BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 040445
(358 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255544181|ref|XP_002513153.1| alpha-galactosidase/alpha-n-acetylgalactosaminidase, putative
[Ricinus communis]
gi|223548164|gb|EEF49656.1| alpha-galactosidase/alpha-n-acetylgalactosaminidase, putative
[Ricinus communis]
Length = 408
Score = 434 bits (1116), Expect = e-119, Method: Compositional matrix adjust.
Identities = 224/386 (58%), Positives = 258/386 (66%), Gaps = 81/386 (20%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQISLSC 67
TPPMGWNSWNHFHCQINE +I+ET DDCWGES RD Q
Sbjct: 49 TPPMGWNSWNHFHCQINETVIKETADALVSTGLAKLGYLYVNIDDCWGESKRDRQ----- 103
Query: 68 SFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-------ILISNSL--- 117
GNL ANKTTFPSG KA+ADYVH I L
Sbjct: 104 ---------------------GNLEANKTTFPSGIKAVADYVHSKGLKLGIYADAGLRTC 142
Query: 118 DNRIESNIAFEQ---------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSM 168
R+ ++ E+ GIDYLKYDNC+N+DT+PT+RY+ MSDALKK GR IFFS+
Sbjct: 143 TGRVPGSLGHEEQDAATFASWGIDYLKYDNCYNNDTKPTVRYKVMSDALKKTGRSIFFSL 202
Query: 169 CEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVG 228
CEWGDM PALWG +I NSWRTTDDISDSW ML IADMN+VYAD+AKPGGWNDPDMLEVG
Sbjct: 203 CEWGDMSPALWGGDIGNSWRTTDDISDSWESMLKIADMNQVYADYAKPGGWNDPDMLEVG 262
Query: 229 NGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ---------- 278
NGGMKY+EYVVHFSIWAISKAPLLLGCDV ++T ETM II NEEVI +NQ
Sbjct: 263 NGGMKYSEYVVHFSIWAISKAPLLLGCDVRSMTDETMQIISNEEVIGINQDSLGVQAKKV 322
Query: 279 -AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTL------FEHRTLE 331
++ IWAGPLSGNRIV+LL +RK + S MTAHWDDIGI N+TL +EH+TL
Sbjct: 323 RMEGERDIWAGPLSGNRIVILLVNRKKYKSTMTAHWDDIGITPNHTLVEARDVWEHQTLT 382
Query: 332 TKFQGKLTAKVDGHSCKMYELTPVTK 357
T+FQ KLT VD +CK+Y LTP+++
Sbjct: 383 TQFQDKLTTDVDPRACKLYVLTPISQ 408
>gi|255544185|ref|XP_002513155.1| alpha-galactosidase/alpha-n-acetylgalactosaminidase, putative
[Ricinus communis]
gi|223548166|gb|EEF49658.1| alpha-galactosidase/alpha-n-acetylgalactosaminidase, putative
[Ricinus communis]
Length = 415
Score = 429 bits (1103), Expect = e-117, Method: Compositional matrix adjust.
Identities = 222/385 (57%), Positives = 258/385 (67%), Gaps = 81/385 (21%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQISLSC 67
TPPMGWNSWNHFHC+INE ++RET DDCWG RD
Sbjct: 55 TPPMGWNSWNHFHCEINETVVRETADALVLSGLARRGYKYVNIDDCWGAYDRD------- 107
Query: 68 SFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------LISNS----L 117
+GNLVAN++TFPSG KALADYVH + ++S
Sbjct: 108 -------------------FKGNLVANRSTFPSGIKALADYVHSKGLKMGIYADSGYRTC 148
Query: 118 DNRIESNIAFEQ---------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSM 168
R+ ++ E+ GIDYLKYDNC+NDDTRPTIRY AMS ALKK RPIFFSM
Sbjct: 149 SGRMPGSLGLEEKDAKTFASWGIDYLKYDNCYNDDTRPTIRYLAMSRALKKTKRPIFFSM 208
Query: 169 CEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVG 228
CEWGDMRPALWG+ + NSWRTTDDISDSW ML IAD+NEVYA++AKPG WNDPDMLEVG
Sbjct: 209 CEWGDMRPALWGAKLANSWRTTDDISDSWESMLKIADLNEVYANYAKPGAWNDPDMLEVG 268
Query: 229 NGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ---------- 278
NGGMKY+EYVVHFSIWAISKAPLLLGCD+ ++ ETM IIGNEEVIAVNQ
Sbjct: 269 NGGMKYSEYVVHFSIWAISKAPLLLGCDIRHMNYETMKIIGNEEVIAVNQDQLGVQARKV 328
Query: 279 -AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT------LFEHRTLE 331
+++IWAGPLSGNR VVL+ +RKT+ S +TAHWDDIG SNN L+EH TLE
Sbjct: 329 RMEGEREIWAGPLSGNRKVVLMVNRKTWGSPLTAHWDDIGFNSNNIVVQARDLWEHHTLE 388
Query: 332 TKFQGKLTAKVDGHSCKMYELTPVT 356
FQG +TA ++ HSCKMY LTP++
Sbjct: 389 KPFQGNMTAYINPHSCKMYVLTPIS 413
>gi|15241601|ref|NP_196455.1| alpha-galactosidase 1 [Arabidopsis thaliana]
gi|10178280|emb|CAC08338.1| alpha-galactosidase-like protein [Arabidopsis thaliana]
gi|332003909|gb|AED91292.1| alpha-galactosidase 1 [Arabidopsis thaliana]
Length = 410
Score = 399 bits (1025), Expect = e-108, Method: Compositional matrix adjust.
Identities = 211/417 (50%), Positives = 264/417 (63%), Gaps = 91/417 (21%)
Query: 1 LFIVSSSAADSNGSD-----RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET----- 50
+ +V SS + +NG D RR+LL+NGLG+TPPMGWNSWNHF C I+EK+I+ET
Sbjct: 22 MTMVESSRSVNNGHDDSEILRRHLLTNGLGVTPPMGWNSWNHFSCNIDEKMIKETADALV 81
Query: 51 --------------DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKT 96
DDCW E SRDS+ G+LV K+
Sbjct: 82 TTGLSKLGYNYVNIDDCWAEISRDSK--------------------------GSLVPKKS 115
Query: 97 TFPSGTKALADYVHILISNSLDNRIESNIAF----------------------EQGIDYL 134
TFPSG KA+ADYVH S L I S+ + E GIDYL
Sbjct: 116 TFPSGIKAVADYVH---SKGLKLGIYSDAGYFTCSKTMPGSLGYEEHDAKTFAEWGIDYL 172
Query: 135 KYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDIS 194
KYDNC +D ++PT+RY M+ AL K+GRPIF S+CEWGDM PALWGS + NSWRTT+DI
Sbjct: 173 KYDNCNSDGSKPTVRYPVMTRALMKSGRPIFHSLCEWGDMHPALWGSPVGNSWRTTNDIK 232
Query: 195 DSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLG 254
D+W M++IADMNEVYA+HA+PGGWNDPDMLEVGNGGM +EY+VHFSIWAISKAPLLLG
Sbjct: 233 DTWLSMISIADMNEVYAEHARPGGWNDPDMLEVGNGGMTKDEYIVHFSIWAISKAPLLLG 292
Query: 255 CDVGNLTAETMPIIGNEEVIAVNQAIN-----------DQQIWAGPLSGNRIVVLLESRK 303
CD+ N+T ETM I+ N+EVIA+NQ + D ++WAGPLSG R+ +LL +R
Sbjct: 293 CDIRNMTKETMEIVANKEVIAINQDPHGVQAKKVRMEGDLEVWAGPLSGYRVALLLLNRG 352
Query: 304 TFSSMMTAHWDDIGIKSNNT-----LFEHRTLETKFQGKLTAKVDGHSCKMYELTPV 355
+ +TA W+DI I +N+ L+EH+TL+ KF G LTA VD H+CK+Y L PV
Sbjct: 353 PSRTSITALWEDIEIPANSIVEARDLWEHQTLKQKFVGNLTATVDSHACKLYVLKPV 409
>gi|20260602|gb|AAM13199.1| alpha-galactosidase-like protein [Arabidopsis thaliana]
gi|30725668|gb|AAP37856.1| At5g08380 [Arabidopsis thaliana]
Length = 410
Score = 399 bits (1025), Expect = e-108, Method: Compositional matrix adjust.
Identities = 211/417 (50%), Positives = 264/417 (63%), Gaps = 91/417 (21%)
Query: 1 LFIVSSSAADSNGSD-----RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET----- 50
+ +V SS + +NG D RR+LL+NGLG+TPPMGWNSWNHF C I+EK+I+ET
Sbjct: 22 MTMVESSRSVNNGHDDSEILRRHLLTNGLGVTPPMGWNSWNHFSCNIDEKMIKETANALV 81
Query: 51 --------------DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKT 96
DDCW E SRDS+ G+LV K+
Sbjct: 82 TTGLSKLGYNYVNIDDCWAEISRDSK--------------------------GSLVPKKS 115
Query: 97 TFPSGTKALADYVHILISNSLDNRIESNIAF----------------------EQGIDYL 134
TFPSG KA+ADYVH S L I S+ + E GIDYL
Sbjct: 116 TFPSGIKAVADYVH---SKGLKLGIYSDAGYFTCSKTMPGSLGYEEHDAKTFAEWGIDYL 172
Query: 135 KYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDIS 194
KYDNC +D ++PT+RY M+ AL K+GRPIF S+CEWGDM PALWGS + NSWRTT+DI
Sbjct: 173 KYDNCNSDGSKPTVRYPVMTRALMKSGRPIFHSLCEWGDMHPALWGSPVGNSWRTTNDIK 232
Query: 195 DSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLG 254
D+W M++IADMNEVYA+HA+PGGWNDPDMLEVGNGGM +EY+VHFSIWAISKAPLLLG
Sbjct: 233 DTWLSMISIADMNEVYAEHARPGGWNDPDMLEVGNGGMTKDEYIVHFSIWAISKAPLLLG 292
Query: 255 CDVGNLTAETMPIIGNEEVIAVNQAIN-----------DQQIWAGPLSGNRIVVLLESRK 303
CD+ N+T ETM I+ N+EVIA+NQ + D ++WAGPLSG R+ +LL +R
Sbjct: 293 CDIRNMTKETMEIVANKEVIAINQDPHGVQAKKVRMEGDLEVWAGPLSGYRVALLLLNRG 352
Query: 304 TFSSMMTAHWDDIGIKSNNT-----LFEHRTLETKFQGKLTAKVDGHSCKMYELTPV 355
+ +TA W+DI I +N+ L+EH+TL+ KF G LTA VD H+CK+Y L PV
Sbjct: 353 PSRTSITALWEDIEIPANSIVEARDLWEHQTLKQKFVGNLTATVDSHACKLYVLKPV 409
>gi|255544191|ref|XP_002513158.1| alpha-galactosidase/alpha-n-acetylgalactosaminidase, putative
[Ricinus communis]
gi|223548169|gb|EEF49661.1| alpha-galactosidase/alpha-n-acetylgalactosaminidase, putative
[Ricinus communis]
Length = 412
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 211/398 (53%), Positives = 254/398 (63%), Gaps = 86/398 (21%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGE 56
RRNLL+NGL +TPPMGWNSWNHF CQI+EK+I+ET DDCW E
Sbjct: 44 RRNLLANGLAITPPMGWNSWNHFSCQIDEKMIKETADALVSTGLYKLGYTYVNIDDCWAE 103
Query: 57 SSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNS 116
+RD + GNLV +TFPSG KALADYVH S
Sbjct: 104 IARDEK--------------------------GNLVPKNSTFPSGIKALADYVH---SKG 134
Query: 117 LDNRIESNIAF--------------EQ--------GIDYLKYDNCFNDDTRPTIRYRAMS 154
L I S+ + EQ GIDYLKYDNC ND T+PT+RY M+
Sbjct: 135 LKLGIYSDAGYFTCSKTMPGSLGHEEQDAKTFALWGIDYLKYDNCNNDGTKPTVRYPVMT 194
Query: 155 DALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHA 214
AL KAGRPIFFS+CEWGD+ PALWG+ + NSWRTT+DISD+W M++ ADMNEVYAD A
Sbjct: 195 RALMKAGRPIFFSLCEWGDLHPALWGAKVGNSWRTTNDISDNWESMISRADMNEVYADLA 254
Query: 215 KPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVI 274
+PGGWNDPDMLEVGNGGM +EY+VHFSIWA+SKAPLLLGCDV N+T ETM I+ N+EVI
Sbjct: 255 RPGGWNDPDMLEVGNGGMTKDEYIVHFSIWAMSKAPLLLGCDVRNMTKETMDIVMNKEVI 314
Query: 275 AVNQ-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT 323
AVNQ D +IWAGPLSG R+VVLL +R + + +TA+WDDIG+ N+
Sbjct: 315 AVNQDRLGVQAKKVRMEGDLEIWAGPLSGYRLVVLLLNRGPWRNSITANWDDIGLAPNSV 374
Query: 324 -----LFEHRTLETKFQGKLTAKVDGHSCKMYELTPVT 356
L+EH+TL+ F LTA VD H+CKMY L P++
Sbjct: 375 VEARDLWEHKTLKNHFVENLTATVDSHACKMYILKPIS 412
>gi|225463807|ref|XP_002270791.1| PREDICTED: alpha-galactosidase [Vitis vinifera]
gi|297742709|emb|CBI35343.3| unnamed protein product [Vitis vinifera]
Length = 408
Score = 394 bits (1011), Expect = e-107, Method: Compositional matrix adjust.
Identities = 207/398 (52%), Positives = 252/398 (63%), Gaps = 86/398 (21%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGE 56
RRNLL+N LG+TPPMGWNSWNHF+C+I+EK I+ET DDCW E
Sbjct: 40 RRNLLANDLGITPPMGWNSWNHFNCKIDEKTIKETADALVATGLVKLGYEYVNIDDCWAE 99
Query: 57 SSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNS 116
+RD + G LVA K+TFPSG KALADYVH S
Sbjct: 100 INRDEK--------------------------GTLVAKKSTFPSGIKALADYVH---SKG 130
Query: 117 LDNRIESNIAF----------------------EQGIDYLKYDNCFNDDTRPTIRYRAMS 154
L I S+ + GIDYLKYDNC ND +RPT RY M+
Sbjct: 131 LKLGIYSDAGYFTCSKTMPGSLGHEEKDAKTFASWGIDYLKYDNCNNDGSRPTDRYPVMT 190
Query: 155 DALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHA 214
AL KAGRPIFFS+CEWGDM PALWGS + NSWRTT+DI+D+W M++ ADMN+VYA +A
Sbjct: 191 RALMKAGRPIFFSLCEWGDMHPALWGSKVGNSWRTTNDIADTWDSMMSRADMNDVYAQYA 250
Query: 215 KPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVI 274
+PGGWNDPDMLEVGNGGM +EY+VHFSIWAISKAPLL+GCDV N T ET+ IIGN+EVI
Sbjct: 251 RPGGWNDPDMLEVGNGGMTNDEYIVHFSIWAISKAPLLIGCDVRNTTKETLDIIGNKEVI 310
Query: 275 AVNQ-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT 323
AVNQ + DQ+IWAGPLS R+ +LL +R + +TA+WDDIG+ +
Sbjct: 311 AVNQDPLGVQAKKVRSEGDQEIWAGPLSDYRVALLLVNRGPWRYSVTANWDDIGLPTGTV 370
Query: 324 -----LFEHRTLETKFQGKLTAKVDGHSCKMYELTPVT 356
L+EH+TLE +F G LTA +D H+CKMY L P++
Sbjct: 371 VEARDLWEHKTLEKRFVGSLTATMDSHACKMYILKPIS 408
>gi|377655463|gb|AFB73771.1| alpha-galactosidase [Chimonanthus praecox]
Length = 412
Score = 394 bits (1011), Expect = e-107, Method: Compositional matrix adjust.
Identities = 204/397 (51%), Positives = 254/397 (63%), Gaps = 80/397 (20%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGE 56
RR+LLSNGLGLTP MGWNSWNHF+C+I EK+IRET DDCW E
Sbjct: 41 RRSLLSNGLGLTPQMGWNSWNHFNCKIEEKVIRETADAPVSTGLAKLGYQYVNIDDCWAE 100
Query: 57 SSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH------ 110
RDS+ G LV +K TFPSG KALADYVH
Sbjct: 101 LERDSK--------------------------GYLVPHKVTFPSGIKALADYVHSKGLKL 134
Query: 111 ------------ILISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDAL 157
+ SL + + + F GIDYLKYDNC ND ++PT+RY M+ AL
Sbjct: 135 GIYSDAGYLTCSKKMPGSLGHEEQDAMTFASWGIDYLKYDNCNNDGSKPTVRYPVMTRAL 194
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPG 217
KAGRPIFFS+CEWGD+ PALWG+ + NSWRTT+DISD+W M++ ADMNEVYA+ A+PG
Sbjct: 195 MKAGRPIFFSLCEWGDLHPALWGAQLGNSWRTTNDISDNWDSMVSRADMNEVYAEPARPG 254
Query: 218 GWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN 277
GWNDPDMLEVGNGGM +EY+VHFSIWAISKAPLL+GCDV N+T ETM I+GN+EVIAVN
Sbjct: 255 GWNDPDMLEVGNGGMTKDEYIVHFSIWAISKAPLLIGCDVRNMTKETMYIVGNKEVIAVN 314
Query: 278 Q-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT--- 323
Q D ++WAGPLS R+ +LL +R + S +TAHWDDIG+
Sbjct: 315 QDPLGVQAKKVRMEGDLEVWAGPLSNYRVALLLVNRGPWPSPVTAHWDDIGLPMETVVEA 374
Query: 324 --LFEHRTLETKFQGKLTAKVDGHSCKMYELTPVTKL 358
L++H+TL +F+ KLTA ++ H+C+MY LTP++ +
Sbjct: 375 RDLWKHKTLPRRFKNKLTATMNSHACQMYVLTPISSV 411
>gi|356532145|ref|XP_003534634.1| PREDICTED: alpha-galactosidase-like [Glycine max]
Length = 410
Score = 392 bits (1008), Expect = e-106, Method: Compositional matrix adjust.
Identities = 208/395 (52%), Positives = 248/395 (62%), Gaps = 80/395 (20%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGE 56
R NLL+NGLG TPPMGWNSWNHF CQINEK+IRET DDCW E
Sbjct: 42 RGNLLANGLGRTPPMGWNSWNHFSCQINEKMIRETADALVSTGLSKLGYTYVNIDDCWAE 101
Query: 57 SSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH------ 110
+RD++ GNLVA K+TFPSG KALADYVH
Sbjct: 102 LNRDAK--------------------------GNLVAKKSTFPSGIKALADYVHSKGLKL 135
Query: 111 ------------ILISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDAL 157
+ SL + + F GIDYLKYDNC ND ++PT RY M+ AL
Sbjct: 136 GIYSDAGYFTCSKQMPGSLGHEFQDAKTFASWGIDYLKYDNCNNDGSKPTDRYPIMTRAL 195
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPG 217
AGRPIFFS+CEWGD+ PALWG+ + NSWRTT+DI+DSW M++ ADMNEVYA++A+PG
Sbjct: 196 MMAGRPIFFSLCEWGDLHPALWGAKVGNSWRTTNDINDSWESMISRADMNEVYAEYARPG 255
Query: 218 GWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN 277
GWNDPDMLEVGNGGM NEY VHFS+WA+SKAPLLLGCDV N+T ETM I+ N+EVIAVN
Sbjct: 256 GWNDPDMLEVGNGGMTKNEYTVHFSLWALSKAPLLLGCDVRNITKETMEIVANKEVIAVN 315
Query: 278 Q-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT--- 323
Q D++IWAGPLSG R+ V+L +R + +TA+WDDIGI +
Sbjct: 316 QDPLGVQGKKVRMEGDEEIWAGPLSGYRVAVVLLNRGPWKISITANWDDIGIPPKSAVEA 375
Query: 324 --LFEHRTLETKFQGKLTAKVDGHSCKMYELTPVT 356
L+EH+TL F KLTA VD H CKMY L P+
Sbjct: 376 RDLWEHKTLMRPFVDKLTATVDPHGCKMYVLKPLA 410
>gi|356568242|ref|XP_003552322.1| PREDICTED: alpha-galactosidase-like [Glycine max]
Length = 410
Score = 392 bits (1007), Expect = e-106, Method: Compositional matrix adjust.
Identities = 208/395 (52%), Positives = 247/395 (62%), Gaps = 80/395 (20%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGE 56
RRNLL+NGLG TPPMGWNSWNHF CQINEK+IRET DDCW E
Sbjct: 42 RRNLLANGLGRTPPMGWNSWNHFSCQINEKMIRETADALVSTGLSKLGYTYVNIDDCWAE 101
Query: 57 SSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH------ 110
+RD++ GNLVA +TFPSG KALADYVH
Sbjct: 102 LNRDAK--------------------------GNLVAKNSTFPSGIKALADYVHGKGLKL 135
Query: 111 ------------ILISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDAL 157
+ SL + + F GIDYLKYDNC ND ++PT RY M+ AL
Sbjct: 136 GIYSDAGYFTCSKQMPGSLGHEFQDAKTFASWGIDYLKYDNCNNDGSKPTDRYPIMTRAL 195
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPG 217
AGRPIFFS+CEWGD+ PALWG+ + NSWRTT+DI+DSW M++ ADMNEVYA++A+PG
Sbjct: 196 MMAGRPIFFSLCEWGDLHPALWGAKVGNSWRTTNDINDSWESMVSRADMNEVYAEYARPG 255
Query: 218 GWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN 277
GWNDPDMLEVGNGGM NEY+VHFS+WA+SKAPLLLGCDV N+T ETM I+ N+EVI+VN
Sbjct: 256 GWNDPDMLEVGNGGMTKNEYIVHFSLWALSKAPLLLGCDVRNMTKETMEIVANKEVISVN 315
Query: 278 Q-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT--- 323
Q DQ+IWAGPLSG R+ V+L +R +TA+WDDI I +
Sbjct: 316 QDPLGVQGKKVRMEGDQEIWAGPLSGYRVAVVLLNRGPSKISITANWDDIDIPPKSAVEA 375
Query: 324 --LFEHRTLETKFQGKLTAKVDGHSCKMYELTPVT 356
L+EH+TL F KLTA VD H CKMY L PV
Sbjct: 376 RDLWEHKTLMRPFVDKLTATVDPHGCKMYVLKPVA 410
>gi|297810943|ref|XP_002873355.1| alpha-galactosidase 1 [Arabidopsis lyrata subsp. lyrata]
gi|297319192|gb|EFH49614.1| alpha-galactosidase 1 [Arabidopsis lyrata subsp. lyrata]
Length = 430
Score = 392 bits (1006), Expect = e-106, Method: Compositional matrix adjust.
Identities = 207/408 (50%), Positives = 254/408 (62%), Gaps = 96/408 (23%)
Query: 12 NGSD----RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET----------------- 50
NG D RR+LL+NGLG+TPPMGWNSWNHF C I+EK+I+ET
Sbjct: 37 NGDDSEILRRHLLTNGLGVTPPMGWNSWNHFSCNIDEKMIKETADALVTTGLSKLGYKYV 96
Query: 51 --DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADY 108
DDCW E SRDS+ G+LV K+TFPSG KALADY
Sbjct: 97 NIDDCWAEISRDSK--------------------------GSLVPKKSTFPSGIKALADY 130
Query: 109 VHILISNSLDNRIESNIAF----------------------EQGIDYLKYDNCFNDDTRP 146
VH S L I S+ + E GIDYLKYDNC ND ++P
Sbjct: 131 VH---SKGLKLGIYSDAGYFTCSKTMPGSLGYEEHDAKTFAEWGIDYLKYDNCNNDGSKP 187
Query: 147 TIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADM 206
T+RY M+ AL K+GRPIF S+CEWGDM PALWGS + NSWRTT+DI D+W M++IADM
Sbjct: 188 TVRYPVMTRALMKSGRPIFHSLCEWGDMHPALWGSPVGNSWRTTNDIKDTWLSMISIADM 247
Query: 207 NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMP 266
NEVYA+HA+PGGWNDPDMLEVGNGGM EY+VHFSIWAISKAPLLLGCD+ N+T ET+
Sbjct: 248 NEVYAEHARPGGWNDPDMLEVGNGGMTKEEYIVHFSIWAISKAPLLLGCDIRNMTKETVE 307
Query: 267 IIGNEEVIAVNQAIN-----------------DQQIWAGPLSGNRIVVLLESRKTFSSMM 309
I+ N+EVIA+NQ D ++WAGPLSG R+ +LL +R +++
Sbjct: 308 IVANKEVIAINQEKESSCRSTRCQAKKVRMEGDLEVWAGPLSGYRVALLLLNRGPSRTLI 367
Query: 310 TAHWDDIGIKSNNT-----LFEHRTLETKFQGKLTAKVDGHSCKMYEL 352
TA WDDI I +N+ L++H+TL+ KF G LTA VD H+CK+Y L
Sbjct: 368 TALWDDIEIPANSIVEARDLWQHKTLKQKFVGNLTATVDSHACKLYIL 415
>gi|224113219|ref|XP_002316426.1| predicted protein [Populus trichocarpa]
gi|222865466|gb|EEF02597.1| predicted protein [Populus trichocarpa]
Length = 369
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 208/398 (52%), Positives = 250/398 (62%), Gaps = 86/398 (21%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGE 56
RRNLL+NGL TPPMGWNSWNHF+C+I+EKII+ T DDCW E
Sbjct: 1 RRNLLANGLADTPPMGWNSWNHFNCKIDEKIIKATADFLVSTGLSKLGYTYVNIDDCWAE 60
Query: 57 SSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNS 116
+RD + GNLV K+TFPSG KALADYVH S
Sbjct: 61 MARDGK--------------------------GNLVPKKSTFPSGIKALADYVH---SKG 91
Query: 117 LDNRIESNIAF--------------EQ--------GIDYLKYDNCFNDDTRPTIRYRAMS 154
L I S+ + EQ GIDYLKYDNC ND T+PT+RY M+
Sbjct: 92 LKLGIYSDAGYFTCSKTMPGSLGHEEQDAKSFASWGIDYLKYDNCNNDGTKPTVRYPVMT 151
Query: 155 DALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHA 214
AL K G PIFFS+CEWGDM PA WG+ + NSWRTT+DISD+W M++ ADMNEVYA+ A
Sbjct: 152 RALMKTGHPIFFSLCEWGDMHPATWGAKVGNSWRTTNDISDTWDSMVSRADMNEVYAELA 211
Query: 215 KPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVI 274
+PGGWNDPDMLEVGNGGM +EY +HFSIWAISKAPLLLGCDV N+T ETM II N+EVI
Sbjct: 212 RPGGWNDPDMLEVGNGGMTKDEYTLHFSIWAISKAPLLLGCDVRNMTKETMDIIANKEVI 271
Query: 275 AVNQ-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT 323
AVNQ D +IWAGPL+G R+ VLL +R + + ++A WDDIGI N+
Sbjct: 272 AVNQDPLGVQAKKVRMEGDLEIWAGPLAGYRVAVLLVNRGPWRNSISAQWDDIGIPLNSI 331
Query: 324 -----LFEHRTLETKFQGKLTAKVDGHSCKMYELTPVT 356
L+EH+TL+T F G LTA +D H+CKMY L P++
Sbjct: 332 VKARDLWEHKTLKTHFVGNLTATMDSHACKMYILKPIS 369
>gi|53747927|emb|CAF34023.1| alpha-galactosidase 1 [Pisum sativum]
Length = 405
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 208/413 (50%), Positives = 253/413 (61%), Gaps = 85/413 (20%)
Query: 3 IVSSSAADSNGSD-----RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET------- 50
+ SSS A++ ++ RRNLL+NGL TPPMGWNSWNHF CQI+EK+IRET
Sbjct: 19 VTSSSLANNKNNEEEHLLRRNLLANGLARTPPMGWNSWNHFACQIDEKMIRETADALIST 78
Query: 51 ------------DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTF 98
DDCW E +RD + GNLVA +TF
Sbjct: 79 GLSKLGYTYVNIDDCWAELNRDDK--------------------------GNLVAKNSTF 112
Query: 99 PSGTKALADYVH------------------ILISNSLDNRIESNIAFEQ-GIDYLKYDNC 139
PSG KALADYVH + SL + + F GIDYLKYDNC
Sbjct: 113 PSGIKALADYVHSKGLKLGIYSDAGYFTCSKQMPGSLGHEFQDAKTFASWGIDYLKYDNC 172
Query: 140 FNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWAR 199
FN ++PT RY M+ AL KAGRPIFFS+CEWGD+ PALWG+ + NSWRTT DISD+W
Sbjct: 173 FNGGSKPTKRYPVMTRALVKAGRPIFFSLCEWGDLHPALWGAKVGNSWRTTGDISDTWES 232
Query: 200 MLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGN 259
M++ AD NEVYA+ A+PGGWNDPDMLEVGNGGM +EY+VHFS+WAISKAPLLLGCDV N
Sbjct: 233 MISKADTNEVYAELARPGGWNDPDMLEVGNGGMTKSEYIVHFSLWAISKAPLLLGCDVRN 292
Query: 260 LTAETMPIIGNEEVIAVNQ-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSM 308
++ +TM II N+EVIAVNQ D +IWAGPLSG R+ V+L ++
Sbjct: 293 VSKDTMEIISNKEVIAVNQDSLGVQAKKVRMEGDLEIWAGPLSGYRVAVVLLNKGAQRMA 352
Query: 309 MTAHWDDIGIKSNNT-----LFEHRTLETKFQGKLTAKVDGHSCKMYELTPVT 356
MTA+WDDIGI + L+EH+TLE F KL+ V+ H+CKMY L PV
Sbjct: 353 MTANWDDIGIPPKSVVEARDLWEHKTLEKHFVDKLSVTVESHACKMYVLKPVA 405
>gi|449458720|ref|XP_004147095.1| PREDICTED: alpha-galactosidase-like [Cucumis sativus]
gi|449503497|ref|XP_004162032.1| PREDICTED: alpha-galactosidase-like [Cucumis sativus]
gi|84310212|gb|ABC55266.1| acid alpha galactosidase 1 [Cucumis sativus]
Length = 413
Score = 388 bits (997), Expect = e-105, Method: Compositional matrix adjust.
Identities = 202/395 (51%), Positives = 247/395 (62%), Gaps = 80/395 (20%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGE 56
RRNLL+NGLG+TPPMGWNSWNHF C INEK+IRET DDCW E
Sbjct: 45 RRNLLANGLGVTPPMGWNSWNHFACNINEKMIRETADALVSTGLSKLGYEYVNIDDCWAE 104
Query: 57 SSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI----- 111
+RD + GNLV +TFPSG KALADYVH
Sbjct: 105 IARDDK--------------------------GNLVPKNSTFPSGMKALADYVHAKGLKI 138
Query: 112 -------------LISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDAL 157
+ SL + + F GIDYLKYDNC N + +PTIRY M+ AL
Sbjct: 139 GIYSDAGYFTCSKTMPGSLGHEEQDAKTFAAWGIDYLKYDNCNNGNIKPTIRYPVMTRAL 198
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPG 217
KAGRPIF S+CEWGD+ PALWG + NSWRTT+DI+DSW M++ AD+NE+YAD+A+PG
Sbjct: 199 MKAGRPIFLSLCEWGDLHPALWGDKLGNSWRTTNDINDSWESMISRADLNEIYADYARPG 258
Query: 218 GWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN 277
GWNDPDMLEVGNGGM +EY+VHFS+WAISKAPLLLGCD+ NLT ET I+ N EVIAVN
Sbjct: 259 GWNDPDMLEVGNGGMTKDEYIVHFSLWAISKAPLLLGCDLRNLTKETKAIVTNTEVIAVN 318
Query: 278 Q-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT--- 323
Q + D ++WAGPLSG R+ V+L +R + + ++A WDDIGI N+
Sbjct: 319 QDPLGVQAKKVRSEGDLEVWAGPLSGYRVAVVLLNRGPWRNAISAQWDDIGIPPNSNVEA 378
Query: 324 --LFEHRTLETKFQGKLTAKVDGHSCKMYELTPVT 356
L+EH TL+T F LTA VD H+CK+Y L P++
Sbjct: 379 RDLWEHTTLKTTFVANLTATVDSHACKLYILKPIS 413
>gi|37722165|gb|AAR02007.1| galactan:galactan galactosyltransferase 1 [Ajuga reptans]
Length = 404
Score = 377 bits (967), Expect = e-102, Method: Compositional matrix adjust.
Identities = 198/411 (48%), Positives = 250/411 (60%), Gaps = 80/411 (19%)
Query: 1 LFIVSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET---------- 50
L +++++ +D + RRNLL NGLG TPPMGWNSWN F C I+E IIRET
Sbjct: 14 LLLITTTTSDDDLHHRRNLLGNGLGATPPMGWNSWNGFGCMIDENIIRETADALVSSGLA 73
Query: 51 ---------DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSG 101
DDCWGE RD Q G LV NK FPSG
Sbjct: 74 KLGYEYVTIDDCWGEVDRDEQ--------------------------GALVPNKAAFPSG 107
Query: 102 TKALADYVHIL------------------ISNSLDNR-IESNIAFEQGIDYLKYDNCFND 142
ALADYVH SL N +++ GIDYLKYDNC ND
Sbjct: 108 MNALADYVHSKGLKLGIYADAGYFTCSKKTPGSLGNEELDAETFASWGIDYLKYDNCNND 167
Query: 143 DTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLT 202
++PTIRY M+ AL +GRPIFF++CEWGDM PA WG + NSWRT+ DI+D+W RM++
Sbjct: 168 GSKPTIRYPVMTRALMNSGRPIFFAICEWGDMHPATWGYTVGNSWRTSCDIADNWERMVS 227
Query: 203 IADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTA 262
ADMNE YAD+A+PGGWNDPDMLE+GNGGM +EY+VH S+WAISKAPL++GCDV N++
Sbjct: 228 RADMNEYYADYARPGGWNDPDMLEIGNGGMNKDEYIVHMSLWAISKAPLIIGCDVRNMSK 287
Query: 263 ETMPIIGNEEVIAVNQ-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTA 311
+T+ I+ NEEVIAV++ D ++WAGPLS RI VLL +R ++ +TA
Sbjct: 288 DTLEILTNEEVIAVDKDNLGVQAKKVRMDGDWEVWAGPLSHYRIAVLLLNRGGWTVNITA 347
Query: 312 HWDDIGIKSNNT-----LFEHRTLETKFQGKLTAKVDGHSCKMYELTPVTK 357
+WDDIGI S+ + L+EH+TL +KF+G TA VD H+CK Y TP K
Sbjct: 348 YWDDIGITSSTSVVARDLWEHKTLTSKFRGSFTAAVDPHTCKFYIFTPADK 398
>gi|30016922|gb|AAP04002.1| alpha-galactosidase [Carica papaya]
Length = 410
Score = 370 bits (949), Expect = e-100, Method: Compositional matrix adjust.
Identities = 196/398 (49%), Positives = 244/398 (61%), Gaps = 86/398 (21%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGE 56
RR LL+NGLGLTPPMGWNSWNHF+C I+EK I++ DDCW E
Sbjct: 42 RRTLLANGLGLTPPMGWNSWNHFYCNIDEKTIKKAADALIATGLSKLGYTYVNIDDCWAE 101
Query: 57 SSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNS 116
+RD + GN V K+TFPSG KALADYVH S
Sbjct: 102 LTRDEK--------------------------GNSVPRKSTFPSGIKALADYVH---SKG 132
Query: 117 LDNRIESNIAF--------------EQ--------GIDYLKYDNCFNDDTRPTIRYRAMS 154
L I S+ + EQ GIDYLKYDNC+ND ++PT+RY M+
Sbjct: 133 LKLGIYSDAGYFTCSKTMPGSLGHEEQDAKTFASWGIDYLKYDNCYNDGSKPTVRYPVMT 192
Query: 155 DALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHA 214
AL K G PI+FS+CEWGD+ PAL G+ + NSWRTT+DI+D+ M++ AD+NE YAD+A
Sbjct: 193 RALMKCGPPIYFSLCEWGDLHPALRGAPVGNSWRTTNDIADTRESMVSRADLNEFYADYA 252
Query: 215 KPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVI 274
+PGGWNDPDMLEVGNGGM +EY+VHFSIWAISKAPLLLGCD+ N+T +T+ II N+EVI
Sbjct: 253 RPGGWNDPDMLEVGNGGMTEDEYMVHFSIWAISKAPLLLGCDLRNVTKDTLDIIANKEVI 312
Query: 275 AVNQ-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT 323
AVNQ + D +IW GPLSG R+ +LL +R +TA+WDD+ I N
Sbjct: 313 AVNQDPQGVQAHKVRSEGDLEIWTGPLSGYRVALLLVNRGPLRRAITANWDDLAIPPNTY 372
Query: 324 -----LFEHRTLETKFQGKLTAKVDGHSCKMYELTPVT 356
L+EH+TL T F G LTA V+ H CKMY L P++
Sbjct: 373 VEARDLWEHKTLTTPFVGNLTATVNSHGCKMYILKPIS 410
>gi|356560095|ref|XP_003548331.1| PREDICTED: alpha-galactosidase-like [Glycine max]
Length = 405
Score = 367 bits (943), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 197/400 (49%), Positives = 243/400 (60%), Gaps = 81/400 (20%)
Query: 11 SNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------D 51
S RRNL++NGLG TPPMGWNSWNHFH +INE +IRET D
Sbjct: 31 SESQHRRNLVANGLGATPPMGWNSWNHFHLKINETVIRETADALVSTGLSKLGYTYVNID 90
Query: 52 DCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI 111
D WGE +RD GNLVA+KT FPSG KALADYVH
Sbjct: 91 DGWGEMTRD--------------------------VDGNLVAHKTKFPSGIKALADYVHG 124
Query: 112 ------------------LISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRA 152
++ SL + + F G+DYLKYDNC N +P RY
Sbjct: 125 KGLKLGIYSAAGYFTCANVMPGSLGHEEQDAKTFASWGVDYLKYDNCNNGGIKPIDRYPI 184
Query: 153 MSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYAD 212
M+ +L KAGRPI+FS+CEWGDM PALWG + NSWRTTDDI D+W ML+ ADMNEVYAD
Sbjct: 185 MTRSLMKAGRPIYFSLCEWGDMHPALWGYQVGNSWRTTDDIRDNWDSMLSKADMNEVYAD 244
Query: 213 HAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEE 272
+A+PGGWNDPDMLE+GNGGM ++Y+VHFS+WAISKAPL++GCDV N+T +T I+ N E
Sbjct: 245 YARPGGWNDPDMLEIGNGGMNKSQYIVHFSLWAISKAPLIIGCDVRNMTEDTKEILSNTE 304
Query: 273 VIAVNQAINDQQ-----------IWAGPLSGNRI-VVLLESRKTFSSMMTAHWDDIGIKS 320
VIAVNQ +Q +W+GPLS R+ VVLL + +TA W+DIG+
Sbjct: 305 VIAVNQDPLGKQGKKVRMESALEVWSGPLSEYRVAVVLLNKYSDRRASITALWEDIGLDP 364
Query: 321 NNT-----LFEHRTLETKFQGKLTAKVDGHSCKMYELTPV 355
++ L+EH+TLE +F GKLT V+ HSCKMY L P+
Sbjct: 365 SSVVEARDLWEHKTLERQFVGKLTDTVEPHSCKMYVLKPI 404
>gi|147810287|emb|CAN75822.1| hypothetical protein VITISV_001661 [Vitis vinifera]
Length = 355
Score = 366 bits (940), Expect = 8e-99, Method: Compositional matrix adjust.
Identities = 195/384 (50%), Positives = 237/384 (61%), Gaps = 86/384 (22%)
Query: 30 MGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQISLSCSFH 70
MGWNSWNHF+C+I+EK I+ET DDCW E +RD +
Sbjct: 1 MGWNSWNHFNCKIDEKTIKETADALVATGLVKLGYEYVNIDDCWAEINRDEK-------- 52
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAF--- 127
G LVA K+TFPSG KALADYVH S L I S+ +
Sbjct: 53 ------------------GTLVAKKSTFPSGIKALADYVH---SKGLKLGIYSDAGYFTC 91
Query: 128 -------------------EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSM 168
GIDYLKYDNC ND +RPT RY M+ AL KAGRPIFFS+
Sbjct: 92 SKTMPGSLGHEEKDAKTFASWGIDYLKYDNCNNDGSRPTDRYPVMTRALMKAGRPIFFSL 151
Query: 169 CEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVG 228
CEWGDM PALWGS + NSWRTT+DI+D+W M++ ADMN+VYA +A+PGGWNDPDMLEVG
Sbjct: 152 CEWGDMHPALWGSKVGNSWRTTNDIADTWDSMMSRADMNDVYAQYARPGGWNDPDMLEVG 211
Query: 229 NGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ---------- 278
NGGM +EY+VHFSIWAISKAPLL+GCDV N T ET+ IIGN+EVIAVNQ
Sbjct: 212 NGGMTNDEYIVHFSIWAISKAPLLIGCDVRNTTKETLDIIGNKEVIAVNQDPLGVQAKKV 271
Query: 279 -AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT-----LFEHRTLET 332
+ DQ+IWAGPLS R+ +LL +R + +TA+WDDIG+ L+EH+TLE
Sbjct: 272 RSEGDQEIWAGPLSDYRVALLLVNRGPWRYSVTANWDDIGLPXGTVVEARDLWEHKTLEK 331
Query: 333 KFQGKLTAKVDGHSCKMYELTPVT 356
+F G L A +D H+CKMY L P++
Sbjct: 332 RFVGSLXATMDSHACKMYILKPIS 355
>gi|363543533|ref|NP_001241777.1| alpha-galactosidase precursor [Zea mays]
gi|195638500|gb|ACG38718.1| alpha-galactosidase precursor [Zea mays]
Length = 428
Score = 347 bits (891), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 186/401 (46%), Positives = 238/401 (59%), Gaps = 72/401 (17%)
Query: 11 SNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------D 51
+ G RR L +NGLGLTP MGWNSWNHF C INE ++R T D
Sbjct: 42 AGGGGRRVLFANGLGLTPQMGWNSWNHFQCDINEAVVRSTADALVATGLAKAGYTYVNLD 101
Query: 52 DCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI 111
DCW +S R ++ A+ L G +VAN TFPSG KALADYVH
Sbjct: 102 DCWADSERTKEV----------------ARVLLRDLVGYMVANPKTFPSGIKALADYVHS 145
Query: 112 ----------LISNSLDNRIESNIAFEQ---------GIDYLKYDNCFNDDTRPTIRYRA 152
+ + NR+ ++ +E+ G+DYLKYDNC+ D T T+R+
Sbjct: 146 KGLKLGIYSSAGTRTCSNRMPGSLGYEERDAKMFASWGVDYLKYDNCYRDGTPETVRFGR 205
Query: 153 MSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYAD 212
MS AL +GRPIF+S+CEWG M WG NSWRTT DI+D+W+ ML D+N+ YA
Sbjct: 206 MSRALVNSGRPIFYSLCEWGFMEVPKWGGMYGNSWRTTGDINDTWSGMLDNIDLNDAYAR 265
Query: 213 HAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEE 272
+AKPGGWNDPDMLEVGNGGM YNEYVVHFS+WAI+KAPL++GCDV ++ ET+ I+ N E
Sbjct: 266 YAKPGGWNDPDMLEVGNGGMAYNEYVVHFSLWAIAKAPLVIGCDVTRVSNETLGILSNAE 325
Query: 273 VIAVNQ------------AINDQQIWAGPLSGNRIVVLLESR-KTFSSMMTAHWDDIGIK 319
VIA++Q ND ++WAG LSG+R VLL +R T S+ +TA W D+GI+
Sbjct: 326 VIAISQDRLGVQGQKVSKYGNDLEVWAGQLSGHRKAVLLLNRGATRSASITAAWPDVGIR 385
Query: 320 SN-----NTLFEHRTLETKFQGKLTAKVDGHSCKMYELTPV 355
+++H TL F G LTA V HSCK++ LTPV
Sbjct: 386 RGVAVQARDVWKHETLPGWFTGSLTAVVGPHSCKLFVLTPV 426
>gi|414884440|tpg|DAA60454.1| TPA: hypothetical protein ZEAMMB73_299281 [Zea mays]
Length = 419
Score = 345 bits (884), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 183/382 (47%), Positives = 237/382 (62%), Gaps = 43/382 (11%)
Query: 11 SNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFH 70
+ G RR L +NGLGLTP MGWNSWNHF C INE ++R T D I L+ + +
Sbjct: 42 AGGGGRRVLFANGLGLTPQMGWNSWNHFQCDINEAVVRSTADAL------VAIGLAKAGY 95
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI----------LISNSLDNR 120
+ +L A + G +VAN TFPSG KALADYVH + + NR
Sbjct: 96 TYVNLDDCWADSERTKEVGYMVANPKTFPSGIKALADYVHSKGLKLGIYSSAGTRTCSNR 155
Query: 121 IESNIAFEQ---------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEW 171
+ ++ +E+ G+DYLKYDNC+ D T T+R+ MS AL +GRPIF+S+CEW
Sbjct: 156 MPGSLGYEERDAKMFASWGVDYLKYDNCYRDGTPETVRFGRMSRALVNSGRPIFYSLCEW 215
Query: 172 GDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGG 231
G M WG NSWRTT DI+D+W+ ML D+N+ YA +AKPGGWNDPDMLEVGNGG
Sbjct: 216 GFMEVPKWGGMYGNSWRTTGDINDTWSGMLDNIDLNDAYARYAKPGGWNDPDMLEVGNGG 275
Query: 232 MKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------------A 279
M YNEYVVHFS+WAI+KAPL++GCDV ++ ET+ I+ N EVIA++Q
Sbjct: 276 MAYNEYVVHFSLWAIAKAPLVIGCDVTRVSNETLGILSNAEVIAISQDRLGVQGQKVSKY 335
Query: 280 INDQQIWAGPLSGNRIVVLLESR-KTFSSMMTAHWDDIGIKSN-----NTLFEHRTLETK 333
ND ++WAG LSG+R VLL +R T S+ +TA W D+GI+ +++H TL
Sbjct: 336 GNDLEVWAGQLSGHRKAVLLLNRGATRSASITAAWPDVGIRRGVAVEARDVWKHETLPGW 395
Query: 334 FQGKLTAKVDGHSCKMYELTPV 355
F G LTA V HSCK++ LTPV
Sbjct: 396 FTGSLTAVVGPHSCKLFVLTPV 417
>gi|242048350|ref|XP_002461921.1| hypothetical protein SORBIDRAFT_02g010650 [Sorghum bicolor]
gi|241925298|gb|EER98442.1| hypothetical protein SORBIDRAFT_02g010650 [Sorghum bicolor]
Length = 424
Score = 343 bits (880), Expect = 7e-92, Method: Compositional matrix adjust.
Identities = 185/384 (48%), Positives = 239/384 (62%), Gaps = 44/384 (11%)
Query: 11 SNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFH 70
+G RR L++NGLGLTP MGWNSWNHF C INE ++R T D L+ + +
Sbjct: 48 GDGGARRVLVANGLGLTPQMGWNSWNHFQCGINEAVVRRTADAL------VATGLAKAGY 101
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI----------LISNSLDNR 120
+ +L A N+ +G +VAN TFPSG KALADYVH + + NR
Sbjct: 102 KYVNLDDCWADYQRNK-EGYMVANPKTFPSGIKALADYVHSKGLKLGIYSSAGTRTCSNR 160
Query: 121 IESNIAFEQ---------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEW 171
+ ++ E G+DYLKYDNC+ D T T+R+ MS ALK +GRPIFFS+CEW
Sbjct: 161 MPGSLGHEDKDAKTFASWGVDYLKYDNCYRDGTPETVRFGRMSRALKNSGRPIFFSLCEW 220
Query: 172 GDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGG 231
G M WG NSWRTT DI+D+W+ ML D N+ YA +AKPGGWNDPDMLEVGNGG
Sbjct: 221 GYMEVPKWGGMYGNSWRTTGDINDTWSGMLDNIDRNDAYARYAKPGGWNDPDMLEVGNGG 280
Query: 232 MKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------------A 279
M Y+EYVVHFS+WAI+KAPL++GCD+ ++ ET+ I+ N EVIA+NQ
Sbjct: 281 MAYDEYVVHFSLWAIAKAPLVIGCDLTRVSKETLGILSNAEVIAINQDRMGVQGKKVKKY 340
Query: 280 INDQQIWAGPLSGNRIVVLLESRKTFSSM-MTAHWDDIGIKSNNT-----LFEHRTLETK 333
ND ++WAG LS +R VLL +R T SS +TA W D+GI+ T +++H TL +
Sbjct: 341 GNDLEVWAGQLSRHRKAVLLLNRGTTSSASITAAWPDVGIRRGVTVEARDVWKHETLPGR 400
Query: 334 FQGKLTAKVDGHSCKMYELTPVTK 357
F G LTA V HSCK++ LTPV +
Sbjct: 401 FTGSLTAVVGPHSCKLFVLTPVPR 424
>gi|414884442|tpg|DAA60456.1| TPA: hypothetical protein ZEAMMB73_299281 [Zea mays]
Length = 418
Score = 343 bits (879), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 183/382 (47%), Positives = 238/382 (62%), Gaps = 44/382 (11%)
Query: 11 SNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFH 70
+ G RR L +NGLGLTP MGWNSWNHF C INE ++R T D I L+ + +
Sbjct: 42 AGGGGRRVLFANGLGLTPQMGWNSWNHFQCDINEAVVRSTADAL------VAIGLAKAGY 95
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI----------LISNSLDNR 120
+ +L A + +G +VAN TFPSG KALADYVH + + NR
Sbjct: 96 TYVNLDDCWADSERTK-EGYMVANPKTFPSGIKALADYVHSKGLKLGIYSSAGTRTCSNR 154
Query: 121 IESNIAFEQ---------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEW 171
+ ++ +E+ G+DYLKYDNC+ D T T+R+ MS AL +GRPIF+S+CEW
Sbjct: 155 MPGSLGYEERDAKMFASWGVDYLKYDNCYRDGTPETVRFGRMSRALVNSGRPIFYSLCEW 214
Query: 172 GDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGG 231
G M WG NSWRTT DI+D+W+ ML D+N+ YA +AKPGGWNDPDMLEVGNGG
Sbjct: 215 GFMEVPKWGGMYGNSWRTTGDINDTWSGMLDNIDLNDAYARYAKPGGWNDPDMLEVGNGG 274
Query: 232 MKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------------A 279
M YNEYVVHFS+WAI+KAPL++GCDV ++ ET+ I+ N EVIA++Q
Sbjct: 275 MAYNEYVVHFSLWAIAKAPLVIGCDVTRVSNETLGILSNAEVIAISQDRLGVQGQKVSKY 334
Query: 280 INDQQIWAGPLSGNRIVVLLESR-KTFSSMMTAHWDDIGIKSN-----NTLFEHRTLETK 333
ND ++WAG LSG+R VLL +R T S+ +TA W D+GI+ +++H TL
Sbjct: 335 GNDLEVWAGQLSGHRKAVLLLNRGATRSASITAAWPDVGIRRGVAVEARDVWKHETLPGW 394
Query: 334 FQGKLTAKVDGHSCKMYELTPV 355
F G LTA V HSCK++ LTPV
Sbjct: 395 FTGSLTAVVGPHSCKLFVLTPV 416
>gi|255569397|ref|XP_002525666.1| alpha-galactosidase/alpha-n-acetylgalactosaminidase, putative
[Ricinus communis]
gi|223535102|gb|EEF36784.1| alpha-galactosidase/alpha-n-acetylgalactosaminidase, putative
[Ricinus communis]
Length = 415
Score = 341 bits (874), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 182/406 (44%), Positives = 239/406 (58%), Gaps = 80/406 (19%)
Query: 4 VSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET------------- 50
++++AA RRNLL NGLGLTPPMGWNSWNHF C I E +IRET
Sbjct: 35 MATAAARPFMEIRRNLLGNGLGLTPPMGWNSWNHFQCNIEETLIRETADAMVSSGLAALG 94
Query: 51 ------DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKA 104
DDCW E RDSQ GNLV +TFPSG KA
Sbjct: 95 YQYINLDDCWAELQRDSQ--------------------------GNLVPKVSTFPSGIKA 128
Query: 105 LADYVH-----ILI-----SNSLDNRIESNIAFEQ---------GIDYLKYDNCFNDDTR 145
LADY+H + I + + + + +E+ GIDYLKYDNC ND T
Sbjct: 129 LADYIHGKGLKLGIYSDAGTQTCSRTMPGSFGYEEQDAKTFASWGIDYLKYDNCNNDGTS 188
Query: 146 PTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIAD 205
P +RY MS AL +GRPIFFS+CEWG PA W +I NSWRTT DISD+W M++ A
Sbjct: 189 PKLRYPIMSKALLNSGRPIFFSLCEWGQEDPATWAPSIGNSWRTTGDISDNWESMISRAG 248
Query: 206 MNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETM 265
N+ +A +A+PGGWNDPDMLEVGNGGM EY HFSIWA++KAPLL+GCD+ +++ ET
Sbjct: 249 ENDKWASYARPGGWNDPDMLEVGNGGMTTEEYRSHFSIWALAKAPLLIGCDIRSMSNETY 308
Query: 266 PIIGNEEVIAVNQAI-----------NDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWD 314
I+ N+EVIAVNQ D ++WAGPLS ++ ++L +R + + +TA+W
Sbjct: 309 EILSNKEVIAVNQDKLGVQGKKVKNDGDLEVWAGPLSNRKVAIVLWNRGSSRATVTAYWS 368
Query: 315 DIGIKSNNTLFEHRTL-----ETKFQGKLTAKVDGHSCKMYELTPV 355
DIG+ ++ T+ R L ++ +G ++A +D H+CKMY LTP+
Sbjct: 369 DIGLNNSTTMVHARDLWAHSTQSSVKGHISADLDSHACKMYVLTPL 414
>gi|449458718|ref|XP_004147094.1| PREDICTED: alpha-galactosidase-like [Cucumis sativus]
gi|86278368|gb|ABC88435.1| acid alpha galactosidase 2 [Cucumis sativus]
Length = 421
Score = 340 bits (872), Expect = 6e-91, Method: Compositional matrix adjust.
Identities = 184/397 (46%), Positives = 233/397 (58%), Gaps = 82/397 (20%)
Query: 12 NGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DD 52
+ S RR LL+NGL LTPPMGWNSWNHF C +NE +I+ET DD
Sbjct: 52 SASSRRILLNNGLALTPPMGWNSWNHFQCNLNENLIKETADAMVSTGLAALGYQYINLDD 111
Query: 53 CWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI- 111
CW E RDS+ GNLVA +TFPSG KALADYVH
Sbjct: 112 CWAELDRDSK--------------------------GNLVAKASTFPSGIKALADYVHRK 145
Query: 112 -----LISN----SLDNRIESNIAFEQ---------GIDYLKYDNCFNDDTRPTIRYRAM 153
+ S+ + R+ ++ E+ GIDYLKYDNC N T P RY M
Sbjct: 146 GLKLGIYSDAGIRTCSKRMPGSLGHEEQDAKTFASWGIDYLKYDNCENTGTSPKERYPKM 205
Query: 154 SDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADH 213
+ AL+++GRPI FS+CEWG PA W N+ NSWRTT DI D+W M TIAD N+ +A +
Sbjct: 206 TKALQQSGRPILFSLCEWGQEDPATWAVNVGNSWRTTSDIQDNWISMTTIADQNDKWASY 265
Query: 214 AKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEV 273
AKPGGWNDPDMLEVGNGGM EY HFSIWA++KAPLL+GCD+ ++ TM ++ N+EV
Sbjct: 266 AKPGGWNDPDMLEVGNGGMTIAEYRSHFSIWALAKAPLLIGCDIRSMDNNTMKLLSNKEV 325
Query: 274 IAVNQ-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNN 322
IAVNQ D ++WAG LSG R+ V+L +R + + +TA+W DIG+ S+
Sbjct: 326 IAVNQDKLGVQGKKVHKYGDLEVWAGLLSGKRVAVVLWNRSLWRANITAYWSDIGLSSST 385
Query: 323 T-----LFEHRTLETKFQGKLTAKVDGHSCKMYELTP 354
T L+EH + + LTA+VD H CKM+ LTP
Sbjct: 386 TVTARDLWEHSSQVVRHH--LTAQVDSHDCKMFVLTP 420
>gi|357146796|ref|XP_003574114.1| PREDICTED: alpha-galactosidase-like [Brachypodium distachyon]
Length = 400
Score = 339 bits (869), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 187/400 (46%), Positives = 233/400 (58%), Gaps = 88/400 (22%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQIN-EKIIRET-------------------DDCWG 55
RR++L+NGLG TPPMGWNSWNHF C N E +IRET DDCW
Sbjct: 26 RRSMLANGLGSTPPMGWNSWNHFGCDGNGEVVIRETADALISTGLAAAGYRYVNLDDCWA 85
Query: 56 ESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISN 115
ES RD++ GNLVA K+TFP+G KALADYVH S
Sbjct: 86 ESERDAK--------------------------GNLVAKKSTFPNGIKALADYVH---SK 116
Query: 116 SLDNRIESNIAFEQ----------------------GIDYLKYDNCFNDDTRPTIRYRAM 153
L I S+ ++ G+DYLKYDNC N D +P RY M
Sbjct: 117 GLKLGIYSDAGYKTCAKAQPGSLGYEEQDAKTFASWGVDYLKYDNCNNGDLKPLERYPEM 176
Query: 154 SDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADH 213
S AL KAG PI+FS+CEWGDM PA W + NSWRTT+DI D+W M + AD NEV+A++
Sbjct: 177 SKALMKAGHPIYFSICEWGDMHPARWAAAYGNSWRTTNDIEDTWDSMTSRADQNEVWAEY 236
Query: 214 AKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEV 273
A+PGGWNDPDMLEVGNGGM +EY+VHFSIWAISKAPL++GCDV +++ +T I+ N EV
Sbjct: 237 ARPGGWNDPDMLEVGNGGMTNDEYIVHFSIWAISKAPLIIGCDVRHMSQDTYDILANTEV 296
Query: 274 IAVNQ-----------AINDQQIWAGPLSGNRI-VVLLESRKTFSSMMTAHWDDIGIKSN 321
IAVNQ +IWA PLSG R VVLL + +TA WDDIG+ +
Sbjct: 297 IAVNQDPLGVQGKKVRMEGSNEIWAAPLSGYRTAVVLLNRHAKDEAEITARWDDIGLPAG 356
Query: 322 -----NTLFEHRTLETKFQGKLTAKVDGHSCKMYELTPVT 356
L++H+TL+ KF K+ V H+ +M+ LTP+T
Sbjct: 357 MAVEARDLWQHKTLDAKFTDKMPFNVTPHAARMFLLTPLT 396
>gi|927577|gb|AAA73964.1| alpha-galactosidase [Phaseolus vulgaris]
Length = 425
Score = 338 bits (867), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 183/392 (46%), Positives = 226/392 (57%), Gaps = 81/392 (20%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGE 56
RRNL+ NGLG TPPMGWNSWNHF C INE +IRET DDCWGE
Sbjct: 59 RRNLVGNGLGQTPPMGWNSWNHFSCNINEDLIRETADAMVSTGLAALGYQYINIDDCWGE 118
Query: 57 SSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI----- 111
+RDSQ GNLV +TFPSG KALADYVH
Sbjct: 119 LNRDSQ--------------------------GNLVPKASTFPSGMKALADYVHKKGLKL 152
Query: 112 -------------LISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDAL 157
+ SL + + F GIDYLKYDNC N + P RY MS AL
Sbjct: 153 GIYSDAGTQTCSKTMPGSLGHEEQDAKTFASWGIDYLKYDNCENKNISPKERYPPMSKAL 212
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPG 217
+GRPIFFS+CEWG PA W ++ NSWRTT DI D W M++ AD+N+ +A +A PG
Sbjct: 213 ANSGRPIFFSLCEWGSEDPATWAKSVGNSWRTTGDIEDKWESMISRADLNDEWASYAGPG 272
Query: 218 GWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN 277
GWNDPDMLEVGNGGM EY HFSIWA++KAPLL+GCD+ L T ++ NEEVIAVN
Sbjct: 273 GWNDPDMLEVGNGGMTTEEYRAHFSIWALAKAPLLIGCDIRALDVTTKELLSNEEVIAVN 332
Query: 278 Q-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFE 326
Q + ND ++WAGPLS NR+ V+L +R + + +TA W DIG+K TL +
Sbjct: 333 QDKLGVQGKKVKSNNDLEVWAGPLSNNRLAVILWNRSSSKAKVTASWSDIGLKP-GTLVD 391
Query: 327 HRTL-----ETKFQGKLTAKVDGHSCKMYELT 353
R L ++ G+++A++D H+C MY LT
Sbjct: 392 ARDLWKHSTQSSVSGEISAELDSHACNMYVLT 423
>gi|351724789|ref|NP_001235022.1| alpha galactosidase precursor [Glycine max]
gi|927575|gb|AAA73963.1| alpha galactosidase [Glycine max]
Length = 422
Score = 338 bits (866), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 180/393 (45%), Positives = 229/393 (58%), Gaps = 81/393 (20%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGE 56
RRNLL NGLG TPPMGWNSWNHF C I E +IRET DDCWGE
Sbjct: 56 RRNLLDNGLGHTPPMGWNSWNHFACNIKEDLIRETADAMVSTGLAALGYQYINIDDCWGE 115
Query: 57 SSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI----- 111
+RDS+ GNLV +TFPSG KALADYVH
Sbjct: 116 LNRDSK--------------------------GNLVPKASTFPSGMKALADYVHKNGLKL 149
Query: 112 -------------LISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDAL 157
+ SL + + F GIDYLKYDNC N++ P RY MS+AL
Sbjct: 150 GIYSDAGNQTCSKTMPGSLGHEEQDAKTFASWGIDYLKYDNCENNNISPKERYPPMSEAL 209
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPG 217
GRPIFFS+CEWG PA W ++ NSWRTT DI D W M++ AD+N+ +A +A PG
Sbjct: 210 ANTGRPIFFSLCEWGSEDPATWAKSVGNSWRTTGDIQDKWDSMISRADLNDKWASYAGPG 269
Query: 218 GWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN 277
GWNDPDMLEVGNGGM EY HFSIW+++KAPLL+GCD+ L A T ++ N+EVIAVN
Sbjct: 270 GWNDPDMLEVGNGGMTTEEYRAHFSIWSLAKAPLLIGCDIRALDATTKELLSNKEVIAVN 329
Query: 278 Q-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT--- 323
Q + ND ++WAGPLS N++ V+L +R + + +TA W DIG+K +
Sbjct: 330 QDKLGVQGKKVKSTNDLEVWAGPLSNNKVAVILWNRSSSKAKVTASWSDIGLKPGTSVEA 389
Query: 324 --LFEHRTLETKFQGKLTAKVDGHSCKMYELTP 354
L+ H T ++ G+++A++D H+CKMY +TP
Sbjct: 390 RDLWAHST-QSSVSGEISAELDSHACKMYVVTP 421
>gi|224118658|ref|XP_002331416.1| predicted protein [Populus trichocarpa]
gi|222873630|gb|EEF10761.1| predicted protein [Populus trichocarpa]
Length = 367
Score = 337 bits (863), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 181/392 (46%), Positives = 227/392 (57%), Gaps = 81/392 (20%)
Query: 17 RNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGES 57
R L +NGLGL PPMGWNSWNHFHC I EK+IR+T DDCW E
Sbjct: 2 RGLSANGLGLAPPMGWNSWNHFHCNIEEKLIRDTADAMVSSGLAALGYEHVNLDDCWAEL 61
Query: 58 SRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH------- 110
+RDS+ GNLV +TFPSG KALADY+H
Sbjct: 62 NRDSE--------------------------GNLVPKASTFPSGIKALADYIHGKGLKLG 95
Query: 111 ---ILISNSLDNRIESNIAFEQ---------GIDYLKYDNCFNDDTRPTIRYRAMSDALK 158
S + + ++ E+ G+DYLKYDNC ND T P RY MS AL
Sbjct: 96 IYSDAGSQTCSGTMPGSLGHEEQDAKTFASWGVDYLKYDNCNNDGTSPKERYPVMSKALL 155
Query: 159 KAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGG 218
+GRPIFFS+CEWG PA W SN+ NSWRTT DISD+W M + AD N+ +A +A PGG
Sbjct: 156 NSGRPIFFSLCEWGQEDPATWASNVGNSWRTTGDISDNWDSMTSRADQNDQWASYAAPGG 215
Query: 219 WNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
WNDPDMLEVGNGGM EY HFSIWA++KAPLL+GCDV ++ ET+ I+ N EVIAVNQ
Sbjct: 216 WNDPDMLEVGNGGMTTEEYRSHFSIWALAKAPLLIGCDVRTMSDETIEILSNREVIAVNQ 275
Query: 279 AI-----------NDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT---- 323
D ++WAGPLS N+I V+L +R + + +TA+W DIG+ T
Sbjct: 276 DKLGVQGKKVKNNGDLEVWAGPLSNNKIAVVLWNRGSSRATVTAYWSDIGLDPTTTVNAR 335
Query: 324 -LFEHRTLETKFQGKLTAKVDGHSCKMYELTP 354
L+ H + +G+++A +D H+CKMY LTP
Sbjct: 336 DLWAHSN-QPSVKGQISADLDSHACKMYVLTP 366
>gi|242039253|ref|XP_002467021.1| hypothetical protein SORBIDRAFT_01g018410 [Sorghum bicolor]
gi|241920875|gb|EER94019.1| hypothetical protein SORBIDRAFT_01g018410 [Sorghum bicolor]
Length = 409
Score = 337 bits (863), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 185/400 (46%), Positives = 228/400 (57%), Gaps = 88/400 (22%)
Query: 14 SDRRNLLSNGLGLTPPMGWNSWNHFHCQIN-EKIIRET-------------------DDC 53
+ RR++L+NGLG PPMGWNSWNHF C N E +IRET DDC
Sbjct: 34 AHRRSMLANGLGSAPPMGWNSWNHFQCDGNGEVVIRETADALVSTGLAALGYRYVNIDDC 93
Query: 54 WGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILI 113
W E RD++ GNLVAN TFP G KALADYVH
Sbjct: 94 WAEPKRDAK--------------------------GNLVANTKTFPQGIKALADYVH--- 124
Query: 114 SNSLDNRIESNIAFEQ----------------------GIDYLKYDNCFNDDTRPTIRYR 151
S L I S+ F+ G+DYLKYDNC N D +P RY
Sbjct: 125 SKGLKLGIYSDAGFQTCAKVQPGSLGHEEQDAKTFAAWGVDYLKYDNCNNGDLKPLERYP 184
Query: 152 AMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYA 211
MS AL K GRPI+FS+CEWGDM PA WG+ NSWRTT+DI+D+W M+ AD NEV+A
Sbjct: 185 EMSRALMKVGRPIYFSLCEWGDMHPAKWGAAYGNSWRTTNDIADTWDSMIATADQNEVWA 244
Query: 212 DHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNE 271
++A+PGGWNDPDMLEVGNGGM +EYVVHFS+WAISKAPL++GCDV +++ ET I+ N+
Sbjct: 245 EYARPGGWNDPDMLEVGNGGMTNSEYVVHFSLWAISKAPLIIGCDVRHMSQETYDILANK 304
Query: 272 EVIAVNQ-----------AINDQQIWAGPLSGNRI-VVLLESRKTFSSMMTAHWDDIGIK 319
EVIAVNQ +IWA PLS R VVLL + +TAHWDDIG+
Sbjct: 305 EVIAVNQDPLGVPGKKVRMEGSSEIWAAPLSEYRTAVVLLNRHAKDQATITAHWDDIGLP 364
Query: 320 SNNT-----LFEHRTLETKFQGKLTAKVDGHSCKMYELTP 354
+ L+ H T++ F ++ V HSC+M+ L P
Sbjct: 365 AGTPVEATDLWLHETVDATFTDTMSFDVAPHSCRMFVLKP 404
>gi|60417424|emb|CAI47559.1| alpha galactosidase [Coffea arabica]
Length = 420
Score = 337 bits (863), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 180/393 (45%), Positives = 226/393 (57%), Gaps = 81/393 (20%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGE 56
RR+LL+NGLGLTPPMGWNSWNHF C ++EK+IRET DDCW E
Sbjct: 54 RRSLLANGLGLTPPMGWNSWNHFSCNLDEKLIRETADAMASKGLAALGYKYINLDDCWAE 113
Query: 57 SSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI----- 111
+RDSQ GNLV +TFPSG KALADYVH
Sbjct: 114 LNRDSQ--------------------------GNLVPKGSTFPSGIKALADYVHSKGLKL 147
Query: 112 -------------LISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDAL 157
+ SL + + F G+DYLKYDNC +++ P RY MS AL
Sbjct: 148 GIYSDAGTQTCSKTMPGSLGHEEQDAKTFASWGVDYLKYDNCNDNNISPKERYPIMSKAL 207
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPG 217
+GR IFFS+CEWGD PA W + NSWRTT DI DSW+ M + ADMN+ +A +A PG
Sbjct: 208 LNSGRSIFFSLCEWGDEDPATWAKEVGNSWRTTGDIDDSWSSMTSRADMNDKWASYAGPG 267
Query: 218 GWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN 277
GWNDPDMLEVGNGGM EY HFSIWA++KAPLL+GCD+ ++ T ++ N EVIAVN
Sbjct: 268 GWNDPDMLEVGNGGMTTTEYRSHFSIWALAKAPLLIGCDIRSIDGATFQLLSNAEVIAVN 327
Query: 278 Q-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT--- 323
Q D ++WAGPLSG R+ V L +R + ++ +TA+W D+G+ S
Sbjct: 328 QDKLGVQGKKVKTYGDLEVWAGPLSGKRVAVALWNRGSSTATITAYWSDVGLPSTAVVNA 387
Query: 324 --LFEHRTLETKFQGKLTAKVDGHSCKMYELTP 354
L+ H T E +G+++A VD H KMY LTP
Sbjct: 388 RDLWAHST-EKSVKGQISAAVDAHDSKMYVLTP 419
>gi|2492782|sp|Q42656.1|AGAL_COFAR RecName: Full=Alpha-galactosidase; AltName:
Full=Alpha-D-galactoside galactohydrolase; AltName:
Full=Melibiase; Flags: Precursor
gi|504489|gb|AAA33022.1| alpha-galactosidase [Coffea arabica]
Length = 378
Score = 336 bits (861), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 180/393 (45%), Positives = 226/393 (57%), Gaps = 81/393 (20%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGE 56
RR+LL+NGLGLTPPMGWNSWNHF C ++EK+IRET DDCW E
Sbjct: 12 RRSLLANGLGLTPPMGWNSWNHFRCNLDEKLIRETADAMVSKGLAALGYKYINLDDCWAE 71
Query: 57 SSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI----- 111
+RDSQ GNLV +TFPSG KALADYVH
Sbjct: 72 LNRDSQ--------------------------GNLVPKGSTFPSGIKALADYVHSKGLKL 105
Query: 112 -------------LISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDAL 157
+ SL + + F G+DYLKYDNC N++ P RY MS AL
Sbjct: 106 GIYSDAGTQTCSKTMPGSLGHEEQDAKTFASWGVDYLKYDNCNNNNISPKERYPIMSKAL 165
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPG 217
+GR IFFS+CEWG+ PA W + NSWRTT DI DSW+ M + ADMN+ +A +A PG
Sbjct: 166 LNSGRSIFFSLCEWGEEDPATWAKEVGNSWRTTGDIDDSWSSMTSRADMNDKWASYAGPG 225
Query: 218 GWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN 277
GWNDPDMLEVGNGGM EY HFSIWA++KAPLL+GCD+ ++ T ++ N EVIAVN
Sbjct: 226 GWNDPDMLEVGNGGMTTTEYRSHFSIWALAKAPLLIGCDIRSMDGATFQLLSNAEVIAVN 285
Query: 278 Q-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT--- 323
Q D ++WAGPLSG R+ V L +R + ++ +TA+W D+G+ S
Sbjct: 286 QDKLGVQGNKVKTYGDLEVWAGPLSGKRVAVALWNRGSSTATITAYWSDVGLPSTAVVNA 345
Query: 324 --LFEHRTLETKFQGKLTAKVDGHSCKMYELTP 354
L+ H T E +G+++A VD H KMY LTP
Sbjct: 346 RDLWAHST-EKSVKGQISAAVDAHDSKMYVLTP 377
>gi|226529467|ref|NP_001147362.1| alpha-galactosidase precursor [Zea mays]
gi|195610514|gb|ACG27087.1| alpha-galactosidase precursor [Zea mays]
Length = 412
Score = 336 bits (861), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 182/385 (47%), Positives = 231/385 (60%), Gaps = 50/385 (12%)
Query: 14 SDRRNLLSNGLGLTPPMGWNSWNHFHCQIN-EKIIRETDDCWGESSRDSQISLSCSFHMF 72
+ RR++L+NGLG PPMGWNSWNHF C N E +IRET D L+ + +
Sbjct: 37 AHRRSMLANGLGSAPPMGWNSWNHFQCDGNGEVVIRETADAL------VSTGLAALGYRY 90
Query: 73 DSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAFEQ--- 129
++ A P +GNLVAN TFP G KALADYVH L I S+ F+
Sbjct: 91 VNIDDCWAGP-QRDAKGNLVANTKTFPHGIKALADYVH---GKGLKLGIYSDAGFQTCAK 146
Query: 130 -------------------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCE 170
G+DYLKYDNC N D +P RY MS AL K GRPI+FS+CE
Sbjct: 147 AQPGSLGHEELDAKTFAAWGVDYLKYDNCNNGDLKPLERYPEMSRALMKVGRPIYFSLCE 206
Query: 171 WGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNG 230
WGDM PA WG+ NSWRTT+DI+D+W M+ AD NEV++++A+PGGWNDPDMLEVGNG
Sbjct: 207 WGDMHPATWGATYGNSWRTTNDIADTWDSMIATADQNEVWSEYARPGGWNDPDMLEVGNG 266
Query: 231 GMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------A 279
GM +EYVVHFS+WAISKAPL++GCDV +++ ET I+ N+EVIAVNQ
Sbjct: 267 GMTNSEYVVHFSLWAISKAPLIIGCDVRHMSRETYAILANKEVIAVNQDPLGVQGKKVRM 326
Query: 280 INDQQIWAGPLSGNRIVVLLESRKTF-SSMMTAHWDDIGIKSNNT-----LFEHRTLETK 333
+IWA PLS R VLL +R + +TAHWDDIG+ + L+ H TL+
Sbjct: 327 EGSSEIWAAPLSEYRTAVLLLNRHAKDEATITAHWDDIGLPAGTPVEARDLWLHETLDAT 386
Query: 334 FQGKLTAKVDGHSCKMYELTPVTKL 358
F K++ V HSC+M L P ++
Sbjct: 387 FTDKMSFDVAPHSCRMLVLKPRIQI 411
>gi|356559985|ref|XP_003548276.1| PREDICTED: alpha-galactosidase-like [Glycine max]
Length = 425
Score = 335 bits (860), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 183/407 (44%), Positives = 232/407 (57%), Gaps = 87/407 (21%)
Query: 2 FIVSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET----------- 50
F+VS A RRNLL NGLG TPPMGWNSWNHF C I E +IRET
Sbjct: 51 FMVSREA------HRRNLLDNGLGHTPPMGWNSWNHFACNIKEDLIRETADAMVSTGLAA 104
Query: 51 --------DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGT 102
DDCWGE +RDS+ GNLV +TFPSG
Sbjct: 105 LGYQYINIDDCWGELNRDSK--------------------------GNLVPKASTFPSGM 138
Query: 103 KALADYVHI------------------LISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDD 143
KALADY+H + SL + + F GIDYLKYDNC N++
Sbjct: 139 KALADYIHKKGLKLGIYSDAGNQTCSKTMPGSLGHEKQDAKTFASWGIDYLKYDNCENNN 198
Query: 144 TRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTI 203
P RY MS+AL GRPIFFS+CEWG PA W ++ NSWRTT DI D W M++
Sbjct: 199 ISPKERYPPMSEALANTGRPIFFSLCEWGSEDPATWAKSVGNSWRTTGDIEDKWESMISR 258
Query: 204 ADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAE 263
AD+N+ +A A PGGWNDPDMLEVGNGGM EY HFSIWA++KAPLL+GCD+ L A
Sbjct: 259 ADLNDKWASCAGPGGWNDPDMLEVGNGGMTTEEYRAHFSIWALAKAPLLIGCDIRALDAT 318
Query: 264 TMPIIGNEEVIAVNQ-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAH 312
T ++ N EVIAVNQ + ND ++WAGPLS N++ V+L +R + + +TA
Sbjct: 319 TKELLSNNEVIAVNQDKLGVQGKKVKSNNDLEVWAGPLSNNKVAVILWNRSSSKAKVTAS 378
Query: 313 WDDIGIKSNNT-----LFEHRTLETKFQGKLTAKVDGHSCKMYELTP 354
W DIG+K + L+ H T ++ G++++++D H+CKMY L+P
Sbjct: 379 WSDIGLKPGTSVKARDLWAHST-QSSVSGEISSELDSHACKMYVLSP 424
>gi|194689658|gb|ACF78913.1| unknown [Zea mays]
gi|194699772|gb|ACF83970.1| unknown [Zea mays]
gi|414870947|tpg|DAA49504.1| TPA: alpha-galactosidase [Zea mays]
Length = 414
Score = 335 bits (860), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 184/404 (45%), Positives = 229/404 (56%), Gaps = 88/404 (21%)
Query: 14 SDRRNLLSNGLGLTPPMGWNSWNHFHCQIN-EKIIRET-------------------DDC 53
+ RR++L+NGLG PPMGWNSWNHF C N E +IRET DDC
Sbjct: 39 AHRRSMLANGLGSAPPMGWNSWNHFQCDGNGEVVIRETADALVSTGLAALGYRYVNIDDC 98
Query: 54 WGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILI 113
W E RD++ GNLVAN TFP G KALADYVH
Sbjct: 99 WAEPQRDAK--------------------------GNLVANTKTFPHGIKALADYVH--- 129
Query: 114 SNSLDNRIESNIAFEQ----------------------GIDYLKYDNCFNDDTRPTIRYR 151
L I S+ F+ G+DYLKYDNC N D +P RY
Sbjct: 130 GKGLKLGIYSDAGFQTCAKAQPGSLGHEELDAKTFAAWGVDYLKYDNCNNGDLKPLERYP 189
Query: 152 AMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYA 211
MS AL K GRPI+FS+CEWGDM PA WG+ NSWRTT+DI+D+W M+ AD NEV++
Sbjct: 190 EMSRALMKVGRPIYFSLCEWGDMHPATWGATYGNSWRTTNDIADTWDSMIATADQNEVWS 249
Query: 212 DHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNE 271
++A+PGGWNDPDMLEVGNGGM +EYVVHFS+WAISKAPL++GCDV +++ ET I+ N+
Sbjct: 250 EYARPGGWNDPDMLEVGNGGMTNSEYVVHFSLWAISKAPLIIGCDVRHMSRETYAILANK 309
Query: 272 EVIAVNQ-----------AINDQQIWAGPLSGNRIVVLLESRKTF-SSMMTAHWDDIGIK 319
EVIAVNQ +IWA PLS R VLL +R + + AHWDDIG+
Sbjct: 310 EVIAVNQDPLGVQGKKVRMEGSSEIWAAPLSEYRTAVLLLNRHAKDEATIAAHWDDIGLP 369
Query: 320 SNNT-----LFEHRTLETKFQGKLTAKVDGHSCKMYELTPVTKL 358
+ L+ H TL+ F K++ V HSC+M L P ++
Sbjct: 370 AGTPVEARDLWLHETLDATFTDKMSFDVAPHSCRMLVLKPRIQI 413
>gi|329741907|gb|AEB98601.1| alpha-galactosidase [Nicotiana tabacum]
Length = 413
Score = 333 bits (853), Expect = 9e-89, Method: Compositional matrix adjust.
Identities = 181/409 (44%), Positives = 237/409 (57%), Gaps = 81/409 (19%)
Query: 1 LFIVSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET---------- 50
L I S++ SN RR+LLSNGLG TP MGW+SWNHF C I EK+IRET
Sbjct: 31 LIIADSNSTTSNAYIRRSLLSNGLGRTPQMGWSSWNHFACNIEEKMIRETADAMVSTGLA 90
Query: 51 ---------DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSG 101
DDCW E +RDSQ GN+V +TFPSG
Sbjct: 91 SLGYEYVNIDDCWAELNRDSQ--------------------------GNMVPKSSTFPSG 124
Query: 102 TKALADYVH----------ILISNSLDNRIESNIAFEQ---------GIDYLKYDNCFND 142
KALADYVH S + ++ ++ E+ G+DYLKYDNC N+
Sbjct: 125 IKALADYVHGKGLKLGIYSDAGSQTCSKQMPGSLGHEEQDAKTFASWGVDYLKYDNCNNE 184
Query: 143 DTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLT 202
+ P RY MS AL+ +GR IF+S+CEWGD PA W S++ NSWRTT DISD+W M +
Sbjct: 185 NRSPRERYPIMSKALQNSGRAIFYSLCEWGDDDPATWASSVGNSWRTTGDISDNWDSMTS 244
Query: 203 IADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTA 262
ADMN+ +A +A PGGWNDPDMLEVGNGGM EY HFSIWA++KAPL++GCD+ ++
Sbjct: 245 RADMNDKWASYAGPGGWNDPDMLEVGNGGMTTAEYRSHFSIWALAKAPLIIGCDLRSMDQ 304
Query: 263 ETMPIIGNEEVIAVNQ-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTA 311
I+ N+EVIAVNQ D ++WAGPLSG R+ ++L +R + + +TA
Sbjct: 305 TAHEILSNKEVIAVNQDKLGVQGKKVKQNGDLEVWAGPLSGKRLAMVLWNRSSSKADITA 364
Query: 312 HWDDIGIKSNNT-----LFEHRTLETKFQGKLTAKVDGHSCKMYELTPV 355
+W DIG+ S+ L+ H T + +G+L+A +D H C+MY LTP
Sbjct: 365 YWSDIGLDSSTVVDARDLWAHST-KGSVKGQLSASIDSHDCRMYVLTPT 412
>gi|329741905|gb|AEB98600.1| alpha-galactosidase [Nicotiana tabacum]
Length = 413
Score = 333 bits (853), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 181/408 (44%), Positives = 237/408 (58%), Gaps = 81/408 (19%)
Query: 1 LFIVSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET---------- 50
L I S++ SN RR+LLSNGLG TP MGW+SWNHF C I EK+IRET
Sbjct: 31 LIIADSNSTTSNAYIRRSLLSNGLGRTPQMGWSSWNHFACNIEEKMIRETADAMVSTGLA 90
Query: 51 ---------DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSG 101
DDCW E +RDSQ GN+V +TFPSG
Sbjct: 91 SLGYEYVNIDDCWAELNRDSQ--------------------------GNMVPKSSTFPSG 124
Query: 102 TKALADYVH----------ILISNSLDNRIESNIAFEQ---------GIDYLKYDNCFND 142
KALADYVH S + ++ ++ E+ G+DYLKYDNC N+
Sbjct: 125 IKALADYVHGKGLKLGIYSDAGSQTCSKQMPGSLGHEEQDAKTFASWGVDYLKYDNCNNE 184
Query: 143 DTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLT 202
+ P RY MS AL+ +GR IF+S+CEWGD PA W S++ NSWRTT DISD+W M +
Sbjct: 185 NRSPRERYPIMSKALQNSGRAIFYSLCEWGDDDPATWASSVGNSWRTTGDISDNWDSMTS 244
Query: 203 IADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTA 262
ADMN+ +A +A PGGWNDPDMLEVGNGGM EY HFSIWA++KAPL++GCD+ ++
Sbjct: 245 RADMNDKWASYAGPGGWNDPDMLEVGNGGMTTAEYRSHFSIWALAKAPLIIGCDLRSMDQ 304
Query: 263 ETMPIIGNEEVIAVNQ-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTA 311
I+ N+EVIAVNQ D ++WAGPLSG R+ ++L +R + + +TA
Sbjct: 305 TAHEILSNKEVIAVNQDKLGVQGKKVKQNGDLEVWAGPLSGKRLAMVLWNRSSSKADITA 364
Query: 312 HWDDIGIKSNNT-----LFEHRTLETKFQGKLTAKVDGHSCKMYELTP 354
+W DIG+ S+ L+ H T + +G+L+A +D H C+MY LTP
Sbjct: 365 YWSDIGLDSSTVVDARDLWAHST-KGSVKGQLSASIDSHDCRMYVLTP 411
>gi|255646697|gb|ACU23822.1| unknown [Glycine max]
Length = 426
Score = 332 bits (852), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 179/393 (45%), Positives = 228/393 (58%), Gaps = 81/393 (20%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGE 56
RRNLL NGLG TPPMGWNSWNHF C I E +IRET DDCWGE
Sbjct: 60 RRNLLDNGLGHTPPMGWNSWNHFACNIKEDLIRETADAMVSTGLAALGYQYINIDDCWGE 119
Query: 57 SSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI----- 111
+RDS+ GNLV +TFPSG KALADYVH
Sbjct: 120 LNRDSK--------------------------GNLVPKASTFPSGMKALADYVHKNGLKL 153
Query: 112 -------------LISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDAL 157
+ SL + + F GIDYLKYDNC N++ P RY MS+AL
Sbjct: 154 GIYSDAGNQTCSKTMPGSLGHEEQDAKTFASWGIDYLKYDNCENNNISPKERYPPMSEAL 213
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPG 217
GRPIFFS+CEWG PA W ++ NSWRTT DI D W M++ AD+N+ +A +A G
Sbjct: 214 ANTGRPIFFSLCEWGSEDPATWAKSVGNSWRTTGDIQDKWDSMISRADLNDKWAFYAGLG 273
Query: 218 GWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN 277
GWNDPDMLEVGNGGM EY HFSIW+++KAPLL+GCD+ L A T ++ N+EVIAVN
Sbjct: 274 GWNDPDMLEVGNGGMTTEEYRAHFSIWSLAKAPLLIGCDIRALDATTKELLSNKEVIAVN 333
Query: 278 Q-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT--- 323
Q + ND ++WAGPLS N++ V+L +R + + +TA W DIG+K +
Sbjct: 334 QDKLGVQGKKVKSTNDLEVWAGPLSNNKVAVILWNRSSSKAKVTASWSDIGLKPGTSVEA 393
Query: 324 --LFEHRTLETKFQGKLTAKVDGHSCKMYELTP 354
L+ H T ++ G+++A++D H+CKMY +TP
Sbjct: 394 RDLWAHST-QSSVSGEISAELDSHACKMYVVTP 425
>gi|60417426|emb|CAI47560.1| alpha-galactosidase [Coffea canephora]
Length = 378
Score = 331 bits (849), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 179/393 (45%), Positives = 224/393 (56%), Gaps = 81/393 (20%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGE 56
RR+LL+NGLGLTPPMGWNS NHF C ++EK+IRET DDCW E
Sbjct: 12 RRSLLANGLGLTPPMGWNSRNHFRCNLDEKLIRETADAMVSKGLAALGYKYINLDDCWAE 71
Query: 57 SSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI----- 111
+RDSQ GNLV +TFPSG KALADYVH
Sbjct: 72 LNRDSQ--------------------------GNLVPKGSTFPSGIKALADYVHSKGLKL 105
Query: 112 -------------LISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDAL 157
+ SL N + F G+DYLKYDNC N++ P RY MS AL
Sbjct: 106 GIYSDAGTQTCSKTMPGSLGNEEQDAKTFASWGVDYLKYDNCNNNNISPKERYPIMSKAL 165
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPG 217
+GR IFFS+CEWG+ PA W + NSWRTT DI DSW+ M + ADMN+ +A +A PG
Sbjct: 166 LNSGRSIFFSLCEWGEEDPATWAKEVGNSWRTTGDIDDSWSSMTSRADMNDKWASYAGPG 225
Query: 218 GWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN 277
GWNDPDMLEVGNGGM EY HFSIWA++KAPLL+GCD+ ++ T ++ N EVIAVN
Sbjct: 226 GWNDPDMLEVGNGGMTTTEYRSHFSIWALAKAPLLIGCDIRSMDGATFQLLSNAEVIAVN 285
Query: 278 Q-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT--- 323
Q D ++WAGPLSG R+ V L +R + ++ +TA+W D+G+ S
Sbjct: 286 QDKLGVQGNKVKTYGDLEVWAGPLSGKRVAVALWNRGSSTATITAYWSDVGLPSTAVVNA 345
Query: 324 --LFEHRTLETKFQGKLTAKVDGHSCKMYELTP 354
L+ H T E +G+++A D H KMY LTP
Sbjct: 346 RDLWAHST-EKSVKGQISAAADAHDSKMYVLTP 377
>gi|10140702|gb|AAG13536.1|AC023240_9 putative alpha-galactosidase [Oryza sativa Japonica Group]
gi|31432821|gb|AAP54408.1| Alpha-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|218184802|gb|EEC67229.1| hypothetical protein OsI_34149 [Oryza sativa Indica Group]
gi|222613063|gb|EEE51195.1| hypothetical protein OsJ_32004 [Oryza sativa Japonica Group]
Length = 413
Score = 330 bits (846), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 184/383 (48%), Positives = 237/383 (61%), Gaps = 53/383 (13%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQIN-EKIIRETDDCWGESSRDSQISLSCSFHMFDS 74
RR++L+NGL PPMGWNSWNHF C N E +IRET D L+ + + + +
Sbjct: 35 RRSMLANGLATAPPMGWNSWNHFACDGNGEDVIRETADAL------VSTGLAAAGYKYVN 88
Query: 75 LTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAFEQ----- 129
+ A+P +GNLVANK TFP G KALADYVH S L I S+ F+
Sbjct: 89 IDDCWAEP-QRDSKGNLVANKKTFPHGIKALADYVH---SKGLKLGIYSDAGFKTCAKVQ 144
Query: 130 -----------------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWG 172
G+DYLKYDNC N D +P RY MS AL AGRPI+FS+CEWG
Sbjct: 145 PGSLGHEEQDAKTFASWGVDYLKYDNCNNGDLKPLERYPEMSKALMMAGRPIYFSLCEWG 204
Query: 173 DMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGM 232
DM PA WG+ NSWRTT+DI+D+W M++ AD NEV+AD+A+PGGWNDPDMLEVGNGGM
Sbjct: 205 DMHPAKWGAAYGNSWRTTNDIADTWESMVSRADENEVWADYARPGGWNDPDMLEVGNGGM 264
Query: 233 KYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-AINDQ-------- 283
+EY+VHFS+WAISKAPL++GCDV +++ ET I+ N+EVIAVNQ A+ Q
Sbjct: 265 TNDEYIVHFSLWAISKAPLIIGCDVRHMSQETYDILANKEVIAVNQDALGIQGKKVRMEG 324
Query: 284 --QIWAGPLSGNRIVVLLESRK-TFSSMMTAHWDDIGIKSNNT-----LFEHRTLE---T 332
++WA PLSG R VLL +R ++ +TAHWDD+G+ + L+ H T+
Sbjct: 325 SSEVWAAPLSGYRTAVLLLNRHAAEAAAITAHWDDVGLPAGTAVEARDLWRHATVAGAGG 384
Query: 333 KFQGKLTAKVDGHSCKMYELTPV 355
+ K+T V HSC+M L P+
Sbjct: 385 ELTEKITLDVAPHSCRMLLLKPL 407
>gi|357499025|ref|XP_003619801.1| Alpha galactosidase [Medicago truncatula]
gi|355494816|gb|AES76019.1| Alpha galactosidase [Medicago truncatula]
Length = 420
Score = 329 bits (843), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 175/393 (44%), Positives = 232/393 (59%), Gaps = 81/393 (20%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGE 56
+R +L NG+G TPPMGWNSWNHF C I+E +I+ET DDCW E
Sbjct: 54 KRTILDNGVGQTPPMGWNSWNHFQCDISEALIKETADAMVSTGLADLGYQYINIDDCWAE 113
Query: 57 SSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----I 111
+RDS+ G +VA +TFPSG KALADYVH +
Sbjct: 114 LNRDSK--------------------------GYMVAKSSTFPSGMKALADYVHSKGLKL 147
Query: 112 LISNSLDN-----RIESNIAFEQ---------GIDYLKYDNCFNDDTRPTIRYRAMSDAL 157
I + N R+ ++ E+ GIDYLKYDNC N+D P RY M +AL
Sbjct: 148 GIYSDAGNQTCSKRMPGSLGHEEQDAKTFASWGIDYLKYDNCENNDVTPKERYPRMGEAL 207
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPG 217
++GRPIF+S+CEWG P++WG ++ NSWRTT DI D W M++ AD N +A +A PG
Sbjct: 208 VQSGRPIFYSLCEWGAEDPSIWGKSVGNSWRTTSDIEDKWDSMISCADQNNKWASYAGPG 267
Query: 218 GWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN 277
GWNDPDMLEVGNGGM EY HFSIWA+ KAPLL+GCD+ L T ++ N EVIAVN
Sbjct: 268 GWNDPDMLEVGNGGMTTEEYRAHFSIWALVKAPLLIGCDIRALDDITKELLRNSEVIAVN 327
Query: 278 Q-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT--- 323
Q + +D ++WAGPLS NR+ V+L +R + +++TA W D+G+K +
Sbjct: 328 QDKLGIQGKKVKSNDDLEVWAGPLSDNRLAVVLWNRSSSKAIVTASWSDLGLKPGTSIDA 387
Query: 324 --LFEHRTLETKFQGKLTAKVDGHSCKMYELTP 354
L+EH T ++K G ++A++D H+CKMY LTP
Sbjct: 388 RDLWEHST-QSKVSGDISAELDSHACKMYILTP 419
>gi|356574823|ref|XP_003555544.1| PREDICTED: alpha-galactosidase-like [Glycine max]
Length = 420
Score = 328 bits (842), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 176/407 (43%), Positives = 236/407 (57%), Gaps = 81/407 (19%)
Query: 2 FIVSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET----------- 50
+V+ S S ++RRNL+ +GLG TPPMGWNSWNHF C +NE ++RET
Sbjct: 38 LLVNGSRRLSVLTERRNLVQSGLGQTPPMGWNSWNHFGCDVNESVVRETADAMMSTGLAA 97
Query: 51 --------DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGT 102
DDCW E +RD++ GN+V N +FPSG
Sbjct: 98 LGYQYVNIDDCWAELNRDTE--------------------------GNMVPNAASFPSGI 131
Query: 103 KALADYVH-----ILISNSLDN-----RIESNIAFEQ---------GIDYLKYDNCFNDD 143
KALADYVH + I + N R+ ++ E+ G+DYLKYDNC N+
Sbjct: 132 KALADYVHSKGLKLGIYSDAGNQTCSKRMPGSLGHEEQDAKTFASWGVDYLKYDNCENNG 191
Query: 144 TRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTI 203
+ T RY MS+AL K GRP+FFSMCEWG PA W + NSWRTT DI D+W M I
Sbjct: 192 IKATDRYPPMSEALLKTGRPVFFSMCEWGWQDPATWAKTVGNSWRTTGDIEDNWNSMTNI 251
Query: 204 ADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAE 263
AD N+ + +A PGGWNDPDMLEVGNGGM EY HFSIWA++KAPLL+GCD+ ++
Sbjct: 252 ADANDQWVSYAGPGGWNDPDMLEVGNGGMTTEEYRAHFSIWALAKAPLLVGCDIRSMDKT 311
Query: 264 TMPIIGNEEVIAVNQ-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAH 312
T +I N EVIAVNQ + ND ++WAGPL N++ V+L +R + ++ +TA
Sbjct: 312 TFELISNPEVIAVNQDKEGVQGKKVKSNNDLEVWAGPLIDNKVAVVLWNRSSSNATVTAS 371
Query: 313 WDDIGIKSNNTLFEHR-----TLETKFQGKLTAKVDGHSCKMYELTP 354
W DIG++ T+ + + T + G+++A++D H+CKMY LTP
Sbjct: 372 WSDIGLEP-GTMVDAKDLWANTTQQSVSGEISAQLDSHACKMYVLTP 417
>gi|356533701|ref|XP_003535398.1| PREDICTED: alpha-galactosidase-like [Glycine max]
Length = 422
Score = 328 bits (842), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 179/393 (45%), Positives = 228/393 (58%), Gaps = 81/393 (20%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGE 56
RRNL+ NGLG TPPMGWNSWNHF C INE +IRET DDCW E
Sbjct: 54 RRNLVENGLGQTPPMGWNSWNHFGCDINESVIRETADAMVSTGLAALGYRYVNIDDCWAE 113
Query: 57 SSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----I 111
+RDS+ N+V N FPSG KALADYVH +
Sbjct: 114 LNRDSE--------------------------DNMVPNAAAFPSGIKALADYVHSKGLKL 147
Query: 112 LISNSLDN-----RIESNIAFEQ---------GIDYLKYDNCFNDDTRPTIRYRAMSDAL 157
I + N R+ ++ E+ GIDYLKYDNC N+ + T RY MS+AL
Sbjct: 148 GIYSDAGNQTCSKRMPGSLGHEEQDAKTFASWGIDYLKYDNCENNGIKATDRYPPMSEAL 207
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPG 217
K GRPIFFSMCEWG PA W + NSWRTT DI D+W M TIAD N+ + +A G
Sbjct: 208 LKTGRPIFFSMCEWGWQDPATWAKTVGNSWRTTGDIEDNWNSMTTIADANDRWVSYAGSG 267
Query: 218 GWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN 277
GWNDPDMLEVGNGGM EY HFSIWA++KAPLL+GCD+ + T +I N EVIAVN
Sbjct: 268 GWNDPDMLEVGNGGMTTEEYRAHFSIWALAKAPLLVGCDIRAMDNTTYDLISNREVIAVN 327
Query: 278 Q-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFE 326
Q + ND ++WAGPLS N++ V+L +R + ++ +TA W +IG+K T+ +
Sbjct: 328 QDKEGVQGKKVKSNNDLEVWAGPLSDNKVAVILWNRSSSNATVTASWSNIGLKP-GTMVD 386
Query: 327 HRTL-----ETKFQGKLTAKVDGHSCKMYELTP 354
+ L ++ G+++A++D H+CKMY LTP
Sbjct: 387 AKDLWANSTQSSVSGEISAELDSHACKMYVLTP 419
>gi|116788809|gb|ABK25009.1| unknown [Picea sitchensis]
Length = 399
Score = 328 bits (841), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 182/401 (45%), Positives = 223/401 (55%), Gaps = 85/401 (21%)
Query: 12 NGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DD 52
N + R+LL +GL TPP+GWNSWNHF C INE +IRET DD
Sbjct: 27 NTTSGRSLLQSGLAQTPPLGWNSWNHFGCHINEDVIRETADALVSTGLSKLGYQYVNIDD 86
Query: 53 CWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL 112
CW E +RDS G LVA+ + FPSG KALADYVH
Sbjct: 87 CWAELNRDSN--------------------------GRLVASASRFPSGIKALADYVH-- 118
Query: 113 ISNSLDNRIESNIAF------------EQ---------GIDYLKYDNCFNDDTRPTIRYR 151
S L I S+ + EQ G+DYLKYDNC+ND ++P RY
Sbjct: 119 -SKGLKLGIYSDAGYYTCQKQPGSLGHEQVDADTFAYWGVDYLKYDNCYNDGSKPENRYP 177
Query: 152 AMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYA 211
M DAL RPIF+S+CEWG PA W I +SWRTTDDI D W M++ AD N +A
Sbjct: 178 KMRDALLSTRRPIFYSLCEWGLDVPATWARQIGSSWRTTDDIEDKWESMISRADQNNEFA 237
Query: 212 DHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNE 271
+A PGGWNDPDMLEVGNG M EY HFSIWA+ KAPLL+GCDV ++ +T I+ N
Sbjct: 238 QYAGPGGWNDPDMLEVGNGNMTPEEYGSHFSIWALMKAPLLIGCDVTSMDKKTYGILSNS 297
Query: 272 EVIAVNQ-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKS 320
EVIAVNQ + D ++WAGPLS NR+ V+L +R +++TA W+DIG+
Sbjct: 298 EVIAVNQDPLGVQGKKVNKLGDLEVWAGPLSNNRVAVVLLNRSNSRTIITAKWEDIGLNP 357
Query: 321 N-----NTLFEHRTLETKFQGKLTAKVDGHSCKMYELTPVT 356
+ L+ HR LE QG LTA V H CKMY LT V+
Sbjct: 358 SAVVRVRNLWRHRELEANHQGSLTASVKPHGCKMYVLTSVS 398
>gi|350536051|ref|NP_001233971.1| alpha-galactosidase precursor [Solanum lycopersicum]
gi|6164944|gb|AAF04591.1|AF191823_1 alpha-galactosidase [Solanum lycopersicum]
Length = 409
Score = 327 bits (839), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 177/394 (44%), Positives = 228/394 (57%), Gaps = 81/394 (20%)
Query: 15 DRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWG 55
+RRNLL NGLG TP MGW+SWNHF C I+E II+ T DDCW
Sbjct: 41 NRRNLLGNGLGQTPQMGWSSWNHFGCNIDENIIKGTADAMVHTGLASLGYEYINIDDCWA 100
Query: 56 ESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH----- 110
E++RDSQ GN+VA +TFPSG KALADYVH
Sbjct: 101 EANRDSQ--------------------------GNMVAKGSTFPSGIKALADYVHGKGLK 134
Query: 111 -------------ILISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDA 156
+ SL + + F G+DYLKYDNC N+D P RY MS+A
Sbjct: 135 LGVYSDAGTQTCSKQMPGSLGHEEQDAKTFASWGVDYLKYDNCTNEDRSPRERYPIMSNA 194
Query: 157 LKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKP 216
L+ +GR IF+SMCEWGD PA W S++ NSWRTT DI+D W M + AD+N+ +A +A P
Sbjct: 195 LQNSGRAIFYSMCEWGDDNPATWASSVGNSWRTTGDITDDWNSMTSRADLNDQWASYAGP 254
Query: 217 GGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAV 276
GGWNDPDMLEVGNGGM + EY HFSIWA+ KAPL++GCD+ ++ I+ N EVIAV
Sbjct: 255 GGWNDPDMLEVGNGGMSFGEYRSHFSIWALVKAPLIIGCDLRSMDNTAHDILSNPEVIAV 314
Query: 277 NQ-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLF 325
NQ D ++WAGPLSG R+ V+L +R ++ + +TA+W DIG+ +TL
Sbjct: 315 NQDKLGVQGKKVKQYGDLEVWAGPLSGKRVAVVLWNRGSYKADITAYWSDIGLDY-STLV 373
Query: 326 EHRTL-----ETKFQGKLTAKVDGHSCKMYELTP 354
+ R L + +G+L+A V+ H C MY LTP
Sbjct: 374 DARDLWAHSTKGSVKGQLSASVESHDCSMYVLTP 407
>gi|294464392|gb|ADE77708.1| unknown [Picea sitchensis]
Length = 396
Score = 327 bits (838), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 186/395 (47%), Positives = 221/395 (55%), Gaps = 86/395 (21%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGE 56
R+L SNGLG TP MGWNSWNHF C I+E IIR T DDCWGE
Sbjct: 27 HRHLSSNGLGKTPQMGWNSWNHFGCNIDENIIRGTADAMVSTGLSKVGYEYVNIDDCWGE 86
Query: 57 SSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNS 116
+RD+Q GNLVA +TFPSG KAL DYVH S
Sbjct: 87 LNRDAQ--------------------------GNLVAKASTFPSGMKALVDYVH---SKG 117
Query: 117 LDNRIESNIAF--------------EQ--------GIDYLKYDNCFNDDTRPTIRYRAMS 154
L I S+ + EQ GIDYLKYDNCFN+ T P RY MS
Sbjct: 118 LKLGIYSDAGYYTCSKTMPGSLGYEEQDAKTFASWGIDYLKYDNCFNNGTNPQERYPKMS 177
Query: 155 DALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHA 214
+AL AG PIF+SMCEWGD PA WG I NSWRTT DI D+W + + AD N+ +A +A
Sbjct: 178 EALSNAGCPIFYSMCEWGDQNPATWGPKIANSWRTTGDIQDNWDSITSRADQNDQWAAYA 237
Query: 215 KPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVI 274
PGGWNDPDMLEVGNG M EY HFSIWA+ KAPLL+GCD+ + T+ET+ I+ N EVI
Sbjct: 238 GPGGWNDPDMLEVGNGNMSTVEYRSHFSIWALMKAPLLIGCDIRSATSETLEILSNSEVI 297
Query: 275 AVNQ-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSN-- 321
VNQ D ++WAGPLS NR+ V+L +R T +TA+W IG+ S+
Sbjct: 298 NVNQDPLGVQGKKVSQQGDLEVWAGPLSNNRVAVILWNRSTSQDAITANWGAIGLPSDAI 357
Query: 322 ---NTLFEHRTLETKFQGKLTAKVDGHSCKMYELT 353
L+ H L + QG LT+ VD H KMY LT
Sbjct: 358 VQARDLWAHSYLPSDLQGSLTSTVDSHDVKMYILT 392
>gi|356533703|ref|XP_003535399.1| PREDICTED: alpha-galactosidase-like [Glycine max]
Length = 408
Score = 327 bits (837), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 178/374 (47%), Positives = 227/374 (60%), Gaps = 43/374 (11%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSL 75
R LL NGLG TPPMGWNSWNHF C INE +IRET D S L+ + + +L
Sbjct: 40 ERYLLENGLGQTPPMGWNSWNHFGCDINESLIRETADALVSSG------LAALGYKYINL 93
Query: 76 TQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDN-----RIESNI 125
A+ L QGN+V +TFPSG KALA YVH + I + N R+ ++
Sbjct: 94 DDCWAE-LNRDHQGNMVPKASTFPSGIKALAHYVHGKGLKLGIYSDAGNQTCSKRMPGSL 152
Query: 126 AFEQ---------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRP 176
E+ G+DYLKYDNC N+ T RY MS+AL K GRPIFFSMCEWG P
Sbjct: 153 GHEEQDAKTFASWGVDYLKYDNCENNGILATERYPPMSEALLKTGRPIFFSMCEWGWQDP 212
Query: 177 ALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNE 236
A W + NSWRTT DI D+W M +IAD N+ +A +A PGGWNDPDMLEVGNGGM E
Sbjct: 213 ATWAKTVGNSWRTTGDIEDNWNSMTSIADANDRWASYAGPGGWNDPDMLEVGNGGMTTEE 272
Query: 237 YVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAI-----------NDQQI 285
Y HFSIWA++KAPLL+GCD+ + T +I N EV+AVNQ N ++
Sbjct: 273 YRAHFSIWALAKAPLLVGCDIQAMDNTTYELISNREVVAVNQDKLGIQGKKVKNNNGLEV 332
Query: 286 WAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT-----LFEHRTLETKFQGKLTA 340
W GPL GN++ V+L +R + ++ +TA W DIG++ L+EH T +T GK++A
Sbjct: 333 WTGPLRGNKVAVILWNRSSSNATVTACWSDIGLEPGTIVDARDLWEHST-QTSVSGKISA 391
Query: 341 KVDGHSCKMYELTP 354
++D H+CKMY LTP
Sbjct: 392 ELDSHACKMYVLTP 405
>gi|116788561|gb|ABK24922.1| unknown [Picea sitchensis]
Length = 396
Score = 325 bits (834), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 180/402 (44%), Positives = 221/402 (54%), Gaps = 80/402 (19%)
Query: 9 ADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET------------------ 50
A+ S R+L SNGLG P MGWNSWNHF C I++ IIR T
Sbjct: 20 ANGARSIHRHLSSNGLGRRPQMGWNSWNHFGCNIDDNIIRGTADAMISTGLSKVGYQYVN 79
Query: 51 -DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYV 109
DDCW E +RD QGNLVA +TFPSG K LADYV
Sbjct: 80 IDDCWAEQNRD--------------------------FQGNLVAKASTFPSGMKVLADYV 113
Query: 110 HI----------LISNSLDNRIESNIAFEQ---------GIDYLKYDNCFNDDTRPTIRY 150
H SN+ + ++ E GIDYLKYDNCFN T P RY
Sbjct: 114 HSKGLKLGIYSDAGSNTCSKTMPGSLGHEDQDARTFASWGIDYLKYDNCFNGGTSPQERY 173
Query: 151 RAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVY 210
MS+AL+ AGR IF+SMCEWGD PA W I NSWRTT DI D W + + AD N +
Sbjct: 174 PKMSEALRNAGRLIFYSMCEWGDQNPATWAPKIANSWRTTGDIEDKWESITSRADQNNQW 233
Query: 211 ADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGN 270
A +A PGGWNDPDMLEVGNG M EY HFSIWA+ KAPLL+GCD+ + + +T+ I+GN
Sbjct: 234 AAYAGPGGWNDPDMLEVGNGNMNIEEYRSHFSIWALMKAPLLIGCDIRSASPQTLEILGN 293
Query: 271 EEVIAVNQ-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIK 319
+EVI VNQ D ++WAGPLS R+ ++L +R + +TA W DIG+
Sbjct: 294 KEVINVNQDPLGVQGKKVSKQGDIEVWAGPLSNRRVAIILWNRSGSQAYITARWADIGLP 353
Query: 320 SN-----NTLFEHRTLETKFQGKLTAKVDGHSCKMYELTPVT 356
SN L+ HR L T +G LT+ V H+CKMY L P++
Sbjct: 354 SNALVQARDLWAHRYLPTSLRGSLTSTVASHACKMYILNPIS 395
>gi|225463809|ref|XP_002268711.1| PREDICTED: alpha-galactosidase [Vitis vinifera]
gi|297742707|emb|CBI35341.3| unnamed protein product [Vitis vinifera]
Length = 409
Score = 325 bits (834), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 179/392 (45%), Positives = 224/392 (57%), Gaps = 81/392 (20%)
Query: 17 RNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGES 57
RNLL+NGL TP MGWNSWNHF C+I+E +I+ET DDCWGE
Sbjct: 44 RNLLANGLACTPQMGWNSWNHFGCKIDENLIKETADAMVSSGLAALGYHYVNLDDCWGEK 103
Query: 58 SRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------ 111
+RDS GNLVA +TFPSG KALADYVH
Sbjct: 104 NRDSD--------------------------GNLVAKHSTFPSGIKALADYVHKKGLKLG 137
Query: 112 ------------LISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALK 158
+ SL + + F +DYLKYDNC N RP RY MS AL
Sbjct: 138 IYSDAGTQTCSKTMPGSLGHEEKDAKTFASWEVDYLKYDNCENTGIRPQERYIKMSKALL 197
Query: 159 KAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGG 218
+GR IFFS+CEWG PA W ++ NSWRTT DI D+W M + AD N+ +A HA PGG
Sbjct: 198 NSGRSIFFSLCEWGQEDPATWAKDVGNSWRTTGDIEDNWDSMTSRADENDKWAAHAGPGG 257
Query: 219 WNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
WNDPDMLEVGNGGM+ EY HFSIWA++KAPLL+GCDV ++ T ++ N+EVIAVNQ
Sbjct: 258 WNDPDMLEVGNGGMRKEEYQSHFSIWALAKAPLLIGCDVRSMDNSTFELLSNKEVIAVNQ 317
Query: 279 -----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT---- 323
+ ++WAGPLS N++ V+L +R + + MTAHW DIG+ +
Sbjct: 318 DELGVQGRKVKKDGNLEVWAGPLSDNKVAVVLWNRGSSQANMTAHWSDIGLSPSAVVDAR 377
Query: 324 -LFEHRTLETKFQGKLTAKVDGHSCKMYELTP 354
L+ H T ++ QG+L A+VD H+CKMY LTP
Sbjct: 378 DLWAHST-QSSVQGQLWAQVDSHACKMYILTP 408
>gi|7572929|emb|CAB87430.1| alpha-galactosidase-like protein [Arabidopsis thaliana]
Length = 434
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 181/396 (45%), Positives = 225/396 (56%), Gaps = 82/396 (20%)
Query: 17 RNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGES 57
R L+NGL TP MGWNSWN F C INE +I+ET DDCW
Sbjct: 59 RLQLNNGLARTPQMGWNSWNFFACNINETVIKETADALVSSGLADLGYIHVNIDDCWSNL 118
Query: 58 SRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------ 111
RDS+ G LV + TFPSG K LADYVH
Sbjct: 119 LRDSE--------------------------GQLVPHPETFPSGIKLLADYVHSKGLKLG 152
Query: 112 -----------LISNSLDNRIE-SNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKK 159
+ SL + ++ ++I G+DYLKYDNCFN +P RY M DAL
Sbjct: 153 IYSDAGVFTCEVHPGSLFHEVDDADIFASWGVDYLKYDNCFNLGIKPIERYPPMRDALNA 212
Query: 160 AGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGW 219
GR IF+S+CEWG PALW + NSWRTTDDI+D+WA M TIAD+N +A +A PGGW
Sbjct: 213 TGRSIFYSLCEWGVDDPALWAKEVGNSWRTTDDINDTWASMTTIADLNNKWAAYAGPGGW 272
Query: 220 NDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ- 278
NDPDMLE+GNGGM Y EY HFSIWA+ KAPLL+GCDV N+TAET+ I+ N+E+IAVNQ
Sbjct: 273 NDPDMLEIGNGGMTYEEYRGHFSIWALMKAPLLIGCDVRNMTAETLEILSNKEIIAVNQD 332
Query: 279 ------------AIND-QQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSN---- 321
ND QQ+W+GPLSG+R+VV L +R + + +TA WD IG++S
Sbjct: 333 PLGVQGRKIQANGENDCQQVWSGPLSGDRMVVALWNRCSEPATITASWDMIGLESTISVS 392
Query: 322 -NTLFEHRTLETKFQGKLTAKVDGHSCKMYELTPVT 356
L++H+ + G A+VD H C MY LTP T
Sbjct: 393 VRDLWQHKDVTENTSGSFEAQVDAHDCHMYVLTPQT 428
>gi|297816956|ref|XP_002876361.1| hypothetical protein ARALYDRAFT_486071 [Arabidopsis lyrata subsp.
lyrata]
gi|297322199|gb|EFH52620.1| hypothetical protein ARALYDRAFT_486071 [Arabidopsis lyrata subsp.
lyrata]
Length = 430
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 181/396 (45%), Positives = 223/396 (56%), Gaps = 82/396 (20%)
Query: 17 RNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGES 57
R L+NGL TP MGWNSWN F C INE +I+ET DDCW
Sbjct: 55 RLQLNNGLARTPQMGWNSWNFFACNINETVIKETADALLSSGLADLGYIHVNIDDCWSNL 114
Query: 58 SRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------ 111
RDS+ G LV + TFPSG K LADYVH
Sbjct: 115 LRDSK--------------------------GQLVPHPETFPSGIKLLADYVHSKGLKLG 148
Query: 112 -----------LISNSLDNRIE-SNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKK 159
+ SL + ++ ++I G+DYLKYDNCFN +P RY M DAL
Sbjct: 149 IYSDAGVFTCQVRPGSLFHEVDDADIFASWGVDYLKYDNCFNLGIKPIKRYPPMRDALNA 208
Query: 160 AGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGW 219
GR IF+S+CEWG PALW + NSWRTTDDI+D+WA M TIAD+N +A +A PGGW
Sbjct: 209 TGRSIFYSLCEWGVDDPALWAKEVGNSWRTTDDINDTWASMTTIADLNNKWAAYAGPGGW 268
Query: 220 NDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ- 278
NDPDMLE+GNGGM Y EY HFSIWA+ KAPLL+GCDV N+TAET I+ N+EVIAVNQ
Sbjct: 269 NDPDMLEIGNGGMTYEEYRGHFSIWALMKAPLLIGCDVRNMTAETFEILSNKEVIAVNQD 328
Query: 279 -------------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSN---- 321
+ QQ+W+GPLSG+RIVV L +R + + +TA WD IG++S
Sbjct: 329 PLGVQGRKIQANGEDDCQQVWSGPLSGDRIVVALWNRCSEQATITASWDVIGLESTISVS 388
Query: 322 -NTLFEHRTLETKFQGKLTAKVDGHSCKMYELTPVT 356
L++H+ + G A+VD H C MY LTP T
Sbjct: 389 VRDLWQHKDVTENASGSFEAQVDAHDCHMYVLTPQT 424
>gi|113497|sp|P14749.1|AGAL_CYATE RecName: Full=Alpha-galactosidase; AltName:
Full=Alpha-D-galactoside galactohydrolase; AltName:
Full=Melibiase; Flags: Precursor
gi|18292|emb|CAA32772.1| alpha-galactosidase preproprotein [Cyamopsis tetragonoloba]
Length = 411
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 176/393 (44%), Positives = 223/393 (56%), Gaps = 81/393 (20%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGE 56
RR L NGLG TPPMGWNSWNHF C INE ++RET DDCW E
Sbjct: 44 RRYLAENGLGQTPPMGWNSWNHFGCDINENVVRETADAMVSTGLAALGYQYINLDDCWAE 103
Query: 57 SSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI----- 111
+RDS+ GN+V N FPSG KALADYVH
Sbjct: 104 LNRDSE--------------------------GNMVPNAAAFPSGIKALADYVHSKGLKL 137
Query: 112 -----LISNSLDNRIESNIAFEQ---------GIDYLKYDNCFNDDTRPTIRYRAMSDAL 157
+ + R+ ++ E+ G+DYLKYDNC N RY M AL
Sbjct: 138 GVYSDAGNQTCSKRMPGSLGHEEQDAKTFASWGVDYLKYDNCENLGISVKERYPPMGKAL 197
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPG 217
+GRPIFFSMCEWG P +W +I NSWRTT DI D+W M +IAD N+ +A +A PG
Sbjct: 198 LSSGRPIFFSMCEWGWEDPQIWAKSIGNSWRTTGDIEDNWNSMTSIADSNDKWASYAGPG 257
Query: 218 GWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN 277
GWNDPDMLEVGNGGM EY HFSIWA++KAPLL+GCD+ + T +I N EVIAVN
Sbjct: 258 GWNDPDMLEVGNGGMTTEEYRSHFSIWALAKAPLLVGCDIRAMDDTTHELISNAEVIAVN 317
Query: 278 Q-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT--- 323
Q + ND ++WAGPLS N++ V+L +R + + +TA W DIG++ T
Sbjct: 318 QDKLGVQGKKVKSTNDLEVWAGPLSDNKVAVILWNRSSSRATVTASWSDIGLQQGTTVDA 377
Query: 324 --LFEHRTLETKFQGKLTAKVDGHSCKMYELTP 354
L+EH T ++ G+++A++D H+CKMY LTP
Sbjct: 378 RDLWEHST-QSLVSGEISAEIDSHACKMYVLTP 409
>gi|22331822|ref|NP_191190.2| alpha-galactosidase [Arabidopsis thaliana]
gi|18377735|gb|AAL67017.1| putative alpha-galactosidase [Arabidopsis thaliana]
gi|21281028|gb|AAM45068.1| putative alpha-galactosidase [Arabidopsis thaliana]
gi|332645987|gb|AEE79508.1| alpha-galactosidase [Arabidopsis thaliana]
Length = 437
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 181/396 (45%), Positives = 225/396 (56%), Gaps = 82/396 (20%)
Query: 17 RNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGES 57
R L+NGL TP MGWNSWN F C INE +I+ET DDCW
Sbjct: 62 RLQLNNGLARTPQMGWNSWNFFACNINETVIKETADALVSSGLADLGYIHVNIDDCWSNL 121
Query: 58 SRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------ 111
RDS+ G LV + TFPSG K LADYVH
Sbjct: 122 LRDSE--------------------------GQLVPHPETFPSGIKLLADYVHSKGLKLG 155
Query: 112 -----------LISNSLDNRIE-SNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKK 159
+ SL + ++ ++I G+DYLKYDNCFN +P RY M DAL
Sbjct: 156 IYSDAGVFTCEVHPGSLFHEVDDADIFASWGVDYLKYDNCFNLGIKPIERYPPMRDALNA 215
Query: 160 AGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGW 219
GR IF+S+CEWG PALW + NSWRTTDDI+D+WA M TIAD+N +A +A PGGW
Sbjct: 216 TGRSIFYSLCEWGVDDPALWAKEVGNSWRTTDDINDTWASMTTIADLNNKWAAYAGPGGW 275
Query: 220 NDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ- 278
NDPDMLE+GNGGM Y EY HFSIWA+ KAPLL+GCDV N+TAET+ I+ N+E+IAVNQ
Sbjct: 276 NDPDMLEIGNGGMTYEEYRGHFSIWALMKAPLLIGCDVRNMTAETLEILSNKEIIAVNQD 335
Query: 279 ------------AIND-QQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSN---- 321
ND QQ+W+GPLSG+R+VV L +R + + +TA WD IG++S
Sbjct: 336 PLGVQGRKIQANGENDCQQVWSGPLSGDRMVVALWNRCSEPATITASWDMIGLESTISVS 395
Query: 322 -NTLFEHRTLETKFQGKLTAKVDGHSCKMYELTPVT 356
L++H+ + G A+VD H C MY LTP T
Sbjct: 396 VRDLWQHKDVTENTSGSFEAQVDAHDCHMYVLTPQT 431
>gi|29335747|dbj|BAC66445.1| alpha-galactosidase [Helianthus annuus]
Length = 428
Score = 324 bits (831), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 177/389 (45%), Positives = 223/389 (57%), Gaps = 81/389 (20%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
L NGLG TP MGWNSWNHF C I EK+IRET DDCW E +RD
Sbjct: 66 LDNGLGRTPQMGWNSWNHFACNIEEKLIRETADAMVSTGLAAAGYQYVNIDDCWAELNRD 125
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH---------- 110
SQ GN V TFPSG +ALADYVH
Sbjct: 126 SQ--------------------------GNFVPKADTFPSGIEALADYVHNKGLKLGIYS 159
Query: 111 --------ILISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAG 161
+ + SL + + F GIDYLKYDNC + P RY M+ AL+KAG
Sbjct: 160 DAGTQTCSLKMPGSLGHEEQDAKTFASWGIDYLKYDNCNDQGRSPKERYPIMTKALQKAG 219
Query: 162 RPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWND 221
RPIFFS+CEWG PA W + + NSWRTT DISD+W M + AD N+ +A +AKPGGWND
Sbjct: 220 RPIFFSLCEWGREDPATWANEVGNSWRTTGDISDNWDSMTSRADENDKWAAYAKPGGWND 279
Query: 222 PDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ--- 278
PDMLEVGNGGM EY HFSIWA++KAPLL+GCD+ +++ ET I+ N EVIAVNQ
Sbjct: 280 PDMLEVGNGGMTTEEYRSHFSIWALAKAPLLVGCDMRSMSKETHDILSNREVIAVNQDSL 339
Query: 279 --------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT-----LF 325
D ++WAGPL+ N++ V+L +R + + +TA+W DIG+ S L+
Sbjct: 340 GVQGKKVKKNGDLEVWAGPLAHNKVAVILWNRGSSRAQITAYWSDIGLNSTTVVNVRDLW 399
Query: 326 EHRTLETKFQGKLTAKVDGHSCKMYELTP 354
HRT + +G+++A V+ H+CKMY LTP
Sbjct: 400 AHRT-QRSVKGQISATVESHACKMYVLTP 427
>gi|224108391|ref|XP_002314831.1| predicted protein [Populus trichocarpa]
gi|222863871|gb|EEF01002.1| predicted protein [Populus trichocarpa]
Length = 386
Score = 324 bits (830), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 179/393 (45%), Positives = 226/393 (57%), Gaps = 81/393 (20%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGE 56
RRNLL NGLG TP MGWNS+NHF C++NE++IRET DDCW E
Sbjct: 20 RRNLLDNGLGHTPQMGWNSYNHFPCKLNEELIRETADAMVSSGMAALGYKYINLDDCWAE 79
Query: 57 SSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----I 111
RDS+ GNLV K+ FPSG KALADYVH I
Sbjct: 80 LKRDSK--------------------------GNLVPKKSIFPSGMKALADYVHSKGLKI 113
Query: 112 LISN-----SLDNRIESNIAFEQ---------GIDYLKYDNCFNDDTRPTIRYRAMSDAL 157
I + + ++ +E+ GIDYLKYDNC N+ P RY MS AL
Sbjct: 114 GIYGDAGTLTCAKTMPGSLGYEEQDAKTFASWGIDYLKYDNCHNNGVSPQERYSKMSKAL 173
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPG 217
GRPIFFS+CEWG PA W + I NSWRTT DI D+W M++ AD N+ +A +A PG
Sbjct: 174 LSTGRPIFFSLCEWGQDNPATWAAGIGNSWRTTGDIKDTWDSMISRADQNDDWASYAGPG 233
Query: 218 GWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN 277
GWNDPDMLEVGNGGM EY HFSIWA+ KAPLLLGCD+ ++ +T ++ N+EVIAVN
Sbjct: 234 GWNDPDMLEVGNGGMSTEEYRSHFSIWALMKAPLLLGCDIRSMDNDTFELLSNKEVIAVN 293
Query: 278 Q-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT--- 323
Q D ++WAGPLSG RI +L +R + + +TA + DIG+KS+
Sbjct: 294 QDKLGLQGKKVKKTGDLEVWAGPLSGKRIAAVLWNRGSSRASITADFKDIGLKSSAVVNA 353
Query: 324 --LFEHRTLETKFQGKLTAKVDGHSCKMYELTP 354
L+ H T+ T + +LT V+ H+CKM+ LTP
Sbjct: 354 RDLWAHSTM-TSVRHQLTVTVESHACKMFVLTP 385
>gi|222636967|gb|EEE67099.1| hypothetical protein OsJ_24103 [Oryza sativa Japonica Group]
Length = 409
Score = 323 bits (828), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 177/379 (46%), Positives = 224/379 (59%), Gaps = 54/379 (14%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSL 75
RR+LL NGLG TP MGWNSWNHF C INE IR T D + +
Sbjct: 42 RRSLLDNGLGRTPQMGWNSWNHFGCNINENTIRSTVDA-----------------LISTG 84
Query: 76 TQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNR-----IESNI 125
A N G + A+ FPSG KALADYVH + I +S +R + ++
Sbjct: 85 LTKAGYTYVNLGGGYIAADPKKFPSGIKALADYVHSKGLKLGIYSSAGSRTCSKTMPGSL 144
Query: 126 AFEQ---------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRP 176
+E G+DYLKYDNC +D + T+R+ MS AL+K GRPIF+S+CEWG
Sbjct: 145 GYEDIDAKTFASWGVDYLKYDNCNSDGSSETVRFPKMSFALRKTGRPIFYSICEWGQRNV 204
Query: 177 ALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNE 236
A WG NSWRTT DI+DSWA ML+ D N+ A +AKPGGWNDPDMLEVGNGGM +E
Sbjct: 205 ATWGGQYGNSWRTTGDINDSWASMLSNIDSNDASASYAKPGGWNDPDMLEVGNGGMTNDE 264
Query: 237 YVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAI------------NDQQ 284
YVVH S+WAI+KAPL++GCDV +++ ETM I+ N EVIA+NQ N+ +
Sbjct: 265 YVVHISLWAIAKAPLIIGCDVRSISRETMEILSNPEVIAINQDPLGVQGKKVRKYDNEIE 324
Query: 285 IWAGPLSGNRIVVLLESR-KTFSSMMTAHWDDIGIKSN-----NTLFEHRTLETKFQGKL 338
+WAGPLS R VLL +R T S +TA W DIG+ ++ H T +F G L
Sbjct: 325 VWAGPLSQQRTAVLLLNRGATGSRQITAAWQDIGVGPGVAVEAKNVWLHATAPGRFTGSL 384
Query: 339 TAKVDGHSCKMYELTPVTK 357
TA+V HSCK++ LTPV +
Sbjct: 385 TAEVAAHSCKLFVLTPVGR 403
>gi|359496639|ref|XP_002271908.2| PREDICTED: alpha-galactosidase-like [Vitis vinifera]
gi|297744869|emb|CBI38328.3| unnamed protein product [Vitis vinifera]
Length = 404
Score = 323 bits (827), Expect = 9e-86, Method: Compositional matrix adjust.
Identities = 175/394 (44%), Positives = 224/394 (56%), Gaps = 86/394 (21%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSR 59
LL+NG+ TPPMGWNSWNHF C+I+E ++ T DDCWGE +R
Sbjct: 40 LLANGVARTPPMGWNSWNHFQCKIDESTVKATADALISTGLAALGYKYVNIDDCWGEGNR 99
Query: 60 DSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDN 119
DS+ GNLVA +TFPSG KALADYVH + L
Sbjct: 100 DSR--------------------------GNLVARSSTFPSGIKALADYVH---ARGLKL 130
Query: 120 RIESNIAF----------------------EQGIDYLKYDNCFNDDTRPTIRYRAMSDAL 157
I S+ + E G+DYLKYDNC+ND ++P RY MS AL
Sbjct: 131 GIYSDAGYSTCSKTMPGSFGHEDQDARTFAEWGVDYLKYDNCYNDGSKPQDRYARMSSAL 190
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPG 217
KAGRPI +S+CEWG PA W S+I N+WRTT DI D+W + +IAD N ++ +A PG
Sbjct: 191 HKAGRPILYSLCEWGQENPATWASSIGNAWRTTGDIKDNWDSITSIADANNIWGRYAGPG 250
Query: 218 GWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN 277
WNDPDMLEVGNGGM EY HFSIWA+ KAPLL+GCDV + + ET+ I+GN+EVI VN
Sbjct: 251 RWNDPDMLEVGNGGMSLEEYRSHFSIWALMKAPLLIGCDVRSSSKETLSILGNKEVIDVN 310
Query: 278 Q---AINDQQI--------WAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT--- 323
Q I ++I WAGPLS R+ V+L +R + + +T W +IG+ +N
Sbjct: 311 QDPLGIQGRKIHSKASVEVWAGPLSKRRVAVVLWNRGSSQAPITVGWREIGLSPSNPVTV 370
Query: 324 --LFEHRTLETKFQGKLTAKVDGHSCKMYELTPV 355
L+ H L + +LTA V H+CKMY LTP+
Sbjct: 371 RDLWTHSFLSRSMRSRLTAYVAPHACKMYILTPL 404
>gi|147810310|emb|CAN66885.1| hypothetical protein VITISV_000962 [Vitis vinifera]
Length = 392
Score = 323 bits (827), Expect = 9e-86, Method: Compositional matrix adjust.
Identities = 178/392 (45%), Positives = 223/392 (56%), Gaps = 81/392 (20%)
Query: 17 RNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGES 57
RNLL+NGL TP MGWNSWNHF C+I+E +I+ET DDCWGE
Sbjct: 27 RNLLANGLACTPQMGWNSWNHFGCKIDENLIKETADAMVSSGLAALGYHYVNLDDCWGEK 86
Query: 58 SRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------ 111
+RDS GNLVA + FPSG KALADYVH
Sbjct: 87 NRDSD--------------------------GNLVAKHSAFPSGIKALADYVHKKGLKLG 120
Query: 112 ------------LISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALK 158
+ SL + + F +DYLKYDNC N RP RY MS AL
Sbjct: 121 IYSDAGTQTCSKTMPGSLGHEEQDAKTFASWEVDYLKYDNCENSGIRPQERYIKMSKALL 180
Query: 159 KAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGG 218
+GR IFFS+CEWG PA W ++ NSWRTT DI D+W M + AD N+ +A HA PGG
Sbjct: 181 NSGRSIFFSLCEWGQEDPATWAKDVGNSWRTTGDIEDNWDSMTSRADENDKWAAHAGPGG 240
Query: 219 WNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
WNDPDMLEVGNGGM EY HFSIWA++KAPLL+GCDV ++ T ++ N+EVIAVNQ
Sbjct: 241 WNDPDMLEVGNGGMXKEEYQSHFSIWALAKAPLLIGCDVRSMDNSTFELLSNKEVIAVNQ 300
Query: 279 -----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFEH 327
+ ++WAGPLS N++ V+L +R + + MTAHW DIG+ S + + +
Sbjct: 301 DELGVQGRKVKKDGNLEVWAGPLSDNKVAVVLWNRGSSQANMTAHWSDIGL-SPSAVVDA 359
Query: 328 RTL-----ETKFQGKLTAKVDGHSCKMYELTP 354
R L ++ QG+L A+VD H+CKMY LTP
Sbjct: 360 RDLWAXSTQSSVQGQLWAQVDSHACKMYILTP 391
>gi|357146802|ref|XP_003574116.1| PREDICTED: alpha-galactosidase-like [Brachypodium distachyon]
Length = 411
Score = 323 bits (827), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 180/394 (45%), Positives = 227/394 (57%), Gaps = 84/394 (21%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGE 56
RR L NGLG TP MGWNSWNHF C I+E++IRET DDCW E
Sbjct: 46 RRMALQNGLGRTPQMGWNSWNHFFCGISEEMIRETADALVNTGLAKLGYQYVNIDDCWAE 105
Query: 57 SSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI----- 111
+RD Q GNLV NK TFPSG KALADYVH
Sbjct: 106 LNRDYQ--------------------------GNLVPNKKTFPSGIKALADYVHAKGLKL 139
Query: 112 -------------LISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTI-RYRAMSDA 156
+ SLD+ + F G+DYLKYDNC ND R + RY MS+A
Sbjct: 140 GVYSDAGTKTCSNQMPGSLDHEEQDVKTFASWGVDYLKYDNC-NDAGRSVMERYTKMSNA 198
Query: 157 LKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKP 216
+K G+ IFFS+CEWG PA WG+ + NSWRTT DI+D+WA M + AD N+ +A +A P
Sbjct: 199 MKTYGKNIFFSLCEWGKENPATWGAGMGNSWRTTADIADNWASMTSCADQNDRWASYAGP 258
Query: 217 GGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAV 276
GGWNDPDMLEVGNGGM EY HFSIWA++KAPLL+GCDV ++ +T I+ N EVIAV
Sbjct: 259 GGWNDPDMLEVGNGGMSDAEYRSHFSIWALAKAPLLIGCDVRTMSQQTKDILSNGEVIAV 318
Query: 277 N-----------QAINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT-- 323
N Q+ N ++WAGPLSGNR V+L +R+ + + +TAHW ++G+ ++
Sbjct: 319 NQDSLGVQGKKVQSDNGLEVWAGPLSGNRKAVVLWNRQGYQATITAHWSNVGLPASAAVT 378
Query: 324 ---LFEHRTLETKFQGKLTAKVDGHSCKMYELTP 354
L+ H + QG+++A V H CKMY LTP
Sbjct: 379 ARDLWAHSSFSA--QGQISASVAPHDCKMYILTP 410
>gi|52699549|gb|AAU86897.1| glycosyl hydrolase family-like protein [Salvia miltiorrhiza]
Length = 431
Score = 321 bits (823), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 175/391 (44%), Positives = 218/391 (55%), Gaps = 82/391 (20%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
L+NGL TP MGWNSWN F C INE +I+ET DDCW E RD
Sbjct: 60 LNNGLAKTPQMGWNSWNFFACDINEIVIKETADALLTTGLAKLGYNYVNIDDCWSELVRD 119
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH--------IL 112
S+ G LV + TFPSG KALADYVH
Sbjct: 120 SK--------------------------GQLVPDSKTFPSGIKALADYVHSKGLKLGIYS 153
Query: 113 ISNSLDNRIESNIAFEQ----------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGR 162
+ + ++ F + G+DYLKYDNCFN P RY M DAL GR
Sbjct: 154 DAGAFTCQVRPGSIFHESDDAKLFASWGVDYLKYDNCFNLGIPPKERYPPMRDALNATGR 213
Query: 163 PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDP 222
IF+S+CEWG PALW + NSWRTTDDI+DSWA M TIAD+N+ +A +A PGGWNDP
Sbjct: 214 SIFYSLCEWGVDDPALWAGTVGNSWRTTDDINDSWASMTTIADLNDKWAAYAGPGGWNDP 273
Query: 223 DMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ---- 278
DMLEVGNGGM Y EY HFSIWA+ KAPLL+GCDV N+T+E I+ NEEVI+VNQ
Sbjct: 274 DMLEVGNGGMTYEEYRAHFSIWALMKAPLLIGCDVRNITSEAFEILSNEEVISVNQDPLG 333
Query: 279 ----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSN-----NT 323
A N Q+WAGPLSG R+ V+L +R + ++ +T WD +G++S+
Sbjct: 334 VQGRKVYAYGADNSYQVWAGPLSGQRLAVVLWNRGSKTASITVKWDVLGLESSISVSVRD 393
Query: 324 LFEHRTLETKFQGKLTAKVDGHSCKMYELTP 354
L++H + +A+VD HS +MY TP
Sbjct: 394 LWKHNYVSANSMAYFSARVDAHSSEMYVFTP 424
>gi|212274575|ref|NP_001130466.1| uncharacterized protein LOC100191564 [Zea mays]
gi|194689202|gb|ACF78685.1| unknown [Zea mays]
Length = 358
Score = 320 bits (821), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 172/363 (47%), Positives = 225/363 (61%), Gaps = 44/363 (12%)
Query: 30 MGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQG 89
MGWNSWNHF C INE ++R T D I L+ + + + +L A + +G
Sbjct: 1 MGWNSWNHFQCDINEAVVRSTADAL------VAIGLAKAGYTYVNLDDCWADSERTK-EG 53
Query: 90 NLVANKTTFPSGTKALADYVHI----------LISNSLDNRIESNIAFEQ---------G 130
+VAN TFPSG KALADYVH + + NR+ ++ +E+ G
Sbjct: 54 YMVANPKTFPSGIKALADYVHSKGLKLGIYSSAGTRTCSNRMPGSLGYEERDAKMFASWG 113
Query: 131 IDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTT 190
+DYLKYDNC+ D T T+R+ MS AL +GRPIF+S+CEWG M WG NSWRTT
Sbjct: 114 VDYLKYDNCYRDGTPETVRFGRMSRALVNSGRPIFYSLCEWGFMEVPKWGGMYGNSWRTT 173
Query: 191 DDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAP 250
DI+D+W+ ML D+N+ YA +AKPGGWNDPDMLEVGNGGM YNEYVVHFS+WAI+KAP
Sbjct: 174 GDINDTWSGMLDNIDLNDAYARYAKPGGWNDPDMLEVGNGGMAYNEYVVHFSLWAIAKAP 233
Query: 251 LLLGCDVGNLTAETMPIIGNEEVIAVNQ------------AINDQQIWAGPLSGNRIVVL 298
L++GCDV ++ ET+ I+ N EVIA++Q ND ++WAG LSG+R VL
Sbjct: 234 LVIGCDVTRVSNETLGILSNAEVIAISQDRLGVQGQKVSKYGNDLEVWAGQLSGHRKAVL 293
Query: 299 LESR-KTFSSMMTAHWDDIGIKSN-----NTLFEHRTLETKFQGKLTAKVDGHSCKMYEL 352
L +R T S+ +TA W D+GI+ +++H TL F G LTA V HSCK++ L
Sbjct: 294 LLNRGATRSASITAAWPDVGIRRGVAVEARDVWKHETLPGWFTGSLTAVVGPHSCKLFVL 353
Query: 353 TPV 355
TPV
Sbjct: 354 TPV 356
>gi|302758534|ref|XP_002962690.1| hypothetical protein SELMODRAFT_78329 [Selaginella moellendorffii]
gi|300169551|gb|EFJ36153.1| hypothetical protein SELMODRAFT_78329 [Selaginella moellendorffii]
Length = 390
Score = 320 bits (819), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 174/399 (43%), Positives = 224/399 (56%), Gaps = 83/399 (20%)
Query: 11 SNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------D 51
S+ +R LL NGLG TPPMGWNSWN+FHC INE+IIR T D
Sbjct: 21 SDSFERHRLLENGLGRTPPMGWNSWNYFHCGINEEIIRATTDAIVSTGLRDVGYEYVNID 80
Query: 52 DCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI 111
DCW E SRD++ GNL A +TFPSG KALADYVH
Sbjct: 81 DCWAELSRDNE--------------------------GNLQARNSTFPSGIKALADYVH- 113
Query: 112 LISNSLDNRIESNIAF----EQ-----------------GIDYLKYDNCFNDDTRPTIRY 150
S +L I S+ + EQ G+DYLKYDNC D + P IRY
Sbjct: 114 --SKNLKFGIYSDAGYLTCQEQPGSLNHEDQDAKTFAAWGVDYLKYDNCHTDGSSPRIRY 171
Query: 151 RAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVY 210
M DAL AGRPIFFSMCEWG PA W S++ NSWRTT DI + W M++IAD N +
Sbjct: 172 PIMRDALSAAGRPIFFSMCEWGQEDPATWASSVGNSWRTTGDIENKWESMISIADKNNAW 231
Query: 211 ADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGN 270
A+HA PGGWNDPDMLE+GNGGM + E HFS+WA+ KAPL++GCDV N A+ + I+ N
Sbjct: 232 AEHAAPGGWNDPDMLEIGNGGMSFQESRTHFSLWALMKAPLIIGCDVRNTVAQDLAILMN 291
Query: 271 EEVIAVNQ-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIK 319
+EVI +NQ +Q++W GPLS R +L+ +R + + +TA + +GI
Sbjct: 292 KEVIQINQDALGVQGRKVAGDGEQEVWGGPLSNGRFALLMLNRGSDPADITAEFAALGIP 351
Query: 320 SNNTLFEHRTLETK---FQGKLTAKVDGHSCKMYELTPV 355
SN ++ + + + +T++V+ H MY LTPV
Sbjct: 352 SNVSVMIRDVWKVRTGTYNSSITSRVESHDVAMYILTPV 390
>gi|297610712|ref|NP_001064939.2| Os10g0493600 [Oryza sativa Japonica Group]
gi|60389815|sp|Q9FXT4.1|AGAL_ORYSJ RecName: Full=Alpha-galactosidase; AltName:
Full=Alpha-D-galactoside galactohydrolase; AltName:
Full=Melibiase; Flags: Precursor
gi|9971577|dbj|BAB12570.1| alpha-galactosidase [Oryza sativa Japonica Group]
gi|22128720|gb|AAM92832.1| putative alpha-galactosidase preproprotein [Oryza sativa Japonica
Group]
gi|31432825|gb|AAP54412.1| Alpha-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|78708844|gb|ABB47819.1| Alpha-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|255679520|dbj|BAF26853.2| Os10g0493600 [Oryza sativa Japonica Group]
Length = 417
Score = 319 bits (818), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 176/390 (45%), Positives = 224/390 (57%), Gaps = 84/390 (21%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
NGLG TP MGWNSWNHF+C INE+IIRET DDCW E SRD
Sbjct: 56 FENGLGRTPQMGWNSWNHFYCGINEQIIRETADALVNTGLAKLGYQYVNIDDCWAEYSRD 115
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI--------- 111
SQ GN V N+ TFPSG KALADYVH
Sbjct: 116 SQ--------------------------GNFVPNRQTFPSGIKALADYVHAKGLKLGIYS 149
Query: 112 -LISNSLDNRIESNIAFEQ---------GIDYLKYDNCFNDDTRPTI-RYRAMSDALKKA 160
S + N++ ++ E+ G+DYLKYDNC ND R + RY MS+A+K
Sbjct: 150 DAGSQTCSNKMPGSLDHEEQDVKTFASWGVDYLKYDNC-NDAGRSVMERYTRMSNAMKTY 208
Query: 161 GRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWN 220
G+ IFFS+CEWG PA W + NSWRTT DI+D+W M + AD N+ +A +A PGGWN
Sbjct: 209 GKNIFFSLCEWGKENPATWAGRMGNSWRTTGDIADNWGSMTSRADENDQWAAYAGPGGWN 268
Query: 221 DPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN--- 277
DPDMLEVGNGGM EY HFSIWA++KAPLL+GCDV +++ +T I+ N EVIAVN
Sbjct: 269 DPDMLEVGNGGMSEAEYRSHFSIWALAKAPLLIGCDVRSMSQQTKNILSNSEVIAVNQDS 328
Query: 278 --------QAINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSN-----NTL 324
Q+ N ++WAGPLS NR V+L +R+++ + +TAHW +IG+ + L
Sbjct: 329 LGVQGKKVQSDNGLEVWAGPLSNNRKAVVLWNRQSYQATITAHWSNIGLAGSVAVTARDL 388
Query: 325 FEHRTLETKFQGKLTAKVDGHSCKMYELTP 354
+ H + QG+++A V H CKMY LTP
Sbjct: 389 WAHSSFAA--QGQISASVAPHDCKMYVLTP 416
>gi|33358179|pdb|1UAS|A Chain A, Crystal Structure Of Rice Alpha-Galactosidase
Length = 362
Score = 319 bits (818), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 176/390 (45%), Positives = 224/390 (57%), Gaps = 84/390 (21%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
NGLG TP MGWNSWNHF+C INE+IIRET DDCW E SRD
Sbjct: 1 FENGLGRTPQMGWNSWNHFYCGINEQIIRETADALVNTGLAKLGYQYVNIDDCWAEYSRD 60
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI--------- 111
SQ GN V N+ TFPSG KALADYVH
Sbjct: 61 SQ--------------------------GNFVPNRQTFPSGIKALADYVHAKGLKLGIYS 94
Query: 112 -LISNSLDNRIESNIAFEQ---------GIDYLKYDNCFNDDTRPTI-RYRAMSDALKKA 160
S + N++ ++ E+ G+DYLKYDNC ND R + RY MS+A+K
Sbjct: 95 DAGSQTCSNKMPGSLDHEEQDVKTFASWGVDYLKYDNC-NDAGRSVMERYTRMSNAMKTY 153
Query: 161 GRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWN 220
G+ IFFS+CEWG PA W + NSWRTT DI+D+W M + AD N+ +A +A PGGWN
Sbjct: 154 GKNIFFSLCEWGKENPATWAGRMGNSWRTTGDIADNWGSMTSRADENDQWAAYAGPGGWN 213
Query: 221 DPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN--- 277
DPDMLEVGNGGM EY HFSIWA++KAPLL+GCDV +++ +T I+ N EVIAVN
Sbjct: 214 DPDMLEVGNGGMSEAEYRSHFSIWALAKAPLLIGCDVRSMSQQTKNILSNSEVIAVNQDS 273
Query: 278 --------QAINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSN-----NTL 324
Q+ N ++WAGPLS NR V+L +R+++ + +TAHW +IG+ + L
Sbjct: 274 LGVQGKKVQSDNGLEVWAGPLSNNRKAVVLWNRQSYQATITAHWSNIGLAGSVAVTARDL 333
Query: 325 FEHRTLETKFQGKLTAKVDGHSCKMYELTP 354
+ H + QG+++A V H CKMY LTP
Sbjct: 334 WAHSSFAA--QGQISASVAPHDCKMYVLTP 361
>gi|302797288|ref|XP_002980405.1| hypothetical protein SELMODRAFT_112569 [Selaginella moellendorffii]
gi|300152021|gb|EFJ18665.1| hypothetical protein SELMODRAFT_112569 [Selaginella moellendorffii]
Length = 391
Score = 319 bits (817), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 174/400 (43%), Positives = 224/400 (56%), Gaps = 84/400 (21%)
Query: 11 SNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------D 51
S+ +R LL NGLG TPPMGWNSWN+FHC INE+IIR T D
Sbjct: 21 SDSFERHRLLENGLGRTPPMGWNSWNYFHCGINEEIIRATADAIVSTGLRDVGYEYVNID 80
Query: 52 DCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI 111
DCW E SRD++ GNL A +TFPSG KALADYVH
Sbjct: 81 DCWAELSRDNE--------------------------GNLQARNSTFPSGIKALADYVH- 113
Query: 112 LISNSLDNRIESNIAF----EQ-----------------GIDYLKYDNCFNDDTRPTIRY 150
S +L I S+ + EQ G+DYLKYDNC D + P IRY
Sbjct: 114 --SKNLKFGIYSDAGYLTCQEQPGSLNHEDQDAKTFAAWGVDYLKYDNCHTDGSSPRIRY 171
Query: 151 RAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVY 210
M DAL AGRPIFFSMCEWG PA W S++ NSWRTT DI + W M++IAD N +
Sbjct: 172 PIMRDALSAAGRPIFFSMCEWGQEDPATWASSVGNSWRTTGDIQNKWESMISIADKNNAW 231
Query: 211 ADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGN 270
A+HA PGGWNDPDMLE+GNGGM + E HFS+WA+ KAPL++GCDV N+ A+ + I+ N
Sbjct: 232 AEHAAPGGWNDPDMLEIGNGGMSFQESRTHFSLWALMKAPLIIGCDVRNIAAQDLAILMN 291
Query: 271 EEVIAVNQ-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIK 319
+EVI +NQ +Q++W GPLS R +L+ +R + + +TA + +GI
Sbjct: 292 KEVIQINQDALGVQGRKVAGDGEQEVWGGPLSNGRFALLMLNRGSDPADITAEFAALGIP 351
Query: 320 SNNTLFEHRTLET----KFQGKLTAKVDGHSCKMYELTPV 355
SN ++ + + +T++V+ H MY LTPV
Sbjct: 352 SNVSVMIRDVWKLPGTGTYNSSITSRVESHDVAMYILTPV 391
>gi|297810941|ref|XP_002873354.1| alpha-galactosidase 2 [Arabidopsis lyrata subsp. lyrata]
gi|297319191|gb|EFH49613.1| alpha-galactosidase 2 [Arabidopsis lyrata subsp. lyrata]
Length = 394
Score = 319 bits (817), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 176/396 (44%), Positives = 224/396 (56%), Gaps = 81/396 (20%)
Query: 12 NGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DD 52
+G R L+ NGL L+P MGWNSWNHF C INE +I++T DD
Sbjct: 22 DGFQSRMLMHNGLALSPQMGWNSWNHFQCNINETLIKQTADAMVSSGLSAIGYKYINIDD 81
Query: 53 CWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI- 111
CWGE RDSQ G+LVA +TFPSG KAL+DYVH
Sbjct: 82 CWGELKRDSQ--------------------------GSLVAKASTFPSGIKALSDYVHSK 115
Query: 112 -----------------LISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAM 153
+ SL + + F GIDYLKYDNC N T P RY M
Sbjct: 116 GLKLGIYSDAGTLTCSQTMPGSLGHEEQDAKTFASWGIDYLKYDNCENTGTSPRERYPKM 175
Query: 154 SDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADH 213
S AL +GR IFFS+CEWG PA W +I NSWRTT DI D+W M IAD N+ +A +
Sbjct: 176 SKALLNSGRSIFFSLCEWGQEDPATWAGDIGNSWRTTGDIQDNWKSMTLIADQNDRWASY 235
Query: 214 AKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEV 273
A+PG WNDPDMLEVGNGGM EY HFSIWA++KAPLL+GCD+ ++ T ++ N+EV
Sbjct: 236 ARPGSWNDPDMLEVGNGGMTKEEYRSHFSIWALAKAPLLIGCDLRSMDNVTFELLSNKEV 295
Query: 274 IAVNQ-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNN 322
IAVNQ D ++WAGPLS R+ V+L +R + S+ +TA W DIG+ S++
Sbjct: 296 IAVNQDKLGIQGKKIKKDGDLEVWAGPLSKKRVAVILWNRGSVSAKITARWADIGLSSSD 355
Query: 323 T-----LFEHRTLETKFQGKLTAKVDGHSCKMYELT 353
L+EH T ++ + +L+A V+ H+CKMY LT
Sbjct: 356 IVNARDLWEHST-HSRVKNQLSALVEPHACKMYTLT 390
>gi|226492690|ref|NP_001142381.1| uncharacterized protein LOC100274554 precursor [Zea mays]
gi|194708560|gb|ACF88364.1| unknown [Zea mays]
Length = 423
Score = 318 bits (814), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 177/390 (45%), Positives = 219/390 (56%), Gaps = 84/390 (21%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
NGLG TP MGWNSWNHF C INE +I++T DDCW ES RD
Sbjct: 62 FENGLGRTPQMGWNSWNHFGCGINENLIKQTADALVNTGLAKLGYEYVNIDDCWAESDRD 121
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-------- 112
Q G+ VAN+ TFPSG KALADYVH
Sbjct: 122 HQ--------------------------GSFVANRQTFPSGIKALADYVHAKGLKLGIYS 155
Query: 113 ----------ISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTI-RYRAMSDALKKA 160
+ SLD+ + F GIDYLKYDNC ND R + RY MS+A+K
Sbjct: 156 DAGTRTCSQKMPGSLDHEEQDVKTFSSWGIDYLKYDNC-NDAGRSVMERYTKMSNAMKTY 214
Query: 161 GRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWN 220
G+ IFFS+CEWG PA W + NSWRTTDDI+D+W M + AD N+ +A +A PGGWN
Sbjct: 215 GKGIFFSLCEWGRQNPATWAGGMGNSWRTTDDIADNWGSMTSRADQNDRWASYAGPGGWN 274
Query: 221 DPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN--- 277
DPDMLEVGNGGM EY HFSIWA++KAPLL+GCDV ++ +TM I+ N EVIAVN
Sbjct: 275 DPDMLEVGNGGMSEAEYRSHFSIWALAKAPLLIGCDVRAMSQQTMGILSNSEVIAVNQDS 334
Query: 278 --------QAINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT-----L 324
Q+ N ++WAGPLS NR V+L +R+ + + +TA W IG+ S+ L
Sbjct: 335 QGAQGKKVQSSNGLEVWAGPLSNNRKAVVLWNRQGYQATITAQWSSIGLASSTAVTARDL 394
Query: 325 FEHRTLETKFQGKLTAKVDGHSCKMYELTP 354
+ H + QG+L+A V H CKMY LTP
Sbjct: 395 WAHSSFSA--QGQLSASVPPHDCKMYVLTP 422
>gi|414870942|tpg|DAA49499.1| TPA: hypothetical protein ZEAMMB73_096407 [Zea mays]
gi|414870943|tpg|DAA49500.1| TPA: hypothetical protein ZEAMMB73_096407 [Zea mays]
Length = 423
Score = 318 bits (814), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 177/390 (45%), Positives = 219/390 (56%), Gaps = 84/390 (21%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
NGLG TP MGWNSWNHF C INE +I++T DDCW ES RD
Sbjct: 62 FENGLGRTPQMGWNSWNHFGCGINENLIKQTADALVNTGLAKLGYEYVNIDDCWAESDRD 121
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-------- 112
QG+ VAN+ TFPSG KALADYVH
Sbjct: 122 --------------------------YQGSFVANRQTFPSGIKALADYVHAKGLKLGIYS 155
Query: 113 ----------ISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTI-RYRAMSDALKKA 160
+ SLD+ + F GIDYLKYDNC ND R + RY MS+A+K
Sbjct: 156 DAGTRTCSQKMPGSLDHEEQDVKTFSSWGIDYLKYDNC-NDAGRSVMERYTKMSNAMKTY 214
Query: 161 GRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWN 220
G+ IFFS+CEWG PA W + NSWRTTDDI+D+W M + AD N+ +A +A PGGWN
Sbjct: 215 GKGIFFSLCEWGRQNPATWAGGMGNSWRTTDDIADNWGSMTSRADQNDRWASYAGPGGWN 274
Query: 221 DPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN--- 277
DPDMLEVGNGGM EY HFSIWA++KAPLL+GCDV ++ +TM I+ N EVIAVN
Sbjct: 275 DPDMLEVGNGGMSEAEYRSHFSIWALAKAPLLIGCDVRAMSQQTMGILSNSEVIAVNQDS 334
Query: 278 --------QAINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT-----L 324
Q+ N ++WAGPLS NR V+L +R+ + + +TA W IG+ S+ L
Sbjct: 335 QGAQGKKVQSSNGLEVWAGPLSNNRKAVVLWNRQGYQATITAQWSSIGLASSTAVTARDL 394
Query: 325 FEHRTLETKFQGKLTAKVDGHSCKMYELTP 354
+ H + QG+L+A V H CKMY LTP
Sbjct: 395 WAHSSFSA--QGQLSASVPPHDCKMYVLTP 422
>gi|194693606|gb|ACF80887.1| unknown [Zea mays]
Length = 423
Score = 318 bits (814), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 177/390 (45%), Positives = 219/390 (56%), Gaps = 84/390 (21%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
NGLG TP MGWNSWNHF C INE +I++T DDCW ES RD
Sbjct: 62 FENGLGRTPQMGWNSWNHFGCGINENLIKQTADALVNTGLAKLGYEYVNIDDCWAESDRD 121
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-------- 112
QG+ VAN+ TFPSG KALADYVH
Sbjct: 122 --------------------------YQGSFVANRQTFPSGIKALADYVHAKGLKLGIYS 155
Query: 113 ----------ISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTI-RYRAMSDALKKA 160
+ SLD+ + F GIDYLKYDNC ND R + RY MS+A+K
Sbjct: 156 DAGTRTCSQKMPGSLDHEEQDVKTFSSWGIDYLKYDNC-NDAGRSVMERYTKMSNAMKTY 214
Query: 161 GRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWN 220
G+ IFFS+CEWG PA W + NSWRTTDDI+D+W M + AD N+ +A +A PGGWN
Sbjct: 215 GKGIFFSLCEWGRQNPATWAGGMGNSWRTTDDIADNWGSMTSRADQNDRWASYAGPGGWN 274
Query: 221 DPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN--- 277
DPDMLEVGNGGM EY HFSIWA++KAPLL+GCDV ++ +TM I+ N EVIAVN
Sbjct: 275 DPDMLEVGNGGMSEAEYRSHFSIWALAKAPLLIGCDVRAMSQQTMGILSNSEVIAVNQDS 334
Query: 278 --------QAINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT-----L 324
Q+ N ++WAGPLS NR V+L +R+ + + +TA W IG+ S+ L
Sbjct: 335 QGAQGKKVQSSNGLEVWAGPLSNNRKAVVLWNRQGYQATITAQWSSIGLASSTAVTARDL 394
Query: 325 FEHRTLETKFQGKLTAKVDGHSCKMYELTP 354
+ H + QG+L+A V H CKMY LTP
Sbjct: 395 WAHSSFSA--QGQLSASVPPHDCKMYVLTP 422
>gi|18415796|ref|NP_568193.1| alpha-galactosidase 2 [Arabidopsis thaliana]
gi|19699069|gb|AAL90902.1| AT5g08370/F8L15_100 [Arabidopsis thaliana]
gi|21553660|gb|AAM62753.1| alpha-galactosidase-like protein [Arabidopsis thaliana]
gi|23308433|gb|AAN18186.1| At5g08370/F8L15_100 [Arabidopsis thaliana]
gi|332003907|gb|AED91290.1| alpha-galactosidase 2 [Arabidopsis thaliana]
Length = 396
Score = 317 bits (812), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 175/397 (44%), Positives = 225/397 (56%), Gaps = 81/397 (20%)
Query: 11 SNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------D 51
++G R L++NGL L+P MGWNSWNHF C INE +I++T D
Sbjct: 23 ADGFQSRMLMNNGLALSPQMGWNSWNHFQCNINETLIKQTADAMVSSGLSAIGYKYINID 82
Query: 52 DCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI 111
DCWGE RDSQ G+LVA +TFPSG KAL+DYVH
Sbjct: 83 DCWGELKRDSQ--------------------------GSLVAKASTFPSGIKALSDYVHS 116
Query: 112 ------------------LISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRA 152
+ SL + + F GIDYLKYDNC N T P RY
Sbjct: 117 KGLKLGIYSDAGTLTCSQTMPGSLGHEEQDAKTFASWGIDYLKYDNCENTGTSPRERYPK 176
Query: 153 MSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYAD 212
MS AL +GR IFFS+CEWG PA W +I NSWRTT DI D+W M IAD N+ +A
Sbjct: 177 MSKALLNSGRSIFFSLCEWGQEDPATWAGDIGNSWRTTGDIQDNWKSMTLIADQNDRWAS 236
Query: 213 HAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEE 272
+A+PG WNDPDMLEVGNGGM EY+ HFSIWA++KAPLL+GCD+ ++ T ++ N+E
Sbjct: 237 YARPGSWNDPDMLEVGNGGMTKEEYMSHFSIWALAKAPLLIGCDLRSMDKVTFELLSNKE 296
Query: 273 VIAVNQ-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSN 321
VIAVNQ D ++WAGPLS R+ V+L +R + S+ +TA W +IG+ S+
Sbjct: 297 VIAVNQDKLGIQGKKVKKEGDLEVWAGPLSKKRVAVILWNRGSASANITARWAEIGLNSS 356
Query: 322 NT-----LFEHRTLETKFQGKLTAKVDGHSCKMYELT 353
+ L+EH T + +L+A V+ H+CKMY LT
Sbjct: 357 DIVNARDLWEHSTYSC-VKKQLSALVEPHACKMYTLT 392
>gi|302753418|ref|XP_002960133.1| hypothetical protein SELMODRAFT_163627 [Selaginella moellendorffii]
gi|300171072|gb|EFJ37672.1| hypothetical protein SELMODRAFT_163627 [Selaginella moellendorffii]
Length = 404
Score = 317 bits (812), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 170/396 (42%), Positives = 214/396 (54%), Gaps = 86/396 (21%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGE 56
R + NGLG TPPMGWNSWNHF C INE++IR+T DDCW +
Sbjct: 36 RHKKVGNGLGGTPPMGWNSWNHFACGINEQLIRQTADAMVSSGLSRLGYRYINLDDCWAD 95
Query: 57 SSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNS 116
SSRD QGNL A + FPSG KALADYVH S
Sbjct: 96 SSRD--------------------------WQGNLRAKSSAFPSGIKALADYVH---SRG 126
Query: 117 LDNRIESNIAFEQ----------------------GIDYLKYDNCFNDDTRPTIRYRAMS 154
L I S+ ++ G+DYLKYDNC N+ P RY M
Sbjct: 127 LKLGIYSDAGYQTCSKQQPGSLGREYQDAATFASWGVDYLKYDNCGNNGVSPRTRYHVMH 186
Query: 155 DALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHA 214
DAL K GRPIF+S+CEWG PA W SN+ NSWRTT DI+D+W M++ AD+N +A +A
Sbjct: 187 DALVKTGRPIFYSLCEWGQDNPATWASNVGNSWRTTGDIADNWNSMVSRADLNNQWASYA 246
Query: 215 KPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVI 274
PG WNDPDMLEVGNGGM EY HFS+WAI KAPLL+GCD+ ++ + + I+ N EV+
Sbjct: 247 GPGHWNDPDMLEVGNGGMSVEEYRTHFSLWAIMKAPLLIGCDIRSIRKDYLGILSNWEVV 306
Query: 275 AVNQ-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSN-- 321
VNQ ++WAGPLSG R+ VL+ +R + A+W DIGIK N
Sbjct: 307 GVNQDSLGIQGRKVRTQGSTEVWAGPLSGGRVAVLMVNRGYMGWSINANWQDIGIKQNLR 366
Query: 322 ---NTLFEHRTLETKFQGKLTAKVDGHSCKMYELTP 354
L++H+ L + + A V H C+M+ L P
Sbjct: 367 VTARDLWQHQNLRYTYTNGIRAYVPSHGCRMFVLKP 402
>gi|302804510|ref|XP_002984007.1| hypothetical protein SELMODRAFT_180602 [Selaginella moellendorffii]
gi|300148359|gb|EFJ15019.1| hypothetical protein SELMODRAFT_180602 [Selaginella moellendorffii]
Length = 404
Score = 317 bits (812), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 170/396 (42%), Positives = 214/396 (54%), Gaps = 86/396 (21%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGE 56
R + NGLG TPPMGWNSWNHF C INE++IR+T DDCW +
Sbjct: 36 RHKKVGNGLGGTPPMGWNSWNHFACGINEQLIRQTADAMVSSGLSRLGYRYINLDDCWAD 95
Query: 57 SSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNS 116
SSRD QGNL A + FPSG KALADYVH S
Sbjct: 96 SSRD--------------------------WQGNLRAKSSAFPSGIKALADYVH---SRG 126
Query: 117 LDNRIESNIAFEQ----------------------GIDYLKYDNCFNDDTRPTIRYRAMS 154
L I S+ ++ G+DYLKYDNC N+ P RY M
Sbjct: 127 LKLGIYSDAGYQTCSKQQPGSLGREYQDAATFASWGVDYLKYDNCGNNGVSPRTRYHVMH 186
Query: 155 DALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHA 214
DAL K GRPIF+S+CEWG PA W SN+ NSWRTT DI+D+W M++ AD+N +A +A
Sbjct: 187 DALVKTGRPIFYSLCEWGQDNPATWASNVGNSWRTTGDIADNWNSMVSRADLNNQWASYA 246
Query: 215 KPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVI 274
PG WNDPDMLEVGNGGM EY HFS+WAI KAPLL+GCD+ ++ + + I+ N EV+
Sbjct: 247 GPGHWNDPDMLEVGNGGMSVEEYRTHFSLWAIMKAPLLIGCDIRSIRQDYLGILSNWEVV 306
Query: 275 AVNQ-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSN-- 321
VNQ ++WAGPLSG R+ VL+ +R + A+W DIGIK N
Sbjct: 307 GVNQDSLGIQGRKVRTQGSTEVWAGPLSGGRVAVLMVNRGYMGWSINANWQDIGIKQNLR 366
Query: 322 ---NTLFEHRTLETKFQGKLTAKVDGHSCKMYELTP 354
L++H+ L + + A V H C+M+ L P
Sbjct: 367 VTARDLWQHQNLRYTYTNGIRAYVPSHGCRMFVLKP 402
>gi|297742706|emb|CBI35340.3| unnamed protein product [Vitis vinifera]
Length = 406
Score = 317 bits (812), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 168/378 (44%), Positives = 227/378 (60%), Gaps = 42/378 (11%)
Query: 12 NGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHM 71
NG+ +R LL NG G TP MGWNS+NHF C+++E++I +T D + LS +
Sbjct: 35 NGAMKRKLLDNGPGQTPQMGWNSYNHFPCKLSEELIHQTADAMVSTG------LSALGYK 88
Query: 72 FDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH--------------ILISNSL 117
+ +L L +GNLV + FPSG KALADYVH + S ++
Sbjct: 89 YINLVDDCWAELNRDSKGNLVPKASIFPSGMKALADYVHSKGLKIGIYADAGTLTCSKTM 148
Query: 118 DNRI-----ESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWG 172
+ ++N GIDYLKYDNC N+ P RY MS AL GRPIF+S+CEWG
Sbjct: 149 PGSLNYEEQDANTFASWGIDYLKYDNCHNNGLSPQERYSNMSKALLNTGRPIFYSLCEWG 208
Query: 173 DMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGM 232
PA W S+I NSWRTT DI D+W M + AD+N+V+A +A PGGWNDPDMLEVGNGGM
Sbjct: 209 QDNPATWASSIGNSWRTTGDIKDTWESMTSHADLNDVWASYAGPGGWNDPDMLEVGNGGM 268
Query: 233 KYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AIN 281
EY HFSIWA++KAPLL+GCD+ ++ ET ++ N+EVI VNQ
Sbjct: 269 STEEYRSHFSIWALAKAPLLIGCDIRSMDNETFELLSNKEVIEVNQDKLGAQGKKVKKTG 328
Query: 282 DQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT-----LFEHRTLETKFQG 336
D ++W+GPLS NR+ V+L +R++ + + A W DIG+ S+ L+ H T+ + +
Sbjct: 329 DLEVWSGPLSDNRVAVVLWNRESSEATIIADWSDIGLNSSAVVDARDLWTHSTIYS-IRH 387
Query: 337 KLTAKVDGHSCKMYELTP 354
+L A V+ H+CKMY LTP
Sbjct: 388 QLKATVEAHACKMYALTP 405
>gi|242039251|ref|XP_002467020.1| hypothetical protein SORBIDRAFT_01g018400 [Sorghum bicolor]
gi|241920874|gb|EER94018.1| hypothetical protein SORBIDRAFT_01g018400 [Sorghum bicolor]
Length = 426
Score = 317 bits (812), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 177/394 (44%), Positives = 221/394 (56%), Gaps = 84/394 (21%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGE 56
RR NGLG TP MGWNSWNHF C INE +I++T DDCW E
Sbjct: 61 RRRTFENGLGRTPQMGWNSWNHFGCGINENLIKQTADALVNTGLAKLGYEYVNIDDCWAE 120
Query: 57 SSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL---- 112
S RD QGN VA++ FPSG KALADYVH
Sbjct: 121 SDRD--------------------------YQGNFVAHRQNFPSGIKALADYVHAKGLKL 154
Query: 113 --------------ISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTI-RYRAMSDA 156
+ SLD+ + F GIDYLKYDNC ND R + RY MS+A
Sbjct: 155 GIYSDAGTRTCSQKMPGSLDHEEQDVKTFSSWGIDYLKYDNC-NDAGRSVMERYTKMSNA 213
Query: 157 LKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKP 216
+K G+ IFFS+CEWG PA W ++ NSWRTTDDI+D+W M + AD N+ +A +A P
Sbjct: 214 MKTYGKNIFFSLCEWGRENPATWAGSMGNSWRTTDDIADNWGSMTSRADQNDRWASYAGP 273
Query: 217 GGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAV 276
GGWNDPDMLEVGNGGM EY HFSIWA++KAPLL+GCDV +++ +T I+ N EVIAV
Sbjct: 274 GGWNDPDMLEVGNGGMSEAEYRSHFSIWALAKAPLLIGCDVRSMSQQTKDILSNSEVIAV 333
Query: 277 N-----------QAINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT-- 323
N Q+ N ++WAGPLS NR V+L +R+ + + +TA W IG+ S+
Sbjct: 334 NQDSLGVQGKKVQSNNGLEVWAGPLSNNRKAVVLWNRQGYQATITAQWSSIGLASSTAVT 393
Query: 324 ---LFEHRTLETKFQGKLTAKVDGHSCKMYELTP 354
L+ H + QG+L+A V H CKMY LTP
Sbjct: 394 ARDLWAHSSFSA--QGQLSASVAPHDCKMYVLTP 425
>gi|79327298|ref|NP_001031855.1| alpha-galactosidase 2 [Arabidopsis thaliana]
gi|222423840|dbj|BAH19885.1| AT5G08370 [Arabidopsis thaliana]
gi|332003908|gb|AED91291.1| alpha-galactosidase 2 [Arabidopsis thaliana]
Length = 370
Score = 315 bits (807), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 174/391 (44%), Positives = 222/391 (56%), Gaps = 81/391 (20%)
Query: 17 RNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGES 57
R L++NGL L+P MGWNSWNHF C INE +I++T DDCWGE
Sbjct: 3 RMLMNNGLALSPQMGWNSWNHFQCNINETLIKQTADAMVSSGLSAIGYKYINIDDCWGEL 62
Query: 58 SRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------ 111
RDSQ G+LVA +TFPSG KAL+DYVH
Sbjct: 63 KRDSQ--------------------------GSLVAKASTFPSGIKALSDYVHSKGLKLG 96
Query: 112 ------------LISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALK 158
+ SL + + F GIDYLKYDNC N T P RY MS AL
Sbjct: 97 IYSDAGTLTCSQTMPGSLGHEEQDAKTFASWGIDYLKYDNCENTGTSPRERYPKMSKALL 156
Query: 159 KAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGG 218
+GR IFFS+CEWG PA W +I NSWRTT DI D+W M IAD N+ +A +A+PG
Sbjct: 157 NSGRSIFFSLCEWGQEDPATWAGDIGNSWRTTGDIQDNWKSMTLIADQNDRWASYARPGS 216
Query: 219 WNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
WNDPDMLEVGNGGM EY+ HFSIWA++KAPLL+GCD+ ++ T ++ N+EVIAVNQ
Sbjct: 217 WNDPDMLEVGNGGMTKEEYMSHFSIWALAKAPLLIGCDLRSMDKVTFELLSNKEVIAVNQ 276
Query: 279 -----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT---- 323
D ++WAGPLS R+ V+L +R + S+ +TA W +IG+ S++
Sbjct: 277 DKLGIQGKKVKKEGDLEVWAGPLSKKRVAVILWNRGSASANITARWAEIGLNSSDIVNAR 336
Query: 324 -LFEHRTLETKFQGKLTAKVDGHSCKMYELT 353
L+EH T + +L+A V+ H+CKMY LT
Sbjct: 337 DLWEHSTYSC-VKKQLSALVEPHACKMYTLT 366
>gi|255640568|gb|ACU20569.1| unknown [Glycine max]
Length = 312
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 160/295 (54%), Positives = 190/295 (64%), Gaps = 64/295 (21%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGE 56
RRNLL+NGLG TPPMGWNSWNHF CQINEK+IRET DDCW E
Sbjct: 42 RRNLLANGLGRTPPMGWNSWNHFSCQINEKMIRETADALVSTGLSKLGYTYVNIDDCWAE 101
Query: 57 SSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH------ 110
+RD++ GNLVA +TFPSG KALADYVH
Sbjct: 102 LNRDAK--------------------------GNLVAKNSTFPSGIKALADYVHGKGLKL 135
Query: 111 ------------ILISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDAL 157
+ SL + + F GIDYLKYDNC ND ++PT RY M+ AL
Sbjct: 136 GIYSDAGYFTCSKQMPGSLGHEFQDAKTFASWGIDYLKYDNCNNDGSKPTDRYPIMTRAL 195
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPG 217
AGRPIFFS+CEWGD+ PALWG+ + NSWRTT+DI+DSW M++ ADMNEVYA++A+PG
Sbjct: 196 MMAGRPIFFSLCEWGDLHPALWGAKVGNSWRTTNDINDSWESMVSRADMNEVYAEYARPG 255
Query: 218 GWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEE 272
GWNDPDMLEVGNGGM NEY+VHFS+WA+SKAPLLLGCDV N+T ETM I+ ++
Sbjct: 256 GWNDPDMLEVGNGGMTKNEYIVHFSLWALSKAPLLLGCDVRNMTKETMEIVAKKK 310
>gi|224144974|ref|XP_002325481.1| predicted protein [Populus trichocarpa]
gi|222862356|gb|EEE99862.1| predicted protein [Populus trichocarpa]
Length = 380
Score = 313 bits (802), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 170/391 (43%), Positives = 221/391 (56%), Gaps = 82/391 (20%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
L+NGL TP MGWNSWN F C INE +I+ET DDCW + RD
Sbjct: 16 LNNGLARTPQMGWNSWNFFACNINETVIKETADALISTGLAELGYVYVNIDDCWSSTKRD 75
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH---------- 110
S+ G L+ + TFPSG KALADYVH
Sbjct: 76 SK--------------------------GQLIPDPKTFPSGIKALADYVHEKGLKLGIYS 109
Query: 111 --------ILISNSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGR 162
+ + L + ++ + G+DYLKYDNCFN P RY M DAL GR
Sbjct: 110 DAGAFTCQVRPGSLLHEKDDAELFASWGVDYLKYDNCFNLGINPKERYPPMRDALNSTGR 169
Query: 163 PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDP 222
+F+S+CEWG PALW + NSWRTTDDI+DSWA M T AD+N+ +A +A PGGWNDP
Sbjct: 170 TVFYSLCEWGVDDPALWAGKVGNSWRTTDDINDSWASMTTTADLNDKWASYAGPGGWNDP 229
Query: 223 DMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ---A 279
DMLEVGNGGM Y+EY HFSIWA+ KAPLL+GCDV N+TAET+ I+ N+E+IAVNQ
Sbjct: 230 DMLEVGNGGMTYHEYRAHFSIWALMKAPLLIGCDVRNMTAETIEILTNKEIIAVNQDPLG 289
Query: 280 INDQ-----------QIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT----- 323
I + Q+WAGPLSG+RIVV L +R + ++ +TA W +G++S+ +
Sbjct: 290 IQGRKVYSTGTDGCLQVWAGPLSGHRIVVALWNRCSKAATITAGWGALGLESSTSVSVRD 349
Query: 324 LFEHRTLETKFQGKLTAKVDGHSCKMYELTP 354
L++ + + A+VD H C ++ TP
Sbjct: 350 LWQGKDIVGDAVASFGARVDAHDCLIFIFTP 380
>gi|225463811|ref|XP_002271010.1| PREDICTED: alpha-galactosidase-like [Vitis vinifera]
Length = 368
Score = 313 bits (801), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 169/393 (43%), Positives = 224/393 (56%), Gaps = 81/393 (20%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGE 56
+R LL NG G TP MGWNS+NHF C+++E++I +T DDCW E
Sbjct: 2 KRKLLDNGPGQTPQMGWNSYNHFPCKLSEELIHQTADAMVSTGLSALGYKYINLDDCWAE 61
Query: 57 SSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH------ 110
+RDS+ GNLV + FPSG KALADYVH
Sbjct: 62 LNRDSK--------------------------GNLVPKASIFPSGMKALADYVHSKGLKI 95
Query: 111 --------ILISNSLDNRI-----ESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDAL 157
+ S ++ + ++N GIDYLKYDNC N+ P RY MS AL
Sbjct: 96 GIYADAGTLTCSKTMPGSLNYEEQDANTFASWGIDYLKYDNCHNNGLSPQERYSNMSKAL 155
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPG 217
GRPIF+S+CEWG PA W S+I NSWRTT DI D+W M + AD+N+V+A +A PG
Sbjct: 156 LNTGRPIFYSLCEWGQDNPATWASSIGNSWRTTGDIKDTWESMTSHADLNDVWASYAGPG 215
Query: 218 GWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN 277
GWNDPDMLEVGNGGM EY HFSIWA++KAPLL+GCD+ ++ ET ++ N+EVI VN
Sbjct: 216 GWNDPDMLEVGNGGMSTEEYRSHFSIWALAKAPLLIGCDIRSMDNETFELLSNKEVIEVN 275
Query: 278 Q-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT--- 323
Q D ++W+GPLS NR+ V+L +R++ + + A W DIG+ S+
Sbjct: 276 QDKLGAQGKKVKKTGDLEVWSGPLSDNRVAVVLWNRESSEATIIADWSDIGLNSSAVVDA 335
Query: 324 --LFEHRTLETKFQGKLTAKVDGHSCKMYELTP 354
L+ H T+ + + +L A V+ H+CKMY LTP
Sbjct: 336 RDLWTHSTIYS-IRHQLKATVEAHACKMYALTP 367
>gi|222613065|gb|EEE51197.1| hypothetical protein OsJ_32006 [Oryza sativa Japonica Group]
Length = 500
Score = 311 bits (798), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 172/387 (44%), Positives = 221/387 (57%), Gaps = 84/387 (21%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
NGLG TP MGWNSWNHF+C INE+IIRET DDCW E SRD
Sbjct: 56 FENGLGRTPQMGWNSWNHFYCGINEQIIRETADALVNTGLAKLGYQYVNIDDCWAEYSRD 115
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI--------- 111
SQ GN V N+ TFPSG KALADYVH
Sbjct: 116 SQ--------------------------GNFVPNRQTFPSGIKALADYVHAKGLKLGIYS 149
Query: 112 -LISNSLDNRIESNIAFEQ---------GIDYLKYDNCFNDDTRPTI-RYRAMSDALKKA 160
S + N++ ++ E+ G+DYLKYDNC ND R + RY MS+A+K
Sbjct: 150 DAGSQTCSNKMPGSLDHEEQDVKTFASWGVDYLKYDNC-NDAGRSVMERYTRMSNAMKTY 208
Query: 161 GRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWN 220
G+ IFFS+CEWG PA W + NSWRTT DI+D+W M + AD N+ +A +A PGGWN
Sbjct: 209 GKNIFFSLCEWGKENPATWAGRMGNSWRTTGDIADNWGSMTSRADENDQWAAYAGPGGWN 268
Query: 221 DPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN--- 277
DPDMLEVGNGGM EY HFSIWA++KAPLL+GCDV +++ +T I+ N EVIAVN
Sbjct: 269 DPDMLEVGNGGMSEAEYRSHFSIWALAKAPLLIGCDVRSMSQQTKNILSNSEVIAVNQDS 328
Query: 278 --------QAINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSN-----NTL 324
Q+ N ++WAGPLS NR V+L +R+++ + +TAHW +IG+ + L
Sbjct: 329 LGVQGKKVQSDNGLEVWAGPLSNNRKAVVLWNRQSYQATITAHWSNIGLAGSVAVTARDL 388
Query: 325 FEHRTLETKFQGKLTAKVDGHSCKMYE 351
+ H + QG+++A V H CKM +
Sbjct: 389 WAHSSFAA--QGQISASVAPHDCKMIQ 413
>gi|218184804|gb|EEC67231.1| hypothetical protein OsI_34152 [Oryza sativa Indica Group]
Length = 500
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 172/387 (44%), Positives = 221/387 (57%), Gaps = 84/387 (21%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
NGLG TP MGWNSWNHF+C INE+IIRET DDCW E SRD
Sbjct: 56 FENGLGRTPQMGWNSWNHFYCGINEQIIRETADALVNTGLAKLGYQYVNIDDCWAEYSRD 115
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI--------- 111
SQ GN V N+ TFPSG KALADYVH
Sbjct: 116 SQ--------------------------GNFVPNRQTFPSGIKALADYVHAKGLKLGIYS 149
Query: 112 -LISNSLDNRIESNIAFEQ---------GIDYLKYDNCFNDDTRPTI-RYRAMSDALKKA 160
S + N++ ++ E+ G+DYLKYDNC ND R + RY MS+A+K
Sbjct: 150 DAGSQTCSNKMPGSLDHEEQDVKTFASWGVDYLKYDNC-NDAGRSVMERYTRMSNAMKTY 208
Query: 161 GRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWN 220
G+ IFFS+CEWG PA W + NSWRTT DI+D+W M + AD N+ +A +A PGGWN
Sbjct: 209 GKNIFFSLCEWGKENPATWAGRMGNSWRTTGDIADNWGSMTSRADENDQWAAYAGPGGWN 268
Query: 221 DPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN--- 277
DPDMLEVGNGGM EY HFSIWA++KAPLL+GCDV +++ +T I+ N EVIAVN
Sbjct: 269 DPDMLEVGNGGMSEAEYRSHFSIWALAKAPLLIGCDVRSMSQQTKNILSNSEVIAVNQDS 328
Query: 278 --------QAINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSN-----NTL 324
Q+ N ++WAGPLS NR V+L +R+++ + +TAHW +IG+ + L
Sbjct: 329 LGVQGKKVQSDNGLEVWAGPLSNNRKAVVLWNRQSYQATITAHWSNIGLAGSVAVTARDL 388
Query: 325 FEHRTLETKFQGKLTAKVDGHSCKMYE 351
+ H + QG+++A V H CKM +
Sbjct: 389 WAHSSFAA--QGQISASVAPHDCKMIQ 413
>gi|356516253|ref|XP_003526810.1| PREDICTED: alpha-galactosidase-like isoform 1 [Glycine max]
Length = 418
Score = 311 bits (796), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 167/367 (45%), Positives = 217/367 (59%), Gaps = 44/367 (11%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
LSNGLG TP MGWNSWN F C INE +I+ET D + L + D
Sbjct: 59 LSNGLGKTPQMGWNSWNFFACNINEMVIKETADALVSTGL---ADLGYVYVNIDDCWSSV 115
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVH------------------ILISNSLDNRI 121
+ L +G LV + TFPSG KALADYVH + +
Sbjct: 116 TRNL----KGQLVPDHKTFPSGIKALADYVHGKGLKLGIYSDAGVFTCQVRPGSIFHETD 171
Query: 122 ESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGS 181
++++ G+DYLKYDNC+N P RY M DAL G+ IF+S+CEWG PALW
Sbjct: 172 DADLFASWGVDYLKYDNCYNLGIPPKERYPPMRDALNATGQKIFYSLCEWGVEDPALWAD 231
Query: 182 NIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHF 241
+ NSWRTT DI+DSWA M TIAD+N+ +A +A PGGWNDPDMLEVGNGGM Y EY HF
Sbjct: 232 KVGNSWRTTGDINDSWASMTTIADLNDKWAAYAGPGGWNDPDMLEVGNGGMTYQEYRAHF 291
Query: 242 SIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ--------------AINDQQIWA 287
SIWA++KAPLL+GCDV NLTAET+ I+ N+EVIA+NQ A +Q+WA
Sbjct: 292 SIWALAKAPLLIGCDVRNLTAETLEILSNKEVIAINQDSLGVQGRKVQVSGADGCRQVWA 351
Query: 288 GPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSN-----NTLFEHRTLETKFQGKLTAKV 342
GPLSGNR+ V L +R + + +TA W+ +G++S L++H+ + +A+V
Sbjct: 352 GPLSGNRLAVALWNRCSKVATITASWEALGLESGVHVSVRDLWQHKVVTGDAVSSFSARV 411
Query: 343 DGHSCKM 349
D H C++
Sbjct: 412 DIHDCQL 418
>gi|449530772|ref|XP_004172366.1| PREDICTED: alpha-galactosidase-like [Cucumis sativus]
Length = 414
Score = 311 bits (796), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 170/377 (45%), Positives = 217/377 (57%), Gaps = 82/377 (21%)
Query: 32 WNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQISLSCSFHMF 72
WNSWNHF C +NE +I+ET DDCW E RDS+
Sbjct: 65 WNSWNHFQCNLNENLIKETADAMVSTGLAALGYQYINLDDCWAELDRDSK---------- 114
Query: 73 DSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------LISN----SLDNRIE 122
GNLVA +TFPSG KALADYVH + S+ + R+
Sbjct: 115 ----------------GNLVAKASTFPSGIKALADYVHRKGLKLGIYSDAGIRTCSKRMP 158
Query: 123 SNIAFEQ---------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGD 173
++ E+ GIDYLKYDNC N T P RY M+ AL+++GRPI FS+CEWG
Sbjct: 159 GSLGHEEQDAKTFASWGIDYLKYDNCENTGTSPKERYPKMTKALQQSGRPILFSLCEWGQ 218
Query: 174 MRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMK 233
PA W N+ NSWRTT DI D+W M TIAD N+ +A +AKPGGWNDPDMLEVGNGGM
Sbjct: 219 EDPATWAVNVGNSWRTTSDIQDNWISMTTIADQNDKWASYAKPGGWNDPDMLEVGNGGMT 278
Query: 234 YNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AIND 282
EY HFSIWA++KAPLL+GCD+ ++ TM ++ N+EVIAVNQ D
Sbjct: 279 IAEYRSHFSIWALAKAPLLIGCDIRSMDNNTMKLLSNKEVIAVNQDKLGVQGKKVHKYGD 338
Query: 283 QQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT-----LFEHRTLETKFQGK 337
++WAG LSG R+ V+L +R + + +TA+W DIG+ S+ T L+EH + +
Sbjct: 339 LEVWAGLLSGKRVAVVLWNRSLWRANITAYWSDIGLSSSTTVTARDLWEHSSQVVRHH-- 396
Query: 338 LTAKVDGHSCKMYELTP 354
LTA+VD H CKM+ LTP
Sbjct: 397 LTAQVDSHDCKMFVLTP 413
>gi|225448651|ref|XP_002279730.1| PREDICTED: alpha-galactosidase [Vitis vinifera]
gi|297736508|emb|CBI25379.3| unnamed protein product [Vitis vinifera]
Length = 427
Score = 309 bits (792), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 172/393 (43%), Positives = 219/393 (55%), Gaps = 82/393 (20%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
L+NGL TP MGWNSWN F C INE +I+ET DDCW RD
Sbjct: 55 LNNGLARTPQMGWNSWNFFACNINETVIKETADALVSTGLADLGYVYVNIDDCWSSLERD 114
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH---------- 110
S+ G LV + TFPSG KALADYVH
Sbjct: 115 SK--------------------------GQLVPDPKTFPSGIKALADYVHAKGLKLGIYS 148
Query: 111 --------ILISNSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGR 162
+ + R ++ + G+DYLKYDNC+N +P RY M +AL GR
Sbjct: 149 DAGIFTCQVRPGSIYHERDDAELFASWGVDYLKYDNCYNLGIKPEERYPPMRNALNATGR 208
Query: 163 PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDP 222
IF+S+CEWG PALW + NSWRTTDDI+DSWA M TIAD+N+ +A +A PGGWNDP
Sbjct: 209 TIFYSLCEWGVDDPALWAGKVGNSWRTTDDINDSWASMTTIADLNDEWAAYAGPGGWNDP 268
Query: 223 DMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-AIN 281
DMLEVGNGGM EY HFSIWA+ KAPLL+GCDV N+TAET IIGN+EVI +NQ ++
Sbjct: 269 DMLEVGNGGMTLEEYRAHFSIWALMKAPLLVGCDVRNITAETFEIIGNKEVIDINQDSLG 328
Query: 282 DQ-------------QIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT----- 323
Q Q+WAGPLSG+R+VV L +R + ++ +T W+ +G++S+ +
Sbjct: 329 IQGRKVHVSGKDGCRQVWAGPLSGHRLVVALWNRCSKAATITVGWEVLGLESSMSVSIRD 388
Query: 324 LFEHRTLETKFQGKLTAKVDGHSCKMYELTPVT 356
L++H L + V H C MY TPV+
Sbjct: 389 LWKHVDLSGDAVASFGSLVASHDCGMYIFTPVS 421
>gi|353441550|gb|AEQ94270.1| alpha galacdosidase 3 [Cucumis sativus]
Length = 430
Score = 308 bits (788), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 170/395 (43%), Positives = 212/395 (53%), Gaps = 82/395 (20%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
L NGL TP MGWNSWN F C INE +I+ET DDCW RD
Sbjct: 58 LQNGLARTPQMGWNSWNFFACDINETLIKETADALVSTGLAELGYVYVNIDDCWNTQKRD 117
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH---------- 110
S+ LV + FPSG K LADYVH
Sbjct: 118 SK--------------------------DQLVPDPKGFPSGIKPLADYVHSKDLKLGIYS 151
Query: 111 --------ILISNSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGR 162
+ + ++ + G+DYLKYDNCFN +P RY M DAL GR
Sbjct: 152 DAGLFTCQVRAGSLYHENDDAQLFASWGVDYLKYDNCFNLGIKPIKRYPPMRDALNATGR 211
Query: 163 PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDP 222
IF+S+CEWG PALW + NSWRTTDDI+D+WA M T+AD+N +A +A PGGWNDP
Sbjct: 212 SIFYSLCEWGVDDPALWAGKVGNSWRTTDDINDTWASMTTLADLNNKWAAYAGPGGWNDP 271
Query: 223 DMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-AIN 281
DMLEVGNGGM Y EY HFSIWA+ K+PLL+GCDV N+T ET I+ N+EVIAVNQ +
Sbjct: 272 DMLEVGNGGMTYQEYRAHFSIWALMKSPLLIGCDVRNMTKETSEILMNKEVIAVNQDPLG 331
Query: 282 DQ-------------QIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT----- 323
Q QIWAGPLSG+R+ V+L +R + +S +T W+ +G+K N +
Sbjct: 332 VQGRKVKVFGKDGCLQIWAGPLSGSRLAVVLWNRCSVASTITTDWNALGLKPNTSVSVRD 391
Query: 324 LFEHRTLETKFQGKLTAKVDGHSCKMYELTPVTKL 358
L+ H + A+VD H CKM+ TPV
Sbjct: 392 LWLHEDVAGDAMSSFGAEVDPHDCKMFVFTPVATF 426
>gi|449457411|ref|XP_004146442.1| PREDICTED: alpha-galactosidase-like [Cucumis sativus]
gi|449491655|ref|XP_004158965.1| PREDICTED: alpha-galactosidase-like [Cucumis sativus]
Length = 430
Score = 307 bits (786), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 169/395 (42%), Positives = 212/395 (53%), Gaps = 82/395 (20%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
L NGL TP MGWNSWN F C INE +I+ET DDCW RD
Sbjct: 58 LQNGLARTPQMGWNSWNFFACDINETLIKETADALVSTGLAELGYVYVNIDDCWNTQKRD 117
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH---------- 110
S+ LV + FPSG K LADYVH
Sbjct: 118 SK--------------------------DQLVPDPKGFPSGIKPLADYVHSKDLKLGIYS 151
Query: 111 --------ILISNSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGR 162
+ + ++ + G+DYLKYDNCFN +P RY M DAL GR
Sbjct: 152 DAGLFTCQVRAGSLYHENDDAQLFASWGVDYLKYDNCFNLGIKPIKRYPPMRDALNATGR 211
Query: 163 PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDP 222
IF+S+CEWG PALW + NSWRTTDDI+D+WA M T+AD+N +A +A PGGWNDP
Sbjct: 212 SIFYSLCEWGVDDPALWAGKVGNSWRTTDDINDTWASMTTLADLNNKWAAYAGPGGWNDP 271
Query: 223 DMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-AIN 281
DMLEVGNGGM Y EY HFSIWA+ K+PLL+GCDV N+T ET I+ N+EVIAVNQ +
Sbjct: 272 DMLEVGNGGMTYQEYRAHFSIWALMKSPLLIGCDVRNMTKETSEILMNKEVIAVNQDPLG 331
Query: 282 DQ-------------QIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT----- 323
Q Q+WAGPLSG+R+ V+L +R + +S +T W+ +G+K N +
Sbjct: 332 VQGRKVKVFGKDGCLQVWAGPLSGSRLAVVLWNRCSVASTITTDWNALGLKPNTSVSVRD 391
Query: 324 LFEHRTLETKFQGKLTAKVDGHSCKMYELTPVTKL 358
L+ H + A+VD H CKM+ TPV
Sbjct: 392 LWLHEDVAGDAMSSFGAEVDPHDCKMFVFTPVATF 426
>gi|255555949|ref|XP_002519009.1| alpha-galactosidase/alpha-n-acetylgalactosaminidase, putative
[Ricinus communis]
gi|223541672|gb|EEF43220.1| alpha-galactosidase/alpha-n-acetylgalactosaminidase, putative
[Ricinus communis]
Length = 403
Score = 307 bits (786), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 169/391 (43%), Positives = 220/391 (56%), Gaps = 80/391 (20%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSR 59
LL+NG+G TPPMGWNSWNHF C I+E ++ T DDCW E +R
Sbjct: 39 LLANGVGRTPPMGWNSWNHFQCNIDEWTVKTTADALISTGLAALGYKYVNIDDCWAEENR 98
Query: 60 DSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL------- 112
DS+ GNL A +TFPSG KALADYVH
Sbjct: 99 DSK--------------------------GNLRAKSSTFPSGIKALADYVHSRDLKLGIY 132
Query: 113 -----------ISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKA 160
+ SL + + F Q G+DYLKYDNC++D ++P IRY MS AL KA
Sbjct: 133 SDAGYTTCSKKMPGSLVHEEQDARTFAQWGVDYLKYDNCYHDGSKPQIRYARMSYALHKA 192
Query: 161 GRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWN 220
GRPIF+S+CEWG PA W N+WRTT DI D+W + ++AD N ++ +A PG WN
Sbjct: 193 GRPIFYSLCEWGQEDPAKWAGRYGNAWRTTGDIKDNWESVTSLADENNIWGRYAGPGRWN 252
Query: 221 DPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-- 278
DPDMLEVGNGGM EY HFSIWA+ KAPLL+GCDV + + +T+ I+GN+EVI VNQ
Sbjct: 253 DPDMLEVGNGGMSLEEYRSHFSIWALMKAPLLIGCDVRSASKQTLRILGNKEVIDVNQDP 312
Query: 279 -AINDQ--------QIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIK-----SNNTL 324
I + +IWAGPLS RIV+LL +R + ++ W ++G+ + L
Sbjct: 313 LGIQGRKIRSGAGLEIWAGPLSRKRIVILLWNRSGSKAPISVGWREVGLSPITPVNIRDL 372
Query: 325 FEHRTLETKFQGKLTAKVDGHSCKMYELTPV 355
+ H + + + LTA V H+CKMY LTP+
Sbjct: 373 WAHSFISMRKRFGLTAYVAPHACKMYILTPL 403
>gi|10312171|gb|AAG16693.1| alpha-galactosidase [Solanum lycopersicum]
Length = 380
Score = 306 bits (785), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 168/377 (44%), Positives = 217/377 (57%), Gaps = 76/377 (20%)
Query: 3 IVSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET------------ 50
++ +S N +RRNLL NGLG TP MGW+SWNHF C I+E II+ T
Sbjct: 29 LIVNSNLSVNEFNRRNLLGNGLGQTPQMGWSSWNHFGCNIDENIIKGTADAMVHTGLASL 88
Query: 51 -------DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTK 103
DDCW E++RDSQ GN+VA +TFPSG K
Sbjct: 89 GYEYINIDDCWAEANRDSQ--------------------------GNMVAKGSTFPSGIK 122
Query: 104 ALADYVH------------------ILISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDT 144
ALADYVH + SL + + F G+DYLKYDNC N+D
Sbjct: 123 ALADYVHGKGLKLGVYSDAGTQTCSKQMPGSLGHEEQDAKTFASWGVDYLKYDNCNNEDR 182
Query: 145 RPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIA 204
P RY MS+AL+ +GR IF+SMCEWGD PA W S++ NSWRTT DI+D W M + A
Sbjct: 183 SPRERYPIMSNALQNSGRAIFYSMCEWGDDNPATWASSVGNSWRTTGDITDDWNSMTSRA 242
Query: 205 DMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAET 264
D+N+ +A +A PGGWNDPDMLEVGNGGM + EY HFSIWA+ KAPL++GCD+ ++
Sbjct: 243 DLNDQWASYAGPGGWNDPDMLEVGNGGMSFGEYRSHFSIWALVKAPLIIGCDLRSMDNTA 302
Query: 265 MPIIGNEEVIAVNQ-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHW 313
I+ N EVIAVNQ D ++WAGPLSG R+ V+L +R ++ + +TA+W
Sbjct: 303 HDILSNPEVIAVNQDKLGVQGKKVKQYGDLEVWAGPLSGKRVAVVLWNRGSYKADITAYW 362
Query: 314 DDIGIKSNNTLFEHRTL 330
DIG+ +TL + R L
Sbjct: 363 SDIGLDY-STLVDARDL 378
>gi|242046940|ref|XP_002461216.1| hypothetical protein SORBIDRAFT_02g042960 [Sorghum bicolor]
gi|241924593|gb|EER97737.1| hypothetical protein SORBIDRAFT_02g042960 [Sorghum bicolor]
Length = 431
Score = 306 bits (785), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 171/379 (45%), Positives = 216/379 (56%), Gaps = 44/379 (11%)
Query: 17 RNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLT 76
R L+NGL LTP MGWNSWN F C INE +IRET D + +L +F D
Sbjct: 57 RLQLNNGLALTPQMGWNSWNFFACNINETVIRETADALVSTGL---AALGYNFVNIDDCW 113
Query: 77 QIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----------------ILISNSLDN 119
+ +Q L+ + TFPSG KALADYVH + SLD+
Sbjct: 114 SYVKRGKQDQ----LLPDPKTFPSGIKALADYVHGKGLKLGIYSDAGKFTCQVRPGSLDH 169
Query: 120 RIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPAL 178
+ F G+DYLKYDNC+N +P RY M DAL GR IF+S+CEWG PAL
Sbjct: 170 ESDDAAIFASWGVDYLKYDNCYNLGIKPQKRYPPMRDALNSTGRQIFYSLCEWGQDDPAL 229
Query: 179 WGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYV 238
W + NSWRTTDDI+D+W M IAD N +A +A PGGWNDPDMLEVGNGGM EY
Sbjct: 230 WAGKVGNSWRTTDDITDTWQSMTDIADKNNKWASYAGPGGWNDPDMLEVGNGGMTSAEYR 289
Query: 239 VHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ--------------AINDQQ 284
HFSIWA+ KAPLL+GCDV N+T+ETM I+ N+EVI VNQ ++
Sbjct: 290 SHFSIWALMKAPLLIGCDVRNMTSETMEILSNKEVIQVNQDPLGVQGRKILGEGKYGCRE 349
Query: 285 IWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIK-----SNNTLFEHRTLETKFQGKLT 339
+WAGPLSGNR+V+ L +R + ++ +T +G+ S L++H TL G
Sbjct: 350 VWAGPLSGNRLVIALWNRCSETANITMKLPAVGLDGSAAYSVRDLWKHETLSENVVGTFG 409
Query: 340 AKVDGHSCKMYELTPVTKL 358
A+VD H KMY +P T +
Sbjct: 410 AQVDVHDTKMYIFSPATSV 428
>gi|42572703|ref|NP_974447.1| alpha-galactosidase [Arabidopsis thaliana]
gi|332645986|gb|AEE79507.1| alpha-galactosidase [Arabidopsis thaliana]
Length = 413
Score = 303 bits (777), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 169/360 (46%), Positives = 216/360 (60%), Gaps = 34/360 (9%)
Query: 17 RNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLT 76
R L+NGL TP MGWNSWN F C INE +I+ET D + S ++ H+
Sbjct: 62 RLQLNNGLARTPQMGWNSWNFFACNINETVIKETAD----ALVSSGLADLGYIHV----- 112
Query: 77 QIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIE-SNIAFEQGIDYLK 135
I K L + + K S VH SL + ++ ++I G+DYLK
Sbjct: 113 NIGIKLLADYVHSKGL--KLGIYSDAGVFTCEVH---PGSLFHEVDDADIFASWGVDYLK 167
Query: 136 YDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISD 195
YDNCFN +P RY M DAL GR IF+S+CEWG PALW + NSWRTTDDI+D
Sbjct: 168 YDNCFNLGIKPIERYPPMRDALNATGRSIFYSLCEWGVDDPALWAKEVGNSWRTTDDIND 227
Query: 196 SWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGC 255
+WA M TIAD+N +A +A PGGWNDPDMLE+GNGGM Y EY HFSIWA+ KAPLL+GC
Sbjct: 228 TWASMTTIADLNNKWAAYAGPGGWNDPDMLEIGNGGMTYEEYRGHFSIWALMKAPLLIGC 287
Query: 256 DVGNLTAETMPIIGNEEVIAVNQ-------------AIND-QQIWAGPLSGNRIVVLLES 301
DV N+TAET+ I+ N+E+IAVNQ ND QQ+W+GPLSG+R+VV L +
Sbjct: 288 DVRNMTAETLEILSNKEIIAVNQDPLGVQGRKIQANGENDCQQVWSGPLSGDRMVVALWN 347
Query: 302 RKTFSSMMTAHWDDIGIKSN-----NTLFEHRTLETKFQGKLTAKVDGHSCKMYELTPVT 356
R + + +TA WD IG++S L++H+ + G A+VD H C MY LTP T
Sbjct: 348 RCSEPATITASWDMIGLESTISVSVRDLWQHKDVTENTSGSFEAQVDAHDCHMYVLTPQT 407
>gi|224078914|ref|XP_002305678.1| predicted protein [Populus trichocarpa]
gi|222848642|gb|EEE86189.1| predicted protein [Populus trichocarpa]
Length = 379
Score = 303 bits (775), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 167/388 (43%), Positives = 213/388 (54%), Gaps = 78/388 (20%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSR 59
LL+NG+ TPPMGWNSWNHF C I+E+ I+ T DDCWGE R
Sbjct: 17 LLANGVARTPPMGWNSWNHFQCNIDERTIKTTADALVSTGLAALGYKYVNIDDCWGEEKR 76
Query: 60 DSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH--------- 110
D +G+L A +TFPSG KALADYVH
Sbjct: 77 D--------------------------WKGSLRAKASTFPSGIKALADYVHSKGLKLGIY 110
Query: 111 -------ILISNSLDNRIESNIAF-EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGR 162
+ SL + + F E G+DYLKYDNC++D ++P RY MS AL+K GR
Sbjct: 111 SDAGTCSKKMPGSLGHEEQDAKTFAEWGVDYLKYDNCYHDGSKPQDRYARMSYALRKVGR 170
Query: 163 PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDP 222
PI +S+CEWG PA W N+WRTT DI D+W +++IAD N ++ +A PG WNDP
Sbjct: 171 PILYSLCEWGQEYPAKWAGFYGNAWRTTGDIKDTWESVISIADENNIWGRYAGPGRWNDP 230
Query: 223 DMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ---- 278
DMLEVGNGGM EY HFSIWA+ KAPLL+GCDV + + ET+ I+GN+EVI VNQ
Sbjct: 231 DMLEVGNGGMSLEEYRSHFSIWALMKAPLLIGCDVQSASHETLRILGNKEVIDVNQDPLG 290
Query: 279 -------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT-----LFE 326
A +IWAG LS R+ V+L +R + +T W +IG+ N L+
Sbjct: 291 VQGRKIRAKAGLEIWAGSLSRKRVAVVLWNRSGSRAPITVGWREIGLSPYNPVIVRDLWA 350
Query: 327 HRTLETKFQGKLTAKVDGHSCKMYELTP 354
H + + LTA V H+CKMY LTP
Sbjct: 351 HSFVSMRKLHGLTAYVASHACKMYILTP 378
>gi|212274867|ref|NP_001130661.1| hypothetical protein precursor [Zea mays]
gi|194689772|gb|ACF78970.1| unknown [Zea mays]
gi|194708268|gb|ACF88218.1| unknown [Zea mays]
gi|224031287|gb|ACN34719.1| unknown [Zea mays]
gi|414888111|tpg|DAA64125.1| TPA: hypothetical protein ZEAMMB73_968166 [Zea mays]
Length = 425
Score = 302 bits (773), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 170/377 (45%), Positives = 213/377 (56%), Gaps = 44/377 (11%)
Query: 17 RNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLT 76
R L+NGL LTP MGWNSWN F C INE +IRET D + +L +F D
Sbjct: 51 RLQLNNGLALTPQMGWNSWNFFACNINETVIRETADALVSTGL---AALGYNFVNIDDCW 107
Query: 77 QIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----------------ILISNSLDN 119
+ +Q L+ + TFPSG KALADYVH + SL +
Sbjct: 108 SYVKRGKQDQ----LLPDPKTFPSGIKALADYVHGKGLKLGIYSDAGKFTCQVRPGSLGH 163
Query: 120 RIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPAL 178
+ F GIDYLKYDNC+N +P RY M DAL GR IF+S+CEWG PAL
Sbjct: 164 ENDDAAIFASWGIDYLKYDNCYNLGIKPQKRYPPMRDALNSTGRQIFYSLCEWGQYDPAL 223
Query: 179 WGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYV 238
W + NSWRTTDDI+D+W M IAD N +A +A PGGWNDPDMLEVGNGGM EY
Sbjct: 224 WAGKVGNSWRTTDDITDTWKSMTDIADKNNKWASYAGPGGWNDPDMLEVGNGGMTLAEYR 283
Query: 239 VHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ--------------AINDQQ 284
HFSIWA+ KAPLL+GCDV N+T+ETM I+ N+EVI VNQ ++
Sbjct: 284 SHFSIWALMKAPLLIGCDVRNMTSETMEILSNKEVIQVNQDPLGVQGRKILGQGKYGCRE 343
Query: 285 IWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIK-----SNNTLFEHRTLETKFQGKLT 339
+WAGPLSGNR+VV L +R + ++ +T +G+ S L++H T+ G
Sbjct: 344 VWAGPLSGNRLVVALWNRCSDTANVTMKLPAVGLDGYAAYSVRDLWKHETISENVVGTFG 403
Query: 340 AKVDGHSCKMYELTPVT 356
A+VD H MY +P T
Sbjct: 404 AQVDVHDTNMYIFSPAT 420
>gi|115474081|ref|NP_001060639.1| Os07g0679300 [Oryza sativa Japonica Group]
gi|33146441|dbj|BAC79549.1| putative alpha-galactosidase [Oryza sativa Japonica Group]
gi|50509523|dbj|BAD31216.1| putative alpha-galactosidase [Oryza sativa Japonica Group]
gi|113612175|dbj|BAF22553.1| Os07g0679300 [Oryza sativa Japonica Group]
gi|215708737|dbj|BAG94006.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 425
Score = 301 bits (770), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 167/376 (44%), Positives = 210/376 (55%), Gaps = 44/376 (11%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L+NGL LTP MGWNSWN F C INE +IR+T D + L ++ D
Sbjct: 54 LNNGLALTPQMGWNSWNFFACNINETVIRDTADALVSTGL---ADLGYNYVNIDDCWSNV 110
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVH-----------------ILISNSLDNRIE 122
+ +Q L+ + TFPSG K LADYVH + SL + +
Sbjct: 111 KRGKKDQ----LLPDPKTFPSGIKDLADYVHGKGLKLGIYSDAGIFTCQVRPGSLHHEKD 166
Query: 123 SNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGS 181
F G+DYLKYDNC+N +P RY M DAL GR IF+S+CEWG PALW
Sbjct: 167 DAAIFASWGVDYLKYDNCYNLGIKPKDRYPPMRDALNSTGRQIFYSLCEWGQDDPALWAG 226
Query: 182 NIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHF 241
+ NSWRTTDDI D+W M IAD N +A +A PGGWNDPDMLEVGNGGM + EY HF
Sbjct: 227 KVGNSWRTTDDIQDTWKSMTDIADKNNKWASYAGPGGWNDPDMLEVGNGGMTFAEYRAHF 286
Query: 242 SIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ--------------AINDQQIWA 287
SIWA+ KAPLL+GCDV N+T ETM I+ N+EVI VNQ Q++WA
Sbjct: 287 SIWALMKAPLLIGCDVRNMTKETMEILSNKEVIQVNQDPLGVQGRRILGQGKNGCQEVWA 346
Query: 288 GPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT-----LFEHRTLETKFQGKLTAKV 342
GPLSGNR+ V+L +R S+ + +G+ ++ L++H TL G A+V
Sbjct: 347 GPLSGNRLAVVLWNRCEESANIIVKLPSVGLDGSSPYSVRDLWKHETLSENVVGTFGAQV 406
Query: 343 DGHSCKMYELTPVTKL 358
D H CKMY TP +
Sbjct: 407 DVHDCKMYIFTPAVTV 422
>gi|168006281|ref|XP_001755838.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693157|gb|EDQ79511.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 402
Score = 300 bits (768), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 168/390 (43%), Positives = 217/390 (55%), Gaps = 71/390 (18%)
Query: 30 MGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQG 89
+ WNSWNHF C +NE +I ET D + L+ + + +L A L QG
Sbjct: 11 ISWNSWNHFACNVNETVIFETADALVSTG------LAAKGYKYINLDDCWAA-LQRDSQG 63
Query: 90 NLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAF---------------------- 127
NL+A TTFPSG KALADYVH S L I S+ +
Sbjct: 64 NLLARATTFPSGIKALADYVH---SKGLKLGIYSDAGYYTCASQPGSLGHETQDAQTFAS 120
Query: 128 -----------------EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCE 170
IDYLKYDNCF D T+P RY M DAL K GRPIFFSMCE
Sbjct: 121 WASFLSTSAILTIIHRSRMEIDYLKYDNCFTDGTKPEYRYPVMRDALNKTGRPIFFSMCE 180
Query: 171 WGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNG 230
WG PA W +N+ NSWRTT DI+D W M+ I ++N V+AD+A PGGWNDPDMLEVGNG
Sbjct: 181 WGIDSPATWANNVGNSWRTTGDITDDWNSMVGIVELNNVWADYAAPGGWNDPDMLEVGNG 240
Query: 231 GMKYNEYVVHFSIWAISK----APLLLGCDVGNLTAETMPIIGNEEVIAVNQ-------- 278
GM EY HFS+WA+ K APLL+GCDV N+ AE + I+ NEEVIAVNQ
Sbjct: 241 GMTVEEYRSHFSLWALMKARSFAPLLIGCDVRNIGAEILEILANEEVIAVNQDPLGVQGK 300
Query: 279 -----AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT-----LFEHR 328
+ ++W+GPLS R+V+LL +R + S+ +TA W D+GI ++ L++H
Sbjct: 301 RVRMDGFSGLEVWSGPLSSGRVVLLLWNRSSESANITARWSDMGIPNDTLLTVRDLWKHE 360
Query: 329 TLETKFQGKLTAKVDGHSCKMYELTPVTKL 358
T+ ++ +V H KM+ LTPV+++
Sbjct: 361 TISEAEMYEICQEVPSHGVKMFILTPVSQI 390
>gi|357121485|ref|XP_003562450.1| PREDICTED: alpha-galactosidase-like [Brachypodium distachyon]
Length = 428
Score = 298 bits (763), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 164/379 (43%), Positives = 212/379 (55%), Gaps = 44/379 (11%)
Query: 17 RNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLT 76
R L+NGL L P MGWNSWN F C INE +IRET D + L ++ D
Sbjct: 54 RLQLNNGLALVPQMGWNSWNFFACNINETLIRETADALVSTG---LADLGYNYVNIDDCW 110
Query: 77 QIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----------------ILISNSLDN 119
+ +Q L+ + TFPSG K+LADYVH + SL +
Sbjct: 111 SYVKRGKKDQ----LLPDPKTFPSGIKSLADYVHGKGLKLGIYSDAGVFTCQVRPGSLHH 166
Query: 120 RIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPAL 178
+ F G+DYLKYDNC+N +P RY M DAL GR IF+S+CEWG PAL
Sbjct: 167 EKDDASLFASWGVDYLKYDNCYNLGIKPKERYPPMRDALNTTGRQIFYSLCEWGQDDPAL 226
Query: 179 WGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYV 238
W + NSWRTTDDI D+W M IAD N +A +A PGGWNDPDMLEVGNGGM + EY
Sbjct: 227 WAGKVGNSWRTTDDIQDTWKSMTDIADKNNKWASYAGPGGWNDPDMLEVGNGGMTFAEYR 286
Query: 239 VHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ--------------AINDQQ 284
HFSIWA+ KAPLL+GCDV N+T+ET+ I+ N+EVI VNQ ++
Sbjct: 287 AHFSIWALMKAPLLIGCDVRNMTSETVEILSNKEVIQVNQDRLGVQGRRILGQGKDGCRE 346
Query: 285 IWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT-----LFEHRTLETKFQGKLT 339
+WAGPLSGNR+ V L +R + ++ +T +G+ ++ L++H TL
Sbjct: 347 VWAGPLSGNRLAVALWNRCSATTNITMKLPAVGLDGSSAYSVRDLWKHETLSPNVVATFG 406
Query: 340 AKVDGHSCKMYELTPVTKL 358
A+V+ H CKMY TP +
Sbjct: 407 AQVNAHDCKMYVFTPAVSV 425
>gi|302792557|ref|XP_002978044.1| hypothetical protein SELMODRAFT_108242 [Selaginella moellendorffii]
gi|300154065|gb|EFJ20701.1| hypothetical protein SELMODRAFT_108242 [Selaginella moellendorffii]
Length = 402
Score = 298 bits (763), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 167/396 (42%), Positives = 211/396 (53%), Gaps = 89/396 (22%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSR 59
LL NGLG TPPMGWN +NHFH +INE I+RET DD W E SR
Sbjct: 34 LLENGLGRTPPMGWNDFNHFHNKINETIVRETVDAIVSTGLAALGYNYINIDDEWAEQSR 93
Query: 60 DSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDN 119
D + GNL K FPSG K LADYVH S +L
Sbjct: 94 DKE--------------------------GNLQPRKDRFPSGIKNLADYVH---SKNLKF 124
Query: 120 RIESN-------------IAFEQ---------GIDYLKYDNCFNDDTRPTIRYRAMSDAL 157
I S+ + FE G+D+LKYDNC ND P IRY M AL
Sbjct: 125 GIYSDAGRLTCAKTQPGSLNFENQDAKTFAAWGVDFLKYDNCHNDGASPKIRYPKMQKAL 184
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPG 217
GRPIF+++CEWG PALW + NSWRTT DI D+W M+ AD N+ +A +A PG
Sbjct: 185 LATGRPIFYALCEWGYEDPALWAPGVGNSWRTTGDIKDNWQSMIVRADQNDKWAKYAGPG 244
Query: 218 GWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN 277
GWNDPDMLEVGNGGM EY HFS+WA+ KAPL++GCDV NL E + I+ N+EVIA+N
Sbjct: 245 GWNDPDMLEVGNGGMSLEEYRAHFSLWALMKAPLIIGCDVRNLKQEYLDILMNKEVIAIN 304
Query: 278 Q--------------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT 323
Q A+ + +W GPLS + V LL +R ++ + A+W+ IG K ++
Sbjct: 305 QDSMGVQGKRVSRKGALEARLVWGGPLSQGKFVALLLNRGDSTAKIRAYWEQIGAKPGSS 364
Query: 324 -----LFEHRTLETKFQGKLTAKVDGHSCKMYELTP 354
++ H+ L QG + VD H+ KMY + P
Sbjct: 365 VQVRDVWAHKDLGAFSQGYIEEAVDAHAIKMYVVVP 400
>gi|147780179|emb|CAN75523.1| hypothetical protein VITISV_043596 [Vitis vinifera]
Length = 449
Score = 298 bits (762), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 164/364 (45%), Positives = 213/364 (58%), Gaps = 41/364 (11%)
Query: 30 MGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQG 89
+GWNSWN F C INE +I+ET + DS + + L IA QG
Sbjct: 84 VGWNSWNFFACNINETVIKETGKFFIGLIVDSHSQMIAGLPWSEILRFIAML----FGQG 139
Query: 90 NLVANKTTFPSGTKALADYVH------------------ILISNSLDNRIESNIAFEQGI 131
LV + TFPSG KALADYVH + + R ++ + G+
Sbjct: 140 QLVPDPKTFPSGIKALADYVHAKGLKLGIYSDAGIFTCQVRPGSIYHERDDAELFASWGV 199
Query: 132 DYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTD 191
DYLKYDNC+N +P RY M +AL GR IF+S+CEWG PALW + NSWRTTD
Sbjct: 200 DYLKYDNCYNLGIKPEERYPPMRNALNATGRTIFYSLCEWGVDDPALWAGKVGNSWRTTD 259
Query: 192 DISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPL 251
DI+DSWA M TIAD+N+ +A +A PGGWNDPDMLEVGNGGM EY HFSIWA+ KAPL
Sbjct: 260 DINDSWASMTTIADLNDEWAAYAGPGGWNDPDMLEVGNGGMTLEEYRAHFSIWALMKAPL 319
Query: 252 LLGCDVGNLTAETMPIIGNEEVIAVNQ-AINDQ-------------QIWAGPLSGNRIVV 297
L+GCDV N+TAET IIGN+EVI +NQ ++ Q Q+WAGPLSG+R+VV
Sbjct: 320 LVGCDVRNITAETFEIIGNKEVIDINQDSLGIQGRKVHVSGKDGCRQVWAGPLSGHRLVV 379
Query: 298 LLESRKTFSSMMTAHWDDIGIKSNNT-----LFEHRTLETKFQGKLTAKVDGHSCKMYEL 352
L +R + ++ +T W+ +G++S+ + L++H L + V H C MY
Sbjct: 380 ALWNRCSKAATITVGWEVLGLESSMSVSIRDLWKHXDLSGDAVASFGSLVASHDCGMYIF 439
Query: 353 TPVT 356
TPV+
Sbjct: 440 TPVS 443
>gi|218200260|gb|EEC82687.1| hypothetical protein OsI_27343 [Oryza sativa Indica Group]
Length = 427
Score = 296 bits (759), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 166/376 (44%), Positives = 209/376 (55%), Gaps = 44/376 (11%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L+NGL LTP MGWNSWN F C INE +IR+T D + L ++ D
Sbjct: 56 LNNGLALTPQMGWNSWNFFACNINETVIRDTADALVSTGL---ADLGYNYVNIDDCWSNV 112
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVH-----------------ILISNSLDNRIE 122
+ +Q L+ + TFPSG K LADYVH + SL + +
Sbjct: 113 KRGKKDQ----LLPDPKTFPSGIKDLADYVHGKGLKLGIYSDAGIFTCQVRPGSLHHEKD 168
Query: 123 SNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGS 181
F G+DYLKYDNC+N +P RY M DAL GR IF+S+CEWG PALW
Sbjct: 169 DAAIFASWGVDYLKYDNCYNLGIKPKDRYPPMRDALNSTGRQIFYSLCEWGQDDPALWAG 228
Query: 182 NIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHF 241
+ NSWRTTDDI D+W M IAD N +A +A PGGWNDPDMLEVGNGGM + EY HF
Sbjct: 229 KVGNSWRTTDDIQDTWKSMTDIADKNNKWASYAGPGGWNDPDMLEVGNGGMTFAEYRAHF 288
Query: 242 SIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ--------------AINDQQIWA 287
SIWA+ KAPLL+GCDV N+T ETM I+ N+EVI VNQ Q++WA
Sbjct: 289 SIWALMKAPLLIGCDVRNMTKETMEILSNKEVIQVNQDPLGVQGRRILGQGKNGCQEVWA 348
Query: 288 GPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT-----LFEHRTLETKFQGKLTAKV 342
GPLSGNR+ V+L +R S+ + +G+ ++ L++H TL G A+V
Sbjct: 349 GPLSGNRLAVVLWNRCEESANIIVKLPSVGLDGSSPYSVRDLWKHETLSENVVGTFGAQV 408
Query: 343 DGHSCKMYELTPVTKL 358
D H CKM TP +
Sbjct: 409 DVHDCKMCIFTPAVTV 424
>gi|34394895|dbj|BAC84411.1| putative alpha-galactosidase [Oryza sativa Japonica Group]
Length = 415
Score = 296 bits (758), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 173/409 (42%), Positives = 217/409 (53%), Gaps = 108/409 (26%)
Query: 16 RRNLLSNGLGLTPPMG------------------------------WNSWNHFHCQINEK 45
RR+LL NGLG TP MG WNSWNHF C INE
Sbjct: 42 RRSLLDNGLGRTPQMGRIASVELIPSARSDRSLACVHVRSVFSFFRWNSWNHFGCNINEN 101
Query: 46 IIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKAL 105
IR T A+ FPSG KAL
Sbjct: 102 TIRST-----------------------------------------AADPKKFPSGIKAL 120
Query: 106 ADYVH-----ILISNSLDNR-----IESNIAFEQ---------GIDYLKYDNCFNDDTRP 146
ADYVH + I +S +R + ++ +E G+DYLKYDNC +D +
Sbjct: 121 ADYVHSKGLKLGIYSSAGSRTCSKTMPGSLGYEDIDAKTFASWGVDYLKYDNCNSDGSSE 180
Query: 147 TIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADM 206
T+R+ MS AL+K GRPIF+S+CEWG A WG NSWRTT DI+DSWA ML+ D
Sbjct: 181 TVRFPKMSFALRKTGRPIFYSICEWGQRNVATWGGQYGNSWRTTGDINDSWASMLSNIDS 240
Query: 207 NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMP 266
N+ A +AKPGGWNDPDMLEVGNGGM +EYVVH S+WAI+KAPL++GCDV +++ ETM
Sbjct: 241 NDASASYAKPGGWNDPDMLEVGNGGMTNDEYVVHISLWAIAKAPLIIGCDVRSISRETME 300
Query: 267 IIGNEEVIAVNQAI------------NDQQIWAGPLSGNRIVVLLESR-KTFSSMMTAHW 313
I+ N EVIA+NQ N+ ++WAGPLS R VLL +R T S +TA W
Sbjct: 301 ILSNPEVIAINQDPLGVQGKKVRKYDNEIEVWAGPLSQQRTAVLLLNRGATGSRQITAAW 360
Query: 314 DDIGIKSN-----NTLFEHRTLETKFQGKLTAKVDGHSCKMYELTPVTK 357
DIG+ ++ H T +F G LTA+V HSCK++ LTPV +
Sbjct: 361 QDIGVGPGVAVEAKNVWLHATAPGRFTGSLTAEVAAHSCKLFVLTPVGR 409
>gi|302766553|ref|XP_002966697.1| hypothetical protein SELMODRAFT_86208 [Selaginella moellendorffii]
gi|300166117|gb|EFJ32724.1| hypothetical protein SELMODRAFT_86208 [Selaginella moellendorffii]
Length = 402
Score = 296 bits (757), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 166/396 (41%), Positives = 211/396 (53%), Gaps = 89/396 (22%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSR 59
LL NGLG TPPMGWN +NHFH +INE I+RET DD W E SR
Sbjct: 34 LLENGLGRTPPMGWNDFNHFHNKINETIVRETADAIVSTGLAALGYNYINIDDEWAEQSR 93
Query: 60 DSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDN 119
D + GNL K FPSG K LADYVH S +L
Sbjct: 94 DKE--------------------------GNLQPRKDRFPSGIKNLADYVH---SKNLKF 124
Query: 120 RIESN-------------IAFEQ---------GIDYLKYDNCFNDDTRPTIRYRAMSDAL 157
I S+ + FE G+D+LKYDNC ND P IRY M AL
Sbjct: 125 GIYSDAGRFTCAKTQPGSLNFENQDAKTFAAWGVDFLKYDNCHNDGASPKIRYPKMQKAL 184
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPG 217
GRPIF+++CEWG PALW + NSWRTT DI D+W M+ AD N+ +A +A PG
Sbjct: 185 LATGRPIFYALCEWGYEDPALWAPGVGNSWRTTGDIKDNWKSMIVRADQNDKWAKYAGPG 244
Query: 218 GWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN 277
GWNDPDMLEVGNGGM EY HFS+WA+ KAPL++GCDV NL E + I+ N+EVIA+N
Sbjct: 245 GWNDPDMLEVGNGGMSLEEYRTHFSLWALMKAPLIIGCDVRNLKQEYLNILMNKEVIAIN 304
Query: 278 Q--------------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT 323
Q A+ + +W GPLS + V LL ++ ++ + A+W+ IG K ++
Sbjct: 305 QDSMGVQGKRVSRKGALEARLVWGGPLSQGKFVALLLNQGDSTAKIRAYWEQIGAKPGSS 364
Query: 324 -----LFEHRTLETKFQGKLTAKVDGHSCKMYELTP 354
++ H+ L QG + VD H+ KMY + P
Sbjct: 365 VQVRDVWAHKDLGVFSQGYIEEAVDAHAIKMYVVVP 400
>gi|356516255|ref|XP_003526811.1| PREDICTED: alpha-galactosidase-like isoform 2 [Glycine max]
Length = 407
Score = 293 bits (750), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 161/374 (43%), Positives = 210/374 (56%), Gaps = 68/374 (18%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
LSNGLG TP MGWNSWN F C INE +I+ET D +L
Sbjct: 59 LSNGLGKTPQMGWNSWNFFACNINEMVIKETAD---------------------ALVSTG 97
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVH------------------ILISNSLDNRI 121
L G + N GT ++DYVH + +
Sbjct: 98 LADL-----GYVYVNI-----GTLLVSDYVHGKGLKLGIYSDAGVFTCQVRPGSIFHETD 147
Query: 122 ESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGS 181
++++ G+DYLKYDNC+N P RY M DAL G+ IF+S+CEWG PALW
Sbjct: 148 DADLFASWGVDYLKYDNCYNLGIPPKERYPPMRDALNATGQKIFYSLCEWGVEDPALWAD 207
Query: 182 NIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHF 241
+ NSWRTT DI+DSWA M TIAD+N+ +A +A PGGWNDPDMLEVGNGGM Y EY HF
Sbjct: 208 KVGNSWRTTGDINDSWASMTTIADLNDKWAAYAGPGGWNDPDMLEVGNGGMTYQEYRAHF 267
Query: 242 SIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ--------------AINDQQIWA 287
SIWA++KAPLL+GCDV NLTAET+ I+ N+EVIA+NQ A +Q+WA
Sbjct: 268 SIWALAKAPLLIGCDVRNLTAETLEILSNKEVIAINQDSLGVQGRKVQVSGADGCRQVWA 327
Query: 288 GPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSN-----NTLFEHRTLETKFQGKLTAKV 342
GPLSGNR+ V L +R + + +TA W+ +G++S L++H+ + +A+V
Sbjct: 328 GPLSGNRLAVALWNRCSKVATITASWEALGLESGVHVSVRDLWQHKVVTGDAVSSFSARV 387
Query: 343 DGHSCKMYELTPVT 356
D H C++Y P T
Sbjct: 388 DIHDCQLYIFAPFT 401
>gi|326529989|dbj|BAK08274.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 388
Score = 293 bits (749), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 167/403 (41%), Positives = 216/403 (53%), Gaps = 85/403 (21%)
Query: 5 SSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------- 50
SS++ +NG R LGLTP MGWNSWNHF C+I+E I++
Sbjct: 17 SSTSPAANGQKNR------LGLTPQMGWNSWNHFGCKIDEATIQKIADAMVDTGLAKLGY 70
Query: 51 -----DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKAL 105
DDCW +RDSQ GNLVAN +TFPSG +AL
Sbjct: 71 EYINLDDCWAAYNRDSQ--------------------------GNLVANASTFPSGMRAL 104
Query: 106 ADYVHI------------------LISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRP 146
ADY+H L+ SL + F GIDYLKYDNC N + P
Sbjct: 105 ADYMHKKGLKLGVYGDAGLRTCSKLMPGSLGYEDQDARTFAAWGIDYLKYDNCNNQNISP 164
Query: 147 TIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADM 206
RY MS+AL +GR IFFS+CEWG PA W ++ NSWRTT+DI ++W M D
Sbjct: 165 LTRYNRMSEALMNSGRDIFFSLCEWGVDEPATWAGSMANSWRTTEDIKNTWESMTDNIDK 224
Query: 207 NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMP 266
+ +A +A PGGWNDPDMLEVGNGGM EY HFSIWA+ KAPL+LGCD+ ++T ET
Sbjct: 225 SNKWAPYAGPGGWNDPDMLEVGNGGMTTEEYRSHFSIWALVKAPLVLGCDISSMTPETKE 284
Query: 267 IIGNEEVIAVN-----------QAINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDD 315
I+ N++VIAVN Q D ++WAG LSG+R+ ++L + + +TA W +
Sbjct: 285 ILSNQKVIAVNQDKLGVQGLKVQQDEDLEVWAGRLSGDRVALVLWNSGPTEASITASWSN 344
Query: 316 IGIKSNNTLFEH----RTLETKFQGKLTAKVDGHSCKMYELTP 354
IG+ + + H + + QG L V H+CKMY LTP
Sbjct: 345 IGLSPSAVVDAHDLWTDEVTSSVQGNLKKTVASHACKMYVLTP 387
>gi|326508514|dbj|BAJ95779.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 431
Score = 293 bits (749), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 163/374 (43%), Positives = 208/374 (55%), Gaps = 44/374 (11%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L+NGL L P MGWNSWN F C I E +IRET D + L ++ D
Sbjct: 59 LNNGLALAPQMGWNSWNFFACNITETLIRETADALVSTG---LADLGYNYVNIDDCWSYV 115
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVH-----------------ILISNSLDNRIE 122
+ +Q L+ + TFPSG K+LADYVH + SL + +
Sbjct: 116 KRGNKDQ----LLPDPKTFPSGIKSLADYVHGKGLKLGIYSDAGVFTCQVRPGSLHHEND 171
Query: 123 SNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGS 181
F G+DYLKYDNC+N P RY M DAL GR IF+S+CEWG PALW
Sbjct: 172 DAALFASWGVDYLKYDNCYNLGIPPKERYPPMRDALNSTGRQIFYSLCEWGQDDPALWAG 231
Query: 182 NIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHF 241
+ NSWRTTDDI D+W M IAD N +A +A PGGWNDPDMLEVGNGGM + EY HF
Sbjct: 232 KVGNSWRTTDDIQDTWKSMTDIADKNNKWASYAGPGGWNDPDMLEVGNGGMTFAEYRAHF 291
Query: 242 SIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ--------------AINDQQIWA 287
SIWA+ KAPLL+GCDV N+T++T+ I+ N+EVI VNQ +++WA
Sbjct: 292 SIWALMKAPLLIGCDVRNMTSQTIEILSNKEVIQVNQDPLGVQGRRILGQGKGGCREVWA 351
Query: 288 GPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT-----LFEHRTLETKFQGKLTAKV 342
GPLSGNR+ V L +R + + +T +G+ ++ L++H TL G A+V
Sbjct: 352 GPLSGNRLAVALWNRCSETVNITMTLPAVGLDGSSAYSVRDLWKHETLSANVVGTFGAQV 411
Query: 343 DGHSCKMYELTPVT 356
D H CKMY TP
Sbjct: 412 DMHDCKMYIFTPAV 425
>gi|255559665|ref|XP_002520852.1| alpha-galactosidase/alpha-n-acetylgalactosaminidase, putative
[Ricinus communis]
gi|223539983|gb|EEF41561.1| alpha-galactosidase/alpha-n-acetylgalactosaminidase, putative
[Ricinus communis]
Length = 360
Score = 290 bits (741), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 155/350 (44%), Positives = 205/350 (58%), Gaps = 63/350 (18%)
Query: 46 IIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKAL 105
I+ + DDCW ++RD++ G LV + TFPSG KAL
Sbjct: 33 ILWDADDCWSAATRDAK--------------------------GQLVPDPKTFPSGIKAL 66
Query: 106 ADYVH---ILISNSLDNRI---------------ESNIAFEQGIDYLKYDNCFNDDTRPT 147
ADY+H + + D I ++++ G+DYLKYDNCFN +P
Sbjct: 67 ADYIHGKGLKLGIYSDAGIFTCQVRPGSLHHEEDDADLFASWGVDYLKYDNCFNLGIKPK 126
Query: 148 IRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMN 207
RY M DAL +GR IF+S+CEWG PALW + NSWRTTDDI+DSW M TIAD+N
Sbjct: 127 ERYPPMRDALNASGRTIFYSLCEWGVDDPALWAGKVGNSWRTTDDINDSWVSMTTIADLN 186
Query: 208 EVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPI 267
+ +A +A PGGWNDPDMLEVGNGGM Y EY HFSIWA+ KAPLL+GCDV N+TAET I
Sbjct: 187 DKWAAYAGPGGWNDPDMLEVGNGGMTYQEYRAHFSIWALMKAPLLIGCDVRNMTAETYEI 246
Query: 268 IGNEEVIAVNQ-AINDQ-------------QIWAGPLSGNRIVVLLESRKTFSSMMTAHW 313
+ N+EVIAVNQ ++ Q Q+WAGPLSG+R+ V+L +R + ++ +TA W
Sbjct: 247 LTNKEVIAVNQDSLGVQGRKVQASGTDGCLQVWAGPLSGHRMAVVLWNRCSKAATITARW 306
Query: 314 DDIGIKSNNT-----LFEHRTLETKFQGKLTAKVDGHSCKMYELTPVTKL 358
D +G++S + L++H+ + +VD H C MY TP T L
Sbjct: 307 DALGLESGTSVAVRDLWQHKDITGDSVASFGTRVDAHDCAMYTFTPKTVL 356
>gi|356506150|ref|XP_003521850.1| PREDICTED: LOW QUALITY PROTEIN: alpha-galactosidase-like [Glycine
max]
Length = 374
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 166/376 (44%), Positives = 211/376 (56%), Gaps = 63/376 (16%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSL 75
+ +L++NGLG TPPMG + +I I DD WGE RD + +
Sbjct: 25 QHSLVANGLGATPPMGITLVSTGLSKIG-YIYVNIDDSWGEMIRDVDVXI---------- 73
Query: 76 TQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------------------LISNSL 117
+ + FPSG KA+ADYVH ++ SL
Sbjct: 74 ----------------LIKQNXFPSGIKAVADYVHGKGLKLGIYSAAGYFTCANVMPGSL 117
Query: 118 DNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRP 176
+ F G+DYLKYD C N T+P RY M +L K GRPI+FS+ E GDM P
Sbjct: 118 GYGEQDAKTFASWGVDYLKYDICNNGGTKPIDRYPIMPRSLMKVGRPIYFSLFEXGDMHP 177
Query: 177 ALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNE 236
ALWG + NSW TDDI D+W ML+ ADMNEVYAD+A+PGGWNDP+MLE+GN M NE
Sbjct: 178 ALWGYQVGNSWIITDDIRDNWDSMLSKADMNEVYADYARPGGWNDPNMLEIGNERMNKNE 237
Query: 237 YVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ-----------I 285
Y+VHFS+WA SKAPL++GCDV N+T +T I+ N EVIAVNQ +Q +
Sbjct: 238 YIVHFSLWATSKAPLIIGCDVRNMTEDTKEILSNTEVIAVNQDPLGKQGKKVRMESTLEV 297
Query: 286 WAGPLSGNRI-VVLLESRKTFSSMMTAHWDDIGIKSNNT-----LFEHRTLETKFQGKLT 339
WAGPLS R+ VVLL + +TA W+DIG+ + L+EH TLE +F GKLT
Sbjct: 298 WAGPLSEYRVDVVLLNKYSDLRASITALWEDIGLHPSTVVESRDLWEHNTLERQFSGKLT 357
Query: 340 AKVDGHSCKMYELTPV 355
V+ HSCKMY L P+
Sbjct: 358 DTVEPHSCKMYVLKPI 373
>gi|255640756|gb|ACU20662.1| unknown [Glycine max]
Length = 374
Score = 288 bits (736), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 155/323 (47%), Positives = 193/323 (59%), Gaps = 39/323 (12%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
LSNGLG TP MGWNSWN F C INE +I+ET D + L + D
Sbjct: 59 LSNGLGKTPQMGWNSWNFFACNINEMVIKETADALVSTGL---ADLGYVYVNIDDCWSSV 115
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVH------------------ILISNSLDNRI 121
+ L +G LV + TFPSG KALADYVH + +
Sbjct: 116 TRNL----KGQLVPDHKTFPSGIKALADYVHGKGLKLGIYSDAGVFTCQVRPGSIFHETD 171
Query: 122 ESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGS 181
++++ G+DYLKYDNC+N P RY M DAL G+ IF+S+CEWG PALW
Sbjct: 172 DADLFASWGVDYLKYDNCYNLGIPPKERYPPMRDALNATGQKIFYSLCEWGVEDPALWAD 231
Query: 182 NIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHF 241
+ NSWRTT DI+DSWA M TIAD+N+ +A +A PGGWNDPDMLEVGNGGM Y EY HF
Sbjct: 232 KVGNSWRTTGDINDSWASMTTIADLNDKWAAYAGPGGWNDPDMLEVGNGGMTYQEYRAHF 291
Query: 242 SIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ--------------AINDQQIWA 287
SIWA++KAPLL+GCDV NLT ET+ I+ N+EVIA+NQ A +Q+WA
Sbjct: 292 SIWALAKAPLLIGCDVRNLTGETLEILSNKEVIAINQDSLGVQGRKVQVSGADGCRQVWA 351
Query: 288 GPLSGNRIVVLLESRKTFSSMMT 310
GPLSGNR+ V L +R + + +T
Sbjct: 352 GPLSGNRLAVALWNRCSKVATIT 374
>gi|414884441|tpg|DAA60455.1| TPA: hypothetical protein ZEAMMB73_299281 [Zea mays]
Length = 370
Score = 288 bits (736), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 145/288 (50%), Positives = 187/288 (64%), Gaps = 26/288 (9%)
Query: 11 SNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFH 70
+ G RR L +NGLGLTP MGWNSWNHF C INE ++R T D I L+ + +
Sbjct: 42 AGGGGRRVLFANGLGLTPQMGWNSWNHFQCDINEAVVRSTADAL------VAIGLAKAGY 95
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI----------LISNSLDNR 120
+ +L A + +G +VAN TFPSG KALADYVH + + NR
Sbjct: 96 TYVNLDDCWADSERTK-EGYMVANPKTFPSGIKALADYVHSKGLKLGIYSSAGTRTCSNR 154
Query: 121 IESNIAFEQ---------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEW 171
+ ++ +E+ G+DYLKYDNC+ D T T+R+ MS AL +GRPIF+S+CEW
Sbjct: 155 MPGSLGYEERDAKMFASWGVDYLKYDNCYRDGTPETVRFGRMSRALVNSGRPIFYSLCEW 214
Query: 172 GDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGG 231
G M WG NSWRTT DI+D+W+ ML D+N+ YA +AKPGGWNDPDMLEVGNGG
Sbjct: 215 GFMEVPKWGGMYGNSWRTTGDINDTWSGMLDNIDLNDAYARYAKPGGWNDPDMLEVGNGG 274
Query: 232 MKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQA 279
M YNEYVVHFS+WAI+KAPL++GCDV ++ ET+ I+ N EVIA++Q
Sbjct: 275 MAYNEYVVHFSLWAIAKAPLVIGCDVTRVSNETLGILSNAEVIAISQG 322
>gi|302809601|ref|XP_002986493.1| hypothetical protein SELMODRAFT_446636 [Selaginella moellendorffii]
gi|300145676|gb|EFJ12350.1| hypothetical protein SELMODRAFT_446636 [Selaginella moellendorffii]
Length = 409
Score = 283 bits (724), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 162/394 (41%), Positives = 210/394 (53%), Gaps = 88/394 (22%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
L NGLG TPPMGWNSWNHF C I+E ++ET DDCW E R
Sbjct: 33 LHNGLGKTPPMGWNSWNHFGCNIDELTVKETAEALVSTGLAALGYNYLNVDDCWAEMKRG 92
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNR 120
S+ G L A TFPSG KALAD+VH L
Sbjct: 93 SE--------------------------GELTARAATFPSGIKALADFVH---EKGLKFG 123
Query: 121 IESNIAF-------------EQ--------GIDYLKYDNCFNDDTRPTIRYRAMSDALKK 159
I S+ + EQ GIDYLKYDNCF D ++P +RY M +AL K
Sbjct: 124 IYSDAGYFTCEKQPGSLGYEEQDAETFASWGIDYLKYDNCFTDGSKPELRYPIMQEALAK 183
Query: 160 AGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGW 219
GR IFFS+CEWG PA W ++ NSWRTT DI+D+W M TIAD+N+ +A A PGGW
Sbjct: 184 TGRKIFFSICEWGVDDPAEWAPDVGNSWRTTGDITDTWKSMTTIADLNDRWASFAGPGGW 243
Query: 220 NDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ- 278
NDPDMLEVGNGGM +EY HFSIWA+ KAPL+LGCD+ +++ +T+ II N+EVI+VNQ
Sbjct: 244 NDPDMLEVGNGGMTIDEYRSHFSIWALMKAPLILGCDLRDMSNDTLEIITNKEVISVNQD 303
Query: 279 AINDQ-------------QIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT-- 323
++ Q +IWAGPLS R V++ +R + + W +I + +
Sbjct: 304 SLGIQGRKVCKKGPEECHEIWAGPLSLKRTAVVIWNRCEEDADIEVSWQEISLHPSTRVT 363
Query: 324 ---LFEHRTLETKFQGKLTAKVDGHSCKMYELTP 354
L++H + + V H +M+ L+P
Sbjct: 364 IRDLWKHEYWSGVYNESIVVSVAPHGAEMFILSP 397
>gi|302762979|ref|XP_002964911.1| hypothetical protein SELMODRAFT_83268 [Selaginella moellendorffii]
gi|300167144|gb|EFJ33749.1| hypothetical protein SELMODRAFT_83268 [Selaginella moellendorffii]
Length = 413
Score = 282 bits (722), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 161/394 (40%), Positives = 210/394 (53%), Gaps = 88/394 (22%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
L NGLG TPPMGWNSWNHF C I+E ++ET DDCW E R
Sbjct: 28 LHNGLGKTPPMGWNSWNHFGCNIDELTVKETAEALVSTGLAALGYNYLNVDDCWAEMKRG 87
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNR 120
S+ G L A TFPSG KALAD+VH L
Sbjct: 88 SE--------------------------GELTARAATFPSGIKALADFVH---EKGLKFG 118
Query: 121 IESNIAF-------------EQ--------GIDYLKYDNCFNDDTRPTIRYRAMSDALKK 159
I S+ + EQ GIDYLKYDNC+ D ++P +RY M +AL K
Sbjct: 119 IYSDAGYFTCEKQPGSLGYEEQDAETFASWGIDYLKYDNCYTDGSKPELRYPIMQEALAK 178
Query: 160 AGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGW 219
GR IFFS+CEWG PA W ++ NSWRTT DI+D+W M TIAD+N+ +A A PGGW
Sbjct: 179 TGRKIFFSICEWGVDDPAEWAPDVGNSWRTTGDITDTWKSMTTIADLNDRWASFAGPGGW 238
Query: 220 NDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ- 278
NDPDMLEVGNGGM +EY HFSIWA+ KAPL+LGCD+ +++ +T+ II N+EVI+VNQ
Sbjct: 239 NDPDMLEVGNGGMTIDEYRSHFSIWALMKAPLILGCDLRDMSNDTLEIITNKEVISVNQD 298
Query: 279 AINDQ-------------QIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT-- 323
++ Q +IWAGPLS R V++ +R + + W +I + +
Sbjct: 299 SLGIQGRKVCKKGPEECHEIWAGPLSLKRTAVVIWNRCEGDADIEVSWQEISLHPSTRVT 358
Query: 324 ---LFEHRTLETKFQGKLTAKVDGHSCKMYELTP 354
L++H + + V H +M+ L+P
Sbjct: 359 IRDLWKHEYWSGVYNESIVVSVAPHGAEMFILSP 392
>gi|125558188|gb|EAZ03724.1| hypothetical protein OsI_25855 [Oryza sativa Indica Group]
Length = 385
Score = 281 bits (718), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 164/379 (43%), Positives = 207/379 (54%), Gaps = 76/379 (20%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSL 75
RR+LL NGLG TP MGWNSWNHF C INE IR T D + +
Sbjct: 40 RRSLLDNGLGRTPQMGWNSWNHFGCNINENTIRSTVDA-----------------LISTG 82
Query: 76 TQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNR-----IESNI 125
A N GN+ A+ FPSG KALADYVH + I +S +R + ++
Sbjct: 83 LAKAGYTYVNLGGGNMAADPKKFPSGIKALADYVHSKGLKLGIYSSAGSRTCSKTMPGSL 142
Query: 126 AFEQ---------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRP 176
+E G+DYLKYDNC +D + T+R G
Sbjct: 143 GYEDIDAKTFASWGVDYLKYDNCNSDGSSETVR----------------------GQRNV 180
Query: 177 ALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNE 236
A WG NSWRTT DI+DSWA ML+ D N+ A +AKPGGWNDPDMLEVGNGGM +E
Sbjct: 181 ATWGGQYGNSWRTTGDINDSWASMLSNIDSNDASASYAKPGGWNDPDMLEVGNGGMTNDE 240
Query: 237 YVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAI------------NDQQ 284
YVVH S+WAI+KAPL++GCDV +++ ETM I+ N EVIA+NQ N+ +
Sbjct: 241 YVVHISLWAIAKAPLIIGCDVRSISRETMEILSNPEVIAINQDPLGVQGKKVRKYDNEIE 300
Query: 285 IWAGPLSGNRIVVLLESR-KTFSSMMTAHWDDIGIKSN-----NTLFEHRTLETKFQGKL 338
+WAGPLS R VLL +R T S +TA W DIG+ ++ H T +F G L
Sbjct: 301 VWAGPLSQQRTAVLLLNRGATGSRQITAAWQDIGVGPGVAVEAKNVWLHATAPGRFTGSL 360
Query: 339 TAKVDGHSCKMYELTPVTK 357
TA+V HSCK++ LTPV +
Sbjct: 361 TAEVAAHSCKLFVLTPVGR 379
>gi|116787336|gb|ABK24468.1| unknown [Picea sitchensis]
Length = 377
Score = 278 bits (710), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 152/334 (45%), Positives = 186/334 (55%), Gaps = 77/334 (23%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
L+NG+G TP MGWNSWN F C INE +IRET DDCW ++R+
Sbjct: 51 LNNGVGQTPQMGWNSWNFFACAINETVIRETADALISTGLADLGYVYVNIDDCWSSTTRN 110
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH---------- 110
+G+L + TFPSG KALADYVH
Sbjct: 111 --------------------------LEGDLEPDPKTFPSGIKALADYVHEKYLKLGIYS 144
Query: 111 --------ILISNSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGR 162
+ + + ++N+ GIDYLKYDNC+N D P RY M +AL
Sbjct: 145 DAGIYTCQVRPGSIFNENRDANLFASWGIDYLKYDNCYNLDIPPKKRYPPMREALNATEH 204
Query: 163 PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDP 222
IF+S+CEWG PALW NI NSWRTTDDI+DSW M IAD+N+ +A +A PGGWNDP
Sbjct: 205 TIFYSLCEWGVDDPALWADNIGNSWRTTDDINDSWWSMTAIADLNDKWAAYAGPGGWNDP 264
Query: 223 DMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAI-- 280
DMLEVGNGGM Y EY HFSIWA+ KAPLL+GCDV N+T ET I+ N+EVIAVNQ
Sbjct: 265 DMLEVGNGGMTYQEYRSHFSIWALMKAPLLVGCDVRNMTKETFEILSNKEVIAVNQDTLG 324
Query: 281 ------------NDQQIWAGPLSGNRIVVLLESR 302
+Q+WAGPLSG R+ V+ +R
Sbjct: 325 IQGRKVSAEGLDGCRQVWAGPLSGGRLAVVFWNR 358
>gi|302788170|ref|XP_002975854.1| hypothetical protein SELMODRAFT_232618 [Selaginella moellendorffii]
gi|300156130|gb|EFJ22759.1| hypothetical protein SELMODRAFT_232618 [Selaginella moellendorffii]
Length = 378
Score = 273 bits (699), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 152/393 (38%), Positives = 210/393 (53%), Gaps = 81/393 (20%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQINEKIIRE-------------------TDDCWGESSR 59
+ NGLG TP MGW+ WN + C INEKI+ DDCW R
Sbjct: 2 VYDNGLGQTPLMGWSGWNFYECTINEKIVYGNMDALVRLDLPSYGYNYVIVDDCWSAYKR 61
Query: 60 DSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------LI 113
D + GNL ++K TFPSG KALADY H L
Sbjct: 62 DKE--------------------------GNLRSDKKTFPSGMKALADYAHKRGMKFGLY 95
Query: 114 SNSLDNRIESNIAFEQG-------------IDYLKYDNCFNDDTRPT-IRYRAMSDALKK 159
S++ + + A +G IDYLKYDNCF+ RP+ RY MS+AL K
Sbjct: 96 SDAGRMTCKCHRAASEGHEFRDARTFASWGIDYLKYDNCFHKKNRPSHARYPVMSEALNK 155
Query: 160 AGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGW 219
+GRPIF++MCEWG+ PA+W NSWRT+ D+ D W R+ +AD N ++A +A PGGW
Sbjct: 156 SGRPIFYAMCEWGEDHPAVWAGKYANSWRTSLDVKDRWPRIELLADDNNLWASYAGPGGW 215
Query: 220 NDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ- 278
NDPDML+VGNG M EY HFSIW+I KAPL++GC++ +T E + I N E+IAVNQ
Sbjct: 216 NDPDMLQVGNGRMTLAEYRSHFSIWSIMKAPLIIGCELRTITKEHLEIYKNTEIIAVNQD 275
Query: 279 ----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT----- 323
I ++WAGPLSG++IVV +R +++T +W D G++ + T
Sbjct: 276 SLGIQGRKVSQIGQSEVWAGPLSGDKIVVAAWNRGWKRALITINWYDFGLEPSVTASVRD 335
Query: 324 LFEHRTLETKFQGKLTAKVDGHSCKMYELTPVT 356
L++H + + ++D H C +Y L+ ++
Sbjct: 336 LWKHEDWTQRQEDGFEVEIDSHDCGVYILSNIS 368
>gi|320168305|gb|EFW45204.1| alpha-galactosidase [Capsaspora owczarzaki ATCC 30864]
Length = 382
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 152/370 (41%), Positives = 204/370 (55%), Gaps = 43/370 (11%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
+ NGL TP MG+N+WN FHC INE +IR T D + ++ + D Q++
Sbjct: 20 IDNGLARTPQMGFNTWNKFHCTINETLIRNTAD---SLLKTGLAAVGYKYINLDDCWQVS 76
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVHI------LISNSLDNRIES---NIAFEQ- 129
Q +VA+ T FPSG ALA+Y+H L S++ N E ++ +E
Sbjct: 77 -----RTAQNVIVADPTAFPSGIAALANYIHSKGLLFGLYSDAGTNTCEGRPGSLGYETI 131
Query: 130 --------GIDYLKYDNC-FNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWG 180
G+DYLKYDNC D P +RY M DAL GRPIFFSMCEWG PA W
Sbjct: 132 DAQTYASWGVDYLKYDNCNAPSDQTPEVRYPVMRDALNATGRPIFFSMCEWGVDNPASWA 191
Query: 181 SNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVH 240
+ NSWRTT DISDSW M+ I D NE A PGGWNDPDMLEVGNGGM EY H
Sbjct: 192 GKVGNSWRTTGDISDSWTSMIGIVDQNEPLWQAAGPGGWNDPDMLEVGNGGMTTTEYRTH 251
Query: 241 FSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AINDQQIWAGP 289
F++W++ KAPL++GCD+ N++ +T+ I+ N E+I NQ ++ + Q+WAGP
Sbjct: 252 FTLWSLMKAPLIIGCDITNMSNDTLAILTNTELIEWNQDSLGVQGHRFTSVGNSQVWAGP 311
Query: 290 LSGNRI-VVLLESRKTFSSMMTAHWDDIGIKSNNTLFEHRTLETK----FQGKLTAKVDG 344
LS NR +VLL + ++ +T W DIG+K+ + K + G +A+V
Sbjct: 312 LSNNRYALVLLNVDNSATANITTTWADIGLKTGAKYMARDVWQHKNVGLYAGTFSAEVPP 371
Query: 345 HSCKMYELTP 354
H L+P
Sbjct: 372 HGVVAVTLSP 381
>gi|302764642|ref|XP_002965742.1| hypothetical protein SELMODRAFT_85032 [Selaginella moellendorffii]
gi|300166556|gb|EFJ33162.1| hypothetical protein SELMODRAFT_85032 [Selaginella moellendorffii]
Length = 444
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 156/413 (37%), Positives = 215/413 (52%), Gaps = 97/413 (23%)
Query: 15 DRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRE-------------------TDDCWG 55
+RR + NGLG TP MGW+ WN + C INEKI+ DDCW
Sbjct: 48 ERRMVYDNGLGQTPLMGWSGWNFYECTINEKIVYGNMDALVRLDLPSYGYNYVIVDDCWS 107
Query: 56 ESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI---- 111
RD + GNL ++K TFPSG KALADY H
Sbjct: 108 AYKRDKE--------------------------GNLRSDKKTFPSGMKALADYAHERGMK 141
Query: 112 --LISNSLDNRIESNIAFEQG-------------IDYLKYDNCFNDDTRPT-IRYRAMSD 155
L S++ + + A +G IDYLKYDNCF+ RP+ RY MS+
Sbjct: 142 FGLYSDAGRMTCKCHRAASEGHEFRDARTFASWGIDYLKYDNCFHKKNRPSHARYPVMSE 201
Query: 156 ALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAK 215
AL K+GRPIF++MCEWG+ PA+W NSWRT+ D+ D W R+ +AD N ++A +A
Sbjct: 202 ALNKSGRPIFYAMCEWGEDHPAVWAGKYANSWRTSLDVKDRWPRIELLADDNNLWASYAG 261
Query: 216 PGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIA 275
PGGWNDPDML+VGNG M EY HFSIW+I KAPL++GC++ +T E + I N E+IA
Sbjct: 262 PGGWNDPDMLQVGNGRMSLAEYRSHFSIWSIMKAPLIIGCELRTITKEHLEIYKNTEIIA 321
Query: 276 VNQ---AINDQ------------------------QIWAGPLSGNRIVVLLESRKTFSSM 308
VNQ I + Q+WAGPLSG+RIV+ +R ++
Sbjct: 322 VNQDSLGIQGRKVSRIGQSEVITSCLTLTKLCVTTQVWAGPLSGDRIVMAAWNRGWKRAL 381
Query: 309 MTAHWDDIGIKSNNT-----LFEHRTLETKFQGKLTAKVDGHSCKMYELTPVT 356
+T +W DIG++ + T L++H + + ++D H C +Y L+ ++
Sbjct: 382 ITINWYDIGLEPSVTASVRDLWKHEDWTQRQKDGFEVEIDSHDCGVYILSNIS 434
>gi|302758986|ref|XP_002962916.1| hypothetical protein SELMODRAFT_78263 [Selaginella moellendorffii]
gi|300169777|gb|EFJ36379.1| hypothetical protein SELMODRAFT_78263 [Selaginella moellendorffii]
Length = 401
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 151/379 (39%), Positives = 207/379 (54%), Gaps = 46/379 (12%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSL 75
RR + NGLG TP MGW+ WN + C+INEKI+ D D L ++ + D
Sbjct: 15 RRLVYDNGLGKTPIMGWSGWNKYECKINEKIVFGNTDALVRHGLDQ---LGYNYVIVDDC 71
Query: 76 TQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------LISN------------SL 117
+ QGNL ++ TTFPSG KALADY H L S+ S
Sbjct: 72 WAAYKR----DKQGNLKSHPTTFPSGMKALADYAHSKGLKFGLYSDAGRRTCIGHRAGSE 127
Query: 118 DNRIESNIAFEQ-GIDYLKYDNCFNDD-TRPTIRYRAMSDALKKAGRPIFFSMCEWGDMR 175
D+ I F G+DYLKYDNCF+ P +RY MS+AL K+GRPIF+++CEWG+
Sbjct: 128 DHEIRDAKTFASWGVDYLKYDNCFHKKHYYPHLRYPVMSEALNKSGRPIFYTICEWGEDH 187
Query: 176 PALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYN 235
PALW NSWRT+ D+ D W R+ +AD N ++A +A PGGWNDPDML+VGNG M
Sbjct: 188 PALWAGKYANSWRTSLDVKDRWDRIQILADDNNMWAAYAGPGGWNDPDMLQVGNGRMSVA 247
Query: 236 EYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ--------------AIN 281
EY HFSIW+I KAPL++GC++ + E + I N+E+I +NQ
Sbjct: 248 EYRSHFSIWSIMKAPLIIGCELRKIREEHLEIYKNKEIIDINQDPLGIQARKVSRQGRCK 307
Query: 282 DQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSN-----NTLFEHRTLETKFQG 336
+++WAGPLSG RIV+ L +R + +W D+G++ L++H + Q
Sbjct: 308 CREVWAGPLSGKRIVLALWNRDWRRVAIKVNWFDLGLEPGVSVCVRDLWKHEDWSERQQD 367
Query: 337 KLTAKVDGHSCKMYELTPV 355
+D H C +Y L+ V
Sbjct: 368 GFEVAIDSHDCGVYILSDV 386
>gi|118377295|ref|XP_001021827.1| Melibiase family protein [Tetrahymena thermophila]
gi|89303594|gb|EAS01582.1| Melibiase family protein [Tetrahymena thermophila SB210]
Length = 382
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 148/371 (39%), Positives = 206/371 (55%), Gaps = 42/371 (11%)
Query: 18 NLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQ 77
N L+NGL LTP MGWNSWNHFHC INE +I++T D S + D Q
Sbjct: 19 NCLNNGLALTPAMGWNSWNHFHCDINEDLIKQTADAIVSSGL---ADAGYKYVNLDDCWQ 75
Query: 78 IAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------LISNSLDNRIE---SNIAFE 128
I+ +G + +K FPSG KALADYVH L S++ D E ++ +E
Sbjct: 76 IS-----RDSEGYIQEDKAKFPSGIKALADYVHSKGLKFGLYSDAGDFTCEHRPGSLGYE 130
Query: 129 Q---------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALW 179
G+DYLKYDNC+N++ P +RY M DAL G PI+FSMCEWG+ PA W
Sbjct: 131 VKDAERYAEWGVDYLKYDNCYNNNIPPQVRYPPMRDALNSTGHPIYFSMCEWGEQDPATW 190
Query: 180 GSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVV 239
+ NSWRTT DISD++ L+I + +A G WNDPDMLEVGNGGM EY
Sbjct: 191 APEVGNSWRTTGDISDNYDSFLSILEQQVGLEKYAHRGAWNDPDMLEVGNGGMTTQEYEA 250
Query: 240 HFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AINDQQIWAG 288
HF++WA+ KAPLL+GCDV N++ +T I+ N+E+IA+NQ + Q+W G
Sbjct: 251 HFALWALLKAPLLIGCDVNNMSPDTFRILSNQEIIAINQDPLGIQGRRVLKTDQYQLWIG 310
Query: 289 PLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSN---NTLFEHRTLETKFQGKLTAKVDGH 345
+ + I V+L + + S ++ ++ IGI L++H+ L KF + V H
Sbjct: 311 TIV-DGIAVILFNTSSQDSDLSFTFEQIGITGQAIIRDLYKHQDL-GKFSNSFSTNVPTH 368
Query: 346 SCKMYELTPVT 356
+ ++ P++
Sbjct: 369 GVAVLKVVPLS 379
>gi|10178279|emb|CAC08337.1| alpha-galactosidase-like protein [Arabidopsis thaliana]
Length = 365
Score = 263 bits (673), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 153/364 (42%), Positives = 197/364 (54%), Gaps = 67/364 (18%)
Query: 11 SNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRETDDCWGESSRD 60
++G R L++NGL L+P MG + F I K I DDCWGE RD
Sbjct: 23 ADGFQSRMLMNNGLALSPQMGNTNDLSFTAADAMVSSGLSAIGYKYIN-IDDCWGELKRD 81
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI--------- 111
SQ G+LVA +TFPSG KAL+DYVH
Sbjct: 82 SQ--------------------------GSLVAKASTFPSGIKALSDYVHSKGLKLGIYS 115
Query: 112 ---------LISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAG 161
+ SL + + F GIDYLKYDNC N T P RY MS AL +G
Sbjct: 116 DAGTLTCSQTMPGSLGHEEQDAKTFASWGIDYLKYDNCENTGTSPRERYPKMSKALLNSG 175
Query: 162 RPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWND 221
R IFFS+CEWG PA W +I NSWRTT DI D+W M IAD N+ +A +A+PG WND
Sbjct: 176 RSIFFSLCEWGQEDPATWAGDIGNSWRTTGDIQDNWKSMTLIADQNDRWASYARPGSWND 235
Query: 222 PDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ--- 278
PDMLEVGNGGM EY+ HFSIWA++KAPLL+GCD+ ++ T ++ N+EVIAVNQ
Sbjct: 236 PDMLEVGNGGMTKEEYMSHFSIWALAKAPLLIGCDLRSMDKVTFELLSNKEVIAVNQDKL 295
Query: 279 --------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFEHRTL 330
D ++WAGPLS R+ V+L +R + S+ +TA W +IG+ S++ +
Sbjct: 296 GIQGKKVKKEGDLEVWAGPLSKKRVAVILWNRGSASANITARWAEIGLNSSDIVNARDLW 355
Query: 331 ETKF 334
E +
Sbjct: 356 EVSY 359
>gi|340381848|ref|XP_003389433.1| PREDICTED: alpha-galactosidase-like [Amphimedon queenslandica]
Length = 383
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 153/392 (39%), Positives = 203/392 (51%), Gaps = 82/392 (20%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
L NGLG TPPMGWNSWNHF+C I EKI+R T D+CW SSR
Sbjct: 18 LDNGLGKTPPMGWNSWNHFNCTITEKIVRATVTAIIEKGLDKHGYKYVNLDNCWAASSRA 77
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------LIS 114
S G + N TTFP K L D+VH L S
Sbjct: 78 S--------------------------DGTIQPNPTTFPD-MKGLIDFVHSKGLKFGLYS 110
Query: 115 N----SLDNRIESNIAFEQG---------IDYLKYDNCFNDDTRPTIRYRAMSDALKKAG 161
+ + R ++ +E +DYLKYDNC D ++P +RY M DAL K G
Sbjct: 111 DAGYYTCGKRQPGSLGYETKDANTYAEWEVDYLKYDNCETDGSKPEVRYPVMRDALNKTG 170
Query: 162 RPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWND 221
RPIF+S+CE G +LWG + NSWRTT DI D W +M++ AD+N +A PGGWND
Sbjct: 171 RPIFYSLCETGKDNVSLWGPKVGNSWRTTRDIGDYWDKMISRADVNADLWPYAGPGGWND 230
Query: 222 PDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ--- 278
PDMLEVGNG M + EY HFS+W + KAPLL+GCD+ ++ +T I+ N+ VIAVNQ
Sbjct: 231 PDMLEVGNGNMTFEEYKTHFSLWCLMKAPLLIGCDITKMSDDTFKILTNDHVIAVNQDPL 290
Query: 279 --------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT-----LF 325
+IW P+ V L +R +S +T W DIG ++ + L+
Sbjct: 291 GIQGNRTKQDGTNEIWQVPMKDGTRVALAFNRGDSASDITIQWTDIGFPNSASVNVYDLW 350
Query: 326 EHRTLETKFQGKLTAK-VDGHSCKMYELTPVT 356
+ + + KF G +TAK + H MY++ PV+
Sbjct: 351 DDKKMIGKFTGSVTAKAIPSHGVAMYQVEPVS 382
>gi|320168306|gb|EFW45205.1| alpha-galactosidase [Capsaspora owczarzaki ATCC 30864]
Length = 383
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 148/371 (39%), Positives = 200/371 (53%), Gaps = 45/371 (12%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGLG TP MG+N+WNHF C INE IIR S+ D+ I + ++ +
Sbjct: 21 LDNGLGRTPQMGFNTWNHFGCNINETIIR--------STVDAMIDTGLTAVGYNYINLDD 72
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVH---ILISNSLD---------------NRI 121
L G +VA+ FPSG ALA YVH +L D +I
Sbjct: 73 CWALNRTAAGVIVADPVAFPSGIAALASYVHSKGLLFGVYSDAGTKTCAGRPGSLGYEKI 132
Query: 122 ESNIAFEQGIDYLKYDNC-FNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWG 180
++ G+DYLKYDNC D P +RY M DAL GRPIFFSMCEWGD PA W
Sbjct: 133 DAQTYASWGVDYLKYDNCNAPADQTPMVRYPVMRDALNATGRPIFFSMCEWGDYDPAKWA 192
Query: 181 SNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVH 240
+ + NSWRTT D+SDSW+R++TI + NE A PGGWNDPDMLEVGNG + EY H
Sbjct: 193 APVGNSWRTTGDVSDSWSRIITILEANEPLWKAAGPGGWNDPDMLEVGNGVLSTTEYTSH 252
Query: 241 FSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-AINDQ----------QIWAGP 289
F++WA+ KAPL++GCDV +T +T+ I+ N EVI NQ ++ Q ++W P
Sbjct: 253 FTLWALMKAPLIIGCDVTKMTNDTLRILTNTEVIEWNQDSLGIQGHRVKRGLLSEVWMAP 312
Query: 290 LSGNRI-VVLLESRKTFSSMMTAHWDDIGIKSNNT-----LFEHRTLETKFQGKLTAKVD 343
L+ R VVL + S+ MT W ++G+ + + +++H+ + + G A+V
Sbjct: 313 LANGRFAVVLFNTDPLLSAHMTVTWAELGLTAGSKYTARDVWQHKNVGL-YAGTFDAQVA 371
Query: 344 GHSCKMYELTP 354
L P
Sbjct: 372 PKGVVAVTLVP 382
>gi|302757874|ref|XP_002962360.1| hypothetical protein SELMODRAFT_78029 [Selaginella moellendorffii]
gi|300169221|gb|EFJ35823.1| hypothetical protein SELMODRAFT_78029 [Selaginella moellendorffii]
Length = 362
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 143/338 (42%), Positives = 194/338 (57%), Gaps = 41/338 (12%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSL 75
RR + NGLG TP MGW+ WN + C+INEKI+ D D L ++ + D
Sbjct: 15 RRLVYDNGLGKTPIMGWSGWNKYECKINEKIVFGNTDALVRHGLDQ---LGYNYVIVDDC 71
Query: 76 TQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------LISN------------SL 117
+ QGNL ++ TTFPSG KALADY H L S+ S
Sbjct: 72 WAAYKR----DKQGNLKSHPTTFPSGMKALADYAHSKGLKFGLYSDAGRRTCIGHRAGSE 127
Query: 118 DNRIESNIAFEQ-GIDYLKYDNCFNDD-TRPTIRYRAMSDALKKAGRPIFFSMCEWGDMR 175
D+ I F G+DYLKYDNCF+ P +RY MS+AL K+GRPIF+++CEWG+
Sbjct: 128 DHEIRDAKTFASWGVDYLKYDNCFHKKHYYPHLRYPVMSEALNKSGRPIFYTICEWGEDH 187
Query: 176 PALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYN 235
PALW NSWRT+ D+ D W R+ +AD N ++A +A PGGWNDPDML+VGNG M
Sbjct: 188 PALWAGKYANSWRTSLDVKDRWDRIQILADDNNMWAAYAGPGGWNDPDMLQVGNGRMSVA 247
Query: 236 EYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ--------AINDQ---- 283
EY HFSIW+I KAPL++GC++ + E + I N+E+I +NQ ++ Q
Sbjct: 248 EYRSHFSIWSIMKAPLIIGCELRKIREEHLEIYKNKEIIDINQDPLGIQARKVSRQGRCK 307
Query: 284 --QIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIK 319
++WAGPLSG RIV+ L +R + +W D+G++
Sbjct: 308 CREVWAGPLSGKRIVLALWNRDWRRVAIKVNWFDLGLE 345
>gi|147860300|emb|CAN81856.1| hypothetical protein VITISV_041251 [Vitis vinifera]
Length = 472
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 144/346 (41%), Positives = 186/346 (53%), Gaps = 97/346 (28%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSR 59
LL+NG+ TPPMGWNSWNHF C+I+E ++ T DDCWGE +R
Sbjct: 40 LLANGVARTPPMGWNSWNHFQCKIDEWTVKATADALISTGLAALGYKYVNIDDCWGEGNR 99
Query: 60 DSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDN 119
DS+ GNLVA +TFPSG KALADYVH + L
Sbjct: 100 DSR--------------------------GNLVARSSTFPSGIKALADYVH---ARGLKL 130
Query: 120 RIESNIAFE----------QGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMC 169
I S+ + QG+DYLKYDNC+ND ++P RY MS AL KAGRPI +S+C
Sbjct: 131 GIYSDAGAKEGFSLCGHLVQGVDYLKYDNCYNDGSKPQDRYARMSSALHKAGRPILYSLC 190
Query: 170 EWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGN 229
E + +IAD N ++ +A PG WNDPDMLEVGN
Sbjct: 191 EC----------------------------ITSIADANNIWGRYAGPGRWNDPDMLEVGN 222
Query: 230 GGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ---AINDQQI- 285
GGM EY HFSIWA+ KAPLL+GCDV + + ET+ I+GN+EVI VNQ I ++I
Sbjct: 223 GGMSLEEYRSHFSIWALMKAPLLIGCDVRSSSKETLSILGNKEVIDVNQDPLGIQGRKIR 282
Query: 286 -------WAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTL 324
WAGPLS R+ V+L +R + + +T W +IG+ +N +
Sbjct: 283 SKASVEVWAGPLSKRRVAVVLWNRGSSQAPITVGWREIGLSPSNLV 328
>gi|118362944|ref|XP_001014570.1| alpha-galactosidase, putative [Tetrahymena thermophila]
gi|89296464|gb|EAR94452.1| alpha-galactosidase, putative [Tetrahymena thermophila SB210]
Length = 381
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 141/347 (40%), Positives = 190/347 (54%), Gaps = 45/347 (12%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGL TP MGWNSWNHF C INE +I+ T D + +L ++ D QI+
Sbjct: 20 LDNGLAKTPAMGWNSWNHFACNINEDLIKATAD---QIVNSGLAALGYNYINLDDCWQIS 76
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVHI------LISN-----------SLDNRIE 122
G +V +K FPSG KALADYVH L S+ SL ++
Sbjct: 77 -----RDADGYIVEDKEKFPSGIKALADYVHSKGLKFGLYSDAGEFTCQKRPGSLGYEVK 131
Query: 123 SNIAF-EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGS 181
+ E +DYLKYDNCFN + P IRY M DAL GRPI+FSMCEWG PA W
Sbjct: 132 DAQRYAEWEVDYLKYDNCFNKNINPKIRYPPMRDALNATGRPIYFSMCEWGQYNPATWAP 191
Query: 182 NIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHF 241
+ NSWRTT DI D ++ L+I + +A PG WNDPDMLEVGNGGM +EY HF
Sbjct: 192 EVGNSWRTTGDIKDRYSSFLSILEKQVGLEKYAHPGAWNDPDMLEVGNGGMTTHEYEAHF 251
Query: 242 SIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ---AINDQ------------QIW 286
++WA+ KAPLL+GCDV ++ +T+ I+ N+E+IA+NQ I Q+W
Sbjct: 252 ALWALLKAPLLIGCDVSKMSEDTIRILTNKEIIAINQDPLGIQGHRVKKSGFWFWSWQLW 311
Query: 287 AGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT---LFEHRTL 330
G L + + ++L + + ++ + ++GI T L++H L
Sbjct: 312 MGQLE-DGVAIILFNTSAWERNLSFTFKEVGIVGPATIRDLYQHEDL 357
>gi|300120197|emb|CBK19751.2| unnamed protein product [Blastocystis hominis]
Length = 366
Score = 247 bits (631), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 147/374 (39%), Positives = 196/374 (52%), Gaps = 58/374 (15%)
Query: 30 MGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQG 89
MG+N+WN F C INEK+ R+T D S D L + D Q+A
Sbjct: 1 MGFNTWNKFGCNINEKLARDTADAIVNSGLDK---LGYIYLNLDDCWQVARD------NN 51
Query: 90 NLVANKTTFPSGTKALADYVH---ILISNSLD---------------NRIESNIAFEQGI 131
+++ FPSG KALADY+H +L D I++ + +
Sbjct: 52 SVIVEDANFPSGMKALADYIHSKGLLFGLYSDAGYKTCAGRPGSLGYEEIDAKTYAKWEV 111
Query: 132 DYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTD 191
DYLKYDNC D T P RY M+ AL+ GR IFFSMCEWG PA W + NSWRTT
Sbjct: 112 DYLKYDNCNTDGTTPEHRYPVMTKALQNCGRDIFFSMCEWGVDDPANWARAVGNSWRTTG 171
Query: 192 DISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPL 251
DI D+W M++IA NE A PGGWNDPDMLEVGNGGM Y EY HFS+W + KAPL
Sbjct: 172 DIQDNWGSMMSIAQANEKLWRRAGPGGWNDPDMLEVGNGGMNYEEYKTHFSLWCLMKAPL 231
Query: 252 LLGCDVGNLTAETMPIIGNEEVIAVNQ------------------AIN------DQQIWA 287
L+GCD+ + ET+ I+ N+E IAVNQ +N D ++WA
Sbjct: 232 LIGCDLTKASQETLEILSNKEAIAVNQDGLGVQGHRVWSDKGGNKEVNGDVPEGDLEVWA 291
Query: 288 GPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT-----LFEHRTLETKFQGKLTAK- 341
GPL + V+L +R + +T ++D G++ ++T ++ H+ + T QG T K
Sbjct: 292 GPLMSGQFAVILLNRSEEEAEITFQFEDCGLRKDDTAMIRDIWVHQDITTA-QGSYTMKG 350
Query: 342 VDGHSCKMYELTPV 355
+ H K LTP+
Sbjct: 351 IPRHGVKFLILTPI 364
>gi|125601522|gb|EAZ41098.1| hypothetical protein OsJ_25590 [Oryza sativa Japonica Group]
Length = 437
Score = 246 bits (629), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 141/342 (41%), Positives = 176/342 (51%), Gaps = 73/342 (21%)
Query: 90 NLVANKTTFPSGTKALADYVH-----------------ILISNSLDNRIESNIAFEQ-GI 131
L+ + TFPSG K LADYVH + SL + + F G+
Sbjct: 93 QLLPDPKTFPSGIKDLADYVHGKGLKLGIYSDAGIFTCQVRPGSLHHEKDDAAIFASWGV 152
Query: 132 DYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTD 191
DYLKYDNC+N +P RY M DAL GR IF+S+CEWG PALW + NSWRTTD
Sbjct: 153 DYLKYDNCYNLGIKPKDRYPPMRDALNSTGRQIFYSLCEWGQDDPALWAGKVGNSWRTTD 212
Query: 192 DISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISK--- 248
DI D+W M IAD N +A +A PGGWNDPDMLEVGNGGM + EY HFSIWA+ K
Sbjct: 213 DIQDTWKSMTDIADKNNKWASYAGPGGWNDPDMLEVGNGGMTFAEYRAHFSIWALMKAST 272
Query: 249 ---------------------------------APLLLGCDVGNLTAETMPIIGNEEVIA 275
APLL+GCDV N+T ETM I+ N+EVI
Sbjct: 273 HWVHSVITRMAPIFQYVAFGLHDSNLFVLLLDQAPLLIGCDVRNMTKETMEILSNKEVIQ 332
Query: 276 VNQ--------------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSN 321
VNQ Q++WAGPLSGNR+ V+L +R S+ + +G+ +
Sbjct: 333 VNQDPLGVQGRRILGQGKNGCQEVWAGPLSGNRLAVVLWNRCEESANIIVKLPSVGLDGS 392
Query: 322 NT-----LFEHRTLETKFQGKLTAKVDGHSCKMYELTPVTKL 358
+ L++H TL G A+VD H CKMY TP +
Sbjct: 393 SPYSVRDLWKHETLSENVVGTFGAQVDVHDCKMYIFTPAVTV 434
>gi|145522522|ref|XP_001447105.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124414605|emb|CAK79708.1| unnamed protein product [Paramecium tetraurelia]
Length = 382
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 139/341 (40%), Positives = 189/341 (55%), Gaps = 44/341 (12%)
Query: 18 NLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMF---DS 74
N L NGLG PPMGWN WN + C INE+I+R+T D ++ L+ + +++ D
Sbjct: 15 NALDNGLGRKPPMGWNPWNKYGCNINEEIVRQTADALVKTG------LAAAGYIYLNLDD 68
Query: 75 LTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------LISN-----------SL 117
Q A P + ++A+ FPSG +L YVH L S+ SL
Sbjct: 69 CWQSARDPATKK----IIADPVKFPSGIPSLVQYVHSRGLKFGLYSDAGMQTCEGKPGSL 124
Query: 118 D-NRIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRP 176
I++ E IDYLKYDNC D T P +RY MS AL K + I+FSMCEWG +P
Sbjct: 125 GYEEIDAQTYAEWDIDYLKYDNCHTDGTSPKVRYPPMSAALMKQTKKIYFSMCEWGLEKP 184
Query: 177 ALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNE 236
LW I NSWRTT DISD W + I + A +A PG WNDPDMLEVGNGGMK +E
Sbjct: 185 WLWAPPIANSWRTTGDISDHWYSFIAILEEQANLAQYAGPGQWNDPDMLEVGNGGMKTHE 244
Query: 237 YVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AINDQQI 285
Y HF++WAI KAPLL+GCD+ N++ +T I+ N EVIAVNQ + ++
Sbjct: 245 YQAHFALWAILKAPLLIGCDITNMSQDTKKILMNPEVIAVNQDSLGIQATRVKKVLTSEV 304
Query: 286 WAGPLS--GNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTL 324
WA ++ G +V +L ++ + +T +D +GI + +
Sbjct: 305 WAAQVADNGAGVVAVLFNQASLQESITIEFDKLGISGDQNV 345
>gi|384146004|ref|YP_005528820.1| alpha-galactosidase [Amycolatopsis mediterranei S699]
gi|340524158|gb|AEK39363.1| alpha-galactosidase [Amycolatopsis mediterranei S699]
Length = 657
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 140/369 (37%), Positives = 204/369 (55%), Gaps = 46/369 (12%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L +GL LTPPMG+N+WN C ++E++IR+T D + + L + + + ++
Sbjct: 19 LPDGLALTPPMGFNNWNTTGCAVDEQLIRDTADIFVDKG------LKAAGYQYVNVDDCW 72
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVHI------------------LISNSLDNR- 120
A+P G + ANK FP G KALADYVH +LD+
Sbjct: 73 AEP-ERDADGRMQANKARFPGGIKALADYVHSKGLKFGLYTSAGTLTCAKTQPGALDHED 131
Query: 121 IESNIAFEQGIDYLKYDNCFNDDTRPTI-RYRAMSDALKKAGRPIFFSMCEWGDMRPALW 179
+++ + G+DYLKYDNC N+ RP + RY M DALKK GRPI +S+CEWG+ +P W
Sbjct: 132 VDAQTFADWGVDYLKYDNC-NNQGRPALERYTKMRDALKKTGRPIVYSLCEWGENKPWTW 190
Query: 180 GSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVV 239
G+++ + WRTT DI D+WA++L I N A +A PG WNDPDMLEVGNGGM EY
Sbjct: 191 GADVGHLWRTTGDIKDNWAKVLQILKANAPLAPYAGPGHWNDPDMLEVGNGGMTTEEYRS 250
Query: 240 HFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ-----------IWAG 288
HFS+WA+ APLL+G D+ ++ ++ N EVIA++Q Q ++A
Sbjct: 251 HFSLWAMMAAPLLIGADLRKVSPANFDVLRNAEVIALDQDRRGVQARVLSNQDGHWVFAK 310
Query: 289 PLSGNRIVVLLESRKTFSSMMTAHWDDIGIK-----SNNTLFEHRTLETKFQGKLTAKVD 343
PL G + + L + T S+ + D G+ + L+ HR L+T G+L+A V
Sbjct: 311 PLDGGDVAIALFNETTSSATIGTTAADAGLPQAAGYTARDLWAHRDLQTA--GRLSAVVP 368
Query: 344 GHSCKMYEL 352
H+ +Y +
Sbjct: 369 PHATVVYRV 377
>gi|300782784|ref|YP_003763075.1| alpha-galactosidase [Amycolatopsis mediterranei U32]
gi|399534670|ref|YP_006547332.1| alpha-galactosidase [Amycolatopsis mediterranei S699]
gi|299792298|gb|ADJ42673.1| alpha-galactosidase [Amycolatopsis mediterranei U32]
gi|398315440|gb|AFO74387.1| alpha-galactosidase [Amycolatopsis mediterranei S699]
Length = 663
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 140/369 (37%), Positives = 204/369 (55%), Gaps = 46/369 (12%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L +GL LTPPMG+N+WN C ++E++IR+T D + + L + + + ++
Sbjct: 25 LPDGLALTPPMGFNNWNTTGCAVDEQLIRDTADIFVDKG------LKAAGYQYVNVDDCW 78
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVHI------------------LISNSLDNR- 120
A+P G + ANK FP G KALADYVH +LD+
Sbjct: 79 AEP-ERDADGRMQANKARFPGGIKALADYVHSKGLKFGLYTSAGTLTCAKTQPGALDHED 137
Query: 121 IESNIAFEQGIDYLKYDNCFNDDTRPTI-RYRAMSDALKKAGRPIFFSMCEWGDMRPALW 179
+++ + G+DYLKYDNC N+ RP + RY M DALKK GRPI +S+CEWG+ +P W
Sbjct: 138 VDAQTFADWGVDYLKYDNC-NNQGRPALERYTKMRDALKKTGRPIVYSLCEWGENKPWTW 196
Query: 180 GSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVV 239
G+++ + WRTT DI D+WA++L I N A +A PG WNDPDMLEVGNGGM EY
Sbjct: 197 GADVGHLWRTTGDIKDNWAKVLQILKANAPLAPYAGPGHWNDPDMLEVGNGGMTTEEYRS 256
Query: 240 HFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ-----------IWAG 288
HFS+WA+ APLL+G D+ ++ ++ N EVIA++Q Q ++A
Sbjct: 257 HFSLWAMMAAPLLIGADLRKVSPANFDVLRNAEVIALDQDRRGVQARVLSNQDGHWVFAK 316
Query: 289 PLSGNRIVVLLESRKTFSSMMTAHWDDIGIK-----SNNTLFEHRTLETKFQGKLTAKVD 343
PL G + + L + T S+ + D G+ + L+ HR L+T G+L+A V
Sbjct: 317 PLDGGDVAIALFNETTSSATIGTTAADAGLPQAAGYTARDLWAHRDLQTA--GRLSAVVP 374
Query: 344 GHSCKMYEL 352
H+ +Y +
Sbjct: 375 PHATVVYRV 383
>gi|242044160|ref|XP_002459951.1| hypothetical protein SORBIDRAFT_02g018600 [Sorghum bicolor]
gi|241923328|gb|EER96472.1| hypothetical protein SORBIDRAFT_02g018600 [Sorghum bicolor]
Length = 331
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 143/367 (38%), Positives = 182/367 (49%), Gaps = 70/367 (19%)
Query: 3 IVSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQ 62
+V+ A S+ RR L NGLG TP MGWNSWNHFH +INE IIR+ D ++
Sbjct: 19 VVAIRPARSSPMARRAALENGLGRTPQMGWNSWNHFHRKINEDIIRQIADAMVDTGL--- 75
Query: 63 ISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIE 122
L + D QGNL AN +TFPSG
Sbjct: 76 AKLGYEYINIDDFW----AAYDRDSQGNLAANVSTFPSGV-------------------- 111
Query: 123 SNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSN 182
RA++D + G + G PA W
Sbjct: 112 ----------------------------RALADYVHGKGLKLGIYSDAGGVNDPATWAGG 143
Query: 183 IRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFS 242
I NSWRTT DI D+WA+M IAD N +A +A PGGWNDPDMLEVGNGGM EY HFS
Sbjct: 144 IGNSWRTTGDIKDTWAKMTAIADKNNKWASYAGPGGWNDPDMLEVGNGGMTTEEYRSHFS 203
Query: 243 IWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN-----------QAINDQQIWAGPLS 291
IWA+ KAPLL+GCD+ +++ ET I+ N VIAVN Q DQ++WAGPL+
Sbjct: 204 IWALMKAPLLIGCDIRSMSNETKEILSNRNVIAVNQDGLGVQGHKVQQDGDQEVWAGPLT 263
Query: 292 GNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFEHRT----LETKFQGKLTAKVDGHSC 347
G R V+L +R + + +TA W IG+ ++ H + + QG+L VD H+C
Sbjct: 264 GGRFAVVLWNRGSAQASITASWSSIGLNASTVADAHDLWIDDIISAVQGELEETVDSHAC 323
Query: 348 KMYELTP 354
KMY LTP
Sbjct: 324 KMYVLTP 330
>gi|414870945|tpg|DAA49502.1| TPA: hypothetical protein ZEAMMB73_583660 [Zea mays]
gi|414870946|tpg|DAA49503.1| TPA: hypothetical protein ZEAMMB73_583660 [Zea mays]
Length = 224
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 116/223 (52%), Positives = 150/223 (67%), Gaps = 17/223 (7%)
Query: 153 MSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYAD 212
MS AL K GRPI+FS+CEWGDM PA WG+ NSWRTT+DI+D+W M+ AD NEV+++
Sbjct: 1 MSRALMKVGRPIYFSLCEWGDMHPATWGATYGNSWRTTNDIADTWDSMIATADQNEVWSE 60
Query: 213 HAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEE 272
+A+PGGWNDPDMLEVGNGGM +EYVVHFS+WAISKAPL++GCDV +++ ET I+ N+E
Sbjct: 61 YARPGGWNDPDMLEVGNGGMTNSEYVVHFSLWAISKAPLIIGCDVRHMSRETYAILANKE 120
Query: 273 VIAVNQ-----------AINDQQIWAGPLSGNRIVVLLESRKTF-SSMMTAHWDDIGIKS 320
VIAVNQ +IWA PLS R VLL +R + + AHWDDIG+ +
Sbjct: 121 VIAVNQDPLGVQGKKVRMEGSSEIWAAPLSEYRTAVLLLNRHAKDEATIAAHWDDIGLPA 180
Query: 321 NNT-----LFEHRTLETKFQGKLTAKVDGHSCKMYELTPVTKL 358
L+ H TL+ F K++ V HSC+M L P ++
Sbjct: 181 GTPVEARDLWLHETLDATFTDKMSFDVAPHSCRMLVLKPRIQI 223
>gi|242044156|ref|XP_002459949.1| hypothetical protein SORBIDRAFT_02g018590 [Sorghum bicolor]
gi|241923326|gb|EER96470.1| hypothetical protein SORBIDRAFT_02g018590 [Sorghum bicolor]
Length = 302
Score = 234 bits (598), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 128/301 (42%), Positives = 171/301 (56%), Gaps = 56/301 (18%)
Query: 88 QGNLVANKTTFPSGTKALADYVHI----------LISNSLDNRIESNIAFEQ-------- 129
QGNL AN +TFPSG KALADYVH S + N++ ++ E+
Sbjct: 23 QGNLAANPSTFPSGIKALADYVHGKGLKLGVYSDAGSRTCSNQMPGSLGHEEQDAKTFAS 82
Query: 130 -GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWR 188
G+DYLKYDNC + P + R + D PA W S + NSWR
Sbjct: 83 WGVDYLKYDNCNDQGLIP--QPRGVDD--------------------PATWASGVGNSWR 120
Query: 189 TTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISK 248
TT DI D+W M IAD N+ +A +A+PGGWNDPDMLEVGNGGM EY HFSIWA+ K
Sbjct: 121 TTGDIQDNWGSMTAIADANDKWASYAQPGGWNDPDMLEVGNGGMTTEEYRSHFSIWALVK 180
Query: 249 APLLLGCDVGNLTAETMPIIGNEEVIAVNQAI-----------NDQQIWAGPLSGNRIVV 297
APLL+GCD+ +++++T I+ N+ VI VNQ + DQ++WAGPLSG R+ V
Sbjct: 181 APLLIGCDIRSMSSDTKEILSNQNVITVNQDVLGVQGRKVLQDGDQEVWAGPLSGGRVAV 240
Query: 298 LLESRKTFSSMMTAHWDDIGIKSNNTLFEHR----TLETKFQGKLTAKVDGHSCKMYELT 353
+L +R + + +TA W IG+ + + H + + QG+L VD H+CKMY LT
Sbjct: 241 VLWNRGSDQASITASWSSIGLNESTVVDAHDLWTGEVTSSVQGELKETVDTHACKMYVLT 300
Query: 354 P 354
P
Sbjct: 301 P 301
>gi|440792938|gb|ELR14143.1| ribosomal protein S32, putative [Acanthamoeba castellanii str.
Neff]
Length = 385
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 132/333 (39%), Positives = 186/333 (55%), Gaps = 41/333 (12%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L+NGLG TP MGWNSWN+F C +NE+I+ ET D S L+ + + + ++
Sbjct: 22 LNNGLGKTPQMGWNSWNYFACNLNEQIVNETIDNLISSG------LAAAGYRYVNMDDCW 75
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVHI------LISNSLDN---------RIESN 124
A + G + + TFP+G + LAD H L S++ + E N
Sbjct: 76 AGWRDDA--GFIHPDNKTFPNGVRPLADKAHSKGLLFGLYSDAGEKTCAGRPGSLNYEKN 133
Query: 125 IAFEQ---GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGS 181
A+ G+DYLKYDNC+ND+ P RY M DAL R IF+SMCEWG PA W
Sbjct: 134 DAYTYASWGVDYLKYDNCYNDNISPLTRYPIMRDALNATNRKIFYSMCEWGVDNPATWAP 193
Query: 182 NIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHF 241
+ NSWRTT DIS WA +++ D+N+ + +A PGG+NDPDMLEVGN G+ + E HF
Sbjct: 194 LVGNSWRTTGDISAHWASVMSRIDLNDEWWKYAGPGGFNDPDMLEVGNKGLTHTEQKSHF 253
Query: 242 SIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ---------------AINDQQIW 286
S+WA++KAPLL+GCD+ NL+ E I+ EVIA+NQ A +W
Sbjct: 254 SLWALAKAPLLIGCDIRNLSREVFEILTAPEVIAINQDPLGVQGHKVWNKTFADGAIDVW 313
Query: 287 AGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIK 319
AGPL+ + V+ +R + +++ W +G+K
Sbjct: 314 AGPLANGDVAVITLNRASVQAVIPVTWSVVGLK 346
>gi|328869524|gb|EGG17902.1| hypothetical protein DFA_08903 [Dictyostelium fasciculatum]
Length = 382
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 149/390 (38%), Positives = 192/390 (49%), Gaps = 85/390 (21%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
+ NGLGL P MGWNSWNHFHC INE +I E+ DDCW D
Sbjct: 20 VDNGLGLVPQMGWNSWNHFHCDINETVIMESALAMVTSGLKDAGYRYVNIDDCWAVGRDD 79
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH---ILISNSL 117
+ + I A P+ FP+G K +ADYVH +LI
Sbjct: 80 NGV--------------IQADPI-------------AFPNGIKYIADYVHSLGLLIGIYT 112
Query: 118 DNRIES------NIAFEQ---------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGR 162
D I + + +EQ GIDYLK D C P RY MS AL GR
Sbjct: 113 DAGILTCQKRPGSYGYEQIDAQTYASWGIDYLKMDWCNTYLENPQERYTIMSKALNATGR 172
Query: 163 PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDP 222
PIFFS+C WG P +W +I NSWRTT DI+D+W M I D+ + GGWNDP
Sbjct: 173 PIFFSLCNWGISEPWMWAMDIGNSWRTTGDIADTWTSMTVILDLQVPITSFSGVGGWNDP 232
Query: 223 DMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ---- 278
DMLEVGNGGM EY+ HFS+W++ APL+ G D+ ++ T I+ EVIAVNQ
Sbjct: 233 DMLEVGNGGMTTTEYISHFSLWSLLSAPLIAGNDIRSIDNTTFSILTAMEVIAVNQDTLG 292
Query: 279 -------AIN--DQQIWAGPLS-GNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT----- 323
+IN DQQIWA PL+ G++ V+LL S+ + W DI + +
Sbjct: 293 RQGSLIKSINGKDQQIWAKPLADGSKAVILLNRNDNESATIQLQWGDIWESPSTSLIVRD 352
Query: 324 LFEHRTLETKFQGKLTA-KVDGHSCKMYEL 352
L+ + ++T F G TA + H C M +
Sbjct: 353 LWAQQDIDT-FTGSYTATNIPPHGCVMLRV 381
>gi|451333074|ref|ZP_21903661.1| Alpha-galactosidase precursor [Amycolatopsis azurea DSM 43854]
gi|449424437|gb|EMD29736.1| Alpha-galactosidase precursor [Amycolatopsis azurea DSM 43854]
Length = 650
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 141/386 (36%), Positives = 197/386 (51%), Gaps = 83/386 (21%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQISLSC 67
TPPMGWNSWN F C INE++IRET DDCW E +R
Sbjct: 34 TPPMGWNSWNKFGCDINEQLIRETADAMVSSGMKAAGYQYVNIDDCWAEKNRTPD----- 88
Query: 68 SFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH----------------- 110
G ++T FPSG KALADYVH
Sbjct: 89 ---------------------GKYEPHRTRFPSGIKALADYVHGKGLKLGIYTSAGTETC 127
Query: 111 -ILISNSLDNR-IESNIAFEQGIDYLKYDNCFNDDTRPTI-RYRAMSDALKKAGRPIFFS 167
+ SLD+ +++ E G+DYLKYDNC N+ RP + RY M +ALKK RPI ++
Sbjct: 128 ARTMPGSLDHEEVDARTFAEWGVDYLKYDNC-NNQGRPALERYTKMGEALKKTSRPIVYA 186
Query: 168 MCEWGDMRPALWGSNIRNS-WRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLE 226
+CEWG +P WG N WRTT DI+D+WA ++ + D ++ PGGWNDPDMLE
Sbjct: 187 LCEWGQNKPWEWGRNAGAQLWRTTGDITDTWASVMNLLDQQVGLEAYSGPGGWNDPDMLE 246
Query: 227 VGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAI------ 280
VGNGGM EY HF++W++ APLL G D+ +++ T I+ N++++AVNQ
Sbjct: 247 VGNGGMTDTEYRSHFALWSLLNAPLLAGNDLRSMSEATKKILLNKDLLAVNQDWGGKQGH 306
Query: 281 -----NDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFEHRTL----E 331
D ++WA P+S VV+L +R S+ ++A +IG +++ + R L E
Sbjct: 307 KVRDDGDTEVWAKPMSDGSNVVVLFNRGGASATVSATAKEIGAPASSG-YRVRDLWSGTE 365
Query: 332 TKFQGKLTAKVDGHSCKMYELTPVTK 357
T+ G L A + H + + P +K
Sbjct: 366 TESAGTLRAGLPSHGSATFRVWPSSK 391
>gi|254392540|ref|ZP_05007718.1| truncated alpha-galactosidase [Streptomyces clavuligerus ATCC
27064]
gi|197706205|gb|EDY52017.1| truncated alpha-galactosidase [Streptomyces clavuligerus ATCC
27064]
Length = 596
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 146/402 (36%), Positives = 198/402 (49%), Gaps = 87/402 (21%)
Query: 8 AADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET----------------- 50
AA +NGS R L +GL TPPMG+N+WN C I+EK+IR+T
Sbjct: 58 AARANGSPR---LPDGLARTPPMGFNNWNSTGCAIDEKMIRDTADLFVSTGLKAAGYEYV 114
Query: 51 --DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADY 108
DDCW RD+ G L + FPSG KALADY
Sbjct: 115 NVDDCWAAERRDAG--------------------------GRLTHHPVRFPSGIKALADY 148
Query: 109 VHI------------------LISNSLDNRIESNIAF-EQGIDYLKYDNCFNDDTRPTI- 148
VH + SLD+ F E G+DYLKYDNC N+ RP +
Sbjct: 149 VHSKGLKFGIYTSAGTQTCAKTMPGSLDHEETDARTFAEWGVDYLKYDNC-NNQNRPALE 207
Query: 149 RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNE 208
RY M DAL+ GRPI +S+CEWG +P WG ++ + WRTT DI+DSW ML + N
Sbjct: 208 RYTKMRDALRATGRPILYSICEWGQNKPWEWGRDVGHLWRTTGDINDSWQSMLGLFKANA 267
Query: 209 VYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPII 268
A +A PG WNDPDMLEVGNGGM EY HFS+W++ APLL+G D+ T T+ I+
Sbjct: 268 PLARYAGPGHWNDPDMLEVGNGGMTDTEYRSHFSLWSVMAAPLLIGTDLRKATPGTLDIL 327
Query: 269 GNEEVIAVNQ-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIG 317
+ EVIAV+Q + N + A PL+ + V L + S+ + ++G
Sbjct: 328 ADREVIAVDQDPLGVQATVVRSANGHWVLAKPLANGDLAVALFNETEQSAAIATTAAELG 387
Query: 318 IKSNNT-----LFEHRTLETKFQGKLTAKVDGHSCKMYELTP 354
+ + L++ T T G + A + H+ ++ +TP
Sbjct: 388 LPKRSGYAIRDLWQKNTSHTA--GAIGATLPAHATALFRITP 427
>gi|294815997|ref|ZP_06774640.1| Alpha-galactosidase [Streptomyces clavuligerus ATCC 27064]
gi|326444339|ref|ZP_08219073.1| alpha-galactosidase [Streptomyces clavuligerus ATCC 27064]
gi|294328596|gb|EFG10239.1| Alpha-galactosidase [Streptomyces clavuligerus ATCC 27064]
Length = 716
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 146/402 (36%), Positives = 198/402 (49%), Gaps = 87/402 (21%)
Query: 8 AADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET----------------- 50
AA +NGS R L +GL TPPMG+N+WN C I+EK+IR+T
Sbjct: 66 AARANGSPR---LPDGLARTPPMGFNNWNSTGCAIDEKMIRDTADLFVSTGLKAAGYEYV 122
Query: 51 --DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADY 108
DDCW RD+ G L + FPSG KALADY
Sbjct: 123 NVDDCWAAERRDAG--------------------------GRLTHHPVRFPSGIKALADY 156
Query: 109 VH------------------ILISNSLDNRIESNIAF-EQGIDYLKYDNCFNDDTRPTI- 148
VH + SLD+ F E G+DYLKYDNC N+ RP +
Sbjct: 157 VHSKGLKFGIYTSAGTQTCAKTMPGSLDHEETDARTFAEWGVDYLKYDNC-NNQNRPALE 215
Query: 149 RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNE 208
RY M DAL+ GRPI +S+CEWG +P WG ++ + WRTT DI+DSW ML + N
Sbjct: 216 RYTKMRDALRATGRPILYSICEWGQNKPWEWGRDVGHLWRTTGDINDSWQSMLGLFKANA 275
Query: 209 VYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPII 268
A +A PG WNDPDMLEVGNGGM EY HFS+W++ APLL+G D+ T T+ I+
Sbjct: 276 PLARYAGPGHWNDPDMLEVGNGGMTDTEYRSHFSLWSVMAAPLLIGTDLRKATPGTLDIL 335
Query: 269 GNEEVIAVNQ-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIG 317
+ EVIAV+Q + N + A PL+ + V L + S+ + ++G
Sbjct: 336 ADREVIAVDQDPLGVQATVVRSANGHWVLAKPLANGDLAVALFNETEQSAAIATTAAELG 395
Query: 318 IKSNNT-----LFEHRTLETKFQGKLTAKVDGHSCKMYELTP 354
+ + L++ T T G + A + H+ ++ +TP
Sbjct: 396 LPKRSGYAIRDLWQKNTSHT--AGAIGATLPAHATALFRITP 435
>gi|403356745|gb|EJY77975.1| hypothetical protein OXYTRI_00382 [Oxytricha trifallax]
Length = 396
Score = 231 bits (590), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 138/397 (34%), Positives = 198/397 (49%), Gaps = 89/397 (22%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
L NG+ TP MGWN+WN F C INE +I+++ DDCW RD
Sbjct: 19 LENGIAQTPQMGWNTWNKFDCGINETLIKQSADKIIELGLDKLGYKYVNIDDCWNAVERD 78
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISN 115
S+ G + A+ TFP G KA+ DY+H I +
Sbjct: 79 SE--------------------------GRMQADPKTFPGGMKAVGDYIHSKGLKYGIYS 112
Query: 116 SLDN-------------RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGR 162
S N I++ + G+DYLKYDNCFN++ T+RY AM DAL K GR
Sbjct: 113 SAGNFTCQGRAGSLNHEDIDAQTWADWGVDYLKYDNCFNENVPATVRYPAMRDALLKTGR 172
Query: 163 PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDP 222
IF+S+C WG+ WG + NSWRTT DI D W + N+V+ D A PG WNDP
Sbjct: 173 NIFYSICNWGNEETFKWGPDTGNSWRTTMDIKDIWPSIQYNYRQNDVHYDIAHPGAWNDP 232
Query: 223 DMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ---- 278
DMLE+GNGG+ N+ HF++WA++K+PL+LGCD+ N+ ++ II N E+IA+NQ
Sbjct: 233 DMLEIGNGGLNQNQERTHFALWAVAKSPLILGCDLDNIPKDSFEIITNTELIAINQDPLG 292
Query: 279 --------------AINDQQIWAGPLSGNRIVVLLESRKTFS-SMMTAHWDDIGIKSNNT 323
+N Q++AGPL I V++ + S T ++ DI + + +
Sbjct: 293 KQAHCRVHCSSTDILVNRAQVYAGPLENGDIAVVVVNWGFLSLGAFTLNFSDIDLSATTS 352
Query: 324 -----LFEHRTLETKFQGKLTA-KVDGHSCKMYELTP 354
L+EH+ + F G + K + + + +TP
Sbjct: 353 ATIRDLWEHKDI-GDFTGTYSIDKFEPYQSYAFRITP 388
>gi|34765755|gb|AAQ82455.1| alpha-galactosidase [Petunia x hybrida]
Length = 285
Score = 231 bits (589), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 133/311 (42%), Positives = 168/311 (54%), Gaps = 75/311 (24%)
Query: 30 MGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQISLSCSFH 70
MG S F C I+EK+IRET DDCW E +RDSQ
Sbjct: 1 MGGRSGILFGCYIDEKMIRETADAMVYTGLSSLGYKYINLDDCWAELNRDSQ-------- 52
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH------------------IL 112
GN+V +TFPSG KALADYVH
Sbjct: 53 ------------------GNMVPKGSTFPSGIKALADYVHNKGLNLGIYSDAGTQTCSKE 94
Query: 113 ISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEW 171
+ SL + + F G+DYLKYDNC N++ P RY MS AL+ +GR IF+S+CEW
Sbjct: 95 MPGSLGHEEQDAKTFASWGVDYLKYDNCNNENRSPRERYPTMSKALQNSGRAIFYSLCEW 154
Query: 172 GDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGG 231
GD PA W ++ NSWRTT DISD+W M + AD N+ +A +A PGGWNDPDMLEVGNGG
Sbjct: 155 GDDDPATWAFSVGNSWRTTGDISDNWDSMTSRADQNDKWASYAGPGGWNDPDMLEVGNGG 214
Query: 232 MKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AI 280
M EY HFSIWA++KAPL++GCD+ ++ T I+ N+ V AVNQ +
Sbjct: 215 MTTAEYRSHFSIWALAKAPLIIGCDIRSMDETTKEILSNKGVFAVNQDKLGVQGKKVKSD 274
Query: 281 NDQQIWAGPLS 291
+ ++WAGPLS
Sbjct: 275 SGLEVWAGPLS 285
>gi|300789642|ref|YP_003769933.1| alpha-galactosidase [Amycolatopsis mediterranei U32]
gi|384153150|ref|YP_005535966.1| alpha-galactosidase [Amycolatopsis mediterranei S699]
gi|399541523|ref|YP_006554185.1| alpha-galactosidase [Amycolatopsis mediterranei S699]
gi|299799156|gb|ADJ49531.1| alpha-galactosidase [Amycolatopsis mediterranei U32]
gi|340531304|gb|AEK46509.1| alpha-galactosidase [Amycolatopsis mediterranei S699]
gi|398322293|gb|AFO81240.1| alpha-galactosidase [Amycolatopsis mediterranei S699]
Length = 526
Score = 231 bits (588), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 149/388 (38%), Positives = 192/388 (49%), Gaps = 83/388 (21%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
L N L TP +GWN WN F C +NE ++R+T DDCW E +RD
Sbjct: 32 LENNLARTPQLGWNDWNSFGCNVNETLVRQTADVMVSSGMAAAGYQYVNIDDCWSEWNRD 91
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-------ILI 113
S GNLV + T FPSG KALADYVH I
Sbjct: 92 SS--------------------------GNLVPDHTKFPSGMKALADYVHGKGLKLGIYS 125
Query: 114 SN----------SLDN-RIESNIAFEQGIDYLKYDNCFNDDTRPTI-RYRAMSDALKKAG 161
S SL N + ++N+ GIDYLKYDNC + R RY AM DAL K+G
Sbjct: 126 SAGTTTCQGYPASLGNEQRDANLWASWGIDYLKYDNCGDHQGRSGQDRYVAMRDALAKSG 185
Query: 162 RPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWND 221
RPI F++C WG WG NSWRTT DI +W +L I D +A AKPG WND
Sbjct: 186 RPILFAICNWGGDSVNQWGPATGNSWRTTWDIQGNWGSVLGILDAQPGWAGLAKPGAWND 245
Query: 222 PDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ--- 278
PDMLEVGN G+ E HFS+WA+ APLL G D+ ++A T I+ N EVI VNQ
Sbjct: 246 PDMLEVGN-GLSDTEARAHFSLWALLNAPLLAGNDLRTMSAATRSILTNTEVIGVNQDWG 304
Query: 279 --------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT-----LF 325
D ++W P++ + V+L +R + +S ++ IG+ S ++ L+
Sbjct: 305 GRQGNRIADYGDTEVWTKPMANGSMAVVLLNRGSGTSTISTSASQIGLGSASSYSVRDLW 364
Query: 326 EHRTLETKFQGKLTAKVDGHSCKMYELT 353
H T T G ++A V GH MY +T
Sbjct: 365 AHSTGTTS--GAISASVPGHGAAMYVVT 390
>gi|297197804|ref|ZP_06915201.1| alpha-galactosidase [Streptomyces sviceus ATCC 29083]
gi|297146873|gb|EDY60641.2| alpha-galactosidase [Streptomyces sviceus ATCC 29083]
Length = 675
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 140/372 (37%), Positives = 191/372 (51%), Gaps = 49/372 (13%)
Query: 20 LSNGLGLTPPMGWNSWNHFHC--QINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQ 77
L++GL LTPPMG+N+WN HC + NE +++ D + + L + + + +L
Sbjct: 46 LADGLALTPPMGFNNWNSTHCRAEFNESMVKGIADIFVDKG------LKDAGYQYVNLDD 99
Query: 78 IAAKPLPNQ-CQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNRIESNIAF---- 127
A LPN+ G LV + FP+G KA+ADYVH + I S + F
Sbjct: 100 CWA--LPNRDADGRLVPDPVRFPNGIKAVADYVHSKGLKLGIYTSAGTKTCDGAGFPGAL 157
Query: 128 -----------EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRP 176
+ G+DYLKYDNC N +RY M DALK GRPI +S+CEWG +P
Sbjct: 158 GHEYSDAQQFADWGVDYLKYDNCNNQGVDAKLRYTTMRDALKATGRPIVYSICEWGQNKP 217
Query: 177 ALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNE 236
W S++ + WRTT DISDSW ML+I N A HA PG WNDPDMLEVGNGGM E
Sbjct: 218 WEWASDVGHLWRTTGDISDSWGSMLSILKQNLPLAPHAGPGHWNDPDMLEVGNGGMTDTE 277
Query: 237 YVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQIWAGPLSGNRIV 296
Y HFS+W++ APLL+G D+ +AET I+ N+EVIAV+Q +Q G R V
Sbjct: 278 YRSHFSMWSVMAAPLLIGSDLRTASAETFEILSNKEVIAVDQDPLGKQGTVVSSEGGRWV 337
Query: 297 VLLE----------------SRKTFSSMMTAHWDDIGIKSNNTLFEHRTLETKFQGKLTA 340
V E R+ ++ D + L++HR+ T G ++A
Sbjct: 338 VAKEMKDGSRTVALFNESGSPRRIATTAAAVGLPDADAYTLRDLWQHRSYNTA--GTISA 395
Query: 341 KVDGHSCKMYEL 352
V H + +
Sbjct: 396 TVPAHGTVLVRV 407
>gi|388513843|gb|AFK44983.1| unknown [Lotus japonicus]
Length = 229
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 115/223 (51%), Positives = 148/223 (66%), Gaps = 19/223 (8%)
Query: 153 MSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYAD 212
M DAL G IF+S+CEWG PALW + NSWRTTDDI+D+WA M TIAD+N+ +A
Sbjct: 1 MRDALGATGVKIFYSICEWGVDDPALWAGTVGNSWRTTDDINDTWASMTTIADLNDKWAS 60
Query: 213 HAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEE 272
+A PGGWNDPDMLEVGNGGM Y EY HFSIWA++KAPLL+GCD+ N+TAET+ II N+E
Sbjct: 61 YAGPGGWNDPDMLEVGNGGMTYQEYRAHFSIWALAKAPLLIGCDIRNMTAETLEIISNKE 120
Query: 273 VIAVNQ------------AIND--QQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGI 318
VIA+NQ A D Q+WAGPLSG+R VV L +R + + +TA W+ +GI
Sbjct: 121 VIAINQDSLGVQGRKVQVAGKDGCSQVWAGPLSGDRWVVALWNRCSKVANITASWEALGI 180
Query: 319 KSN-----NTLFEHRTLETKFQGKLTAKVDGHSCKMYELTPVT 356
+S L++H+ + +A+VD H +Y TP T
Sbjct: 181 ESGVYVTVRDLWQHKVVTEDAVSSFSAQVDTHDSHLYIFTPST 223
>gi|307105143|gb|EFN53394.1| hypothetical protein CHLNCDRAFT_26028 [Chlorella variabilis]
Length = 446
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 145/374 (38%), Positives = 189/374 (50%), Gaps = 48/374 (12%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
NGLGL P MG+N+WN F I+EK IR T D + L + +++ +L
Sbjct: 23 NGLGLRPAMGFNTWNAFRTNIDEKSIRATADL------IVSMGLRDAGYVYLNLDD-GWS 75
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVH-----------------ILISNSLD-NRIES 123
G L ANK FPSG KALADYVH SL +++
Sbjct: 76 ERNRTADGRLAANKARFPSGIKALADYVHGKGLKFGIYGDAGSMTCAKYPGSLGYEEVDA 135
Query: 124 NIAFEQGIDYLKYDNCFNDDTRPTI-RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSN 182
E G+DYLKYDNC+ + I RY AM DAL GRPI +S+C+WG P LW S
Sbjct: 136 QTFAEWGVDYLKYDNCYAKQEQWVIDRYAAMRDALNATGRPILYSLCDWGVADPWLWASE 195
Query: 183 IRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFS 242
I NSWRTT+DI W +L I D N A A+ G WNDPDMLEVGNG + E HFS
Sbjct: 196 IGNSWRTTEDIEPRWESILKILDYNTGLARFARKGAWNDPDMLEVGNGQLTSGEQRAHFS 255
Query: 243 IWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ---AINDQQIW--------AGPLS 291
+WA+ KAPLL+G D+ + +++ I+ EVIAVNQ + IW A PL+
Sbjct: 256 LWALMKAPLLIGADLRTIHPDSLAILKAREVIAVNQDDLGVAGDLIWMQGANRVYAVPLA 315
Query: 292 -GNRIVVLLESRKT----FSSMMTAHWDDIGIKSNNT-----LFEHRTLETKFQGKLTAK 341
G R VV L T +S +T +W +G+ + + LF + L + G TA
Sbjct: 316 GGGRAVVFLNLHTTGGQYLTSNITVYWGQLGLPAGRSALVRDLFAEQDL-GEHTGSFTAA 374
Query: 342 VDGHSCKMYELTPV 355
V H + + PV
Sbjct: 375 VQAHDVVVVRIMPV 388
>gi|347736013|ref|ZP_08868755.1| Alpha-galactosidase [Azospirillum amazonense Y2]
gi|346920613|gb|EGY01650.1| Alpha-galactosidase [Azospirillum amazonense Y2]
Length = 401
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 143/378 (37%), Positives = 193/378 (51%), Gaps = 52/378 (13%)
Query: 21 SNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAA 80
+NGL LTPPMGWN+WN F C INE +IR+ D S + + D A
Sbjct: 34 ANGLALTPPMGWNTWNKFACDINEDVIRKAADAMVSS---GMAGAGYKYVVIDDCWHGA- 89
Query: 81 KPLPNQCQGNLVANKTTFPSGTKALADYVHI------LISN-----------SLDNRIES 123
G++ + FPSG KAL DY+H + S+ SL + +
Sbjct: 90 ----RDAHGDMQPDPVRFPSGLKALGDYIHSKGLKFGIYSDAGKMTCGKRPGSLGHEYQD 145
Query: 124 NIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSN 182
+ G+DYLKYD C+ Y MSDAL+ +GR I FSMCEWG +P LW
Sbjct: 146 AAQYAAWGVDYLKYDWCYTGTLDAKAAYTLMSDALRASGRDIVFSMCEWGTAKPWLWAQA 205
Query: 183 IRNSWRTTDDISDSW----ARMLTIADMNEVYAD---HAKPGGWNDPDMLEVGNGGMKYN 235
+ N WR+T DI D W L + D+ ++ AD +A PG WNDPDMLEVGNGGM
Sbjct: 206 VGNLWRSTGDIYDGWQGKKGYSLGVMDILDLEADLYPYAGPGHWNDPDMLEVGNGGMTDT 265
Query: 236 EYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAI-----------NDQQ 284
EY HFS+WA+ APL+ G D+ +++ T I+ N EVIAV+Q D +
Sbjct: 266 EYRAHFSLWALLAAPLIAGNDLTSMSPATKAILTNAEVIAVDQDALGAQGRRVAKQGDVE 325
Query: 285 IWAGPL-SGNRIVVLLESRKTFSSMMTAHWDDIGIKSN-----NTLFEHRTLETKFQGKL 338
+WA PL SG+R VVLL +R T + +T +W D+G + L+ + L T +G
Sbjct: 326 VWARPLASGDRAVVLL-NRGTTPARITVNWADLGYPAKLSAKVRDLWLAKDLGTS-KGGY 383
Query: 339 TAKVDGHSCKMYELTPVT 356
TA+V H M + P
Sbjct: 384 TAEVAPHGVVMVTVRPAV 401
>gi|242044154|ref|XP_002459948.1| hypothetical protein SORBIDRAFT_02g018580 [Sorghum bicolor]
gi|241923325|gb|EER96469.1| hypothetical protein SORBIDRAFT_02g018580 [Sorghum bicolor]
Length = 307
Score = 226 bits (575), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 131/338 (38%), Positives = 175/338 (51%), Gaps = 82/338 (24%)
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW +RDSQ GNLV +TFPSG KAL+DYVH
Sbjct: 17 DDCWAAYNRDSQ--------------------------GNLVPKPSTFPSGMKALSDYVH 50
Query: 111 I----------LISNSLDNRIESNIAFEQ---------GIDYLKYDNCFNDDTRPTIRYR 151
S + ++ ++ E+ GIDYLKYDNC Y+
Sbjct: 51 GKGLKLGIYSDAGSRTCSQQMPGSLGHEEQDAKTFASWGIDYLKYDNC---------NYQ 101
Query: 152 AMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYA 211
+S + G P A W S++ NSWRT DI D+W M + AD N+ +A
Sbjct: 102 GLSPQPRGVGNP-------------ATWASSLGNSWRTAGDIKDNWGSMASNADSNDYWA 148
Query: 212 DHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNE 271
+A PGGWNDPDMLE+GNGGM EY HFSIWA++KAPLL GCD+ +++ +T I+ N+
Sbjct: 149 KYAGPGGWNDPDMLEIGNGGMTTEEYRSHFSIWALAKAPLLTGCDIRSMSKDTKDILSNQ 208
Query: 272 EVIAVNQAI-----------NDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKS 320
VIAVNQ DQ++WAGPLSG R+ V+L +R + +TA W IG+ +
Sbjct: 209 NVIAVNQDALGVQGLKVQKNGDQEVWAGPLSGGRVAVVLWNRGPTQASITASWSSIGLST 268
Query: 321 NNTLFEH----RTLETKFQGKLTAKVDGHSCKMYELTP 354
+ + H + QG+L A V+ H+CKMY LTP
Sbjct: 269 SAVVDAHDLWTGEATSSVQGELKATVETHACKMYVLTP 306
>gi|345011330|ref|YP_004813684.1| glycoside hydrolase clan GH-D [Streptomyces violaceusniger Tu 4113]
gi|344037679|gb|AEM83404.1| glycoside hydrolase clan GH-D [Streptomyces violaceusniger Tu 4113]
Length = 409
Score = 225 bits (573), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 144/397 (36%), Positives = 196/397 (49%), Gaps = 94/397 (23%)
Query: 20 LSNGLGLTPPMGWNSWNHFHC--QINEKIIR-------------------ETDDCWGESS 58
L NGL TPPMG+N+WN HC NE +++ DDCW E S
Sbjct: 42 LDNGLARTPPMGFNNWNATHCDASFNESMVKGVADAFVSLGLKDVGYTYVNIDDCWAEPS 101
Query: 59 RDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-------I 111
RD+ GNLV + T FP+G KA+ADYVH I
Sbjct: 102 RDAS--------------------------GNLVPHHTRFPNGIKAVADYVHAKGLKFGI 135
Query: 112 LIS-------------NSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDAL 157
S +LD+ + F G+DYLKYDNC N RY+AM DAL
Sbjct: 136 YTSAGTKTCNAANGFPGALDHEKQDAKQFASWGVDYLKYDNCNNQGVDAQKRYKAMRDAL 195
Query: 158 KKAGRPIFFSMCEWGDMRPALW--GSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAK 215
GRPI +S+CEWG P +W G + NSWRTT DISD W+ M+ +N+ A +A
Sbjct: 196 AATGRPILYSLCEWGQTTPKVWTWGEPVGNSWRTTGDISDKWSSMIDKLHINDDLAQYAG 255
Query: 216 PGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIA 275
PG WNDPDMLEVGNGGM EY HFS+WA+ APLL+G D+ N +A I+ N ++IA
Sbjct: 256 PGHWNDPDMLEVGNGGMTATEYRTHFSLWAMMAAPLLIGSDIRNASATAQSILKNTDIIA 315
Query: 276 VNQAINDQQ-----------IWAGPLS-GNRIVVLL---ESRKTFSSMMTAHWDDIGIKS 320
++Q +Q ++ L+ G+ V LL S KT S+ +A IG+
Sbjct: 316 LDQDTLGKQATIVSSSGGKTVYTKQLANGDHAVALLNENSSSKTISTTASA----IGLGG 371
Query: 321 NNTLFEHRTLETKF----QGKLTAKVDGHSCKMYELT 353
+++ + + L +K G ++A V H +Y ++
Sbjct: 372 SSS-YRLKDLWSKATSTTSGSISASVPAHGTVVYRVS 407
>gi|374984776|ref|YP_004960271.1| alpha-galactosidase [Streptomyces bingchenggensis BCW-1]
gi|297155428|gb|ADI05140.1| alpha-galactosidase [Streptomyces bingchenggensis BCW-1]
Length = 666
Score = 224 bits (571), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 139/381 (36%), Positives = 192/381 (50%), Gaps = 47/381 (12%)
Query: 11 SNGSDRRNLLSNGLGLTPPMGWNSWNHFHC--QINEKIIRETDDCWGESSRDSQISLSCS 68
S G++ +GL LTPPMG+N+WN HC + NE ++++ D + L +
Sbjct: 28 STGANAAAAPGDGLALTPPMGFNNWNSTHCRAEFNEAMVKDIADIF------VTKGLKEA 81
Query: 69 FHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNRIES 123
+ + +L A P G LV + FP+G KA+ADYVH I S + +
Sbjct: 82 GYQYVNLDDCWALPQ-RDADGKLVPDPERFPNGIKAVADYVHSKGLKFDIYTSAGTKTCN 140
Query: 124 NIAF---------------EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSM 168
+ F + G+DYLKYDNC N RY AM DAL+ GRPI +S+
Sbjct: 141 SAGFPGALGHERSDAQQFADWGVDYLKYDNCNNQGVDAKQRYLAMRDALQATGRPIVYSI 200
Query: 169 CEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVG 228
CEWG +P W +++ WRTT DISDSW ML+IA N +A PG WNDPDMLEVG
Sbjct: 201 CEWGQNKPWEWAADVGQLWRTTGDISDSWGSMLSIAKKNLPLTPYAGPGHWNDPDMLEVG 260
Query: 229 NGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ---- 284
NGGM EY HFS+W+I APLL+G D+ T ET I+GN EVIAV+Q +Q
Sbjct: 261 NGGMTPTEYRSHFSLWSIMAAPLLIGTDLRKATPETYEILGNREVIAVDQDGLGKQGQVL 320
Query: 285 -------IWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT-----LFEHRTLET 332
+ A L+G V L + + +G+ + L+EHR+ T
Sbjct: 321 SSGGGGWVIAKELAGGDRAVALFNETDRPRHIATTAAALGLPEADGYQLRDLWEHRSYNT 380
Query: 333 KFQGKLTAKVDGHSCKMYELT 353
G + A V H ++ ++
Sbjct: 381 A--GDVAATVPAHGTALFRVS 399
>gi|386838616|ref|YP_006243674.1| alpha-galactosidase [Streptomyces hygroscopicus subsp.
jinggangensis 5008]
gi|374098917|gb|AEY87801.1| alpha-galactosidase [Streptomyces hygroscopicus subsp.
jinggangensis 5008]
gi|451791907|gb|AGF61956.1| alpha-galactosidase [Streptomyces hygroscopicus subsp.
jinggangensis TL01]
Length = 661
Score = 224 bits (571), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 138/371 (37%), Positives = 186/371 (50%), Gaps = 47/371 (12%)
Query: 20 LSNGLGLTPPMGWNSWNHFHC--QINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQ 77
L++GL LTPPMG+N+WN HC + N ++ D + E L + + + +L
Sbjct: 32 LADGLALTPPMGFNNWNSTHCRAEFNASMVEGIADLFVEKG------LKDAGYQYVNLDD 85
Query: 78 IAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNRIESNIAF----- 127
A P G LV + FP G KA+ADYVH + I S + + F
Sbjct: 86 CWALP-DRDANGKLVPDPARFPDGIKAVADYVHAKGLKLGIYTSAGTKTCDSTGFPGALG 144
Query: 128 ----------EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPA 177
+ G+DYLKYDNC N RYR M DALK GRPI +S+CEWG +P
Sbjct: 145 HEYSDARQFADWGVDYLKYDNCNNQGVDAQQRYRTMRDALKATGRPIVYSICEWGQNKPW 204
Query: 178 LWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEY 237
W S++ + WRTT DISDSW+ ML+I N A +A PG WNDPDMLEVGNGGM EY
Sbjct: 205 EWASDVGHLWRTTGDISDSWSSMLSILKQNLPLAPYAGPGHWNDPDMLEVGNGGMTDTEY 264
Query: 238 VVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQIWAGPLSGNRIVV 297
HFS+W++ APLL+G D+ + T I+GN+EVIAV+Q +Q G R VV
Sbjct: 265 RSHFSLWSVMAAPLLIGTDLRKASPATFDILGNKEVIAVDQDPLGKQGTVVSSGGGRWVV 324
Query: 298 LLE----------------SRKTFSSMMTAHWDDIGIKSNNTLFEHRTLETKFQGKLTAK 341
E +++ +S D + L++HRT T G + A
Sbjct: 325 AKEMKDGSRAVALFNESGTAQRIATSASAVGLPDADAYTLRDLWQHRTYNTA--GTIAAT 382
Query: 342 VDGHSCKMYEL 352
V H + +
Sbjct: 383 VPAHGTVLLRV 393
>gi|371778348|ref|ZP_09484670.1| alpha-galactosidase [Anaerophaga sp. HS1]
Length = 395
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 136/356 (38%), Positives = 181/356 (50%), Gaps = 52/356 (14%)
Query: 21 SNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS-RDSQISLSCSFHMFDSLTQIA 79
++GL LTPPMGWNSWN F C ++E II+E + S RD+ F + D QI
Sbjct: 22 NDGLALTPPMGWNSWNKFGCNVDETIIKEMAEAMVNSGMRDA----GYEFIVIDDCWQIG 77
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVHIL----------ISNSLDNR--------I 121
L GN++ + FPSG KAL DY+H L S + R
Sbjct: 78 RDSL-----GNIIPDSERFPSGMKALGDYIHSLGLKFGIYSCAGSKTCQGRPGSRGYQFQ 132
Query: 122 ESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGS 181
++ E G+DYLKYD C+N+ Y+ MSDALK GRPI FS+CEWGD P WG
Sbjct: 133 DARTYAEWGVDYLKYDWCYNEGQNAEAAYKTMSDALKATGRPIVFSICEWGDNEPWKWGK 192
Query: 182 NIRNSWRTTDDISD------SWARM--LTIADMNEVYADHAKPGGWNDPDMLEVGNGGMK 233
I + WRTT DI D W + + I D +A PG WNDPDMLEVGNGGM
Sbjct: 193 GIGHLWRTTADIRDCFQCQFDWGGLGVMDIIDRQAELYPYAGPGHWNDPDMLEVGNGGMT 252
Query: 234 YNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AIND 282
Y EY HFS+WA+ APL+ G D+ N+ +T I+ N +VI++NQ + D
Sbjct: 253 YTEYKTHFSMWAMLAAPLMAGNDLRNMDRQTREILTNLDVISINQDELGQQARRFMDMGD 312
Query: 283 QQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFEHRTLETKFQGKL 338
+IWA PL+ + V +R + + +W K + F K+Q K+
Sbjct: 313 YEIWAKPLTKGEVAVCFLNRSSEVWKLAYNW-----KQHTMYFVKDVNLHKYQYKI 363
>gi|134103490|ref|YP_001109151.1| alpha-galactosidase [Saccharopolyspora erythraea NRRL 2338]
gi|291005966|ref|ZP_06563939.1| putative alpha-galactosidase [Saccharopolyspora erythraea NRRL
2338]
gi|133916113|emb|CAM06226.1| putative alpha-galactosidase [Saccharopolyspora erythraea NRRL
2338]
Length = 392
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 137/361 (37%), Positives = 194/361 (53%), Gaps = 44/361 (12%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQ 86
TPPMGWNSWN F C I+E++I ET D S + + + + ++ P
Sbjct: 36 TPPMGWNSWNKFACDIDERLITETADAMVASG------MKDAGYTYVNIDDCWMAP-ERD 88
Query: 87 CQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLDNR-IESNIAFE 128
+G L A+ FPSG KALAD+VH + SLD+ I++ E
Sbjct: 89 AEGRLQADPHRFPSGIKALADHVHAKGLKLGIYSSAGTKTCQGLPASLDHEEIDARSFAE 148
Query: 129 QGIDYLKYDNCFNDDTRPTI-RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNS- 186
G+DYLKYDNC N++ RP + RY MS+AL+ GR I +S+CEWG+ P G ++
Sbjct: 149 WGVDYLKYDNC-NNEGRPAVERYSEMSEALRATGRRIVYSICEWGENDPWNRGRDVGGHL 207
Query: 187 WRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAI 246
WRTT DISDSW+ M ++ D H+ PGGWNDPDMLEVGNGGM EY HFS+WA+
Sbjct: 208 WRTTGDISDSWSSMTSLLDQQVGIEQHSGPGGWNDPDMLEVGNGGMTDTEYRAHFSLWAL 267
Query: 247 SKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLSGNRI 295
APLL G D+ ++ T I+ N E+IA+NQ +Q +WA P+S +
Sbjct: 268 LNAPLLAGNDLRSMDEPTARILLNPELIAINQDWGGKQGYRVRDDGETEVWAKPVSDGSV 327
Query: 296 VVLLESRKTFSSMMTAHWDDIGIKSNNTLFEHRTL----ETKFQGKLTAKVDGHSCKMYE 351
V+L +R ++A +++G+ + + R L E++ G+L A V H Y
Sbjct: 328 AVVLFNRDGQERRISATTEEVGLPGADR-YRVRDLWTGDESQNDGELGASVPSHGAVAYR 386
Query: 352 L 352
+
Sbjct: 387 V 387
>gi|452959430|gb|EME64770.1| alpha-galactosidase [Amycolatopsis decaplanina DSM 44594]
Length = 650
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 137/380 (36%), Positives = 190/380 (50%), Gaps = 83/380 (21%)
Query: 30 MGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQISLSCSFH 70
MGWNSWN F C I E++IRET DDCW E +R
Sbjct: 37 MGWNSWNKFGCDITEELIRETADAMVSSGMKAAGYQYVNIDDCWAEKNRTPD-------- 88
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH------------------IL 112
G ++T FPSG KALADYVH
Sbjct: 89 ------------------GKYEPHRTRFPSGIKALADYVHGKGLKLGIYTSAGTETCART 130
Query: 113 ISNSLDNR-IESNIAFEQGIDYLKYDNCFNDDTRPTI-RYRAMSDALKKAGRPIFFSMCE 170
+ SLD+ +++ + G+DYLKYDNC N RP + RY M +ALKK GRPI +++CE
Sbjct: 131 MPGSLDHEEVDAQTFADWGVDYLKYDNCHNQG-RPALERYTKMGEALKKTGRPIVYALCE 189
Query: 171 WGDMRPALWGSNIRNS-WRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGN 229
WG+ +P WG WRTT DISD+W+ M + D +A PGGWNDPDMLEVGN
Sbjct: 190 WGENKPWEWGKAAGAQLWRTTGDISDTWSSMTNLLDQQVGLEGYAGPGGWNDPDMLEVGN 249
Query: 230 GGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAI--------- 280
GGM EY HF++W++ APLL G D+ +++ T I+ N++++AV+Q
Sbjct: 250 GGMTDAEYRSHFALWSLLNAPLLAGNDLRSMSEATKKILLNKDLLAVDQDWGGEQGRKIR 309
Query: 281 --NDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFEHRTL----ETKF 334
D ++WA P+S VV+L +R + ++A +IG + + + R L ET+
Sbjct: 310 DDGDTEVWAKPMSDGSEVVVLFNRGRAPATLSATTGEIGAPAASG-YRVRDLWTGAETES 368
Query: 335 QGKLTAKVDGHSCKMYELTP 354
G L A V GH ++ + P
Sbjct: 369 AGTLRAGVPGHGSAVFRVWP 388
>gi|374984855|ref|YP_004960350.1| secreted alpha-galactosidase [Streptomyces bingchenggensis BCW-1]
gi|297155507|gb|ADI05219.1| secreted alpha-galactosidase [Streptomyces bingchenggensis BCW-1]
Length = 415
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 145/404 (35%), Positives = 199/404 (49%), Gaps = 95/404 (23%)
Query: 13 GSDRRNLLSNGLGLTPPMGWNSWNHFHC--QINEKIIR-------------------ETD 51
G+ + L NGL TPPMG+N+WN HC NE +++ D
Sbjct: 42 GAGQAAALDNGLLRTPPMGFNNWNATHCDSSFNESMVKGVADAFVSLGLKDVGYEYVNID 101
Query: 52 DCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH- 110
DCW E+SRDS GNLV ++T FP+G KA+ADYVH
Sbjct: 102 DCWAETSRDSD--------------------------GNLVPDRTRFPNGIKAVADYVHS 135
Query: 111 ----ILISNSLDNRIESNIAFEQG---------------IDYLKYDNCFNDDTRPTIRYR 151
I S + ++ F G +DYLKYDNC N RY+
Sbjct: 136 KGLKFGIYTSAGTKTCNSAGFPGGLNHEKQDAKQFASWGVDYLKYDNCNNQGVDAQQRYK 195
Query: 152 AMSDALKKAGRPIFFSMCEWGDMRPALW--GSNIRNSWRTTDDISDSWARMLTIADMNEV 209
AM DAL +GR I +S+CEWG P +W G + N WRTT DISD W+ M+ +N+
Sbjct: 196 AMRDALAASGRKIAYSICEWGQTSPKVWTWGEPVGNLWRTTGDISDKWSSMIDKVHINDD 255
Query: 210 YADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIG 269
A +A PG WNDPDMLEVGNGGM EY HFS+WA+ APLL+G D+ + +A T I+
Sbjct: 256 LAQYAGPGHWNDPDMLEVGNGGMTATEYRTHFSLWAMMAAPLLIGSDIRDASATTQSILK 315
Query: 270 NEEVIAVNQAINDQQ-----------IWAGPLS-GNRIVVLL---ESRKTFSSMMTAHWD 314
N ++IAV+Q +Q ++ L+ G+R V LL S KT S+ TA
Sbjct: 316 NTDLIAVDQDTLGKQATIVSSSGGKVVYTKQLANGDRAVALLNENSSAKTISTTATA--- 372
Query: 315 DIGIKSNNT-----LFEHRTLETKFQGKLTAKVDGHSCKMYELT 353
IG+ +++ L+ T T G ++A V H ++ ++
Sbjct: 373 -IGLGGSSSYGLKDLWSKATSTT--TGSISASVPAHGTVVFRVS 413
>gi|330793725|ref|XP_003284933.1| hypothetical protein DICPUDRAFT_86485 [Dictyostelium purpureum]
gi|325085149|gb|EGC38562.1| hypothetical protein DICPUDRAFT_86485 [Dictyostelium purpureum]
Length = 385
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 138/375 (36%), Positives = 199/375 (53%), Gaps = 47/375 (12%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
++NGLGLTP MGWNSWN++ C INE +I T + S+ ++ + + + ++
Sbjct: 19 INNGLGLTPQMGWNSWNYYACDINETVIMNT------ALAMSKNGMAAAGYKYVNIDDCW 72
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVH---ILISNSLD---------------NRI 121
A L G ++ + FP+G K +ADY+H +LI D I
Sbjct: 73 A--LERASNGTVIPDPKAFPNGIKYVADYIHSLGLLIGIYTDAGLYTCQKRPGSYGFEEI 130
Query: 122 ESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGS 181
++ E GIDYLK D C++ P RY+ MS++L GR IFFS+C+WG P +G
Sbjct: 131 DAITYAEWGIDYLKEDWCYSFLENPQERYQIMSNSLNATGRQIFFSLCDWGTDNPWTFGG 190
Query: 182 NIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHF 241
I NSWRTT DI D+W M+ + ++ GGWNDPDMLEVGNGGM EY+ HF
Sbjct: 191 AIANSWRTTPDIKDNWDSMMANLMAQASISSYSGVGGWNDPDMLEVGNGGMTNTEYISHF 250
Query: 242 SIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AIND--QQIWAG 288
S+W+I APL+ G ++ ++ ET+ I+ EVIAVNQ + N QQIWA
Sbjct: 251 SLWSILNAPLIAGNNLIDIDQETLSILTATEVIAVNQDPLGVQGALVKSYNGGLQQIWAK 310
Query: 289 PLS-GNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT------LFEHRTLETKFQGKLTAK 341
P++ G+R VVL + T + +T +W DI + L++ L T ++
Sbjct: 311 PMADGSRAVVLFNT-DTNPATITLNWADIWVAPVTQQLVVRDLWQQSNLGTFATTFVSEV 369
Query: 342 VDGHSCKMYELTPVT 356
+ H C M +LTP T
Sbjct: 370 IPPHGCVMLKLTPAT 384
>gi|94968704|ref|YP_590752.1| alpha-galactosidase [Candidatus Koribacter versatilis Ellin345]
gi|94550754|gb|ABF40678.1| Alpha-galactosidase [Candidatus Koribacter versatilis Ellin345]
Length = 408
Score = 220 bits (561), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 148/384 (38%), Positives = 188/384 (48%), Gaps = 64/384 (16%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
N TPPMGWN+WN F C ++EK+IR D S + + D Q++
Sbjct: 38 NAYAPTPPMGWNTWNKFACNVDEKLIRGAADALVSSGMKDA---GYHYLVIDDCWQVS-- 92
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAFE--QG--------- 130
+GN+VA+ FPSG KALA YVH L I S+I + QG
Sbjct: 93 ---RDGKGNIVADPQRFPSGIKALAAYVH---HKGLKFGIYSDIGTKTCQGRPGSRGHEF 146
Query: 131 ----------IDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWG 180
+DYLK D C Y+ M DA+ GRPI S+CEWG +P LW
Sbjct: 147 QDALQYAAWDVDYLKLDWCNTGTQNAPASYKTMRDAIDATGRPIVLSICEWGKSKPWLWA 206
Query: 181 SNIRNSWRTTDDISDSWA-------------RMLTIADMNEVYADHAKPGGWNDPDMLEV 227
+ N WRTTDDI D WA ML I D A +A PG WNDPDMLEV
Sbjct: 207 KDTGNLWRTTDDIQDRWAGKKKWDEASCCSNGMLDILDEEADIASYAGPGHWNDPDMLEV 266
Query: 228 GNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ--------- 278
GNGGM EY HFS+WAI APL+ G D+ N+ E I+ N+EVIAV+Q
Sbjct: 267 GNGGMTNVEYRSHFSLWAILAAPLMAGNDLTNMRPEIREILTNKEVIAVDQDPLGKQGVR 326
Query: 279 -AIN-DQQIWAGPLS-GNRIVVLLESRKTFSSMMTAHWDDIGIKSN-----NTLFEHRTL 330
A N D +IW+ L+ G+R VLL +R +TA W D+G + L+ H+ L
Sbjct: 327 AAKNGDLEIWSKVLADGSRAAVLL-NRSASEQEITARWIDLGYPATLSVRVRDLWAHKDL 385
Query: 331 ETKFQGKLTAKVDGHSCKMYELTP 354
+ F+ K +AKV H M L P
Sbjct: 386 GS-FKDKFSAKVPSHGVVMIHLQP 408
>gi|320163986|gb|EFW40885.1| alpha-galactosidase [Capsaspora owczarzaki ATCC 30864]
Length = 404
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 143/371 (38%), Positives = 196/371 (52%), Gaps = 46/371 (12%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGLG TP MG+N+WNHF C I+E +IR T D + LS + + +L
Sbjct: 43 LDNGLGRTPQMGFNTWNHFGCNISEALIRSTVDVMVATG------LSAVGYKYINLDDCW 96
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVH---ILISNSLD---------------NRI 121
A + G +VA+ FPSG ALA YVH +L D +I
Sbjct: 97 A--VNRTAAGVIVADPVAFPSGIAALASYVHSKGMLFGLYSDAGTKTCAGRPGSVGYEKI 154
Query: 122 ESNIAFEQGIDYLKYDNC-FNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWG 180
++ G+DYLKYDNC D P +RY AM DAL GRPIF+SMC+ D P+ W
Sbjct: 155 DAQTYAAWGVDYLKYDNCNAPADQTPQVRYNAMRDALNATGRPIFYSMCDSID-DPSAWA 213
Query: 181 SNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVH 240
+ NSWRT DISDSW+ ++ I D NE A PGGWNDPD+LEVGNGG+ EY H
Sbjct: 214 KPVVNSWRTASDISDSWSSIMKIVDKNEPLWKIAGPGGWNDPDVLEVGNGGLSTTEYTSH 273
Query: 241 FSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-AINDQ----------QIWAGP 289
F++WA+ KAPL+ GCDV +T +T+ I+ N EVI NQ ++ Q ++W P
Sbjct: 274 FTLWALMKAPLIFGCDVTKMTNDTLRILTNTEVIEWNQDSLGIQGHRVKRGLLSEVWMAP 333
Query: 290 LSGNRI-VVLLESRKTFSSMMTAHWDDIGIKSNNT-----LFEHRTLETKFQGKLTAKVD 343
L+ R VVL + S+ MT W ++G+ + + +++H+ + + G A+V
Sbjct: 334 LANGRFAVVLFNTDPLLSAHMTVTWAELGLTAGSKYTARDVWQHKNVGL-YAGTFDAQVA 392
Query: 344 GHSCKMYELTP 354
L P
Sbjct: 393 PKGVVAVTLVP 403
>gi|302540812|ref|ZP_07293154.1| alpha-galactosidase [Streptomyces hygroscopicus ATCC 53653]
gi|302458430|gb|EFL21523.1| alpha-galactosidase [Streptomyces himastatinicus ATCC 53653]
Length = 686
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 123/283 (43%), Positives = 157/283 (55%), Gaps = 33/283 (11%)
Query: 22 NGLGLTPPMGWNSWNHFHCQ--INEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
+GL LTPPMG+N+WN HC+ NE +++ D + L + + + +L
Sbjct: 55 DGLALTPPMGFNNWNSTHCRADFNEAMVKGIADIF------VSKGLKDAGYTYVNLDDCW 108
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNRIESNIAF------- 127
A+P G LV + FP G KA+ADYVH I I S + + F
Sbjct: 109 ARPQ-RDANGKLVPDPVRFPKGIKAVADYVHSKGLKIGIYTSAGTKTCNTAGFPGALGHE 167
Query: 128 --------EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKA----GRPIFFSMCEWGDMR 175
+ GIDYLKYDNC N RYR M DALK A GRPI +S+CEWG +
Sbjct: 168 RSDAQQFADWGIDYLKYDNCNNQGVDAKQRYRTMRDALKAASETTGRPIVYSICEWGQNK 227
Query: 176 PALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYN 235
P W +++ WRTT DISDSW ML+I N A +A PG WNDPDMLEVGNGGM
Sbjct: 228 PWEWAADVGQLWRTTGDISDSWGSMLSITKKNLPLAPYAGPGHWNDPDMLEVGNGGMTDT 287
Query: 236 EYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
EY HFS+W++ APLL+G D+ A+T I+GN EVIAV+Q
Sbjct: 288 EYRSHFSLWSVMAAPLLIGTDLRKADAKTFEILGNREVIAVDQ 330
>gi|300783843|ref|YP_003764134.1| alpha-galactosidase [Amycolatopsis mediterranei U32]
gi|384147082|ref|YP_005529898.1| alpha-galactosidase [Amycolatopsis mediterranei S699]
gi|399535726|ref|YP_006548389.1| alpha-galactosidase [Amycolatopsis mediterranei S699]
gi|299793357|gb|ADJ43732.1| alpha-galactosidase [Amycolatopsis mediterranei U32]
gi|340525236|gb|AEK40441.1| alpha-galactosidase [Amycolatopsis mediterranei S699]
gi|398316496|gb|AFO75443.1| alpha-galactosidase [Amycolatopsis mediterranei S699]
Length = 525
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 140/389 (35%), Positives = 185/389 (47%), Gaps = 84/389 (21%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
L NGL TPPMGWN+WN F C INE ++++T DDCW SRD
Sbjct: 24 LENGLARTPPMGWNTWNTFECNINETLVKQTTDLMVSSGMRDRGYTYVNLDDCWMTRSRD 83
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-------- 112
S GNLVA+ FPSG KAL DY+H
Sbjct: 84 SD--------------------------GNLVADPAKFPSGLKALGDYIHTRGMKFGIYE 117
Query: 113 ---------ISNSLDN-RIESNIAFEQGIDYLKYDNCFNDDTRPT----IRYRAMSDALK 158
SL + + ++N G+DYLKYDNCFN+ + RY AM DAL+
Sbjct: 118 SAGTMTCQSYPGSLGHEQADANRFASWGVDYLKYDNCFNNGSNSQEDYIRRYSAMRDALR 177
Query: 159 KAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGG 218
GRPI +S+CEWGD PA W +++ N WRTT DI+++W + I N A AKPG
Sbjct: 178 ATGRPIAYSICEWGDFAPATWAADVGNLWRTTGDITNNWGTVDAIYRQNVGLASAAKPGA 237
Query: 219 WNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
WNDPDMLEVG+ GM + E HF++WA APL+ G D+ + + T N +VIAV+Q
Sbjct: 238 WNDPDMLEVGD-GMDFQEDRAHFTLWAAMAAPLIAGADLRSASVATFSTYLNSDVIAVDQ 296
Query: 279 AINDQQ-----------IWAGPLSGNRIVVLL----ESRKTFSSMMTAHWDDIGIKSNNT 323
+Q + A PL G + V+L S +T S+ A T
Sbjct: 297 DSLGKQARRIATGDGMDVLAKPLRGGDVAVVLFNENGSARTISTTAAAAGLPSASSYRLT 356
Query: 324 LFEHRTLETKFQGKLTAKVDGHSCKMYEL 352
+ L T G ++A V HS +Y +
Sbjct: 357 NLWSKELTTS-TGGISANVPSHSTVIYRV 384
>gi|345013258|ref|YP_004815612.1| glycoside hydrolase [Streptomyces violaceusniger Tu 4113]
gi|344039607|gb|AEM85332.1| Glycosyl hydrolase family 98 putative carbohydrate binding module
[Streptomyces violaceusniger Tu 4113]
Length = 685
Score = 217 bits (553), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 129/305 (42%), Positives = 161/305 (52%), Gaps = 33/305 (10%)
Query: 22 NGLGLTPPMGWNSWNHFHC--QINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
+GL LTPPMG+N+WN HC + NE +++ D + L + + + +L
Sbjct: 54 DGLALTPPMGFNNWNSTHCRAEFNEAMVKGIADIF------VAKGLKDAGYQYVNLDDCW 107
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNRIESNIAF------- 127
A P G LV + FP+G KA+ADY H I I S + + F
Sbjct: 108 ALPQ-RDADGKLVPDPVRFPNGIKAVADYAHSKGLKIGIYTSAGTKTCNTAGFPGALGHE 166
Query: 128 --------EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKA----GRPIFFSMCEWGDMR 175
+ GIDYLKYDNC N RYRAM DALK A G PI +S+CEWG+ +
Sbjct: 167 KSDAQQFADWGIDYLKYDNCNNQGVDAKQRYRAMRDALKAASESTGHPIVYSICEWGENK 226
Query: 176 PALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYN 235
P W WRTT DISDSW ML+IA N A +A PG WNDPDMLEVGNGGM
Sbjct: 227 PWEWAGEFGQLWRTTGDISDSWGSMLSIAKQNLPLAQYAGPGHWNDPDMLEVGNGGMTDT 286
Query: 236 EYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQIWAGPLSGNRI 295
EY HFS+W+I APLL+G D+ T ET I+GN EVI V+Q +Q G R
Sbjct: 287 EYRSHFSLWSIMAAPLLIGTDLRKATDETYEILGNREVIGVDQDPLGKQGTVVSSEGGRW 346
Query: 296 VVLLE 300
V+ E
Sbjct: 347 VIAKE 351
>gi|399029623|ref|ZP_10730424.1| alpha-galactosidase [Flavobacterium sp. CF136]
gi|398072439|gb|EJL63655.1| alpha-galactosidase [Flavobacterium sp. CF136]
Length = 397
Score = 217 bits (553), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 140/383 (36%), Positives = 193/383 (50%), Gaps = 66/383 (17%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS-RDSQISLSCSFHMFDSLTQIAAKP 82
L TPPMGWNSWN F C ++EK+I+E D +S RD+ ++ + D QI
Sbjct: 26 LAKTPPMGWNSWNKFGCDVSEKLIKEMADAMVKSGMRDA----GYNYLVIDDCWQIG--- 78
Query: 83 LPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRI--------------------- 121
+G+++A+ FPSG KAL DY+H S L I
Sbjct: 79 --RDAKGDIIADPERFPSGMKALGDYIH---SKGLKFGIYSCAGSMTCQSRPGSRGYQFQ 133
Query: 122 ESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGS 181
++ E G+DYLKYD CF++ Y+ MSDALK+ RPI FS+CEWG +P WG
Sbjct: 134 DAKKYAEWGVDYLKYDWCFDEGQNAKAAYKTMSDALKETKRPIVFSICEWGGSKPWEWGE 193
Query: 182 NIRNSWRTTDDISDSWARM-----LTIADMNEVYAD---HAKPGGWNDPDMLEVGNGGMK 233
I + WRTT DI D + L I D+ + AD +A PG WNDPDMLEVGNGGM
Sbjct: 194 GIGHLWRTTFDIRDCYQCTFDWGGLGILDIIDRQADLWKYAGPGHWNDPDMLEVGNGGMT 253
Query: 234 YNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ--------- 284
Y+E HFS+W++ +PL+ G D+ NL T I+ N+EVIAVNQ QQ
Sbjct: 254 YDENKTHFSMWSMLASPLMAGNDLRNLDPTTAQILTNKEVIAVNQDAKGQQARRFMDMGE 313
Query: 285 --IWAGPLSGNRIVVLLESRKTFSSMMTAHW--------DDIGIKSNNT----LFEHRTL 330
IWA PL+ ++ + +R + W +D+ IK L+EH+++
Sbjct: 314 HEIWAKPLADGKVAICFLNRTETDWKLDYDWKKNTMYFVEDVKIKKETYIVRDLWEHKSI 373
Query: 331 ETKFQGKLTAKVDGHSCKMYELT 353
T + L + H M L+
Sbjct: 374 GTT-EKNLVKSIPAHGVLMVVLS 395
>gi|389735987|ref|ZP_10189591.1| alpha-galactosidase [Rhodanobacter sp. 115]
gi|388440094|gb|EIL96511.1| alpha-galactosidase [Rhodanobacter sp. 115]
Length = 408
Score = 217 bits (552), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 136/379 (35%), Positives = 188/379 (49%), Gaps = 56/379 (14%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGL TPPMGWNSWN FHC ++ K+I T D S + + L Q
Sbjct: 27 LDNGLARTPPMGWNSWNTFHCDVSAKLIEATADIMVSSGMKAAGYRYVNIDDCWLLKQRG 86
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVH-----------------ILISNSLDNRIE 122
G LVA+ FP G KA+ADYVH SL + +
Sbjct: 87 P-------HGELVADPAKFPQGIKAVADYVHRKGLKLGIYESAGTITCAGYPGSLGHEKQ 139
Query: 123 SNIAFEQ-GIDYLKYDNC--FNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALW 179
F + G+DYLKYDNC + +T P RY AMSDAL+ GRPI +S+CEWG+ P W
Sbjct: 140 DAKEFARWGVDYLKYDNCGDYRGETYPQ-RYTAMSDALRATGRPIVYSLCEWGNQSPWNW 198
Query: 180 GSNIRNSWRTTDDISDSW----------ARMLTIADMNEVYADHAKPGGWNDPDMLEVGN 229
I NSWRTT DI+ W +L I D + ++PG WNDPDMLEVGN
Sbjct: 199 AQAIGNSWRTTQDITPRWHTDQPANGYPQGILDILDQQAALSHASRPGAWNDPDMLEVGN 258
Query: 230 GGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ----- 284
G + +E HFS+WA+ APL+ G D+ ++ + I+ N EVIAV+Q +Q
Sbjct: 259 GYLNNDENRAHFSLWALLNAPLIAGNDLRKMSKDVRAILTNREVIAVDQDWGGRQGYRVH 318
Query: 285 ------IWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT-----LFEHRTLETK 333
+WA P+S + V+L +R + + IG+KS ++ L++H + +
Sbjct: 319 RDGNADVWAKPMSDGSVAVILLNRGSGPLHIATSATAIGLKSASSYTVRDLWQHHS--ST 376
Query: 334 FQGKLTAKVDGHSCKMYEL 352
G + A V H+ M+ +
Sbjct: 377 SAGAIEADVPTHAVAMFRV 395
>gi|281207323|gb|EFA81506.1| putative alpha-galactosidase [Polysphondylium pallidum PN500]
Length = 393
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 144/401 (35%), Positives = 196/401 (48%), Gaps = 96/401 (23%)
Query: 20 LSNGLGLTPPMGWNS------------WNHFHCQINEKIIRET----------------- 50
L+NGLGLTP MGWNS WN+F C INE +I ET
Sbjct: 20 LNNGLGLTPQMGWNSNWINSYYLITYSWNYFACNINETVIMETAKAMATNGMKDAGYVYV 79
Query: 51 --DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADY 108
DDCW ES RD G + + TFP+G +ADY
Sbjct: 80 NIDDCWAES-RDKN--------------------------GVIQPDSNTFPNGIAYIADY 112
Query: 109 VHIL---ISNSLDNRIES------NIAFEQ---------GIDYLKYDNCFNDDTRPTIRY 150
VH L + D E+ + +EQ GIDYLK D C +P RY
Sbjct: 113 VHGLGLKLGIYTDAGTETCAGRPGSFGYEQIDAQTYASWGIDYLKEDWCNTGSNQPLSRY 172
Query: 151 RAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVY 210
MS AL GRPIFFS+C+WG P WG + NS+RTT DI D+WA + ++
Sbjct: 173 SIMSQALNATGRPIFFSLCDWGTDNPWEWGPTVGNSFRTTGDIKDNWASFMNNLNLQIPI 232
Query: 211 ADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGN 270
+++ GGWNDPDMLEVGNGGM EY+ HFS+W+I APL+ G D+ N+ T+ I+
Sbjct: 233 TSYSQVGGWNDPDMLEVGNGGMTTTEYISHFSLWSILNAPLIAGNDLRNIDQTTLSILTA 292
Query: 271 EEVIAVNQ-----------AIND--QQIWAGPLS-GNRIVVLLESRKTFSSMMTAHWDDI 316
EVIAVNQ + N QQ+WA P++ G+R+VVL + + S+ + W DI
Sbjct: 293 PEVIAVNQDPLGKQGALVRSYNGGLQQVWAKPMADGSRVVVLFNT-DSISADIQLEWVDI 351
Query: 317 GIKSNNTL----FEHRTLETKFQGK-LTAKVDGHSCKMYEL 352
+++ T+ R+ ++ G ++ V H C M ++
Sbjct: 352 YVQNTATMTVRDLWARSDVGQYTGSYVSLNVPSHGCAMLKI 392
>gi|398788359|ref|ZP_10550523.1| alpha-galactosidase [Streptomyces auratus AGR0001]
gi|396992276|gb|EJJ03389.1| alpha-galactosidase [Streptomyces auratus AGR0001]
Length = 658
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 137/375 (36%), Positives = 186/375 (49%), Gaps = 53/375 (14%)
Query: 20 LSNGLGLTPPMGWNSWNHFHC--QINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQ 77
L GL TPPMG+N+WN C + NE++++ D + L + + + +L
Sbjct: 29 LPGGLAKTPPMGFNNWNSTECRAEFNEEMVKGIADLF------VSKGLKAAGYQYVNLDD 82
Query: 78 IAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-------------------ISNSLD 118
A P G LV + FP G KA+ADYVH L
Sbjct: 83 CWALPT-RDGAGKLVPDPRRFPHGIKAVADYVHAKGLKFGIYTSAGIKTCNPAGFPGGLG 141
Query: 119 NRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPA 177
+ F G+DYLKYDNC N RYR M DALK GRPI +S+CEWG+ +P
Sbjct: 142 HETSDAQQFADWGVDYLKYDNCNNLGVDAKQRYRTMRDALKATGRPIVYSICEWGENKPW 201
Query: 178 LWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEY 237
W S++ + WRTT DISDSWA ML IA N A HA PG WNDPDMLEVGNGGM EY
Sbjct: 202 EWASDVGHLWRTTGDISDSWASMLGIAKQNLPLAPHAGPGHWNDPDMLEVGNGGMTDTEY 261
Query: 238 VVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ---------AINDQQIW-- 286
HFS+W++ APLL+G D+ T ET I+ N+++IAV+Q ++ W
Sbjct: 262 RSHFSLWSMMAAPLLIGSDLRKATPETFTILANKDLIAVDQDALGKPATVLASESGRWTL 321
Query: 287 ----AGPLSGNRIVVLL----ESRKTFSSMMTAHWDDIGIKSNNTLFEHRTLETKFQGKL 338
AG G+R V L + ++ ++ A L++HR +T G +
Sbjct: 322 VRQLAG---GDRAVALFNETDQPQRISTTAGEAGLPQAAGYRLRDLWQHR--DTHTTGTV 376
Query: 339 TAKVDGHSCKMYELT 353
+A V H +Y ++
Sbjct: 377 SATVPAHGTVVYRIS 391
>gi|403335733|gb|EJY67047.1| Alpha-galactosidase [Oxytricha trifallax]
Length = 413
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 136/395 (34%), Positives = 191/395 (48%), Gaps = 97/395 (24%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
L NGLGLTP MGWN+WN F C+INEK+I+ET DDCW RD
Sbjct: 23 LDNGLGLTPQMGWNTWNKFGCKINEKLIKETADYIVKLGLDKVGYQYVNVDDCWLLEERD 82
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNR 120
G+++ + FPSG KAL+DY+H S L
Sbjct: 83 KD--------------------------GHIIVDPVAFPSGMKALSDYIH---SKGLKFG 113
Query: 121 IESNIAF---------------------EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKK 159
+ S+ + +DY+KYDNC + +R++ MSD LKK
Sbjct: 114 LYSSAGYFSCEHRAGSMGHEKTDAQDFASWEVDYIKYDNCNHGGQPNYVRFKKMSDELKK 173
Query: 160 AGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGW 219
GR IFFS+C WGD WG + NSWRTT DI +++ + N+ + +HA PG W
Sbjct: 174 TGRSIFFSICNWGDEDAPEWGKRVGNSWRTTQDIQNNFNSVEYNFIQNQKFQEHAGPGHW 233
Query: 220 NDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ- 278
NDPDML++GN GM +E HF++WAI+K PL++G D+ N+ E++ I+ N E+IAVNQ
Sbjct: 234 NDPDMLQIGNNGMNPDEERTHFALWAIAKGPLIMGNDLQNIRPESLEILKNTEIIAVNQD 293
Query: 279 --------AIN---------DQQIWAGP-LSGNRIVVLLESRKTFSSMMTAHWDDIG--I 318
AIN Q++ P ++G+ VV++ R+ T DIG I
Sbjct: 294 ELGLQGGCAINCGPFEFLLRRPQVYVSPMMNGDLAVVIVNWRELDYGEFTFELKDIGMNI 353
Query: 319 KSNNTLFE------HRTLETKFQGKLTA-KVDGHS 346
+ + L+ H+T Q T K+ GH
Sbjct: 354 RKDTDLYAVRDLWLHKTFSPYNQPTFTVPKIPGHG 388
>gi|300785539|ref|YP_003765830.1| alpha-galactosidase [Amycolatopsis mediterranei U32]
gi|384148835|ref|YP_005531651.1| alpha-galactosidase [Amycolatopsis mediterranei S699]
gi|399537423|ref|YP_006550085.1| alpha-galactosidase [Amycolatopsis mediterranei S699]
gi|299795053|gb|ADJ45428.1| alpha-galactosidase [Amycolatopsis mediterranei U32]
gi|340526989|gb|AEK42194.1| alpha-galactosidase [Amycolatopsis mediterranei S699]
gi|398318193|gb|AFO77140.1| alpha-galactosidase [Amycolatopsis mediterranei S699]
Length = 651
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 130/358 (36%), Positives = 175/358 (48%), Gaps = 80/358 (22%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQIS 64
+G PPMGWNSWN F C INE ++R DDCW E +R
Sbjct: 33 VGERPPMGWNSWNRFGCDINEDLVRGAADALVSSGMKDAGYQYINIDDCWAEQNR----- 87
Query: 65 LSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL------------ 112
+G L + FP G K LADYVH L
Sbjct: 88 ---------------------TAEGKLEPSHERFPHGIKTLADYVHGLGLKLGIYTSAGT 126
Query: 113 ------ISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTI-RYRAMSDALKKAGRPI 164
+ +LD+ +F G+DYLKYDNC N+ RP I RY M DA+KK GRP+
Sbjct: 127 LTCQKTMPGALDHEEADAQSFADWGVDYLKYDNC-NNQGRPAIERYTKMGDAIKKTGRPM 185
Query: 165 FFSMCEWGDMRPALWGSNIRNS-WRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPD 223
+++CEWG+ +P WG + WRTT DISD+W M I D A PGGWNDPD
Sbjct: 186 IYALCEWGENKPWTWGRDAGAQLWRTTGDISDNWGSMTGILDQQVGLEKFAGPGGWNDPD 245
Query: 224 MLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAI--- 280
MLEVGNGGM EY HF++WA+ APLL G D+ ++ T I+ N +VIA++Q
Sbjct: 246 MLEVGNGGMTDTEYRSHFALWALLNAPLLAGNDLPAMSPATKAILENNDVIALDQDWAGT 305
Query: 281 --------NDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFEHRTL 330
D ++WA P+S V+L +R + S+ ++ ++G++ + F R L
Sbjct: 306 QGRKLRDDGDTEVWAKPMSDGSAAVVLFNRGSASTTISTTAAELGLEGHT--FRVRDL 361
>gi|29828016|ref|NP_822650.1| alpha-galactosidase [Streptomyces avermitilis MA-4680]
gi|29605118|dbj|BAC69185.1| putative alpha-galactosidase [Streptomyces avermitilis MA-4680]
Length = 682
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 136/372 (36%), Positives = 188/372 (50%), Gaps = 49/372 (13%)
Query: 21 SNGLGLTPPMGWNSWNHFHC--QINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQI 78
++ L LTPPMG+N+WN C + NE +++ T D + E L + + + +L
Sbjct: 54 ADSLALTPPMGFNNWNSTGCRPEFNEDMVKGTADIFVEKG------LKDAGYQYVNLDDC 107
Query: 79 AAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNRIESNIAF------ 127
A P G LV + FP G KA+ADYVH + I S + + F
Sbjct: 108 WALPA-RDSNGKLVPDPARFPGGIKAVADYVHSKGLKLGIYTSAGTKTCNEAGFPGALGH 166
Query: 128 ---------EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPAL 178
+ G+DYLKYDNC N +RY M DALK GRPI +S+CEWG+ +P
Sbjct: 167 EYSDAQQFADWGVDYLKYDNCNNQGVDAKLRYTTMRDALKATGRPIVYSLCEWGENKPWE 226
Query: 179 WGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYV 238
W S++ WRTT DISDSW ML+I N A +A PG WNDPDMLEVGN GM EY
Sbjct: 227 WASDVGQLWRTTGDISDSWGSMLSILKQNLPLAPYAGPGHWNDPDMLEVGNSGMTDTEYR 286
Query: 239 VHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ---------IWAGP 289
HFS+W+I APLL+G D+ +A T I+ N+EVIAV+Q +Q WA
Sbjct: 287 THFSMWSIMAAPLLIGSDLRKASAATFDILDNKEVIAVDQDPLGKQGTVLSSEGGRWAVT 346
Query: 290 LS---GNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT-----LFEHRTLETKFQGKLTAK 341
G+R V L + + T +G+ + L++HR+ T G ++A
Sbjct: 347 KEMKDGSRAVALFNETDSAQRITTTA-QAVGLPKAHAYTLRDLWQHRSYNT--AGTISAT 403
Query: 342 VDGHSCKMYELT 353
V H + ++
Sbjct: 404 VPAHGTVLVRVS 415
>gi|326441974|ref|ZP_08216708.1| secreted alpha-galactosidase [Streptomyces clavuligerus ATCC 27064]
Length = 707
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 140/367 (38%), Positives = 187/367 (50%), Gaps = 43/367 (11%)
Query: 24 LGLTPPMGWNSWNHFHC--QINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
L TPPMG+N+WN HC + NE +I+ D L + + + +L A
Sbjct: 45 LAPTPPMGFNNWNSTHCRPEFNETMIKGVADTL------VAKGLRDAGYRYVNLDDCWAL 98
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVHIL-------------------ISNSLDN-RI 121
P N G+LV + FPSG KALADYVH L N R
Sbjct: 99 PQRNAA-GDLVPDPVRFPSGIKALADYVHAKGLKFGIYSSAGTRTCDVQGFPGGLGNERR 157
Query: 122 ESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGS 181
++ + G+DYLKYDNC N+ RYRAM++AL+ GR I S+CEWG+ P W
Sbjct: 158 DAALWASWGVDYLKYDNCHNNGVDARQRYRAMAEALRATGREILLSVCEWGENAPWEWAG 217
Query: 182 NIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHF 241
++WRTT DI+D+W ML IA N+ A +A+PG WNDPDMLEVGNGGM EY HF
Sbjct: 218 EYGSAWRTTGDIADTWDSMLGIARRNQELAPYARPGAWNDPDMLEVGNGGMTDTEYRTHF 277
Query: 242 SIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ-----------IWAGPL 290
S+W+ APLL+G D+ T T+ I+ N EVIAVNQ +Q + PL
Sbjct: 278 SLWSQMAAPLLIGSDLRTATPATLEILTNREVIAVNQDPLGRQGTVVSRSGGLVVMTKPL 337
Query: 291 S-GNRIVVLL-ESRKTFSSMMTAHWDDIGIKSNNTLFEHRTLETKFQG-KLTAKVDGHSC 347
+ G R V L E+ + TA +G ++ TL + T + + + A V H
Sbjct: 338 ADGGRSVTLTNETAAARTVTTTAEAIGLGGSASYTLRDLWTRQNRTTADTIAATVPAHGT 397
Query: 348 KMYELTP 354
MY +TP
Sbjct: 398 VMYRVTP 404
>gi|124495059|gb|ABN13605.1| alpha-galactosidase [Cerastium arcticum]
Length = 226
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 109/227 (48%), Positives = 146/227 (64%), Gaps = 17/227 (7%)
Query: 132 DYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTD 191
DYLKYDNC N +RP RY MS AL+ +G+ IFFS+CEWG PA W + NSWRTT
Sbjct: 1 DYLKYDNCENTGSRPEERYPKMSTALQTSGKEIFFSLCEWGQDNPATWAKGMGNSWRTTG 60
Query: 192 DISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPL 251
DI D+W M IAD N+ +A A PGGWNDPDMLEVGNGGM EY HFSIWA+ KAPL
Sbjct: 61 DIQDNWDSMTQIADQNDKWAPFAGPGGWNDPDMLEVGNGGMTTAEYRSHFSIWALMKAPL 120
Query: 252 LLGCDVGNLTAETMPIIGNEEVIAVNQAI-----------NDQQIWAGPLSGNRIVVLLE 300
L+GCD+ + T I+ N VI+VNQ + D ++WAGPL+ NR+ V+L
Sbjct: 121 LIGCDLRAMNWVTYEILSNRGVISVNQDLLGVQRRKVKKDGDLEVWAGPLTNNRVAVILW 180
Query: 301 SRKTFSSMMTAHWDDIGIKSNNTLFEHRTLETKF-----QGKLTAKV 342
+R + ++ +TA+ DIG+ + ++ R L T++ Q +++A+V
Sbjct: 181 NRGSEAATITAYXTDIGLNPDTVVY-ARDLWTRYTQLGVQNQISAQV 226
>gi|281207322|gb|EFA81505.1| hypothetical protein PPL_05494 [Polysphondylium pallidum PN500]
Length = 766
Score = 214 bits (546), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 129/355 (36%), Positives = 172/355 (48%), Gaps = 77/355 (21%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
L NGLGLTP MGWNSWNHF C INE II +T DDCW +
Sbjct: 405 LQNGLGLTPQMGWNSWNHFGCDINEDIIMQTAKAMATNGMKEAGYIYVNIDDCWASHRNE 464
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-------- 112
S G++ A+ TFP+G LADYVH L
Sbjct: 465 S---------------------------GHIQADSKTFPNGIAYLADYVHSLGMKLGIYT 497
Query: 113 ---------ISNSLDNR-IESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGR 162
S D+ I++ G+DY+K D C+ + P RY MS AL GR
Sbjct: 498 DAGPLTCQRRPGSYDHEEIDAQTYAAWGVDYVKEDWCWAFLSNPLDRYAIMSQALNGTGR 557
Query: 163 PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDP 222
PIFFS+C+WG P WG + NS+RTT DI D+W L D +++ GGWNDP
Sbjct: 558 PIFFSLCDWGTDNPWEWGPTVGNSFRTTSDIKDTWDSFLDNLDKQIPITSYSQVGGWNDP 617
Query: 223 DMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ---- 278
DMLEVGNGGM Y EY+ HF +W+I APL+ G D+ + + + I E++AVNQ
Sbjct: 618 DMLEVGNGGMSYTEYLSHFQLWSIINAPLIAGNDMRTVDQQYLDIFTAPEIVAVNQDPLG 677
Query: 279 -------AIND--QQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTL 324
+ N QQ+WA P++ +L +R + S+ + W DI + N ++
Sbjct: 678 KQGSLVRSYNSGLQQVWAKPMADGSRAAVLFNRDSSSAGIQLFWADIFLTPNTSM 732
>gi|18157238|dbj|BAB83765.1| alpha-galactosidase [Clostridium josui]
Length = 478
Score = 214 bits (544), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 143/370 (38%), Positives = 184/370 (49%), Gaps = 47/370 (12%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L+NGLGLTPPMGWNSWN F INE I+E D + + +
Sbjct: 34 LNNGLGLTPPMGWNSWNIFGGDINEDKIKEIADAMVTTGMKD-----AGYEYVNLDDNWM 88
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVH-------------------ILISNSLDNR 120
A P G L+ + FPSG KALADY+H I S S
Sbjct: 89 ANP-ARDANGKLIPDPKRFPSGMKALADYIHSKGLKFGIYGDRGVTTCCNIPQSGSQGYE 147
Query: 121 IESNIAF-EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALW 179
+ F E G+DYLKYDNC +D + Y M DAL K GRPIF+S+C W P W
Sbjct: 148 EQDAKTFAEWGLDYLKYDNCASD-SNLQAGYEKMRDALLKTGRPIFYSICCWYFAGP--W 204
Query: 180 GSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVV 239
+ NSWRTT DISDSW ++ D N A +A PG WNDPDMLEVGNG M EY
Sbjct: 205 MVDCGNSWRTTGDISDSWGSIIRNIDENSKSAAYAGPGHWNDPDMLEVGNGNMTDTEYKA 264
Query: 240 HFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AINDQQIWAG 288
HFS+W + APL+ G D+ N+T T I+ N+EVIA++Q + + ++W
Sbjct: 265 HFSMWCMMAAPLIAGNDLRNMTPATKEILTNKEVIAIDQDAAGVQGTKVSSSGELEVWCK 324
Query: 289 PLS--GNRIVVLLESRKTFSSMMTAHWDDI----GIKSNNTLFEHRTLETKFQGKLTAKV 342
PL G V L +R S+ +T +W DI G + L+EH+ T F TA V
Sbjct: 325 PLGTDGTTKAVALLNRGATSADITVNWSDIQLADGPVTVRDLWEHKDCGT-FNTGYTANV 383
Query: 343 DGHSCKMYEL 352
H + ++
Sbjct: 384 PSHGVVVLKV 393
>gi|326203063|ref|ZP_08192929.1| glycoside hydrolase clan GH-D [Clostridium papyrosolvens DSM 2782]
gi|325986709|gb|EGD47539.1| glycoside hydrolase clan GH-D [Clostridium papyrosolvens DSM 2782]
Length = 472
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 139/370 (37%), Positives = 182/370 (49%), Gaps = 47/370 (12%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L+N LGLTPPMGWNSWN F INE I++ D + + +
Sbjct: 34 LNNNLGLTPPMGWNSWNIFGGDINEDKIKQIADAMVSTGMKD-----AGYEYVNLDDNWM 88
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLDNRIE 122
A P G L+ + FP+G KAL DY+H + S E
Sbjct: 89 ANP-ARDVNGKLIPDPKRFPNGMKALGDYIHAKGLKFGIYGDRGVTTCCNVPQSGSQGYE 147
Query: 123 ---SNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALW 179
+N E G+DYLKYDNC +D + Y M AL K GRPIF+S+C W P W
Sbjct: 148 EQDANTFAEWGVDYLKYDNCASD-SNLQAGYEKMQSALLKTGRPIFYSICCWYFAGP--W 204
Query: 180 GSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVV 239
+ NSWRTT DISD+W + I D N A +A PG WNDPDMLEVGNG M EY
Sbjct: 205 IVDCGNSWRTTGDISDNWGSVAKIIDENSKSASYAGPGHWNDPDMLEVGNGNMTDTEYKA 264
Query: 240 HFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AINDQQIWAG 288
HFS+W + APL+ G D+ N+T T I+ N+EVIA++Q A D ++W
Sbjct: 265 HFSMWCLMAAPLIAGNDLRNMTGTTKDILTNKEVIAIDQDAAGVQGTKVSASGDLEVWCK 324
Query: 289 PLS--GNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT----LFEHRTLETKFQGKLTAKV 342
PL G V+L +R SS +T +W DI + + L+EH+ T F TA V
Sbjct: 325 PLGTDGTTKAVILLNRGGVSSDITVNWKDIKLSDGSVKVRDLWEHKDCGT-FDTGYTANV 383
Query: 343 DGHSCKMYEL 352
H + ++
Sbjct: 384 PSHGAVVLKV 393
>gi|254391225|ref|ZP_05006431.1| alpha-galactosidase [Streptomyces clavuligerus ATCC 27064]
gi|197704918|gb|EDY50730.1| alpha-galactosidase [Streptomyces clavuligerus ATCC 27064]
Length = 786
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 141/367 (38%), Positives = 188/367 (51%), Gaps = 43/367 (11%)
Query: 24 LGLTPPMGWNSWNHFHC--QINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
L TPPMG+N+WN HC + NE +I+ D L + + + +L A
Sbjct: 45 LAPTPPMGFNNWNSTHCRPEFNETMIKGVADTL------VAKGLRDAGYRYVNLDDCWAL 98
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNR------IESNIAFEQ- 129
P N G+LV + FPSG KALADYVH I +S R + E+
Sbjct: 99 PQRNAA-GDLVPDPVRFPSGIKALADYVHAKGLKFGIYSSAGTRTCDVQGFPGGLGNERR 157
Query: 130 --------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGS 181
G+DYLKYDNC N+ RYRAM++AL+ GR I S+CEWG+ P W
Sbjct: 158 DAALWASWGVDYLKYDNCHNNGVDARQRYRAMAEALRATGREILLSVCEWGENAPWEWAG 217
Query: 182 NIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHF 241
++WRTT DI+D+W ML IA N+ A +A+PG WNDPDMLEVGNGGM EY HF
Sbjct: 218 EYGSAWRTTGDIADTWDSMLGIARRNQELAPYARPGAWNDPDMLEVGNGGMTDTEYRTHF 277
Query: 242 SIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ-----------IWAGPL 290
S+W+ APLL+G D+ T T+ I+ N EVIAVNQ +Q + PL
Sbjct: 278 SLWSQMAAPLLIGSDLRTATPATLEILTNREVIAVNQDPLGRQGTVVSRSGGLVVMTKPL 337
Query: 291 S-GNRIVVLL-ESRKTFSSMMTAHWDDIGIKSNNTLFEHRTLETKFQG-KLTAKVDGHSC 347
+ G R V L E+ + TA +G ++ TL + T + + + A V H
Sbjct: 338 ADGGRSVTLTNETAAARTVTTTAEAIGLGGSASYTLRDLWTRQNRTTADTIAATVPAHGT 397
Query: 348 KMYELTP 354
MY +TP
Sbjct: 398 VMYRVTP 404
>gi|300788639|ref|YP_003768930.1| alpha-galactosidase [Amycolatopsis mediterranei U32]
gi|384152097|ref|YP_005534913.1| alpha-galactosidase [Amycolatopsis mediterranei S699]
gi|399540520|ref|YP_006553182.1| alpha-galactosidase [Amycolatopsis mediterranei S699]
gi|299798153|gb|ADJ48528.1| alpha-galactosidase [Amycolatopsis mediterranei U32]
gi|340530251|gb|AEK45456.1| alpha-galactosidase [Amycolatopsis mediterranei S699]
gi|398321290|gb|AFO80237.1| alpha-galactosidase [Amycolatopsis mediterranei S699]
Length = 522
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 141/377 (37%), Positives = 195/377 (51%), Gaps = 43/377 (11%)
Query: 13 GSDRRNLLSNGLGLTPPMGWNSWNHFHCQ--INEKIIRETDDCWGESSRDSQISLSCSFH 70
G+ L NGL TPPMG+N+WN C+ NE +I+ D + L + +
Sbjct: 24 GAPTAAALENGLLRTPPMGFNNWNSTQCKADFNETMIKGIADIF------VSKGLKDAGY 77
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNRI---- 121
+ ++ A P N GNLV + FP G KALADYVH I S +
Sbjct: 78 TYVNIDDCWALPSRNSA-GNLVPDPARFPDGIKALADYVHGKGLKFGIYTSAGTKTCNKA 136
Query: 122 -----------ESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCE 170
++N+ G+DYLKYDNC N RY+AM DAL K+GR I +S+CE
Sbjct: 137 GFPGALNHEQQDANLFASWGVDYLKYDNCNNQGVDAQQRYKAMRDALAKSGRAIAYSICE 196
Query: 171 WGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNG 230
WG +P W + + N WRTT DISD W+ M+ A N A +A PG WNDPDMLEVGNG
Sbjct: 197 WGQNQPWTWAAPVGNLWRTTGDISDKWSSMIGKAQTNRGLAQYAGPGHWNDPDMLEVGNG 256
Query: 231 GMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ------ 284
GM EY HFS+WA+ APLL+G D+ ++ + I+ N +VIA++Q +Q
Sbjct: 257 GMTAAEYRTHFSLWAMMAAPLLIGSDLRKVSDDNFAILKNTDVIALDQDPLGKQATVLSA 316
Query: 285 -----IWAGPLS-GNRIVVLL-ESRKTFSSMMTAHWDDIGIKSNNTLFEHRTLETK-FQG 336
++ LS G+R V L E+ T + TA IG S+ TL + + T+ G
Sbjct: 317 NAGLVVYGKVLSNGDRAVALSNETAATATIGTTASATGIGSASSYTLKDLWSKATRTTTG 376
Query: 337 KLTAKVDGHSCKMYELT 353
++A V HS +Y ++
Sbjct: 377 TISASVPSHSTVLYRVS 393
>gi|376261211|ref|YP_005147931.1| alpha-galactosidase [Clostridium sp. BNL1100]
gi|373945205|gb|AEY66126.1| alpha-galactosidase [Clostridium sp. BNL1100]
Length = 472
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 140/370 (37%), Positives = 185/370 (50%), Gaps = 47/370 (12%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L+NGLGLTPPMGWNSWN F INE I++ D + + +
Sbjct: 34 LNNGLGLTPPMGWNSWNIFGGDINEDKIKQIADAMVTTGMKD-----AGYEYVNLDDNWM 88
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVH-------------------ILISNSLDNR 120
A P G L+ + FP+G KALADY+H I S S
Sbjct: 89 ANP-ARDANGKLIPDPKRFPNGMKALADYIHSKGLKFGIYGDRGVTTCCNIPQSGSQGYE 147
Query: 121 IESNIAF-EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALW 179
+ F E G+DYLKYDNC +D + Y M DAL K GRPIF+S+C W P W
Sbjct: 148 EQDAKTFAEWGVDYLKYDNCASD-SNLQAGYEKMRDALLKTGRPIFYSICCWYFAGP--W 204
Query: 180 GSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVV 239
+ NSWRTT DISD+W + D N A +A PG WNDPDMLEVGNG M EY
Sbjct: 205 IIDCGNSWRTTGDISDNWGSITKNIDENSKSAAYAGPGHWNDPDMLEVGNGKMSDTEYKA 264
Query: 240 HFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAI-----------NDQQIWAG 288
HFS+W + APL+ G D+ N+T T I+ N+EVIA++Q + + ++W
Sbjct: 265 HFSMWCMMAAPLIAGNDIRNMTPATKEILTNKEVIAIDQDVAGVQGTKVSTSGELEVWCK 324
Query: 289 PLS--GNRIVVLLESRKTFSSMMTAHWDDI----GIKSNNTLFEHRTLETKFQGKLTAKV 342
PL G V+L +R S+ +T +W DI G + L+EH+ +T F TA V
Sbjct: 325 PLGVDGTTKAVILLNRGGASADITVNWRDIKLADGPATVRDLWEHKDCDT-FNTGYTANV 383
Query: 343 DGHSCKMYEL 352
H + ++
Sbjct: 384 PSHGAVVLKV 393
>gi|319640075|ref|ZP_07994802.1| alpha-galactosidase [Bacteroides sp. 3_1_40A]
gi|317388353|gb|EFV69205.1| alpha-galactosidase [Bacteroides sp. 3_1_40A]
Length = 408
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 136/387 (35%), Positives = 191/387 (49%), Gaps = 66/387 (17%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
N L TPPMGWNSWN F C ++E++I+E D+ I+ ++ L
Sbjct: 34 NALAQTPPMGWNSWNKFGCNVSEQLIKEM--------ADAMIATGMKDAGYEYLVIDDCW 85
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAFEQ------------ 129
+ +GN+ + FP+G KALADYVH + L I S E
Sbjct: 86 QVGRDEEGNIQVDPKRFPNGMKALADYVH---AKGLKMGIYSCAGSETCQGRPGSRGYQF 142
Query: 130 ---------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWG 180
GIDYLKYD C N+ + YR MSDALK GRPI FS+CEWG+ P WG
Sbjct: 143 QDARTYAAWGIDYLKYDWCSNEGQKAEAAYRTMSDALKACGRPIVFSICEWGENEPWKWG 202
Query: 181 SNIRNSWRTTDDISDSW--------ARMLTIAD-MNEVYADHAKPGGWNDPDMLEVGNGG 231
I + WR T DI D + +L I D M ++Y +A PG WND +MLEVGNGG
Sbjct: 203 KGIGHLWRITPDIRDCYQCVFDWGGVGVLDIIDKMADLYP-YAGPGHWNDAEMLEVGNGG 261
Query: 232 MKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AI 280
M +EY+ HFS+W + APL+ G D+ N+ ET+ I+ N+EVIA+NQ +
Sbjct: 262 MTRDEYITHFSMWCMLAAPLMSGNDLRNMDKETIEILTNKEVIAINQDKKGEQARRFMDM 321
Query: 281 NDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHW--------DDIGIKSNNT----LFEHR 328
+++IWA PL+ + V +R S + W D++ I L++H+
Sbjct: 322 GEKEIWAKPLNDGELAVCFMNRTENSWKLDYDWKKQTIYFADEVNIHRKEYKIRDLWQHK 381
Query: 329 TLETKFQGKLTAKVDGHSCKMYELTPV 355
+ T + ++ H M LTP+
Sbjct: 382 DIGTT-ERNTRHEIPAHGVLMVRLTPI 407
>gi|345517090|ref|ZP_08796568.1| alpha-galactosidase [Bacteroides sp. 4_3_47FAA]
gi|254833860|gb|EET14169.1| alpha-galactosidase [Bacteroides sp. 4_3_47FAA]
Length = 415
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 136/387 (35%), Positives = 191/387 (49%), Gaps = 66/387 (17%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
N L TPPMGWNSWN F C ++E++I+E D+ I+ ++ L
Sbjct: 41 NALAQTPPMGWNSWNKFGCNVSEQLIKEM--------ADAMIATGMKDAGYEYLVIDDCW 92
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAFEQ------------ 129
+ +GN+ + FP+G KALADYVH + L I S E
Sbjct: 93 QVGRDEEGNIQVDPKRFPNGMKALADYVH---AKGLKMGIYSCAGSETCQGRPGSRGYQF 149
Query: 130 ---------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWG 180
GIDYLKYD C N+ + YR MSDALK GRPI FS+CEWG+ P WG
Sbjct: 150 QDARTYAAWGIDYLKYDWCSNEGQKAEAAYRTMSDALKACGRPIVFSICEWGENEPWKWG 209
Query: 181 SNIRNSWRTTDDISDSW--------ARMLTIAD-MNEVYADHAKPGGWNDPDMLEVGNGG 231
I + WR T DI D + +L I D M ++Y +A PG WND +MLEVGNGG
Sbjct: 210 KGIGHLWRITPDIRDCYQCVFDWGGVGVLDIIDKMADLYP-YAGPGHWNDAEMLEVGNGG 268
Query: 232 MKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AI 280
M +EY+ HFS+W + APL+ G D+ N+ ET+ I+ N+EVIA+NQ +
Sbjct: 269 MTRDEYITHFSMWCMLAAPLMSGNDLRNMDKETIEILTNKEVIAINQDKKGEQARRFMDM 328
Query: 281 NDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHW--------DDIGIKSNNT----LFEHR 328
+++IWA PL+ + V +R S + W D++ I L++H+
Sbjct: 329 GEKEIWAKPLNDGELAVCFMNRTENSWKLDYDWKKQTIYFADEVNIHRKEYKIRDLWQHK 388
Query: 329 TLETKFQGKLTAKVDGHSCKMYELTPV 355
+ T + ++ H M LTP+
Sbjct: 389 DIGTT-ERNTRHEIPAHGVLMVRLTPI 414
>gi|328869050|gb|EGG17428.1| putative alpha-galactosidase [Dictyostelium fasciculatum]
Length = 381
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 135/390 (34%), Positives = 180/390 (46%), Gaps = 82/390 (21%)
Query: 18 NLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESS 58
N + NGLG TP +GWNSWN+F C INE +I +T DDCW S
Sbjct: 18 NAIDNGLGNTPQLGWNSWNYFGCDINETVIMQTAKAMATNGMLEAGYRYINIDDCWAVSR 77
Query: 59 RDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL------ 112
S + + A P+ FP G +ADYVH L
Sbjct: 78 NSSGV--------------VQADPIK-------------FPQGIAYIADYVHSLGLKLGI 110
Query: 113 ---ISNSLDNRIESNIAFEQ---------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKA 160
+ + +EQ GIDYLK D C RY MS AL
Sbjct: 111 YTDAGTATCQGRPGSYGYEQIDAETYASWGIDYLKEDWCNTQGDDQLERYTIMSKALNST 170
Query: 161 GRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWN 220
GRPIFFS+C WG WG+++ NSWRTT DI D+WA ML+ D + ++ GGWN
Sbjct: 171 GRPIFFSLCCWGTANVWEWGASVGNSWRTTGDIKDNWASMLSNLDQQIPISSFSQVGGWN 230
Query: 221 DPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-- 278
DPDMLEVGNGGM EY+ HFS+W+I APL+ G D+ + T+ I+ EVIA+NQ
Sbjct: 231 DPDMLEVGNGGMTTTEYISHFSLWSILNAPLIAGNDLRTIDQTTLSILTAPEVIAINQDS 290
Query: 279 ---------AIND--QQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT---- 323
+ N QQ+WA PL+ V+L + T S+ + W DI ++
Sbjct: 291 LGKQGALVRSYNGGLQQVWARPLADGSKAVVLFNTDTNSADIELQWADIWVQEGTQLTVR 350
Query: 324 -LFEHRTLETKFQGKLTAKVDGHSCKMYEL 352
L++ + T + V+ H C M +
Sbjct: 351 DLWQQSDIGTYSDSYVAYNVESHGCVMLRV 380
>gi|294777474|ref|ZP_06742925.1| alpha-galactosidase [Bacteroides vulgatus PC510]
gi|294448542|gb|EFG17091.1| alpha-galactosidase [Bacteroides vulgatus PC510]
Length = 409
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 136/384 (35%), Positives = 191/384 (49%), Gaps = 60/384 (15%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
N L TPPMGWNSWN F C ++E++I+E D+ I+ ++ L
Sbjct: 35 NALAQTPPMGWNSWNKFGCNVSEQLIKEM--------ADAMIATGMKDAGYEYLVIDDCW 86
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVH----------ILISNSLDNRIES-NIAFEQ- 129
+ +GN+ + FP+G KALADYVH S + R S F+
Sbjct: 87 QVGRDEEGNIQVDPKRFPNGMKALADYVHAKGLKMGIYSCAGSETCQGRPGSRGYQFQDA 146
Query: 130 ------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNI 183
GIDYLKYD C N+ + YR MSDALK GRPI FS+CEWG+ P WG I
Sbjct: 147 RTYAAWGIDYLKYDWCSNEGQKAEAAYRTMSDALKACGRPIVFSICEWGENEPWKWGKGI 206
Query: 184 RNSWRTTDDISDSW--------ARMLTIAD-MNEVYADHAKPGGWNDPDMLEVGNGGMKY 234
+ WR T DI D + +L I D M ++Y +A PG WND +MLEVGNGGM
Sbjct: 207 GHLWRITPDIRDCYQCVFDWGGVGVLDIIDKMADLYP-YAGPGHWNDAEMLEVGNGGMTR 265
Query: 235 NEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AINDQ 283
+EY+ HFS+W + APL+ G D+ N+ ET+ I+ N+EVIA+NQ + ++
Sbjct: 266 DEYITHFSMWCMLAAPLMSGNDLRNMDKETIEILTNKEVIAINQDKKGEQARRFMDMGEK 325
Query: 284 QIWAGPLSGNRIVVLLESRKTFSSMMTAHW--------DDIGIKSNNT----LFEHRTLE 331
+IWA PL+ + V +R S + W D++ I L++H+ +
Sbjct: 326 EIWAKPLNDGELAVCFMNRTENSWKLDYDWKKQTIYFADEVNIHRKEYKIRDLWQHKDIG 385
Query: 332 TKFQGKLTAKVDGHSCKMYELTPV 355
T + ++ H M LTP
Sbjct: 386 TT-ERNTRHEIPAHGVLMVRLTPT 408
>gi|21218834|ref|NP_624613.1| alpha-galactosidase [Streptomyces coelicolor A3(2)]
gi|5824093|emb|CAB54169.1| probable secreted alpha-galactosidase [Streptomyces coelicolor
A3(2)]
Length = 680
Score = 211 bits (536), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 122/299 (40%), Positives = 160/299 (53%), Gaps = 29/299 (9%)
Query: 24 LGLTPPMGWNSWNHFHC--QINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
L LTPPMG+N+WN HC + NE +++ D + E L + + + +L A
Sbjct: 54 LALTPPMGFNNWNSTHCRDEFNESMVKGIADLFVERG------LKDAGYEYVNLDDCWAL 107
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNRIESNIAF--------- 127
P G LV + FP+G +A+ADYVH I S + S+I F
Sbjct: 108 P-ERDADGRLVPDPKRFPNGIEAVADYVHSKGLKFGIYTSAGTKTCSSIGFPGALGHEYS 166
Query: 128 ------EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGS 181
+ G+DYLKYDNC N RY M DAL GRPI +S+CEWG+ +P W
Sbjct: 167 DARQFADWGVDYLKYDNCNNQGVDAKQRYTTMRDALAATGRPIVYSICEWGENKPWEWAG 226
Query: 182 NIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHF 241
++ WRTT DI+DSW M +I N A++A+PGGWNDPDMLEVGNGGM EY HF
Sbjct: 227 DLGQLWRTTGDINDSWGSMSSIMKSNLELAEYARPGGWNDPDMLEVGNGGMTDTEYRTHF 286
Query: 242 SIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQIWAGPLSGNRIVVLLE 300
S+W+I APLL+G D+ I+ N+EVIAV+Q +Q G R VV E
Sbjct: 287 SMWSIMAAPLLIGTDLRTAPESAFEILTNDEVIAVDQDPLGKQGEVVSSEGGRWVVSKE 345
>gi|7706787|gb|AAB35252.2| alpha-galactosidase [Umbelopsis vinacea]
Length = 417
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 139/415 (33%), Positives = 197/415 (47%), Gaps = 109/415 (26%)
Query: 21 SNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDS 61
+NGL +TP MGWN+WN + C I+E++I + DDCW ++ R++
Sbjct: 22 NNGLAITPQMGWNTWNKYGCNIDEQLILDAAKAIVSSGLKDYGYNYVVIDDCWQKNEREN 81
Query: 62 QISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL--------- 112
+L LP+ T FP G K L D +H +
Sbjct: 82 NKTL-----------------LPD---------PTKFPRGMKPLVDDIHAMGLKVGIYSS 115
Query: 113 ---------ISNSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPTI-----RYRAMSDALK 158
I++ I++ G+DYLKYDNC+N T RY MS AL
Sbjct: 116 AGTLTCGGHIASLGYEEIDAKTWASWGMDYLKYDNCYNQGQSGTAKLSYDRYNVMSQALN 175
Query: 159 KAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWAR------------------- 199
K GRPI +S+C WG+ P + S I NSWR + DI D++ R
Sbjct: 176 KTGRPILYSLCNWGEDGPWNFASTIGNSWRLSGDIYDNFNRPDPACPCETYDCLLPGFRC 235
Query: 200 -MLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVG 258
++ I + + A A+ GGWND DMLEVGNGGM + EY +H+++WA K+PLLLG DV
Sbjct: 236 SVMNIINKSVAVAQKARSGGWNDMDMLEVGNGGMTHEEYRLHYTMWAALKSPLLLGNDVT 295
Query: 259 NLTAETMPIIGNEEVIAVNQ-----------AINDQQIWAGPLSGNRIVVLLESRKTFSS 307
N+TA+T II N+EVIAVNQ DQQ+++G L+ N VV+L + +
Sbjct: 296 NMTAQTKEIIMNDEVIAVNQDSSYSPAVRMWVKGDQQMFSGSLANNTQVVILLNAGDNET 355
Query: 308 MMTAHWDDIGIKSNNTLFEHRTLET----------KFQGKLTAKVDGHSCKMYEL 352
MTA WDDI S + +++E KF ++T V H +M +
Sbjct: 356 KMTATWDDIWFYSLPNVDSSKSIEVRDLWQKKNLGKFSHQITLDVPAHGVRMVKF 410
>gi|371776064|ref|ZP_09482386.1| alpha-galactosidase [Anaerophaga sp. HS1]
Length = 404
Score = 210 bits (535), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 144/389 (37%), Positives = 188/389 (48%), Gaps = 69/389 (17%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPL 83
L LTPPMGWNSWN F C++NE++IR D ES + + D Q L
Sbjct: 27 LALTPPMGWNSWNKFGCEVNEELIRSIADAMVESGMKDA---GYEYIVIDDCWQGGRDSL 83
Query: 84 PNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAFEQ-------------- 129
G + + FP+G KALADYVH S L I S+ +
Sbjct: 84 -----GFIYPDSAKFPNGMKALADYVH---SKGLKLGIYSDAGTKTCAGYPGSRGYEYQD 135
Query: 130 -------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSN 182
GIDYLKYD C ++ P Y M DAL KAGRPI FSMCEWG +P W +
Sbjct: 136 ALQYALWGIDYLKYDWCNTENINPIGAYSTMRDALYKAGRPILFSMCEWGTSKPWEWAKD 195
Query: 183 IRNSWRTTDDISDSWA-----------RMLTIADMNEVYADHAKPGGWNDPDMLEVGNGG 231
I + WRTT DI W ++ I DM E +A PG WNDPDMLEVGN G
Sbjct: 196 IAHMWRTTGDIFICWDCKHDHGGWYSWGVMKILDMQEGLRQYAGPGHWNDPDMLEVGN-G 254
Query: 232 MKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AI 280
+ NE HFS+WA+ APL+ G D+ N++ ET+ I+ N+EVIAVNQ +I
Sbjct: 255 LSVNEDRAHFSMWAMLAAPLIAGNDLRNMSKETVEILTNQEVIAVNQDSLGIQGFKYNSI 314
Query: 281 NDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHW------DDI-GIKS--NNTLFEHRTLE 331
+ + W PLS V +R+ +++ W D+I G ++ N T++ R L
Sbjct: 315 DSLETWFKPLSNGEWAVCFLNRRDQPAVIKFDWKIENVVDEIFGFETAFNETIYNIRNLW 374
Query: 332 TK-----FQGKLTAKVDGHSCKMYELTPV 355
K + A V H M LTP+
Sbjct: 375 DKKNIGNTKRVFKATVPSHDVIMLRLTPL 403
>gi|66824895|ref|XP_645802.1| hypothetical protein DDB_G0271490 [Dictyostelium discoideum AX4]
gi|122058123|sp|Q55B10.1|AGAL_DICDI RecName: Full=Probable alpha-galactosidase; AltName:
Full=Alpha-D-galactoside galactohydrolase; AltName:
Full=Melibiase; Flags: Precursor
gi|60473936|gb|EAL71875.1| hypothetical protein DDB_G0271490 [Dictyostelium discoideum AX4]
Length = 385
Score = 210 bits (535), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 139/394 (35%), Positives = 185/394 (46%), Gaps = 83/394 (21%)
Query: 18 NLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESS 58
N L NGL LTP MGW+SWN + C INE +I T DDCW
Sbjct: 18 NALDNGLALTPQMGWSSWNFYACNINESVIMNTAKAMVSNGMADAGYTYVNIDDCW---- 73
Query: 59 RDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILI----- 113
A PN G + A+ T FP+G K +ADY+H L
Sbjct: 74 --------------------AGGRYPN---GTVYADPTNFPNGIKYVADYIHSLGLKIGI 110
Query: 114 -----SNSLDNRI-----ESNIA---FEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKA 160
+ + R+ E+N A E GIDY+K D C+ P RY+ MS AL
Sbjct: 111 YTDAGTETCQKRVGSYGYEANDAQTYAEWGIDYVKEDWCYATLENPQQRYQIMSQALNAT 170
Query: 161 GRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWN 220
GRP+FFS+C+WG P +G ++ NSWRTT DI D+W ML+ + GG+N
Sbjct: 171 GRPMFFSLCDWGYENPWTFGMSVGNSWRTTPDIKDNWDSMLSNLMAQAPITSFSGIGGFN 230
Query: 221 DPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-- 278
DPDM+ VGNGGM EYV HFS+W++ APL+ GCD+ ++ ET+ I+ EVIA+NQ
Sbjct: 231 DPDMMMVGNGGMSNTEYVSHFSLWSLLNAPLIAGCDLIDIDQETLSILTASEVIAINQDP 290
Query: 279 ---------AIND--QQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFEH 327
+ N QQIWA PLS V+L + T + + W +I ++ + L
Sbjct: 291 LGVQGSLVKSYNGGLQQIWAKPLSNGARAVVLFNTDTNPATIELLWGNIWMEPSQQLSIR 350
Query: 328 RTLETKFQGKLTAKVDG------HSCKMYELTPV 355
G T + H C M LTP
Sbjct: 351 NLWTQTNLGTFTESYESDSLIPPHGCIMLTLTPT 384
>gi|220929020|ref|YP_002505929.1| glycoside hydrolase [Clostridium cellulolyticum H10]
gi|219999348|gb|ACL75949.1| glycoside hydrolase clan GH-D [Clostridium cellulolyticum H10]
Length = 471
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 139/370 (37%), Positives = 183/370 (49%), Gaps = 47/370 (12%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L+N LGLTPPMGWNSWN F INE+ I++ D + + +
Sbjct: 34 LNNSLGLTPPMGWNSWNIFGGDINEEKIKQITDAMVTTGMKD-----AGYEYVNIDDNWM 88
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVH-------------------ILISNSLDNR 120
A P G L+ + FP+G KALADY+H I S S
Sbjct: 89 ANP-ARDANGILIPDPKRFPNGMKALADYIHSKGLKFGIYGDRGVTTCCNIPQSGSQGYE 147
Query: 121 IESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALW 179
+ F Q G+DYLKYDNC +D + Y M DAL K GRPIF+S+C W W
Sbjct: 148 EQDAKTFAQWGVDYLKYDNCASD-SNLQAGYEKMRDALLKTGRPIFYSICCW--YFAGAW 204
Query: 180 GSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVV 239
+ NSWRTT DISD+W ++ D N A +A PG WNDPDMLEVGNG M EY
Sbjct: 205 MVDCGNSWRTTGDISDNWRSIIKNIDENSKSAAYAGPGHWNDPDMLEVGNGNMTETEYKA 264
Query: 240 HFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AINDQQIWAG 288
HFS+W + APL+ G D+ N+T T I+ N+EVIA+NQ + ++W
Sbjct: 265 HFSMWCMMAAPLIAGNDLRNMTPATKDILTNKEVIAINQDAAGVQGTKVSTSGELEVWCK 324
Query: 289 PLS--GNRIVVLLESRKTFSSMMTAHWDDI----GIKSNNTLFEHRTLETKFQGKLTAKV 342
PL G V L +R S+ +T +W DI G + L+EH+ KF + TA V
Sbjct: 325 PLGTDGTTKAVALLNRGAASADITVNWRDIKLADGPATVRDLWEHKDY-GKFNTEYTANV 383
Query: 343 DGHSCKMYEL 352
H + ++
Sbjct: 384 PSHGVVVLKV 393
>gi|403333964|gb|EJY66120.1| hypothetical protein OXYTRI_13715 [Oxytricha trifallax]
Length = 410
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 117/289 (40%), Positives = 162/289 (56%), Gaps = 37/289 (12%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMF---DSLT 76
L+NGLGL P MGWN+WN +HC I++ II + D + Q+ L +++ D
Sbjct: 30 LNNGLGLKPAMGWNTWNAYHCDISKDIIIKNAD------KIKQLGLDQVGYVYVNIDDCW 83
Query: 77 QIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESN------------ 124
Q+ A+ GN+ A+ T FP+G K + DY+H +N L I S+
Sbjct: 84 QLEARDK----DGNVQADTTKFPNGMKEVGDYLH---TNGLKFGIYSSAGTKTCQGKAGS 136
Query: 125 IAFEQ---------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMR 175
+ FE G DYLKYDNC+N+ RY M DAL K+GRPI++S+C WG+
Sbjct: 137 LGFETADAKYYASIGADYLKYDNCYNNFVPALKRYTDMGDALTKSGRPIYYSICNWGNEN 196
Query: 176 PALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYN 235
WG+++ NSWRTT DI ++W M N + + +A PGGWNDPDMLEVGN +
Sbjct: 197 VWQWGASVGNSWRTTLDIENNWGSMRYNFVQNSILSQYAAPGGWNDPDMLEVGNNNLTIT 256
Query: 236 EYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ 284
E HF++W KAPL+LG D+ N+ E + II N+ +IAVNQ QQ
Sbjct: 257 EQRSHFALWCFVKAPLILGNDLTNMGPEVLAIISNKNLIAVNQDSLGQQ 305
>gi|289774046|ref|ZP_06533424.1| secreted alpha-galactosidase [Streptomyces lividans TK24]
gi|289704245|gb|EFD71674.1| secreted alpha-galactosidase [Streptomyces lividans TK24]
Length = 675
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 121/299 (40%), Positives = 160/299 (53%), Gaps = 29/299 (9%)
Query: 24 LGLTPPMGWNSWNHFHC--QINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
L LTPPMG+N+WN HC + +E +++ D + E L + + + +L A
Sbjct: 49 LALTPPMGFNNWNSTHCRDEFDESMVKGIADLFVERG------LKDAGYEYVNLDDCWAL 102
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNRIESNIAF--------- 127
P G LV + FP+G +A+ADYVH I S + S+I F
Sbjct: 103 P-ERDADGRLVPDPKRFPNGIEAVADYVHSKGLKFGIYTSAGTKTCSSIGFPGALGHEYS 161
Query: 128 ------EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGS 181
+ G+DYLKYDNC N RY M DAL GRPI +S+CEWG+ +P W
Sbjct: 162 DARQFADWGVDYLKYDNCNNQGVDAKQRYTTMRDALAATGRPIVYSICEWGENKPWEWAG 221
Query: 182 NIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHF 241
++ WRTT DI+DSW M +I N A++A+PGGWNDPDMLEVGNGGM EY HF
Sbjct: 222 DLGQLWRTTGDINDSWGSMSSIMKSNLELAEYARPGGWNDPDMLEVGNGGMTDTEYRTHF 281
Query: 242 SIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQIWAGPLSGNRIVVLLE 300
S+W+I APLL+G D+ I+ N+EVIAV+Q +Q G R VV E
Sbjct: 282 SMWSIMAAPLLIGTDLRTAPESAFEILTNDEVIAVDQDPLGKQGEVVSSEGGRWVVSKE 340
>gi|441161626|ref|ZP_20967966.1| glycoside hydrolase [Streptomyces rimosus subsp. rimosus ATCC
10970]
gi|440616686|gb|ELQ79815.1| glycoside hydrolase [Streptomyces rimosus subsp. rimosus ATCC
10970]
Length = 689
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 134/372 (36%), Positives = 186/372 (50%), Gaps = 46/372 (12%)
Query: 20 LSNGLGLTPPMGWNSWNHFHC--QINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQ 77
L +GL TPPMG+N+WN HC NE +I+ D + E L + + + +L
Sbjct: 61 LPDGLARTPPMGFNNWNSTHCGADFNEAMIKGIADLFVEKG------LKAAGYEYVNLDD 114
Query: 78 IAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------------------LISNSLDN 119
A+ G LV + T FP+G KA+ADYVH + +L +
Sbjct: 115 CWAQ-KDRDANGKLVPDTTRFPNGIKAVADYVHSKGLKLGIYTSAGTTTCARAMPGALGH 173
Query: 120 RIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPAL 178
F G+DYLKYDNC N RY M DALK GRPI +S+CEWG+ +P
Sbjct: 174 EYSDARQFADWGVDYLKYDNCNNQGVDAKQRYTTMRDALKATGRPIVYSICEWGENKPWE 233
Query: 179 WGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYV 238
W +++ + WRTT DI D W + I N A +A PG WNDPDMLEVGNGGM EY
Sbjct: 234 WAADVGHLWRTTGDIGDVWDSAVGIFKENAPLAKYAGPGHWNDPDMLEVGNGGMTDTEYR 293
Query: 239 VHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ-----------IWA 287
HFS+W++ APLL+G D+ + T I+GN EVI ++Q +Q + +
Sbjct: 294 SHFSLWSMMAAPLLIGTDLRKASRATYDILGNREVIGLDQDPLGKQAEVLKSDGGRWVLS 353
Query: 288 GPLS-GNRIVVLL-ESRKTFSSMMTAHWDDIGIKSN---NTLFEHRTLETKFQGKLTAKV 342
PL+ G+R V L ES + TA + S L++H +T G++ A V
Sbjct: 354 KPLADGDRAVALFNESDRPQRIATTAKEAGLPQASGYRLRDLWKHE--DTHTAGEIAATV 411
Query: 343 DGHSCKMYELTP 354
H +Y ++P
Sbjct: 412 PAHGTLVYRVSP 423
>gi|256375800|ref|YP_003099460.1| alpha-galactosidase [Actinosynnema mirum DSM 43827]
gi|255920103|gb|ACU35614.1| Alpha-galactosidase [Actinosynnema mirum DSM 43827]
Length = 567
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 136/379 (35%), Positives = 181/379 (47%), Gaps = 78/379 (20%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSR 59
+L +G PPMG+N+WN F C + E++I T DDCW R
Sbjct: 72 MLDDGASRVPPMGFNNWNSFGCDVTEELIMATADIFVTSGLKDAGYQYVNIDDCWSTRER 131
Query: 60 DSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH---ILIS-- 114
D+ GNLV + FP G K +ADYVH +L+
Sbjct: 132 DAD--------------------------GNLVPDPVKFPKGIKHVADYVHSRGLLLGLY 165
Query: 115 ------------NSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAG 161
SL + AF G+DYLKYDNC+N RY M DAL G
Sbjct: 166 GDAGTLTCAGYPGSLGHEQRDAAAFASWGVDYLKYDNCYNQGIDARQRYTTMRDALLATG 225
Query: 162 RPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWND 221
R I FS+CEWG+ +P W ++ + WRTT DI+D+W ++ I N D A PGGWND
Sbjct: 226 RRIVFSVCEWGENQPWTWARDVGHLWRTTPDITDTWGSVVDIFHRNAPLNDAAGPGGWND 285
Query: 222 PDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ--- 278
PDMLEVGNGGM EY HF++WA APLL+G D+ T ETM I N +IAVNQ
Sbjct: 286 PDMLEVGNGGMTTTEYRSHFTLWAQMAAPLLIGADLRVATPETMAIYLNRGLIAVNQDPL 345
Query: 279 --------AINDQQIWAGPLS-GNRIVVLLESRKTFSSMMTAHWDDIGIK-SNNTLFEHR 328
+ + + + PL+ G+R VVL + + T+ IG+ +T +
Sbjct: 346 GVQARPVSSSGARHVLSKPLANGDRSVVLFNEGGAEAVVGTS-LQAIGLPGGGSTATDLW 404
Query: 329 TLET-KFQGKLTAKVDGHS 346
T T +F G +TA+V H
Sbjct: 405 TGATRRFTGAITARVPAHG 423
>gi|329941761|ref|ZP_08291026.1| secreted alpha-galactosidase [Streptomyces griseoaurantiacus M045]
gi|329299478|gb|EGG43378.1| secreted alpha-galactosidase [Streptomyces griseoaurantiacus M045]
Length = 675
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 119/298 (39%), Positives = 161/298 (54%), Gaps = 29/298 (9%)
Query: 27 TPPMGWNSWNHFHC--QINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLP 84
TPPMG+N+WN HC + +EK++ D + E L + + + +L A P
Sbjct: 53 TPPMGFNNWNSTHCRAEFDEKMVEGIADIFVEKG------LRDAGYQYVNLDDCWALPA- 105
Query: 85 NQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNRIESNIAF------------ 127
G+LV + FP G KA+ADYVH + I S + ++ F
Sbjct: 106 RDADGHLVPDPARFPHGIKAVADYVHAKGLKLGIYTSAGTKTCNSAGFPGALGHEYSDAR 165
Query: 128 ---EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIR 184
+ G+DYLKYDNC N RY M DAL+ GRPI +S+CEWG+ +P W +++
Sbjct: 166 QFADWGVDYLKYDNCNNQGVDAKQRYTTMRDALRATGRPIVYSICEWGENKPWEWAADLG 225
Query: 185 NSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIW 244
WRTT DI DSW ML+I N A +A PG WNDPDMLEVGNGGM EY HFS+W
Sbjct: 226 QLWRTTGDIDDSWGSMLSILKKNLPLAPYAGPGHWNDPDMLEVGNGGMTDTEYRTHFSMW 285
Query: 245 AISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQIWAGPLSGNRIVVLLESR 302
++ APLL+G D+ + T I+GN+EV+AV+Q +Q G R VV E +
Sbjct: 286 SVMAAPLLIGADLRTASDATFGILGNKEVLAVDQDPLGRQGTVVSSEGGRWVVAKEMK 343
>gi|357403202|ref|YP_004915127.1| alpha-galactosidase [Streptomyces cattleya NRRL 8057 = DSM 46488]
gi|386359282|ref|YP_006057528.1| secreted alpha-galactosidase [Streptomyces cattleya NRRL 8057 = DSM
46488]
gi|337769611|emb|CCB78324.1| putative secreted alpha-galactosidase [Streptomyces cattleya NRRL
8057 = DSM 46488]
gi|365809790|gb|AEW98006.1| secreted alpha-galactosidase [Streptomyces cattleya NRRL 8057 = DSM
46488]
Length = 536
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 134/393 (34%), Positives = 181/393 (46%), Gaps = 82/393 (20%)
Query: 12 NGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DD 52
G+ + L +GL TPPMGWN WN F C + E+++ +T DD
Sbjct: 30 GGAGSAHALDDGLARTPPMGWNDWNAFGCSVTERLVEQTADTMVASGMKDAGYTYVNIDD 89
Query: 53 CWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-- 110
CW SRD+ G+LV + FP G K ADYVH
Sbjct: 90 CWMTHSRDAA--------------------------GHLVPDPAKFPDGIKGTADYVHRK 123
Query: 111 ---------------ILISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMS 154
SL + + F G+DYLKYDNC N RY AM
Sbjct: 124 GLKLGIYESAGTATCAGYPGSLGHERQDAADFAAWGVDYLKYDNCNNQGVPYQQRYDAMR 183
Query: 155 DALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHA 214
DALK GRPI +S+CEWG+ WG+ + WRTT DIS S+ ML+I N A +A
Sbjct: 184 DALKATGRPIVYSLCEWGEDSVWTWGAATGHLWRTTGDISASFGSMLSIYRSNVRLAGYA 243
Query: 215 KPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVI 274
PGGWNDPDMLEVGN GM + E F++WA APL+ G D+ T T+ + N +VI
Sbjct: 244 GPGGWNDPDMLEVGN-GMSFTEDRTEFTLWAEMAAPLIAGTDLRTATPATLSLYTNRDVI 302
Query: 275 AVNQAINDQQ-----------IWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT 323
AV+Q +Q + A PL+G + V+L + + +T G+ + +
Sbjct: 303 AVDQDPLGRQGTEVSSSGGLHVLAKPLAGGDVAVVLFNENAAPATITTSASTAGLPAAPS 362
Query: 324 -----LFEHRTLETKFQGKLTAKVDGHSCKMYE 351
L+ H+ T G ++A+V GH MY
Sbjct: 363 YRLTNLWSHQVTSTG--GTVSAQVPGHGTVMYR 393
>gi|256599572|pdb|3A5V|A Chain A, Crystal Structure Of Alpha-Galactosidase I From
Mortierella Vinacea
Length = 397
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 133/396 (33%), Positives = 196/396 (49%), Gaps = 71/396 (17%)
Query: 21 SNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAA 80
+NGL +TP MGWN+WN + C ++E++I + S L ++ + D Q
Sbjct: 2 NNGLAITPQMGWNTWNKYGCNVDEQLILDAAKAIASSGLKD---LGYNYVIIDDCWQKNE 58
Query: 81 KPLPNQCQGNLVANKTTFPSGTKALADYVHIL------------------ISNSLDNRIE 122
+ + L+A+ T FP G K L D +H L I++ I+
Sbjct: 59 R----ESSKTLLADPTKFPRGIKPLVDDIHNLGLKAGIYSSAGTLTCGGHIASLGYEDID 114
Query: 123 SNIAFEQGIDYLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFSMCEWGDMRPA 177
+ + GIDYLKYDNC+N T RY+AM +AL K GRP+ +S+C WG+ P
Sbjct: 115 AKTWAKWGIDYLKYDNCYNQGQSGTPKLSYDRYKAMGNALNKTGRPMLYSLCNWGEDGPW 174
Query: 178 LWGSNIRNSWRTTDDISDSWAR--------------------MLTIADMNEVYADHAKPG 217
+ S I NSWR + D+ D++ R ++ I + + A+ G
Sbjct: 175 NFASTISNSWRISGDVYDNFNRPDPACPCTTYDCVLAGFRCSVMNIINKAVAVSQKARSG 234
Query: 218 GWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN 277
GWND DMLEVGNGGM EY VH++IWA K+PL+LG DV N+T T II N+EVIAVN
Sbjct: 235 GWNDLDMLEVGNGGMNQEEYRVHYTIWAALKSPLILGNDVTNITNTTKEIIMNKEVIAVN 294
Query: 278 Q-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFE 326
Q DQQ+++G L+ N VV+L + ++ MTA WDDI + + +
Sbjct: 295 QDSSFSPANRIWVKGDQQLFSGNLANNTQVVILLNAGDSAAKMTATWDDIWVYNLPNVDS 354
Query: 327 HRTLETK----------FQGKLTAKVDGHSCKMYEL 352
R++E + F +T V H ++ +
Sbjct: 355 SRSIEVRDLWKQKSLGNFSNHITLDVPAHGVRLLKF 390
>gi|423305443|ref|ZP_17283442.1| hypothetical protein HMPREF1072_02382 [Bacteroides uniformis
CL03T00C23]
gi|423311260|ref|ZP_17289229.1| hypothetical protein HMPREF1073_03979 [Bacteroides uniformis
CL03T12C37]
gi|392679307|gb|EIY72692.1| hypothetical protein HMPREF1073_03979 [Bacteroides uniformis
CL03T12C37]
gi|392681144|gb|EIY74505.1| hypothetical protein HMPREF1072_02382 [Bacteroides uniformis
CL03T00C23]
Length = 397
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 127/343 (37%), Positives = 173/343 (50%), Gaps = 59/343 (17%)
Query: 1 LFIVSSS-AADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSR 59
LF+V + +A +N +D L LTPPMGWNSWN F C INEK IRE D
Sbjct: 9 LFVVCTVISARANSTDS-------LALTPPMGWNSWNCFSCNINEKQIREIADL------ 55
Query: 60 DSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDN 119
+S ++ L + +GN++ ++ FPSG KALADY+H S L
Sbjct: 56 --MVSTGMKDAGYEYLNIDDCWQVGRDNEGNILVDEKNFPSGIKALADYIH---SKGLKF 110
Query: 120 RI---------------------ESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALK 158
I ++ E G+DYLKYD CF++ P Y+ MSDALK
Sbjct: 111 GIYSCAGTLTCAGRPGSRGYQFQDARTYAEWGVDYLKYDWCFDEGQNPQAAYKTMSDALK 170
Query: 159 KAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSW--------ARMLTIADMNEVY 210
+GRPI FS+CEWG+ +P W I + WRTT DI +++ ++ I D N
Sbjct: 171 ASGRPIVFSICEWGNSQPWTWAKGIGHLWRTTGDIINAFKGINYWGGCGVVEIIDKNADL 230
Query: 211 ADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGN 270
+A PG WNDPDML+VGNG + E HF++W + APLL G D+ + ET+ I+ N
Sbjct: 231 HKYAGPGHWNDPDMLQVGNGVLTMEENRSHFTMWCMLAAPLLAGNDIRKMDKETLGILTN 290
Query: 271 EEVIAVNQ-----------AINDQQIWAGPLSGNRIVVLLESR 302
+EVIAVNQ + + +IW LS V +R
Sbjct: 291 KEVIAVNQDKLGKQGGRYMKVGEHEIWVKQLSNGEAAVCFFNR 333
>gi|270294331|ref|ZP_06200533.1| ribosomal protein S32 [Bacteroides sp. D20]
gi|270275798|gb|EFA21658.1| ribosomal protein S32 [Bacteroides sp. D20]
Length = 403
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 127/343 (37%), Positives = 173/343 (50%), Gaps = 59/343 (17%)
Query: 1 LFIVSSS-AADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSR 59
LF+V + +A +N +D L LTPPMGWNSWN F C INEK IRE D
Sbjct: 15 LFVVCTVISARANSTD-------SLALTPPMGWNSWNCFSCNINEKQIREIADL------ 61
Query: 60 DSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDN 119
+S ++ L + +GN++ ++ FPSG KALADY+H S L
Sbjct: 62 --MVSTGMKDAGYEYLNIDDCWQVGRDNEGNILVDEKNFPSGIKALADYIH---SKGLKF 116
Query: 120 RI---------------------ESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALK 158
I ++ E G+DYLKYD CF++ P Y+ MSDALK
Sbjct: 117 GIYSCAGTLTCAGRPGSRGYQFQDARTYAEWGVDYLKYDWCFDEGQNPQAAYKTMSDALK 176
Query: 159 KAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSW--------ARMLTIADMNEVY 210
+GRPI FS+CEWG+ +P W I + WRTT DI +++ ++ I D N
Sbjct: 177 ASGRPIVFSICEWGNSQPWTWAKGIGHLWRTTGDIINAFKGINYWGGCGVVEIIDKNADL 236
Query: 211 ADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGN 270
+A PG WNDPDML+VGNG + E HF++W + APLL G D+ + ET+ I+ N
Sbjct: 237 HKYAGPGHWNDPDMLQVGNGVLTMEENRSHFTMWCMLAAPLLAGNDIRKMDKETLGILTN 296
Query: 271 EEVIAVNQ-----------AINDQQIWAGPLSGNRIVVLLESR 302
+EVIAVNQ + + +IW LS V +R
Sbjct: 297 KEVIAVNQDKLGKQGGRYMKVGEHEIWVKQLSNGEAAVCFFNR 339
>gi|167646092|ref|YP_001683755.1| alpha-galactosidase [Caulobacter sp. K31]
gi|167348522|gb|ABZ71257.1| Alpha-galactosidase [Caulobacter sp. K31]
Length = 407
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 130/381 (34%), Positives = 176/381 (46%), Gaps = 58/381 (15%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS-RDSQISLSCSFHMFDSLTQ 77
L NGL TPPMGWNSWN F C ++E +IR+T D S RD+ + + D
Sbjct: 40 LKDNGLARTPPMGWNSWNRFACDVDETLIRKTADAMVSSGMRDA----GYQYVVIDDCWH 95
Query: 78 IAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAFEQ-------- 129
A G++ + FPSG KAL DY+H S L I S+ +
Sbjct: 96 GA-----RDAHGDIQPDPKRFPSGMKALGDYIH---SRGLKFGIYSDAGLKTCGGRPGSW 147
Query: 130 -------------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRP 176
G+DYLKYD C Y MS AL+ +GR I S+CEWG +P
Sbjct: 148 GHEYQDAKQYAAWGVDYLKYDWCMAGTQDARSAYYIMSSALQASGRDIVLSICEWGTSKP 207
Query: 177 ALWGSNIRNSWRTTDDISDSW-------ARMLTIADMNEVYADHAKPGGWNDPDMLEVGN 229
LW + N WRTT DI D W + ++ I D +A+PG WNDPDMLEVGN
Sbjct: 208 WLWADKVGNLWRTTGDIYDKWEGVRDYSSGVMNIIDKQVELYPYARPGHWNDPDMLEVGN 267
Query: 230 GGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ----------- 278
GGM EY HFS+WA+ APL+ G D+ + AET I+ N EVIA++Q
Sbjct: 268 GGMTTEEYRSHFSLWAMLAAPLIAGNDIAAMDAETKAILTNREVIAIDQDSLGQQARRVS 327
Query: 279 AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSN-----NTLFEHRTLETK 333
D ++W PL G V+L +R + + W + L+ + + +
Sbjct: 328 KTGDLEVWVRPLQGGGRAVVLLNRGPAPAPIRLDWSQLDYPPTLKARVRDLWTGKDVGVR 387
Query: 334 FQGKLTAKVDGHSCKMYELTP 354
+ A V H M ++ P
Sbjct: 388 -EASYQATVASHGVAMLKIQP 407
>gi|329851881|ref|ZP_08266562.1| alpha-galactosidase [Asticcacaulis biprosthecum C19]
gi|328839730|gb|EGF89303.1| alpha-galactosidase [Asticcacaulis biprosthecum C19]
Length = 401
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 127/383 (33%), Positives = 177/383 (46%), Gaps = 68/383 (17%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPL 83
L TPPMGWNSWN F C I+E +R D + + + D Q
Sbjct: 37 LAATPPMGWNSWNKFACNIDEAKVRAVADAMASTGMKDA---GYQYIVIDDCWQT----- 88
Query: 84 PNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAFEQ-------------- 129
G + A+ FPSG KALADYVH S L + S+ +
Sbjct: 89 ERAADGTIQADPVKFPSGIKALADYVH---SKGLKFGLYSDAGVKTCGGRPGSAGYEFQD 145
Query: 130 -------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSN 182
G+DYLKYD C+ Y M+ AL+ +GR I S+CEWGD +P W +
Sbjct: 146 ARTYAGWGVDYLKYDWCYTGTRNAEAAYTLMAKALRASGRDILLSICEWGDNKPRDWAAK 205
Query: 183 IRNSWRTTDDISDSWARMLTIADMNEVYA--------------DHAKPGGWNDPDMLEVG 228
+ WRTT DI DSW D++E Y+ + P WNDPDMLEVG
Sbjct: 206 AGHQWRTTGDIRDSW-------DVDEGYSHSFTWILDRQADLWKDSGPNQWNDPDMLEVG 258
Query: 229 NGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ---- 284
NGGM EY HFS+WA+ APL+ G D+ + ET I+ N++VIAV+Q QQ
Sbjct: 259 NGGMTTTEYKAHFSLWAMLAAPLIAGNDLSAMDRETHDILTNKDVIAVDQDPLGQQGRKL 318
Query: 285 -------IWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIG----IKSNNTLFEHRTLETK 333
+W PL+G V+L +R + +M+ W+ + +K+ + + K
Sbjct: 319 VDDGDFEVWVRPLAGGDRAVVLFNRSAAAKVMSVDWETLQMPAEMKATVKDLWTKQVTKK 378
Query: 334 FQGKLTAKVDGHSCKMYELTPVT 356
+G+ +A+V H M +TP
Sbjct: 379 VKGRYSAEVPSHGAVMVRITPTV 401
>gi|443289919|ref|ZP_21029013.1| Alpha-galactosidase [Micromonospora lupini str. Lupac 08]
gi|385886831|emb|CCH17087.1| Alpha-galactosidase [Micromonospora lupini str. Lupac 08]
Length = 552
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 125/313 (39%), Positives = 165/313 (52%), Gaps = 38/313 (12%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGL TP MGWN WN F C +NE +IR+T D S ++ + + + ++
Sbjct: 38 LENGLARTPQMGWNDWNSFGCNVNEGLIRQTADTLVSSG------MAAAGYRYVNIDDCW 91
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNRI----ESNIAFEQ- 129
+ N G+LVA+ FPSG KALADYVH + I +S ++I +EQ
Sbjct: 92 STKQRNG-SGDLVADPQKFPSGMKALADYVHDKGLKLGIYSSAGTLTCAGYPASINYEQR 150
Query: 130 --------GIDYLKYDNCFNDDTRPTI-RYRAMSDALKKAGRPIFFSMCEWGDMRPALWG 180
GIDYLKYDNC + R RY AM DAL RPI +S+C WG WG
Sbjct: 151 DANLWASWGIDYLKYDNCGDHLGRSAQQRYTAMRDALLATNRPILYSLCSWGQDSVWTWG 210
Query: 181 SNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVH 240
+ + NSWRTT DI +W ++ I D +A+PG WNDPDMLEVGNG E H
Sbjct: 211 AGVGNSWRTTGDIGGNWNSIMGILDQQVGKESYARPGAWNDPDMLEVGNGPTD-TESRAH 269
Query: 241 FSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ-----------IWAGP 289
FS+WA+ APLL G D+ ++A T I+ N +VIAVNQ QQ +W P
Sbjct: 270 FSLWALLNAPLLAGNDIRTMSAATRTILTNTDVIAVNQDWGGQQGRKISDTGDLEVWRKP 329
Query: 290 LSGNRIVVLLESR 302
+S + +L +R
Sbjct: 330 MSTGGVATVLLNR 342
>gi|294811122|ref|ZP_06769765.1| Alpha-galactosidase [Streptomyces clavuligerus ATCC 27064]
gi|326439536|ref|ZP_08214270.1| alpha-galactosidase [Streptomyces clavuligerus ATCC 27064]
gi|294323721|gb|EFG05364.1| Alpha-galactosidase [Streptomyces clavuligerus ATCC 27064]
Length = 693
Score = 204 bits (518), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 133/364 (36%), Positives = 182/364 (50%), Gaps = 49/364 (13%)
Query: 30 MGWNSWNHFHCQ--INEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQC 87
MG+N+WN HC+ +E +++ D + L + + + +L A P N
Sbjct: 74 MGFNNWNSTHCRADFHEAMVKGIADIF------VARGLKEAGYTYVNLDDCWALPERN-A 126
Query: 88 QGNLVANKTTFPSGTKALADYVH-----ILISNSLDNRIESNIAF--------------- 127
+G LV + FP+G KA+ADYVH I S + + F
Sbjct: 127 EGKLVPDPVRFPNGIKAVADYVHAKGLKFGIYTSAGTKTCNRAGFPGALGHEYSDAQQFA 186
Query: 128 EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSW 187
+ G+DYLKYDNC N +RY M DAL+ GRPI +S+CEWG +P W ++ W
Sbjct: 187 DWGVDYLKYDNCNNQGVDAKLRYTTMRDALRATGRPIVYSICEWGQNKPWEWAKDVGQLW 246
Query: 188 RTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAIS 247
RTT DISD+W ML+I N A +A PG WNDPDMLEVGNGGM EY HFS+W+I
Sbjct: 247 RTTGDISDNWGSMLSIMKRNLPLAPYAGPGHWNDPDMLEVGNGGMTDTEYRSHFSMWSIM 306
Query: 248 KAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLS-GNRI 295
APLL+G D+ + T I+GN EVIAVNQ +Q + A L+ G+R
Sbjct: 307 AAPLLIGADLRKVPESTFEILGNREVIAVNQDRLGKQGTVVSSEGGRWVVAKELADGSRA 366
Query: 296 VVLL-ESRKTFSSMMTAHWDDIGIKSNNTLFEHRTL----ETKFQGKLTAKVDGHSCKMY 350
V L ES + TA +G+ + T ++ R L G L+A V H +
Sbjct: 367 VALFNESARPQRIATTAQ--AVGL-AQATAYKVRDLWRHGSAHTAGTLSATVPAHGTVLL 423
Query: 351 ELTP 354
+ P
Sbjct: 424 RVAP 427
>gi|384248217|gb|EIE21702.1| glycoside hydrolase [Coccomyxa subellipsoidea C-169]
Length = 351
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 128/332 (38%), Positives = 172/332 (51%), Gaps = 41/332 (12%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NG +P +G+N+WN + I+E +IR T D S +H + A
Sbjct: 20 LDNGQARSPALGYNTWNAYGGDIDEDLIRATADLMVSSGLKK-----AGYHYL--VIDDA 72
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVH-----------------ILISNSLDNRIE 122
L QG L AN FPSG K +ADYVH + S + E
Sbjct: 73 WSNLQRDDQGRLHANSDRFPSGMKTMADYVHSKGLKFGMYSDAGSHTCLGYPGSRYHEKE 132
Query: 123 SNIAF-EQGIDYLKYDNCFNDDTRPTI-RYRAMSDALKKAGRPIFFSMCEWGDMRPALWG 180
+F + G+D+LKYDNC+ + I RY AM AL GRPI +SMC+WG P LW
Sbjct: 133 DAESFADWGVDFLKYDNCWAPASDWVIDRYTAMHTALNGTGRPILYSMCDWGVGDPWLWA 192
Query: 181 SNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVH 240
I NSWRTT DIS +W ML D + +AKPG WNDPDMLEVGN G+ E +
Sbjct: 193 PKIANSWRTTGDISPNWESMLRCLDNTIGLSKYAKPGAWNDPDMLEVGNPGLTEQEQRAN 252
Query: 241 FSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ----------AINDQQIWAGPL 290
F++WA+ K+PL++G D+ L+ + I+ EEVIAVNQ + N +Q++AGPL
Sbjct: 253 FALWAVLKSPLMVGTDLRRLSKTALEILTAEEVIAVNQDKLGVAGDLASSNARQVYAGPL 312
Query: 291 S-GNRIVVLLESRKTFS----SMMTAHWDDIG 317
G+R VVL + + S +T W D+G
Sbjct: 313 EDGSRAVVLFNRHTSGTQYPLSNVTVQWQDLG 344
>gi|313204282|ref|YP_004042939.1| alpha-galactosidase [Paludibacter propionicigenes WB4]
gi|312443598|gb|ADQ79954.1| Alpha-galactosidase [Paludibacter propionicigenes WB4]
Length = 404
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 128/351 (36%), Positives = 179/351 (50%), Gaps = 52/351 (14%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
GL LTPPMGWNSWN F C ++E++IRET D + + + D Q++
Sbjct: 26 QGLALTPPMGWNSWNKFACNVSEELIRETADAMVATGMKDA---GYEYIVIDDCWQVSRD 82
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNRI------------ESN 124
L G +VA+ FPSG KALADYVH I + N+ +
Sbjct: 83 SL-----GFIVADPKRFPSGIKALADYVHAKGLKFGIYSCAGNKTCGGRPAGRGHEYQDA 137
Query: 125 IAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNI 183
+++ + G+DYLKYD C ++ Y M DAL AGRPI FS+CEWG+ +P LW +
Sbjct: 138 LSYAKWGVDYLKYDWCDTENLNAIGAYTTMRDALFAAGRPIVFSLCEWGNNKPELWAKEV 197
Query: 184 RNSWRTTDDISD---------SWAR--MLTIADMNEVYADHAKPGGWNDPDMLEVGNGGM 232
+ WRTT DI + +W ++ I D + +A PG WNDPDMLEVGN GM
Sbjct: 198 GHLWRTTGDIYNCFDCVQNNGTWKAFGVMQIMDKQDGLRKNAGPGHWNDPDMLEVGN-GM 256
Query: 233 KYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ----------AIND 282
NE HFSIW + APL+ G D+ ++ AET+ ++ N+EVIA++Q A+ D
Sbjct: 257 SLNESRAHFSIWCMMAAPLIAGNDLRSMNAETLTVLSNKEVIAIDQDSLGVQGFRHAVKD 316
Query: 283 Q-QIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFEHRTLET 332
+ W PL V L +R T ++ W + + +RTL T
Sbjct: 317 SVETWLKPLKNGEWAVCLLNRSTTPRVVALDWKTFSVIDS---LSNRTLNT 364
>gi|254393286|ref|ZP_05008436.1| alpha-galactosidase [Streptomyces clavuligerus ATCC 27064]
gi|197706923|gb|EDY52735.1| alpha-galactosidase [Streptomyces clavuligerus ATCC 27064]
Length = 517
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 133/364 (36%), Positives = 182/364 (50%), Gaps = 49/364 (13%)
Query: 30 MGWNSWNHFHCQ--INEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQC 87
MG+N+WN HC+ +E +++ D + L + + + +L A P N
Sbjct: 151 MGFNNWNSTHCRADFHEAMVKGIADIF------VARGLKEAGYTYVNLDDCWALPERN-A 203
Query: 88 QGNLVANKTTFPSGTKALADYVHIL-----ISNSLDNRIESNIAF--------------- 127
+G LV + FP+G KA+ADYVH I S + + F
Sbjct: 204 EGKLVPDPVRFPNGIKAVADYVHAKGLKFGIYTSAGTKTCNRAGFPGALGHEYSDAQQFA 263
Query: 128 EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSW 187
+ G+DYLKYDNC N +RY M DAL+ GRPI +S+CEWG +P W ++ W
Sbjct: 264 DWGVDYLKYDNCNNQGVDAKLRYTTMRDALRATGRPIVYSICEWGQNKPWEWAKDVGQLW 323
Query: 188 RTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAIS 247
RTT DISD+W ML+I N A +A PG WNDPDMLEVGNGGM EY HFS+W+I
Sbjct: 324 RTTGDISDNWGSMLSIMKRNLPLAPYAGPGHWNDPDMLEVGNGGMTDTEYRSHFSMWSIM 383
Query: 248 KAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLS-GNRI 295
APLL+G D+ + T I+GN EVIAVNQ +Q + A L+ G+R
Sbjct: 384 AAPLLIGADLRKVPESTFEILGNREVIAVNQDRLGKQGTVVSSEGGRWVVAKELADGSRA 443
Query: 296 VVLL-ESRKTFSSMMTAHWDDIGIKSNNTLFEHRTL----ETKFQGKLTAKVDGHSCKMY 350
V L ES + TA +G+ + T ++ R L G L+A V H +
Sbjct: 444 VALFNESARPQRIATTAQ--AVGL-AQATAYKVRDLWRHGSAHTAGTLSATVPAHGTVLL 500
Query: 351 ELTP 354
+ P
Sbjct: 501 RVAP 504
>gi|384249058|gb|EIE22540.1| Melibiase family protein [Coccomyxa subellipsoidea C-169]
Length = 428
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 134/383 (34%), Positives = 193/383 (50%), Gaps = 55/383 (14%)
Query: 17 RNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLT 76
+ L NGL TPPMG+N+WN FH +I+EK++RE+ + S + ++ D
Sbjct: 19 KYCLDNGLARTPPMGYNTWNAFHEEIDEKMVRESAEILISSGLAAA---GYNYFNLDDGW 75
Query: 77 QIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----------------ILISNSLDN 119
+ + + +G + N T FPSG ALAD++H + SL +
Sbjct: 76 NMKTRGV----EGPMAVNSTRFPSGIPALADWLHSKGLKLGVYSDAGSMTCARFAASLGH 131
Query: 120 RIESNIAF-EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPAL 178
E F E G+D+LKYDNCF RY AM DAL GRP+ FSMCEWG P +
Sbjct: 132 EKEDAKVFAEWGVDFLKYDNCFATPMSVRARYEAMRDALNATGRPMLFSMCEWGVSSPWV 191
Query: 179 WGSNIRNSWRTTDDIS----DSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKY 234
+G + ++WRTT DIS +WA ++ D A A PGGWND DMLE + Y
Sbjct: 192 YGHEVGHAWRTTKDISMEIEATWADVVENLDETAGLARFAGPGGWNDADMLE---ATLTY 248
Query: 235 NEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ----AINDQ------- 283
E HF++WA+ K+PLL+G D+ L E + ++ + E+IA+NQ D+
Sbjct: 249 TEQRSHFALWALIKSPLLIGADLRKLKKEDLLLLKSREIIAINQDPLGVAGDRVWKQGPY 308
Query: 284 QIWAGPLSGNRIVVLLESRKTFSSM------MTAHWDDIGIKSN-----NTLFEHRTLET 332
++WA PL G V++ +R S MT HW IG + L++ R L
Sbjct: 309 EVWAAPLLGGARAVVMFNRHVASEEKFEEHNMTLHWSMIGYPVDMQVVVRDLYKERDL-G 367
Query: 333 KFQGKLTAKVDGHSCKMYELTPV 355
++ G+LT VD H +L+PV
Sbjct: 368 RYTGELTELVDAHGVLALKLSPV 390
>gi|388501780|gb|AFK38956.1| unknown [Lotus japonicus]
Length = 203
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 97/195 (49%), Positives = 134/195 (68%), Gaps = 17/195 (8%)
Query: 176 PALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYN 235
PA W ++ NSWRTT DI D W M++ AD+N+ +A +A PGGWNDPDMLEVGNGGM
Sbjct: 9 PATWARSVGNSWRTTGDIEDKWDSMISRADLNDKWASYAGPGGWNDPDMLEVGNGGMTTE 68
Query: 236 EYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AINDQQ 284
EY HFSIWA++KAPLL+GCD+ +L TM ++ N EVIAVNQ + +D +
Sbjct: 69 EYRAHFSIWALAKAPLLIGCDIRSLDHATMELLSNSEVIAVNQDKLGVQGKKVKSESDLE 128
Query: 285 IWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT-----LFEHRTLETKFQGKLT 339
+WAGPLS NR+ V+L +R + + +TA W DIG++S + L+EH T ++ G+++
Sbjct: 129 VWAGPLSNNRLAVILWNRSSSKATVTASWSDIGLESGTSVDARDLWEHST-KSSISGEMS 187
Query: 340 AKVDGHSCKMYELTP 354
A++D H+CKMY LTP
Sbjct: 188 AELDSHACKMYVLTP 202
>gi|256397018|ref|YP_003118582.1| alpha-galactosidase [Catenulispora acidiphila DSM 44928]
gi|256363244|gb|ACU76741.1| Alpha-galactosidase [Catenulispora acidiphila DSM 44928]
Length = 535
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 131/385 (34%), Positives = 175/385 (45%), Gaps = 82/385 (21%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
L NGL LTPPMGWN WN F C ++E ++ +T DDCW S+RD
Sbjct: 37 LGNGLALTPPMGWNDWNSFGCNVSESLVEQTADLIVSSGMKDAGYQYVNIDDCWMSSNRD 96
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH---------- 110
+ GNLV + FP G A YVH
Sbjct: 97 AG--------------------------GNLVPDPAKFPDGISGTAAYVHSKGLKLGIYE 130
Query: 111 -------ILISNSLDN-RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGR 162
SL++ + ++N G+DYLKYDNC N RY AM DAL K GR
Sbjct: 131 SAGTATCAGYPGSLNHEQADANSFASWGVDYLKYDNCNNQGIPAQTRYTAMRDALAKTGR 190
Query: 163 PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDP 222
PI +S+C WG WG+ + N WRTT DIS ++ ML+ A A PGGWNDP
Sbjct: 191 PIVYSLCNWGQESVWTWGAGVGNLWRTTGDISANFGSMLSNFHNTVGLASSAGPGGWNDP 250
Query: 223 DMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ---- 278
DMLEVGN GM + E S+WA APL+ G D+ T T+ + N++VIAV+Q
Sbjct: 251 DMLEVGN-GMSFTEDRAEMSLWAEMAAPLISGTDLRKATTATLSLYTNKDVIAVDQDSLG 309
Query: 279 -------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSN-----NTLFE 326
+ + A PL+ + V L + + + ++ IGI S N L+
Sbjct: 310 KAGREIASSGGADVLAKPLANGDVAVALFNENSSAQTISTSASAIGIGSASSYKLNNLWS 369
Query: 327 HRTLETKFQGKLTAKVDGHSCKMYE 351
H + T G ++A V GH +Y
Sbjct: 370 H--VLTSTSGSISASVPGHGVVLYR 392
>gi|302528738|ref|ZP_07281080.1| ribosomal protein S32 [Streptomyces sp. AA4]
gi|302437633|gb|EFL09449.1| ribosomal protein S32 [Streptomyces sp. AA4]
Length = 417
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 133/393 (33%), Positives = 176/393 (44%), Gaps = 85/393 (21%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
L NG+ TPP+GWNSWN + C I++ I+ DDCW S+RD
Sbjct: 33 LDNGVAQTPPLGWNSWNAYRCGIDQTKIKAAADAAVSRGLKDAGYRYVNIDDCWQASTRD 92
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISN 115
+Q GNL + FP G KALADYVH + I
Sbjct: 93 AQ--------------------------GNLRPDPARFPDGIKALADYVHGKGLKLGIYA 126
Query: 116 SLDNRIESNI-------------------AFEQ-GIDYLKYDNCF--NDDTRPTIRYRAM 153
+ R +NI F G DYLKYD C D + AM
Sbjct: 127 TPGTRTCANIWDNYPGTLGSKGHEAQDAQTFASWGADYLKYDWCQADRDGVDAKKAFTAM 186
Query: 154 SDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADH 213
DAL GRPI +S+ + W + NSWRTT DI +W+ +++I D +
Sbjct: 187 RDALAATGRPIVYSIHREPQLPVESWRPQVANSWRTTADIRPTWSSLMSILDNQVGLERY 246
Query: 214 AKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEV 273
+KPG WNDPDML+VGNG + E HFS+WA+ APLL G D+ ++ T ++ N +V
Sbjct: 247 SKPGAWNDPDMLQVGNGSLTAEENRAHFSLWALLSAPLLAGNDLSAMSEATRQVLANTKV 306
Query: 274 IAVNQ----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT 323
IAVNQ +QQIW PLS V+L +R S+ TA D+G
Sbjct: 307 IAVNQDWAGSQGVRIRGGEQQIWRKPLSDGSQAVVLLNRSATSASFTASAGDLGFPGRTD 366
Query: 324 LFEH---RTLETKFQGKLTAKVDGHSCKMYELT 353
L R T G +TA V GH MY+++
Sbjct: 367 LTAEDLWRATSTAVAGSVTATVPGHGVVMYKVS 399
>gi|198274385|ref|ZP_03206917.1| hypothetical protein BACPLE_00530 [Bacteroides plebeius DSM 17135]
gi|198272751|gb|EDY97020.1| alpha-galactosidase [Bacteroides plebeius DSM 17135]
Length = 411
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 119/321 (37%), Positives = 167/321 (52%), Gaps = 51/321 (15%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
N L TPPMGWNSWN F C ++E +++E D ES + + D Q+
Sbjct: 38 NSLAQTPPMGWNSWNKFGCDVSESLMKEMADAMVESGMKDA---GYEYIVIDDCWQVGRD 94
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIES--------------NIAF 127
L GN++ + FP+G KALADY+H S L I S F
Sbjct: 95 SL-----GNIIPDPVRFPNGIKALADYIH---SKGLKLGIYSCAGSYTCQGRPGSRGYQF 146
Query: 128 EQ-------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWG 180
+ G+DYLKYD C N+ Y+ MSDA+K +GRPI FS+CEWG+ +P WG
Sbjct: 147 QDARQYAAWGVDYLKYDWCSNEGQNARAAYQTMSDAIKLSGRPIVFSICEWGENQPWKWG 206
Query: 181 SNIRNSWRTTDDISDSWARM-----LTIADMNEVYAD---HAKPGGWNDPDMLEVGNGGM 232
I + WR T DI D + + + D+ ++ AD +A PG WND +MLE+GNGGM
Sbjct: 207 KGIGHMWRVTPDIRDCYQCKFDWGGVGVLDIIDIMADLYPYAGPGHWNDAEMLEIGNGGM 266
Query: 233 KYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AIN 281
+EY+ HFS+W + PL+ G D+ + ET I+ N+EVIAVNQ +
Sbjct: 267 TRDEYMTHFSMWCMLATPLMAGNDLRKMDTETKEILTNKEVIAVNQDKLGQQARRFMDMG 326
Query: 282 DQQIWAGPLSGNRIVVLLESR 302
+++IWA PL+ + V +R
Sbjct: 327 EKEIWAKPLANGELAVCFLNR 347
>gi|256375186|ref|YP_003098846.1| alpha-galactosidase [Actinosynnema mirum DSM 43827]
gi|255919489|gb|ACU35000.1| Alpha-galactosidase [Actinosynnema mirum DSM 43827]
Length = 693
Score = 201 bits (511), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 113/278 (40%), Positives = 146/278 (52%), Gaps = 31/278 (11%)
Query: 24 LGLTPPMGWNSWNHFHC--QINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
L TPPMG+N+WN C + + +IR D + +SL ++ +
Sbjct: 62 LAPTPPMGFNNWNSTQCGPEFTDSMIRGIADLF--------LSLGLKDAGYEYVNIDDCW 113
Query: 82 PLPNQ-CQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNRIESNIAFEQ------ 129
LP + G+LV + FP G K L DYVH I S R S F
Sbjct: 114 ALPQRDADGDLVPDPVRFPEGMKPLVDYVHSKGLKFGIYTSAGTRTCSERGFPGALGHER 173
Query: 130 ---------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWG 180
G+DYLKYDNC N +RYRAM DA+ GRPI S+CEWG+ RP W
Sbjct: 174 QDAALFASWGVDYLKYDNCHNQGVDARLRYRAMRDAIAATGRPIVLSVCEWGENRPWEWA 233
Query: 181 SNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVH 240
+ WRTT DI DSW +L IA N A+HA P WNDPDMLEVGNGG+ + E H
Sbjct: 234 FEVGQLWRTTPDIRDSWDSVLEIAKANMALAEHAGPNRWNDPDMLEVGNGGLTWEECRTH 293
Query: 241 FSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
FS+WA+ APLL+G D+ ++ E + I+ N EVIA++Q
Sbjct: 294 FSLWAMMAAPLLIGVDLRSVAPEAVEILTNREVIALDQ 331
>gi|346225126|ref|ZP_08846268.1| alpha-galactosidase [Anaerophaga thermohalophila DSM 12881]
gi|346227111|ref|ZP_08848253.1| alpha-galactosidase [Anaerophaga thermohalophila DSM 12881]
Length = 405
Score = 201 bits (510), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 132/355 (37%), Positives = 173/355 (48%), Gaps = 61/355 (17%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPL 83
L LTPP+GWNSWN F C INE+IIR+ D ES + + D L
Sbjct: 27 LALTPPLGWNSWNKFACNINEEIIRDIADAMVESGMKDA---GYEYIVIDDCWHGGRDSL 83
Query: 84 PNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAFEQ-------------- 129
G + + T FP+G KALADYVH S L I S+ +
Sbjct: 84 -----GFIYPDFTRFPNGMKALADYVH---SKGLKLGIYSDAGTKTCGGYPGSRGYEYQD 135
Query: 130 -------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSN 182
GIDYLKYD C ++ P Y M DAL KAGRP+ FSMCEWG+ P W ++
Sbjct: 136 ALQYAKWGIDYLKYDWCNTENINPVGAYTTMRDALYKAGRPVLFSMCEWGNNEPWKWAAD 195
Query: 183 IRNSWRTTDDISDSWA------------RMLTIADMNEVYADHAKPGGWNDPDMLEVGNG 230
I + WRTT DI W ++ I DM E +A PG WNDPDMLEVGN
Sbjct: 196 IAHMWRTTGDIFICWDCKHQHEGGWYSWGVMKILDMQEGLRKYAGPGHWNDPDMLEVGN- 254
Query: 231 GMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------A 279
G+ NE HFS+WA+ APL+ G D+ N++ ET+ I+ NEEVIA++Q +
Sbjct: 255 GLTVNEDRAHFSMWAMLAAPLIAGNDLRNMSEETIEILTNEEVIAIDQDSLGVQGFKYSS 314
Query: 280 INDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFEHRTLETKF 334
+ + W PL V +R+ +++ W K+ N + E TKF
Sbjct: 315 KDSLETWFKPLKNGEWAVCFLNRRAEPAVVEFDW-----KNENVMDEIFGYVTKF 364
>gi|386850979|ref|YP_006268992.1| ricin B lectin [Actinoplanes sp. SE50/110]
gi|359838483|gb|AEV86924.1| ricin B lectin [Actinoplanes sp. SE50/110]
Length = 525
Score = 201 bits (510), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 134/374 (35%), Positives = 182/374 (48%), Gaps = 52/374 (13%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
L N L LTPPMGWN WN + C ++E ++++T DDCW + SRD
Sbjct: 29 LPNNLALTPPMGWNDWNAYGCNVSEALVKQTADKIVSAGLATAGYQYVNIDDCWMQKSRD 88
Query: 61 SQISLSCSFHMF-DSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDN 119
+ +L F D + AA G + +GT A Y L + D
Sbjct: 89 AAGNLQPDLGKFPDGIAGTAA-----YVHGKGLKLGIYEDAGTATCAGYPGSLGHEAQDA 143
Query: 120 RIESNIAFEQ-GIDYLKYDNCFNDDTRPTI----RYRAMSDALKKAGRPIFFSMCEWGDM 174
R +F G+DYLKYDNC N + T RY AM DAL GRPI +S+CEWG
Sbjct: 144 R-----SFAAWGVDYLKYDNCNNSGSTTTAQYIARYSAMRDALAATGRPIVYSICEWGVN 198
Query: 175 RPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKY 234
P W ++ N WRTT DI ++A ML+I N A +A PG WNDPDMLEVGN GM
Sbjct: 199 APWTWAGDVGNLWRTTGDIQANYASMLSIFHQNVGLAAYAGPGRWNDPDMLEVGN-GMTA 257
Query: 235 NEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ---------- 284
E HFS+WA APLL G ++ +A T+ I+GN VIAV+Q +Q
Sbjct: 258 TEDRAHFSLWAEMAAPLLAGNNLVTASATTLSILGNRSVIAVDQDSAGRQGHLVSSTGGL 317
Query: 285 -IWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFEHRTL----ETKFQGKLT 339
+ A PL+ + V+L + ++ ++ IG KS + + L G ++
Sbjct: 318 DVLAKPLANGDVSVVLFNENAATATISTTVAAIG-KSGASSYGLADLWAGTSGSTTGTIS 376
Query: 340 AKVDGHSCKMYELT 353
A V GH MY ++
Sbjct: 377 AGVPGHGVVMYRVS 390
>gi|2204226|emb|CAA74160.1| alpha-galactosidase [Hordeum vulgare subsp. vulgare]
Length = 204
Score = 200 bits (509), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 101/205 (49%), Positives = 136/205 (66%), Gaps = 19/205 (9%)
Query: 167 SMCEWGDMRPALWGSNIR-NSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDML 225
S+CEWG+ PA W + NSWRTT DI+D+W M + AD N+ +A +A PGGWNDPDML
Sbjct: 1 SLCEWGNENPATWARGMGGNSWRTTGDIADNWGSMTSRADQNDRWASYAGPGGWNDPDML 60
Query: 226 EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN-------- 277
EVGNGGM EY HFSIWA++KAPLL+GCDV ++T +T II N+EVIAVN
Sbjct: 61 EVGNGGMSEAEYRSHFSIWALAKAPLLIGCDVRSMTPQTKTIISNQEVIAVNQDRLGVQG 120
Query: 278 ---QAINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT-----LFEHRT 329
Q+ ++WAG LSGNR V+L +R+ + + +TAHW ++G+ ++ + L+ H +
Sbjct: 121 KKVQSDGGLEVWAGLLSGNRKAVVLWNRQGYQATITAHWSNVGLPASASVTARDLWAHSS 180
Query: 330 LETKFQGKLTAKVDGHSCKMYELTP 354
QG+L+A V H CKMY LTP
Sbjct: 181 FSA--QGQLSASVGPHDCKMYILTP 203
>gi|315498046|ref|YP_004086850.1| alpha-galactosidase [Asticcacaulis excentricus CB 48]
gi|315416058|gb|ADU12699.1| Alpha-galactosidase [Asticcacaulis excentricus CB 48]
Length = 404
Score = 200 bits (509), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 127/371 (34%), Positives = 172/371 (46%), Gaps = 54/371 (14%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQ 86
PPMGWNSWN + C I E I+R+ D S + + D Q +
Sbjct: 43 APPMGWNSWNKYACNITEDIVRKQADAMAASGLKEA---GYQYIVIDDCWQKS-----RD 94
Query: 87 CQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIA--------------FEQ--- 129
GN+ + FPSG KAL DYVH S L + S+ F+
Sbjct: 95 ADGNIQVDLERFPSGMKALIDYVH---SKGLKFGLYSDAGSLTCGGRPGSAGHEFQDARQ 151
Query: 130 ----GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRN 185
G+DYLKYD C+ Y M+ AL+ +GR I S+CEWGD P W + I +
Sbjct: 152 YARWGVDYLKYDWCYTGTRDAEAAYTIMAKALRSSGRDIVLSICEWGDNYPQRWAAPIGH 211
Query: 186 SWRTTDDISDSWA-------RMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYV 238
WRTT DI D+W M+ I D ++ P WNDPDMLEVGNGGM EY
Sbjct: 212 LWRTTGDIYDAWEGKKGYSIGMVNILDKQVDLWRYSGPNRWNDPDMLEVGNGGMTTTEYE 271
Query: 239 VHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ-----------IWA 287
HFS+WA+ APL+ G D+ N+ A+T+ I+ N +VIAV+Q QQ +WA
Sbjct: 272 SHFSLWAMLAAPLIAGNDLSNMDADTLRILTNTDVIAVDQDPLGQQAKRIWKEGDLEVWA 331
Query: 288 GPLSGNRIVVLLESRKTFSSMMTAHWDDI----GIKSNNTLFEHRTLETKFQGKLTAKVD 343
PL G V+L +R + M+ W+ + G+K + + + + V
Sbjct: 332 RPLKGGDQAVVLFNRSAAPAEMSVTWEQLNLPAGLKVQVKDLWSKKVTKNVKARFGGTVA 391
Query: 344 GHSCKMYELTP 354
H M LTP
Sbjct: 392 SHGVIMARLTP 402
>gi|403340526|gb|EJY69549.1| Glycosyl hydrolase family-like protein [Oxytricha trifallax]
Length = 392
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 124/344 (36%), Positives = 174/344 (50%), Gaps = 47/344 (13%)
Query: 21 SNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQI------SLSCSFHMFDS 74
+NG G TP MGWN+WN + C I+E II+ S +QI L + D
Sbjct: 18 NNGAGKTPAMGWNTWNKYACNISEDIIK---------SNANQIIALGLDQLGYKYVNIDD 68
Query: 75 LTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESN---------- 124
Q+ A+ N Q A+ T F +G KA+ D++H S SL I S+
Sbjct: 69 CWQLPARDKDNHVQ----ADTTRFSNGMKAVGDFLH---SKSLKFGIYSSAGTMTCQQKA 121
Query: 125 --IAFEQ---------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGD 173
+ FE G+DYLKYDNC+N RY AM DAL+K GR I++S+C WG+
Sbjct: 122 GSLGFEDIDAADYASWGVDYLKYDNCYNKMVPAVQRYTAMRDALQKTGRQIYYSICNWGE 181
Query: 174 MRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMK 233
W +I NSWRTT+DI + WA M N + + A PGGWNDPDMLE+GNGG+
Sbjct: 182 EETWKWAKDIGNSWRTTNDIQNKWASMRENFKWNAQHPEIAGPGGWNDPDMLEIGNGGLT 241
Query: 234 YNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-AINDQQIWAGPLSG 292
E HF++W+ +KAPL+LG D+ +T + + II N I VNQ + Q +G
Sbjct: 242 PLEEKTHFALWSFAKAPLILGNDLTKMTPDQLSIISNTNFINVNQDSFGQQAKCVQGCNG 301
Query: 293 NRIVV---LLESRKTFSSMMTAHWDDIGIKSNNTLFEHRTLETK 333
I V + ++ ++ +W D K+ F+ + TK
Sbjct: 302 GDIEVYQSFQNDKGSYYGLLVVNWSDTQSKAILINFQIAGVTTK 345
>gi|409195670|ref|ZP_11224333.1| alpha-galactosidase [Marinilabilia salmonicolor JCM 21150]
Length = 404
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 136/376 (36%), Positives = 177/376 (47%), Gaps = 57/376 (15%)
Query: 23 GLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKP 82
GL TPPMGWNSWN F C +NE++IR D ES + + D
Sbjct: 26 GLADTPPMGWNSWNKFACDVNEEMIRGIADAMVESGMKDA---GYEYVVIDDCWHGGRDS 82
Query: 83 LPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIA---------------- 126
L G + A+ FP+G KALADYVH S L I S+
Sbjct: 83 L-----GFIYADSAKFPNGMKALADYVH---SKGLKLGIYSDAGTKTCAGYPGSRGYEYQ 134
Query: 127 -----FEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGS 181
E GIDYLKYD C ++ P Y M DAL KAGRPIFFSMCEWGD +P W
Sbjct: 135 DALQYAEWGIDYLKYDWCNTENVNPIGAYSTMRDALYKAGRPIFFSMCEWGDNQPWEWAG 194
Query: 182 NIRNSWRTTDDISDSWA-----------RMLTIADMNEVYADHAKPGGWNDPDMLEVGNG 230
++ + WRTT DI W + I DM + +A PG WNDPDMLEVGN
Sbjct: 195 DVGHMWRTTGDIYLCWDCKHDHGGYYSWGVTRILDMQDGLRKYAGPGHWNDPDMLEVGN- 253
Query: 231 GMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------A 279
GM NE HFS+WA+ APL+ G D+ ++T ET+ I+ N+EVIAV+Q +
Sbjct: 254 GMPVNEDRAHFSMWAMLAAPLISGNDLRDMTPETIEILTNDEVIAVDQDSLGVQGFKYGS 313
Query: 280 INDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFEHRTLETKFQGK-- 337
+ + W PL V +R + W + +E R E + +
Sbjct: 314 EDSLETWFKPLDNGEWAVCFFNRGQEDVRLRFDWKKENVVDEIFGYEARFEEITYAMRNL 373
Query: 338 LTAKVDGHSCKMYELT 353
T K +G + K ++ T
Sbjct: 374 WTKKSEGKTEKAFKGT 389
>gi|345304436|ref|YP_004826338.1| alpha-galactosidase [Rhodothermus marinus SG0.5JP17-172]
gi|345113669|gb|AEN74501.1| Alpha-galactosidase [Rhodothermus marinus SG0.5JP17-172]
Length = 403
Score = 199 bits (506), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 137/384 (35%), Positives = 181/384 (47%), Gaps = 65/384 (16%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPL 83
L TPPMGWNSWNHF C INE+I+RE + R + + D Q L
Sbjct: 25 LARTPPMGWNSWNHFGCNINEQIVREVAEA---MVRSGMRDAGYEYVIIDDCWQGERDSL 81
Query: 84 PNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLDNRIESNIA 126
G + + FPSG KALADY+H L S + + +
Sbjct: 82 -----GFIQPDPERFPSGMKALADYIHSLGLKFGIYSDAGDRTCAGRPGSRGHEYQDALT 136
Query: 127 FEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRN 185
+ + G+DYLKYD C ++ P Y M DAL AGRPI FS+CEWG P WG I +
Sbjct: 137 YARWGVDYLKYDWCHTENLNPIGAYTTMRDALYAAGRPIVFSICEWGTNEPWKWGRTIGH 196
Query: 186 SWRTTDDISDSWA-----------RMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKY 234
WRTT DI++ W +L I D + +A PG WNDPDMLEVGN GM+
Sbjct: 197 LWRTTGDITNCWDCVIDHGTWKSWGILQILDRQDGLRIYAGPGHWNDPDMLEVGN-GMRV 255
Query: 235 NEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ---------- 284
+E HFS+WA+ APL+ G DV +++ T I+ N EVIAV+Q Q
Sbjct: 256 SEDRAHFSMWAMLAAPLIAGNDVRHMSETTRAILTNREVIAVDQDTLGIQGFPYRREAGV 315
Query: 285 -IWAGPLSGNRIVVLLESRK------TFSSMMTAHWDDIGIKSN---------NTLFEHR 328
IW PL+G + + +R TF+ T +DD + L+ HR
Sbjct: 316 EIWFRPLAGGDWAMAILNRTETPRTVTFNFRETYIFDDFSRRGTFFDRITYRLRDLWAHR 375
Query: 329 TLETKFQGKLTAKVDGHSCKMYEL 352
+ T + LT +V GH M L
Sbjct: 376 DIGTT-ETPLTVEVPGHDVVMLRL 398
>gi|403354222|gb|EJY76664.1| Glycosyl hydrolase family-like protein [Oxytricha trifallax]
Length = 432
Score = 198 bits (504), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 113/348 (32%), Positives = 179/348 (51%), Gaps = 85/348 (24%)
Query: 30 MGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQISLSCSFH 70
MGWNSWN F+C ++E +I++T DDCW E++RDS+
Sbjct: 1 MGWNSWNKFYCDVSETLIKQTADKLVELGLDKLGYNYVNVDDCWMEANRDSK-------- 52
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH---ILI-------------- 113
G++V ++ FP+G KA+A+Y+H +L
Sbjct: 53 ------------------GHMVPDRKNFPNGMKAVAEYIHEKRLLFGLYSSAGTMTCQKR 94
Query: 114 SNSLDNR-IESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWG 172
+ SL++ I++ + +DYLKYDNC+N++ RY M DAL + RPIF+S+C WG
Sbjct: 95 AGSLNHEDIDAQDFADWQVDYLKYDNCYNENVPAIKRYTKMRDALIQTARPIFYSICNWG 154
Query: 173 DMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGM 232
D W + NSWRTT DI +W + + N AD A PGGWNDPDM+E+GNG +
Sbjct: 155 DEDTPSWAPEVGNSWRTTLDIEMNWQTIERNIEQNNRRADVAGPGGWNDPDMMEIGNGVL 214
Query: 233 KYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-------------- 278
+ + HF++WA KAPL++GCD+ + +++ I+ N+++I +NQ
Sbjct: 215 NHEQEKTHFALWAAVKAPLIIGCDLAKIDKKSLEILKNQQLIDINQDPLGVQAKCVQNCA 274
Query: 279 AINDQ-------QIWAGPL-SGNRIVVLLESRKTFSSMMTAHWDDIGI 318
ND+ Q++AGPL +G+ +V+++ ++ D IG+
Sbjct: 275 KKNDRIAVQQNIQVFAGPLENGDTVVIIVNWNNKPKALTNIKLDQIGL 322
>gi|3777481|dbj|BAA33931.1| alpha-galactosidase [Umbelopsis vinacea]
Length = 396
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 139/395 (35%), Positives = 189/395 (47%), Gaps = 92/395 (23%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQIS 64
L TP MGWNSWN + C +NE +I T DDCW
Sbjct: 26 LAKTPQMGWNSWNKYQCNVNETVIINTANAMVSSGLKDLGYHYINIDDCW---------- 75
Query: 65 LSCSFHMFDSLTQ-IAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL------ISNSL 117
S H D+ TQ IA P T FP+G +A VH L S++
Sbjct: 76 ---SLHQRDNTTQRIAPDP-------------TKFPNGISGVASKVHALGLKFGIYSDAG 119
Query: 118 DNR------------IESNIAFEQGIDYLKYDNCFN-----DDTRPTIRYRAMSDALKKA 160
N I++ + G+DYLKYDNC N + T + RY+ M DALK
Sbjct: 120 TNTCAGYPGSYGYEAIDAQAFSDWGVDYLKYDNCNNLGLAGNATISSKRYKRMGDALKNV 179
Query: 161 GRPIFFSMCEWGDMRPALWGSNIR-NSWRTTDDISDSWARMLTIADMNEVYADHAKPGGW 219
RPIFFS+C WG WG +I SWR + DISD+W+ +++I A+ + PGGW
Sbjct: 180 SRPIFFSLCSWGTDDVWDWGRSIGGQSWRMSGDISDNWSSVVSITGQAVPIANISAPGGW 239
Query: 220 NDPDMLEVG-NGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
ND DMLEVG + M EY HFSIWA K+PL+LG D+ N+T + II N EVIA++Q
Sbjct: 240 NDMDMLEVGVHDQMTITEYTSHFSIWAAMKSPLILGNDITNMTNDIKNIITNNEVIAISQ 299
Query: 279 -----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDI--GIKSNN--- 322
+ Q++AGPLS N V L + ++ MT W +I K+N
Sbjct: 300 DSLGASVQQRSMKGNTQLFAGPLSKNGYVSLFLNEGNSTTNMTGTWSEIFNNPKANTHKS 359
Query: 323 ----TLFEHRTLETKFQGKLTAKVDGHSCKMYELT 353
L+ H+ L +FQG ++ V+ H +M + +
Sbjct: 360 IAVRDLWAHKDLG-RFQGSISVSVESHGVRMLKFS 393
>gi|220928664|ref|YP_002505573.1| family 6 carbohydrate binding protein [Clostridium cellulolyticum
H10]
gi|219998992|gb|ACL75593.1| Carbohydrate binding family 6 [Clostridium cellulolyticum H10]
Length = 604
Score = 197 bits (502), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 131/375 (34%), Positives = 181/375 (48%), Gaps = 52/375 (13%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
NGL TPPMGWNSWN FH INE I++ D S + + +++ +L
Sbjct: 30 NGLAKTPPMGWNSWNIFHGDINETKIKQIADTMVSSG------MKEAGYVYLNLDDNWMA 83
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVH-------------------ILISNSLDNRIE 122
GNL A+ T FPSG +ALADYVH I S S +
Sbjct: 84 NPARDSNGNLRADPTRFPSGIRALADYVHAKGLKLGIYGCRGTMTCMNIPQSGSKGYEDK 143
Query: 123 SNIAFEQ-GIDYLKYDNC-FNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWG 180
F GIDYLKYDNC + + Y+ M AL GRPI FS+C WG W
Sbjct: 144 DAKTFASWGIDYLKYDNCNIPNGSDMKTDYQKMQTALANCGRPIVFSICAWGYQS---WM 200
Query: 181 SNIRNSWRTTDDISDSW-------ARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMK 233
N WRTT DI+D W ++ D N Y A PG WNDPDMLE+GNGG
Sbjct: 201 PATGNLWRTTGDIADKWDNGNEWFKGIINAIDGNAQYTSSAAPGAWNDPDMLEIGNGGCT 260
Query: 234 YNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AIND 282
EY S+W++ +PL+ G D+ ++ T I+ N+EVIA++Q + N
Sbjct: 261 TEEYRTQMSMWSMMASPLIAGNDIRTMSQTTKDILLNKEVIAIDQDPAGVQGKRVKSANG 320
Query: 283 QQIWAGPL--SGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFE--HRTLETKFQGKL 338
+IW PL +G V L +R + +S +T +W DIG+ + T+ + ++ + F G
Sbjct: 321 LEIWVKPLGTNGTTKAVALLNRNSATSNITVNWSDIGVSGSVTVRDLWAKSDKGSFTGSY 380
Query: 339 TAKVDGHSCKMYELT 353
TA V H + +++
Sbjct: 381 TASVPSHGTVLIKIS 395
>gi|387790700|ref|YP_006255765.1| alpha-galactosidase [Solitalea canadensis DSM 3403]
gi|379653533|gb|AFD06589.1| alpha-galactosidase [Solitalea canadensis DSM 3403]
Length = 396
Score = 197 bits (502), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 121/334 (36%), Positives = 171/334 (51%), Gaps = 56/334 (16%)
Query: 9 ADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCS 68
+ +NG NL PPMGWNSWN F C+INEKII+E D + + +
Sbjct: 18 SQANGQKFENLAER-----PPMGWNSWNKFGCEINEKIIKEVADA---MTSNGMKAAGYE 69
Query: 69 FHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESN--IA 126
+ + D QI L GN++A+ FPSG +L DYVH +N L I S+ A
Sbjct: 70 YIVIDDCWQIGRDSL-----GNILADPDRFPSGISSLVDYVH---ANGLKFGIYSDAGTA 121
Query: 127 FEQG-------------------IDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFS 167
QG +DYLKYD CF+ Y M DA+ KAGRP+ S
Sbjct: 122 TCQGRPGSRGYEFQDARTYAKWNVDYLKYDWCFHGKQNAEASYTLMRDAIYKAGRPMVLS 181
Query: 168 MCEWGDMRPALWGSNIRNSWRTTDDI------SDSWARM--LTIADMNEVYADHAKPGGW 219
+CEWG +P WG N+ + WRTT+DI ++W + L I D++ +++ PG W
Sbjct: 182 ICEWGTNKPWEWGKNVGHLWRTTEDIINCFDCKNNWGGLGVLQIIDLHTEIGEYSGPGHW 241
Query: 220 NDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ- 278
NDPDMLE+GNG + E +H S+WA+ APL+ G D+ N++AET+ ++ N+EV+ ++Q
Sbjct: 242 NDPDMLEIGNGVLTPAEERLHLSMWAMFSAPLMAGNDIRNMSAETLKLLTNKEVLEIDQD 301
Query: 279 ----------AINDQQIWAGPLSGNRIVVLLESR 302
D +IW PLS N +R
Sbjct: 302 KLGISATRWMKYGDLEIWFKPLSDNNYAFCFINR 335
>gi|395772523|ref|ZP_10453038.1| ricin B lectin [Streptomyces acidiscabies 84-104]
Length = 545
Score = 197 bits (501), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 130/383 (33%), Positives = 177/383 (46%), Gaps = 77/383 (20%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
L NGL LTP MG+N WN + C ++E +I+ T DDCW +RD
Sbjct: 46 LGNGLALTPQMGFNDWNAYGCNVSESLIKSTAQAMHTNGMQAAGYTYVNIDDCWMTHNRD 105
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-------- 112
+ G LV + FP G K ADYVH L
Sbjct: 106 AA--------------------------GRLVPDPAKFPDGIKGTADYVHSLGLKLGIYE 139
Query: 113 ---------ISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGR 162
SL + +F G+DYLKYDNC N RY AM DAL GR
Sbjct: 140 DAGTATCAGYPGSLGHETTDAQSFASWGVDYLKYDNCNNTGAPARNRYTAMRDALAATGR 199
Query: 163 PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDP 222
PI +S+C WG WG+++ NSWRTT DIS +++ ML+I N A +A PG WNDP
Sbjct: 200 PILYSLCNWGQDNVWTWGADVGNSWRTTGDISANFSSMLSIFHSNVGLASYAGPGHWNDP 259
Query: 223 DMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ---- 278
DMLEVGNG M E FS+WA APL+ G ++ +A+T+ ++ N VIAV+Q
Sbjct: 260 DMLEVGNGSMTATENRSEFSLWAEMAAPLIAGTNIPQASADTLAVLTNSRVIAVDQDPLG 319
Query: 279 -------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGI--KSNNTLFEHRT 329
+ + + A PL+G + V L + ++ ++ IG S+ TL + +
Sbjct: 320 KQGTLVSSSGGRDVLAKPLAGGDVSVALFNETGSTTTISTTAGAIGKTGASSYTLTDLWS 379
Query: 330 LETK-FQGKLTAKVDGHSCKMYE 351
T G ++A V H MY
Sbjct: 380 GATSTTSGTISASVPAHGTVMYR 402
>gi|29827169|ref|NP_821803.1| alpha-galactosidase [Streptomyces avermitilis MA-4680]
gi|29604267|dbj|BAC68338.1| putative secreted alpha-galactosidase [Streptomyces avermitilis
MA-4680]
Length = 536
Score = 197 bits (500), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 129/387 (33%), Positives = 186/387 (48%), Gaps = 82/387 (21%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
L NGL TPPMGWN WN F C + E+++++T DDCW S+R+
Sbjct: 39 LDNGLARTPPMGWNDWNAFGCNVTEQLVKQTADYLVSSGLKGAGYQYVNIDDCWMTSARN 98
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISN 115
S G LV + FP G A YVH + I
Sbjct: 99 SV--------------------------GQLVPDPVKFPDGISGTAAYVHSKGLKLGIYE 132
Query: 116 SLDNRI----ESNIAFEQ---------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGR 162
S ++ EQ G+DYLKYDNC + + RY AM DAL GR
Sbjct: 133 SAGTATCQGYPGSLGHEQTDADSFASWGVDYLKYDNCNHQNVPDQQRYTAMRDALVNTGR 192
Query: 163 PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDP 222
PI +S+C WG WG+ + NSWRTTDDI+ +++ +++I N A +AKPG WNDP
Sbjct: 193 PIVYSLCNWGLASVWTWGAGVGNSWRTTDDINVNFSTVVSIYKANVKLAPYAKPGAWNDP 252
Query: 223 DMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAIND 282
DMLEVGN GM + E HFS+W+ APL+ G D+ +A T+ + GN++VIAV+Q
Sbjct: 253 DMLEVGN-GMSFTEDRSHFSLWSEMAAPLIAGTDLRKASAATLFLYGNKDVIAVDQDSLG 311
Query: 283 QQ-----------IWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT-----LFE 326
+Q + PL+ + V+L + + ++ +T G+ + ++ L+
Sbjct: 312 KQGTEVSSSGGLHVLTKPLANGDVSVVLFNENSSAATITTSATAAGLPAASSYRLDNLWS 371
Query: 327 HRTLETKFQGKLTAKVDGHSCKMYELT 353
H T G ++A V GH MY ++
Sbjct: 372 HVVSSTG--GSISASVPGHGSVMYRVS 396
>gi|182414117|ref|YP_001819183.1| Alpha-galactosidase [Opitutus terrae PB90-1]
gi|177841331|gb|ACB75583.1| Alpha-galactosidase [Opitutus terrae PB90-1]
Length = 399
Score = 197 bits (500), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 133/386 (34%), Positives = 180/386 (46%), Gaps = 60/386 (15%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
GL LTPPMGWNSWN F I+EK++RET D ++ + + D + +
Sbjct: 21 EGLALTPPMGWNSWNTFANHISEKVVRETADA---MEKNGMRDAGYVYIVIDDTWSLRQR 77
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLDNRIE-S 123
G+LVA+ FPSG KALADYVH S + + +
Sbjct: 78 ----DANGSLVADPEKFPSGMKALADYVHAKGFKLGIYSCAGKTTCGGYPGSWGHEFQDA 133
Query: 124 NIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNI 183
+ GIDYLKYD C + D Y MS AL+ AGRP+ FS+CEWG RP W I
Sbjct: 134 RLWASWGIDYLKYDWCDHGDANAKDAYARMSAALRAAGRPVVFSLCEWGQNRPWEWAEPI 193
Query: 184 RNSWRTTDDISDS----------WARML----TIADMNEVYA---DHAKPGGWNDPDMLE 226
+ WRTT DI DS W R+L ++ D + Y A PG WNDPDMLE
Sbjct: 194 GHLWRTTGDIYDSYDGRKGWESGWKRLLDLQYSLVDSSGAYDGLNKFAGPGHWNDPDMLE 253
Query: 227 VGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ-- 284
VGN G+ E HFS W + APL+ G DV +++ E I+ ++EVIA+NQ +Q
Sbjct: 254 VGNDGLSLAESRAHFSFWCLLAAPLMAGNDVRHMSEEIRAIMTDKEVIAINQDPAGKQGF 313
Query: 285 -----------IWAGPLSGNRIVVLLESRKTFSSMMTAHWDDI----GIKSNNTLFEHRT 329
IW LS V + T + +T W + G S ++ +
Sbjct: 314 RALAEPAKSIEIWVKELSNQEWAVCALNTDTSARELTIDWGRLWTIQGKHSVRDVWAKKA 373
Query: 330 LETKFQGKLTAKVDGHSCKMYELTPV 355
+ + T +V+ H M+ LTP
Sbjct: 374 VGDTSK-PYTVRVESHDVAMFRLTPA 398
>gi|331240690|ref|XP_003332995.1| alpha-galactosidase [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
gi|309311985|gb|EFP88576.1| alpha-galactosidase [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
Length = 448
Score = 197 bits (500), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 120/321 (37%), Positives = 163/321 (50%), Gaps = 55/321 (17%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
NGL +TPPMGWN+WN + C INE +I D L ++ D Q +
Sbjct: 23 NGLAITPPMGWNTWNKYGCNINEDLILSAAKAIKSEGLDK---LGYTYVNIDDCWQAPHR 79
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLDNRIESN 124
N+ +A+ FPSG K L++ +H L SL I
Sbjct: 80 GPNNEP----IADPEKFPSGIKHLSNQIHALGLKLGIYSDAGTYTCGKRFGSLGYEINDA 135
Query: 125 IAF-EQGIDYLKYDNCFND--DTRPTI---RYRAMSDALKKAGRPIFFSMCEWGDMRPAL 178
A+ + G+DYLKYDNC+N+ P+I RYR M +AL GRPI +S+C+WG+ +
Sbjct: 136 QAYADWGVDYLKYDNCYNEGLSGTPSISATRYRTMRNALNDTGRPIVYSLCQWGEDQVWN 195
Query: 179 WGSNIRNSWRTTDDISDSWAR----------------------MLTIADMNEVYADHAKP 216
WG+ I NSWR + DI D++ R ++ I + A P
Sbjct: 196 WGATIANSWRISGDIYDNFDRPDDRCPCPDSSVPCPLAGFHCSVMNILEKAAGLGQKAGP 255
Query: 217 GGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAV 276
GGWND DMLEVGNGGM Y+EYV HFS+WA+ K+PL+LG DV ++ ET II N+EVIA+
Sbjct: 256 GGWNDLDMLEVGNGGMSYDEYVTHFSMWALVKSPLILGNDVTKMSPETKSIISNKEVIAI 315
Query: 277 NQAINDQ---QIWAGPLSGNR 294
NQ ++W P R
Sbjct: 316 NQDAEHSAGYRVWKKPTPNGR 336
>gi|395334147|gb|EJF66523.1| alpha-galactosidase [Dichomitus squalens LYAD-421 SS1]
Length = 451
Score = 197 bits (500), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 126/319 (39%), Positives = 164/319 (51%), Gaps = 53/319 (16%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L+NGL LTP MGWN+WNHF C IN+ I + + D L + + D Q
Sbjct: 18 LNNGLALTPQMGWNTWNHFGCGINQDTIVNSAKALISNGLDK---LGYEYVLVDDCWQAD 74
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLD-NRI 121
A+ + G V +K+ FP G KA+AD VH L SLD I
Sbjct: 75 AR---DPDTGAPVEDKSKFPDGIKAVADQVHSLGLKFGIYSSAGTYTCGGKFGSLDYEEI 131
Query: 122 ESNIAFEQGIDYLKYDNCFNDDT--RPTI---RYRAMSDALKKAGRPIFFSMCEWGDMRP 176
++ G+DYLKYDNC N+ P I RY MS AL GRPI +SMC WG+ P
Sbjct: 132 DAKTYASWGVDYLKYDNCNNEGRAGTPQISYERYNNMSQALNATGRPILYSMCNWGEDGP 191
Query: 177 ALWGSNIRNSWRTTDDISDSWAR---------------------MLTIADMNEVYADHAK 215
+ NI NSWR + DI D++ R M I D + A
Sbjct: 192 WNFAVNIANSWRISGDIYDNFDRYDDRCPCTSMLDCKLAGFHCAMSRIIDFSAPLGQKAG 251
Query: 216 PGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIA 275
G WND DMLEVGNGGM Y+EYV HFS+WA++K+PL+LG DV +++ ET II N+ +IA
Sbjct: 252 KGHWNDLDMLEVGNGGMTYDEYVTHFSMWALAKSPLILGNDVTDMSDETKQIITNDAIIA 311
Query: 276 VNQAINDQ---QIWAGPLS 291
VN+ N ++W P+S
Sbjct: 312 VNKDTNGSPAIRVWKKPVS 330
>gi|297197084|ref|ZP_06914481.1| secreted alpha-galactosidase [Streptomyces sviceus ATCC 29083]
gi|197714037|gb|EDY58071.1| secreted alpha-galactosidase [Streptomyces sviceus ATCC 29083]
Length = 533
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 115/296 (38%), Positives = 144/296 (48%), Gaps = 63/296 (21%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
L NGL LTP MG+N WN + C ++E +I+ T DDCW +RD
Sbjct: 35 LGNGLALTPQMGFNDWNAYGCNVSESLIKSTAQAMHTNGMQAAGYSYVNIDDCWMTHNRD 94
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-------- 112
S G LV + FP G K ADYVH L
Sbjct: 95 SG--------------------------GRLVPDPAKFPDGIKGTADYVHSLGLKLGIYE 128
Query: 113 ---------ISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGR 162
SL + +F G+DYLKYDNC N+ RY AM DAL GR
Sbjct: 129 DAGTATCAGYPGSLGHESTDAQSFASWGVDYLKYDNCNNNGVSAQSRYTAMRDALAATGR 188
Query: 163 PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDP 222
PI +S+C WG WG+ + NSWRTT DIS ++A ML+I N A +A PG WNDP
Sbjct: 189 PILYSLCNWGQDNVWTWGAGVGNSWRTTGDISANFASMLSIFHSNVGLASYAGPGHWNDP 248
Query: 223 DMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
DMLEVGNG M E FS+WA APL+ G ++ + +AET+ + N VIAV+Q
Sbjct: 249 DMLEVGNGSMTATESRSEFSLWAEMAAPLIAGTNIPSASAETLSTLTNSRVIAVDQ 304
>gi|427386285|ref|ZP_18882482.1| hypothetical protein HMPREF9447_03515 [Bacteroides oleiciplenus YIT
12058]
gi|425726325|gb|EKU89190.1| hypothetical protein HMPREF9447_03515 [Bacteroides oleiciplenus YIT
12058]
Length = 400
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 130/388 (33%), Positives = 183/388 (47%), Gaps = 62/388 (15%)
Query: 21 SNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAA 80
++ L LTPPMGWNSWN F C +NE+ IR+ D + ++ D Q+
Sbjct: 23 ADSLALTPPMGWNSWNCFSCDVNEQQIRDMADL---IVANGMKDAGYTYVNVDDCWQVG- 78
Query: 81 KPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRI------------------- 121
GN+V + FPSG KALADY+H S L I
Sbjct: 79 ----RDADGNIVVDSVRFPSGIKALADYIH---SKGLKFGIYSCAGSLTCAGRPGSRGYQ 131
Query: 122 --ESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALW 179
++ E G+D+LKYD CF++ P YR M DAL+ +GRPI FS+CEWG +P W
Sbjct: 132 FQDARTYAEWGVDFLKYDWCFDEAQSPQGAYRTMRDALRASGRPIVFSICEWGSSKPWTW 191
Query: 180 GSNIRNSWRTTDDISDSW--------ARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGG 231
+ + WRTT DI +++ ++ I D N +A PG WNDPDML+VGNG
Sbjct: 192 AKGVGHLWRTTGDIINAFKGTVHWGGCSVVDIIDKNADLYPYAGPGHWNDPDMLQVGNGV 251
Query: 232 MKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AI 280
+ +E HF++W + APLL G D+ N+ ET+ I+ N++VIAVNQ I
Sbjct: 252 LTTDENRSHFTMWCMLAAPLLAGNDLRNMDKETLSILTNKDVIAVNQDKLGVQGRRYMKI 311
Query: 281 NDQQIWAGPLSGNRIVVLLESRK----TFSSMM---TAHWDDIGIKSNNTLFEHRTLETK 333
++W L+ V +R TF + H+ DI + K
Sbjct: 312 EQHEVWVKQLADGEAAVCFFNRDEQPWTFEYKIGKDNYHFADIRFWELEYDVYDIWNKNK 371
Query: 334 FQG----KLTAKVDGHSCKMYELTPVTK 357
+ G L+ KV H + +LTP K
Sbjct: 372 YLGTSSDNLSFKVPAHGVVLVKLTPKNK 399
>gi|384249062|gb|EIE22544.1| alpha-galactosidase [Coccomyxa subellipsoidea C-169]
Length = 377
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 130/348 (37%), Positives = 182/348 (52%), Gaps = 50/348 (14%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L+NGL TPPMG+N+WN FH +INE +I D E+ L+ + + + +
Sbjct: 29 LNNGLASTPPMGYNTWNAFHDEINETLIYHAADELVETG------LAAAGYDYLVIDADG 82
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVHI------LISNSLD---NRIESNIAFEQ- 129
G + AN T FP+G KA+ADYVH + S+S + +++ +E
Sbjct: 83 WANFQRNGSGPIEANSTRFPNGIKAVADYVHAKGLKLGIYSDSGTLTCAKYTASLGYEAI 142
Query: 130 --------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGS 181
+D LKYDNCF+ R RY AMSDAL GRPI FSMCEWG P +G+
Sbjct: 143 DAAQFAAWEVDLLKYDNCFSRTVRS--RYEAMSDALNATGRPILFSMCEWGVSSPWEYGN 200
Query: 182 NIRNSWRTTDD----ISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVG-NGGMKYN- 235
+ N+WRTT D IS +W ++ + + A PGGWND DMLEVG GG +
Sbjct: 201 QVGNTWRTTQDISLAISATWDSVMNNLEGTKGLARFGGPGGWNDADMLEVGWPGGALLSD 260
Query: 236 -EYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AINDQ 283
E HF++WAI K+PL+ G D+ N+T +T+ I+ + EV+A+NQ Q
Sbjct: 261 VEQRAHFALWAILKSPLIFGNDLRNMTRQTLNILKSPEVVAINQDPLGIPGDLVWKQGPQ 320
Query: 284 QIWAGPL-SGNRIVVLLESRKTF-----SSMMTAHWDDIGIKSNNTLF 325
+++A PL G+R VVL + S MT W IGI N T++
Sbjct: 321 EVYAAPLVDGSRAVVLFNRHQHLDPNFPSQNMTVFWRSIGIPPNVTVW 368
>gi|390595331|gb|EIN04737.1| alpha-galactosidase [Punctularia strigosozonata HHB-11173 SS5]
Length = 436
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 142/410 (34%), Positives = 191/410 (46%), Gaps = 85/410 (20%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQ--IA 79
NGL LTP MGWN+WNHF C I+E I S+ + +S + + + ++ +
Sbjct: 23 NGLALTPQMGWNTWNHFGCDISEDTIV--------SAAQAFVSYNLTQYGYEYIIMDDCW 74
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVH-------ILIS----------NSLD-NRI 121
P + G VA+ FP+G K LAD +H I S SLD I
Sbjct: 75 QAPARDNSTGAPVADPQKFPNGIKDLADKIHDMGLKFGIYSSAGLYTCGGRFGSLDFEVI 134
Query: 122 ESNIAFEQGIDYLKYDNCFNDDT--RPTI---RYRAMSDALKKAGRPIFFSMCEWGDMRP 176
++ G+DYLKYDNC+N+ P I R+ MS AL GRPI +SMC WG+ P
Sbjct: 135 DAQTYASWGVDYLKYDNCYNEGRAGTPQISFERFNNMSQALNATGRPILYSMCNWGEDGP 194
Query: 177 ALWGSNIRNSWRTTDDISDSWAR---------------------MLTIADMNEVYADHAK 215
+ I NSWR + DI DS+ R M I D A
Sbjct: 195 WNFAVTIANSWRISGDIMDSFDRFDDRCPCTSVLDCKLPGFHCAMARIIDFAAPIGQKAG 254
Query: 216 PGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIA 275
PG WND DMLEVGNGGM ++EYV HFS+W+I K+PL+LG DV N+T ET+ II NE +IA
Sbjct: 255 PGKWNDLDMLEVGNGGMTFDEYVTHFSMWSILKSPLILGNDVTNMTNETLSIITNEALIA 314
Query: 276 VNQAINDQ---------------QIWAGPLSGNRIVVLL----ESRKTFSSMMTAHWDDI 316
VNQ N +W G L + VV + + +T + M+ + D
Sbjct: 315 VNQDPNGSPASRQTNTAVDGGSISLWQGSLVNDAFVVAVLNTSPNNQTTTIQMSDVFFDQ 374
Query: 317 GIKSNNTLFEHRTLETK------------FQGKLTAKVDGHSCKMYELTP 354
G + + +E L K QG L + HS K++ P
Sbjct: 375 GSVAGSEPWEFFDLWQKDDAGTWGKSIGTVQGSLEVDIGPHSVKVWRAIP 424
>gi|110588921|gb|ABG76969.1| GH27 galactosidase [Clostridium cellulolyticum H10]
Length = 604
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 130/375 (34%), Positives = 180/375 (48%), Gaps = 52/375 (13%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
NGL TPPMGWNSWN FH INE I++ D S + + +++ +L
Sbjct: 30 NGLAKTPPMGWNSWNIFHGDINETKIKQIADTMVSSG------MKEAGYVYLNLDDNWMA 83
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVH-------------------ILISNSLDNRIE 122
GNL A+ T FPSG +ALADYVH I S S +
Sbjct: 84 NPARDSNGNLRADPTRFPSGIRALADYVHAKGLKLGIYGCRGTMTCMNIPQSGSKGYEDK 143
Query: 123 SNIAFEQ-GIDYLKYDNC-FNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWG 180
F GIDYLKYDNC + + Y+ M AL GRPI FS+C WG W
Sbjct: 144 DAKTFASWGIDYLKYDNCNIPNGSDMKTDYQKMQTALANCGRPIVFSICAWGYQS---WM 200
Query: 181 SNIRNSWRTTDDISDSW-------ARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMK 233
N WRTT D +D W ++ D N Y A PG WNDPDMLE+GNGG
Sbjct: 201 PATGNLWRTTGDFADKWDNGNEWFKGIINAIDGNAQYTSSAAPGAWNDPDMLEIGNGGCT 260
Query: 234 YNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AIND 282
EY S+W++ +PL+ G D+ ++ T I+ N+EVIA++Q + N
Sbjct: 261 TEEYRTQMSMWSMMASPLIAGNDIRTMSQTTKDILLNKEVIAIDQDPAGVQGKRVKSANG 320
Query: 283 QQIWAGPL--SGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFE--HRTLETKFQGKL 338
+IW PL +G V L +R + +S +T +W DIG+ + T+ + ++ + F G
Sbjct: 321 LEIWVKPLGTNGTTKAVALLNRNSATSNITVNWSDIGVSGSVTVRDLWAKSDKGSFTGSY 380
Query: 339 TAKVDGHSCKMYELT 353
TA V H + +++
Sbjct: 381 TASVPSHGTVLIKIS 395
>gi|268318139|ref|YP_003291858.1| alpha-galactosidase [Rhodothermus marinus DSM 4252]
gi|262335673|gb|ACY49470.1| Alpha-galactosidase [Rhodothermus marinus DSM 4252]
Length = 403
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 134/384 (34%), Positives = 179/384 (46%), Gaps = 65/384 (16%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPL 83
L TPPMGWNSWNHF C INE+I+RE R + + D Q L
Sbjct: 25 LARTPPMGWNSWNHFGCNINEQIVREVAQA---MVRSGMRDAGYEYVIIDDCWQGERDSL 81
Query: 84 PNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLDNRIESNIA 126
G + + FPSG KALADY+H L S + + +
Sbjct: 82 -----GFIQPDPERFPSGMKALADYIHSLGLKFGIYSDAGDRTCAGRPGSRGHEYQDALT 136
Query: 127 FEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRN 185
+ + G+DYLKYD C ++ P Y M DAL AGRP+ FS+CEWG P WG I +
Sbjct: 137 YARWGVDYLKYDWCHTENLNPIGAYTTMRDALYAAGRPVVFSICEWGTNEPWKWGRTIGH 196
Query: 186 SWRTTDDISDSWA-----------RMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKY 234
WRTT DI++ W +L I D + +A PG WNDPDMLEVGN GM+
Sbjct: 197 LWRTTGDITNCWDCIIDHGTWKSWGILQILDKQDGLRIYAGPGHWNDPDMLEVGN-GMRV 255
Query: 235 NEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ---------- 284
+E HFS+WA+ APL+ G D+ +++ T I+ N EVIAV+Q Q
Sbjct: 256 SEDRAHFSMWAMLAAPLIAGNDIRHMSEATREILTNREVIAVDQDTLGIQGFPYRREAGV 315
Query: 285 -IWAGPLSGNRIVVLLESRK------TFSSMMTAHWDDIGIKSN---------NTLFEHR 328
IW PL+G + + +R TF+ +DD + L+ HR
Sbjct: 316 EIWFRPLAGGDWAMAILNRTETPRTVTFNFREEYIFDDFSKRGTFFDRITYRLRDLWAHR 375
Query: 329 TLETKFQGKLTAKVDGHSCKMYEL 352
+ T + LT +V GH M L
Sbjct: 376 DIGTT-ETPLTVEVPGHDVVMLRL 398
>gi|333381639|ref|ZP_08473318.1| hypothetical protein HMPREF9455_01484 [Dysgonomonas gadei ATCC
BAA-286]
gi|332829568|gb|EGK02214.1| hypothetical protein HMPREF9455_01484 [Dysgonomonas gadei ATCC
BAA-286]
Length = 412
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 123/348 (35%), Positives = 172/348 (49%), Gaps = 47/348 (13%)
Query: 6 SSAADSNGSDRRNLLS--NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQI 63
S A G ++++S + L TPPMGWNSWN F C ++EK+I+E D S
Sbjct: 18 SLAQQEFGKQGQSVISEFDKLAQTPPMGWNSWNKFGCNVSEKLIKEMADAIVSSGMKDA- 76
Query: 64 SLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILI---------- 113
+ + D Q +GN++ +K FP+G K +ADYVH L
Sbjct: 77 --GYEYVVIDDCWQTGRDE-----EGNIIVDKDHFPNGMKPVADYVHSLGLKFGIYSCAG 129
Query: 114 SNSLDNRIES-NIAFEQ-------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIF 165
S + R S F+ G+DYLKYD C+N+ Y+ MSDALK GRPI
Sbjct: 130 SKTCQGRPGSRGYQFQDTRQYAGWGVDYLKYDWCYNEGQDAKAAYKTMSDALKACGRPIV 189
Query: 166 FSMCEWGDMRPALWGSNIRNSWRTTDDISDSW--------ARMLTIADMNEVYADHAKPG 217
FS+CEWG+ +P WG I + WR T DI D + +L I D ++ PG
Sbjct: 190 FSICEWGENKPWEWGKGIGHLWRITADIRDCYDCKFNWGGVGVLQILDKALTINQYSGPG 249
Query: 218 GWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN 277
WND +MLE+GNGG NEY HF+IW++ APL+ G D+ N+ A T I+ N+E IA+N
Sbjct: 250 HWNDLEMLEIGNGGQTENEYRSHFAIWSMMSAPLMAGNDIRNMDALTKEILLNKEAIAIN 309
Query: 278 Q-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWD 314
Q +N I LS + L +R F+ + +W+
Sbjct: 310 QDKLGKTAFRFVTLNGIDILVKALSDGDVAFLFINRNNFNIDLDYNWN 357
>gi|50425819|ref|XP_461506.1| DEHA2F26840p [Debaryomyces hansenii CBS767]
gi|49657175|emb|CAG89932.1| DEHA2F26840p [Debaryomyces hansenii CBS767]
Length = 417
Score = 194 bits (494), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 133/401 (33%), Positives = 194/401 (48%), Gaps = 78/401 (19%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLS---CSFHMFDSLTQI 78
NGLGLTP MGWNSWN + C INE II G + + ++ L + + D Q+
Sbjct: 23 NGLGLTPQMGWNSWNIYGCDINETIII------GAAEKIKELGLQDLGYEYIVMDDCYQL 76
Query: 79 AAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLDNR- 120
+ + ++A+ FP+G K LAD +H L SL N
Sbjct: 77 HER---DNTTNKILADPEKFPNGIKHLADSIHDLGFKFGMYSSAGRYTCGGYPGSLHNEE 133
Query: 121 IESNIAFEQGIDYLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFSMCEWGDMR 175
++++ + GIDYLKYDNC+N+ T RY M+ AL GRPIF+S+C+WG+
Sbjct: 134 LDADTFADWGIDYLKYDNCYNEGNSGTAQISYKRYDKMAKALNATGRPIFYSLCQWGEDN 193
Query: 176 PALWGSNIRNSWRTTDDISDSWAR---------------------MLTIADMNEVYADHA 214
WGS + NSWR + DI D + R M I + A
Sbjct: 194 VWNWGSTVSNSWRISGDIYDHFDRYDDRCPCESYECLGLQGYMCSMTNILEKAVPLGQKA 253
Query: 215 KPG-GWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEV 273
G GWND D LEVGNGGM Y+EYV HF++WAI K+PL+LG DV N+T E + I+ N ++
Sbjct: 254 GTGQGWNDLDSLEVGNGGMNYDEYVAHFTLWAILKSPLVLGNDVTNMTDEDLGIVKNAQI 313
Query: 274 IAVNQAIN--------------DQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIK 319
I +NQ ++ Q++A LS +V + + + ++DI +
Sbjct: 314 IEINQDLSAPAHRVWKKSVKGGSLQLFATTLSDETQIVAIFNSGDYEEDTELLFEDIFVD 373
Query: 320 S---NNTLFEHRTLETK----FQGKLTAKVDGHSCKMYELT 353
N + + L T F+ K++ V HS K+++LT
Sbjct: 374 DLTMKNKSYSGKELWTNETSTFEDKISTSVKTHSIKIWKLT 414
>gi|90021240|ref|YP_527067.1| 30S ribosomal protein S32 [Saccharophagus degradans 2-40]
gi|89950840|gb|ABD80855.1| putative a-galactosidase [Saccharophagus degradans 2-40]
Length = 408
Score = 194 bits (493), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 134/393 (34%), Positives = 185/393 (47%), Gaps = 77/393 (19%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPL 83
L TPPMGWNSWN+F C ++EK+I+ET D +S ++ +
Sbjct: 30 LAQTPPMGWNSWNNFGCDVDEKLIKET--------ADYMVSSGMKDAGYEYVNIDDCWHG 81
Query: 84 PNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAF---------------- 127
G + A+ FPSG KALADYVH S L I S+ +
Sbjct: 82 ERDANGFIQADPERFPSGIKALADYVH---SKGLKFGIYSDAGWTTCGGKPGSRGYEFQD 138
Query: 128 -----EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSN 182
+ G+DYLKYD C D + Y+ M +A+ KAGRP+ FS+CEWGD +P W
Sbjct: 139 AQMYAKWGVDYLKYDWCATDGLKAEGAYQTMREAIHKAGRPMVFSICEWGDNQPWEWAKP 198
Query: 183 IRNSWRTTDDISD------------SWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNG 230
I + WRTT DI + SW +L I DM + +A PG WNDPDM+EVGN
Sbjct: 199 IGHLWRTTGDIYNCFDCEYDHGTWSSWG-VLQILDMQDDLRQYAGPGHWNDPDMMEVGN- 256
Query: 231 GMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------A 279
GM E HFS+WA+ APL+ G D+ N++ T I+ N+ VIAV+Q +
Sbjct: 257 GMTEAEDRSHFSMWAMLAAPLIAGNDIRNMSEATRKILTNKAVIAVDQDELGVQGFKYSS 316
Query: 280 INDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFEHRTL---ETKFQG 336
N ++W PL+ + + + +R W + +K T HRT+ E KF
Sbjct: 317 KNGVEVWFKPLANDEWAMAVLNRNKGEVKFEFKWRNEVVKDELT---HRTITFNEQKFDW 373
Query: 337 K--------------LTAKVDGHSCKMYELTPV 355
+ L K+ GH M+ LTP
Sbjct: 374 QDLWNKSNKGHTKKFLKTKIAGHDTLMFRLTPA 406
>gi|376260259|ref|YP_005146979.1| alpha-galactosidase [Clostridium sp. BNL1100]
gi|373944253|gb|AEY65174.1| alpha-galactosidase [Clostridium sp. BNL1100]
Length = 603
Score = 194 bits (493), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 131/376 (34%), Positives = 177/376 (47%), Gaps = 54/376 (14%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
N L TPPMGWNSWN FH INE I++ D S + + A
Sbjct: 30 NNLAKTPPMGWNSWNIFHGDINETKIKQIADTMVSSGMKD-----VGYVYLNLDDNWMAN 84
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVHILI---------------------SNSLDNR 120
P GNL A+ T FP+G KALADYVH S ++R
Sbjct: 85 P-ARDSNGNLRADPTRFPNGIKALADYVHAKGLKLGIYGCRGTMTCMNVPQSGSKGYEDR 143
Query: 121 IESNIAFEQGIDYLKYDNC-FNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALW 179
A GIDYLKYDNC + + Y+ M AL GRPI FS+C WG W
Sbjct: 144 DAKTFA-SWGIDYLKYDNCNIPNGSDMKTDYQKMQTALANCGRPIVFSICAWGYQS---W 199
Query: 180 GSNIRNSWRTTDDISDSW-------ARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGM 232
N WRTT DI+D W ++ D N YA A PG WNDPDMLE+GNGG
Sbjct: 200 MPATGNLWRTTGDIADKWDNGTEWFKGIINAIDGNAQYASSAVPGAWNDPDMLEIGNGGC 259
Query: 233 KYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AIN 281
EY S+W++ +PL+ G D+ ++ T I+ N+EVIA++Q + N
Sbjct: 260 TTEEYRTQMSMWSMMASPLIAGNDIRTMSQTTKDILMNKEVIAIDQDPAGVQGKRVKSAN 319
Query: 282 DQQIWAGPL--SGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFE--HRTLETKFQGK 337
+IW PL +G V L +R + +S +T +W DIG+ + T+ + + + F G
Sbjct: 320 GLEIWVKPLGTNGTTKAVALLNRNSATSNITVNWSDIGVSGSVTVRDLWAKADKGSFTGS 379
Query: 338 LTAKVDGHSCKMYELT 353
TA V H + +++
Sbjct: 380 YTASVPSHGTVLLKIS 395
>gi|254787943|ref|YP_003075372.1| alpha-galactosidase [Teredinibacter turnerae T7901]
gi|237686725|gb|ACR13989.1| alpha-galactosidase [Teredinibacter turnerae T7901]
Length = 411
Score = 193 bits (491), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 133/397 (33%), Positives = 188/397 (47%), Gaps = 76/397 (19%)
Query: 23 GLGLTPPMGWNSWNHFHCQINEKIIRE-TDDCWGESSRDS---QISLSCSFHMFDSLTQI 78
GL L+PPMGWNSWN F C I+E +I+ D RD+ I++ +H
Sbjct: 24 GLALSPPMGWNSWNTFGCDIDEGLIKTIADKMVASGMRDAGYQYINIDDCWHG------- 76
Query: 79 AAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNR------------I 121
G + A+KT FPSG KALADYVH + I + N+
Sbjct: 77 -----ERDENGFIQADKTRFPSGIKALADYVHSKGLKLGIYSDAGNKTCGGRPGSNGHEY 131
Query: 122 ESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWG 180
+ I + + GIDYLKYD C ++ YR M DAL AGRP+ S+CEWGD +P LWG
Sbjct: 132 QDAIQYARWGIDYLKYDWCNTENLSAPGAYRTMRDALAAAGRPVVLSICEWGDNKPWLWG 191
Query: 181 SNIRNSWRTTDDISDSWA-----------RMLTIADMNEVYADHAKPGGWNDPDMLEVGN 229
+ + WRTT DI + W +L I DM + +A P WNDPDM+EVGN
Sbjct: 192 KEMGHLWRTTGDIINCWDCEVDHGSWSSWGILRILDMQDGLRQYAGPDHWNDPDMMEVGN 251
Query: 230 GGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ----------- 278
GM +E HFSIWA+ APL+ G D+ +++ T I+ + EVIAVNQ
Sbjct: 252 -GMSEHEDRAHFSIWAMLAAPLIAGNDLRSMSKSTQKILTHSEVIAVNQDALGVQAFKYK 310
Query: 279 AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWD-----DIGIKSNNTLFEHRTLETK 333
ND +IW PL + +++ +R + ++ W D N + ++
Sbjct: 311 TQNDVEIWFKPLQDKQWAMMILNRGDKKAGVSFDWRNENVVDPDFPGNGQFYTAGFANSR 370
Query: 334 F--------------QGKLTAKVDGHSCKMYELTPVT 356
+ + L V+ H+ K+Y L +T
Sbjct: 371 YKIFDLWANKSAGTTKSSLRVSVEPHAVKLYRLEKIT 407
>gi|395327353|gb|EJF59753.1| glycoside hydrolase [Dichomitus squalens LYAD-421 SS1]
Length = 430
Score = 193 bits (491), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 139/417 (33%), Positives = 195/417 (46%), Gaps = 92/417 (22%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQIS-----LSCSFHMFDS 74
L+NGL TP MGWN+WNHF C I++ I E+S + IS L + + D
Sbjct: 18 LNNGLARTPQMGWNTWNHFGCGISQDTI--------ENSAKALISNGLYKLGYEYVLVDD 69
Query: 75 LTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSL 117
Q A+ PN G V +K+ FP G KA+AD +H + SL
Sbjct: 70 CWQADARD-PNT--GAPVEDKSKFPDGIKAVADKIHSMGLKFGIYSSAGTYTCGGKFGSL 126
Query: 118 D-NRIESNIAFEQGIDYLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFSMCEW 171
D I++ G+DYLKYDNC N+ T RY MS AL GRPI +SMC W
Sbjct: 127 DYEEIDAKTYASWGVDYLKYDNCNNEGRAGTPQVSYERYNNMSKALLATGRPILYSMCNW 186
Query: 172 GDMRPALWGSNIRNSWRTTDDISDSWAR---------------------MLTIADMNEVY 210
G+ P + NI NSWR + DI D++ R M I D +
Sbjct: 187 GEDGPWSFAVNIANSWRISGDIYDNFDRYDDRCPCTSVLDCKLAGFHCAMSRIIDFSAPL 246
Query: 211 ADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGN 270
+ G WND DMLEVGNGGM Y+EYV HFS+WA+ K+PL+LG ++ +++ ET II N
Sbjct: 247 LQKSGKGHWNDLDMLEVGNGGMTYDEYVTHFSMWALVKSPLILGNEITSMSDETKQIITN 306
Query: 271 EEVIAVNQAIN----------------DQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWD 314
+ +IAVNQ N D Q+W+G LS N+ V + + + + +
Sbjct: 307 DAIIAVNQDSNGSPAGRVWKKPASQGGDIQLWSGSLSNNQYVFAVLNTSPVNQTVDISFL 366
Query: 315 DIGIKSNNTL----FEHRTLETK------------FQGKLTAKVDGHSCKMYELTPV 355
D+ + + F L K QG + VD H+ +++++ P
Sbjct: 367 DVFVDQGRSFGSGTFTLYDLWQKDPSGKWGAPVGNAQGTYSVSVDTHATRVFKVVPA 423
>gi|328851473|gb|EGG00627.1| family 27 glycoside hydrolase [Melampsora larici-populina 98AG31]
Length = 423
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 124/336 (36%), Positives = 173/336 (51%), Gaps = 61/336 (18%)
Query: 18 NLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQ 77
+LL NG ++P MGWN+WN + C INE+II + + D ++ +C+ D Q
Sbjct: 24 SLLDNGF-VSPAMGWNTWNKYGCSINEQIILKAAEAIKNHGLD-KLGYNCTI---DDCWQ 78
Query: 78 IAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL------------------ISNSLDN 119
A + PN +A+ FP G KALAD +H + +
Sbjct: 79 -APQRGPNNVP---LADPKKFPKGIKALADEIHDMGLKVGIYSDAGTYTCGKRYGSLGHE 134
Query: 120 RIESNIAFEQGIDYLKYDNCFNDDTRPT-----IRYRAMSDALKKAGRPIFFSMCEWGDM 174
++++ GIDYLKYDNC+N+ T RYRAM DAL GRPI +S+C+WG+
Sbjct: 135 KVDAQTYANWGIDYLKYDNCYNEGLSGTPQISATRYRAMRDALNATGRPIVYSLCQWGED 194
Query: 175 RPALWGSNIRNSWRTTDDISDSWAR----------------------MLTIADMNEVYAD 212
WG I NSWR + DI D + R + I +
Sbjct: 195 AVWNWGWTIANSWRISGDIYDHFDRPDDRCPCKDSVSYCALAGFQCSAMNILEKAAGVGQ 254
Query: 213 HAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEE 272
A GGWND DMLEVGNGGM Y+EYV HFS+WA +K+PL+LG DV N++ ET+ II N+E
Sbjct: 255 KAGSGGWNDLDMLEVGNGGMSYDEYVAHFSMWAFAKSPLILGNDVTNMSKETLSIISNKE 314
Query: 273 VIAVNQAINDQQIWAGPLSGNRIVVLLESRKTFSSM 308
+IA+NQ D+ SG R+ + + RK+ S+
Sbjct: 315 IIALNQ---DKSF----DSGYRVWKMEDPRKSSGSI 343
>gi|418467038|ref|ZP_13037935.1| secreted alpha-galactosidase [Streptomyces coelicoflavus ZG0656]
gi|371552335|gb|EHN79586.1| secreted alpha-galactosidase [Streptomyces coelicoflavus ZG0656]
Length = 684
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 118/279 (42%), Positives = 153/279 (54%), Gaps = 29/279 (10%)
Query: 22 NGLGLTPPMGWNSWNHFHC--QINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
GL LTPPMG+N+WN HC + +E +++ D + E L + + + +L
Sbjct: 56 EGLALTPPMGFNNWNSTHCRDEFDESMVKGIADLFVERG------LKDAGYEYVNLDDCW 109
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNRIESNIAF------- 127
A P G LV + FP+G KA+ADYVH I S + S+I F
Sbjct: 110 ALP-ERDADGRLVPDPERFPNGIKAVADYVHSKGLKFGIYTSAGTKTCSSIGFPGALGHE 168
Query: 128 --------EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALW 179
+ G+DYLKYDNC N RY M DAL GRPI +S+CEWG +P W
Sbjct: 169 YSDARQFADWGVDYLKYDNCNNQGVDAKQRYTTMRDALAATGRPIVYSICEWGQNKPWEW 228
Query: 180 GSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVV 239
++ WRTT DISDSW+ M +I N A +A+PGGWNDPDMLEVGNGGM EY
Sbjct: 229 AGDLGQLWRTTGDISDSWSSMSSIMKANLALAPYARPGGWNDPDMLEVGNGGMTDTEYRT 288
Query: 240 HFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
HFS+W+I APLL+G D+ I+ N+EVIAV+Q
Sbjct: 289 HFSMWSIMAAPLLIGTDLRTAPESAFEILTNDEVIAVDQ 327
>gi|10944324|gb|AAG24510.1|AF246262_1 alpha-galactosidase [Phanerochaete chrysosporium]
Length = 438
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 134/431 (31%), Positives = 181/431 (41%), Gaps = 120/431 (27%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQ 62
NGL +TP MGWN+WNHF C I+E I DDCW SRD+Q
Sbjct: 23 NGLAITPQMGWNTWNHFGCDISEDTILSAAQAILASNLTDFGYEYVIMDDCWHAPSRDNQ 82
Query: 63 ISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL---------- 112
G VA+ + FP+G K L+D +H +
Sbjct: 83 T-------------------------GAPVADPSKFPNGIKDLSDKIHAMGLKFGIYSDA 117
Query: 113 --------ISNSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPTI-----RYRAMSDALKK 159
+ I++ E G+DYLKYDNC+N+ T RY MS AL
Sbjct: 118 GTYTCGGRFGSLGYEEIDAKTYAEWGVDYLKYDNCYNEGLSGTPHISHERYANMSRALNA 177
Query: 160 AGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWAR-------------------- 199
GRPI +SMC WG+ P + NI NSWR + DI D++ R
Sbjct: 178 TGRPILYSMCNWGEDGPWNFAQNIANSWRISGDIMDNFDRYDDRCPCTSVIDCKLPGFHC 237
Query: 200 -MLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVG 258
M I D A PG WND DMLE+GNGGM ++EYV HFS+W+I K+PL+LG DV
Sbjct: 238 AMTRIIDFAAPVGQKAGPGHWNDLDMLEIGNGGMTFDEYVTHFSMWSILKSPLILGNDVT 297
Query: 259 NLTAETMPIIGNEEVIAVNQAI----------------NDQQIWAGPLSGNRIVVLLESR 302
++T ET+ II N+ +I +NQ D +WAG L N V L +
Sbjct: 298 DMTNETLTIITNKAIIDINQDAAGSPANRMWKRSVDEGGDLSLWAGSLVNNTFVFTLLNT 357
Query: 303 KTFSSMMTAHWDDIGIKSNNTL----FEHRTLETK------------FQGKLTAKVDGHS 346
+ + D+ T +E L K QG + + H
Sbjct: 358 SPAEQTVQVDFADVFFDQGKTYQTQPYEVFDLWQKDDEGTWGKSIGIIQGSMNVTIGVHQ 417
Query: 347 CKMYELTPVTK 357
K+++ P +
Sbjct: 418 TKVWKAVPAPQ 428
>gi|116620708|ref|YP_822864.1| alpha-galactosidase [Candidatus Solibacter usitatus Ellin6076]
gi|116223870|gb|ABJ82579.1| Alpha-galactosidase [Candidatus Solibacter usitatus Ellin6076]
Length = 518
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 133/372 (35%), Positives = 188/372 (50%), Gaps = 50/372 (13%)
Query: 21 SNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAA 80
+NGL TPPMGWNSWN F Q+++K++RE D +S + + D + A
Sbjct: 156 ANGLAKTPPMGWNSWNKFRNQVSDKMVREIADAMVKSGMKAA---GYVYVNIDDTWEGAH 212
Query: 81 KPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNRI----ESNIAFEQ-- 129
+ QGN+ N FP KAL+ YVH + I +S + E + E+
Sbjct: 213 R----DAQGNITTN-NKFPD-MKALSAYVHSKGLKLGIYSSPGPKTCAGYEGSYQHEEQD 266
Query: 130 -------GIDYLKYDNC----FNDDTRPTI--RYRAMSDALKKAGRPIFFSMCEWGDMRP 176
GIDYLKYD C D+T+ T+ Y M L +GR I +S+C++G++
Sbjct: 267 AKTYAAWGIDYLKYDWCSASQVYDNTQATMVNAYAKMGLGLLNSGRKIVYSLCQYGNLDV 326
Query: 177 ALWGSNIR-NSWRTTDDISDSWARMLTIA-DMNEVYADHAKPGGWNDPDMLEVGNGGMKY 234
WG + N WRTT DISD W M + ++ A PG WNDPDMLE+GNGGM
Sbjct: 327 GAWGERVGGNLWRTTGDISDRWDSMQRLGFELQTDREKVAGPGHWNDPDMLEIGNGGMTN 386
Query: 235 NEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AINDQ 283
+EY H S+W I +PLL G D+ ++ E I+ N+EVIAV+Q D+
Sbjct: 387 DEYKTHMSLWCILASPLLAGNDIRDMKPEVADILMNKEVIAVDQDKLGKQGKRVAKDGDK 446
Query: 284 QIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT---LFEHRTLETKFQGKLTA 340
++WA PL+ V L +R ++ +TA W D+GI T L++H L K G+ TA
Sbjct: 447 EVWAKPLADGSHAVALFNRGADTASVTAKWSDLGINGARTIRDLWKHADLGRK-SGEFTA 505
Query: 341 KVDGHSCKMYEL 352
V H M +
Sbjct: 506 TVPSHGVVMVRI 517
>gi|423216525|ref|ZP_17203048.1| hypothetical protein HMPREF1074_04580 [Bacteroides xylanisolvens
CL03T12C04]
gi|392690735|gb|EIY83991.1| hypothetical protein HMPREF1074_04580 [Bacteroides xylanisolvens
CL03T12C04]
Length = 399
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 133/385 (34%), Positives = 182/385 (47%), Gaps = 62/385 (16%)
Query: 21 SNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAA 80
++ L LTPPMGW+SWN F+ I+E+ IRE D +S ++ L
Sbjct: 24 ADSLALTPPMGWSSWNCFNSDISEQKIREI--------ADFMVSTGMKDAGYEYLNIDDC 75
Query: 81 KPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIES------------NIAFE 128
+ GN++ + FPSG KALADYVH S L I S + +E
Sbjct: 76 WQIGRDEDGNIIVDDKNFPSGMKALADYVH---SKGLKFGIYSCAGTMTCAGRPGSFGYE 132
Query: 129 ---------QGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALW 179
G+DYLKYD C N+ Y+ MSDALKK+GRPI S+CEWG +P W
Sbjct: 133 FQDARTYASWGVDYLKYDWCNNEGRNAQAAYKIMSDALKKSGRPIILSICEWGHSKPWTW 192
Query: 180 GSNIRNSWRTTDDI------SDSWARM--LTIADMNEVYADHAKPGGWNDPDMLEVGNGG 231
G I WRTT DI + W + + I D N ++ PG WNDPDML+VGN G
Sbjct: 193 GQGIGQLWRTTHDIISVFSGTIHWGALGIVEIIDQNAELYKYSGPGHWNDPDMLQVGNPG 252
Query: 232 MKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AI 280
+ E HF++W + APL+ G D+ + E I+ N+EVIAV+Q
Sbjct: 253 LSMEENRSHFTMWCMLAAPLMAGNDIRKMDKEVAKILMNKEVIAVDQDRLGKQGRRYKVF 312
Query: 281 NDQQIWAGPLSGNRIVVLLESRKT-FSSMMTAHWDD-----IGIKSNNTLFEHRTL-ETK 333
+IW LSG+ I V L +R FS + W +G+ ++ R L + K
Sbjct: 313 GKNEIWVKQLSGDEIAVCLFNRDDHFSWNLDIDWQKEDFSLVGVNLTEKKYKVRDLWKQK 372
Query: 334 FQG----KLTAKVDGHSCKMYELTP 354
G K++ V H + LTP
Sbjct: 373 DLGTAADKMSFDVPVHGVVLLRLTP 397
>gi|326201457|ref|ZP_08191328.1| Alpha-galactosidase [Clostridium papyrosolvens DSM 2782]
gi|325988057|gb|EGD48882.1| Alpha-galactosidase [Clostridium papyrosolvens DSM 2782]
Length = 604
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 129/375 (34%), Positives = 178/375 (47%), Gaps = 52/375 (13%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
N L TPPMGWNSWN FH INE I++ D S + + +++ +L
Sbjct: 30 NNLAKTPPMGWNSWNIFHGDINETKIKQIADTMVSSG------MKDAGYVYLNLDDNWMA 83
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVH-------------------ILISNSLDNRIE 122
GNL A+ FP+G KALADYVH I S S +
Sbjct: 84 NPARDSNGNLKADPKRFPNGIKALADYVHDKGLKLGIYGCRGTMTCMNIPQSGSKGYEDK 143
Query: 123 SNIAFEQ-GIDYLKYDNC-FNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWG 180
F GIDYLKYDNC + + Y+ M AL GRPI FS+C WG W
Sbjct: 144 DAKTFASWGIDYLKYDNCNIPNGSDMKTDYQKMQTALANCGRPIVFSICAWGYQS---WM 200
Query: 181 SNIRNSWRTTDDISDSW-------ARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMK 233
N WRTT DI+D W ++ D N Y A PG WNDPDMLE+GNGG
Sbjct: 201 PATGNLWRTTGDIADKWDNGNEWFKGIINAIDGNAQYTSSAAPGAWNDPDMLEIGNGGCT 260
Query: 234 YNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AIND 282
EY S+W++ +PL+ G D+ ++ T I+ N+EVIA++Q + N
Sbjct: 261 AEEYRTQMSMWSMMASPLIAGNDIRTMSQTTKDILMNKEVIAIDQDPAGIQGKRVKSSNG 320
Query: 283 QQIWAGPL--SGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFE--HRTLETKFQGKL 338
Q+IW L +G V L +R + +S +T +W DIG+ + T+ + + + F G
Sbjct: 321 QEIWVKSLGTNGTTKAVALLNRNSSTSNITVNWSDIGVTGSVTVRDLWAKADKGSFTGSY 380
Query: 339 TAKVDGHSCKMYELT 353
TA V H + +++
Sbjct: 381 TASVPAHGTVLLKIS 395
>gi|10944326|gb|AAG24511.1|AF246263_1 alpha-galactosidase [Phanerochaete chrysosporium]
Length = 438
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 125/357 (35%), Positives = 159/357 (44%), Gaps = 104/357 (29%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQ 62
NGL +TP MGWN+WNHF C I+E I DDCW SRD+Q
Sbjct: 23 NGLAITPQMGWNTWNHFGCDISEDTILSAAQAIIANNLTDLGYEYVIMDDCWHAPSRDNQ 82
Query: 63 ISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL---------- 112
G VA+ FP+G K L+D +H +
Sbjct: 83 T-------------------------GAPVADPAKFPNGIKDLSDKIHAMGLKFGIYSDA 117
Query: 113 --------ISNSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPTI-----RYRAMSDALKK 159
+ I++ E G+DYLKYDNC+N+ T RY MS AL
Sbjct: 118 GTYTCGGRFGSLGYEDIDAKTYAEWGVDYLKYDNCYNEGLSGTPHISHERYANMSRALNA 177
Query: 160 AGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWAR-------------------- 199
GRPI +SMC WG+ P + NI NSWR + DI D++ R
Sbjct: 178 TGRPILYSMCNWGEDGPWNFAQNIANSWRISGDIMDNFDRYDDRCPCTSVIDCKLPGFHC 237
Query: 200 -MLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVG 258
M I D A PG WND DMLE+GNGGM ++EYV HFS+W+I K+PL+LG DV
Sbjct: 238 AMTRIIDFAAPVGQKAGPGHWNDLDMLEIGNGGMTFDEYVTHFSMWSILKSPLILGNDVT 297
Query: 259 NLTAETMPIIGNEEVIAVNQAI----------------NDQQIWAGPLSGNRIVVLL 299
N+T ET+ II N+ +I +NQ D +WAG L N V L
Sbjct: 298 NMTNETLTIITNKAIIDINQDAAGSPTNRMWKRSVDEGGDLSLWAGSLVNNTYVFAL 354
>gi|340521580|gb|EGR51814.1| predicted protein [Trichoderma reesei QM6a]
Length = 496
Score = 190 bits (482), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 120/350 (34%), Positives = 168/350 (48%), Gaps = 75/350 (21%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
++NGL TP MGWN+WN F C ++ ++ T E L + + D
Sbjct: 20 VNNGLARTPQMGWNNWNTFACSVSSALLTSTAQLLTEYGLQD---LGYKYVVLDDCWSAG 76
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVH-----------------ILISNSLDNRIE 122
++ G LVA++ FP G K LAD +H + SLD+
Sbjct: 77 R----DEADGRLVADEAKFPGGMKPLADALHEKGFLFGMYSSAGEMTCARYAGSLDHEEA 132
Query: 123 SNIAFEQ-GIDYLKYDNCFNDDT--RPTI---RYRAMSDALKKAGRPIFFSMCEWGDMRP 176
+F G+DYLKYDNC++ P I R+ AM+ ALKK GR I +S+C WG+
Sbjct: 133 DAASFASWGVDYLKYDNCYHMGRFGTPLISFDRFNAMAKALKKTGRAILYSLCSWGEDYV 192
Query: 177 ALWGSNIRNSWRTTDDISDSWAR-----------------------MLTIADMNEVYADH 213
WG +I NSWR + DI DS+AR +L I + Y D
Sbjct: 193 HTWGGSIANSWRVSGDIYDSFARPDDLCSCTDAADPACIAPGTHCSVLAIINKVAPYIDR 252
Query: 214 AKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEV 273
PGGWND DMLEVG+GGM EY HFS+WA K+PLLLG D+ +T T+ II N +
Sbjct: 253 GLPGGWNDLDMLEVGHGGMTEEEYKAHFSMWAALKSPLLLGNDLRIMTPSTLTIITNPAI 312
Query: 274 IAVNQ---------------------AINDQQIWAGPLS-GNRIVVLLES 301
IA+NQ + + +W+GPL+ G+++V+ L +
Sbjct: 313 IALNQDPRGRAVQRISRNTSVPKDRYGVGETHVWSGPLANGDQVVIFLNA 362
>gi|392571696|gb|EIW64868.1| alpha-galactosidase [Trametes versicolor FP-101664 SS1]
Length = 436
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 131/357 (36%), Positives = 164/357 (45%), Gaps = 104/357 (29%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQ 62
NGL TP MGWN+WNHF C I+E I DDCW ++RD
Sbjct: 20 NGLARTPQMGWNTWNHFGCGISEDTILSAAKAMVSNNLTAFGYEYVIMDDCWHAAARDPD 79
Query: 63 ISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL---------- 112
G VA+ T FP+G KALAD VH L
Sbjct: 80 T-------------------------GAPVADATKFPNGIKALADEVHGLGLKFGIYSSA 114
Query: 113 -------ISNSLDNR-IESNIAFEQGIDYLKYDNCFNDDT--RPTI---RYRAMSDALKK 159
SLD+ I++ G+DYLKYDNCFN+ P I R+ MS AL
Sbjct: 115 GTYTCGGRFGSLDHEEIDAQTYASWGVDYLKYDNCFNEGRAGTPKISYDRFNNMSRALNA 174
Query: 160 AGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSW---------------------A 198
GRPI +SMC WG+ P + +I NSWR + DI D +
Sbjct: 175 TGRPILYSMCNWGEDGPWNFAVDIANSWRISGDIHDDFDRFDDRCPCTDILDCKLPGFHC 234
Query: 199 RMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVG 258
M I D A G WND DMLEVGNGGM ++EYV HFS+WA+ K+PL+LG DV
Sbjct: 235 AMSRIIDFAAPVGQKAGAGKWNDLDMLEVGNGGMTFDEYVTHFSMWALVKSPLILGNDVT 294
Query: 259 NLTAETMPIIGNEEVIAVNQAIN----------------DQQIWAGPLSGNRIVVLL 299
++T ET+ II N+ +IAVNQ N D Q+W+G L+ N+ V L
Sbjct: 295 DMTNETLSIITNDALIAVNQDSNGSPANRIWKRTVAEGGDLQLWSGSLANNQFVFAL 351
>gi|403415516|emb|CCM02216.1| predicted protein [Fibroporia radiculosa]
Length = 438
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 129/357 (36%), Positives = 173/357 (48%), Gaps = 66/357 (18%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
NGL LTP MGW++WN++ C I+E+ I D G D ++ + D Q A+
Sbjct: 23 NGLALTPQMGWDTWNYYGCSISEQTI--LDAAQGFIKYDLP-KYGYNYVVMDDCWQAPAR 79
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVHIL-----ISNSLD-------------NRIES 123
N G +A+ T FPSG KALAD +H + I +S I++
Sbjct: 80 ---NATTGAPIADPTKFPSGMKALADQIHSMGLKFGIYSSAGMYTCGGHFGSLGYETIDA 136
Query: 124 NIAFEQGIDYLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFSMCEWGDMRPAL 178
E G DYLKYDNC+N+ T RY MS AL GRPI +SMC WG+ P
Sbjct: 137 QTYAEWGADYLKYDNCYNEGQAGTPKLSYDRYANMSFALNATGRPILYSMCNWGEDGPWN 196
Query: 179 WGSNIRNSWRTTDDISDSWAR---------------------MLTIADMNEVYADHAKPG 217
+ I NSWR + DI D++ R M I D A
Sbjct: 197 FAPTIANSWRISGDIYDNYNRFDERCPCLSMIDCKLAGYHCAMTRIIDFAAPVGQKAGTN 256
Query: 218 GWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN 277
WND DMLEVGNGGM Y+EYV HFS+W+I K+PL+LG +V N+T ET+ II N+ +IAVN
Sbjct: 257 HWNDLDMLEVGNGGMTYDEYVTHFSMWSILKSPLILGNNVLNMTNETLEIITNDAIIAVN 316
Query: 278 Q----------------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGI 318
Q + D W+G L N V+ L + + +T ++ D+ I
Sbjct: 317 QDSTGSAAERMWKVPVFSGGDLSSWSGSLVNNTFVIALLNTSPETQYVTVNFTDVFI 373
>gi|261416165|ref|YP_003249848.1| alpha-galactosidase [Fibrobacter succinogenes subsp. succinogenes
S85]
gi|385791034|ref|YP_005822157.1| alpha-galactosidase [Fibrobacter succinogenes subsp. succinogenes
S85]
gi|261372621|gb|ACX75366.1| Alpha-galactosidase [Fibrobacter succinogenes subsp. succinogenes
S85]
gi|302326665|gb|ADL25866.1| alpha-galactosidase [Fibrobacter succinogenes subsp. succinogenes
S85]
Length = 630
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 131/340 (38%), Positives = 169/340 (49%), Gaps = 56/340 (16%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS-RDS-QISLSCSFHMFDSLTQIAAK 81
L LTPP+GWNSWN FH INEK I+E D ES RD+ + L+ + D+
Sbjct: 34 LVLTPPLGWNSWNVFHENINEKQIQEIADAMVESGLRDAGYVFLNLDDNWMDT------- 86
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVHIL----------------------ISNSLDN 119
QGNL N TFPSG KA+ADYVH S S N
Sbjct: 87 --KRDAQGNLQNNPKTFPSGMKAIADYVHKKGLKFGLYGDRGKRTCHHYNSNWQSESGSN 144
Query: 120 RIESNIA---FEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRP 176
E A E G+DY KYDNC +D Y AMS+AL+ +GR I FS+C W
Sbjct: 145 GHEVQDAKKLAEWGVDYWKYDNCDSDPRTQEKDYTAMSNALRNSGRDIVFSICMW---EY 201
Query: 177 ALWGSNIRNSWRTTDDI-----SDSWAR-MLTIADMNEVYADHAKPGGWNDPDMLEVGNG 230
W I N WRTT DI S SW R + I D N Y AKPG WNDPDMLEVGN
Sbjct: 202 KDWMPKIANLWRTTFDIGPEWISTSWYRGVYEIIDANNKYWQIAKPGHWNDPDMLEVGNR 261
Query: 231 GMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------A 279
G+ Y E ++W+I AP+++ DV N++ ET + N+++IA+NQ
Sbjct: 262 GLSYEEQRSQMTMWSIMAAPIMISSDVRNMSNETKELYLNKDMIAINQDSLGVQGHRISD 321
Query: 280 INDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIK 319
N +Q+W PL I V L + + + ++ DIG++
Sbjct: 322 KNGKQVWTKPLKNGDIAVALLNNNNSTQTVECNFKDIGVE 361
>gi|242215448|ref|XP_002473539.1| candidate alpha-galactosidase from glycoside hydrolase family GH27
[Postia placenta Mad-698-R]
gi|220727325|gb|EED81247.1| candidate alpha-galactosidase from glycoside hydrolase family GH27
[Postia placenta Mad-698-R]
Length = 438
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 125/359 (34%), Positives = 171/359 (47%), Gaps = 70/359 (19%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQ--IA 79
NGL LTP MGWN+WN+F C ++E I S+ + I + ++ +
Sbjct: 23 NGLALTPQMGWNTWNYFGCDVSEDTIV--------SAAKAMIQYDLPKYGYEYVISDDCW 74
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----ISNSLD-------------NRI 121
P N G +A+ T FPSG A+AD VH+L I +S I
Sbjct: 75 QAPDRNATTGAPIADPTKFPSGMAAVADQVHLLGLKFGIYSSAGLYTCGGHFGSLGYETI 134
Query: 122 ESNIAFEQGIDYLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFSMCEWGDMRP 176
++ E G DYLKYDNC+N+ T RY MS AL GR I +SMC WG+ P
Sbjct: 135 DAQTYAEWGADYLKYDNCYNEGQAGTPKLSYDRYANMSLALNATGRTILYSMCNWGEDGP 194
Query: 177 ALWGSNIRNSWRTTDDISDSWAR---------------------MLTIADMNEVYADHAK 215
+ I NSWR + DI D++ R M I D A
Sbjct: 195 WNFAPTIANSWRISGDIYDNYNRFDERCPCLSMIDCKLAGYHCAMTRIIDFAAPVGQKAG 254
Query: 216 PGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIA 275
WND DMLE+GNGGM Y+EYV HFS+W++ K+PL+LG DV N+T ET+ II N+ VIA
Sbjct: 255 VNHWNDLDMLEIGNGGMTYDEYVTHFSMWSVLKSPLILGNDVTNMTNETLEIITNDAVIA 314
Query: 276 VNQAIN----------------DQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGI 318
+NQ N D +W+G L N+ V+ L + + ++ + D+ I
Sbjct: 315 INQDPNGSAANRLWKVPVASGGDLSLWSGSLVNNQFVIALLNTSPDTQVVDVEFTDVFI 373
>gi|404404847|ref|ZP_10996431.1| alpha-galactosidase [Alistipes sp. JC136]
Length = 407
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 133/411 (32%), Positives = 179/411 (43%), Gaps = 102/411 (24%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQ 62
GL TPPMGW+SWN F C ++EK+IRE DDCW + RD+
Sbjct: 23 EGLADTPPMGWSSWNKFACNVDEKMIREIADALVSSGLADAGYVYLNIDDCWHAAERDAD 82
Query: 63 ISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI----------- 111
C + FPSG KALADYVH
Sbjct: 83 GFPQC--------------------------DPERFPSGMKALADYVHAKGLKLGIYSDA 116
Query: 112 ------LISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPI 164
L SL + + + + + GIDYLKYD C +++ P Y M DAL+ AGRPI
Sbjct: 117 GCKTCALRFGSLGHEYQDALQYARWGIDYLKYDWCNSENVNPVGAYTLMRDALRAAGRPI 176
Query: 165 FFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWA------------RMLTIADMNEVYAD 212
FS+CEWG +P W + + WRTT DI +++ +L D N
Sbjct: 177 LFSICEWGTNKPWEWAQEVGHLWRTTQDIGLAFSDPADFKVDWRPRTVLENLDSNAGLRR 236
Query: 213 HAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEE 272
A PG WNDPDMLEVGN GM N+ HF++W + APL+LG DV ++ ET I+ + +
Sbjct: 237 FAGPGHWNDPDMLEVGN-GMSVNQDRAHFTMWCMMAAPLILGNDVRTMSDETAAILLDRD 295
Query: 273 VIAVNQ-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWD-----DI 316
VIA++Q N ++W PL+G L +R W D+
Sbjct: 296 VIAIDQDSLGVQGLRYETDNGLEVWFKPLAGGDWAFCLLNRTLEPRRYMIDWQRFCFTDV 355
Query: 317 GIKSNNTLFEHRTLETK---FQGK-------LTAKVDGHSCKMYELTPVTK 357
+ +T F+ E + F GK V +Y LTPV K
Sbjct: 356 EVSQRSTDFDKIVYEGRDLWFGGKPFRTNRVREVVVPAEDVVLYRLTPVKK 406
>gi|358378186|gb|EHK15868.1| glycoside hydrolase family 27 protein [Trichoderma virens Gv29-8]
Length = 496
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 124/369 (33%), Positives = 169/369 (45%), Gaps = 114/369 (30%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
++NGL TP MGWN+WN FHC ++ ++ T DDCW S RD
Sbjct: 20 VNNGLARTPQMGWNNWNTFHCSVSSTLLTNTAKLLTEYGLQDLGYKYVVLDDCW-SSGRD 78
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH---------- 110
+ G LVA+ T FP G A+AD +H
Sbjct: 79 AN--------------------------GKLVADTTKFPDGMGAVADALHEQGFLFGMYS 112
Query: 111 -------ILISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDT--RPTI---RYRAMSDAL 157
+ SLD +F G+DYLKYDNC++ P I R+ M+ AL
Sbjct: 113 SAGEMTCARYAGSLDYEENDAQSFADWGVDYLKYDNCYHMGRFGTPLISFNRFNEMAKAL 172
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWAR------------------ 199
KK GR I +S+C WG+ WG +I NSWR + DI DS+AR
Sbjct: 173 KKTGRSILYSLCNWGEDYVHTWGGSIANSWRISGDIYDSFARPDDLCSCTNAADPACIAP 232
Query: 200 -----MLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLG 254
+L I + Y D PGGWND DMLEVG+GGM EY HFS+WA K+PLLLG
Sbjct: 233 GTHCSVLAIINKVAPYIDRGLPGGWNDLDMLEVGHGGMTEEEYKAHFSMWAALKSPLLLG 292
Query: 255 CDVGNLTAETMPIIGNEEVIAVNQ---------------------AINDQQIWAGPLS-G 292
D+ +TA ++ II N +IA+NQ + + +W+GPL+ G
Sbjct: 293 NDLRAMTASSLAIINNPAIIALNQDPRGRAIQRISRDLDVPVDRHGVGETHVWSGPLANG 352
Query: 293 NRIVVLLES 301
+++V+ L +
Sbjct: 353 DQVVIFLNA 361
>gi|387791649|ref|YP_006256714.1| alpha-galactosidase [Solitalea canadensis DSM 3403]
gi|379654482|gb|AFD07538.1| alpha-galactosidase [Solitalea canadensis DSM 3403]
Length = 412
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 117/345 (33%), Positives = 168/345 (48%), Gaps = 64/345 (18%)
Query: 23 GLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS-RDSQISL----SCSFHMFDSLTQ 77
GL LTPPMGWNSWN F C ++E +I+ D S +D+ + C DSL
Sbjct: 33 GLALTPPMGWNSWNKFACNVDENLIKSIADAMVSSGMKDAGYTYINIDDCWHGDRDSLGF 92
Query: 78 IAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAFEQ-------- 129
I P FPSG KALADY+H S L I S+ +
Sbjct: 93 IHPDP-------------KRFPSGMKALADYIH---SKGLKIGIYSDAGSQTCGGRPGSR 136
Query: 130 -------------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRP 176
GIDYLKYD C + + Y+ ++ AL++AGRPI S+CEWG+ +P
Sbjct: 137 GYEFQDAMTYAAWGIDYLKYDWCNTEGLKAEGAYKTITAALRRAGRPIVLSICEWGNDKP 196
Query: 177 ALWGSNIRNSWRTTDDISD---------SWARM--LTIADMNEVYADHAKPGGWNDPDML 225
WG + + WRTT DI + +W + I DM + +A PG WNDPDML
Sbjct: 197 WEWGKTVGHLWRTTGDIYNCFDCIEDHGTWKSFGTMQILDMQDGLRKYAGPGHWNDPDML 256
Query: 226 EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------- 278
EVGNG + E HF++WA+ APL+ G D+ N+ ET+ ++ N+++IA+NQ
Sbjct: 257 EVGNGKLTPREDRAHFTMWAMLAAPLIAGNDIRNMDKETVEVLTNKDIIAINQDSLGIQG 316
Query: 279 ----AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIK 319
+++ Q W PL G + +R +++ W + I+
Sbjct: 317 FKHTSVDSLQTWLKPLKGGDWAICFLNRSKSEKVVSLDWKKVIIQ 361
>gi|297610710|ref|NP_001064935.2| Os10g0492900 [Oryza sativa Japonica Group]
gi|255679519|dbj|BAF26849.2| Os10g0492900, partial [Oryza sativa Japonica Group]
Length = 138
Score = 187 bits (476), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 85/128 (66%), Positives = 101/128 (78%)
Query: 121 IESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWG 180
I S QG+DYLKYDNC N D +P RY MS AL AGRPI+FS+CEWGDM PA WG
Sbjct: 2 IGSVTPLSQGVDYLKYDNCNNGDLKPLERYPEMSKALMMAGRPIYFSLCEWGDMHPAKWG 61
Query: 181 SNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVH 240
+ NSWRTT+DI+D+W M++ AD NEV+AD+A+PGGWNDPDMLEVGNGGM +EY+VH
Sbjct: 62 AAYGNSWRTTNDIADTWESMVSRADENEVWADYARPGGWNDPDMLEVGNGGMTNDEYIVH 121
Query: 241 FSIWAISK 248
FS+WAISK
Sbjct: 122 FSLWAISK 129
>gi|389740758|gb|EIM81948.1| alpha-galactosidase [Stereum hirsutum FP-91666 SS1]
Length = 450
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 119/343 (34%), Positives = 160/343 (46%), Gaps = 70/343 (20%)
Query: 21 SNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAA 80
SNGL +TP MGWN+WNHF C I+E I + +L+ + + +
Sbjct: 27 SNGLAITPQMGWNTWNHFGCDISEDTIVSAAQAF------VNYNLTQYGYEYILMDDCWQ 80
Query: 81 KPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLD-NRIE 122
P + G VA+ FP+G K L+D +H + SLD I+
Sbjct: 81 APARDNATGAPVADPDKFPNGVKDLSDKIHAMGLKFGIYSSAGTYTCGGKFGSLDYEEID 140
Query: 123 SNIAFEQGIDYLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFSMCEWGDMRPA 177
+ + G+DYLKYDNC+N T RY MS AL GRPI +SMC WG+ P
Sbjct: 141 AQTYADWGVDYLKYDNCYNQGRAGTPLISYERYNNMSMALNATGRPILYSMCNWGEDGPW 200
Query: 178 LWGSNIRNSWRTTDDISDSW---------------------ARMLTIADMNEVYADHAKP 216
+ I NSWR + DI D + M I D A P
Sbjct: 201 NFAVTIANSWRISGDIFDVYDGYDDRCPCTSMLDCKLAGFHCSMTRIIDFAAPLGQKAGP 260
Query: 217 GGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAV 276
G WND DMLEVGNGGM Y+EYV HFS+W+I K+PL+LG DV ++T +T+ II N+ +IA+
Sbjct: 261 GKWNDLDMLEVGNGGMSYDEYVTHFSMWSILKSPLILGNDVTDMTNDTLEIITNDAIIAL 320
Query: 277 NQ--------------------AINDQQIWAGPLSGNRIVVLL 299
NQ D +W G L+ + V+ L
Sbjct: 321 NQDSSGSPAVRIWKKATNDTGSGTGDLSLWVGSLANSEYVIAL 363
>gi|307105141|gb|EFN53392.1| hypothetical protein CHLNCDRAFT_137182 [Chlorella variabilis]
Length = 436
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 130/381 (34%), Positives = 185/381 (48%), Gaps = 53/381 (13%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L +G TPPMG+N+WN F +INE +IR+ D R ++ L + + F L
Sbjct: 24 LRDGQATTPPMGFNTWNAFGARINEDLIRDAAD------RIVELGLRDAGYTFLVLDD-G 76
Query: 80 AKPLPNQCQGNLVANKTTFPSGT---------KALADYV--------------HIL---I 113
+ G L ++ F SG KALADYV H
Sbjct: 77 WSDVERTGDGRLQGDRERFSSGAPLLLLLLRIKALADYVRGKGLRLGIYSDSGHFTCQGF 136
Query: 114 SNSLDNRIESNIAF-EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWG 172
S D+ E +F + G+DYLKYDNCF D R+ AM DAL GRP +S+ EWG
Sbjct: 137 PGSRDHEREDAQSFADWGVDYLKYDNCFVHDDL-LGRFVAMRDALNATGRPFVYSLSEWG 195
Query: 173 DMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGG- 231
P +WG + ++WRTT D SW ++ D + A +A PG WND D+LEVG G
Sbjct: 196 IGDPWVWGPQVAHAWRTTFDSHPSWPSIMLNLDESVALARYAGPGAWNDLDLLEVGPTGS 255
Query: 232 ------MKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ---AIND 282
+ + E HF++WA+ K+PL + ++ L+ T+ I+ EVIAVNQ +
Sbjct: 256 PNARSYLSHQEEQAHFALWALLKSPLFVAANLRQLSPVTLAILKTHEVIAVNQDELGVPG 315
Query: 283 QQIWAGPLS-GNRIVVLLESRKTF-SSMMTAHWDDIGIKSNNT-----LFEHRTLETKFQ 335
++A PL+ G R VVLL + +S +T W +G + LF+ R + ++
Sbjct: 316 DLVYAAPLADGGRAVVLLNRHSQYQASNLTLRWQLVGYPPDTRVVARDLFQERDI-GQYA 374
Query: 336 GKLTAKVDGHSCKMYELTPVT 356
G LTA+V H LTPV
Sbjct: 375 GSLTAEVHVHGVVALRLTPVA 395
>gi|414588075|tpg|DAA38646.1| TPA: putative SERRATE-related C2H2 zinc-finger family protein [Zea
mays]
Length = 641
Score = 187 bits (475), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 102/207 (49%), Positives = 122/207 (58%), Gaps = 32/207 (15%)
Query: 90 NLVANKTTFPSGTKALADYVH-----------------ILISNSLDNRIESNIAFEQ-GI 131
L+ + TFPSG K LADYV + SL + + F GI
Sbjct: 303 QLLPDPKTFPSGIKYLADYVDGKGLKLGIYSDAGKFTCQVRPGSLGHENDDVAIFASWGI 362
Query: 132 DYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTD 191
DYLKY P +R DAL G IF+S+CEWG PALW + NSWRTTD
Sbjct: 363 DYLKY---------PPMR-----DALNSTGHQIFYSLCEWGQYDPALWAGKVGNSWRTTD 408
Query: 192 DISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPL 251
DI+D+W M IAD N +A +A PGGWNDPDMLEV NGGM EY HFSIWA+ KAPL
Sbjct: 409 DITDTWKSMTDIADKNNKWASYAGPGGWNDPDMLEVDNGGMTLAEYRSHFSIWALMKAPL 468
Query: 252 LLGCDVGNLTAETMPIIGNEEVIAVNQ 278
L+GCDV N+T+ETM I+ N+EVI VN+
Sbjct: 469 LIGCDVRNMTSETMEILSNKEVIQVNK 495
>gi|433802364|gb|AGB51443.1| alpha galactosidase isoform 2, partial [Cocos nucifera]
Length = 210
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 103/236 (43%), Positives = 123/236 (52%), Gaps = 64/236 (27%)
Query: 31 GWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQISLSCSFHM 71
GWNSWNHF C I+E++I+ET DDCW ES+R SQ
Sbjct: 1 GWNSWNHFACNIDEQMIKETADALVSTGLARLGYQYVNLDDCWAESNRGSQ--------- 51
Query: 72 FDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------------------LI 113
GNL A +TFPSG KALADYVH +
Sbjct: 52 -----------------GNLAAKSSTFPSGIKALADYVHAKGLKLGIYGDAGFYTCSKQM 94
Query: 114 SNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWG 172
SL + + F +DYLKYDNC+ND + P RY MS AL +GR IFFS+CEWG
Sbjct: 95 PGSLGHEDQDAKTFASWEVDYLKYDNCYNDGSSPQDRYNPMSKALLNSGRTIFFSLCEWG 154
Query: 173 DMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVG 228
+ PA W + NSWRTT DI D W M++ AD N+ +A HA PGGWNDPDMLEVG
Sbjct: 155 EADPATWARGVGNSWRTTGDIQDKWESMISRADENDKWAGHAGPGGWNDPDMLEVG 210
>gi|449550724|gb|EMD41688.1| glycoside hydrolase family 27 protein [Ceriporiopsis subvermispora
B]
Length = 407
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 121/368 (32%), Positives = 171/368 (46%), Gaps = 105/368 (28%)
Query: 30 MGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQISLSCSFH 70
MGWN+WNHF C I+E +I DDCW ++R++
Sbjct: 1 MGWNTWNHFGCSIDEDLILSAAKAMIQYDLPSYGYNYVNMDDCWHAATRNA--------- 51
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL------------------ 112
+ P P+ T FP+G KAL+D +H +
Sbjct: 52 --------SGAPQPD---------PTKFPNGIKALSDQIHSMGLKFGIYSDAGTMTCGGH 94
Query: 113 ISNSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFS 167
+ I++ E G+DYLKYDNCFN T RY MS AL GRPI +S
Sbjct: 95 FGSLGYEEIDAQTYAEWGVDYLKYDNCFNQGLSGTPKLSYDRYNNMSVALNNTGRPILYS 154
Query: 168 MCEWGDMRPALWGSNIRNSWRTTDDISDSWAR---------------------MLTIADM 206
MC WG+ P + S I NSWR + D+ D++ R M I D
Sbjct: 155 MCNWGEDGPWNFASTIANSWRISGDVYDNYDRWDVRCPCDSFIDCKLAGFHCAMTRIIDF 214
Query: 207 NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMP 266
A PG WND DMLEVGNGGM Y+EYV HFS+WA+ K+PL+LG DV N+T ET+
Sbjct: 215 AAPLGQKAGPGHWNDLDMLEVGNGGMTYDEYVTHFSMWALLKSPLILGNDVTNMTNETLS 274
Query: 267 IIGNEEVIAVNQ-------------AINDQ---QIWAGPLSGNRIVVLLESRKTFSSMMT 310
II N+ +IAVNQ ++++ ++W+G L ++ ++ L + + +T
Sbjct: 275 IITNDAIIAVNQDSAGSPATRQWKRSVSEGGYLELWSGSLVNSQFIIALMNTSPTNQTVT 334
Query: 311 AHWDDIGI 318
++D+ I
Sbjct: 335 VSFEDVFI 342
>gi|298351556|sp|A4DA70.2|AGALD_ASPFU RecName: Full=Probable alpha-galactosidase D; AltName:
Full=Melibiase D; Flags: Precursor
gi|298351562|sp|B0YEK2.2|AGALD_ASPFC RecName: Full=Probable alpha-galactosidase D; AltName:
Full=Melibiase D; Flags: Precursor
Length = 648
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 128/389 (32%), Positives = 182/389 (46%), Gaps = 67/389 (17%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQ-- 77
+ NGL TP MGWNS+N++ C NE IIR S+ + + L + + +T
Sbjct: 23 IDNGLARTPQMGWNSYNYYSCSPNEAIIR--------SNAKALVDLGLAELGYRYVTTDC 74
Query: 78 --IAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-------------ISNSLDNRIE 122
A LPN G L N+T FPSG A+ +Y+H L + S +++
Sbjct: 75 GWSVADRLPN---GTLTWNETLFPSGFPAMGEYLHELGLLFGVYGDSGTKLCGSPPDQVG 131
Query: 123 SNIAFEQ--------GIDYLKYDNCFND------------DTRPTIRYRAMSDALKKAGR 162
S EQ G D LKYDNC++D T P RY MS AL + GR
Sbjct: 132 SLYHEEQDAKTFAEWGADSLKYDNCYSDAATNYPNVNYEPSTSPRPRYEIMSSALARVGR 191
Query: 163 PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDP 222
PI F +CEWG PALW + NSWR +DI +W + + D A PG W D
Sbjct: 192 PILFQICEWGIDFPALWAPALGNSWRIGNDIIPAWRSIFRTLNQAVPNTDFAGPGQWADL 251
Query: 223 DMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGC----DVGNLTAETMPIIGNEEVIAVNQ 278
DML VGNG E HFS+WAI K+PL +G D +++ ++ ++ ++VI NQ
Sbjct: 252 DMLYVGNGVFSLPEEQTHFSLWAILKSPLTIGAALKDDDTSISQASLEVLKQKDVIGFNQ 311
Query: 279 -------------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLF 325
+ ++W+GPLSGNR VV + + + S +T D+G++
Sbjct: 312 DALGVSASLKRRWSDEGYEVWSGPLSGNRTVVAVINWRNESRDLTLDLPDVGLQYAQVAR 371
Query: 326 E--HRTLETKFQGKLTAKVDGHSCKMYEL 352
+T+ + TA V GH + EL
Sbjct: 372 NIWGKTVVRDVRTSYTAGVAGHGTILLEL 400
>gi|390601837|gb|EIN11230.1| glycoside hydrolase [Punctularia strigosozonata HHB-11173 SS5]
Length = 439
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 124/358 (34%), Positives = 165/358 (46%), Gaps = 102/358 (28%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
++NGL TP MGWN+WN F C INE++I + DDCW R
Sbjct: 23 VNNGLARTPVMGWNTWNRFVCGINEELILSSAKLLVSTGLKDAGYNYVNVDDCWHAPERA 82
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----ISN 115
S G+ + TFP G KALAD VH L I +
Sbjct: 83 SD--------------------------GSPAWDPNTFPRGIKALADDVHDLGLKFGIYS 116
Query: 116 SLDN-------------RIESNIAFEQGIDYLKYDNCFNDDT--RPTIRYRAMSDALKKA 160
S I++ E G+D LKYDNCFND T+ Y M++AL
Sbjct: 117 SAGTMTCQRRFGSLGYEEIDAKAYAEWGVDLLKYDNCFNDGLFGNETVSYAKMANALNAT 176
Query: 161 GRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDS---------------------WAR 199
GRPI +SMC WG W I N WR + DISD +
Sbjct: 177 GRPIVYSMCNWGQDLSWTWAGKIANMWRMSGDISDDFDGYDSRCPCLQLENCTEFGYYCS 236
Query: 200 MLTIADMNEVYADHAKPGGWNDPDMLE----VGNGGMKYNEYVVHFSIWAISKAPLLLGC 255
+ I D ++++PG WND DMLE VGNGGM Y+EYV HFS+WA+ K+PL+LG
Sbjct: 237 AVRILDWAAAMLEYSEPGAWNDLDMLEAGTYVGNGGMSYDEYVSHFSLWALVKSPLILGN 296
Query: 256 DVGNLTAETMPIIGNEEVIAVNQ-----------AINDQQIWAGPL-SGNRIVVLLES 301
D+ ++ ET+ II N+ +IA NQ +D ++WAGPL G+ +V +L +
Sbjct: 297 DLAQMSDETLEIITNDAIIAANQDPLGVPAKRVWKRDDLELWAGPLHDGSTVVAVLNT 354
>gi|414588074|tpg|DAA38645.1| TPA: putative SERRATE-related C2H2 zinc-finger family protein [Zea
mays]
Length = 538
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 102/207 (49%), Positives = 122/207 (58%), Gaps = 32/207 (15%)
Query: 90 NLVANKTTFPSGTKALADYVH-----------------ILISNSLDNRIESNIAFEQ-GI 131
L+ + TFPSG K LADYV + SL + + F GI
Sbjct: 200 QLLPDPKTFPSGIKYLADYVDGKGLKLGIYSDAGKFTCQVRPGSLGHENDDVAIFASWGI 259
Query: 132 DYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTD 191
DYLKY P +R DAL G IF+S+CEWG PALW + NSWRTTD
Sbjct: 260 DYLKY---------PPMR-----DALNSTGHQIFYSLCEWGQYDPALWAGKVGNSWRTTD 305
Query: 192 DISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPL 251
DI+D+W M IAD N +A +A PGGWNDPDMLEV NGGM EY HFSIWA+ KAPL
Sbjct: 306 DITDTWKSMTDIADKNNKWASYAGPGGWNDPDMLEVDNGGMTLAEYRSHFSIWALMKAPL 365
Query: 252 LLGCDVGNLTAETMPIIGNEEVIAVNQ 278
L+GCDV N+T+ETM I+ N+EVI VN+
Sbjct: 366 LIGCDVRNMTSETMEILSNKEVIQVNK 392
>gi|397691600|ref|YP_006528854.1| alpha-galactosidase [Melioribacter roseus P3M]
gi|395813092|gb|AFN75841.1| alpha-galactosidase [Melioribacter roseus P3M]
Length = 406
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 137/394 (34%), Positives = 183/394 (46%), Gaps = 81/394 (20%)
Query: 23 GLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS-RDS---QISLSCSFH-MFDSLTQ 77
GL LTPPMGWNSWN F C I+E +I+E D ES +D+ ++L +H DSL
Sbjct: 30 GLALTPPMGWNSWNKFACDIDETLIKEIADAIVESGMKDAGYIYVNLDDCWHGERDSLGF 89
Query: 78 IAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAFEQ-------- 129
I P FPSG KALADY+H S L I S ++
Sbjct: 90 IHPDP-------------ERFPSGMKALADYIH---SKGLKIGIYSCAGYKTCGGRPGSR 133
Query: 130 -------------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRP 176
GIDYLKYD C + Y + DA++ AGRP+ S+CEWGD +P
Sbjct: 134 GYEYQDALTYAKWGIDYLKYDWCNTEGLCAEGAYMTIRDAIRSAGRPMVLSICEWGDNQP 193
Query: 177 ALWGSNIRNSWRTTDDISD------------SWARMLTIADMNEVYADHAKPGGWNDPDM 224
WG ++ + WRTT DI++ SW + I DM E +A PG WNDPDM
Sbjct: 194 WEWGKDVGHLWRTTGDITNCFDCFVDHGTWKSWG-VTYILDMQEGLRQYAGPGHWNDPDM 252
Query: 225 LEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------ 278
LEVGN GM NE HFS+W + APL+ G D+ N++ ET+ I+ N+E IAV+Q
Sbjct: 253 LEVGN-GMSVNEDRAHFSMWCMLAAPLIAGNDIRNMSKETLEILTNKEAIAVDQDPLGIQ 311
Query: 279 -----AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWD---------DIGIKSNNTL 324
+ + +IW PL + + +R W D + + N
Sbjct: 312 GFKYSSEDSVEIWFKPLVNDEWAFCILNRNETDKEFVFDWQKEKVIDELFDKKLDTQNKT 371
Query: 325 FEHRTL-ETKFQGK----LTAKVDGHSCKMYELT 353
+ R L + +F G L A V H M L
Sbjct: 372 YRLRDLWKHEFVGTTDEPLKAVVPSHDVLMLRLV 405
>gi|256394049|ref|YP_003115613.1| ricin B lectin [Catenulispora acidiphila DSM 44928]
gi|256360275|gb|ACU73772.1| Ricin B lectin [Catenulispora acidiphila DSM 44928]
Length = 547
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 133/392 (33%), Positives = 178/392 (45%), Gaps = 86/392 (21%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
L+N L LTP MG+N WN + C ++E +I+ T DDCW +RD
Sbjct: 41 LNNQLALTPQMGFNDWNAYGCNVSESLIKSTALAMHNDGMQAAGYQYVNIDDCWLTHNRD 100
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-------- 112
S G LV + FP G A YVH L
Sbjct: 101 SS--------------------------GQLVPDPAKFPDGISGTAAYVHSLGLKLGIYE 134
Query: 113 ---------ISNSLDN-RIESNIAFEQGIDYLKYDNCFND------DTRPTI--RYRAMS 154
SL + + ++N G+DYLKYDNC+ D D P+ RY M
Sbjct: 135 DAGTMTCAGYPGSLGHEQTDANSFAAWGVDYLKYDNCYADGVHQSGDGGPSAQSRYTTMR 194
Query: 155 DALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHA 214
DAL K GRPI FS+C WG WGS + NSWRTT DI+ S+ ML+I N A HA
Sbjct: 195 DALAKTGRPILFSLCSWGLDSVWNWGSGVGNSWRTTGDINASFGSMLSIFHSNVGLASHA 254
Query: 215 KPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVI 274
PG WNDPDMLEVGN GM E FS+WA APL+ G ++ N ++ T+ + N VI
Sbjct: 255 GPGAWNDPDMLEVGN-GMSATEDRSEFSLWAEMAAPLISGTNLANASSTTLATLTNSRVI 313
Query: 275 AVNQAINDQQ-----------IWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGI--KSN 321
AV+Q +Q + A PL+ + V L + + ++ + IG S
Sbjct: 314 AVDQDSLGKQGTEVSSSGGLDVLAKPLANGDVSVALFNENSGTATINTTVAAIGKTGASG 373
Query: 322 NTLFEHRTLETK-FQGKLTAKVDGHSCKMYEL 352
TL + + T G ++A V GH M+ +
Sbjct: 374 YTLTDLWSGATSTTTGAISASVPGHGTVMFRV 405
>gi|452819290|gb|EME26353.1| alpha-galactosidase [Galdieria sulphuraria]
Length = 535
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 124/379 (32%), Positives = 190/379 (50%), Gaps = 52/379 (13%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGL TPPMG+++WN +HCQ + + + +T ++ +D+ + +++ A
Sbjct: 27 LDNGLARTPPMGYDTWNFYHCQYDGETLMKT----AKAMKDAGMLELGYEYIYPDDCWEA 82
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----ISNSLDNRIESNIA-----FEQ 129
+ P+ G+L N FP G K + DY+H L I N R + +E+
Sbjct: 83 PERAPD---GSLQPNPFKFPHGIKPVIDYIHSLGFKFGIYNCAGTRTCAGFPGSYGHYEE 139
Query: 130 --------GIDYLKYDNC---------FNDDTRPTIRYRAMSDALKKAGRPIFFSMCE-W 171
G+DY+K+D C ++ + Y DAL K GR I FSMC W
Sbjct: 140 YAQQFADWGVDYIKFDWCNVPFWEFPGWSHEQVAQKLYSDFRDALNKTGRHIVFSMCNGW 199
Query: 172 G-DMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNG 230
D+ P W S++ N WRTTDDI+D++ M + N ++ A PG WN+PDMLEVGNG
Sbjct: 200 DPDVYPWRWASDVANLWRTTDDIADNYDVMRDRYEQNILHGSKAGPGHWNNPDMLEVGNG 259
Query: 231 GMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAIND-------- 282
GM EY+ HFS+W+I APL++G DV N+T T I+ N+EVI V+Q +
Sbjct: 260 GMTTEEYITHFSLWSIMAAPLVVGTDVINMTEATKMILTNKEVIDVDQDVLGVQGIRVKS 319
Query: 283 ---QQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFEHRTL----ETKFQ 335
Q + PL+ + ++L S ++ + + + S ++ R L ET +
Sbjct: 320 DIAQDVVTKPLANGDVAIVLFSSNG-PGVINTTIEQVRMPSYYDVYILRDLWTKQETTIR 378
Query: 336 GKLTAKVDGHSCKMYELTP 354
++A V GH KMY + P
Sbjct: 379 HFISAYVQGHGVKMYRIRP 397
>gi|19115924|ref|NP_595012.1| alpha-galactosidase, melibiase [Schizosaccharomyces pombe 972h-]
gi|51701357|sp|Q9URZ0.1|AGAL_SCHPO RecName: Full=Alpha-galactosidase mel1; AltName:
Full=Alpha-D-galactoside galactohydrolase; AltName:
Full=Melibiase; Flags: Precursor
gi|6224598|emb|CAB60017.1| alpha-galactosidase, melibiase [Schizosaccharomyces pombe]
Length = 436
Score = 184 bits (466), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 122/359 (33%), Positives = 179/359 (49%), Gaps = 66/359 (18%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
NGLGL P MGWNSWN + C I+E II E + + L + + D +K
Sbjct: 27 NGLGLKPQMGWNSWNKYACDIDESIILNNAKAIKE---EGLLDLGYEYIVMD---DCWSK 80
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLDN-RIES 123
N G L AN FP+G ++A +H + SL++ +I++
Sbjct: 81 HERNATTGRLEANPDKFPNGIGSMAKKLHDMGFKFGMYSSAGKYTCAGFPGSLNHEQIDA 140
Query: 124 NIAFEQGIDYLKYDNCFNDDTR--PTI---RYRAMSDALKKAGRPIFFSMCEWGDMRPAL 178
+ + G+DYLKYDNCFN+ P I RY+ MSDAL K GRPIF+S+C+WG+
Sbjct: 141 DTFADWGVDYLKYDNCFNEGKSGVPLISYERYKRMSDALNKTGRPIFYSLCQWGEDFVWN 200
Query: 179 WGSNIRNSWRTTDDISDSWAR----------------------MLTIADMNEVYADHAKP 216
WG+ I NSWR + DI D+++R M I+ + + +
Sbjct: 201 WGNTIANSWRISGDIFDTFSRKDVRCPCETIECFALQGDHCSVMNIISKASFLSSKAGMN 260
Query: 217 GGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAV 276
GWND D LEVGNGGM + EY HF++WAI K+PL+LG DV +++ I+ N+E+I++
Sbjct: 261 SGWNDLDSLEVGNGGMSFEEYKTHFTMWAILKSPLILGNDVSSMSPMDKLIVSNKELISI 320
Query: 277 NQAINDQ---------------QIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKS 320
NQ I ++++G LS N VV + + + M H DI + +
Sbjct: 321 NQDIGTNPAALIWKKKYGDEYIELFSGRLSNNDWVVAVLNAASEPLKMGIHLSDIFVDA 379
>gi|347829162|emb|CCD44859.1| glycoside hydrolase family 27 protein [Botryotinia fuckeliana]
Length = 502
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 110/309 (35%), Positives = 152/309 (49%), Gaps = 52/309 (16%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSL 75
R + ++NGL TP MGW++WN C ++E+++ +T D + L + + D
Sbjct: 17 RVDAVNNGLARTPQMGWDNWNALGCDVSEELLLQTADLIVDYGLKD---LGYQYVILDDC 73
Query: 76 TQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLD 118
N LVA+ FP G KA+AD +H L + SL
Sbjct: 74 WSNGRNASDNN---TLVADADKFPQGMKAVADAIHDLGLKFGMYSDAGEYTCGGYAGSLG 130
Query: 119 -NRIESNIAFEQGIDYLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFSMCEWG 172
+++N GIDYLKYDNC+N T RY+ M+ AL GRPI +S+C WG
Sbjct: 131 YETVDANYFASVGIDYLKYDNCYNTGQAGTQYLSSQRYQVMAKALNATGRPIVYSLCNWG 190
Query: 173 DMRPALWGSNIRNSWRTTDDISDSWAR-----------------------MLTIADMNEV 209
+ P WGS NSWR + D+ DSW R ++ I +
Sbjct: 191 EDSPWNWGSTTANSWRISGDVYDSWDRADARCPCDGPDAFNCVLPGFHCSIVNIMNKASF 250
Query: 210 YADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIG 269
A+PG WND DMLEVGNGGM +EYV HFS+WA+ K+PL+LG D+ + + I+
Sbjct: 251 IVSKAQPGAWNDLDMLEVGNGGMTDSEYVAHFSMWAVVKSPLILGNDLREIQPADLAILS 310
Query: 270 NEEVIAVNQ 278
N VIAVNQ
Sbjct: 311 NPAVIAVNQ 319
>gi|154311387|ref|XP_001555023.1| hypothetical protein BC1G_06546 [Botryotinia fuckeliana B05.10]
Length = 460
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 110/309 (35%), Positives = 152/309 (49%), Gaps = 52/309 (16%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSL 75
R + ++NGL TP MGW++WN C ++E+++ +T D + L + + D
Sbjct: 17 RVDAVNNGLARTPQMGWDNWNALGCDVSEELLLQTADLIVDYGLKD---LGYQYVILDDC 73
Query: 76 TQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLD 118
N LVA+ FP G KA+AD +H L + SL
Sbjct: 74 WSNGRNASDNN---TLVADADKFPQGMKAVADAIHDLGLKFGMYSDAGEYTCGGYAGSLG 130
Query: 119 -NRIESNIAFEQGIDYLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFSMCEWG 172
+++N GIDYLKYDNC+N T RY+ M+ AL GRPI +S+C WG
Sbjct: 131 YETVDANYFASVGIDYLKYDNCYNTGQAGTQYLSSQRYQVMAKALNATGRPIVYSLCNWG 190
Query: 173 DMRPALWGSNIRNSWRTTDDISDSWAR-----------------------MLTIADMNEV 209
+ P WGS NSWR + D+ DSW R ++ I +
Sbjct: 191 EDSPWNWGSTTANSWRISGDVYDSWDRADARCPCDGPDAFNCVLPGFHCSIVNIMNKASF 250
Query: 210 YADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIG 269
A+PG WND DMLEVGNGGM +EYV HFS+WA+ K+PL+LG D+ + + I+
Sbjct: 251 IVSKAQPGAWNDLDMLEVGNGGMTDSEYVAHFSMWAVVKSPLILGNDLREIQPADLAILS 310
Query: 270 NEEVIAVNQ 278
N VIAVNQ
Sbjct: 311 NPAVIAVNQ 319
>gi|358397432|gb|EHK46807.1| glycoside hydrolase family 27 protein [Trichoderma atroviride IMI
206040]
Length = 496
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 119/365 (32%), Positives = 165/365 (45%), Gaps = 114/365 (31%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQ 62
NGL TP MGWN+WN F C ++ ++ +T DDCW S RD
Sbjct: 22 NGLARTPQMGWNNWNTFACSVSSTLLLDTSKLLTEYGLQDLGYKYVVLDDCW-SSGRDDN 80
Query: 63 ISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH------------ 110
G LVA+ FP G A+AD +H
Sbjct: 81 --------------------------GKLVADSAKFPDGMGAVADALHEQGFLFGMYSSA 114
Query: 111 -----ILISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTI-----RYRAMSDALKK 159
+ SLD +F G+DYLKYDNC++ T R+ AM++A+KK
Sbjct: 115 GEMTCARYAGSLDYEEADAQSFADWGVDYLKYDNCYHMGRFGTPLVSFNRFNAMAEAIKK 174
Query: 160 AGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWAR-------------------- 199
GR + +S+C WG+ WG +I NSWR + DI DS+AR
Sbjct: 175 TGRAMLYSLCSWGEDYVHTWGGSIANSWRISGDIYDSFARPDDLCSCTNAADPECIAPGT 234
Query: 200 ---MLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCD 256
+L I + Y D PGGWND DMLEVG+GGM EY HFS+WA K+PLLLG D
Sbjct: 235 HCSVLAIVNKVAPYIDRGLPGGWNDLDMLEVGHGGMTEEEYKAHFSMWAALKSPLLLGND 294
Query: 257 VGNLTAETMPIIGNEEVIAVNQ---------------------AINDQQIWAGPLS-GNR 294
+ ++TA + I+ N +IA+NQ + + +W+GPL+ G++
Sbjct: 295 LRSMTASALAIVNNPAIIALNQDPRGRAVQRIQRNLDVPRDRHGVGEAHVWSGPLANGDQ 354
Query: 295 IVVLL 299
+V+ L
Sbjct: 355 VVIFL 359
>gi|212538145|ref|XP_002149228.1| alpha-galactosidase, putative [Talaromyces marneffei ATCC 18224]
gi|210068970|gb|EEA23061.1| alpha-galactosidase, putative [Talaromyces marneffei ATCC 18224]
Length = 646
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 129/394 (32%), Positives = 186/394 (47%), Gaps = 69/394 (17%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQ---- 77
N L TP MGWN++NH+ C NE I+ S+ + + L + + +T
Sbjct: 18 NRLAQTPQMGWNTYNHYGCSPNEAIVH--------SNAQALVDLGLASLGYRYVTTDCGW 69
Query: 78 IAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-------------ILISNSLDNRIESN 124
A LPN G+L + T FPSG AL Y+H IL+ S N+I S
Sbjct: 70 TVADRLPN---GSLTWDSTRFPSGFPALGQYLHGLGLLFGVYQDAGILLCGSPPNQIGSL 126
Query: 125 IAFEQ--------GIDYLKYDNCFND------------DTRPTIRYRAMSDALKKAGRPI 164
Q G+D LKYDNC++D T P RY MS+AL + R I
Sbjct: 127 YHELQDAQTFASWGVDSLKYDNCYSDAATNYPNVNYAPSTSPEPRYANMSNALSQQDRAI 186
Query: 165 FFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDM 224
F +CEWG PA W ++ NSWR +DIS SW + + + D A PG W D DM
Sbjct: 187 LFQVCEWGVDFPAAWAPSLGNSWRIGNDISSSWTSIYRLVNQIAPQTDFAGPGQWPDLDM 246
Query: 225 LEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGN----LTAETMPIIGNEEVIAVNQ-- 278
LEVGN E HFS+WAI K+PL++G + + + E++ I+ ++VI+ NQ
Sbjct: 247 LEVGNNVYSTAEEQTHFSLWAILKSPLVIGAALKDAATSINEESLAILKQKDVISYNQDE 306
Query: 279 -----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT---L 324
++ +W+GPLSG RIV L + +T DIGI+ +
Sbjct: 307 LSVSANLSRRYTEDEYDVWSGPLSGKRIVAALVNWADEKRYLTIALPDIGIQYAGMVKDI 366
Query: 325 FEHRTLETKFQGKLTAKVDGHSCKMYELTPVTKL 358
+++ T++ + TA+V+ H + EL T L
Sbjct: 367 WKNVTIQ-DVKTSYTAEVEAHGTMLLELQNTTSL 399
>gi|190348102|gb|EDK40496.2| hypothetical protein PGUG_04594 [Meyerozyma guilliermondii ATCC
6260]
Length = 428
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 133/402 (33%), Positives = 192/402 (47%), Gaps = 77/402 (19%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
NGLG P MGWNSWN F C INE +I +T + + ++ L + + +
Sbjct: 17 FDNGLGKKPQMGWNSWNKFACNINETVILQT------AEKMKELGLLEYGYEYIVMDDCY 70
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLD-NRI 121
A + +V + FP+G ++L+ +H L SL +I
Sbjct: 71 ALKERDPISHKMVEDAAKFPNGIRSLSRRIHELGFKFGMYSSAGKYTCAGYPGSLHYEKI 130
Query: 122 ES-NIAFEQGIDYLKYDNCFNDDTR--PTI---RYRAMSDALKKAGRPIFFSMCEWGDMR 175
++ + IDYLKYDNCFN+ P I RY AMS AL + GRP+F+S+C+WG+ +
Sbjct: 131 DAETFVNDWEIDYLKYDNCFNEGNSGTPQISYQRYEAMSKALLQTGRPVFYSLCQWGEDQ 190
Query: 176 PALWGSNIRNSWRTTDDISDSWAR---------------------MLTIADMNEVYADHA 214
WGS + NSWR T DI DS+ R M I + A
Sbjct: 191 VWDWGSTVANSWRITGDIYDSFDRYDDRCPCQTYDCRAVQGGMCSMTNILEKAVPLGQKA 250
Query: 215 KP-GGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEV 273
P GWND D LEVGNGGM +EY HF++WAI K+PL+LG DV +++ E I+ N+ +
Sbjct: 251 GPFSGWNDLDSLEVGNGGMSTDEYKAHFTLWAILKSPLVLGNDVTDMSKEDFNIVTNKAI 310
Query: 274 IAVNQAIND--QQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTL------- 324
IA+NQ ++ ++W P+SG + + K + +T + G K NNT+
Sbjct: 311 IAINQDDSNPAYRVWKKPVSGGHLHLFTNILKDGTFAVTLF--NSGNKVNNTVLNFEDIF 368
Query: 325 ----------FEHRTL---ETKFQGK-LTAKVDGHSCKMYEL 352
FE L ET K L+ +D HS K++ L
Sbjct: 369 LTDRVNAAKSFEFTELWTNETTLVSKELSTSIDAHSVKIWWL 410
>gi|51701639|sp|Q99172.1|MEL_ZYGCI RecName: Full=Alpha-galactosidase; AltName:
Full=Alpha-D-galactoside galactohydrolase; AltName:
Full=Melibiase; Flags: Precursor
gi|407012|gb|AAA35280.1| alpha-galactosidase [Lachancea cidri]
Length = 469
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 128/366 (34%), Positives = 174/366 (47%), Gaps = 80/366 (21%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQ 62
NGLGLTP MGW++WN F C + E+++ T DDCW S R S
Sbjct: 23 NGLGLTPQMGWDNWNSFGCSVKEELLLGTAEKIVKLGLKDLGYNYIILDDCW-SSGRSSN 81
Query: 63 ISLSCSFHMFDSLTQIAAKPLPN-QCQGNLVANKTTFPSGTKALADYVHILISNSLDNRI 121
SL F + A+ L N Q + + ++ +G A Y L +D
Sbjct: 82 GSLLADDSKFPHGMKYVAEQLHNSQLKFGMYSS-----AGEYTCAGYAGSLGYEDMDAAT 136
Query: 122 ESNIAFEQGIDYLKYDNCFNDDT--RPTI---RYRAMSDALKKAGRPIFFSMCEWGDMRP 176
++ +DYLKYDNC+N P I RY+AMSDAL K GRPIF+S+C WG
Sbjct: 137 FAS----WDVDYLKYDNCYNKGEFGTPEISYKRYKAMSDALNKTGRPIFYSLCNWGQDLT 192
Query: 177 ALWGSNIRNSWRTTDDISDSWAR----------------------MLTIADMNEVYADHA 214
WGS I NSWR + D+ + R ++ I + +A
Sbjct: 193 FYWGSAISNSWRMSGDVYPQFDRPDSRCPCSGDEYDCSYPGFHCSIMNILNKAAPMGQNA 252
Query: 215 KPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVI 274
PGGWND DMLEVG G M +E V HFS+WAI K+PL++G D+ +L ++ + N VI
Sbjct: 253 APGGWNDLDMLEVGVGNMSDSEEVAHFSMWAIVKSPLIIGADIDDLKDSSLSVYSNPAVI 312
Query: 275 AVNQAI---------------NDQ------QIWAGPL-SGNRIVVLLESRKTFSSMMTAH 312
A+NQ + DQ Q+W+GPL +G+ +V LL S M A
Sbjct: 313 AINQDVLGTPATRIWKYHVSDKDQYGEGEIQLWSGPLDNGDHVVALLNGGNNERS-MNAS 371
Query: 313 WDDIGI 318
W+DI I
Sbjct: 372 WNDIFI 377
>gi|319642861|ref|ZP_07997497.1| alpha-galactosidase [Bacteroides sp. 3_1_40A]
gi|317385409|gb|EFV66352.1| alpha-galactosidase [Bacteroides sp. 3_1_40A]
Length = 499
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 130/411 (31%), Positives = 196/411 (47%), Gaps = 72/411 (17%)
Query: 1 LFIVSSSAADSNGSDRRNLLSN---GLGLTPPMGWNSWNHFHCQINEKIIRETDDCW-GE 56
++ V A + +G D + LL N L LTPPMGW+SWN F + ++I+E D
Sbjct: 96 VYKVKLVAENKSGKDEKELLINIGDALALTPPMGWSSWNTFGRHLTGELIKEVADAMVAN 155
Query: 57 SSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI----- 111
RD L ++ D Q+A + G++ +K FP G K +ADY+H
Sbjct: 156 GMRD----LGYAYINIDDFWQLAER----GADGHMQVDKEKFPDGIKPIADYLHERGFKL 207
Query: 112 -LISNSLDNR------------IESNIAFEQGIDYLKYDNCFNDDTRPTI--RYRAMSDA 156
+ S++ D ++N G+D LKYD C D R RY M A
Sbjct: 208 GIYSDAADKTCGGVCGSYGHEVTDANDFASWGVDLLKYDYCNAPDGRQDAMERYTVMGKA 267
Query: 157 LKKAGRPIFFSMCEWGDMRPALWGSNIR-NSWRTTDDISDSWARMLT--------IADMN 207
L+ GR I FS+CEWG P W ++ + WRTT DI D W+ +T I D+N
Sbjct: 268 LRATGRSIVFSICEWGQREPWKWAKSVGGHYWRTTGDIGDHWSNAVTGPGWGVMEITDIN 327
Query: 208 EVYADHAKPGGWNDPDMLEVGNGGMKYN-----------EYVVHFSIWAISKAPLLLGCD 256
++A PGGWNDPDML VG G + +Y H S+W + +PLL G D
Sbjct: 328 GKLDNYAGPGGWNDPDMLIVGISGKSMSIADAKSGCSDEQYRSHMSLWCMMASPLLSGND 387
Query: 257 VGNLTAETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLSGNRIVVLLESRKTF 305
V ++T+ T+ + N E+IA+NQ + +Q +WA PLS + V+ +R F
Sbjct: 388 VRDMTSVTLETLTNPEIIAINQDVLGRQAERVIRHDKYDVWAKPLSDGSVAVMCLNR--F 445
Query: 306 SSMMTAHWDD---IGIKSNNTLFE---HRTLETKFQGKLTAKVDGHSCKMY 350
+T H +D G+ +N + + H+ L KF + ++ + CK++
Sbjct: 446 DEPVTVHLNDKTIPGVTTNTGIRDVWLHKEL-GKFPQGMNVELGAYQCKVF 495
>gi|396486397|ref|XP_003842406.1| similar to alpha-galactosidase precursor [Leptosphaeria maculans
JN3]
gi|312218982|emb|CBX98927.1| similar to alpha-galactosidase precursor [Leptosphaeria maculans
JN3]
Length = 508
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 114/353 (32%), Positives = 176/353 (49%), Gaps = 77/353 (21%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L+NGL +TPPMGWN+WN F C ++EK++ +T S++ + L + + L
Sbjct: 25 LNNGLAITPPMGWNNWNSFGCDVSEKLLLDT------SAQLVHLGLRDLGYQYVVLDDCW 78
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLDNRIE 122
P G L + + FP+G A+++++H L SLD+ +
Sbjct: 79 QDPKGRDGTGKLQVDYSKFPNGLNAISEHLHGLGLKYGMYSSAGEMTCARFQGSLDHEGD 138
Query: 123 SNIAFEQ-GIDYLKYDNCFN-----DDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRP 176
+F G+D LKYD+C++ T R++AM+DALK GR I F++C WG+
Sbjct: 139 DAQSFADWGVDMLKYDSCYHMGRVGTPTASFNRFKAMADALKATGRNILFNLCNWGEDLV 198
Query: 177 ALWGSNIRNSWRTTDDISDSWAR-------------------------MLTIADMNEVYA 211
WG +I NSWR T DI DS+ R +L I + +A
Sbjct: 199 HTWGMSISNSWRITGDIYDSFTRPDDLCGCSTLPLGATNCIAPGTHCSVLFILNKVAPFA 258
Query: 212 DHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNE 271
D + PGGW+D DMLEVG GGM +EY HF++WA K+PL LG D+ +++A + I+ N
Sbjct: 259 DRSIPGGWSDLDMLEVGQGGMTDDEYKAHFALWAALKSPLFLGNDLRSMSAAALSIVNNP 318
Query: 272 EVIAVNQ----------------------AINDQQIWAGPLS-GNRIVVLLES 301
+IA++Q + + IWAG L+ G+++V+LL +
Sbjct: 319 AIIALSQDPHGRSATRVRRETQDVAKDEWGVGELHIWAGHLANGDQVVILLNA 371
>gi|146323621|ref|XP_001481538.1| alpha-galactosidase [Aspergillus fumigatus Af293]
gi|129555310|gb|EBA27184.1| alpha-galactosidase, putative [Aspergillus fumigatus Af293]
gi|159122224|gb|EDP47346.1| alpha-galactosidase, putative [Aspergillus fumigatus A1163]
Length = 663
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 128/404 (31%), Positives = 182/404 (45%), Gaps = 82/404 (20%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQ-- 77
+ NGL TP MGWNS+N++ C NE IIR S+ + + L + + +T
Sbjct: 23 IDNGLARTPQMGWNSYNYYSCSPNEAIIR--------SNAKALVDLGLAELGYRYVTTDC 74
Query: 78 --IAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL----------------------- 112
A LPN G L N+T FPSG A+ +Y+H L
Sbjct: 75 GWSVADRLPN---GTLTWNETLFPSGFPAMGEYLHELGLLFGVYGDSGTKLCGSPPDQVG 131
Query: 113 ------------ISNSLDNRIESNIAF-EQGIDYLKYDNCFND------------DTRPT 147
++N D+ + F E G D LKYDNC++D T P
Sbjct: 132 SLCKSPSSMILTLANVPDHEEQDAKTFAEWGADSLKYDNCYSDAATNYPNVNYEPSTSPR 191
Query: 148 IRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMN 207
RY MS AL + GRPI F +CEWG PALW + NSWR +DI +W + +
Sbjct: 192 PRYEIMSSALARVGRPILFQICEWGIDFPALWAPALGNSWRIGNDIIPAWRSIFRTLNQA 251
Query: 208 EVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGC----DVGNLTAE 263
D A PG W D DML VGNG E HFS+WAI K+PL +G D +++
Sbjct: 252 VPNTDFAGPGQWADLDMLYVGNGVFSLPEEQTHFSLWAILKSPLTIGAALKDDDTSISQA 311
Query: 264 TMPIIGNEEVIAVNQ-------------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMT 310
++ ++ ++VI NQ + ++W+GPLSGNR VV + + + S +T
Sbjct: 312 SLEVLKQKDVIGFNQDALGVSASLKRRWSDEGYEVWSGPLSGNRTVVAVINWRNESRDLT 371
Query: 311 AHWDDIGIKSNNTLFE--HRTLETKFQGKLTAKVDGHSCKMYEL 352
D+G++ +T+ + TA V GH + EL
Sbjct: 372 LDLPDVGLQYAQVARNIWGKTVVRDVRTSYTAGVAGHGTILLEL 415
>gi|159900407|ref|YP_001546654.1| alpha-galactosidase [Herpetosiphon aurantiacus DSM 785]
gi|159893446|gb|ABX06526.1| Alpha-galactosidase [Herpetosiphon aurantiacus DSM 785]
Length = 387
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 122/350 (34%), Positives = 167/350 (47%), Gaps = 64/350 (18%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPL 83
L TPPMGWNSWN F I+E +R T D S C ++ + + + +
Sbjct: 8 LAPTPPMGWNSWNMFGSTIHEDSVRATADVLVSSGLKD-----CGYN-YVVIDDCWSTKV 61
Query: 84 PNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIA------------FEQ-- 129
G+LVA+ FPSG KALADYVH L L I S+ A FE+
Sbjct: 62 GRDGNGDLVADPEKFPSGIKALADYVHSL---GLKIGIYSDAAHLTCASYPGSFGFEEQD 118
Query: 130 -------GIDYLKYDNCF--NDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWG 180
GID+LKYD CF D RY M +AL+K R +S+CEWG P LWG
Sbjct: 119 AQLWASWGIDFLKYDFCFAPTDQATAIDRYTRMGEALRKTKRQFLYSLCEWGGRSPQLWG 178
Query: 181 SNIR-NSWRTTDDISDSWARML------------TIADMNEVYADHAKPGGWNDPDMLEV 227
++ + WR T DI DSW + T D+ A++A P WND DML V
Sbjct: 179 RSVGGHMWRVTGDIFDSWVDIWVAPHKYYGVGIDTAIDIAANLAEYAGPDAWNDLDMLVV 238
Query: 228 G--------NGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ- 278
G GG+ + EY H S+W I+ +PL++GCD+ N+ +T ++ N EV+AVNQ
Sbjct: 239 GLKGKGQISGGGLSFIEYQTHMSLWTIACSPLMIGCDIRNMDRDTTSLLTNREVLAVNQD 298
Query: 279 ----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGI 318
+IW PL+ + V L +R + S ++ DIG+
Sbjct: 299 SLGIAGRRVKQTGTCEIWKKPLADGSLAVALINRGSIGSDLSLRASDIGL 348
>gi|340516346|gb|EGR46595.1| glycoside hydrolase family 27 [Trichoderma reesei QM6a]
Length = 496
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 121/367 (32%), Positives = 169/367 (46%), Gaps = 114/367 (31%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRE-------------------TDDCWGESSRD 60
+ NGL TP MGWN+WN F C ++E ++ + DDCW + RD
Sbjct: 19 IDNGLARTPQMGWNNWNSFGCDVSENLLLDHAQLINEYGLQDLGYQYVVLDDCWSDG-RD 77
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH---------- 110
S+ G L+A+K FP G A+AD +H
Sbjct: 78 SK--------------------------GKLIADKKKFPRGMAAVADDLHSQGFLFGMYS 111
Query: 111 -------ILISNSLDNRIESNIAF-EQGIDYLKYDNCFNDDT--RPTI---RYRAMSDAL 157
+ SLD+ ++ +F + G+DYLKYDNC++ P I R+ M++AL
Sbjct: 112 SAGELTCARYAGSLDHEMDDAQSFADWGVDYLKYDNCYHMGRFGTPLISFERFNKMAEAL 171
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWAR------------------ 199
K G+ IF+S+C WG+ W ++I NSWR DI DS+AR
Sbjct: 172 KATGKNIFYSLCNWGEDYSYSWAASISNSWRVFGDIYDSFARPDDLCSCNDPANPACIAP 231
Query: 200 -----MLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLG 254
+L I + Y D PGGWND DMLEVG+GGM EY HF+IWA KAPLLLG
Sbjct: 232 GTHCSVLAIINRVVPYIDRGLPGGWNDLDMLEVGHGGMTEEEYRAHFTIWAALKAPLLLG 291
Query: 255 CDVGNLTAETMPIIGNEEVIAVNQ---------------------AINDQQIWAGPLS-G 292
D+ + + I+ N VIA+NQ + + IW+GPL+ G
Sbjct: 292 TDLRKWSGSDLAIVTNPAVIAINQDPRGRAVQRIRRNFNVPKDEWGVGETHIWSGPLANG 351
Query: 293 NRIVVLL 299
++I+V L
Sbjct: 352 DQILVFL 358
>gi|242809757|ref|XP_002485438.1| alpha-galactosidase, putative [Talaromyces stipitatus ATCC 10500]
gi|218716063|gb|EED15485.1| alpha-galactosidase, putative [Talaromyces stipitatus ATCC 10500]
Length = 512
Score = 182 bits (461), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 119/354 (33%), Positives = 168/354 (47%), Gaps = 66/354 (18%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
L+NGL +TP MGW+ WN F C +++ ++ T DDCW S R
Sbjct: 20 LNNGLAVTPQMGWDDWNAFGCSLSQNLVLSTANTILKTGLRDLGYHYIILDDCW-SSGRT 78
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNR 120
S L + F + K L +Q +GTK A Y L ++D
Sbjct: 79 SSNVLIPDANKFPN----GMKYLGDQLHAQGFGFGIYSSAGTKTCAGYPGSLGYETVD-- 132
Query: 121 IESNIAFEQGIDYLKYDNCFND-----DTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMR 175
+N G+DYLKYDNC N+ + RY AM AL +GR I +++C WG
Sbjct: 133 --ANTFASWGVDYLKYDNCNNNGQSGSQAASSARYNAMEKALAASGRNILYAICNWGQDS 190
Query: 176 PALWGSNIRNSWRTTDDISDSW----------------ARMLTIADMNEVYADHAKPGGW 219
P +WG ++ NSWR T DISD++ + I + ++ GGW
Sbjct: 191 PWIWGPSVGNSWRITGDISDNFNTQNSACPVPNSGGYDCSVTQIMSKQATISQYSAKGGW 250
Query: 220 NDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ- 278
ND DMLEVGNGGM +EYV HFS+WA +K+PL++G D+ L A I+ N +IAVNQ
Sbjct: 251 NDLDMLEVGNGGMSDSEYVAHFSVWAAAKSPLIMGNDMSKLIASDYSILANPAIIAVNQD 310
Query: 279 ----------AINDQQIWAGPL------SGNRIVVLLESRKTFSSMMTAHWDDI 316
N+ Q+W+GPL S N VV+L + S+ ++ D+
Sbjct: 311 PLGVAATYRWTRNNVQLWSGPLVSTTGSSVNDQVVVLYNNGGSSTTVSVALSDV 364
>gi|42556012|gb|AAS19696.1| Aga27A [Cellvibrio mixtus]
Length = 405
Score = 181 bits (460), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 126/384 (32%), Positives = 180/384 (46%), Gaps = 62/384 (16%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPL 83
L TP +GWNSWN F C +NEK+IRE D+ ++ ++ +
Sbjct: 30 LAKTPQLGWNSWNTFACDVNEKMIREM--------ADAMVASGMKDAGYEYINIDDCWHG 81
Query: 84 PNQCQGNLVANKTTFPSGTKALADYVHI------LISN-----------SLDNRIESNIA 126
QG + +K +FPSG KALADYVH + S+ S + + +
Sbjct: 82 ERDKQGFIQVDKKSFPSGMKALADYVHSKGLKLGIYSDAGNTTCAGRPGSRGHEYQDAVT 141
Query: 127 FEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRN 185
+ GIDY+KYD C D P Y M DA+ KAGRP+ FS+CEWGD +P W +++ +
Sbjct: 142 YASWGIDYVKYDWCDTKDINPKAAYATMRDAIHKAGRPMLFSICEWGDNKPWEWATDVGH 201
Query: 186 SWRTTDDISDSWA-----------RMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKY 234
SWRTT DI W +L I D +A PG WND DM+EVGN GM
Sbjct: 202 SWRTTGDIYPCWNCEHNHGSWSSWGVLPILDKQAGLRKYAGPGHWNDMDMMEVGN-GMNE 260
Query: 235 NEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAI-----------NDQ 283
+E HFS+WA+ +PL+ G D+ ++ T I+ N++++A+NQ D
Sbjct: 261 DEDRAHFSLWAMMASPLIAGNDLRKMSEATKKILTNKDMLAINQDKLGIQAMKWIDEGDI 320
Query: 284 QIWAGPLSGNRIVVLLESRKTFSSMMTAHW------DDIG---IKSNNTLFEHRTLETKF 334
+I+ PL VL +R + + W DDI I + F R +
Sbjct: 321 EIYVKPLEKGDYAVLFLNRADTTVNYSLDWGFHYMKDDISKHEIFFDKKKFNWRDIWNGG 380
Query: 335 QG----KLTAKVDGHSCKMYELTP 354
+G KL + HS + LTP
Sbjct: 381 KGSTAEKLNLTMAAHSVAVLRLTP 404
>gi|119487192|ref|XP_001262451.1| alpha-galactosidase/alpha-n-acetylgalactosaminidase [Neosartorya
fischeri NRRL 181]
gi|298351521|sp|A1D9S3.1|AGALD_NEOFI RecName: Full=Probable alpha-galactosidase D; AltName:
Full=Melibiase D; Flags: Precursor
gi|119410608|gb|EAW20554.1| alpha-galactosidase/alpha-n-acetylgalactosaminidase [Neosartorya
fischeri NRRL 181]
Length = 648
Score = 181 bits (460), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 126/389 (32%), Positives = 179/389 (46%), Gaps = 67/389 (17%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQ-- 77
+ NGL TP MGWNS+N++ C NE I+R S+ + + L + + +T
Sbjct: 23 IDNGLAKTPQMGWNSYNYYSCSPNEAIVR--------SNAKALVDLGLADLGYRYVTTDC 74
Query: 78 --IAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-------------ISNSLDNRIE 122
A LPN G L N+T FPSG A+ Y+H L + S +++
Sbjct: 75 GWSVADRLPN---GTLTWNETLFPSGFPAMGKYLHELGLLFGVYGDSGTKLCGSPPDQVG 131
Query: 123 SNIAFEQ--------GIDYLKYDNCFND------------DTRPTIRYRAMSDALKKAGR 162
S EQ G D LKYDNC++D T P RY MS AL + GR
Sbjct: 132 SLYHEEQDAKTFAEWGADSLKYDNCYSDAATNYPNVNYEPSTSPRPRYEIMSSALARVGR 191
Query: 163 PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDP 222
PI F +CEWG PALW + +SWR +DI W + + D A PG W D
Sbjct: 192 PILFQICEWGIDFPALWAPALGSSWRIGNDIIPEWRSIFRTLNQAVPNTDFAGPGQWADL 251
Query: 223 DMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGC----DVGNLTAETMPIIGNEEVIAVNQ 278
DML VGNG E HFS+WAI K+PL +G D ++ ++ ++ ++VI NQ
Sbjct: 252 DMLYVGNGVFSLPEEQTHFSLWAILKSPLTIGAALKDDDTSINQASLEVLKQKDVIGFNQ 311
Query: 279 -------------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLF 325
+ ++W+GPLSGNR VV + + + S +T D+G++
Sbjct: 312 DALGVSASLKRRWSDEGYEVWSGPLSGNRTVVAVINWRDESRDLTLDLPDVGLQYAQVAR 371
Query: 326 E--HRTLETKFQGKLTAKVDGHSCKMYEL 352
+T+ + TA V GH + EL
Sbjct: 372 NIWGKTVVRDVRTSYTAGVAGHGTMLLEL 400
>gi|116621935|ref|YP_824091.1| alpha-galactosidase [Candidatus Solibacter usitatus Ellin6076]
gi|116225097|gb|ABJ83806.1| Alpha-galactosidase [Candidatus Solibacter usitatus Ellin6076]
Length = 495
Score = 181 bits (459), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 123/355 (34%), Positives = 176/355 (49%), Gaps = 53/355 (14%)
Query: 21 SNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS-RDSQISLSCSFHMFDSLTQIA 79
+NGL TPPMGWNSWN F ++++K +R+ D S RD+ ++
Sbjct: 137 ANGLAKTPPMGWNSWNKFAGRVDDKSVRDMADAIVTSGMRDAGYVYVNIDDTWEG----- 191
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVHI------LISNSLDNR---IESNIAFEQ- 129
+GN+ +NK FP KALADYVH + S+ N E + E+
Sbjct: 192 ----ERDAKGNITSNKK-FPD-MKALADYVHGKGLKLGIYSSPGPNTCAGYEGSYGHEEQ 245
Query: 130 --------GIDYLKYDNC-----FNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRP 176
GIDYLKYD C + DD + Y+ M DAL+ +GRPI +S+C++G
Sbjct: 246 DAKTWAAWGIDYLKYDWCGARNLYTDDEMQAV-YQKMGDALRASGRPILYSLCQYGRADV 304
Query: 177 ALWGSNIR-NSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYN 235
WG + N+WRTT DI D+W M I A A PG WNDPDMLE+GNGGM +
Sbjct: 305 WKWGPEVGGNAWRTTGDIKDTWDSMTNIGFKQNELAQWAAPGHWNDPDMLEIGNGGMNDD 364
Query: 236 EYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AINDQQ 284
EY H S+W++ APLL G D+ + + I+ N +VIA++Q DQ+
Sbjct: 365 EYRTHMSLWSMLAAPLLAGNDLRTASPAILAILTNRDVIAIDQDKDGKQGRRAWKSGDQE 424
Query: 285 IWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKS---NNTLFEHRTLETKFQG 336
IW L+G V + +R + + W ++ + + L+ H+ E K QG
Sbjct: 425 IWVRDLAGGDRAVAIFNRGAEKADVKMKWSELEVNAPAQAKNLWSHQ--EEKLQG 477
>gi|388256581|ref|ZP_10133762.1| Aga27A [Cellvibrio sp. BR]
gi|387940281|gb|EIK46831.1| Aga27A [Cellvibrio sp. BR]
Length = 405
Score = 181 bits (458), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 126/384 (32%), Positives = 180/384 (46%), Gaps = 62/384 (16%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPL 83
L TP +GWNSWN F C +NEK+IRE D+ ++ ++ +
Sbjct: 30 LAKTPQLGWNSWNTFACDVNEKMIREM--------ADAMVASGMKDAGYEYINIDDCWHG 81
Query: 84 PNQCQGNLVANKTTFPSGTKALADYVHI------LISN-----------SLDNRIESNIA 126
QG + +K +FPSG KALADYVH + S+ S + + +
Sbjct: 82 ERDKQGFIQVDKKSFPSGMKALADYVHSKGLKLGIYSDAGNTTCAGRPGSRGHEYQDAVT 141
Query: 127 FEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRN 185
+ GIDY+KYD C D P Y M DA+ KAGRP+ FS+CEWGD +P W +++ +
Sbjct: 142 YASWGIDYVKYDWCDTKDINPKAAYATMRDAIHKAGRPMLFSICEWGDNKPWEWATDVGH 201
Query: 186 SWRTTDDISDSWA-----------RMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKY 234
SWRTT DI W +L I D +A PG WND DM+EVGN GM
Sbjct: 202 SWRTTGDIYPCWNCEHNHGSWSSWGVLPILDKQAGLRKYAGPGHWNDMDMMEVGN-GMNE 260
Query: 235 NEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAI-----------NDQ 283
+E HFS+WA+ +PL+ G D+ ++ T I+ N++++A+NQ D
Sbjct: 261 DEDRAHFSLWAMMASPLIAGNDLRKMSEATKKILTNKDMLAINQDKLGIQAMKWIDEGDI 320
Query: 284 QIWAGPLSGNRIVVLLESRKTFSSMMTAHW------DDIG---IKSNNTLFEHRTLETKF 334
+I+ PL VL +R + + W DDI I + F R +
Sbjct: 321 EIYVKPLEKGDYAVLFLNRADTTVNYSLDWGFHYMKDDISKHEIFFDKKKFNWRDIWNGG 380
Query: 335 QG----KLTAKVDGHSCKMYELTP 354
+G KL + HS + LTP
Sbjct: 381 KGSTAEKLNLTMAAHSVVVLRLTP 404
>gi|146415790|ref|XP_001483865.1| hypothetical protein PGUG_04594 [Meyerozyma guilliermondii ATCC
6260]
Length = 428
Score = 180 bits (457), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 121/354 (34%), Positives = 176/354 (49%), Gaps = 56/354 (15%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
NGLG P MGWNSWN F C INE +I +T + + ++ L + + +
Sbjct: 17 FDNGLGKKPQMGWNSWNKFACNINETVILQT------AEKMKELGLLEYGYEYIVMDDCY 70
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLD-NRI 121
A + +V + FP+G ++L+ +H L SL +I
Sbjct: 71 ALKERDPISHKMVEDAAKFPNGIRSLSRRIHELGFKFGMYSSAGKYTCAGYPGSLHYEKI 130
Query: 122 ES-NIAFEQGIDYLKYDNCFNDDTR--PTI---RYRAMSDALKKAGRPIFFSMCEWGDMR 175
++ + IDYLKYDNCFN+ P I RY AMS AL + GRP+F+S+C+WG+ +
Sbjct: 131 DAETFVNDWEIDYLKYDNCFNEGNSGTPQISYQRYEAMSKALLQTGRPVFYSLCQWGEDQ 190
Query: 176 PALWGSNIRNSWRTTDDISDSWAR---------------------MLTIADMNEVYADHA 214
WGS + NSWR T DI DS+ R M I + A
Sbjct: 191 VWDWGSTVANSWRITGDIYDSFDRYDDRCPCQTYDCRAVQGGMCSMTNILEKAVPLGQKA 250
Query: 215 KPG-GWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEV 273
P GWND D LEVGNGGM +EY HF++WAI K+PL+LG DV +++ E I+ N+ +
Sbjct: 251 GPFLGWNDLDSLEVGNGGMSTDEYKAHFTLWAILKSPLVLGNDVTDMSKEDFNIVTNKAI 310
Query: 274 IAVNQAIND--QQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLF 325
IA+NQ ++ ++W P+SG + + K + +T + G K NNT+
Sbjct: 311 IAINQDDSNPAYRVWKKPVSGGHLHLFTNILKDGTFAVTLF--NSGNKVNNTVL 362
>gi|408528774|emb|CCK26948.1| secreted alpha-galactosidase [Streptomyces davawensis JCM 4913]
Length = 563
Score = 180 bits (457), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 126/391 (32%), Positives = 180/391 (46%), Gaps = 90/391 (23%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRE-------------------TDDCWGESSRDSQISLSC 67
TP MGWNSWNHF C + ++ I++ DDCW +RD+
Sbjct: 50 TPLMGWNSWNHFGCNVTQQNIKDAADQIVAQGLDKLGYRYVNVDDCWMARTRDAG----- 104
Query: 68 SFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH----------------- 110
G+L + F G +ALADYVH
Sbjct: 105 ---------------------GHLQPDPVRFKDGIRALADYVHSKGLKFGIYQSAGTTTC 143
Query: 111 ILISNSLDNRI-ESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMC 169
+ SL + ++N G+D LKYDNC + RY+AM DALK +GR I FS+C
Sbjct: 144 AGLPGSLGHETTDANDFAAWGVDLLKYDNCPDGQGSIEQRYKAMGDALKASGRAIVFSLC 203
Query: 170 EWGDMRP-ALWGSNIRNS-WRTTDDISDSW---------ARMLTIADMNEVYADHAKPGG 218
WG P A +GS S WRTT DI DSW ++ I D + + P
Sbjct: 204 SWGQGSPWAGFGSVSGGSQWRTTYDIRDSWYDNKWTSSPMGVIDILDQQKGLEVFSGPSK 263
Query: 219 WNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
WND DMLEVGNG ++ +EY HFS+WA+ +PL+LG D+ ++ T II N +VIAVNQ
Sbjct: 264 WNDMDMLEVGNGALRDDEYRSHFSLWALLNSPLILGNDLTRMSDATKAIIKNADVIAVNQ 323
Query: 279 ------------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTL-- 324
N +Q+WA P+S + V+L +R ++ +T +IG+ +++
Sbjct: 324 DWGGSQGRLMRDLGNGRQVWAKPMSDGSVAVVLLNRSGAAAAITTSAAEIGLGGSSSYAL 383
Query: 325 --FEHRTLETKFQGKLTAKVDGHSCKMYELT 353
T T G ++ +V H MY ++
Sbjct: 384 KDLWTGTSSTSANGTISGQVPSHGVAMYRVS 414
>gi|399071246|ref|ZP_10749945.1| alpha-galactosidase [Caulobacter sp. AP07]
gi|398043616|gb|EJL36507.1| alpha-galactosidase [Caulobacter sp. AP07]
Length = 503
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 128/373 (34%), Positives = 178/373 (47%), Gaps = 53/373 (14%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS-RDSQISLSCSFHMFDSLTQ 77
L +GL TPPMGW+SWN F +I++K +R D + RD+ + D Q
Sbjct: 142 LAPDGLAKTPPMGWSSWNRFAEKIDDKTVRAMADALVATGLRDA----GYIYVNIDDGWQ 197
Query: 78 IAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLDNR 120
P G + N+ FP KALADYVH S +
Sbjct: 198 GTRGP-----DGAIRPNEK-FPD-MKALADYVHARGLKFGLYSAPGPKTCAGYEGSYGHV 250
Query: 121 IESNIAFEQ-GIDYLKYDNC----FNDDTRPTIR-YRAMSDALKKAGRPIFFSMCEWGDM 174
+ F G+DYLKYD C F DD R Y M AL+ GR I +S+CE+G
Sbjct: 251 QQDAQTFADWGVDYLKYDLCSGEWFYDDADKVQRTYYEMGAALRATGRSIVYSLCEYGRF 310
Query: 175 RPALWGSNIRNS-WRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMK 233
WG N+ WRTT DI+D + M I A+HA PGGWNDPDMLE+GNGGM
Sbjct: 311 EVGAWGRNVGGHLWRTTGDITDDYPTMARIGFDKNGRAEHAGPGGWNDPDMLEIGNGGMI 370
Query: 234 YNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-AINDQ--------- 283
++EY H S+WA+S APL++G D+ + E + ++ N EVIAV+Q A+ Q
Sbjct: 371 HDEYATHMSLWAMSAAPLMMGHDLRQSSPEVLAMLENREVIAVDQDALGVQGKAVRREGS 430
Query: 284 -QIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKS---NNTLFEHRTLETKFQGKLT 339
++W+ PL+ + V L +R ++ M D+ + + L+ L FQ T
Sbjct: 431 TEVWSKPLADGGVAVALFNRGEAATPMMLQAGDVSMSALTGVRDLWRGAELAPAFQ---T 487
Query: 340 AKVDGHSCKMYEL 352
+V H M +
Sbjct: 488 FQVPAHGVVMLRV 500
>gi|358379709|gb|EHK17389.1| glycoside hydrolase family 27 protein [Trichoderma virens Gv29-8]
Length = 496
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 123/367 (33%), Positives = 169/367 (46%), Gaps = 114/367 (31%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINE-------KIIRE------------TDDCWGESSRD 60
+ NGL TP MGWN+WN F C ++E K+I E DDCW + RD
Sbjct: 19 IDNGLARTPQMGWNNWNSFGCDVSEHLLLDHAKLITEYGLQDLGYEYVVLDDCWSDG-RD 77
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH---------- 110
S+ G L+A+K FP G A+AD +H
Sbjct: 78 SK--------------------------GKLIADKKKFPRGMAAVADDLHSQGFLFGMYS 111
Query: 111 -------ILISNSLDNRIESNIAF-EQGIDYLKYDNCFNDDT--RPTI---RYRAMSDAL 157
+ SLD+ ++ +F + G+DYLKYDNC++ P I R+ M++AL
Sbjct: 112 SAGELTCARYAGSLDHEMDDAQSFADWGVDYLKYDNCYHMGRFGTPLISFERFNKMAEAL 171
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWAR------------------ 199
K G+ IF+S+C WG+ W ++I NSWR DI DS+AR
Sbjct: 172 KATGKNIFYSLCNWGEDYSYSWAASIGNSWRIFGDIYDSFARPDDLCSCNDPANPACIAP 231
Query: 200 -----MLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLG 254
+L I + Y D PGGWND DMLEVG+GG EY HF+IWA KAPLLLG
Sbjct: 232 GTHCSVLAIINRVVPYIDRGLPGGWNDLDMLEVGHGGQTEEEYRAHFTIWAALKAPLLLG 291
Query: 255 CDVGNLTAETMPIIGNEEVIAVNQ---------------------AINDQQIWAGPLS-G 292
D+ + + I+ N VIA+NQ + + IW+GPL+ G
Sbjct: 292 TDLRKWSGSDLAIVTNPAVIAINQDPRGRAVQRIRRNFNVPKDEWGVGETHIWSGPLANG 351
Query: 293 NRIVVLL 299
++I+V L
Sbjct: 352 DQILVFL 358
>gi|302847865|ref|XP_002955466.1| hypothetical protein VOLCADRAFT_106804 [Volvox carteri f.
nagariensis]
gi|300259308|gb|EFJ43537.1| hypothetical protein VOLCADRAFT_106804 [Volvox carteri f.
nagariensis]
Length = 800
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 126/380 (33%), Positives = 166/380 (43%), Gaps = 126/380 (33%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
L NG+G TP MGWNSWN+F C INE IIR+ DDCW E RD
Sbjct: 30 LDNGVGRTPAMGWNSWNYFRCNINETIIRQVADAIVSSGLKDAGYVYVNIDDCWMEK-RD 88
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----ISN 115
Q G + + FPSG KALADY+H L + +
Sbjct: 89 PQT-------------------------GRIQPFASKFPSGMKALADYIHGLGLRFGVYS 123
Query: 116 SLDNR-------------IESNIAFEQGIDYLKYDNCFNDDTRPTIR--YRAMSDALKKA 160
N+ +++ E G+DYLKYD C D + +++ Y M DAL
Sbjct: 124 DTGNKTCEGYPGSWGYEKLDAATYAEWGVDYLKYDYCGMDGVQESVKASYERMRDALAAT 183
Query: 161 GRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSW----ARMLTIADM---------- 206
GRPI FS+C WG +P +WG + NSWRT D+ +W AR L +
Sbjct: 184 GRPILFSLCSWGSGQPWVWGKEVGNSWRTGIDVFAAWDPAQARALKLPSFLQPILGAVRQ 243
Query: 207 NEVYADHAKPGGWNDPDMLEVGNGGM----------------KYNEYV------------ 238
+ A+HA PGG+NDPDML VG GM K EY+
Sbjct: 244 TQGLAEHAGPGGFNDPDMLVVGLDGMYPYGIVQECPEHVKDCKPGEYISRDRWGKVGGLT 303
Query: 239 -----VHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-AINDQ--------- 283
HFS W I APL+LG D ++ T+ I+ EV++VNQ A+ Q
Sbjct: 304 QTEQRTHFSYWCIMAAPLILGNDPRVMSKATLQILLAREVLSVNQDALGVQGRPVWTSPG 363
Query: 284 ----QIWAGPLSGNRIVVLL 299
++WA PL+ R +LL
Sbjct: 364 GGALEVWAKPLADGRTALLL 383
>gi|307106372|gb|EFN54618.1| hypothetical protein CHLNCDRAFT_35902 [Chlorella variabilis]
Length = 366
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 119/360 (33%), Positives = 172/360 (47%), Gaps = 84/360 (23%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRE------------------TDDCWGESSRDS 61
L NGL P +G+N+WN F+ +I+E +++E DDCW E SR+
Sbjct: 19 LDNGLNAKPALGFNTWNAFYREIHEDLVKEHADLMVSLGLRDAGQYLVLDDCWSERSREE 78
Query: 62 QISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL--------- 112
L A+K FPSG KA+ DY+H
Sbjct: 79 GERLQ--------------------------ASKEKFPSGMKAMGDYIHAKGLKYGIYSD 112
Query: 113 --------ISNSLDNR-IESNIAFEQGIDYLKYDNCFNDDTRPTI-RYRAMSDALKKAGR 162
SLD+ +++ G+DYLKYDNC R I RY AM DAL GR
Sbjct: 113 AGTLTCAKYPGSLDHEELDAQTFAGWGVDYLKYDNCHVRRDRWVIDRYAAMRDALNATGR 172
Query: 163 PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPG-GWND 221
PI +S+CEWG M P LW + NSWRTT+DI W ++ D N + A P GWND
Sbjct: 173 PIVYSLCEWGVMEPHLWAPQVGNSWRTTEDIRPWWDSIVKTLDYNVGLSRFAGPHLGWND 232
Query: 222 PDMLEVGNG-GMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-- 278
DM GN G+ + E HF++WA+ K+PL++G D+ + + ++ I+ +EVIA+NQ
Sbjct: 233 LDM---GNDTGLSHAEQRTHFALWALLKSPLMIGHDLRDFSKTSLGILLAKEVIAINQDD 289
Query: 279 ---------AINDQQIWAGPLS-GNRIVVLLESRKTFSSM----MTAHWDDIGIKSNNTL 324
++++AGPL+ G R VVL + T+S +T W +G++ +
Sbjct: 290 LGVAGDLVWRQGTKRVYAGPLAGGGRAVVLANFQTTYSQYPATNITVFWTQVGLQPGQRV 349
>gi|310798838|gb|EFQ33731.1| melibiase [Glomerella graminicola M1.001]
Length = 506
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 118/376 (31%), Positives = 172/376 (45%), Gaps = 116/376 (30%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCW-G 55
R L+NGL +TPPMGWN+WN F C ++E +I T DDCW
Sbjct: 17 RVAALNNGLAITPPMGWNNWNAFACDVSEDLILSTSERVISLGLRDLGYDHVVLDDCWQD 76
Query: 56 ESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI---- 111
E+ RD+Q G + N FP+G K ++D +H
Sbjct: 77 ENGRDAQ--------------------------GKIHPNLAKFPNGLKHVSDQLHAQGLK 110
Query: 112 -------------LISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDT--RPTI---RYRA 152
+ SLD+ E F G+D+LKYDNC++ P I R++
Sbjct: 111 YGMYSSAGEMTCARFAGSLDHEKEDAENFASWGVDFLKYDNCYHMGRMGTPKISFDRFKV 170
Query: 153 MSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWAR------------- 199
M+DAL GRPI ++C WG+ WG +I ++WR + DI DS+ R
Sbjct: 171 MADALNATGRPIALNLCNWGEDYVHTWGMSIAHAWRMSGDIYDSFTRPDDLCSCTTAADP 230
Query: 200 ----------MLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKA 249
+L I + +AD + PGGWND DMLEVG GGM EY HF++WA K+
Sbjct: 231 FCVAPGTHCSVLFILNRVAAFADRSIPGGWNDLDMLEVGQGGMTDEEYKAHFALWAALKS 290
Query: 250 PLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------------------AINDQQIW 286
PL+LG D+ ++ AE + I+ N +IA+NQ + + Q+W
Sbjct: 291 PLMLGNDLRDMPAEALSIVNNPAIIALNQDPHGRSVLRVRRDVGGVLTPDEYGVAETQVW 350
Query: 287 AGPL-SGNRIVVLLES 301
+G L +G++ V+ L +
Sbjct: 351 SGRLENGDQAVIFLNA 366
>gi|298351559|sp|Q0CVX4.2|AGALD_ASPTN RecName: Full=Probable alpha-galactosidase D; AltName:
Full=Melibiase D; Flags: Precursor
Length = 655
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 131/402 (32%), Positives = 194/402 (48%), Gaps = 63/402 (15%)
Query: 1 LFIVSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDC---WGES 57
L + ++ AAD S R L +GL TP MGWN++NH++C +E+I R +G +
Sbjct: 9 LLLPAAFAADH--SPLRARLQDGLARTPQMGWNTYNHYNCYPDEEIFRSNAKALVDFGLA 66
Query: 58 SRDSQI-SLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH---ILI 113
+ ++ C + + + L G+L N T FPSG A+ADY+H +L
Sbjct: 67 DLGYRYATIDCGWTLTERL-----------ANGSLTWNATRFPSGFPAIADYLHDLGLLF 115
Query: 114 SNSLDNRIE----SNIAFEQ--------GIDYLKYDNCFNDDT--RPTI----------R 149
D I+ S+ EQ G D LKYDNCF++ + P + R
Sbjct: 116 GVYGDAGIKLCGPSDEHEEQDAQTFAAWGADSLKYDNCFSEASLGYPNVEYAPSSPLKPR 175
Query: 150 YRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEV 209
Y MS+AL+K RPI F +CEWG PALW + +SWR +DI +W + +
Sbjct: 176 YEVMSNALQKLDRPILFQICEWGIDFPALWAPALGHSWRIGNDIIPAWRSVFRTLNQAVP 235
Query: 210 YADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT----AETM 265
D A PG W D DML VGNG E HFS+WAI K+PL++G + + T +E++
Sbjct: 236 QTDFAGPGQWPDLDMLMVGNGVYSVPEEETHFSLWAILKSPLIIGSALKDATTEINSESL 295
Query: 266 PIIGNEEVIAVNQ-----------AINDQ--QIWAGPLSGNRIVVLLESRKTFSSMMTAH 312
I+ + VI NQ DQ ++W+GPLS R V + + + + +T
Sbjct: 296 RILKQKAVIGYNQDKLGVSASLRRRWTDQGYEVWSGPLSNGRTVAAVINWRGEARDLTLD 355
Query: 313 WDDIGIKSNNTL--FEHRTLETKFQGKLTAKVDGHSCKMYEL 352
DIG++S + + Q TA+V+GH + EL
Sbjct: 356 LPDIGLQSAGLVKNIWAGSTSRNVQTSYTARVEGHGTMLLEL 397
>gi|414878417|tpg|DAA55548.1| TPA: hypothetical protein ZEAMMB73_350257 [Zea mays]
Length = 669
Score = 178 bits (451), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 97/217 (44%), Positives = 129/217 (59%), Gaps = 18/217 (8%)
Query: 150 YRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEV 209
++ + LKK P + + G PALW + NSW TTDDI+ +W M IAD N
Sbjct: 327 HQEQAGKLKKTTNPRV--LWDGGQYDPALWAGKVGNSWHTTDDITVTWKSMTYIADKNNK 384
Query: 210 YADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIG 269
+A +A PGGW+DPDMLEVGNGGM EY+ HFSIWA+ KAPLL+GCDV N+T+ETM I+
Sbjct: 385 WASYAGPGGWDDPDMLEVGNGGMTLAEYLSHFSIWALMKAPLLIGCDVRNMTSETMEILS 444
Query: 270 NEEVIAVNQAINDQQI-----------WAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGI 318
N+EVI + ++I WAGPLSGNR+VV L +R + ++ +T +G+
Sbjct: 445 NKEVIQNPLGVQGRKILGQGKYGCREVWAGPLSGNRLVVALWNRCSDTANVTMKLPAVGL 504
Query: 319 K-----SNNTLFEHRTLETKFQGKLTAKVDGHSCKMY 350
S L++H TL G A+VD H KMY
Sbjct: 505 DGYAAYSVRDLWKHETLSENVVGTFGAQVDVHDTKMY 541
>gi|390601316|gb|EIN10710.1| glycoside hydrolase [Punctularia strigosozonata HHB-11173 SS5]
Length = 380
Score = 178 bits (451), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 121/352 (34%), Positives = 159/352 (45%), Gaps = 61/352 (17%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGL TP MGWN +N F C NE +S L + D Q
Sbjct: 21 LDNGLERTPAMGWNPYNAFLCDTNEAQYHAAAQALAQSGLPK---LGYKYFNIDCGWQGT 77
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAF------------ 127
+ G N T PSG ALA YVH L LD + S+ +
Sbjct: 78 NR----TADGVFTWNTTRIPSGIPALASYVHDL---GLDFGVYSDAGYFSCDFVGGTAGW 130
Query: 128 ------EQ---------GIDYLKYDNCFND------DTRPTI----RYRAMSDALKKAGR 162
EQ G DYLKYDNC+ D P I + AM DAL GR
Sbjct: 131 LGSLNHEQSDADTFASWGADYLKYDNCYAVSPTDFVDMNPPISLEPHFTAMRDALNSTGR 190
Query: 163 PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISD--SWARMLTIADMNEVYADHAKPGGWN 220
PI FS+CEWG PA W S + NSWR ++DI SW ++ I + A PG +N
Sbjct: 191 PIVFSVCEWGVQDPARWASEVGNSWRVSNDIGPPPSWDNLVRIINQVVPITQFASPGAFN 250
Query: 221 DPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAI 280
D D+LEVGN G+ E HF+ WA +K+PL + D+ T ET+ I+ N +IA+NQ +
Sbjct: 251 DLDLLEVGNQGLTTAEQETHFAFWAAAKSPLFISTDLTTATKETLSILSNPGIIALNQDV 310
Query: 281 ------------NDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKS 320
ND IWAGPL+ N V++ + + S +T + D+G +
Sbjct: 311 LGKSIGFKRRYTNDSDIWAGPLADNSTAVVIINWQNVSRPVTFYLSDVGFSA 362
>gi|429853041|gb|ELA28142.1| alpha-galactosidase [Colletotrichum gloeosporioides Nara gc5]
Length = 502
Score = 178 bits (451), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 115/358 (32%), Positives = 171/358 (47%), Gaps = 76/358 (21%)
Query: 14 SDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFD 73
+ R L NGL TPPMGWN+WN F C ++E ++ T S R + L +
Sbjct: 11 ASRVAALMNGLADTPPMGWNNWNAFACDVSEHLLLST------SERIVSLGLRDLGYNHV 64
Query: 74 SLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----------------ILISNS 116
L G L N FP+G ++D++H + S
Sbjct: 65 VLDDCWQDENGRDANGKLQPNLEKFPNGLNHVSDHIHKHGMKYGMYSTAGEMTCARFAGS 124
Query: 117 LDNRIESNIAFEQ-GIDYLKYDNCFNDDT--RPTI---RYRAMSDALKKAGRPIFFSMCE 170
LD+ + +F G+D+LKYDNCF+ P I R++ MSDAL++ GRPI ++C
Sbjct: 125 LDHEKDDAESFASWGVDFLKYDNCFHMGRMGTPEISFNRFKKMSDALQETGRPISLNLCN 184
Query: 171 WGDMRPALWGSNIRNSWRTTDDISDSWAR-----------------------MLTIADMN 207
WG+ WG +I NSWR + DI DS+ R +L I +
Sbjct: 185 WGEDYVHTWGMSIANSWRMSGDIYDSFTRPDDLCSCNTAADPFCIAPGTHCSVLFIINRV 244
Query: 208 EVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPI 267
+AD + PGGWND DMLEVG GGM EY HF++WA K+PL+LG D+ ++ AE + I
Sbjct: 245 AAFADKSIPGGWNDLDMLEVGQGGMTDEEYKAHFALWAALKSPLMLGNDLRDMPAEALSI 304
Query: 268 IGNEEVIAVNQ-----------------------AINDQQIWAGPL-SGNRIVVLLES 301
+ N +IA++Q + + Q+W+G L +G+++V+ L +
Sbjct: 305 VNNPAIIAISQDPHGRSALRVRRDVGGGLVPDEFGVAEAQVWSGRLENGDQVVIFLNA 362
>gi|344231839|gb|EGV63718.1| hypothetical protein CANTEDRAFT_105318 [Candida tenuis ATCC 10573]
Length = 446
Score = 177 bits (450), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 110/327 (33%), Positives = 161/327 (49%), Gaps = 53/327 (16%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
NG+G TP MGWNSWN + C INE +I + + + + + D Q+
Sbjct: 19 FDNGVGHTPQMGWNSWNKYGCNINESVIISAAETMNQLGL---LQYGYEYIVMDDCYQLK 75
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVHIL---------ISNSLDNRIESNIAFEQ- 129
+ + G +V + FP+G K L+D VH L R ++ +E+
Sbjct: 76 ER---DSETGKIVVDPQRFPNGIKYLSDRVHQLGFKFGMYSSAGKFTCGRYPGSLGYEEL 132
Query: 130 --------GIDYLKYDNCFND-----DTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRP 176
G+DYLKYDNCFN+ + + RY MS AL GR IF+S+C+WG+
Sbjct: 133 DAQTFADWGVDYLKYDNCFNEGMSGNEKISSDRYLKMSQALNATGRTIFYSLCQWGEDNV 192
Query: 177 ALWGSNIRNSWRTTDDISDSWAR---------------------MLTIADMNEVYADHAK 215
WG+ + NSWR T DI DS+ R M+ I + A +
Sbjct: 193 WNWGTTLSNSWRVTGDIYDSFNRYDDRCPCESFDCAGIQGNSCSMVNIFEKAIPLAQKSG 252
Query: 216 P-GGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVI 274
P GWND D LEVGNGGM NEY HF++WAI K+PL+LG D+ ++ II N+++I
Sbjct: 253 PFEGWNDLDSLEVGNGGMTTNEYKAHFTLWAILKSPLVLGNDLPRMSQTDFAIITNKDII 312
Query: 275 AVNQAINDQ--QIWAGPLSGNRIVVLL 299
A+NQ + ++W + G + +L+
Sbjct: 313 AINQDFSSPAVRVWKRAVKGGHLSLLV 339
>gi|358398162|gb|EHK47520.1| glycoside hydrolase family 27 protein [Trichoderma atroviride IMI
206040]
Length = 496
Score = 177 bits (450), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 122/367 (33%), Positives = 168/367 (45%), Gaps = 114/367 (31%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINE-------KIIRE------------TDDCWGESSRD 60
+ NGL TP MGWN+WN F C ++E K+I E DDCW + RD
Sbjct: 19 IDNGLARTPQMGWNNWNSFGCDVSEHLLLDHAKLISEYGLQDLGYGYVVLDDCWSDG-RD 77
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI--------- 111
+ G LV +K FP G A+AD +H
Sbjct: 78 RK--------------------------GKLVVDKKKFPRGMAAVADDLHAQGFLFGMYS 111
Query: 112 --------LISNSLDNRIESNIAF-EQGIDYLKYDNCFNDDT--RPTI---RYRAMSDAL 157
+ SLD+ + +F + G+DYLKYDNC++ P I R+ AM++AL
Sbjct: 112 SAGELTCARYAGSLDHEKDDAQSFADWGVDYLKYDNCYHMGRFGTPLISFERFNAMAEAL 171
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWAR------------------ 199
K G+ IF+S+C WG+ W ++I NSWR DI DS+AR
Sbjct: 172 KATGKNIFYSLCNWGEDYSYSWAASIGNSWRIFGDIYDSFARPDDLCSCNDPANPACIAP 231
Query: 200 -----MLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLG 254
+L I + Y D PGGWND DMLEVG+GGM EY HF++WA KAPLLLG
Sbjct: 232 GTHCSVLAIINRVVPYIDRGLPGGWNDLDMLEVGHGGMTEEEYKAHFTMWAALKAPLLLG 291
Query: 255 CDVGNLTAETMPIIGNEEVIAVNQ---------------------AINDQQIWAGPLS-G 292
D+ + + I+ N VIA+NQ + + IW+GPL+ G
Sbjct: 292 TDLRKWSGSDLSIVTNPAVIAINQDPRGRAVQRVRRNFNVPKDEWGVGETHIWSGPLANG 351
Query: 293 NRIVVLL 299
++I++ L
Sbjct: 352 DQILIFL 358
>gi|347735140|ref|ZP_08868080.1| alpha-galactosidase [Azospirillum amazonense Y2]
gi|346921721|gb|EGY02326.1| alpha-galactosidase [Azospirillum amazonense Y2]
Length = 501
Score = 177 bits (450), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 131/359 (36%), Positives = 179/359 (49%), Gaps = 54/359 (15%)
Query: 18 NLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS-RDSQISLSCSFHMFDSLT 76
++ NGL TPPMGW+SWN F I++K +R+ D S RD+ + D
Sbjct: 137 HVPGNGLARTPPMGWSSWNKFAEAIDDKTVRDIADALVRSGLRDA----GYVYVNIDDGW 192
Query: 77 QIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNRIESNIA----- 126
Q P G L N FP KALADYVH + I +S + + A
Sbjct: 193 QGERGP-----DGVLHPN-AKFPD-MKALADYVHARGLKLGIYSSPGPKSCAGYAGSYGH 245
Query: 127 --------FEQGIDYLKYDNCFNDD---TRPTIR--YRAMSDALKKAG----RPIFFSMC 169
E G+DYLKYD C + T T++ Y+ M AL AG RP+ +S+C
Sbjct: 246 VEQDARTWAEWGVDYLKYDLCSGEGFYHTAETVQAVYQQMGAALAAAGAATGRPVVYSLC 305
Query: 170 EWGDMRPALWGSNIR-NSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVG 228
E+G WG ++ N WRTT DI DS+A M I H PGGWNDPDMLEVG
Sbjct: 306 EYGRFDVGAWGRDVGGNLWRTTGDIEDSYASMAAIGFDKNGVPHHTGPGGWNDPDMLEVG 365
Query: 229 NGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ---- 284
NGGM EY H S+WA+ APLL+G D ++T +T+ ++GN EVIA++Q +Q
Sbjct: 366 NGGMTTEEYRTHISLWALMAAPLLMGNDARSMTKDTLALLGNAEVIAIDQDPLGRQGLPV 425
Query: 285 -------IWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGI---KSNNTLFEHRTLETK 333
+W PL+ + V L +R +MTA W +G+ S L+ HR++ +
Sbjct: 426 RKRDGMEVWTRPLADGSVAVGLFNRTDKPVVMTADWPALGLGDHPSVRDLWAHRSVAPR 484
>gi|414888112|tpg|DAA64126.1| TPA: hypothetical protein ZEAMMB73_968166, partial [Zea mays]
Length = 294
Score = 177 bits (450), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 100/222 (45%), Positives = 121/222 (54%), Gaps = 25/222 (11%)
Query: 17 RNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLT 76
R L+NGL LTP MGWNSWN F C INE +IRET D + +L +F D
Sbjct: 51 RLQLNNGLALTPQMGWNSWNFFACNINETVIRETADALVSTGL---AALGYNFVNIDDCW 107
Query: 77 QIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----------------ILISNSLDN 119
+ +Q L+ + TFPSG KALADYVH + SL +
Sbjct: 108 SYVKRGKQDQ----LLPDPKTFPSGIKALADYVHGKGLKLGIYSDAGKFTCQVRPGSLGH 163
Query: 120 RIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPAL 178
+ F GIDYLKYDNC+N +P RY M DAL GR IF+S+CEWG PAL
Sbjct: 164 ENDDAAIFASWGIDYLKYDNCYNLGIKPQKRYPPMRDALNSTGRQIFYSLCEWGQYDPAL 223
Query: 179 WGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWN 220
W + NSWRTTDDI+D+W M IAD N +A +A PGGWN
Sbjct: 224 WAGKVGNSWRTTDDITDTWKSMTDIADKNNKWASYAGPGGWN 265
>gi|254443372|ref|ZP_05056848.1| Melibiase subfamily [Verrucomicrobiae bacterium DG1235]
gi|198257680|gb|EDY81988.1| Melibiase subfamily [Verrucomicrobiae bacterium DG1235]
Length = 416
Score = 177 bits (448), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 125/405 (30%), Positives = 179/405 (44%), Gaps = 100/405 (24%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQ 62
GL LTPPMGWNSWN F INE+++ +T DDCW RD
Sbjct: 38 EGLALTPPMGWNSWNTFESAINEQLVLDTAQSMIDSGMRDAGYTYIVLDDCWALRDRDP- 96
Query: 63 ISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSL 117
+GNLV + FP+G KALAD +H I I
Sbjct: 97 -------------------------EGNLVPDPEKFPNGIKALADQLHEMGFKIGIYGDA 131
Query: 118 DNRIESNIAFEQ-------------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPI 164
+ Q GIDYLKYD C Y M DAL+ AGRPI
Sbjct: 132 GRTTCAGYPGSQGHEYQDARTWASWGIDYLKYDWCATGTRDAKEAYTTMRDALRTAGRPI 191
Query: 165 FFSMCEWGDMRPALWGSNIRNSWRTTDDISD------SWARMLTIADMNEVYA------- 211
FS+CEWGD P LW ++ + WR + DI D W+R + ++ +A
Sbjct: 192 TFSICEWGDNEPWLWAQDVGHLWRISGDIYDCWDCEQEWSRGFKVI-LDRYHALKPSVVG 250
Query: 212 -----DHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMP 266
++ PGGWND DMLEVGN G+ E HF++WA+ +PL+ G DV +++ E +
Sbjct: 251 RDGLGQYSGPGGWNDADMLEVGNPGLTLAESRSHFTLWAMITSPLMAGNDVRSMSPEIID 310
Query: 267 IIGNEEVIAVNQ------------AINDQQIWAGPLSGN--RIVVLLESRKTFSSMMTAH 312
++ N++VIA+NQ + W PL G + +L S +T ++ + H
Sbjct: 311 VLTNKDVIAINQDPDGVSAWRFGIVPGKYETWIKPLQGGDWAVCILNTSEETQTAEIQWH 370
Query: 313 WDDIGIKSN---NTLFEHRTLETKFQGKLTAKVDGHSCKMYELTP 354
+ + + L+ ++L T + L AK+ H + L+P
Sbjct: 371 RMERALSGEFEIHDLWADKSLSTTHK-DLKAKIASHDVLLMRLSP 414
>gi|336417196|ref|ZP_08597523.1| hypothetical protein HMPREF1017_04631 [Bacteroides ovatus
3_8_47FAA]
gi|423297814|ref|ZP_17275874.1| hypothetical protein HMPREF1070_04539 [Bacteroides ovatus
CL03T12C18]
gi|335936516|gb|EGM98442.1| hypothetical protein HMPREF1017_04631 [Bacteroides ovatus
3_8_47FAA]
gi|392664451|gb|EIY57989.1| hypothetical protein HMPREF1070_04539 [Bacteroides ovatus
CL03T12C18]
Length = 394
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 129/419 (30%), Positives = 186/419 (44%), Gaps = 99/419 (23%)
Query: 1 LFIVSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRE----------- 49
LFI S+ A N + +L TPPMGWNSWN F+ I+E+ IRE
Sbjct: 8 LFIFLSTFA--NAQKHKKVLCP----TPPMGWNSWNCFNKNISEEQIREIANLMVSTGLK 61
Query: 50 --------TDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSG 101
DDCW ++ R+S + + ++ FPSG
Sbjct: 62 DAGYTYLNVDDCW-QTHRESCV---------------------------IQSDSVKFPSG 93
Query: 102 TKALADYVH----------ILISNSLDNR--------IESNIAFEQGIDYLKYDNCFNDD 143
KALADYVH S + R I++ E G+D+LKYD C N+
Sbjct: 94 IKALADYVHSKGLKFGIYSCAGSKTCAGRPGSRGYEYIDAVTYAEWGVDFLKYDWCHNNG 153
Query: 144 TRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDI------SDSW 197
Y M DALK GRPI S+CEWG RP WG I WR ++DI + W
Sbjct: 154 ANAREAYFTMCDALKSTGRPIVLSICEWGTNRPWEWGKGIGEMWRVSEDIIATTEGTAYW 213
Query: 198 --ARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGC 255
A +L I D+ +A PG WND DML+VGNG + Y+E V+HFS+W + +PL++GC
Sbjct: 214 GGASILKIIDIMAPLWPYAGPGHWNDADMLQVGNGDLTYDENVMHFSMWCMLASPLIVGC 273
Query: 256 DVGNLTAETMPIIGNEEVIAVNQAINDQQ------------IWAGPLSGNRIVVLLESRK 303
D+ N+ + + I+ N+E IA+NQ +Q W LS RI + ++
Sbjct: 274 DLRNIDKKILDILTNKEAIAINQDKLGKQGRMFERSGRYHETWIKELSDGRIALCFLNKD 333
Query: 304 TFSSMMTAHWDDIGIKSNNTLFEHRTLETKFQ--------GKLTAKVDGHSCKMYELTP 354
+ H+ I + ++H + ++ + + KV H + L P
Sbjct: 334 EKEWSLDYHFWGFDIIGIDRTYDHYIVRDVWKHQDIGDTGDRYSFKVPAHGVVLLCLVP 392
>gi|451849867|gb|EMD63170.1| glycoside hydrolase family 27 protein [Cochliobolus sativus ND90Pr]
Length = 511
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 112/353 (31%), Positives = 173/353 (49%), Gaps = 77/353 (21%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
++NGL +TPPMGWN+WN F C ++E ++ T S++ + L + + L
Sbjct: 21 VNNGLAITPPMGWNNWNAFGCDVSENLLLTT------SAQIIDLGLRDLGYNYVVLDDCW 74
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVH-----------------ILISNSLDNRIE 122
P +G L FP+G K+++D++H + SLD+ I+
Sbjct: 75 QNPKGRDKEGKLQPALEKFPNGLKSISDHLHDRNLKFGMYSSAGEMTCARFAGSLDHEID 134
Query: 123 SNIAFEQ-GIDYLKYDNCFNDDT--RPTI---RYRAMSDALKKAGRPIFFSMCEWGDMRP 176
+F G+D LKYD+C++ P++ R++ M+DALK GR I F++C WG+
Sbjct: 135 DAKSFAGWGVDMLKYDSCYHMGRVGTPSVSFNRFKTMADALKATGRNILFNLCNWGEDLV 194
Query: 177 ALWGSNIRNSWRTTDDISDSWAR-------------------------MLTIADMNEVYA 211
WG +I NSWR T DI DS+ R +L I + +A
Sbjct: 195 HTWGMSISNSWRITGDIYDSFTRPDDLCGCNTVAPGDVNCVAPGTHCSVLFILNKVAPFA 254
Query: 212 DHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNE 271
D + PGGW+D DMLEVG GGM EY HF++WA K+PL LG D+ ++ A + II N
Sbjct: 255 DRSIPGGWSDLDMLEVGQGGMTDEEYKAHFALWAALKSPLFLGNDLRDMPAPALTIINNP 314
Query: 272 EVIAVNQ----------------------AINDQQIWAGPL-SGNRIVVLLES 301
+IA++Q + + +WAG L +G+ +V+LL +
Sbjct: 315 AIIALSQDPHGRSVTRVRRDTEGVAKDEWGVGETHVWAGHLHNGDEVVILLNA 367
>gi|300783669|ref|YP_003763960.1| alpha-galactosidase [Amycolatopsis mediterranei U32]
gi|384146905|ref|YP_005529721.1| alpha-galactosidase [Amycolatopsis mediterranei S699]
gi|399535552|ref|YP_006548215.1| alpha-galactosidase [Amycolatopsis mediterranei S699]
gi|299793183|gb|ADJ43558.1| alpha-galactosidase [Amycolatopsis mediterranei U32]
gi|340525059|gb|AEK40264.1| alpha-galactosidase [Amycolatopsis mediterranei S699]
gi|398316322|gb|AFO75269.1| alpha-galactosidase [Amycolatopsis mediterranei S699]
Length = 405
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 123/383 (32%), Positives = 177/383 (46%), Gaps = 64/383 (16%)
Query: 20 LSNGLGLTPPMGWNSWNHFHC-QINEKIIRETDDCWGESS-RDSQISLSCSFHMFDSLTQ 77
L+NGL TPPMGWNSWN C + E ++++ D ++ RD+ + + T+
Sbjct: 38 LANGLAGTPPMGWNSWNQVRCYDLTEDVVKQAADALADTGLRDAGYHYVVVDDCWQAPTR 97
Query: 78 IAAKPLPNQCQGNLVANKTTFPSGTKALADYVH---ILIS-------------------- 114
A G+L + FP G LADYVH +L
Sbjct: 98 AA--------DGSLQPDPKRFPHGIADLADYVHSRGLLFGIYAVPGSRTCAMANDAYPAA 149
Query: 115 --NSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRA----MSDALKKAGRPIFFS 167
SL + + F++ G+DYLKYD C N DT + +A M D L RPI ++
Sbjct: 150 GIGSLGHERQDAETFDRWGVDYLKYDWC-NADTVDGLDRKAAFEKMRDELAALPRPIVYA 208
Query: 168 MCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHA-KPGGWNDPDMLE 226
+ E+G P W + N WRTT D+ +W +L D A H+ PGGWNDPDML+
Sbjct: 209 ISEYGVSSPWTWARPVANLWRTTYDLVPTWDSVLATIDQQAGVAAHSGSPGGWNDPDMLQ 268
Query: 227 VGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-------- 278
VGNG + +E HFS+WA+ APL G D L+ + +GN E IAV+Q
Sbjct: 269 VGNGTLTADEARAHFSVWAVLNAPLFAGTDPAKLSDTDLATLGNAEAIAVDQDFAGSQGR 328
Query: 279 ---AINDQQIWAGPLSGNRIVVLL----ESRKTFSSMMTAHWDDIGIKSNNTLFEHRTLE 331
A Q+W PLSG V+L + T S+ + W+ L+ HR
Sbjct: 329 QLDAGPGYQVWGKPLSGGGFAVVLLNTGSTTATVSAAIPGAWN------VRDLWAHRETG 382
Query: 332 TKFQGKLTAKVDGHSCKMYELTP 354
T G +++ + H+ + +LTP
Sbjct: 383 TVV-GTVSSTLRPHAAALLKLTP 404
>gi|51701634|sp|Q03647.1|MEL_SACPS RecName: Full=Alpha-galactosidase; AltName:
Full=Alpha-D-galactoside galactohydrolase; AltName:
Full=MELx; AltName: Full=Melibiase; Flags: Precursor
gi|171924|gb|AAA34769.1| alpha-galactosidase [Saccharomyces carlsbergensis]
Length = 471
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 123/366 (33%), Positives = 174/366 (47%), Gaps = 76/366 (20%)
Query: 1 LFIVSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRD 60
L++ +S AA S G + NGLGLTP MGW++WN F C ++E+++ +T D R
Sbjct: 4 LYLFTSFAAVS-GVLGSSPSYNGLGLTPQMGWDNWNTFACDVSEQLLLDTAD------RI 56
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH---------- 110
S+I L + + L + G LVA+K FP+G +AD++H
Sbjct: 57 SEIGLKDLGYTYVILDDCWSSG--RTANGTLVADKEKFPNGMSHVADHLHNNNFLFGMYS 114
Query: 111 -------ILISNSLDNRIESNIAF-EQGIDYLKYDNCFNDDT-----RPTIRYRAMSDAL 157
SL + E F G+DYLKYDNC+N RY+AMSDAL
Sbjct: 115 SAGEYTCAGYPGSLGHEEEDAEFFASNGVDYLKYDNCYNKGQFGAPETSYKRYKAMSDAL 174
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWAR------------------ 199
K GRPIF+S+C WG WGS+I NSWR + DI + R
Sbjct: 175 NKTGRPIFYSLCNWGQDLTHYWGSDIANSWRMSGDIYPQFTRPDSRCPCDGDQFDCAYAG 234
Query: 200 ----MLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGC 255
++ I + +A GGWND D LEVG G + +E HFS+WA+ K+PL++G
Sbjct: 235 FHCSIMNILNKAAPMGQNAGIGGWNDLDNLEVGVGNLTDDEEKAHFSMWAMVKSPLVIGA 294
Query: 256 DVGNLTAETMPIIGNEEVIAVNQ---------------------AINDQQIWAGPL-SGN 293
DV +L A + I VIA+NQ + Q W+GPL +G+
Sbjct: 295 DVNHLKASSYSIYSQASVIAINQDPKGVPATRVWRHQVPQTDKYGQGEIQFWSGPLDNGD 354
Query: 294 RIVVLL 299
+++ LL
Sbjct: 355 QVIALL 360
>gi|385250213|emb|CCG27771.1| alpha-galactosidase [Saccharomyces pastorianus]
Length = 471
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 118/345 (34%), Positives = 165/345 (47%), Gaps = 75/345 (21%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
NGLGLTP MGW++WN F C ++E+++ +T D R S+I L + + L +
Sbjct: 24 NGLGLTPQMGWDNWNTFACDVSEQLLLDTAD------RISEIGLKDLGYTYVILDDCWSS 77
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVH---ILIS--------------NSLDNRIESN 124
G LVA+K FP+G +AD++H L SL + E
Sbjct: 78 G--RTANGTLVADKEKFPNGMSHVADHLHNNNFLFGMYSSAGEYTCAGYPGSLGHEEEDA 135
Query: 125 IAF-EQGIDYLKYDNCFNDDT-----RPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPAL 178
F G+DYLKYDNC+N RY+AMSDAL K GRPIF+S+C WG
Sbjct: 136 EFFASNGVDYLKYDNCYNKGQFGAPETSYKRYKAMSDALNKTGRPIFYSLCNWGQDLTFY 195
Query: 179 WGSNIRNSWRTTDDISDSWAR----------------------MLTIADMNEVYADHAKP 216
WGS+I NSWR + DI + R ++ I + +A
Sbjct: 196 WGSDIANSWRMSGDIYPQFTRPDSRCPCDGDQFDCAYAGFHCSIMNILNKAAPMGQNAGI 255
Query: 217 GGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAV 276
GGWND D LEVG G + +E HFS+WA+ K+PL++G DV +L A + I VIA+
Sbjct: 256 GGWNDLDNLEVGVGNLTDDEEKAHFSMWAMVKSPLVIGADVNHLKASSYSIYSQASVIAI 315
Query: 277 NQ---------------------AINDQQIWAGPL-SGNRIVVLL 299
NQ + Q W+GPL +G++++ LL
Sbjct: 316 NQDPKGVPATRVWRHQVPQTDKYGQGEIQFWSGPLDNGDQVIALL 360
>gi|385250211|emb|CCG27770.1| alpha-galactosidase [Saccharomyces eubayanus]
Length = 471
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 117/345 (33%), Positives = 164/345 (47%), Gaps = 75/345 (21%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
NGLGLTP MGW++WN F C ++E+++ +T D R S+I L + + L +
Sbjct: 24 NGLGLTPQMGWDNWNTFACDVSEQLLLDTAD------RISEIGLKDLGYTYVILDDCWSS 77
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVH-----------------ILISNSLDNRIESN 124
G LVA+K FP+G +AD++H SL + E
Sbjct: 78 G--RTANGTLVADKEKFPNGMSHVADHLHNNNFLFGMYSSAGEYTCAGYPGSLGHEEEDA 135
Query: 125 IAF-EQGIDYLKYDNCFNDDT-----RPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPAL 178
F G+DYLKYDNC+N RY+AMSDAL K GRPIF+S+C WG
Sbjct: 136 EFFASNGVDYLKYDNCYNKGQFGAPETSYKRYKAMSDALNKTGRPIFYSLCNWGQDLTFY 195
Query: 179 WGSNIRNSWRTTDDISDSWAR----------------------MLTIADMNEVYADHAKP 216
WGS+I NSWR + DI + R ++ I + +A
Sbjct: 196 WGSDIANSWRMSGDIYPQFTRPDSRCPCDGDQFDCAYAGFHCSIMNILNKAAPMGQNAGI 255
Query: 217 GGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAV 276
GGWND D LEVG G + +E HFS+WA+ K+PL++G DV +L A + I VIA+
Sbjct: 256 GGWNDLDNLEVGVGNLTDDEEKAHFSMWAMVKSPLVIGADVNHLKASSYSIYSQASVIAI 315
Query: 277 NQ---------------------AINDQQIWAGPL-SGNRIVVLL 299
NQ + Q W+GPL +G++++ LL
Sbjct: 316 NQDPKGVPATRVWRHQVPQTDKYGQGEIQFWSGPLDNGDQVIALL 360
>gi|332687426|emb|CBY84985.1| alpha-galactosidase [Saccharomyces bayanus]
gi|332687432|emb|CBY84988.1| alpha-galactosidase [Saccharomyces uvarum]
gi|332687434|emb|CBY84989.1| alpha-galactosidase [Saccharomyces uvarum]
gi|334084824|emb|CCA89532.1| alpha-galactosidase [Saccharomyces uvarum]
gi|385250225|emb|CCG27777.1| alpha-galactosidase [Saccharomyces uvarum]
Length = 471
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 127/377 (33%), Positives = 177/377 (46%), Gaps = 76/377 (20%)
Query: 1 LFIVSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRD 60
L++ SS AA S G + NGLGLTP MGW++WN F C ++E+++ +T D R
Sbjct: 4 LYLFSSFAAVS-GVLGSSPSYNGLGLTPQMGWDNWNTFACDVSEQLLLDTAD------RI 56
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH---ILIS--- 114
S I L + + L + G LVA+K FP+G +AD++H L
Sbjct: 57 SDIGLKDLGYTYVILDDCWSSG--RSSNGTLVADKEKFPNGMSHVADHLHNNSFLFGMYS 114
Query: 115 -----------NSLDNRIESNIAF-EQGIDYLKYDNCFNDDT--RPTI---RYRAMSDAL 157
SL + + F G+DYLKYDNC+N P RY+AMSDAL
Sbjct: 115 SAGEYTCAGYPGSLGHEEDDADFFASNGVDYLKYDNCYNKGQFGAPETSYNRYKAMSDAL 174
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWAR------------------ 199
K GRPIF+S+C WG WGS+I NSWR + DI + R
Sbjct: 175 NKTGRPIFYSLCNWGQDLTFYWGSDIANSWRMSGDIYPEFTRPDSRCPCDGDQFDCAYAG 234
Query: 200 ----MLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGC 255
++ I + +A GGWND D LEVG G + +E HFS+WA+ K+PL++G
Sbjct: 235 FHCSIMNILNKAAPMGQNAGIGGWNDLDNLEVGVGNLTDDEEKAHFSMWAMVKSPLIIGA 294
Query: 256 DVGNLTAETMPIIGNEEVIAVNQ---------------------AINDQQIWAGPL-SGN 293
DV L A + I +IA+NQ + Q W+GPL +G+
Sbjct: 295 DVNQLKASSYSIYSQASIIAINQDPKGVPATRVWRHQVPQTDKYGQGEIQFWSGPLDNGD 354
Query: 294 RIVVLLESRKTFSSMMT 310
+++ LL SM T
Sbjct: 355 QVIALLNGGNKARSMNT 371
>gi|406864931|gb|EKD17974.1| alpha-galactosidase/alpha-n-acetylgalactosaminidase [Marssonina
brunnea f. sp. 'multigermtubi' MB_m1]
Length = 504
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 109/328 (33%), Positives = 151/328 (46%), Gaps = 90/328 (27%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGE 56
R + ++NGL +TP MGWN+WN C ++E ++ +T DDCW
Sbjct: 18 RASAVNNGLAVTPQMGWNNWNALGCDVSEDLLVQTANLIVDYGLKDLGYHYVVLDDCW-S 76
Query: 57 SSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL---- 112
S R++ + S L+ + FP G A+AD +H L
Sbjct: 77 SGRNASTNNS------------------------LIPDAKKFPRGMAAVADDMHALGLGF 112
Query: 113 -----ISNSLDNRIESNIAFEQ---------GIDYLKYDNCFN-----DDTRPTIRYRAM 153
R E ++ +E+ G+DYLKYDNC+N D RY+ M
Sbjct: 113 GMYSSAGRYTCARYEGSLGYEEVDAQTWADWGVDYLKYDNCYNEGQAGDQLISESRYQKM 172
Query: 154 SDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSW---------------- 197
+DAL GRPI +S+C WG+ P WGS I NSWR + DI D W
Sbjct: 173 ADALNATGRPILYSLCNWGEDYPWNWGSTIANSWRISGDIFDYWDTYDPRCPCEGADAWN 232
Query: 198 -------ARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAP 250
+ I + A+PGGWND DMLEVGNG M EYV HFS+W+ K+P
Sbjct: 233 CGLPGFHCSITNIMNKVAFIVSKAEPGGWNDLDMLEVGNGAMTDAEYVSHFSMWSAVKSP 292
Query: 251 LLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
L++G D+ + + + I+ N VIAVNQ
Sbjct: 293 LIMGNDIREIAPQDLSILSNAAVIAVNQ 320
>gi|213406708|ref|XP_002174125.1| alpha-galactosidase [Schizosaccharomyces japonicus yFS275]
gi|212002172|gb|EEB07832.1| alpha-galactosidase [Schizosaccharomyces japonicus yFS275]
Length = 415
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 119/344 (34%), Positives = 161/344 (46%), Gaps = 91/344 (26%)
Query: 22 NGLGLTPPMGWNSWNHFHCQI--NEKI-----IRET------------DDCWGESSRDSQ 62
NGLGLTP MGWNSWN + C I N+ I I+E DDCW +RD
Sbjct: 20 NGLGLTPQMGWNSWNKYGCNISANDIISAATSIKEKGLLDLGYEYIVMDDCWALKTRD-- 77
Query: 63 ISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL---------- 112
P+ Q +V + + FP+G K L+D +H +
Sbjct: 78 -------------------PVTKQ----IVPDPSKFPNGIKNLSDSIHDMGFKWGMYSSA 114
Query: 113 -------ISNSLD-NRIESNIAFEQGIDYLKYDNCFNDDTRPT-----IRYRAMSDALKK 159
SLD I++ G+DYLKYDNC+N T RY MS AL
Sbjct: 115 GKYTCAGYPGSLDYEEIDATTFASWGVDYLKYDNCYNQGRSGTPSESFKRYEVMSKALNS 174
Query: 160 AGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWAR-------------------- 199
GRPIF+S+C+WG+ P W S I NSWR + DI D++ R
Sbjct: 175 TGRPIFYSLCQWGEDGPWNWASTIANSWRISGDIYDNFNRYDDRCPCESFQCIGLQGFDC 234
Query: 200 --MLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDV 257
M I + GWND D LEVGNGGM Y+EYV HF++WAI K+PL+LG DV
Sbjct: 235 SVMNIIRKALPLSQKARDRDGWNDLDSLEVGNGGMTYDEYVTHFTVWAILKSPLMLGNDV 294
Query: 258 GNLTAETMPIIGNEEVIAVNQAINDQQIWAGPLS--GNRIVVLL 299
++ + + I+ N ++I +NQ Q ++ G+ I++LL
Sbjct: 295 SAMSDQDLSIVSNRDLITINQDRGTQPAYSKWTKNYGDSIIILL 338
>gi|340368548|ref|XP_003382813.1| PREDICTED: alpha-galactosidase-like [Amphimedon queenslandica]
Length = 242
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 96/224 (42%), Positives = 123/224 (54%), Gaps = 27/224 (12%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
L NGLGLTP MGWNSWNHFHC +++ +I+ T D+CW SSR
Sbjct: 20 LDNGLGLTPQMGWNSWNHFHCNVSQDLIKATAKAMIDKGLDKHGYQYVNIDNCWAASSRA 79
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNR 120
S S+ D +T K L + + +GTK AD+ + +
Sbjct: 80 SDGSIRS-----DPVTFPDMKGLIDYVHSLGLKFGLYSDAGTKTCADHQPGSLGHETQ-- 132
Query: 121 IESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWG 180
++N + G+DYLKYDNC + ++P +RY M DAL K GRPIFFSMCEWG PA W
Sbjct: 133 -DANTYAQWGVDYLKYDNCNSGGSKPEVRYPVMRDALNKTGRPIFFSMCEWGVDNPATWA 191
Query: 181 SNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDM 224
S + NSWRTT DI D+W M++ AD N +A PGGWNDPDM
Sbjct: 192 SRVGNSWRTTGDIKDNWKSMISRADQNAGLWSYAGPGGWNDPDM 235
>gi|395804003|ref|ZP_10483244.1| alpha-galactosidase [Flavobacterium sp. F52]
gi|395433647|gb|EJF99599.1| alpha-galactosidase [Flavobacterium sp. F52]
Length = 410
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 117/356 (32%), Positives = 171/356 (48%), Gaps = 49/356 (13%)
Query: 23 GLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKP 82
GL +TPPMGWNSWN F I+EK+++ET D S + ++ + D +
Sbjct: 32 GLAMTPPMGWNSWNTFATNIDEKLVKETADIMVSSGL---AAAGYNYIVLDDGWMTHER- 87
Query: 83 LPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNRI-------------ESN 124
G+LV + FPSG KAL DYVH + N + ++
Sbjct: 88 ---DANGDLVPDPEKFPSGMKALIDYVHNKGLKFGLYNCAGTKTCAGYPGTRGYEYQDAR 144
Query: 125 IAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIR 184
+ GID+LKYD C + Y MS+ALK AGRPI FS+CEWGD +P WG +
Sbjct: 145 FYAKLGIDFLKYDWCNTEGITAKEAYTTMSNALKTAGRPIVFSLCEWGDNQPWEWGKPVG 204
Query: 185 NSWRTTDDI----------SDSWAR--MLTIADMNEVYADHAKPGGWNDPDMLEVGNGGM 232
N WR + DI ++W+ + IADM + ++ P WND DM+EVGN M
Sbjct: 205 NLWRISGDIYPCFDCEFKHPENWSSWGFMKIADMRKDIRKYSGPDHWNDFDMMEVGN-EM 263
Query: 233 KYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AIN 281
E HF++W + +PL G D ++ ET+ I+ N+E++AVNQ ++
Sbjct: 264 NDTEDKTHFAMWCMLSSPLFTGNDYRKMSKETLSILTNKELLAVNQDKLGIQGFKYAILD 323
Query: 282 DQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFEHRTLETKFQGK 337
++W PLS N + +R S + W IK + +E +T F+ K
Sbjct: 324 GVEVWVKPLSDNTWAMSFVNRTETSKKINFDWKKNNIKDADFGYEADFNKTIFKIK 379
>gi|159463552|ref|XP_001690006.1| alpha-galactosidase [Chlamydomonas reinhardtii]
gi|158283994|gb|EDP09744.1| alpha-galactosidase [Chlamydomonas reinhardtii]
Length = 328
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 117/326 (35%), Positives = 161/326 (49%), Gaps = 55/326 (16%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NG+G TP MGWNSWN+F C INE IIR D S L + +++ ++
Sbjct: 1 LKNGVGKTPAMGWNSWNYFRCNINETIIRSVADAIVSS------GLMDAGYVYVNIDDCW 54
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVHIL------ISNSLDNRIE---SNIAFEQ- 129
+ N G +V FPSG KAL DY+H L S++ + E + +E+
Sbjct: 55 MEKRDN-ATGRIVPFADKFPSGMKALGDYIHSLGLKFGVYSDTGKHTCEGYPGSAGYEEQ 113
Query: 130 --------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGS 181
G+DYLK+D C DT+ + Y M DAL GRPI FS+C WG +P LWG
Sbjct: 114 DAATYAEWGVDYLKFDYCDMQDTKESATYERMRDALAATGRPILFSLCSWGSGQPWLWGK 173
Query: 182 NIRNSWRTTDDI----SDSWARMLTIAD-----MNEVYADHAKPGGWNDPDMLEVGNGGM 232
++ NSWRT D+ + A+ L + + + A +A PGG+NDPDML V GG+
Sbjct: 174 DVGNSWRTGIDVFAAWDAAQAKALKLPNFLQPILGAGLAPYAGPGGFNDPDMLVV--GGL 231
Query: 233 KYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-------------- 278
E HF+ W I APL+LG D ++ T+ I+ EV+AVNQ
Sbjct: 232 TQTEQRTHFAFWCIMAAPLILGNDPRAMSKATLEILLAREVLAVNQDPLGLQGRPVWTQE 291
Query: 279 -----AINDQQIWAGPLSGNRIVVLL 299
A +IW PL+ R +LL
Sbjct: 292 LDGASAGKSLEIWTKPLADGRTAMLL 317
>gi|403364578|gb|EJY82054.1| Melibiase family protein [Oxytricha trifallax]
Length = 381
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 102/273 (37%), Positives = 146/273 (53%), Gaps = 25/273 (9%)
Query: 30 MGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQG 89
MGW++WN + C IN II++ D Q+ L + + ++ A P N+ G
Sbjct: 1 MGWSTWNSYGCNINLDIIKKNTDI------IVQLGLDTIGYKYVNIDDCWALPDRNK-DG 53
Query: 90 NLVANKTTFPSGTKALADYVHI------LISN-----------SLDNRIESNIAFEQ-GI 131
+ +++ FP G + DY+H L S+ SL + E F G
Sbjct: 54 RIQHDESKFPGGMTPVGDYIHSKGLKFGLYSSAGTLTCQKKAGSLGHEKEDASDFSMVGT 113
Query: 132 DYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTD 191
DYLKYDNC N + RY AM+DAL GRPI++S+C WG+ W S I NSWRTT
Sbjct: 114 DYLKYDNCNNGNVPAITRYTAMADALNATGRPIYYSVCNWGEDNVWQWASKISNSWRTTG 173
Query: 192 DISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPL 251
DI++++ M N ++ + A G WNDPDML++GNG + Y E HF++W KAPL
Sbjct: 174 DINNNFQSMKYNFYQNNLHPEVAGKGAWNDPDMLQIGNGNLTYAESRSHFALWCFVKAPL 233
Query: 252 LLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ 284
+LG D+ N+ E + II N+ +IA+NQ N Q
Sbjct: 234 ILGNDLTNIAPEVLNIISNKNLIAINQDSNTVQ 266
>gi|330944730|ref|XP_003306408.1| hypothetical protein PTT_19548 [Pyrenophora teres f. teres 0-1]
gi|311316079|gb|EFQ85485.1| hypothetical protein PTT_19548 [Pyrenophora teres f. teres 0-1]
Length = 507
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 114/353 (32%), Positives = 172/353 (48%), Gaps = 77/353 (21%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
+SNGL +TPPMGWN+WN F C ++E ++ T S++ + L + + L
Sbjct: 21 ISNGLAITPPMGWNNWNAFGCDVSENLLLTT------SAQIVSLGLRDLGYDYVVLDDCW 74
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVH-----------------ILISNSLDNRIE 122
P +G L FP+G +++D++H SLD+ ++
Sbjct: 75 QDPKGRDKKGKLQPALEKFPNGLNSISDHLHGQNLKFGMYSSAGEMTCARFEGSLDHEVD 134
Query: 123 SNIAFEQ-GIDYLKYDNCFNDDT--RPTI---RYRAMSDALKKAGRPIFFSMCEWGDMRP 176
+F G+D LKYD+C++ P++ R++AMSDALK GR I ++C WG+
Sbjct: 135 DAKSFAGWGVDMLKYDSCYHMGRVGTPSVSFNRFKAMSDALKATGRNILLNLCNWGEDLV 194
Query: 177 ALWGSNIRNSWRTTDDISDSWAR--------MLTIADMNEV-----------------YA 211
WG +I NSWR T DI DS+ R L+ D+N V +A
Sbjct: 195 HTWGMSISNSWRITGDIYDSFTRPDDLCGCNSLSPGDVNCVAPGTHCSVVFILNKVAPFA 254
Query: 212 DHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNE 271
D + PGGW+D DMLEVG GGM EY HF++WA K+PL LG D+ ++ A + II N
Sbjct: 255 DRSIPGGWSDLDMLEVGQGGMTDEEYKAHFALWAALKSPLFLGNDLRDMPASALTIINNP 314
Query: 272 EVIAVNQ----------------------AINDQQIWAGPL-SGNRIVVLLES 301
+IA++Q + + +WAG L +G+ V+LL +
Sbjct: 315 AIIALSQDPHGRSVTRVRRDTAGVAKDEWGVGETHVWAGHLQNGDEAVILLNA 367
>gi|402074382|gb|EJT69911.1| alpha-galactosidase [Gaeumannomyces graminis var. tritici
R3-111a-1]
Length = 510
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 112/353 (31%), Positives = 167/353 (47%), Gaps = 77/353 (21%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
++NGL LTPPMGWN+WN F C ++E ++ T + + + L + + L
Sbjct: 26 INNGLALTPPMGWNNWNAFGCDVSEHLLLST------AEKLVSLGLRDLGYNYVVLDDCW 79
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLDNRIE 122
P G L T FP G KA++D +H + SLD+ +
Sbjct: 80 QDPKGRDEHGKLQPEATKFPRGLKAISDDLHAMGFKYGMYSSAGEMTCARFEGSLDHEKD 139
Query: 123 SNIAFEQ-GIDYLKYDNCFNDDT--RPTI---RYRAMSDALKKAGRPIFFSMCEWGDMRP 176
+F G+D LKYD+C++ P I R++ MSDAL+ GR + ++C WG+
Sbjct: 140 DAQSFAAWGVDMLKYDSCYHMGRIGTPQISFNRFKVMSDALRATGRNVLLNLCNWGEDLV 199
Query: 177 ALWGSNIRNSWRTTDDISDSWAR-------------------------MLTIADMNEVYA 211
WG +I NSWR T DI DS+ R +L I + +A
Sbjct: 200 HTWGMSIANSWRVTGDIYDSFTRPDDLCGCDENRLSDPSCIAPGTHCSVLFILNKVAAFA 259
Query: 212 DHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNE 271
D + PGGW+D DMLEVG GGM +EY HF++WA K+PL LG DV ++ + I+ N
Sbjct: 260 DRSIPGGWSDLDMLEVGQGGMTDDEYKAHFALWAALKSPLFLGNDVREMSPRALSIVNNP 319
Query: 272 EVIAVNQ----------------------AINDQQIWAGPLS-GNRIVVLLES 301
+IA++Q + + +WAG L+ G++ VVLL +
Sbjct: 320 AIIALSQDPHGRSLTRVSRATDGVAKDVYGVGETHVWAGHLANGDQAVVLLNA 372
>gi|315499582|ref|YP_004088385.1| alpha-galactosidase [Asticcacaulis excentricus CB 48]
gi|315417594|gb|ADU14234.1| Alpha-galactosidase [Asticcacaulis excentricus CB 48]
Length = 501
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 123/341 (36%), Positives = 161/341 (47%), Gaps = 53/341 (15%)
Query: 17 RNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS-RDSQISLSCSFHMFDSL 75
R + L TPPMGWNSWN F+ +I++K +RE D S RD+ + +
Sbjct: 139 RVVPQKALAPTPPMGWNSWNRFNTEIDDKTVREIADALVSSGLRDAGYVYVNIDDGWQGM 198
Query: 76 TQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIA--------- 126
+PN FP KALADYVH S L I S+
Sbjct: 199 RDADGVLMPN----------AKFPD-MKALADYVH---SRGLKIGIYSSQGPKTCGGYEG 244
Query: 127 ----FEQ--------GIDYLKYDNCFNDD---TRPTI--RYRAMSDALKKAGRPIFFSMC 169
EQ G+DYLKYD C + T+ + Y+ M +AL GR I FS+C
Sbjct: 245 SYGHVEQDARTFANWGMDYLKYDLCSGEAFYYTKEAVYASYQQMGEALAATGRDIVFSLC 304
Query: 170 EWGDMRPALWGSNIRNS-WRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVG 228
++G WG ++ WRTT DI D++ARM I +H P GWNDPDMLEVG
Sbjct: 305 QYGRFDVGSWGRDVGGHLWRTTGDIEDNYARMAWIGFDANGKPNHTGPNGWNDPDMLEVG 364
Query: 229 NGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ---- 284
NGGM +EY H S+WA+ APLLLG DV ++T ET I+ N +VIA++Q I Q
Sbjct: 365 NGGMTTDEYKTHMSLWALMAAPLLLGNDVRSMTPETAAILSNRDVIAIDQDIAGVQGLPV 424
Query: 285 -------IWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGI 318
IW LS V L + ++ W IG+
Sbjct: 425 KKDATHEIWTKKLSDGATAVGLFNHSDRPITLSGEWAQIGL 465
>gi|393213793|gb|EJC99288.1| alpha-galactosidase/alpha-n-acetylgalactosaminidase [Fomitiporia
mediterranea MF3/22]
Length = 497
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 113/368 (30%), Positives = 170/368 (46%), Gaps = 82/368 (22%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDC-WGESSRDSQISLSCSFHMFDSLTQIAA 80
NGL +TP MGWN+WN F C +E ++ T G RD + + Q
Sbjct: 23 NGLAVTPQMGWNTWNAFGCSTSEDLLLSTGKLIAGLGLRDLG---------YKYVVQDDC 73
Query: 81 KPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLDNRIES 123
G+L + T FP+G +AD +H L + SL + +
Sbjct: 74 WSAGRNSTGHLQVDTTKFPNGLSTVADELHGLGLGFGIYSDAGALTCGRFAGSLGHETQD 133
Query: 124 NIAFEQ-GIDYLKYDNCFNDDTRPT-----IRYRAMSDALKKAGRPIFFSMCEWGDMRPA 177
+ G+DYLKYDNC+N+ T RY+ M+DAL GR I +SMC WG+ RP
Sbjct: 134 AETWASWGVDYLKYDNCYNEGQSGTPQISYTRYKTMADALNATGRHILYSMCNWGEDRPW 193
Query: 178 LWGSNIRNSWRTTDDISDSWAR-----------------------MLTIADMNEVYADHA 214
W + NSWR + DI D++ R M+ I + + D
Sbjct: 194 NWAQTVANSWRMSGDIYDNFDRPDQACPCPGEQGIDCALPGFKCSMMNILNKVASFPDKG 253
Query: 215 KPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVI 274
G WND DMLE+GNGGM +EY HF++WA K+PL++G D+ +T ET+ I+ N V+
Sbjct: 254 VTGAWNDMDMLEIGNGGMTDDEYKTHFTMWAALKSPLIMGNDLRKITPETLSILSNPAVL 313
Query: 275 AVNQAINDQ---------------------QIWAGPLSGNRIVVLL-----ESRKTFSSM 308
A++Q + Q+W+G LSG +V+L + R+ +++
Sbjct: 314 AISQDTGSRPAFRRWRYFVGDTDKFGQGEIQMWSGSLSGGDALVVLLNGGDKEREMNATL 373
Query: 309 MTAHWDDI 316
+ WD++
Sbjct: 374 VDIFWDNM 381
>gi|225871747|ref|YP_002753201.1| alpha-galactosidase [Acidobacterium capsulatum ATCC 51196]
gi|225793573|gb|ACO33663.1| alpha-galactosidase [Acidobacterium capsulatum ATCC 51196]
Length = 406
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 128/390 (32%), Positives = 171/390 (43%), Gaps = 84/390 (21%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQ-- 62
L TPPMGWNSWNHF +++E +R DD W + RD+Q
Sbjct: 40 LAPTPPMGWNSWNHFAGKVDEADVRAAAKAMVDSGMAAAGYKYIVIDDTW-QGKRDAQGF 98
Query: 63 ISLSCSFHMFDSLTQ----------IAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL 112
I + F L Q I + P P C G Y H+
Sbjct: 99 IHPNSKFPDMPGLIQYVHSLGLKFGIYSSPGPQTCAGYE--------------GSYGHV- 143
Query: 113 ISNSLDNRIESNIAFEQGIDYLKYDNC----------FNDDTRP----TIRYRAMSDALK 158
+ ++ G+DYLKYD C N+ + Y M AL
Sbjct: 144 -------QQDAETYARWGVDYLKYDLCSYLGIMHKEAANNPAKALAMQQAAYLKMYKALA 196
Query: 159 KAGRPIFFSMCEWGDMRPALWGSNIR-NSWRTTDDISDSWARMLTIADMNEVYADHAKPG 217
AGRPI FS+C++G WG ++ N WRTT DI D++ARM TI A +A PG
Sbjct: 197 AAGRPIVFSLCQYGIGDVWKWGPSVGGNLWRTTGDIQDNYARMATIGFGQAGLAKYAGPG 256
Query: 218 GWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN 277
WNDPDMLEVGNGGM EY H S+WA+ APL+ G D+ +++ T+ I+ N EVIAV+
Sbjct: 257 HWNDPDMLEVGNGGMTNEEYRTHMSLWALLAAPLIAGNDLSHMSPATLAILTNREVIAVD 316
Query: 278 Q-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSN---NT 323
Q +IWA PL+G V L +R T MT + +
Sbjct: 317 QDRLGREGDRVSKNGALEIWAKPLTGGAKAVGLFNRDTQPHSMTLQLSVVNFPPHAHLRD 376
Query: 324 LFEHRTLETKFQGKLTAKVDGHSCKMYELT 353
L+ H+T+ G T V H + +LT
Sbjct: 377 LWRHKTVHAH-HGAYTVTVPAHGVVLLKLT 405
>gi|192361444|ref|YP_001980770.1| putative alpha-galactosidase, aga27A [Cellvibrio japonicus Ueda107]
gi|294956467|sp|B3PGJ1.1|AGAL_CELJU RecName: Full=Alpha-galactosidase A; AltName:
Full=Alpha-D-galactoside galactohydrolase; AltName:
Full=Alpha-galactosidase 27A; AltName: Full=Melibiase;
Flags: Precursor
gi|190687609|gb|ACE85287.1| putative alpha-galactosidase, aga27A [Cellvibrio japonicus Ueda107]
Length = 404
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 112/319 (35%), Positives = 154/319 (48%), Gaps = 49/319 (15%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPL 83
L TP MGWNSWN F C ++EK+IR + D+ ++ ++ +
Sbjct: 29 LAKTPQMGWNSWNTFGCNVDEKMIR--------AMADAMVTSGMKAAGYEYINIDDCWHG 80
Query: 84 PNQCQGNLVANKTTFPSGTKALADYVHI------LISN-----------SLDNRIESNIA 126
G + A+K FPSG KALADYVH + S+ S + + +
Sbjct: 81 ERDKNGFIQADKKHFPSGMKALADYVHAKGLKLGIYSDAGNTTCAGRPGSRGHEYQDALT 140
Query: 127 FEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRN 185
+ GIDY+KYD C D P Y M DA+ KAGRP+ FS+CEWGD +P W ++ +
Sbjct: 141 YASWGIDYVKYDWCDTQDINPKSAYATMRDAIHKAGRPMLFSICEWGDNQPWEWAQDVGH 200
Query: 186 SWRTTDDISDSW-----------ARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKY 234
SWRTT DI W +L I D +A PG WND DM+EVGN GM
Sbjct: 201 SWRTTGDIYPCWNCEHNHGSWSSFGVLPILDKQAGLRKYAGPGHWNDMDMMEVGN-GMTE 259
Query: 235 NEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAI-----------NDQ 283
E HFS+WA +PL+ G D+ N++ T I+ ++E IA+NQ D
Sbjct: 260 EEDRAHFSLWAFMASPLIAGNDLRNMSDTTRAILTHKETIAINQDKLGIQAMKWIDEGDL 319
Query: 284 QIWAGPLSGNRIVVLLESR 302
+I+ PL VL +R
Sbjct: 320 EIYIKPLEKGHYAVLFLNR 338
>gi|440465350|gb|ELQ34674.1| alpha-galactosidase [Magnaporthe oryzae Y34]
gi|440487839|gb|ELQ67608.1| alpha-galactosidase [Magnaporthe oryzae P131]
Length = 568
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 116/371 (31%), Positives = 175/371 (47%), Gaps = 81/371 (21%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDS--LTQ 77
+ NGL +TPPMGWN+WN F C ++E ++ S+ ++ ++L +D L
Sbjct: 82 VKNGLAITPPMGWNNWNAFGCDVSEHLLL--------STSEAVVNLGLRDLGYDHVVLDD 133
Query: 78 IAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----------------ILISNSLDNR 120
P G L N FP G KA++D +H SLD+
Sbjct: 134 CWQDPDGRDPNGKLQPNADKFPRGLKAISDDLHSENLKFGMYSTSGEMTCARFEGSLDHE 193
Query: 121 IESNIAFEQ-GIDYLKYDNCFN-----DDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDM 174
++ +F G+D+LKYDNC+N R++ M+DAL GRPI ++C WG+
Sbjct: 194 VDDANSFASWGVDFLKYDNCYNMGRIGSPVATFNRFKVMADALNATGRPIQLNLCNWGED 253
Query: 175 RPALWGSNIRNSWRTTDDISDSWAR-----------------------MLTIADMNEVYA 211
WG +I NSWR + DI DS+ R +L I + +A
Sbjct: 254 YVHTWGMSIANSWRMSGDIYDSFTRPDDLCSCTDPHDPLCVAPGTHCSVLFILNKVAPFA 313
Query: 212 DHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNE 271
D + PGGW+D DMLEVG GGM E HF++WA K+PL+LG D+ ++ AE + I+ N
Sbjct: 314 DKSIPGGWSDLDMLEVGQGGMTREEQRAHFAMWAALKSPLMLGNDLRSMPAEALAIVSNP 373
Query: 272 EVIAVNQ-----------------------AINDQQIWAGPL-SGNRIVVLLESRKTFSS 307
VIA++Q + +W+G L +G+++V+LL +
Sbjct: 374 AVIALSQDPHGRSVLRVKRTVGGELQPDEFGAQESHVWSGRLQNGDQVVILLNAAGN-DV 432
Query: 308 MMTAHWDDIGI 318
MTA +DI +
Sbjct: 433 EMTASLNDIFV 443
>gi|328869357|gb|EGG17735.1| putative alpha-N-acetylgalactosaminidase [Dictyostelium
fasciculatum]
Length = 585
Score = 174 bits (442), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 130/405 (32%), Positives = 183/405 (45%), Gaps = 97/405 (23%)
Query: 20 LSNGLGLTPPMGWNSWNHFHC---QINEKIIRET-------------------DDCWGES 57
L+NG+ +TPPMGWN+WNHF C +IN ++I ET DDCW
Sbjct: 169 LNNGVAITPPMGWNTWNHFGCDTSEINSQLIYETATAMVTSGMAAVGYEYINLDDCWLAK 228
Query: 58 SRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL----- 112
RD+ G L A+ FP G LADYVH L
Sbjct: 229 ERDAD--------------------------GRLQADPIRFPQGIAPLADYVHSLGLKMG 262
Query: 113 ISNSLDN-------------RIESNIAFEQGIDYLKYDNCFN--DDTRPTIRYRAMSDAL 157
I + N I+ E G+DY+K D C+ DD T Y +S+ L
Sbjct: 263 IYEDVGNLTCGGYPGSENYFEIDMKTFAEWGMDYVKMDGCYFPVDDMEET--YTQLSEIL 320
Query: 158 KKAGRPIFFSMCEWGDMRPA-------LWGSNIRNSWRTTDDISDSWARMLTIADMNEVY 210
GRP+ +S C W + I N WR DDI D++ I D Y
Sbjct: 321 NATGRPMVYS-CSWPTYAFVNNQTVNFTYIGEICNLWREFDDIRDNFDSWTGILDQMMQY 379
Query: 211 --ADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPII 268
A +A PG WNDPDMLE+GNGG EY FS+W+I APL+ G D+ N++A+ + I+
Sbjct: 380 NRAPYASPGHWNDPDMLEIGNGGQTTAEYQSFFSLWSIIAAPLIAGNDIRNMSADIINIL 439
Query: 269 GNEEVIAVNQ---AINDQ--------QIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIG 317
N ++IAV+Q I Q ++W PL G I +L +R + + D +
Sbjct: 440 TNADIIAVDQDALGIQGQRAYQSGTSEVWTRPLVGGAIAAVLFNRGEQPATIELTSDILQ 499
Query: 318 IKSNNTLFE----HRTLETK-FQGKLT-AKVDGHSCKMYELTPVT 356
+ SN+ +T T+ FQG + + +D H+ M +L+P+T
Sbjct: 500 VSSNSQFLMRDLWQKTAPTQVFQGSSSFSNIDAHAVIMLKLSPIT 544
>gi|1170911|sp|P41946.1|MEL5_YEAST RecName: Full=Alpha-galactosidase 5; AltName:
Full=Alpha-D-galactoside galactohydrolase 5; AltName:
Full=Melibiase 5; Flags: Precursor
gi|547464|emb|CAA85740.1| alpha-galactosidase [Saccharomyces cerevisiae]
Length = 471
Score = 174 bits (442), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 120/356 (33%), Positives = 170/356 (47%), Gaps = 75/356 (21%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
NGLGLTP MGW+SWN F C ++E+++ +T D R S + L + + L +
Sbjct: 24 NGLGLTPQMGWDSWNTFACDVSEQLLLDTAD------RISDLGLKDMGYKYVILDDCWSS 77
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVH------ILISNSLD----------NRIESNI 125
G LVA+K FP+G +AD++H + S++ + R E +
Sbjct: 78 G--RDSDGFLVADKHKFPNGMGHVADHLHNNSFLFGMYSSAGEYTCAGYPGSLGREEEDA 135
Query: 126 AF--EQGIDYLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFSMCEWGDMRPAL 178
F +DYLKYDNC+N T RY+AMSDAL K GRPIF+S+C WG
Sbjct: 136 QFFANNRVDYLKYDNCYNKGQFGTPDVSYHRYKAMSDALNKTGRPIFYSLCNWGQDLTFY 195
Query: 179 WGSNIRNSWRTTDDISDSWAR----------------------MLTIADMNEVYADHAKP 216
WGS I NSWR + DI+ + R ++ I + +A
Sbjct: 196 WGSGIANSWRMSGDITAEFTRPDSRCPCDGDEYDCKYAGFHCSIMNILNKAAPMGQNAGV 255
Query: 217 GGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAV 276
GGWND D LEVG G + +E HFS+WA+ K+PL++G DV +L A + I VIA+
Sbjct: 256 GGWNDLDNLEVGVGNLTDDEEKAHFSMWAMVKSPLIIGADVNHLKASSYSIYSQASVIAI 315
Query: 277 NQ---------------------AINDQQIWAGPL-SGNRIVVLLESRKTFSSMMT 310
NQ + Q+W+GPL +G+++V LL M T
Sbjct: 316 NQDPKGIPATRVWRYYVSDTDEYGQGEIQMWSGPLDNGDQVVALLNGGSVARPMNT 371
>gi|284172749|ref|YP_003406131.1| Alpha-galactosidase [Haloterrigena turkmenica DSM 5511]
gi|284017509|gb|ADB63458.1| Alpha-galactosidase [Haloterrigena turkmenica DSM 5511]
Length = 541
Score = 174 bits (442), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 136/408 (33%), Positives = 199/408 (48%), Gaps = 66/408 (16%)
Query: 1 LFIVSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS-R 59
L S+ AD N SDR L +PPMGWNSWN F+C I+E++I++ D ES +
Sbjct: 12 LLANSTVTADDN-SDRSESADKTLVESPPMGWNSWNTFYCDIDEELIKDAADAMAESGMK 70
Query: 60 DSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILIS 114
++ C + + + A G L + TFP+G ALADYVH + I
Sbjct: 71 EAGYEYVCIDDCWMAPERDA--------NGKLQPDPETFPNGISALADYVHDKGLKLGIY 122
Query: 115 NSLDNR----IESNIAFEQ---------GIDYLKYDNCFNDDTRPTI-RYRAMSDALKKA 160
S + ++ +E+ G+D+LKYDNC + + RY M +AL+
Sbjct: 123 ESAGTTTCQGLPGSLGYEETDAQTFADWGVDFLKYDNCGDHYGLSAVERYTRMHNALEAV 182
Query: 161 GRPIFFSMCEWGDMRPALWGSNIRNS-WRTTDDISDSWAR--------MLTIADMNEVYA 211
R I FS+CEWGD P +W + WRTT DI W ++ I D NE A
Sbjct: 183 DRDIIFSICEWGDNDPWMWAPEVGGDLWRTTGDIKPLWRAQEDLWGNGIIDIIDQNEPLA 242
Query: 212 DHAKPGGWNDPDMLEVGNGGMKY-----NEYVVHFSIWAISKAPLLLGCDVGNLTAETMP 266
++A PG WNDPDML VG +Y E HF +WA+ APL+ G D+ N++ ET
Sbjct: 243 EYAGPGRWNDPDMLVVGVDLPEYPNLTEAEDRTHFGMWAMMAAPLMAGNDIRNMSDETRD 302
Query: 267 IIGNEEVIAV------NQAINDQQI----------WAGPLS-GNRIVVLLESRKTFSSMM 309
I+ N+EVIA+ NQA Q I WA L+ G+R V LL +R + +
Sbjct: 303 ILTNDEVIAIDQDPAGNQATRIQHIRGEDGLSRSVWAKTLANGDRAVGLL-NRSDRRTTV 361
Query: 310 TAHWDDIGIKSNNTLFEHRTL--ETKFQ--GKLTAKVDGHSCKMYELT 353
T +G+++ + + R L T +Q G ++A V H ++ ++
Sbjct: 362 TTSAQAVGLEA-ASCYVARDLWNGTDWQTAGLISASVPSHGLALFRVS 408
>gi|374313643|ref|YP_005060073.1| Alpha-galactosidase [Granulicella mallensis MP5ACTX8]
gi|358755653|gb|AEU39043.1| Alpha-galactosidase [Granulicella mallensis MP5ACTX8]
Length = 526
Score = 174 bits (442), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 121/324 (37%), Positives = 165/324 (50%), Gaps = 59/324 (18%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
NGL TPPMGWNSWN F ++++ +R D+ +S S + +
Sbjct: 161 NGLARTPPMGWNSWNKFAGRVDDAAVR--------GMADAMVSSGMSKAGYVYINIDDTW 212
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNRI----ESNIAFEQ--- 129
L GN+ NK FP KALADYVH I I +S + E + E
Sbjct: 213 ELGRDANGNVTTNKK-FPD-MKALADYVHSKGLKIGIYSSPGPKTCAGYEGSFGHEVQDA 270
Query: 130 ------GIDYLKYDNC-----FNDDTRPTIR--YRAMSDALKKAGRPIFFSMCEWGDMRP 176
GIDYLKYD C + DD PT++ Y+ M AL++ RPI +S+CE+G R
Sbjct: 271 KTYASWGIDYLKYDLCSGLDIYKDDA-PTLQAIYQKMGQALQETNRPIIYSLCEYG--RA 327
Query: 177 ALW-------GSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGN 229
++W G N+ WRTT DISD W M I A++AKPG WNDPDMLE+GN
Sbjct: 328 SVWTGWGTKSGGNL---WRTTGDISDRWDSMDKIGFSQIKIAEYAKPGHWNDPDMLEIGN 384
Query: 230 GGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ----------- 278
GGM +EY H S+W++ APL+ G D+ +T ET I+ N EVIA++Q
Sbjct: 385 GGMTADEYRTHMSLWSLLAAPLIAGNDLRTMTDETKSILMNSEVIAIDQDPEYKPVVSVS 444
Query: 279 AINDQQIWAGPLSGNRIVVLLESR 302
+ N ++ PL ++V L +R
Sbjct: 445 SENKVEVLMRPLHDGSVIVGLFNR 468
>gi|322433177|ref|YP_004210398.1| alpha-galactosidase [Granulicella tundricola MP5ACTX9]
gi|321165569|gb|ADW71271.1| Alpha-galactosidase [Granulicella tundricola MP5ACTX9]
Length = 488
Score = 174 bits (442), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 129/376 (34%), Positives = 182/376 (48%), Gaps = 52/376 (13%)
Query: 17 RNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS-RDSQISLSCSFHMFDSL 75
N+ NG+ TPPMGWNSWN F +I++K +RE D S RD+ ++ D+
Sbjct: 121 HNVPPNGMAKTPPMGWNSWNLFESRIDDKTVREIADAMVTSGMRDAGY---VYVNIDDTW 177
Query: 76 TQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNRI----ESNIA 126
+ QGNL +NK FP KALADYVH I +S R +
Sbjct: 178 EGV------RDAQGNLGSNKK-FPD-MKALADYVHSKGLKFGIYSSPGPRTCAEYPGSYG 229
Query: 127 FEQ---------GIDYLKYDNC-----FNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWG 172
+E G+DYLKYD C F+ + + Y+ M DAL K+GRPI +S+CE+G
Sbjct: 230 YEDQDARTFAAWGVDYLKYDWCGARMTFSVEDLQAV-YQKMGDALLKSGRPIVYSLCEYG 288
Query: 173 DMRPALWGSNI-RNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGG 231
+ WG+ + N WRTT DI + W M+ A A PG WNDPDMLE+GN
Sbjct: 289 NGHVETWGTKVGANLWRTTGDIHEPWTSMIGNITKQIPTAPFAGPGHWNDPDMLEIGNHH 348
Query: 232 MKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ------- 284
M +E S+WA++ APLL G D+ + A T I+ N EVIAV+Q QQ
Sbjct: 349 MSDDEDRTQMSLWALTAAPLLAGNDIRLMPASTRAILMNAEVIAVDQDALGQQAVPIKSG 408
Query: 285 ---IWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIG----IKSNNTLFEHRTLETKFQGK 337
+W PL+ + V + + + +++G +KS L+ H ++ G
Sbjct: 409 GLDMWVKPLADGSVAVGMVNMGATEATAEIKANELGLGTQVKSARDLWNHTSVAFH-NGA 467
Query: 338 LTAKVDGHSCKMYELT 353
AKV H M ++
Sbjct: 468 YLAKVPSHGVLMLRVS 483
>gi|389632001|ref|XP_003713653.1| alpha-galactosidase [Magnaporthe oryzae 70-15]
gi|351645986|gb|EHA53846.1| alpha-galactosidase [Magnaporthe oryzae 70-15]
Length = 512
Score = 174 bits (441), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 116/369 (31%), Positives = 174/369 (47%), Gaps = 81/369 (21%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDS--LTQ 77
+ NGL +TPPMGWN+WN F C ++E ++ S+ ++ ++L +D L
Sbjct: 26 VKNGLAITPPMGWNNWNAFGCDVSEHLLL--------STSEAVVNLGLRDLGYDHVVLDD 77
Query: 78 IAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----------------ILISNSLDNR 120
P G L N FP G KA++D +H SLD+
Sbjct: 78 CWQDPDGRDPNGKLQPNADKFPRGLKAISDDLHSENLKFGMYSTSGEMTCARFEGSLDHE 137
Query: 121 IESNIAFEQ-GIDYLKYDNCFN-----DDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDM 174
++ +F G+D+LKYDNC+N R++ M+DAL GRPI ++C WG+
Sbjct: 138 VDDANSFASWGVDFLKYDNCYNMGRIGSPVATFNRFKVMADALNATGRPIQLNLCNWGED 197
Query: 175 RPALWGSNIRNSWRTTDDISDSWAR-----------------------MLTIADMNEVYA 211
WG +I NSWR + DI DS+ R +L I + +A
Sbjct: 198 YVHTWGMSIANSWRMSGDIYDSFTRPDDLCSCTDPHDPLCVAPGTHCSVLFILNKVAPFA 257
Query: 212 DHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNE 271
D + PGGW+D DMLEVG GGM E HF++WA K+PL+LG D+ ++ AE + I+ N
Sbjct: 258 DKSIPGGWSDLDMLEVGQGGMTREEQRAHFAMWAALKSPLMLGNDLRSMPAEALAIVSNP 317
Query: 272 EVIAVNQ-----------------------AINDQQIWAGPL-SGNRIVVLLESRKTFSS 307
VIA++Q + +W+G L +G+++V+LL +
Sbjct: 318 AVIALSQDPHGRSVLRVKRTVGGELQPDEFGAQESHVWSGRLQNGDQVVILLNAAGN-DV 376
Query: 308 MMTAHWDDI 316
MTA +DI
Sbjct: 377 EMTASLNDI 385
>gi|51701643|sp|Q9UVD6.1|MEL_TORDE RecName: Full=Alpha-galactosidase; AltName:
Full=Alpha-D-galactoside galactohydrolase; AltName:
Full=MELt; AltName: Full=Melibiase; Flags: Precursor
gi|6429221|dbj|BAA86883.1| alpha-galactosidase [Torulaspora delbrueckii]
Length = 474
Score = 174 bits (441), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 131/386 (33%), Positives = 174/386 (45%), Gaps = 116/386 (30%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQ 62
NGLGLTP MGWN+WN F C ++E ++ T DDCW + RDS
Sbjct: 25 NGLGLTPQMGWNNWNTFACNVSEDLLLSTVDRIAALGLRDIGYHYVILDDCWSDG-RDSD 83
Query: 63 ISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH------------ 110
G LV + T FP+G K +ADY+H
Sbjct: 84 --------------------------GMLVPDSTKFPNGMKHVADYLHGKDFLFGMYSSA 117
Query: 111 -----ILISNSLDNRIESNIAF--EQGIDYLKYDNCFNDDTRPTI-----RYRAMSDALK 158
+ SLD+ E + AF + +DYLKYDNC+N T RYRAMS+AL
Sbjct: 118 GEYTCAGYAGSLDHE-EDDAAFFAKNEVDYLKYDNCYNRGQFGTPETSFNRYRAMSEALN 176
Query: 159 KAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWAR------------------- 199
K RPIF+S+C WG WGS I NSWR + DI+ + R
Sbjct: 177 KTERPIFYSLCNWGQDLTFYWGSGIANSWRISGDITAEFDRPDSRCPCDGDEYDCPYAGF 236
Query: 200 ---MLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCD 256
++ I + +A GGWND D LEVG G + +E HFS+WAI K+ +++G D
Sbjct: 237 HCSIMNILNKAAPMGQNAGVGGWNDLDCLEVGVGNLTDDEEKAHFSMWAIVKSAMVIGAD 296
Query: 257 VGNLTAETMPIIGNEEVIAVNQ------AI---------NDQ------QIWAGPL-SGNR 294
V NL + I V+A+NQ AI DQ Q+W+GPL +G+R
Sbjct: 297 VRNLKPSSFSIYSQASVLAINQDPAGAPAIRVWRRYVPETDQHGQGEVQLWSGPLDNGDR 356
Query: 295 IVVLLESRKTFSSMMTAHWDDIGIKS 320
+V LL M A+ +DI I S
Sbjct: 357 VVALLNGGAK-ERPMVAYLEDIFIDS 381
>gi|15788341|gb|AAL07760.1| alpha-galactosidase precursor [Saccharomyces cerevisiae]
Length = 454
Score = 174 bits (441), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 118/359 (32%), Positives = 167/359 (46%), Gaps = 81/359 (22%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
NGLGLTP MGW++WN F C ++E+++ +T D R S I L + + L +
Sbjct: 7 NGLGLTPQMGWDNWNTFACDVSEQLLLDTAD------RISDIGLKDLGYTYVILDDCWSS 60
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVHILISNSL---------------------DNR 120
G LVA+K FP+G +AD++H +NS
Sbjct: 61 G--RSSNGTLVADKEKFPNGMSHVADHLH---NNSFLFGMYSSAGEYTCAGYPGSLGHEE 115
Query: 121 IESNIAFEQGIDYLKYDNCFNDDT--RPTI---RYRAMSDALKKAGRPIFFSMCEWGDMR 175
+++ G+DYLKYDNC+N P RY+AMSDAL K GRPIF+S+C WG
Sbjct: 116 GDADFFASNGVDYLKYDNCYNKGQFGAPETSYNRYKAMSDALNKTGRPIFYSLCNWGQDL 175
Query: 176 PALWGSNIRNSWRTTDDISDSWAR----------------------MLTIADMNEVYADH 213
WGS+I NSWR + DI + R ++ I + +
Sbjct: 176 TFYWGSDIANSWRMSGDIYPEFTRPDSRCPCDGDQFDCAYAGFHCSIMNILNKAAPMGQN 235
Query: 214 AKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEV 273
A GGWND D LEVG G + +E HFS+WA+ K+PL++G DV L A + I +
Sbjct: 236 AGIGGWNDLDNLEVGVGNLTDDEEKAHFSMWAMVKSPLIIGADVNQLKASSYSIYSQASI 295
Query: 274 IAVNQ---------------------AINDQQIWAGPL-SGNRIVVLLESRKTFSSMMT 310
IA+NQ + Q W+GPL +G++++ LL SM T
Sbjct: 296 IAINQDPKGVPATRVWRHQVPQTDKYGQGEIQFWSGPLDNGDQMIALLNGGSKARSMNT 354
>gi|113501|sp|P04824.1|MEL1_YEAST RecName: Full=Alpha-galactosidase 1; AltName:
Full=Alpha-D-galactoside galactohydrolase 1; AltName:
Full=Melibiase 1; Flags: Precursor
gi|5257486|gb|AAD41358.1|AF152092_1 alpha-galactosidase [Cloning vector pGB-MEL1]
gi|3920|emb|CAA26888.1| unnamed protein product [Saccharomyces cerevisiae]
gi|171926|gb|AAA34770.1| pre-alpha galactosidase (melibiase) [Saccharomyces carlsbergensis]
gi|332687422|emb|CBY84983.1| alpha-galactosidase [Saccharomyces cerevisiae]
gi|332687424|emb|CBY84984.1| alpha-galactosidase [Saccharomyces cerevisiae]
Length = 471
Score = 174 bits (441), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 121/366 (33%), Positives = 174/366 (47%), Gaps = 74/366 (20%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
NGLGLTP MGW++WN F C ++E+++ +T D R S + L + + L +
Sbjct: 24 NGLGLTPQMGWDNWNTFACDVSEQLLLDTAD------RISDLGLKDMGYKYIILDDCWSS 77
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVH------ILISNSLD----------NRIESNI 125
G LVA++ FP+G +AD++H + S++ + R E +
Sbjct: 78 G--RDSDGFLVADEQKFPNGMGHVADHLHNNSFLFGMYSSAGEYTCAGYPGSLGREEEDA 135
Query: 126 AF--EQGIDYLKYDNCFNDDT--RPTI---RYRAMSDALKKAGRPIFFSMCEWGDMRPAL 178
F +DYLKYDNC+N P I RY+AMSDAL K GRPIF+S+C WG
Sbjct: 136 QFFANNRVDYLKYDNCYNKGQFGTPEISYHRYKAMSDALNKTGRPIFYSLCNWGQDLTFY 195
Query: 179 WGSNIRNSWRTTDDISDSWAR----------------------MLTIADMNEVYADHAKP 216
WGS I NSWR + D++ + R ++ I + +A
Sbjct: 196 WGSGIANSWRMSGDVTAEFTRPDSRCPCDGDEYDCKYAGFHCSIMNILNKAAPMGQNAGV 255
Query: 217 GGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAV 276
GGWND D LEVG G + +E HFS+WA+ K+PL++G +V NL A + I VIA+
Sbjct: 256 GGWNDLDNLEVGVGNLTDDEEKAHFSMWAMVKSPLIIGANVNNLKASSYSIYSQASVIAI 315
Query: 277 NQAIN---------------------DQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDD 315
NQ N + Q+W+GPL VV L + + S M ++
Sbjct: 316 NQDSNGIPATRVWRYYVSDTDEYGQGEIQMWSGPLDNGDQVVALLNGGSVSRPMNTTLEE 375
Query: 316 IGIKSN 321
I SN
Sbjct: 376 IFFDSN 381
>gi|332687428|emb|CBY84986.1| alpha-galactosidase [Saccharomyces uvarum]
Length = 471
Score = 174 bits (441), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 125/377 (33%), Positives = 175/377 (46%), Gaps = 76/377 (20%)
Query: 1 LFIVSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRD 60
L++ SS AA S G + NGLGLTP MGW++WN F C ++E+++ +T D R
Sbjct: 4 LYLFSSFAAVS-GVLGSSPSYNGLGLTPQMGWDNWNTFACDVSEQLLLDTAD------RI 56
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH---------- 110
S I L + + L + G LVA+K FP+G +AD++H
Sbjct: 57 SDIGLKDLGYTYVILDDCWSSG--RSSNGTLVADKEKFPNGMSHVADHLHNNSFLFGMYS 114
Query: 111 -------ILISNSLDNRIESNIAF-EQGIDYLKYDNCFNDDT--RPTI---RYRAMSDAL 157
SL + + F G+DYLKYDNC+N P RY+AMSDAL
Sbjct: 115 SAGEYTCAGYPGSLGHEEDDADFFASNGVDYLKYDNCYNKGQFGAPETSYNRYKAMSDAL 174
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWAR------------------ 199
K GRPIF+S+C WG WGS+I NSWR + DI + R
Sbjct: 175 NKTGRPIFYSLCNWGQDLTFYWGSDIANSWRMSGDIYPEFTRPDSRCPCDGDQFDCAYAG 234
Query: 200 ----MLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGC 255
++ I + +A GGWND D LEVG G + +E HFS+WA+ K+PL++G
Sbjct: 235 FHCSIMNILNKAAPMGQNAGIGGWNDLDNLEVGVGNLTDDEEKAHFSMWAMVKSPLIIGA 294
Query: 256 DVGNLTAETMPIIGNEEVIAVNQ---------------------AINDQQIWAGPL-SGN 293
DV L A + I +IA+NQ + Q W+GPL +G+
Sbjct: 295 DVNQLKASSYSIYSQASIIAINQDPKGVPATRVWRHQVPQTDKYGQGEIQFWSGPLDNGD 354
Query: 294 RIVVLLESRKTFSSMMT 310
+++ LL M T
Sbjct: 355 QVIALLNGGNKARPMNT 371
>gi|146302674|ref|YP_001197265.1| alpha-galactosidase [Flavobacterium johnsoniae UW101]
gi|146157092|gb|ABQ07946.1| Candidate Alpha-galactosidase; Glycoside hydrolase family 27
[Flavobacterium johnsoniae UW101]
Length = 411
Score = 174 bits (441), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 122/386 (31%), Positives = 180/386 (46%), Gaps = 63/386 (16%)
Query: 23 GLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKP 82
GL +TPPMGWNSWN F I+EK+++ET D S + ++ + D +
Sbjct: 33 GLAMTPPMGWNSWNTFAANIDEKLVKETADIMVSSGL---AAAGYNYIVLDDGWMTKER- 88
Query: 83 LPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNRI-------------ESN 124
G+LV + FP+G KAL DYVH + N + ++
Sbjct: 89 ---DANGDLVPDPAKFPNGMKALIDYVHSKGLKFGLYNCAGTQTCAGYPGTRGYEYQDAR 145
Query: 125 IAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIR 184
+ GID+LKYD C Y MS+ALK AGRPI FS+CEWGD +P WG +
Sbjct: 146 FYAKLGIDFLKYDWCNTKGITAPEAYTTMSNALKTAGRPIVFSLCEWGDNQPWEWGKPVG 205
Query: 185 NSWRTTDDI----------SDSWAR--MLTIADMNEVYADHAKPGGWNDPDMLEVGNGGM 232
N WR + DI ++W+ + IA+M + ++ P WND DM+EVGN M
Sbjct: 206 NLWRISGDIYPCFDCEFKHPENWSSWGFMKIAEMRKDIRKYSGPDHWNDFDMMEVGN-EM 264
Query: 233 KYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AIN 281
E HF++W + +PL G D ++ ET+ I+ N+E+IA+NQ A +
Sbjct: 265 NDTEDKSHFAMWCMLSSPLFTGNDYRKMSKETLAILTNKELIAINQDKLGIQGFKHSAED 324
Query: 282 DQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWD-------DIGIKS--NNTLFEHRTLET 332
++W PLS V +R S + W D G ++ N T F+ + L T
Sbjct: 325 GLEVWVKPLSDGNWAVTFLNRSDVSKKINFDWKKNTFKDVDFGYEADFNKTAFKLKNLWT 384
Query: 333 KFQG-----KLTAKVDGHSCKMYELT 353
+ TA++ H C +L+
Sbjct: 385 NKEAGSTKKNFTAELASHDCITLKLS 410
>gi|452001692|gb|EMD94151.1| glycoside hydrolase family 27 protein [Cochliobolus heterostrophus
C5]
Length = 511
Score = 174 bits (440), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 111/353 (31%), Positives = 172/353 (48%), Gaps = 77/353 (21%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
++NGL +TPPMGWN+WN F C ++E ++ T S++ + L + + L
Sbjct: 21 VNNGLAITPPMGWNNWNAFGCDVSENLLLTT------SAQIIDLGLRDLGYNYVVLDDCW 74
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVH-----------------ILISNSLDNRIE 122
P G L FP+G K++++++H + SLD+ I+
Sbjct: 75 QNPKGRDKDGKLQPALDKFPNGLKSISNHLHDRNLKFGMYSSAGEMTCARFAGSLDHEID 134
Query: 123 SNIAFEQ-GIDYLKYDNCFNDDT--RPTI---RYRAMSDALKKAGRPIFFSMCEWGDMRP 176
+F G+D LKYD+C++ P++ R++ M+DALK GR I F++C WG+
Sbjct: 135 DAKSFAGWGVDMLKYDSCYHMGRVGTPSVSFNRFKTMADALKATGRNILFNLCNWGEDLV 194
Query: 177 ALWGSNIRNSWRTTDDISDSWAR-------------------------MLTIADMNEVYA 211
WG +I NSWR T DI DS+ R +L I + +A
Sbjct: 195 HTWGMSISNSWRITGDIYDSFTRPDDLCGCNTVAPGDVNCVAPGTHCSVLFILNKVAPFA 254
Query: 212 DHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNE 271
D + PGGW+D DMLEVG GGM EY HF++WA K+PL LG D+ ++ A + II N
Sbjct: 255 DRSIPGGWSDLDMLEVGQGGMTDEEYKAHFALWAALKSPLFLGNDLRDMPASALTIINNP 314
Query: 272 EVIAVNQ----------------------AINDQQIWAGPL-SGNRIVVLLES 301
+IA++Q + + +WAG L +G+ +V+LL +
Sbjct: 315 AIIALSQDPHGRSVTRVRRDTEGVAKDEWGVGETHVWAGHLHNGDEVVILLNA 367
>gi|332687430|emb|CBY84987.1| alpha-galactosidase [Saccharomyces bayanus]
gi|332687436|emb|CBY84990.1| alpha-galactosidase [Saccharomyces uvarum]
Length = 471
Score = 174 bits (440), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 125/377 (33%), Positives = 175/377 (46%), Gaps = 76/377 (20%)
Query: 1 LFIVSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRD 60
L++ SS AA S G + NGLGLTP MGW++WN F C ++E+++ +T D R
Sbjct: 4 LYLFSSFAAVS-GVLGSSPSYNGLGLTPQMGWDNWNTFACDVSEQLLLDTAD------RI 56
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH---------- 110
S I L + + L + G LVA+K FP+G +AD++H
Sbjct: 57 SDIGLKDLGYTYVILDDCWSSG--RSSNGTLVADKEKFPNGMSHVADHLHNNSFLFGMYS 114
Query: 111 -------ILISNSLDNRIESNIAF-EQGIDYLKYDNCFNDDT--RPTI---RYRAMSDAL 157
SL + + F G+DYLKYDNC+N P RY+AMSDAL
Sbjct: 115 SAGEYTCAGYPGSLGHEEDDADFFASNGVDYLKYDNCYNKGQFGAPETSYNRYKAMSDAL 174
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWAR------------------ 199
K GRPIF+S+C WG WGS+I NSWR + DI + R
Sbjct: 175 NKTGRPIFYSLCNWGQDLTFYWGSDIANSWRMSGDIYPEFTRPDSRCPCDGDQFDCAYAG 234
Query: 200 ----MLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGC 255
++ I + +A GGWND D LEVG G + +E HFS+WA+ K+PL++G
Sbjct: 235 FHCSIMNILNKAAPMGQNAGIGGWNDLDNLEVGVGNLTDDEEKAHFSMWAMVKSPLIIGA 294
Query: 256 DVGNLTAETMPIIGNEEVIAVNQ---------------------AINDQQIWAGPL-SGN 293
DV L A + I +IA+NQ + Q W+GPL +G+
Sbjct: 295 DVNKLKASSYSIYSQASIIAINQDPKGVPATRVWRHQVPQTDKYGQGEIQFWSGPLDNGD 354
Query: 294 RIVVLLESRKTFSSMMT 310
+++ LL M T
Sbjct: 355 QVIALLNGGNKARPMNT 371
>gi|426195050|gb|EKV44980.1| hypothetical protein AGABI2DRAFT_194078 [Agaricus bisporus var.
bisporus H97]
Length = 515
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 104/305 (34%), Positives = 152/305 (49%), Gaps = 74/305 (24%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRE-------------------TDDCWGESSRDSQ 62
+G+G P MGWN+WN +HC+INE I+ + DDC+ E RDS
Sbjct: 106 DGVGRLPFMGWNTWNAYHCEINETIVLDNAKLMKSMGLLDAGYNYVNVDDCYSEKQRDSD 165
Query: 63 ISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI----------- 111
GN+VANK FPSG ++L D +H
Sbjct: 166 --------------------------GNIVANKERFPSGMRSLTDKLHAMGFKAGIYSDS 199
Query: 112 ------LISNSLDNRIESNIAFEQ--GIDYLKYDNC---FNDDTRPTI--RYRAMSDAL- 157
L S N F++ G D LKYDNC F++ + + +++ MSDA+
Sbjct: 200 GWFTCQLYPGSYQNEDRDIELFQEQWGFDLLKYDNCAVPFDEVIKEGMVGKFKRMSDAIG 259
Query: 158 ---KKAGR-PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADH 213
++ G+ P+ +S+C+WG +P LW + +WRTTDDI+ SW I + N Y+
Sbjct: 260 RLSERTGKLPMLYSLCQWGREQPWLWAKKLGQTWRTTDDINPSWGSTTNILNQNSFYSWA 319
Query: 214 AKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEV 273
G+ D DMLEVGNGGM + E HF+ WA+ K+PLL+G D+ T ET+ I+ N+E+
Sbjct: 320 NDYYGYGDMDMLEVGNGGMNFEESKSHFTAWALMKSPLLIGTDLTKATKETITILKNKEL 379
Query: 274 IAVNQ 278
I ++Q
Sbjct: 380 IKIHQ 384
>gi|299856761|pdb|3LRK|A Chain A, Structure Of Alfa-Galactosidase (Mel1) From Saccharomyces
Cerevisiae
gi|299856762|pdb|3LRL|A Chain A, Structure Of Alfa-Galactosidase (Mel1) From Saccharomyces
Cerevisiae With Melibiose
Length = 479
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 120/366 (32%), Positives = 174/366 (47%), Gaps = 74/366 (20%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
NGLGLTP MGW++WN F C ++E+++ +T D R S + L + + L +
Sbjct: 24 NGLGLTPQMGWDNWNTFACDVSEQLLLDTAD------RISDLGLKDMGYKYIILDDCWSS 77
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVH------ILISNSLD----------NRIESNI 125
G LVA++ FP+G +AD++H + S++ + R E +
Sbjct: 78 G--RDSDGFLVADEQKFPNGMGHVADHLHNNSFLFGMYSSAGEYTCAGYPGSLGREEEDA 135
Query: 126 AF--EQGIDYLKYDNCFNDDT--RPTI---RYRAMSDALKKAGRPIFFSMCEWGDMRPAL 178
F +DYLKYDNC+N P I RY+AMSDAL K GRP+F+S+C WG
Sbjct: 136 QFFANNRVDYLKYDNCYNKGQFGTPEISYHRYKAMSDALNKTGRPVFYSLCNWGQDLTFY 195
Query: 179 WGSNIRNSWRTTDDISDSWAR----------------------MLTIADMNEVYADHAKP 216
WGS I NSWR + D++ + R ++ I + +A
Sbjct: 196 WGSGIANSWRMSGDVTAEFTRPDSRCPCDGDEYDCKYAGFHCSIMNILNKAAPMGQNAGV 255
Query: 217 GGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAV 276
GGWND D LEVG G + +E HFS+WA+ K+PL++G +V NL A + I VIA+
Sbjct: 256 GGWNDLDNLEVGVGNLTDDEEKAHFSMWAMVKSPLIIGANVNNLKASSYSIYSQASVIAI 315
Query: 277 NQAIN---------------------DQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDD 315
NQ N + Q+W+GPL VV L + + S M ++
Sbjct: 316 NQDSNGIPATRVWRYYVSDTDEYGQGEIQMWSGPLDNGDQVVALLNGGSVSRPMNTTLEE 375
Query: 316 IGIKSN 321
I SN
Sbjct: 376 IFFDSN 381
>gi|1170912|sp|P41947.1|MEL6_YEAST RecName: Full=Alpha-galactosidase 6; AltName:
Full=Alpha-D-galactoside galactohydrolase 6; AltName:
Full=Melibiase 6; Flags: Precursor
gi|547466|emb|CAA85739.1| alpha-galactosidase [Saccharomyces cerevisiae]
Length = 471
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 119/356 (33%), Positives = 170/356 (47%), Gaps = 75/356 (21%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
NGLGLTP MGW++WN F C ++E+++ +T D R S + L + + L +
Sbjct: 24 NGLGLTPQMGWDNWNTFACDVSEQLLLDTAD------RISDLGLKDMGYKYVILDDCWSS 77
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVH------ILISNSLD----------NRIESNI 125
G LVA+K FP+G +AD++H + S++ + R E +
Sbjct: 78 G--RDSDGFLVADKHKFPNGMGHVADHLHNNSFLFGMYSSAGEYTCAGYPGSLGREEEDA 135
Query: 126 AF--EQGIDYLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFSMCEWGDMRPAL 178
F +DYLKYDNC+N T RY+AMSDAL K GRPIF+S+C WG
Sbjct: 136 QFFANNRVDYLKYDNCYNKGQFGTPDVSYHRYKAMSDALNKTGRPIFYSLCNWGQDLTFY 195
Query: 179 WGSNIRNSWRTTDDISDSWAR----------------------MLTIADMNEVYADHAKP 216
WGS I NSWR + DI+ + R ++ I + +A
Sbjct: 196 WGSGIANSWRMSGDITAEFTRPDSRCPCDGDEYDCKYAGFHCSIMNILNKAAPMGQNAGV 255
Query: 217 GGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAV 276
GGWND D LEVG G + +E HFS+WA+ K+PL++G DV +L A + I VIA+
Sbjct: 256 GGWNDLDNLEVGVGNLTDDEEKAHFSMWAMVKSPLIIGADVNHLKASSYSIYSQASVIAI 315
Query: 277 NQ---------------------AINDQQIWAGPL-SGNRIVVLLESRKTFSSMMT 310
NQ + Q+W+GPL +G+++V LL M T
Sbjct: 316 NQDPKGIPATRVWRYYVSDTDEYGQGEIQMWSGPLDNGDQVVALLNGGSVARPMNT 371
>gi|281207657|gb|EFA81837.1| putative alpha-N-acetylgalactosaminidase [Polysphondylium pallidum
PN500]
Length = 651
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 134/398 (33%), Positives = 179/398 (44%), Gaps = 91/398 (22%)
Query: 20 LSNGLGLTPPMGWNSWNHFHC---QINEKIIRET-------------------DDCWGES 57
L+NG+ LTPPMGWN+WNHF C QIN +I ET DDCW
Sbjct: 281 LNNGVALTPPMGWNTWNHFGCETSQINATLIIETAMAMVTSGMAAVGYEYVNLDDCWLAK 340
Query: 58 SRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL----- 112
RD+Q G L A+ FP G LA YVH L
Sbjct: 341 ERDAQ--------------------------GRLQADPIRFPDGIAPLAAYVHSLGLKMG 374
Query: 113 ISNSLDNRI-------ESNIAFEQ------GIDYLKYDNC-FNDDTRPTIRYRAMSDALK 158
I + N+ E+ +A + GIDY+K D C F D I Y +S L
Sbjct: 375 IYGDVGNQTCAGFPGSENYLALDAKTYASWGIDYVKMDGCNFPVDEMKEI-YTDLSQYLN 433
Query: 159 KAGRPIFFSMCEW-----GDMRPALWGSNIRNSWRTTDDISDSWARMLTIAD--MNEVYA 211
GRP+ +S C W G+ + I N WR DDI+D++ I D M+ A
Sbjct: 434 ATGRPMVYS-CSWPAYAQGEYVNFTYVGEICNLWREFDDINDNFDTWTAILDQMMSTGRA 492
Query: 212 DHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNE 271
A PG WNDPDMLE+GNGG EY FS+W+I APL+ G D+ ++ +T+ I+ N
Sbjct: 493 PFAGPGNWNDPDMLEIGNGGQTTTEYTSFFSLWSIIAAPLIAGNDLRFMSQDTIDILTNS 552
Query: 272 EVIAVNQ-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKS 320
EVI V+Q N +IW+ L G V+L +R + +S +T + I S
Sbjct: 553 EVIQVDQDPLGIQGTRIFQQNGNEIWSRQLVGGNTAVVLFNRGSSTSTITLTSQLLNIPS 612
Query: 321 NNTLFEHRTLETK----FQGKLTAKVDGHSCKMYELTP 354
L + K FQG ++ V H C M +L+P
Sbjct: 613 TTKLLTRDLWQHKSAGIFQGSMSFSVPSHGCVMLKLSP 650
>gi|51701642|sp|Q9P4V4.1|MEL_ZYGMR RecName: Full=Alpha-galactosidase; AltName:
Full=Alpha-D-galactoside galactohydrolase; AltName:
Full=MELr; AltName: Full=Melibiase; Flags: Precursor
gi|9188530|dbj|BAA99555.1| alpha-galactosidase [Zygotorulaspora mrakii]
Length = 470
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 120/367 (32%), Positives = 168/367 (45%), Gaps = 76/367 (20%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
NGLGLTP MGWN+WN F C + E+++ T D R S++ L + + L +
Sbjct: 24 NGLGLTPQMGWNNWNTFACNVTEQLLLGTAD------RISELGLKDVGYNYVILDDCWSG 77
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVHI--LISNSLDNRIESNIAFEQG--------- 130
G+LV + FP G K +AD++H L+ + E A G
Sbjct: 78 G--RSSNGSLVPDLNKFPHGMKYVADHLHDQDLLFGMYSSAGEYTCAGYPGSLGHEEKDA 135
Query: 131 -------IDYLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFSMCEWGDMRPAL 178
+DYLKYDNC+N T RY+AMSDAL GRPIF+S+C WG
Sbjct: 136 QFFARNEVDYLKYDNCYNKGQFGTPQASYERYKAMSDALNNTGRPIFYSLCNWGQDLTFY 195
Query: 179 WGSNIRNSWRTTDDISDSWAR----------------------MLTIADMNEVYADHAKP 216
WGS I NSWR + DI+ + R ++ I + +A P
Sbjct: 196 WGSAIANSWRMSGDITADFDRPDSRCPCGDDEYDCKYAGYHCSIMNILNKAAPMGQNANP 255
Query: 217 GGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAV 276
GGWND DMLEVG G + +E HFS+WA+ ++PL++G DV +L + I VIA+
Sbjct: 256 GGWNDLDMLEVGVGNLTDDEEKAHFSMWAMVRSPLIIGADVNHLKPSSFSIYAQSPVIAI 315
Query: 277 NQ---------------------AINDQQIWAGPL-SGNRIVVLLESRKTFSSMMTAHWD 314
NQ + Q W+GPL +G++++ L M A D
Sbjct: 316 NQDPRGVPATRVWRRQVSDTDAYGRGEVQFWSGPLENGDQVIAFLNGGNRMRP-MNAGLD 374
Query: 315 DIGIKSN 321
DI S+
Sbjct: 375 DIFFDSH 381
>gi|409074214|gb|EKM74620.1| hypothetical protein AGABI1DRAFT_116817 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 515
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 104/305 (34%), Positives = 152/305 (49%), Gaps = 74/305 (24%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRE-------------------TDDCWGESSRDSQ 62
+G+G P MGWN+WN +HC+INE I+ + DDC+ E RDS
Sbjct: 106 DGVGRLPFMGWNTWNAYHCEINETIVLDNAKLMKSMGLLDAGYNYVNVDDCYSEKQRDSD 165
Query: 63 ISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI----------- 111
GN+VANK FPSG ++L D +H
Sbjct: 166 --------------------------GNIVANKERFPSGMRSLTDKLHAMGFKAGIYSDS 199
Query: 112 ------LISNSLDNRIESNIAFEQ--GIDYLKYDNC---FNDDTRPTI--RYRAMSDAL- 157
L S N F++ G D LKYDNC F++ + + +++ MSDA+
Sbjct: 200 GWFTCQLYPGSYQNEDRDIELFQEQWGFDLLKYDNCAVPFDEVIKEGMVGKFKRMSDAIG 259
Query: 158 ---KKAGRP-IFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADH 213
++ G+P + +S+C+WG +P LW + +WRTTDDI+ SW I + N Y+
Sbjct: 260 RLSERTGKPPMLYSLCQWGREQPWLWAKKLGQTWRTTDDINPSWGSTTNILNQNSFYSWA 319
Query: 214 AKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEV 273
G+ D DMLEVGNGGM + E HF+ WA+ K+PLL+G D+ T ET+ I+ N+E+
Sbjct: 320 NDYYGYGDMDMLEVGNGGMNFEESKSHFTAWALMKSPLLIGTDLTKATKETITILKNKEL 379
Query: 274 IAVNQ 278
I ++Q
Sbjct: 380 IKIHQ 384
>gi|302868145|ref|YP_003836782.1| ricin B lectin [Micromonospora aurantiaca ATCC 27029]
gi|302571004|gb|ADL47206.1| Ricin B lectin [Micromonospora aurantiaca ATCC 27029]
Length = 551
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 128/405 (31%), Positives = 184/405 (45%), Gaps = 97/405 (23%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
L NG+ TPPMGWN+WN F C INE +IR+T DDCW + +RD
Sbjct: 36 LDNGVARTPPMGWNTWNTFGCNINETLIRQTADAIVSNGLRDLGYKYVVVDDCWFDPNRD 95
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-------- 112
SQ GNL A+ FPSG K+L DY+H
Sbjct: 96 SQ--------------------------GNLQAHPQRFPSGMKSLGDYLHARGLLFGIYQ 129
Query: 113 ----------------ISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDT--RPTIRYRAM 153
+ SL++ + F G+DYLKYD C + T R+ M
Sbjct: 130 VPLDKTCAQVGGAFPGATGSLNHEYQDARQFAAWGVDYLKYDWCSSTGTINDQVARFGIM 189
Query: 154 SDALKKAGRPIFFSM---CEWGDMRPALWGSNIRNSWRTTDDISDSWARMLT-------- 202
DAL GRPI +S+ P S++ N WRTT+DI+++W T
Sbjct: 190 RDALASTGRPIVYSINPNSYHAKTGPQRNWSDVANLWRTTEDITNAWNTGQTNGYPMGIQ 249
Query: 203 -IADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
I D+ A +A PGG+NDPDMLEVG GGM E HF++WA+ APL++G DV +
Sbjct: 250 NIVDVTVPLAGYAAPGGFNDPDMLEVGRGGMNDTEMRSHFALWAVLAAPLMMGNDVRSAN 309
Query: 262 AETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLSGNRIVVLLESRKTFSSMMT 310
A T+ I+ N ++A+NQ +Q + A PL+ + V L ++ ++ ++
Sbjct: 310 AATLAILRNANLVAINQDTLGRQAAQVSFDGTRRVLAKPLANGDVAVALFNQGGSTTTVS 369
Query: 311 AHWDDIGIKSNN-TLFEHRTLETKFQ-GKLTAKVDGHSCKMYELT 353
+G N+ TL + T T G ++A V H +Y ++
Sbjct: 370 TTAAAVGKSGNSFTLRDAWTDATSTSTGTISASVPAHGTVVYRVS 414
>gi|389645608|ref|XP_003720436.1| alpha-galactosidase [Magnaporthe oryzae 70-15]
gi|351640205|gb|EHA48069.1| alpha-galactosidase [Magnaporthe oryzae 70-15]
gi|440463566|gb|ELQ33138.1| alpha-galactosidase [Magnaporthe oryzae Y34]
gi|440477127|gb|ELQ58260.1| alpha-galactosidase [Magnaporthe oryzae P131]
Length = 502
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 105/303 (34%), Positives = 153/303 (50%), Gaps = 53/303 (17%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS-RDSQISLSCSFHMFDSLTQIAA 80
NGL L P MGWN+WN +HC ++E+++ +T + RD L ++ + D I
Sbjct: 36 NGLNLVPQMGWNNWNAYHCDVSEELLLKTAEAMVTYGLRD----LGYNYVVLDDCWSIGR 91
Query: 81 KPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----ISNSLD----NRIESNIAFEQ-- 129
G L+ N FPSG K++AD +H + + +S R ++ FEQ
Sbjct: 92 NE-----SGYLLHNPVKFPSGMKSIADKLHAMKFKFGMYSSAGVFTCGRYPGSLGFEQKD 146
Query: 130 -------GIDYLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFSMCEWGDMRPA 177
G+DYLKYDNC+N T RY MS AL GRP+ ++MC WG+ P
Sbjct: 147 ADTFASWGVDYLKYDNCYNQGQSGTPQLSFNRYNVMSKALNNTGRPMVYAMCNWGNDDPY 206
Query: 178 LWGSNIRNSWRTTDDISDSWAR--------------------MLTIADMNEVYADHAKPG 217
W I NS+R + DI DS+ R ++ I + +PG
Sbjct: 207 DWAYTIANSYRMSGDIYDSFQRPDSRCPCTETPCNWPGFHCSVMNILNKMAPITSRTQPG 266
Query: 218 GWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN 277
+ND DMLEVGNGG +EYV+HFS+WA+ +PLL+G ++ L+ + I+ N VIA+N
Sbjct: 267 AFNDMDMLEVGNGGQSDSEYVLHFSMWAMMSSPLLMGTNIPGLSPAHLSILSNPAVIALN 326
Query: 278 QAI 280
Q +
Sbjct: 327 QDV 329
>gi|51701636|sp|Q11129.1|MEL_SACMI RecName: Full=Alpha-galactosidase; AltName:
Full=Alpha-D-galactoside galactohydrolase; AltName:
Full=MELj; AltName: Full=Melibiase; Flags: Precursor
gi|1246856|emb|CAA64760.1| alpha-galactosidase MEL [Saccharomyces mikatae]
Length = 471
Score = 171 bits (433), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 125/387 (32%), Positives = 178/387 (45%), Gaps = 79/387 (20%)
Query: 1 LFIVSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRD 60
LFI +++ + GS NGLGLTP MGW++WN F C ++E+++ T D R
Sbjct: 6 LFITAAAVTGALGSSPS---YNGLGLTPQMGWDNWNTFACDVSEQLLLNTAD------RI 56
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH---------- 110
S+I L + + L + G LVA+K FP+G +A ++H
Sbjct: 57 SEIGLKDLGYKYVILDDCWSSG--RNSNGTLVADKNKFPNGMDHVARHLHNNNFLFGMYS 114
Query: 111 -------ILISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPT-----IRYRAMSDAL 157
SL + E F + G+DYLKYDNC+N T RY+AMSDAL
Sbjct: 115 SAGEYTCAGYPGSLGHEQEDAEFFARNGVDYLKYDNCYNKGKFGTPETSYKRYKAMSDAL 174
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWAR------------------ 199
K GRPIF+S+C WG WGS+I NSWR + DI + R
Sbjct: 175 NKTGRPIFYSLCNWGQDLTFYWGSDIANSWRMSGDIYPEFDRPDSRCPCDGDQYDCSYAG 234
Query: 200 ----MLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGC 255
++ I + +A GGWND D LEVG G + +E HFS+WA+ K+PL++G
Sbjct: 235 FHCSIMNILNKAAPMGQNAGIGGWNDLDNLEVGVGNLTDDEEKAHFSMWAMVKSPLIIGA 294
Query: 256 DVGNLTAETMPIIGNEEVIAVNQ---------------------AINDQQIWAGPL-SGN 293
DV +L + I VIA+NQ + Q+W+ PL +G+
Sbjct: 295 DVNHLKESSYSIYSQASVIAINQDPKGVPATRVWRHYVSQTDKYGKGEIQLWSCPLDNGD 354
Query: 294 RIVVLLESRKTFSSMMTAHWDDIGIKS 320
+++ LL M A +DI S
Sbjct: 355 QVIALLNGSNK-KRPMNASLEDIFFDS 380
>gi|329847088|ref|ZP_08262116.1| melibiase family protein [Asticcacaulis biprosthecum C19]
gi|328842151|gb|EGF91720.1| melibiase family protein [Asticcacaulis biprosthecum C19]
Length = 496
Score = 171 bits (433), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 115/340 (33%), Positives = 161/340 (47%), Gaps = 48/340 (14%)
Query: 15 DRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWG 55
+ R + L PPMGW+SWN F I++ IR+ DD W
Sbjct: 134 EERVVAQKSLAPRPPMGWSSWNKFATNIDDTTIRQIADAMVSSGLRDAGYIYVNIDDGW- 192
Query: 56 ESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISN 115
+ RD+ L + H D K L + + G K A Y
Sbjct: 193 QGERDAAGVLHPNAHFPD------MKDLADYVHARGLKLGLYSGPGPKTCAGYEGAYGHV 246
Query: 116 SLDNRIESNIAFEQGIDYLKYDNC-----FNDDTRPTIRYRAMSDALKKAGRPIFFSMCE 170
+ D R + E G+DYLKYD C +N Y+ M +ALK GR I +S+C+
Sbjct: 247 AQDARTFA----EWGVDYLKYDLCSGEAFYNTAETVYATYQQMGEALKAPGRDIVYSLCQ 302
Query: 171 WGDMRPALWGSNIRNS-WRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGN 229
+G WG ++ WRTT DI D++A+M +I +H P GWNDPDMLEVGN
Sbjct: 303 YGRFDVGSWGRDVGGHLWRTTGDIEDTYAKMSSIGFDKNGVPNHTGPNGWNDPDMLEVGN 362
Query: 230 GGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-AINDQ----- 283
GGM ++EY H S+WA+ APL+LG DV ++T ET+ ++ N EVIAV+Q + Q
Sbjct: 363 GGMTHDEYRTHMSLWALLAAPLILGNDVRDMTPETVALLTNSEVIAVDQDPLGAQGLPVK 422
Query: 284 -----QIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGI 318
+IW PLS + V L +T +T W ++ I
Sbjct: 423 KDGAAEIWVKPLSNGKAVGLFNRGET-PLTITGTWTELCI 461
>gi|315505458|ref|YP_004084345.1| ricin b lectin [Micromonospora sp. L5]
gi|315412077|gb|ADU10194.1| Ricin B lectin [Micromonospora sp. L5]
Length = 551
Score = 171 bits (432), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 128/405 (31%), Positives = 185/405 (45%), Gaps = 97/405 (23%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
L NG+ TPPMGWN+WN F C INE +IR+T DDCW + +RD
Sbjct: 36 LDNGVARTPPMGWNTWNTFGCNINETLIRQTADAIVSNGLRDLGYKYVVVDDCWFDPNRD 95
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-------- 112
SQ GNL A+ FPSG K+L DY+H
Sbjct: 96 SQ--------------------------GNLQAHPQRFPSGMKSLGDYLHARGLLFGIYQ 129
Query: 113 ----------------ISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDT--RPTIRYRAM 153
+ SL++ + F G+DYLKYD C + T R+ M
Sbjct: 130 VPLDKTCAQVGGAFPGATGSLNHEYQDARQFAAWGVDYLKYDWCSSTGTINDQVARFGIM 189
Query: 154 SDALKKAGRPIFFSM---CEWGDMRPALWGSNIRNSWRTTDDISDSWARMLT-------- 202
DAL GRPI +S+ P S++ N WRTT+DI+++W T
Sbjct: 190 RDALASTGRPIVYSINPNSYHAKTGPQRNWSDVANLWRTTEDITNAWNTGQTNGYPMGIQ 249
Query: 203 -IADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
I D+ A +A PGG+NDPDMLEVG GGM E HF++WA+ APL++G DV +
Sbjct: 250 NIVDVTVPLAGYAAPGGFNDPDMLEVGRGGMNDTEMRSHFALWAVLAAPLMMGNDVRSAN 309
Query: 262 AETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLSGNRIVVLLESRKTFSSMMT 310
A T+ I+ N ++A+NQ +Q + A PL+ + V L ++ + ++ ++
Sbjct: 310 AATLAILRNANLVAINQDTLGRQAAQVSFDGTRRVLAKPLANGDVAVALFNQGSSTTTVS 369
Query: 311 AHWDDIGIK-SNNTLFEHRTLETKFQ-GKLTAKVDGHSCKMYELT 353
+G S+ TL + T T G ++A V H +Y ++
Sbjct: 370 TTAAAVGKSGSSFTLRDAWTDATSTSTGTISASVPAHGTVVYRVS 414
>gi|399028218|ref|ZP_10729521.1| alpha-galactosidase [Flavobacterium sp. CF136]
gi|398074295|gb|EJL65446.1| alpha-galactosidase [Flavobacterium sp. CF136]
Length = 410
Score = 171 bits (432), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 114/356 (32%), Positives = 170/356 (47%), Gaps = 49/356 (13%)
Query: 23 GLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKP 82
GL +TPPMGWNSWN F I+EK+++ET D S + ++ + D +
Sbjct: 32 GLAMTPPMGWNSWNTFGTNIDEKLVKETADIMVSSG---MAAAGYNYIVLDDGWMAKERD 88
Query: 83 LPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNRI-------------ESN 124
+ G+LVA+ FP+G KA+ DYVH + N + ++
Sbjct: 89 V----NGDLVADPVKFPNGMKAVIDYVHGKGLKFGLYNCAGTQTCAGYPGTRGYEYQDAR 144
Query: 125 IAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIR 184
+ GID+LKYD C Y MS+ALK AG+PI FS+CEWGD +P WG +
Sbjct: 145 FYAKLGIDFLKYDWCNTQGITAKEAYTTMSNALKTAGKPIVFSLCEWGDNQPWEWGKPVG 204
Query: 185 NSWRTTDDI----------SDSWAR--MLTIADMNEVYADHAKPGGWNDPDMLEVGNGGM 232
N WR + DI ++W+ + I +M + ++ P WND DM+EVGN M
Sbjct: 205 NLWRISGDIYPCFDCEFKHPENWSSWGFMKIVEMRKDIRKYSGPDHWNDFDMMEVGN-EM 263
Query: 233 KYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AIN 281
E HF++W + +PL G D ++ ET+ I+ N+E+IAVNQ A +
Sbjct: 264 NDTEDKSHFAMWCMMASPLFAGNDFRKISKETLAILTNKELIAVNQDKLGIQGFKYSAED 323
Query: 282 DQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFEHRTLETKFQGK 337
++W PLS V +R + + W IK + +E +T F+ K
Sbjct: 324 GLEVWVKPLSDGNWAVTFLNRSDVAKKINFDWKKHTIKDADFGYEADFNKTVFKLK 379
>gi|299856763|pdb|3LRM|A Chain A, Structure Of Alfa-Galactosidase From Saccharomyces
Cerevisiae With Raffinose
gi|299856764|pdb|3LRM|B Chain B, Structure Of Alfa-Galactosidase From Saccharomyces
Cerevisiae With Raffinose
gi|299856765|pdb|3LRM|C Chain C, Structure Of Alfa-Galactosidase From Saccharomyces
Cerevisiae With Raffinose
gi|299856766|pdb|3LRM|D Chain D, Structure Of Alfa-Galactosidase From Saccharomyces
Cerevisiae With Raffinose
Length = 479
Score = 171 bits (432), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 119/366 (32%), Positives = 173/366 (47%), Gaps = 74/366 (20%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
NGLGLTP MGW++WN F C ++E+++ +T D R S + L + + L +
Sbjct: 24 NGLGLTPQMGWDNWNTFACDVSEQLLLDTAD------RISDLGLKDMGYKYIILDDCWSS 77
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVH------ILISNSLD----------NRIESNI 125
G LVA++ FP+G +AD++H + S++ + R E +
Sbjct: 78 G--RDSDGFLVADEQKFPNGMGHVADHLHNNSFLFGMYSSAGEYTCAGYPGSLGREEEDA 135
Query: 126 AF--EQGIDYLKYDNCFNDDT--RPTI---RYRAMSDALKKAGRPIFFSMCEWGDMRPAL 178
F +DYLKY NC+N P I RY+AMSDAL K GRP+F+S+C WG
Sbjct: 136 QFFANNRVDYLKYANCYNKGQFGTPEISYHRYKAMSDALNKTGRPVFYSLCNWGQDLTFY 195
Query: 179 WGSNIRNSWRTTDDISDSWAR----------------------MLTIADMNEVYADHAKP 216
WGS I NSWR + D++ + R ++ I + +A
Sbjct: 196 WGSGIANSWRMSGDVTAEFTRPDSRCPCDGDEYDCKYAGFHCSIMNILNKAAPMGQNAGV 255
Query: 217 GGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAV 276
GGWND D LEVG G + +E HFS+WA+ K+PL++G +V NL A + I VIA+
Sbjct: 256 GGWNDLDNLEVGVGNLTDDEEKAHFSMWAMVKSPLIIGANVNNLKASSYSIYSQASVIAI 315
Query: 277 NQAIN---------------------DQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDD 315
NQ N + Q+W+GPL VV L + + S M ++
Sbjct: 316 NQDSNGIPATRVWRYYVSDTDEYGQGEIQMWSGPLDNGDQVVALLNGGSVSRPMNTTLEE 375
Query: 316 IGIKSN 321
I SN
Sbjct: 376 IFFDSN 381
>gi|254443006|ref|ZP_05056482.1| Melibiase subfamily [Verrucomicrobiae bacterium DG1235]
gi|198257314|gb|EDY81622.1| Melibiase subfamily [Verrucomicrobiae bacterium DG1235]
Length = 410
Score = 170 bits (431), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 119/339 (35%), Positives = 158/339 (46%), Gaps = 60/339 (17%)
Query: 23 GLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS-RDS---QISLSCSFHMFDSLTQI 78
GL TPPMGWNSWN F INE++I D E+ RD+ I+L + M +
Sbjct: 29 GLAETPPMGWNSWNTFATNINEELIIGVADSMVENGMRDAGYIYINLDDGWMMRE----- 83
Query: 79 AAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------------------LISNSLDNR 120
G+LV + FP+G K LADY+H N
Sbjct: 84 ------RDENGDLVPDPEKFPNGLKHLADYLHERGFKFGIYGDAGRKTCAGYPGNQGHEY 137
Query: 121 IESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWG 180
++ E G+DYLKYD C + Y M DAL AGRP+ FSMCEWG P LW
Sbjct: 138 QDARKYAEWGVDYLKYDWCSTESRDAVEAYTTMRDALYAAGRPVVFSMCEWGTAEPWLWA 197
Query: 181 SNIRNSWRTTDDI----------SDSWARMLTIA-----DMNEVYADHAKPGGWNDPDML 225
+I + WRTT DI S W ++L + +N + +A PG WNDPDM+
Sbjct: 198 EDIGHLWRTTGDIISCYDCVQEWSMGWKKILDLQMSLNPGINGLEG-YAGPGHWNDPDMM 256
Query: 226 EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAI----- 280
EVGN G+ E HFS+WAI APL+ G DV +T + + I+ NEE IA+NQ
Sbjct: 257 EVGNHGISDVESKSHFSLWAILAAPLIAGNDVREMTPQIVDILTNEEAIAMNQDALGKQG 316
Query: 281 ------NDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHW 313
++++IW L G V + + T W
Sbjct: 317 TRIYQDDEKEIWLKYLEGGDFGVCVLNASEEKRETTMEW 355
>gi|307103618|gb|EFN51877.1| hypothetical protein CHLNCDRAFT_8622, partial [Chlorella
variabilis]
Length = 319
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 110/328 (33%), Positives = 169/328 (51%), Gaps = 55/328 (16%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NG+GLTP MGWNSWN F + E++IR+T + + L + ++ + +
Sbjct: 2 LDNGVGLTPAMGWNSWNQFGLNVTEELIRQTSE--------QIVDLGLADVGYEYV--VI 51
Query: 80 AKPLPNQ---CQGNLVANKTTFPSGTKALADYVHILI----------SNSLDNRIESNIA 126
PN+ QG + AN T FPSG L+DYVH L + + + S
Sbjct: 52 DDGWPNKTRDAQGRMQANLTIFPSGIANLSDYVHSLGLKFGIYSDAGALTCGGHMGSYGH 111
Query: 127 FEQ--------GIDYLKYDNCFNDD----TRPTIRYRAMSDALKK-------AGRPIFFS 167
FE G+DYLK + P ++ +++ AGR I++S
Sbjct: 112 FEDDAETWAEWGVDYLKAGGKRGRGEGWRSDPYTQHHSLAAPSHAGRLVSLGAGREIYYS 171
Query: 168 MCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEV 227
+C WG + P WG+ + NSWRTT DI+ ++ R+ + D N + A PGGWNDPDMLEV
Sbjct: 172 ICNWGIVNPWRWGAEVGNSWRTTADIAANFDRVFSNLDNNVGLSQFAAPGGWNDPDMLEV 231
Query: 228 GNGGMKYNEYVVHFSIWAISKAPLLLGCDVGN--LTAETMPIIGNEEVIAVNQ---AIND 282
G+G + + HF++WA K+PL++G D+ ++ E++ I+ N ++IAVNQ +
Sbjct: 232 GSGALTLQDQRSHFALWAAVKSPLIIGTDLRRVAISPESLAILKNRDIIAVNQDPLGVAA 291
Query: 283 QQIW--------AGPLSGNRIVVLLESR 302
+++W AGPL G VV+L +R
Sbjct: 292 ERVWKQGSAEVFAGPLQGGDRVVVLFNR 319
>gi|374384441|ref|ZP_09641961.1| hypothetical protein HMPREF9449_00347 [Odoribacter laneus YIT
12061]
gi|373228349|gb|EHP50658.1| hypothetical protein HMPREF9449_00347 [Odoribacter laneus YIT
12061]
Length = 491
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 129/408 (31%), Positives = 187/408 (45%), Gaps = 69/408 (16%)
Query: 3 IVSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCW-GESSRDS 61
I + +A + S+ + + L LTP MGWNSWN F ++ E ++ ET D RD
Sbjct: 93 ITAENAKGKSVSELNIQIGDELALTPVMGWNSWNTFGPELTEALVLETADAMIANGMRD- 151
Query: 62 QISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH----------- 110
L + D Q+ + G + NK FP G K +ADY+H
Sbjct: 152 ---LGYQYINIDDYWQLKDR----GADGRIQINKEKFPRGIKYVADYLHERGFRLGIYSD 204
Query: 111 --------ILISNSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPT--IRYRAMSDALKKA 160
+ S ++ I++ G+D LKYD C + R T +RYR M +AL+
Sbjct: 205 ASRYTCGGVCGSYGYED-IDARDFASWGVDLLKYDYCGAPEERDTAIVRYRKMGEALRAT 263
Query: 161 GRPIFFSMCEWGDMRPALWGSNIR-NSWRTTDDISDSWAR-------MLTIADMNEVYAD 212
R I FS+CEWG P W + + WRTT DI D W+ ++ I D N+ AD
Sbjct: 264 DRSIVFSVCEWGGREPWTWAKEVGGHYWRTTGDIRDKWSTDNKNFLGIVNILDRNKNLAD 323
Query: 213 HAKPGGWNDPDMLEVGNGGMKYN-------------EYVVHFSIWAISKAPLLLGCDVGN 259
+A PG WNDPDML VG G ++ EY H S+W + APLL G DV +
Sbjct: 324 YAGPGAWNDPDMLTVGIFGKSFSINDGRKDFGCTLEEYRSHMSLWCMMAAPLLSGNDVRS 383
Query: 260 LTAETMPIIGNEEVIAVNQ-AINDQ----------QIWAGPLSGNRIVVLLESRKTFSSM 308
+ I+ NEE+IA+NQ A+ Q +IW L V +L +R
Sbjct: 384 MADSVKDILLNEEIIAINQDALGKQAVVVKTEGNCEIWQKNLEDGVAVAVL-NRGEQPET 442
Query: 309 MTAHWDDIGIKSN---NTLFEHRTLETKFQGKLTAKVDGHSCKMYELT 353
+T H+ ++G+K +++H L T LT H ++Y+LT
Sbjct: 443 VTLHFKEMGMKGKVKVRDIWKHVNLGT--METLTVSPKAHGTEVYKLT 488
>gi|115387665|ref|XP_001211338.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114195422|gb|EAU37122.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 641
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 122/391 (31%), Positives = 186/391 (47%), Gaps = 55/391 (14%)
Query: 1 LFIVSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDC---WGES 57
L + ++ AAD S R L +GL TP MGWN++NH++C +E+I R +G +
Sbjct: 9 LLLPAAFAADH--SPLRARLQDGLARTPQMGWNTYNHYNCYPDEEIFRSNAKALVDFGLA 66
Query: 58 SRDSQI-SLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL---- 112
+ ++ C + + + L G+L N T FPSG A+ADY+H L
Sbjct: 67 DLGYRYATIDCGWTLTERL-----------ANGSLTWNATRFPSGFPAIADYLHDLGLLF 115
Query: 113 ------------ISNSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKA 160
S+ + N ++ N E + Y + + +P RY MS+AL+K
Sbjct: 116 GVYGDAGIKLCGPSDEVGN-LDDNCFSEASLGYPNVEYAPSSPLKP--RYEVMSNALQKL 172
Query: 161 GRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWN 220
RPI F +CEWG PALW + +SWR +DI +W + + D A PG W
Sbjct: 173 DRPILFQICEWGIDFPALWAPALGHSWRIGNDIIPAWRSVFRTLNQAVPQTDFAGPGQWP 232
Query: 221 DPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT----AETMPIIGNEEVIAV 276
D DML VGNG E HFS+WAI K+PL++G + + T +E++ I+ + VI
Sbjct: 233 DLDMLMVGNGVYSVPEEETHFSLWAILKSPLIIGSALKDATTEINSESLRILKQKAVIGY 292
Query: 277 NQ-----------AINDQ--QIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT 323
NQ DQ ++W+GPLS R V + + + + +T DIG++S
Sbjct: 293 NQDKLGVSASLRRRWTDQGYEVWSGPLSNGRTVAAVINWRGEARDLTLDLPDIGLQSAGL 352
Query: 324 L--FEHRTLETKFQGKLTAKVDGHSCKMYEL 352
+ + Q TA+V+GH + EL
Sbjct: 353 VKNIWAGSTSRNVQTSYTARVEGHGTMLLEL 383
>gi|396484503|ref|XP_003841960.1| similar to alpha-galactosidase precursor [Leptosphaeria maculans
JN3]
gi|312218535|emb|CBX98481.1| similar to alpha-galactosidase precursor [Leptosphaeria maculans
JN3]
Length = 507
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 120/374 (32%), Positives = 172/374 (45%), Gaps = 83/374 (22%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRET-DDCWGESSRDSQISLSCSFHMFDSLTQIAA 80
NGL L P MGWN+WN FHC +NE ++ T D RD L + + D +
Sbjct: 44 NGLNLVPQMGWNNWNAFHCDVNESLLLTTAQDMVDYGLRD----LGYNHIVLDDCWAVGR 99
Query: 81 KPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----ISNSLD----NRIESNIAFEQ-- 129
G LVA+ FPSG +++ D +H L I +S R ++ +E+
Sbjct: 100 NE-----SGYLVADSYKFPSGMQSIVDQIHALGFKFGIYSSAGVLTCGRFPGSLGYEKQD 154
Query: 130 -------GIDYLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFSMCEWGDMRPA 177
G+DYLKYDNCFN T RY+ MSDAL GR + ++MC WG+ P
Sbjct: 155 ADLWASWGVDYLKYDNCFNQGQSGTPKLSFDRYKVMSDALNATGREMVYAMCNWGNDDPF 214
Query: 178 LWGSNIRNSWRTTDDISDSWAR---------------------MLTIADMNEVYADHAKP 216
W I NS R + DI DS+ R ++ I +
Sbjct: 215 DWAYRIANSGRMSGDIYDSFNRPDDRCPCTEAVGCPWPGFHCSVMNILNKMPAITSRTMS 274
Query: 217 GGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAV 276
G +ND DMLE+GNGG +EYVVHFS+WAI+ +P+L+G +VG L+ + I N ++A+
Sbjct: 275 GYFNDMDMLEIGNGGQSDSEYVVHFSMWAINSSPMLIGTNVGTLSPANLAIYSNPAILAI 334
Query: 277 NQ---AINDQQIW----------AGPLS--------GNRIVVLLESRKTFSSMMTAHWDD 315
NQ A +++W G +S G+R+V LL + SM
Sbjct: 335 NQDRSAGAAKRVWRYLEAADDNGQGEISLWTRTLGNGDRVVALLNAANVSMSM------- 387
Query: 316 IGIKSNNTLFEHRT 329
+ + F+ RT
Sbjct: 388 -NVTIDEVFFDERT 400
>gi|302883605|ref|XP_003040702.1| hypothetical protein NECHADRAFT_78539 [Nectria haematococca mpVI
77-13-4]
gi|256721591|gb|EEU34989.1| hypothetical protein NECHADRAFT_78539 [Nectria haematococca mpVI
77-13-4]
Length = 407
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 117/339 (34%), Positives = 160/339 (47%), Gaps = 68/339 (20%)
Query: 21 SNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAA 80
+ GL P MGWNSWN F INE II T + +S SL ++ + D+ Q
Sbjct: 16 AQGLATKPQMGWNSWNSFKAIINESIIESTANTIVKSGL---ASLGYNYVLMDAGWQT-- 70
Query: 81 KPLPNQCQGNLVANKTTFPSGTKALADYVHILI-------------------SNSLDNRI 121
L +G N T FP G K +ADY+H L S+ +
Sbjct: 71 --LERDKEGRQQVNLTRFPGGIKPVADYIHKLGLKVGIYSDAGIYGCDFAPGSHGYEELD 128
Query: 122 ESNIAFEQGIDYLKYDNC--FNDDTRPTI-RYRAMSDALKKAGRPIFFSMCEWGDMRPAL 178
S A E GIDYLKYDNC F +T R+ MS ALK GR IF+S+CEWG P
Sbjct: 129 ASQYA-EWGIDYLKYDNCGGFQANTLSVQERFLRMSHALKDTGRDIFYSLCEWGHQFPWF 187
Query: 179 WGSNIRNSWRTTDDISDSWAR---------------------MLTIADMNEVYADHAKPG 217
W +S+R + DI + + + I M E+ + KPG
Sbjct: 188 WADQFSDSYRMSGDIHSEYKKDNSNGCATAYCLNTGYAGVSVLTMIRKMREI-SRFQKPG 246
Query: 218 GWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN 277
W D DMLEVG G M ++ HFS WA K+PL++G DV N+ ++ ++ N+E+IA+N
Sbjct: 247 SWADMDMLEVGVGNMTRHQEETHFSFWAALKSPLIIGADVNNIRESSLEVLKNKEIIALN 306
Query: 278 Q-----AIN---------DQQIWAGPLSG--NRIVVLLE 300
Q A+N Q+WAGPL +R V+L++
Sbjct: 307 QDKLGVAVNYIPSLSREGKYQVWAGPLKSGKSRHVILVQ 345
>gi|407918679|gb|EKG11948.1| Glycoside hydrolase clan GH-D [Macrophomina phaseolina MS6]
Length = 494
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 107/319 (33%), Positives = 146/319 (45%), Gaps = 89/319 (27%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQ 62
NGL L P MGWN+WN FHC ++E+++ T DDCW +S
Sbjct: 31 NGLNLVPQMGWNNWNAFHCDVSEELLLNTAQAMVDYGLRDLGYKYVVLDDCWSAGRNES- 89
Query: 63 ISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----ISNSL 117
G LV N FP+G K +AD VH L + +S
Sbjct: 90 --------------------------GYLVHNPKKFPNGMKYVADRVHDLGMKYGMYSSA 123
Query: 118 D----NRIESNIAFEQ---------GIDYLKYDNCFNDDTRPTI-----RYRAMSDALKK 159
+ ++ +EQ G+DYLKYDNC+N T RY MS AL
Sbjct: 124 GVYTCGKYPGSLGYEQNDADLFASWGVDYLKYDNCYNQGQSGTPKLSFDRYNVMSKALNN 183
Query: 160 AGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWAR-------------------- 199
GRPI +++C WG+ P W I NS+R + DI DS+ R
Sbjct: 184 TGRPIVYALCNWGNDDPYDWAYTISNSYRMSGDIYDSFQRPDDRCPCVETPCSWPGFHCS 243
Query: 200 MLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGN 259
++ I + +PG +ND DMLEVGNGG +EYVVHFS+WA+ +PLL+G ++
Sbjct: 244 VMNILNKMAPIVSRTQPGAFNDMDMLEVGNGGQSDSEYVVHFSMWALMSSPLLIGTNIPT 303
Query: 260 LTAETMPIIGNEEVIAVNQ 278
L+ + I N VIA+NQ
Sbjct: 304 LSPANLAIYSNPAVIALNQ 322
>gi|317034650|ref|XP_001400806.2| alpha-galactosidase D [Aspergillus niger CBS 513.88]
gi|298351557|sp|A2R2S6.2|AGALD_ASPNC RecName: Full=Probable alpha-galactosidase D; AltName:
Full=Melibiase D; Flags: Precursor
Length = 660
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 125/395 (31%), Positives = 177/395 (44%), Gaps = 67/395 (16%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQ-- 77
L +GL LTP MGWN++NH+ C NE I+ +S+ + + L S + +T
Sbjct: 24 LQDGLALTPQMGWNTYNHYSCSPNETIV--------QSNAQALVDLGLSSLGYRYVTTDC 75
Query: 78 --IAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-------------ILISNSLDNRIE 122
A LP+ G+L N T FP G A+ D++H IL+ S N
Sbjct: 76 GWTVADRLPD---GSLTWNDTLFPQGFPAMGDFLHDLGLLFGVYQDSGILLCGSPPNETG 132
Query: 123 SNIAFEQ--------GIDYLKYDNCFND------------DTRPTIRYRAMSDALKKAGR 162
S EQ +D LKYDNC++D T P R+ MS AL + R
Sbjct: 133 SLYHEEQDARTFASWNVDSLKYDNCYSDAATGYPNVNYAPSTSPEPRFANMSHALLQQNR 192
Query: 163 PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDP 222
I F +CEWG PA W + +SWR +DI W + I + D A PG W D
Sbjct: 193 TILFQICEWGISFPANWAPALGHSWRIGNDIIPDWRTIFRIINQAAPQTDVAGPGQWPDL 252
Query: 223 DMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGN-LTA---ETMPIIGNEEVIAVNQ 278
DMLEVGN E HFS+WAI K+PL++G + + LTA ++ ++ ++VIA NQ
Sbjct: 253 DMLEVGNNIFSLPEEQTHFSLWAILKSPLIIGAALKDELTAINDASLAVLKQKDVIAFNQ 312
Query: 279 -------------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTL- 324
++W+GPLS R V + + S +T D+G++ T+
Sbjct: 313 DALGKSASLRRRWTEEGYEVWSGPLSNGRTVAAVINWHNESKDLTLDLPDVGLQHAGTVK 372
Query: 325 -FEHRTLETKFQGKLTAKVDGHSCKMYELTPVTKL 358
T TA V GH + EL T +
Sbjct: 373 NIWDGTTARDVLTSYTATVAGHGTMLLELQNTTAV 407
>gi|119467278|ref|XP_001257445.1| alpha-galactosidase/alpha-n-acetylgalactosaminidase [Neosartorya
fischeri NRRL 181]
gi|119405597|gb|EAW15548.1| alpha-galactosidase/alpha-n-acetylgalactosaminidase [Neosartorya
fischeri NRRL 181]
Length = 481
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 116/368 (31%), Positives = 171/368 (46%), Gaps = 75/368 (20%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSL 75
+ + L+NGL TP MGWN+WN F C++NE +I ++ + +SL ++ +
Sbjct: 16 KASALNNGLARTPQMGWNTWNSFACELNETVIL--------NAAERIVSLGFRDLGYEYV 67
Query: 76 TQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----ISNSLDN----RIESNIA 126
G L+A+ FP+G LAD VH L I +S R E ++
Sbjct: 68 VLDDCWSAGRNSSGYLIADSEKFPNGIAHLADKVHELGLKIGIYSSAGTWTCARYEGSLG 127
Query: 127 FEQ---------GIDYLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFSMCEWG 172
+E+ GIDYLKYDNC+N+ T RY AM AL GRP+ +S+C WG
Sbjct: 128 YEEKDAALWASWGIDYLKYDNCYNEGEEGTPKLSFDRYNAMFKALNATGRPMLYSLCNWG 187
Query: 173 DMRPALWGSNIRNSWRTTDDISDSWAR-----------------------MLTIADMNEV 209
P + I NSWRT D+S+ W R ++ + +
Sbjct: 188 VDGPWNFAPTIANSWRTAGDLSNVWDRDDVNCPCSELEGLDCKTPGYKCSIMNVLNKAVY 247
Query: 210 YADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIG 269
Y A PG WND DML+VGNGG+ +E + H S+WA K+PLL+ + + T+ I+
Sbjct: 248 YPSKAIPGAWNDLDMLQVGNGGLTDDESIAHMSLWAALKSPLLMTNVMTKIDPPTLSILQ 307
Query: 270 NEEVIAVNQ---------------------AINDQQIWAGPLSGNRIVVLLESRKTFSSM 308
N V+AV+Q + Q+++GPLSG +VLL + + +
Sbjct: 308 NPAVLAVSQDPVASTPVRQWRYFVDDVDENGKGEIQMYSGPLSGGDQLVLLLNAGSKARE 367
Query: 309 MTAHWDDI 316
M A DI
Sbjct: 368 MNATLVDI 375
>gi|281425762|ref|ZP_06256675.1| alpha-galactosidase [Prevotella oris F0302]
gi|281400023|gb|EFB30854.1| alpha-galactosidase [Prevotella oris F0302]
Length = 407
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 108/290 (37%), Positives = 153/290 (52%), Gaps = 50/290 (17%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS-RDS---QISLSCSFHMFDSLTQIA 79
L TP MGWNSWN F I+E II+ D S RD+ I++ +H
Sbjct: 27 LSQTPQMGWNSWNKFQGNIDEDIIKGIADAMVSSGLRDAGYVYINMDDCWH--------- 77
Query: 80 AKPLPNQCQGN--LVANKTTFPSGTKALADYVHI------LISN-----------SLDNR 120
+C N + A+ FP G KALADY+H + S+ SL +
Sbjct: 78 -----GKCDANGFIQADPKHFPHGIKALADYIHARGLKLGIYSDAGTETCAGRPGSLGHE 132
Query: 121 IESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALW 179
+ + + + +DYLKYD C + P Y+ MSDAL+ +GRPIFFSMCEWGD +P W
Sbjct: 133 YQDALQYARWDVDYLKYDWCNTVNVNPRGAYQLMSDALRASGRPIFFSMCEWGDNQPWRW 192
Query: 180 GSNIRNSWRTTDDISDSW-----------ARMLTIADMNEVYADHAKPGGWNDPDMLEVG 228
+I NSWR DI S+ +L + N+ +A PG WNDPDMLEVG
Sbjct: 193 ARDIGNSWRIGPDIWCSFDSTRVFPTYVQYSVLDCINKNDSLRRYAGPGHWNDPDMLEVG 252
Query: 229 NGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
N G+ N+ HF++W + +PL+LG D+ +++AET I+ N ++IA++Q
Sbjct: 253 N-GLTVNQDRAHFTMWCMMASPLILGNDIRHMSAETKAILTNRDLIAIDQ 301
>gi|134081479|emb|CAK46492.1| unnamed protein product [Aspergillus niger]
Length = 431
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 125/395 (31%), Positives = 177/395 (44%), Gaps = 67/395 (16%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQ-- 77
L +GL LTP MGWN++NH+ C NE I+ +S+ + + L S + +T
Sbjct: 24 LQDGLALTPQMGWNTYNHYSCSPNETIV--------QSNAQALVDLGLSSLGYRYVTTDC 75
Query: 78 --IAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-------------ILISNSLDNRIE 122
A LP+ G+L N T FP G A+ D++H IL+ S N
Sbjct: 76 GWTVADRLPD---GSLTWNDTLFPQGFPAMGDFLHDLGLLFGVYQDSGILLCGSPPNETG 132
Query: 123 SNIAFEQ--------GIDYLKYDNCFND------------DTRPTIRYRAMSDALKKAGR 162
S EQ +D LKYDNC++D T P R+ MS AL + R
Sbjct: 133 SLYHEEQDARTFASWNVDSLKYDNCYSDAATGYPNVNYAPSTSPEPRFANMSHALLQQNR 192
Query: 163 PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDP 222
I F +CEWG PA W + +SWR +DI W + I + D A PG W D
Sbjct: 193 TILFQICEWGISFPANWAPALGHSWRIGNDIIPDWRTIFRIINQAAPQTDVAGPGQWPDL 252
Query: 223 DMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGN-LTA---ETMPIIGNEEVIAVNQ 278
DMLEVGN E HFS+WAI K+PL++G + + LTA ++ ++ ++VIA NQ
Sbjct: 253 DMLEVGNNIFSLPEEQTHFSLWAILKSPLIIGAALKDELTAINDASLAVLKQKDVIAFNQ 312
Query: 279 -------------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTL- 324
++W+GPLS R V + + S +T D+G++ T+
Sbjct: 313 DALGKSASLRRRWTEEGYEVWSGPLSNGRTVAAVINWHNESKDLTLDLPDVGLQHAGTVK 372
Query: 325 -FEHRTLETKFQGKLTAKVDGHSCKMYELTPVTKL 358
T TA V GH + EL T +
Sbjct: 373 NIWDGTTARDVLTSYTATVAGHGTMLLELQNTTAV 407
>gi|317503004|ref|ZP_07961088.1| alpha-galactosidase [Prevotella salivae DSM 15606]
gi|315665869|gb|EFV05452.1| alpha-galactosidase [Prevotella salivae DSM 15606]
Length = 408
Score = 168 bits (425), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 109/288 (37%), Positives = 149/288 (51%), Gaps = 46/288 (15%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS-RDS---QISLSCSFHMFDSLTQIA 79
L TP MGW+SWN F I+E II+ D S RD+ +++ +H
Sbjct: 27 LSKTPQMGWSSWNKFQGNIDEDIIKSIADAMVSSGLRDAGYIYVNIDDCWHG-------- 78
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVHI------LISN-----------SLDNRIE 122
G + A+ FP G KALADY+H + S+ SL + +
Sbjct: 79 ----KRDSNGFIQADPKHFPHGIKALADYIHARGLKLGIYSDAGSETCAGRPGSLGHEYQ 134
Query: 123 SNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGS 181
+ + + IDYLKYD C + Y+ MSDAL+ AGRPIFFSMCEWGD P W
Sbjct: 135 DALQYARWDIDYLKYDWCNTTNINARGAYQLMSDALRAAGRPIFFSMCEWGDNNPWRWAR 194
Query: 182 NIRNSWRTTDDISDSW-----------ARMLTIADMNEVYADHAKPGGWNDPDMLEVGNG 230
+I NSWR DI S+ +L + N+ +A PG WNDPDMLEVGN
Sbjct: 195 DIGNSWRIGPDIWCSFDSTRVFPTYVQYSVLDCINKNDSLRSYAGPGHWNDPDMLEVGN- 253
Query: 231 GMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
G+ N+ HF++W + +PL+LG DV N++AET I+ N E+IA++Q
Sbjct: 254 GLTVNQDRAHFTMWCMMASPLILGNDVRNMSAETKAILTNRELIAIDQ 301
>gi|285018714|ref|YP_003376425.1| glycoside hydrolase family 27 [Xanthomonas albilineans GPE PC73]
gi|283473932|emb|CBA16433.1| putative glycoside hydrolase, family 27 transmembrane protein
[Xanthomonas albilineans GPE PC73]
Length = 531
Score = 168 bits (425), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 128/380 (33%), Positives = 180/380 (47%), Gaps = 61/380 (16%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS-RDSQISLSCSFHMFDSLTQ 77
L S+GL TPP+GWNSWN F I++ +R D S RD+ + D Q
Sbjct: 162 LPSDGLARTPPLGWNSWNRFGEHIDDATVRRIADAMVASGLRDA----GYVYVNIDDGWQ 217
Query: 78 IAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------LISN----SLDNRIESNIAF 127
+ + ++ FP +ALADYVH L S+ S S
Sbjct: 218 ------GQRDRDGVLQPNARFPD-MRALADYVHAKGLKLGLYSSPGPKSCAGYTGSYGHV 270
Query: 128 EQ--------GIDYLKYDNC-----FNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDM 174
EQ GIDYLKYD C F + + Y M AL+ +GRPI +S+CE+G
Sbjct: 271 EQDARTWAGWGIDYLKYDLCSGEGIFREPQQVRRAYLQMGQALRASGRPIVYSLCEYGRD 330
Query: 175 RPALWGSNIRNS-WRTTDDISDSWARMLTIA-DMNEVYADHAKPGGWNDPDMLEVGNGGM 232
WG + WRTT DI D++A+M D N AD A PGGWNDPDMLEVGNGGM
Sbjct: 331 HVGQWGREVGGHLWRTTGDIEDTYAKMAANGFDHNGDPAD-AGPGGWNDPDMLEVGNGGM 389
Query: 233 KYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-AINDQ-------- 283
+EY H ++WA+S APLLLG D+ +T T+ ++ N V+A++Q A+ Q
Sbjct: 390 SEDEYRTHLALWAMSAAPLLLGNDLRQMTPVTVALLRNRAVLAIDQDALGVQGRAVRKDG 449
Query: 284 --QIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFEHRTLETKFQGKL--- 338
+IW PL+ + + + +R ++ D G T + R +QG+
Sbjct: 450 AIEIWRKPLADGGVALGVFNRDAQPHLVVLTAQDAG-----TALQGRRWRDLWQGQTRPA 504
Query: 339 ----TAKVDGHSCKMYELTP 354
+ +V H + L+P
Sbjct: 505 RQLRSVQVAAHGVVLLRLSP 524
>gi|440796286|gb|ELR17395.1| Alphagalactosidase [Acanthamoeba castellanii str. Neff]
Length = 398
Score = 167 bits (422), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 120/398 (30%), Positives = 175/398 (43%), Gaps = 75/398 (18%)
Query: 30 MGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQG 89
MGWN+WN F C +E +++ S + + + ++ + QG
Sbjct: 1 MGWNTWNTFQCNYDESTLKQMAHALISSGMAKAGYTFLNIDDWHATLSLSTRE--RDMQG 58
Query: 90 NLVANKTTFPSGTKALADYVHILI----------SNSLDNRIESNIAFEQ--------GI 131
NL + T FP+G KA DY+H L + + SN ++Q GI
Sbjct: 59 NLQPDPTKFPNGMKAFVDYIHSLGLKVGIYSDVGTYTCQGFPGSNGYYQQDANTFASWGI 118
Query: 132 DYLKYDNCF---NDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWR 188
DYLK+D C+ + +P + Y MS AL K GRPI FS+C WG W I N WR
Sbjct: 119 DYLKFDTCYLTNEIENQPWLFYGQMSQALNKTGRPIVFSICNWGRKDSWNWAPKIGNLWR 178
Query: 189 TTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYN------------E 236
TT DI W R+L I D + +A PG +NDPDMLEVG G +N +
Sbjct: 179 TTYDIFPWWGRVLEILDAQKPLYPYAAPGAFNDPDMLEVGVNGTFFNLPLAPAAWLDDTQ 238
Query: 237 YVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------------------ 278
HFS+WAI APL+ G D+ +T I+ N EVIAVNQ
Sbjct: 239 ARSHFSLWAIMAAPLITGNDLRTMTPAIQAILTNSEVIAVNQDPLGLQGRVVASVQSGLD 298
Query: 279 --------AINDQQIWAGPLSGNRIVVLLESRKTF--------SSMMTAHWDDIGIKSNN 322
+ ++WA ++ RI V+L +R + +T W +G+ S++
Sbjct: 299 LDGACLWLKCSWTEVWAKNMTNGRIAVVLFNRAGVLYDFNQWGNEDVTVTWTQLGLSSSS 358
Query: 323 TLFEHRTLET-----KFQGKLTAK-VDGHSCKMYELTP 354
+ L + +F G+ TAK + H+ + L P
Sbjct: 359 ATYRATDLWSGQALGQFTGQFTAKNLAPHAVQFIVLDP 396
>gi|260593558|ref|ZP_05859016.1| alpha-galactosidase [Prevotella veroralis F0319]
gi|260534546|gb|EEX17163.1| alpha-galactosidase [Prevotella veroralis F0319]
Length = 410
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 106/288 (36%), Positives = 147/288 (51%), Gaps = 46/288 (15%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRD----SQISLSCSFHMFDSLTQIA 79
L LTP MGW+SWN+F INE II+ D S + I++ +H
Sbjct: 29 LSLTPQMGWSSWNNFQGNINEDIIKSIADAMVASGLKDAGYTYINIDDCWHG-------- 80
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVHI------LISN-----------SLDNRIE 122
G + A+ FP G KALADYVH + S+ SL + +
Sbjct: 81 ----KRDADGFIQADPKHFPHGIKALADYVHSRGLKLGIYSDAGSETCAGRPGSLGHEYQ 136
Query: 123 SNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGS 181
+ + + GIDYLKYD C + Y+ M DA++ AGRPIFFSMCEWGD P W
Sbjct: 137 DALQYARWGIDYLKYDWCNTTNVNAQGAYQLMRDAIQAAGRPIFFSMCEWGDNHPWRWAK 196
Query: 182 NIRNSWRTTDDISDSW-----------ARMLTIADMNEVYADHAKPGGWNDPDMLEVGNG 230
I +SWR DI S+ ++ + N+ +A PG WNDPDMLEVGN
Sbjct: 197 GIGDSWRIGPDIWCSFDSTHVFPTYIQCSIIDCINRNDSLRSYAGPGHWNDPDMLEVGN- 255
Query: 231 GMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
G+ N+ HF++W + +PL+LG DV N++ ET I+ N ++IA++Q
Sbjct: 256 GLSVNQDRAHFTMWCMMASPLILGNDVRNMSDETKAILTNRDLIAIDQ 303
>gi|393245406|gb|EJD52916.1| alpha-galactosidase [Auricularia delicata TFB-10046 SS5]
Length = 457
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 121/395 (30%), Positives = 174/395 (44%), Gaps = 109/395 (27%)
Query: 1 LFIVSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET---------- 50
L +V+ + AD +++R GL PP+GWNSWN F C I++ I +
Sbjct: 9 LALVTVTVADVGTNNQRP----GLAPRPPLGWNSWNKFGCDISQDTILDAAKSIVSTGLK 64
Query: 51 ---------DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSG 101
DDCW RD++ + KP+P+ F G
Sbjct: 65 DFGYEYIVIDDCWHADQRDNETN----------------KPVPH---------PERFSLG 99
Query: 102 TKALADYVHILI----------SNSLDNRIESNIAFEQ---------GIDYLKYDNCFND 142
K +AD VH L +++ R S + FE+ IDYLKYDNC+N+
Sbjct: 100 IKHIADEVHKLGLKFGIYSSAGTHTCAGRFGS-LGFEEIDAQTYASWDIDYLKYDNCYNE 158
Query: 143 DTRPT-----IRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSW 197
T RY MS AL GR I +++C WG+ P W I +WR + D+ D++
Sbjct: 159 GQSGTPEISFTRYEKMSKALGSTGREIVYALCNWGEDGPWNWAPTIAETWRISGDVWDNF 218
Query: 198 ---------------------ARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNE 236
+ I + + A PG WND DMLEVGNGG Y+E
Sbjct: 219 DTVDDYCPCETMIDCKLPGFHCAVEKIIEFSAPLGQKAGPGHWNDMDMLEVGNGGQSYDE 278
Query: 237 YVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQ------------- 283
YV HFS+WAI+K+PL+LG D+ +T ET II N+ ++A+NQ
Sbjct: 279 YVTHFSMWAINKSPLILGHDLTKMTNETYEIITNKAILAINQDEQGSPAHRKWKRPVEGG 338
Query: 284 --QIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDI 316
Q+W G L+ VV L + + ++A + DI
Sbjct: 339 TVQMWTGSLANGDTVVALLNLSPGNVTVSADFADI 373
>gi|440790280|gb|ELR11563.1| alphagalactosidase, putative [Acanthamoeba castellanii str. Neff]
Length = 345
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 122/366 (33%), Positives = 168/366 (45%), Gaps = 63/366 (17%)
Query: 30 MGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQG 89
MGWNSWNHF C I+ R+ +C IS + M D++ K
Sbjct: 1 MGWNSWNHFGCSID--CDRQPFNC---------ISQALLIQMADAMVLAGMK-------- 41
Query: 90 NLVANKTTFPSGTKALADYVH---ILISNSLDNRIESNIAF---------------EQGI 131
A FP G K +ADY+H + D ++ F E G+
Sbjct: 42 --YALFPRFPLGIKYVADYIHKRGLKFGIYQDCGKQTCEGFPGSFGYEALDVKTFAEWGV 99
Query: 132 DYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEW---GDMR----PALWGSNIR 184
D LK D C+ + +Y S L A RP+ FS C W D++ P +++
Sbjct: 100 DLLKLDGCYFPVDQMEAKYTQWSQLLGNASRPMVFS-CSWPTYADLKNMSVPWAKVASVC 158
Query: 185 NSWRTTDDISDSWARMLTIADMNE--VYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFS 242
N WR DI D WA +L I D E A A+PG WNDPDMLEVGNGGM Y+EY HFS
Sbjct: 159 NLWREWIDIRDEWATILKIVDYQEEMKLAQWARPGAWNDPDMLEVGNGGMTYDEYKSHFS 218
Query: 243 IWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AINDQQIWAGPLS 291
+WA+ APL+ G D+ ++ ET I+ EVIAVNQ A + ++W L
Sbjct: 219 LWAMMAAPLIAGNDIRTMSHETRTILTAREVIAVNQDPLGRQGGRLRAQSGLEVWTRRLH 278
Query: 292 GNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFEHRTLET---KFQGKLTAKVDGHSCK 348
N VLL +R T + +T W ++G+ S + E+ F TA V+ H
Sbjct: 279 DNSQAVLLWNRNTSRTNITVTWAEMGLASRTQRVRNLWTESDEGTFVNAFTAPVEPHGVV 338
Query: 349 MYELTP 354
M +++P
Sbjct: 339 MVKVSP 344
>gi|288929235|ref|ZP_06423080.1| alpha-galactosidase (Melibiase) (Alpha-D-galactoside
galactohydrolase) [Prevotella sp. oral taxon 317 str.
F0108]
gi|288329337|gb|EFC67923.1| alpha-galactosidase (Melibiase) (Alpha-D-galactoside
galactohydrolase) [Prevotella sp. oral taxon 317 str.
F0108]
Length = 406
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 109/288 (37%), Positives = 149/288 (51%), Gaps = 46/288 (15%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS-RD---SQISLSCSFHMFDSLTQIA 79
L TP MGW+SWN F INE II+ D S RD + I++ +H
Sbjct: 26 LAQTPQMGWSSWNKFQGNINEDIIKGIADAMVSSGLRDAGYTYINIDDCWHG-------- 77
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVHI------LISN-----------SLDNRIE 122
G + + FPSG KALADYVH + S+ SL + +
Sbjct: 78 ----QRDADGFIQPDSKHFPSGMKALADYVHARGLKLGIYSDAGTETCAGRPGSLGHEYQ 133
Query: 123 SNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGS 181
+ + + +DYLKYD C + P Y+ +SDAL AGRPIF SMCEWGD +P W
Sbjct: 134 DALQYARWEVDYLKYDWCNTTNVNPRGAYQLISDALCAAGRPIFLSMCEWGDNQPWRWAR 193
Query: 182 NIRNSWRTTDDISDSW-----------ARMLTIADMNEVYADHAKPGGWNDPDMLEVGNG 230
+I +SWR DI S+ +L + N+ +A PG WNDPDMLEVGN
Sbjct: 194 DIGHSWRIGPDIWCSFDSTRVFPTYVQYSVLDCINKNDSLRRYAGPGHWNDPDMLEVGN- 252
Query: 231 GMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
G+ N+ HF++W + +PL+LG DV N++AET I+ N ++IA+NQ
Sbjct: 253 GLSVNQDRAHFAMWCMMASPLILGNDVRNMSAETKAILTNRDLIAINQ 300
>gi|336370252|gb|EGN98593.1| glycoside hydrolase family 27 protein [Serpula lacrymans var.
lacrymans S7.3]
Length = 393
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 115/352 (32%), Positives = 156/352 (44%), Gaps = 59/352 (16%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQI-SLSCSFHMFDSLTQI 78
L+N L TP MGWN +N F C E+ + D+ + SL + FD Q
Sbjct: 20 LNNDLERTPAMGWNPYNAFLCSTTEEQYLTA----ASNLIDTGLKSLGYEYFNFDCGWQG 75
Query: 79 AAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------------ISN 115
+ G + N+T PSG AL +VH L
Sbjct: 76 TNR----TASGTITWNETRIPSGIPALGQHVHGLGFKFGVYSDAGYYSCDFVNGQAHWIG 131
Query: 116 SLDNRIESNIAFEQ-GIDYLKYDNCFN------------DDTRPTIRYRAMSDALKKAGR 162
SL + F G DYLKYDNC++ D ++ Y AM DAL GR
Sbjct: 132 SLGYELSDAKTFTSWGADYLKYDNCYSVIVFSFPSSNNADSSKSKPHYTAMRDALAGTGR 191
Query: 163 PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISD--SWARMLTIADMNEVYADHAKPGGWN 220
P+ FSMCEWG PA W + NSWR +DI SW + I + A PG WN
Sbjct: 192 PVAFSMCEWGVQDPARWAPAVGNSWRIANDIGPPPSWDNVFRIINQVVPITGFAGPGAWN 251
Query: 221 DPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-- 278
D D+LEVGN G+ E HF+ WA +K+PLL+ D+ + T+ I+ N +IA+NQ
Sbjct: 252 DLDLLEVGNEGLSVAEQASHFAFWAAAKSPLLVSTDLTSPAESTLTILKNTRIIALNQDP 311
Query: 279 ----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKS 320
ND +WAGPL+ V +L + + S +T D+G +S
Sbjct: 312 LGTSITFKRRYTNDYDVWAGPLADGSTVAVLLNWQNSSRSLTFDLSDVGFES 363
>gi|391865565|gb|EIT74844.1| alpha-D-galactosidase [Aspergillus oryzae 3.042]
Length = 420
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 122/393 (31%), Positives = 170/393 (43%), Gaps = 67/393 (17%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDS----QISLSCSFHMFDSL 75
L NGL TP MGWN++NH+ C NE I+R + S ++ C + + D L
Sbjct: 24 LDNGLARTPQMGWNTYNHYSCSPNETIVRSNAQALVDLGLASLGYRYVTTDCGWTVADRL 83
Query: 76 TQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-------------ILISNSLDNRIE 122
+ G+L N+T FP G AL Y+H I + S + +
Sbjct: 84 SD-----------GSLTWNETLFPEGFPALGKYLHDLNLLFGVYQDSGIKLCGSPPDNVG 132
Query: 123 SNIAFEQG--------IDYLKYDNCFND------------DTRPTIRYRAMSDALKKAGR 162
S +Q +D LKYDNC++D T P R+ MS AL R
Sbjct: 133 SLYHEDQDARTFASWEVDSLKYDNCYSDAATGYPNVNYEPSTSPQPRFANMSRALAAQNR 192
Query: 163 PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDP 222
+ F +CEWG PA W + +SWR +DI W + + A PG W D
Sbjct: 193 SMVFQVCEWGIDFPARWAPALGHSWRIGNDIIPHWRAIYRTLNQAVPQTSFAGPGQWPDL 252
Query: 223 DMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGC----DVGNLTAETMPIIGNEEVIAVNQ 278
DML VGN + E HFS+WAI K+PL +G D ++ E++ I+ EEVI NQ
Sbjct: 253 DMLFVGNDILSIPEEQTHFSLWAILKSPLTIGAALKDDNTSINDESLQILKQEEVIGYNQ 312
Query: 279 -------------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTL- 324
++W+GPLSG R V L + + S +T DIG++ T+
Sbjct: 313 DSLGVSASLRRRWTEEGYEVWSGPLSGGRTVAALINWRNESRDLTLDLPDIGLQYAGTVK 372
Query: 325 -FEHRTLETKFQGKLTAKVDGHSCKMYELTPVT 356
T + TAKV GH + EL T
Sbjct: 373 NIWDGTTAQNVKTSYTAKVQGHGTILLELQDTT 405
>gi|29468168|gb|AAO85428.1| alpha-galactosidase [Oryza sativa Japonica Group]
Length = 184
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 94/206 (45%), Positives = 114/206 (55%), Gaps = 47/206 (22%)
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW E SRDSQ GN V N+ TFPSG KALADYVH
Sbjct: 6 DDCWAEYSRDSQ--------------------------GNFVPNRQTFPSGIKALADYVH 39
Query: 111 I----------LISNSLDNRIESNIAFEQ---------GIDYLKYDNCFNDDTRPTI-RY 150
S + N++ ++ E+ G+DYLKYDNC ND R + RY
Sbjct: 40 AKGLKLGIYSDAGSQTCSNKMPGSLDHEEQDVKTFASWGVDYLKYDNC-NDAGRSVMERY 98
Query: 151 RAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVY 210
MS+A+K G+ IFFS+CEWG PA W + NSWRTT DI+D+W M + AD N+ +
Sbjct: 99 TRMSNAMKTYGKNIFFSLCEWGKENPATWAGRMGNSWRTTGDIADNWGSMTSRADENDQW 158
Query: 211 ADHAKPGGWNDPDMLEVGNGGMKYNE 236
A +A PGGWNDPDMLEVGNGGM E
Sbjct: 159 AAYAGPGGWNDPDMLEVGNGGMSEAE 184
>gi|393718125|ref|ZP_10338052.1| glycoside hydrolase family 27 [Sphingomonas echinoides ATCC 14820]
Length = 509
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 115/343 (33%), Positives = 157/343 (45%), Gaps = 47/343 (13%)
Query: 18 NLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS-RDSQISLSCSFHMFDSLT 76
L +GL LTPPMGW+SWN F +I++ IR D + RD+ + + +
Sbjct: 148 TLAPDGLALTPPMGWSSWNKFADKIDDATIRAIADAMVSTGLRDAGYTYVNIDDGWQGVR 207
Query: 77 QIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------LISNSLDN----------- 119
+ PN FP KALADYVH L S+
Sbjct: 208 GADGQIRPN----------AKFPD-MKALADYVHARGLKFGLYSSPGPRTCAGYEGSYGH 256
Query: 120 -RIESNIAFEQGIDYLKYDNC----FNDDTRPTIR-YRAMSDALKKAGRPIFFSMCEWGD 173
R ++ + G+DYLKYD C F D R Y M AL+ GRPI +S+CE+G
Sbjct: 257 VRQDAQTFADWGVDYLKYDLCSGEYFYADADTVQRSYYEMGAALRATGRPIVYSLCEYGR 316
Query: 174 MRPALWGSNIRNS-WRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGM 232
R WG + WRTT DI+D +A M I A PG WNDPDMLE+GNGGM
Sbjct: 317 FRVGSWGRAVGGHLWRTTGDITDDYATMARIGFEKNGNPSDAGPGEWNDPDMLEIGNGGM 376
Query: 233 KYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ-------- 284
Y H ++WA+S APL++G DV +A ++ + VIA++Q Q
Sbjct: 377 SEAAYRSHMTLWAMSAAPLMMGHDVRTTSAAAKRLLEDRAVIAIDQDARGVQGKPVRKQG 436
Query: 285 ---IWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTL 324
IW PL+G + + +R +T DIG+ S +
Sbjct: 437 TSEIWLKPLAGGAYAIAVFNRADAPVTLTLRPADIGLDSFTAM 479
>gi|374312326|ref|YP_005058756.1| Alpha-galactosidase [Granulicella mallensis MP5ACTX8]
gi|358754336|gb|AEU37726.1| Alpha-galactosidase [Granulicella mallensis MP5ACTX8]
Length = 395
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 121/339 (35%), Positives = 159/339 (46%), Gaps = 58/339 (17%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETDDCW-GESSRDSQISLSCSFHMFDSLTQIAAKPLPN 85
TPPMGWNSWNHF + + +R D RD+ + D Q +P
Sbjct: 31 TPPMGWNSWNHFFQHVTDADVRAAADAMVANGMRDA----GYVYVNIDDGWQGTRRP--- 83
Query: 86 QCQGNLVANKTTFPSGTKALADYVH-------ILISNSLDN--RIESNIAFEQ------- 129
G + +N FP K LADYVH I S L + E + E
Sbjct: 84 --DGTIQSNDR-FPD-MKTLADYVHTRGLKLGIYSSPGLKTCAKFEGSYGHEMQDAKTYA 139
Query: 130 --GIDYLKYDNC-FNDD---------------TRPTIRYRAMSDALKKAGRPIFFSMCEW 171
GIDYLKYD C F D YR M DAL AGRPI FS+C++
Sbjct: 140 AWGIDYLKYDLCSFGDKLTVAGNGDQAAPAAFAAQQEAYRKMHDALVAAGRPIVFSLCQY 199
Query: 172 GDMRPALWGSNIR-NSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNG 230
G WG + N WRTT DISD + RM I + +A PG WNDPDMLEVGNG
Sbjct: 200 GMSNVWEWGPEVGGNLWRTTGDISDKYDRMALIGFSQAGLSRYAGPGHWNDPDMLEVGNG 259
Query: 231 GMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------A 279
GM +EY H S+WAI APLL G D+ ++ E + I+ N++ IA++Q A
Sbjct: 260 GMTLDEYRTHMSLWAILAAPLLAGNDLRHMKPEHLAILENKDAIAIDQDSLGKQGDRVSA 319
Query: 280 INDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGI 318
+ ++W P+S R + L +R + M + ++GI
Sbjct: 320 LGPYELWTKPMSHGRTAIALFNRGELAHTMRVNLRELGI 358
>gi|380692933|ref|ZP_09857792.1| alpha-galactosidase [Bacteroides faecis MAJ27]
Length = 503
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 124/413 (30%), Positives = 189/413 (45%), Gaps = 78/413 (18%)
Query: 2 FIVSSSAADSNGSDRRNL---LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCW-GES 57
+IV+ A +S G+ + L + + L LTPPMGWNSWN F + E+++ +T D
Sbjct: 99 YIVTLKAKNSQGTCTQKLEIHIGDDLLLTPPMGWNSWNTFGQHLTEELVLQTADALVANG 158
Query: 58 SRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------ 111
RD L S+ D Q+ + G++ +K FP G K +ADY+H
Sbjct: 159 MRD----LGYSYINIDDFWQLPER----GADGHIQIDKAKFPRGIKYVADYLHERGFKLG 210
Query: 112 LISNSLDNR---IESNIAFEQ---------GIDYLKYDNCFNDDTRPTI--RYRAMSDAL 157
+ S++ D + + +E+ G+D LKYD C R RY M AL
Sbjct: 211 IYSDAADKTCGGVCGSYGYEEMDAKDFASWGVDLLKYDYCNAPAGRVEAMERYAKMGKAL 270
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNS-WRTTDDISDSWAR----------MLTIADM 206
+ GR I FS+CEWG P W + WR + DI D W R +L I ++
Sbjct: 271 RATGRSIVFSVCEWGQREPWKWAKQVGGHLWRVSGDIGDVWDREANRMGGLRGILNILEI 330
Query: 207 NEVYADHAKPGGWNDPDMLEVGNGGMKYN-----------EYVVHFSIWAISKAPLLLGC 255
N ++A P GWNDPDML VG GG + +Y HF++W + +PLL G
Sbjct: 331 NAPLNEYAGPSGWNDPDMLVVGIGGKSMSIGSESKGCTQEQYKSHFALWCMMASPLLCGN 390
Query: 256 DVGNLTAETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLSGNRIVVLLESRKT 304
DV N+ T+ ++ ++++IA+NQ + +Q IW PL+ R V
Sbjct: 391 DVRNMDDSTLQVLLDKDLIAINQDVLGKQAERSIRADYYDIWVKPLADGRKAV-----AC 445
Query: 305 FSSMMTAHWDDIGIKSNNTL-------FEHRTLETKFQGKLTAKVDGHSCKMY 350
F+ M T ++ K+ L + R+ E G + K+ + CK+Y
Sbjct: 446 FNRMDTPQNIELNAKTVEGLSLEQVYSLDSRSTENAADG-MIVKLAPYQCKVY 497
>gi|1170910|sp|P41945.1|MEL2_YEAST RecName: Full=Alpha-galactosidase 2; AltName:
Full=Alpha-D-galactoside galactohydrolase 2; AltName:
Full=Melibiase 2; Flags: Precursor
gi|547462|emb|CAA85737.1| Alpha-Galactosidase [Saccharomyces cerevisiae]
Length = 471
Score = 164 bits (416), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 114/345 (33%), Positives = 165/345 (47%), Gaps = 75/345 (21%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
NGLGLTP MGW++WN F C ++E+++ +T D R S + L + + L +
Sbjct: 24 NGLGLTPQMGWDNWNTFACDVSEQLLLDTAD------RISDLGLKDMGYKYIILDDCWSS 77
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVH------ILISNSLD----------NRIESNI 125
G LVA++ FP+G +AD++H + S++ + R E +
Sbjct: 78 G--RDSDGFLVADEQKFPNGMGHVADHLHNNSFLFGMYSSAGEYTCAGYPGSLGREEEDA 135
Query: 126 AF--EQGIDYLKYDNCFNDDT--RPTI---RYRAMSDALKKAGRPIFFSMCEWGDMRPAL 178
F +DYLKYDNC+N P I RY+AMSDAL K GRPIF+S+C WG
Sbjct: 136 QFFANNRVDYLKYDNCYNKGQFGTPEISYHRYKAMSDALNKTGRPIFYSLCNWGQDLTFY 195
Query: 179 WGSNIRNSWRTTDDISDSWAR----------------------MLTIADMNEVYADHAKP 216
WGS I NSWR + DI+ + R ++ I + +A
Sbjct: 196 WGSGIANSWRMSGDITAEFTRPDSRCPCDGDEYDCKYAGFHCSIMNILNKAAPMGQNAGV 255
Query: 217 GGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAV 276
GGWND D LEV G + +E HF +WA+ K+PL++G DV L + I VIA+
Sbjct: 256 GGWNDLDNLEVRVGNLTDDEEKAHFPMWAMVKSPLIIGADVNTLKPSSYSIYSQASVIAI 315
Query: 277 NQ---------------------AINDQQIWAGPL-SGNRIVVLL 299
NQ + Q+W+GPL +G+++V LL
Sbjct: 316 NQDPKGIPATRVWRYYVSDTDEYGQGEIQMWSGPLDNGDQVVALL 360
>gi|260909846|ref|ZP_05916538.1| alpha-galactosidase [Prevotella sp. oral taxon 472 str. F0295]
gi|260636077|gb|EEX54075.1| alpha-galactosidase [Prevotella sp. oral taxon 472 str. F0295]
Length = 406
Score = 164 bits (416), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 109/288 (37%), Positives = 149/288 (51%), Gaps = 46/288 (15%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS-RD---SQISLSCSFHMFDSLTQIA 79
L TP MGW+SWN F INE II+ D S RD + I++ +H
Sbjct: 26 LSQTPQMGWSSWNKFQGNINEDIIKGIADAMVSSGLRDAGYTYINIDDCWHG-------- 77
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVHI------LISN-----------SLDNRIE 122
G + + FPSG KALADYVH + S+ SL + +
Sbjct: 78 ----KRDVDGFIQPDPKHFPSGMKALADYVHARGLKLGIYSDAGSETCAGRPGSLGHEYQ 133
Query: 123 SNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGS 181
+ + + +DYLKYD C + P Y+ +SDAL AGRPIF SMCEWGD +P W
Sbjct: 134 DALQYARWEVDYLKYDWCNTTNINPRGAYQLISDALCAAGRPIFLSMCEWGDNQPWRWAR 193
Query: 182 NIRNSWRTTDDISDSW-----------ARMLTIADMNEVYADHAKPGGWNDPDMLEVGNG 230
+I +SWR DI S+ +L + N+ +A PG WNDPDMLEVGN
Sbjct: 194 DIGHSWRIGPDIWCSFDSTRVFPTYVQYSVLDCINKNDSLRRYAGPGHWNDPDMLEVGN- 252
Query: 231 GMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
G+ N+ HF++W + +PL+LG DV N++AET I+ N ++IA+NQ
Sbjct: 253 GLTVNQDRAHFTMWCMMASPLILGNDVRNMSAETKLILTNRDLIAINQ 300
>gi|238490436|ref|XP_002376455.1| alpha-galactosidase, putative [Aspergillus flavus NRRL3357]
gi|298351525|sp|B8N7Z0.1|AGALD_ASPFN RecName: Full=Probable alpha-galactosidase D; AltName:
Full=Melibiase D; Flags: Precursor
gi|220696868|gb|EED53209.1| alpha-galactosidase, putative [Aspergillus flavus NRRL3357]
Length = 655
Score = 164 bits (415), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 119/393 (30%), Positives = 169/393 (43%), Gaps = 69/393 (17%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDS----QISLSCSFHMFDSL 75
L NGL TP MGWN++NH+ C NE I+R + S ++ C + + D L
Sbjct: 24 LDNGLARTPQMGWNTYNHYSCSPNETIVRSNAQALVDLGLASLGYRYVTTDCGWTVADRL 83
Query: 76 TQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL---------------------IS 114
+ G+L N+T FP G AL Y+H L +
Sbjct: 84 SD-----------GSLTWNETLFPEGFPALGKYLHDLDLLFGVYQDSGIKLCGSPPDNVG 132
Query: 115 NSLDNRIESNIAFEQGIDYLKYDNCFND------------DTRPTIRYRAMSDALKKAGR 162
N D + ++E +D LKYDNC++D T P R+ MS AL R
Sbjct: 133 NYEDQDARTFASWE--VDSLKYDNCYSDAATGYPNVNYEPSTSPQPRFANMSRALAAQNR 190
Query: 163 PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDP 222
+ F +CEWG PA W + +SWR +DI W + + A PG W D
Sbjct: 191 SMVFQVCEWGIDFPARWAPALGHSWRIGNDIIPHWRAIYRTLNQAVPQTSFAGPGQWPDL 250
Query: 223 DMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGC----DVGNLTAETMPIIGNEEVIAVNQ 278
DML VGN + E HFS+WAI K+PL +G D ++ E++ I+ ++I NQ
Sbjct: 251 DMLFVGNDILSIPEEQTHFSLWAILKSPLTIGAALKDDETSINDESLQILKQADIIGYNQ 310
Query: 279 -------------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTL- 324
++W+GPLSG R V L + + S +T DIG++ T+
Sbjct: 311 DSLGVSASLRRRWTEEGYEVWSGPLSGGRTVAALINWRNESRDLTLDLPDIGLQYAGTVK 370
Query: 325 -FEHRTLETKFQGKLTAKVDGHSCKMYELTPVT 356
T + TAKV GH + EL T
Sbjct: 371 NIWDGTTAQNVKTSYTAKVQGHGTILLELQDTT 403
>gi|298351560|sp|Q5AX28.2|AGALD_EMENI RecName: Full=Alpha-galactosidase D; AltName: Full=Melibiase D;
Flags: Precursor
gi|95025935|gb|ABF50881.1| alpha-galactosidase [Emericella nidulans]
Length = 659
Score = 164 bits (415), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 117/404 (28%), Positives = 180/404 (44%), Gaps = 64/404 (15%)
Query: 2 FIVSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDS 61
+V+++A S + L +GL TP MGWN++N ++C NE I+ E ++
Sbjct: 7 MVVAATALASPAPALKPRLDDGLARTPQMGWNTYNQYNCFPNESIVHENAQALVDTGLAD 66
Query: 62 ----QISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH------- 110
+++ C + + D L PN G + N FP G A+ Y+H
Sbjct: 67 LGYRYVTIDCGWGVEDRL--------PN---GTITWNPELFPQGFPAMGQYLHDLGLLFG 115
Query: 111 ------ILISNSLDN--------RIESNIAFEQGIDYLKYDNCFND------------DT 144
IL+ S N I++ E G D LKYDNC++D T
Sbjct: 116 VYGDSGILLCGSPPNITGSLYYEDIDARTFAEWGADSLKYDNCYSDAATNYPNVNYAPST 175
Query: 145 RPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIA 204
P R+ MS ++ R I F +CEWG PALW I +SWR +DI W +
Sbjct: 176 SPHPRFANMSRYIQAQDRDILFQVCEWGIDFPALWAPEIGHSWRIGNDIIPHWRSIFRTL 235
Query: 205 DMNEVYADHAKPGGWNDPDMLEVG-NGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAE 263
+ D A PG W D DML VG +G + E HFS+W+I K+PL +G + + AE
Sbjct: 236 NQAVPQTDFAGPGQWPDLDMLLVGLDGVLTVPEEQTHFSLWSILKSPLTIGAAIPGMRAE 295
Query: 264 TMPIIGNEEVIAVNQ-------------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMT 310
++ I+ N +VIA NQ + ++W+GPL G R + + + + +T
Sbjct: 296 SLEILSNADVIAFNQDALGVSAALRRRWSDEGYEVWSGPLEGGRTIAAVINWRDEDREIT 355
Query: 311 AHWDDIGIKSNNTLFEHRTLET--KFQGKLTAKVDGHSCKMYEL 352
DIG++ TL ET + ++ V+ H + +L
Sbjct: 356 LDLPDIGLQYAETLQNVWADETVNGVKTSYSSVVEAHGVMLVQL 399
>gi|116619835|ref|YP_821991.1| alpha-galactosidase [Candidatus Solibacter usitatus Ellin6076]
gi|116222997|gb|ABJ81706.1| Alpha-galactosidase [Candidatus Solibacter usitatus Ellin6076]
Length = 499
Score = 164 bits (414), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 120/370 (32%), Positives = 168/370 (45%), Gaps = 51/370 (13%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS-RDSQISLSCSFHMFDSLTQIAA 80
NGL TPPMGWNSWN F +I++ +R D ++ RD+ ++
Sbjct: 141 NGLARTPPMGWNSWNKFANRIDDATVRGIADAMVDNGMRDAGYVYVNIDDTWEGTRDTHG 200
Query: 81 KPLPNQCQGNLVANKTTFPSGTKALADYVH-------ILISNSLDN--RIESNIAFEQ-- 129
PNQ FP KALADYVH I S D E + E
Sbjct: 201 NIHPNQ----------KFPD-MKALADYVHSKGMKVGIYSSPGPDTCEGYEGSYGHEAQD 249
Query: 130 -------GIDYLKYDNC----FNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPAL 178
GIDYLKYD C D+ Y+ M DAL +GR I FS+C++G +
Sbjct: 250 ARTYAAWGIDYLKYDWCGAFTIYKDSEMQAVYQKMGDALLASGRDIVFSLCQYGLVDVWK 309
Query: 179 WGSNIR-NSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEY 237
W + N WRT DISDSW + I + +A +A PG WNDPDMLE+GNG M EY
Sbjct: 310 WAPAVGGNLWRTAGDISDSWESVSRIGFSQDRFAPYASPGHWNDPDMLEIGNGHMTDTEY 369
Query: 238 VVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AINDQQIW 286
H S+W++ APL+ G DV ++ A I+ N +VIA++Q Q++W
Sbjct: 370 RTHMSLWSMLAAPLIAGNDVRDMRASIRDILTNRDVIAIDQDKAGHQAKRVWKSGQQEVW 429
Query: 287 AGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT----LFEHRTLETKFQGKLTAKV 342
L+G V + +R ++ + W D G+ + L+ H G T ++
Sbjct: 430 TRELAGGDTAVAVFNRAPGNARVAFRWADAGLLKTPSCLRDLWSHTERRATGPGYST-EI 488
Query: 343 DGHSCKMYEL 352
GH + +
Sbjct: 489 PGHGVALLRV 498
>gi|443898456|dbj|GAC75791.1| alpha-d-galactosidase [Pseudozyma antarctica T-34]
Length = 473
Score = 164 bits (414), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 118/358 (32%), Positives = 161/358 (44%), Gaps = 70/358 (19%)
Query: 21 SNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDS 61
+NGL LTP MGWN+WN F C I+E I DDCW RD
Sbjct: 22 NNGLALTPQMGWNTWNTFACNISEDTILSAAKSIKSEGLDKLGYNYVIIDDCWQADQRDP 81
Query: 62 QI-SLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNR 120
+ + F + KP+ ++ + + +GT ++ L +D +
Sbjct: 82 DTKEIPANPDKFPN----GLKPIVDEIKALGLKAGIYSSAGTMTCGHHIGSLGYEDVDAK 137
Query: 121 IESNIAFEQGIDYLKYDNCFNDDTR--PTI---RYRAMSDALKKA-GRPIFFSMCEWGDM 174
SN FE YLKYDNCFN + P I RY AMS AL K G PI +SMC WG+
Sbjct: 138 SWSNAGFE----YLKYDNCFNQGQQGTPKISFDRYNAMSQALNKTEGDPILYSMCNWGED 193
Query: 175 RPALWGSNIRNSWRTTDDISDSWAR---------------------MLTIADMNEVYADH 213
P L+ I NSWR + DI S+ R + I D
Sbjct: 194 WPWLFAQEIANSWRISGDIYPSFNRDDDRCPCTDITTCNLQGFHCSVAKILDFAASLGQK 253
Query: 214 AKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEV 273
A PG WND DMLEVGN G+ +E +VHFS+WA+ K+PL+LG D+ +T +T II N+ V
Sbjct: 254 AYPGAWNDLDMLEVGNRGLSLDESLVHFSMWAMVKSPLILGNDLTKMTNQTRAIIKNKHV 313
Query: 274 IAVNQ---------------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDI 316
I ++Q + + Q+W LS N V + + M+ DD+
Sbjct: 314 IDISQDPKGSPATRMWKKQESDGNTQLWKIGLSNNTYAVAVINVSESPKNMSISLDDV 371
>gi|323450097|gb|EGB05980.1| hypothetical protein AURANDRAFT_29988 [Aureococcus anophagefferens]
Length = 418
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 115/380 (30%), Positives = 162/380 (42%), Gaps = 77/380 (20%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
NGL LTPP+GWNSWN F C +++ +IR D + L + + D Q A+
Sbjct: 18 NGLALTPPLGWNSWNQFQCNVSDALIRRQADA---MVSLGLVDLGYQYVVIDDCWQADAR 74
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAFEQ------------ 129
G L + T FP+G AL+ YV S L I S++ +
Sbjct: 75 ----DVSGRLAPDATRFPAGIAALSAYVR---SRGLKLGIYSDVGTKTCAGYPGSFGHYD 127
Query: 130 ---------GIDYLKYDNCF-----NDDTRPTIRYRA--MSDALKKAGRPIFFSMCEWGD 173
GIDYLK+D C D RP R MS AL GRP+ +SMC WG
Sbjct: 128 LDARTFADWGIDYLKFDTCSLTWKETLDPRPFYARRGANMSAALNATGRPVLYSMCNWGR 187
Query: 174 MRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMK 233
P LW I N WRTT D+ W R+ +I D + + GG+ND DM+EVG
Sbjct: 188 HDPWLWAPEIANMWRTTMDVWPQWHRVASILDSMAGLSAYGGRGGFNDADMVEVGVDSRI 247
Query: 234 YN------------EYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ--- 278
+N E HF++WAI APL+LG D+ + ++ + V+AVNQ
Sbjct: 248 FNWAGMPETNLTEREAAAHFTMWAIMGAPLVLGLDLEAAERWALDVVSHAGVLAVNQDPL 307
Query: 279 ---------------------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIG 317
Q+WA LSG R+ V + S ++H+ D
Sbjct: 308 AYPGRRVTSDSDAVLGVCLKSRCASTQVWAKDLSGGRVAVAFLNAGDAYSQYSSHYGDEA 367
Query: 318 IKSNNTLFEHRTLETKFQGK 337
I + F+ + +F+ +
Sbjct: 368 IAVD---FDELHIAGRFRAR 384
>gi|317145371|ref|XP_001820730.2| alpha-galactosidase D [Aspergillus oryzae RIB40]
gi|298351558|sp|Q2UI87.2|AGALD_ASPOR RecName: Full=Probable alpha-galactosidase D; AltName:
Full=Melibiase D; Flags: Precursor
Length = 657
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 121/393 (30%), Positives = 169/393 (43%), Gaps = 67/393 (17%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDS----QISLSCSFHMFDSL 75
L NGL TP MGWN++NH+ C NE I+R + S ++ C + + D L
Sbjct: 24 LDNGLARTPQMGWNTYNHYSCSPNETIVRSNAQALVDLGLASLGYRYVTTDCGWTVADRL 83
Query: 76 TQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-------------ILISNSLDNRIE 122
+ G+L N+T FP G AL Y+H I + S + +
Sbjct: 84 SD-----------GSLTWNETLFPKGFPALGKYLHDLDLLFGVYQDSGIKLCGSPPDNVG 132
Query: 123 SNIAFEQG--------IDYLKYDNCFND------------DTRPTIRYRAMSDALKKAGR 162
S +Q +D LKYDNC++D T P R+ MS AL R
Sbjct: 133 SLYHEDQDARTFASWEVDSLKYDNCYSDAATGYPNVNYEPSTSPQPRFANMSRALAAQNR 192
Query: 163 PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDP 222
+ F +CEWG PA W + +SWR +DI W + + A PG W D
Sbjct: 193 SMVFQVCEWGIDFPARWAPALGHSWRIGNDIIPHWRAIYRTLNQAVPQTSFAGPGQWPDL 252
Query: 223 DMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGC----DVGNLTAETMPIIGNEEVIAVNQ 278
DML VGN + E HFS+WAI K+PL +G D ++ E++ I+ EEVI NQ
Sbjct: 253 DMLFVGNDILSIPEEQTHFSLWAILKSPLTIGAALKDDNTSINDESLQILKQEEVIGYNQ 312
Query: 279 -------------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTL- 324
++W+GPLSG R V L + K + +T DIG++ T+
Sbjct: 313 DSLGVSASLRRRWTEEGYEVWSGPLSGGRTVAALINWKDEARELTLDLPDIGLQFAGTVK 372
Query: 325 -FEHRTLETKFQGKLTAKVDGHSCKMYELTPVT 356
T + TAKV H + EL T
Sbjct: 373 NIWDGTTAQNVKTSYTAKVQSHGTILLELQDTT 405
>gi|328868802|gb|EGG17180.1| hypothetical protein DFA_08164 [Dictyostelium fasciculatum]
Length = 399
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 99/282 (35%), Positives = 142/282 (50%), Gaps = 30/282 (10%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L+NG+ TP MGW++WN + C INE +I + + +S ++ + D +
Sbjct: 28 LNNGVAPTPGMGWSTWNRYGCNINESVIIDNANAMIKSGL---VNYGYEYINIDDCWAVG 84
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVHIL---------ISNSLDNRIESNIAFEQ- 129
N+ L+A+ FP+G K +ADY+H + S + + FEQ
Sbjct: 85 R----NESTNELIADPVAFPNGIKYIADYLHSVGLKMGIYTDASPITWQKRPGSFGFEQL 140
Query: 130 --------GIDYLKYD----NCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPA 177
GIDY+K D + +N P Y MSD L GR I+F++CEWG
Sbjct: 141 DADTFASWGIDYVKEDWMKQSFYNWTGDPIALYTKMSDCLNATGRDIYFAVCEWGIDNVQ 200
Query: 178 LWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPG-GWNDPDMLEVGNGGMKYNE 236
W I NSWRT DI D W R++ I D+ + + G GWND D LEVGNG M E
Sbjct: 201 TWAPAIANSWRTAPDIKDEWTRIMEILDLQLPLTNASGVGSGWNDMDSLEVGNGNMSTIE 260
Query: 237 YVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
Y HFSIW + +PL++G D+ N+ +T+ I+ E+I VNQ
Sbjct: 261 YTTHFSIWCMMASPLIMGNDLTNMDDDTLAILSATEMIEVNQ 302
>gi|170100256|ref|XP_001881346.1| glycoside hydrolase family 27 protein [Laccaria bicolor S238N-H82]
gi|164644025|gb|EDR08276.1| glycoside hydrolase family 27 protein [Laccaria bicolor S238N-H82]
Length = 438
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 104/304 (34%), Positives = 150/304 (49%), Gaps = 73/304 (24%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQ 62
NG+G P MG+N+WN +HC I+E II ET DDC+ E +R +
Sbjct: 29 NGVGRLPFMGYNTWNAYHCDIDESIILETARLMKALGLSDAGYDHVNIDDCYSERNRSAS 88
Query: 63 ISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL------ISNS 116
G++VANK TFPSG +L + +H L S+S
Sbjct: 89 --------------------------GDIVANKQTFPSGMNSLTNKIHKLGLKAGIYSDS 122
Query: 117 ------------LDNRIESNIAFEQGIDYLKYDNC---FNDDTRPTI--RYRAMSDALKK 159
L+ + + G D LKYDNC F++ + I +Y MS+A+ K
Sbjct: 123 GWFTCQLYPGSYLNEERDIKTFQDWGFDLLKYDNCAVPFDEVIKEGIVGKYTRMSNAIAK 182
Query: 160 AGR-----PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHA 214
+ PI FS+CEWG P LW + SWRTT DI +W + +I + N Y+ +
Sbjct: 183 LSKESGKPPILFSLCEWGREEPWLWARRLGQSWRTTGDIEPNWNAIASILNENSFYSWAS 242
Query: 215 KPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVI 274
G ND D+LEVGNG + + E HF+ WA K+PLL+G ++ +T +T+ I+ N E+I
Sbjct: 243 DFYGHNDMDILEVGNGDLTFEESKSHFTAWAFMKSPLLVGTELSKVTQQTLDILKNHEII 302
Query: 275 AVNQ 278
A+NQ
Sbjct: 303 AINQ 306
>gi|83768591|dbj|BAE58728.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 420
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 121/393 (30%), Positives = 169/393 (43%), Gaps = 67/393 (17%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDS----QISLSCSFHMFDSL 75
L NGL TP MGWN++NH+ C NE I+R + S ++ C + + D L
Sbjct: 24 LDNGLARTPQMGWNTYNHYSCSPNETIVRSNAQALVDLGLASLGYRYVTTDCGWTVADRL 83
Query: 76 TQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-------------ILISNSLDNRIE 122
+ G+L N+T FP G AL Y+H I + S + +
Sbjct: 84 SD-----------GSLTWNETLFPKGFPALGKYLHDLDLLFGVYQDSGIKLCGSPPDNVG 132
Query: 123 SNIAFEQG--------IDYLKYDNCFND------------DTRPTIRYRAMSDALKKAGR 162
S +Q +D LKYDNC++D T P R+ MS AL R
Sbjct: 133 SLYHEDQDARTFASWEVDSLKYDNCYSDAATGYPNVNYEPSTSPQPRFANMSRALAAQNR 192
Query: 163 PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDP 222
+ F +CEWG PA W + +SWR +DI W + + A PG W D
Sbjct: 193 SMVFQVCEWGIDFPARWAPALGHSWRIGNDIIPHWRAIYRTLNQAVPQTSFAGPGQWPDL 252
Query: 223 DMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGC----DVGNLTAETMPIIGNEEVIAVNQ 278
DML VGN + E HFS+WAI K+PL +G D ++ E++ I+ EEVI NQ
Sbjct: 253 DMLFVGNDILSIPEEQTHFSLWAILKSPLTIGAALKDDNTSINDESLQILKQEEVIGYNQ 312
Query: 279 -------------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTL- 324
++W+GPLSG R V L + K + +T DIG++ T+
Sbjct: 313 DSLGVSASLRRRWTEEGYEVWSGPLSGGRTVAALINWKDEARELTLDLPDIGLQFAGTVK 372
Query: 325 -FEHRTLETKFQGKLTAKVDGHSCKMYELTPVT 356
T + TAKV H + EL T
Sbjct: 373 NIWDGTTAQNVKTSYTAKVQSHGTILLELQDTT 405
>gi|159124808|gb|EDP49926.1| alpha-galactosidase [Aspergillus fumigatus A1163]
Length = 406
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 121/384 (31%), Positives = 177/384 (46%), Gaps = 81/384 (21%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPL 83
LG TP MGWNSWN F QIN +I T + + L + + L +
Sbjct: 26 LGQTPQMGWNSWNTFKSQINSSVIENTVQLF------EHLGLKDVGYEYILLDE-GWSDY 78
Query: 84 PNQCQGNLVANKTTFPSGTKALADYVH-------------ILI------SNSLDNRIESN 124
G L N T+FP+G K L D +H IL S S + R
Sbjct: 79 SRTADGYLQPNLTSFPNGIKPLIDDIHAKGLKIGLYGDSGILTCGFRPGSWSYEERDAQT 138
Query: 125 IAFEQGIDYLKYDNC--FNDDTRP-TIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGS 181
+A G+DY KYDNC F T P +R+ M AL+ +GR IF+S+CEWG P WG
Sbjct: 139 LA-RWGVDYWKYDNCGGFQAMTEPPQVRFGIMQKALELSGRQIFYSVCEWGYQFPWHWGG 197
Query: 182 NIRNSWRTTDDISDSWAR---------------------MLTIADMNEVYADHAKPGGWN 220
I +S+R + DI+ + + I M E+ + + PG W
Sbjct: 198 KIGHSYRMSGDITAKFTNETGCACKTAYCLNTGYAGCSVLSIIRKMREI-SQYQTPGHWL 256
Query: 221 DPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAI 280
D DMLE+GNG M + HF+ WA K+PL++G D+ L+ E++ ++ N+++IAVNQ
Sbjct: 257 DMDMLEIGNGEMTLYQQQTHFAFWAALKSPLIIGADLSKLSDESVAVLKNKDIIAVNQDS 316
Query: 281 NDQ--------------QIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDI-----GIKSN 321
Q Q+WAGP++G VVLL + K++ ++ + D+ G
Sbjct: 317 LGQAVHYIESASKEGAWQVWAGPVNGG-FVVLLLNEKSYPQALSVSFADLRLGLDGPVQV 375
Query: 322 NTLFEHRTLETKFQGKLTAKVDGH 345
L+ H++L KVDG+
Sbjct: 376 TELWSHKSL---------GKVDGY 390
>gi|350639314|gb|EHA27668.1| hypothetical protein ASPNIDRAFT_41606 [Aspergillus niger ATCC 1015]
Length = 668
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 125/403 (31%), Positives = 177/403 (43%), Gaps = 75/403 (18%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQ-- 77
L +GL LTP MGWN++NH+ C NE I+ +S+ + + L S + +T
Sbjct: 24 LQDGLALTPQMGWNTYNHYSCSPNETIV--------QSNAQALVDLGLSSLGYRYVTTDC 75
Query: 78 --IAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-------------ILISNS------ 116
A LP+ G+L N T FP G A+ D++H IL+ S
Sbjct: 76 GWTVADRLPD---GSLTWNDTLFPQGFPAMGDFLHDLGLLFGVYQDSGILLCGSPPNETG 132
Query: 117 -------LDNRIESNIA---FEQGIDYLKYDNCFND------------DTRPTIRYRAMS 154
LD E A +D LKYDNC++D T P R+ MS
Sbjct: 133 SLCMPTILDRHHEEQDARTFASWNVDSLKYDNCYSDAATGYPNVNYAPSTSPEPRFANMS 192
Query: 155 DALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHA 214
AL + R I F +CEWG PA W + +SWR +DI W + I + D A
Sbjct: 193 HALLQQNRTILFQICEWGISFPANWAPALGHSWRIGNDIIPDWRTIFRIINQAAPQTDVA 252
Query: 215 KPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGN-LTA---ETMPIIGN 270
PG W D DMLEVGN E HFS+WAI K+PL++G + + LTA ++ ++
Sbjct: 253 GPGQWPDLDMLEVGNNIFSLPEEQTHFSLWAILKSPLIIGAALKDELTAINDASLAVLKQ 312
Query: 271 EEVIAVNQ-------------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIG 317
++VIA NQ ++W+GPLS R V + + S +T D+G
Sbjct: 313 KDVIAFNQDALGKSASLRRRWTEEGYEVWSGPLSNGRTVAAVINWHNESKDLTLDLPDVG 372
Query: 318 IKSNNTL--FEHRTLETKFQGKLTAKVDGHSCKMYELTPVTKL 358
++ T+ T TA V GH + EL T +
Sbjct: 373 LQHAGTVKNIWDGTTARDVLTSYTATVAGHGTMLLELQNTTAV 415
>gi|424793912|ref|ZP_18219964.1| glycoside hydrolase, family 27 transmembrane protein [Xanthomonas
translucens pv. graminis ART-Xtg29]
gi|422796309|gb|EKU24837.1| glycoside hydrolase, family 27 transmembrane protein [Xanthomonas
translucens pv. graminis ART-Xtg29]
Length = 527
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 120/378 (31%), Positives = 176/378 (46%), Gaps = 60/378 (15%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCW-GESS 58
L S+GL TPP+GW+SWN F I++ +R DD W G+
Sbjct: 162 LPSDGLARTPPLGWSSWNRFAEHIDDATVRRIADAMVASGLRDAGYVYVNIDDGWQGQRD 221
Query: 59 RDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLD 118
RD + + F +L + G + G K A Y D
Sbjct: 222 RDGVLQPNARFPDMRALA--------DYVHGKGLKLGLYSSPGPKTCAGYTGSYGHVEQD 273
Query: 119 NRIESNIAFEQGIDYLKYDNC-----FNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGD 173
R + G+DY+KYD C F + + Y M AL+ GRPI +S+C++G
Sbjct: 274 ARTWAGW----GVDYVKYDLCSGEGIFREPQQVRRAYLQMGQALRATGRPIVYSLCQYGR 329
Query: 174 MRPALWGSNIRNS-WRTTDDISDSWARMLTIA-DMNEVYADHAKPGGWNDPDMLEVGNGG 231
WG + WRTT DI DS+A+M +I D N AD A PGGWNDPDMLEVGNGG
Sbjct: 330 DHVGQWGREVGGHLWRTTGDIEDSYAKMASIGFDRNGDPAD-AGPGGWNDPDMLEVGNGG 388
Query: 232 MKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-AINDQ------- 283
M +EY H ++WA+S APLLLG D+ + T+ ++ N +V+A++Q A+ Q
Sbjct: 389 MSVDEYRTHLALWALSAAPLLLGNDLRQM---TLALLRNRDVLAIDQDALGVQGKAVRKD 445
Query: 284 ---QIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFEHRTL---ETKFQGK 337
+IW PL+ + + + +R T + +D G + R L ++ G+
Sbjct: 446 GAIEIWRKPLADGGVALGVFNRDTQPHRVALSAEDAGTALRGRRW--RDLWRGGSRPAGE 503
Query: 338 L-TAKVDGHSCKMYELTP 354
L + +V H + L+P
Sbjct: 504 LRSVQVAAHGVVLLRLSP 521
>gi|448372356|ref|ZP_21557198.1| Alpha-galactosidase [Natrialba aegyptia DSM 13077]
gi|445646142|gb|ELY99132.1| Alpha-galactosidase [Natrialba aegyptia DSM 13077]
Length = 384
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 118/342 (34%), Positives = 167/342 (48%), Gaps = 58/342 (16%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQ 86
TPPMGWNSWN F+C I+ +I++T D E + + + D + + +
Sbjct: 21 TPPMGWNSWNTFYCDIDAGLIKDTADAMVE---NGMKEVGYEYICIDDCWMASERDV--- 74
Query: 87 CQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNR----IESNIAFEQ-------- 129
GNL + TFP+G A+ADYVH + I S + ++ +E+
Sbjct: 75 -NGNLQPDPETFPNGIDAVADYVHKRGLKLGIYQSAGTTTCEGLPGSLGYEEKDAQSFAD 133
Query: 130 -GIDYLKYDNCFNDDTRPTI-RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNI-RNS 186
G+DYLKYDNC I RY M AL+ R I S+CEWGD P +W N
Sbjct: 134 WGVDYLKYDNCGEHYGLSAIERYTRMHKALEATDRDILLSICEWGDNDPWMWAPEAGGNL 193
Query: 187 WRTTDDISDSWAR--------MLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKY---- 234
WRTT DI W+ ++ I D NE A++A PG WNDPDML VG +Y
Sbjct: 194 WRTTGDIKPLWSAKENLWGNGIIDIIDQNEPLAEYAGPGHWNDPDMLVVGVDLPEYPNLT 253
Query: 235 -NEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAV------NQAINDQQ--- 284
E HF +WA+ APL+ G D+ N++ ET I+ N E+I + NQAI Q+
Sbjct: 254 EAEDRTHFGMWAMMAAPLIAGNDIRNMSEETRNILTNYELIEINQDPAGNQAIRIQRTTG 313
Query: 285 -------IWAGPLS-GNRIVVLLESRKTFSSMMTAHWDDIGI 318
+WA L+ G+R V LL +R + +T ++G+
Sbjct: 314 QDGLSRSVWAKKLANGDRAVGLL-NRSDRRTTITTSAQEVGL 354
>gi|190348345|gb|EDK40785.2| hypothetical protein PGUG_04883 [Meyerozyma guilliermondii ATCC
6260]
Length = 431
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 110/341 (32%), Positives = 165/341 (48%), Gaps = 54/341 (15%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
NGLG P MGWNSWN F C INE +I T + + ++ L + + +
Sbjct: 17 FDNGLGKKPQMGWNSWNKFGCDINETVILST------AQKMKELGLLEYGYEYIVMDDCY 70
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLD-NRI 121
A + +V + FP+G L++ +H L SL I
Sbjct: 71 ALKQRDPVTHKIVEDPDKFPNGILDLSNKIHDLGFKFGMYSSAGKYTCAGYPGSLHYEEI 130
Query: 122 ESN-IAFEQGIDYLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFSMCEWGDMR 175
+++ A + IDYLKYDNCFN+ T RY MS AL GRPIF+S+C+WG+
Sbjct: 131 DADTFANDWEIDYLKYDNCFNEGNSGTAKISYERYNNMSQALLNTGRPIFYSLCQWGEDH 190
Query: 176 PALWGSNIRNSWRTTDDISDSWARM------------------LTIADMNE----VYADH 213
WGS + NSWR + DI D++ R +I ++ E +
Sbjct: 191 VWDWGSTLANSWRISGDIYDNFDRYDDRCPCETYECPGLQGYSCSINNILEKAVPLGQKA 250
Query: 214 AKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEV 273
++ GWND D LEVGNGGM +EY HF++WAI K+PL+LG DV ++ E I+ N+ +
Sbjct: 251 SEFSGWNDLDSLEVGNGGMTTSEYKAHFTLWAILKSPLVLGNDVTGMSDEDFAIVTNKAI 310
Query: 274 IAVNQ--AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAH 312
IA+NQ + ++W G + + S ++++T +
Sbjct: 311 IAINQDGSRPANRMWKRKTEGGYLHLFTNSLNNGTTVVTLY 351
>gi|51701635|sp|Q09187.1|MEL_SACPA RecName: Full=Alpha-galactosidase; AltName:
Full=Alpha-D-galactoside galactohydrolase; AltName:
Full=MELp; AltName: Full=Melibiase; Flags: Precursor
gi|1246850|emb|CAA64759.1| alpha-galactosidase MEL [Saccharomyces paradoxus]
Length = 471
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 115/345 (33%), Positives = 165/345 (47%), Gaps = 75/345 (21%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
NGLGLTP MGW++WN F C ++E ++ T D R S I L + + L +
Sbjct: 24 NGLGLTPQMGWDNWNTFACNVSEDLLLNTAD------RISDIGLKDLGYKYVILDDCWSS 77
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVH------ILISNSLD----------NRIESNI 125
G LVA++ FP+G +AD +H + S++ + R E +
Sbjct: 78 GRDED--GFLVADEQKFPNGMGHVADRLHNNSFLFGMYSSAGEYTCAGYPGSLGREEEDA 135
Query: 126 AF--EQGIDYLKYDNCFNDDTRPT-----IRYRAMSDALKKAGRPIFFSMCEWGDMRPAL 178
F +DYLKYDNC+N T RY+AMSDAL K GRPIF+S+C WG
Sbjct: 136 QFFANNRVDYLKYDNCYNKGRFGTPESSHKRYKAMSDALNKTGRPIFYSLCNWGQDLTFY 195
Query: 179 WGSNIRNSWRTTDDISDSWAR----------------------MLTIADMNEVYADHAKP 216
WGS+I NSWR + DI+ ++R ++ I + +
Sbjct: 196 WGSDIANSWRMSGDITAEFSRPDSRCPCDGDEYDCKYAGYHCSIMNILNKAAPMGQNGGI 255
Query: 217 GGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAV 276
GGWND D LEVG G + +E HFS+WA+ K+PL++G DV +L A + I VIA+
Sbjct: 256 GGWNDLDNLEVGVGNLTDDEEKTHFSMWAMVKSPLIIGADVNHLKASSYSIYSQSSVIAI 315
Query: 277 NQ---------------------AINDQQIWAGPL-SGNRIVVLL 299
NQ + Q W+GPL +G++++ LL
Sbjct: 316 NQDPKGIPATRVWRYYVPQTDKYGQGEIQFWSGPLENGDQVIALL 360
>gi|146413915|ref|XP_001482928.1| hypothetical protein PGUG_04883 [Meyerozyma guilliermondii ATCC
6260]
Length = 431
Score = 161 bits (408), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 106/305 (34%), Positives = 149/305 (48%), Gaps = 52/305 (17%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
NGLG P MGWNSWN F C INE +I T + + ++ L + + +
Sbjct: 17 FDNGLGKKPQMGWNSWNKFGCDINETVILST------AQKMKELGLLEYGYEYIVMDDCY 70
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLD-NRI 121
A + +V + FP+G L++ +H L SL I
Sbjct: 71 ALKQRDPVTHKIVEDPDKFPNGILDLSNKIHDLGFKFGMYSSAGKYTCAGYPGSLHYEEI 130
Query: 122 ESN-IAFEQGIDYLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFSMCEWGDMR 175
+++ A + IDYLKYDNCFN+ T RY MS AL GRPIF+S+C+WG+
Sbjct: 131 DADTFANDWEIDYLKYDNCFNEGNSGTAKISYERYNNMSQALLNTGRPIFYSLCQWGEDH 190
Query: 176 PALWGSNIRNSWRTTDDISDSWARM------------------LTIADMNEVYADHAKPG 217
WGS + NSWR + DI D++ R +I ++ E +
Sbjct: 191 VWDWGSTLANSWRISGDIYDNFDRYDDRCPCETYECPGLQGYSCSINNILEKAVPLGQKA 250
Query: 218 ----GWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEV 273
GWND D LEVGNGGM +EY HF++WAI K+PL+LG DV ++ E I+ N+ +
Sbjct: 251 SEFLGWNDLDSLEVGNGGMTTSEYKAHFTLWAILKSPLVLGNDVTGMSDEDFAIVTNKAI 310
Query: 274 IAVNQ 278
IA+NQ
Sbjct: 311 IAINQ 315
>gi|53715671|ref|YP_101663.1| alpha-galactosidase [Bacteroides fragilis YCH46]
gi|423259902|ref|ZP_17240825.1| hypothetical protein HMPREF1055_03102 [Bacteroides fragilis
CL07T00C01]
gi|423267557|ref|ZP_17246538.1| hypothetical protein HMPREF1056_04225 [Bacteroides fragilis
CL07T12C05]
gi|52218536|dbj|BAD51129.1| alpha-galactosidase precursor [Bacteroides fragilis YCH46]
gi|387775547|gb|EIK37653.1| hypothetical protein HMPREF1055_03102 [Bacteroides fragilis
CL07T00C01]
gi|392696400|gb|EIY89594.1| hypothetical protein HMPREF1056_04225 [Bacteroides fragilis
CL07T12C05]
Length = 496
Score = 161 bits (408), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 117/356 (32%), Positives = 173/356 (48%), Gaps = 66/356 (18%)
Query: 2 FIVSSSAADSNGSDRRNLLSN---GLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS 58
+ V A ++ G+D + LL N L LTPPMGWNSWN F + E+++ +T D E+
Sbjct: 94 YKVMLKAENALGTDTQELLINIGDELLLTPPMGWNSWNTFGRHLTEELLLQTADAMVENG 153
Query: 59 -RDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------ 111
RD L +F D Q+ + G++ +KT FP G K +ADY+H
Sbjct: 154 MRD----LGYAFINIDDFWQLPER----GADGHIQIDKTKFPRGIKYVADYLHERGFKLG 205
Query: 112 LISNSLDNR---IESNIAFEQ---------GIDYLKYDNCFNDDTRPTI--RYRAMSDAL 157
+ S++ D + + +E+ G+D LKYD C R RY M AL
Sbjct: 206 IYSDAADKTCGGVCGSYGYEEIDARDFASWGVDLLKYDYCNAPAGRVEAMERYEKMGRAL 265
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNS-WRTTDDISDSWAR----------MLTIADM 206
+ R I FS+CEWG P W + WR + DI D W R +L I ++
Sbjct: 266 RATDRSIVFSICEWGQREPWKWAKKVGGHLWRVSGDIGDLWNRSTDEKGGLRGILNILEI 325
Query: 207 NEVYADHAKPGGWNDPDMLEVGNGGMKYN-----------EYVVHFSIWAISKAPLLLGC 255
N +++A+PGGWNDPDML VG GG + +Y HF++W + +PLL G
Sbjct: 326 NAPLSEYARPGGWNDPDMLVVGIGGKSKSIGYESEGCTNEQYQSHFALWCMMASPLLCGN 385
Query: 256 DVGNLTAETMPIIGNEEVIAVNQ---------AI--NDQQIWAGPLS-GNRIVVLL 299
DV + T+ I+ N+++IA+NQ AI + +W PLS G++ + L
Sbjct: 386 DVRQMNDSTLQILLNKDLIAINQDPLGIQAERAIRADHYDVWVKPLSDGSKAIACL 441
>gi|409079186|gb|EKM79548.1| hypothetical protein AGABI1DRAFT_106995 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 562
Score = 161 bits (408), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 117/405 (28%), Positives = 177/405 (43%), Gaps = 81/405 (20%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L++GL TPPMGWN +N F C E+ R + + + LS + F +L
Sbjct: 20 LNDGLARTPPMGWNPYNAFLCSTTEQQYRTA------AQKLIDLGLSEVGYQFVNL-DCG 72
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------------ISNS 116
+ G + T FPSG AL+ ++H L S
Sbjct: 73 WQGKARNASGGFTWDTTAFPSGIPALSTFIHNLGLKFGVYSDGGFFSCDFVGGTAHYIGS 132
Query: 117 LDNRIESNIAFEQ-GIDYLKYDNCFNDD---TRPTIRYRA-------------------- 152
L + +F G DYLKYDNC++ T P ++R+
Sbjct: 133 LGHETSDANSFAAWGADYLKYDNCYSGSVSYTPPYFKFRSPLHAVSPTDFVNFNPPIQIE 192
Query: 153 -----MSDALKKAGRPIFFSMCEWGDMRPALW-GSNIRNSWRTTDDISD--SWARMLTIA 204
M DAL RP+ FS+CEWG PA W S + +SWR ++DI SW + I
Sbjct: 193 PHYVTMRDALAATNRPVIFSICEWGVQDPARWPASAVGHSWRISNDIGPPASWDNLFRII 252
Query: 205 DMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAET 264
+ A PGGWND DMLEVGN G+ E HF+ WA K+PLL+ D+ + + +
Sbjct: 253 NQVVPLTQFAGPGGWNDLDMLEVGNSGLTTVEQQTHFAFWAAVKSPLLISTDLTRIASNS 312
Query: 265 MPIIGNEEVIAVNQ------------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAH 312
+ I+ N +IA+NQ ND IW+GPL+ V ++ + + +T +
Sbjct: 313 LNILKNNRIIALNQDRLGKSISFKRRYTNDHDIWSGPLADGAFVAVIINWQNSRRSLTFN 372
Query: 313 WDDIGIKSNNTLFEHRTLETKFQGKL----TAKVDGHSCKMYELT 353
D+G+ S + + F G++ TA ++ H + +L+
Sbjct: 373 LADVGLSSADAT---DLISGSFLGRINQSHTAVLEPHGVMVLKLS 414
>gi|302697919|ref|XP_003038638.1| glycoside hydrolase family 27 protein [Schizophyllum commune H4-8]
gi|300112335|gb|EFJ03736.1| glycoside hydrolase family 27 protein [Schizophyllum commune H4-8]
Length = 488
Score = 161 bits (407), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 100/289 (34%), Positives = 148/289 (51%), Gaps = 38/289 (13%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NG+G P MG+N+WN + C INE +IRE S D +SL ++ +
Sbjct: 77 LDNGVGRLPFMGYNTWNAYVCDINETVIRE--------SADLLVSLGLKDLGYNYMNIDD 128
Query: 80 AKPLPNQCQ-GNLVANKTTFPSGTKALADYVH-----------------ILISNSLDNRI 121
N+ + G++V + FPSG + L DY+H L S N
Sbjct: 129 CYAEKNRTEDGDIVESAERFPSGMRNLTDYIHDLGLKTGIYSDSGWFTCQLYPGSFQNEE 188
Query: 122 ESNIAFEQ--GIDYLKYDNC---FNDDTRPTI--RYRAMSDAL----KKAGR-PIFFSMC 169
F + D LKYDNC F++ R + ++ M+DA+ K +G+ PI FS+C
Sbjct: 189 RDIKLFRETWNFDLLKYDNCAVPFDEIIREGMVGKFTRMADAIAAQAKSSGKDPILFSLC 248
Query: 170 EWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGN 229
+WG +P LW + SWRTTDDI W + +I + N Y + G D D+LEVGN
Sbjct: 249 QWGREQPWLWAKRLGQSWRTTDDIGPHWDAISSIINQNSFYTWSSDFYGHGDMDILEVGN 308
Query: 230 GGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
G + + E HF+ WA+ K+PLL+G D+ +T E++ I+ N E+I +NQ
Sbjct: 309 GDLTFEEAKSHFTAWALMKSPLLIGTDLSKITKESLTILTNTEIIGINQ 357
>gi|336383008|gb|EGO24158.1| glycoside hydrolase family 27 protein [Serpula lacrymans var.
lacrymans S7.9]
Length = 562
Score = 161 bits (407), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 118/368 (32%), Positives = 157/368 (42%), Gaps = 75/368 (20%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQI-SLSCSFHMFDSLTQI 78
L+N L TP MGWN +N F C E+ + D+ + SL + FD Q
Sbjct: 29 LNNDLERTPAMGWNPYNAFLCSTTEEQYLTA----ASNLIDTGLKSLGYEYFNFDCGWQG 84
Query: 79 AAKPLPNQCQGNLVANKTTFPSGTKALADYVHILIS-----------------------N 115
+ G + N+T PSG AL +VH L
Sbjct: 85 TNR----TASGTITWNETRIPSGIPALGQHVHGLGFKFGVYSDAGYYSCDFVNGQAHWIG 140
Query: 116 SLDNRIESNIAFEQ-GIDYLKYDNCFN------DDTRPTIR------------------- 149
SL + F G DYLKYDNC++ D P IR
Sbjct: 141 SLGYELSDAKTFTSWGADYLKYDNCYSVSPTDFVDDYPPIRLEVIVFSFPSSNNADSSKS 200
Query: 150 ---YRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDIS--DSWARMLTIA 204
Y AM DAL GRP+ FSMCEWG PA W + NSWR +DI SW + I
Sbjct: 201 KPHYTAMRDALAGTGRPVAFSMCEWGVQDPARWAPAVGNSWRIANDIGPPPSWDNVFRII 260
Query: 205 DMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAET 264
+ A PG WND D+LEVGN G+ E HF+ WA +K+PLL+ D+ + T
Sbjct: 261 NQVVPITGFAGPGAWNDLDLLEVGNEGLSVAEQASHFAFWAAAKSPLLVSTDLTSPAEST 320
Query: 265 MPIIGNEEVIAVNQ------------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAH 312
+ I+ N +IA+NQ ND +WAGPL+ V +L + + S +T
Sbjct: 321 LTILKNTRIIALNQDPLGTSITFKRRYTNDYDVWAGPLADGSTVAVLLNWQNSSRSLTFD 380
Query: 313 WDDIGIKS 320
D+G +S
Sbjct: 381 LSDVGFES 388
>gi|423280844|ref|ZP_17259756.1| hypothetical protein HMPREF1203_03973 [Bacteroides fragilis HMW
610]
gi|404583647|gb|EKA88323.1| hypothetical protein HMPREF1203_03973 [Bacteroides fragilis HMW
610]
Length = 496
Score = 160 bits (406), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 116/356 (32%), Positives = 171/356 (48%), Gaps = 66/356 (18%)
Query: 2 FIVSSSAADSNGSDRRNLL---SNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS 58
+ V A ++ GSD R LL + L LTPPMGWNSWN F + E+++ +T D E+
Sbjct: 94 YKVVLKAENALGSDTRELLIKIGDELLLTPPMGWNSWNTFGRHLTEELVLQTADAMVENG 153
Query: 59 -RDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------ 111
RD L ++ D Q+ + G++ +KT FP G + +ADY+H
Sbjct: 154 MRD----LGYAYINIDDFWQLPER----GADGHIQVDKTKFPRGIRYVADYLHERGFKLG 205
Query: 112 LISNSLDNR---IESNIAFEQ---------GIDYLKYDNCFNDDTRPTI--RYRAMSDAL 157
+ S++ D + + +E+ G+D LKYD C R RY M AL
Sbjct: 206 IYSDAADKTCGGVCGSYGYEETDARDFASWGVDLLKYDYCNAPAGRVEAMERYEKMGKAL 265
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNS-WRTTDDISDSWAR----------MLTIADM 206
+ GR + FS+CEWG P W + WR + DI D W R +L I ++
Sbjct: 266 RATGRSMVFSVCEWGQREPWKWAKKVGGHLWRVSGDIGDLWNRSTDQKGGLRGVLNILEI 325
Query: 207 NEVYADHAKPGGWNDPDMLEVG-----------NGGMKYNEYVVHFSIWAISKAPLLLGC 255
N ++A P GWNDPDML VG +GG +Y HF++W + +PLL G
Sbjct: 326 NAPLNEYAGPAGWNDPDMLVVGIDGKSKSIGYESGGCTNEQYQSHFALWCMMASPLLCGN 385
Query: 256 DVGNLTAETMPIIGNEEVIAVNQ---------AI--NDQQIWAGPLS-GNRIVVLL 299
DV + T+ I+ N+++IA+NQ AI + IW PLS G++ + L
Sbjct: 386 DVRQMNDSTLQILLNKDLIAINQDPLGIQAERAIRADHYDIWVKPLSDGSKAIACL 441
>gi|423268613|ref|ZP_17247585.1| hypothetical protein HMPREF1079_00667 [Bacteroides fragilis
CL05T00C42]
gi|423273827|ref|ZP_17252774.1| hypothetical protein HMPREF1080_01427 [Bacteroides fragilis
CL05T12C13]
gi|392703897|gb|EIY97038.1| hypothetical protein HMPREF1079_00667 [Bacteroides fragilis
CL05T00C42]
gi|392707260|gb|EIZ00379.1| hypothetical protein HMPREF1080_01427 [Bacteroides fragilis
CL05T12C13]
Length = 500
Score = 160 bits (405), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 121/411 (29%), Positives = 186/411 (45%), Gaps = 68/411 (16%)
Query: 2 FIVSSSAADSNGSDRRNL---LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCW-GES 57
+ V+ A ++ G + L + + L LTPPMGWNSWN F + E+++ +T D
Sbjct: 94 YTVTLKAENALGVSVKQLVIRIGDELLLTPPMGWNSWNTFGQHLTEELVLQTADAMITNG 153
Query: 58 SRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------ 111
RD L S+ D Q+ + G+L +KT FP G K +ADY+H
Sbjct: 154 MRD----LGYSYINIDDFWQLPER----GADGHLQIDKTKFPRGIKYVADYLHERGFKLG 205
Query: 112 LISNSLDNR---IESNIAFEQ---------GIDYLKYDNCFN--DDTRPTIRYRAMSDAL 157
+ S++ + + + +E+ G+D LKYD C D RY M AL
Sbjct: 206 IYSDAAEKTCGGVCGSYGYEETDAKDFASWGVDLLKYDYCNAPVDRVEAMERYAKMGRAL 265
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNS-WRTTDDISDSWAR----------MLTIADM 206
+ R I +S+CEWG P W + WR + DI D W R +L I ++
Sbjct: 266 RATNRSIVYSVCEWGQREPWKWAKQVGGHLWRVSGDIGDIWYRDGNRVGGLHGILNILEI 325
Query: 207 NEVYADHAKPGGWNDPDMLEVG-----------NGGMKYNEYVVHFSIWAISKAPLLLGC 255
N +++A P GWNDPDML VG + G +Y HFS+W + +PLL G
Sbjct: 326 NAPLSEYAGPSGWNDPDMLVVGIDGKSMSIGYESEGCTQEQYKSHFSLWCMMASPLLSGN 385
Query: 256 DVGNLTAETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLSGNRIVVLLESRKT 304
DV N+ T+ I+ + ++IA+NQ + +Q IW PL+ R V +R +
Sbjct: 386 DVRNMNDSTLKILLDPDLIAINQDVLGRQAERSIRSDHYDIWVKPLADGRKAVACFNRAS 445
Query: 305 FSSMMTAHWDDIGIKSNNTLF---EHRTLETKFQGKLTAKVDGHSCKMYEL 352
+ + + I S ++ H T +L K+ + CK+Y L
Sbjct: 446 SPQTVILNENTIADLSFEQIYCLDSHLTKSGSDSKELIVKLAPYQCKVYIL 496
>gi|383778001|ref|YP_005462567.1| putative alpha-galactosidase [Actinoplanes missouriensis 431]
gi|381371233|dbj|BAL88051.1| putative alpha-galactosidase [Actinoplanes missouriensis 431]
Length = 537
Score = 160 bits (405), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 111/320 (34%), Positives = 145/320 (45%), Gaps = 90/320 (28%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRE-------------------TDDCWGESSRD 60
L NG+ TPPMGWNSWN F C INE +IR DDCW +RD
Sbjct: 27 LDNGVARTPPMGWNSWNTFGCNINEGLIRGMADAMVSSGMRDLGYQYVVVDDCWFNPNRD 86
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-------- 112
S GNL + T FPSG KAL DY+H
Sbjct: 87 SA--------------------------GNLQGDPTRFPSGMKALGDYLHGKGLKFGIYQ 120
Query: 113 ----------------ISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDT--RPTIRYRAM 153
+ SL + + F G+DYLKYD C T + M
Sbjct: 121 GPLDKTCAQYFNSYPGATGSLGHEAQDARQFAAWGVDYLKYDWCSPTGTINDQVATFAKM 180
Query: 154 SDALKKAGRPIFFSM------CEWGDMRPALWGSNIRNSWRTTDDISDSWARMLT----- 202
DAL GRPI +S+ + G R WG ++ N WRTT+DISD+W T
Sbjct: 181 RDALAATGRPIVYSINPNSIHAKTGPQRN--WG-DVANMWRTTEDISDAWDTGQTNGYPM 237
Query: 203 ----IADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVG 258
I D+ A +A+PGG+NDPDM+EVG GGM E HF++WA+ +PL+ G D+
Sbjct: 238 GVKNIVDVTVPLAGYARPGGFNDPDMMEVGRGGMTDTEQRSHFALWAMMASPLIAGNDLR 297
Query: 259 NLTAETMPIIGNEEVIAVNQ 278
+++ T I+ N +IA+NQ
Sbjct: 298 SMSTATQTILKNPRLIAINQ 317
>gi|313149327|ref|ZP_07811520.1| alpha-galactosidase [Bacteroides fragilis 3_1_12]
gi|313138094|gb|EFR55454.1| alpha-galactosidase [Bacteroides fragilis 3_1_12]
Length = 453
Score = 160 bits (405), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 116/357 (32%), Positives = 174/357 (48%), Gaps = 66/357 (18%)
Query: 1 LFIVSSSAADSNGSDRRNLLSN---GLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGES 57
++ V A ++ G+D + LL N L LTPPMGWNSWN F + E+++ +T D E+
Sbjct: 50 MYKVMLKAENALGTDIQELLINIGDELLLTPPMGWNSWNTFGRHLTEELLLQTADAMVEN 109
Query: 58 S-RDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI----- 111
RD L ++ D Q+ + G++ +KT FP G K +ADY+H
Sbjct: 110 GMRD----LGYAYINIDDFWQLPER----GADGHIQIDKTKFPRGIKYVADYLHERGFKL 161
Query: 112 -LISNSLDNR---IESNIAFEQ---------GIDYLKYDNCFNDDTRPTI--RYRAMSDA 156
+ S++ D + + +E+ G+D LKYD C R RY M A
Sbjct: 162 GIYSDAADKTCGGVCGSYGYEEIDARDFASWGVDLLKYDYCNAPAGRVEAMERYEKMGRA 221
Query: 157 LKKAGRPIFFSMCEWGDMRPALWGSNIRNS-WRTTDDISDSWAR----------MLTIAD 205
L+ R I FS+CEWG P W + WR + DI D W R +L I +
Sbjct: 222 LRATDRSIVFSICEWGQREPWKWAKKVGGHLWRVSGDIGDLWNRSTDEKGGLRGILNILE 281
Query: 206 MNEVYADHAKPGGWNDPDMLEVGNGGMKYN-----------EYVVHFSIWAISKAPLLLG 254
+N +++A+PGGWNDPDML VG GG + +Y HF++W + +PLL G
Sbjct: 282 INAPLSEYARPGGWNDPDMLVVGIGGKSKSIGYESEGCTNEQYQSHFALWCMMASPLLCG 341
Query: 255 CDVGNLTAETMPIIGNEEVIAVNQ---------AI--NDQQIWAGPLS-GNRIVVLL 299
DV + T+ I+ N+++IA+NQ AI + +W PLS G++ + L
Sbjct: 342 NDVRQMNDSTLQILLNKDLIAINQDPLGIQAERAIRADHYDVWVKPLSDGSKAIACL 398
>gi|60683610|ref|YP_213754.1| alpha-galactosidase [Bacteroides fragilis NCTC 9343]
gi|265767384|ref|ZP_06095050.1| alpha-galactosidase [Bacteroides sp. 2_1_16]
gi|336411589|ref|ZP_08592053.1| hypothetical protein HMPREF1018_04071 [Bacteroides sp. 2_1_56FAA]
gi|60495044|emb|CAH09862.1| putative alpha-galactosidase/melibiase [Bacteroides fragilis NCTC
9343]
gi|263252689|gb|EEZ24201.1| alpha-galactosidase [Bacteroides sp. 2_1_16]
gi|335941385|gb|EGN03242.1| hypothetical protein HMPREF1018_04071 [Bacteroides sp. 2_1_56FAA]
Length = 496
Score = 160 bits (405), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 116/356 (32%), Positives = 173/356 (48%), Gaps = 66/356 (18%)
Query: 2 FIVSSSAADSNGSDRRNLLSN---GLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS 58
+ V A ++ G+D + LL N L LTPPMGWNSWN F + E+++ +T D E+
Sbjct: 94 YKVMLKAENALGTDTQELLINIGDELLLTPPMGWNSWNTFGRHLTEELLLQTADAMVENG 153
Query: 59 -RDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------ 111
RD L ++ D Q+ + G++ +KT FP G K +ADY+H
Sbjct: 154 MRD----LGYAYINIDDFWQLPER----GADGHIQIDKTKFPRGIKYVADYLHERGFKLG 205
Query: 112 LISNSLDNR---IESNIAFEQ---------GIDYLKYDNCFNDDTRPTI--RYRAMSDAL 157
+ S++ D + + +E+ G+D LKYD C R RY M AL
Sbjct: 206 IYSDAADKTCGGVCGSYGYEEIDARDFASWGVDLLKYDYCNAPAGRVEAMERYEKMGRAL 265
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNS-WRTTDDISDSWAR----------MLTIADM 206
+ R I FS+CEWG P W + WR + DI D W R +L I ++
Sbjct: 266 RATDRSIVFSICEWGQREPWKWAKKVGGHLWRVSGDIGDLWNRSTDEKGGLRGILNILEI 325
Query: 207 NEVYADHAKPGGWNDPDMLEVGNGGMKYN-----------EYVVHFSIWAISKAPLLLGC 255
N +++A+PGGWNDPDML VG GG + +Y HF++W + +PLL G
Sbjct: 326 NAPLSEYARPGGWNDPDMLVVGIGGKSKSIGYESEGCTNEQYQSHFALWCMMASPLLCGN 385
Query: 256 DVGNLTAETMPIIGNEEVIAVNQ---------AI--NDQQIWAGPLS-GNRIVVLL 299
DV + T+ I+ N+++IA+NQ AI + +W PLS G++ + L
Sbjct: 386 DVRQMNDSTLQILLNKDLIAINQDPLGIQAERAIRADHYDVWVKPLSDGSKAIACL 441
>gi|70990042|ref|XP_749870.1| alpha-galactosidase [Aspergillus fumigatus Af293]
gi|66847502|gb|EAL87832.1| alpha-galactosidase [Aspergillus fumigatus Af293]
Length = 406
Score = 160 bits (405), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 121/384 (31%), Positives = 176/384 (45%), Gaps = 81/384 (21%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPL 83
LG TP MGWNSWN F QIN +I T + + L + + L +
Sbjct: 26 LGQTPQMGWNSWNTFKSQINSSVIENTVQLF------EHLGLKDVGYEYILLDE-GWSDY 78
Query: 84 PNQCQGNLVANKTTFPSGTKALADYVH-------------ILI------SNSLDNRIESN 124
G L N T+FP+G K L D H IL S S + R
Sbjct: 79 SRTADGYLQPNLTSFPNGIKPLIDDNHAKELKIGLYGDSGILTCGFRPGSWSYEERDAQT 138
Query: 125 IAFEQGIDYLKYDNC--FNDDTRP-TIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGS 181
+A G+DY KYDNC F T P +R+ M AL+ +GR IF+S+CEWG P WG
Sbjct: 139 LA-RWGVDYWKYDNCGGFQAMTEPPQVRFGIMQKALELSGRQIFYSVCEWGYQFPWHWGG 197
Query: 182 NIRNSWRTTDDISDSWAR---------------------MLTIADMNEVYADHAKPGGWN 220
I +S+R + DI+ + + I M E+ + + PG W
Sbjct: 198 KIGHSYRMSGDITAKFTNETGCACKTAYCLNTGYAGCSVLSIIRKMREI-SQYQTPGHWL 256
Query: 221 DPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAI 280
D DMLE+GNG M + HF+ WA K+PL++G D+ L+ E++ ++ N+++IAVNQ
Sbjct: 257 DMDMLEIGNGEMTLYQQQTHFAFWAALKSPLIIGADLSKLSDESVAVLKNKDIIAVNQDS 316
Query: 281 NDQ--------------QIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDI-----GIKSN 321
Q Q+WAGP++G VVLL + K++ ++ + D+ G
Sbjct: 317 LGQAVHYIESASKEGAWQVWAGPVNGG-FVVLLLNEKSYPQALSVSFADLRLGLDGPVQV 375
Query: 322 NTLFEHRTLETKFQGKLTAKVDGH 345
L+ H++L KVDG+
Sbjct: 376 TELWSHKSL---------GKVDGY 390
>gi|423282834|ref|ZP_17261719.1| hypothetical protein HMPREF1204_01257 [Bacteroides fragilis HMW
615]
gi|404581443|gb|EKA86141.1| hypothetical protein HMPREF1204_01257 [Bacteroides fragilis HMW
615]
Length = 496
Score = 160 bits (405), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 116/356 (32%), Positives = 173/356 (48%), Gaps = 66/356 (18%)
Query: 2 FIVSSSAADSNGSDRRNLLSN---GLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS 58
+ V A ++ G+D + LL N L LTPPMGWNSWN F + E+++ +T D E+
Sbjct: 94 YKVMLKAENALGTDTQELLINIGDELLLTPPMGWNSWNTFGRHLTEELLLQTADAMVENG 153
Query: 59 -RDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------ 111
RD L ++ D Q+ + G++ +KT FP G K +ADY+H
Sbjct: 154 MRD----LGYAYINIDDFWQLPER----GADGHIQIDKTKFPRGIKYVADYLHERGFKLG 205
Query: 112 LISNSLDNR---IESNIAFEQ---------GIDYLKYDNCFNDDTRPTI--RYRAMSDAL 157
+ S++ D + + +E+ G+D LKYD C R RY M AL
Sbjct: 206 IYSDAADKTCGGVCGSYGYEEIDARDFASWGVDLLKYDYCNAPAGRVEAMERYEKMGRAL 265
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNS-WRTTDDISDSWAR----------MLTIADM 206
+ R I FS+CEWG P W + WR + DI D W R +L I ++
Sbjct: 266 RATDRSIVFSICEWGQREPWKWAKKVGGHLWRVSGDIGDLWNRSTDEKGGLRGILNILEI 325
Query: 207 NEVYADHAKPGGWNDPDMLEVGNGGMKYN-----------EYVVHFSIWAISKAPLLLGC 255
N +++A+PGGWNDPDML VG GG + +Y HF++W + +PLL G
Sbjct: 326 NAPLSEYARPGGWNDPDMLVVGIGGKSKSIGYESEGCTNEQYQSHFALWCMMASPLLCGN 385
Query: 256 DVGNLTAETMPIIGNEEVIAVNQ---------AI--NDQQIWAGPLS-GNRIVVLL 299
DV + T+ I+ N+++IA+NQ AI + +W PLS G++ + L
Sbjct: 386 DVRQMNDSTLQILLNKDLIAINQDPLGIQAERAIRADHYDVWVKPLSDGSKAIACL 441
>gi|336375286|gb|EGO03622.1| glycoside hydrolase family 27 protein [Serpula lacrymans var.
lacrymans S7.3]
Length = 439
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 103/306 (33%), Positives = 146/306 (47%), Gaps = 73/306 (23%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
L G+ P MG+N+WN ++C I+E +I +T DDC+ E +R
Sbjct: 30 LREGVARLPVMGYNTWNAYYCDIDEALILKTANLMKSLGLADAGYDHVNIDDCYSEKTRS 89
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-------- 112
+ G++VANK FPSG L D +H L
Sbjct: 90 AS--------------------------GDIVANKDRFPSGMNNLTDQIHALGFKAGIYS 123
Query: 113 ---------ISNSLDNRIESNIAFEQ-GIDYLKYDNC---FNDDTRPTI--RYR----AM 153
S N F+ G DYLKYDNC F++ TR I RY A+
Sbjct: 124 DSGWFTCQLYPGSYSNEARDAKLFQDWGFDYLKYDNCAVPFDNITRQNIIGRYEPMTTAI 183
Query: 154 SDALKKAGRP-IFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYAD 212
SD + +G+P + S+C+WG +P LW SWRTTDDI +W + +I + N
Sbjct: 184 SDLAQMSGKPPLALSLCQWGREQPWLWARKFGQSWRTTDDIESTWDSIASIINQNSFITW 243
Query: 213 HAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEE 272
+ G ND D+LEVGNG + Y E HF+ WA+ K+PLL+G D+ + T +T+ I+ N E
Sbjct: 244 ASDFYGHNDMDILEVGNGALTYEETKSHFTAWAMMKSPLLIGTDLSSATKDTLEILTNPE 303
Query: 273 VIAVNQ 278
+IA+NQ
Sbjct: 304 IIAINQ 309
>gi|336388348|gb|EGO29492.1| glycoside hydrolase family 27 protein [Serpula lacrymans var.
lacrymans S7.9]
Length = 480
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 103/306 (33%), Positives = 146/306 (47%), Gaps = 73/306 (23%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
L G+ P MG+N+WN ++C I+E +I +T DDC+ E +R
Sbjct: 71 LREGVARLPVMGYNTWNAYYCDIDEALILKTANLMKSLGLADAGYDHVNIDDCYSEKTRS 130
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-------- 112
+ G++VANK FPSG L D +H L
Sbjct: 131 AS--------------------------GDIVANKDRFPSGMNNLTDQIHALGFKAGIYS 164
Query: 113 ---------ISNSLDNRIESNIAFEQ-GIDYLKYDNC---FNDDTRPTI--RYR----AM 153
S N F+ G DYLKYDNC F++ TR I RY A+
Sbjct: 165 DSGWFTCQLYPGSYSNEARDAKLFQDWGFDYLKYDNCAVPFDNITRQNIIGRYEPMTTAI 224
Query: 154 SDALKKAGRP-IFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYAD 212
SD + +G+P + S+C+WG +P LW SWRTTDDI +W + +I + N
Sbjct: 225 SDLAQMSGKPPLALSLCQWGREQPWLWARKFGQSWRTTDDIESTWDSIASIINQNSFITW 284
Query: 213 HAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEE 272
+ G ND D+LEVGNG + Y E HF+ WA+ K+PLL+G D+ + T +T+ I+ N E
Sbjct: 285 ASDFYGHNDMDILEVGNGALTYEETKSHFTAWAMMKSPLLIGTDLSSATKDTLEILTNPE 344
Query: 273 VIAVNQ 278
+IA+NQ
Sbjct: 345 IIAINQ 350
>gi|343425248|emb|CBQ68784.1| probable alpha-galactosidase [Sporisorium reilianum SRZ2]
Length = 477
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 107/322 (33%), Positives = 142/322 (44%), Gaps = 88/322 (27%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
L+NGL TP MGWN+WN F C I++ + DDCW RD
Sbjct: 21 LNNGLARTPQMGWNTWNTFACNISQDTVLSAARAIKSENLDQYGYNYVVIDDCWQADQRD 80
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-------- 112
+ K LP AN FP+G KA+ D + L
Sbjct: 81 AD-----------------TKALP--------ANPDKFPNGLKAVVDEIKGLGLKAGIYS 115
Query: 113 ----------ISNSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPTI-----RYRAMSDAL 157
I + +++ G YLKYDNCF+ T RY AMS AL
Sbjct: 116 SAGVMTCGHHIGSLGYEDVDAQSYANDGFSYLKYDNCFSQGESGTPKLSFDRYNAMSQAL 175
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWAR------------------ 199
K G+PI + MC WG+ P L+ + I NSWR + DI S+ R
Sbjct: 176 NKTGQPILYFMCNWGEDWPWLFATEIANSWRISGDIYPSFNRDDDRCPCTDITHCNLQGF 235
Query: 200 ---MLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCD 256
+ I D A PG WND DMLEVGN G+ +E +VHFS+WA+ K+PL+LG D
Sbjct: 236 HCSIAKIIDFAASLGQKAYPGAWNDLDMLEVGNRGLSLDESLVHFSMWAMLKSPLILGND 295
Query: 257 VGNLTAETMPIIGNEEVIAVNQ 278
+ +T +T II N+ VI ++Q
Sbjct: 296 LTKMTNQTRAIIKNKHVIDISQ 317
>gi|115390200|ref|XP_001212605.1| hypothetical protein ATEG_03427 [Aspergillus terreus NIH2624]
gi|114195001|gb|EAU36701.1| hypothetical protein ATEG_03427 [Aspergillus terreus NIH2624]
Length = 503
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 124/401 (30%), Positives = 175/401 (43%), Gaps = 98/401 (24%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCW--GESS 58
L+NGL +TP MGW++WN F C +++ +I T DDCW G +S
Sbjct: 20 LNNGLAVTPQMGWDNWNAFGCSLSQSLILSTASFIKKTGLRDAGYHYIILDDCWSSGRAS 79
Query: 59 RDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLD 118
+ I+ S F S L NQ + +GTK A Y L ++D
Sbjct: 80 NGALIADSTKFPNGMSY-------LGNQLHADGFGFGIYSSAGTKTCAGYPGSLGYETID 132
Query: 119 NRIESNIAFEQGIDYLKYDNCFNDDTR-----PTIRYRAMSDALKKAGRPIFFSMCEWGD 173
+N G+DYLKYDNC N+ + RY M AL +GR I +++C WG+
Sbjct: 133 ----ANTFASWGVDYLKYDNCNNNGQSGSQAVSSARYAVMQKALAASGRQILYAICNWGE 188
Query: 174 MRPALWGSNIRNSWRTTDDISDSW----------------ARMLTIADMNEVYADHAKPG 217
+P +WG ++ NSWR T DI D + + I M + ++ G
Sbjct: 189 DKPWIWGPSVGNSWRITGDIFDRFNTAYSTCPVPNSGGYQCSVTQIMSMQATISSYSAKG 248
Query: 218 GWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN 277
GWND DMLEVGNGGM +YV HFSI I+ N VIAVN
Sbjct: 249 GWNDMDMLEVGNGGMSDAQYVAHFSI--------------------GFSILANPAVIAVN 288
Query: 278 Q-----------AINDQQIWAGPLSG------NRIVVLLESRKTFSSMMTAHWDDIGIKS 320
Q N+ Q+W+GPL+ N +VV+L + S+ +A +I S
Sbjct: 289 QDPLGVAASYRWTRNNVQLWSGPLASTTGGSVNDVVVVLFNNGGSSTTASATLSEIFGSS 348
Query: 321 N--NTLFEHRTL------ETKFQGKLTAKVDGHSCKMYELT 353
+ +T FE R L +++ Q L HS +Y T
Sbjct: 349 SVPSTQFEIRDLWGSRLSDSQAQSILNDGAAAHSSLLYNAT 389
>gi|336409082|ref|ZP_08589570.1| hypothetical protein HMPREF1018_01586 [Bacteroides sp. 2_1_56FAA]
gi|335947236|gb|EGN09029.1| hypothetical protein HMPREF1018_01586 [Bacteroides sp. 2_1_56FAA]
Length = 500
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 121/411 (29%), Positives = 186/411 (45%), Gaps = 68/411 (16%)
Query: 2 FIVSSSAADSNGSDRRNL---LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCW-GES 57
+ V+ A ++ G + L + + L LTPPMGWNSWN F + E+++ +T D
Sbjct: 94 YTVTLKAENALGISVKQLVIRIGDELLLTPPMGWNSWNTFGQHLTEELVLQTADAMITNG 153
Query: 58 SRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------ 111
RD L S+ D Q+ + G+L +KT FP G K +ADY+H
Sbjct: 154 MRD----LGYSYINIDDFWQLPER----GADGHLQIDKTKFPRGIKYVADYLHERGFKLG 205
Query: 112 LISNSLDNR---IESNIAFEQ---------GIDYLKYDNCFN--DDTRPTIRYRAMSDAL 157
+ S++ + + + +E+ G+D LKYD C D RY M AL
Sbjct: 206 IYSDAAEKTCGGVCGSYGYEETDAKDFASWGVDLLKYDYCNAPVDRVEAMERYAKMGRAL 265
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNS-WRTTDDISDSWAR----------MLTIADM 206
+ R I +S+CEWG P W + WR + DI D W R +L I ++
Sbjct: 266 RATNRSIVYSVCEWGQREPWKWAKQVGGHLWRVSGDIGDIWYRDGNRVGGLHGILNILEI 325
Query: 207 NEVYADHAKPGGWNDPDMLEVG-----------NGGMKYNEYVVHFSIWAISKAPLLLGC 255
N +++A P GWNDPDML VG + G +Y HFS+W + +PLL G
Sbjct: 326 NAPLSEYAGPSGWNDPDMLVVGIDGKSMSIGYESEGCTQEQYKSHFSLWCMMASPLLSGN 385
Query: 256 DVGNLTAETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLSGNRIVVLLESRKT 304
DV N+ T+ I+ + ++IA+NQ + +Q IW PL+ R V +R +
Sbjct: 386 DVRNMNDSTLKILLDPDLIAINQDVLGRQAERSIRSDHYDIWVKPLADGRKAVACFNRAS 445
Query: 305 FSSMMTAHWDDIGIKSNNTLF---EHRTLETKFQGKLTAKVDGHSCKMYEL 352
+ + + I S ++ H T +L K+ + CK+Y L
Sbjct: 446 SPQTVILNENTIADLSFEQIYCLDSHLTKSGSDSKELIVKLAPYQCKVYIL 496
>gi|330467865|ref|YP_004405608.1| ricin B lectin [Verrucosispora maris AB-18-032]
gi|328810836|gb|AEB45008.1| ricin B lectin [Verrucosispora maris AB-18-032]
Length = 550
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 113/323 (34%), Positives = 153/323 (47%), Gaps = 96/323 (29%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRE-------------------TDDCWGESSRD 60
L+NG+ TPPMGWN+WN F C INE +IR+ DDCW + +RD
Sbjct: 35 LNNGVARTPPMGWNTWNTFGCNINETLIRQMADAIVNTGMRDLGYKYVVVDDCWFDPNRD 94
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH---ILI---- 113
+Q GNL A+ FPSG KAL DY+H +L
Sbjct: 95 AQ--------------------------GNLQAHAGRFPSGMKALGDYLHSRGLLFGLYQ 128
Query: 114 -----------------SNSLDNRIESNIAFEQ-GIDYLKYDNC-----FNDDTRPTIRY 150
+ SL + + F G+D+LKYD C ND +
Sbjct: 129 VPREKTCAQYFGAWPGSTGSLGHEYQDARQFAAWGVDFLKYDWCSPWGTINDQI---AGF 185
Query: 151 RAMSDALKKAGRPIFFSMC------EWGDMRPALWGSNIRNSWRTTDDISDSWARMLT-- 202
M DAL GRPI +S+ + G R WG +I N WRTT+DI+++W T
Sbjct: 186 SLMRDALAATGRPIVYSINPNSIHEKTGPQRN--WG-DIANMWRTTEDITNAWNTGQTNG 242
Query: 203 -------IADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGC 255
I ++N A +A PG +NDPDML VGNGGM E HF++WAI APL+ G
Sbjct: 243 YPMGIQNIVNVNVPIASYASPGSFNDPDMLVVGNGGMTDTEMRSHFALWAIMAAPLIAGN 302
Query: 256 DVGNLTAETMPIIGNEEVIAVNQ 278
D+ N +A T+ I+ N+ ++A+NQ
Sbjct: 303 DIRNASAATLDILRNQNLVAINQ 325
>gi|346978344|gb|EGY21796.1| alpha-galactosidase [Verticillium dahliae VdLs.17]
Length = 480
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 116/363 (31%), Positives = 164/363 (45%), Gaps = 109/363 (30%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------DDCWGESSRDSQISLSCSFHMF 72
L+NGL TP MGW H ++ + +R+ DDCW RD
Sbjct: 20 LNNGLARTPQMGWV---HTSKKLVDLGLRDLGYNYVVLDDCW-SGGRDED---------- 65
Query: 73 DSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISN 115
G ++T FPSG KA++D +H +
Sbjct: 66 ----------------GFQYEDRTKFPSGMKAVSDAIHDMGMLFGMYGTAGEMTCARYPG 109
Query: 116 SLDNRIESNIAFEQ-GIDYLKYDNCFN--DDTRPTI---RYRAMSDALKKAGRPIFFSMC 169
SLD +F G+DYLKYDNC++ P I R+ M AL GRP+ +S+C
Sbjct: 110 SLDWEENDAKSFAAWGVDYLKYDNCYSMGRHGSPKISFDRFNTMRKALNATGRPMVYSLC 169
Query: 170 EWGDMRPALWGSNIRNSWRTTDDISDSWAR-----------------------MLTIADM 206
WG+ WG +I NSWR + DI DS++R +L I +
Sbjct: 170 NWGEDYVYSWGVSIANSWRMSGDIYDSFSRPDVLCSCEDPSNPLCTSWGTHCSVLNILNK 229
Query: 207 NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMP 266
YAD A+PGGWND DMLEVG GGM +EY HF++WA+ K+PLL+G D+ L+A +
Sbjct: 230 VAAYADRAQPGGWNDLDMLEVGQGGMTDDEYKAHFTMWALVKSPLLIGTDLRKLSASALT 289
Query: 267 IIGNEEVIAVNQ---------------------AINDQQIWAGPLS-GNRIVVLL----E 300
I+ N +IAV+Q + + Q+W+G L+ G+++V L E
Sbjct: 290 ILNNPAIIAVSQDPAARSALRVAIDYDVKKDKYGVGETQVWSGWLANGDQVVAFLNAADE 349
Query: 301 SRK 303
RK
Sbjct: 350 DRK 352
>gi|424665496|ref|ZP_18102532.1| hypothetical protein HMPREF1205_01371 [Bacteroides fragilis HMW
616]
gi|404574740|gb|EKA79488.1| hypothetical protein HMPREF1205_01371 [Bacteroides fragilis HMW
616]
Length = 496
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 116/356 (32%), Positives = 173/356 (48%), Gaps = 66/356 (18%)
Query: 2 FIVSSSAADSNGSDRRNLLSN---GLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS 58
+ V A ++ G+D + LL N L LTPPMGWNSWN F + E+++ +T D E+
Sbjct: 94 YKVMLKAENALGTDTQELLINIGDELLLTPPMGWNSWNTFGRHLTEELLLQTADAMVENG 153
Query: 59 -RDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------ 111
RD L +F D Q+ + G++ +KT FP G K +ADY+H
Sbjct: 154 MRD----LGYAFINIDDFWQLPER----GADGHIQIDKTKFPRGIKYVADYLHERGFKLG 205
Query: 112 LISNSLDNR---IESNIAFEQ---------GIDYLKYDNCFNDDTRPTI--RYRAMSDAL 157
+ S++ D + + +E+ G+D LKYD C R RY M AL
Sbjct: 206 IYSDAADKTCGGVCGSYGYEEIDARDFASWGVDLLKYDYCNAPAGRVEAMERYEKMGRAL 265
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNS-WRTTDDISDSWAR----------MLTIADM 206
+ R I FS+CEWG P W + WR + DI D W R +L I ++
Sbjct: 266 RATDRSIVFSICEWGQREPWKWAKKVGGHLWRVSGDIGDLWNRSTDEKGGLRGILNILEI 325
Query: 207 NEVYADHAKPGGWNDPDMLEVGNGGMKYN-----------EYVVHFSIWAISKAPLLLGC 255
N +++A+PGGWNDPDML VG GG + +Y HF++W + +PLL G
Sbjct: 326 NAPLSEYARPGGWNDPDMLVVGIGGKSKSIGYESEGCTNEQYQSHFALWCMMASPLLCGN 385
Query: 256 DVGNLTAETMPIIGNEEVIAVNQ---------AI--NDQQIWAGPLS-GNRIVVLL 299
DV + T+ I+ N+++IA++Q AI + +W PLS G++ + L
Sbjct: 386 DVRQMNDSTLQILLNKDLIAIDQDPLGIQAERAIRADHYDVWVKPLSDGSKAIACL 441
>gi|256377862|ref|YP_003101522.1| Ricin B lectin [Actinosynnema mirum DSM 43827]
gi|255922165|gb|ACU37676.1| Ricin B lectin [Actinosynnema mirum DSM 43827]
Length = 597
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 122/420 (29%), Positives = 182/420 (43%), Gaps = 86/420 (20%)
Query: 4 VSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS-RDSQ 62
+ S AD +D R TPPMGWNSWN F C I+E IR+ D S RD+
Sbjct: 72 AAGSPADVRPADVRPAAPGSPATTPPMGWNSWNTFGCNISESTIRDGADALVSSGMRDAG 131
Query: 63 ISL----SCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-------- 110
C F + + LP+ G+L + T FPSG KAL DY+H
Sbjct: 132 YQYVVVDDCWFDV---------QRLPD---GSLRGDPTRFPSGMKALGDYIHARGLKFGI 179
Query: 111 --ILISNSLDNR------IESNIAFEQ---------GIDYLKYDNCFNDDTR--PTIRYR 151
+ + R ++ E+ G+DYLKYD C + R R+
Sbjct: 180 YQVPTDRTCAQRGGAYPGSTGSVGHEELDARTFASWGVDYLKYDWCSPEGDRDEQVARFA 239
Query: 152 AMSDALKKAGRPIFFSM------------CEWGDMRPALWGSNIRNSWRTTDDISDSWAR 199
M DAL+ GRPI +S+ +WG+ + + WRTT+D+ D W
Sbjct: 240 LMRDALRATGRPIVYSINPNSYHAITGSTYDWGE---------VADLWRTTEDLLDIWRN 290
Query: 200 ---------MLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAP 250
++ + D+N A A PG WNDPDML VG G+ + HF++WA+ AP
Sbjct: 291 ENTNSYPMGVVNVVDVNAPLAAQAGPGRWNDPDMLVVGRPGLTTQQSRAHFALWALMAAP 350
Query: 251 LLLGCDVGNLTAETMPIIGNEEVIAVNQAI-----------NDQQIWAGPLSGNRIVVLL 299
L+ G DV + AE I+ ++AVNQ D +++A PL+ + V L
Sbjct: 351 LMAGNDVRAMPAEISSILRTPGLVAVNQDALGAGGRRVRDDGDTEVFAKPLADGSVAVGL 410
Query: 300 ESRKTFSSMMTAHWDDIGIKSNN-TLFEHRTLETKFQGKLTAKVDGHSCKMYELTPVTKL 358
+R + ++A ++G+ + L + T T ++TA V + +T L
Sbjct: 411 FNRGAQPARISAGPAEVGLAGTSLALTDLWTGATSTGARITADVPSQGVAAFRVTGAGPL 470
>gi|298385824|ref|ZP_06995381.1| alpha-galactosidase [Bacteroides sp. 1_1_14]
gi|298261052|gb|EFI03919.1| alpha-galactosidase [Bacteroides sp. 1_1_14]
Length = 503
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 111/358 (31%), Positives = 166/358 (46%), Gaps = 65/358 (18%)
Query: 2 FIVSSSAADSNGSDRRNL---LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCW-GES 57
+ V+ A ++ G+ L + + L LTPPMGWNSWN F + E+++ +T D
Sbjct: 98 YTVTLKAKNALGTSTEKLVIRIGDDLLLTPPMGWNSWNTFGRHLTEELVLQTADALVANG 157
Query: 58 SRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------ 111
RD L S+ D Q+ + G+L NK FP G K +ADY+H
Sbjct: 158 MRD----LGYSYINIDDFWQLPER----GADGHLQINKDKFPRGIKYVADYLHERGFKLG 209
Query: 112 LISNSLDNR---IESNIAFEQ---------GIDYLKYDNCFN--DDTRPTIRYRAMSDAL 157
+ S++ D + + +E+ G+D LKYD C D RY M AL
Sbjct: 210 IYSDAADKTCGGVCGSYGYEEVDAKDFASWGVDLLKYDYCNAPVDRVEAMERYAKMGKAL 269
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNS-WRTTDDISDSWAR----------MLTIADM 206
+ GR I FS+CEWG P W + WR + DI D W R +L I ++
Sbjct: 270 RATGRSIVFSICEWGQREPWKWAKQVGGHLWRVSGDIGDVWNREANKLGGLRGILNILEI 329
Query: 207 NEVYADHAKPGGWNDPDMLEVGNGGMK-----------YNEYVVHFSIWAISKAPLLLGC 255
N +++ P GWNDPDML VG GG + +Y HF++W + +PLL G
Sbjct: 330 NAPLSEYGGPSGWNDPDMLVVGIGGKSMSIGYESEGCTHEQYKSHFALWCMMASPLLCGN 389
Query: 256 DVGNLTAETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLSGNRIVVLLESR 302
DV ++ T+ ++ + ++IA+NQ + +Q IW PL+ R V +R
Sbjct: 390 DVRSMNDSTLQVLLDRDLIAINQDVLGKQAERSIRADHYDIWVKPLADGRKAVACFNR 447
>gi|383117649|ref|ZP_09938392.1| hypothetical protein BSHG_0212 [Bacteroides sp. 3_2_5]
gi|251947011|gb|EES87293.1| hypothetical protein BSHG_0212 [Bacteroides sp. 3_2_5]
Length = 500
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 120/409 (29%), Positives = 185/409 (45%), Gaps = 68/409 (16%)
Query: 2 FIVSSSAADSNGSDRRNL---LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCW-GES 57
+ V+ A ++ G + L + + L LTPPMGWNSWN F + E+++ +T D
Sbjct: 94 YTVTLKAENALGVSVKQLVIRIGDELLLTPPMGWNSWNTFGQHLTEELVLQTADAMITNG 153
Query: 58 SRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------ 111
RD L S+ D Q+ + G+L +KT FP G K +ADY+H
Sbjct: 154 MRD----LGYSYINIDDFWQLPER----GADGHLQIDKTKFPRGIKYVADYLHERGFKLG 205
Query: 112 LISNSLDNR---IESNIAFEQ---------GIDYLKYDNCFN--DDTRPTIRYRAMSDAL 157
+ S++ + + + +E+ G+D LKYD C D RY M AL
Sbjct: 206 IYSDAAEKTCGGVCGSYGYEETDAKDFASWGVDLLKYDYCNAPVDRVEAMERYAKMGRAL 265
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNS-WRTTDDISDSWAR----------MLTIADM 206
+ R I +S+CEWG P W + WR + DI D W R +L I ++
Sbjct: 266 RATNRSIVYSVCEWGQREPWKWAKQVGGHLWRVSGDIGDIWYRDGNRVGGLHGILNILEI 325
Query: 207 NEVYADHAKPGGWNDPDMLEVG-----------NGGMKYNEYVVHFSIWAISKAPLLLGC 255
N +++A P GWNDPDML VG + G +Y HFS+W + +PLL G
Sbjct: 326 NAPLSEYAGPSGWNDPDMLVVGIDGKSMSIGYESEGCTQEQYKSHFSLWCMMASPLLSGN 385
Query: 256 DVGNLTAETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLSGNRIVVLLESRKT 304
DV N+ T+ I+ + ++IA+NQ + +Q IW PL+ R V +R +
Sbjct: 386 DVRNMNDSTLKILLDPDLIAINQDVLGRQAERSIRSDHYDIWVKPLADGRKAVACFNRTS 445
Query: 305 FSSMMTAHWDDIGIKSNNTLF---EHRTLETKFQGKLTAKVDGHSCKMY 350
+ + + I S ++ H T +L K+ + CK+Y
Sbjct: 446 SPQTVILNENTIADLSFEQIYCLDSHLTKSGSDSKELIVKLAPYQCKVY 494
>gi|375357809|ref|YP_005110581.1| putative alpha-galactosidase [Bacteroides fragilis 638R]
gi|301162490|emb|CBW22036.1| putative alpha-galactosidase [Bacteroides fragilis 638R]
Length = 500
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 120/409 (29%), Positives = 185/409 (45%), Gaps = 68/409 (16%)
Query: 2 FIVSSSAADSNGSDRRNL---LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCW-GES 57
+ V+ A ++ G + L + + L LTPPMGWNSWN F + E+++ +T D
Sbjct: 94 YTVTLKAENALGVSVKQLVIRIGDELLLTPPMGWNSWNTFGQHLTEELVLQTADAMITNG 153
Query: 58 SRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------ 111
RD L S+ D Q+ + G+L +KT FP G K +ADY+H
Sbjct: 154 MRD----LGYSYINIDDFWQLPER----GADGHLQIDKTKFPRGIKYVADYLHERGFKLG 205
Query: 112 LISNSLDNR---IESNIAFEQ---------GIDYLKYDNCFN--DDTRPTIRYRAMSDAL 157
+ S++ + + + +E+ G+D LKYD C D RY M AL
Sbjct: 206 IYSDAAEKTCGGVCGSYGYEETDAKDFASWGVDLLKYDYCNAPVDRVEAMERYAKMGRAL 265
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNS-WRTTDDISDSWAR----------MLTIADM 206
+ R I +S+CEWG P W + WR + DI D W R +L I ++
Sbjct: 266 RATNRSIVYSVCEWGQREPWKWAKQVGGHLWRVSGDIGDIWYRDGNRVGGLHGILNILEI 325
Query: 207 NEVYADHAKPGGWNDPDMLEVG-----------NGGMKYNEYVVHFSIWAISKAPLLLGC 255
N +++A P GWNDPDML VG + G +Y HFS+W + +PLL G
Sbjct: 326 NAPLSEYAGPSGWNDPDMLVVGIDGKSMSIGYESEGCTQEQYKSHFSLWCMMASPLLSGN 385
Query: 256 DVGNLTAETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLSGNRIVVLLESRKT 304
DV N+ T+ I+ + ++IA+NQ + +Q IW PL+ R V +R +
Sbjct: 386 DVRNMNDSTLKILLDPDLIAINQDVLGRQAERSIRSDHYDIWVKPLADGRKAVACFNRAS 445
Query: 305 FSSMMTAHWDDIGIKSNNTLF---EHRTLETKFQGKLTAKVDGHSCKMY 350
+ + + I S ++ H T +L K+ + CK+Y
Sbjct: 446 SPQTVILNENTIADLSFEQIYCLDSHLTKSGSDSKELIVKLAPYQCKVY 494
>gi|423249451|ref|ZP_17230467.1| hypothetical protein HMPREF1066_01477 [Bacteroides fragilis
CL03T00C08]
gi|423256236|ref|ZP_17237164.1| hypothetical protein HMPREF1067_03808 [Bacteroides fragilis
CL03T12C07]
gi|423284830|ref|ZP_17263713.1| hypothetical protein HMPREF1204_03251 [Bacteroides fragilis HMW
615]
gi|392649427|gb|EIY43105.1| hypothetical protein HMPREF1067_03808 [Bacteroides fragilis
CL03T12C07]
gi|392655536|gb|EIY49178.1| hypothetical protein HMPREF1066_01477 [Bacteroides fragilis
CL03T00C08]
gi|404579419|gb|EKA84133.1| hypothetical protein HMPREF1204_03251 [Bacteroides fragilis HMW
615]
Length = 500
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 120/409 (29%), Positives = 185/409 (45%), Gaps = 68/409 (16%)
Query: 2 FIVSSSAADSNGSDRRNL---LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCW-GES 57
+ V+ A ++ G + L + + L LTPPMGWNSWN F + E+++ +T D
Sbjct: 94 YTVTLKAENALGVSVKQLVIRIGDELLLTPPMGWNSWNTFGQHLTEELVLQTADAMITNG 153
Query: 58 SRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------ 111
RD L S+ D Q+ + G+L +KT FP G K +ADY+H
Sbjct: 154 MRD----LGYSYINIDDFWQLPER----GADGHLQIDKTKFPRGIKYVADYLHERGFKLG 205
Query: 112 LISNSLDNR---IESNIAFEQ---------GIDYLKYDNCFN--DDTRPTIRYRAMSDAL 157
+ S++ + + + +E+ G+D LKYD C D RY M AL
Sbjct: 206 IYSDAAEKTCGGVCGSYGYEETDAKDFASWGVDLLKYDYCNAPVDRVEAMERYAKMGRAL 265
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNS-WRTTDDISDSWAR----------MLTIADM 206
+ R I +S+CEWG P W + WR + DI D W R +L I ++
Sbjct: 266 RATNRSIVYSVCEWGQREPWKWAKQVGGHLWRVSGDIGDIWYRDGNRVGGLHGILNILEI 325
Query: 207 NEVYADHAKPGGWNDPDMLEVG-----------NGGMKYNEYVVHFSIWAISKAPLLLGC 255
N +++A P GWNDPDML VG + G +Y HFS+W + +PLL G
Sbjct: 326 NAPLSEYAGPSGWNDPDMLVVGIDGKSMSIGYESEGCTQEQYKSHFSLWCMMASPLLSGN 385
Query: 256 DVGNLTAETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLSGNRIVVLLESRKT 304
DV N+ T+ I+ + ++IA+NQ + +Q IW PL+ R V +R +
Sbjct: 386 DVRNMNDSTLKILLDPDLIAINQDVLGRQAERSIRSDHYDIWVKPLADGRKAVACFNRAS 445
Query: 305 FSSMMTAHWDDIGIKSNNTLF---EHRTLETKFQGKLTAKVDGHSCKMY 350
+ + + I S ++ H T +L K+ + CK+Y
Sbjct: 446 SPQTVILNENTIADLSFEQIYCLDSHLTKSGSDSKELIVKLAPYQCKVY 494
>gi|29348474|ref|NP_811977.1| alpha-galactosidase [Bacteroides thetaiotaomicron VPI-5482]
gi|29340378|gb|AAO78171.1| alpha-galactosidase precursor [Bacteroides thetaiotaomicron
VPI-5482]
Length = 503
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 111/358 (31%), Positives = 166/358 (46%), Gaps = 65/358 (18%)
Query: 2 FIVSSSAADSNGSDRRNL---LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCW-GES 57
+ V+ A ++ G+ L + + L LTPPMGWNSWN F + E+++ +T D
Sbjct: 98 YTVTLKAKNALGTSTEKLVIRIGDDLLLTPPMGWNSWNTFGRHLTEELVLQTADALVANG 157
Query: 58 SRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------ 111
RD L S+ D Q+ + G+L NK FP G K +ADY+H
Sbjct: 158 MRD----LGYSYINIDDFWQLPER----GADGHLQINKDKFPRGIKYVADYLHERGFKLG 209
Query: 112 LISNSLDNR---IESNIAFEQ---------GIDYLKYDNCFN--DDTRPTIRYRAMSDAL 157
+ S++ D + + +E+ G+D LKYD C D RY M AL
Sbjct: 210 IYSDATDKTCGGVCGSYGYEEVDAKDFASWGVDLLKYDYCNAPVDRVEAMERYAKMGKAL 269
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNS-WRTTDDISDSWAR----------MLTIADM 206
+ GR I FS+CEWG P W + WR + DI D W R +L I ++
Sbjct: 270 RGTGRSIVFSICEWGQREPWKWAKQVGGHLWRVSGDIGDVWNREANKLGGLRGILNILEI 329
Query: 207 NEVYADHAKPGGWNDPDMLEVGNGGMK-----------YNEYVVHFSIWAISKAPLLLGC 255
N +++ P GWNDPDML VG GG + +Y HF++W + +PLL G
Sbjct: 330 NAPLSEYGGPSGWNDPDMLVVGIGGKSMSIGYESEGCTHEQYKSHFALWCMMASPLLCGN 389
Query: 256 DVGNLTAETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLSGNRIVVLLESR 302
DV ++ T+ ++ + ++IA+NQ + +Q IW PL+ R V +R
Sbjct: 390 DVRSMNDSTLQVLLDRDLIAINQDVLGKQAERSIRADHYDIWVKPLADGRKAVACFNR 447
>gi|60680928|ref|YP_211072.1| alpha-galactosidase [Bacteroides fragilis NCTC 9343]
gi|60492362|emb|CAH07128.1| putative alpha-galactosidase [Bacteroides fragilis NCTC 9343]
Length = 500
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 120/409 (29%), Positives = 185/409 (45%), Gaps = 68/409 (16%)
Query: 2 FIVSSSAADSNGSDRRNL---LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCW-GES 57
+ V+ A ++ G + L + + L LTPPMGWNSWN F + E+++ +T D
Sbjct: 94 YTVTLKAENALGVSVKQLVIRIGDELLLTPPMGWNSWNTFGQHLTEELVLQTADAMITNG 153
Query: 58 SRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------ 111
RD L S+ D Q+ + G+L +KT FP G K +ADY+H
Sbjct: 154 MRD----LGYSYINIDDFWQLPER----GADGHLQIDKTKFPRGIKYVADYLHERGFKLG 205
Query: 112 LISNSLDNR---IESNIAFEQ---------GIDYLKYDNCFN--DDTRPTIRYRAMSDAL 157
+ S++ + + + +E+ G+D LKYD C D RY M AL
Sbjct: 206 IYSDAAEKTCGGVCGSYGYEETDAKDFASWGVDLLKYDYCNAPVDRVEAMERYAKMGRAL 265
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNS-WRTTDDISDSWAR----------MLTIADM 206
+ R I +S+CEWG P W + WR + DI D W R +L I ++
Sbjct: 266 RATNRSIVYSVCEWGQREPWKWAKQVGGHLWRVSGDIGDIWYRDGNRVGGLHGILNILEI 325
Query: 207 NEVYADHAKPGGWNDPDMLEVG-----------NGGMKYNEYVVHFSIWAISKAPLLLGC 255
N +++A P GWNDPDML VG + G +Y HFS+W + +PLL G
Sbjct: 326 NAPLSEYAGPSGWNDPDMLVVGIDGKSMSIGYESEGCTQEQYKSHFSLWCMMASPLLSGN 385
Query: 256 DVGNLTAETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLSGNRIVVLLESRKT 304
DV N+ T+ I+ + ++IA+NQ + +Q IW PL+ R V +R +
Sbjct: 386 DVRNMNDSTLKILLDPDLIAINQDVLGRQAERSIRSDHYDIWVKPLADGRKAVACFNRAS 445
Query: 305 FSSMMTAHWDDIGIKSNNTLF---EHRTLETKFQGKLTAKVDGHSCKMY 350
+ + + I S ++ H T +L K+ + CK+Y
Sbjct: 446 SPQTVILNENTIADLSFEQIYCLDNHLTKSGSDSKELIVKLAPYQCKVY 494
>gi|423258228|ref|ZP_17239151.1| hypothetical protein HMPREF1055_01428 [Bacteroides fragilis
CL07T00C01]
gi|423264804|ref|ZP_17243807.1| hypothetical protein HMPREF1056_01494 [Bacteroides fragilis
CL07T12C05]
gi|387777674|gb|EIK39771.1| hypothetical protein HMPREF1055_01428 [Bacteroides fragilis
CL07T00C01]
gi|392704537|gb|EIY97672.1| hypothetical protein HMPREF1056_01494 [Bacteroides fragilis
CL07T12C05]
Length = 500
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 120/409 (29%), Positives = 185/409 (45%), Gaps = 68/409 (16%)
Query: 2 FIVSSSAADSNGSDRRNL---LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCW-GES 57
+ V+ A ++ G + L + + L LTPPMGWNSWN F + E+++ +T D
Sbjct: 94 YTVTLKAENALGVSVKQLVIRIGDELLLTPPMGWNSWNTFGQHLTEELVLQTADAMITNG 153
Query: 58 SRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------ 111
RD L S+ D Q+ + G+L +KT FP G K +ADY+H
Sbjct: 154 MRD----LGYSYINIDDFWQLPER----GADGHLQIDKTKFPRGIKYVADYLHERGFKLG 205
Query: 112 LISNSLDNR---IESNIAFEQ---------GIDYLKYDNCFN--DDTRPTIRYRAMSDAL 157
+ S++ + + + +E+ G+D LKYD C D RY M AL
Sbjct: 206 IYSDAAEKTCGGVCGSYGYEETDAKDFASWGVDLLKYDYCNAPVDRVEAMGRYAKMGRAL 265
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNS-WRTTDDISDSWAR----------MLTIADM 206
+ R I +S+CEWG P W + WR + DI D W R +L I ++
Sbjct: 266 RATNRSIVYSVCEWGQREPWKWAKQVGGHLWRVSGDIGDIWYRDGNRVGGLHGILNILEI 325
Query: 207 NEVYADHAKPGGWNDPDMLEVG-----------NGGMKYNEYVVHFSIWAISKAPLLLGC 255
N +++A P GWNDPDML VG + G +Y HFS+W + +PLL G
Sbjct: 326 NAPLSEYAGPSGWNDPDMLVVGIDGKSMSISYESEGCTQEQYKSHFSLWCMMASPLLSGN 385
Query: 256 DVGNLTAETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLSGNRIVVLLESRKT 304
DV N+ T+ I+ + ++IA+NQ + +Q IW PL+ R V +R +
Sbjct: 386 DVRNMNDSTLKILLDPDLIAINQDVLGRQAERSIRSDHYDIWVKPLADGRKAVACFNRAS 445
Query: 305 FSSMMTAHWDDIGIKSNNTLF---EHRTLETKFQGKLTAKVDGHSCKMY 350
+ + + I S ++ H T +L K+ + CK+Y
Sbjct: 446 SPQTVILNENTIADLSFEQIYCLDSHLTKSGSDSKELIVKLAPYQCKVY 494
>gi|148909357|gb|ABR17777.1| unknown [Picea sitchensis]
Length = 133
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 75/128 (58%), Positives = 89/128 (69%)
Query: 153 MSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYAD 212
M DAL RPIF+S+CEWG PA W I +SWRTTDDI D W M++ AD N +A
Sbjct: 1 MRDALLSTRRPIFYSLCEWGLDVPATWARQIGSSWRTTDDIEDKWESMISRADQNNEFAQ 60
Query: 213 HAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEE 272
+A PGGWNDPDMLEVGNG M EY HFSIWA+ KAPLL+GCDV ++ +T + N E
Sbjct: 61 YAGPGGWNDPDMLEVGNGNMTPEEYGSHFSIWALMKAPLLIGCDVTSMDKKTYGTLSNSE 120
Query: 273 VIAVNQAI 280
VIAVNQ +
Sbjct: 121 VIAVNQGL 128
>gi|265762880|ref|ZP_06091448.1| alpha-galactosidase [Bacteroides sp. 2_1_16]
gi|263255488|gb|EEZ26834.1| alpha-galactosidase [Bacteroides sp. 2_1_16]
Length = 500
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 120/409 (29%), Positives = 185/409 (45%), Gaps = 68/409 (16%)
Query: 2 FIVSSSAADSNGSDRRNL---LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCW-GES 57
+ V+ A ++ G + L + + L LTPPMGWNSWN F + E+++ +T D
Sbjct: 94 YTVTLKAENALGVSVKQLVIRIGDELLLTPPMGWNSWNTFGQHLTEELVLQTADAMITNG 153
Query: 58 SRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------ 111
RD L S+ D Q+ + G+L +KT FP G K +ADY+H
Sbjct: 154 MRD----LGYSYINIDDFWQLPER----GADGHLQIDKTKFPRGIKYVADYLHERGFKLG 205
Query: 112 LISNSLDNR---IESNIAFEQ---------GIDYLKYDNCFN--DDTRPTIRYRAMSDAL 157
+ S++ + + + +E+ G+D LKYD C D RY M AL
Sbjct: 206 IYSDAAEKTCGGVCGSYGYEETDAKDFASWGVDLLKYDYCNAPVDRVEAMERYAKMGRAL 265
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNS-WRTTDDISDSWAR----------MLTIADM 206
+ R I +S+CEWG P W + WR + DI D W R +L I ++
Sbjct: 266 RATNRSIVYSVCEWGQREPWKWAKQVGGHLWRVSGDIGDIWYRDGNRVGGLHGILNILEI 325
Query: 207 NEVYADHAKPGGWNDPDMLEVG-----------NGGMKYNEYVVHFSIWAISKAPLLLGC 255
N +++A P GWNDPDML VG + G +Y HFS+W + +PLL G
Sbjct: 326 NAPLSEYAGPSGWNDPDMLVVGIDGKSMSIGYESEGCTQEQYKSHFSLWCMMASPLLSGN 385
Query: 256 DVGNLTAETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLSGNRIVVLLESRKT 304
DV N+ T+ I+ + ++IA+NQ + +Q IW PL+ R V +R +
Sbjct: 386 DVRNMNDSTLKILLDPDLIAINQDVLGRQAERSIRSDHYDIWVKPLADGRKAVACFNRAS 445
Query: 305 FSSMMTAHWDDIGIKSNNTLF---EHRTLETKFQGKLTAKVDGHSCKMY 350
+ + + I S ++ H T +L K+ + CK+Y
Sbjct: 446 SPQTVILNENTIADLSFEQIYCLDNHLTKSGSDSKELIVKLAPYQCKVY 494
>gi|456392187|gb|EMF57530.1| alpha-galactosidase [Streptomyces bottropensis ATCC 25435]
Length = 548
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 118/400 (29%), Positives = 172/400 (43%), Gaps = 99/400 (24%)
Query: 26 LTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQISLS 66
LTPP+GWNSWN F C I E +R+ DDCW RDS
Sbjct: 40 LTPPLGWNSWNSFGCGITEAQVRQAADAMVSSGMKDAGYQYVVVDDCWFNPQRDSA---- 95
Query: 67 CSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-------------- 112
GNL AN TTFPSG KAL DY+H
Sbjct: 96 ----------------------GNLRANPTTFPSGMKALGDYIHGKGLKFGIYQAPNERT 133
Query: 113 ----------ISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDT--RPTIRYRAMSDALKK 159
+ S + + F G+DYLKYD C T ++ M DAL+
Sbjct: 134 CAQGVGTYPGSTGSKGHEAQDAATFASWGVDYLKYDWCSGSGTLNEQIAQFTIMRDALRA 193
Query: 160 AGRPIFFSMCEWGDMRPAL----WGSNIRNSWRTTDDISDSWARMLT---------IADM 206
GRPI +S+ P WG + + WRTT+D+ D W T + D+
Sbjct: 194 TGRPIIYSINPNSFHSPTGATYDWG-QVADLWRTTEDLLDIWQNGNTNSYPMGVGNVLDI 252
Query: 207 NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMP 266
A + PG WNDPDML VG G+ E HF++WA+ APL+ G D+ ++++
Sbjct: 253 TAPLAAQSGPGNWNDPDMLVVGRPGLSLTESRSHFALWALLSAPLIAGNDIRTMSSDVSA 312
Query: 267 IIGNEEVIAVNQAI-----------NDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDD 315
I+ N ++AVNQ D +++A PLS + V L +R+ ++ +T+
Sbjct: 313 ILRNPRLLAVNQDPLGAGGRRVRDDGDTEVFAKPLSDGSVAVGLFNRRNSTATLTSTAAQ 372
Query: 316 IGIKSNN-TLFEHRTLETK-FQGKLTAKVDGHSCKMYELT 353
+G+ + TL + T T G+++ V H ++++T
Sbjct: 373 LGLTGGSFTLTDLWTGATSNTSGQISVSVPAHGVAVFKMT 412
>gi|383124665|ref|ZP_09945328.1| hypothetical protein BSIG_1583 [Bacteroides sp. 1_1_6]
gi|251841177|gb|EES69258.1| hypothetical protein BSIG_1583 [Bacteroides sp. 1_1_6]
Length = 503
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 111/358 (31%), Positives = 166/358 (46%), Gaps = 65/358 (18%)
Query: 2 FIVSSSAADSNGSDRRNL---LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCW-GES 57
+ V+ A ++ G+ L + + L LTPPMGWNSWN F + E+++ +T D
Sbjct: 98 YTVTLKAKNALGTSTEKLVIRIGDDLLLTPPMGWNSWNTFGRHLTEELVLQTADVLVANG 157
Query: 58 SRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------ 111
RD L S+ D Q+ + G+L NK FP G K +ADY+H
Sbjct: 158 MRD----LGYSYINIDDFWQLPER----GADGHLQINKDKFPRGIKYVADYLHERGFKLG 209
Query: 112 LISNSLDNR---IESNIAFEQ---------GIDYLKYDNCFN--DDTRPTIRYRAMSDAL 157
+ S++ D + + +E+ G+D LKYD C D RY M AL
Sbjct: 210 IYSDAADKTCGGVCGSYGYEEVDAKDFASWGVDLLKYDYCNAPVDRVEAMERYAKMGKAL 269
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNS-WRTTDDISDSWAR----------MLTIADM 206
+ GR I FS+CEWG P W + WR + DI D W R +L I ++
Sbjct: 270 RGTGRSIVFSICEWGQREPWKWAKQVGGHLWRVSGDIGDVWNREANKLGGLRGILNILEI 329
Query: 207 NEVYADHAKPGGWNDPDMLEVGNGGMK-----------YNEYVVHFSIWAISKAPLLLGC 255
N +++ P GWNDPDML VG GG + +Y HF++W + +PLL G
Sbjct: 330 NAPLSEYGGPSGWNDPDMLVVGIGGKSMSIGYESEGCTHEQYKSHFALWCMMASPLLCGN 389
Query: 256 DVGNLTAETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLSGNRIVVLLESR 302
DV ++ T+ ++ + ++IA+NQ + +Q IW PL+ R V +R
Sbjct: 390 DVRSMNDSTLQVLLDRDLIAINQDVLGKQAERSIRADHYDIWVKPLADGRKAVACFNR 447
>gi|313146012|ref|ZP_07808205.1| alpha-galactosidase [Bacteroides fragilis 3_1_12]
gi|313134779|gb|EFR52139.1| alpha-galactosidase [Bacteroides fragilis 3_1_12]
Length = 426
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 125/411 (30%), Positives = 184/411 (44%), Gaps = 72/411 (17%)
Query: 2 FIVSSSAADSNGSDRRNL---LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCW-GES 57
+ V+ A ++ G + L + + L LTPPMGWNSWN F + E+++ +T D
Sbjct: 20 YTVTLKAENALGVSVKQLVIRIGDELLLTPPMGWNSWNTFGQHLTEELVLQTADAMIANG 79
Query: 58 SRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------ 111
RD L S+ D Q+ + G+L +KT FP G K +ADY+H
Sbjct: 80 MRD----LGYSYINIDDFWQLPERG----ADGHLQIDKTKFPRGIKYVADYLHERGFKLG 131
Query: 112 LISNSLDNR---IESNIAFEQ---------GIDYLKYDNCFNDDTRPTI--RYRAMSDAL 157
+ S++ + + + +E+ G+D LKYD C R RY M AL
Sbjct: 132 IYSDAAEKTCGGVCGSYGYEETDAKDFASWGVDLLKYDYCNAPVGRVEAMERYVKMGRAL 191
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNS-WRTTDDISDSWAR----------MLTIADM 206
+ R I +S+CEWG P W + WR + DI D W R +L I ++
Sbjct: 192 RATNRSIVYSVCEWGQREPWKWAKQVGGHLWRVSGDIGDIWYRDANHVGGLRGILNILEI 251
Query: 207 NEVYADHAKPGGWNDPDMLEVG-----------NGGMKYNEYVVHFSIWAISKAPLLLGC 255
N +++A P GWNDPDML VG + G +Y HFS+W + +PL G
Sbjct: 252 NAPLSEYAGPSGWNDPDMLVVGIDGKSMSIGYESEGCTQEQYKSHFSLWCMMASPLFSGN 311
Query: 256 DVGNLTAETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLSGNRIVVLLESRKT 304
DV N+ T+ I+ N ++IA+NQ I +Q IW PL+ R V +R
Sbjct: 312 DVRNMNDSTLKILLNADLIAINQDILGRQAERSIRSDHYDIWVKPLADGRKAVACFNRA- 370
Query: 305 FSSMMTAHWD-----DIGIKSNNTLFEHRTLETKFQGKLTAKVDGHSCKMY 350
SS T D D+ + L H T +L K+ + CK+Y
Sbjct: 371 -SSPQTVILDENTIADLSFEQIYCLDNHLTKSGSDSKELIVKLAPYQCKVY 420
>gi|423272008|ref|ZP_17250977.1| hypothetical protein HMPREF1079_04059 [Bacteroides fragilis
CL05T00C42]
gi|423275988|ref|ZP_17254931.1| hypothetical protein HMPREF1080_03584 [Bacteroides fragilis
CL05T12C13]
gi|392695695|gb|EIY88901.1| hypothetical protein HMPREF1079_04059 [Bacteroides fragilis
CL05T00C42]
gi|392699742|gb|EIY92912.1| hypothetical protein HMPREF1080_03584 [Bacteroides fragilis
CL05T12C13]
Length = 496
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 116/356 (32%), Positives = 172/356 (48%), Gaps = 66/356 (18%)
Query: 2 FIVSSSAADSNGSDRRNLLSN---GLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS 58
+ V A ++ G D + LL N L LTPPMGWNSWN F + E+++ +T D E+
Sbjct: 94 YKVMLKAENALGIDTQELLINIGDELLLTPPMGWNSWNTFGRHLTEELLLQTADAMVENG 153
Query: 59 -RDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------ 111
RD L ++ D Q+ + G++ +KT FP G K +ADY+H
Sbjct: 154 MRD----LGYAYINIDDFWQLPER----GADGHIQIDKTKFPRGIKYVADYLHERGFKLG 205
Query: 112 LISNSLDNR---IESNIAFEQ---------GIDYLKYDNCFNDDTRPTI--RYRAMSDAL 157
+ S++ D + + +E+ G+D LKYD C R RY M AL
Sbjct: 206 IYSDAADKTCGGVCGSYGYEEIDARDFASWGVDLLKYDYCNAPAGRVEAMERYEKMGRAL 265
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNS-WRTTDDISDSWAR----------MLTIADM 206
+ R I FS+CEWG P W + WR + DI D W R +L I ++
Sbjct: 266 RATDRSIVFSICEWGQREPWKWAKKVGGHLWRVSGDIGDLWNRSTDEKGGLRGILNILEI 325
Query: 207 NEVYADHAKPGGWNDPDMLEVGNGGMKYN-----------EYVVHFSIWAISKAPLLLGC 255
N +++A+PGGWNDPDML VG GG + +Y HF++W + +PLL G
Sbjct: 326 NAPLSEYARPGGWNDPDMLVVGIGGKSKSIGYESEGCTNEQYQSHFALWCMMASPLLCGN 385
Query: 256 DVGNLTAETMPIIGNEEVIAVNQ---------AI--NDQQIWAGPLS-GNRIVVLL 299
DV + T+ I+ N+++IA+NQ AI + +W PLS G++ + L
Sbjct: 386 DVRQMNDSTLQILLNKDLIAINQDPLGIQAERAIRADHYDVWVKPLSDGSKAIACL 441
>gi|358370493|dbj|GAA87104.1| melibiase D [Aspergillus kawachii IFO 4308]
Length = 627
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 118/385 (30%), Positives = 172/385 (44%), Gaps = 67/385 (17%)
Query: 30 MGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQ----IAAKPLPN 85
MGWN++NH+ C NE I+R S+ + + L S + +T A LP+
Sbjct: 1 MGWNTYNHYSCSPNETIVR--------SNAQALVDLGLSSLGYRYVTTDCGWTVADRLPD 52
Query: 86 QCQGNLVANKTTFPSGTKALADYVH-------------ILISNSLDNRIESNIAFEQ--- 129
G+L N+T FP G A+ D++H IL+ S N S Q
Sbjct: 53 ---GSLTWNETLFPQGFPAMGDFLHDLGLLFGVYQDSGILLCGSPPNETGSLYHEAQDAR 109
Query: 130 -----GIDYLKYDNCFND------------DTRPTIRYRAMSDALKKAGRPIFFSMCEWG 172
+D LKYDNC++D T P R+ MS AL + R I F +CEWG
Sbjct: 110 TFASWNVDSLKYDNCYSDAATNYPNVNYAPSTSPEPRFANMSHALSQQNRTILFQICEWG 169
Query: 173 DMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGM 232
PA W + +SWR +DI +W + I + D A PG W D DMLEVGN
Sbjct: 170 ISFPAGWAPALGHSWRIGNDIIPAWRTIYRIINQAAPQTDFAGPGQWPDLDMLEVGNNIF 229
Query: 233 KYNEYVVHFSIWAISKAPLLLGCDVGN-LTA---ETMPIIGNEEVIAVNQ---------- 278
E HFS+WAI K+PL++G + + LTA ++ ++ ++V+A NQ
Sbjct: 230 SLPEEQTHFSLWAILKSPLIIGAALKDELTAINDASLAVLKQKDVVAFNQDALGKSASLR 289
Query: 279 ---AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTL--FEHRTLETK 333
++W+GPLS R V + + + S +T DIG++ T+ T
Sbjct: 290 RRWTEEGYEVWSGPLSNGRTVAAVINWRNESRDLTLDLPDIGLQHAGTVKNIWDGTTAQD 349
Query: 334 FQGKLTAKVDGHSCKMYELTPVTKL 358
T+ V GH + EL T +
Sbjct: 350 VLTSYTSTVAGHGTMLLELQNTTAV 374
>gi|414870941|tpg|DAA49498.1| TPA: hypothetical protein ZEAMMB73_096407 [Zea mays]
Length = 255
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 94/219 (42%), Positives = 110/219 (50%), Gaps = 66/219 (30%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
NGLG TP MGWNSWNHF C INE +I++T DDCW ES RD
Sbjct: 62 FENGLGRTPQMGWNSWNHFGCGINENLIKQTADALVNTGLAKLGYEYVNIDDCWAESDRD 121
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-------- 112
Q G+ VAN+ TFPSG KALADYVH
Sbjct: 122 YQ--------------------------GSFVANRQTFPSGIKALADYVHAKGLKLGIYS 155
Query: 113 ----------ISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTI-RYRAMSDALKKA 160
+ SLD+ + F GIDYLKYDNC ND R + RY MS+A+K
Sbjct: 156 DAGTRTCSQKMPGSLDHEEQDVKTFSSWGIDYLKYDNC-NDAGRSVMERYTKMSNAMKTY 214
Query: 161 GRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWAR 199
G+ IFFS+CEWG PA W + NSWRTTDDI+D+W R
Sbjct: 215 GKGIFFSLCEWGRQNPATWAGGMGNSWRTTDDIADNWGR 253
>gi|375360444|ref|YP_005113216.1| putative alpha-galactosidase/melibiase [Bacteroides fragilis 638R]
gi|383119700|ref|ZP_09940438.1| hypothetical protein BSHG_3490 [Bacteroides sp. 3_2_5]
gi|423251884|ref|ZP_17232892.1| hypothetical protein HMPREF1066_03902 [Bacteroides fragilis
CL03T00C08]
gi|423252802|ref|ZP_17233733.1| hypothetical protein HMPREF1067_00377 [Bacteroides fragilis
CL03T12C07]
gi|251944689|gb|EES85164.1| hypothetical protein BSHG_3490 [Bacteroides sp. 3_2_5]
gi|301165125|emb|CBW24693.1| putative alpha-galactosidase/melibiase [Bacteroides fragilis 638R]
gi|392648760|gb|EIY42447.1| hypothetical protein HMPREF1066_03902 [Bacteroides fragilis
CL03T00C08]
gi|392659263|gb|EIY52884.1| hypothetical protein HMPREF1067_00377 [Bacteroides fragilis
CL03T12C07]
Length = 496
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 115/356 (32%), Positives = 173/356 (48%), Gaps = 66/356 (18%)
Query: 2 FIVSSSAADSNGSDRRNLLSN---GLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS 58
+ V A ++ G+D + LL N L LTPPMGWNSWN F + E+++ +T D E+
Sbjct: 94 YKVMLKAENALGTDTQELLINIGDELLLTPPMGWNSWNTFGRHLTEELLLQTADAMVENG 153
Query: 59 -RDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------ 111
RD L ++ D Q+ + G++ +KT FP G K +ADY+H
Sbjct: 154 MRD----LGYAYINIDDFWQLPER----GADGHIQIDKTKFPRGIKYVADYLHERGFKLG 205
Query: 112 LISNSLDNR---IESNIAFEQ---------GIDYLKYDNCFNDDTRPTI--RYRAMSDAL 157
+ S++ D + + +E+ G+D LKYD C R RY M AL
Sbjct: 206 IYSDAADKTCGGVCGSYGYEEIDARDFASWGVDLLKYDYCNAPAGRVEAMERYEKMGRAL 265
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNS-WRTTDDISDSWAR----------MLTIADM 206
+ R I FS+CEWG P W + WR + DI D W R +L I ++
Sbjct: 266 RATDRSIVFSICEWGQREPWKWAKKVGGHLWRVSGDIGDLWNRSTDEKGGLRGILNILEI 325
Query: 207 NEVYADHAKPGGWNDPDMLEVGNGGMKYN-----------EYVVHFSIWAISKAPLLLGC 255
N +++A+PGGWNDPDML VG GG + +Y HF++W + +PLL G
Sbjct: 326 NAPLSEYARPGGWNDPDMLVVGIGGKSKSIGYESEGCTNEQYQSHFALWCMMASPLLCGN 385
Query: 256 DVGNLTAETMPIIGNEEVIAVNQ---------AI--NDQQIWAGPLS-GNRIVVLL 299
DV + T+ I+ N+++IA++Q AI + +W PLS G++ + L
Sbjct: 386 DVRQMNDSTLQILLNKDLIAIDQDPLGIQAERAIRADHYDVWVKPLSDGSKAIACL 441
>gi|392589366|gb|EIW78697.1| glycoside hydrolase family 27 protein [Coniophora puteana
RWD-64-598 SS2]
Length = 432
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 107/390 (27%), Positives = 183/390 (46%), Gaps = 66/390 (16%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
L+NG+ P +G+N+WN +HC I+++++ + DDCWG ++R
Sbjct: 17 LNNGVAKLPVLGFNTWNAYHCDISQELVLQQAQYMKTLGLLDAGYTQFNLDDCWGVTNRS 76
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNR 120
S + + +F + A + + ++ SG + A Y+ S D+
Sbjct: 77 SSGEIQYNTTLFPDMNNYTATLNSMGFKAGIYSD-----SGWQTCAGYM-----GSFDHE 126
Query: 121 IESNIAFEQ-GIDYLKYDNC---FNDDTRPTI--RYRAMSDALKKA----GRPIFFSMCE 170
+ F+ G DYLKYDNC F+DD + I +Y+ M DAL+K G FS+CE
Sbjct: 127 DQDAATFQSWGFDYLKYDNCAIPFDDDVQQNILGKYQRMQDALEKVANSTGTTFVFSLCE 186
Query: 171 WGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNG 230
WG + LWG+++ +SWR +DIS W+ + +I + ++ G ND D+LE+GNG
Sbjct: 187 WGWSQVWLWGASVGHSWRIDEDISPYWSSLTSIINQLSFVNFGSQFYGHNDLDILEIGNG 246
Query: 231 GMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------------ 278
+ Y+E HF++WA +K+PLL+ D+ +++ E + I+ N E++A++Q
Sbjct: 247 NLTYDESKTHFTVWAFAKSPLLISADLSSISQENVEILTNSEILAISQDNVYGQSVSPFR 306
Query: 279 ----------AINDQQIWAGPLSGNRIVVLL----ESRKTFSSMMTAHWDDIGIK-SNNT 323
Q W+ P +++L+ +S F ++ + W GI+ S
Sbjct: 307 WGLNPDWTTNVTYPAQYWSLPSENGTVIMLINTLDDSADMFFNLTESPWLRAGIQYSVRD 366
Query: 324 LFEHRTLETKFQGKLTAKVDGHSCKMYELT 353
L+ H T + A V H T
Sbjct: 367 LWSHTDNGTAVRNYTAASVPSHGVAALLFT 396
>gi|345564628|gb|EGX47588.1| hypothetical protein AOL_s00083g96 [Arthrobotrys oligospora ATCC
24927]
Length = 434
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 101/301 (33%), Positives = 140/301 (46%), Gaps = 50/301 (16%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
NGL LTP MGWN+WN F C INE I + D + + D + +
Sbjct: 26 NGLALTPQMGWNTWNTFACNINEDTILGAAKALKKLKLDE---YGYHYVVMDDCWSLHQR 82
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVHIL---------ISNSLDNRIESNIAFEQG-- 130
G L + FP G K LAD +H L R ++ FE+
Sbjct: 83 ----NATGYLQYDPVKFPKGIKHLADEIHALGLGFGMYSSAGRYTCGRYPGSLNFEEKDA 138
Query: 131 -------IDYLKYDNCFNDDTR--PTI---RYRAMSDALKKAGRPIFFSMCEWGDMRPAL 178
+DYLKYDNC N+ P I RY MS AL K GRPI +SMC WG+ P
Sbjct: 139 DLFAEWEVDYLKYDNCHNEGQSGTPLISYNRYEVMSRALNKTGRPILYSMCNWGEDSPWD 198
Query: 179 WGSNIRNSWRTTDDISDSWAR--------------------MLTIADMNEVYADHAKPGG 218
W I NSWR + DI + + R ++ I + A G
Sbjct: 199 WAPTIANSWRMSGDIVNVFDRYDDRCPCDDYICRLPGHHCSIMNIINKAAPLGQKAHTGA 258
Query: 219 WNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
WND DMLE+GN + Y E +HFSIWA+ K+PL++G D+ + + +T+ I+ N+++I ++Q
Sbjct: 259 WNDLDMLEIGNDNLDYEEQKLHFSIWAMVKSPLIMGHDIMHTSKQTLEILTNKDIIDLSQ 318
Query: 279 A 279
Sbjct: 319 G 319
>gi|346977931|gb|EGY21383.1| alpha-galactosidase [Verticillium dahliae VdLs.17]
Length = 549
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 113/379 (29%), Positives = 174/379 (45%), Gaps = 56/379 (14%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQ 86
TPPMGWNS+N ++C NE IIR+ + + L + +D +T P ++
Sbjct: 31 TPPMGWNSYNRYNCFPNEAIIRQ--------NAQGLVDLGFAGLGYDIVTVDCGWPSRDR 82
Query: 87 -CQGNLVANKTTFPSGTKALADYVHIL-------------------ISNSLDNR-IESNI 125
QG L N+T FPSG L D++H L I SL I++
Sbjct: 83 DAQGRLQWNETLFPSGPVELGDFIHGLGLQFGLYSGAGYLQCGSSDIPASLGYEDIDAQS 142
Query: 126 AFEQGIDYLKYDNCF-----------NDDTRPTIRYRAMSDALKKAGRPIFFSMCEWG-D 173
E G D LKYDNC+ + + + R+ M+DAL KAGR I + +C+WG
Sbjct: 143 FAEWGGDSLKYDNCYATSRTEMVDWQSAEAQSPDRFVKMADALHKAGRDIKYFLCQWGIG 202
Query: 174 MRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMK 233
W + + N+WR ++DI ++W + I + +A H PG + D DML +G G +
Sbjct: 203 ENVPDWAAPLGNTWRMSNDIFNAWRAIWRITNQVVPHAKHTGPGAFADMDMLIIGLGALS 262
Query: 234 YNEYVVHFSIWAISKAPLLLG--CDVGNLTAETMPIIGNEEVIAVNQ------------- 278
Y E HF W++ K+PL +G D + AE++ I+ NEE IA+NQ
Sbjct: 263 YEEERFHFGFWSLMKSPLFIGGVMDRAEIPAESLKIMSNEEAIAINQDPLAKAAELVIRY 322
Query: 279 AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFEHRTLETKFQGKL 338
+ IWAG LSG R + + + K + + IG+ S T + G
Sbjct: 323 TEEEWDIWAGELSGGRKALGIANWKNETQTVNVDLSLIGVASAKTRDIWAHADGAIAGTQ 382
Query: 339 TAKVDGHSCKMYELTPVTK 357
K+ H ++ L+ + +
Sbjct: 383 EVKLAPHELRLLVLSDIEE 401
>gi|346975587|gb|EGY19039.1| alpha-galactosidase [Verticillium dahliae VdLs.17]
Length = 479
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 110/328 (33%), Positives = 162/328 (49%), Gaps = 58/328 (17%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
+GL TPPMGWN+WN F C ++E ++ T S R + L + L
Sbjct: 22 DGLADTPPMGWNNWNAFACDVSEHLLLRT------SERIVNLGLRDLGYNTVVLDDCWQD 75
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVHI-----------------LISNSLDNRIESN 124
P +G + + FP G KA++D +H + SLD+ +
Sbjct: 76 PAGRDAKGKVQPDLAKFPRGMKAISDALHAQNLKFGMYSSAGELTCARFAGSLDHEQDDA 135
Query: 125 IAFEQ-GIDYLKYDNCFNDDT--RPTI---RYRAMSDALKKAGRPIFFSMCEWGDMRPAL 178
+F G+D+LKYDNCF+ P I R++AMSDALK +GR I ++C WG+
Sbjct: 136 DSFAAWGVDFLKYDNCFHMGRMGTPEISFNRFKAMSDALKASGRDIALNLCNWGEDYVHT 195
Query: 179 WGSNIRNSWRTTDDISDSWAR------MLTIAD---------------MNEV--YADHAK 215
WG+++ ++WR + DI DS+ R ++AD +N+V +AD A
Sbjct: 196 WGASLAHAWRMSGDIYDSFTRPDDLCSCASVADPFCVAPGTQCSVLFILNKVAPFADRAI 255
Query: 216 PGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEV-I 274
PGGWND DMLEVG GGM EY HF++WA K+P G V + + IG V +
Sbjct: 256 PGGWNDLDMLEVGRGGMTDEEYKAHFALWAALKSPDPHGRAVYRVRRD----IGPPRVPV 311
Query: 275 AVNQAINDQQIWAGPLS-GNRIVVLLES 301
A A + IW+G L+ G++ V+ L +
Sbjct: 312 ADEYAAQEAHIWSGRLANGDQTVIFLNA 339
>gi|93276954|dbj|BAE93466.1| alpha-galactosidase [Lachancea thermotolerans]
Length = 472
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 112/345 (32%), Positives = 157/345 (45%), Gaps = 75/345 (21%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
NGLGLTP MGWN+WN F C + E+++ S + +SL + +
Sbjct: 26 NGLGLTPQMGWNNWNSFACDVKEEMLL--------GSAEKLVSLGLKDLGYKYVILDDCW 77
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVHI--LISNSLDNRIESNIAFEQG--------- 130
G+L + FP+G K +AD +H L+ + E A G
Sbjct: 78 SGGRSENGSLFPHPEKFPNGMKHVADRLHKQDLLFGMYSSAGEYTCAGYSGSLGHEETDA 137
Query: 131 -------IDYLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFSMCEWGDMRPAL 178
+DYLKYDNC+N T RY+AMSDAL K GRPIF+S+C WG
Sbjct: 138 NSFASWDVDYLKYDNCYNRGQFGTPEASFNRYKAMSDALNKTGRPIFYSLCNWGQDLTFY 197
Query: 179 WGSNIRNSWRTTDDISDSWAR----------------------MLTIADMNEVYADHAKP 216
WGS I NSWR + DI + R ++ I +A
Sbjct: 198 WGSAISNSWRISGDIYPFFDRPDSRCPCEDDEYDCKYAGFHCSIMNILGKAAPMGQNAGV 257
Query: 217 GGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAV 276
GGWND D LEVG G + +E + HFS+WAI K+PL++G ++ + + I+ N VIA+
Sbjct: 258 GGWNDLDALEVGVGNLTDDEEMAHFSMWAIVKSPLVIGANLEKINPSSYSILSNPAVIAI 317
Query: 277 NQ-------------AINDQ--------QIWAGPL-SGNRIVVLL 299
NQ I D Q+W+G L +G+++V LL
Sbjct: 318 NQDPAGTPAVRVWREQIRDTDEYGRGEIQLWSGSLHNGDQVVALL 362
>gi|119497533|ref|XP_001265525.1| alpha-galactosidase/alpha-n-acetylgalactosaminidase [Neosartorya
fischeri NRRL 181]
gi|119413687|gb|EAW23628.1| alpha-galactosidase/alpha-n-acetylgalactosaminidase [Neosartorya
fischeri NRRL 181]
Length = 406
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 115/383 (30%), Positives = 179/383 (46%), Gaps = 79/383 (20%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPL 83
LG TP MGWNSWN F IN +I T + + L + + + L +
Sbjct: 26 LGQTPQMGWNSWNTFKSNINASVIENTVQLF------EDLGLKDAGYEYILLDE-GWSDY 78
Query: 84 PNQCQGNLVANKTTFPSGTKALADYVH---ILISNSLDNRI------ESNIAFEQ----- 129
G L N T+FP+G K L D +H + I D+ I + +E+
Sbjct: 79 SRTADGYLQPNLTSFPNGIKPLIDDIHAKGLKIGLYGDSGILTCGFRPGSWGYEERDAQT 138
Query: 130 ----GIDYLKYDNC--FNDDTRP-TIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSN 182
G+DY KYDNC F T P +R+ M AL+ +GR IF+S+CEWG P WG
Sbjct: 139 LAGWGVDYWKYDNCGGFQAMTEPPQVRFGVMQKALELSGREIFYSVCEWGYQFPWHWGGK 198
Query: 183 IRNSWRTTDDISDSWAR---------------------MLTIADMNEVYADHAKPGGWND 221
I +S+R + DI+ + + I M E+ + + PG W D
Sbjct: 199 IGHSYRMSGDITAKFTNETGCACKTAYCLNTGYAGCSVLSIIRKMREI-SQYQTPGHWLD 257
Query: 222 PDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAIN 281
DMLE+GNG M + HF+ WA K+PL++G D+ L+ E++ ++ N+++IAV+Q
Sbjct: 258 MDMLEIGNGEMTLYQQQTHFAFWAALKSPLIIGADLSKLSDESVAVLKNKDIIAVSQDTL 317
Query: 282 DQ--------------QIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNN----- 322
Q Q+WAG + G +V+LL + K++ ++ + D+G+ +
Sbjct: 318 GQAVHYIESASKEGQWQVWAGQVKGGYVVLLL-NEKSYPQELSVSFADLGLGLDGPVLVT 376
Query: 323 TLFEHRTLETKFQGKLTAKVDGH 345
L+ H++L KVDG+
Sbjct: 377 ELWSHKSL---------GKVDGY 390
>gi|93276952|dbj|BAE93465.1| alpha-galactosidase [Lachancea thermotolerans]
Length = 472
Score = 158 bits (400), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 113/345 (32%), Positives = 160/345 (46%), Gaps = 75/345 (21%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
NGLGLTP MGWN+WN F C + E+++ S + +SL + +
Sbjct: 26 NGLGLTPQMGWNNWNSFACDVKEEMLL--------GSAEKLVSLGLKDLGYKYVILDDCW 77
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVH---ILI--------------SNSLDNR-IES 123
G+L + FP+G K +AD +H +L S SL + ++
Sbjct: 78 SGGRSENGSLFPHPEKFPNGMKHVADRLHKQDLLFGMYSSAGEYTCAGYSGSLGHEETDA 137
Query: 124 NIAFEQGIDYLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFSMCEWGDMRPAL 178
N +DYLKYDNC+N T RY+AMSDAL K GRPIF+S+C WG
Sbjct: 138 NSFASWDVDYLKYDNCYNRGQFGTPEASFNRYKAMSDALNKTGRPIFYSLCNWGQDLTFY 197
Query: 179 WGSNIRNSWRTTDDISDSWAR----------------------MLTIADMNEVYADHAKP 216
WGS I NSWR + DI + R ++ I +A
Sbjct: 198 WGSAISNSWRISGDIYPFFDRPDSRCPCDDDEYDCKYAGFHCSIMNILGKAAPMGQNAGV 257
Query: 217 GGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAV 276
GGWND D LEVG G + +E + HFS+WAI K+PL++G ++ + + I+ N VIA+
Sbjct: 258 GGWNDLDALEVGVGNLTDDEEMAHFSMWAIVKSPLVIGANLEKINPSSYSILSNPAVIAI 317
Query: 277 NQ-------------AINDQ--------QIWAGPL-SGNRIVVLL 299
NQ I D Q+W+G L +G+++V LL
Sbjct: 318 NQDPAGTPAVRVWREQIRDTDEYGRGEIQLWSGSLHNGDQVVALL 362
>gi|443290878|ref|ZP_21029972.1| Alpha-galactosidase [Micromonospora lupini str. Lupac 08]
gi|385886433|emb|CCH18046.1| Alpha-galactosidase [Micromonospora lupini str. Lupac 08]
Length = 575
Score = 158 bits (399), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 110/323 (34%), Positives = 144/323 (44%), Gaps = 96/323 (29%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
L NG TPPMGWNSWN F C INE +IR+T DDCW +RD
Sbjct: 56 LENGTARTPPMGWNSWNSFGCNINEGLIRQTADAMVSNGMRDLGYQYVVVDDCWFNPNRD 115
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-------- 112
S GNL + T FPSG KAL DY+H
Sbjct: 116 SS--------------------------GNLQGDPTRFPSGMKALGDYLHARGLKFGLYQ 149
Query: 113 ----------------ISNSLDNRIESNIAFEQ-GIDYLKYDNC-----FNDDTRPTIRY 150
+ S + + F G+DYLKYD C ND +
Sbjct: 150 VPVDKTCAQYFNSYPGATGSQGHEAQDARQFAAWGVDYLKYDWCSPNGSINDQV---TTF 206
Query: 151 RAMSDALKKAGRPIFFSM------CEWGDMRPALWGSNIRNSWRTTDDISDSWARMLT-- 202
M DAL GRPI +S+ + G R WG ++ N WRTT+DI+++W T
Sbjct: 207 AKMRDALAATGRPILYSINPNSIHAKTGPQRN--WG-DVANIWRTTEDITNAWDTGQTNG 263
Query: 203 -------IADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGC 255
I ++ A +A+PG + DPDM+EVG GGM E HF++WA+ +PL+ G
Sbjct: 264 YPMGIQNIINVTVPLASYARPGSFTDPDMMEVGRGGMNDTEMRSHFAMWAVMASPLIAGN 323
Query: 256 DVGNLTAETMPIIGNEEVIAVNQ 278
DV N+ A T I+ N +IA+NQ
Sbjct: 324 DVRNMNAATQTILKNANLIAINQ 346
>gi|409051424|gb|EKM60900.1| glycoside hydrolase family 27 protein [Phanerochaete carnosa
HHB-10118-sp]
Length = 408
Score = 158 bits (399), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 102/299 (34%), Positives = 132/299 (44%), Gaps = 88/299 (29%)
Query: 30 MGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQISLSCSFH 70
MGW++WNHF C ++E I DDCW SRD+Q
Sbjct: 1 MGWDTWNHFGCNVDEDTIVSAANAIVQNNLTQFGYEYVIIDDCWQAPSRDNQT------- 53
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL------------------ 112
G VA+ T FP+G + L++ +H +
Sbjct: 54 ------------------GAPVADPTKFPNGMEYLSNKIHSMGLKFGIYSDAGTLTCGGH 95
Query: 113 ISNSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFS 167
+ I++ E G DYLKYDNC+N+ T RY MS AL GRPI +S
Sbjct: 96 FGSLGYEEIDAKTYAEWGADYLKYDNCYNEGLAGTPHISHERYANMSRALNVTGRPILYS 155
Query: 168 MCEWGDMRPALWGSNIRNSWRTTDDISDSWAR---------------------MLTIADM 206
MC WG+ P + I NSWR + D+ DS+ R + I D
Sbjct: 156 MCSWGEDGPWNYAQTIANSWRISGDVMDSFDRYDERCPCTSVIDCKLPGFHCAVSRIIDF 215
Query: 207 NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETM 265
A PG WND DMLEVGNGGM + EY HFS+W+I K+PL+LG DV N+T ETM
Sbjct: 216 AAPLGQKAGPGHWNDLDMLEVGNGGMSFEEYRTHFSMWSILKSPLILGNDVTNMTNETM 274
>gi|374312900|ref|YP_005059330.1| Alpha-galactosidase [Granulicella mallensis MP5ACTX8]
gi|358754910|gb|AEU38300.1| Alpha-galactosidase [Granulicella mallensis MP5ACTX8]
Length = 389
Score = 157 bits (398), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 115/339 (33%), Positives = 149/339 (43%), Gaps = 92/339 (27%)
Query: 27 TPPMGWNSWNHFHCQINEK-------------------IIRETDDCWGESSRDSQISLSC 67
TPPMGWNSWNHF ++ + + DD W E+ RD+
Sbjct: 28 TPPMGWNSWNHFADKVTDADVRAAADALVASGMRDAGYVYVNIDDTW-EAKRDAH----- 81
Query: 68 SFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNRI- 121
++ FP K LADYVH + I +S +
Sbjct: 82 ----------------------GVIQTNEKFPD-MKGLADYVHSKGLKLGIYSSPGPKTC 118
Query: 122 ---ESNIAFEQ---------GIDYLKYDNC-FND----------DTRPTIR---YRAMSD 155
E + E+ GIDYLKYD C F D D ++ Y M
Sbjct: 119 AGYEGSYGHEEQDAQTYAAWGIDYLKYDQCSFGDLIAKEAGNDLDKAAAMQRAAYEKMHV 178
Query: 156 ALKKAGRPIFFSMCEWGDMRPALWGSNIR-NSWRTTDDISDSWARMLTIADMNEVYADHA 214
A+ K GRP+ +S C++G W N WRTTDDI+D+W RM I A
Sbjct: 179 AIVKTGRPMVYSFCQYGLYSVWQWAPKAGGNLWRTTDDINDTWDRMTLIGFQQAGLESFA 238
Query: 215 KPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVI 274
PG WNDPDMLEVGNGGMK +EY VH S+WA+ APLL G D+ +T ET I+ N EVI
Sbjct: 239 GPGHWNDPDMLEVGNGGMKKSEYEVHMSLWAMLSAPLLAGNDLSKMTPETKAILMNREVI 298
Query: 275 AVNQ-----------AINDQQIWAGPLSGNRIVVLLESR 302
A++Q A +IW LSG + + +R
Sbjct: 299 AIDQDALGRPGRRVWAEGPMEIWVKDLSGGKKAIAFFNR 337
>gi|336375285|gb|EGO03621.1| glycoside hydrolase family 27 protein [Serpula lacrymans var.
lacrymans S7.3]
gi|336388347|gb|EGO29491.1| glycoside hydrolase family 27 protein [Serpula lacrymans var.
lacrymans S7.9]
Length = 446
Score = 157 bits (398), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 104/355 (29%), Positives = 163/355 (45%), Gaps = 65/355 (18%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
NG+G P MG+N+WN + C +NE +I +T DDCW +R
Sbjct: 37 FDNGVGRLPAMGFNAWNAYACNVNESLIVQTAQDMQKLGLQDVGYTQINLDDCWALRNRS 96
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNR 120
S + F +T + + + + ++ F A S ++
Sbjct: 97 STGEIQSDPTRFPDMTNLTSTLHDMGYKAGIYSDSGWFTCAGFA----------GSFEHE 146
Query: 121 IESNIAFEQ-GIDYLKYDNC---FNDDTRPTI--RYRAMSDAL----KKAGRPIFFSMCE 170
+ + F+ G D+LKYDNC F+D R + +Y+ MSDAL ++ G FS+C+
Sbjct: 147 EQDALTFQSWGFDFLKYDNCAIPFDDVIREGMVGKYQRMSDALVVVSQQTGTDFVFSLCQ 206
Query: 171 WGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNG 230
WG + LWG+++ +SWR DI +W + +I + G ND DMLE+GNG
Sbjct: 207 WGWNQVWLWGASLSHSWRIDGDIEANWDSLASIINTVSFITQATNFYGRNDLDMLEIGNG 266
Query: 231 GMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------------ 278
+ Y+E HF+ WA+ KAPLL+G DV LT + + I+ N+E+IA+NQ
Sbjct: 267 NLTYDESKTHFTAWALVKAPLLIGTDVATLTQQNVEILSNQEIIAINQDSVYGTSISPFR 326
Query: 279 -AIN---------DQQIWAGPLSGNRIVVLL----ESRKTFSSMMTAHWDDIGIK 319
IN Q W+GP + +L+ E F ++ + W GI+
Sbjct: 327 WGINADFTFNTTFPAQYWSGPSENGTVFMLINTLSEPADMFFNLTESPWTRAGIQ 381
>gi|440803877|gb|ELR24760.1| Aga27A, putative [Acanthamoeba castellanii str. Neff]
Length = 407
Score = 157 bits (398), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 120/385 (31%), Positives = 167/385 (43%), Gaps = 57/385 (14%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NG+ LTPPMGW++WN C +E +RE D + L + D +
Sbjct: 18 LDNGVALTPPMGWSTWNTLACDYDEGDLREIADI---MVKTGLTKLGYVYFNIDDCWEDG 74
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLDNRIE 122
G L N T FPSG ++H L S + E
Sbjct: 75 -----RDANGRLRYNATQFPSGMHGWGQHLHSLGMKFGIYTSSGQYTCSGFEGSWGHEFE 129
Query: 123 SNIAF-EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFS--------MCEW-G 172
F + +D++K D C + Y S AL GRPI +S + W
Sbjct: 130 DAQTFADWEVDFMKLDCCNTSVEMKNVSYPKWSKALNATGRPIVYSCDTDELIGIFPWTN 189
Query: 173 DMRPALWGSNIRNSWRTTDDISDSW-ARMLTIADMNEVY-----ADHAKPGGWNDPDMLE 226
+ RP W + N WRT DI W + + +A E + + AKPG +NDPDMLE
Sbjct: 190 EERPWDWAPPVCNMWRTWKDIKPYWWSWTMNLAFNIETWDLLDLSTIAKPGAFNDPDMLE 249
Query: 227 VGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ-- 284
VG G M Y E HFS+WA+ APL+LG D+ ++T ET+ II N EVIA++Q +Q
Sbjct: 250 VGVGEMTYEESKSHFSLWAMMAAPLILGNDLRHMTTETLDIITNAEVIAIDQDPLGRQGR 309
Query: 285 ---------IWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT-----LFEHRTL 330
I+A PL G I V L ++ + +T +W D+G T L+ HR L
Sbjct: 310 RVLQGIDSDIYAKPLVGGAIAVALWNKVDYPVDITLNWSDLGFAPAKTMALRDLWAHRDL 369
Query: 331 ETKFQGKLTAKVDGHSCKMYELTPV 355
Q V + +L+PV
Sbjct: 370 GPHTQSYTAKAVAPTGVVVLKLSPV 394
>gi|302411604|ref|XP_003003635.1| alpha-galactosidase [Verticillium albo-atrum VaMs.102]
gi|261357540|gb|EEY19968.1| alpha-galactosidase [Verticillium albo-atrum VaMs.102]
Length = 472
Score = 157 bits (397), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 106/344 (30%), Positives = 163/344 (47%), Gaps = 76/344 (22%)
Query: 26 LTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPN 85
L P+ +N+WN C ++E ++ +T S + + L + + L +
Sbjct: 3 LILPVPFNNWNALGCDVSEALLLDT------SKKLVNLGLRDLGYNYVVLDDCWSGGRGE 56
Query: 86 QCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLDNRIESNIAFE 128
G ++T FPSG KA++D +H + SLD +F
Sbjct: 57 D--GFQYEDRTKFPSGMKAVSDAIHDMGMLFGMYGTAGEMTCARYPGSLDWEENDAKSFA 114
Query: 129 Q-GIDYLKYDNCFN--DDTRPTI---RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSN 182
G+DYLKYDNC++ P + R+ M AL GRP+ +S+C WG+ WG +
Sbjct: 115 AWGVDYLKYDNCYSMGRHGSPKVSFDRFDTMRKALNATGRPMIYSLCNWGEDYVYSWGVS 174
Query: 183 IRNSWRTTDDISDSWAR-----------------------MLTIADMNEVYADHAKPGGW 219
I NSWR + DI DS++R +L I + YAD A+PGGW
Sbjct: 175 IANSWRMSGDIYDSFSRPDVLCSCEDPSNPLCTSWGTHCSVLNILNKVAAYADRAQPGGW 234
Query: 220 NDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ- 278
ND DMLEVG GGM +EY HF++WA+ K+PLL+G D+ L+A + I+ N +IAV+Q
Sbjct: 235 NDLDMLEVGQGGMTDDEYKAHFTMWALVKSPLLIGTDLRKLSASALTILNNPAIIAVSQD 294
Query: 279 --------------------AINDQQIWAGPLS-GNRIVVLLES 301
+ + Q+W+G L+ G+++V L +
Sbjct: 295 PAARSALRVAIDYDVKKDKYGVGETQVWSGWLANGDQVVAFLNA 338
>gi|156373810|ref|XP_001629503.1| predicted protein [Nematostella vectensis]
gi|156216505|gb|EDO37440.1| predicted protein [Nematostella vectensis]
Length = 406
Score = 157 bits (396), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 119/392 (30%), Positives = 181/392 (46%), Gaps = 64/392 (16%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGE------SSRDSQISLSCSFHMFD 73
L NGL LTPPMGW W F C I+ + + ++C E + R ++ + + F
Sbjct: 22 LDNGLALTPPMGWMDWERFRCNIDCE--HDPNNCISEHLFMEIADRMAEDGYRGAGYEFI 79
Query: 74 SLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH---ILISNSLDNRIESNIAF--- 127
S+ ++ GNL N T FPSG ALA Y+H + + D + + +
Sbjct: 80 SIDDCYTNKKRDK-NGNLQLNTTRFPSGPHALAQYIHGKGLKLGVYADYGVLTCAGYPGS 138
Query: 128 ----EQ--------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMR 175
EQ G DYLK D C++D Y AL GRPI +S CEW
Sbjct: 139 YYYIEQDAKTFASWGADYLKLDGCYSDPKTMNKGYPEFRRALNATGRPIVYS-CEWPSY- 196
Query: 176 PALWGSNIR----------NSWRTTDDISDSWARMLTIADM----NEVYADHAKPGGWND 221
+ G N+ N+WR DDI DSWA ++ I D + + A PG WND
Sbjct: 197 --MIGRNLTVNYTDIAENCNTWRNYDDIQDSWASVMHIVDYFAANQDTFIAAAGPGHWND 254
Query: 222 PDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ--- 278
PD L +GN G+ ++ ++WAI +PL++ D+ ++ I+ N+EVIAVNQ
Sbjct: 255 PDELIIGNFGLSDDQSRAQMALWAIMASPLIMSTDLRSIKPRHKEILLNKEVIAVNQDKL 314
Query: 279 ---------AINDQQIWAGPLSGNRIVVLLESRKT--FSSMMTAHWDDIGIK----SNNT 323
+ ++ ++WA PLS + V+L +R+ ++ A +D IG + +
Sbjct: 315 GKMGRQWIKSKDNTEVWARPLSDGSVAVVLLNRRDDGMPYLVEASFDKIGFQVHAATARD 374
Query: 324 LFEHRTLETKFQGKLTAKVDGHSCKMYELTPV 355
LF H+ L + F A V+ M +LT +
Sbjct: 375 LFSHKHLGS-FINSFAAHVNPTGVVMVKLTKI 405
>gi|53712778|ref|YP_098770.1| alpha-galactosidase [Bacteroides fragilis YCH46]
gi|52215643|dbj|BAD48236.1| alpha-galactosidase precursor [Bacteroides fragilis YCH46]
Length = 500
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 120/411 (29%), Positives = 185/411 (45%), Gaps = 68/411 (16%)
Query: 2 FIVSSSAADSNGSDRRNL---LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCW-GES 57
+ V+ A ++ G + L + + L LTPPMGWNSWN F + E+++ +T D
Sbjct: 94 YTVTLKAENALGVSVKQLVIRIGDELLLTPPMGWNSWNTFGQHLTEELVLQTADAMITNG 153
Query: 58 SRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------ 111
RD L S+ D Q+ + G+L +KT FP G K +ADY+H
Sbjct: 154 MRD----LGYSYINIDDFWQLPER----GADGHLQIDKTKFPRGIKYVADYLHERGFKLG 205
Query: 112 LISNSLDNR---IESNIAFEQ---------GIDYLKYDNCFN--DDTRPTIRYRAMSDAL 157
+ S++ + + + +E+ G+D LKYD C D RY M AL
Sbjct: 206 IYSDAAEKTCGGVCGSYGYEETDAKDFASWGVDLLKYDYCNAPVDRVEAMERYAKMGRAL 265
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNS-WRTTDDISDSWAR----------MLTIADM 206
+ R I +S+CEWG P W + WR + DI D W R +L I ++
Sbjct: 266 RATNRSIVYSVCEWGQREPWKWAKQVGGHLWRVSGDIGDIWYRDGNRVGGLHGILNILEI 325
Query: 207 NEVYADHAKPGGWNDPDMLEVG-----------NGGMKYNEYVVHFSIWAISKAPLLLGC 255
N +++A P GWNDPDML VG + G +Y HFS+W + +PLL G
Sbjct: 326 NAPLSEYAGPSGWNDPDMLVVGIDGKSMSIGYESEGCTQEQYKSHFSLWCMMASPLLSGN 385
Query: 256 DVGNLTAETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLSGNRIVVLLESRKT 304
DV N+ T+ I+ + ++IA+NQ + +Q IW PL+ V +R +
Sbjct: 386 DVRNMNDSTLKILLDPDLIAINQDVLGRQAERSIRSDHYDIWVKPLADGCKAVACFNRAS 445
Query: 305 FSSMMTAHWDDIGIKSNNTLF---EHRTLETKFQGKLTAKVDGHSCKMYEL 352
+ + + I S ++ H T +L K+ + CK+Y L
Sbjct: 446 SPQTVILNENTIADLSFEQIYCLDSHLTKSGSDSKELIVKLAPYQCKVYIL 496
>gi|169606079|ref|XP_001796460.1| hypothetical protein SNOG_06073 [Phaeosphaeria nodorum SN15]
gi|160706912|gb|EAT87137.2| hypothetical protein SNOG_06073 [Phaeosphaeria nodorum SN15]
Length = 396
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 114/388 (29%), Positives = 178/388 (45%), Gaps = 72/388 (18%)
Query: 15 DRRNLLSNGLGLTPPMGWNSWNHFHCQINEKI------IRETDDCWGESSRDSQISLSCS 68
D+R L NGLG TP +G+N+W I+ + DDCW R+S
Sbjct: 30 DKR--LENGLGRTPALGYNNWVVADLMISLGLKDVGYQYLNIDDCWHSKQRNSS-----G 82
Query: 69 FHMFD-SLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAF 127
F + D S KP+ +Q G + +G A Y + D + ++
Sbjct: 83 FLVPDPSKWPQGMKPVVDQIHGKGLKFGLYGDAGIMTCAGYPGSQGHEAQDAKTLAS--- 139
Query: 128 EQGIDYLKYDNCFND-----------DTRPTIR--YRAMSDALKKAGRPIFFSMCEWGDM 174
G+DY K+D C+ D+R + + Y M DAL AGRPI FSMC+WG
Sbjct: 140 -WGVDYWKHDACYLPCVTGGTPQTCWDSRSSTKSYYATMRDALAGAGRPILFSMCQWGRD 198
Query: 175 RPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKY 234
WG + NSWR + DI+++W + +IA A +A PGG+ND DML+VGNG +
Sbjct: 199 SVWTWGKDYGNSWRMSGDIANTWNSLASIASNAAGMAQYAAPGGFNDLDMLQVGNGALNE 258
Query: 235 NEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ---------------- 278
NE H +W I+K+P+L+G D+ + T+ ++ N+ +IA+NQ
Sbjct: 259 NEERAHVGLWMIAKSPILIGTDLSKIKQSTLTLLKNKALIAINQDKLGKAAGYFRPSGQP 318
Query: 279 -AINDQ--QIWAGPLSGNRIVVLLE----------SRKTFSSMMTAHWDDIGIKSNNTLF 325
+N + WAGPLS +V L+ SRK +S+ W +
Sbjct: 319 APVNGKLYPYWAGPLSDGVVVGLVAADGAATLSAISRKCLASVQAWKWTE---------- 368
Query: 326 EHRTLETKFQGKLTAKVDGHSCKMYELT 353
+L + ++A++ H ++Y++T
Sbjct: 369 --SSLRYWYGTSVSAQLGSHDMRVYKVT 394
>gi|353239663|emb|CCA71565.1| related to alpha-galactosidase precursor [Piriformospora indica DSM
11827]
Length = 474
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 115/382 (30%), Positives = 155/382 (40%), Gaps = 126/382 (32%)
Query: 22 NGLGLTPPMGW------------------NSWNHFHCQINEKIIRET------------- 50
+GL TPPMGW SWN F C I + I +
Sbjct: 24 DGLARTPPMGWVGLSFLVEFKHSTLTTLVPSWNFFACDITQDTILKAAQALVDTGLKDLG 83
Query: 51 ------DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKA 104
DDCW +R S+ +P P VA+ FP G KA
Sbjct: 84 YEYVIMDDCWHAPARSSR------------------RPHPP------VADPIRFPDGIKA 119
Query: 105 LADYVHIL-----ISNSLDNR-------------IESNIAFEQGIDYLKYDNCFND---- 142
LAD +H L I +S + I++ E GIDYLKYDNC+N+
Sbjct: 120 LADKIHALGLKIGIYSSAGTKTCAGQFGSLGYEEIDAKTYAEWGIDYLKYDNCYNEGQAG 179
Query: 143 -DTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSW---- 197
D RY MS AL GRPI +S+C WG+ W I ++WR + DI DS+
Sbjct: 180 YDLVSYNRYAKMSRALSDTGRPILYSLCNWGEDHTWNWAPTIAHTWRMSGDIMDSYDAYD 239
Query: 198 -----------------ARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVH 240
M I + A G WND DMLEVGNGGM +EY H
Sbjct: 240 DRCPCESMIDCKLPGFHCSMTRILEYAAPLVQKAGAGQWNDLDMLEVGNGGMTTDEYKTH 299
Query: 241 FSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ---------------------A 279
F++WA+ K+PL+LG D+ + T II N+ +I +NQ
Sbjct: 300 FAMWAVIKSPLILGNDLTKMDEVTKAIITNKWLIKINQDRVALPVNRVRKSIIKNPHGEV 359
Query: 280 INDQQIWAGPLSGNRIVVLLES 301
+ Q+W+GPL +V ++ +
Sbjct: 360 TGNVQVWSGPLKDGAVVAIVNT 381
>gi|426192725|gb|EKV42660.1| hypothetical protein AGABI2DRAFT_228269 [Agaricus bisporus var.
bisporus H97]
Length = 504
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 96/306 (31%), Positives = 144/306 (47%), Gaps = 73/306 (23%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
+ +G+G P MGWN+W + C+INE I+ + DDC+ E RD
Sbjct: 94 VEDGVGRLPIMGWNTWYAYMCEINETIVLDNAKLVKSLGLLDVGYNYINIDDCYSEKQRD 153
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH---------- 110
S GN+VA+K FPSG + L D +H
Sbjct: 154 SD--------------------------GNIVASKERFPSGMRNLTDQLHEMGFKAGIYS 187
Query: 111 -------ILISNSLDNRIESNIAFEQ--GIDYLKYDNC---FNDDTRPTI--RYRAMSDA 156
L S N F + G D LKYDNC F++ + + +Y+ MSDA
Sbjct: 188 DSGWFTCQLYPGSYQNEDRDIALFSEDWGFDLLKYDNCAVPFDEVIKEGMVGKYKRMSDA 247
Query: 157 L----KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYAD 212
+ K+ G+ I +S+C+WG +P LW + +WRTTDD W ++I + N Y+
Sbjct: 248 IERLRKRTGKSILYSLCQWGREQPWLWAKKLGQTWRTTDDTGSDWGHSVSILNQNSFYSW 307
Query: 213 HAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEE 272
G+ D D L+VGN GM + E HF+ WA+ K+PL++ D+ T ET+ I+ N+E
Sbjct: 308 ANDFYGYGDMDFLQVGNEGMTFEESKSHFTAWALMKSPLVISTDLTKATEETITILKNKE 367
Query: 273 VIAVNQ 278
+I ++Q
Sbjct: 368 LIDIHQ 373
>gi|255564098|ref|XP_002523047.1| alpha-galactosidase/alpha-n-acetylgalactosaminidase, putative
[Ricinus communis]
gi|223537730|gb|EEF39351.1| alpha-galactosidase/alpha-n-acetylgalactosaminidase, putative
[Ricinus communis]
Length = 184
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 80/174 (45%), Positives = 111/174 (63%), Gaps = 19/174 (10%)
Query: 200 MLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGN 259
M TIAD+N+ +A +A PGGWNDPDMLEVGNGGM Y E +FSIWA+ KAPLL+GCDV N
Sbjct: 1 MTTIADLNDKWAAYAGPGGWNDPDMLEVGNGGMTYQECRTYFSIWALMKAPLLIGCDVRN 60
Query: 260 LTAETMPIIGNEEVIAVNQ---AINDQ-----------QIWAGPLSGNRIVVLLESRKTF 305
+TAET I+ N+EVIAVNQ + + Q+WAGPL+G+R+ V+L +R +
Sbjct: 61 MTAETYEILTNKEVIAVNQDPHGVQGRKVQTSRTDGCLQVWAGPLTGHRMAVVLCNRCSK 120
Query: 306 SSMMTAHWDDIGIKSNNT-----LFEHRTLETKFQGKLTAKVDGHSCKMYELTP 354
S+ +T WD +G++S T L++H+ + +VD H+ + L P
Sbjct: 121 SATITVRWDALGVESGTTVVVRDLWQHKDITGDSAASFGTRVDAHAAQCIFLLP 174
>gi|406865881|gb|EKD18922.1| alpha-galactosidase D [Marssonina brunnea f. sp. 'multigermtubi'
MB_m1]
Length = 623
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 116/375 (30%), Positives = 165/375 (44%), Gaps = 60/375 (16%)
Query: 30 MGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQ-CQ 88
MGWNS+NH+ C NE IIR S+ + + L ++ +T LP + +
Sbjct: 1 MGWNSYNHYSCSPNESIIR--------SNAQALVDLGLQSLGYEFVTVDCGWTLPARTAE 52
Query: 89 GNLVANKTTFPSGTKALADYVHIL-----------ISNSLDNRIESNIAFEQ-------- 129
G L N FP+G AL +Y+H L I + E E+
Sbjct: 53 GTLPWNPDRFPNGFPALGEYIHGLGLKFGVYSDAGIRMCMTGEPEQVGIHEETDAQTFAS 112
Query: 130 -GIDYLKYDNCF--------NDDTRPTI----RYRAMSDALKKAGRPIFFSMCEWGDMRP 176
G D LKYDNCF N D P RY M+ AL RPI F +C WG P
Sbjct: 113 WGADLLKYDNCFSEEASGYPNTDYNPVASASGRYANMTRALAATNRPIVFQICNWGVDFP 172
Query: 177 ALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNE 236
+ W ++ NSWR +DI ++ + I + A PG W D DMLEVGN E
Sbjct: 173 SAWAPDLGNSWRIANDIIPAYRTIPRILNQAVPQTSFAGPGRWLDLDMLEVGNNVFTIPE 232
Query: 237 YVVHFSIWAISKAPLLLGCDVGN----LTAETMPIIGNEEVIAVNQ-------------A 279
HFS+WAI K+PL++G + + + AE++ I+ N++VI NQ
Sbjct: 233 EQTHFSLWAIIKSPLVIGAALKDTSTSINAESLAILKNKDVIGYNQDPLGVAASFRRRWT 292
Query: 280 INDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFE--HRTLETKFQGK 337
++WAG LSG R VV + + K + +T D G++ L + + +
Sbjct: 293 EEGYEVWAGELSGERTVVAVINLKNEARELTLDLPDAGVQRAGWLKDIWNNAAASDVLTS 352
Query: 338 LTAKVDGHSCKMYEL 352
TA V H + EL
Sbjct: 353 YTAPVGAHGTILLEL 367
>gi|317025918|ref|XP_001388572.2| alpha-galactosidase/alpha-n-acetylgalactosaminidase [Aspergillus
niger CBS 513.88]
Length = 409
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 118/395 (29%), Positives = 176/395 (44%), Gaps = 108/395 (27%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQIS 64
L P MGWNSWN F +N I++E DD W +R S
Sbjct: 31 LAQKPQMGWNSWNAFKATVNYTIVQEVISLFDTLGLKEAGYEYVLLDDGWASYNRTSD-- 88
Query: 65 LSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------LISNS-- 116
G L AN T+FP G KALA VH L +S
Sbjct: 89 ------------------------GYLQANATSFPQGIKALAQEVHGKGLKLGLYGDSGH 124
Query: 117 -----------LDNRIESNIAFEQGIDYLKYDNC--FNDDTR-PTIRYRAMSDALKKAGR 162
+ R A G+DYLKYDNC F T P IR+ AM +AL +GR
Sbjct: 125 YTCAWRPGSWGYEERDAQTFA-GWGVDYLKYDNCGGFQSMTEAPQIRFGAMKNALTLSGR 183
Query: 163 PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWA--------------------RMLT 202
IF+S+C WG P WG +I +S+R + DI+ S+ +LT
Sbjct: 184 DIFYSVCGWGYQFPWHWGGDIGHSYRMSGDITTSFTNETECQCKTAYCLNTGYAGCSVLT 243
Query: 203 IADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTA 262
I + + + + PG W D DMLEVGN N+ HF+ WA K+PL++G D+ L+
Sbjct: 244 IINKMKEISQYQTPGHWLDMDMLEVGNANFTLNQQQTHFAFWAALKSPLIIGADLSKLSN 303
Query: 263 ETMPIIGNEEVIAVNQ-AINDQ-------------QIWAGPLSGNRIVVLLESRKTFSSM 308
+++ ++ N+ +I++NQ A+ + Q+WAG + +V+LL + K++
Sbjct: 304 DSLAVLTNKAIISINQDALGEPVTYREAHSKEGLFQVWAGKVEDGYVVLLL-NEKSYPQT 362
Query: 309 MTAHWDDIGIKSNNT---LFEHRTLE--TKFQGKL 338
++ + +G+ S L+ +TL+ +KF G L
Sbjct: 363 VSLSFASLGLGSPQKVTELWSGQTLQDLSKFNGTL 397
>gi|361125030|gb|EHK97092.1| putative Alpha-galactosidase [Glarea lozoyensis 74030]
Length = 459
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 104/299 (34%), Positives = 141/299 (47%), Gaps = 74/299 (24%)
Query: 96 TTFPSGTKALADYVHIL-----------------ISNSLDNR-IESNIAFEQGIDYLKYD 137
T FP G A+AD +H L SL + +++N E G+DYLKYD
Sbjct: 38 TKFPRGMAAVADDIHALNLGFGMYSDAGRYTCGMYEGSLGHETVDANSFAEWGVDYLKYD 97
Query: 138 NCFNDDTR-----PTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDD 192
NC+N+ + RYR M DAL GRPI +S+C WG+ P WGS + NSWR + D
Sbjct: 98 NCYNEGYAGNQLISSTRYRTMGDALNATGRPILYSLCNWGEDYPWNWGSTVANSWRISGD 157
Query: 193 ISDSW----AR-------------------MLTIADMNEVYADHAKPGGWNDPDMLEVGN 229
I D W AR + I + A+PG WND DMLEVGN
Sbjct: 158 IFDDWDAYDARCPCEGPDAWNCELPGFHCSVTNIFNKVAFIVSKAQPGAWNDLDMLEVGN 217
Query: 230 GGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQ------ 283
G M EYV HFS+W+ K+PL++G D+ ++ + + I+ N VIAVNQ Q
Sbjct: 218 GAMTDPEYVAHFSMWSAVKSPLIMGNDLRDIAPQDLSILSNAAVIAVNQDPLGQSAARRW 277
Query: 284 ----------------QIWAGPLSG------NRIVVLLESRKTFSSMMTAHWDDIGIKS 320
Q+W+G L G N VVLL + +++M A +DI + S
Sbjct: 278 LCHGNGTNGESGGSGIQLWSGNLKGTVHEDYNDYVVLLVNGNNNATVMNATLEDIFVDS 336
>gi|388496374|gb|AFK36253.1| unknown [Medicago truncatula]
Length = 150
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 82/149 (55%), Positives = 102/149 (68%), Gaps = 17/149 (11%)
Query: 224 MLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ----- 278
MLEVGNGGM NEY+VHFS+WAISKAPLLLGCDV N+T ET+ I+ N+EVIAVNQ
Sbjct: 1 MLEVGNGGMTKNEYIVHFSLWAISKAPLLLGCDVRNVTKETLEIVSNKEVIAVNQDSLGV 60
Query: 279 ------AINDQQIWAGPLSGNRIVVLLESRKTFSSM-MTAHWDDIGIKSNNT-----LFE 326
D +IWAGPLSG R+ V+L ++ T + +TA+WDDIGI L+E
Sbjct: 61 QAKKVRMEGDAEIWAGPLSGYRVAVVLLNKGTQKHIDITANWDDIGIPPKTVVQARDLWE 120
Query: 327 HRTLETKFQGKLTAKVDGHSCKMYELTPV 355
H+TL+T F KL A V+ H+CKM+ L PV
Sbjct: 121 HKTLKTPFVNKLRATVESHACKMFVLKPV 149
>gi|401885367|gb|EJT49486.1| alpha-galactosidase [Trichosporon asahii var. asahii CBS 2479]
Length = 400
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 121/397 (30%), Positives = 175/397 (44%), Gaps = 66/397 (16%)
Query: 1 LFIVSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRD 60
+ +++S S G+D L NGL TPPMGWNSWN C +NE I+R D +S
Sbjct: 10 ITVLASLLLASTGTD---ALDNGLARTPPMGWNSWNAVRCAVNETIVRSAADALVDSG-- 64
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH---------- 110
L+ + + + L + G L A+ T FPSG LA YV
Sbjct: 65 ----LAAAGYQYIVLDDCW-QDYKRGADGALRAHPTKFPSGIATLASYVRSKGLKFGLYG 119
Query: 111 -------ILISNSLDNRIESNIAFEQ---------GIDYLKYDNCFNDDT----RPTIRY 150
+I + + EQ G+ YLKYD C D +PT +
Sbjct: 120 SPGTRTCAMIYDKYPGTGLGSKGHEQQDADTWAAWGVQYLKYDWCLADKDGLVHQPT--F 177
Query: 151 RAMSDALKKAGR--PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNE 208
M DAL G+ I +S+ E+G +P W + + N WRTT DI W + I +
Sbjct: 178 ERMRDALAATGKGGEITYSISEYGYTKPWTWAAPVANLWRTTADIQPRWDSVARIIESQA 237
Query: 209 VYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPII 268
A + PG WNDPDML++GNG E H ++WA+ APL++G V L +T+ ++
Sbjct: 238 ALAGTSAPGAWNDPDMLQIGNGKFTPEETRSHVAMWAMLAAPLMVGTLVDKLPQQTLDVL 297
Query: 269 GNEEVIAVN------QAINDQQ-----IWAGPLSGNR--IVVLLESRKTFSSMMTAHWDD 315
N ++A++ QA QQ +W LSG R I V T S ++ D
Sbjct: 298 ANPRLVAIDQDSLGKQAARIQQTAGVDLWTRELSGGRQAIAVFNTGNSTQSVTLSLGQDA 357
Query: 316 --IGIKSNNTLFEHRTLETKFQGKLTAKVDGHSCKMY 350
+ S N + H T K+TA+V H ++
Sbjct: 358 ALTDVWSGNAV-AHGT------AKVTAQVKAHDTAVF 387
>gi|350637801|gb|EHA26157.1| alpha-galactosidase [Aspergillus niger ATCC 1015]
Length = 384
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 117/381 (30%), Positives = 178/381 (46%), Gaps = 80/381 (20%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDS-----LTQI 78
L P MGWNSWN F +N I++E ISL + + ++ L
Sbjct: 6 LAQKPQMGWNSWNAFKATVNYTIVQEV------------ISLFDTLGLKEAGYEYVLLDD 53
Query: 79 AAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------LISNS-------------LDN 119
G L AN T+FP G KALA VH L +S +
Sbjct: 54 GWASYNRTSDGYLQANATSFPQGIKALAQEVHGKGLKLGLYGDSGHYTCAWRPGSWGYEE 113
Query: 120 RIESNIAFEQGIDYLKYDNC--FNDDTR-PTIRYRAMSDALKKAGRPIFFSMCEWGDMRP 176
R A G+DYLKYDNC F T P IR+ AM +AL +GR IF+S+C WG P
Sbjct: 114 RDAQTFA-GWGVDYLKYDNCGGFQSMTEAPQIRFGAMKNALALSGRDIFYSVCGWGYQFP 172
Query: 177 ALWGSNIRNSWRTTDDISDSWAR--------------------MLTIADMNEVYADHAKP 216
WG +I +S+R + DI+ S+ +LTI + + + + P
Sbjct: 173 WHWGGDIGHSYRMSGDITTSFTNETECQCKTAYCLNTGYAGCSVLTIINKMKEISQYQTP 232
Query: 217 GGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAV 276
G W D DMLEVGN N+ HF+ WA K+PL++G D+ L+ +++ ++ N+ +I++
Sbjct: 233 GHWLDMDMLEVGNANFTLNQQQTHFAFWAALKSPLIIGADLSKLSNDSLAVLTNKAIISI 292
Query: 277 NQ-AINDQ-------------QIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNN 322
NQ A+ + Q+WAG + +V+LL + K++ ++ + +G+ S
Sbjct: 293 NQDALGEPVTYREAHSKEGLFQVWAGKIEDGYVVLLL-NEKSYPQTVSLTFASLGLGSPQ 351
Query: 323 TLFE---HRTLE--TKFQGKL 338
+ E +TL+ +KF G L
Sbjct: 352 KVIELWSGQTLQDFSKFNGVL 372
>gi|388506542|gb|AFK41337.1| unknown [Medicago truncatula]
Length = 150
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 81/149 (54%), Positives = 102/149 (68%), Gaps = 17/149 (11%)
Query: 224 MLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ----- 278
MLEVGNGGM NEY+VHFS+WAISKAPLLLGCDV N+T ET+ ++ N+EVIAVNQ
Sbjct: 1 MLEVGNGGMTKNEYIVHFSLWAISKAPLLLGCDVRNVTKETLEVVSNKEVIAVNQDSLGV 60
Query: 279 ------AINDQQIWAGPLSGNRIVVLLESRKTFSSM-MTAHWDDIGIKSNNT-----LFE 326
D +IWAGPLSG R+ V+L ++ T + +TA+WDDIGI L+E
Sbjct: 61 QAKKVRMEGDAEIWAGPLSGYRVAVVLLNKGTQKHIDITANWDDIGIPPKTVVQARDLWE 120
Query: 327 HRTLETKFQGKLTAKVDGHSCKMYELTPV 355
H+TL+T F KL A V+ H+CKM+ L PV
Sbjct: 121 HKTLKTPFVNKLRATVESHACKMFVLKPV 149
>gi|359406211|ref|ZP_09198920.1| alpha-galactosidase [Prevotella stercorea DSM 18206]
gi|357556629|gb|EHJ38218.1| alpha-galactosidase [Prevotella stercorea DSM 18206]
Length = 440
Score = 155 bits (391), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 120/388 (30%), Positives = 176/388 (45%), Gaps = 72/388 (18%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS-RDSQISLSCSFHMFDSLTQIAAKP 82
L PPMGW +WN F INE++IRET D E RD+ + D L Q
Sbjct: 65 LAPKPPMGWMTWNLFQGNINEQLIRETADAMVEGGFRDA----GYEYIFIDDLWQGG--- 117
Query: 83 LPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIA------------FEQ- 129
Q N++ + FPSG KALADYVH S L I S+ A FE+
Sbjct: 118 --RDRQNNIIPDPEKFPSGIKALADYVH---SKGLKLGIYSDAAQLTCGGWTASLGFEEQ 172
Query: 130 --------GIDYLKYDNCF--NDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALW 179
GIDYLKYD C D RYR M+DAL+K+GR I +CEWG + W
Sbjct: 173 DARTFASWGIDYLKYDYCNAPEDSATARQRYRTMADALQKSGRDIVLGICEWGQRQCEEW 232
Query: 180 GSNIRNS-WRTTDDISDSWAR--------MLTIADMNEVYADHAKPGGWNDPDML----- 225
+ WRT+ D+ D W +L I ++ A HA+PG W D DML
Sbjct: 233 CEEVGGQLWRTSYDVRDMWKDIVKEGGMGILDIVNITAPLAKHARPGQWPDMDMLVVGLR 292
Query: 226 -------EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
++G G EY S+W + + L + D+ N++ + I+ N+E+IA+NQ
Sbjct: 293 GTGGPSSDLGGVGCTQTEYQTQMSMWCMLSSVLAMTNDLRNVSDDDRRILLNKEIIAINQ 352
Query: 279 ---------AIND--QQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT---L 324
+N+ QQ++ PL+ V + + + ++ ++ +G+ T +
Sbjct: 353 DALGKAAERIVNEPTQQVFVKPLADGSHAVAILNPSDAAQRVSLNFSTVGLNGKYTVRDV 412
Query: 325 FEHRTLETKFQGKLTAKVDGHSCKMYEL 352
++HR + K K V H K++ L
Sbjct: 413 WQHRDI-AKRANKWKGTVTAHETKVFVL 439
>gi|322437510|ref|YP_004219600.1| alpha-galactosidase [Granulicella tundricola MP5ACTX9]
gi|321165403|gb|ADW71106.1| Alpha-galactosidase [Granulicella tundricola MP5ACTX9]
Length = 524
Score = 155 bits (391), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 103/287 (35%), Positives = 146/287 (50%), Gaps = 35/287 (12%)
Query: 17 RNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLT 76
+++ NGL TPPMGWNSWN F +I DD +S D+ ++ + + +
Sbjct: 154 KDIPDNGLVRTPPMGWNSWNKFAGKIT------ADDV--KSMADAMVATGMNKAGYQYIN 205
Query: 77 QIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNRI----ESNIAF 127
G + N FP K LADYVH I I +S + E +
Sbjct: 206 IDDTWEADRAADGTIQTNNK-FPD-MKGLADYVHSKGLKIGIYSSPGGKTCAGYEGSFGH 263
Query: 128 EQ---------GIDYLKYDNC----FNDDTRPTIR--YRAMSDALKKAGRPIFFSMCEWG 172
E GIDYLKYD C + T R Y+ M +AL ++ PI +S+C++G
Sbjct: 264 EAQDAKTFAAWGIDYLKYDLCGARAIYESTLENERGLYQKMGEALSQSSHPIVYSLCQYG 323
Query: 173 DMRPALWGSNIR-NSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGG 231
D WGS + N WRTT DI D+W M I + + + G WNDPDMLE+GNGG
Sbjct: 324 DADVWKWGSKVGGNLWRTTGDIRDTWESMDKIGFSQIAISSYTRAGHWNDPDMLEIGNGG 383
Query: 232 MKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
M +EY H S+W++ APL+ G D+ +++ ET I+ N+EVIA++Q
Sbjct: 384 MTPDEYRTHMSLWSMLAAPLIAGNDLRSMSPETSSILLNKEVIAIDQ 430
>gi|387207376|gb|AFJ69054.1| alpha-galactosidase, partial [Nannochloropsis gaditana CCMP526]
gi|422293824|gb|EKU21124.1| alpha-galactosidase, partial [Nannochloropsis gaditana CCMP526]
Length = 139
Score = 155 bits (391), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 70/128 (54%), Positives = 84/128 (65%)
Query: 128 EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSW 187
E G+DYLKYDNC D T P RY M DAL GRP+ +SMCEWG P W + N W
Sbjct: 12 EWGVDYLKYDNCHTDGTSPQERYPPMRDALNATGRPVLYSMCEWGLDNPGAWAPAVSNLW 71
Query: 188 RTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAIS 247
RTT DI D W+ ++ I ++N +A PGG+NDPDMLEVGNGGM EY H S+W +
Sbjct: 72 RTTPDIRDEWSSVMEIVEINGRRWRYAGPGGFNDPDMLEVGNGGMGLEEYRAHMSLWCVM 131
Query: 248 KAPLLLGC 255
KAPLL+GC
Sbjct: 132 KAPLLIGC 139
>gi|291227707|ref|XP_002733825.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
Length = 403
Score = 155 bits (391), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 114/371 (30%), Positives = 172/371 (46%), Gaps = 65/371 (17%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHM-FDSLTQI 78
L NGL TPPMGW +W F C I+ + + D+C GE ++ + + HM D +
Sbjct: 15 LDNGLARTPPMGWMAWERFRCNID--CVNDPDNCIGE-----KLFMQIADHMSTDGFKDV 67
Query: 79 AAKPLP-NQC--------QGNLVANKTTFPSGTKALADYVH-----------------IL 112
+ + + C G L N T FPSG K LADYVH
Sbjct: 68 GYEYVAIDDCWMSHQRDSNGKLYGNTTRFPSGIKRLADYVHSKGLKLGIYEDYGKLTCAG 127
Query: 113 ISNSLDN-RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEW 171
SLD+ +++ + G+DYLK+D C+++ Y M+ AL K G PI FS C W
Sbjct: 128 YPGSLDHLEVDAQTFADWGVDYLKFDGCYSNPKVMDTGYPQMTKALNKTGIPIVFS-CSW 186
Query: 172 GD------MRP--ALWGSNIRNSWRTTDDISDSWARMLTIADM----NEVYADHAKPGGW 219
D M+P L G N N WR +DI DSW + +I D ++ A PG W
Sbjct: 187 PDYQRASGMKPNYTLIGDNC-NLWRNFNDIQDSWDSVTSIIDYYAKEHDTLAAAQGPGKW 245
Query: 220 NDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ- 278
NDPDM+ +G+ G+ Y++ ++W+I +PL++ D+ +++ E I+ N+E+IAVNQ
Sbjct: 246 NDPDMVIIGDFGLSYDQSKSQMAMWSIFASPLMMSNDLRSVSDEAKEILLNKEIIAVNQD 305
Query: 279 ----------AINDQQIWAGPLSGNRIVVLLESRKT--FSSMMTAHWDDIGIKSNNTLFE 326
++W PL + +R T ++ D+G NT +
Sbjct: 306 ALGVMGRQVYKTGAMEVWTKPLVNKSFATVFLNRNTNGMPRSISMTLKDMGY---NTGVD 362
Query: 327 HRTLETKFQGK 337
H TL + K
Sbjct: 363 HYTLRDVYLHK 373
>gi|390599249|gb|EIN08646.1| glycoside hydrolase [Punctularia strigosozonata HHB-11173 SS5]
Length = 389
Score = 154 bits (390), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 91/274 (33%), Positives = 138/274 (50%), Gaps = 33/274 (12%)
Query: 30 MGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQ- 88
MG+N+WN F C INE + T + +SL + + L N+
Sbjct: 1 MGYNAWNAFQCNINEDLFLTTANL--------MVSLGLKDAGYHYINLDDCYALKNRTSS 52
Query: 89 GNLVANKTTFPSGTKALADYVHIL-----------------ISNSLDNRIESNIAF--EQ 129
G +V + FPSG ++L D +H + S + + F E
Sbjct: 53 GAIVEDPAKFPSGMRSLTDKIHAMGFKAGIYGDSGWYTCAGYPGSFQHEAQDAHTFFDEW 112
Query: 130 GIDYLKYDNC---FNDDTRPTI--RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIR 184
D LK+DNC ++D R I +Y+ M+DA+ K RPI FS+CEWG + LWG+ +
Sbjct: 113 NFDLLKFDNCAIPYDDIIRQGIIGKYQRMADAIAKVARPIVFSLCEWGWSQVWLWGAKLG 172
Query: 185 NSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIW 244
SWRTT DI W + I + N G ND DM+++GNGG+ Y+E HF+ W
Sbjct: 173 QSWRTTGDIGPDWPSLANIINFNAFITQATNFYGRNDMDMVQLGNGGLTYDEAKSHFTAW 232
Query: 245 AISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
A+ K+PLL+G ++ +T + + I+ N+E+IA+NQ
Sbjct: 233 ALMKSPLLIGTNLSAITNDVLGILKNKEIIAINQ 266
>gi|390602280|gb|EIN11673.1| glycoside hydrolase [Punctularia strigosozonata HHB-11173 SS5]
Length = 457
Score = 154 bits (390), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 96/299 (32%), Positives = 136/299 (45%), Gaps = 72/299 (24%)
Query: 25 GLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQISL 65
G P +GWNSWN + C INE + DDCW +RDS
Sbjct: 90 GRLPALGWNSWNAYGCNINETKVTAAANQFISLGLKDAGYEYVNIDDCWALMARDS---- 145
Query: 66 SCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL------------- 112
G +V + FP+G +++ VH L
Sbjct: 146 ---------------------ATGKIVPDPAKFPNGLTSVSSQVHDLGLKFGIYSDAGTA 184
Query: 113 ----ISNSLDNR-IESNIAFEQGIDYLKYDNC--------FNDDTRPTIRYRAMSDALKK 159
SL N ++++ G+DYLKYDNC +N +T IRYR M L
Sbjct: 185 TCAGFPGSLGNENLDASTFSSWGVDYLKYDNCNPVPGNDEYNSNT--AIRYRQMGAELAV 242
Query: 160 AGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGW 219
PI FS+C WG P LWGS + +SWR + D S SW+ + +I ++N Y D+
Sbjct: 243 QANPIQFSLCIWGVDSPWLWGSRVGHSWRMSGDSSASWSYITSIINLNAQYLDYVTFFSH 302
Query: 220 NDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
ND DM+E+GNG + E HF+ WA K+P+LLG D+ L+ + + II N E++A +Q
Sbjct: 303 NDMDMMEIGNGDLTIEEQRTHFAAWAFMKSPILLGTDLSTLSTDQLAIIKNTELLAFSQ 361
>gi|389639786|ref|XP_003717526.1| alpha-galactosidase [Magnaporthe oryzae 70-15]
gi|351643345|gb|EHA51207.1| alpha-galactosidase [Magnaporthe oryzae 70-15]
Length = 427
Score = 154 bits (389), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 112/386 (29%), Positives = 170/386 (44%), Gaps = 66/386 (17%)
Query: 2 FIVSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQI-NEKIIRET---------- 50
F ++ A +N S L NGLG TP +GWNSWN C E +T
Sbjct: 8 FASTAILALANASPMEKRLENGLGRTPALGWNSWNVAQCNAATEAFALDTANRFISMGLK 67
Query: 51 ---------DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSG 101
DDCW R+S L + KP+ ++ + +G
Sbjct: 68 DLGYTYVNIDDCWSTMQRNSSGYLVADPKKWPR----GIKPVVDEIHAKGLKFGLYGSAG 123
Query: 102 TKALADYVHILISNSLDNRIESNIAFEQGIDYLKYDNCFND------DTRPTIR------ 149
TK A Y S + + ++ + E G+DY K+DNC+ T P +
Sbjct: 124 TKTCAGYPA---SQGYEGK-DAQLLAEWGVDYWKHDNCYTPCRQGLPQTCPENQVAGNTR 179
Query: 150 --YRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMN 207
Y M DA+ +PIFFS+C WG R WG + NSWR + DI + WA ++ I
Sbjct: 180 TWYGTMRDAVLATKKPIFFSLCNWGRDRVWEWGKDYGNSWRMSIDIWNDWASVIRIGSAA 239
Query: 208 EVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPI 267
A ++ PGG+ND DM+++ NG + + H IWAI+K+P++LG D+ ++A ++ I
Sbjct: 240 AGIAQYSAPGGFNDLDMMQISNGALNPAQERTHMGIWAIAKSPIILGMDLSKISASSLAI 299
Query: 268 IGNEEVIAVNQ-----------------AINDQ--QIWAGPLSGNRIVVLLESRKTFSSM 308
I N+ +IA+NQ ++ Q WAGPLS +V L + T
Sbjct: 300 IKNKGLIAINQDKLGKAATYFQPPGKPAPVSGQLYPYWAGPLSDGVVVGLTNAMGTGRQT 359
Query: 309 MTAHWDDI-----GIKSNNTLFEHRT 329
+ + D+ G S ++ RT
Sbjct: 360 LAVDFKDVPGLGAGTWSWTEMYSGRT 385
>gi|320106911|ref|YP_004182501.1| alpha-galactosidase [Terriglobus saanensis SP1PR4]
gi|319925432|gb|ADV82507.1| Alpha-galactosidase [Terriglobus saanensis SP1PR4]
Length = 394
Score = 154 bits (389), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 124/368 (33%), Positives = 171/368 (46%), Gaps = 71/368 (19%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDS---LTQI 78
N L TPPMGWNSWN F ++ +K IR+T D L S M D+ I
Sbjct: 24 NTLAATPPMGWNSWNFFAGRVTDKDIRDTAD------------LLVSTGMRDAGYIYVNI 71
Query: 79 AAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAFEQ--------- 129
+ ++ FP KALADYVH S L I S+ E
Sbjct: 72 DDTWEGKRDAKGVLHTNGKFPD-MKALADYVH---SKGLKLGIYSSPGRETCAHYAGSYQ 127
Query: 130 ------------GIDYLKYDNC-FNDDTR---PTIR----------YRAMSDALKKAGRP 163
GIDYLKYD C F D+ + P R Y M A+ K GRP
Sbjct: 128 HEEQDAKLYADWGIDYLKYDLCSFRDNMQQEAPNDRAKQNRMMRDAYEKMHQAILKTGRP 187
Query: 164 IFFSMCEWGDMRPALWGSNI-RNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDP 222
+ +S+C++G WG + N WRTTDDI+ + +M IA A +A PG WNDP
Sbjct: 188 MIYSLCQYGFDSVWEWGPEVGANLWRTTDDINAHYQQMSYIALTQAGLAKYAGPGHWNDP 247
Query: 223 DMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-AIN 281
DMLEVGNG + ++E + H ++WAI APLL G ++ ++ E I+ N EVIA++Q +
Sbjct: 248 DMLEVGNGKLTHDENLTHMTMWAILAAPLLAGNNLTAMSPEVKSILTNREVIAIDQDPLG 307
Query: 282 DQ----------QIWAGPLS-GNRIVVLLESRKTFSSM--MTAHWDDIGIKSNNTLFEHR 328
Q QIW+ PL+ G+ + + + + M M H +IG+ N++
Sbjct: 308 KQGDRIWGEGPLQIWSRPLADGSHALAIFNFGEDTAEMRGMNLHLKEIGL--GNSVHARN 365
Query: 329 TLETKFQG 336
E K G
Sbjct: 366 VWEAKDLG 373
>gi|440464931|gb|ELQ34281.1| alpha-galactosidase [Magnaporthe oryzae Y34]
Length = 811
Score = 154 bits (388), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 108/368 (29%), Positives = 164/368 (44%), Gaps = 61/368 (16%)
Query: 2 FIVSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQI-NEKIIRET---------- 50
F ++ A +N S L NGLG TP +GWNSWN C E +T
Sbjct: 8 FASTAILALANASPMEKRLENGLGRTPALGWNSWNVAQCNAATEAFALDTANRFISMGLK 67
Query: 51 ---------DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSG 101
DDCW R+S L + KP+ ++ + +G
Sbjct: 68 DLGYTYVNIDDCWSTMQRNSSGYLVADPKKWPR----GIKPVVDEIHAKGLKFGLYGSAG 123
Query: 102 TKALADYVHILISNSLDNRIESNIAFEQGIDYLKYDNCFND------DTRPTIR------ 149
TK A Y S + + ++ + E G+DY K+DNC+ T P +
Sbjct: 124 TKTCAGYP---ASQGYEGK-DAQLLAEWGVDYWKHDNCYTPCRQGLPQTCPENQVAGNTR 179
Query: 150 --YRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMN 207
Y M DA+ +PIFFS+C WG R WG + NSWR + DI + WA ++ I
Sbjct: 180 TWYGTMRDAVLATKKPIFFSLCNWGRDRVWEWGKDYGNSWRMSIDIWNDWASVIRIGSAA 239
Query: 208 EVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPI 267
A ++ PGG+ND DM+++ NG + + H IWAI+K+P++LG D+ ++A ++ I
Sbjct: 240 AGIAQYSAPGGFNDLDMMQISNGALNPAQERTHMGIWAIAKSPIILGMDLSKISASSLAI 299
Query: 268 IGNEEVIAVNQ-----------------AINDQ--QIWAGPLSGNRIVVLLESRKTFSSM 308
I N+ +IA+NQ ++ Q WAGPLS +V L + T
Sbjct: 300 IKNKGLIAINQDKLGKAATYFQPPGKPAPVSGQLYPYWAGPLSDGVVVGLTNAMGTGRQT 359
Query: 309 MTAHWDDI 316
+ + D+
Sbjct: 360 LAVDFKDV 367
>gi|67541977|ref|XP_664756.1| hypothetical protein AN7152.2 [Aspergillus nidulans FGSC A4]
gi|40742214|gb|EAA61404.1| hypothetical protein AN7152.2 [Aspergillus nidulans FGSC A4]
gi|259483517|tpe|CBF78972.1| TPA: Alpha-galactosidase [Source:UniProtKB/TrEMBL;Acc:Q1HFR9]
[Aspergillus nidulans FGSC A4]
Length = 640
Score = 154 bits (388), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 111/386 (28%), Positives = 173/386 (44%), Gaps = 47/386 (12%)
Query: 2 FIVSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDS 61
+V+++A S + L +GL TP MGWN++N ++C NE I+ E ++
Sbjct: 7 MVVAATALASPAPALKPRLDDGLARTPQMGWNTYNQYNCFPNESIVHENAQALVDTGLAD 66
Query: 62 ----QISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH---ILIS 114
+++ C + + D L PN G + N FP G A+ Y+H +L
Sbjct: 67 LGYRYVTIDCGWGVEDRL--------PN---GTITWNPELFPQGFPAMGQYLHDLGLLFG 115
Query: 115 NSLDNRIESNIAFEQGIDYLKYDNCFND------------DTRPTIRYRAMSDALKKAGR 162
D+ I I YDNC++D T P R+ MS ++ R
Sbjct: 116 VYGDSGILL-CGSPPNITGSLYDNCYSDAATNYPNVNYAPSTSPHPRFANMSRYIQAQDR 174
Query: 163 PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDP 222
I F +CEWG PALW I +SWR +DI W + + D A PG W D
Sbjct: 175 DILFQVCEWGIDFPALWAPEIGHSWRIGNDIIPHWRSIFRTLNQAVPQTDFAGPGQWPDL 234
Query: 223 DMLEVG-NGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ--- 278
DML VG +G + E HFS+W+I K+PL +G + + AE++ I+ N +VIA NQ
Sbjct: 235 DMLLVGLDGVLTVPEEQTHFSLWSILKSPLTIGAAIPGMRAESLEILSNADVIAFNQDAL 294
Query: 279 ----------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFEHR 328
+ ++W+GPL G R + + + + +T DIG++ TL
Sbjct: 295 GVSAALRRRWSDEGYEVWSGPLEGGRTIAAVINWRDEDREITLDLPDIGLQYAETLQNVW 354
Query: 329 TLET--KFQGKLTAKVDGHSCKMYEL 352
ET + ++ V+ H + +L
Sbjct: 355 ADETVNGVKTSYSSVVEAHGVMLVQL 380
>gi|297204458|ref|ZP_06921855.1| carbohydrate binding family 6 protein [Streptomyces sviceus ATCC
29083]
gi|197715799|gb|EDY59833.1| carbohydrate binding family 6 protein [Streptomyces sviceus ATCC
29083]
Length = 548
Score = 154 bits (388), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 117/415 (28%), Positives = 171/415 (41%), Gaps = 119/415 (28%)
Query: 26 LTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQISLS 66
LTPP+GWNSWN F C + E +R+ DDCW + RD
Sbjct: 41 LTPPLGWNSWNSFGCGVTEAQVRQAADAMVSSGMRDAGYRYVVVDDCWFDPQRDGA---- 96
Query: 67 CSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-------------- 112
GNL AN T FPSG KAL DY+H
Sbjct: 97 ----------------------GNLRANPTKFPSGMKALGDYIHGKGLKFGIYQVPGDRT 134
Query: 113 ----------ISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTR--PTIRYRAMSDALKK 159
+ S + + F G+DYLKYD C + TR R+ M DAL+
Sbjct: 135 CAQTSGAYPGSTGSRGHETQDAATFASWGVDYLKYDWCSSSGTRDEQVARFTLMRDALRA 194
Query: 160 AGRPIFFSM------------CEWGDMRPALWGSNIRNSWRTTDDISDSWARMLT----- 202
GRPI +S+ WG+ + + WRTT+D+ D W T
Sbjct: 195 TGRPIVYSINPNSFHAITGATYNWGE---------VADLWRTTEDLLDIWQNGNTNSYPM 245
Query: 203 ----IADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVG 258
+ D+ A + PG WNDPDML VG G+ E HF++WA+ APL+ G D+
Sbjct: 246 GVGNVLDITAPLAAQSGPGHWNDPDMLVVGRPGLSLTESRSHFALWALLSAPLMAGNDIR 305
Query: 259 NLTAETMPIIGNEEVIAVNQAI-----------NDQQIWAGPLSGNRIVVLLESRKTFSS 307
++A+ I+ N ++AVNQ D +++A PLS + V L +R ++
Sbjct: 306 TMSADVSAILRNPRLLAVNQDALGAGGRRVRDDGDTEVFAKPLSDGSVAVGLFNRGGGTA 365
Query: 308 MMTAHWDDIGIKSN----NTLFEHRTLETKFQGKLTAKVDGHSCKMYELTPVTKL 358
+T +G+ L+ T T G+++A V H ++ ++ + L
Sbjct: 366 TVTTTAAQVGLSGGPFTLTDLWSGGTSSTS--GQISASVPAHGVAVFRVSGGSPL 418
>gi|330844389|ref|XP_003294110.1| hypothetical protein DICPUDRAFT_58841 [Dictyostelium purpureum]
gi|325075485|gb|EGC29366.1| hypothetical protein DICPUDRAFT_58841 [Dictyostelium purpureum]
Length = 420
Score = 154 bits (388), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 124/404 (30%), Positives = 169/404 (41%), Gaps = 94/404 (23%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ---INEKIIRET-------------------DDCWGES 57
L NGL LTP M WN WN F C+ INE + E DDCW
Sbjct: 20 LDNGLALTPIMAWNPWNKFGCETSLINETMFMEMAYAMASNGMANAGYQYINLDDCWFAK 79
Query: 58 SRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL----- 112
+RD+ G L+A+ FP G LA Y+H L
Sbjct: 80 TRDN-------------------------VTGQLIADPVRFPRGIGFLATYIHSLGLKFG 114
Query: 113 ISNSLDNR-------------IESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKK 159
I + +++ E G+DY+K D C + + Y + LK
Sbjct: 115 IYGDIGTETCMGYPGSAGYLELDAKTFAEWGVDYVKMDGCNYPEDKMQEAYTQLGQYLKS 174
Query: 160 AGRPIFFSMCEWGDM-------RPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVY-- 210
RP+ +S C W P + I N WR DI+D++ + I D E+
Sbjct: 175 TNRPMVYS-CSWPTYAYVQNISMPFNYIEGICNLWREFQDITDNFDEWVKIIDEMEIMKP 233
Query: 211 --ADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPII 268
+ A PG WNDPDMLE+GNG EY FS+WAI APL+ G D+ + ET+ I+
Sbjct: 234 DRSGFAGPGHWNDPDMLEIGNGNQTNTEYKSMFSLWAILAAPLVAGNDLRTMDQETLDIL 293
Query: 269 GNEEVIAVNQ--------AIN---DQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIG 317
N +VIAVNQ +N + +IW PL N I V L +R SS +T D +
Sbjct: 294 INTDVIAVNQDPLGIQGSRVNKNGNLEIWKRPLVNNSIAVALFNRGPTSSNITITNDILN 353
Query: 318 IKSNNT-----LFEHRTLETKFQGKLTAKVDGHSCKMYELTPVT 356
I +N L+ H T F TA V H + +L+P+
Sbjct: 354 ITNNQNYNIMDLWTH-TSNGTFYNSFTAMVPSHGTVLIKLSPIV 396
>gi|302408715|ref|XP_003002192.1| alpha-galactosidase [Verticillium albo-atrum VaMs.102]
gi|261359113|gb|EEY21541.1| alpha-galactosidase [Verticillium albo-atrum VaMs.102]
Length = 501
Score = 154 bits (388), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 102/322 (31%), Positives = 154/322 (47%), Gaps = 86/322 (26%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIR-----------ETDDCWGESSRDSQISLSCSFH 70
+GL TPPMGW H +E+++ DDCW +
Sbjct: 22 DGLADTPPMGW----HLLLSTSERVVSLGLRDLGYNTVVLDDCWQD-------------- 63
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI-----------------LI 113
P +G + + FP G KA++D +H
Sbjct: 64 -----------PAGRDAKGKVQPDLAKFPRGMKAISDALHAQNLKFGMYSSAGELTCARF 112
Query: 114 SNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDT--RPTI---RYRAMSDALKKAGRPIFFS 167
+ SLD+ + +F G+D+LKYDNCF+ P I R++AMSDALK +GR I +
Sbjct: 113 AGSLDHERDDADSFAAWGVDFLKYDNCFHMGRMGTPEISFNRFKAMSDALKASGRDIALN 172
Query: 168 MCEWGDMRPALWGSNIRNSWRTTDDISDSWAR------MLTIAD---------------M 206
+C WG+ WG+++ ++WR +DDI DS+ R ++AD +
Sbjct: 173 LCNWGEDYVHTWGASLAHAWRMSDDIYDSFTRPDDLCSCASVADPFCVAPGTQCSVLFIL 232
Query: 207 NEV--YADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAET 264
N+V +AD A PGGWND DMLEVG GGM EY HF++WA K+PL+LG D+ + +
Sbjct: 233 NKVAPFADRAIPGGWNDLDMLEVGQGGMTDEEYKAHFALWAALKSPLMLGNDLRIMDSAA 292
Query: 265 MPIIGNEEVIAVNQAINDQQIW 286
+ II N +IA++Q + + ++
Sbjct: 293 LSIINNPAIIALSQDPHGRAVY 314
>gi|383810261|ref|ZP_09965757.1| alpha-galactosidase domain protein [Prevotella sp. oral taxon 306
str. F0472]
gi|383357006|gb|EID34494.1| alpha-galactosidase domain protein [Prevotella sp. oral taxon 306
str. F0472]
Length = 409
Score = 154 bits (388), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 118/380 (31%), Positives = 170/380 (44%), Gaps = 64/380 (16%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
+ L LTPPMG+ +WN + INE++IR+ D + + D + D Q
Sbjct: 31 DSLALTPPMGFMTWNKYGEDINEQLIRQIAD---KMAADGYADAGYQYIFIDDAWQGGRD 87
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIA------------FEQ 129
+ N++ + FPSG KALADYVH S L I S+ A FE+
Sbjct: 88 K-----RNNIIPDPVKFPSGMKALADYVH---SRGLKLGIYSDAAPLTCAGYTASYNFEE 139
Query: 130 ---------GIDYLKYDNCF--NDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPAL 178
G+DYLKYD C +D RY+ M DAL+K+GR I +CEWG + P L
Sbjct: 140 QDAKTFAEWGMDYLKYDYCHAPSDSAVAHERYKRMGDALEKSGRKIALGVCEWGQLNPEL 199
Query: 179 WGSNIRNS-WRTTDDISDSWAR--------MLTIADMNEVYADHAKPGGWNDPDML---- 225
W S WR + D+ D W +L I D+ E A PG WND DML
Sbjct: 200 WARQAGGSLWRISYDVRDMWKDIVKQGGMGILDIIDITEPLYSFAGPGHWNDMDMLIVGL 259
Query: 226 --------EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN 277
++G G Y EY S+W + +PL + D+ N A T I+ N+E+IA+N
Sbjct: 260 EGKGGPSSDLGGIGCTYTEYQTQMSMWCMFASPLAMSHDILNENAATRRILLNKEIIAIN 319
Query: 278 QAINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFEHRTLETKFQGK 337
Q + R+V + + + +T + I I + + + +L F GK
Sbjct: 320 QDALGK--------AARLVQRIRECRVYHRQLTNNRQAIAIMNPSDKAQRISLPLSFLGK 371
Query: 338 LTAKVDGHSCKMYELTPVTK 357
+AK D ++ T V K
Sbjct: 372 -SAKYDFRDVWEHKNTSVKK 390
>gi|395333174|gb|EJF65552.1| glycoside hydrolase [Dichomitus squalens LYAD-421 SS1]
Length = 432
Score = 153 bits (387), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 102/293 (34%), Positives = 144/293 (49%), Gaps = 43/293 (14%)
Query: 21 SNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAA 80
S+ +G P +GWNSWN + C INE I + D + ISL + ++ +
Sbjct: 51 SHQVGRLPALGWNSWNAYSCNINETKILDAADLF--------ISLGLADAGYEYVNIDDC 102
Query: 81 KPLPNQ--CQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLDNRI 121
PL + G +V + + FPSG +AD VH L SL N
Sbjct: 103 WPLETRDASTGRIVPDPSKFPSGISGVADQVHALGLKLGIYSDAGTNTCAGFPGSLGNET 162
Query: 122 ESNIAF-EQGIDYLKYDNC-----FNDDTRP----------TIRYRAMSDALKKAGRPIF 165
AF E G+DYLKYDNC + D P IRYR M+ AL + G+P+
Sbjct: 163 VDAEAFAEWGVDYLKYDNCNVPSNWTDAATPPDNDWYNSNSAIRYRQMTAALNQTGKPVH 222
Query: 166 FSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDML 225
FS+C WGD WG + +SWR T D+S SW+ + +I N + D ND DM+
Sbjct: 223 FSLCIWGDANVWEWGDRVGHSWRMTGDVSASWSSISSIIAANAQHLDSVDFFSHNDMDMM 282
Query: 226 EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
E+GNG + E HF+ WA K+P+LLG D+ NL + + II N +++A +Q
Sbjct: 283 EIGNGDLTLEEQRTHFAAWAFLKSPILLGTDLNNLNSTQLDIIKNAQLLAFHQ 335
>gi|336427937|ref|ZP_08607928.1| hypothetical protein HMPREF0994_03934 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336008686|gb|EGN38699.1| hypothetical protein HMPREF0994_03934 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 376
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 110/319 (34%), Positives = 144/319 (45%), Gaps = 85/319 (26%)
Query: 26 LTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQISLS 66
LTPPMGWNSWN F INE++I ET DDCW RD
Sbjct: 4 LTPPMGWNSWNTFGENINEELIFETADAMVNTGLLEKGYEYLVIDDCWSLKERDG----- 58
Query: 67 CSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH---------------- 110
NQ LV + FP G KA+ADYVH
Sbjct: 59 ------------------NQ---RLVPDPVKFPHGMKAVADYVHSKGLKFGMYSCAGNLT 97
Query: 111 -ILISNSLDNRIESNIAF-EQGIDYLKYDNCFNDDTRPTIR----YRAMSDALKKAGRPI 164
S ++ F E G+D+LKYD C++ P I YR M AL+ GR I
Sbjct: 98 CAGYPGSFEHEFTDARTFAEWGVDFLKYDYCYHS---PIIHGQYLYRRMGLALENCGRDI 154
Query: 165 FFSMCEWGDMRPALW-GSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPD 223
FS C WG + W ++ + WR+T DI D+W + + E + G +ND D
Sbjct: 155 LFSACSWGADQTQEWIKTSAASMWRSTGDIFDTWESVKDLVKQQEKLHPYNGVGCFNDMD 214
Query: 224 MLEV-----GNGGMKY---NEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIA 275
ML V GN G+K +Y HFSIWA+ +PL++GCD+ ++ ET I+GNEE+I
Sbjct: 215 MLIVGMYGKGNVGLKGCSDTQYKTHFSIWALLGSPLMIGCDIRSMNKETKAILGNEELIR 274
Query: 276 VN------QAINDQQIWAG 288
+N Q + Q IWAG
Sbjct: 275 INQDSACRQPVKLQGIWAG 293
>gi|443623413|ref|ZP_21107913.1| putative Carbohydrate binding family 6 protein [Streptomyces
viridochromogenes Tue57]
gi|443343031|gb|ELS57173.1| putative Carbohydrate binding family 6 protein [Streptomyces
viridochromogenes Tue57]
Length = 548
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 120/405 (29%), Positives = 174/405 (42%), Gaps = 99/405 (24%)
Query: 26 LTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQISLS 66
LTPP+GWNSWN F C I E +R+ DDCW + RD+
Sbjct: 40 LTPPLGWNSWNSFGCGITEAQVRQAADAMVSSGMKDAGYQYVVVDDCWFDPQRDAA---- 95
Query: 67 CSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH---------------- 110
GNL AN T FPSG KAL DY+H
Sbjct: 96 ----------------------GNLRANPTKFPSGMKALGDYIHGKGLKFGIYQAPNEVT 133
Query: 111 --------ILISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDT--RPTIRYRAMSDALKK 159
+ S + + F G+DYLKYD C T R+ M DAL+
Sbjct: 134 CAQGVGTYPGATGSKGHEAQDAATFASWGVDYLKYDWCSGSGTLSEQIARFTIMRDALRA 193
Query: 160 AGRPIFFSMCEWGDMRPAL----WGSNIRNSWRTTDDISDSWARMLT---------IADM 206
GRPI +S+ P WG + + WRTT+D+ D W T + D+
Sbjct: 194 TGRPIVYSINPNSFHAPTGDKYNWG-EVADLWRTTEDLLDIWQNNNTNSYPMGVGNVLDV 252
Query: 207 NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMP 266
A + PG WNDPDML VG G+ E HF++WA+ APL+ G D+ ++A+
Sbjct: 253 TAPLAAQSGPGHWNDPDMLVVGRPGLSLTESRSHFALWALMSAPLMAGNDIRTMSADVSA 312
Query: 267 IIGNEEVIAVNQ--------AINDQ---QIWAGPLSGNRIVVLLESRKTFSSMMTAHWDD 315
I+ N ++AVNQ + D +++A PLS + V L +R ++ +T
Sbjct: 313 ILRNPRLLAVNQDPLGAGGRRVRDDGSTEVFAKPLSDGSVAVGLFNRGEGTATVTTTAAQ 372
Query: 316 IGIKSNN-TLFEHRTLETK-FQGKLTAKVDGHSCKMYELTPVTKL 358
+G+ + TL + T T G+++A V H ++++T + L
Sbjct: 373 VGLSGGSFTLTDLWTGATSNTSGQISASVPAHGVAVFKVTGGSPL 417
>gi|33391229|gb|AAQ17218.1| alpha-galactosidase precursor [Saccharomyces pastorianus]
Length = 417
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 107/332 (32%), Positives = 152/332 (45%), Gaps = 75/332 (22%)
Query: 35 WNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVAN 94
WN F C ++E+++ +T D R S+I L + + L + G LVA+
Sbjct: 1 WNTFACDVSEQLLLDTAD------RISEIGLKDLGYTYVILDDCWSSG--RTANGTLVAD 52
Query: 95 KTTFPSGTKALADYVH-----------------ILISNSLDNRIESNIAF-EQGIDYLKY 136
K FP+G +AD++H SL + E F G+DYLKY
Sbjct: 53 KEKFPNGMSHVADHLHNNNFLFGMYSSAGEYTCAGYPGSLGHEEEDAEFFASNGVDYLKY 112
Query: 137 DNCFNDDT-----RPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTD 191
DNC+N RY+AMSDAL K GRPIF+S+C WG WGS+I NSWR +
Sbjct: 113 DNCYNKGQFGAPETSYKRYKAMSDALNKTGRPIFYSLCNWGQDLTFYWGSDIANSWRMSG 172
Query: 192 DISDSWAR----------------------MLTIADMNEVYADHAKPGGWNDPDMLEVGN 229
DI + R ++ I + +A GGWND D LEVG
Sbjct: 173 DIYPQFTRPDSRCPCDGDQFDCAYAGFHCSIMNILNKAAPMGQNAGIGGWNDLDNLEVGV 232
Query: 230 GGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ----------- 278
G + +E HFS+WA+ K+PL++G DV +L A + I VIA+NQ
Sbjct: 233 GNLTDDEEKAHFSMWAMVKSPLVIGADVNHLKASSYSIYSQASVIAINQDPKGVPATRVW 292
Query: 279 ----------AINDQQIWAGPL-SGNRIVVLL 299
+ Q W+GPL +G++++ LL
Sbjct: 293 RHQVPQTDKYGQGEIQFWSGPLDNGDQVIALL 324
>gi|440704247|ref|ZP_20885117.1| alpha-galactosidase [Streptomyces turgidiscabies Car8]
gi|440274128|gb|ELP62755.1| alpha-galactosidase [Streptomyces turgidiscabies Car8]
Length = 548
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 120/385 (31%), Positives = 178/385 (46%), Gaps = 69/385 (17%)
Query: 26 LTPPMGWNSWNHFHCQINEKIIRETDDCWGESS-RDSQIS-LSCSFHMFDSLTQIAAKPL 83
LTPP+GWNSWN F C I E +R+ D S RD+ L FD
Sbjct: 41 LTPPLGWNSWNSFGCGITEAQVRQAADAMVSSGMRDAGYRYLVVDDCWFD---------- 90
Query: 84 PNQ-CQGNLVANKTTFPSGTKALADYVH------------------------ILISNSLD 118
PN+ GNL +N T FPSG KAL D++H + S
Sbjct: 91 PNRDAAGNLRSNPTKFPSGMKALGDHIHNKGLKFGIYQVPGERTCAQTSGGYPGSTGSRG 150
Query: 119 NRIESNIAFEQ-GIDYLKYDNCFNDDTR--PTIRYRAMSDALKKAGRPIFFSMCEWGDMR 175
+ + F G+DYLKYD C + TR R+ M DAL+ GRPI +S+ +
Sbjct: 151 HEAQDATTFASWGVDYLKYDWCSSSGTRDEQVARFTLMRDALRATGRPIVYSINP--NSF 208
Query: 176 PALWGSN-----IRNSWRTTDDISDSWARMLT---------IADMNEVYADHAKPGGWND 221
A+ GS + + WRTT+D+ D W T + D+ A + PG WND
Sbjct: 209 HAITGSTYNWGEVADLWRTTEDLLDIWQNGNTNSYPMGVGNVLDVTAPLAAQSGPGHWND 268
Query: 222 PDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ--- 278
PDML VG G+ E HF++WA++ APL+ G D+ ++A+ I+ N ++AVNQ
Sbjct: 269 PDMLVVGRPGLSLTESRSHFALWALTGAPLMAGNDIRTMSADVSAILRNPRLLAVNQDSL 328
Query: 279 -----AINDQ---QIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNN-TLFEHRT 329
+ D +++A PLS + V L +R ++ + +G+ + TL + T
Sbjct: 329 GAGGRRVRDDGSTEVFAKPLSDGSVAVGLFNRGGSTATIATTAAQVGLSGGSFTLTDLWT 388
Query: 330 LET-KFQGKLTAKVDGHSCKMYELT 353
T G+++A V H ++ +T
Sbjct: 389 GGTSSTSGQISASVPAHGVAVFRVT 413
>gi|449544301|gb|EMD35274.1| glycoside hydrolase family 27 protein [Ceriporiopsis subvermispora
B]
Length = 494
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 96/302 (31%), Positives = 141/302 (46%), Gaps = 73/302 (24%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRE-------------------TDDCWGESSRDSQIS 64
+G P MGWN+WN F C I+E ++ E DDC+ E RD+
Sbjct: 83 VGRLPAMGWNTWNSFWCDIDETLVLEHAQLMKTLGLADAGYNHVNIDDCYSEKKRDAD-- 140
Query: 65 LSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL------------ 112
GN+VA+ FPSG K L D +H +
Sbjct: 141 ------------------------GNIVASAIKFPSGMKNLTDQIHAMGFKAGIYSDSGW 176
Query: 113 -----ISNSLDNRIESNIAFEQ-GIDYLKYDNC---FNDDTRP--TIRYRAMSDALKKAG 161
S N F+ G DYLKYD+C F+ + +++ M++A+++
Sbjct: 177 FTCQMYPGSFQNEARDAKLFQDWGFDYLKYDSCTVPFDGLLKEGQVGKFKRMANAIQELA 236
Query: 162 R-----PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKP 216
+ P+ FS+C+WG +P LW SWRTT DI +W + I + N A +
Sbjct: 237 KTSGKPPLQFSLCQWGWEQPWLWAREFGQSWRTTGDIEANWRSVANIINRNSFIAWASDF 296
Query: 217 GGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAV 276
G ND DMLE+GNG + Y E HF++WA+ K+PLL+G D+ T +T+ I+ N E+IA+
Sbjct: 297 YGHNDLDMLEIGNGELTYEESKTHFTVWALLKSPLLIGTDLRRATNQTIEILTNREIIAI 356
Query: 277 NQ 278
NQ
Sbjct: 357 NQ 358
>gi|407924773|gb|EKG17800.1| Glycoside hydrolase clan GH-D [Macrophomina phaseolina MS6]
Length = 960
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 99/295 (33%), Positives = 142/295 (48%), Gaps = 62/295 (21%)
Query: 20 LSNGLGLTPPMGWNSWN--------------HFHCQINEKIIR----ETDDCWGESSRDS 61
L NG+G+TP +GWN++N + Q+ K + DD W ++RD+
Sbjct: 585 LDNGVGVTPALGWNNYNAGLSASADSALAAANAFIQLGLKDLGYEYINLDDGWSTTTRDA 644
Query: 62 QISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-ISNSL--D 118
GNLVA+ FP+G K ++D +H L + L D
Sbjct: 645 D--------------------------GNLVADPNKFPNGVKNVSDQIHALGLKFGLYGD 678
Query: 119 NRIESNIAF-------EQ--------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRP 163
+ + F EQ G+DY KYDNC RY M DAL G
Sbjct: 679 SGTATCAGFPGSQGYEEQDAKLLASWGVDYWKYDNCNTPSGDSQPRYETMRDALLATGHD 738
Query: 164 IFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPD 223
I +S+C+WG WG+++ NSWR DI+++W + +IA N ++ PGG+ND D
Sbjct: 739 IVYSLCQWGVDSVWTWGASVGNSWRVGGDITNNWNSVASIAASNAGITSYSAPGGFNDYD 798
Query: 224 MLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
MLE+GNG + E HF +WAI K+PLLLG D+ + ++ +I N VIAVNQ
Sbjct: 799 MLEIGNGKLTAAEERAHFGLWAICKSPLLLGTDLTKIANTSLAVIRNSAVIAVNQ 853
>gi|116191885|ref|XP_001221755.1| hypothetical protein CHGG_05660 [Chaetomium globosum CBS 148.51]
gi|88181573|gb|EAQ89041.1| hypothetical protein CHGG_05660 [Chaetomium globosum CBS 148.51]
Length = 400
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 103/347 (29%), Positives = 153/347 (44%), Gaps = 85/347 (24%)
Query: 3 IVSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQI-NEKIIRET----------- 50
++ + +A S+ S L NG+G TP +GWNSWN C ++ T
Sbjct: 8 LLGAVSAVSSRSVPEKRLDNGVGRTPALGWNSWNQGGCNAATAAVVLNTAQAFIDLGLKE 67
Query: 51 --------DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGT 102
DDCW R+S G+LVA+ FP G
Sbjct: 68 LGYTYVNIDDCWSLKQRNSS--------------------------GHLVADPAKFPQGI 101
Query: 103 KALADYVH------------------ILISNSLDNRIESNIAFEQGIDYLKYDNCF---- 140
LA VH + + + +++ G+DY K+DNC
Sbjct: 102 DGLARAVHAKGLKLGLYGDAGTLTCALYPGSYGSEQKDADTIAAWGVDYWKFDNCLTEQV 161
Query: 141 --NDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWA 198
N + Y M DAL K G+PI FS+C+WG WG + NSWR ++DI++ WA
Sbjct: 162 YTNKGIKSPEYYPVMRDALLKTGKPILFSICQWGRDEVWTWGGKVGNSWRMSEDITNDWA 221
Query: 199 RMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVG 258
+ +IA ++A PG +ND DM+E+GNG + E HF +WAI K+P+++G D+
Sbjct: 222 SVSSIAARAATMHEYAAPGEFNDLDMMELGNGVLTEAEERAHFGLWAIMKSPIIMGTDMT 281
Query: 259 NLTAETMPIIGN--EEVIAVNQ----------AINDQQIWAGPLSGN 293
L T+ +I N + ++A+NQ DQ AGP+SG
Sbjct: 282 KLKESTLKVIKNKTQGILAINQDPLGKAATTFTPKDQ---AGPVSGK 325
>gi|134054663|emb|CAK43504.1| unnamed protein product [Aspergillus niger]
Length = 391
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 111/362 (30%), Positives = 162/362 (44%), Gaps = 88/362 (24%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQIS 64
L P MGWNSWN F +N I++E DD W +R S
Sbjct: 31 LAQKPQMGWNSWNAFKATVNYTIVQEVISLFDTLGLKEAGYEYVLLDDGWASYNRTSD-- 88
Query: 65 LSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------LISNS-- 116
G L AN T+FP G KALA VH L +S
Sbjct: 89 ------------------------GYLQANATSFPQGIKALAQEVHGKGLKLGLYGDSGH 124
Query: 117 -----------LDNRIESNIAFEQGIDYLKYDNC--FNDDTR-PTIRYRAMSDALKKAGR 162
+ R A G+DYLKYDNC F T P IR+ AM +AL +GR
Sbjct: 125 YTCAWRPGSWGYEERDAQTFA-GWGVDYLKYDNCGGFQSMTEAPQIRFGAMKNALTLSGR 183
Query: 163 PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVY----ADHAKPGG 218
IF+S+C WG P WG +I +S+R + DI+ S+ T Y + PG
Sbjct: 184 DIFYSVCGWGYQFPWHWGGDIGHSYRMSGDITTSFTNE-TECQCKTAYCLNTGQYQTPGH 242
Query: 219 WNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
W D DMLEVGN N+ HF+ WA K+PL++G D+ L+ +++ ++ N+ +I++NQ
Sbjct: 243 WLDMDMLEVGNANFTLNQQQTHFAFWAALKSPLIIGADLSKLSNDSLAVLTNKAIISINQ 302
Query: 279 -AINDQ-------------QIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTL 324
A+ + Q+WAG + +V+LL + K++ ++ + +G+ S +
Sbjct: 303 DALGEPVTYREAHSKEGLFQVWAGKVEDGYVVLLL-NEKSYPQTVSLSFASLGLGSPQKV 361
Query: 325 FE 326
E
Sbjct: 362 TE 363
>gi|389740218|gb|EIM81409.1| glycoside hydrolase [Stereum hirsutum FP-91666 SS1]
Length = 442
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 106/361 (29%), Positives = 159/361 (44%), Gaps = 97/361 (26%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
LSNG+ P +G+N+WN F C I++ +I +T DDCW E +R
Sbjct: 30 LSNGVAKVPVLGYNTWNFFQCNIDQDLIIDTANLMVSLGLQDAGYNYVNIDDCWAEKNRS 89
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILI------- 113
S G L + F G K L D VH L
Sbjct: 90 ST--------------------------GELQPDSVRFSRGMKNLTDQVHALGFKAGIYG 123
Query: 114 ------------SNSLDNRIESNIAFEQGIDYLKYDNC---FNDDTRPTI--RYRAMSDA 156
S +++ + + G DYLKYDNC F+ + +Y MS+A
Sbjct: 124 DSGWFTCAGYPGSFEHEDQDAQTLFADWGFDYLKYDNCAIPFDTIIQQGTMGKYVPMSNA 183
Query: 157 LKKAG------RPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVY 210
L+ P+ FS+C+WG + LWG+ + +SWRTT DIS W+ + +I + N
Sbjct: 184 LQNLATNGSLSEPVVFSLCQWGWSQVWLWGARLGHSWRTTGDISSQWSSIASIINFNSFI 243
Query: 211 ADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGN 270
G ND DML++GNG M ++E HF+ WA+ K+PLL+G ++ +T ET+ I+ N
Sbjct: 244 TQATGFYGRNDMDMLQLGNGNMTFDEAKTHFTAWALMKSPLLIGTNLSAITNETLSILTN 303
Query: 271 EEVIAVNQ-------------AINDQ---------QIWAGPLSGNRIVVLLESRKTFSSM 308
+E++A+NQ IN Q W+GP + +L+ + T S M
Sbjct: 304 KEILAINQDSVEPDAVSPFRWGINPDWTSNSSFPAQYWSGPSENGTVFMLINTLDTPSDM 363
Query: 309 M 309
Sbjct: 364 F 364
>gi|290954985|ref|YP_003486167.1| alpha-galactosidase [Streptomyces scabiei 87.22]
gi|260644511|emb|CBG67596.1| probable secreted alpha-galactosidase [Streptomyces scabiei 87.22]
Length = 548
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 120/406 (29%), Positives = 177/406 (43%), Gaps = 102/406 (25%)
Query: 26 LTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQISLS 66
LTPP+GWNSWN F C I E +R+ DDCW + RD+
Sbjct: 40 LTPPLGWNSWNSFGCGITEAQVRQATDAMVSSGMREAGYRYVVVDDCWFDPQRDAA---- 95
Query: 67 CSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNRI 121
GNL AN T FPSG KAL DY+H I + + +
Sbjct: 96 ----------------------GNLRANPTKFPSGMKALGDYIHGKGLKFGIYQAPNEKT 133
Query: 122 --------------------ESNIAFEQGIDYLKYDNCFNDDT--RPTIRYRAMSDALKK 159
++ I G+DYLKYD C T + ++ M DAL+
Sbjct: 134 CAQGVGTYPGSTGSKGHEAQDAAIFASWGVDYLKYDWCSGSGTLDQQIAQFTVMRDALRA 193
Query: 160 AGRPIFFSMCEWGDMRPAL----WGSNIRNSWRTTDDISDSWARMLT---------IADM 206
GRPI +S+ P WG + + WRTT+D+ D W T + D+
Sbjct: 194 TGRPIVYSINPNSFHAPTGHTYDWG-QVADLWRTTEDLLDIWQNGNTNSYPMGVGNVLDI 252
Query: 207 NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMP 266
A + PG WNDPDML VG G+ E HF++W++ APL+ G D+ ++A+
Sbjct: 253 TAPLAAQSGPGHWNDPDMLVVGRPGLSLTESRSHFALWSLLSAPLMAGNDIRTMSADVSA 312
Query: 267 IIGNEEVIAVNQAI-----------NDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDD 315
++ N ++AVNQ D +++A PLS + V L +R ++ +TA
Sbjct: 313 VLRNPRLLAVNQDSLGAGGRRVRDDGDTEVFAKPLSDGSVAVGLLNRGGSATTVTATAAQ 372
Query: 316 IGIKSNN-TLFEHRTLET-KFQGKLTAKVDGHSCKMYELT---PVT 356
+G+ TL + T T G+++A V H ++++T PVT
Sbjct: 373 VGLTGGPFTLTDLWTGGTSSTSGQISASVPAHGVAVFKMTGGSPVT 418
>gi|33391227|gb|AAQ17217.1| alpha-galactosidase precursor [Saccharomyces bayanus]
Length = 417
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 109/343 (31%), Positives = 154/343 (44%), Gaps = 75/343 (21%)
Query: 35 WNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVAN 94
WN F C ++E+++ +T D R S I L + + L + G LVA+
Sbjct: 1 WNTFACDVSEQLLLDTAD------RISDIGLKDLGYTYVILDDCWSSG--RSSNGTLVAD 52
Query: 95 KTTFPSGTKALADYVH-----------------ILISNSLDNRIESNIAF-EQGIDYLKY 136
K FP+G +AD++H SL + + F G+DYLKY
Sbjct: 53 KEKFPNGMSHVADHLHNNSFLFGMYSSAGEYTCAGYPGSLGHEEDDADFFASNGVDYLKY 112
Query: 137 DNCFNDDT--RPTI---RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTD 191
DNC+N P RY+AMSDAL K GRPIF+S+C WG WGS+I NSWR +
Sbjct: 113 DNCYNKGQFGAPETSYNRYKAMSDALNKTGRPIFYSLCNWGQDLTFYWGSDIANSWRMSG 172
Query: 192 DISDSWAR----------------------MLTIADMNEVYADHAKPGGWNDPDMLEVGN 229
DI + R ++ I + +A GGWND D LEVG
Sbjct: 173 DIYPEFTRPDSRCPCDGDQFDCAYAGFHCSIMNILNKAAPMGQNAGIGGWNDLDNLEVGV 232
Query: 230 GGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ----------- 278
G + +E HFS+WA+ K+PL++G DV L A + I +IA+NQ
Sbjct: 233 GNLTDDEEKAHFSMWAMVKSPLIIGADVNQLKASSYSIYSQASIIAINQDPKGVPATRVW 292
Query: 279 ----------AINDQQIWAGPL-SGNRIVVLLESRKTFSSMMT 310
+ Q W+GPL +G++++ LL SM T
Sbjct: 293 RHQVPQTDKYGQGEIQFWSGPLDNGDQVIALLNGGNKARSMNT 335
>gi|256377206|ref|YP_003100866.1| Ricin B lectin [Actinosynnema mirum DSM 43827]
gi|255921509|gb|ACU37020.1| Ricin B lectin [Actinosynnema mirum DSM 43827]
Length = 548
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 121/413 (29%), Positives = 169/413 (40%), Gaps = 115/413 (27%)
Query: 26 LTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQISLS 66
LTPP+GWNSWN F C I E +R+ DDCW + RDS
Sbjct: 41 LTPPLGWNSWNSFGCGITEGQVRQAADAMASSGMRDAGYRYVVVDDCWFDPQRDSA---- 96
Query: 67 CSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH---------------- 110
GNL + T FPSG KAL DY+H
Sbjct: 97 ----------------------GNLRNHPTKFPSGMKALGDYIHGKGLKFGIYQAPNEKT 134
Query: 111 --------ILISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTR--PTIRYRAMSDALKK 159
+ S + + +F G+DYLKYD C TR R+ M DAL+
Sbjct: 135 CAQGTGAHPGATGSKGHEAQDARSFASWGVDYLKYDWCSGAGTRDEQIARFTIMRDALRA 194
Query: 160 AGRPIFFSM------------CEWGDMRPALWGSNIRNSWRTTDDISDSWARMLT----- 202
GRPI +S+ +WGD + + WRTT+D+ D W T
Sbjct: 195 TGRPIVYSINPNSFHAITGDKHDWGD---------VADLWRTTEDLLDVWQNGNTNSYPM 245
Query: 203 ----IADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVG 258
+ D+ A PG WNDPDML VG G+ E HF++WA+ APL+ G D+
Sbjct: 246 GVGNVLDVTAPLAAQTGPGNWNDPDMLVVGRPGLTLTESRAHFALWALMAAPLMAGNDIR 305
Query: 259 NLTAETMPIIGNEEVIAVNQ--------AINDQ---QIWAGPLSGNRIVVLLESRKTFSS 307
++ E ++ N +IAVNQ + D +++A PLS + V L +R ++
Sbjct: 306 TMSPEISAVLRNPGLIAVNQDPLGAGGRRVRDDGATEVFAKPLSDGSVAVGLFNRGGGAT 365
Query: 308 MMTAHWDDIGIKSNN-TLFEHRTLETKF-QGKLTAKVDGHSCKMYELTPVTKL 358
+ IG+ TL + T T G ++A V H + +T T L
Sbjct: 366 TVATTAAQIGLSGTGFTLTDLWTGGTSTSSGAISATVPAHGVAAFRVTGGTPL 418
>gi|310801358|gb|EFQ36251.1| alpha-galactosidase [Glomerella graminicola M1.001]
Length = 405
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 112/409 (27%), Positives = 184/409 (44%), Gaps = 109/409 (26%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRET--------------------DDCWGESSRDSQI 63
+G P +GW+ WN C + T DDCW RDS
Sbjct: 29 VGQKPILGWSGWNQGGCNAASASVALTTAQNFISRGLKDAGYTYVNIDDCWSTKQRDSS- 87
Query: 64 SLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-ISNSL--DNR 120
GNLV + +P+G KA+ D +H + + L DN
Sbjct: 88 -------------------------GNLVPDPAKWPNGIKAVTDQIHSMGLKFGLYGDNG 122
Query: 121 IESNIAF------EQ---------GIDYLKYDNCFND----------------DTRPTIR 149
I++ + EQ G+D+ KYDNC+ ++RP R
Sbjct: 123 IKTCAGYPGSQGNEQKDAKLLASWGVDFWKYDNCYTPCNLTGPPQTCPNNQAPNSRP--R 180
Query: 150 YRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEV 209
Y M DA+K GRPI +S+C WG + WG+ + WR + D W ++ IA+
Sbjct: 181 YETMRDAIKATGRPILYSLCNWGYDQVWTWGAQVGQMWRMSTDNWGGWQDVVNIANWAAP 240
Query: 210 YADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIG 269
A ++KP G+ND DM+ +GNG + + HF+IWAI+K+P++LG D+ L++ + ++
Sbjct: 241 IAKYSKPYGFNDLDMMIIGNGKLTPAQERTHFAIWAIAKSPIILGTDISKLSSAQIALVT 300
Query: 270 NEEVIAVNQ-------------------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMT 310
N++++AVNQ + N WAG LS +V L+ S T ++ M+
Sbjct: 301 NKDLLAVNQDSLGVAATTFTPRGATNPGSGNMPPYWAGALSDGTVVALVAS--TAAATMS 358
Query: 311 AHWDDI-GIKSNNTLFEHRTLETKFQG---KLTAKVDGHSCKMYELTPV 355
++ D+ G+ S F + L T QG ++A+++ + ++++T V
Sbjct: 359 VNFTDVPGLGSGT--FAWKELLTGKQGTGTSVSAQLEKNDVAVFKVTKV 405
>gi|198413786|ref|XP_002130547.1| PREDICTED: similar to MGC81044 protein [Ciona intestinalis]
Length = 419
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 117/390 (30%), Positives = 173/390 (44%), Gaps = 56/390 (14%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGES----SRDSQISLSCSFHMFDSL 75
L+NGL TPPMGW +W + C N + DDC GE D +S ++ +
Sbjct: 17 LNNGLVRTPPMGWLAWERYRCNTN--CAQFPDDCIGEKLFMRMADRLVSDGWRDVGYEYV 74
Query: 76 TQIAAKPLPNQ-CQGNLVANKTTFPSGTKALADYVHI------------------LISNS 116
P + G LV N T FPSG KALADYVH I++
Sbjct: 75 NIDDCWPSRERDANGRLVGNSTRFPSGIKALADYVHSKGLKLGIYSDCGKLTCGGYIASG 134
Query: 117 LDNRIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEW----G 172
+ +I++ G+D LKYD C++++T + Y M L K GRPI +S C W G
Sbjct: 135 GNEKIDAETFAAWGVDMLKYDGCYSNETSRKVNYPLMGAELNKTGRPIVYS-CSWPAYEG 193
Query: 173 DMRPALWGSN---IRNSWRTTDDISDSWARMLTI----ADMNEVYADHAKPGGWNDPDML 225
+ P + + I N WR DI DSW ++ I D +V A PGGWNDPDML
Sbjct: 194 GLPPKVNYTELNAICNLWRNYGDIQDSWDDVVDIMNWWGDNQDVLIPAAGPGGWNDPDML 253
Query: 226 EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAI----- 280
G+ + ++ F +WA+ APL + D+ L + ++ N +VIAVNQ +
Sbjct: 254 IGGDYTLSLDQTKTQFGMWAMLAAPLFMSNDLSKLEPDIKTVLQNRDVIAVNQDVLGVQG 313
Query: 281 ------NDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHW--DDIGIKSNNTLFEHRTLET 332
+ Q+++ PL V S +T + + +D+ I E+
Sbjct: 314 RRFIKQDSIQVFSKPLDKGEFAVAAFSTRTDGTPHAIKFSLNDLKISGAKFYSLFDLFES 373
Query: 333 KFQGK------LTAKVDGHSCKMYELTPVT 356
K GK L VD + M+ +P++
Sbjct: 374 KHLGKYSTAEELDIYVDPNGISMFRASPIS 403
>gi|358392928|gb|EHK42332.1| glycoside hydrolase family 27 protein [Trichoderma atroviride IMI
206040]
Length = 408
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 111/330 (33%), Positives = 161/330 (48%), Gaps = 69/330 (20%)
Query: 28 PPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQC 87
P MGWNSWN F INE +I+ + +S D+ L+ + + + +L +
Sbjct: 24 PQMGWNSWNTFKSNINETLIKSS----AKSLVDT--GLAQAGYKYVNLDD-GWQAFTRDS 76
Query: 88 QGNLVANKTTFPSGTKALADYVH---ILISNSLDNRIESNIAFEQG-------------- 130
G N T FPSG KALAD+VH + I D+ I + AF G
Sbjct: 77 SGRQQPNSTRFPSGMKALADFVHHMGLEIGIYSDSGIY-DCAFYPGSYGYEERDAATYAS 135
Query: 131 --IDYLKYDNC--FNDDT-RPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRN 185
IDYLK+DNC F T P R+ M DAL ++GR IF+S+C+WG+ P W S +
Sbjct: 136 WTIDYLKFDNCGSFQAGTLSPQERFLRMGDALNRSGRSIFYSLCQWGNQFPWHWAS-FSD 194
Query: 186 SWRTTDDISDSWAR---------------------MLTIADMNEVYADHAKPGGWNDPDM 224
S+R + DI ++ + I M E+ + +PG W D DM
Sbjct: 195 SYRISGDIKSAFGEDSSGVCQSAYCLNTGYAGVSVLTMIRKMREL-SRFQRPGSWGDMDM 253
Query: 225 LEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----A 279
LEVG G M ++ HFS WA K+PL++G DV ++ ++ I+ N+E+IA++Q A
Sbjct: 254 LEVGTGTMNLHQEQTHFSFWAALKSPLIIGADVNTISKVSLNILMNKEIIAISQDDAGVA 313
Query: 280 IN---------DQQIWAGPLSG--NRIVVL 298
+N Q+W GPL+ +R VVL
Sbjct: 314 VNYLPDLSTEHKVQVWGGPLASGKSRYVVL 343
>gi|388583499|gb|EIM23801.1| glycoside hydrolase [Wallemia sebi CBS 633.66]
Length = 430
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 103/324 (31%), Positives = 147/324 (45%), Gaps = 90/324 (27%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
L+NG+ +TP MGWNSWN F C+I++ +I ++ DDCW +RD
Sbjct: 21 LNNGVAVTPQMGWNSWNSFACEIDQDLITQSAQKLVDLGLRDLGYTYVGIDDCWQADARD 80
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----ISN 115
P N+ L N FP G K +AD VH L I +
Sbjct: 81 ---------------------PETNK----LSYNAEKFPDGIKGVADQVHGLNLKLGIYS 115
Query: 116 SLD----NRIESNIAFEQG---------IDYLKYDNCFNDDTRPTI-----RYRAMSDAL 157
S R+ +++ +E +D LKYDNCFN T RY AMS AL
Sbjct: 116 SAGTLTCGRMPASLGYETEDASSYAEWEVDLLKYDNCFNQGQSGTPKLSYDRYNAMSQAL 175
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWA------------------- 198
GRPI ++MC WG+ P + + I NSWRTT DI+DS+
Sbjct: 176 NATGRPIVYAMCNWGEDGPWNFATTIANSWRTTGDITDSFTGEDDRCPCKGNEGLDCKLA 235
Query: 199 ----RMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLG 254
+ I + + + G WND D LEVG G + + HF++WA K+PL++G
Sbjct: 236 GYHCSITNILEKSVSLGQKSFSGAWNDLDGLEVGVGNLTATQSRSHFTMWAFMKSPLMIG 295
Query: 255 CDVGNLTAETMPIIGNEEVIAVNQ 278
++ + E++ I+ N+ VI VNQ
Sbjct: 296 ANLETIDDESLEILKNKAVIDVNQ 319
>gi|410724423|ref|ZP_11363614.1| alpha-galactosidase [Clostridium sp. Maddingley MBC34-26]
gi|410602123|gb|EKQ56611.1| alpha-galactosidase [Clostridium sp. Maddingley MBC34-26]
Length = 375
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 117/384 (30%), Positives = 183/384 (47%), Gaps = 65/384 (16%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQ 86
TPPMGWNSWN F +I+E +I+ D + ES L + + + + +K ++
Sbjct: 5 TPPMGWNSWNTFGWEISEDLIKSVADIFVESG------LKDAGYEYIVIDDCWSKKQRDK 58
Query: 87 CQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLDNR-IESNIAFE 128
G LVA++ FP+G +ALADYVH S ++ I++
Sbjct: 59 -DGRLVADEEKFPNGMRALADYVHSKGLKFGMYSCAGTMTCGGYPGSFEHEFIDAETFAS 117
Query: 129 QGIDYLKYDNCFNDDTRPT-IRYRAMSDALKKAGRPIFFSMCEWGDMRPALW----GSNI 183
G+DYLKYD C+ D P I Y+ M+ AL+ GR I FS C WG+ W G++I
Sbjct: 118 WGVDYLKYDYCYKPDGIPGHILYKRMAMALRNCGRDILFSACNWGNDDSFNWMRASGAHI 177
Query: 184 RNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGN--------GGMKYN 235
+R+T DI D+W + +A ++ P +ND DML VG GG
Sbjct: 178 ---FRSTGDIQDNWESIKDLALKQLNKECYSGPYCYNDVDMLVVGMYGNGNVALGGCTDE 234
Query: 236 EYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQIWAGPL----- 290
EY HFS+WA+ +PL++G D+ + T I+ N+E+I +NQ I +Q + P
Sbjct: 235 EYKTHFSLWAMMDSPLMIGSDIRKMNETTKNILTNKEIIEINQDIEGRQAYTIPQWNNSD 294
Query: 291 ---------SGNRIVVLLESRKTFSSMMTAHWDDIGIKSNN-------TLFEHRTLETKF 334
+G+ + + M W DIG+ +++ L++H L T F
Sbjct: 295 CMAYVKTLSNGDYAIGFFNFGDSAGEMSLQFW-DIGLPTSSGLGLSLRDLWKHEDLGT-F 352
Query: 335 QGKLTAKVDGHSCKMYELTPVTKL 358
+ T K++ H+C +++ V KL
Sbjct: 353 KESYTIKLEPHACAVFK-AKVIKL 375
>gi|242207094|ref|XP_002469401.1| hypothetical protein POSPLDRAFT_134790 [Postia placenta Mad-698-R]
gi|220731430|gb|EED85274.1| hypothetical protein POSPLDRAFT_134790 [Postia placenta Mad-698-R]
Length = 435
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 105/361 (29%), Positives = 158/361 (43%), Gaps = 95/361 (26%)
Query: 18 NLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESS 58
N L+NG+ P +G+N+WN + C I+E +I ET DDCW E +
Sbjct: 22 NALANGVARLPVLGYNTWNAYACNIDENLILETANLMKSLGLQDAGYTHVNLDDCWAEKN 81
Query: 59 RDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL------ 112
R ++ G LV N FPSG + +H L
Sbjct: 82 RSAE--------------------------GLLVPNSERFPSGFNNMTAQLHALGFEAGI 115
Query: 113 -----------ISNSLDNRIESNIAFEQ-GIDYLKYDNC---FNDDTRPTI--RYRAMSD 155
S N + ++ G DYLKYDNC F+D R +Y+ MSD
Sbjct: 116 YGDSGWLTCAGYPGSYSNELLDATTYQNWGFDYLKYDNCYIPFDDVIREGTFGKYQRMSD 175
Query: 156 ALKKAGR-----PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVY 210
A+ + P+ FS+CEWG + +WG+ + NSWR DI +W + +I ++
Sbjct: 176 AIAELSLTSRQPPLIFSLCEWGWSQVWIWGAQVGNSWRVDGDIESNWPALSSIINLASFI 235
Query: 211 ADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGN 270
G ND D+LEVGNG + Y+E HF+ WA+ K+PLL+G ++ + E + I+ N
Sbjct: 236 TQGTDFYGRNDMDILEVGNGNLTYDENKSHFTAWALFKSPLLIGTNLSAASPEIVEILSN 295
Query: 271 EEVIAVNQ-------------------AINDQ---QIWAGPLSGNRIVVLLESRKTFSSM 308
E++A+NQ ND Q W+GP +V+LL + ++M
Sbjct: 296 REILAINQDPVVGTSISPFRWGLNADWTSNDSYPAQYWSGPSENGTVVMLLNTLDEPANM 355
Query: 309 M 309
Sbjct: 356 F 356
>gi|392562204|gb|EIW55385.1| glycoside hydrolase family 27 protein [Trametes versicolor
FP-101664 SS1]
Length = 439
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 94/306 (30%), Positives = 147/306 (48%), Gaps = 73/306 (23%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
L +G+ P +G+N+WN +HC INE ++ + DDC+ E +R
Sbjct: 28 LDSGVAKLPVLGYNTWNAYHCDINETVVLQAAQLMVSLGLADVGYKHVNIDDCYAEKNRS 87
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI--------- 111
+ G++VA+K F +G L D +H
Sbjct: 88 AS--------------------------GDIVADKERFAAGMNDLTDKIHAMGLKAGIYS 121
Query: 112 --------LISNSLDNRIESNIAFEQ-GIDYLKYDNC---FNDDTRPTI--RYRAMSDAL 157
L S N F+ G DYLK+DNC ++ R I +Y+ M+DA+
Sbjct: 122 DSGWFTCQLYPGSYQNEARDAKLFQDWGFDYLKFDNCAVPYDAIIREGIVGKYKRMADAI 181
Query: 158 ----KKAGRP-IFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYAD 212
K +G+P + FS+CEWG+ +P LW SWRTT DIS W + +I + N A
Sbjct: 182 ADLAKSSGKPPLIFSLCEWGEEQPWLWARRFGQSWRTTGDISPDWGSIASIINQNSFIAW 241
Query: 213 HAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEE 272
+ G ND D+LE+GNGG+ ++E HF+ WA+ K+PLL+G D+ ++ + I+ N E
Sbjct: 242 ASDFYGHNDMDILEIGNGGLTHDEAKTHFTAWALMKSPLLIGTDLSTISDADLAILKNTE 301
Query: 273 VIAVNQ 278
++A++Q
Sbjct: 302 LLALSQ 307
>gi|392589346|gb|EIW78677.1| glycoside hydrolase family 27 protein [Coniophora puteana
RWD-64-598 SS2]
Length = 448
Score = 151 bits (382), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 108/406 (26%), Positives = 179/406 (44%), Gaps = 77/406 (18%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
L+NG+ P +G+N+WN +HC I+++++ + DDCWG +R
Sbjct: 17 LNNGVAKLPVLGFNTWNAYHCDISQELVLQQAQYMKTLGLLDAGYTQFNLDDCWGVKTRS 76
Query: 61 SQISLSCSFHMFDSLTQIAAK-----------PLPNQ--CQGNLVANKTTFP---SGTKA 104
S + + +F + A P C P SG +
Sbjct: 77 SSGEIQYNTTLFPDMNNYTATLNSMGLCVDFTPFRTHVICPDRFTVRSAAKPYSDSGWQT 136
Query: 105 LADYVHILISNSLDNRIESNIAFEQ-GIDYLKYDNC---FNDDTRPTI--RYRAMSDALK 158
A YV S D+ + +F+ G DYLKYDNC F+D + + +Y+ M AL+
Sbjct: 137 CAGYV-----GSFDHEDQDAASFQSWGFDYLKYDNCAIPFDDVVQQNVLGKYQRMQYALE 191
Query: 159 KA----GRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHA 214
+ G +S+CEWG + LWG+++ +SWR DI+ W+ + +I D +
Sbjct: 192 RVANSTGTTFVYSLCEWGWSQVWLWGASVAHSWRIDGDIAPYWSSLTSIIDQLSFINFGS 251
Query: 215 KPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVI 274
+ G ND D+LE+GNG + Y+E HF++WA +K+PLL+ D+ ++ E + I+ N E++
Sbjct: 252 QFYGHNDMDILEIGNGNLTYDESKTHFTVWAFAKSPLLISADLSTISQENVDILKNSEIL 311
Query: 275 AVNQ----------------------AINDQQIWAGPLSGNRIVVLL----ESRKTFSSM 308
A++Q A Q W+ P +++L+ E F ++
Sbjct: 312 AISQDNVYGQSVSPFRWGINPDWITNATYPAQYWSLPSENGTVIMLINTLDEPADMFFNL 371
Query: 309 MTAHWDDIGIKSN-NTLFEHRTLETKFQGKLTAKVDGHSCKMYELT 353
+ W GI+ N L+ H T + A V H LT
Sbjct: 372 TESPWLRAGIQYNVRDLWSHTDNGTAVRNYTAAGVPSHGVAALLLT 417
>gi|336398020|ref|ZP_08578820.1| Alpha-galactosidase [Prevotella multisaccharivorax DSM 17128]
gi|336067756|gb|EGN56390.1| Alpha-galactosidase [Prevotella multisaccharivorax DSM 17128]
Length = 411
Score = 151 bits (382), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 97/291 (33%), Positives = 147/291 (50%), Gaps = 49/291 (16%)
Query: 24 LGLTPPMGWNSWNHFHCQINE-KIIRETDDCWGESSRDS---QISLSCSFHMFDSLTQIA 79
L TP MGW+SWN F +I+E KI+ D +D+ +++ +H
Sbjct: 26 LADTPLMGWSSWNCFADKIDEEKIVGIIDAVVSSGLKDAGYVYVNIDDCWHG-------- 77
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLDNRIE 122
G + N+ FP G K LADY H + S + +
Sbjct: 78 ----KRDANGFITVNEQRFPHGMKWLADYAHAKGLKLGIYSCAGYQTCAGLPGSYGHEYQ 133
Query: 123 SNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGS 181
+ + + GID+LK D C D P Y+ MSDAL+ AGRPI+F++CEWG +P W
Sbjct: 134 DALQYARWGIDFLKEDWCNTPDLNPKAAYQLMSDALRTAGRPIYFNLCEWGRNQPWHWAR 193
Query: 182 NIRNSWRTTDDISDSWARML-----------TIA---DMNEVYADHAKPGGWNDPDMLEV 227
++ +SWRTT DI +A ++ T+ ++ + +A PG WNDPDMLEV
Sbjct: 194 DMAHSWRTTGDIGVGFAELIKHYEEGAWRPNTVMKNLELTDTLRQYAGPGHWNDPDMLEV 253
Query: 228 GNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
GN GM +E HF++W + APL+LG D+ ++ T I+ N+++IA++Q
Sbjct: 254 GN-GMTQSEDRAHFTMWCMLAAPLVLGNDIRTMSDATKGIVLNKDMIAIDQ 303
>gi|238059649|ref|ZP_04604358.1| alpha-galactosidase [Micromonospora sp. ATCC 39149]
gi|237881460|gb|EEP70288.1| alpha-galactosidase [Micromonospora sp. ATCC 39149]
Length = 509
Score = 151 bits (381), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 105/310 (33%), Positives = 142/310 (45%), Gaps = 90/310 (29%)
Query: 30 MGWNSWNHFHCQINEKIIRE-------------------TDDCWGESSRDSQISLSCSFH 70
MGWNSWN F C INE +IR+ DDCW +RDS
Sbjct: 1 MGWNSWNTFFCNINEALIRQMADAMVSSGMRDAGYQYVVVDDCWMNPNRDSA-------- 52
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL------------------ 112
GNL A+ T FPSG KAL DY+H
Sbjct: 53 ------------------GNLQADPTRFPSGMKALGDYIHARGLKFGIYQAPLDKTCAQY 94
Query: 113 ------ISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDT--RPTIRYRAMSDALKKAGRP 163
+ ++ + + F G+DYLKYD C T R+ M DAL GRP
Sbjct: 95 FGSYPGATGAMGHEAQDARQFAAWGVDYLKYDWCSPTGTINEQVNRFAIMRDALAATGRP 154
Query: 164 IFFSMC------EWGDMRPALWGSNIRNSWRTTDDISDSWARMLT---------IADMNE 208
I +S+ + G MR WG ++ N WRTT+DI+++W T I ++N
Sbjct: 155 ILYSINPNSIHEKTGPMRN--WG-DVANIWRTTEDITNAWDTGQTNGYPMGIQNIINVNV 211
Query: 209 VYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPII 268
A +A+PG +NDPDM+EVG GGM E HF++WAI +PL+ G D+ N+ + T I+
Sbjct: 212 PLAGYARPGSFNDPDMMEVGRGGMNDTEMRSHFAMWAIMASPLIAGNDLRNMNSATQTIL 271
Query: 269 GNEEVIAVNQ 278
N +IA+NQ
Sbjct: 272 KNANLIAINQ 281
>gi|240144513|ref|ZP_04743114.1| alpha-galactosidase [Roseburia intestinalis L1-82]
gi|257203463|gb|EEV01748.1| alpha-galactosidase [Roseburia intestinalis L1-82]
Length = 377
Score = 151 bits (381), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 99/283 (34%), Positives = 143/283 (50%), Gaps = 39/283 (13%)
Query: 26 LTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFH--MFDSLTQIAAKPL 83
LTPPMGWNSWN F INE +I+ET D E+ L C + + D + +
Sbjct: 4 LTPPMGWNSWNTFGEHINETMIKETADSMVENGL-----LECGYEYLVIDDCWSLRER-- 56
Query: 84 PNQCQGNLVANKTTFPSGTKALADYVH-----------------ILISNSLDNR-IESNI 125
G LVA+ FP G KA+A+YVH S ++ I++
Sbjct: 57 --DKNGRLVADPEKFPHGMKAVAEYVHQKGLKFGMYSCAGNLTCAGYPGSFEHEFIDAAT 114
Query: 126 AFEQGIDYLKYDNCFNDDTRP-TIRYRAMSDALKKAGRPIFFSMCEWGDMRPALW-GSNI 183
E G+D+LKYD C++ + P YR M AL+ GR I FS C WG W ++
Sbjct: 115 FAEWGVDFLKYDYCYHSNIIPGEYLYRRMGLALENCGRDILFSACSWGADETHKWIKTSA 174
Query: 184 RNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVG-----NGGMKY---N 235
+ WR+T DI D+W + +A + G +ND DML VG N G++
Sbjct: 175 SSMWRSTGDIFDTWESIKDLAKQQPALFPYNGVGCFNDMDMLVVGMYGKGNVGLQGCNDT 234
Query: 236 EYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
+Y HFS+W+ +PL++GCD+ ++T ET I+ N+E+IA+NQ
Sbjct: 235 QYRTHFSLWSFFGSPLMIGCDIRDMTPETKKILTNKELIAINQ 277
>gi|392570382|gb|EIW63555.1| glycoside hydrolase [Trametes versicolor FP-101664 SS1]
Length = 440
Score = 151 bits (381), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 100/310 (32%), Positives = 137/310 (44%), Gaps = 77/310 (24%)
Query: 21 SNGLGLTPPMGWNSWNHFHCQINE-KIIRET------------------DDCWGESSRDS 61
S+ +G P +GWN+WN + C INE KI+ DDCW E RD+
Sbjct: 61 SHSVGKLPALGWNAWNAYGCNINEDKILAAANQFVSLGLADLGYEYINIDDCWSEMKRDA 120
Query: 62 QISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL--------- 112
G +V + T FP+G +AD VH L
Sbjct: 121 ST-------------------------GRIVPDPTKFPNGISGVADQVHALGLKMGIYSD 155
Query: 113 --------ISNSLDNRIESNIAF-EQGIDYLKYDNC-----FNDDTRP----------TI 148
SL N + F E G+DYLKYDNC ++D P I
Sbjct: 156 AGTATCAGFPGSLGNEMLDATTFAEWGVDYLKYDNCNVPGNWSDSGTPPGGDWYNSNSAI 215
Query: 149 RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNE 208
RYR M+ AL RP F++C WG WG+ + +SWR + D S SW + +I N
Sbjct: 216 RYRQMTAALNGTSRPFQFNLCIWGAANVWDWGARVGHSWRMSGDSSASWNYITSILATNV 275
Query: 209 VYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPII 268
+ ND DM+E+GNG + E HF++WA K+P+LLG D+ NL A + II
Sbjct: 276 QHLASIDFYSHNDMDMMEIGNGDLTIQEQRTHFAVWAFLKSPILLGTDLSNLNATQLAII 335
Query: 269 GNEEVIAVNQ 278
N E++A +Q
Sbjct: 336 KNTELLAFHQ 345
>gi|291005415|ref|ZP_06563388.1| putative alpha-galactosidase [Saccharopolyspora erythraea NRRL
2338]
gi|4091877|gb|AAC99325.1| alpha galactosidase precursor [Saccharopolyspora erythraea]
Length = 428
Score = 150 bits (380), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 124/397 (31%), Positives = 177/397 (44%), Gaps = 77/397 (19%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCW-GESSRDSQIS-LSCSFHMFDSLTQIA 79
+G+ TPPMGWNSWN F C I+E++IR+T D G RD+ + FD
Sbjct: 41 DGVARTPPMGWNSWNSFGCDIDERLIRDTADALVGSGMRDAGYQYVVVDDCWFDPQ---- 96
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVHIL------------------------ISN 115
QGNL AN FPSG +ALADYVH +
Sbjct: 97 -----RDPQGNLRANPERFPSGIRALADYVHSRGLKFGIYQVPTEKTCAQRGGTYPGATG 151
Query: 116 SLDNRIESNIAF-EQGIDYLKYDNCFNDDT--RPTIRYRAMSDALKKAGRPIFFSMCEWG 172
SL + + F E G+DYLKYD C + T + M DAL GRPI +S+
Sbjct: 152 SLGHEEQDARTFAEWGVDYLKYDWCSPEGTLEDQIAAFTKMRDALAATGRPIVYSINS-N 210
Query: 173 DMRPALWG-----SNIRNSWRTTDDISDSWAR---------MLTIADMNEVYADHAKPGG 218
P G S + N WRTT+DI W ++ I D+N A A+PG
Sbjct: 211 SYHPDKNGATHDWSPVANMWRTTEDIKPVWDSGNENEYPMGVVNIIDVNRGLAAQARPGH 270
Query: 219 WNDPDMLEVGN------GGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEE 272
WNDPDMLEVG G+ E H S+WA+ +PL+ G +V + I+ N E
Sbjct: 271 WNDPDMLEVGVYDVEGFKGLTDTEARAHLSMWALMASPLIAGNNVTRMPDGIRDILTNRE 330
Query: 273 VIAVNQAI-----------NDQQIWAGPLS-GNRIVVLLESRKTFSSMMTAHWDDIGIKS 320
V+AV+Q D+++W ++ G+R+V L + + + T + K+
Sbjct: 331 VVAVDQDPAGAQGVPVRDHGDREVWVKNMADGSRVVALFNRGERPAGIRTTAREAGLPKA 390
Query: 321 NN----TLFEHRTLETKFQGKLTAKVDGHSCKMYELT 353
+ L+ H T T G++ A+V H + ++
Sbjct: 391 SEYQVRDLWAHETSTT--DGEIRAEVPAHGVVLLRVS 425
>gi|378725852|gb|EHY52311.1| alpha-galactosidase [Exophiala dermatitidis NIH/UT8656]
Length = 482
Score = 150 bits (380), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 111/367 (30%), Positives = 164/367 (44%), Gaps = 76/367 (20%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQINEK-IIRETDDCWGESSRDSQISLSCSFHMFDSLTQ 77
L NGL TP MGW+++N + NE I+ D RD L +FD
Sbjct: 20 LSYNGLVQTPQMGWDTYNAYALDYNETTILTNADRLVSLGFRD----LGYRVVIFDD--- 72
Query: 78 IAAKPLPNQCQGNLVANKTTFPSGTKALADYVH----ILISNSLDNRIE-----SNIAFE 128
A G+L+ N T FPSG +++AD +H S R ++ +E
Sbjct: 73 -AMTERNRSANGSLIENHTKFPSGLRSIADRLHASGLYFGVYSSAGRFTCGGYPGSLGYE 131
Query: 129 Q---------GIDYLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFSMCEWGDM 174
G DYLKYDNC+N+ T RY +MS+AL GR +S+C WGD
Sbjct: 132 TTDAQWWAGLGADYLKYDNCYNEGLSGTPKLSQDRYASMSNALNSTGRQFVYSLCNWGDD 191
Query: 175 RPALWGSNIRNSWRTTDDISDSWA----------------------RMLTIADMNEVYAD 212
+P W S I NS R + DI DS++ ++ I
Sbjct: 192 KPWEWASTIANSARISGDIQDSFSMPTSSCPCGPEEYYCQLPGYGCSVVNILGKASHITS 251
Query: 213 HAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEE 272
+PG +ND DM+EVGNGGM Y+EY VHFS+WA K+PL+LG + L+ + ++ N
Sbjct: 252 KNQPGYFNDLDMIEVGNGGMSYDEYKVHFSMWAAIKSPLILGNKLDQLSPQDYALLINPA 311
Query: 273 VIAVNQ---------------------AINDQQIWAGPLS-GNRIVVLLESRKTFSSMMT 310
++A++Q + Q+W+G L+ G+++V L + +M
Sbjct: 312 ILAISQDPAGSAIQRRIRTEVDDKDQYGFGEVQVWSGSLANGDQVVAFLNAGNASRTMQY 371
Query: 311 AHWDDIG 317
+ D G
Sbjct: 372 SLVDVFG 378
>gi|428177476|gb|EKX46356.1| hypothetical protein GUITHDRAFT_70739 [Guillardia theta CCMP2712]
Length = 405
Score = 150 bits (379), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 105/318 (33%), Positives = 151/318 (47%), Gaps = 50/318 (15%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMF---DSLT 76
L NG+G P MG+N+WN C+ + IR+ + R + L+ + + D
Sbjct: 66 LDNGVGRLPAMGYNTWNDLLCKPTDSAIRKA------AVRLEGLGLAALGYTYVNIDDCW 119
Query: 77 QIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH---ILISNSLD--------------- 118
++ P N+ LV + + FP G K LAD++H + D
Sbjct: 120 AVSRDPKTNR----LVHDASAFPQGIKGLADFMHSKGFKLGIYTDRGQLTCAGRPGSLGT 175
Query: 119 NRIESNIAFEQGIDYLKYDNC--FNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRP 176
+++ + GIDYLK D+C + YR M DAL K GRPI+FS+C W
Sbjct: 176 EELDAQTFADWGIDYLKEDSCNATQEHEGALDEYRKMRDALNKTGRPIYFSLCGWHTWY- 234
Query: 177 ALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYN- 235
A+ G +I NSWR D+ + W M +E+ A PGGWNDPDML GG +N
Sbjct: 235 AMPGKSIGNSWRIAGDVVN-WKTMYRAIRKSELVVKFAGPGGWNDPDMLIGSGGGSNFNL 293
Query: 236 ---EYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ----------AIND 282
+ FS+W++ APLL+G + NLTA + N +VIAV+Q A+ D
Sbjct: 294 LPHQSRTQFSLWSVLAAPLLIGAAINNLTAWDLETYSNADVIAVDQDVLGIQGSPYALTD 353
Query: 283 -QQIWAGPLSGNRIVVLL 299
Q+WA PLS V+L
Sbjct: 354 CVQVWARPLSSGSFAVIL 371
>gi|389750501|gb|EIM91672.1| glycoside hydrolase [Stereum hirsutum FP-91666 SS1]
Length = 307
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 93/257 (36%), Positives = 124/257 (48%), Gaps = 48/257 (18%)
Query: 89 GNLVANKTTFPSGTKALADYVHIL------------------------ISNSLDNRIESN 124
G N T PSG ALA +VH L + + + +++
Sbjct: 48 GGFTWNTTRIPSGVPALATFVHDLGLKFGVYSDAGYFSCDFVGGTAGWLGSLGNETLDAE 107
Query: 125 IAFEQGIDYLKYDNCFND------DTRPTI----RYRAMSDALKKAGRPIFFSMCEWGDM 174
G D LKYDNCF D P I Y AM +AL+ GRPI FS+CEWG
Sbjct: 108 TFTSWGADLLKYDNCFAVSPTDFVDNDPPISLEPHYVAMRNALEATGRPILFSICEWGLQ 167
Query: 175 RPALWGSNIRNSWRTTDDISD--SWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGM 232
PA W S++ NSWR ++DI SW + I + A PGG+ND D+LEVGN G+
Sbjct: 168 DPARWASDVGNSWRISNDIGPPPSWDNLFRIINEVVPVTGFAGPGGFNDLDLLEVGNNGL 227
Query: 233 KYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------------AI 280
E HF+ WA K+PL++ D+ N T +T+ I+GN +IA+NQ
Sbjct: 228 TVEEQKTHFAFWAAIKSPLMISTDLTNPTDDTLDILGNRRIIALNQDTLGASIAFKRRYT 287
Query: 281 NDQQIWAGPLSGNRIVV 297
ND +WAGPL+ VV
Sbjct: 288 NDHDVWAGPLADGSTVV 304
>gi|313234596|emb|CBY10551.1| unnamed protein product [Oikopleura dioica]
Length = 305
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 102/294 (34%), Positives = 142/294 (48%), Gaps = 38/294 (12%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGES----SRDSQISLSCSFHMFDSL 75
++NG+ TPPMGW +W F C + K +T C GE+ + + S +D +
Sbjct: 14 MNNGVARTPPMGWLAWERFRCNTDCKTFPDT--CIGENLFMEQAERLVKDGWSAVGYDLV 71
Query: 76 TQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLD------------ 118
PL ++ G + A+ FP G K LA+ +H + I +
Sbjct: 72 HIDDCWPLRDRVDGKITADPARFPHGIKWLAEQIHGKGLKLGIYGDMGTHTCGGFPGSMG 131
Query: 119 -NRIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEW----GD 173
++++N+ E GID LKYD C+++D + I Y AMS AL GRPI +S C W G
Sbjct: 132 YEKVDANMFAEWGIDMLKYDGCYSNDDQQKIGYPAMSQALNNTGRPIIYS-CSWPAYQGG 190
Query: 174 MRP----ALWGSNIRNSWRTTDDISDSWARMLTIA----DMNEVYADHAKPGGWNDPDML 225
+ P L G I N WR DI D W +L I+ D +V A PG WNDPDML
Sbjct: 191 LPPQVNYTLLGE-ICNVWRNYGDIQDDWDDILDISKWWGDHADVLVKAAGPGKWNDPDML 249
Query: 226 EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQA 279
GN + N+ V F IW+I APL L D+ ++ E + N VIAVNQ
Sbjct: 250 IGGNYALTVNQAQVQFGIWSIVAAPLFLSTDLRTMSKEMRDVYQNTAVIAVNQV 303
>gi|383641928|ref|ZP_09954334.1| alpha-galactosidase [Streptomyces chartreusis NRRL 12338]
Length = 549
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 118/405 (29%), Positives = 172/405 (42%), Gaps = 99/405 (24%)
Query: 26 LTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQISLS 66
LTPP+GWNSWN F C I E +R+ DDCW + RD+
Sbjct: 42 LTPPLGWNSWNSFGCGITEAQVRQAADAMVSSGMKEAGYQYVVVDDCWFDPQRDAA---- 97
Query: 67 CSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-------------- 112
GNL AN T FPSG KAL DY+H
Sbjct: 98 ----------------------GNLRANPTKFPSGMKALGDYIHSKGLKFGIYQAPNEKT 135
Query: 113 ----------ISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDT--RPTIRYRAMSDALKK 159
+ S + + F G+DYLKYD C T R+ M DAL+
Sbjct: 136 CAQGVGTYPGATGSKGHEAQDAATFASWGVDYLKYDWCSGSGTLSEQIARFTIMRDALRA 195
Query: 160 AGRPIFFSMCEWGDMRPAL----WGSNIRNSWRTTDDISDSWARMLT---------IADM 206
GRPI +S+ P WG + + WRTT+D+ D W T + D+
Sbjct: 196 TGRPIVYSINPNSFHAPTGDKYNWG-EVADLWRTTEDLLDIWQNNNTNSYPMGVGNVLDV 254
Query: 207 NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMP 266
A + PG WNDPDML VG G+ E HF++W++ APL+ G D+ ++A+
Sbjct: 255 TAPLAAQSGPGHWNDPDMLVVGRPGLSLTESRSHFALWSLMGAPLMAGNDIRTMSADVSA 314
Query: 267 IIGNEEVIAVNQ--------AINDQ---QIWAGPLSGNRIVVLLESRKTFSSMMTAHWDD 315
I+ N ++AVNQ + D +++A PLS + V L +R ++ +T
Sbjct: 315 ILRNPRLLAVNQDSLGAGGRRVRDDGSTEVFAKPLSDGSVAVGLFNRGGGTATVTTTAAQ 374
Query: 316 IGIKSNN-TLFEHRTLET-KFQGKLTAKVDGHSCKMYELTPVTKL 358
+G+ TL + T T G+++A V H ++ ++ + L
Sbjct: 375 VGLSGGPFTLTDLWTGGTSSTSGQISASVPAHGVAVFRVSGGSPL 419
>gi|295665133|ref|XP_002793118.1| alpha-galactosidase [Paracoccidioides sp. 'lutzii' Pb01]
gi|226278639|gb|EEH34205.1| alpha-galactosidase [Paracoccidioides sp. 'lutzii' Pb01]
Length = 489
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 111/368 (30%), Positives = 164/368 (44%), Gaps = 81/368 (22%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGL LTP MGWN+WN F+C +NE+++ + D + + + L + + D A
Sbjct: 23 LDNGLALTPQMGWNTWNSFYCDLNEEVVLDAAD---KIVQLGFMDLGYEYIVLDDCWS-A 78
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDN----RIESNIAFEQ- 129
+ + Q NL FPSG LA +H I I +S R E ++ +E+
Sbjct: 79 GRNSSDYLQPNL----EKFPSGIDGLAAKIHAMGLKIGIYSSAGTKTCARYEGSLGYEEK 134
Query: 130 --------GIDYLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFSMCEWGDMRP 176
GIDYLKYDNC+N+ T RY M AL GRPI +S+C WG P
Sbjct: 135 DAELWASWGIDYLKYDNCYNEGQEGTPLLSFNRYNVMGKALNATGRPILYSLCNWGIDGP 194
Query: 177 ALWGSNIRNSWRTTDDISDSWAR-----------------------MLTIADMNEVYADH 213
+ I NSWR T D+ ++ R ++ + +
Sbjct: 195 WNFAPTIANSWRITGDLFLNYNRETPECPCAELGGLDCKLPGFRCSVMNVLNKAAYLPSK 254
Query: 214 AKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEV 273
G WND DML VGNGG+ + V HFS+WA K+PLL+ + + A+++ I+ N V
Sbjct: 255 GFSGAWNDLDMLVVGNGGLTDDAMVAHFSLWAALKSPLLMSLVLSKIDAKSLSILQNIAV 314
Query: 274 IAVNQ----------------------AINDQQIWAGPLSGNRIVVLL-----ESRKTFS 306
+A++Q + Q+++G L+G VVL R+ S
Sbjct: 315 LAISQDSAGLSATRKWRKYVGDVDELGKQGEIQMFSGSLAGGDEVVLFLNAAASDRQMES 374
Query: 307 SMMTAHWD 314
S+ WD
Sbjct: 375 SLEDIFWD 382
>gi|449544297|gb|EMD35270.1| glycoside hydrolase family 27 protein [Ceriporiopsis subvermispora
B]
Length = 432
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 94/306 (30%), Positives = 146/306 (47%), Gaps = 73/306 (23%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIR-------------------ETDDCWGESSRD 60
LSNG+ P +G+N+WN F C I++ ++ DDCW + +R
Sbjct: 23 LSNGVARLPVLGYNTWNAFACNIDQDLVLTQAKLMKSLGLQAVGYTHVHLDDCWADKNRS 82
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISN----- 115
+ G LV ++T F G L +H + N
Sbjct: 83 AS--------------------------GTLVPDQTRFSRGMNNLTGELHAMGFNAGIYG 116
Query: 116 ------------SLDNRIESNIAFEQ-GIDYLKYDNC---FNDDTRPTI--RYRAMSDAL 157
S ++ + F+ G DYLKYDNC F+D R + +Y+ MSDAL
Sbjct: 117 DSGWFTCAGYPGSFEHEAQDAQTFQDWGFDYLKYDNCNIPFDDIIREGMVGKYQRMSDAL 176
Query: 158 KKAGR-----PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYAD 212
+ + PI FS+C+WG + LWG+++ +SWRTT DIS +W + +I + N
Sbjct: 177 TQLAQTSRKTPIIFSLCQWGWEQVWLWGASVGHSWRTTGDISPNWDALASIINFNSFITQ 236
Query: 213 HAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEE 272
G ND DM+ +GNGG+ Y+E HF+ WA+ K+PLL+G ++ + T+ I+ N E
Sbjct: 237 ATGFYGRNDMDMVRLGNGGLTYDEAKTHFTAWALMKSPLLIGTNLSAIDDATLGILKNTE 296
Query: 273 VIAVNQ 278
++A+NQ
Sbjct: 297 LLAINQ 302
>gi|156058748|ref|XP_001595297.1| hypothetical protein SS1G_03386 [Sclerotinia sclerotiorum 1980]
gi|154701173|gb|EDO00912.1| hypothetical protein SS1G_03386 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 648
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 106/362 (29%), Positives = 166/362 (45%), Gaps = 63/362 (17%)
Query: 43 NEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQ-GNLVANKTTFPSG 101
NE II S+ + +SL S + +T LP++ G L N FP+G
Sbjct: 42 NETII--------HSNAQALVSLGLSALGYHYVTVDCGWTLPDRTSAGTLTWNPDRFPNG 93
Query: 102 TKALADYVHILISNSLDNRIESNIAFEQ--------------------GIDYLKYDNCFN 141
AL ++H L L + S+ + G D LKYDNC++
Sbjct: 94 YPALGTFIHGL---GLGFGVYSDAGVQMCMTGTPAQVGSLYAQTFASWGADLLKYDNCYS 150
Query: 142 D------------DTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRT 189
D T P++RY+ M++AL +P+ F +C+WG P+ W + N+WR
Sbjct: 151 DGATGYPNTNYAPSTSPSLRYQNMTNALATLSKPMLFQICDWGVDFPSAWAPPMGNTWRI 210
Query: 190 TDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKA 249
T+DI S++ + I + A PG W D DMLEVGN + E HFS+WAI K+
Sbjct: 211 TNDIIPSYSTIPRILNQAVPQTSFAGPGHWLDLDMLEVGNNVLTTTEEQTHFSLWAILKS 270
Query: 250 PLLLGCDVGN----LTAETMPIIGNEEVIAVNQ-------------AINDQQIWAGPLSG 292
PL++G + + + + ++ I+ N++VI+ NQ + ++WAGPLSG
Sbjct: 271 PLVIGAALKDTFTSINSASLNILSNKDVISYNQDTLGVAASFRRRWTTDGYEVWAGPLSG 330
Query: 293 NRIVVLLESRKTFSSMMTAHWDDIGIKSNNTL--FEHRTLETKFQGKLTAKVDGHSCKMY 350
R VV L + + + +T D+G++ +TL T T +A V H +
Sbjct: 331 GRTVVALINLQNTAKTLTLDLPDVGLQKASTLKNIWAGTSATNVLTSYSATVGAHGTMLL 390
Query: 351 EL 352
EL
Sbjct: 391 EL 392
>gi|291228138|ref|XP_002734022.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
Length = 409
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 126/416 (30%), Positives = 174/416 (41%), Gaps = 109/416 (26%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQIN----------EKIIRET------------------- 50
L NGL TPPMGWN W F C I+ EK+ R+
Sbjct: 21 LDNGLARTPPMGWNDWERFRCNIDCNNDPNNCIGEKLFRDMIDRIASGGYKDVGYEYVNL 80
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RD L AN T FP+G K+LADY H
Sbjct: 81 DDCWMSHERDEN--------------------------NRLTANSTRFPNGIKSLADYAH 114
Query: 111 I------------LIS--------NSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPTIRY 150
L++ N +D I++ + GIDYLK+D C + Y
Sbjct: 115 SKGLKLGIYEDYGLLTCAGYPGSLNYMD--IDAQTFADWGIDYLKFDGCNSLPWTMDKGY 172
Query: 151 RAMSDALKKAGRPIFFSMCEWG-DMRPALWGSN---IR---NSWRTTDDISDSWARMLTI 203
M+ AL K GRPIF+S CEW R L N IR N WR DDI+DSW +L++
Sbjct: 173 PEMTRALNKTGRPIFYS-CEWPLYQRAVLIKPNYNLIRENCNLWRNFDDINDSWDSVLSV 231
Query: 204 ADMNEVYADH----AKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGN 259
+ D A PGGWNDPDML +G+ G+ Y++ ++W+I APL++ D+
Sbjct: 232 INFYTNEQDKLIAAAGPGGWNDPDMLVIGDFGLSYDQSKSQMAMWSIFAAPLMMSADLRT 291
Query: 260 LTAETMPIIGNEEVIAVNQ-----------AINDQQIWAGPLSGNRIVVLLESRKTFS-- 306
++ E I+ N+EVIAV+Q + + +W PL V+ ++
Sbjct: 292 ISDEAKEILLNKEVIAVDQDALGKMGRRVISRGNIDVWCRPLVDGCYAVVFFNKAVGGGP 351
Query: 307 SMMTAHWDDIGIKSN------NTLFEHRTL-ETKFQGKLTAKVDGHSCKMYELTPV 355
S +T ++IG LF HR L +G V+ M +PV
Sbjct: 352 SNVTVTLEEIGFTKTVEKYALRDLFLHRDLGNYNVRGTFVTSVNPTGVVMVTASPV 407
>gi|402082237|gb|EJT77382.1| hypothetical protein GGTG_07294 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 577
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 109/348 (31%), Positives = 163/348 (46%), Gaps = 68/348 (19%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETD----DCWGESSRDSQISLSCSFHMFDSLTQIAAKP 82
TPPMGWNS+N F C +E+ I+ + D +++ C ++ D
Sbjct: 45 TPPMGWNSYNTFGCSPSEEHIKTSARGLVDLGFRDLGYKTVTVDCGWNGRD--------- 95
Query: 83 LPNQCQGNLVANKTTFPSGTKALADYVH-------------ILISNSLDNRIESNIAFEQ 129
QG L N+T FPSG +AL D+VH L S D + +++ FE+
Sbjct: 96 --RDAQGRLQWNETRFPSGGRALGDFVHGLGLRFGLYSGAGYLQCGSTD--LPASLGFEE 151
Query: 130 ---------GIDYLKYDNCF-----------NDDTRPTIRYRAMSDALKKAGRPIFFSMC 169
G D LKYDNC+ + + + R+ M LK+ GR I + +C
Sbjct: 152 VDAASFAEWGGDSLKYDNCYATSRTVMVDSDSAEAKSPARFLKMGRLLKETGRDIAYWLC 211
Query: 170 EWG--DMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEV 227
+WG + PA W + SWR ++DI DSW + I + +A H +PG + D DML V
Sbjct: 212 QWGIGENVPA-WAAATGQSWRMSNDIYDSWRSIWRIVNQAVPHARHTRPGAYADLDMLVV 270
Query: 228 GNGGMKYNEYVVHFSIWAISKAPLLLGC--DVGNLTAETMPIIGNEEVIAVNQ------- 278
G GG+ E HF +W+I K+PL +G D G ++ I+ N EVIA+NQ
Sbjct: 271 GLGGLTLEEERFHFGMWSIMKSPLHIGAVVDQGRTRPASLDIMRNREVIALNQDPLSRAA 330
Query: 279 --AINDQQ----IWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKS 320
A+ D + +W G LSG R VV + + + S + +G+ S
Sbjct: 331 RLALRDTEGEWDVWVGDLSGGRKVVGVANWRNESQTVPLDLPFVGVAS 378
>gi|238578562|ref|XP_002388761.1| hypothetical protein MPER_12185 [Moniliophthora perniciosa FA553]
gi|215450334|gb|EEB89691.1| hypothetical protein MPER_12185 [Moniliophthora perniciosa FA553]
Length = 302
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 97/261 (37%), Positives = 123/261 (47%), Gaps = 50/261 (19%)
Query: 87 CQGNLVANKTTFPSGTKALADYVHIL-----------------------ISNSLDNRIES 123
G N FPSG ALA +VH L SLD+ +
Sbjct: 42 ASGGFTWNTERFPSGIPALASFVHNLGLKFGIYSDNGYYSCDDQGGNAHWQGSLDHEEQD 101
Query: 124 NIAF-EQGIDYLKYDNCF----------NDDTRPTIRYRAMSDALKKAGRPIFFSMCEWG 172
+F E G DYLKYDNCF N RY M DAL G PI +S C WG
Sbjct: 102 AASFAEWGADYLKYDNCFAVSKTDFVNFNPPFELKPRYTRMRDALAATGHPILYSACNWG 161
Query: 173 DMRPA-LW-GSNIRNSWRTTDDISD--SWARMLTIADMNEVYADHAKPGGWNDPDMLEVG 228
PA W G + NSWR ++DI +WA + I + A+PGG+ND DML VG
Sbjct: 162 VQDPARTWPGPTVANSWRMSNDIGPPATWASVFRILNQVVPITGFAEPGGFNDLDMLYVG 221
Query: 229 NGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ---------- 278
N G+ + E HF+ WA +K+PLL+G D+ ++ T+ I+ NE VIA+NQ
Sbjct: 222 NNGLSFPEQQTHFAFWAAAKSPLLIGIDLTKASSNTLNILKNERVIAINQDSLGKSISFK 281
Query: 279 --AINDQQIWAGPLSGNRIVV 297
ND +WAGPLS VV
Sbjct: 282 RRYTNDHDVWAGPLSDGSTVV 302
>gi|409074341|gb|EKM74742.1| hypothetical protein AGABI1DRAFT_80761 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 501
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 95/306 (31%), Positives = 143/306 (46%), Gaps = 76/306 (24%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
+ +G+G P MGWN+W + C+INE I+ + DDC+ E RD
Sbjct: 94 VEDGVGRLPIMGWNTWYAYMCEINETIVLDNAKLVKSLGLLDVGYNYINIDDCYSEKERD 153
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH---------- 110
S GN+VA+K FPSG + L D +H
Sbjct: 154 SN--------------------------GNIVASKERFPSGMRNLTDQLHEMGFKAGIYS 187
Query: 111 -------ILISNSLDNRIESNIAFEQ--GIDYLKYDNC---FNDDTRPTI--RYRAMSDA 156
L S N F + G D LKYDNC F++ + + +Y+ MSDA
Sbjct: 188 DSGWFTCQLYPGSYQNEDRDITLFSEDWGFDLLKYDNCAVPFDEVIKEGMVGKYKRMSDA 247
Query: 157 L----KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYAD 212
+ K+ G+ I +S+C+WG +P LW + +WRTTDD W ++I + N Y+
Sbjct: 248 IERLRKRTGKSILYSLCQWGREQPWLWAKKLGQTWRTTDDTGSDWGHSVSILNQNSFYSW 307
Query: 213 HAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEE 272
G+ D D L+VGN GM + E HF+ WA+ K+PL++ + T ET+ I+ N+E
Sbjct: 308 ANDFYGYGDMDFLQVGNEGMTFEESKSHFTAWALMKSPLVI---LTKATEETITILKNKE 364
Query: 273 VIAVNQ 278
+I ++Q
Sbjct: 365 LIDIHQ 370
>gi|340380709|ref|XP_003388864.1| PREDICTED: alpha-N-acetylgalactosaminidase-like [Amphimedon
queenslandica]
Length = 405
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 125/397 (31%), Positives = 178/397 (44%), Gaps = 81/397 (20%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
NG+ LTPPMGW +W + C N K + D+C GE ++ + + H+ + A
Sbjct: 25 NGVALTPPMGWLAWERYACNTNCK--DDPDNCIGE-----KLFMRMADHIASDGFKDAGY 77
Query: 82 PLPN----------QCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAFEQ-- 129
N QG L A+ FPSG ALA+YVH S L I ++
Sbjct: 78 QYINIDDCWASKERDSQGRLQADPDRFPSGIAALANYVH---SKGLKLGIYADYGTHTCG 134
Query: 130 -------------------GIDYLKYDNC-FNDDTRPTIRYRAMSDALKKAGRPIFFSMC 169
GID LK D C N D P Y+ +SD L GR I FS C
Sbjct: 135 GYPGSGPSMKLDIDTFASWGIDMLKMDGCNANIDGMPQ-GYKQVSDYLNATGRHIVFS-C 192
Query: 170 EWGDMRPALW---GSNIR--------NSWRTTDDISDSWARMLTIADMNEVYADH----A 214
W PA W G + N WR DISDSW + +I D D A
Sbjct: 193 SW----PAYWVGSGKTVNYTYAGETCNLWRNYHDISDSWDSVTSIIDYYAKEEDDLIPAA 248
Query: 215 KPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVI 274
PG WNDPDML VG+ G+ +E +IW+I APLL+ D+ ++ E+ I+ N +VI
Sbjct: 249 GPGHWNDPDMLIVGDFGLSVDEQQAQMAIWSILSAPLLMSNDLSTISDESKAILQNSDVI 308
Query: 275 AVNQ-----------AINDQQIWAGPLSGNRIVVLLESRKTFSSM--MTAHWDDIGIKSN 321
+V+Q + ++++ LS N + V+ + +F+ +T + +G+ S+
Sbjct: 309 SVSQDKLGHQGKVVATVGKVRVFSKLLSDNSMAVVFFNSGSFAGPQNVTVTFQTVGLSSS 368
Query: 322 NT----LFEHRTLETKFQGKLTAKVDGHSCKMYELTP 354
LF+ + L T FQG T VD S M ++TP
Sbjct: 369 KATVKDLFQQKDLGT-FQGSFTTPVDPSSVVMVKMTP 404
>gi|395776531|ref|ZP_10457046.1| alpha galactosidase precursor [Streptomyces acidiscabies 84-104]
Length = 542
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 112/404 (27%), Positives = 169/404 (41%), Gaps = 99/404 (24%)
Query: 26 LTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQISLS 66
LTPP+GWNSWN F C + E +R+ DDCW + RD+
Sbjct: 37 LTPPLGWNSWNSFGCGVTEAQVRQAADAMVSSGMRDAGYRYVVVDDCWFDPQRDAS---- 92
Query: 67 CSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH---------------- 110
G L A+ T FP G KAL DY+H
Sbjct: 93 ----------------------GALRAHPTKFPGGMKALGDYIHGKGLKFGIYQVPGTRT 130
Query: 111 --------ILISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTR--PTIRYRAMSDALKK 159
+ S + + F G+DYLKYD C ++ TR R+ M DAL+
Sbjct: 131 CAQTSGGFPGSTGSRGHEAQDAATFASWGVDYLKYDWCSSEGTRDEQVARFGLMRDALRA 190
Query: 160 AGRPIFFSMCEWGDMRPALWGS-----NIRNSWRTTDDISDSWARMLT---------IAD 205
GRPI +S+ + A+ GS + + WRTT+D+ D W T + D
Sbjct: 191 TGRPIVYSINP--NSFHAITGSTYDWGQVADLWRTTEDLLDIWQNGNTNSYPMGVGNVLD 248
Query: 206 MNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETM 265
+N A + PG WNDPDML VG G+ E HF++WA+ APL+ G D+ ++A+
Sbjct: 249 VNAPLAAQSGPGHWNDPDMLVVGRPGLTLAESRSHFALWALMAAPLMAGNDIRTMSADVS 308
Query: 266 PIIGNEEVIAVNQAI-----------NDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWD 314
I+ N ++AV+Q + +++A PL+ + V L +R + ++ ++
Sbjct: 309 AILRNPRLLAVDQDPLGAGGRRVRDDGNTEVFAKPLADGSVAVGLFNRGSATATISTTAA 368
Query: 315 DIGIKSNNTLFEHRTLETKFQGKLTAKVDGHSCKMYELTPVTKL 358
IG+ G L+A V H + ++ T L
Sbjct: 369 QIGLSGGPFTLTDLWTGATSAGSLSASVPAHGVAAFRVSGGTPL 412
>gi|367013943|ref|XP_003681471.1| hypothetical protein TDEL_0E00170 [Torulaspora delbrueckii]
gi|359749132|emb|CCE92260.1| hypothetical protein TDEL_0E00170 [Torulaspora delbrueckii]
Length = 390
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 101/297 (34%), Positives = 140/297 (47%), Gaps = 68/297 (22%)
Query: 91 LVANKTTFPSGTKALADYVH-----------------ILISNSLDNRIESNIAFEQG-ID 132
LV + T FP+G K +AD++H S SL + + F Q +D
Sbjct: 2 LVPDSTKFPNGMKHVADHLHENDFLFGMYSSAGEYTCAGYSGSLGHEEDDAAFFAQNEVD 61
Query: 133 YLKYDNCFNDDT--RPTI---RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSW 187
YLKYDNC+N P I RYRAMS+AL K GRPIF+S+C WG WGS I NSW
Sbjct: 62 YLKYDNCYNRGQFGSPEISFNRYRAMSEALNKTGRPIFYSLCNWGQDLTFYWGSGIANSW 121
Query: 188 RTTDDISDSWAR----------------------MLTIADMNEVYADHAKPGGWNDPDML 225
R + DI+ + R ++ I + +A GGWND D L
Sbjct: 122 RISGDITADFDRPDSRCPCDGDEYDCAYAGFHCSIMNILNKAAPMGQNAGTGGWNDLDCL 181
Query: 226 EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------- 278
EVG G + +E HFS+W+I K+P+++G DV NL + I V+A+NQ
Sbjct: 182 EVGVGNLTDDEEKAHFSMWSIVKSPMVIGADVRNLKPSSFSIYSQASVLAINQDPAGIPA 241
Query: 279 --------------AINDQQIWAGPL-SGNRIVVLLESRKTFSSMMTAHWDDIGIKS 320
+ Q+W+GPL +G+R+V LL M A+ +DI + S
Sbjct: 242 VRVWKRSVPETDQYGQGEIQLWSGPLDNGDRVVALLNGGMK-ERPMVAYLEDIFVDS 297
>gi|323456837|gb|EGB12703.1| hypothetical protein AURANDRAFT_35927 [Aureococcus anophagefferens]
Length = 377
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 106/371 (28%), Positives = 166/371 (44%), Gaps = 56/371 (15%)
Query: 30 MGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQG 89
MG+N+WN+FHC ++E+ + T I L + + + + G
Sbjct: 1 MGYNTWNYFHCNVDERAVVRTARL--------LIDLGLAAKGYAYVNVDDCWQVGRDGAG 52
Query: 90 NLVANKTTFPSGTKALADYVHIL----------------------ISNSLDNRIESNIAF 127
N+VA+ FP G KA+AD VH L + + D ++ AF
Sbjct: 53 NIVADPARFPGGMKAVADAVHALGMKYGLYTASHEFTCQGRPGSYLHETRD--ADAYCAF 110
Query: 128 EQGIDYLKYDNCFNDDTRPTIRYR-------AMSDALKKAGRPIFFSMCEWGDM-RPALW 179
E +DY+K D C R + A + + GR + S+ G + W
Sbjct: 111 E--VDYVKIDACGGRGYAKKDRRKSWQLFRDAFDECHARTGRAVVESVESCGTVDECGAW 168
Query: 180 GSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVV 239
++ + WRTT+DI ++WA +L AD E A PG WNDPDML VGN G+ +E
Sbjct: 169 VGDVAHLWRTTNDIQNNWASVLANADATERLWSLAGPGHWNDPDMLLVGNVGLDADEARA 228
Query: 240 HFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ----------AINDQQIWAGP 289
HFSIW + APLL+G D+ L+AE + I+GNE +IA++Q +++W
Sbjct: 229 HFSIWCMLAAPLLIGTDLHALSAENLAILGNEHLIAIDQDPLGFQGRKVVRGAREVWMKQ 288
Query: 290 LSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFEHRTLETK----FQGKLTAKVDGH 345
L+ + V+L +R ++ ++A D +G+ + K G A V H
Sbjct: 289 LANGDVAVMLLNRDARAARVSAKLDAVGLAAGRRYAVFDCWRGKAAGTAAGSYAADVPRH 348
Query: 346 SCKMYELTPVT 356
+ L+P T
Sbjct: 349 GAVVVRLSPDT 359
>gi|288803470|ref|ZP_06408902.1| alpha-galactosidase (Melibiase) (Alpha-D-galactoside
galactohydrolase) [Prevotella melaninogenica D18]
gi|288334080|gb|EFC72523.1| alpha-galactosidase (Melibiase) (Alpha-D-galactoside
galactohydrolase) [Prevotella melaninogenica D18]
Length = 409
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 118/390 (30%), Positives = 178/390 (45%), Gaps = 71/390 (18%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
+ L LTPPMG+ +WN + INE++IR+ D + + D + D Q
Sbjct: 31 DSLALTPPMGFMTWNKYKEDINEQLIRQIAD---KMAADGYAEAGYKYIFIDDAWQGG-- 85
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIA------------FEQ 129
+ N++ + FPSG KALADYVH S L I S+ A FE+
Sbjct: 86 ---RDKRNNIIPDPKKFPSGIKALADYVH---SKGLLLGIYSDAAQLTCAGYTASYGFEE 139
Query: 130 ---------GIDYLKYDNCF--NDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPAL 178
GIDYLKYD C +D RY+ M+DAL+ +GR I +CEWG + P L
Sbjct: 140 QDAKTFAEWGIDYLKYDYCHAPSDSAVAHERYKKMADALQNSGRKIALGVCEWGQLNPEL 199
Query: 179 WGSNIRNS-WRTTDDISDSWARMLT--------IADMNEVYADHAKPGGWNDPDML---- 225
W S WR + D+ D W ++ I ++ E +A PG W D DML
Sbjct: 200 WARQAGGSLWRVSFDVRDMWKDIVKQGGMGIIDIINITEPLYKYAGPGHWLDMDMLVVGL 259
Query: 226 --------EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN 277
++G G Y EY S+W + +PL + D+ N AET I+ N+E+IA+N
Sbjct: 260 DGKGGPSSDLGGVGCTYTEYQTQMSMWCMFASPLAVSHDILNENAETRRILLNKEIIAIN 319
Query: 278 Q-AINDQ----------QIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIG-IKSNN--T 323
Q A+ + +++ LSGNR + + + + +G K N
Sbjct: 320 QDALGEAAHRVDFPGACRVYLRNLSGNRQAIAIMNPSDTPQRVQLPLSILGNAKEYNFRD 379
Query: 324 LFEHRTLETKFQGKLTAKVDGHSCKMYELT 353
++EH+T T+ + A + H K++ +T
Sbjct: 380 VWEHKT--TRQRKAWQATLQPHETKVFTVT 407
>gi|358375244|dbj|GAA91829.1| alpha-galactosidase/alpha-n-acetylgalactosaminidase [Aspergillus
kawachii IFO 4308]
Length = 373
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 108/353 (30%), Positives = 167/353 (47%), Gaps = 65/353 (18%)
Query: 30 MGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQG 89
MGWNSWN F +N I+++ + R + L + + + L A G
Sbjct: 1 MGWNSWNAFKATVNYTIVQDVIE------RFYTLGLKEAGYEYVLLDDGWAS-YNRTSDG 53
Query: 90 NLVANKTTFPSGTKALADYVHI------LISNS-------------LDNRIESNIAFEQG 130
L AN T+FP G KALA+ VH L +S + R A G
Sbjct: 54 YLQANATSFPQGIKALAEEVHAEGLKLGLYGDSGHYTCAWRPGSWGYEERDAQTFA-GWG 112
Query: 131 IDYLKYDNC--FNDDTR-PTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSW 187
+DYLKYDNC F T P IR+ AM +AL +GR IF+S+C WG P WG +I +S+
Sbjct: 113 VDYLKYDNCGGFQSMTEAPQIRFGAMKNALALSGRDIFYSVCGWGYQFPWHWGGDIGHSY 172
Query: 188 RTTDDISDSWAR--------------------MLTIADMNEVYADHAKPGGWNDPDMLEV 227
R + DI+ S+ +LTI + + + + G W D DMLEV
Sbjct: 173 RMSGDITTSFTNETECQCKTAYCLNTGYAGCSVLTIINKMKEISQYQTRGHWLDMDMLEV 232
Query: 228 GNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-AINDQ--- 283
GN N+ HF+ WA K+PL++G D+ L+ +++ ++ N +I+VNQ A+ +
Sbjct: 233 GNANFTLNQQQTHFAFWAALKSPLIIGADLSKLSNDSLGVLTNTAIISVNQDALGEPVTY 292
Query: 284 ----------QIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFE 326
Q+WAG + +V+LL + K++ ++ + +G+ S + E
Sbjct: 293 REAHSKEGLFQVWAGNVEAGYVVLLL-NEKSYPQTVSLSFASLGLGSPQKVVE 344
>gi|409074255|gb|EKM74659.1| hypothetical protein AGABI1DRAFT_133014 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 410
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 94/294 (31%), Positives = 142/294 (48%), Gaps = 39/294 (13%)
Query: 21 SNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAA 80
S+ G P +GWN+WN +HC+I+E + + E + L + + + ++ +
Sbjct: 24 SHETGRLPALGWNTWNAYHCEIDEAKVLAAAHSFVE------LGLKDAGYEYVNIDDCWS 77
Query: 81 KPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLDNRIES 123
G +V + T FP+G +A+ VH L SL N +
Sbjct: 78 VKDSRDSSGRIVPDPTRFPNGIIGVANEVHDLGLKIGIYSDAGTQTCAGYPASLGNELID 137
Query: 124 NIAFEQ-GIDYLKYDNC-----FNDDTRPT----------IRYRAMSDALKKAGRPIFFS 167
F GIDYLKYDNC + D P IRYR M+DAL K RPI
Sbjct: 138 VQTFAGWGIDYLKYDNCNVPSNWTDAPNPPDGDWYNSNSGIRYRRMTDALSKVSRPIQLD 197
Query: 168 MCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEV 227
+C WG + WG+ +SWR + D + +W+ ++ I +N + D G ND DM+E+
Sbjct: 198 VCIWGQAQVWTWGARTGHSWRMSGDATPTWSYIIDIIKINVDHLDTIDFFGHNDMDMMEI 257
Query: 228 GNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAIN 281
GNG + E HF+ WA K+P+LLG D+ L +E + II N+E++A +Q N
Sbjct: 258 GNGDLTIQEERTHFAAWAFFKSPILLGTDLSQLNSEQVAIITNKELLAFSQDPN 311
>gi|386837140|ref|YP_006242198.1| alpha galactosidase [Streptomyces hygroscopicus subsp.
jinggangensis 5008]
gi|374097441|gb|AEY86325.1| alpha galactosidase precursor [Streptomyces hygroscopicus subsp.
jinggangensis 5008]
gi|451790498|gb|AGF60547.1| alpha galactosidase precursor [Streptomyces hygroscopicus subsp.
jinggangensis TL01]
Length = 548
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 118/406 (29%), Positives = 175/406 (43%), Gaps = 101/406 (24%)
Query: 26 LTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQISLS 66
LTPP+GWNSWN F C I E +R+ DDCW + RD+
Sbjct: 41 LTPPLGWNSWNSFGCGITEAQVRQAADAMVSSGMRDAGYRYVVVDDCWFDPQRDAA---- 96
Query: 67 CSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-------------- 112
GNL + FPSG +AL DY+H
Sbjct: 97 ----------------------GNLRGDPAKFPSGMRALGDYIHGKGLKFGIYQVPGDRT 134
Query: 113 ----------ISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTR--PTIRYRAMSDALKK 159
+ S + + F G+DYLKYD C + TR R+ M DAL+
Sbjct: 135 CAQTTGAYPGSTGSRGHEAQDAATFASWGVDYLKYDWCSSSGTRDEQVARFTLMRDALRA 194
Query: 160 AGRPIFFSMCEWGDMRPALWGS-----NIRNSWRTTDDISDSWARMLT---------IAD 205
GRPI +S+ + A+ GS + + WRTT+D+ D W T + D
Sbjct: 195 TGRPIVYSINP--NSFHAITGSTYDWGQVADLWRTTEDLLDIWQNGNTNSYPMGVGNVLD 252
Query: 206 MNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETM 265
+ A + PG WNDPDML VG G+ E HF++WA+ APL+ G D+ ++A+
Sbjct: 253 VTAPLAAQSGPGHWNDPDMLVVGRPGLTLTESRSHFALWALLSAPLMAGNDIRTMSADVS 312
Query: 266 PIIGNEEVIAVNQ--------AINDQ---QIWAGPLSGNRIVVLLESRKTFSSMMTAHWD 314
I+ N ++AVNQ + D +++A PLS + V L +R ++ +TA
Sbjct: 313 AILRNPRLLAVNQDPLGAGGRRVRDDGTTEVFAKPLSDGSVAVGLFNRGGGTATVTATAA 372
Query: 315 DIGIKSNN-TLFEHRTLET-KFQGKLTAKVDGHSCKMYELTPVTKL 358
+G+ TL + T T G+++A V H ++ +T + L
Sbjct: 373 QVGLSGGPFTLTDLWTGTTSSTSGQISASVPAHGVAVFRVTGGSPL 418
>gi|426197061|gb|EKV46988.1| hypothetical protein AGABI2DRAFT_70106 [Agaricus bisporus var.
bisporus H97]
Length = 410
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 94/294 (31%), Positives = 142/294 (48%), Gaps = 39/294 (13%)
Query: 21 SNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAA 80
S+ G P +GWN+WN +HC+I+E + + E + L + + + ++ +
Sbjct: 24 SHETGRLPALGWNTWNAYHCEIDEAKVLAAAHSFVE------LGLKDAGYEYVNIDDCWS 77
Query: 81 KPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLDNRIES 123
G +V + T FP+G +A+ VH L SL N +
Sbjct: 78 VKDSRDSSGRIVPDPTRFPNGIIGVANEVHDLGLKIGIYSDAGTQTCAGYPASLGNELID 137
Query: 124 NIAFEQ-GIDYLKYDNC-----FNDDTRPT----------IRYRAMSDALKKAGRPIFFS 167
F GIDYLKYDNC + D P IRYR M+DAL K RPI
Sbjct: 138 VQTFAGWGIDYLKYDNCNVPSNWTDAPNPPDGDWYNSNSGIRYRRMTDALSKVSRPIQLD 197
Query: 168 MCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEV 227
+C WG + WG+ +SWR + D + +W+ ++ I +N + D G ND DM+E+
Sbjct: 198 VCIWGQAQVWTWGARTGHSWRMSGDATPTWSYIMDIIKINVDHLDTIDFFGHNDMDMMEI 257
Query: 228 GNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAIN 281
GNG + E HF+ WA K+P+LLG D+ L +E + II N+E++A +Q N
Sbjct: 258 GNGDLTIQEERTHFAAWAFFKSPILLGTDLSQLNSEQVAIITNKELLAFSQDPN 311
>gi|167750142|ref|ZP_02422269.1| hypothetical protein EUBSIR_01111 [Eubacterium siraeum DSM 15702]
gi|167656885|gb|EDS01015.1| alpha-galactosidase [Eubacterium siraeum DSM 15702]
gi|291556563|emb|CBL33680.1| Alpha-galactosidase [Eubacterium siraeum V10Sc8a]
Length = 396
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 124/410 (30%), Positives = 188/410 (45%), Gaps = 93/410 (22%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPL 83
LG+TPPMGWNSWN F +INE++I++ D + E+ L + + + + ++
Sbjct: 5 LGMTPPMGWNSWNTFTWEINEELIKQAADAFIENG------LKDAGYEYVVIDDCWSEKQ 58
Query: 84 PNQCQGNLVANKTTFPSGTKALADYVH-----------------ILISNSLDNRIESNIA 126
N+ +G LV + FP+G K +ADYVH S ++ +
Sbjct: 59 RNE-KGELVPDHWKFPNGIKPVADYVHSKGLKFGIYSCAGTHTCAGHPGSFEHEFQDAET 117
Query: 127 F-EQGIDYLKYDNCFNDDTRP-TIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIR 184
F E G+DYLKYD C+ P I Y+ MS AL+ GRPI FS C WG+ W IR
Sbjct: 118 FAEWGVDYLKYDYCYKPVHIPGEILYKRMSTALRNCGRPIMFSACNWGNDDIYKW---IR 174
Query: 185 NS----WRTTDDISDSWA--RMLTIADM-NEVYADHAKPGGW---NDPDMLEVGN----- 229
S +R+T DI D+W + L ++ M NE Y GG ND DML VG
Sbjct: 175 ESGAHLFRSTGDIQDNWESIKRLALSQMGNECY------GGCFCHNDIDMLVVGMHGGSN 228
Query: 230 --------------------GGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIG 269
GG EY HFS+WA+ +PL++GCD+ ++T T I+
Sbjct: 229 NQYINGESDDQFGNKEGKGLGGCTDTEYKTHFSLWAMMNSPLMMGCDIRHMTDRTKEILT 288
Query: 270 NEEVIAVNQAI------------NDQQIWA--GPLSGNRIVVLLESRKTFSSMMTAHWDD 315
N+EVIA+NQ I N + +++ PL+ + + + + M+ + D
Sbjct: 289 NKEVIAINQDIECRGPYRISQWNNPENVFSLVKPLANGDYAIGMYNFSDVPAEMSLQFWD 348
Query: 316 IGIKSN-------NTLFEHRTLETKFQGKLTAKVDGHSCKMYELTPVTKL 358
IG+ + + + H + +F + + H C +Y V KL
Sbjct: 349 IGLSTAAGRGLEMHDCWSHEDI-GRFSERYVTTLQPHDCAVY-TAKVVKL 396
>gi|393212672|gb|EJC98171.1| glycoside hydrolase [Fomitiporia mediterranea MF3/22]
Length = 444
Score = 148 bits (373), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 95/306 (31%), Positives = 142/306 (46%), Gaps = 73/306 (23%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
LSNG+ P +G+N+WN + C I++ +I T DDCW E SRD
Sbjct: 34 LSNGVAKLPVLGYNTWNAYQCNIDQDLITTTAKLMQSLGLQDAGYTHVNIDDCWSEKSRD 93
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISN----- 115
S GNLV +K F SG +L +H + N
Sbjct: 94 SS--------------------------GNLVPDKVRFSSGMNSLTGQLHSMGFNAGIYG 127
Query: 116 ------------SLDNRIESNIAF-EQGIDYLKYDNC---FNDDTRPTI--RYRAMSDAL 157
S N F + G DYLKYDNC ++D R +Y+ M DA+
Sbjct: 128 DSGWFTCAGYPGSFQNEARDAKTFLDWGFDYLKYDNCAIPYDDILRENTMGKYQRMRDAI 187
Query: 158 K----KAGRP-IFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYAD 212
+G+P + FS+CEWG + +WG SWRTT+DI +W + +I + N
Sbjct: 188 ADLAASSGKPPLIFSLCEWGWSQVWIWGKQFGESWRTTNDIQPTWESLASIINFNSFITM 247
Query: 213 HAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEE 272
G ND DM+++GNG M +E HF+ WA+ K+PLL+G ++ + + ++ N+E
Sbjct: 248 ATDFYGHNDMDMVQIGNGDMTVDEVKSHFTAWALMKSPLLIGANLSAIKDSDLEVLKNQE 307
Query: 273 VIAVNQ 278
++A+NQ
Sbjct: 308 ILAINQ 313
>gi|390960078|ref|YP_006423835.1| alpha-galactosidase [Terriglobus roseus DSM 18391]
gi|390414996|gb|AFL90500.1| alpha-galactosidase [Terriglobus roseus DSM 18391]
Length = 398
Score = 147 bits (372), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 112/349 (32%), Positives = 163/349 (46%), Gaps = 65/349 (18%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPL 83
L TPPMGWNSWN F ++ + +++ D L S M D+ +
Sbjct: 29 LAATPPMGWNSWNWFAGKVTQDDVKQAAD------------LVVSTGMRDA--GYVYINI 74
Query: 84 PNQCQGNLVANKTTFPS----GTKALADYVH-----ILISNSLDN----RIESNIAFEQ- 129
+ QG A+ P+ K LADYVH + I +S + + E ++ EQ
Sbjct: 75 DDTWQGKRDASGVLHPNEKFPDMKGLADYVHSKGLKLGIYSSPGDLTCAKFEGSLGHEQQ 134
Query: 130 --------GIDYLKYDNCF----------NDDTRPTI----RYRAMSDALKKAGRPIFFS 167
GIDYLKYD C D + + Y M AL K GRPI +S
Sbjct: 135 DADLYASWGIDYLKYDLCGFRTKLDAEAPGDKLKQNVMMRDAYVKMHQALLKTGRPIVYS 194
Query: 168 MCEWGDMRPALWGSNIR-NSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLE 226
+C++G WG + N WRTT D+ D++ + IA+ N +A PG WNDPDMLE
Sbjct: 195 LCQYGFDSVWQWGPEVGGNLWRTTGDVRDNFPSIALIAETNVGLGKYAGPGHWNDPDMLE 254
Query: 227 VGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-AINDQ-- 283
VGNG + +E H +WA+ APLL G ++ LT E ++ N EV+A++Q ++ Q
Sbjct: 255 VGNGKLTMDENRTHMGMWAMLAAPLLAGNNLTKLTPEVTGVLTNREVVAIDQDSLGKQAE 314
Query: 284 --------QIWAGPLS-GNRIVVLLE--SRKTFSSMMTAHWDDIGIKSN 321
QIW+ PL+ G R + ++ TF + H + G+ S
Sbjct: 315 RVYQEGPVQIWSRPLADGGRALAVINFGEDTTFLRGIPLHLKEAGVTSG 363
>gi|386847959|ref|YP_006265972.1| ricin B lectin [Actinoplanes sp. SE50/110]
gi|359835463|gb|AEV83904.1| ricin B lectin [Actinoplanes sp. SE50/110]
Length = 548
Score = 147 bits (372), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 107/320 (33%), Positives = 143/320 (44%), Gaps = 90/320 (28%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRE-------------------TDDCWGESSRD 60
L NG+ TPPMGWNSWN F C I+E +IR+ DDCW +RD
Sbjct: 37 LDNGVARTPPMGWNSWNTFGCNISESLIRQMADAVVSSGMRDLGYQYVVVDDCWFNPNRD 96
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------LIS 114
S GNL + T FPSG KAL DY+H L
Sbjct: 97 SA--------------------------GNLQGDPTRFPSGMKALGDYLHAKGLKFGLYE 130
Query: 115 NSLDNRI----------------ESNIAFE---QGIDYLKYDNCFNDDT--RPTIRYRAM 153
LD E+ A + G+DYLKYD C T + M
Sbjct: 131 VPLDKTCAQYFNSYPGATGSQGHEAQDARQFAAWGVDYLKYDWCSPTGTINDQVTTFAKM 190
Query: 154 SDALKKAGRPIFFSM------CEWGDMRPALWGSNIRNSWRTTDDISDSWARMLT----- 202
DAL GRPI +S+ + G R WG ++ N WRTT+DIS +W T
Sbjct: 191 RDALAATGRPIVYSINPNSVHAKTGPQRN--WG-DVANMWRTTEDISAAWDTGQTNGYPM 247
Query: 203 ----IADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVG 258
I D+ A +A+PG +NDPDM+EVG G+ E HF++W + APL+ G D+
Sbjct: 248 GVKNIVDVTVPLAGYARPGQFNDPDMMEVGRSGLTDTEQRSHFALWTMLAAPLIAGNDLR 307
Query: 259 NLTAETMPIIGNEEVIAVNQ 278
+++ T I+ N +IA++Q
Sbjct: 308 SMSTATQTILKNPRLIAIDQ 327
>gi|291531894|emb|CBK97479.1| Alpha-galactosidase [Eubacterium siraeum 70/3]
Length = 396
Score = 147 bits (372), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 124/410 (30%), Positives = 187/410 (45%), Gaps = 93/410 (22%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPL 83
LG+TPPMGWNSWN F +INE++I++ D + E+ L + + + + ++
Sbjct: 5 LGMTPPMGWNSWNTFTWEINEELIKQAADAFIENG------LKDAGYEYVVIDDCWSEKQ 58
Query: 84 PNQCQGNLVANKTTFPSGTKALADYVH-----------------ILISNSLDNRIESNIA 126
N+ +G LV + FP+G K +ADYVH S ++ +
Sbjct: 59 RNE-KGELVPDHWKFPNGIKPVADYVHSKGLKFGIYSCAGTHTCAGHPGSFEHEFQDAET 117
Query: 127 F-EQGIDYLKYDNCFNDDTRP-TIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIR 184
F E G+DYLKYD C+ P I Y+ MS AL+ GRPI FS C WG+ W IR
Sbjct: 118 FAEWGVDYLKYDYCYKPVHIPGEILYKRMSTALRNCGRPIMFSACNWGNDDIYKW---IR 174
Query: 185 NS----WRTTDDISDSWA--RMLTIADM-NEVYADHAKPGGW---NDPDMLEVGN----- 229
S +R+T DI D+W + L ++ M NE Y GG ND DML VG
Sbjct: 175 ESGAHLFRSTGDIQDNWESIKRLALSQMGNECY------GGCFCHNDIDMLVVGMHGGSN 228
Query: 230 --------------------GGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIG 269
GG EY HFS+WA+ +PL++GCD+ ++T T I+
Sbjct: 229 NQYINGESDDQFGNKEGKGLGGCTDTEYKTHFSLWAMMNSPLMMGCDIRHMTDRTKEILT 288
Query: 270 NEEVIAVNQAI------------NDQQIWA--GPLSGNRIVVLLESRKTFSSMMTAHWDD 315
N+EVIA+NQ I N + +++ PL+ + + + + M+ + D
Sbjct: 289 NKEVIAINQDIECRGPYRISQWNNPENVFSLVKPLANGDYAIGMYNFSDVPAEMSLQFWD 348
Query: 316 IGIKSN-------NTLFEHRTLETKFQGKLTAKVDGHSCKMYELTPVTKL 358
IG+ + + H + +F + + H C +Y V KL
Sbjct: 349 IGLSTAAGRGLEMRDCWSHEDI-GRFSERYVTTLQPHDCAVY-TAKVVKL 396
>gi|10863055|dbj|BAB16833.1| alpha-galactosidase [Lachancea kluyveri]
Length = 377
Score = 147 bits (372), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 102/298 (34%), Positives = 135/298 (45%), Gaps = 66/298 (22%)
Query: 89 GNLVANKTTFPSGTKALADYVH---ILIS--------------NSLDNRIESNIAFEQ-G 130
G+LV++ FP G K +AD +H +L SL + +F
Sbjct: 5 GHLVSDSKKFPRGMKHVADSLHDEGLLFGMYSSAGEYTCAGYPGSLGHEEADAASFASWD 64
Query: 131 IDYLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRN 185
IDYLKYDNCF+ T RY+AMSDAL K GRPIF+S+C WG P WGS I N
Sbjct: 65 IDYLKYDNCFSKGNFGTPQLSYDRYKAMSDALNKTGRPIFYSICNWGQDSPWYWGSAIAN 124
Query: 186 SWRTTDDISDSWAR----------------------MLTIADMNEVYADHAKPGGWNDPD 223
SWR + DI + R ++ I + A PGGWND D
Sbjct: 125 SWRISGDIYADFDRPDSRCPCSGDEYYCQLAGFHCSIMNILNKAAPLGQTAGPGGWNDLD 184
Query: 224 MLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ----- 278
LEVG GG+ E HFS+WA K+PL++G ++ L A++ I N VIAVNQ
Sbjct: 185 ALEVGVGGLTDEEEKTHFSMWAAVKSPLVIGANLKTLDAKSFSIYANPAVIAVNQDAAGV 244
Query: 279 ----------------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKS 320
+ Q+W+GPL +V L + M A +DI + S
Sbjct: 245 PASRIWRYYVSETDEYGQGEIQLWSGPLKNGDQIVALVNGGNKPRPMNATLEDIFVDS 302
>gi|296128320|ref|YP_003635570.1| Ricin B lectin [Cellulomonas flavigena DSM 20109]
gi|296020135|gb|ADG73371.1| Ricin B lectin [Cellulomonas flavigena DSM 20109]
Length = 568
Score = 147 bits (371), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 110/392 (28%), Positives = 178/392 (45%), Gaps = 81/392 (20%)
Query: 28 PPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMF----DSLTQIAAKPL 83
PP+GWNSWN F+C INE++IR+T D L+ + + + D Q P
Sbjct: 47 PPLGWNSWNTFYCNINEQMIRQTADAM------VSTGLAAAGYQYVVVDDCWMQDTRGP- 99
Query: 84 PNQCQGNLVANKTTFPSGTKALADYVHI-------------------------LISNSLD 118
GNL + + FPSG KAL DY+H SN +
Sbjct: 100 ----DGNLRPHTSRFPSGMKALGDYIHSKGLKFGLYHAPREKTCDQYFNNRPGTSSNGNE 155
Query: 119 NRIESNIAFEQGIDYLKYDNCFNDDTRPTIR-----YRAMSDALKKAGRPIFFSM----- 168
R ++ + G+DY+K+D C D R +I+ ++ DALK GRPI +S+
Sbjct: 156 TR-DAQLFASWGVDYVKHDWC---DPRGSIQEQVDLFKRFGDALKATGRPIVYSINPNSA 211
Query: 169 CEWGDMRPALWGSNIRNSWRTTDDISDSWAR------------MLTIADMNEVYADHAKP 216
+ R + WG+ + WRT++D+ D+W+ + D+ E + +P
Sbjct: 212 HDNTAPRYSGWGA-FADMWRTSEDLKDAWSTGCPPSDQWCFVGITEALDVIEPMREWTRP 270
Query: 217 GGWNDPDMLEVG-NGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIA 275
G +NDPDML VG G + E H S+WA+ APL++G DV N++A+ ++ N +V+A
Sbjct: 271 GQYNDPDMLMVGVRGTLSPTENRAHMSMWAMLSAPLIMGNDVRNMSADVRSVLTNRDVLA 330
Query: 276 VNQAI-----------NDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTL 324
++Q D ++WA PL+ V L +R + ++A + G+
Sbjct: 331 IDQDPLVRQADRVRDDGDAEVWAKPLADGSAAVALLNRGNSARSISATLAEAGLPGGTAS 390
Query: 325 FEH--RTLETKFQGKLTAKVDGHSCKMYELTP 354
+ + ++T V H +Y +TP
Sbjct: 391 YREVWSGATGQTSDRITTTVPAHGVALYRVTP 422
>gi|389750506|gb|EIM91677.1| glycoside hydrolase [Stereum hirsutum FP-91666 SS1]
Length = 310
Score = 147 bits (370), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 90/257 (35%), Positives = 124/257 (48%), Gaps = 48/257 (18%)
Query: 89 GNLVANKTTFPSGTKALADYVHIL------------------------ISNSLDNRIESN 124
G N T PSG ALA +VH L + + + +++
Sbjct: 48 GGFTWNTTRIPSGVPALASFVHDLGLKFGVYSDAGYFSCDFVGGTAGWLGSLGNETLDAE 107
Query: 125 IAFEQGIDYLKYDNCFND------DTRPTI----RYRAMSDALKKAGRPIFFSMCEWGDM 174
G D LKYDNCF D P I Y AM +AL+ GRP+ FS+CEWG
Sbjct: 108 TFTSWGADLLKYDNCFAVSPMDFVDNDPPISLEPHYVAMRNALEATGRPVLFSICEWGLQ 167
Query: 175 RPALWGSNIRNSWRTTDDISD--SWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGM 232
PA W S++ NSWR ++DI SW + I + A PGG+ND D+LEVGN G+
Sbjct: 168 DPARWASDVGNSWRISNDIGPPPSWDNLFRIINEVVPVTGFAGPGGFNDLDLLEVGNDGL 227
Query: 233 KYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------------AI 280
E HF+ WA K+PL++ D+ + T +T+ I+ N+ +IA+NQ
Sbjct: 228 TVEEQKTHFAFWAAVKSPLMISTDLTSPTDDTLDILSNQRIIALNQDTLGASIAFKRRYT 287
Query: 281 NDQQIWAGPLSGNRIVV 297
ND +WAGPL+ VV
Sbjct: 288 NDHDVWAGPLADGSTVV 304
>gi|353243387|emb|CCA74935.1| related to alpha-galactosidase precursor [Piriformospora indica DSM
11827]
Length = 459
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 96/290 (33%), Positives = 142/290 (48%), Gaps = 44/290 (15%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGE-SSRDSQI 63
+G P +GWNSWN + C INE I DDCW + RDS
Sbjct: 85 VGKLPALGWNSWNAYRCDINESKILSAANQMVSLGLKDAGYQYVNIDDCWSNINGRDSST 144
Query: 64 S-LSCSFHMF-DSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRI 121
+ F+ F + + +A+K QG L+ + +GTK A Y L S+D
Sbjct: 145 QQIRPDFNKFPNGIASVASKV---HSQGLLLGIYSD--AGTKTCAGYPGSLGYESIDAAT 199
Query: 122 ESNIAFEQGIDYLKYDNCFN-------------DDTRPTIRYRAMSDALKKAGRPIFFSM 168
S+ GIDYLKYDNC+ ++ RYR MS AL RPI +S+
Sbjct: 200 FSS----WGIDYLKYDNCYVPSNWADSGSYSDWGQSKSATRYRQMSSALAAQSRPIQYSL 255
Query: 169 CEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVG 228
C WG+ + WG+++ SWR + D S +W+ + + + N + G ND DM+E+G
Sbjct: 256 CIWGNAQVWTWGASVGQSWRISGDSSPTWSYITGVINKNVGILGYTNFYGHNDMDMMEIG 315
Query: 229 NGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
NG + E HF +WA K+P+LLG D+ L+ + + II N+E++A +Q
Sbjct: 316 NGALTLAEQRTHFLMWAALKSPILLGTDLSRLSTDQLNIIKNKELLAFSQ 365
>gi|347756013|ref|YP_004863576.1| alpha-galactosidase [Candidatus Chloracidobacterium thermophilum B]
gi|347588530|gb|AEP13059.1| Alpha-galactosidase [Candidatus Chloracidobacterium thermophilum B]
Length = 536
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 111/385 (28%), Positives = 172/385 (44%), Gaps = 63/385 (16%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
N L LTPPMGWNSWN + Q++++ +R + R + + D Q
Sbjct: 140 NKLALTPPMGWNSWNVWGTQVSDEKVRAAAEA---LERTGLAACGYRYVCIDDGWQGRRT 196
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVHI------------------LISNSLDNRIES 123
P +G + N+ FP KAL D++H + + ++
Sbjct: 197 P-----EGVMQPNER-FPD-MKALGDWLHARGFLFGMYTSPGPFTCGRYLGSWRHEEADA 249
Query: 124 NIAFEQGIDYLKYDNC----FNDDTRPTI---RYRAMSDALKKAGRPIFFSMCEWGDMRP 176
+ G+DYLK+D C P + Y M AL K R I +++C++G
Sbjct: 250 RLYASWGVDYLKHDWCSYEGIARQKTPEVLQQPYIVMRAALDKTDRDIVYAICQYGMGEV 309
Query: 177 ALWGSNIR---NSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGG-- 231
W N WRTT DI D+W + I + A A PGGWNDPDML +G G
Sbjct: 310 WTWARQPNIGGNLWRTTGDIEDTWQSVSEIGFRHSPLARFAGPGGWNDPDMLVLGVVGWG 369
Query: 232 -------MKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------ 278
+ +E + H ++WA+ APL+LGCD+ L T ++ N EVI ++Q
Sbjct: 370 EKTRPTRLTPDEQITHMTLWALLAAPLILGCDLTRLDEFTRRLLTNPEVIGIDQDELGVP 429
Query: 279 ------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT---LFEHRT 329
A + ++WA PL+ R+ V L +R + +TA+W D+G++ T +++ R
Sbjct: 430 ATRRDTAQDGTEVWARPLADGRLAVGLFNRSNDTQTVTANWRDLGLRGRCTVRDVWQRRD 489
Query: 330 LETKFQGKLTAKVDGHSCKMYELTP 354
+ T F A V H ++ L P
Sbjct: 490 VGT-FDQVFAALVPPHGARLLLLRP 513
>gi|392389679|ref|YP_006426282.1| alpha-galactosidase [Ornithobacterium rhinotracheale DSM 15997]
gi|390520757|gb|AFL96488.1| alpha-galactosidase [Ornithobacterium rhinotracheale DSM 15997]
Length = 404
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 109/316 (34%), Positives = 146/316 (46%), Gaps = 66/316 (20%)
Query: 17 RNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRE-TDDCWGESSRDSQISLSCSFHMFDSL 75
++LL + TPPMG+ +WN+F NE I+ DD RD L + D
Sbjct: 12 QSLLLSAQTETPPMGFMTWNYFGVDFNENDIKSLADDLVSTGLRD----LGYDYIFIDDG 67
Query: 76 TQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIA--------- 126
Q + N++ + FPSG KALADYVH S L I S+ A
Sbjct: 68 WQGG-----RDNRNNIIPDPQKFPSGIKALADYVH---SKGLKIGIYSDAAPLTCAGYTA 119
Query: 127 ---FEQ---------GIDYLKYDNCFN--DDTRPTIRYRAMSDALKKAGRPIFFSMCEWG 172
FE+ GIDYLKYD C D RY+ MSDALKK R I FS+CEWG
Sbjct: 120 SLNFEEQDAKTFADWGIDYLKYDYCGAPADSVTAKKRYKKMSDALKKTNRNIVFSICEWG 179
Query: 173 DMRPALWGSNIRNS-WRTTDDISDSW----------------ARMLTIADMNEVYADHAK 215
D +P W N + WRT+ DI D W A +L I ++N + A
Sbjct: 180 DRKPWHWAKNAGGTLWRTSADIRDKWKATEPYTTPEELHRWGAGILDILNINAPLDEFAG 239
Query: 216 PGGWNDPDMLEVGNGGMKY-------------NEYVVHFSIWAISKAPLLLGCDVGNLTA 262
G WNDPDML G G K EY S+W++ +PL++ D+ ++T
Sbjct: 240 NGYWNDPDMLVAGLYGKKGAPSTELNGHGCTDTEYQSQMSLWSLMASPLMISLDLKSMTP 299
Query: 263 ETMPIIGNEEVIAVNQ 278
+T I+ N +VIA++Q
Sbjct: 300 KTKEILMNPDVIAIDQ 315
>gi|322435356|ref|YP_004217568.1| alpha-galactosidase [Granulicella tundricola MP5ACTX9]
gi|321163083|gb|ADW68788.1| Alpha-galactosidase [Granulicella tundricola MP5ACTX9]
Length = 402
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 113/327 (34%), Positives = 159/327 (48%), Gaps = 62/327 (18%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS-RDSQISLSCSFHMFDSLTQIAAKP 82
L TPPMGWNSWN F ++ +K +R+ D S RD+ ++
Sbjct: 28 LAATPPMGWNSWNWFAGKVTDKDVRQAADLLVSSGMRDAGYVYVNIDDTWEG-------- 79
Query: 83 LPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDN----RIESNIAFEQ---- 129
G L N+ FP KALADYVH + I +S + R E + EQ
Sbjct: 80 -KRDSTGVLHTNEK-FPD-MKALADYVHSKGLKLGIYSSPGSQTCARFEGSFGHEQQDAD 136
Query: 130 -----GIDYLKYDNC-FNDD----TRP-------TIRYRAMSDA-------LKKAGRPIF 165
GIDYLKYD C F++ T P TI+Y+ M DA L K GRPI
Sbjct: 137 LYASWGIDYLKYDLCSFHNGNMRVTAPGDSAEAMTIQYKMMHDAYARMHQALLKTGRPIV 196
Query: 166 FSMCEWGDMRPALWGSNI-RNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDM 224
+S+C++G WG + N WRTTDD++ ++ + I + +A PG WNDPDM
Sbjct: 197 YSLCQYGFDSVWQWGPEVGANLWRTTDDVNATFNSIALIGRDQAGLSRYAGPGHWNDPDM 256
Query: 225 LEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------ 278
LEVGNG + +E H +W++ APLL G ++ LT E I+ N E++A++Q
Sbjct: 257 LEVGNGKLTLDENRTHMGLWSMLAAPLLAGNNLSELTPEITAILTNREILAIDQDKLGHQ 316
Query: 279 -----AINDQQIWAGPLS-GNRIVVLL 299
A +IWA PL+ G+R + +
Sbjct: 317 ADRIYAEGPIEIWARPLADGSRALAIF 343
>gi|15020804|emb|CAC44626.1| alpha galactosidase a precursor [Takifugu rubripes]
Length = 429
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 104/340 (30%), Positives = 155/340 (45%), Gaps = 71/340 (20%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRETDDC-----WGESSRDSQIS 64
L NGL LTP MGW W F C I+E++ + D W E+ +
Sbjct: 19 LDNGLALTPTMGWLHWERFMCNTDCDQDPDNCISERLFMQVADVMVKEGWKEAGYEYVCI 78
Query: 65 LSC-SFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLD 118
C +H D+ QG L A+ FP G K LADYVH + I +
Sbjct: 79 DDCWPYHQRDN-------------QGRLQADPKRFPGGIKKLADYVHSRGLKLGIYADVG 125
Query: 119 NRI-------------ESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIF 165
N+ ++ E G+D LK+D CF D T Y MS AL + GR I
Sbjct: 126 NKTCAGYPGSLGYYDTDAQTFAEWGVDLLKFDGCFMDWTLLGEGYMNMSRALNQTGRSIV 185
Query: 166 FS----MCEWGDMRPALWGSNIR---NSWRTTDDISDSWARMLTI----ADMNEVYADHA 214
+S + EW +P + IR N WR + D+ DSW+ + +I AD + A
Sbjct: 186 YSCEWPLYEWRFQKPNY--TAIRETCNHWRNSADVLDSWSSIKSISAWTADYQDTIVPAA 243
Query: 215 KPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVI 274
PGGWNDPDML +GN G+ +++ ++WAI APL + D+ N+ + ++ NE VI
Sbjct: 244 GPGGWNDPDMLVIGNFGLSHDQQESQMALWAIMAAPLFMSNDLRNICPRSKELLQNEHVI 303
Query: 275 AVNQ-----------AINDQQIWAGPLSGNRIVVLLESRK 303
+NQ ++ ++W PLS R+ + + +++
Sbjct: 304 GINQDPLGKQGYLSGKVDSFEVWERPLSKQRLAIAVLNKQ 343
>gi|410914371|ref|XP_003970661.1| PREDICTED: alpha-galactosidase A-like [Takifugu rubripes]
Length = 429
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 104/340 (30%), Positives = 155/340 (45%), Gaps = 71/340 (20%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRETDDC-----WGESSRDSQIS 64
L NGL LTP MGW W F C I+E++ + D W E+ +
Sbjct: 19 LDNGLALTPTMGWLHWERFMCNTDCDQDPDNCISERLFMQVADVMVKEGWKEAGYEYVCI 78
Query: 65 LSC-SFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLD 118
C +H D+ QG L A+ FP G K LADYVH + I +
Sbjct: 79 DDCWPYHQRDN-------------QGRLQADPKRFPGGIKKLADYVHSRGLKLGIYADVG 125
Query: 119 NRI-------------ESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIF 165
N+ ++ E G+D LK+D CF D T Y MS AL + GR I
Sbjct: 126 NKTCAGYPGSLGYYDTDAQTFAEWGVDLLKFDGCFMDWTLLGEGYMNMSRALNQTGRSIV 185
Query: 166 FS----MCEWGDMRPALWGSNIR---NSWRTTDDISDSWARMLTI----ADMNEVYADHA 214
+S + EW +P + IR N WR + D+ DSW+ + +I AD + A
Sbjct: 186 YSCEWPLYEWRFQKPNY--TAIRETCNHWRNSADVLDSWSSIKSISAWTADYQDTIVPAA 243
Query: 215 KPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVI 274
PGGWNDPDML +GN G+ +++ ++WAI APL + D+ N+ + ++ NE VI
Sbjct: 244 GPGGWNDPDMLVIGNFGLSHDQQESQMALWAIMAAPLFMSNDLRNICPRSKELLQNEHVI 303
Query: 275 AVNQ-----------AINDQQIWAGPLSGNRIVVLLESRK 303
+NQ ++ ++W PLS R+ + + +++
Sbjct: 304 GINQDPLGKQGYLSGKVDSFEVWERPLSKQRLAIAVLNKQ 343
>gi|291228140|ref|XP_002734023.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
Length = 412
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 119/398 (29%), Positives = 176/398 (44%), Gaps = 73/398 (18%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQIN----------EKIIRETDDCWGESSRDSQISLSCSF 69
L NGL TPPMGW W F C ++ EK+ R+ D + D + +
Sbjct: 12 LDNGLSRTPPMGWIDWERFRCNVDCNNDPNNCIGEKLFRDMID---RIAADGYKDVGYEY 68
Query: 70 HMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------------LIS--- 114
D + N+ L AN T FP+G K+LADY H L++
Sbjct: 69 VNLDDCWMSHERDENNR----LTANSTRFPNGIKSLADYAHSKGLKLGIYEDYGLLTCAG 124
Query: 115 -----NSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMC 169
N +D I++ + GIDYLK+D C++ Y M+ AL K GRPI FS C
Sbjct: 125 YPGSLNYMD--IDAQTFADWGIDYLKFDGCYSLPWTMNKGYPEMTRALNKTGRPIVFS-C 181
Query: 170 EW------GDMRP--ALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYAD----HAKPG 217
EW ++P L N NSWR D+ DSW ++ I D D A PG
Sbjct: 182 EWPFHQQKASIKPNYKLIRKNC-NSWRNFKDVQDSWDSVMKIIDFYTKEQDTLIAAAGPG 240
Query: 218 GWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN 277
GWNDPDML +G+ G+ Y++ ++W+I APL++ D+ ++ E I+ N+EVIAV+
Sbjct: 241 GWNDPDMLIIGDFGLSYDQSKSQMAMWSIFAAPLMMSTDLRTISDEAKEILLNKEVIAVD 300
Query: 278 Q-----------AINDQQIWAGPLSGNRIVVLLESRKT--FSSMMTAHWDDIGIKSN--- 321
Q + +W+ PL V+ ++ + S +T ++IG +
Sbjct: 301 QDALGKMGRRVISRGSVDVWSRPLVNGSYAVVFFNKGSDGGPSNVTVTLEEIGFTTTVQK 360
Query: 322 ---NTLFEHRTL-ETKFQGKLTAKVDGHSCKMYELTPV 355
LF HR L +G ++ M +PV
Sbjct: 361 YALRDLFLHRFLGNYDVRGTFETSINPTGVVMVTASPV 398
>gi|409045078|gb|EKM54559.1| glycoside hydrolase family 27 protein [Phanerochaete carnosa
HHB-10118-sp]
Length = 500
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 90/287 (31%), Positives = 133/287 (46%), Gaps = 39/287 (13%)
Query: 25 GLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLP 84
G P +GWN+WN + C I+ II ++ + L + + + ++ A+
Sbjct: 119 GKLPALGWNAWNAYGCAISSDIIL------AAANDIVSLGLQAAGYQYVNIDDCWAEMQR 172
Query: 85 NQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLD-NRIESNIA 126
N +V + + FP G L +H L SL I++
Sbjct: 173 NSTTQRIVPDPSKFPDGIDGLVTQIHALELKAGIYSDAGTATCAGFPGSLGYEAIDAATF 232
Query: 127 FEQGIDYLKYDNCF-------------ND--DTRPTIRYRAMSDALKKAGRPIFFSMCEW 171
E GIDYLKYDNC ND D+ IRYR M+ AL RP+ FS+C W
Sbjct: 233 TEWGIDYLKYDNCNVPANWTDAYTPQDNDWYDSNSAIRYRQMTAALAAQSRPVQFSLCIW 292
Query: 172 GDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGG 231
GD WGS + +SWR + D + SW + I N Y + ND DM+E+GNGG
Sbjct: 293 GDANVWQWGSRVGHSWRMSGDSTPSWGYITQILTTNAQYLSYVDFYAHNDMDMMEIGNGG 352
Query: 232 MKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
+ E HF+ W K+P+LLG ++ NL + + I+ N E++A +Q
Sbjct: 353 LTIEEQRTHFAAWCFMKSPILLGTNLSNLNSTQLAIVTNPELLAFHQ 399
>gi|405950671|gb|EKC18643.1| Alpha-galactosidase A [Crassostrea gigas]
Length = 432
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 114/386 (29%), Positives = 176/386 (45%), Gaps = 61/386 (15%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGLGLTPPMGW +W F C N R+ D+C E + + + D+ +
Sbjct: 46 LDNGLGLTPPMGWLTWERFRC--NMDCDRDPDNCISEHLIKAMADIMAANGYRDAGYEYV 103
Query: 80 AKPLPNQC--------QGNLVANKTTFPSGTKALADYVHI------------------LI 113
+ + C G LV + FPSG K L+ YVH
Sbjct: 104 ---IIDDCWLGPERDTNGRLVPDPNRFPSGIKNLSAYVHSKNLKFGIYQDFGTKTCGGFP 160
Query: 114 SNSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGD 173
+ + ++ + G+DYLK+D C ++ Y AM L K GR IF+S CEW
Sbjct: 161 GSEFYMQTDAETFADWGVDYLKFDGCNSNLYDFPSGYEAMGFFLNKTGRSIFYS-CEWPF 219
Query: 174 MRPA----LWGSNIR---NSWRTTDDISDSWARMLTIADM----NEVYADHAKPGGWNDP 222
+ + + +R N WR D+ DSW+ +L I + ++++ +A PGGWNDP
Sbjct: 220 YKIVSGLQVDFAKVRKTCNLWRNYYDVQDSWSSVLDIVNFWGSNSKIFTKYAGPGGWNDP 279
Query: 223 DMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN----- 277
DM+ +GN G+ Y E V ++WAI APL++ D+ ++ + + ++ N +IA+N
Sbjct: 280 DMIILGNFGLSYEEERVQMAMWAILAAPLIMSNDLRHIRSSSKALLLNRNLIAINQDKLG 339
Query: 278 ------QAINDQQIWAGPLSGNRIVV--LLESRKTFSSMMTAHWDDIGIKSNNTLF-EHR 328
Q N QIW+ LS NR V L E+ ++ D+ + S+ F E
Sbjct: 340 IQGILLQKQNGIQIWSRDLSFNRTAVAFLNENSDGMPRFLSFSLVDLKLTSSTYNFTEAF 399
Query: 329 TLETKF----QGKLTAKVDGHSCKMY 350
++T Q L KV+ H +Y
Sbjct: 400 DMKTNILVTRQRPLKIKVNPHGINLY 425
>gi|374992690|ref|YP_004968185.1| Ricin B lectin [Streptomyces bingchenggensis BCW-1]
gi|297163342|gb|ADI13054.1| Ricin B lectin [Streptomyces bingchenggensis BCW-1]
Length = 535
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 118/402 (29%), Positives = 170/402 (42%), Gaps = 103/402 (25%)
Query: 26 LTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQISLS 66
LTPP+GWNSWN F C + E +R+ DDCW + RD+
Sbjct: 28 LTPPLGWNSWNSFGCGVTEAQVRQAADAMVSSGMRDAGYRYVVVDDCWFDPQRDAA---- 83
Query: 67 CSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-------------- 112
GNL AN T FP G KAL DY+H
Sbjct: 84 ----------------------GNLRANPTKFPGGMKALGDYIHQKGLKFGIYQVPNERT 121
Query: 113 ----------ISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTR--PTIRYRAMSDALKK 159
+ S + + F G+DYLKYD C + TR R+ M DAL+
Sbjct: 122 CAQGTGGYPGSTGSKGHETQDARTFASWGVDYLKYDWCSSAGTRDEQVARFTLMRDALRA 181
Query: 160 AGRPIFFSM------CEWGDMRPALWGSNIRNSWRTTDDISDSWARMLT---------IA 204
GRPI +S+ GD WG + + WRTT+D+ D W T +
Sbjct: 182 TGRPIVYSINPNSFHAITGDRYN--WG-EVADLWRTTEDLLDIWQNGNTNSYPMGVGNVL 238
Query: 205 DMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAET 264
D+ A + PG WNDPDML VG G+ E HF++WA+ APL+ G D+ ++A+
Sbjct: 239 DVTAPLAAQSGPGHWNDPDMLVVGRPGLSLTESRSHFALWALMAAPLMAGNDIRTMSADV 298
Query: 265 MPIIGNEEVIAVNQAI-----------NDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHW 313
I+ N ++AVNQ + +++A PLS + V L +R ++ +T
Sbjct: 299 SAILRNPRLLAVNQDPLGVGGRRVRDDGNTEVFAKPLSDGSVAVGLFNRGGGTATVTTTA 358
Query: 314 DDIGIKSNN-TLFEHRTLET-KFQGKLTAKVDGHSCKMYELT 353
G+ + TL + T T G+++A V H + +T
Sbjct: 359 AQAGLSGGSFTLTDLWTGGTSSTTGQISASVPAHGVAAFRVT 400
>gi|358388302|gb|EHK25896.1| glycoside hydrolase family 27 protein [Trichoderma virens Gv29-8]
Length = 408
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 106/330 (32%), Positives = 158/330 (47%), Gaps = 69/330 (20%)
Query: 28 PPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQC 87
P MGWN+WN F INE +I+ + +S D+ L+ + + + +L +
Sbjct: 24 PQMGWNTWNTFKSNINETLIKSS----AKSLVDT--GLARAGYKYVNLDD-GWQAFTRDS 76
Query: 88 QGNLVANKTTFPSGTKALADYVHIL---ISNSLDNRIESNIAFEQG-------------- 130
G N T FPSG +ALAD+VH L I D I + AF G
Sbjct: 77 LGRQQPNSTRFPSGIRALADFVHDLGLKIGIYSDAGIY-DCAFYPGSYGYEERDANTYAS 135
Query: 131 --IDYLKYDNC--FNDDT-RPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRN 185
IDYLK+DNC F+ T P R+ M DAL ++GR IF+S+C+WG+ P W S +
Sbjct: 136 WKIDYLKFDNCGGFHAGTVSPQERFLRMGDALNRSGRDIFYSLCQWGNQFPWHWAS-FSD 194
Query: 186 SWRTTDDISDSWAR---------------------MLTIADMNEVYADHAKPGGWNDPDM 224
S+R + DI ++ + I M E+ + PG W D DM
Sbjct: 195 SYRISGDIKSAFGEDSSGVCRSAYCLNTGYAGVSVLTMIRKMREL-SRFQTPGSWGDMDM 253
Query: 225 LEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------ 278
LE+G G M ++ HFS WA K+PL++G ++ + ++ I+ N+E+IA++Q
Sbjct: 254 LEIGTGTMNLHQEQTHFSFWAALKSPLIIGANINTINKASLDILLNKEIIAISQDDAGVA 313
Query: 279 --------AINDQQIWAGPLSG--NRIVVL 298
N+ Q+W GPL+ +R VVL
Sbjct: 314 VNYLPELSTENEVQVWGGPLASGKSRYVVL 343
>gi|317056235|ref|YP_004104702.1| alpha-galactosidase [Ruminococcus albus 7]
gi|315448504|gb|ADU22068.1| Alpha-galactosidase [Ruminococcus albus 7]
Length = 385
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 120/394 (30%), Positives = 179/394 (45%), Gaps = 72/394 (18%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS-RDSQISLSCSFHMFDSLTQIAAKP 82
L PPMGWNSWN F IN+++IR T D + ES +D+ + DS ++
Sbjct: 5 LAAVPPMGWNSWNTFGWNINDELIRTTADFFVESGLKDAGYEYIV---IDDSWSE----- 56
Query: 83 LPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLDNRIESNI 125
G LV +K FP+G KALADY+H S ++
Sbjct: 57 RQRDSSGRLVPDKNKFPNGMKALADYIHSKGLKFGMYSCAGTHTCGGFPGSFEHEFVDAE 116
Query: 126 AFEQ-GIDYLKYDNCFND-DTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNI 183
F + +DYLKYD C+ YR MS AL+ GR I FS C WG+ W I
Sbjct: 117 TFAKWEVDYLKYDYCYKPLSADGATLYRRMSMALRSCGRDILFSACNWGNDNVLDW---I 173
Query: 184 RNS----WRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGN---------- 229
R+S +R+T DI D+W + +I D + ++ ND DML VG
Sbjct: 174 RSSGAHMFRSTGDIQDNWFSIKSIFDSQVSHQPYSGSYCHNDMDMLVVGMHGKGDNAEVL 233
Query: 230 ----GGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAI----- 280
GG EY HF++WAI +PL++GCD+ + A I+ N+++I++NQ I
Sbjct: 234 GSVVGGCTDIEYKTHFALWAIMNSPLMIGCDIRKMDAAAKEILTNKDIISINQDIECRGP 293
Query: 281 -------NDQQIWA--GPLSGNRIVVLLESRKTFSSMMTAHWDDIGIK--SNNTL----- 324
N + ++A PLS + L + + M+ + DIG+ S L
Sbjct: 294 YIIRQWNNPENVFALVKPLSTGDYAIGLFNLSDKRAEMSLQFYDIGLPVFSGRGLDMYDC 353
Query: 325 FEHRTLETKFQGKLTAKVDGHSCKMYELTPVTKL 358
+ H ++ T + + ++D H C M+ + V KL
Sbjct: 354 YSHSSVGT-YTERYVTQLDSHDCAMF-IAKVVKL 385
>gi|219842113|dbj|BAH10648.1| beta-L-arabinopyranosidase/alfa-D- galactopyranosidase [Fusarium
oxysporum]
Length = 540
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 103/344 (29%), Positives = 165/344 (47%), Gaps = 62/344 (18%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETD----DCWGESSRDSQISLSCSFHMFDSLTQIAAKP 82
TPPMGWNS+NH++C+ +E II+ D + S +++ C + D
Sbjct: 22 TPPMGWNSYNHYNCRPSEDIIKINAKGLVDLGFKDLGYSIVTVDCGWPSRD--------- 72
Query: 83 LPNQCQGNLVANKTTFPSGTKALADYVHIL-------------------ISNSLD-NRIE 122
+G L N+T FPSG KAL +Y+H L I SLD I+
Sbjct: 73 --RDSKGRLQWNETLFPSGPKALGEYIHDLGLEFGLYSGAGYLQCGSTDIPASLDYEEID 130
Query: 123 SNIAFEQGIDYLKYDNCF-----------NDDTRPTIRYRAMSDALKKAGRPIFFSMCEW 171
+ E G D LKYDNC+ + + + R+ M+ A+ + R I + +C+W
Sbjct: 131 AKSFAEWGGDTLKYDNCYATSKTNMVDADSAEAKSPNRFIKMAAAINETDRDIKYFLCQW 190
Query: 172 GDMRPA-LWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNG 230
G W + + NSWR ++DI ++W + I + +A + PG + D DML +G G
Sbjct: 191 GIGEDVPQWAAPLGNSWRMSNDIFNAWRAIWRITNQVVAHAKYNGPGAFADMDMLIIGLG 250
Query: 231 GMKYNEYVVHFSIWAISKAPLLLG--CDVGNLTAETMPIIGNEEVIAVNQ---------- 278
+ ++E HF W++ K+PL++G D + AE++ I+ N+EVIA+NQ
Sbjct: 251 ALSHDEERFHFGFWSMMKSPLIIGGVMDAKQIPAESLEIMSNKEVIAINQDPLAEAAKLV 310
Query: 279 ---AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIK 319
+ IWAG LS NR V+ + + K + + IG++
Sbjct: 311 IRYTEEEWDIWAGNLSSNRKVLGVLNWKNETQTVKVDLSLIGVE 354
>gi|332981654|ref|YP_004463095.1| alpha-galactosidase [Mahella australiensis 50-1 BON]
gi|332699332|gb|AEE96273.1| Alpha-galactosidase [Mahella australiensis 50-1 BON]
Length = 377
Score = 145 bits (365), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 121/386 (31%), Positives = 175/386 (45%), Gaps = 66/386 (17%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPL 83
L TPPMGWNSWN F I+EK+I+E D W E + L + + + + A+
Sbjct: 2 LAHTPPMGWNSWNTFGENISEKLIKEIADDWIE------LGLLDAGYQYLVIDDCWAEKK 55
Query: 84 PNQCQGNLVANKTTFPSGTKALADYVH-----------------ILISNSLDNRIESNIA 126
+ G LV + FP+G K ++DYVH SL++
Sbjct: 56 RGK-DGLLVPDHNKFPNGMKVVSDYVHSKGLKFGMYSCVGTMTCAGYPGSLEHEFVDAET 114
Query: 127 FEQ-GIDYLKYDNCFNDDTRPT-IRYRAMSDALKKAGRPIFFSMCEWGDMRPALW-GSNI 183
F G+DYLKYD C+ P I Y+ M ALK GR I FS C WG W S
Sbjct: 115 FAMWGVDYLKYDYCYKPHHIPGHILYKRMGMALKNCGRDILFSACNWGADDVYAWIRSTG 174
Query: 184 RNSWRTTDDISDSWARMLTIA----DMNEVYADHAKPGGWNDPDML--------EVGNGG 231
+ +R+T DI D W + IA D + A H +ND DML VG GG
Sbjct: 175 AHMYRSTGDIEDKWESIRDIALSQMDKDCYSASHC----YNDMDMLVVGMYGKGNVGRGG 230
Query: 232 MKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ---IWAG 288
EY HFS+W + +PL++GCD+ + ET+ I+ N+E+IA+NQ +Q I G
Sbjct: 231 CSDEEYRTHFSLWCMMNSPLMIGCDIRAMNRETVNILTNKEIIALNQDPEGRQAYLIERG 290
Query: 289 PLSGNRIVVLLE--SRKTFS----------SMMTAHWDDIGIKS-------NNTLFEHRT 329
+ N + VL++ S +F + + + DIG+ S L+EH
Sbjct: 291 NIGYNNVKVLVKPISDGSFGIGFFNMGDTDARGSLQFWDIGLPSLTGYGFKMRDLWEHED 350
Query: 330 LETKFQGKLTAKVDGHSCKMYELTPV 355
+ ++ T + H CK++ V
Sbjct: 351 IGV-YKENYTCDLKPHHCKVFRAKLV 375
>gi|302346022|ref|YP_003814375.1| alpha-galactosidase [Prevotella melaninogenica ATCC 25845]
gi|302149805|gb|ADK96067.1| alpha-galactosidase [Prevotella melaninogenica ATCC 25845]
Length = 409
Score = 145 bits (365), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 99/301 (32%), Positives = 141/301 (46%), Gaps = 55/301 (18%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
+ L LTPPMG+ +WN + INE++IR+ D + + D + D Q
Sbjct: 31 DSLALTPPMGFMTWNKYKEDINEQLIRQIAD---KMAADGYAEAGYKYIFIDDAWQGG-- 85
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIA------------FEQ 129
+ N++ + FPSG KALADYVH S L I S+ A FE+
Sbjct: 86 ---RDKRNNIIPDPKKFPSGIKALADYVH---SKGLLLGIYSDAAQLTCAGYTASYDFEE 139
Query: 130 ---------GIDYLKYDNCF--NDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPAL 178
GIDYLKYD C +D RY+ M+DAL+ +GR I +CEWG + P +
Sbjct: 140 QDAKTFAEWGIDYLKYDYCHAPSDSAIAHKRYKKMADALQNSGRKIALGVCEWGQLNPEM 199
Query: 179 WGSNIRNS-WRTTDDISDSWARMLT--------IADMNEVYADHAKPGGWNDPDML---- 225
W S WR + D+ D W ++ I ++ E +A P W D DML
Sbjct: 200 WARQAGGSLWRVSFDVRDMWKDIVKQGGMGIIDIINITEPLYKYAGPSHWLDMDMLVVGL 259
Query: 226 --------EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN 277
++G G Y EY S+W + +PL + D+ N AET I+ N+E+IA+N
Sbjct: 260 DGKGGPSSDLGGVGCTYTEYQTQMSMWCMFASPLAVSHDILNENAETRHILLNKEIIAIN 319
Query: 278 Q 278
Q
Sbjct: 320 Q 320
>gi|255955485|ref|XP_002568495.1| Pc21g14820 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211590206|emb|CAP96379.1| Pc21g14820 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 436
Score = 144 bits (364), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 108/374 (28%), Positives = 161/374 (43%), Gaps = 85/374 (22%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINE-KIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAA 80
N +G P +GWN+WN F C I+ KII ++ +D L + D +
Sbjct: 24 NDVGKLPALGWNTWNAFGCDIDSTKIITAANEVINRGLKD----LGYEYINIDDCWSV-- 77
Query: 81 KPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----ISNSLDNR-------------IE 122
K ++ ++ + T FP G +A VH L I +S ++
Sbjct: 78 KNTRDKSTNRMIPDPTKFPDGISGVAKEVHDLGLKIGIYSSAGEETCAHYPASLGYEAVD 137
Query: 123 SNIAFEQGIDYLKYDNC--------------------FNDDTRPTI-------------- 148
+ E GIDYLKYDNC + D T P +
Sbjct: 138 AQSFAEWGIDYLKYDNCGVPSNWTDEYTYCVPDGDGPYPDGTCPDLTNPAPEGYDWKTSN 197
Query: 149 ---RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIAD 205
RY AM DAL R IF+S+C WG WG + NSWR + DI+ WAR+ IA+
Sbjct: 198 TYKRYAAMRDALLDVERTIFYSLCGWGQANVNSWGMEVGNSWRMSGDITVDWARIAQIAN 257
Query: 206 MNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETM 265
N Y DH G DPDMLEVGNG + E HF++WA+ K+PL++G + ++ +
Sbjct: 258 ENSFYMDHVNFWGRADPDMLEVGNGDLTIEENRAHFALWAVMKSPLIIGTALDSIDDAHL 317
Query: 266 PIIGNEEVIAVNQ----------------------AINDQQIWAGPLSG-NRIVVLLESR 302
I+ N+ ++ NQ ++ + W+G S + VVL+ +
Sbjct: 318 AILKNKYLLDFNQDPVIGTPARPYKWGYNPDRTFDPVHPAEYWSGASSTLDGTVVLMLNS 377
Query: 303 KTFSSMMTAHWDDI 316
+ +S TA W +I
Sbjct: 378 ENVTSTRTATWREI 391
>gi|383777966|ref|YP_005462532.1| putative alpha-galactosidase [Actinoplanes missouriensis 431]
gi|381371198|dbj|BAL88016.1| putative alpha-galactosidase [Actinoplanes missouriensis 431]
Length = 540
Score = 144 bits (364), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 116/385 (30%), Positives = 167/385 (43%), Gaps = 69/385 (17%)
Query: 26 LTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPN 85
+ PP+GWNSWN F C I+E IR D S + + + D Q +
Sbjct: 33 VRPPLGWNSWNTFGCDISEAKIRAAADAMVSSGMKAA---GYQYVVVDDCWQATTR---- 85
Query: 86 QCQGNLVANKTTFPSGTKALADYVHIL------------------------ISNSLDNRI 121
GNL ++ FPSG KAL DY+H + +L +
Sbjct: 86 DAAGNLRSDPVRFPSGMKALGDYIHGKGLKFGIYQAPREETCAQYFNALGGATGALGHER 145
Query: 122 ESNIAFEQ-GIDYLKYDNC-----FNDDTRPTIRYRAMSDALKKAGRPIFFSM---CEWG 172
+ F G+D+LKYD C ND + M DAL+ GRPI +S+
Sbjct: 146 QDATTFASWGVDFLKYDWCSPWGTLNDQI---AGFTTMRDALRATGRPIVYSINSNSAHT 202
Query: 173 DMRPAL-WGSNIRNSWRTTDDISDSWAR---------MLTIADMNEVYADHAKPGGWNDP 222
+ P+ WG I + WRTT+DI+D+W + I D+ A P WNDP
Sbjct: 203 NTGPSYDWGP-IADMWRTTEDITDTWTSGCRADCFMGVTEILDVQAPLYPRAGPQHWNDP 261
Query: 223 DMLEVG-NGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAIN 281
DMLEVG G E HFS+WAI APL+ G D+ ++A+ ++ N +V+A+NQ
Sbjct: 262 DMLEVGVRGTFTPTENRAHFSMWAIMAAPLIAGNDITTMSADVRSVLTNPDVLAINQDSA 321
Query: 282 DQQ-----------IWAGPLSGNRIVVLLESRKTFSSMM--TAHWDDIGIKSNNTLFEHR 328
+Q +WA LS V L +R ++ + TA +G S LF+
Sbjct: 322 GRQAQRVRDSGETEVWAKTLSDGSAAVALLNRSNSAATVSTTAAEAGLGAASGYQLFDVW 381
Query: 329 TLETK-FQGKLTAKVDGHSCKMYEL 352
T + ++A V H MY +
Sbjct: 382 TKAARNTSSAISATVPPHGVVMYRV 406
>gi|358367854|dbj|GAA84472.1| alpha-galactosidase (AglB) [Aspergillus kawachii IFO 4308]
Length = 442
Score = 144 bits (364), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 104/375 (27%), Positives = 162/375 (43%), Gaps = 86/375 (22%)
Query: 22 NGLGLTPPMGWNSWNHFHCQIN-EKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAA 80
+G+G TP +GWNSWN + C I+ +KI+ ++ +D L + D +
Sbjct: 21 DGVGRTPALGWNSWNAYSCDIDADKIVTAANEVVNLGLKD----LGYEYINIDDCWSV-- 74
Query: 81 KPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----ISNSLD-------------NRIE 122
K N ++ + FP+G +AD VH L I +S I+
Sbjct: 75 KSGRNTTTKRIIPDPDKFPNGISGVADQVHALGLKLGIYSSAGLTTCAGYPASLGYEEID 134
Query: 123 SNIAFEQGIDYLKYDNC-----FND------------------------DTRPT------ 147
+ E G+DYLKYDNC D D PT
Sbjct: 135 AQSFAEWGVDYLKYDNCGVPTNLTDQYTYCVPDSTDGSNYPNGTCVDLTDAAPTGYDWAT 194
Query: 148 ----IRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTI 203
RY+ M DAL R I +S+C+WG WG+ NSWR + DI+ +W R+ I
Sbjct: 195 STTAKRYQRMRDALLSVNRTILYSLCDWGQADVNAWGNETGNSWRMSGDITATWTRIAEI 254
Query: 204 ADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAE 263
A+ N Y ++A G+ DPDMLEVGNG + E HF++WA+ KAPL++G + ++
Sbjct: 255 ANENSFYMNYANFWGYPDPDMLEVGNGNLTLAENRAHFALWAMMKAPLIIGTPLDSIDTS 314
Query: 264 TMPIIGNEEVIAVNQ----------------------AINDQQIWAGPLSGNRIVVLLES 301
+ I+ N+ ++ +Q + + W+GP S + VL+ +
Sbjct: 315 HLSILLNKPLLTFHQDDVIGRPAYPYKWGYNPDWTFDPEHPAEYWSGPTSSGEVFVLMLN 374
Query: 302 RKTFSSMMTAHWDDI 316
+ TA W ++
Sbjct: 375 SEDGVKTRTALWGEV 389
>gi|373952836|ref|ZP_09612796.1| Glycosyl hydrolase family 98 putative carbohydrate binding module
[Mucilaginibacter paludis DSM 18603]
gi|373889436|gb|EHQ25333.1| Glycosyl hydrolase family 98 putative carbohydrate binding module
[Mucilaginibacter paludis DSM 18603]
Length = 678
Score = 144 bits (364), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 127/444 (28%), Positives = 199/444 (44%), Gaps = 107/444 (24%)
Query: 2 FIVSSSAADSNGSDRRNL---LSNGLGLTPPMGWNSWNHFHCQIN-EKIIRET------- 50
F V +A + G ++ + + LTPPMGWNSWN F Q++ EK+ R
Sbjct: 251 FQVVLAATNGRGKTKKTMRIVCGERIALTPPMGWNSWNCFADQVSAEKVKRAAKAMVQSG 310
Query: 51 -----------DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFP 99
DD W +++RDS+ S + D GN+V N FP
Sbjct: 311 LINHGWTYINIDDFW-QNNRDSK-DPSLRGKLRDE-------------AGNIVPN-VRFP 354
Query: 100 SGTKALADYVHIL------------------ISNSLDNRIESNIAFEQGIDYLKYDNCF- 140
KALAD +H L + + + ++ + G DYLKYD C
Sbjct: 355 D-MKALADTIHSLGLKAGLYSSPGPWTCGGCVGSYGYEKPDAQNYAKWGFDYLKYDWCSY 413
Query: 141 ----------------------NDDTRPTIR-YRAMSDALKKAGRPIFFSMCEWGDMRPA 177
D + I+ Y+ M +ALK+ R I +S+C++G
Sbjct: 414 GNVIDGMPGNDPYKVSSLSYKGGDQLQTAIKPYQLMGEALKQQPRDIVYSLCQYGMSDVW 473
Query: 178 LWGSNIRNS-WRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGG----- 231
WG ++ + WRTT+DI+D+WA + +IA + A+ AKPG W+DPDML VG G
Sbjct: 474 KWGDSVGGTCWRTTNDITDTWASVKSIALAQDKTAEGAKPGNWSDPDMLVVGTVGWGNPH 533
Query: 232 ---MKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ---------- 278
++ +E +HFS+W++ APLL+GCD+ L TM ++ N+EVIA++Q
Sbjct: 534 PSKLRPDEQYLHFSLWSLFAAPLLIGCDMEKLDDFTMNLLTNDEVIAIDQDPLGKQATCV 593
Query: 279 -AINDQQIWAGPLS-GNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFE-----HRTLE 331
I D +I+ L G+R V +++ +D +GIK + + + +
Sbjct: 594 HTIGDLRIYVKELEDGSRAVGFCNFGLNITNISFHDFDKLGIKGRYNVRDVWRQKNVMVM 653
Query: 332 TKFQGKLTAKVDGHSCKMYELTPV 355
+ KL +V H +Y+ T V
Sbjct: 654 DSRKDKLPLRVPAHGVLLYKFTAV 677
>gi|392589367|gb|EIW78698.1| glycoside hydrolase family 27 protein [Coniophora puteana
RWD-64-598 SS2]
Length = 513
Score = 144 bits (364), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 92/299 (30%), Positives = 138/299 (46%), Gaps = 74/299 (24%)
Query: 28 PPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQISLSCS 68
P MG+N+WN ++C I+E + E DDC+ RD++
Sbjct: 97 PVMGYNTWNAYYCDISEARVLEAAYKMMDYGLTDVGYNYVNIDDCYSLKERDAK------ 150
Query: 69 FHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----------------I 111
GN+V + FP G + L + +H
Sbjct: 151 --------------------GNIVEDPDKFPHGMRTLTNKIHDLGLKTGIYSDSGWFTCQ 190
Query: 112 LISNSLDNRIESNIAF--EQGIDYLKYDNC---FNDDTRPTIRYR------AMSDALKKA 160
L S N F E G DYLKYDNC F++ TR I+ R A+++ +
Sbjct: 191 LYPGSFGNEDRDAQLFHEEWGFDYLKYDNCAVPFDNVTRENIKGRFQRMGRAINELAAQT 250
Query: 161 GR-PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGW 219
G+ P+ S+CEWG + +WG SWRTTDDI +W + +I + N + G
Sbjct: 251 GKEPMVLSLCEWGREQGWVWGRRYGQSWRTTDDIGANWPSVASIINQNSFITWASDFYGH 310
Query: 220 NDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
ND DMLE+GN G+ Y E HF+ WA+ K+PL +G ++ N+ +++ I+ NEE+IA+NQ
Sbjct: 311 NDMDMLEIGNAGLNYEEQKTHFTAWALMKSPLFIGTNLANIPNDSLAILKNEEIIAINQ 369
>gi|383115655|ref|ZP_09936410.1| hypothetical protein BSGG_2478 [Bacteroides sp. D2]
gi|423293779|ref|ZP_17271906.1| hypothetical protein HMPREF1070_00571 [Bacteroides ovatus
CL03T12C18]
gi|382948212|gb|EFS31778.2| hypothetical protein BSGG_2478 [Bacteroides sp. D2]
gi|392677737|gb|EIY71153.1| hypothetical protein HMPREF1070_00571 [Bacteroides ovatus
CL03T12C18]
Length = 650
Score = 144 bits (363), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 122/419 (29%), Positives = 177/419 (42%), Gaps = 80/419 (19%)
Query: 2 FIVSSSAADSNGSDRRNL---LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS 58
+ V SA + G ++L + + LTPPMGWNSWN F I++K IR
Sbjct: 246 YTVHISAKNQYGETTKDLTLEIGEKISLTPPMGWNSWNIFGADIDDKKIRRM-------- 297
Query: 59 RDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------- 111
D + L + + + + N + FP K L DYVH
Sbjct: 298 ADRMVELGLVNYGYAYINIDDGWQGVRGGKYNAIMPNDKFPD-MKGLVDYVHSKGLKIGI 356
Query: 112 -----------LISNSLDNR----IESNIAF--------------EQGIDYLKYDNCFND 142
I +S D R + S+ + E G DY+KYD ND
Sbjct: 357 YSSPWVQTFAGYIGSSADTRNGKVVNSSRRYGEFSFAKNDVKQWAEWGFDYIKYDWVTND 416
Query: 143 DTRPTIRYRA-MSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARML 201
I + A +S L+++GR I +S+ A SN+ N WRTT DI DSW M
Sbjct: 417 -----IAHTAELSYLLRQSGRDILYSISNAAPFELAEDWSNLTNVWRTTGDIYDSWCSMT 471
Query: 202 TIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYN---------EYVVHFSIWAISKAPLL 252
TI + + + AKPG WNDPDML VG G N E H ++W+I APLL
Sbjct: 472 TIGFLQDKWQPFAKPGSWNDPDMLIVGKVGWGKNIHSTHLSPDEQYTHITLWSILAAPLL 531
Query: 253 LGCDVGNLTAETMPIIGNEEVIAVNQAI-------------NDQQIWAGPLSGNRIVVLL 299
+GCD+ + TM ++ N EVIA+NQ I + ++W+ PL I V L
Sbjct: 532 IGCDLEQMDDFTMNLLSNREVIAINQDIAGIQGSRVYADNNKEIEVWSKPLKDGSIAVGL 591
Query: 300 ESRKTFSSMMTAHWDDIGIKSNNT---LFEHRTLETKFQGKLTAKVDGHSCKMYELTPV 355
+ ++ WD + I+ L+E + + + K + V H ++ PV
Sbjct: 592 FNLSDNKQDISIFWDQLNIQGKQKVRNLWEQKD-KGVYINKYQSDVPSHGVLFIKIEPV 649
>gi|342880562|gb|EGU81647.1| hypothetical protein FOXB_07844 [Fusarium oxysporum Fo5176]
Length = 540
Score = 144 bits (362), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 99/322 (30%), Positives = 155/322 (48%), Gaps = 62/322 (19%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETD----DCWGESSRDSQISLSCSFHMFDSLTQIAAKP 82
TPPMGWNS+NH++C+ +E II+ D + S +++ C + D
Sbjct: 22 TPPMGWNSYNHYNCRPSEDIIKINAKGLVDLGFKDLGYSIVTVDCGWPSRD--------- 72
Query: 83 LPNQCQGNLVANKTTFPSGTKALADYVHIL-------------------ISNSLD-NRIE 122
+G L N+T FPSG KAL +Y+H L I SLD I+
Sbjct: 73 --RDSKGRLQWNETLFPSGPKALGEYIHDLGLEFGLYSGAGYLQCGSTDIPASLDYEEID 130
Query: 123 SNIAFEQGIDYLKYDNCF-----------NDDTRPTIRYRAMSDALKKAGRPIFFSMCEW 171
+ E G D LKYDNC+ + + + R+ M+ A+ + R I + +C+W
Sbjct: 131 AKSFAEWGGDTLKYDNCYATSKTNMVDADSAEAKSPNRFIKMAAAINETDRDIKYFLCQW 190
Query: 172 GDMRPA-LWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNG 230
G W + + NSWR ++DI ++W + I + +A + PG + D DML +G G
Sbjct: 191 GIGEDVPQWAAPLGNSWRMSNDIFNAWRAIWRITNQVVAHAKYNGPGAFADMDMLIIGLG 250
Query: 231 GMKYNEYVVHFSIWAISKAPLLLG--CDVGNLTAETMPIIGNEEVIAVNQ---------- 278
+ ++E HF W++ K+PL++G D + AE++ I+ N+EVIA+NQ
Sbjct: 251 ALSHDEERFHFGFWSMMKSPLIIGGVMDAKQIPAESLEIMSNKEVIAINQDPLAEAAKLV 310
Query: 279 ---AINDQQIWAGPLSGNRIVV 297
+ IW G LS NR V+
Sbjct: 311 IRYTEEEWDIWVGNLSSNRKVL 332
>gi|169771801|ref|XP_001820370.1| alpha-galactosidase B [Aspergillus oryzae RIB40]
gi|121803523|sp|Q2UJ97.1|AGALB_ASPOR RecName: Full=Probable alpha-galactosidase B; AltName:
Full=Melibiase B; Flags: Precursor
gi|83768229|dbj|BAE58368.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 442
Score = 144 bits (362), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 105/377 (27%), Positives = 166/377 (44%), Gaps = 89/377 (23%)
Query: 22 NGLGLTPPMGWNSWNHFHCQIN-EKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAA 80
+G+G P +GWN+WN F C I+ K++ ++ +D+ + D + +
Sbjct: 24 DGVGRLPALGWNTWNAFGCDIDASKVLTAAEETINLGLKDAGYE----YINIDDCWSVKS 79
Query: 81 KPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----ISNSLDN-------------RIE 122
PN + ++ + FP G +A +H L I +S +I+
Sbjct: 80 GRDPNTKR--IIPDSAKFPDGISGVASKIHDLGLKVGIYSSAGTETCAGYPASLGYEKID 137
Query: 123 SNIAFEQGIDYLKYDNC----------------------FNDDTRPTI------------ 148
+ E GIDYLKYDNC F + T P I
Sbjct: 138 AESFAEWGIDYLKYDNCGVPTNWTDTYTHCVPDNSNGSKFPNGTCPDISNPAPTAYDWSS 197
Query: 149 -----RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTI 203
RY AM DAL R I +S+CEWG WG+ NSWRTT DI+ W+R++ I
Sbjct: 198 SNTAQRYNAMRDALLGVNRTILYSLCEWGQADVNTWGNGTGNSWRTTGDITPDWSRIVEI 257
Query: 204 ADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAE 263
A+ N ++A G+ DPDMLEVGNG + E HF++WA K+PL++G + ++ E
Sbjct: 258 ANENSFLMNYADFWGYPDPDMLEVGNGNLTLEENRAHFALWAAMKSPLIIGTALDSINEE 317
Query: 264 TMPIIGNEEVIAVNQ----------------------AINDQQIWAGPLS--GNRIVVLL 299
+ I+ N+ +++ +Q + + W+GP S G +V++
Sbjct: 318 HLAILKNKPLLSFHQDPVIGRPAYPYKWGYNPDWTFDPAHPAEYWSGPSSTLGGTLVLMF 377
Query: 300 ESRKTFSSMMTAHWDDI 316
S + + TA W +I
Sbjct: 378 NSEDS-AKHRTAVWSEI 393
>gi|238485606|ref|XP_002374041.1| alpha-galactosidase AglB [Aspergillus flavus NRRL3357]
gi|292495606|sp|B8N306.1|AGALB_ASPFN RecName: Full=Probable alpha-galactosidase B; AltName:
Full=Melibiase B; Flags: Precursor
gi|220698920|gb|EED55259.1| alpha-galactosidase AglB [Aspergillus flavus NRRL3357]
gi|391874702|gb|EIT83547.1| alpha-D-galactosidase [Aspergillus oryzae 3.042]
Length = 442
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 105/377 (27%), Positives = 166/377 (44%), Gaps = 89/377 (23%)
Query: 22 NGLGLTPPMGWNSWNHFHCQIN-EKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAA 80
+G+G P +GWN+WN F C I+ K++ ++ +D+ + D + +
Sbjct: 24 DGVGRLPALGWNTWNAFGCDIDASKVLTAAEETINLGLKDAGYE----YINIDDCWSVKS 79
Query: 81 KPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----ISNSLDN-------------RIE 122
PN + ++ + FP G +A +H L I +S +I+
Sbjct: 80 GRDPNTKR--IIPDSAKFPDGISGVASKIHDLGLKVGIYSSAGTETCAGYPASLGYEKID 137
Query: 123 SNIAFEQGIDYLKYDNC----------------------FNDDTRPTI------------ 148
+ E GIDYLKYDNC F + T P I
Sbjct: 138 AESFAEWGIDYLKYDNCGVPTNWTDTYTHCVPDNSNGSKFPNGTCPDISNPAPTAYDWSS 197
Query: 149 -----RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTI 203
RY AM DAL R I +S+CEWG WG+ NSWRTT DI+ W+R++ I
Sbjct: 198 SNTAQRYNAMRDALLGVNRTILYSLCEWGQADVNTWGNGTGNSWRTTGDITPDWSRIVEI 257
Query: 204 ADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAE 263
A+ N ++A G+ DPDMLEVGNG + E HF++WA K+PL++G + ++ E
Sbjct: 258 ANENSFLMNYADFWGYPDPDMLEVGNGNLTLEENRAHFALWAAMKSPLIIGTALDSINEE 317
Query: 264 TMPIIGNEEVIAVNQ----------------------AINDQQIWAGPLS--GNRIVVLL 299
+ I+ N+ +++ +Q + + W+GP S G +V++
Sbjct: 318 HLAILKNKPLLSFHQDPVIGRPAYPYKWGYNPDWTFDPAHPAEYWSGPSSTLGGTLVLMF 377
Query: 300 ESRKTFSSMMTAHWDDI 316
S + + TA W +I
Sbjct: 378 NSEDS-AKHRTAVWSEI 393
>gi|310699603|dbj|BAJ23877.1| alpha-D-galactopyranosidase [Fusarium oxysporum]
Length = 409
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 106/428 (24%), Positives = 177/428 (41%), Gaps = 105/428 (24%)
Query: 1 LFIVSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQI-NEKIIRET--------- 50
L +V+ + + R +L + +G P +GWN WN C +EK+ T
Sbjct: 8 LGVVALAGLATAVPSRNHLQTREVGKLPALGWNGWNQGQCNAASEKVALATAKTFINLGL 67
Query: 51 ----------DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPS 100
DDCW RDS+ GNLV + +P
Sbjct: 68 KDAGYQYVNIDDCWSTKQRDSK--------------------------GNLVPDPAKWPR 101
Query: 101 GTKALADYVHIL------------------ISNSLDNRIESNIAFEQGIDYLKYDNCFND 142
G K +AD +H + + + ++N+ G+DY KYDNC+
Sbjct: 102 GIKPVADEIHAMGLKFGLYGDGGAKTCAGYPGSQGHEQQDANLLASWGVDYWKYDNCYTP 161
Query: 143 ------------------DTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIR 184
+RP RY M D L+ GR I +S+C WG WG+ +
Sbjct: 162 CNTGNGADIQTCPNNQAPSSRP--RYEKMRDLLRATGREILYSLCNWGYDEVWTWGAQVG 219
Query: 185 NSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIW 244
+ WR + D WA ++ IA+ + PG +ND DM+ + NG + E HF+IW
Sbjct: 220 HMWRMSQDNWGKWADVVRIANQAAPILKYTVPGHYNDLDMMILANGALTPAEERTHFAIW 279
Query: 245 AISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ----------------AINDQQI--- 285
I+K+P++LG D+ L ++ + +I N+ ++AVNQ A +I
Sbjct: 280 CITKSPIILGTDMTKLNSDEVKLITNKGLLAVNQDSLSKPAVPFTPPNTPAKGSSEIYPY 339
Query: 286 WAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFEHRTLETKFQGK-LTAKVDG 344
W+GP+S +V ++ S+ ++ + D G+K + K +GK ++A +
Sbjct: 340 WSGPISTGTVVAIVASKGNLNTQLNLK-DVPGLKDMEYSWTELLTGAKGKGKTVSANLQT 398
Query: 345 HSCKMYEL 352
H ++ +
Sbjct: 399 HDVAVFRI 406
>gi|74626383|sp|Q9Y865.1|AGALB_ASPNG RecName: Full=Probable alpha-galactosidase B; AltName:
Full=Melibiase B; Flags: Precursor
gi|5326775|emb|CAB46229.1| alpha-galactosidase [Aspergillus niger]
gi|350635004|gb|EHA23366.1| alpha-galactosidase [Aspergillus niger ATCC 1015]
Length = 443
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 103/375 (27%), Positives = 163/375 (43%), Gaps = 86/375 (22%)
Query: 22 NGLGLTPPMGWNSWNHFHCQIN-EKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAA 80
+G+GLTP +GWNSWN + C I+ +KI+ ++ +D L + D +
Sbjct: 21 DGVGLTPALGWNSWNAYSCDIDADKIVTAANEVVNLGLKD----LGYEYINIDDCWSV-- 74
Query: 81 KPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----ISNSLD-------------NRIE 122
K N ++ + FP+G +AD VH L I +S I+
Sbjct: 75 KSGRNTTTKRIIPDPDKFPNGISGVADQVHALGLKLGIYSSAGLTTCAGYPASLGYEEID 134
Query: 123 SNIAFEQGIDYLKYDNC-----FND------------------------DTRP------- 146
+ E GIDYLKYDNC D D P
Sbjct: 135 AQSFAEWGIDYLKYDNCGVPTNLTDQYTYCVPDSTDGSNYPNGTCVNLTDAAPQGYDWAT 194
Query: 147 ---TIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTI 203
RY+ M DAL R I +S+C+WG WG+ NSWR + DI+ +W+R+ I
Sbjct: 195 STTAKRYQRMRDALLSVNRTILYSLCDWGQADVNAWGNATGNSWRMSGDITATWSRIAEI 254
Query: 204 ADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAE 263
A+ N ++A G+ DPDMLEVGNG + E HF++WA+ KAPL++G + ++
Sbjct: 255 ANENSFLMNYANFWGYPDPDMLEVGNGNLTLPENRAHFALWAMMKAPLIIGTPLDSIDTS 314
Query: 264 TMPIIGNEEVIAVNQAI----------------------NDQQIWAGPLSGNRIVVLLES 301
+ I+ N+ ++ +Q + + W+GP S + VL+ +
Sbjct: 315 HLTILSNKPLLTFHQDAVIGRPAYPYKWGYNPDWTFDPEHPAEYWSGPTSSGEVFVLMLN 374
Query: 302 RKTFSSMMTAHWDDI 316
+ +A W+++
Sbjct: 375 SEGEVKTRSAVWEEV 389
>gi|403413918|emb|CCM00618.1| predicted protein [Fibroporia radiculosa]
Length = 508
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 99/285 (34%), Positives = 143/285 (50%), Gaps = 38/285 (13%)
Query: 24 LGLTPPMGWNSWNHFHC-QINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKP 82
+G P MG+NS +HC INE II ET SL + ++ +
Sbjct: 100 VGRLPVMGYNSKVDYHCCDINEGIILETAHL--------MKSLGLADIGYNYINIDGCYS 151
Query: 83 LPNQCQ-GNLVANKTTFPSGTKALADYVH-----------------ILISNSLDNRIESN 124
N+ + G++VA+ FPSG L +H L S N
Sbjct: 152 EKNRSESGDIVADHVRFPSGMHNLTTQIHNLGLKTGIYSDSGWFTCQLYPGSFQNEERDA 211
Query: 125 IAFEQ-GIDYLKYDNC---FNDDTRPTI--RYRAMSDAL----KKAGR-PIFFSMCEWGD 173
F+ G D LKYDNC F++ + I +Y+ M+DA+ K +GR P FS+CEWG+
Sbjct: 212 KLFQDWGFDLLKYDNCAIPFDEILKEGIVGKYKRMADAIADLTKTSGRAPFVFSLCEWGE 271
Query: 174 MRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMK 233
+P LW SWRTT DI +W +++I + N + G ND D+LEVGNG +
Sbjct: 272 EQPWLWARRYGQSWRTTGDIEPNWRSVMSILNQNSFITWASDFYGHNDLDILEVGNGDLT 331
Query: 234 YNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
Y+E HF+ WA+ K+PLL+ D+ +T ET+ I+ N E+IA+NQ
Sbjct: 332 YDEAKSHFTAWALLKSPLLISTDLPTVTDETLEILSNRELIAINQ 376
>gi|11275342|dbj|BAB18273.1| alpha-galactosidase [Lachancea kluyveri]
Length = 416
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 102/297 (34%), Positives = 136/297 (45%), Gaps = 68/297 (22%)
Query: 91 LVANKTTFPSGTKALADYVH---ILI--------------SNSLDNRIESNIAFEQ-GID 132
L+ + FP G K +AD +H +L S SL AF +D
Sbjct: 46 LIPDSKKFPRGMKHVADSLHSEGLLFGMYSSAGEYTCAGYSGSLGREEADAAAFASWDVD 105
Query: 133 YLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSW 187
YLKYDNC+N T RY+AMSDAL K GRPI++S+C WG+ P W S I NSW
Sbjct: 106 YLKYDNCYNQGNYGTAQLSYERYKAMSDALNKTGRPIYYSLCNWGEDSPWHWASAIANSW 165
Query: 188 RTTDDISDSWAR----------------------MLTIADMNEVYADHAKPGGWNDPDML 225
R + DI + R ++ I + A PGGWND D L
Sbjct: 166 RISGDIYADFDRPDSRCPCNGDEYYCHLAGFHCSIMNILNKAAPLGQKAGPGGWNDLDAL 225
Query: 226 EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------- 278
EVG G M E HFS+WA K+PL++G ++ L A++ I N +IAVNQ
Sbjct: 226 EVGVGNMTDEEEKTHFSMWAAVKSPLVIGANLNTLDAKSFSIYVNPAIIAVNQDAAGTPV 285
Query: 279 --------------AINDQQIWAGPL-SGNRIVVLLESRKTFSSMMTAHWDDIGIKS 320
+ Q+W+GPL +G++IV LL + M A +DI I S
Sbjct: 286 MRVWRYYVSDTDEHGQGEIQLWSGPLDNGDQIVALLNAGSK-DRPMNATLEDIFIDS 341
>gi|384252626|gb|EIE26102.1| glycoside hydrolase [Coccomyxa subellipsoidea C-169]
Length = 495
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 110/384 (28%), Positives = 171/384 (44%), Gaps = 56/384 (14%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L N L LTP MG+N+WN F I+E++I +T G S + L + D
Sbjct: 23 LGNELALTPTMGFNTWNAFGSDISEQLILQTA---GFMRNMSLVQLGYDLIVLDDGWSTK 79
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVHIL------------------ISNSLDNRI 121
+ + G L + FPSG K L++ + + ++ ++
Sbjct: 80 ERGI----DGKLQPDPKKFPSGLKELSNRLAGMGIKLGLFGDAGTRTCGGAAASYGQEKL 135
Query: 122 ESNIAFEQGIDYLKYDNCF---NDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPAL 178
++ GI YLKYDNC+ D R+ M DAL G PI +++ +WG
Sbjct: 136 DAATFASWGISYLKYDNCYAPKKDKESVRGRFAVMRDALNATGHPITYAIDDWGVTNTWT 195
Query: 179 WGSNIRNSWRTTDDISD----SWARMLTIADMNEVYADHAKPGGWNDPDMLEVG---NGG 231
+G+ + NSWRTT ++D +W +L + D + A PGGWN+ D+L VG +
Sbjct: 196 YGTTVANSWRTTAGLTDQLQATWEGILRVLDNSAGLGRFAAPGGWNNLDLLAVGEPVSED 255
Query: 232 MKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AI 280
+ E HF++WAI K+PL + D+ +T + I+ ++E+IAVNQ
Sbjct: 256 LTVEEMQSHFALWAIVKSPLFISADLRQITKTALDILKSDELIAVNQDPLGVAGDLIWKQ 315
Query: 281 NDQQIWAGPLSGN-RIVVLL-----ESRKTFSSMMTAHWDDIGIKSNNTLFEHRTLETK- 333
+IW LSG R V LL E + +S +T HW +G + + + K
Sbjct: 316 GANEIWGAGLSGGARAVALLNRHFDEDPQFDNSSITLHWHHLGWEGDMAVTVRDLYAKKD 375
Query: 334 ---FQGKLTAKVDGHSCKMYELTP 354
F TA + H +LTP
Sbjct: 376 LGLFTHNFTAVIPYHGVLALKLTP 399
>gi|260812559|ref|XP_002600988.1| hypothetical protein BRAFLDRAFT_232493 [Branchiostoma floridae]
gi|229286278|gb|EEN57000.1| hypothetical protein BRAFLDRAFT_232493 [Branchiostoma floridae]
Length = 410
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 106/307 (34%), Positives = 145/307 (47%), Gaps = 56/307 (18%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSL 75
R + L NGL TPPMGW +W F C ++ + + +C IS M D L
Sbjct: 18 RSSTLDNGLARTPPMGWLAWERFRCNVD--CVDDPYNC---------ISEQLFMQMADRL 66
Query: 76 TQIAAKPLP------NQC--------QGNLVANKTTFPSGTKALADYVH---ILISNSLD 118
+ K L + C QGNL ANKT FP+G K LADYVH + + LD
Sbjct: 67 AEDGWKELGYNYINIDDCWMSMDRDDQGNLYANKTRFPNGIKKLADYVHSKGLKLGIYLD 126
Query: 119 ------NRIESNIAFEQ---------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRP 163
+I + Q GID LK D C+ + + + Y M + K GRP
Sbjct: 127 FGTHTCGGYPGSIKYLQKDAETIAGWGIDMLKMDGCYANVSDMSWGYPDMEMYMNKTGRP 186
Query: 164 IFFSMCEW----GDMRPALWGSNIR----NSWRTTDDISDSWARMLTI----ADMNEVYA 211
I FS C W G ++ L ++ N+WR DDI DSW + I D +V A
Sbjct: 187 ILFS-CSWPAYAGGIKVDLVDYSVLAKHCNAWRNYDDIQDSWDSVTHIIDWFGDNQDVLA 245
Query: 212 DHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNE 271
A PG WNDPDML VG+ + Y E ++WAI APL + D+ +++ E I+ N+
Sbjct: 246 PAAGPGHWNDPDMLIVGDYSLSYEESKAQVAMWAILAAPLFMSNDLRSISPEARSILMNK 305
Query: 272 EVIAVNQ 278
EVI ++Q
Sbjct: 306 EVIRISQ 312
>gi|251798243|ref|YP_003012974.1| alpha-galactosidase [Paenibacillus sp. JDR-2]
gi|247545869|gb|ACT02888.1| Alpha-galactosidase [Paenibacillus sp. JDR-2]
Length = 386
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 101/291 (34%), Positives = 135/291 (46%), Gaps = 46/291 (15%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPL 83
LG+TP MGWNSWN F INE++IR+ D D + + + D + +
Sbjct: 5 LGITPAMGWNSWNTFTWDINEQLIRDVAD---RFESDGYQAAGYEYIVIDDCWSLKQRDQ 61
Query: 84 PNQCQGNLVANKTTFPSGTKALADYVH-----------------ILISNSLDNRIESNIA 126
Q NLVA+ FPSG KALADY+H S ++ +
Sbjct: 62 ----QRNLVADPAKFPSGMKALADYIHSKGLKFGMYSCVGTHTCAGFPGSFEHEFQDAQL 117
Query: 127 F-EQGIDYLKYDNCFND-DTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIR 184
F E G+DYLKYD CF + Y+ MS ALK +GR I S C WG+ W IR
Sbjct: 118 FAEWGVDYLKYDYCFKPRHISGELLYKRMSLALKNSGRDILLSACNWGEDNVYQW---IR 174
Query: 185 NS----WRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGN----------- 229
S +R+T DI D W + +A + ND DML VG
Sbjct: 175 ESGAHMYRSTGDIQDHWDSIKNLALSQLDKECYTGSFCHNDLDMLVVGMYGGSNNTFIGS 234
Query: 230 --GGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
GG NEY HFS+W + +PL++GCD+ ET I+ N ++IA+NQ
Sbjct: 235 KIGGCSDNEYKTHFSLWCLMGSPLMIGCDIRTANPETTNILLNPDLIAINQ 285
>gi|70984631|ref|XP_747822.1| alpha-galactosidase [Aspergillus fumigatus Af293]
gi|66845449|gb|EAL85784.1| alpha-galactosidase, putative [Aspergillus fumigatus Af293]
gi|159122604|gb|EDP47725.1| alpha-galactosidase, putative [Aspergillus fumigatus A1163]
Length = 512
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 98/295 (33%), Positives = 139/295 (47%), Gaps = 67/295 (22%)
Query: 89 GNLVANKTTFPSGTKALADYVHIL-----ISNSLDN----RIESNIAFEQ---------G 130
G L+A+ FP+G LAD VH L I +S R E ++ +E+ G
Sbjct: 112 GYLIADSEKFPNGIAHLADKVHELGLKIGIYSSAGRWTCARYEGSLGYEEKDAALWASWG 171
Query: 131 IDYLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRN 185
IDYLKYDNC+N+ T RY AM AL GRP+ +S+C WG P + I N
Sbjct: 172 IDYLKYDNCYNEGEEGTPKLSFDRYNAMFKALNATGRPMLYSLCNWGVDGPWNFAPTIAN 231
Query: 186 SWRTTDDISDSWAR-----------------------MLTIADMNEVYADHAKPGGWNDP 222
SWRTT D+S+ W R ++ + + Y A PG WND
Sbjct: 232 SWRTTGDLSNVWDRDDVNCPCSELDGLDCKTPGYKCSIMNVLNKAVYYPSKAIPGAWNDL 291
Query: 223 DMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ---- 278
DML+VGNGG+ +E + H S+WA K+PLL+ + + T+ I+ N V+AV+Q
Sbjct: 292 DMLQVGNGGLTDDESIAHMSLWAALKSPLLMTNVMTKIDPPTLSILQNPAVLAVSQDPLA 351
Query: 279 -----------------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDI 316
+ Q+++GPLSG +VLL + + + M A DI
Sbjct: 352 STPVRQWRYFVDDVDENGKGEIQMYSGPLSGGDQLVLLLNAGSKAREMNATLVDI 406
>gi|295444972|ref|NP_001171396.1| alpha-galactosidase A [Sus scrofa]
gi|188998336|gb|ACD67895.1| alpha-galactosidase A [Sus scrofa]
gi|188998339|gb|ACD67896.1| alpha-galactosidase A [Sus scrofa]
Length = 433
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 103/317 (32%), Positives = 140/317 (44%), Gaps = 85/317 (26%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRET------------------- 50
L+NGL TP MGW W F C I+EK+ R+
Sbjct: 36 LNNGLARTPTMGWLHWERFMCNVDCQEDPDSCISEKLFRQIAELMASDGWKDAGYEYLCI 95
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RDSQ G L A+ FP G + LADYVH
Sbjct: 96 DDCWMAPKRDSQ--------------------------GRLQADPKRFPGGIRRLADYVH 129
Query: 111 -----ILISNSLDNR-------------IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YR 151
+ I + N+ I++ E G+D LK+D C+ D + Y+
Sbjct: 130 SKGLKLGIYADVGNKTCAGFPGSFGYYDIDAKTFAEWGVDLLKFDGCYCDSIKHLADGYK 189
Query: 152 AMSDALKKAGRPIFFSMCEWGDMRPALWGSN---IR---NSWRTTDDISDSWARMLTIAD 205
MS AL + GR I +S CEW + N IR N WR DI DSW + I D
Sbjct: 190 QMSLALNRTGRSIVYS-CEWPLYMQPFFKPNYTEIREYCNHWRNFADIYDSWQSIKNILD 248
Query: 206 M----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
E A PGGWNDPDML +GN G+ +N+ V ++WAI APLL+ D+ +++
Sbjct: 249 WTSSNQEKIVPVAGPGGWNDPDMLVIGNFGLSWNQQVTQMALWAIMAAPLLMSNDLRHIS 308
Query: 262 AETMPIIGNEEVIAVNQ 278
E ++ +++VIA+NQ
Sbjct: 309 PEAKALLQDKDVIAINQ 325
>gi|407918743|gb|EKG12009.1| Glycoside hydrolase clan GH-D [Macrophomina phaseolina MS6]
Length = 878
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 113/391 (28%), Positives = 175/391 (44%), Gaps = 73/391 (18%)
Query: 28 PPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQC 87
P +GWNS+N + C NE II+E + + L + + +T N+
Sbjct: 30 PQLGWNSYNAYACSPNETIIKE--------NAKGLVDLGFAEKGYTIVTTDCGWAASNRT 81
Query: 88 -QGNLVANKTTFPSGTKALADYVH-----------------------ILISNSLDNRIES 123
+G + N T FPSG AL DY+H I + SL + E
Sbjct: 82 TEGRITWNSTLFPSGFPALGDYIHDLGLLFGLYSGAGQWQCTTDSEHIFLVASLGHETED 141
Query: 124 NIAF-EQGIDYLKYDNCFNDDTR-----------PTIRYRAMSDALKKAGRPIFFSMCEW 171
F E G D LKYDNC+ + T P++R+ MS AL +PI + +C+W
Sbjct: 142 AQTFAEWGGDALKYDNCWANVTEGFVVYDPYEPDPSVRFAKMSQALDAVDKPILYQICQW 201
Query: 172 G---DMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVG 228
G D+ W NSWR ++DI +SW+ + I + Y + PG + D DML +G
Sbjct: 202 GVGEDL--GEWAPRYGNSWRISNDIYNSWSSIWRITNEVVPYWKYTGPGRYPDMDMLIIG 259
Query: 229 NGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAET----MPIIGNEEVIAVNQAINDQQ 284
+ E HF++WAI+K+PL +G + TA T + I+ N+E +A+NQ +Q
Sbjct: 260 LNALSLEEERFHFTMWAINKSPLTIGAPMS--TARTPQASLDIMLNDEALAINQDPLGEQ 317
Query: 285 -------------IWAGPLSGNRIVVLLESRKTFSSMMTAHW-DDIGIKSNNTLFEHRTL 330
IWAG LS +R+VV + + + S + +GI S + +
Sbjct: 318 ARLIRRYTEEEYDIWAGNLSASRLVVAVPNWRNASRTINLDLASTLGIASAGAVRDVWAA 377
Query: 331 E----TKFQGKLTAKVDGHSCKMYELTPVTK 357
+ T LT + GH K+ L+ +T+
Sbjct: 378 QDLGPTPSNSTLTLTLAGHEAKLLVLSSITR 408
>gi|242791515|ref|XP_002481773.1| alpha-galactosidase [Talaromyces stipitatus ATCC 10500]
gi|218718361|gb|EED17781.1| alpha-galactosidase [Talaromyces stipitatus ATCC 10500]
Length = 439
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 122/424 (28%), Positives = 182/424 (42%), Gaps = 97/424 (22%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQINE-KIIRETDDCWGESSRDSQISLSCSFHMFDSLTQ 77
+L +G+G P +GWNSWN + C ++E KI+ + +D L + D
Sbjct: 21 VLPDGVGRLPALGWNSWNAYGCDVDETKIVTAANKLNSTGLKD----LGYQYVNIDDCWS 76
Query: 78 IAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNR------------ 120
+ K + ++ N TFPSG A VH I I +S +
Sbjct: 77 V--KSGRDSVTNRIIPNPDTFPSGINGTAQQVHDLGLKIGIYSSAGYQTCAGYPASLGYE 134
Query: 121 -IESNIAFEQGIDYLKYDNC---------FN------------------DDTRP------ 146
I++ + GIDYLKYDNC +N D T P
Sbjct: 135 TIDAQTFADWGIDYLKYDNCNYPSEWDDQYNACIPDSDYPGVNPNGTCPDLTNPAPAGYD 194
Query: 147 ------TIRYRAMSDALKKAG--RPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWA 198
T R+ AM DAL R I +S+CEWG+ WG+ NSWR T DI+ +W
Sbjct: 195 WSMSNTTKRFNAMRDALASVQDQRVILYSLCEWGNADVPSWGNGTGNSWRVTGDINATWD 254
Query: 199 RMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVG 258
R+ IA+MN G NDPDMLEVGNG + E HF++WAI K+PL++G D+
Sbjct: 255 RITAIANMNAHELSSVDFWGHNDPDMLEVGNGNLTIEENRAHFALWAIMKSPLIIGTDLS 314
Query: 259 NLTAETMPIIGNEEVIAVNQ-------------AIND--------QQIWAGPLSGNRIVV 297
+ + I+ N ++IA NQ N+ + W+G S +V
Sbjct: 315 TIPDTHLFILKNADLIAFNQDPIIGKPALPYKDGYNNGTYNPHYPPEYWSGATSHGWNLV 374
Query: 298 LLESRKTFSSMMTAHWDDIGIKSNNTLFEHRTLETKFQGK--------LTAKVDGHSCKM 349
LL + + ++ TA W +I S + +E + GK TA+++ H +
Sbjct: 375 LLFNSENVAASRTAVWSEIPQLSGHD--SKYRVEDVWTGKDLGCVHKEYTAQLEAHDVAV 432
Query: 350 YELT 353
++T
Sbjct: 433 LKVT 436
>gi|167537676|ref|XP_001750506.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771046|gb|EDQ84720.1| predicted protein [Monosiga brevicollis MX1]
Length = 403
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 110/342 (32%), Positives = 158/342 (46%), Gaps = 78/342 (22%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ--------------INEKIIRETDDCWG----ESSRDS 61
L NGLGLTPPMG+N WN + + NE +I+ D + + +
Sbjct: 26 LDNGLGLTPPMGFNPWNCYGIESTGLPKLPGSKIVGFNETVIKAVADAFVTLGLKEAGYE 85
Query: 62 QISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH------ILISN 115
++L C + QG LV N + +PSG KAL DY+H L S+
Sbjct: 86 YVNLDCGYSTGF-----------RDAQGQLVVNTSLYPSGLKALGDYIHDKGLKFGLYSD 134
Query: 116 -----------------SLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDAL 157
SL + F GIDYLK+DNC + P+ Y M DAL
Sbjct: 135 AGAVQCCSRIWPGANDGSLGHEAADAAWFASLGIDYLKHDNC---NPGPS-SYPDMRDAL 190
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPG 217
K GRPI++S+ + ++ N WRTT DI+ +W A N Y A+PG
Sbjct: 191 NKTGRPIYYSI------HGPIANDSLANCWRTTTDINPNWDSFYQRAHENNQYIAAARPG 244
Query: 218 GWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCD-VGNLTAET-MPIIGNEEVIA 275
+ND DMLEVGN + E HFS+W + KAPLL+G D V N + + + I+ N++ IA
Sbjct: 245 AFNDADMLEVGNAPLTMAENRAHFSLWCLMKAPLLIGTDLVANPPSRSVLEILTNKDAIA 304
Query: 276 VNQAINDQQ------------IWAGPLS-GNRIVVLLESRKT 304
+NQ QQ +W+G L+ G+ ++L + +
Sbjct: 305 INQDPLGQQGRFQNATADGCEVWSGALADGDTAAIILNVQNS 346
>gi|432877055|ref|XP_004073084.1| PREDICTED: alpha-galactosidase A-like [Oryzias latipes]
Length = 427
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 107/357 (29%), Positives = 152/357 (42%), Gaps = 101/357 (28%)
Query: 20 LSNGLGLTPPMGWNSWNHFHC------------------QINEKIIRE-----------T 50
L NGL L P MGW W F C Q+ + +++E
Sbjct: 21 LDNGLALKPTMGWLHWERFTCNTDCDTDPRNCISEQLYMQMADVMVKEGWKEAGYEYVCI 80
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RD+ +G L A+ FP G K LADY+H
Sbjct: 81 DDCWPSHKRDA--------------------------KGRLQADPNRFPGGIKKLADYIH 114
Query: 111 ILISNSLDNRIESNIA-------------FEQ--------GIDYLKYDNCFNDDTRPTIR 149
S L I +++ +E +D LK+D CF + T
Sbjct: 115 ---SKGLKLGIYADVGKNTCAGYPGSLGYYETDAQTFADWDVDLLKFDGCFMERTLIAEG 171
Query: 150 YRAMSDALKKAGRPIFFSMCEWGDMRPALWGSN---IR---NSWRTTDDISDSWARMLTI 203
Y MS AL K GR I +S CEW A+ N IR N WR +DI DSW+ + TI
Sbjct: 172 YMNMSKALNKTGRSILYS-CEWPLYEWAIKQPNYTAIREYCNHWRNFNDIFDSWSSLKTI 230
Query: 204 ----ADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGN 259
A ++ A PGGWNDPDML +GN G+ +++ ++WAI APLL+ D+ +
Sbjct: 231 LEWTASNQKIIVPAAGPGGWNDPDMLVIGNFGLSHDQQQSQMALWAIMAAPLLMSNDLRD 290
Query: 260 LTAETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLSGNRIVVLLESRKTF 305
+ + ++ N +IA+NQ QQ +W PL GNR+ + + +R+
Sbjct: 291 ICPRSKELLQNRMIIAINQDPLGQQGYHTARLMDFDVWERPLFGNRLAIAIMNRQEL 347
>gi|423223035|ref|ZP_17209504.1| hypothetical protein HMPREF1062_01690 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392640304|gb|EIY34106.1| hypothetical protein HMPREF1062_01690 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 673
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 120/430 (27%), Positives = 195/430 (45%), Gaps = 79/430 (18%)
Query: 2 FIVSSSAADSNGSDRRNL---LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS 58
F V A + G+ R L + + LTPPMGWNSWN F +++ + +++ ES
Sbjct: 246 FEVVLQAKNVKGTAERKLRIECGDRIALTPPMGWNSWNCFGHEVSAEKVKQAARAMVESG 305
Query: 59 RD----SQISLSCSF-HMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL- 112
+ I++ S+ H D + L + QGN++ N FP K L DY+H L
Sbjct: 306 LVNYGWTYINIDDSWQHHRDPNDRTRGGRLRDD-QGNIIPN-AQFPD-MKGLTDYIHSLG 362
Query: 113 -----------------ISNSLDNRIESNIAFEQGIDYLKYDNCF--------------- 140
+ + + ++++ E G+DYLKYD C
Sbjct: 363 LKVGIYSSPGPWTCGGCVGSYGYEKQDADMYGEWGLDYLKYDWCSYGGVLDRDLDKDPYS 422
Query: 141 ---------NDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNS-WRTT 190
D ++ M D L++ R I +++C++G WG + WRTT
Sbjct: 423 VSSLAFQGGGDSIAGRKPFKIMGDYLRQQPRDIVYNLCQYGMGDVWRWGDAVGGQCWRTT 482
Query: 191 DDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGG--------MKYNEYVVHFS 242
+DI+D+W + IA + A AKPG WNDPDML +G G +K +E +HFS
Sbjct: 483 NDITDTWESVKGIALSQDRAAAWAKPGNWNDPDMLVLGIVGWGNPHQTKLKPDEQYLHFS 542
Query: 243 IWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AINDQQIWAGPLS 291
+W++ APLL+GCD+ + T+ ++ N EVIAVNQ +I + +I+ L
Sbjct: 543 LWSLFSAPLLIGCDLEKMDDFTLSLLTNNEVIAVNQDPLGKQATCVYSIGELRIYVKELE 602
Query: 292 -GNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT---LFEHRTLETKFQGK--LTAKVDGH 345
G++ V + + + ++D + I T L+ + + T G+ L+ V H
Sbjct: 603 DGSKAVGFCNFDREKADISFRNFDKLNITGKQTVRDLWRQKDIRTLDTGRKPLSLNVPAH 662
Query: 346 SCKMYELTPV 355
+Y+ TPV
Sbjct: 663 GVLLYKFTPV 672
>gi|300789211|ref|YP_003769502.1| alpha-galactosidase [Amycolatopsis mediterranei U32]
gi|384152702|ref|YP_005535518.1| alpha-galactosidase [Amycolatopsis mediterranei S699]
gi|399541091|ref|YP_006553753.1| alpha-galactosidase [Amycolatopsis mediterranei S699]
gi|299798725|gb|ADJ49100.1| alpha-galactosidase [Amycolatopsis mediterranei U32]
gi|340530856|gb|AEK46061.1| alpha-galactosidase [Amycolatopsis mediterranei S699]
gi|398321861|gb|AFO80808.1| alpha-galactosidase [Amycolatopsis mediterranei S699]
Length = 542
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 117/407 (28%), Positives = 171/407 (42%), Gaps = 103/407 (25%)
Query: 26 LTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQISLS 66
+TPP+GWNSWN F C ++E I + DDCW + RD+Q
Sbjct: 39 VTPPLGWNSWNSFGCNVSESTIHQAADAMVSSGMRDAGYQYVVVDDCWFDPQRDAQ---- 94
Query: 67 CSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-------------- 112
GNL N + FP G KAL DY+H
Sbjct: 95 ----------------------GNLRGNASKFPGGMKALGDYIHARGLKFGIYQVPTDRT 132
Query: 113 ----------ISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTR--PTIRYRAMSDALKK 159
+ S + + F G+DYLKYD C TR R+ M DAL+
Sbjct: 133 CAQRGGAYPGSTGSQGHETQDARTFASWGVDYLKYDWCSPAGTRDEQVSRFGLMRDALRG 192
Query: 160 AGRPIFFSM------CEWGDMRPALWGSNIRNSWRTTDDISDSWARMLT---------IA 204
GRPI +S+ GD WG + + WRTT+D+ D W T +
Sbjct: 193 TGRPIVYSINPNSYHAITGDKYD--WG-QVADLWRTTEDLLDIWQNGNTNSYPMGVGNVL 249
Query: 205 DMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAET 264
D+ A A PG WNDPDML VG G+ E HF++WA+ APL+ G D+ ++A+
Sbjct: 250 DVTAPLAAQAGPGHWNDPDMLVVGRPGLSLTESRAHFTLWALMAAPLMAGNDIRTMSADI 309
Query: 265 MPIIGNEEVIAVNQAI-----------NDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHW 313
++ N ++AV+Q D +++A PL+ + V L +R ++ ++
Sbjct: 310 SAVLRNPRLLAVDQDRLGAGGRRVRDDGDVEVFAKPLADGSVAVGLLNRGGTTTAISTTA 369
Query: 314 DDIGIKSNN-TLFEHRTLET-KFQGKLTAKVDGHSCKMYELTPVTKL 358
IG+ + TL + T T G ++A V H Y ++ T L
Sbjct: 370 AQIGLSGTSFTLTDLWTGGTSASSGAISASVPAHGAAAYRVSGGTPL 416
>gi|426196092|gb|EKV46021.1| hypothetical protein AGABI2DRAFT_144304 [Agaricus bisporus var.
bisporus H97]
Length = 536
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 108/365 (29%), Positives = 154/365 (42%), Gaps = 85/365 (23%)
Query: 30 MGWNSWNHFHCQINEKIIRETDDCW---GESSRDSQ-ISLSCSFHMFDSLTQIAAKPLPN 85
MGWN +N F C E+ R G S Q ++L C + Q A+
Sbjct: 1 MGWNPYNAFLCSTTEQQYRTAAQILIDLGLSEVGYQFVNLDCGW-------QGKAR---- 49
Query: 86 QCQGNLVANKTTFPSGTKALADYVHIL-----------------------ISNSLDNRIE 122
G + T FPSG AL+ +VH L SL +
Sbjct: 50 NASGGFTWDTTAFPSGIPALSSFVHNLGLKFGVYSDGGFFSCDFVGGTAHYIGSLGHETS 109
Query: 123 SNIAFEQ-GIDYLKYDNCFNDD---TRPTIRYRA-------------------------M 153
+F G DYLKYDNC++ T P ++R+ M
Sbjct: 110 DANSFAAWGADYLKYDNCYSGTVSYTPPNFKFRSPLHAVSPTDFVNFNPPIQIEPHYVTM 169
Query: 154 SDALKKAGRPIFFSMCEWGDMRPALW-GSNIRNSWRTT---DDISD--SWARMLTIADMN 207
DAL RP+ FS+CEWG PA W S + +SWR + +DI SW + I +
Sbjct: 170 RDALAATNRPVIFSICEWGVQDPARWPASAVGHSWRISCCSNDIGPPASWDNLFRIINQV 229
Query: 208 EVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPI 267
A PGGWND DMLEVGN G+ E HF+ WA K+PLL+ D+ + ++ I
Sbjct: 230 VPLTQFAGPGGWNDLDMLEVGNSGLTTVEQQTHFAFWAAVKSPLLISTDLTRIATNSLNI 289
Query: 268 IGNEEVIAVNQ------------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDD 315
+ N +IA+NQ ND +W+GPL+ V ++ + + +T + D
Sbjct: 290 LKNNRIIALNQDRLGKSISFKRRYTNDHDVWSGPLANGAFVAVVINWQNTRRSLTFNLAD 349
Query: 316 IGIKS 320
+G+ S
Sbjct: 350 VGLSS 354
>gi|224537196|ref|ZP_03677735.1| hypothetical protein BACCELL_02073 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521251|gb|EEF90356.1| hypothetical protein BACCELL_02073 [Bacteroides cellulosilyticus
DSM 14838]
Length = 677
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 120/430 (27%), Positives = 195/430 (45%), Gaps = 79/430 (18%)
Query: 2 FIVSSSAADSNGSDRRNL---LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS 58
F V A + G+ R L + + LTPPMGWNSWN F +++ + +++ ES
Sbjct: 250 FEVVLQAKNVKGTAERKLRIECGDRIALTPPMGWNSWNCFGHEVSAEKVKQAARAMIESG 309
Query: 59 RD----SQISLSCSF-HMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL- 112
+ I++ S+ H D + L + QGN++ N FP K L DY+H L
Sbjct: 310 LVNYGWTYINIDDSWQHHRDPNDRTRGGRLRDD-QGNIIPN-AQFPD-MKGLTDYIHSLG 366
Query: 113 -----------------ISNSLDNRIESNIAFEQGIDYLKYDNCF--------------- 140
+ + + ++++ E G+DYLKYD C
Sbjct: 367 LKVGIYSSPGPWTCGGCVGSYGYEKQDADMYGEWGLDYLKYDWCSYGGVLDRDLDKDPYS 426
Query: 141 ---------NDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNS-WRTT 190
D ++ M D L++ R I +++C++G WG + WRTT
Sbjct: 427 VSSLAFQGGGDSIAGRKPFKIMGDYLRQQPRDIVYNLCQYGMGDVWKWGDAVGGQCWRTT 486
Query: 191 DDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGG--------MKYNEYVVHFS 242
+DI+D+W + IA + A AKPG WNDPDML +G G +K +E +HFS
Sbjct: 487 NDITDTWESVKGIALSQDRAAAWAKPGNWNDPDMLVLGIVGWGNPHQTKLKPDEQYLHFS 546
Query: 243 IWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AINDQQIWAGPLS 291
+W++ APLL+GCD+ + T+ ++ N EVIAVNQ +I + +I+ L
Sbjct: 547 LWSLFSAPLLIGCDLEKMDDFTLSLLTNNEVIAVNQDPLGKQATCVYSIGELRIYVKELE 606
Query: 292 -GNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT---LFEHRTLETKFQGK--LTAKVDGH 345
G++ V + + + ++D + I T L+ + + T G+ L+ V H
Sbjct: 607 DGSKAVGFCNFDREKADISFRNFDKLNITGKQTVRDLWRQKDIRTLDTGRKPLSLNVPAH 666
Query: 346 SCKMYELTPV 355
+Y+ TPV
Sbjct: 667 GVLLYKFTPV 676
>gi|408397831|gb|EKJ76969.1| hypothetical protein FPSE_02844 [Fusarium pseudograminearum CS3096]
Length = 567
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 105/384 (27%), Positives = 173/384 (45%), Gaps = 66/384 (17%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETD----DCWGESSRDSQISLSCSFHMFDSLTQIAAKP 82
TPPMGWNS+NH++C +E+II+ D S +++ C + D
Sbjct: 49 TPPMGWNSYNHYNCNPSEEIIKINAKGLVDLGFLDLGYSIVTVDCGWPSRD--------- 99
Query: 83 LPNQCQGNLVANKTTFPSGTKALADYVHIL-------------------ISNSLD-NRIE 122
QG L N+T FPSG KAL DY+H L I SL I+
Sbjct: 100 --RDSQGRLQWNETLFPSGPKALGDYIHDLGLEFGLYSGAGYLQCGSQDIPASLGYEEID 157
Query: 123 SNIAFEQGIDYLKYDNCF-----------NDDTRPTIRYRAMSDALKKAGRPIFFSMCEW 171
+ E G D LKYDNC+ + + + R+ M+ A+ + R I + +C+W
Sbjct: 158 AKSFAEWGGDTLKYDNCYATSKTDMVDSDSAEAKSPDRFIKMAAAINETDRDIKYFLCQW 217
Query: 172 GDMRPA-LWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNG 230
G W + + NSWR ++DI ++W + I + ++ + PG + D DML +G G
Sbjct: 218 GIGEDVPQWATKVGNSWRMSNDIFNAWRAIWRITNQAVAHSKYNHPGAFADLDMLIIGLG 277
Query: 231 GMKYNEYVVHFSIWAISKAPLLLG--CDVGNLTAETMPIIGNEEVIAVNQAINDQ----- 283
+ Y E HF W++ K+PL++G D + + ++ ++ N+E IA+NQ Q
Sbjct: 278 ALSYEEERFHFGFWSMMKSPLIIGGVMDAKQIPSHSLEVMSNKEAIAINQDALGQAAELV 337
Query: 284 --------QIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT--LFEHRTLETK 333
+W+G L+ NR V+ + + K + + IG+ ++ H L
Sbjct: 338 IRYTEEQWDVWSGNLTSNRKVLGVANWKNETQTVEVDLSLIGVGKAKARDVWAHEDLTIS 397
Query: 334 FQGKLTAKVDGHSCKMYELTPVTK 357
K K H ++ L+ +T+
Sbjct: 398 GTQKFVLKP--HELRLLVLSDITQ 419
>gi|348515581|ref|XP_003445318.1| PREDICTED: alpha-galactosidase A-like [Oreochromis niloticus]
Length = 430
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 114/403 (28%), Positives = 177/403 (43%), Gaps = 82/403 (20%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGL L P MGW W F C I DC ++ ++ IS M D + +
Sbjct: 21 LDNGLALKPTMGWLHWERFMCNI---------DC--DTDPNNCISERLYMQMADVMVKEG 69
Query: 80 AKPLPNQ--------------CQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNI 125
K + QG L A+ FP G K LADY+H S L I +++
Sbjct: 70 WKEAGYEYVCIDDCWPSHQRDAQGRLQADPKRFPGGIKKLADYIH---SKGLKLGIYADV 126
Query: 126 A-------------FEQG--------IDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPI 164
+E+ +D LK+D CF + Y MS AL GRPI
Sbjct: 127 GDKTCAGYPGSLGYYEKDAQTFADWDVDLLKFDGCFMNRALLGEGYINMSKALNNTGRPI 186
Query: 165 FFSMCEWGDMRPALWGSN---IR---NSWRTTDDISDSWARMLTIADMNEVYAD----HA 214
+S CEW L N IR N WR DD+ DSW+ + +I D + D A
Sbjct: 187 LYS-CEWPLYEWPLKQPNYTAIRETCNHWRNFDDVFDSWSSVKSILDWTAAHQDIIVPAA 245
Query: 215 KPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVI 274
PGGWNDPDML +GN G+ +++ ++WAI +PLL+ D+ ++ + ++ N+ +I
Sbjct: 246 GPGGWNDPDMLVIGNFGLSHDQQESQMALWAIMASPLLMSNDLRDICPRSKELLQNKRII 305
Query: 275 AVNQ-----------AINDQQIWAGPLSGNRIVVLLESR------KTFSSMMTAHWDDIG 317
++Q ND ++W PLS NR+ + + ++ + F+ W G
Sbjct: 306 DISQDSMGKQGYRTAKGNDFEVWEKPLSKNRLAIAVLNKLETGGPRGFNIGAAPGWKICG 365
Query: 318 IKSNNT----LFEHRTLETKFQGKLTAKVDGHSCKMYELTPVT 356
+ N T ++ ++T+ Q K+ V+ + +TP++
Sbjct: 366 PQCNVTQILPQYKEMGVQTQ-QSKIIVSVNPSGTALLTVTPIS 407
>gi|399026408|ref|ZP_10728217.1| alpha-galactosidase [Flavobacterium sp. CF136]
gi|398076118|gb|EJL67204.1| alpha-galactosidase [Flavobacterium sp. CF136]
Length = 672
Score = 141 bits (356), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 125/444 (28%), Positives = 192/444 (43%), Gaps = 107/444 (24%)
Query: 2 FIVSSSAADSNGSDRRNL---LSNGLGLTPPMGWNSWNHFHCQIN-EKIIRET------- 50
+++ A ++ G ++L + + LTPPMGWNSWN F +++ EKI R
Sbjct: 245 YVIQLKAKNAKGIAEKSLRIICGDKIALTPPMGWNSWNCFAHEVSGEKIKRAAEVMVKTG 304
Query: 51 -----------DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFP 99
DD W +++RDS+ + L A +PN+ G++
Sbjct: 305 LINYGWSYVNIDDYW-QNNRDSK-----DPSLQGKLRDEAGNIVPNKKFGDM-------- 350
Query: 100 SGTKALADYVHIL------------------ISNSLDNRIESNIAFEQGIDYLKYDNCF- 140
KAL DYVH L + + R ++ + G DYLKYD C
Sbjct: 351 ---KALTDYVHGLGLKIGIYSSPGPWTCGGCVGSYGHERQDAETYAKWGFDYLKYDWCSY 407
Query: 141 --------NDDTRPTIR---------------YRAMSDALKKAGRPIFFSMCEWGDMRPA 177
++D I Y+ M + +++ R I FS+C++G
Sbjct: 408 GGAVNGIPDNDPNKVISISYNGGYQLSTAVKPYKLMGEYIRQQPRDIVFSLCQYGMSDVW 467
Query: 178 LWGSNI-RNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGG----- 231
WG ++ NSWRTT+DI D+W+ + IA + A AKPG WND DML VG G
Sbjct: 468 KWGDSVGGNSWRTTNDILDTWSNVKVIALAQDQTAAWAKPGNWNDADMLVVGTVGWGSLH 527
Query: 232 ---MKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN----------- 277
+K +E +H S+W++ PLL+GCD+ L A T+ ++ N+EVIAVN
Sbjct: 528 PSKLKPDEQYLHVSLWSLFSTPLLIGCDMEKLDAFTLNLLTNDEVIAVNQDALGKQATCV 587
Query: 278 QAINDQQIWAGPLS-GNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT---LFEHRTLETK 333
Q I D +I+ L G+R+ + +GI L+ + + T
Sbjct: 588 QTIGDLRIYVKELEDGSRVAGFCNFGLEKVDIPYKDLKKLGISGKQKVRDLWRQKDIATI 647
Query: 334 FQGK--LTAKVDGHSCKMYELTPV 355
K + KV H +Y+ TPV
Sbjct: 648 HADKEAFSVKVPMHGVALYKFTPV 671
>gi|345885382|ref|ZP_08836759.1| hypothetical protein HMPREF0666_02935 [Prevotella sp. C561]
gi|345045306|gb|EGW49238.1| hypothetical protein HMPREF0666_02935 [Prevotella sp. C561]
Length = 409
Score = 141 bits (356), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 98/301 (32%), Positives = 140/301 (46%), Gaps = 55/301 (18%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
+ L LTPPMG+ +WN + INE++IR+ + + + D + D Q
Sbjct: 31 DSLALTPPMGFMTWNKYKEDINEQLIRQIAN---KMAADGYAEAGYKYIFIDDAWQGG-- 85
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIA------------FEQ 129
+ N++ + FPSG KALADYVH S L I S+ A FEQ
Sbjct: 86 ---RDKRNNILPDPEKFPSGMKALADYVH---SKGLRLGIYSDAALLTCAGYTASYGFEQ 139
Query: 130 ---------GIDYLKYDNCF--NDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPAL 178
GIDYLKYD C +D RY+ M+DAL+ +GR I +CEWG + P
Sbjct: 140 QDAKTFAEWGIDYLKYDYCHAPSDSAVAHERYKKMADALQNSGRKIVLGVCEWGQLNPEK 199
Query: 179 WGSNIRNS-WRTTDDISDSWARMLT--------IADMNEVYADHAKPGGWNDPDML---- 225
W S WR + D+ D W ++ I ++ E +A PG W D DML
Sbjct: 200 WARQAGGSLWRVSFDVRDMWKDIVKQGGMGIIDIINVTEPLYKYAGPGHWLDMDMLVVGL 259
Query: 226 --------EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN 277
++G G Y EY S+W + +PL + D+ + ET I+ N+E+IA+N
Sbjct: 260 DGKGGPSSDLGGIGCTYTEYQTQMSMWCMFASPLAMSHDILHENEETRRILLNKEIIAIN 319
Query: 278 Q 278
Q
Sbjct: 320 Q 320
>gi|260593413|ref|ZP_05858871.1| alpha-galactosidase [Prevotella veroralis F0319]
gi|260534689|gb|EEX17306.1| alpha-galactosidase [Prevotella veroralis F0319]
Length = 371
Score = 141 bits (356), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 113/384 (29%), Positives = 171/384 (44%), Gaps = 75/384 (19%)
Query: 30 MGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQG 89
MG+ +WN + INE++IR+ D + D + D Q +
Sbjct: 1 MGFMTWNKYGEDINEQLIRQIAD---KMVADGYADAGYQYIFIDDAWQGGRDK-----RN 52
Query: 90 NLVANKTTFPSGTKALADYVHILISNSLDNRIESNIA------------FEQ-------- 129
N++ + FPSG KALADYVH S L I S+ A FE+
Sbjct: 53 NIIPDPVKFPSGMKALADYVH---SRGLKLGIYSDAAPLTCAGYTASYNFEEQDAKTFAE 109
Query: 130 -GIDYLKYDNCF--NDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNS 186
G+DYLKYD C +D RY+ M DAL+K+GR I +CEWG + P LW S
Sbjct: 110 WGMDYLKYDYCHAPSDSAVAHERYKRMGDALEKSGRKIALGVCEWGQLNPELWARQAGGS 169
Query: 187 -WRTTDDISDSWAR--------MLTIADMNEVYADHAKPGGWNDPDML------------ 225
WR + D+ D W +L I D+ E A PG WND DML
Sbjct: 170 LWRISYDVRDMWKDIVKQGGMGILDIIDITEPLYSFAGPGHWNDMDMLVVGLEGKGGPSS 229
Query: 226 EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN-------- 277
++G G Y EY S+W + +PL + D+ N A T I+ N+E+IA+N
Sbjct: 230 DLGGIGCTYTEYQTQMSMWCMFASPLAMSHDILNENAATRQILLNKEIIAINQDALGKAA 289
Query: 278 ---QAINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSN---NTLFEHRTLE 331
Q I + +++ L+ NR + + + + ++ +G + ++EH+ +
Sbjct: 290 RLVQRIGECRVYHRQLTNNRQAIAIMNPSDKAQRISLPLSFLGKSAKYDFRDVWEHKNIS 349
Query: 332 TK--FQGKLTAKVDGHSCKMYELT 353
K ++GKL A H K++ +T
Sbjct: 350 VKKAWEGKLEA----HETKVFIIT 369
>gi|427384925|ref|ZP_18881430.1| hypothetical protein HMPREF9447_02463 [Bacteroides oleiciplenus YIT
12058]
gi|425728186|gb|EKU91045.1| hypothetical protein HMPREF9447_02463 [Bacteroides oleiciplenus YIT
12058]
Length = 675
Score = 141 bits (356), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 118/432 (27%), Positives = 192/432 (44%), Gaps = 83/432 (19%)
Query: 2 FIVSSSAADSNGSDRRNL---LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS 58
F V+ A + G + + + LTPPMGWNSWN F +++ + +++ ES
Sbjct: 248 FEVTLQAKNDKGMAEKKFRIECGDKIALTPPMGWNSWNCFGHEVSAEKVKQAARAMVESG 307
Query: 59 RDSQISLSCSFHMFDSLTQIAAKP-------LPNQCQGNLVANKTTFPSGTKALADYVHI 111
I+ ++ D Q P +GN++ N FP K LADY+H
Sbjct: 308 L---INYGWTYINIDDSWQHHRDPNDRTRSGKFRDARGNILTN-AQFPD-MKGLADYIHS 362
Query: 112 L------------------ISNSLDNRIESNIAFEQGIDYLKYDNCF------------- 140
L + + + ++++ + GIDYLKYD C
Sbjct: 363 LGMRVGIYSSPGPWTCGGCVGSYGYEKQDADMYAKWGIDYLKYDWCSYGGVLDRDLKKDP 422
Query: 141 -----------NDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNS-WR 188
D ++ M D L++ R I +++C++G WG + + WR
Sbjct: 423 YSVSSLAFQGGGDSIAGRKPFKIMGDYLRQQPRDIVYNLCQYGMGDVWEWGDAVGSQCWR 482
Query: 189 TTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGG--------MKYNEYVVH 240
TT+DI+D+W + +IA + A AKPG WNDPD+L VG G +K +E +H
Sbjct: 483 TTNDITDTWESVKSIALAQDKAAAWAKPGNWNDPDILVVGIVGWGNPHQTKLKPDEQYLH 542
Query: 241 FSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AINDQQIWAGP 289
FS+W++ APLL+GCD+ + T ++ N EVIAVNQ AI + +I+
Sbjct: 543 FSLWSLFSAPLLIGCDLEKMDDFTFSLLTNNEVIAVNQDPLGKQATCVHAIGELRIYVKE 602
Query: 290 LS-GNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT---LFEHRTLETKFQGK--LTAKVD 343
L G++ V + + + +D +GI T L+ + ++T G+ L+ V
Sbjct: 603 LEDGSKAVGFCNFDREKTDISFRDFDKLGITGKQTVRDLWRQKDIKTLDAGRETLSLNVP 662
Query: 344 GHSCKMYELTPV 355
H +Y+ PV
Sbjct: 663 AHGVLLYKFMPV 674
>gi|10863053|dbj|BAB16832.1| alpha-galactosidase [Lachancea kluyveri]
Length = 377
Score = 141 bits (356), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 100/297 (33%), Positives = 137/297 (46%), Gaps = 68/297 (22%)
Query: 91 LVANKTTFPSGTKALADYVH---ILIS--------------NSLDNRIESNIAFEQ-GID 132
L+ + FP G K +AD +H +L SL + +F +D
Sbjct: 7 LIPDSKKFPRGMKHVADSLHDKGLLFGMYSSAGEYTCAGYPGSLGHEEADAASFASWDVD 66
Query: 133 YLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSW 187
YLKYDNC+N T RY+AMSDAL K GRPI++++C WG+ P W S I NSW
Sbjct: 67 YLKYDNCYNQGNYGTAQLSYERYKAMSDALNKTGRPIYYTLCNWGEDSPWHWASAIANSW 126
Query: 188 RTTDDISDSWAR----------------------MLTIADMNEVYADHAKPGGWNDPDML 225
R + DI + R ++ I + A PGGWND D L
Sbjct: 127 RISGDIYADFDRPDSRCPCNGDEYYCHLAGFHCSIMNILNKAAPLGQKAGPGGWNDLDAL 186
Query: 226 EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------- 278
EVG G M E HFS+WA K+PL++G ++ L A++ I N +IAVNQ
Sbjct: 187 EVGVGNMTDEEEKTHFSMWAAVKSPLVIGANLNTLDAKSFSIYVNPAIIAVNQDAAGAPV 246
Query: 279 --------------AINDQQIWAGPL-SGNRIVVLLESRKTFSSMMTAHWDDIGIKS 320
+ Q+W+GPL +G++IV LL + S M A +DI I S
Sbjct: 247 MRVWRYYVSDTDEHGQGEIQLWSGPLDNGDQIVALLNAGSKDRS-MNATLEDIFIDS 302
>gi|325268526|ref|ZP_08135156.1| alpha-galactosidase [Prevotella multiformis DSM 16608]
gi|324989054|gb|EGC21007.1| alpha-galactosidase [Prevotella multiformis DSM 16608]
Length = 409
Score = 141 bits (356), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 101/328 (30%), Positives = 149/328 (45%), Gaps = 60/328 (18%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
+ L LTPPMG+ +WN + I+E++IR D +S + + +
Sbjct: 31 DSLALTPPMGFMTWNKYKDDISEQLIRRI--------ADRMVSAGYAKAGYKYIFIDDGW 82
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVH---ILISNSLD---------------NRIES 123
+ N++ + FPSG KALADYVH +L+ D + ++
Sbjct: 83 QGGRDKRNNIIPDPVKFPSGMKALADYVHSKGLLLGIYSDAARLTCAGYTGSYGFEKQDA 142
Query: 124 NIAFEQGIDYLKYDNCF--NDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGS 181
E GIDYLKYD C +D RYR M DAL+ +GR I +CEWG + P W
Sbjct: 143 RTFAEWGIDYLKYDYCHAPSDSAVAHQRYRKMGDALQDSGRKIALGVCEWGQLNPERWAR 202
Query: 182 NIRNS-WRTTDDISDSWARM--------LTIADMNEVYADHAKPGGWNDPDML------- 225
S WR + D+ D W + L I D+ E +A PG W D DML
Sbjct: 203 QAGGSLWRVSYDVRDMWKDIVGQGGMGILDIIDITEPLYKYAGPGHWLDMDMLIVGLDGK 262
Query: 226 -----EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-A 279
++G GG Y EY S+W + +PL + D+ N T I+ N+E+IA+NQ A
Sbjct: 263 GGPSSDLGGGGCTYTEYQTQMSMWCMFASPLAMSHDLLNENEATRRILLNKEMIAINQDA 322
Query: 280 INDQ----------QIWAGPLSGNRIVV 297
+ + +++ LSGNR+ +
Sbjct: 323 LGEAARRVDFPGVCRVYLRRLSGNRLAL 350
>gi|292495602|sp|A7XZT2.1|AGALB_TALEM RecName: Full=Probable alpha-galactosidase B; AltName:
Full=Melibiase B; Flags: Precursor
gi|156763774|gb|ABU94728.1| alpha-galactosidase [Rasamsonia emersonii]
Length = 452
Score = 141 bits (356), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 111/380 (29%), Positives = 162/380 (42%), Gaps = 94/380 (24%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINE-KIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAA 80
+G+G P +GWNSWN F C I+E KI+ + +D L + D +
Sbjct: 29 DGVGKLPALGWNSWNAFGCDIDEEKILTAANQIVNLGLKD----LGYEYVNIDDCWSV-- 82
Query: 81 KPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLDNR-IE 122
K N G ++ + T FP G LA+ +H L SL N I+
Sbjct: 83 KSGRNATTGRIMPDLTKFPDGISGLAEKIHNLGLKIGIYSSAGWTTCAGYPASLGNETID 142
Query: 123 SNIAFEQGIDYLKYDNC-------------FNDDTRPTI--------------------- 148
+ E GIDYLKYDNC D P
Sbjct: 143 AETFAEWGIDYLKYDNCGVPPDWQDQYSYCVPDSGDPATNPNGTCPNLQNPAPAVYDWRT 202
Query: 149 -----RYRAMSDAL--KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARML 201
RYR M DAL + R I FS+C+WG WG+ NSWR + DIS +W R+
Sbjct: 203 SKTAERYRRMRDALLGVQDKRTILFSLCDWGQADVNEWGAETGNSWRMSGDISPNWPRIS 262
Query: 202 TIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
TIA++N + G NDPDMLEVGNG + E HF++WA K+PL++G + +
Sbjct: 263 TIANLNSFELNSVDFWGHNDPDMLEVGNGNLTLAENRAHFALWAAMKSPLIIGTALDKID 322
Query: 262 AETMPIIGNEEVIAVNQAINDQQI-------------------------WAGPLSGNRIV 296
+ + I+ N+ ++ +Q D QI W+GP S ++
Sbjct: 323 QDHLSILSNKYLLTFHQ---DPQIGRPAYPYKWGYNPDWTFDPGHPAEYWSGPTSSGDVL 379
Query: 297 VLLESRKTFSSMMTAHWDDI 316
VL+ + ++ + TA W ++
Sbjct: 380 VLMLNTESGPANRTAVWSEV 399
>gi|395327306|gb|EJF59707.1| glycoside hydrolase [Dichomitus squalens LYAD-421 SS1]
Length = 496
Score = 141 bits (356), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 91/286 (31%), Positives = 131/286 (45%), Gaps = 73/286 (25%)
Query: 40 CQINEKIIRET-------------------DDCWGESSRDSQISLSCSFHMFDSLTQIAA 80
C INE I+ E DDC+ E SR
Sbjct: 92 CNINESIVLEAAEYLDNLGLKDVGYRRVNLDDCYSEKSRSPA------------------ 133
Query: 81 KPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLDNRIES 123
G++VA+ FPSG +L D +H L S N
Sbjct: 134 --------GDIVASAARFPSGMNSLTDRIHALGLDAGIYSDSGWFTCQLYPGSYQNEYRD 185
Query: 124 NIAFEQ-GIDYLKYDNC---FNDDTRPTI--RYRAMSDALKK-----AGRPIFFSMCEWG 172
F+ G DYLKYDNC ++D R I +Y+ ++DA+ + P+F S+CEWG
Sbjct: 186 AKLFQDWGFDYLKYDNCAIPYDDVIRQGIVGKYQRIADAIAQLASDTGKSPLFLSLCEWG 245
Query: 173 DMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGM 232
+ +P LW SWRTT DIS +W + I + N A + G ND D+LEVGNG +
Sbjct: 246 EEQPWLWARRYGQSWRTTGDISPNWGSVTDIINRNSFIAWASDFYGHNDMDILEVGNGDL 305
Query: 233 KYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
Y E HF++WA+ K+PLL+ D+ +++ + I+ NEE++A+NQ
Sbjct: 306 TYEEARTHFTVWALMKSPLLISADLSSISEADLSILANEEILAINQ 351
>gi|49614964|dbj|BAD26725.1| alpha-galactosidase [Lachancea kluyveri]
Length = 416
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 100/297 (33%), Positives = 137/297 (46%), Gaps = 68/297 (22%)
Query: 91 LVANKTTFPSGTKALADYVH---ILIS--------------NSLDNRIESNIAFEQ-GID 132
L+ + FP G K +AD +H +L SL + +F +D
Sbjct: 46 LIPDSKKFPRGMKHVADSLHDEGLLFGMYSSAGEYTCAGYPGSLGHEEADAASFASWDVD 105
Query: 133 YLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSW 187
YLKYDNC+N T RY+AMSDAL K GRPI++++C WG+ P W S I NSW
Sbjct: 106 YLKYDNCYNQGNYGTAQLSYERYKAMSDALNKTGRPIYYTLCNWGEDSPWHWASAIANSW 165
Query: 188 RTTDDISDSWAR----------------------MLTIADMNEVYADHAKPGGWNDPDML 225
R + DI + R ++ I + A PGGWND D L
Sbjct: 166 RISGDIYADFDRPDSRCPCNGDEYYCHLAGFHCSIMNILNKAAPLGQKAGPGGWNDLDAL 225
Query: 226 EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------- 278
EVG G M E HFS+WA K+PL++G ++ L A++ I N +IAVNQ
Sbjct: 226 EVGVGNMTDEEEKTHFSMWAAVKSPLVIGANLNTLDAKSFSIYVNPAIIAVNQDAAGTPV 285
Query: 279 --------------AINDQQIWAGPL-SGNRIVVLLESRKTFSSMMTAHWDDIGIKS 320
+ Q+W+GPL +G++IV LL + S M A +DI I S
Sbjct: 286 MRVWRYYVSDTDEHGQGEIQLWSGPLDNGDQIVALLNAGSKDRS-MNATLEDIFIDS 341
>gi|449283280|gb|EMC89957.1| Alpha-N-acetylgalactosaminidase, partial [Columba livia]
Length = 300
Score = 141 bits (355), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 100/298 (33%), Positives = 131/298 (43%), Gaps = 52/298 (17%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQIN-----EKIIRETDDCWGESSRDSQISLSCSFHMFDS 74
L NGL LTPPMGW +W F C ++ IRE C GE D L + D
Sbjct: 4 LENGLALTPPMGWLAWERFRCNVDCREDPRNCIRE-HGCVGE---DGWRELGYEYINIDD 59
Query: 75 LTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------------------LISNS 116
+ G LVA+ FP G KALADYVH +
Sbjct: 60 CWSAKKR----DAAGRLVADPERFPRGIKALADYVHARGLKLGIYGDLGTLTCGGYPGTT 115
Query: 117 LDNRIESNIAF-EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMR 175
LD + F E G+D LK D C++ Y M+ AL GRPI +S C W
Sbjct: 116 LDLVEQDAQTFAEWGVDMLKLDGCYSSGEEQAKGYPEMARALNSTGRPIVYS-CSW---- 170
Query: 176 PALWG-----------SNIRNSWRTTDDISDSWARMLTIADM----NEVYADHAKPGGWN 220
PA G + + N WR DDI DSW +L+I D +V A PG WN
Sbjct: 171 PAYQGGLPPKVNYTVLAEVCNLWRNYDDIQDSWDSVLSILDWFSANQDVLQPAAGPGHWN 230
Query: 221 DPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
DPDML +GN G+ Y + ++W + APLL+ D+ ++ I+ N +I +NQ
Sbjct: 231 DPDMLIIGNFGLSYEQSRSQMALWTVMAAPLLMSTDLRTISPSAKEILQNRLMIQINQ 288
>gi|145233743|ref|XP_001400244.1| alpha-galactosidase B [Aspergillus niger CBS 513.88]
gi|292495600|sp|A2QEJ9.1|AGALB_ASPNC RecName: Full=Probable alpha-galactosidase B; AltName:
Full=Melibiase B; Flags: Precursor
gi|134057178|emb|CAK44445.1| alpha-galactosidase aglB-Aspergillus niger
Length = 443
Score = 141 bits (355), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 102/375 (27%), Positives = 162/375 (43%), Gaps = 86/375 (22%)
Query: 22 NGLGLTPPMGWNSWNHFHCQIN-EKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAA 80
+G+G TP +GWNSWN + C I+ +KI+ ++ +D L + D +
Sbjct: 21 DGVGRTPALGWNSWNAYSCDIDADKIVTAANEVVNLGLKD----LGYEYINIDDCWSV-- 74
Query: 81 KPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----ISNSLD-------------NRIE 122
K N ++ + FP+G +AD VH L I +S I+
Sbjct: 75 KSGRNTTTKRIIPDPDKFPNGISGVADQVHALGLKLGIYSSAGLTTCAGYPASLGYEEID 134
Query: 123 SNIAFEQGIDYLKYDNC-----FND------------------------DTRP------- 146
+ E GIDYLKYDNC D D P
Sbjct: 135 AQSFAEWGIDYLKYDNCGVPTNLTDQYTYCVPDSTDGSNYPNGTCVNLTDAAPQGYDWAT 194
Query: 147 ---TIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTI 203
RY+ M DAL R I +S+C+WG WG+ NSWR + DI+ +W+R+ I
Sbjct: 195 STTAKRYQRMRDALLSVNRTILYSLCDWGQADVNAWGNATGNSWRMSGDITATWSRIAEI 254
Query: 204 ADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAE 263
A+ N ++A G+ DPDMLEVGNG + E HF++WA+ KAPL++G + ++
Sbjct: 255 ANENSFLMNYANFWGYPDPDMLEVGNGNLTLPENRAHFALWAMMKAPLIIGTPLDSIDTS 314
Query: 264 TMPIIGNEEVIAVNQAI----------------------NDQQIWAGPLSGNRIVVLLES 301
+ I+ N+ ++ +Q + + W+GP S + VL+ +
Sbjct: 315 HLTILSNKPLLTFHQDAVIGRPAYPYKWGYNPDWTFDPEHPAEYWSGPTSSGEVFVLMLN 374
Query: 302 RKTFSSMMTAHWDDI 316
+ +A W+++
Sbjct: 375 SEGEVKTRSAVWEEV 389
>gi|49614944|dbj|BAD26715.1| alpha-galactosidase [Lachancea kluyveri]
gi|49614948|dbj|BAD26717.1| alpha-galactosidase [Lachancea kluyveri]
gi|49614962|dbj|BAD26724.1| alpha-galactosidase [Lachancea kluyveri]
Length = 416
Score = 141 bits (355), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 100/297 (33%), Positives = 137/297 (46%), Gaps = 68/297 (22%)
Query: 91 LVANKTTFPSGTKALADYVH---ILIS--------------NSLDNRIESNIAFEQ-GID 132
L+ + FP G K +AD +H +L SL + +F +D
Sbjct: 46 LIPDSKKFPRGMKHVADSLHDKGLLFGMYSSAGEYTCAGYPGSLGHEEADAASFASWDVD 105
Query: 133 YLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSW 187
YLKYDNC+N T RY+AMSDAL K GRPI++++C WG+ P W S I NSW
Sbjct: 106 YLKYDNCYNQGNYGTAQLSYERYKAMSDALNKTGRPIYYTLCNWGEDSPWHWASAIANSW 165
Query: 188 RTTDDISDSWAR----------------------MLTIADMNEVYADHAKPGGWNDPDML 225
R + DI + R ++ I + A PGGWND D L
Sbjct: 166 RISGDIYADFDRPDSRCPCNGDEYYCHLAGFHCSIMNILNKAAPLGQKAGPGGWNDLDAL 225
Query: 226 EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------- 278
EVG G M E HFS+WA K+PL++G ++ L A++ I N +IAVNQ
Sbjct: 226 EVGVGNMTDEEEKTHFSMWAAVKSPLVIGANLNTLDAKSFSIYVNPAIIAVNQDAAGAPV 285
Query: 279 --------------AINDQQIWAGPL-SGNRIVVLLESRKTFSSMMTAHWDDIGIKS 320
+ Q+W+GPL +G++IV LL + S M A +DI I S
Sbjct: 286 MRVWRYYVSDTDEHGQGEIQLWSGPLDNGDQIVALLNAGSKDRS-MNATLEDIFIDS 341
>gi|49614942|dbj|BAD26714.1| alpha-galactosidase [Lachancea kluyveri]
Length = 416
Score = 141 bits (355), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 100/297 (33%), Positives = 137/297 (46%), Gaps = 68/297 (22%)
Query: 91 LVANKTTFPSGTKALADYVH---ILIS--------------NSLDNRIESNIAFEQ-GID 132
L+ + FP G K +AD +H +L SL + +F +D
Sbjct: 46 LIPDSKKFPRGMKHVADSLHDEGLLFGMYSSAGEYTCAGYPGSLGHEEADAASFASWDVD 105
Query: 133 YLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSW 187
YLKYDNC+N T RY+AMSDAL K GRPI++++C WG+ P W S I NSW
Sbjct: 106 YLKYDNCYNQGNYGTAQLSYERYKAMSDALNKTGRPIYYTLCNWGEDSPWHWASAIANSW 165
Query: 188 RTTDDISDSWAR----------------------MLTIADMNEVYADHAKPGGWNDPDML 225
R + DI + R ++ I + A PGGWND D L
Sbjct: 166 RISGDIYADFDRPDSRCPCNGDEYYCHLAGFHCSIMNILNKAAPLGQKAGPGGWNDLDAL 225
Query: 226 EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------- 278
EVG G M E HFS+WA K+PL++G ++ L A++ I N +IAVNQ
Sbjct: 226 EVGVGNMTDEEEKTHFSMWAAVKSPLVIGANLNTLDAKSFSIYVNPAIIAVNQDAAGTPV 285
Query: 279 --------------AINDQQIWAGPL-SGNRIVVLLESRKTFSSMMTAHWDDIGIKS 320
+ Q+W+GPL +G++IV LL + S M A +DI I S
Sbjct: 286 MRVWRYYVSDTDEHGQGEIQLWSGPLDNGDQIVALLNAGSKDRS-MNATLEDIFIDS 341
>gi|49614938|dbj|BAD26712.1| alpha-galactosidase [Lachancea kluyveri]
gi|49614950|dbj|BAD26718.1| alpha-galactosidase [Lachancea kluyveri]
gi|49614952|dbj|BAD26719.1| alpha-galactosidase [Lachancea kluyveri]
gi|49614954|dbj|BAD26720.1| alpha-galactosidase [Lachancea kluyveri]
gi|49614956|dbj|BAD26721.1| alpha-galactosidase [Lachancea kluyveri]
gi|49614958|dbj|BAD26722.1| alpha-galactosidase [Lachancea kluyveri]
Length = 416
Score = 141 bits (355), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 100/297 (33%), Positives = 137/297 (46%), Gaps = 68/297 (22%)
Query: 91 LVANKTTFPSGTKALADYVH---ILIS--------------NSLDNRIESNIAFEQ-GID 132
L+ + FP G K +AD +H +L SL + +F +D
Sbjct: 46 LIPDSKKFPRGMKHVADSLHDEGLLFGMYSSAGEYTCAGYPGSLGHEEADAASFASWDVD 105
Query: 133 YLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSW 187
YLKYDNC+N T RY+AMSDAL K GRPI++++C WG+ P W S I NSW
Sbjct: 106 YLKYDNCYNQGNYGTAQLSYERYKAMSDALNKTGRPIYYTLCNWGEDSPWHWASAIANSW 165
Query: 188 RTTDDISDSWAR----------------------MLTIADMNEVYADHAKPGGWNDPDML 225
R + DI + R ++ I + A PGGWND D L
Sbjct: 166 RISGDIYADFDRPDSRCPCNGDEYYCHLAGFHCSIMNILNKAAPLGQKAGPGGWNDLDAL 225
Query: 226 EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------- 278
EVG G M E HFS+WA K+PL++G ++ L A++ I N +IAVNQ
Sbjct: 226 EVGVGNMTDEEEKTHFSMWAAVKSPLVIGANLNTLDAKSFSIYVNPAIIAVNQDAAGTPV 285
Query: 279 --------------AINDQQIWAGPL-SGNRIVVLLESRKTFSSMMTAHWDDIGIKS 320
+ Q+W+GPL +G++IV LL + S M A +DI I S
Sbjct: 286 MRVWRYYVSDTDEHGQGEIQLWSGPLDNGDQIVALLNAGSKDRS-MNATLEDIFIDS 341
>gi|373456249|ref|ZP_09548016.1| Alpha-galactosidase, NEW1 domain-containing protein [Caldithrix
abyssi DSM 13497]
gi|371717913|gb|EHO39684.1| Alpha-galactosidase, NEW1 domain-containing protein [Caldithrix
abyssi DSM 13497]
Length = 779
Score = 141 bits (355), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 106/322 (32%), Positives = 153/322 (47%), Gaps = 52/322 (16%)
Query: 1 LFIVSSSAADSNGSDRRNL---LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGES 57
++ V +A ++ G DR L + N + LTPPMGWNSWN + + ++ + + E
Sbjct: 375 VYHVKITARNALGEDRATLKIVIGNQIALTPPMGWNSWNVWGLSVTQERVYAAARAFVEK 434
Query: 58 SRDSQISLSCSFHMFDSLTQI-----AAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL 112
++ F D +I AK PN G + NK FP K LAD +H L
Sbjct: 435 GL---VNHGWQFVNIDDGWEIIGSSDEAKRHPN---GEIRTNKK-FPD-IKRLADDIHAL 486
Query: 113 ------------------ISNSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPTIR----- 149
++ +++ G+D+LKYD C +
Sbjct: 487 GLKLGIYSSPGPLTCGGYTASYGYEELDAQTFARWGVDFLKYDLCSYRKMMKDLHSAEEL 546
Query: 150 ---YRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIR-NSWRTTDDISDSWARMLTIAD 205
Y+ M+ AL+K R I +S+CE+G + WG+ + N WRTT DI D W RM +I
Sbjct: 547 IPPYKKMNQALQKVDRDIVYSICEYGLGKVWEWGARVGGNLWRTTGDIWDDWERMASIGF 606
Query: 206 MNEVYADHAKPGGWNDPDMLEVGNGG---------MKYNEYVVHFSIWAISKAPLLLGCD 256
E A +A PG WNDPDML VG G + +E H S+WA+ APLLLGCD
Sbjct: 607 NQEQAAPYAGPGHWNDPDMLVVGWVGWGDQLHYTKLTPDEQYTHISLWALLSAPLLLGCD 666
Query: 257 VGNLTAETMPIIGNEEVIAVNQ 278
+ L T+ ++ N+EV+AVNQ
Sbjct: 667 LQRLDDFTLNLLTNDEVLAVNQ 688
>gi|224098608|ref|XP_002187248.1| PREDICTED: alpha-galactosidase A [Taeniopygia guttata]
Length = 415
Score = 140 bits (354), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 101/302 (33%), Positives = 141/302 (46%), Gaps = 55/302 (18%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRE-TDDCWGESSRDSQISLSCS 68
L NGL TPPMGW W F C ++E++ E D E RD+ C
Sbjct: 24 LDNGLARTPPMGWLHWERFLCGTDCAAEPRRCVSEQLFVEMADRMAAEGWRDAGYEFICI 83
Query: 69 FHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAFE 128
+ + T+ QG L A+ FP G + LADYVH S L I S++ +
Sbjct: 84 DDCWMAPTRDG--------QGRLQADPKRFPGGIRKLADYVH---SKGLKLGIYSDVGSK 132
Query: 129 Q---------------------GIDYLKYDNCFNDDTRPTIR-YRAMSDALKKAGRPIFF 166
G+D LK+D C +D YR MS AL K GR I +
Sbjct: 133 TCAGFPGSYNHYDLDAQTFASWGVDLLKFDGCNSDSLEQLAEGYRLMSLALNKTGRSIVY 192
Query: 167 SMCEWG-DMRPALWGSNIR-----NSWRTTDDISDSWARMLTIADMNEVYADH----AKP 216
S CEW +RP + N WR D+ DSW+ + +I D ++ D A P
Sbjct: 193 S-CEWPFYLRPVQQPNYTEIKQYCNHWRNFYDVYDSWSSIKSILDWTALHQDTIVKIAGP 251
Query: 217 GGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAV 276
GGWNDPDML +GN G+ + + V ++WAI APL + D+ +++ E ++ N+EVIA+
Sbjct: 252 GGWNDPDMLVIGNFGLSWEQSVTQMAMWAIMAAPLFMSNDLRHISPEAKWLLQNKEVIAI 311
Query: 277 NQ 278
NQ
Sbjct: 312 NQ 313
>gi|224094705|ref|XP_002192599.1| PREDICTED: alpha-N-acetylgalactosaminidase [Taeniopygia guttata]
Length = 432
Score = 140 bits (354), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 97/302 (32%), Positives = 132/302 (43%), Gaps = 56/302 (18%)
Query: 21 SNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAA 80
NGL TPPMGW SW F C + DC ++ + IS F M D L +
Sbjct: 23 ENGLARTPPMGWMSWERFRCNV---------DC--QADPRNCISEQLFFEMADRLAEDGW 71
Query: 81 KPLPNQ--------------CQGNLVANKTTFPSGTKALADYVH-------------ILI 113
+ L + G LV + FP G KALADYVH I
Sbjct: 72 RELGYEYINLDDCWAAKQRDVAGQLVPDPKRFPRGIKALADYVHARGLKLGIYGDLGIFT 131
Query: 114 SNSLDNRIESNIAFEQ------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFS 167
+ N+ + G+D LK D C++ Y M+ AL GRPI +S
Sbjct: 132 CGGYPGTMLENVKQDAQTFAAWGVDMLKLDGCYSSAEEQAKGYPEMARALNATGRPIVYS 191
Query: 168 MCEW----GDMRPAL---WGSNIRNSWRTTDDISDSWARMLTIADM----NEVYADHAKP 216
C W G + P + + I N WR DDI DSW +L+I D +V A P
Sbjct: 192 -CSWPAYQGGLPPQVNYTILAEICNLWRNYDDIQDSWDSVLSIVDWFFTNQDVLQPAAGP 250
Query: 217 GGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAV 276
G WNDPDML +GN G+ Y + ++W + APLL+ D+ ++ E I+ N +I +
Sbjct: 251 GHWNDPDMLIIGNFGLSYEQSRSQMALWTVMAAPLLMSTDLRTISPEAKEILQNRLMIQI 310
Query: 277 NQ 278
NQ
Sbjct: 311 NQ 312
>gi|20151048|pdb|1KTB|A Chain A, The Structure Of Alpha-N-Acetylgalactosaminidase
gi|20151049|pdb|1KTC|A Chain A, The Structure Of Alpha-N-Acetylgalactosaminidase
Length = 405
Score = 140 bits (354), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 100/318 (31%), Positives = 132/318 (41%), Gaps = 86/318 (27%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQIN----------EKIIRE-------------------T 50
L NGL TPPMGW +W F C +N E + E
Sbjct: 1 LENGLARTPPMGWLAWERFRCNVNCREDPRQCISEMLFMEMADRIAEDGWRELGYKYINI 60
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RD++ G LV + FP G KALADYVH
Sbjct: 61 DDCWAAKQRDAE--------------------------GRLVPDPERFPRGIKALADYVH 94
Query: 111 -----ILISNSLD------------NRIESNIA--FEQGIDYLKYDNCFNDDTRPTIRYR 151
+ I L +R+E + E G+D LK D C++ Y
Sbjct: 95 ARGLKLGIYGDLGRLTCGGYPGTTLDRVEQDAQTFAEWGVDMLKLDGCYSSGKEQAQGYP 154
Query: 152 AMSDALKKAGRPIFFSMCEW----GDMRPAL---WGSNIRNSWRTTDDISDSWARMLTIA 204
M+ AL GRPI +S C W G + P + I N WR DDI DSW +L+I
Sbjct: 155 QMARALNATGRPIVYS-CSWPAYQGGLPPKVNYTLLGEICNLWRNYDDIQDSWDSVLSIV 213
Query: 205 DM----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNL 260
D +V A PG WNDPDML +GN G+ Y + ++W I APLL+ D+ +
Sbjct: 214 DWFFTNQDVLQPFAGPGHWNDPDMLIIGNFGLSYEQSRSQMALWTIMAAPLLMSTDLRTI 273
Query: 261 TAETMPIIGNEEVIAVNQ 278
+ I+ N +I +NQ
Sbjct: 274 SPSAKKILQNRLMIQINQ 291
>gi|67461064|sp|Q90744.1|NAGAB_CHICK RecName: Full=Alpha-N-acetylgalactosaminidase; AltName:
Full=Alpha-galactosidase B
gi|435037|gb|AAA16614.1| alpha-N-acetylgalactosaminidase, partial [Gallus gallus]
Length = 405
Score = 140 bits (354), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 99/318 (31%), Positives = 129/318 (40%), Gaps = 86/318 (27%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQIN----------EKIIRE-------------------T 50
L NGL TPPMGW +W F C +N E + E
Sbjct: 1 LENGLARTPPMGWLAWERFRCNVNCREDPRQCISEMLFMEMADRIAEDGWRELGYKYINI 60
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RD++ G LV + FP G KALADYVH
Sbjct: 61 DDCWAAKQRDAE--------------------------GRLVPDPERFPRGIKALADYVH 94
Query: 111 I------------------LISNSLDNRIESNIAF-EQGIDYLKYDNCFNDDTRPTIRYR 151
+LD + F E G+D LK D C++ Y
Sbjct: 95 ARGLKLDIYGDLGRLTCGGYPGTTLDRVEQDAQTFAEWGVDMLKLDGCYSSGKEQAQGYP 154
Query: 152 AMSDALKKAGRPIFFSMCEW----GDMRPAL---WGSNIRNSWRTTDDISDSWARMLTIA 204
M+ AL GRPI +S C W G + P + I N WR DDI DSW +L+I
Sbjct: 155 QMARALNSTGRPIVYS-CSWPAYQGGLPPKVNYTLLGEICNLWRNYDDIQDSWDSVLSIV 213
Query: 205 DM----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNL 260
D +V A PG WNDPDML +GN G+ Y + ++W I APLL+ D+ +
Sbjct: 214 DWFFTNQDVLQPFAGPGHWNDPDMLIIGNFGLSYEQSRSQMALWTIMAAPLLMSTDLRTI 273
Query: 261 TAETMPIIGNEEVIAVNQ 278
+ I+ N +I +NQ
Sbjct: 274 SPSAKKILQNRLMIQINQ 291
>gi|268370285|ref|NP_001161200.1| alpha-N-acetylgalactosaminidase precursor [Gallus gallus]
Length = 425
Score = 140 bits (354), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 100/318 (31%), Positives = 132/318 (41%), Gaps = 86/318 (27%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQIN----------EKIIRE-------------------T 50
L NGL TPPMGW +W F C +N E + E
Sbjct: 21 LENGLARTPPMGWLAWERFRCNVNCREDPRQCISEMLFMEMADRIAEDGWRELGYKYINI 80
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RD++ G LV + FP G KALADYVH
Sbjct: 81 DDCWAAKQRDTE--------------------------GRLVPDPERFPRGIKALADYVH 114
Query: 111 -----ILISNSLD------------NRIESNIA--FEQGIDYLKYDNCFNDDTRPTIRYR 151
+ I L +R+E + E G+D LK D C++ Y
Sbjct: 115 ARGLKLGIYGDLGRLTCGGYPGTTLDRVEQDAQTFAEWGVDMLKLDGCYSSGKEQAQGYP 174
Query: 152 AMSDALKKAGRPIFFSMCEW----GDMRPAL---WGSNIRNSWRTTDDISDSWARMLTIA 204
M+ AL GRPI +S C W G + P + I N WR DDI DSW +L+I
Sbjct: 175 QMARALNATGRPIVYS-CSWPAYQGGLPPKVNYTLLGEICNLWRNYDDIQDSWDSVLSIV 233
Query: 205 DM----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNL 260
D +V A PG WNDPDML +GN G+ Y + ++W I APLL+ D+ +
Sbjct: 234 DWFFTNQDVLQPFAGPGHWNDPDMLIIGNFGLSYEQSRSQMALWTIMAAPLLMSTDLRTI 293
Query: 261 TAETMPIIGNEEVIAVNQ 278
+ I+ N +I +NQ
Sbjct: 294 SPSAKKILQNRLMIQINQ 311
>gi|389747570|gb|EIM88748.1| glycoside hydrolase family 27 protein [Stereum hirsutum FP-91666
SS1]
Length = 470
Score = 140 bits (354), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 98/333 (29%), Positives = 147/333 (44%), Gaps = 89/333 (26%)
Query: 18 NLLSNGLGLTPPMGWNSWNHFHCQINEKII-------------------RETDDCWGESS 58
N L +G+ P MG+N+WN ++C I+E I+ DDCW E +
Sbjct: 32 NKLGSGVARLPFMGYNTWNAYYCDIDEDIVLTNAKRMVELGFVDLGYNYMNVDDCWAEKN 91
Query: 59 RDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-------- 110
R + G +ANK TFPSG K+L D +H
Sbjct: 92 RSAD--------------------------GFQLANKKTFPSGMKSLTDKIHEMGMKAGI 125
Query: 111 ---------ILISNSLDNRIESNIAF-EQGIDYLKYDNC---FNDDTRPTI--RYRAMSD 155
L S + + F E G D LK+DNC F+ + + ++ M++
Sbjct: 126 YSDSGWFTCQLYPGSFQHEEQDAETFMEWGFDLLKFDNCAIPFDTIVKEGVVGKFHRMAE 185
Query: 156 ALKKAGR-----PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVY 210
A+ K P+ +S+C+WG +P LW I SWRTT DI +W + +I + N
Sbjct: 186 AINKVSTKLQRTPLVYSLCQWGWGQPWLWAREISQSWRTTGDIEANWKSVTSILNANSFI 245
Query: 211 ADHAKPGGWNDPDML-------EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAE 263
A G ND D++ GNG + Y E HF+ WA+ K+PLL+ CD+ ++ E
Sbjct: 246 AWANNFYGHNDMDIVLTMRGRPHSGNGDLSYEEAKSHFTAWALMKSPLLISCDLTTISDE 305
Query: 264 TMPIIGNEEVIAVNQAINDQQIWAGPLSGNRIV 296
+ I+ N+E+IA+NQ P+ G IV
Sbjct: 306 SFAILSNKEIIAINQ---------DPVEGKSIV 329
>gi|223938226|ref|ZP_03630122.1| autotransporter-associated beta strand repeat protein [bacterium
Ellin514]
gi|223893098|gb|EEF59563.1| autotransporter-associated beta strand repeat protein [bacterium
Ellin514]
Length = 1353
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 116/378 (30%), Positives = 169/378 (44%), Gaps = 56/378 (14%)
Query: 13 GSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMF 72
G L+NGL LTPPMG+N+W INE I+ D ++ + + +M
Sbjct: 28 GVQSAQALTNGLALTPPMGYNTWYARGSSINEDYIKSIADTM--ATNGMKAAGYEYVNMD 85
Query: 73 DSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----------------ILISN 115
D G ++AN FPSG KALADYVH +
Sbjct: 86 DGWAGY------RDTNGVMIANTNKFPSGIKALADYVHGKGLKLGLYTVFGPTTCAHLPG 139
Query: 116 SLDNRIESNIAFEQ-GIDYLKYDNCFNDD--TRPTIRYRAMSDALKKAGRPIFFSMCEWG 172
S + ++ + Q GIDYLKY+ C D T + M DAL GRPI F+M
Sbjct: 140 SYGHEVQDAQTYAQWGIDYLKYEGCSFPDPLAHETEKAVQMRDALAATGRPIVFTMST-- 197
Query: 173 DMRPA-LWGSNIRNSWR-TTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNG 230
PA W ++ N WR D++ W L D + A PG WNDPD++++G G
Sbjct: 198 --GPAESWMPDVLNMWRGAGDNVPHGWNTFLRHMDFVAQTPELAGPGHWNDPDVMDLGFG 255
Query: 231 GMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ----------AI 280
G + ++ ++ +++ I APLL +++ + I+ N E + VNQ A
Sbjct: 256 GTEQDKAIL--TMYCIVAAPLL----SPTVSSGYLNILTNAEALQVNQDPAGIQGACVAT 309
Query: 281 N-DQQIWAGPLSG--NRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFE---HRTLETKF 334
N D Q+W+ PLS N V L +R T ++ +TA+W D+G + R E F
Sbjct: 310 NGDLQVWSKPLSEGVNVRAVALLNRGTNTADITANWGDLGFPAGVAKVRDLWARAYEGNF 369
Query: 335 QGKLTAKVDGHSCKMYEL 352
TA V G S K ++
Sbjct: 370 TNSFTATVPGQSVKFLKI 387
>gi|134099681|ref|YP_001105342.1| alpha-galactosidase [Saccharopolyspora erythraea NRRL 2338]
gi|133912304|emb|CAM02417.1| putative alpha-galactosidase [Saccharopolyspora erythraea NRRL
2338]
Length = 380
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 120/388 (30%), Positives = 171/388 (44%), Gaps = 75/388 (19%)
Query: 30 MGWNSWNHFHCQINEKIIRETDDCW-GESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQ 88
MGWNSWN F C I+E++IR+T D G RD+ + D P Q
Sbjct: 1 MGWNSWNSFGCDIDERLIRDTADALVGSGMRDAGYQYVV---VDDCWFDPQRDP-----Q 52
Query: 89 GNLVANKTTFPSGTKALADYVHIL------------------------ISNSLDNRIESN 124
GNL AN FPSG +ALADYVH + SL + +
Sbjct: 53 GNLRANPERFPSGIRALADYVHSRGLKFGIYQVPTEKTCAQRGGTYPGATGSLGHEEQDA 112
Query: 125 IAF-EQGIDYLKYDNCFNDDT--RPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWG- 180
F E G+DYLKYD C + T + M DAL GRPI +S+ P G
Sbjct: 113 RTFAEWGVDYLKYDWCSPEGTLEDQIAAFTKMRDALAATGRPIVYSINS-NSYHPDKNGA 171
Query: 181 ----SNIRNSWRTTDDISDSWAR---------MLTIADMNEVYADHAKPGGWNDPDMLEV 227
S + N WRTT+DI W ++ I D+N A A+PG WNDPDMLEV
Sbjct: 172 THDWSPVANMWRTTEDIKPVWDSGNENEYPMGVVNIIDVNRGLAAQARPGHWNDPDMLEV 231
Query: 228 GN------GGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAI- 280
G G+ E H S+WA+ +PL+ G +V + I+ N EV+AV+Q
Sbjct: 232 GVYDVEGFKGLTDTEARAHLSMWALMASPLIAGNNVTRMPDGIRDILTNREVVAVDQDPA 291
Query: 281 ----------NDQQIWAGPLS-GNRIVVLLESRKTFSSMMTAHWDDIGIKSNN----TLF 325
D+++W ++ G+R+V L + + + T + K++ L+
Sbjct: 292 GAQGVPVRDHGDREVWVKNMADGSRVVALFNRGERPAGIRTTAREAGLPKASEYQVRDLW 351
Query: 326 EHRTLETKFQGKLTAKVDGHSCKMYELT 353
H T T G++ A+V H + ++
Sbjct: 352 AHETSTT--DGEIRAEVPAHGVVLLRVS 377
>gi|10863057|dbj|BAB16834.1| alpha-galactosidase [Lachancea kluyveri]
Length = 377
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 99/297 (33%), Positives = 136/297 (45%), Gaps = 68/297 (22%)
Query: 91 LVANKTTFPSGTKALADYVH---ILIS--------------NSLDNRIESNIAFEQ-GID 132
L+ + FP G K +AD +H +L SL + +F +D
Sbjct: 7 LIPDSKKFPRGMKHVADSLHDEGLLFGMYSSAGEYTCAGYPGSLGHEEADAASFASWDVD 66
Query: 133 YLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSW 187
YLKYDNC+N T RY+AMSDAL K GRPI++++C WG+ P W S I NSW
Sbjct: 67 YLKYDNCYNQGNYGTAQLSYERYKAMSDALNKTGRPIYYTLCNWGEDSPWHWASAIANSW 126
Query: 188 RTTDDISDSWAR----------------------MLTIADMNEVYADHAKPGGWNDPDML 225
R + DI + R ++ I + A PGGWND D L
Sbjct: 127 RISGDIYADFDRPDSRCPCNGDEYYCHLAGFHCSIMNILNKAAPLGQKAGPGGWNDLDAL 186
Query: 226 EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------- 278
EVG G M E HFS+WA K+PL++G ++ L A++ I N +IAVNQ
Sbjct: 187 EVGVGNMTDEEEKTHFSMWAAVKSPLVIGANLNTLDAKSFSIYVNPAIIAVNQDAAGAPV 246
Query: 279 --------------AINDQQIWAGPL-SGNRIVVLLESRKTFSSMMTAHWDDIGIKS 320
+ Q+W+GPL +G++IV LL + M A +DI I S
Sbjct: 247 MRVWRYYVSDTDEHGQGEIQLWSGPLDNGDQIVALLNAGSK-DRPMNATLEDIFIDS 302
>gi|383316644|ref|YP_005377486.1| alpha-galactosidase [Frateuria aurantia DSM 6220]
gi|379043748|gb|AFC85804.1| alpha-galactosidase [Frateuria aurantia DSM 6220]
Length = 400
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 105/322 (32%), Positives = 148/322 (45%), Gaps = 60/322 (18%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETDDCWGESS-RDSQISLSCSFHMFDSLTQIAAKPLPN 85
TPPMGW+SWNHF + IR T D S RD+ + D Q P
Sbjct: 30 TPPMGWSSWNHFAEHVTAAEIRATADAMVSSGMRDA----GYVYVNIDDGWQGGRNP--- 82
Query: 86 QCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLDNRIESNIAFE 128
G++ N FP +ALADYVH + SL + + ++
Sbjct: 83 --DGSIRPN-AGFPD-MRALADYVHARGLKLGIYSSPGRHTCAGYTGSLGHEQQDAASYA 138
Query: 129 Q-GIDYLKYDNC--FNDDTRPTI-------------RYRAMSDALKKAGRPIFFSMCEWG 172
G+DYLKYD C D +P Y+ M AL + RPI +++C++G
Sbjct: 139 AWGVDYLKYDLCSYIQDVMQPQAPHDMGRQMALMHQAYQRMRVALDRQSRPIIYALCQYG 198
Query: 173 DMRPALWGSNIR---NSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGN 229
P W + WRTT DI W R+ IA+ + AD+A PG WNDPDMLEVGN
Sbjct: 199 WGAPWQWATRPDVDGQLWRTTSDIEPDWDRIYAIAESQDGLADYAGPGHWNDPDMLEVGN 258
Query: 230 GGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ----------- 278
G + E HFS WA+ APL+ G D+ +++ + I+ VIA++Q
Sbjct: 259 GHLSDAENRAHFSWWAMLAAPLIAGNDLRHMSLKVRDILTLSAVIAIDQDPLGRPATRTY 318
Query: 279 AINDQQIWAGPLS-GNRIVVLL 299
A ++W+ PL+ G+R V +
Sbjct: 319 ADGQVEVWSRPLADGDRAVAIF 340
>gi|49614940|dbj|BAD26713.1| alpha-galactosidase [Lachancea kluyveri]
gi|49614946|dbj|BAD26716.1| alpha-galactosidase [Lachancea kluyveri]
gi|49614960|dbj|BAD26723.1| alpha-galactosidase [Lachancea kluyveri]
Length = 416
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 95/285 (33%), Positives = 131/285 (45%), Gaps = 67/285 (23%)
Query: 91 LVANKTTFPSGTKALADYVH---ILIS--------------NSLDNRIESNIAFEQ-GID 132
L+ + FP G K +AD +H +L SL + +F +D
Sbjct: 46 LIPDSKKFPRGMKHVADSLHDEGLLFGMYSSAGEYTCAGYPGSLGHEEADAASFASWDVD 105
Query: 133 YLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSW 187
YLKYDNC+N T RY+AMSDAL K GRPI++++C WG+ P W S I NSW
Sbjct: 106 YLKYDNCYNQGNYGTAQLSYERYKAMSDALNKTGRPIYYTLCNWGEDSPWHWASAIANSW 165
Query: 188 RTTDDISDSWAR----------------------MLTIADMNEVYADHAKPGGWNDPDML 225
R + DI + R ++ I + A PGGWND D L
Sbjct: 166 RISGDIYADFDRPDSRCPCSGDEYYCHLAGFHCSIMNILNKAAPLGQKAGPGGWNDLDAL 225
Query: 226 EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------- 278
EVG G M E HFS+WA K+PL++G ++ L A++ I N +IAVNQ
Sbjct: 226 EVGVGNMTDEEEKTHFSMWAAVKSPLVIGANLNTLDAKSFSIYVNPAIIAVNQDAAGTPA 285
Query: 279 --------------AINDQQIWAGPL-SGNRIVVLLESRKTFSSM 308
+ Q+W+GPL +G++IV LL + SM
Sbjct: 286 MRVWRYYVSDTDEHGQGEIQLWSGPLDNGDQIVALLNAGSKDRSM 330
>gi|189464740|ref|ZP_03013525.1| hypothetical protein BACINT_01084 [Bacteroides intestinalis DSM
17393]
gi|189437014|gb|EDV05999.1| alpha-galactosidase [Bacteroides intestinalis DSM 17393]
Length = 677
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 119/430 (27%), Positives = 193/430 (44%), Gaps = 79/430 (18%)
Query: 2 FIVSSSAADSNGSDRRNL---LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS 58
F V A + G+ R L + + LTPPMGWNSWN F +++ + +++ ES
Sbjct: 250 FEVVLQAKNVKGTAERKLRIECGDRIALTPPMGWNSWNCFGHEVSAEKVKQAARAMVESG 309
Query: 59 RD----SQISLSCSF-HMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL- 112
+ I++ S+ H D + L + QGN++ N FP K L DY+H L
Sbjct: 310 LVNYGWTYINIDDSWQHHRDPNDRTRGGRLRDD-QGNIIPN-AQFPD-MKGLTDYIHSLG 366
Query: 113 -----------------ISNSLDNRIESNIAFEQGIDYLKYDNCF--------------- 140
+ + + ++++ + G DYLKYD C
Sbjct: 367 LKVGIYSSPGPWTCGGCVGSYGYEKQDADMYGKWGFDYLKYDWCSYGGVLDRDLDKDPYS 426
Query: 141 ---------NDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNS-WRTT 190
D ++ M D L++ R I +++C++G WG ++ WRTT
Sbjct: 427 VSSLAFQGGGDSIAGRKPFKIMGDYLRQQPRDIVYNLCQYGMGDVWKWGDDVGGQCWRTT 486
Query: 191 DDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGG--------MKYNEYVVHFS 242
+DI+D+W + IA + A AKPG WNDPDML +G G +K +E +HFS
Sbjct: 487 NDITDTWESVKGIALSQDRAAAWAKPGNWNDPDMLVLGIVGWGNPHQTKLKPDEQYLHFS 546
Query: 243 IWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AINDQQIWAGPLS 291
+W++ APLL+GCD+ + T+ ++ N EVIAVNQ +I + +I+ L
Sbjct: 547 LWSLFSAPLLIGCDLEKMDDFTLSLLTNNEVIAVNQDPLGKQATCIYSIGELRIYVKELE 606
Query: 292 -GNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT---LFEHRTLETKFQGK--LTAKVDGH 345
G++ V + + + +D + I T L+ + + T G+ L V H
Sbjct: 607 DGSKAVGFCNFDREKADISFRDFDKLNITGKQTVRDLWRQKDIRTLDTGRKPLPLNVPAH 666
Query: 346 SCKMYELTPV 355
+Y+ TPV
Sbjct: 667 GVLLYKFTPV 676
>gi|169623138|ref|XP_001804977.1| hypothetical protein SNOG_14799 [Phaeosphaeria nodorum SN15]
gi|160704923|gb|EAT77991.2| hypothetical protein SNOG_14799 [Phaeosphaeria nodorum SN15]
Length = 547
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 109/379 (28%), Positives = 174/379 (45%), Gaps = 59/379 (15%)
Query: 28 PPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLT-QIAAKPLPNQ 86
PPMGWNS+N ++C +E+ I+E S + IS+ +D +T
Sbjct: 26 PPMGWNSYNSWNCLPSEERIKE--------STEGLISMGLDKLGYDFITVDCGWNSKYRD 77
Query: 87 CQGNLVANKTTFPSGTKALADYVHIL-ISNSL----------DNRIESNIAFEQ------ 129
+G + N T FPSG KAL DY+H +S L I +++ FE+
Sbjct: 78 SEGRMQWNTTLFPSGGKALGDYIHERGLSFGLYSGAGYLQCGSTEIPASLGFEKLDAESF 137
Query: 130 ---GIDYLKYDNCF-----------NDDTRPTIRYRAMSDALKKAGRPIFFSMCEWG-DM 174
G D LKYDNC+ + +++ R++ M L R I + +C+WG +
Sbjct: 138 AEWGGDSLKYDNCYATSNTTMVDSSSVESQSPARFQHMGAELDAVDRDIEYYICQWGIGV 197
Query: 175 RPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKY 234
W S + +WR ++DI ++W + I + Y H K G + D DML VG +
Sbjct: 198 DVGKWASELGTTWRMSNDIYNAWRSIWRITNQIVPYFRHTKVGAFADMDMLIVGLNALSA 257
Query: 235 NEYVVHFSIWAISKAPLLLGC--DVGNLTAETMPIIGNEEVIAVNQAINDQQ-------- 284
E HF +WAI+K+PL++G D G L+ ++ I+ N EVIAVNQ +Q
Sbjct: 258 EEERFHFGMWAINKSPLIMGAALDPGRLSNVSLQIMSNAEVIAVNQDSLAKQAQLIIRAT 317
Query: 285 -----IWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT--LFEHRTLETKFQGK 337
+W G LSG R ++ + + K S + +GI S L+ + + G
Sbjct: 318 EEEWDVWQGELSGARQILGIANWKNDSQSIQVDLAALGIGSAQARDLWAEKD-QGVVSGS 376
Query: 338 LTAKVDGHSCKMYELTPVT 356
+ GH +++ L+ +T
Sbjct: 377 QNLTLAGHELRLWLLSNIT 395
>gi|49614966|dbj|BAD26726.1| alpha-galactosidase [Lachancea kluyveri]
gi|49614968|dbj|BAD26727.1| alpha-galactosidase [Lachancea kluyveri]
gi|49614970|dbj|BAD26728.1| alpha-galactosidase [Lachancea kluyveri]
Length = 416
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 99/297 (33%), Positives = 136/297 (45%), Gaps = 68/297 (22%)
Query: 91 LVANKTTFPSGTKALADYVH---ILIS--------------NSLDNRIESNIAFEQ-GID 132
L+ + FP G K +AD +H +L SL + +F +D
Sbjct: 46 LIPDSKKFPRGMKHVADSLHDEGLLFGMYSSAGEYTCAGYPGSLGHEEADAASFASWDVD 105
Query: 133 YLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSW 187
YLKYDNC+N T RY+AMSDAL K GRPI++++C WG+ P W S I NSW
Sbjct: 106 YLKYDNCYNQGNYGTAQLSYERYKAMSDALNKTGRPIYYTLCNWGEDSPWHWASAIANSW 165
Query: 188 RTTDDISDSWAR----------------------MLTIADMNEVYADHAKPGGWNDPDML 225
R + DI + R ++ I + A PGGWND D L
Sbjct: 166 RISGDIYADFDRPDSRCPCNGDEYYCHLAGFHCSIMNILNKAAPLGQKAGPGGWNDLDAL 225
Query: 226 EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------- 278
EVG G M E HFS+WA K+PL++G ++ L A++ I N +IAVNQ
Sbjct: 226 EVGVGNMTDEEEKTHFSMWAAVKSPLVIGANLNTLDAKSFSIYVNPAIIAVNQDAAGAPV 285
Query: 279 --------------AINDQQIWAGPL-SGNRIVVLLESRKTFSSMMTAHWDDIGIKS 320
+ Q+W+GPL +G++IV LL + M A +DI I S
Sbjct: 286 MRVWRYYVSDTDEHGQGEIQLWSGPLDNGDQIVALLNAGSK-DRPMNATLEDIFIDS 341
>gi|154308151|ref|XP_001553412.1| hypothetical protein BC1G_07821 [Botryotinia fuckeliana B05.10]
Length = 517
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 80/248 (32%), Positives = 127/248 (51%), Gaps = 31/248 (12%)
Query: 136 YDNCFND------------DTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNI 183
YDNC++D T P+IRY+ M++AL +PI F +C+WG P+ W +
Sbjct: 14 YDNCYSDAATGYPNTNYAPSTSPSIRYQNMTNALATLSKPILFQICDWGVDFPSTWAPAM 73
Query: 184 RNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSI 243
N+WR T+DI S+ + I + A PG W D DMLEVGN E HFS+
Sbjct: 74 GNTWRITNDIIPSYPTVSRILNQAVPQTSFAGPGHWLDLDMLEVGNNIFTTAEEQTHFSL 133
Query: 244 WAISKAPLLLGC----DVGNLTAETMPIIGNEEVIAVNQ-------------AINDQQIW 286
WAI K+PL++G + ++++ ++ I+ N++VI+ NQ + ++W
Sbjct: 134 WAILKSPLVIGAALKDSLTSISSASLAILSNKDVISYNQDSLGVAASFRRRWTTDGYEVW 193
Query: 287 AGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTL--FEHRTLETKFQGKLTAKVDG 344
AGPLSG R VV + + + + +T ++ D+G++ +TL T +A VD
Sbjct: 194 AGPLSGGRTVVAVINLQNTAKTLTLNFPDVGLQKASTLKNIWANISTTNVLTSWSADVDA 253
Query: 345 HSCKMYEL 352
H + EL
Sbjct: 254 HGTMLLEL 261
>gi|440793257|gb|ELR14445.1| Aga27A, putative [Acanthamoeba castellanii str. Neff]
Length = 413
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 119/398 (29%), Positives = 167/398 (41%), Gaps = 73/398 (18%)
Query: 20 LSNGLGLTPPMGWNSW----NHFHCQ---INEKIIRETDDCWGESSRDSQISLSCSFHMF 72
L NG+ PPMGW++W + C NE+ IR D + + L ++ +
Sbjct: 23 LDNGVARLPPMGWSTWCTENDVLPCYDDFCNEQEIRSVADA---MAANGLKDLGYNYIVL 79
Query: 73 DSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----------ILISNSLDNRI 121
D Q N++ + + FPSG K L DY+H + + R
Sbjct: 80 DDCWGGG-----RDAQNNIIPDASRFPSGMKNLTDYIHARGLLFGVYTDVGTTTCRGGRP 134
Query: 122 ESNIAFEQ--------GIDYLKYDNCFNDDTRPTIR-YRAMSDALKKAGRPIFFSMCEWG 172
S +EQ G+DY+K D C Y MS AL K GRPIFFS+CEWG
Sbjct: 135 GSWPFYEQDARVFASWGLDYVKMDWCHEPSGFTAQELYTNMSQALNKTGRPIFFSICEWG 194
Query: 173 DMRPALWGSNIRNSWRTTDDISDSW---------ARMLTIADMNEVYA---DHAKPGGWN 220
P WG + NSWR D W + A++ E A +A PGGWN
Sbjct: 195 RSDPWTWGMKVGNSWRVGPDHLPVWWTPPFQQDPGQGQGTANIIEHMAGLSSYAGPGGWN 254
Query: 221 DPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-- 278
DPD L G + NE + FS W + +PL++ DV L+ + I N+E IAVNQ
Sbjct: 255 DPDFLMPGYWWLSENEQISEFSFWCLWASPLIVATDVRVLSDKKH--ILNKEAIAVNQDK 312
Query: 279 ----------AINDQQIWAGPLSGNRIVVLLESRKTFSSM-MTAHWDDI---------GI 318
+ Q+WA PLS V+L + F S+ + WD I G
Sbjct: 313 LGIPGDRIANYTDGGQVWAKPLSNGSWAVILYNPNFFESVDIPVSWDQIRGWPAMQQAGN 372
Query: 319 KSN--NTLFEHRTLETKFQGKLTAKVDGHSCKMYELTP 354
K L++H G L +V+ H+ + +TP
Sbjct: 373 KKALVRDLWKHENQGVWAVGFLGKRVEAHAVRFITVTP 410
>gi|373958146|ref|ZP_09618106.1| glycoside hydrolase clan GH-D [Mucilaginibacter paludis DSM 18603]
gi|373894746|gb|EHQ30643.1| glycoside hydrolase clan GH-D [Mucilaginibacter paludis DSM 18603]
Length = 742
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 103/317 (32%), Positives = 158/317 (49%), Gaps = 50/317 (15%)
Query: 8 AADSNGSDRRNL---LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQIS 64
A++S G ++L + + L LTPPMGWNSWN + ++ + ++ SS D+ I
Sbjct: 352 ASNSLGKYEKSLEIKVGDTLALTPPMGWNSWNCWGLSVSAEKVK--------SSADAMIQ 403
Query: 65 LSCSFHMFDSL-TQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-------ILIS-- 114
+ + ++ + + G + AN+ FP L DY+H I S
Sbjct: 404 KGLADYGWNYINVDDGWQATGRAGDGEIKANEK-FPD-MGGLGDYLHQQGLKFGIYSSPG 461
Query: 115 --------NSLDNRIESNIAFEQ-GIDYLKYDNC-----FNDDTRPTIR---YRAMSDAL 157
SL + + + + Q G+DYLKYD C +DT +++ Y M + L
Sbjct: 462 TKTCGGFLGSLGHEGQDAVTYNQWGVDYLKYDLCSYTDVIGNDTSLSVQQKPYMLMRNYL 521
Query: 158 KKAGRPIFFSMCEWGDMRPALWGSNIR-NSWRTTDDISDSWARMLTIADMNEVYADHAKP 216
+K R I +S+C++G WGS++ N WRTT+DI+D+W + +I + +A P
Sbjct: 522 EKQPRDIIYSICQYGIHDVWKWGSSMNGNLWRTTEDITDTWESLYSIGFAQSNFYPYAHP 581
Query: 217 GGWNDPDMLEVGNGGMKYN---------EYVVHFSIWAISKAPLLLGCDVGNLTAETMPI 267
GGWNDPDML VG G N E H S+W + APLL+GCD+ NL T+ +
Sbjct: 582 GGWNDPDMLIVGKVGWGENLHASRLTPYEQYTHISLWCLLSAPLLIGCDMSNLDEFTLNL 641
Query: 268 IGNEEVIAVNQAINDQQ 284
+ N EVIAV+Q +Q
Sbjct: 642 LKNNEVIAVDQDAAGKQ 658
>gi|326435640|gb|EGD81210.1| hypothetical protein PTSG_13159 [Salpingoeca sp. ATCC 50818]
Length = 463
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 104/320 (32%), Positives = 136/320 (42%), Gaps = 93/320 (29%)
Query: 22 NGLGLTPPMGWNSWNHFHCQ----------INEKIIRET-------------------DD 52
N L TPPMGW SW F CQ IN + + T DD
Sbjct: 34 NKLAQTPPMGWMSWEIFRCQMDCKTYPNSCINADLYKTTADAMKAKGFLDAGYDTVHIDD 93
Query: 53 CWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL 112
CW +RD PN G L AN FP G LADY+H L
Sbjct: 94 CWEADARD-----------------------PNT--GRLAANADRFPGGIAPLADYIHKL 128
Query: 113 ------------------ISNSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMS 154
+ + L+ +++ + G+DYLK D C + T Y AM
Sbjct: 129 GLKFGLYTAESKTTCTGHVGSYLNEILDAQTFADWGVDYLKVDGCNPNKTYYPTGYTAMG 188
Query: 155 DALKKAGRPIFFSMCEWGDMRPALWGSNIR------------NSWRTTDDISDSWARMLT 202
AL+ +GR I +S C W PA GSN N WR DI +W + +
Sbjct: 189 TALQISGRDIVYS-CSW----PAYLGSNETTKPYNTMIGIGCNLWRNWHDIQCNWKSLSS 243
Query: 203 I----ADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVG 258
I D +V A +A PG WNDPDML +GN + ++ +IW+I APL++G DV
Sbjct: 244 IIDHWGDYGQVLASYAGPGHWNDPDMLLIGNDCITDDQARTQMAIWSIVAAPLIMGNDVR 303
Query: 259 NLTAETMPIIGNEEVIAVNQ 278
N+TA T I+ N + IAVNQ
Sbjct: 304 NITASTRAILLNRDAIAVNQ 323
>gi|325105285|ref|YP_004274939.1| alpha-galactosidase [Pedobacter saltans DSM 12145]
gi|324974133|gb|ADY53117.1| Alpha-galactosidase [Pedobacter saltans DSM 12145]
Length = 674
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 121/412 (29%), Positives = 191/412 (46%), Gaps = 82/412 (19%)
Query: 3 IVSSSAADSNGSDR-RNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDS 61
I+S+ A + S + R + + + LTP MGWNSWN F +++ + ++ D ++ +
Sbjct: 250 ILSAKNAKGSASKKLRIVCGDKIALTPTMGWNSWNCFGHEVSAEKVKRAADALIKTGLVN 309
Query: 62 Q----ISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSL 117
I++ S+ + K G ++ N + FP K L DY+H SN L
Sbjct: 310 HGWNYINIDDSWQYNRDGKDTSFKGKMRDENGYILTN-SKFPD-MKGLTDYMH---SNGL 364
Query: 118 DNRIES------------NIAFEQ---------GIDYLKYDNCF----------NDDTR- 145
I S + +E+ G DYLKYD C ND +
Sbjct: 365 KAGIYSSPGPWTCGGCAGSYGYEKQDAESYAKWGFDYLKYDWCSYGGVIDGLPDNDPNKV 424
Query: 146 PTIR-------------YRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNI-RNSWRTTD 191
P++ ++ M D LKK R I +++C++G WG + SWRTT+
Sbjct: 425 PSLAFQGGADLDKGVKPFKVMGDLLKKQSRDIVYNLCQYGMGDVWKWGDDADAQSWRTTN 484
Query: 192 DISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGG--------MKYNEYVVHFSI 243
DI+D+WA + +IA + A +AKPG WNDPDML VG G +K +E +H S+
Sbjct: 485 DITDTWASVKSIALAQDKAAPYAKPGNWNDPDMLVVGVVGWGNAHQSRLKPDEQYLHISL 544
Query: 244 WAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN-----------QAINDQQIWAGPLS- 291
W+I APLL+GCD+ L T+ ++ N+EVIAVN Q I + +I+ L
Sbjct: 545 WSIFSAPLLIGCDLEKLDDFTINLLTNDEVIAVNQDALGKQGVCQQTIGELKIYVKELED 604
Query: 292 GNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFEHRTLETKFQGKLTAKVD 343
G + V + +M + +GI EH+T+ ++ K AK++
Sbjct: 605 GGKAVAFANFGREKVNMSYKDFQKLGIT------EHQTVRDLWRQKNIAKIN 650
>gi|342885960|gb|EGU85909.1| hypothetical protein FOXB_03576 [Fusarium oxysporum Fo5176]
Length = 409
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 97/384 (25%), Positives = 159/384 (41%), Gaps = 103/384 (26%)
Query: 1 LFIVSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQI-NEKIIRET--------- 50
L +V+ + + R +L + +G P +GWN WN C +EK+ T
Sbjct: 8 LGVVALAGLATAVPSRNHLQTREVGKLPALGWNGWNQGQCNAASEKVALATAKTFINLGL 67
Query: 51 ----------DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPS 100
DDCW RDS+ GNLV + +P
Sbjct: 68 KDAGYQYVNIDDCWSTKQRDSK--------------------------GNLVPDPAKWPR 101
Query: 101 GTKALADYVHIL------------------ISNSLDNRIESNIAFEQGIDYLKYDNCFND 142
G K + D +H + + + ++N+ G+DY KYDNC+
Sbjct: 102 GIKPVTDEIHAMGLKFGLYGDGGAKTCAGYPGSQGHEQQDANLLASWGVDYWKYDNCYTP 161
Query: 143 ------------------DTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIR 184
+RP RY M D L+ GR I +S+C WG WG+ +
Sbjct: 162 CNTGNGADIQTCPNNQAPSSRP--RYEKMRDLLRATGRDILYSLCNWGYDEVWTWGAQVG 219
Query: 185 NSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIW 244
+ WR + D WA ++ IA+ + PG +ND DM+ + NG + E HF+IW
Sbjct: 220 HMWRMSQDNWGQWADVVRIANQAAPILKYTVPGHYNDLDMMILANGALTPAEERTHFAIW 279
Query: 245 AISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ----------------AINDQQI--- 285
I+K+P++LG D+ L ++ + +I N+ ++AVNQ A +I
Sbjct: 280 CITKSPIILGTDMTKLNSDEVKLITNKGLLAVNQDSLSKPAVPFTPPNTPAKGSSEIYPY 339
Query: 286 WAGPLSGNRIVVLLESRKTFSSMM 309
W+GP+S +V ++ S+ ++ +
Sbjct: 340 WSGPISTGTVVAIVASKGNLNTQL 363
>gi|409048486|gb|EKM57964.1| glycoside hydrolase family 27 protein, partial [Phanerochaete
carnosa HHB-10118-sp]
Length = 434
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 111/402 (27%), Positives = 168/402 (41%), Gaps = 100/402 (24%)
Query: 18 NLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESS 58
N L++G+ P +G+N+WN + C I+E +I T DDC+ ES+
Sbjct: 22 NALNDGVARLPVLGYNTWNAYQCDIDEDLIITTAKLMRSLGLQDAGYNWVNLDDCYAESN 81
Query: 59 RDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISN--- 115
R + G +V + F SG L D +H + N
Sbjct: 82 RSAD--------------------------GLIVPDNVRFKSGFNWLTDQLHSMGFNAGI 115
Query: 116 --------------SLDNRIESNIAFEQ-GIDYLKYDNC---FNDDTRPTI--RYR---- 151
S N F+ G DYLKYDNC F+D R +Y+
Sbjct: 116 YSDSGWFTCAGYPGSFQNEARDAKTFQDWGFDYLKYDNCAIPFDDIIREGTFGKYQRIAG 175
Query: 152 AMSDALKKAGRP-IFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVY 210
A++D K + RP + S+CEWG + +WG I SWRTT DIS +W ++I + N
Sbjct: 176 AIADLAKSSSRPPMILSLCEWGWSQVWIWGKTIGQSWRTTGDISPNWDSAMSIINFNSFI 235
Query: 211 ADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGN 270
G ND DML+VGN G+ E HF+ WA+ K+PLL+G ++ +T + + I+ N
Sbjct: 236 TTATDFYGRNDLDMLQVGNEGLTPEEAKTHFTAWALMKSPLLIGTNLSAITPDIVEILTN 295
Query: 271 EEVIAVNQ----------------------AINDQQIWAGPLSGNRIVVLL----ESRKT 304
E++A+NQ + + Q W+GP + +LL E
Sbjct: 296 SEILAINQDPVEGTSISPFRWGINPDWTSNSTHPAQFWSGPSENGTVFMLLNTLDEPASL 355
Query: 305 FSSMMTAHWDDIGIK-SNNTLFEHRTLETKFQGKLTAKVDGH 345
F ++ + W G + S L+ H T + V H
Sbjct: 356 FFNLTESPWIRAGRQYSVRDLWTHTDNGTAIRNFTATNVPAH 397
>gi|392562193|gb|EIW55374.1| glycoside hydrolase [Trametes versicolor FP-101664 SS1]
Length = 737
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 72/196 (36%), Positives = 115/196 (58%), Gaps = 14/196 (7%)
Query: 89 GNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPT 147
G++VA+K FP+G L D +H + + +N F+ G DYLK+DNC
Sbjct: 402 GDIVADKERFPAGMNDLTDKIHGMGLKAGNNEARDAKLFQDWGFDYLKFDNC-------- 453
Query: 148 IRYRAMSDAL----KKAGRP-IFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLT 202
Y+ M+DA+ + +G+P + S+CEWG+ +P LW SWRTT DIS W + +
Sbjct: 454 AEYKRMADAIAALAQSSGKPPLSLSLCEWGEEQPWLWARRFGQSWRTTGDISPDWGSIAS 513
Query: 203 IADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTA 262
I + N A + G ND D+LE+GNGG+ ++E +HF+ WA+ K+PLL+G D+ ++
Sbjct: 514 IINQNSFIAWASDLYGHNDKDILEIGNGGLTHDEAKMHFTAWALMKSPLLIGTDLSTISD 573
Query: 263 ETMPIIGNEEVIAVNQ 278
+ I+ N E++A++Q
Sbjct: 574 ADLAILKNTELLALSQ 589
>gi|354493447|ref|XP_003508853.1| PREDICTED: alpha-N-acetylgalactosaminidase-like [Cricetulus
griseus]
gi|344254579|gb|EGW10683.1| Alpha-N-acetylgalactosaminidase [Cricetulus griseus]
Length = 415
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 99/303 (32%), Positives = 136/303 (44%), Gaps = 55/303 (18%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQI 78
+L NGL TPPMGW +W F C IN ++ +C IS M D L Q
Sbjct: 17 MLENGLLRTPPMGWLAWERFRCNINCD--QDPKNC---------ISERLFMEMADRLAQD 65
Query: 79 AAKPLP------NQC-------QGNLVANKTTFPSGTKALADYVHIL------------- 112
+ L + C +G LV + FP G LADY H L
Sbjct: 66 GWRDLGYVYLNIDDCWIGGRDSKGRLVPDPKRFPHGIAFLADYAHSLGLKLGIYEDMGKM 125
Query: 113 -----ISNSLDN-RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFF 166
+LD +++ E +D LK D CF+ + Y M+ AL GRPI F
Sbjct: 126 TCMGYPGTTLDKIELDAGTFAEWKVDMLKLDGCFSTSKERAVGYPKMAAALNATGRPIAF 185
Query: 167 SMCEW----GDMRPAL---WGSNIRNSWRTTDDISDSWARMLTIADMNEVYAD----HAK 215
S C W G + P + +NI N WR DDI DSW +L+I D Y D A
Sbjct: 186 S-CSWPAYEGGLPPKVNYTLIANICNLWRNFDDIQDSWRSVLSIVDWFVKYQDILQPVAG 244
Query: 216 PGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIA 275
PG WNDPDML +GN G+ + E ++W + APL + D+ ++ + + I+ N +I
Sbjct: 245 PGHWNDPDMLLIGNYGLSFEESRAQMALWTVLAAPLFMSTDLRTISPQNIDILQNPLMIK 304
Query: 276 VNQ 278
+NQ
Sbjct: 305 INQ 307
>gi|126342936|ref|XP_001374517.1| PREDICTED: alpha-galactosidase A [Monodelphis domestica]
Length = 462
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 107/332 (32%), Positives = 158/332 (47%), Gaps = 54/332 (16%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQ-- 77
L NGL LTP MGW W F C ++ I + ++C E + S D+ +
Sbjct: 30 LDNGLALTPTMGWLHWERFTCNVD--CIEDPENCISEGLFMQMADIMVSEGWKDAGYEYV 87
Query: 78 ------IAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDN------- 119
+A+K N G+L A+ FPSG + LADYVH + I +
Sbjct: 88 CIDDCWLASKRNKN---GSLQADPKRFPSGIRHLADYVHSRGLKLGIYQDVGTLTCAGYP 144
Query: 120 ------RIESNIAFEQGIDYLKYDNCFNDDTRPTIR-YRAMSDALKKAGRPIFFSMCEWG 172
I++ + G+D LK+D C D + Y++MS AL K GR I +S CEW
Sbjct: 145 GSFGYYDIDAQTFADWGVDLLKFDGCNFKDVESLVEGYKSMSSALNKTGRSILYS-CEWP 203
Query: 173 -DMRP--ALWGSNIR---NSWRTTDDISDSWARMLTI----ADMNEVYADHAKPGGWNDP 222
+RP + IR N WR DI DSW + TI A E A PGGWNDP
Sbjct: 204 LYLRPFQEPHYTEIRQYCNHWRNYGDIFDSWNSVKTILAWTASKQESLVPAAGPGGWNDP 263
Query: 223 DMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ---- 278
DML +GN G+ +++ V ++WAI APL + D+ +++ ++ ++ N++VIA+NQ
Sbjct: 264 DMLVIGNFGLSWDQQVTQMALWAIMAAPLFMSNDLRHISPQSKNLLQNKDVIAINQDPLG 323
Query: 279 -------AINDQQIWAGPLSGNRIVVLLESRK 303
D ++W PLS V + ++K
Sbjct: 324 KQGYQLRKDKDFEVWERPLSNLAWAVAVLNQK 355
>gi|47226188|emb|CAG08335.1| unnamed protein product [Tetraodon nigroviridis]
Length = 377
Score = 139 bits (349), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 106/331 (32%), Positives = 144/331 (43%), Gaps = 43/331 (12%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGE----SSRDSQISLSCSFHMFDSL 75
L NGL LTP MGW W F C N R+ D+C E D + ++ +
Sbjct: 20 LDNGLALTPTMGWLHWERFMC--NTDCDRDPDNCISERLYMQMADVMVKEGWKEAGYEYV 77
Query: 76 TQIAAKPLPNQ-CQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIA-------- 126
P P + G L A+ FP G K LA+YVH S L I +++
Sbjct: 78 CIDDCWPSPRRNIFGRLQADPKRFPGGIKKLAEYVH---SRGLKLGIYADVGSLTCAGYP 134
Query: 127 -------------FEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGD 173
E G+D LK+D C+ + T Y MS AL K GR I +S CEW
Sbjct: 135 GSLGYYDTDAQTFAEWGVDLLKFDGCYMNLTMLGEGYMNMSKALNKTGRSILYS-CEWPL 193
Query: 174 MRPALWGSN---IR---NSWRTTDDISDSWARMLTIADMNEVYAD----HAKPGGWNDPD 223
L N IR N WR + D+SDSW+ + +I Y D A PGGWNDPD
Sbjct: 194 YEWPLKKPNYTAIRETCNHWRNSQDVSDSWSSIKSILAWTAAYQDTIVPSAGPGGWNDPD 253
Query: 224 MLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-AIND 282
ML +GN G+ +++ ++WAI APLL+ D+ ++ + ++ N VIA+NQ +
Sbjct: 254 MLVIGNFGLSHDQQESQMALWAIMAAPLLMSNDLRDICPRSKKLLQNRHVIAINQDPLGK 313
Query: 283 QQIWAGPLSGNRIVVLLESRKTFSSMMTAHW 313
Q G V R T W
Sbjct: 314 QGFLTAKAGGAGAQVTPTKRPTLKPDRLCRW 344
>gi|332668960|ref|YP_004451968.1| Ricin B lectin [Cellulomonas fimi ATCC 484]
gi|332337998|gb|AEE44581.1| Ricin B lectin [Cellulomonas fimi ATCC 484]
Length = 561
Score = 139 bits (349), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 104/402 (25%), Positives = 175/402 (43%), Gaps = 106/402 (26%)
Query: 28 PPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQISLSCS 68
PP+GWNSWN F+C I+E++IR+T DDCW + +RD+
Sbjct: 54 PPLGWNSWNTFYCNIDEQMIRQTADAMVSSGMAAAGYQYVVVDDCWMQDTRDAN------ 107
Query: 69 FHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH------------------ 110
GNL A FPSG KAL DY+H
Sbjct: 108 --------------------GNLRARTDRFPSGMKALGDYIHSKGLKFGIYHAPREKTCN 147
Query: 111 -----ILISNSLDNRIESNIAFEQ-GIDYLKYDNC--FNDDTRPTIRYRAMSDALKKAGR 162
+ ++S + F G+DY+K+D C T ++ ALK GR
Sbjct: 148 QYFANLPGTSSYGREAQDATLFASWGVDYVKHDWCDPRGSATEQAAVFQRFGAALKATGR 207
Query: 163 PIFFSM---CEWGDMRPAL--WGSNIRNSWRTTDDISDSWAR-----------MLTIADM 206
PI +S+ + P WG+ + WRT++D+S SW+ + D+
Sbjct: 208 PIVYSINPNSAHSNTAPTYSGWGA-FSDMWRTSEDLSASWSTGCAPSADCFVGITEALDI 266
Query: 207 NEVYADHAKPGGWNDPDMLEVG-NGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETM 265
E + +PG +NDPDML VG + E H S+WA+ APL+ G D+ +++A+
Sbjct: 267 VEPMREWTRPGQYNDPDMLMVGVRNALTATENRAHMSMWAMLGAPLIAGNDLRSMSADVR 326
Query: 266 PIIGNEEVIAVNQAI-----------NDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWD 314
++ N +V+A++Q D ++WA PL+ + V L +R + + ++
Sbjct: 327 AVLTNRDVLAIDQDPLVRQADRVRDDGDAEVWAKPLADGSVAVALLNRGSGTRQISTTLG 386
Query: 315 DIGIKSNNTLFEHRTLET----KFQGKLTAKVDGHSCKMYEL 352
+G+ + +++R + T G+++A+V H ++ +
Sbjct: 387 QVGLGAGT--YQYREVWTGATGSTTGQISAQVAQHGVALFRV 426
>gi|114051916|ref|NP_001040191.1| alpha-N-acetylgalactosaminidase precursor [Bombyx mori]
gi|87248331|gb|ABD36218.1| alpha-N-acetylgalactosaminidase [Bombyx mori]
Length = 428
Score = 139 bits (349), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 114/389 (29%), Positives = 166/389 (42%), Gaps = 107/389 (27%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRET------------------- 50
L+NGL +TPPMGW +W C I+E +I+ T
Sbjct: 17 LNNGLAMTPPMGWLTWERLRCITDCEKYPNECISESLIKRTADLMVSEGYLDAGYEYLGI 76
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW E +RD + G LV ++ FP+G KA+ADY+H
Sbjct: 77 DDCWLEKTRDDK--------------------------GRLVPDRKRFPNGMKAIADYIH 110
Query: 111 ILISNSL---------------------DNRIESNIAFEQGIDYLKYDNCFNDDTRPTIR 149
S L + +I+ N E IDYLK D C+ D +
Sbjct: 111 ---SKGLKFGMYEDYGNLTCAGYPGVLGNEKIDINTFVEWEIDYLKLDGCYIDPIQMDKG 167
Query: 150 YRAMSDALKKAGRPIFFSMCEWGDMRPAL---------WGSNIR--NSWRTTDDISDSWA 198
Y L GR I +S C W PA + S N R DDI DSWA
Sbjct: 168 YPDFGKLLNATGRSILYS-CSW----PAYQEEKKILPNYASIAEHCNLGRNYDDIEDSWA 222
Query: 199 RMLTI----ADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLG 254
+ I D + A HA PG WNDPDML +GN G+ ++ V ++WAI APLL+
Sbjct: 223 SLTKIMAWFGDNQDRLAQHAGPGHWNDPDMLLIGNFGLSVDQAKVQMAVWAILAAPLLIS 282
Query: 255 CDVGNLTAETMPIIGNEEVIAVNQ-AINDQQIWAGPL--SGNRIVVLLESRK----TFSS 307
D+ + E I+ N ++IAVNQ + Q + G SG+R + + R+ +F+
Sbjct: 283 TDLATIRPEFKQILLNRDIIAVNQDKLGKQGLRVGKKMGSGSRSKLEIWKRELYDGSFAM 342
Query: 308 MMTAHWDD-IGIKSNNTLFEHRTLETKFQ 335
++ DD I + T ++ + +++F+
Sbjct: 343 AFVSYRDDGIPYAAKFTYYDMQLPQSEFE 371
>gi|410056735|ref|XP_003954083.1| PREDICTED: LOW QUALITY PROTEIN: alpha-galactosidase A [Pan
troglodytes]
Length = 429
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 105/353 (29%), Positives = 151/353 (42%), Gaps = 96/353 (27%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRE-------------------T 50
L NGL TP MGW W F C I+EK+ E
Sbjct: 32 LDNGLARTPTMGWLHWERFMCNLDCQEEPDSCISEKLFMEMAELMVSEGWKDAGYEYLCI 91
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RDS+ G L A+ FP G + LA+YVH
Sbjct: 92 DDCWMAPQRDSE--------------------------GRLQADPQRFPHGIRQLANYVH 125
Query: 111 -----ILISNSLDNR-------------IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YR 151
+ I + N+ I++ + G+D LK+D C+ D Y+
Sbjct: 126 SKGLKLGIYADVGNKTCAGFPGSFGYYDIDAQTFADWGVDLLKFDGCYCDSLENLADGYK 185
Query: 152 AMSDALKKAGRPIFFSMCEWGDMRPALWGSN---IR---NSWRTTDDISDSWARMLTIAD 205
MS AL + GR I +S CEW N IR N WR D+ DSW + +I D
Sbjct: 186 HMSLALNRTGRSIVYS-CEWPLYMWPFQKPNYTEIRQYCNHWRNFADVDDSWKSIKSILD 244
Query: 206 MNEVY----ADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
Y D A PGGWNDPDML +GN G+ +N+ V ++WAI APL + D+ +++
Sbjct: 245 WTSFYQERIVDVAXPGGWNDPDMLVIGNFGLSWNQQVTQMALWAIMAAPLFMSNDLRHIS 304
Query: 262 AETMPIIGNEEVIAVNQ-----------AINDQQIWAGPLSGNRIVVLLESRK 303
+ ++ +++VIA+NQ ++ ++W PLSG V + +R+
Sbjct: 305 PQAKALLQDKDVIAINQDPLGKQGYQLRQADNFEVWERPLSGLAWAVAMINRQ 357
>gi|429849781|gb|ELA25124.1| alpha-galactosidase precursor [Colletotrichum gloeosporioides Nara
gc5]
Length = 550
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 107/348 (30%), Positives = 163/348 (46%), Gaps = 62/348 (17%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQ 86
TP MGWNS+N+++C NE II+E + + + + + ++T P ++
Sbjct: 25 TPQMGWNSYNYYNCYPNETIIKE--------NAHAVVDTGLAEAGYSTVTTDCGWPAKDR 76
Query: 87 -CQGNLVANKTTFPSG-TKALADYVHIL-------------------ISNSLDNRIESNI 125
G LV N FPSG K L DY+H L SLD+
Sbjct: 77 DANGELVWNPALFPSGGAKELGDYLHNLGLKFGVYSGGGYFQCGSTDQPASLDHEFTDAK 136
Query: 126 AFEQ-GIDYLKYDNCFNDDTRPTI-------------RYRAMSDALKKAGRPIFFSMCEW 171
+F G D LKYDNC+ D PT+ R+ M++A+ R + + +C+W
Sbjct: 137 SFAAWGADILKYDNCYPID--PTVMVDYVSEEAISPDRFVTMAEAMNTTDRDMVYQVCQW 194
Query: 172 G-DMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNG 230
G +W I NSWR ++DI +SW + I + + + PG + D DML VG
Sbjct: 195 GTGTDLGVWVPKIGNSWRISNDIYNSWRSIWRITNQVVPFYKYTGPGAFPDMDMLIVGLN 254
Query: 231 GMKYNEYVVHFSIWAISKAPLLLGCD-VGNLTAE-TMPIIGNEEVIAVNQAINDQQ---- 284
+ E H +W+I+K+PL LG + L E T+ I+ N+EVIA+NQ +Q
Sbjct: 255 ALSVEEERFHMGMWSINKSPLTLGAPAIPALVPEHTLSIVANKEVIAINQDPLAKQAQLV 314
Query: 285 ---------IWAGPLSGNRIVVLLESRKTFSSMMTAHWDDI-GIKSNN 322
+WAG LSG+R+VV + + K S ++ D+ G+ S N
Sbjct: 315 RRFTEEEWDVWAGELSGDRLVVGIANWKNDSQAVSFDVADVLGMASAN 362
>gi|431900022|gb|ELK07957.1| Alpha-N-acetylgalactosaminidase, partial [Pteropus alecto]
Length = 394
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 112/398 (28%), Positives = 171/398 (42%), Gaps = 68/398 (17%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQIN----------EKIIRETDDCWGESSRDSQISLSCSF 69
L NGL TPPMGW +W F C +N E++ E D ++D L ++
Sbjct: 1 LENGLLRTPPMGWLAWERFRCNVNCDEDPKNCISERLFMEMAD---RLAQDGWRDLGYTY 57
Query: 70 HMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL----------------- 112
D I + G+LV + FP+G LADY H L
Sbjct: 58 LNIDD-CWIGGR----DDNGHLVPDAKRFPNGIAFLADYAHSLGLKLGIYEDVGNFTCMG 112
Query: 113 -ISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCE 170
+LD + F Q +D LK D CF+ Y M+ AL GRPI FS C
Sbjct: 113 YPGTTLDKVTQDAQTFAQWKVDMLKLDGCFSTPEERAKGYPMMAAALNATGRPIAFS-CS 171
Query: 171 W----GDMRPALWGS---NIRNSWRTTDDISDSWARMLTI----ADMNEVYADHAKPGGW 219
W G + P + S + N WR DDI DSW +L+I D ++ A PG W
Sbjct: 172 WPAYEGGLPPKVNYSLLIDTCNLWRNYDDIQDSWRSVLSILDWFVDHQDILQPVAGPGHW 231
Query: 220 NDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ- 278
NDPDML +GN G+ + + ++W + APL + D+ ++A+ M I+ N +I +NQ
Sbjct: 232 NDPDMLLIGNFGLSFEQARAQMALWTVLAAPLFMSTDLRTMSAQNMDILQNPLMIKINQD 291
Query: 279 --AINDQQI---------WAGPLSGNRIVVLLESRKT------FSSMMTAHWDDIGIKSN 321
I ++I + PL+G ++ SR+T +S+ ++ G+
Sbjct: 292 PLGIQGRRILKEKSYIEVYMRPLAGYSYALVFFSRRTDMPYHYRTSLTQLNFTGSGLHEA 351
Query: 322 NTLFEHRTLE-TKFQGKLTAKVDGHSCKMYELTPVTKL 358
++ + + + T ++ M+ L PV KL
Sbjct: 352 QNVYTGDVISGLRTETNFTVVINPSGVVMWYLYPVRKL 389
>gi|187607418|ref|NP_001120606.1| galactosidase, alpha precursor [Xenopus (Silurana) tropicalis]
gi|171847318|gb|AAI61735.1| LOC100145767 protein [Xenopus (Silurana) tropicalis]
Length = 408
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 104/347 (29%), Positives = 159/347 (45%), Gaps = 54/347 (15%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGES----SRDSQISLSCSFHMFDSL 75
L NGL LTPPMGW W F C+ N + R +C E D +S + L
Sbjct: 22 LQNGLALTPPMGWLHWERFLCETNCE--RNPKNCISEQLFMDMADKMVSEGWLDAGYQYL 79
Query: 76 TQIAAKPLPNQCQGN-LVANKTTFPSGTKALADYVH-----ILISNSLDNR--------- 120
P + + N L A+ FP G K LADYVH + I + ++
Sbjct: 80 CIDDCWLAPERDEKNRLQADPVRFPGGIKKLADYVHSRGLLLGIYEDVGSKTCEGFPGSQ 139
Query: 121 ----IESNIAFEQGIDYLKYDNCFNDDT-RPTIRYRAMSDALKKAGRPIFFSMCEWGDMR 175
I++ E G+D LK+D C+ + YR MS AL + GR I S CEW
Sbjct: 140 GYYDIDAQTFAEWGVDLLKFDGCYYGTLDKLEDGYRQMSMALNRTGRKIVLS-CEWPLYE 198
Query: 176 PALWGSNIR------NSWRTTDDISDSWARMLTIADMNEVYADH----AKPGGWNDPDML 225
+ N NSWR DI+DSW + + D++ + A PG WNDPDML
Sbjct: 199 RGIKKINYSEVAEYCNSWRNFGDITDSWGSVKAVMDLSSAVQEEIVPVAGPGSWNDPDML 258
Query: 226 EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-AINDQ- 283
+GN G+ +N+ V ++W+I APL++ D+ ++ AE+ ++ ++ VI++NQ + Q
Sbjct: 259 VIGNFGLSWNQQVTQMALWSIMAAPLMMSNDLRDIPAESKALLQDKNVISINQDPLGAQG 318
Query: 284 ---------QIWAGPLSGNRIVVLLESR------KTFSSMMTAHWDD 315
++W PL + V + +R + F ++ W+D
Sbjct: 319 YRVFKADYFEVWERPLIKGSLAVAVTNRNEIGGPRNFLLSLSLLWED 365
>gi|340372681|ref|XP_003384872.1| PREDICTED: alpha-galactosidase-like [Amphimedon queenslandica]
Length = 412
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 114/405 (28%), Positives = 165/405 (40%), Gaps = 99/405 (24%)
Query: 21 SNGLGLTPPMGWNSWNHF------HCQINEKI----------IRET-------DDCWGES 57
+NG+GL PPMGWN+W +C E + ++E DDCWG+
Sbjct: 36 NNGVGLKPPMGWNTWCSLGRCGRDYCDAKELMAIADAMATNGMKEAGYEYINMDDCWGDH 95
Query: 58 SRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYV-------- 109
D +GN+V +K FP G + YV
Sbjct: 96 RDD---------------------------KGNIVPDKDRFPDGLVPVVKYVNSKGFKFG 128
Query: 110 -------HILISNSLDNRIESNIA-FEQ--------GIDYLKYDNCFND----DTRPTIR 149
+ S +I + +EQ GI+Y+K D C + P +
Sbjct: 129 LYTDAGLYTCSSGGRKYKIPGSYGHYEQDANTYASWGIEYVKMDWCNTKINGTELDPHKQ 188
Query: 150 YRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEV 209
Y+ MSDAL K G+PIFF+ CEWG P W NSWRT D D W I ++N
Sbjct: 189 YQEMSDALNKTGKPIFFNSCEWGVDNPWEWMHQYANSWRTGPDHHDDWKTTSKIIEVNAD 248
Query: 210 YADHAKPG-GWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPII 268
D+A G GWNDPD L GM EY F++W + APLL+ DV N+T+ ++
Sbjct: 249 LGDYAGTGKGWNDPDFLMTHCPGMTDTEYRTEFTLWCLMSAPLLVVTDVRNMTSIMKEVL 308
Query: 269 GNEEVIAVNQ----------------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAH 312
N+++I +NQ N QIWA ++ + L + + S +T
Sbjct: 309 LNKDLIEINQDTTGPGGKRIGFDKTCGENACQIWAKNITNGEKAIALYNADSVSHNITLD 368
Query: 313 WDDIGIK--SNNTLFEHRTLETKFQGKLTAKVDGHSCKMYELTPV 355
+ G K S L+++ L T +V+ H + Y V
Sbjct: 369 FSLFGWKVVSMQNLWKNENLNA--TNSYTVEVESHGVQAYRAASV 411
>gi|393214676|gb|EJD00169.1| glycoside hydrolase [Fomitiporia mediterranea MF3/22]
Length = 432
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 98/351 (27%), Positives = 149/351 (42%), Gaps = 95/351 (27%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRD 60
L NG+ P MG+N+W + C I++ +I T DDCW E +R
Sbjct: 22 LDNGVARLPSMGYNTWYAYQCNIDQDLIITTAKLMQSLGLQDAGYEFINLDDCWAEKNRS 81
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-------- 112
G+LV +KT F SG L D VH +
Sbjct: 82 ES--------------------------GDLVPDKTRFSSGFNWLTDQVHSMGFKSGIYS 115
Query: 113 ---------ISNSLDNRIESNIAFEQ-GIDYLKYDNC---FNDDTRPTI--RYRAMSDAL 157
S +N F+ G ++LKYDNC ++D R +++ M+DA+
Sbjct: 116 DSGWFTCAGYPGSFENEARDAKTFQDWGFNFLKYDNCAIPYDDILRQNTMGKFQRMADAI 175
Query: 158 ----KKAGRP-IFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYAD 212
K +G+P + F++CEWG + LWG N+ SWRT +DI +W + + N
Sbjct: 176 ADLAKSSGKPPLVFNLCEWGWSQVWLWGKNLGQSWRTVNDIGPTWPLLTNTINFNSFITM 235
Query: 213 HAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEE 272
G ND D L++GNGG+ ++E HF+ WA+ K+PL + ++ L + I+ N E
Sbjct: 236 ATDFNGHNDLDYLQIGNGGLTFDEAKSHFTAWALMKSPLFISTNLSALDLSLLEIMKNRE 295
Query: 273 VIAVNQ----------------------AINDQQIWAGPLSGNRIVVLLES 301
+IA+NQ N W+GP IV+LL +
Sbjct: 296 LIAINQDPVVGMSISPFRWGVNPDWTNDPNNPAGYWSGPTQNGTIVMLLNT 346
>gi|11275340|dbj|BAB18272.1| alpha-galactosidase [Lachancea kluyveri]
Length = 416
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 93/280 (33%), Positives = 131/280 (46%), Gaps = 67/280 (23%)
Query: 89 GNLVANKTTFPSGTKALADYVH---ILI--------------SNSLDNRIESNIAFEQ-G 130
G LV + FP G K + D +H +L S SL + AF G
Sbjct: 44 GYLVPDSKKFPRGMKHVGDSLHDEGLLFGMYSSAGEYTCAGYSGSLGHEEADAAAFASWG 103
Query: 131 IDYLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRN 185
+DYLKYDNC+N T RY+AMSDAL K G+ I++++C WG+ P W S I N
Sbjct: 104 VDYLKYDNCYNQGNYGTAQLSYERYKAMSDALNKTGKNIYYTLCNWGEDSPWHWASAIAN 163
Query: 186 SWRTTDDISDSWAR----------------------MLTIADMNEVYADHAKPGGWNDPD 223
SWR + D+ + R ++ I + A PGGWND D
Sbjct: 164 SWRISGDVYAHFDRPDSRCPCDGDEYYCHLAGFDCSIMNILNKAAPLGQKAGPGGWNDLD 223
Query: 224 MLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ----- 278
LEVG G M E HFS+WA K+PL++G ++ +L A++ I N +IAVNQ
Sbjct: 224 SLEVGVGNMTDEEEKTHFSMWAAVKSPLVIGANLNSLDAKSFSIYVNPAIIAVNQDPAGV 283
Query: 279 ----------------AINDQQIWAGPL-SGNRIVVLLES 301
+ Q+W+GPL +G++IV L+ +
Sbjct: 284 PATRVWRYYVSDTDEYGQGEIQLWSGPLDNGDQIVALVNA 323
>gi|443898540|dbj|GAC75875.1| alpha-d-galactosidase [Pseudozyma antarctica T-34]
Length = 386
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 92/267 (34%), Positives = 135/267 (50%), Gaps = 59/267 (22%)
Query: 89 GNLVANKTTFPSGTKALADYVH-------ILISNS-LDNRIESNIAFEQ---------GI 131
G AN T FP+G +A + H I ++N +D+ + +E+ G+
Sbjct: 61 GRPQANVTKFPNGIPPVAQHAHNVGLQFGIYMTNGYVDSADPGSWGYEEIDAQEFADWGV 120
Query: 132 DYLKYDNC--FNDDTRP-TIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWR 188
DYLKYDNC F T P R+R M +AL++ GR I FS+C+WGD P W I +S+R
Sbjct: 121 DYLKYDNCGTFEAGTHPPQSRFRVMGNALRETGRHILFSLCQWGDQFPWFWADQIAHSYR 180
Query: 189 TTDDISDSWAR----------------------MLTIADMNEV-YADHAKPGGWNDPDML 225
+ DI+ +++ M I M E+ Y H PG + D DML
Sbjct: 181 ISGDITATFSDTGKDCACKTAYCLNTGYAGCSVMTIIRKMREISYFQH--PGSFADMDML 238
Query: 226 EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------- 278
E+G M +E HFS WA K+PL++G D+ ++ E++ ++ NE++IAVNQ
Sbjct: 239 EIGIANMTLDEERTHFSFWAALKSPLIIGADLRTISNESLSVLRNEKMIAVNQDKLGAAV 298
Query: 279 ----AINDQ---QIWAGPLSGNRIVVL 298
I+ + Q WAG LSGNR +VL
Sbjct: 299 EYLADISKERAFQTWAGRLSGNRTIVL 325
>gi|357044104|ref|ZP_09105788.1| hypothetical protein HMPREF9138_02260 [Prevotella histicola F0411]
gi|355367654|gb|EHG15082.1| hypothetical protein HMPREF9138_02260 [Prevotella histicola F0411]
Length = 409
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 95/298 (31%), Positives = 138/298 (46%), Gaps = 49/298 (16%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
+ L TPPMG+ +WN + INE++IR+ D + + D + + D Q
Sbjct: 31 DSLAPTPPMGFMTWNKYEENINEQLIRQIAD---KMATDGYAAAGYKYIFIDDAWQGG-- 85
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVH---ILI--------------SNSLDNRIESN 124
+ N++ + FPSG KALADYVH +L+ + S D +
Sbjct: 86 ---RDKRNNIIPDPKKFPSGMKALADYVHSRGLLLGIYSDAAQLTCAGYTGSYDFEEQDA 142
Query: 125 IAF-EQGIDYLKYDNCF--NDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGS 181
F E GIDYLKYD C +D RY+ M DAL+ +GR + +CEWG + P W
Sbjct: 143 KTFAEWGIDYLKYDYCHAPSDSAVAHKRYKEMGDALQNSGRKMALGICEWGQLNPEKWAR 202
Query: 182 NIRNS-WRTTDDISDSWARM--------LTIADMNEVYADHAKPGGWNDPDML------- 225
S WR + D+ D W + L I ++ E +A PG W D DML
Sbjct: 203 QAGGSLWRISYDVRDMWKDIVKQGGMGILDIINIAEPLYTYAGPGHWLDMDMLIVGLDGK 262
Query: 226 -----EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
++G G Y EY S+W + +PL + D+ N T I+ N+E+IA+NQ
Sbjct: 263 GGPSSDLGGIGCSYTEYQTQMSMWCMFASPLAMSHDILNENEATRRILLNKEIIAINQ 320
>gi|302886970|ref|XP_003042374.1| hypothetical protein NECHADRAFT_102022 [Nectria haematococca mpVI
77-13-4]
gi|256723284|gb|EEU36661.1| hypothetical protein NECHADRAFT_102022 [Nectria haematococca mpVI
77-13-4]
Length = 543
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 105/380 (27%), Positives = 173/380 (45%), Gaps = 58/380 (15%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQ 86
TPPMGWN++N ++C E+I++ ++ + L + + ++ + + N+
Sbjct: 24 TPPMGWNTYNRYNCNPTEEIVK--------ANAKGLVDLGFA-ELGYTIVAVDCGWMTNE 74
Query: 87 CQGN--LVANKTTFPSGTKALADYVHIL-------------------ISNSLD-NRIESN 124
N L N FPSG KAL +Y+H L + SL I++
Sbjct: 75 RDENNRLQWNPKIFPSGPKALGEYIHDLGLEFGLYSGAGYYQCGSTDLPGSLGFEEIDAQ 134
Query: 125 IAFEQGIDYLKYDNCFN-------DDTRPTI----RYRAMSDALKKAGRPIFFSMCEWG- 172
E G D LKYDNC++ D T P R+ M AL R I +C+WG
Sbjct: 135 SFVEWGGDALKYDNCYSTSKTVMVDSTSPEAQSPDRFIKMGAALNDTDRDIKLFLCQWGI 194
Query: 173 DMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGM 232
W + + N+WR ++DI +W + I + +A H +PG + D DMLE+G G +
Sbjct: 195 GENVPDWAAPLGNTWRMSNDIFGAWRAIWRITNQVVGHAKHTRPGAFADMDMLEIGLGFL 254
Query: 233 KYNEYVVHFSIWAISKAPLLLG--CDVGNLTAETMPIIGNEEVIAVNQ-----------A 279
+ E HF W++ K+PLL+G D + +E++ I+ N+EVIA+NQ
Sbjct: 255 SFEEERFHFGFWSMMKSPLLIGGVLDKKEMPSESIEIMSNKEVIAINQDPLGKAAELVIR 314
Query: 280 INDQQ--IWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFEHRTLETKFQGK 337
++Q +WAG LS NR V+ + + K S + IG+ +T
Sbjct: 315 YTEEQWDVWAGDLSSNRKVLAVANWKNESQTVDVDLSLIGVGKAKARDVWAHADTSISDI 374
Query: 338 LTAKVDGHSCKMYELTPVTK 357
T ++ H ++ L+ + +
Sbjct: 375 ETFELKAHELRLLVLSDIEE 394
>gi|449546879|gb|EMD37848.1| glycoside hydrolase family 27 protein [Ceriporiopsis subvermispora
B]
Length = 489
Score = 138 bits (347), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 89/288 (30%), Positives = 137/288 (47%), Gaps = 41/288 (14%)
Query: 25 GLTPPMGWNSWNHFHCQINE-KIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPL 83
G TP +GWNSWN + C+I+E K+I + +D+ F D + A+
Sbjct: 108 GKTPALGWNSWNAYGCEISEEKVIAAANQFISLGLKDAGYE----FVNIDDCWSLQAR-- 161
Query: 84 PNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLD-NRIESNI 125
N ++ + + FP+G ++A VH L SL ++++
Sbjct: 162 -NSTTQQIIPDSSKFPNGISSVASQVHALGLKLGIYGDAGTETCSGFPGSLGYENLDAST 220
Query: 126 AFEQGIDYLKYDNC-----FNDDTRP----------TIRYRAMSDALKKAGRPIFFSMCE 170
+ G+DYLKYDNC ++D P IRYR M AL P+ F++C
Sbjct: 221 FADWGVDYLKYDNCNVPGNWSDSGTPQGNDWYNSNSAIRYRQMGAALAVQSNPMQFNLCI 280
Query: 171 WGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNG 230
WG+ WG+ + +SWR D S SW+ + +I N Y + ND DM+E+GNG
Sbjct: 281 WGNANVWQWGARVGHSWRMGSDSSASWSYITSILTQNVQYLSYIDFWAHNDMDMMEIGNG 340
Query: 231 GMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
+ E HF+ W K+P+LLG D+ L+ E + II N E++A +Q
Sbjct: 341 DLTIEEQRTHFAAWCFLKSPILLGTDLSLLSTEQIAIITNTELLAFHQ 388
>gi|395546395|ref|XP_003775073.1| PREDICTED: alpha-galactosidase A [Sarcophilus harrisii]
Length = 441
Score = 137 bits (346), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 105/344 (30%), Positives = 146/344 (42%), Gaps = 102/344 (29%)
Query: 20 LSNGLGLTPPMGWNSWNHFHC------------------QINEKIIRE-----------T 50
L NGL LTP MGW W F C Q+ E ++ E
Sbjct: 43 LDNGLALTPTMGWLHWERFTCNVDCMEDPENCISEQLFMQMAEVMVSEGWKDVGYEYVCI 102
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW S RD G L A+ FP G + LA+YVH
Sbjct: 103 DDCWLASQRDKD--------------------------GRLQADPKRFPRGIRHLANYVH 136
Query: 111 ILISNSLDNRIESNIA---------------------FEQGIDYLKYDNCFNDDTRPTIR 149
S L I ++ + G+D LK+D C+ D + +
Sbjct: 137 ---SKGLKLGIYQDVGTLTCAGYPGSFGYYDIDAETFADWGVDLLKFDGCYAKDVKSLVE 193
Query: 150 -YRAMSDALKKAGRPIFFSMCEWG-DMRPALWG--SNIR---NSWRTTDDISDSWARMLT 202
Y+ MS AL K GR I +S CEW MRP + IR N WR DI DSW+ +
Sbjct: 194 GYKYMSYALNKTGRSILYS-CEWPLYMRPHQQPNYTEIRQYCNHWRNYGDIFDSWSSVKN 252
Query: 203 I----ADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVG 258
I A + A PGGWNDPDML +GN G+ +++ + ++WAI APL + D+
Sbjct: 253 ILAWTASHQKSLVPAAGPGGWNDPDMLVIGNFGLSWDQQITQMALWAIMAAPLFMSNDLR 312
Query: 259 NLTAETMPIIGNEEVIAVNQ-----------AINDQQIWAGPLS 291
N++ ++ ++ N++VIA+NQ D ++W PLS
Sbjct: 313 NISPQSKALLQNKDVIAINQDPLGKQGYQLRKDEDFEVWQRPLS 356
>gi|451996249|gb|EMD88716.1| carbohydrate-binding module family 35 protein [Cochliobolus
heterostrophus C5]
Length = 545
Score = 137 bits (346), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 108/385 (28%), Positives = 178/385 (46%), Gaps = 71/385 (18%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETD----DCWGESSRDSQISLSCSFHMFDSLTQIAAKP 82
TPPMGWNS+N ++C +E+ I E+ D +S + +++ C ++ D
Sbjct: 26 TPPMGWNSYNTWNCLPDEEKIHESARGLVDLGFQSVGYNFVTVDCGWNSKD--------- 76
Query: 83 LPNQCQGNLVANKTTFPSGTKALADYVHILISNSLD--------------NRIESNIAFE 128
G L NKT FPSG KAL D++H L LD + +++ FE
Sbjct: 77 --RDYNGKLQWNKTLFPSGGKALGDFLHDL---GLDFGLYSGAGYLQCGSEALPASLGFE 131
Query: 129 Q---------GIDYLKYDNCFND-----------DTRPTIRYRAMSDALKKAGRPIFFSM 168
Q G D LKYDNC++ + + R++ M+ L R I + +
Sbjct: 132 QLDAESFAEWGGDSLKYDNCYSTSNTTMVDSSSAEAQSPARFQHMAAELDAVNRDIQYYV 191
Query: 169 CEWG-DMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEV 227
C+WG W ++I +WR ++DI ++W + I + Y H G + D DML V
Sbjct: 192 CQWGIGTNVGDWAADIGTTWRMSNDIYNAWRSIWRITNQIVPYFRHTTTGAFADMDMLIV 251
Query: 228 GNGGMKYNEYVVHFSIWAISKAPLLLGC--DVGNLTAETMPIIGNEEVIAVNQ------- 278
G + E HF +WAI+K+PL++G D L ++ I+ N+EVIA+NQ
Sbjct: 252 GLKALSEEEERFHFGMWAINKSPLIMGAALDPKRLGQSSIDIMTNKEVIAINQDPLVKPA 311
Query: 279 ------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT--LFEHRTL 330
++ +W G LSG++ V+ + + + S + + +GI S N ++ + L
Sbjct: 312 KLIQRNTESEWDVWLGELSGSKQVLGVANWRNDSQTVELNLASLGIASANARDVWAAKDL 371
Query: 331 ETKFQGKLTAKVDGHSCKMYELTPV 355
G T + GH +++ L+ V
Sbjct: 372 -GAVSGSQTLDLAGHELRLWVLSDV 395
>gi|126339069|ref|XP_001371622.1| PREDICTED: alpha-N-acetylgalactosaminidase-like [Monodelphis
domestica]
Length = 412
Score = 137 bits (345), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 102/340 (30%), Positives = 152/340 (44%), Gaps = 66/340 (19%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGES---------SRDSQISLSCSFH 70
L NGL TPPMGW +W + C IN + D+C E + D L +
Sbjct: 17 LENGLMRTPPMGWLAWERYRCSINCH--EDPDNCINERLFMAMADRLAEDGWRELGYVYV 74
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-------------------- 110
D L G LV + FPSG +ALADYVH
Sbjct: 75 NIDDCWMA----LQRDEGGVLVPDPERFPSGIRALADYVHSRGLKLGIYGDMGSMTCMGY 130
Query: 111 ---ILISNSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFS 167
L + +LD + ++ G+D LK D C++ + Y AM+ AL GRPI +S
Sbjct: 131 PGTTLATVALDAQTFAS----WGVDMLKLDGCYSSSLERALGYPAMAKALNNTGRPIGYS 186
Query: 168 MCEW----GDMRPAL---WGSNIRNSWRTTDDISDSWARMLTIAD----MNEVYADHAKP 216
C W G + P + ++I N WR DDI DSW +L+I D +V A P
Sbjct: 187 -CSWPAYDGGLPPEVNYTLLADICNLWRNFDDIEDSWQSVLSIVDWFSEHQDVLQPMAGP 245
Query: 217 GGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAV 276
G WNDPDML +GN G+ + + ++W + APL + D+ ++ + + I+ N+ +I +
Sbjct: 246 GHWNDPDMLLIGNYGLSFEQAKAQMALWTVLAAPLFMSTDLRAISPQNVDILQNKLMIRI 305
Query: 277 NQ------------AINDQQIWAGPLSGNRIVVLLESRKT 304
NQ + +++ PLS + V+ SR+T
Sbjct: 306 NQDPLGIQGRRVLKTKHHIEVFVRPLSKDASAVVFFSRRT 345
>gi|397478237|ref|XP_003810459.1| PREDICTED: alpha-galactosidase A [Pan paniscus]
Length = 429
Score = 137 bits (345), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 106/353 (30%), Positives = 151/353 (42%), Gaps = 96/353 (27%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRE-------------------T 50
L NGL TP MGW W F C I+EK+ E
Sbjct: 32 LDNGLARTPTMGWLHWERFMCNLDCQEEPDSCISEKLFMEMAELMVSEGWKDAGYEYLCI 91
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RDS+ G L A+ FP G + LA+YVH
Sbjct: 92 DDCWMAPQRDSE--------------------------GRLQADPQRFPHGIRQLANYVH 125
Query: 111 -----ILISNSLDNR-------------IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YR 151
+ I + N+ I++ + G+D LK+D C+ D Y+
Sbjct: 126 SKGLKLGIYADVGNKTCAGFPGSFGYYDIDAQTFADWGVDLLKFDGCYCDSLENLADGYK 185
Query: 152 AMSDALKKAGRPIFFSMCEWGDMRPALWGSN---IR---NSWRTTDDISDSWARMLTIAD 205
MS AL + GR I +S CEW N IR N WR DI DSW + +I D
Sbjct: 186 HMSLALNRTGRSIVYS-CEWPLYMWPFQKPNYTEIRQYCNHWRNFADIDDSWKSIKSILD 244
Query: 206 M----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
E D A PGGWNDPDML +GN G+ +N+ V ++WAI APL + D+ +++
Sbjct: 245 WTSFNQERIVDVAGPGGWNDPDMLVIGNFGLSWNQQVTQMALWAIMAAPLFMSNDLRHIS 304
Query: 262 AETMPIIGNEEVIAVNQ-----------AINDQQIWAGPLSGNRIVVLLESRK 303
+ ++ +++VIA+NQ ++ ++W PLSG V + +R+
Sbjct: 305 PQAKALLQDKDVIAINQDPLGKQGYQLRQADNFEVWERPLSGLAWAVAMINRQ 357
>gi|62896813|dbj|BAD96347.1| galactosidase, alpha variant [Homo sapiens]
Length = 429
Score = 137 bits (345), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 107/353 (30%), Positives = 150/353 (42%), Gaps = 96/353 (27%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRE-------------------T 50
L NGL TP MGW W F C I+EK+ E
Sbjct: 32 LDNGLARTPTMGWLHWERFMCNLDCQEEPDSCISEKLFMEMAELMVSEGWKDAGYEYLCI 91
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RDS+ G L A+ FP G + LA+YVH
Sbjct: 92 DDCWMAPQRDSE--------------------------GRLQADPQRFPHGVRQLANYVH 125
Query: 111 -----ILISNSLDNR-------------IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YR 151
+ I + N+ I++ + G+D LK+D C+ D Y+
Sbjct: 126 SKGLKLGIYADVGNKTCAGFPGSFGYYDIDAQTFADWGVDLLKFDGCYCDSLENLADGYK 185
Query: 152 AMSDALKKAGRPIFFSMCEWGDMRPALWGSN---IR---NSWRTTDDISDSWARMLTIAD 205
MS AL + GR I +S CEW N IR N WR DI DSW + +I D
Sbjct: 186 HMSLALNRTGRSIVYS-CEWPLYMWPFQKPNYTEIRQYCNHWRNFADIDDSWKSIKSILD 244
Query: 206 M----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
E D A PGGWNDPDML +GN G+ +N+ V ++WAI APL + D+ +++
Sbjct: 245 WTSFNQERIVDVAGPGGWNDPDMLVIGNFGLSWNQQVTQMALWAIMAAPLFMSNDLRHIS 304
Query: 262 AETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLSGNRIVVLLESRK 303
+ ++ +++VIA+NQ +Q +W PLSG V + +R+
Sbjct: 305 PQAKALLQDKDVIAINQDPLGKQGYQLRQGDNFEVWERPLSGLAWAVAMINRQ 357
>gi|414865929|tpg|DAA44486.1| TPA: hypothetical protein ZEAMMB73_777822 [Zea mays]
Length = 224
Score = 137 bits (345), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 69/129 (53%), Positives = 81/129 (62%), Gaps = 20/129 (15%)
Query: 172 GDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWN----------- 220
G PALW + NSWRTTDDI+D+W M IAD N +A +A P GWN
Sbjct: 96 GQYDPALWAGKVGNSWRTTDDITDTWKSMTDIADKNNKWASYAGPSGWNVLCGAVQYTIT 155
Query: 221 ---------DPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNE 271
DPDML+VGNGGM EY HFSIWA+ KAPLL+GCDV N+T+ETM I+ N+
Sbjct: 156 LLLNLCFYADPDMLKVGNGGMTLAEYRSHFSIWALMKAPLLIGCDVRNMTSETMEILSNK 215
Query: 272 EVIAVNQAI 280
EVI VNQ I
Sbjct: 216 EVIQVNQVI 224
>gi|355690247|gb|AER99093.1| galactosidase, alpha [Mustela putorius furo]
Length = 419
Score = 137 bits (345), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 103/341 (30%), Positives = 149/341 (43%), Gaps = 96/341 (28%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRE-------------------T 50
L NGL +TP MGW W F C I+EK+ +
Sbjct: 32 LDNGLAMTPTMGWLHWERFMCNTDCQGEPDSCISEKLFMQMAELMESDGWKDVGYEYLCI 91
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RDS+ G L A+ FPSG + LADYVH
Sbjct: 92 DDCWMAPKRDSK--------------------------GRLQADPKRFPSGIRGLADYVH 125
Query: 111 -----ILISNSLDNR-------------IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YR 151
+ I + N+ I++ + G+D LK+D C+ D + Y+
Sbjct: 126 SKGLKLGIYADVGNKTCAGFPGSFGHYDIDAKTFADWGVDLLKFDGCYCDSVEHLEKGYK 185
Query: 152 AMSDALKKAGRPIFFSMCEWGDMRPALWGSN---IR---NSWRTTDDISDSWARMLTI-- 203
MS AL + GR I +S CEW N IR N WR +D+ DSW + +I
Sbjct: 186 HMSLALNRTGRSIVYS-CEWPLYMWPFRKPNYTEIRQYCNHWRNFEDVYDSWQSVKSILA 244
Query: 204 --ADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
+ + D A PGGWNDPDML +GN G+ +++ V ++WAI APLL+ D+ +++
Sbjct: 245 WTSSNQKRIVDVAGPGGWNDPDMLVIGNFGLSWDQQVTQMALWAIMAAPLLMSNDLRHIS 304
Query: 262 AETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLS 291
+ ++ N++VIA+NQ +Q +W PLS
Sbjct: 305 PQAKALLQNKDVIAINQDPLGKQGYRLRKEDNIEVWERPLS 345
>gi|328704370|ref|XP_001944081.2| PREDICTED: alpha-N-acetylgalactosaminidase-like [Acyrthosiphon
pisum]
Length = 433
Score = 137 bits (345), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 98/297 (32%), Positives = 138/297 (46%), Gaps = 45/297 (15%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGES----SRDSQISLSCSFHMFDSL 75
L NGL LTPPMGW +W + C + ++ DDC E + D +S S ++ +
Sbjct: 24 LDNGLALTPPMGWLAWQRYRCITDCEMY--PDDCVNEQLFMKAADLLVSEGYSELGYNYV 81
Query: 76 TQIAAKPLPNQ-CQGNLVANKTTFPSGTKALADYVH----------------------IL 112
N+ G L A+K FPSG KAL+DYVH +L
Sbjct: 82 IVDDCWLAKNRSADGKLQADKIRFPSGIKALSDYVHSKGLKFGIYEDWGTKTCAGYPGVL 141
Query: 113 ISNSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWG 172
LD + + E +DY+K D C+++ Y L GRP+ +S C W
Sbjct: 142 GHEELDAKTFA----EWEVDYVKLDGCYSNVRHMDKGYPEFGRHLNSTGRPMVYS-CSWP 196
Query: 173 DMRPAL-----WGSNIR--NSWRTTDDISDSWARMLTIADM----NEVYADHAKPGGWND 221
+ + S + N WR DDI DSW M+ IAD E +A +A PG WND
Sbjct: 197 AYQEEKGMLIDYASMAKHCNLWRNYDDIDDSWESMIKIADYFAQKQEFWAKYAGPGHWND 256
Query: 222 PDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
PDML +GN G+ Y++ +IWAI APLL+ + + I+ N++VI VNQ
Sbjct: 257 PDMLLIGNFGLTYDQSKTQMAIWAILAAPLLMSNKLAEVQPHFKDILQNKKVIEVNQ 313
>gi|395325342|gb|EJF57766.1| glycoside hydrolase [Dichomitus squalens LYAD-421 SS1]
Length = 421
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 96/310 (30%), Positives = 132/310 (42%), Gaps = 88/310 (28%)
Query: 21 SNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDS 61
S+ +G P +GWNSWN C INE I + DDCW +RD+
Sbjct: 51 SHQVGRLPALGWNSWNGHACNINETIFLDAANQFIFLGLADAGYEYINIDDCWQSETRDA 110
Query: 62 QISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL--------- 112
G +V + FPSG +AD VH L
Sbjct: 111 S-------------------------TGRIVPEPSKFPSGINGIADQVHALGLKLGIYSD 145
Query: 113 --------ISNSLDNR-IESNIAFEQGIDYLKYDNC-----FNDDTRP----------TI 148
+ SL N +++ E G+DY NC + D P I
Sbjct: 146 AGAKTCDGLPASLGNETLDAETFIEWGVDY----NCNIPSNWTDAATPPDDDWYNSNSAI 201
Query: 149 RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNE 208
RYR M+ AL + G+PI FS+C WG WG+ + +SWR T+DIS I N
Sbjct: 202 RYRQMTAALNQTGKPIHFSLCIWGTANVWEWGARVGHSWRMTNDIS-------AILAANA 254
Query: 209 VYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPII 268
+ D ND DM+E+GNG + E HF+ WA K+P+LLG D+ NL + + II
Sbjct: 255 QHLDSVDFFSHNDMDMIEIGNGDLTLQEQRTHFAAWAFMKSPILLGTDLNNLNSTQLDII 314
Query: 269 GNEEVIAVNQ 278
N E++A +Q
Sbjct: 315 KNAELLAFHQ 324
>gi|431895712|gb|ELK05133.1| Alpha-galactosidase A [Pteropus alecto]
Length = 427
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 102/324 (31%), Positives = 146/324 (45%), Gaps = 62/324 (19%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGES---------SRDSQISLSCSFH 70
L+NGL +TP MGW W F C I+ K E D C E D + +
Sbjct: 37 LNNGLAMTPTMGWLHWERFMCNIDCK--EEPDSCISEKLFMQMADLMDSDGWKKVGYEYL 94
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAFEQ- 129
D A + +G L A+ FPSG + LA+YVH S L I +++ +
Sbjct: 95 CIDDCWMAAQR----DSKGRLQADPIRFPSGIRHLANYVH---SKGLKLGIYADVGKKTC 147
Query: 130 --------------------GIDYLKYDNCFNDDTRPTIR-YRAMSDALKKAGRPIFFSM 168
G+D LK+D C+ D Y+ MS AL K GR I +S
Sbjct: 148 AGYPGSFGYYDIDAKTFASWGVDLLKFDGCYCDSVEQLADGYKHMSLALNKTGRSIVYS- 206
Query: 169 CEWGDMRPALWGSN---IR---NSWRTTDDISDSWARMLTIADM----NEVYADHAKPGG 218
CEW N IR N WR + DI DSW + +I D + A PGG
Sbjct: 207 CEWPLYMWPFRKPNYTEIREYCNHWRNSGDIYDSWQSVKSILDWTSSNQKTIVSVAGPGG 266
Query: 219 WNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
WNDPDML +GN G+ +++ + ++WAI APLL+ D+ +++ + ++ N++VIA+NQ
Sbjct: 267 WNDPDMLVIGNFGLSWDQQITQMALWAIMAAPLLMSNDLRHISPQAKALLQNKDVIAINQ 326
Query: 279 AINDQQ-----------IWAGPLS 291
+Q +W PLS
Sbjct: 327 DPLGKQGYLLRKEDNIEVWERPLS 350
>gi|4504009|ref|NP_000160.1| alpha-galactosidase A precursor [Homo sapiens]
gi|113499|sp|P06280.1|AGAL_HUMAN RecName: Full=Alpha-galactosidase A; AltName:
Full=Alpha-D-galactosidase A; AltName:
Full=Alpha-D-galactoside galactohydrolase; AltName:
Full=Melibiase; AltName: INN=Agalsidase; Flags:
Precursor
gi|31756|emb|CAA32617.1| alpha-D-galactosidase A [Homo sapiens]
gi|757912|emb|CAA29232.1| alpha-galactosidase [Homo sapiens]
gi|1684916|gb|AAB64203.1| alpha-D-galactosidase A [Homo sapiens]
gi|12803707|gb|AAH02689.1| Galactosidase, alpha [Homo sapiens]
gi|30582567|gb|AAP35510.1| galactosidase, alpha [Homo sapiens]
gi|60655717|gb|AAX32422.1| galactosidase alpha [synthetic construct]
gi|60655719|gb|AAX32423.1| galactosidase alpha [synthetic construct]
gi|119623267|gb|EAX02862.1| galactosidase, alpha, isoform CRA_a [Homo sapiens]
gi|123979636|gb|ABM81647.1| galactosidase, alpha [synthetic construct]
gi|123994447|gb|ABM84825.1| galactosidase, alpha [synthetic construct]
gi|158255626|dbj|BAF83784.1| unnamed protein product [Homo sapiens]
gi|226967|prf||1612342A alpha galactosidase
Length = 429
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 107/353 (30%), Positives = 150/353 (42%), Gaps = 96/353 (27%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRE-------------------T 50
L NGL TP MGW W F C I+EK+ E
Sbjct: 32 LDNGLARTPTMGWLHWERFMCNLDCQEEPDSCISEKLFMEMAELMVSEGWKDAGYEYLCI 91
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RDS+ G L A+ FP G + LA+YVH
Sbjct: 92 DDCWMAPQRDSE--------------------------GRLQADPQRFPHGIRQLANYVH 125
Query: 111 -----ILISNSLDNR-------------IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YR 151
+ I + N+ I++ + G+D LK+D C+ D Y+
Sbjct: 126 SKGLKLGIYADVGNKTCAGFPGSFGYYDIDAQTFADWGVDLLKFDGCYCDSLENLADGYK 185
Query: 152 AMSDALKKAGRPIFFSMCEWGDMRPALWGSN---IR---NSWRTTDDISDSWARMLTIAD 205
MS AL + GR I +S CEW N IR N WR DI DSW + +I D
Sbjct: 186 HMSLALNRTGRSIVYS-CEWPLYMWPFQKPNYTEIRQYCNHWRNFADIDDSWKSIKSILD 244
Query: 206 M----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
E D A PGGWNDPDML +GN G+ +N+ V ++WAI APL + D+ +++
Sbjct: 245 WTSFNQERIVDVAGPGGWNDPDMLVIGNFGLSWNQQVTQMALWAIMAAPLFMSNDLRHIS 304
Query: 262 AETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLSGNRIVVLLESRK 303
+ ++ +++VIA+NQ +Q +W PLSG V + +R+
Sbjct: 305 PQAKALLQDKDVIAINQDPLGKQGYQLRQGDNFEVWERPLSGLAWAVAMINRQ 357
>gi|30584509|gb|AAP36507.1| Homo sapiens galactosidase, alpha [synthetic construct]
gi|60652625|gb|AAX29007.1| galactosidase alpha [synthetic construct]
Length = 430
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 107/353 (30%), Positives = 150/353 (42%), Gaps = 96/353 (27%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRE-------------------T 50
L NGL TP MGW W F C I+EK+ E
Sbjct: 32 LDNGLARTPTMGWLHWERFMCNLDCQEEPDSCISEKLFMEMAELMVSEGWKDAGYEYLCI 91
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RDS+ G L A+ FP G + LA+YVH
Sbjct: 92 DDCWMAPQRDSE--------------------------GRLQADPQRFPHGIRQLANYVH 125
Query: 111 -----ILISNSLDNR-------------IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YR 151
+ I + N+ I++ + G+D LK+D C+ D Y+
Sbjct: 126 SKGLKLGIYADVGNKTCAGFPGSFGYYDIDAQTFADWGVDLLKFDGCYCDSLENLADGYK 185
Query: 152 AMSDALKKAGRPIFFSMCEWGDMRPALWGSN---IR---NSWRTTDDISDSWARMLTIAD 205
MS AL + GR I +S CEW N IR N WR DI DSW + +I D
Sbjct: 186 HMSLALNRTGRSIVYS-CEWPLYMWPFQKPNYTEIRQYCNHWRNFADIDDSWKSIKSILD 244
Query: 206 M----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
E D A PGGWNDPDML +GN G+ +N+ V ++WAI APL + D+ +++
Sbjct: 245 WTSFNQERIVDVAGPGGWNDPDMLVIGNFGLSWNQQVTQMALWAIMAAPLFMSNDLRHIS 304
Query: 262 AETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLSGNRIVVLLESRK 303
+ ++ +++VIA+NQ +Q +W PLSG V + +R+
Sbjct: 305 PQAKALLQDKDVIAINQDPLGKQGYQLRQGDNFEVWERPLSGLAWAVAMINRQ 357
>gi|327312731|ref|YP_004328168.1| alpha-galactosidase [Prevotella denticola F0289]
gi|326944797|gb|AEA20682.1| alpha-galactosidase [Prevotella denticola F0289]
Length = 409
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 100/328 (30%), Positives = 148/328 (45%), Gaps = 60/328 (18%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
+ L LTPPMG+ +WN + I+E++IR D +S + + +
Sbjct: 31 DSLELTPPMGFMTWNKYKDDISEQLIRRI--------ADRMVSAGYAEAGYKYIFIDDGW 82
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVH---ILISNSLD---------------NRIES 123
+ N++ + FPSG KALADYVH +L+ D + ++
Sbjct: 83 QGGRDKRNNIIPDPVKFPSGMKALADYVHSKGLLLGIYSDAARLTCAGYTGSYGFEKQDA 142
Query: 124 NIAFEQGIDYLKYDNCF--NDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGS 181
E GIDYLKYD C +D RYR M DAL+ +GR I +CEWG + P W
Sbjct: 143 KTFAEWGIDYLKYDYCHAPSDSAVAHHRYRKMGDALQDSGRKIALGVCEWGQLNPERWAR 202
Query: 182 NIRNS-WRTTDDISDSWAR--------MLTIADMNEVYADHAKPGGWNDPDML------- 225
S WR + D+ D W +L I D+ E +A PG W D DML
Sbjct: 203 QAGGSLWRVSYDVRDMWKDIVGQGGMGILDIIDITEPLYKYAGPGHWLDMDMLIVGLDGK 262
Query: 226 -----EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-A 279
++G G Y EY S+W + +PL + D+ N T I+ N+E+IA+NQ A
Sbjct: 263 GGPSSDLGGVGCTYTEYQTQMSMWCMFASPLAMSHDLLNENEATRRILLNKEMIAINQDA 322
Query: 280 INDQ----------QIWAGPLSGNRIVV 297
+ + +++ LSGNR+ +
Sbjct: 323 LGEAARRVDFPGVCRVYLRRLSGNRLAL 350
>gi|426257855|ref|XP_004022537.1| PREDICTED: alpha-galactosidase A [Ovis aries]
Length = 439
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 98/317 (30%), Positives = 141/317 (44%), Gaps = 85/317 (26%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQIN----------EKIIRET------------------- 50
L NGL +TP MGW W F C +N EK+ ++
Sbjct: 37 LDNGLAMTPTMGWLHWERFMCNVNCQEEPDSCISEKLFQQIAEIMASDGWKDVGYEYICI 96
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RDS+ G L A+ FP G + LADYVH
Sbjct: 97 DDCWMAPERDSE--------------------------GRLQADPKRFPGGIRHLADYVH 130
Query: 111 -----ILISNSLDNR-------------IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YR 151
+ I + N+ I++ + G+D LK+D C+ D + Y+
Sbjct: 131 SKGLKLGIYADVGNKTCAGFPGSFGYYDIDAQTFADWGVDLLKFDGCYCDSIQHLAEGYK 190
Query: 152 AMSDALKKAGRPIFFSMCEWG-DMRPALWG--SNIR---NSWRTTDDISDSWARMLTIAD 205
MS AL + GR I +S CEW MRP + IR N WR DI DSW + D
Sbjct: 191 QMSLALNRTGRSIVYS-CEWPLYMRPIFKPNYTEIREYCNHWRNFPDIYDSWESIRNTLD 249
Query: 206 M----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
++ A PGGWNDPDML +GN G+ ++ + ++WAI APLL+ D+ +++
Sbjct: 250 WTSSNQKIIVPVAGPGGWNDPDMLVIGNFGLSRDQQITQMALWAIMAAPLLMSNDLRSIS 309
Query: 262 AETMPIIGNEEVIAVNQ 278
E ++ +++VIA+NQ
Sbjct: 310 PEAKALLQDKDVIAINQ 326
>gi|194226911|ref|XP_001502759.2| PREDICTED: alpha-N-acetylgalactosaminidase [Equus caballus]
Length = 411
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 116/406 (28%), Positives = 176/406 (43%), Gaps = 74/406 (18%)
Query: 15 DRRNLLSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRETDDCWGESSRDSQIS 64
D LL NGL PPMGW +W F C I+E++ E D ++D
Sbjct: 13 DHVLLLENGLLQKPPMGWLAWERFRCNTDCDEDPKNCISERLFMEMAD---RLAQDGWRD 69
Query: 65 LSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL------------ 112
L ++ D I + +G LV + FP+G LADY H L
Sbjct: 70 LGYTYLNIDD-CWIGGR----DAKGRLVPDPKRFPNGIDFLADYAHSLGLKLGIYSDLGN 124
Query: 113 ------ISNSLDNRIESNIAF-EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIF 165
+LD ++ F E +D LK D CF+ Y M+ AL GRPI
Sbjct: 125 FTCMGYPGTTLDKVVQDAQTFAEWKVDMLKLDGCFSTPEERAKGYPKMAAALNATGRPIA 184
Query: 166 FSMCEW----GDMRPALWGS---NIRNSWRTTDDISDSWARMLTI----ADMNEVYADHA 214
FS C W G + P + S +I N WR DDI DSW +L+I + ++ A
Sbjct: 185 FS-CSWPAYEGGLPPQVNYSLLADICNLWRNYDDIQDSWRSVLSILNWFVEHQDILQPVA 243
Query: 215 KPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVI 274
PG WNDPDML VGN G+ + + ++W + APL + D+ ++A+ M I+ N +I
Sbjct: 244 GPGHWNDPDMLLVGNYGLSFEQAQSQMALWTVLAAPLFMSTDLRTISAQNMDILQNPLMI 303
Query: 275 AVNQ---------AINDQ---QIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNN 322
+NQ + D+ +++ PL+ ++ SR+T M H+ + N
Sbjct: 304 KINQDPLGIQGRRILMDKSYIEVFMRPLANEASALVFFSRRT---DMPYHYHSSLAQLNF 360
Query: 323 T---LFEHRTLET-------KFQGKLTAKVDGHSCKMYELTPVTKL 358
T ++E + + T + + T ++ M+ L PV KL
Sbjct: 361 TGSNVYEAQNVYTGDIISGLRAETNFTVVINPSGVVMWYLYPVRKL 406
>gi|393770708|ref|ZP_10359186.1| alpha-galactosidase [Novosphingobium sp. Rr 2-17]
gi|392723815|gb|EIZ81202.1| alpha-galactosidase [Novosphingobium sp. Rr 2-17]
Length = 441
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 83/211 (39%), Positives = 104/211 (49%), Gaps = 28/211 (13%)
Query: 6 SSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS-RDSQIS 64
S+AA G+ L NGL TPPMGWNSWN F CQ++EK+IRE D S RD+
Sbjct: 9 SAAALVAGTTAVGALDNGLARTPPMGWNSWNKFGCQVSEKLIREVADAIVASGMRDA--- 65
Query: 65 LSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL------------ 112
+ + D Q L G++ + FPSG KALADYVH
Sbjct: 66 -GYRYVVIDDCWQ-----LGRNADGSIQVDAQHFPSGIKALADYVHSRGLKFGLYSAAGP 119
Query: 113 -----ISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFF 166
S SL + + G+DYLKYD C + Y+ M DAL GRPI F
Sbjct: 120 RTCQGRSGSLGHEARDAQTYASWGVDYLKYDWCTFEKMDAPAAYQKMRDALSATGRPILF 179
Query: 167 SMCEWGDMRPALWGSNIRNSWRTTDDISDSW 197
S+CEWG PA W + N WRTT DI+D++
Sbjct: 180 SICEWGQSEPAKWARPVGNMWRTTGDITDAF 210
Score = 100 bits (249), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 53/114 (46%), Positives = 66/114 (57%), Gaps = 11/114 (9%)
Query: 200 MLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGN 259
+L I D HA PGGWNDPDMLEVGNGGM +EY HF++WAI +PL+ G D
Sbjct: 271 VLQIIDRQAGLGRHAGPGGWNDPDMLEVGNGGMTADEYRAHFALWAIMGSPLIAGHDPRA 330
Query: 260 LTAETMPIIGNEEVIAVNQ-----------AINDQQIWAGPLSGNRIVVLLESR 302
+TAET I+ N E+IAV+Q D IW PL+G + V L +R
Sbjct: 331 MTAETRAILLNRELIAVSQDSLGKAGDRLKREGDADIWGRPLAGGAMAVALLNR 384
>gi|46139309|ref|XP_391345.1| hypothetical protein FG11169.1 [Gibberella zeae PH-1]
Length = 544
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 104/385 (27%), Positives = 173/385 (44%), Gaps = 67/385 (17%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETD----DCWGESSRDSQISLSCSFHMFDSLTQIAAKP 82
TPPMGWNS+NH++C +E+II+ D S +++ C + D
Sbjct: 25 TPPMGWNSYNHYNCNPSEEIIKINAKGLVDLGFLDLGYSIVTVDCGWPSRD--------- 75
Query: 83 LPNQCQGNLVANKTTFPSGTKALADYVHIL-------------------ISNSLD-NRIE 122
+G L N+T FPSG KAL DY+H L I SL I+
Sbjct: 76 --RDSEGRLQWNETLFPSGPKALGDYIHDLGLEFGLYSGAGYLQCGSQDIPASLGYEEID 133
Query: 123 SNIAFEQGIDYLKYDNCF-----------NDDTRPTIRYRAMSDALKKAGRPIFFSMCEW 171
+ E G D LKYDNC+ + + + R+ M+ A+ + R I + +C+W
Sbjct: 134 AKSFAEWGGDTLKYDNCYATSKTDMVDSDSAEAKSPDRFIKMAAAINETDRDIKYFLCQW 193
Query: 172 GDMRPA-LWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDML-EVGN 229
G W + + NSWR ++DI ++W + I + ++ + PG + D DML +G
Sbjct: 194 GIGEDVPQWATKVGNSWRMSNDIFNAWRAIWRITNQAVAHSKYNHPGAFADLDMLISIGL 253
Query: 230 GGMKYNEYVVHFSIWAISKAPLLLG--CDVGNLTAETMPIIGNEEVIAVNQAINDQ---- 283
G + Y E HF W++ K+PL++G D + + ++ ++ N+E IA+NQ Q
Sbjct: 254 GALSYEEERFHFGFWSMMKSPLIIGGVMDAKQIPSHSLEVMSNKEAIAINQDALGQAAEL 313
Query: 284 ---------QIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT--LFEHRTLET 332
+W+G L+ NR V+ + + K + + IG+ ++ H L
Sbjct: 314 VIRYTEEQWDVWSGNLTSNRKVLGVANWKNETQTVEVDLSLIGVAKAKARDVWAHEDLTI 373
Query: 333 KFQGKLTAKVDGHSCKMYELTPVTK 357
K K H ++ L+ +T+
Sbjct: 374 SGTQKFELKP--HELRLLVLSDITQ 396
>gi|330947689|ref|XP_003306936.1| hypothetical protein PTT_20251 [Pyrenophora teres f. teres 0-1]
gi|311315272|gb|EFQ84969.1| hypothetical protein PTT_20251 [Pyrenophora teres f. teres 0-1]
Length = 544
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 110/380 (28%), Positives = 178/380 (46%), Gaps = 59/380 (15%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQ 86
TPPMGWNS+N ++CQ E+ I++ S + I L + ++ +T L +
Sbjct: 24 TPPMGWNSYNTWNCQPTEEKIKK--------SANGLIELGFNKVGYNFVTIDCGWNLRKR 75
Query: 87 -CQGNLVANKTTFPSGTKALADYVHIL-ISNSL----------DNRIESNIAFEQ----- 129
G L N T FPSG KAL D++H L + L D+ + +++ +E+
Sbjct: 76 DANGALQWNTTRFPSGGKALGDFLHDLGLGFGLYSGAGYLQCGDDDLPASLGYEKIDAES 135
Query: 130 ----GIDYLKYDNCFN-------DDTRPT----IRYRAMSDALKKAGRPIFFSMCEWG-D 173
G D LKYDNC++ D + P R++ M++ L+ R I + +C+WG
Sbjct: 136 FAGWGGDSLKYDNCYSTSNTTLVDSSSPESQSPARFQRMAEELEAVDRGIRYYVCQWGIG 195
Query: 174 MRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMK 233
W + I N+WR ++DI W + I + Y H G + D DML VG +
Sbjct: 196 TDVGEWAAKIGNTWRISNDIYGKWRSIWRITNQVVPYFRHTTTGAYADMDMLIVGLNSLS 255
Query: 234 YNEYVVHFSIWAISKAPLLLGC--DVGNLTAETMPIIGNEEVIAVNQAINDQQ------- 284
E HF +WAI+K+PL++G D L +M I+ N+E+IA+NQ +Q
Sbjct: 256 EEEERFHFGMWAINKSPLIMGAALDPSRLKNTSMEIMLNKEIIAINQDSLAKQAQLVRRD 315
Query: 285 ------IWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT--LFEHRTLETKFQG 336
IW G LS +R V+ + + + S ++ +GI S ++ + T G
Sbjct: 316 TEGEWDIWMGELSESRHVLGIANWRNDSRTVSVDLKSLGIASATARDVWAAKDAGT-VSG 374
Query: 337 KLTAKVDGHSCKMYELTPVT 356
T + GH +++ L+ V
Sbjct: 375 SQTLHLAGHELRIWVLSNVV 394
>gi|332231370|ref|XP_003264870.1| PREDICTED: alpha-N-acetylgalactosaminidase [Nomascus leucogenys]
Length = 411
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 102/338 (30%), Positives = 152/338 (44%), Gaps = 61/338 (18%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQIN----------EKIIRETDDCWGESSRDSQISLSCS 68
+L NGL TPPMGW +W F C IN E++ E D ++D + +
Sbjct: 17 MLDNGLLQTPPMGWLAWERFRCNINCDEDPKNCISERLFMEMAD---RMAQDGWRDMGYT 73
Query: 69 FHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL---------------- 112
+ D I + G L+ + FP G LADYVH L
Sbjct: 74 YLNIDD-CWIGGR----DASGRLMPDPKRFPHGIPFLADYVHSLGLKLGIYADMGNFTCM 128
Query: 113 --ISNSLDNRIESNIAF-EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMC 169
+LD ++ F E +D LK D CF+ Y M+ AL GRPI FS C
Sbjct: 129 GYPGTTLDKVVQDAQTFAEWKVDMLKLDGCFSTADERAQGYPKMAAALNATGRPIAFS-C 187
Query: 170 EW----GDMRPAL---WGSNIRNSWRTTDDISDSWARMLTI----ADMNEVYADHAKPGG 218
W G + P + ++I N WR DDI DSW +L+I + ++ A PG
Sbjct: 188 SWPAYEGGLPPKVNYSLLADICNLWRNYDDIQDSWWSVLSILSWFVEHQDILQPVAGPGH 247
Query: 219 WNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
WNDPDML +GN G+ + ++W + APLL+ D+ ++A+ M I+ N +I +NQ
Sbjct: 248 WNDPDMLLIGNFGLSLEQSRAQMALWTVLAAPLLMSTDLRTISAQNMDILQNPLMIKINQ 307
Query: 279 --------AINDQ----QIWAGPLSGNRIVVLLESRKT 304
I+++ +++ PLS ++ SR+T
Sbjct: 308 DPLGIQGRRIHEEKSLIEVYMRPLSNKASALVFFSRRT 345
>gi|114052322|ref|NP_001039814.1| alpha-N-acetylgalactosaminidase precursor [Bos taurus]
gi|67460991|sp|Q58DH9.1|NAGAB_BOVIN RecName: Full=Alpha-N-acetylgalactosaminidase; AltName:
Full=Alpha-galactosidase B; Flags: Precursor
gi|61553826|gb|AAX46465.1| alpha-N-acetylgalactosaminidase precursor [Bos taurus]
Length = 411
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 97/303 (32%), Positives = 135/303 (44%), Gaps = 55/303 (18%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQI 78
+L NGL PPMGW +W F C I DC E ++ IS M D L Q
Sbjct: 17 VLENGLLRKPPMGWLAWERFRCNI---------DC-SEDPKNC-ISEQLFMEMADRLAQD 65
Query: 79 AAKPLP-------------NQCQGNLVANKTTFPSGTKALADYVHIL------------- 112
+ L +GNLV ++ FP G LADY H L
Sbjct: 66 GWRDLGYVYLNIDDCWIGGRDAKGNLVPDRKRFPHGIAFLADYAHSLGLKLGIYEDLGNF 125
Query: 113 -----ISNSLDNRIESNIAF-EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFF 166
+LD ++ F E +D LK D C++ Y M+ AL GRPI F
Sbjct: 126 TCMGYPGTTLDKVVQDAQTFAEWKVDMLKLDGCYSTPQERAEGYPKMAAALNATGRPIAF 185
Query: 167 SMCEW----GDMRPAL---WGSNIRNSWRTTDDISDSWARMLTIAD----MNEVYADHAK 215
S C W G + P + ++I N WR DDI DSW +L++ D +V A
Sbjct: 186 S-CSWPAYEGGLPPKVNYTLLADICNLWRNFDDIQDSWRSVLSVLDWFVTHQDVLQPIAG 244
Query: 216 PGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIA 275
PG WNDPDML +GN G+ + + ++W + APL + D+ ++A+ M I+ N +I
Sbjct: 245 PGHWNDPDMLLIGNFGLSFEQAQAQMALWTVLAAPLFMSTDLRTISAQNMDILQNPLMIK 304
Query: 276 VNQ 278
+NQ
Sbjct: 305 INQ 307
>gi|92096920|gb|AAI14814.1| NAGA protein [Bos taurus]
gi|296486966|tpg|DAA29079.1| TPA: alpha-N-acetylgalactosaminidase precursor [Bos taurus]
Length = 411
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 98/305 (32%), Positives = 135/305 (44%), Gaps = 59/305 (19%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQ--ISLSCSFHMFDSLT 76
+L NGL PPMGW +W F C I DC S D + IS M D L
Sbjct: 17 VLENGLLRKPPMGWLAWERFRCNI---------DC----SEDPKNCISEQLFMEMADRLA 63
Query: 77 QIAAKPLP-------------NQCQGNLVANKTTFPSGTKALADYVHIL----------- 112
Q + L +GNLV ++ FP G LADY H L
Sbjct: 64 QDGWRDLGYVYLNIDDCWIGGRDAKGNLVPDRKRFPHGIAFLADYAHSLGLKLGIYEDLG 123
Query: 113 -------ISNSLDNRIESNIAF-EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPI 164
+LD ++ F E +D LK D C++ Y M+ AL GRPI
Sbjct: 124 NFTCMGYPGTTLDKVVQDAQTFAEWKVDMLKLDGCYSTPQERAEGYPKMAAALNATGRPI 183
Query: 165 FFSMCEW----GDMRPAL---WGSNIRNSWRTTDDISDSWARMLTIAD----MNEVYADH 213
FS C W G + P + ++I N WR DDI DSW +L++ D +V
Sbjct: 184 AFS-CSWPAYEGGLPPKVNYTLLADICNLWRNFDDIQDSWRSVLSVLDWFVTHQDVLQPI 242
Query: 214 AKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEV 273
A PG WNDPDML +GN G+ + + ++W + APL + D+ ++A+ M I+ N +
Sbjct: 243 AGPGHWNDPDMLLIGNFGLSFEQARAQMALWTVLAAPLFMSTDLRTISAQNMDILQNPLM 302
Query: 274 IAVNQ 278
I +NQ
Sbjct: 303 IKINQ 307
>gi|47168708|pdb|1R46|A Chain A, Structure Of Human Alpha-galactosidase
gi|47168709|pdb|1R46|B Chain B, Structure Of Human Alpha-galactosidase
gi|47168710|pdb|1R47|A Chain A, Structure Of Human Alpha-Galactosidase
gi|47168711|pdb|1R47|B Chain B, Structure Of Human Alpha-Galactosidase
gi|229597937|pdb|3GXN|A Chain A, Crystal Structure Of Apo Alpha-Galactosidase A At Ph 4.5
gi|229597938|pdb|3GXN|B Chain B, Crystal Structure Of Apo Alpha-Galactosidase A At Ph 4.5
gi|229597939|pdb|3GXP|A Chain A, Crystal Structure Of Acid-alpha-galactosidase A Complexed
With Galactose At Ph 4.5
gi|229597940|pdb|3GXP|B Chain B, Crystal Structure Of Acid-alpha-galactosidase A Complexed
With Galactose At Ph 4.5
gi|238537965|pdb|3GXT|A Chain A, Crystal Structure Of Alpha-Galactosidase A At Ph 4.5
Complexed With 1- Deoxygalactonijirimycin
gi|238537966|pdb|3GXT|B Chain B, Crystal Structure Of Alpha-Galactosidase A At Ph 4.5
Complexed With 1- Deoxygalactonijirimycin
gi|269914453|pdb|3HG2|A Chain A, Human Alpha-Galactosidase Catalytic Mechanism 1. Empty
Active Site
gi|269914454|pdb|3HG2|B Chain B, Human Alpha-Galactosidase Catalytic Mechanism 1. Empty
Active Site
gi|269914457|pdb|3HG4|A Chain A, Human Alpha-Galactosidase Catalytic Mechanism 3. Covalent
Intermediate
gi|269914458|pdb|3HG4|B Chain B, Human Alpha-Galactosidase Catalytic Mechanism 3. Covalent
Intermediate
gi|269914459|pdb|3HG5|A Chain A, Human Alpha-Galactosidase Catalytic Mechanism 4. Product
Bound
gi|269914460|pdb|3HG5|B Chain B, Human Alpha-Galactosidase Catalytic Mechanism 4. Product
Bound
gi|367460152|pdb|3S5Y|A Chain A, Pharmacological Chaperoning In Human Alpha-Galactosidase
gi|367460153|pdb|3S5Y|B Chain B, Pharmacological Chaperoning In Human Alpha-Galactosidase
gi|367460154|pdb|3S5Z|A Chain A, Pharmacological Chaperoning In Human Alpha-Galactosidase
gi|367460155|pdb|3S5Z|B Chain B, Pharmacological Chaperoning In Human Alpha-Galactosidase
Length = 398
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 107/353 (30%), Positives = 150/353 (42%), Gaps = 96/353 (27%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRE-------------------T 50
L NGL TP MGW W F C I+EK+ E
Sbjct: 1 LDNGLARTPTMGWLHWERFMCNLDCQEEPDSCISEKLFMEMAELMVSEGWKDAGYEYLCI 60
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RDS+ G L A+ FP G + LA+YVH
Sbjct: 61 DDCWMAPQRDSE--------------------------GRLQADPQRFPHGIRQLANYVH 94
Query: 111 -----ILISNSLDNR-------------IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YR 151
+ I + N+ I++ + G+D LK+D C+ D Y+
Sbjct: 95 SKGLKLGIYADVGNKTCAGFPGSFGYYDIDAQTFADWGVDLLKFDGCYCDSLENLADGYK 154
Query: 152 AMSDALKKAGRPIFFSMCEWGDMRPALWGSN---IR---NSWRTTDDISDSWARMLTIAD 205
MS AL + GR I +S CEW N IR N WR DI DSW + +I D
Sbjct: 155 HMSLALNRTGRSIVYS-CEWPLYMWPFQKPNYTEIRQYCNHWRNFADIDDSWKSIKSILD 213
Query: 206 M----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
E D A PGGWNDPDML +GN G+ +N+ V ++WAI APL + D+ +++
Sbjct: 214 WTSFNQERIVDVAGPGGWNDPDMLVIGNFGLSWNQQVTQMALWAIMAAPLFMSNDLRHIS 273
Query: 262 AETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLSGNRIVVLLESRK 303
+ ++ +++VIA+NQ +Q +W PLSG V + +R+
Sbjct: 274 PQAKALLQDKDVIAINQDPLGKQGYQLRQGDNFEVWERPLSGLAWAVAMINRQ 326
>gi|357622905|gb|EHJ74264.1| alpha-N-acetylgalactosaminidase [Danaus plexippus]
Length = 410
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 110/414 (26%), Positives = 170/414 (41%), Gaps = 112/414 (27%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRE-------------------T 50
L NGL LTPPMGW +W F C I+E +I+
Sbjct: 18 LDNGLALTPPMGWLTWQRFRCITDCDKYPNECISESLIKRMADIMVNEGYSHAGYKYVGI 77
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW E +RD+ G LV ++ FP+G KA+ADY+H
Sbjct: 78 DDCWLEKTRDAN--------------------------GRLVPDRKRFPNGMKAVADYLH 111
Query: 111 IL-ISNSL-----------------DNRIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRA 152
L + +L ++ E +DY+K D C + ++ Y
Sbjct: 112 DLGLKFALYQDYGTKTCAGYPGVLGHEAVDVQTFAEWEVDYIKLDGCNVNVSKMDTGYPE 171
Query: 153 MSDALKKAGRPIFFSMCEWGDMRPALWGSNIR---NSWRTTDDISDSWARMLTI----AD 205
+ ++GRP+ +S C W + ++I N WR DDI DSWA + TI A+
Sbjct: 172 FGKLMNESGRPMVYS-CSWPAYQNKPDYASISKHCNMWRNWDDIQDSWASLTTIMSWFAE 230
Query: 206 MNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETM 265
E A +A PG WNDPDML +GN G+ ++ V ++W+I APLL+ D+ + E
Sbjct: 231 KQEEIAKYAGPGRWNDPDMLLIGNFGLSLDQARVQMAVWSILAAPLLMSVDLATIRPEFK 290
Query: 266 PIIGNEEVIAVNQAINDQQ-----------IWAGPLSGNRIVVLLESRK---------TF 305
++ N+++IA++Q +Q IW L V + R T
Sbjct: 291 EVLLNKDIIAIDQDELGKQGLMVWNKAKCEIWTRELVDGIAVAFVSKRDDGAPHTVDVTT 350
Query: 306 SSM----MTAHWDDIGIKSNNTLFEHRTLETKFQGKLTAKVDGHSCKMYELTPV 355
M T H D+ +N F+ + G T +++ + Y+ P+
Sbjct: 351 EDMKIPPTTYHIQDLYKDGHNFKFDCK-------GNFTTRINPSGVRFYKFIPI 397
>gi|410252378|gb|JAA14156.1| galactosidase, alpha [Pan troglodytes]
gi|410333475|gb|JAA35684.1| galactosidase, alpha [Pan troglodytes]
Length = 429
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 105/353 (29%), Positives = 151/353 (42%), Gaps = 96/353 (27%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRE-------------------T 50
L NGL TP MGW W F C I+EK+ E
Sbjct: 32 LDNGLARTPTMGWLHWERFMCNLDCQEEPDSCISEKLFMEMAELMVSEGWKDAGYEYLCI 91
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RDS+ G L A+ FP G + LA+YVH
Sbjct: 92 DDCWMAPQRDSE--------------------------GRLQADPQRFPHGIRQLANYVH 125
Query: 111 -----ILISNSLDNR-------------IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YR 151
+ I + N+ I++ + G+D LK+D C+ D Y+
Sbjct: 126 SKGLKLGIYADVGNKTCAGFPGSFGYYDIDAQTFADWGVDLLKFDGCYCDSLENLADGYK 185
Query: 152 AMSDALKKAGRPIFFSMCEWGDMRPALWGSN---IR---NSWRTTDDISDSWARMLTIAD 205
MS AL + GR I +S CEW N IR N WR D+ DSW + +I D
Sbjct: 186 HMSLALNRTGRSIVYS-CEWPLYMWPFQKPNYTEIRQYCNHWRNFADVDDSWKSIKSILD 244
Query: 206 M----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
E D A PGGWNDPDML +GN G+ +N+ V ++WAI APL + D+ +++
Sbjct: 245 WTSFNQERIVDVAGPGGWNDPDMLVIGNFGLSWNQQVTQMALWAIMAAPLFMSNDLRHIS 304
Query: 262 AETMPIIGNEEVIAVNQ-----------AINDQQIWAGPLSGNRIVVLLESRK 303
+ ++ +++VIA+NQ ++ ++W PLSG V + +R+
Sbjct: 305 PQAKALLQDKDVIAINQDPLGKQGYQLRQADNFEVWERPLSGLAWAVAMINRQ 357
>gi|440895953|gb|ELR48007.1| Alpha-N-acetylgalactosaminidase, partial [Bos grunniens mutus]
Length = 407
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 97/303 (32%), Positives = 135/303 (44%), Gaps = 55/303 (18%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQI 78
+L NGL PPMGW +W F C I DC E ++ IS M D L Q
Sbjct: 13 VLENGLLRKPPMGWLAWERFRCNI---------DC-SEDPKNC-ISEQLFMEMADRLAQD 61
Query: 79 AAKPLP-------------NQCQGNLVANKTTFPSGTKALADYVHIL------------- 112
+ L +GNLV ++ FP G LADY H L
Sbjct: 62 GWRDLGYVYLNIDDCWIGGRDAKGNLVPDRKRFPHGIAFLADYAHSLGLKLGIYEDLGNF 121
Query: 113 -----ISNSLDNRIESNIAF-EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFF 166
+LD ++ F E +D LK D C++ Y M+ AL GRPI F
Sbjct: 122 TCMGYPGTTLDKVVQDAQTFAEWKVDMLKLDGCYSTPQERAEGYPKMAAALNATGRPIAF 181
Query: 167 SMCEW----GDMRPAL---WGSNIRNSWRTTDDISDSWARMLTIAD----MNEVYADHAK 215
S C W G + P + ++I N WR DDI DSW +L++ D +V A
Sbjct: 182 S-CSWPAYEGGLPPKVNYTLLADICNLWRNFDDIQDSWRSVLSVLDWFVTHQDVLQPIAG 240
Query: 216 PGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIA 275
PG WNDPDML +GN G+ + + ++W + APL + D+ ++A+ M I+ N +I
Sbjct: 241 PGHWNDPDMLLIGNFGLSFEQARAQMALWTVLAAPLFMSTDLRTISAQNMDILQNPLMIK 300
Query: 276 VNQ 278
+NQ
Sbjct: 301 INQ 303
>gi|178246|gb|AAA51676.1| alpha-galactosidase A precursor (EC 3.2.1.22), partial [Homo
sapiens]
gi|3808178|dbj|BAA34059.1| alpha-galactosidase A [Homo sapiens]
Length = 403
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 107/353 (30%), Positives = 150/353 (42%), Gaps = 96/353 (27%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRE-------------------T 50
L NGL TP MGW W F C I+EK+ E
Sbjct: 6 LDNGLARTPTMGWLHWERFMCNLDCQEEPDSCISEKLFMEMAELMVSEGWKDAGYEYLCI 65
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RDS+ G L A+ FP G + LA+YVH
Sbjct: 66 DDCWMAPQRDSE--------------------------GRLQADPQRFPHGIRQLANYVH 99
Query: 111 -----ILISNSLDNR-------------IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YR 151
+ I + N+ I++ + G+D LK+D C+ D Y+
Sbjct: 100 SKGLKLGIYADVGNKTCAGFPGSFGYYDIDAQTFADWGVDLLKFDGCYCDSLENLADGYK 159
Query: 152 AMSDALKKAGRPIFFSMCEWGDMRPALWGSN---IR---NSWRTTDDISDSWARMLTIAD 205
MS AL + GR I +S CEW N IR N WR DI DSW + +I D
Sbjct: 160 HMSLALNRTGRSIVYS-CEWPLYMWPFQKPNYTEIRQYCNHWRNFADIDDSWKSIKSILD 218
Query: 206 M----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
E D A PGGWNDPDML +GN G+ +N+ V ++WAI APL + D+ +++
Sbjct: 219 WTSFNQERIVDVAGPGGWNDPDMLVIGNFGLSWNQQVTQMALWAIMAAPLFMSNDLRHIS 278
Query: 262 AETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLSGNRIVVLLESRK 303
+ ++ +++VIA+NQ +Q +W PLSG V + +R+
Sbjct: 279 PQAKALLQDKDVIAINQDPLGKQGYQLRQGDNFEVWERPLSGLAWAVAMINRQ 331
>gi|348570592|ref|XP_003471081.1| PREDICTED: alpha-galactosidase A-like [Cavia porcellus]
Length = 426
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 104/324 (32%), Positives = 150/324 (46%), Gaps = 62/324 (19%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGL TP MGW W F C +N K E D C E L S D +
Sbjct: 32 LDNGLARTPTMGWLHWERFLCNLNCK--EEPDSCISEQLFMQMAELMVS----DGWKDVG 85
Query: 80 AKPL--------PNQ-CQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNR----- 120
+ L P + + L A+ FP G + LA+YVH + I + N+
Sbjct: 86 YEYLCIDDCWMAPQRDSENRLQADPQRFPGGIRRLANYVHSKGLKLGIYADVGNKTCAGF 145
Query: 121 --------IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YRAMSDALKKAGRPIFFSMCEW 171
+++ + G+D LK+D C+ D Y+ MS AL + GR I +S CEW
Sbjct: 146 PGSFGHYDLDAQTFADWGVDLLKFDGCYCDSINHLENGYKYMSLALNRTGRSIVYS-CEW 204
Query: 172 GDMRPALWG------SNIR---NSWRTTDDISDSWARMLTIAD----MNEVYADHAKPGG 218
LW ++IR N WR DD+SDSW + I D E D A PGG
Sbjct: 205 P---LYLWPFHKPNYTDIRQYCNHWRNFDDVSDSWQSINDILDWTSSRQEEIVDVAGPGG 261
Query: 219 WNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
WNDPDML +GN G+ +++ V ++WAI APLL+ D+ ++ + ++ N++VIA+NQ
Sbjct: 262 WNDPDMLVIGNFGLSWDQQVTQMALWAIMAAPLLMSNDLRQISPQAKALLQNKDVIAINQ 321
Query: 279 -----------AINDQQIWAGPLS 291
++ ++W PLS
Sbjct: 322 DPLGKQGYCLRKEDNTEVWERPLS 345
>gi|397487189|ref|XP_003814689.1| PREDICTED: alpha-N-acetylgalactosaminidase [Pan paniscus]
Length = 411
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 104/338 (30%), Positives = 151/338 (44%), Gaps = 61/338 (18%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQIN----------EKIIRETDDCWGESSRDSQISLSCS 68
+L NGL TPPMGW +W F C IN E++ E D ++D + +
Sbjct: 17 MLDNGLLQTPPMGWLAWERFRCNINCDEDPKNCISERLFMEMAD---RMAQDGWRDMGYT 73
Query: 69 FHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL---------------- 112
+ D I+ + G L+ + FP G LADYVH L
Sbjct: 74 YLNIDD-CWISGR----DASGRLMPDPKRFPHGIPFLADYVHSLGLKLGIYADMGNFTCM 128
Query: 113 --ISNSLDNRIESNIAF-EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMC 169
+LD ++ F E +D LK D CF+ Y M+ AL GRPI FS C
Sbjct: 129 GYPGTTLDKVVQDAQTFAEWKVDMLKLDGCFSTPEERAQGYPKMAAALNATGRPIAFS-C 187
Query: 170 EW----GDMRPALWGS---NIRNSWRTTDDISDSWARMLTI----ADMNEVYADHAKPGG 218
W G + P + S +I N WR DDI DSW +L+I + ++ A PG
Sbjct: 188 SWPAYEGGLPPRVNYSLLADICNLWRNYDDIQDSWWSVLSILNWFVEHQDILQPVAGPGH 247
Query: 219 WNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
WNDPDML +GN G+ + ++W + APLL+ D+ ++A+ M I+ N +I +NQ
Sbjct: 248 WNDPDMLLIGNFGLSLEQSRAQMALWTVLAAPLLMSTDLRTISAQNMDILQNPLMIKINQ 307
Query: 279 ---AINDQQI---------WAGPLSGNRIVVLLESRKT 304
I ++I + PLS ++ SR+T
Sbjct: 308 DPLGIQGRRIHKEKSLIEVYMRPLSNKASALVFFSRRT 345
>gi|148229665|ref|NP_001085328.1| alpha-N-acetylgalactosaminidase [Xenopus laevis]
gi|49256018|gb|AAH71089.1| Nagaa protein [Xenopus laevis]
Length = 400
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 102/300 (34%), Positives = 143/300 (47%), Gaps = 50/300 (16%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDS----- 74
L NGL TPPMGW +W + C I+ K + D+C E+ + S + M DS
Sbjct: 7 LDNGLVRTPPMGWMTWQRYRCNIDCK--SDPDNCISEN-----LIKSMADKMADSGWRDV 59
Query: 75 -LTQIAAKPLPNQCQ----GNLVANKTTFPSGTKALADYVHI------------------ 111
T I+ +Q Q G L + FPSG KALADYVH
Sbjct: 60 GYTYISIDDCWSQKQRDSNGRLQPDLERFPSGMKALADYVHAKGLKLGIYSDMGTYTCGG 119
Query: 112 LISNSLDN-RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCE 170
+LD +I++ +D LK+D C+++ T + Y MS+AL GRPI +S C
Sbjct: 120 YPGTTLDTIKIDAETFASWEVDMLKFDGCYSNSTEKALGYPKMSEALNGTGRPILYS-CS 178
Query: 171 W----GDMRPAL----WGSNIRNSWRTTDDISDSWARMLTI----ADMNEVYADHAKPGG 218
W G + P + GS I N WR DI DSW +L I A +V A PG
Sbjct: 179 WPAYEGGLPPKVNYTQLGS-ICNMWRNYGDIQDSWDSVLDIIEWYAKNQDVLQPAAGPGR 237
Query: 219 WNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
WNDPDML G+ G+ Y + +IWAI APL++ D+ ++ + ++ N +I +NQ
Sbjct: 238 WNDPDMLITGDFGLSYEQSKSQLAIWAILAAPLIMSNDLRTISQDAKDLLQNRLLIYINQ 297
>gi|342890317|gb|EGU89146.1| hypothetical protein FOXB_00331 [Fusarium oxysporum Fo5176]
Length = 435
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 98/270 (36%), Positives = 130/270 (48%), Gaps = 62/270 (22%)
Query: 89 GNLVANKTTFPSGTKALADYVHILISNSLDNRIESN------------IAFEQ------- 129
G L AN T FP G A A YVH S L I S+ +E+
Sbjct: 107 GKLAANHTRFPGGISATASYVH---SKGLKVGIYSDAGIFTCGKYPGSYGYEEIDAQTFA 163
Query: 130 --GIDYLKYDNC--FNDDTRPTI-RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIR 184
G+DYLKYDNC F +T R+ MS AL +GR IF+S+CEWG+ P LW I
Sbjct: 164 GWGVDYLKYDNCGGFQSNTLSVQERFLKMSYALAASGRQIFYSLCEWGNQFPWLWADQIG 223
Query: 185 NSWRTTDDISDSWAR-----MLTIADMNEVYAD---------------HAKPGGWNDPDM 224
S+R + DI S+A+ T MN+ YA +KPG W D DM
Sbjct: 224 ESYRMSGDIYSSFAKDRASICKTAYCMNQGYAGVSVLTMIRKMREISPFSKPGSWADMDM 283
Query: 225 LEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------ 278
LE+G M E H S WA K+PL++G D+ N++ ++ I N+++IA+NQ
Sbjct: 284 LEIGTWTMTELEEQTHLSFWAALKSPLIIGADLKNISDTSLAIYKNKDMIALNQDDAGKP 343
Query: 279 AI--------NDQQIWAGPL-SGNRIVVLL 299
A+ Q+WAGPL SG R V+L
Sbjct: 344 AVYLPKLSEEGSYQVWAGPLSSGKRRHVIL 373
>gi|313204099|ref|YP_004042756.1| alpha-galactosidase [Paludibacter propionicigenes WB4]
gi|312443415|gb|ADQ79771.1| Alpha-galactosidase [Paludibacter propionicigenes WB4]
Length = 680
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 127/447 (28%), Positives = 192/447 (42%), Gaps = 117/447 (26%)
Query: 2 FIVSSSAADSNGSDRRNL---LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------- 50
++VS A ++ G R + + LTPPMGWNSWN F +++ ++
Sbjct: 253 YLVSLEAKNAKGKAVRKFKIECGDKIALTPPMGWNSWNCFAQEVSADKVKRAANAMVKSG 312
Query: 51 -----------DDCWGESSRDSQ-ISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTF 98
DD W E++RDS+ S+ F GN+V N
Sbjct: 313 LINHGWTYINIDDFW-ENNRDSKDQSIRGKFR---------------DEAGNIVPNSRFV 356
Query: 99 PSGTKALADYVHIL------------------ISNSLDNRIESNIAFEQGIDYLKYDNCF 140
K LADYVH L + ++++ + G DYLKYD C
Sbjct: 357 D--MKGLADYVHGLGLKIGLYSSPGPWTCGGCAGSYGYEKLDAESYAKWGFDYLKYDWCS 414
Query: 141 ---------NDD---------------TRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRP 176
N+D + ++ M D LK+ R I FS+C++G
Sbjct: 415 YGNVINGLPNNDPLKVSSLSYNGGSVLSTAMKPFQLMGDLLKQQPRDIVFSVCQYGMSDV 474
Query: 177 ALWGSNI-RNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGG---- 231
WG ++ N WRTT+DI+D+WA + +I + A +AKPG WNDPDML VG+ G
Sbjct: 475 WKWGGSVGGNLWRTTNDITDTWASVKSIILDQDKSAAYAKPGNWNDPDMLVVGHVGWGNP 534
Query: 232 ----MKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN---------- 277
++ +E +H S+W++ APLL+GCD+ L T+ ++ N+EVI +N
Sbjct: 535 HPSKLRPDEQYLHISLWSLFAAPLLIGCDMEKLDDFTLNLLTNDEVIEINQDPLGKQATC 594
Query: 278 -QAINDQQIWAGPLS-GNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFEHR------- 328
Q I + +I+ L G+R V + +D IG+ N F R
Sbjct: 595 IQTIGELRIYVKELEDGSRAVGFCNLGADIIDISYKDFDKIGL---NGKFNVRDVWRQKN 651
Query: 329 --TLETKFQGKLTAKVDGHSCKMYELT 353
T+ETK +L KV H +Y+ T
Sbjct: 652 ISTIETK-TSQLALKVPVHGVLLYKFT 677
>gi|291410288|ref|XP_002721425.1| PREDICTED: NAGA protein-like [Oryctolagus cuniculus]
Length = 410
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 106/341 (31%), Positives = 148/341 (43%), Gaps = 67/341 (19%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQI 78
LL NGL TPPMGW +W F C I DC + ++ IS M D L Q
Sbjct: 17 LLENGLLRTPPMGWLAWERFRCNI---------DC--DEDPENCISERLFMEMADRLAQD 65
Query: 79 AAKPLP-------------NQCQGNLVANKTTFPSGTKALADYVHIL-----ISNSLDNR 120
+ L +G LV + FP G LADY H L I + N
Sbjct: 66 GWRDLGYVYLNIDDCWIGGRDAEGRLVPDSKRFPHGIGFLADYAHSLGLKLGIYEDMGNL 125
Query: 121 --------------IESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFF 166
+++ +D LK D CF+ + Y M+ AL GRPI F
Sbjct: 126 TCMGYPGTTLDKVDLDAQTFASWKVDMLKLDGCFSTSEERALGYPKMAAALNATGRPIAF 185
Query: 167 SMCEW----GDMRPALWGS---NIRNSWRTTDDISDSWARMLTIAD----MNEVYADHAK 215
S C W G + P + S +I N WR DD+ DSW +L+I D ++ A
Sbjct: 186 S-CSWPAYEGGLPPKVNYSLLADICNLWRNYDDMQDSWRDVLSILDWFVKHQDILQPVAG 244
Query: 216 PGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIA 275
PG WNDPDML VGN G+ + + ++W I APL + D+ ++ + M I+ N +I
Sbjct: 245 PGRWNDPDMLLVGNFGLSFEQARSQMALWTILAAPLFMSTDLRTISPQNMDILQNPLMIK 304
Query: 276 VNQ---AINDQQI---------WAGPLSGNRIVVLLESRKT 304
+NQ I ++I + PLSG ++ SR+T
Sbjct: 305 INQDPLGIQGRRILKEKSHIEVFKRPLSGEASALVFFSRRT 345
>gi|387016048|gb|AFJ50143.1| Alpha-galactosidase A-like [Crotalus adamanteus]
Length = 396
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 103/320 (32%), Positives = 146/320 (45%), Gaps = 50/320 (15%)
Query: 30 MGWNSWNHFHCQINEKIIRETDDCWGE------SSRDSQISLSCSFHMFDSLTQIAAKPL 83
MGW W F CQ + + RE DC E + R + + F + P
Sbjct: 1 MGWLHWERFLCQTDCE--REPRDCISERLFMQMADRMASEGWKEVGYHFLCIDDCWMAPT 58
Query: 84 PNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNRIESNIA-----FEQ---- 129
++ QG L + FPSG K LADYVH + I + NR + +EQ
Sbjct: 59 RDK-QGRLQPDPKRFPSGIKKLADYVHSKGLKLGIYADIGNRTCAGFPGSYGHYEQDAET 117
Query: 130 ----GIDYLKYDNC-FNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIR 184
G+D LK+D C F Y+ MS AL K GR I +S CEW + N
Sbjct: 118 FASWGVDLLKFDGCDFGTLDEMAEGYKKMSAALNKTGRNIVYS-CEWPLYQRPFQKVNYT 176
Query: 185 ------NSWRTTDDISDSWARMLTIAD----MNEVYADHAKPGGWNDPDMLEVGNGGMKY 234
N WR DISD+W + I D ++ D A PGGWNDPDML +GN G+ +
Sbjct: 177 EIKQYCNYWRNYADISDTWISIKNILDWTSSHQDILVDIAGPGGWNDPDMLVIGNFGLSW 236
Query: 235 NEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ---------AINDQ-- 283
++ + + WAI APLL+ D+ +++ ++ N+EVIA+NQ DQ
Sbjct: 237 DQQITQIAFWAIMAAPLLMSNDLRQISSRAKALLQNKEVIAINQDPLGKQGYRITKDQSF 296
Query: 284 QIWAGPLSGNRIVVLLESRK 303
++W PLS + + +RK
Sbjct: 297 ELWERPLSDGAFAIAVINRK 316
>gi|2570148|dbj|BAA22992.1| alpha-galactosidase [Talaromyces purpurogenus]
Length = 439
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 118/422 (27%), Positives = 179/422 (42%), Gaps = 93/422 (22%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQI 78
+L + +G P +GWNSWN + C +NE I +++ + L + + ++
Sbjct: 21 ILPDDVGRLPALGWNSWNAYGCNVNETKIVTA------ATKLNTTGLQALGYQYVNIDDC 74
Query: 79 -AAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLD-------------N 119
+ K + ++ N TFP+G A VH I I +S
Sbjct: 75 WSVKSGRDNVTNRIIPNPDTFPNGINGTAQQVHDLGLKIGIYSSAGYETCAGYPASLGYE 134
Query: 120 RIESNIAFEQGIDYLKYDNC-----FNDD----------------------TRP------ 146
I++ E GIDYLKYDNC ++D+ T P
Sbjct: 135 TIDAQTFAEWGIDYLKYDNCNYPSEWDDEYNACIPDSDYPGVNPNGTCPGLTNPAPAGYD 194
Query: 147 ------TIRYRAMSDALKKAG--RPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWA 198
T R+ M DAL R I +S+CEWG WG+ NSWR T DI+ +W
Sbjct: 195 WSTSNTTKRFNIMRDALVAVQDQRVILYSLCEWGYADVPSWGNGTGNSWRVTGDINATWD 254
Query: 199 RMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVG 258
R+ IA+MN G NDPDMLEVGNG + E HF++WAI K+PL++G D+
Sbjct: 255 RITAIANMNAHELSSVDFWGHNDPDMLEVGNGDLTIEENRAHFALWAIMKSPLIIGTDLS 314
Query: 259 NLTAETMPIIGNEEVIAVNQ-------AINDQQ--------------IWAGPLSGNRIVV 297
L + I+ N ++IA NQ A+ +Q W+G S +V
Sbjct: 315 TLPDTHLSILKNADLIAFNQDPVIGKPALPYKQGYNNGTYNPNYPPEYWSGATSYGWNLV 374
Query: 298 LLESRKTFSSMMTAHWDDI---GIKSNNTLFEHRTLETKF---QGKLTAKVDGHSCKMYE 351
LL + + ++ +A W +I G SN E + TA+++ H + +
Sbjct: 375 LLFNSEDVAASRSAVWSEIPQLGKHSNRYRVEDVWTGKDLGCVHKEYTAQLEAHDVAVLK 434
Query: 352 LT 353
+T
Sbjct: 435 VT 436
>gi|118089381|ref|XP_420183.2| PREDICTED: alpha-galactosidase A [Gallus gallus]
Length = 409
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 102/333 (30%), Positives = 153/333 (45%), Gaps = 60/333 (18%)
Query: 22 NGLGLTPPMGWNSWNHFHCQ----------INEKIIRETDDCW-GESSRDSQISLSCSFH 70
NG+ TPPMGW W F C ++E++ E D E +++ C
Sbjct: 23 NGVARTPPMGWLHWERFLCGTDCAAEPDRCVSERLFTEMADVMVAEGWKEAGYEFVC--- 79
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDN------ 119
+ P ++ +G L A+ FPSG +ALADYVH + I + + N
Sbjct: 80 ----IDDCWMAPTRDE-RGRLRADPRRFPSGIRALADYVHSKGLKLGIYSDVGNTTCAGF 134
Query: 120 -------RIESNIAFEQGIDYLKYDNCFNDDTRPTIR-YRAMSDALKKAGRPIFFSMCEW 171
+++ G+D LK+D C D YR MS AL K GRPI +S CEW
Sbjct: 135 PGSYGHYELDAQTFASWGVDLLKFDGCNADSLELLAEGYRNMSLALNKTGRPIVYS-CEW 193
Query: 172 G-DMRPALWGSNIR-----NSWRTTDDISDSWARMLTIADMNEVYADH----AKPGGWND 221
+RP + N WR D+ DSW + +I + ++ D A PGGWND
Sbjct: 194 PFYLRPMQQPNYTEIKQYCNHWRNFYDVYDSWNSIKSIMEWTALHQDSIVKIAGPGGWND 253
Query: 222 PDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAIN 281
PDML +GN G+ + + V ++WAI APL + D+ ++ E ++ N+EVIA+NQ
Sbjct: 254 PDMLVIGNFGLSWEQAVTQMAMWAIMAAPLFMSNDLRHMKPEAKWLLQNKEVIAINQDPL 313
Query: 282 DQQ-----------IWAGPLSGNRIVVLLESRK 303
+Q +W PLS V + +++
Sbjct: 314 GKQGYRITKDKNFELWERPLSDRAYAVAVLNQQ 346
>gi|302549411|ref|ZP_07301753.1| carbohydrate binding family 6 protein [Streptomyces
viridochromogenes DSM 40736]
gi|302467029|gb|EFL30122.1| carbohydrate binding family 6 protein [Streptomyces
viridochromogenes DSM 40736]
Length = 531
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 120/407 (29%), Positives = 175/407 (42%), Gaps = 103/407 (25%)
Query: 26 LTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQISLS 66
LTPP+GWNSWN F C I E +R+ DDCW + RD+
Sbjct: 24 LTPPLGWNSWNSFGCGITEAQVRQAADAMVSSGMRDAGYRYVVVDDCWFDPQRDAA---- 79
Query: 67 CSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-------------- 112
GNL AN T FP G KAL DY+H
Sbjct: 80 ----------------------GNLRANPTKFPGGMKALGDYIHGKGLKFGIYQVPGERT 117
Query: 113 ----------ISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTR--PTIRYRAMSDALKK 159
+ S + + F G+DYLKYD C + TR R+ M DAL+
Sbjct: 118 CAQTSGAYPGSTGSRGHEAQDAATFASWGVDYLKYDWCSSSGTRDEQVARFTLMRDALRA 177
Query: 160 AGRPIFFSM------CEWGDMRPALWGSNIRNSWRTTDDISDSWARMLT---------IA 204
GRPI +S+ GD WG + + WRTT+D+ D W T +
Sbjct: 178 TGRPIVYSINPNSFHAITGDTYN--WG-EVADLWRTTEDLLDIWQNGNTNSYPMGVGNVL 234
Query: 205 DMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAET 264
D+ A + PG WNDPDML VG G+ E HF++WA+ APL+ G D+ ++A+
Sbjct: 235 DVTAPLAAQSGPGHWNDPDMLVVGRPGLSLTESRSHFALWALLAAPLMAGNDIRTMSADV 294
Query: 265 MPIIGNEEVIAVNQ--------AINDQ---QIWAGPLSGNRIVVLLESRKTFSSMMTAHW 313
I+ N ++AVNQ + D +++A PLS + V L +R ++ ++A
Sbjct: 295 SAILRNPRLLAVNQDPLGAGGRRVRDDGGTEVFAKPLSDGSVAVGLFNRGGDTATVSATA 354
Query: 314 DDIGIKSNN-TLFEHRTLET-KFQGKLTAKVDGHSCKMYELTPVTKL 358
+G+ TL + T T G+++A V H ++ ++ + L
Sbjct: 355 AQVGLSGGPFTLTDLWTGGTSSTSGQISASVPAHGVAVFRVSGGSPL 401
>gi|114686801|ref|XP_525662.2| PREDICTED: alpha-N-acetylgalactosaminidase [Pan troglodytes]
Length = 411
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 103/338 (30%), Positives = 150/338 (44%), Gaps = 61/338 (18%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQIN----------EKIIRETDDCWGESSRDSQISLSCS 68
+L NGL TPPMGW +W F C IN E++ E D ++D + +
Sbjct: 17 MLDNGLLQTPPMGWLAWERFRCNINCDEDPKNCISERLFMEMAD---RMAQDGWRDMGYT 73
Query: 69 FHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL---------------- 112
+ D I + G L+ + FP G LADYVH L
Sbjct: 74 YLNIDD-CWIGGR----DASGRLMPDPKRFPHGIPFLADYVHSLGLKLGIYADMGNFTCM 128
Query: 113 --ISNSLDNRIESNIAF-EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMC 169
+LD ++ F E +D LK D CF+ Y M+ AL GRPI FS C
Sbjct: 129 GYPGTTLDKVVQDAQTFAEWKVDMLKLDGCFSTPEERAQGYPKMAAALNATGRPIAFS-C 187
Query: 170 EW----GDMRPAL---WGSNIRNSWRTTDDISDSWARMLTI----ADMNEVYADHAKPGG 218
W G + P + ++I N WR DDI DSW +L+I + ++ A PG
Sbjct: 188 SWPAYEGGLPPRVNYSLLADICNLWRNYDDIQDSWWSVLSILNWFVEHQDILQPVAGPGH 247
Query: 219 WNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
WNDPDML +GN G+ + ++W + APLL+ D+ ++A+ M I+ N +I +NQ
Sbjct: 248 WNDPDMLLIGNFGLSLEQSRAQMALWTVLAAPLLMSTDLRTISAQNMDILQNPLMIKINQ 307
Query: 279 ---AINDQQI---------WAGPLSGNRIVVLLESRKT 304
I ++I + PLS ++ SR+T
Sbjct: 308 DPLGIQGRRIHKEKSLIEVYMRPLSNKASALVFFSRRT 345
>gi|91086279|ref|XP_973373.1| PREDICTED: similar to
alpha-galactosidase/alpha-n-acetylgalactosaminidase
[Tribolium castaneum]
Length = 407
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 102/317 (32%), Positives = 139/317 (43%), Gaps = 46/317 (14%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGL LTPPMGW W F C+ + + + DC E+ + S D+ +
Sbjct: 17 LDNGLALTPPMGWLHWERFRCETDCR--QFPFDCISENLFKAMADKMASDGYLDAGYEYV 74
Query: 80 AK-----PLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSL 117
+ +G L A+ FPSG KALADYVH SL
Sbjct: 75 IMDDCWLAMDRDSEGRLQADPDRFPSGIKALADYVHAKGLKFGIYEDYGTKTCAGYPGSL 134
Query: 118 DN-RIESNIAFEQGIDYLKYDNC-FNDDTRPTIRYRAMSDALKKAGRPIFFSMCEW---- 171
D+ I++ E G+DYLK D C D + M+ L + GR I FS CE+
Sbjct: 135 DHLEIDAKTFAEWGVDYLKMDGCNVTPDEAMEAGHLEMARYLNETGREIVFS-CEFPLYR 193
Query: 172 GDMRPALWGSNIRNSWRTTDDISDSWARMLTIADM----NEVYADHAKPGGWNDPDMLEV 227
GD N WR +DI DSW + I + + Y A PG WNDPDML +
Sbjct: 194 GDKANYSVAIEACNLWRNYNDIEDSWVSVTNIVNHYKKNQDKYVAVAGPGHWNDPDMLII 253
Query: 228 GNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ--------- 278
GN G+ ++ +IWAI APL++ D+ + E I+ N+ I +NQ
Sbjct: 254 GNFGLSLDQSKAQMTIWAIWAAPLIMSVDLRTIKPEFKEILLNKHAIKINQDPLGIQGHL 313
Query: 279 --AINDQQIWAGPLSGN 293
IND +W P+S N
Sbjct: 314 QTTINDVDVWLKPISPN 330
>gi|115764848|ref|XP_792642.2| PREDICTED: alpha-N-acetylgalactosaminidase-like [Strongylocentrotus
purpuratus]
Length = 415
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 99/299 (33%), Positives = 135/299 (45%), Gaps = 49/299 (16%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGL TPPMGW +W F C I+ K + +C E Q+ + + M + + A
Sbjct: 26 LDNGLARTPPMGWMAWERFRCNIDCK--NDPKNCISE-----QLFMDMADRMVEDGYRDA 78
Query: 80 AKPLPN----------QCQGNLVANKTTFPSGTKALADYVHIL----------------- 112
N QG L A+ FP+G KALADYVH
Sbjct: 79 GYEYVNIDDCWSSKQRDAQGRLQADPDRFPNGIKALADYVHKRGLKLGTYGDYGNLTCGG 138
Query: 113 ISNSLDN-RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEW 171
SL + I++ + GID K D C+ D Y M+ L + GRPI FS C W
Sbjct: 139 YPGSLGHLEIDAQTFADWGIDMFKMDGCYADPKTMDKGYPQMTKYLNQTGRPILFS-CSW 197
Query: 172 GDMRPA--------LWGSNIRNSWRTTDDISDSWARMLTIADM----NEVYADHAKPGGW 219
D + A L N N WR DDI DSW +L I D + A PG +
Sbjct: 198 PDYQRASGIDVNYKLIAENC-NMWRNYDDIQDSWDSVLGIIDYYAKEQDTLAPVQGPGSF 256
Query: 220 NDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
NDPDM+ VG+ G+ Y + ++W I AP+L+ D+ + E ++ N+EVIA+NQ
Sbjct: 257 NDPDMIIVGDFGLSYEQAKAQIAMWTIFAAPMLMSNDLRTIRPEFRELLLNKEVIAINQ 315
>gi|396486712|ref|XP_003842464.1| hypothetical protein LEMA_P082240.1 [Leptosphaeria maculans JN3]
gi|312219041|emb|CBX98985.1| hypothetical protein LEMA_P082240.1 [Leptosphaeria maculans JN3]
Length = 1497
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 96/342 (28%), Positives = 163/342 (47%), Gaps = 56/342 (16%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQ 86
TPPMGWNS+N + C ++E+ I++ S + + L +DS+T P ++
Sbjct: 25 TPPMGWNSYNTWGCSVSEEKIKQ--------SSEGIVRLGLDKVGYDSVTVDCGWPARDR 76
Query: 87 -CQGNLVANKTTFPSGTKALADYVH-------------ILISNSLD-------NRIESNI 125
QG L N T FPSG KAL DY+H L S D ++++
Sbjct: 77 DEQGRLQWNATLFPSGGKALGDYLHGMGLAFGLYSGAGYLQCGSTDLPASLGYEKLDAES 136
Query: 126 AFEQGIDYLKYDNCFN-----------DDTRPTIRYRAMSDALKKAGRPIFFSMCEWG-D 173
E G D LKYDNC++ ++++ R++ M+ L+ R I + +C+WG
Sbjct: 137 FAEWGGDSLKYDNCYSTSNTTMVDSSSEESQSPARFQHMAAELEAVQRDIRYYICQWGIG 196
Query: 174 MRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMK 233
W + I N+WR ++DI ++W + I + Y H G + D DM+ +G +
Sbjct: 197 TDVGKWSAEIGNTWRISNDIYNAWRSIWRITNQVVPYFRHTTVGAFADMDMMIIGLNALS 256
Query: 234 YNEYVVHFSIWAISKAPLLLGC--DVGNLTAETMPIIGNEEVIAVNQ------------- 278
E HF +WAI+K+PL++G D ++ ++ I+ N+EVI++NQ
Sbjct: 257 AEEERFHFGMWAINKSPLIIGAALDESRMSKTSLDIMMNKEVISINQDALAKSAQLVRRY 316
Query: 279 AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKS 320
+ + +W G LS +R V+ + + + + + +GI S
Sbjct: 317 SEEEWDVWLGDLSDSRKVLGIANWRNDTQSVKVDLKSLGIAS 358
>gi|126273087|ref|XP_001373509.1| PREDICTED: alpha-galactosidase A-like [Monodelphis domestica]
Length = 454
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 104/332 (31%), Positives = 153/332 (46%), Gaps = 54/332 (16%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQ-- 77
L NGL LTP MGW W F C +N I++ ++C E L S D+ +
Sbjct: 20 LDNGLALTPTMGWLPWERFTCNVN--CIQDPENCISEQLFMQMADLMVSEGWKDAGYEYV 77
Query: 78 ------IAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNR------ 120
+A+K G L A+ FPSG + LA+YVH + I +
Sbjct: 78 CIDDCWLASK---RNKDGTLKADPKRFPSGIRHLANYVHSRGLKLGIYQDVGTLTCAGYP 134
Query: 121 -------IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YRAMSDALKKAGRPIFFSMCEWG 172
I++ + G+D LK+D C+ D + Y+ MS AL K GR I +S CEW
Sbjct: 135 GSFGYYDIDAQTFADWGVDLLKFDGCYYKDIESLVEGYKHMSIALNKTGRSILYS-CEWP 193
Query: 173 DMRPALWGSN---IR---NSWRTTDDISDSWARMLTI----ADMNEVYADHAKPGGWNDP 222
N IR N WR DI DSW + +I A + A PGGWNDP
Sbjct: 194 LYTRPFQEPNYTEIRQYCNHWRNYVDILDSWRSVKSILAWTASKQDSLVPAAGPGGWNDP 253
Query: 223 DMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAIND 282
DML +GN G+ +++ V ++WAI APL + D+ +++ + ++ N++VIA+NQ
Sbjct: 254 DMLVIGNFGLSWDQQVTQMALWAIMAAPLFMSNDLRHISCRSKALLQNKDVIAINQDPLG 313
Query: 283 QQ-----------IWAGPLSGNRIVVLLESRK 303
+Q +W PLS V + ++K
Sbjct: 314 KQGYLLRKDKNFEVWERPLSNLAWAVAVLNQK 345
>gi|291212901|dbj|BAI82608.1| modified alpha-N-acetylgalactosaminidase [eukaryotic synthetic
construct]
Length = 425
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 96/299 (32%), Positives = 136/299 (45%), Gaps = 49/299 (16%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQIN----------EKIIRETDDCWGESSRDSQISLSCSF 69
L NGL TPPMGW +W F C IN E++ E D ++D + ++
Sbjct: 32 LDNGLLQTPPMGWLAWERFRCNINCDEDPKNCISEQLFMEMAD---RMAQDGWRDMGYTY 88
Query: 70 HMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL----------------- 112
D I + G L+ + FP G LADYVH L
Sbjct: 89 LNIDD-CWIGGR----DASGRLMPDPKRFPHGIPFLADYVHSLGLKLGIYADMGNFTCMG 143
Query: 113 -ISNSLDNRIESNIAF-EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCE 170
+LD ++ F E +D LK D CF+ Y M+ AL GRPI FS CE
Sbjct: 144 YPGTTLDKVVQDAQTFAEWKVDMLKLDGCFSTPEERAQGYPKMAAALNATGRPIAFS-CE 202
Query: 171 W----GDMRPAL---WGSNIRNSWRTTDDISDSWARMLTI----ADMNEVYADHAKPGGW 219
W G + P + ++I N WR DDI DSW +L+I + ++ A PG W
Sbjct: 203 WPLYEGGLPPRVNYSLLADICNLWRNYDDIQDSWWSVLSILNWFVEHQDILQPVAGPGHW 262
Query: 220 NDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
NDPDML +GN G+ + ++W + APLL+ D+ ++A+ M I+ N +I +NQ
Sbjct: 263 NDPDMLLIGNFGLSLEQSRAQMALWTVLAAPLLMSTDLRTISAQNMDILQNPLMIKINQ 321
>gi|426394638|ref|XP_004063597.1| PREDICTED: alpha-N-acetylgalactosaminidase [Gorilla gorilla
gorilla]
Length = 411
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 103/338 (30%), Positives = 150/338 (44%), Gaps = 61/338 (18%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQIN----------EKIIRETDDCWGESSRDSQISLSCS 68
+L NGL TPPMGW +W F C IN E++ E D ++D + +
Sbjct: 17 MLDNGLLQTPPMGWLAWERFRCNINCDEDPKNCISERLFMEMAD---RLAQDGWRDMGYT 73
Query: 69 FHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL---------------- 112
+ D I + G L+ + FP G LADYVH L
Sbjct: 74 YLNIDD-CWIGGR----DASGRLMPDPKRFPHGIPFLADYVHSLGLKLGIYADMGNFTCM 128
Query: 113 --ISNSLDNRIESNIAF-EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMC 169
+LD ++ F E +D LK D CF+ Y M+ AL GRPI FS C
Sbjct: 129 GYPGTTLDKVVQDAQTFAEWKVDMLKLDGCFSTPEERAQGYPKMAAALNATGRPIAFS-C 187
Query: 170 EW----GDMRPAL---WGSNIRNSWRTTDDISDSWARMLTI----ADMNEVYADHAKPGG 218
W G + P + ++I N WR DDI DSW +L+I + ++ A PG
Sbjct: 188 SWPAYEGGLPPRVNYSLLADICNLWRNYDDIQDSWWSVLSILNWFVEHQDILQPVAGPGH 247
Query: 219 WNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
WNDPDML +GN G+ + ++W + APLL+ D+ ++A+ M I+ N +I +NQ
Sbjct: 248 WNDPDMLLIGNFGLSLEQSRAQMALWTVLAAPLLMSTDLRTISAQNMDILQNPLMIKINQ 307
Query: 279 ---AINDQQI---------WAGPLSGNRIVVLLESRKT 304
I ++I + PLS ++ SR+T
Sbjct: 308 DPLGIQGRRIHKEKSLIEVYMRPLSNKASALVFFSRRT 345
>gi|91086281|ref|XP_973404.1| PREDICTED: similar to CG7997 CG7997-PA [Tribolium castaneum]
Length = 411
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 100/328 (30%), Positives = 141/328 (42%), Gaps = 64/328 (19%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRETDDCWGESSRDSQISL 65
R L NGL TPPMGW W F C I+EK+ + D + + + ++
Sbjct: 13 RALALDNGLARTPPMGWMHWQRFRCLVDCDAYPDECISEKLFKTMAD---KMASEGYLAA 69
Query: 66 SCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNI 125
+ + D + QG L N T FPSG KAL+DYVH S L I S+
Sbjct: 70 GYEYLIIDDCWMAKNR----DAQGRLQPNATRFPSGIKALSDYVH---SKGLKFGIYSDY 122
Query: 126 A---------------------FEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPI 164
E G+DYLK D C+ D Y M L + GRPI
Sbjct: 123 GTKTCAGYPGSHGHLETDAQTFAEWGVDYLKLDGCYADLDDLEPGYIQMGKFLNQTGRPI 182
Query: 165 FFSMCEW-GDMRPALWGSNIR------NSWRTTDDISDSWARMLTI----ADMNEVYADH 213
+S C W P N N WR DDI D+W+ + +I + + A+
Sbjct: 183 VYS-CSWPAYQEPKGVKPNYTALRETCNLWRNWDDIDDTWSNVTSILKWFSTNQDRIAEF 241
Query: 214 AKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEV 273
+ PG WNDPDML +GN G+ Y + ++WAI APL++ D+ + + ++ ++EV
Sbjct: 242 SGPGHWNDPDMLIIGNFGLSYEQSKAQMALWAILAAPLIMSVDLRTIEPKFRDVLLHQEV 301
Query: 274 IAVNQ-----------AINDQQIWAGPL 290
I +NQ IN IW P+
Sbjct: 302 IKINQDALGIQGRLITTINKIDIWTKPI 329
>gi|46110355|ref|XP_382235.1| hypothetical protein FG02059.1 [Gibberella zeae PH-1]
Length = 407
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 99/426 (23%), Positives = 178/426 (41%), Gaps = 105/426 (24%)
Query: 3 IVSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQI-NEKIIRET----------- 50
+V+ + + +R+ L + +G P +GWN WN C +EK+ T
Sbjct: 9 VVALATLTTAAPERKTLATREVGRLPALGWNGWNQGQCNAASEKVALATAKTFIDLGLKD 68
Query: 51 --------DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGT 102
DDCW RDS+ GNLV + + +P G
Sbjct: 69 AGYQYINIDDCWSTRQRDSK--------------------------GNLVPDPSKWPRGI 102
Query: 103 KALADYVHIL------------------ISNSLDNRIESNIAFEQGIDYLKYDNCF---- 140
K + D +H + + + ++++ G+DY KYDNC+
Sbjct: 103 KPVVDEIHKMGLKFGLYGDRGVKTCAGFPGSQGHEKQDADLLASWGVDYWKYDNCYTPCY 162
Query: 141 --------------NDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNS 186
+RP Y M D L+ G+ I +S+C WG WG+++ +
Sbjct: 163 NGNQADIQTCPIGKGPSSRPG--YELMRDMLRNTGKDILYSLCNWGWDEVWTWGASVGHM 220
Query: 187 WRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAI 246
WR + D W ++ IA+ + +PG +ND DM+ + NG + E HF+IWAI
Sbjct: 221 WRMSVDNWGKWDDVVRIANQAAPILKYTQPGRYNDLDMMILANGALTPAEERTHFAIWAI 280
Query: 247 SKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ----------------AINDQQI---WA 287
+K+P++LG D+ + + + +I N+ ++AVNQ A +I W+
Sbjct: 281 TKSPIILGTDMTKINSAEIKLITNKGLLAVNQDSLSKPAVPFTPPGTPAKASNEIYPYWS 340
Query: 288 GPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFEHRTLETKFQGK-LTAKVDGHS 346
GPL+ +V ++ S+ + ++ G+K + + K +G+ + AK++ H
Sbjct: 341 GPLASGTVVAIVASKGNLDTTLSLS-QVPGLKDQDYSWTELLTGAKGRGRTVQAKLEKHD 399
Query: 347 CKMYEL 352
++ +
Sbjct: 400 IAVFRI 405
>gi|443900230|dbj|GAC77556.1| alpha-d-galactosidase [Pseudozyma antarctica T-34]
Length = 574
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 118/420 (28%), Positives = 177/420 (42%), Gaps = 77/420 (18%)
Query: 5 SSSAADSNGSDRRNLLSNGLGL--TPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQ 62
SSS A S S+ R S+G + TP MG+N++N C E +T D SQ
Sbjct: 23 SSSDAKSLRSNVRARQSSGASVQPTPGMGFNTYNQVSCSPTEASSHKTMDIMA-----SQ 77
Query: 63 ISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIE 122
++ + MF AK GNL+ + FPSG K+L Y S L+ +
Sbjct: 78 GYIAAGYGMFGIDCGWVAKSSSRDSAGNLMIDTNNFPSGMKSLGAYA---TSKGLEFGLY 134
Query: 123 SNIAF--------------------------EQGIDYLKYDNCFND-----DTRPT---- 147
S+ F + YLKYDNC+ D P
Sbjct: 135 SDGGFLACDPDVPSKRLGSLNHEAQDAALLKSFNVSYLKYDNCYAGGSTAGDNAPKAART 194
Query: 148 ---IRYRAMSDALKKAGRPIFFSMCEWGD---------MRPALWGSNIRNSWRTTDDISD 195
R+ AMS+AL K G +C+WG + P W + S+R +DDI+
Sbjct: 195 DFPTRFGAMSNALAKVGIHKML-VCQWGVPQRQSDGSLVGPDQWTKGLSTSYRLSDDIAQ 253
Query: 196 SWARMLTIADMNEVYA----DHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPL 251
W ++ I +N+ + PG + D D+LEVGN GM +E HF+ WA+ K+PL
Sbjct: 254 GWVNVVRI--LNQAIPISLNGRSGPGHFADGDLLEVGNAGMSIDEQATHFAFWAMIKSPL 311
Query: 252 LLGCDVGNLTAETMPIIGNEEVIAVNQ------------AINDQQIWAGPLSGNRIVVLL 299
++ D+ +++E I+ N+ +IAVNQ ND + AG LS + VL
Sbjct: 312 MISTDLSTISSEAKAILLNKGLIAVNQDKLGQPVKLVERRTNDFDLHAGNLSNGDLAVLA 371
Query: 300 ESRKTFSSMMTAHWDDIGIKSNNTL-FEHRTLETKFQGKLTAKVDGHSCKMYELTPVTKL 358
+ + +G++S + + T +T QG + KV+GH L V L
Sbjct: 372 IDWTNSQRTIKIDFAQLGVQSADVMDLWSGTTQTNVQGSFSKKVNGHGSVALRLANVKAL 431
>gi|295789486|pdb|3LX9|A Chain A, Interconversion Of Human Lysosomal Enzyme Specificities
gi|295789487|pdb|3LX9|B Chain B, Interconversion Of Human Lysosomal Enzyme Specificities
gi|295789488|pdb|3LXA|A Chain A, Interconversion Of Human Lysosomal Enzyme Specificities
gi|295789489|pdb|3LXA|B Chain B, Interconversion Of Human Lysosomal Enzyme Specificities
gi|295789490|pdb|3LXB|A Chain A, Interconversion Of Human Lysosomal Enzyme Specificities
gi|295789491|pdb|3LXB|B Chain B, Interconversion Of Human Lysosomal Enzyme Specificities
gi|295789492|pdb|3LXC|A Chain A, Interconversion Of Human Lysosomal Enzyme Specificities
gi|295789493|pdb|3LXC|B Chain B, Interconversion Of Human Lysosomal Enzyme Specificities
Length = 404
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 108/357 (30%), Positives = 153/357 (42%), Gaps = 104/357 (29%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRE-------------------T 50
L NGL TP MGW W F C I+EK+ E
Sbjct: 1 LDNGLARTPTMGWLHWERFMCNLDCQEEPDSCISEKLFMEMAELMVSEGWKDAGYEYLCI 60
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RDS+ G L A+ FP G + LA+YVH
Sbjct: 61 DDCWMAPQRDSE--------------------------GRLQADPQRFPHGIRQLANYVH 94
Query: 111 -----ILISNSLDNR-------------IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YR 151
+ I + N+ I++ + G+D LK+D C+ D Y+
Sbjct: 95 SKGLKLGIYADVGNKTCAGFPGSFGYYDIDAQTFADWGVDLLKFDGCYCDSLENLADGYK 154
Query: 152 AMSDALKKAGRPIFFSMCEWGDMRPA-LWG------SNIR---NSWRTTDDISDSWARML 201
MS AL + GR I +S C W PA +W + IR N WR DI DSW +
Sbjct: 155 HMSLALNRTGRSIVYS-CSW----PAYMWPFQKPNYTEIRQYCNHWRNFADIDDSWKSIK 209
Query: 202 TIADM----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDV 257
+I D E D A PGGWNDPDML +GN G+ +N+ V ++WAI APL + D+
Sbjct: 210 SILDWTSFNQERIVDVAGPGGWNDPDMLVIGNFGLSWNQQVTQMALWAIMAAPLFMSNDL 269
Query: 258 GNLTAETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLSGNRIVVLLESRK 303
+++ + ++ +++VIA+NQ +Q +W PLSG V + +R+
Sbjct: 270 RHISPQAKALLQDKDVIAINQDPLGKQGYQLRQGDNFEVWERPLSGLAWAVAMINRQ 326
>gi|344296168|ref|XP_003419781.1| PREDICTED: alpha-N-acetylgalactosaminidase [Loxodonta africana]
Length = 411
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 106/340 (31%), Positives = 148/340 (43%), Gaps = 67/340 (19%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGL TPPMGW +W F C I DC E + IS M D L Q
Sbjct: 18 LENGLLQTPPMGWLAWERFRCNI---------DC--EEDPKNCISERLFVEMADRLAQDG 66
Query: 80 AKPLP------NQC-------QGNLVANKTTFPSGTKALADYVHIL-------------- 112
+ L + C GNLV + FPSG LADY H L
Sbjct: 67 WRDLGYTYLNIDDCWIGGRDDSGNLVPDPKRFPSGIAYLADYAHSLGLKLGIYADMGNFT 126
Query: 113 ----ISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFS 167
+LD ++ F + +D LK D C++ Y MS AL GRPI FS
Sbjct: 127 CMGYPGTTLDRVVQDAQTFAKWKVDMLKLDGCYSTPLERAYGYPKMSAALNATGRPIAFS 186
Query: 168 MCEW----GDMRPAL---WGSNIRNSWRTTDDISDSWARMLTI----ADMNEVYADHAKP 216
C W G + P + +N+ N WR DDI DSW +L+I + ++ A P
Sbjct: 187 -CSWPAYEGGLPPKVNYTLLANVCNLWRNYDDIQDSWKSVLSILHWFVENQDILQPVAGP 245
Query: 217 GGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAV 276
G WNDPDML VGN G+ + E ++W + APL + D+ ++ + + I+ N +I +
Sbjct: 246 GHWNDPDMLLVGNFGLSFEEAQSQMALWTVLAAPLFMSTDLRTISPKDVDILQNPLMIKI 305
Query: 277 NQ---AINDQQI---------WAGPLSGNRIVVLLESRKT 304
+Q I ++I + PLS ++ SR+T
Sbjct: 306 HQDPLGIQGRRILKEKSYIDVFMRPLSNGASALVFFSRRT 345
>gi|195434174|ref|XP_002065078.1| GK15268 [Drosophila willistoni]
gi|194161163|gb|EDW76064.1| GK15268 [Drosophila willistoni]
Length = 412
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 95/300 (31%), Positives = 135/300 (45%), Gaps = 47/300 (15%)
Query: 18 NLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCS--------- 68
N L NGL TPPMGW SW F C N + D+C E + L +
Sbjct: 16 NCLDNGLARTPPMGWLSWERFRC--NTDCTNDPDNCISEQLFQTMTDLLVAEGYASVGYE 73
Query: 69 -FHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNR-- 120
++ D + P +GNLV ++ FPSG KAL+DYVH I N
Sbjct: 74 YINVDDCWLERNRSP-----EGNLVPDRKRFPSGIKALSDYVHSRGLKFGIYEDYGNYTC 128
Query: 121 --------IESNIAF---EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMC 169
E +A E +DY+K D C+ Y + L + G+P+ +S C
Sbjct: 129 AGYPGIIGYEKEVALQFAEWDVDYVKLDGCYALPIDMDKGYSEFGNLLNRTGKPMVYS-C 187
Query: 170 EW------GDMRPALWGSNIR-NSWRTTDDISDSWARMLTIADM----NEVYADHAKPGG 218
W ++P N WR DDI DSW+ + I D ++ + +A PG
Sbjct: 188 SWPVYQIYAGIQPNFSAVQTHCNLWRNYDDIQDSWSSVENIIDYYGNNQDIISANAGPGH 247
Query: 219 WNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
WNDPDML +GN G+ Y + F+IW+I APLL+ D+ + E I+ N ++I V+Q
Sbjct: 248 WNDPDMLIIGNFGLSYEQAKTQFAIWSILAAPLLMSVDLRTIRPEFKQILLNRKIIEVDQ 307
>gi|301782613|ref|XP_002926720.1| PREDICTED: alpha-N-acetylgalactosaminidase-like [Ailuropoda
melanoleuca]
Length = 411
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 114/400 (28%), Positives = 171/400 (42%), Gaps = 70/400 (17%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRETDDCWGESSRDSQISLSCS 68
+L NGL PPMGW +W F C I+E++ E D ++D L +
Sbjct: 17 VLENGLLRKPPMGWLAWERFRCNTNCDEDPKNCISERLFMEMAD---HLAQDGWRDLGYT 73
Query: 69 FHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL---------------- 112
+ D I + +G LV + FP+G LADY H L
Sbjct: 74 YLNIDD-CWIGGR----DAKGRLVPDPKRFPNGIAFLADYAHSLGLKLGIYEDMGNFTCM 128
Query: 113 --ISNSLDNRIESNIAF-EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMC 169
+LD I+ F E +D LK D CF+ Y M+ AL GRPI FS C
Sbjct: 129 GYPGTTLDKVIQDAQTFAEWKVDMLKLDGCFSTPEERAKGYPKMAAALNATGRPIAFS-C 187
Query: 170 EW----GDMRPAL---WGSNIRNSWRTTDDISDSWARMLTI----ADMNEVYADHAKPGG 218
W G + P + + I N WR DDI DSW+ +L+I D ++ A PG
Sbjct: 188 SWPAYEGGLPPKVNYTLLAEICNLWRNYDDIQDSWSSVLSILDWFVDNQDILQPVAGPGH 247
Query: 219 WNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
WNDPDML VGN G+ + + ++W + APL + D+ ++A+ I+ N +I +NQ
Sbjct: 248 WNDPDMLLVGNFGLSFEQARAQMALWTVLAAPLFMSTDLRTISAQNTDILQNPLMIKINQ 307
Query: 279 ---AINDQQI---------WAGPLSGNRIVVLLESRKT-FSSMMTAHWDDIGIKSNNTLF 325
I ++I + PL+ ++ SR+T + + S+NT +
Sbjct: 308 DPLGIQGRRILREKSHIEVFVRPLANEASALVFFSRRTDMPYRYHSSLAQLNFNSSNT-Y 366
Query: 326 EHRTLET-------KFQGKLTAKVDGHSCKMYELTPVTKL 358
E + + T Q T ++ M+ L P+ KL
Sbjct: 367 EAQNVYTGDVISGLHTQTNFTVIINPSGVVMWYLYPIRKL 406
>gi|389625703|ref|XP_003710505.1| hypothetical protein MGG_16425 [Magnaporthe oryzae 70-15]
gi|351650034|gb|EHA57893.1| hypothetical protein MGG_16425 [Magnaporthe oryzae 70-15]
Length = 583
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 109/383 (28%), Positives = 176/383 (45%), Gaps = 61/383 (15%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAA-KPLPN 85
TPPMGWNS+NH++C E I++ S + L + ++ +T
Sbjct: 58 TPPMGWNSYNHYNCLPTEDDIKQ--------SAQGIVDLGFADLGYNIVTTDCGWNGRDR 109
Query: 86 QCQGNLVANKTTFPSGTKALADYVHIL-------------------ISNSLD-NRIESNI 125
QG N T FPSG KAL D++H L + SL I+S
Sbjct: 110 DEQGRQQWNATLFPSGGKALGDFIHSLGLKFGLYSGAGYFQCGSTDLPASLGFEEIDSKS 169
Query: 126 AFEQGIDYLKYDNCFND-----------DTRPTIRYRAMSDALKKAGRPIFFSMCEWG-- 172
E G D LKYDNC++ + + R+ M LK++GR I + +C+WG
Sbjct: 170 FAEWGGDTLKYDNCYSTSRTVMVDSDSAEAKSPERFLKMGRLLKESGRQIDYFLCQWGIG 229
Query: 173 -DMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGG 231
++ + + SWR ++DI + W + I + +A H + G + D DML VG G
Sbjct: 230 ENVPEWQVATTVGKSWRMSNDIYNGWRSIWRITNQVIPHAKHTREGSYADMDMLTVGLGA 289
Query: 232 MKYNEYVVHFSIWAISKAPLLLGC--DVGNLTAETMPIIGNEEVIAVNQ---------AI 280
M Y+E HF +W++ K+PL +G D + E++ I+ N+EVIA+NQ A+
Sbjct: 290 MSYDEERFHFGMWSMMKSPLHIGAPIDKSITSQESLDIMANKEVIAINQDPLSEAAQLAL 349
Query: 281 NDQQ----IWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT--LFEHRTLETKF 334
+ + +W G LS +R +V + + + S + +GI S + ++ L +
Sbjct: 350 RNTEGEWDVWIGNLSEDRKIVGVANWRNESQSVALDLSFLGIGSADARDVWAASDLGS-I 408
Query: 335 QGKLTAKVDGHSCKMYELTPVTK 357
G ++ GH K+ L+ V K
Sbjct: 409 SGTRQVELKGHELKLLILSNVQK 431
>gi|408395502|gb|EKJ74682.1| hypothetical protein FPSE_05150 [Fusarium pseudograminearum CS3096]
Length = 407
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 98/414 (23%), Positives = 175/414 (42%), Gaps = 105/414 (25%)
Query: 15 DRRNLLSNGLGLTPPMGWNSWNHFHCQI-NEKIIRET-------------------DDCW 54
+R+ L + +G P +GWN WN C +EK+ T DDCW
Sbjct: 21 ERKTLATREVGRLPALGWNGWNQGQCNAASEKVALATAKTFINLGLKDAGYQYINIDDCW 80
Query: 55 GESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-- 112
RDS+ GNLV + + +P G K + D +H +
Sbjct: 81 STRQRDSK--------------------------GNLVPDPSKWPRGIKPVVDEIHKMGL 114
Query: 113 ----------------ISNSLDNRIESNIAFEQGIDYLKYDNCF---------------- 140
+ + ++++ G+DY KYDNC+
Sbjct: 115 KFGLYGDRGVKTCAGFPGSQGHEKQDADLLASWGVDYWKYDNCYTPCYNGNQADIQTCPI 174
Query: 141 --NDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWA 198
+RP Y M D L+ G+ I +S+C WG WG+++ + WR + D W
Sbjct: 175 GKGPSSRPG--YELMRDMLRNTGKDILYSLCNWGWDEVWTWGASVGHMWRMSVDNWGKWD 232
Query: 199 RMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVG 258
++ IA+ + +PG +ND DM+ + NG + E HF+IWAI+K+P++LG D+
Sbjct: 233 DVVRIANQAAPILKYTQPGRYNDLDMMILANGALTPAEERTHFAIWAITKSPIILGTDMT 292
Query: 259 NLTAETMPIIGNEEVIAVNQ----------------AINDQQI---WAGPLSGNRIVVLL 299
+ ++ + +I N+ ++AVNQ A + +I W+GPL+ +V ++
Sbjct: 293 KINSDEIKLITNKGLLAVNQDSLSKPAVPFTPPGTPAKANNEIYPYWSGPLASGTVVAIV 352
Query: 300 ESRKTFSSMMTAHWDDIGIKSNNTLFEHRTLETKFQGK-LTAKVDGHSCKMYEL 352
S+ + ++ G+K + + K +G+ + AK++ H ++ +
Sbjct: 353 ASKGNLDTTLSLS-QVPGLKDQDYSWTELLTGAKGRGRTVRAKLEKHDVAVFRI 405
>gi|325860245|ref|ZP_08173370.1| alpha-galactosidase [Prevotella denticola CRIS 18C-A]
gi|325482332|gb|EGC85340.1| alpha-galactosidase [Prevotella denticola CRIS 18C-A]
Length = 409
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 112/390 (28%), Positives = 170/390 (43%), Gaps = 71/390 (18%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
+ L LTPPMG+ +WN + I E++IR D +S + + +
Sbjct: 31 DSLELTPPMGFMTWNKYKDDIREQLIRRI--------ADRMVSAGYAEAGYKYIFIDDGW 82
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVH---ILISNSLD---------------NRIES 123
+ N++ + FPSG KALADYVH +L+ D + ++
Sbjct: 83 QGGRDKRNNIIPDPVKFPSGMKALADYVHSKGLLLGIYSDAARLTCAGYTGSYGFEKQDA 142
Query: 124 NIAFEQGIDYLKYDNCF--NDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGS 181
E GIDYLKYD C +D RYR M DAL+ +GR I +CEWG + P W
Sbjct: 143 KTFAEWGIDYLKYDYCHAPSDSAVAHQRYRKMGDALQDSGRKIALGVCEWGQLNPERWAR 202
Query: 182 NIRNS-WRTTDDISDSWAR--------MLTIADMNEVYADHAKPGGWNDPDML------- 225
S WR + D+ D W +L I D+ E +A PG W D DML
Sbjct: 203 QAGGSLWRVSYDVRDMWKDIVGQGGMGILDIIDITEPLYKYAGPGHWLDMDMLIVGLDGK 262
Query: 226 -----EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-A 279
++G G Y EY S+W + +PL + D+ + T I+ N+E+IA+NQ A
Sbjct: 263 GGPSSDLGGVGCTYTEYQTQMSMWCMFASPLAMSHDLLDENEATRRILLNKEMIAINQDA 322
Query: 280 INDQ----------QIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLF---- 325
+ + +++ LSGNR+ + + + + IG K+ F
Sbjct: 323 LGEAARRVDFPGVCRVYLRRLSGNRLALAVMNPSDKPQSIQLPLSVIG-KAQKYAFRDVW 381
Query: 326 EHRTL--ETKFQGKLTAKVDGHSCKMYELT 353
EH+T + +QG L H K++ +T
Sbjct: 382 EHKTALWQKTWQGDLQP----HETKVFTVT 407
>gi|74225459|dbj|BAE31642.1| unnamed protein product [Mus musculus]
Length = 415
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 96/303 (31%), Positives = 132/303 (43%), Gaps = 55/303 (18%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQI 78
+L NGL TPPMGW +W F C I+ + + +C IS M D L Q
Sbjct: 17 MLENGLLRTPPMGWLAWERFRCNID--CVEDPKNC---------ISERLFMEMADRLAQD 65
Query: 79 AAKPLP-------------NQCQGNLVANKTTFPSGTKALADYVHIL------------- 112
+ L G L+ + FP G LADY H L
Sbjct: 66 GWRDLGYVYLNIDDCWIGGRDASGRLIPDPKRFPHGIAFLADYAHSLGLKLGIYEDMGKM 125
Query: 113 -----ISNSLDN-RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFF 166
+LD +++ E +D LK D CF+ Y M+ AL GRPI F
Sbjct: 126 TCMGYPGTTLDKVELDAETFAEWKVDMLKLDGCFSSSRERAEGYPKMAAALNATGRPIAF 185
Query: 167 SMCEW----GDMRPAL---WGSNIRNSWRTTDDISDSWARMLTIAD----MNEVYADHAK 215
S C W G + P + S + N WR DI DSW +L+I D +V A
Sbjct: 186 S-CSWPAYEGGLPPKVNYTEVSRVCNLWRNYKDIQDSWKSVLSILDWFVRHQDVLQPVAG 244
Query: 216 PGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIA 275
PG WNDPDML +GN G+ Y+E ++W + APLL+ D+ ++ + M I+ N +I
Sbjct: 245 PGHWNDPDMLLIGNFGLSYDESRAQMALWTVLAAPLLMSTDLRTISPQNMDILQNPLMIK 304
Query: 276 VNQ 278
+NQ
Sbjct: 305 INQ 307
>gi|73969405|ref|XP_538347.2| PREDICTED: alpha-N-acetylgalactosaminidase [Canis lupus familiaris]
Length = 411
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 94/300 (31%), Positives = 137/300 (45%), Gaps = 49/300 (16%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRETDDCWGESSRDSQISLSCS 68
+L NGL PPMGW +W F C I+E++ E D ++D L +
Sbjct: 17 VLENGLLRKPPMGWLAWERFRCNTDCDEDPKNCISERLFLEMAD---HLAQDGWRDLGYT 73
Query: 69 FHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL---------------- 112
+ D I + +G+L+ + FP+G LADY H L
Sbjct: 74 YLNIDD-CWIGGR----DAKGHLIPDPKRFPNGIAFLADYAHSLGLKLGIYEDMGNFTCM 128
Query: 113 --ISNSLDNRIESNIAF-EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMC 169
+LD I+ F E +D LK D CF+ Y M+ AL GRPI FS C
Sbjct: 129 GYPGTTLDKVIQDAQTFAEWKVDMLKLDGCFSTPEERAKGYPMMAAALNATGRPIAFS-C 187
Query: 170 EW----GDMRPAL---WGSNIRNSWRTTDDISDSWARMLTI----ADMNEVYADHAKPGG 218
W G + P + + I N WR DDI DSW+ +L+I D ++ A PG
Sbjct: 188 SWPAYEGGLPPKVNYTLLTEICNVWRNYDDIQDSWSSVLSILDWFVDNQDILQPVAGPGH 247
Query: 219 WNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
WNDPDML +GN G+ + + ++W + APL + D+ ++A+ M I+ N +I +NQ
Sbjct: 248 WNDPDMLLIGNFGLSFEQARAQMALWTVLAAPLFMSTDLRTISAQNMDILQNPLMIKINQ 307
>gi|159155308|gb|AAI54864.1| naga protein [Xenopus (Silurana) tropicalis]
Length = 403
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 97/299 (32%), Positives = 139/299 (46%), Gaps = 48/299 (16%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQIS---------LSCSFH 70
L NGL TPPMGW +W + C I+ K + D+C GE+ S L +
Sbjct: 10 LDNGLVRTPPMGWMTWQRYRCNIDCK--SDPDNCIGENLIKSMADKMADSGWRDLGYVYV 67
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDN------ 119
D + G L + FPSG KALADYVH + I + + N
Sbjct: 68 CIDDCWSRKQR----DSNGRLEPDPERFPSGMKALADYVHAKGLKLGIYSDMGNYTCGGY 123
Query: 120 --------RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEW 171
+I+++ +D LK+D C+++ T + Y MS+AL GRPI +S C W
Sbjct: 124 PGTTLDTIKIDADTFASWEVDMLKFDGCYSNSTEKALGYPKMSEALNATGRPILYS-CSW 182
Query: 172 ----GDMRPAL----WGSNIRNSWRTTDDISDSWARMLTI----ADMNEVYADHAKPGGW 219
G + P + GS I N WR DI DSW +L I A +V A PG W
Sbjct: 183 PAYEGGLPPKVNYTQLGS-ICNMWRNYGDIQDSWDSVLDIIEWYAKNQDVLQPAAGPGRW 241
Query: 220 NDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
NDPDML G+ G+ Y + +IWAI AP ++ D+ ++ + ++ N +I +NQ
Sbjct: 242 NDPDMLITGDFGLSYEQSKSQLAIWAILAAPFIMSNDLRTISQDAKDLLQNRLLIYINQ 300
>gi|112419099|gb|AAI22060.1| naga protein [Xenopus (Silurana) tropicalis]
Length = 432
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 99/304 (32%), Positives = 142/304 (46%), Gaps = 58/304 (19%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGL TPPMGW +W + C I+ K + D+C GE+ S M D +
Sbjct: 39 LDNGLVRTPPMGWMTWQRYRCNIDCK--SDPDNCIGENLIKS---------MADKMADSG 87
Query: 80 AKPLP------NQC--------QGNLVANKTTFPSGTKALADYVH-----ILISNSLDN- 119
+ L + C G L + FPSG KALADYVH + I + + N
Sbjct: 88 WRDLGYVYVCIDDCWSRKQRDSNGRLEPDPERFPSGMKALADYVHAKGLKLGIYSDMGNY 147
Query: 120 -------------RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFF 166
+I+++ +D LK+D C+++ T + Y MS+AL GRPI +
Sbjct: 148 TCGGYPGTTLDTIKIDADTFASWEVDMLKFDGCYSNSTEKALGYPKMSEALNATGRPILY 207
Query: 167 SMCEW----GDMRPAL----WGSNIRNSWRTTDDISDSWARMLTI----ADMNEVYADHA 214
S C W G + P + GS I N WR DI DSW +L I A +V A
Sbjct: 208 S-CSWPAYEGGLPPKVNYTQLGS-ICNMWRNYGDIQDSWDSVLDIIEWYAKNQDVLQPAA 265
Query: 215 KPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVI 274
PG WNDPDML G+ G+ Y + +IWAI AP ++ D+ ++ + ++ N +I
Sbjct: 266 GPGRWNDPDMLITGDFGLSYEQSKSQLAIWAILAAPFIMSNDLRTISQDAKDLLQNRLLI 325
Query: 275 AVNQ 278
+NQ
Sbjct: 326 YINQ 329
>gi|189201571|ref|XP_001937122.1| alpha-galactosidase precursor [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187984221|gb|EDU49709.1| alpha-galactosidase precursor [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 544
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 110/380 (28%), Positives = 174/380 (45%), Gaps = 59/380 (15%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQ 86
TPPMGWNS+N ++CQ E I+ +S + I L +D +T L +
Sbjct: 24 TPPMGWNSYNTWNCQPTEDKIK--------TSANGLIELGFKDVGYDFVTIDCGWNLRER 75
Query: 87 -CQGNLVANKTTFPSGTKALADYVHILISNSL-----------DNRIESNIAFEQ----- 129
G L N T FPSG KAL DY+H L D+ + +++ +E+
Sbjct: 76 DAAGQLQWNTTRFPSGGKALGDYLHGLGLGFGLYSGAGYLQCGDDDLPASLGYEKIDAES 135
Query: 130 ----GIDYLKYDNCFN-------DDTRPT----IRYRAMSDALKKAGRPIFFSMCEWG-D 173
G D LKYDNC++ D + P R++ M++ L R I + +C+WG
Sbjct: 136 IAGWGGDSLKYDNCYSTSNTTLVDSSSPESQSPARFQRMAEELDAVDRDIRYYVCQWGIG 195
Query: 174 MRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMK 233
W + I N+WR ++DI W + I + Y H G + D DML VG +
Sbjct: 196 TDVGEWAAKIGNTWRISNDIYGKWRSIWRITNQVVPYFRHTTTGAFADMDMLIVGLNSLS 255
Query: 234 YNEYVVHFSIWAISKAPLLLGC--DVGNLTAETMPIIGNEEVIAVNQAINDQQ------- 284
E HF +WAI+K+PL++G D L ++ I+ N+E+IA+NQ +Q
Sbjct: 256 EEEERFHFGMWAINKSPLIMGAALDPSRLKNSSVEIMLNKEIIAINQDSLAKQAQLIRRD 315
Query: 285 ------IWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT--LFEHRTLETKFQG 336
IW G LSG+R V+ + + + S + +GI S + ++ + T G
Sbjct: 316 TEGEWDIWMGELSGSRQVLGVANWRNDSQTVNVDLKSLGIASASARDVWAAKDAGT-VSG 374
Query: 337 KLTAKVDGHSCKMYELTPVT 356
T + GH +++ L+ +
Sbjct: 375 SQTLNLAGHELRIWVLSDIV 394
>gi|355706261|gb|AES02586.1| N-acetylgalactosaminidase [Mustela putorius furo]
Length = 410
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 96/303 (31%), Positives = 132/303 (43%), Gaps = 55/303 (18%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQI 78
+L NGL PPMGW +W F C N C E + IS M D L Q
Sbjct: 17 VLENGLLRKPPMGWLAWERFRCNTN---------C--EEDPKNCISERLFMEMADHLAQD 65
Query: 79 AAKPLP-------------NQCQGNLVANKTTFPSGTKALADYVHIL------------- 112
+ L +GNLV + FP+G LADY H L
Sbjct: 66 GWRDLGYIYLNIDDCWIGGRDAKGNLVPDPKRFPNGIAFLADYAHSLGLKLGIYEDMGNF 125
Query: 113 -----ISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFF 166
+LD ++ F +D LK D CF+ Y M+ AL GRPI F
Sbjct: 126 TCMGYPGTTLDKVVQDAQTFAAWKVDMLKLDGCFSTPKERAQGYPKMAAALNATGRPIAF 185
Query: 167 SMCEW----GDMRPAL---WGSNIRNSWRTTDDISDSWARMLTI----ADMNEVYADHAK 215
S C W G + P + + I N WR DDI DSW+ +L+I D ++ A
Sbjct: 186 S-CSWPAYEGGLPPKVNYTLLAEICNLWRNYDDIQDSWSSVLSILDWFVDNQDILQPVAG 244
Query: 216 PGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIA 275
PG WNDPDML +GN G+ + + ++W + APL + D+ ++A+ M I+ N +I
Sbjct: 245 PGHWNDPDMLLIGNFGLSFEQARAQMALWTVLAAPLFMSTDLRTISAQNMDILQNPLMIK 304
Query: 276 VNQ 278
+NQ
Sbjct: 305 INQ 307
>gi|49115142|gb|AAH72931.1| Nagab protein [Xenopus laevis]
Length = 431
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 99/319 (31%), Positives = 138/319 (43%), Gaps = 88/319 (27%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQIN----------EKIIRE-------------------T 50
L NGL TPPMGW +W + C I+ E +I+
Sbjct: 40 LDNGLVRTPPMGWMTWQRYRCNIDCKSDPDNCISENLIKIMADEMADSGWRDLGYVYVCI 99
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW + RDS G L + FPSG KALADYVH
Sbjct: 100 DDCWSQKQRDSN--------------------------GRLQPDPERFPSGMKALADYVH 133
Query: 111 -----ILISNSLDN--------------RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYR 151
+ I + + N + ++ +D LK+D C+++ T + Y
Sbjct: 134 AKGLKLGIYSDMGNYTCGGYPGTTLDTIKTDAETFASWEVDMLKFDGCYSNSTEKALGYP 193
Query: 152 AMSDALKKAGRPIFFSMCEW----GDMRPAL----WGSNIRNSWRTTDDISDSWARMLTI 203
MS+AL GRPI +S C W G + P + GS I N WR DDI DSW +L I
Sbjct: 194 KMSEALNATGRPILYS-CSWPAYEGGLPPKVNYTQLGS-ICNMWRNYDDIDDSWDSVLDI 251
Query: 204 ----ADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGN 259
A +V A PG WNDPDML G+ G+ Y + +IWAI APL++ D+
Sbjct: 252 IEWYAKNQDVLQPAAGPGRWNDPDMLITGDFGLSYEQSKSQLAIWAILAAPLIMSNDLRA 311
Query: 260 LTAETMPIIGNEEVIAVNQ 278
++ + ++ N +I +NQ
Sbjct: 312 ISQDAKDLLQNRLLIYINQ 330
>gi|219283066|ref|NP_001107689.1| alpha-N-acetylgalactosaminidase precursor [Xenopus (Silurana)
tropicalis]
Length = 412
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 97/299 (32%), Positives = 139/299 (46%), Gaps = 48/299 (16%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQIS---------LSCSFH 70
L NGL TPPMGW +W + C I+ K + D+C GE+ S L +
Sbjct: 19 LDNGLVRTPPMGWMTWQRYRCNIDCK--SDPDNCIGENLIKSMADKMADSGWRDLGYVYV 76
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDN------ 119
D + G L + FPSG KALADYVH + I + + N
Sbjct: 77 CIDDCWSRKQR----DSNGRLEPDPERFPSGMKALADYVHAKGLKLGIYSDMGNYTCGGY 132
Query: 120 --------RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEW 171
+I+++ +D LK+D C+++ T + Y MS+AL GRPI +S C W
Sbjct: 133 PGTTLDTIKIDADTFASWEVDMLKFDGCYSNSTEKALGYPKMSEALNATGRPILYS-CSW 191
Query: 172 ----GDMRPAL----WGSNIRNSWRTTDDISDSWARMLTI----ADMNEVYADHAKPGGW 219
G + P + GS I N WR DI DSW +L I A +V A PG W
Sbjct: 192 PAYEGGLPPKVNYTQLGS-ICNMWRNYGDIQDSWDSVLDIIEWYAKNQDVLQPAAGPGRW 250
Query: 220 NDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
NDPDML G+ G+ Y + +IWAI AP ++ D+ ++ + ++ N +I +NQ
Sbjct: 251 NDPDMLITGDFGLSYEQSKSQLAIWAILAAPFIMSNDLRTISQDAKDLLQNRLLIYINQ 309
>gi|187607477|ref|NP_001119817.1| alpha-N-acetylgalactosaminidase precursor [Ovis aries]
gi|182382508|gb|ACB87536.1| N-acetylgalactosaminidase alpha [Ovis aries]
Length = 411
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 116/402 (28%), Positives = 171/402 (42%), Gaps = 80/402 (19%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
NGL PPMGW +W F C I DC S IS M D L Q +
Sbjct: 20 NGLLRKPPMGWLAWERFRCNI---------DC--NEDPKSCISEELFMEMADRLAQDGWR 68
Query: 82 PLP-------------NQCQGNLVANKTTFPSGTKALADYVHIL---------------- 112
L +GNLV ++ FP G LADY H L
Sbjct: 69 ELGYVYLNIDDCWIGGRDAKGNLVPDRKRFPHGIAFLADYAHSLGLKLGIYEDLGNFTCM 128
Query: 113 --ISNSLDNRIESNIAF-EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMC 169
+LD ++ F E +D LK D C++ Y M+ AL GRPI FS C
Sbjct: 129 GYPGTTLDKVVQDAQTFAEWKVDMLKLDGCYSSPQERAEGYPKMAAALNATGRPIAFS-C 187
Query: 170 EW----GDMRPAL---WGSNIRNSWRTTDDISDSWARMLTIAD----MNEVYADHAKPGG 218
W G + P + ++I N WR DDI DSW +L+I D +V A PG
Sbjct: 188 SWPAYEGGLPPKVNYTLLADICNLWRNFDDIQDSWRSVLSILDWFVAHQDVLQPVAGPGH 247
Query: 219 WNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
WNDPD+L +GN G+ + + ++W + APLL+ D+ ++A+ + I+ N +I +NQ
Sbjct: 248 WNDPDVLLIGNFGLSFEQARAQMALWTVLAAPLLMSTDLRTISAQNVDILQNPLMIKINQ 307
Query: 279 ---AINDQQI---------WAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIK---SNNT 323
I ++I + PL ++ SR+T M H+ + S++
Sbjct: 308 DPLGIQGRRILKEKSYIEVYLRPLVNEASAIVFFSRRT---DMPYHYHSSLARLNFSSSV 364
Query: 324 LFEHRTLET-----KFQGK--LTAKVDGHSCKMYELTPVTKL 358
++E + + T Q K T ++ M+ L P++KL
Sbjct: 365 VYEAQDVYTGEIISGLQDKTNFTVIINPSGVVMWYLYPISKL 406
>gi|301788124|ref|XP_002929478.1| PREDICTED: alpha-galactosidase A-like [Ailuropoda melanoleuca]
gi|281341596|gb|EFB17180.1| hypothetical protein PANDA_019662 [Ailuropoda melanoleuca]
Length = 420
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 102/341 (29%), Positives = 148/341 (43%), Gaps = 96/341 (28%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRE-------------------T 50
L NGL +TP MGW W F C I+EK+ +
Sbjct: 32 LDNGLAMTPTMGWLHWERFMCNLDCQEEPDSCISEKLFMQMAELMESEGWKDVGYEYLCI 91
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RDS G L A+ FPSG + LADYVH
Sbjct: 92 DDCWMAPERDSN--------------------------GRLQADPKRFPSGIRGLADYVH 125
Query: 111 -----ILISNSLDNR-------------IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YR 151
+ I + N+ I++ + G+D LK+D C+ D + Y+
Sbjct: 126 SKGLKLGIYADVGNKTCAGFPGSFGHYDIDAKTFADWGVDLLKFDGCYCDSVEHLEKGYK 185
Query: 152 AMSDALKKAGRPIFFSMCEWGDMRPALWGSN---IR---NSWRTTDDISDSWARMLTI-- 203
MS AL + GR I +S CEW N IR N WR +D+ DSW + +I
Sbjct: 186 RMSLALNRTGRSIVYS-CEWPLYMWPFRKPNYTEIRQYCNHWRNFEDVYDSWQSVKSILA 244
Query: 204 --ADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
+ + D A PGGWNDPDML +GN G+ +++ V ++WAI APLL+ D+ +++
Sbjct: 245 WTSSNQKRIVDVAGPGGWNDPDMLVIGNFGLSWDQQVTQMALWAIMAAPLLMSNDLRHIS 304
Query: 262 AETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLS 291
+ ++ +++VIA+NQ +Q +W PLS
Sbjct: 305 PQAKALLQDKDVIAINQDPLGKQGYRLRKEDNIEVWERPLS 345
>gi|444723793|gb|ELW64423.1| Alpha-N-acetylgalactosaminidase [Tupaia chinensis]
Length = 411
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 103/341 (30%), Positives = 148/341 (43%), Gaps = 67/341 (19%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQI 78
+L NGL TPPMGW +W F C DC E+ + IS M D L Q
Sbjct: 17 MLDNGLLRTPPMGWLAWERFRCNT---------DC--EADPKNCISEQLFMEMADRLAQD 65
Query: 79 AAKPLP------NQC-------QGNLVANKTTFPSGTKALADYVHIL------------- 112
+ + + C G LV + FP G LADY H L
Sbjct: 66 GWRDMGYTYLNIDDCWIGGRDDNGRLVPDLKRFPHGIAFLADYAHSLGLKLGIYEDLGNM 125
Query: 113 -----ISNSLDNRIESNIAF-EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFF 166
+LD + F E +D LK D CF+ Y M+ AL GRPI F
Sbjct: 126 TCMGYPGTTLDKVVPDAQTFAEWKVDMLKLDGCFSTRKERAQGYPKMAAALNATGRPIAF 185
Query: 167 SMCEW----GDMRPALWGSNIRNS---WRTTDDISDSWARMLTIAD----MNEVYADHAK 215
S C W G + P + S + +S WR DDI DSW+ +L+I D ++ A
Sbjct: 186 S-CSWPAYEGGLPPKVNYSLLADSCNLWRNYDDIQDSWSSVLSILDWFVKHQDILQPAAG 244
Query: 216 PGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIA 275
PG WNDPDML +GN G+ + + ++W + APL + D+ ++A+ I+ N +I
Sbjct: 245 PGHWNDPDMLLIGNFGLSFEQARAQMALWTVLAAPLFMSTDLRTISAQNTDILQNPLMIK 304
Query: 276 VNQ------------AINDQQIWAGPLSGNRIVVLLESRKT 304
+NQ ++ +++ PLSG ++ SR+T
Sbjct: 305 INQDPLGIQGRRILKEKSNIEVFMRPLSGEASALVFFSRRT 345
>gi|353242838|emb|CCA74446.1| related to alpha-galactosidase precursor [Piriformospora indica DSM
11827]
Length = 490
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 87/290 (30%), Positives = 145/290 (50%), Gaps = 44/290 (15%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMF----DSLTQIA 79
+G P +GWNSWN + C INE + +++ + L + + + D + I
Sbjct: 116 VGKLPALGWNSWNAYRCDINETKLLNA------ANQMVSLGLKAAGYQYVNIDDCWSNIN 169
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVH---ILISNSLDNRIES------NIAFEQ- 129
+ Q ++ + FP+G ++A +H +L+ D ++ ++ +E
Sbjct: 170 GRDPSTQ---QILPDLNKFPNGMASVASKIHSLGLLLGIYSDAGTKTCSGYPGSLGYEAI 226
Query: 130 --------GIDYLKYDNC-----FNDD-TRP-------TIRYRAMSDALKKAGRPIFFSM 168
GIDYLKYDNC + D+ T P IRYR M AL RPI +S+
Sbjct: 227 DAATFSSWGIDYLKYDNCNVPSEWQDNWTYPDWGQSNSAIRYRQMGTALAAQSRPIQYSL 286
Query: 169 CEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVG 228
C WG + WG+++ SWR + D + +W+ + ++ D N D+ G +D DM+E+G
Sbjct: 287 CIWGAAQVWTWGASVGQSWRISGDSAPTWSYITSVIDRNVAIIDYTNFYGRSDMDMMEIG 346
Query: 229 NGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
NG + E HF +WA K+P+LLG D+ L+ + + II N+E++A +Q
Sbjct: 347 NGDLTLAEERTHFLMWAALKSPILLGTDLSLLSTDQLNIIKNKELLAFSQ 396
>gi|297709035|ref|XP_002831245.1| PREDICTED: alpha-N-acetylgalactosaminidase [Pongo abelii]
Length = 411
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 102/338 (30%), Positives = 149/338 (44%), Gaps = 61/338 (18%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQIN----------EKIIRETDDCWGESSRDSQISLSCS 68
+L NGL TPPMGW +W F C IN E++ E D ++D + +
Sbjct: 17 MLDNGLLQTPPMGWLAWERFRCNINCDEDPKNCISERLFMEMAD---RMAQDGWRDMGYT 73
Query: 69 FHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL---------------- 112
+ D I + G L+ + FP G LADYVH L
Sbjct: 74 YLNIDD-CWIGGR----DASGRLMPDPKRFPHGIPFLADYVHSLGLKLGIYADMGNFTCM 128
Query: 113 --ISNSLDNRIESNIAF-EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMC 169
+LD ++ F E +D LK D CF+ Y M+ AL GRPI FS C
Sbjct: 129 GYPGTTLDKVVQDAQTFAEWKVDMLKLDGCFSTTEEQAQGYPKMAAALNATGRPIAFS-C 187
Query: 170 EW----GDMRPAL---WGSNIRNSWRTTDDISDSWARMLTI----ADMNEVYADHAKPGG 218
W G + P + ++I N WR DDI DSW +L+I + ++ A PG
Sbjct: 188 SWPAYEGGLPPKVNYSLLADICNLWRNYDDIQDSWWSVLSILNWFVEHQDILQPVAGPGH 247
Query: 219 WNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
WNDPDML +GN G+ + ++W + APLL+ D+ ++ + M I+ N +I +NQ
Sbjct: 248 WNDPDMLLIGNFGLSLEQSRAQMALWTVLAAPLLMSTDLRTISPQNMDILQNPLMIKINQ 307
Query: 279 ---AINDQQI---------WAGPLSGNRIVVLLESRKT 304
I ++I + PLS ++ SR+T
Sbjct: 308 DPLGIQGRRIHKEKSLIEVYMRPLSNKASALVFFSRRT 345
>gi|189053|gb|AAA59902.1| alpha-N-acetylgalactosaminidase [Homo sapiens]
Length = 358
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 94/300 (31%), Positives = 135/300 (45%), Gaps = 49/300 (16%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQIN----------EKIIRETDDCWGESSRDSQISLSCS 68
+L NGL TPPMGW +W F C IN E++ E D ++D + +
Sbjct: 17 MLDNGLLQTPPMGWLAWERFRCNINCDEDPKNCISEQLFMEMAD---RMAQDGWRDMGYT 73
Query: 69 FHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL---------------- 112
+ D I + G L+ + FP G LADYVH L
Sbjct: 74 YLNIDD-CWIGGR----DASGRLMPDPKRFPHGIPFLADYVHSLGLKLGIYADMGNFTCM 128
Query: 113 --ISNSLDNRIESNIAF-EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMC 169
+LD ++ F E +D LK D CF+ Y M+ L GRPI FS C
Sbjct: 129 GYPGTTLDKVVQDAQTFAEWKVDMLKLDGCFSTPEEAAQGYPKMAAGLNATGRPIAFS-C 187
Query: 170 EW----GDMRPAL---WGSNIRNSWRTTDDISDSWARMLTI----ADMNEVYADHAKPGG 218
W G + P + ++I N WR DDI DSW +L+I + ++ A PG
Sbjct: 188 SWPAYEGGLPPRVNYSLQADICNLWRNYDDIQDSWRSVLSILNWFVEHQDILQPVAGPGH 247
Query: 219 WNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
WNDPDML +GN G+ + ++W + APLL+ D+ ++A+ M I+ N +I +NQ
Sbjct: 248 WNDPDMLLIGNFGLSLEQSRAQMALWTVLAAPLLMSTDLRTISAQNMDILQNPLMIKINQ 307
>gi|323343838|ref|ZP_08084065.1| alpha galactosidase [Prevotella oralis ATCC 33269]
gi|323095657|gb|EFZ38231.1| alpha galactosidase [Prevotella oralis ATCC 33269]
Length = 772
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 111/378 (29%), Positives = 173/378 (45%), Gaps = 76/378 (20%)
Query: 8 AADSNGSDRRNL---LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------- 50
A +S G+D R + + + LTPPMGWNSWN + ++++ + +
Sbjct: 380 AENSKGADERKFTICVGDKIALTPPMGWNSWNCWGLSVSQEKVMSSAQAMIDKGLTDYGY 439
Query: 51 -----DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKAL 105
DD W R++ +++ + + +A L GN + G +
Sbjct: 440 AYINVDDAWEAPQRNTDGTIAVN----EKFPDMAG--LGRWLHGNGLKFGIYSSPGDRTC 493
Query: 106 ADYVHILISNSLDNRIESNIAFEQGIDYLKYDNC-----FNDD----TRPTIR-YRAMSD 155
Y+ L LD R ++ G+DYLKYD C F+ + T +R Y M
Sbjct: 494 GGYLGSLDHEELDARTYNS----WGVDYLKYDWCGYSRVFDAEKDHSTAAYVRPYLKMER 549
Query: 156 ALKKAGRPIFFSMCEWGDMRPALWGSNI-RNSWRTTDDISDSWARMLTIA--DMNEVYAD 212
L+ R IF+S+C++G + WG + NSWRTT DI+D+W + I E+Y
Sbjct: 550 YLRLMPRDIFYSLCQYGMAKVWEWGHAVDANSWRTTGDITDTWESLYDIGFVQQAELYP- 608
Query: 213 HAKPGGWNDPDMLEVGNGGMKYN---------EYVVHFSIWAISKAPLLLGCDVGNLTAE 263
+A PG WNDPDML VG G N E H S+W + A +L+GCD+ +
Sbjct: 609 YAGPGHWNDPDMLIVGKVGWSANLRDTRLTPDEQYTHISLWTLLSANMLIGCDISQIDDF 668
Query: 264 TMPIIGNEEVIAVNQAI-----------NDQQIWAGPLS-GNRIVVLL-----ESRKTFS 306
T+ ++ N EV A+NQ + D QIWA PL+ G+ V + + R F+
Sbjct: 669 TLNLLCNNEVNAINQDVLGKQARRVVVDGDIQIWAKPLNDGSHAVGIFNVGTDDRRVDFA 728
Query: 307 SMMTAHWDDIGIKSNNTL 324
++D++GIK T+
Sbjct: 729 K----YFDEMGIKQLKTV 742
>gi|327287984|ref|XP_003228708.1| PREDICTED: alpha-N-acetylgalactosaminidase-like [Anolis
carolinensis]
Length = 407
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 101/312 (32%), Positives = 136/312 (43%), Gaps = 49/312 (15%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGES---------SRDSQISLSCSFH 70
L NGL TPPMGW W F C N + D+C E D L +
Sbjct: 19 LENGLLRTPPMGWVPWERFRCNTN--CAHDPDNCISEGLIKAMADRLVEDGWKELGYEYV 76
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDN------ 119
D A K P QG L + FPSG K+LADYVH I + L N
Sbjct: 77 NLDD-CWAAKKRDP---QGKLQPDPERFPSGIKSLADYVHSKGLKFGIYSDLGNATCAGY 132
Query: 120 --------RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEW 171
++ G+D LK D CF+D + Y MS AL K GRPI FS C W
Sbjct: 133 PGTTLETIETDAQTFASWGVDMLKLDGCFSDSATKAVGYPKMSAALNKTGRPIAFS-CSW 191
Query: 172 ----GDMRPAL---WGSNIRNSWRTTDDISDSWARMLTIADM----NEVYADHAKPGGWN 220
G + P + I N WR DI DSW + I + + A PG WN
Sbjct: 192 PAYEGGLPPKVNYTLLGKICNLWRNYIDIEDSWDSLFRIIEWYGNNQDTLQPAAGPGRWN 251
Query: 221 DPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-- 278
DPDML +G+ G+ + V +IWAI AP + C++ N++ E ++ N +++ ++Q
Sbjct: 252 DPDMLILGDFGLSLEQSKVQVAIWAILAAPFFMSCNLRNISDEAKGLLQNPQLLNISQDP 311
Query: 279 -AINDQQIWAGP 289
I +I+ P
Sbjct: 312 RGIQGSRIYKSP 323
>gi|292495625|sp|Q4WE86.2|AGALB_ASPFU RecName: Full=Probable alpha-galactosidase B; AltName:
Full=Melibiase B; Flags: Precursor
gi|226536997|gb|ACO72591.1| alpha galactosidase precursor [Aspergillus fumigatus]
Length = 447
Score = 134 bits (338), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 93/311 (29%), Positives = 137/311 (44%), Gaps = 62/311 (19%)
Query: 24 LGLTPPMGWNSWNHFHCQINE-KIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKP 82
+G P +GWN+WN F C I+ KI+ ++ +D L + D + K
Sbjct: 29 VGKLPALGWNTWNAFGCDIDATKIMTAANEVVNLGLKD----LGYEYINIDDCWSV--KS 82
Query: 83 LPNQCQGNLVANKTTFPSGTKALADYVH-------ILISNSLDN-----------RIESN 124
+ ++ + FP G +AD +H I S L I++
Sbjct: 83 GRDASTQRIIPDPDKFPDGISGVADQIHDLGLKIGIYSSAGLTTCAGYPASLGYEDIDAQ 142
Query: 125 IAFEQGIDYLKYDNC-------------------------FNDDTRPTI----------- 148
E GIDYLKYDNC D+ P
Sbjct: 143 TFAEWGIDYLKYDNCGVPSNWTDTYTYCVPDPGSKATNGTCPDNKNPAPAGYDWRTSLTA 202
Query: 149 -RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMN 207
RYR M DAL R I +S+CEWG WG+ NSWRTT DI+ SW R+ IA+ N
Sbjct: 203 ERYRRMRDALVSVDRTILYSLCEWGQANVNDWGNETGNSWRTTGDITPSWPRIAAIANEN 262
Query: 208 EVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPI 267
+H G+ DPDMLEVGNG + E HF++WA K+PL++G + +++ + + I
Sbjct: 263 SFLMNHVDFWGYPDPDMLEVGNGNLTLAENRAHFALWAAMKSPLIIGTALDSISQDHLAI 322
Query: 268 IGNEEVIAVNQ 278
+ N+ ++ +Q
Sbjct: 323 LSNKILLKFHQ 333
>gi|413938382|gb|AFW72933.1| hypothetical protein ZEAMMB73_085808 [Zea mays]
Length = 596
Score = 134 bits (338), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 103/304 (33%), Positives = 143/304 (47%), Gaps = 50/304 (16%)
Query: 88 QGNLVANKTTFPSG--TKALADYVH-----ILISNSLDNRIESNIAFEQGIDYLKYDNCF 140
+GNL NK S +AL + H IL+ N+ + ++ N+A + K
Sbjct: 303 RGNLDDNKENVSSALSEEALDNSSHYGSENILLENNENVPLKENVALKVAKLQKKV---- 358
Query: 141 NDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARM 200
++ + LKK P + + G PALW S + NSWRTTDDI+D+W M
Sbjct: 359 ---------HQEQAGKLKKTTNPRV--LWDGGQYDPALWASKVGNSWRTTDDITDTWKSM 407
Query: 201 LTIADMNEVYADHAKPGGWN--------------------DPDMLEVGNGGMKYNEYVVH 240
IAD N +A +A PGGWN DPDMLEVGNGGM EY H
Sbjct: 408 TDIADKNNKWASYAGPGGWNVLCGAVQYTVTLLLNLCFSVDPDMLEVGNGGMTLAEYRSH 467
Query: 241 FSIWAISKAPLLLGCDVGNLTAETMPI--IGNEEVIAVNQA-INDQQIWAGPLSGNRIVV 297
FSIWA+ K N ++ M I +G + + Q ++WAGPLSGNR+VV
Sbjct: 468 FSIWALMKRVDQRVLKNKNCSSLKMMIDPLGVQGRKNLGQGKYGCCEVWAGPLSGNRLVV 527
Query: 298 LLESRKTFSSMMTAHWDDIGIKSNNT-----LFEHRTLETKFQGKLTAKVDGHSCKMYEL 352
L +R + ++ +T +G+ L++H TL G A+VD H KMY
Sbjct: 528 ALWNRCSDTANVTMKLPAVGLDGYAAYYVRDLWKHETLSENVVGTFGAQVDVHDTKMYIF 587
Query: 353 TPVT 356
+P T
Sbjct: 588 SPAT 591
>gi|4557781|ref|NP_000253.1| alpha-N-acetylgalactosaminidase precursor [Homo sapiens]
gi|127801|sp|P17050.2|NAGAB_HUMAN RecName: Full=Alpha-N-acetylgalactosaminidase; AltName:
Full=Alpha-galactosidase B; Flags: Precursor
gi|178248|gb|AAA51677.1| alpha-N-acetylgalactosaminidase [Homo sapiens]
gi|189055|gb|AAA36351.1| alpha-N-acetylgalactosaminidase [Homo sapiens]
gi|1513067|gb|AAB06718.1| alpha-N-acetylgalactosaminidase [Homo sapiens]
gi|12652693|gb|AAH00095.1| N-acetylgalactosaminidase, alpha- [Homo sapiens]
gi|47678585|emb|CAG30413.1| NAGA [Homo sapiens]
gi|109451394|emb|CAK54558.1| NAGA [synthetic construct]
gi|109451990|emb|CAK54857.1| NAGA [synthetic construct]
gi|119580888|gb|EAW60484.1| N-acetylgalactosaminidase, alpha-, isoform CRA_a [Homo sapiens]
gi|119580889|gb|EAW60485.1| N-acetylgalactosaminidase, alpha-, isoform CRA_a [Homo sapiens]
gi|119580890|gb|EAW60486.1| N-acetylgalactosaminidase, alpha-, isoform CRA_a [Homo sapiens]
gi|123981942|gb|ABM82800.1| N-acetylgalactosaminidase, alpha- [synthetic construct]
gi|123996773|gb|ABM85988.1| N-acetylgalactosaminidase, alpha- [synthetic construct]
gi|261859664|dbj|BAI46354.1| N-acetylgalactosaminidase, alpha- [synthetic construct]
Length = 411
Score = 134 bits (338), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 95/300 (31%), Positives = 136/300 (45%), Gaps = 49/300 (16%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQIN----------EKIIRETDDCWGESSRDSQISLSCS 68
+L NGL TPPMGW +W F C IN E++ E D ++D + +
Sbjct: 17 MLDNGLLQTPPMGWLAWERFRCNINCDEDPKNCISEQLFMEMAD---RMAQDGWRDMGYT 73
Query: 69 FHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL---------------- 112
+ D I + G L+ + FP G LADYVH L
Sbjct: 74 YLNIDD-CWIGGR----DASGRLMPDPKRFPHGIPFLADYVHSLGLKLGIYADMGNFTCM 128
Query: 113 --ISNSLDNRIESNIAF-EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMC 169
+LD ++ F E +D LK D CF+ Y M+ AL GRPI FS C
Sbjct: 129 GYPGTTLDKVVQDAQTFAEWKVDMLKLDGCFSTPEERAQGYPKMAAALNATGRPIAFS-C 187
Query: 170 EW----GDMRPAL---WGSNIRNSWRTTDDISDSWARMLTI----ADMNEVYADHAKPGG 218
W G + P + ++I N WR DDI DSW +L+I + ++ A PG
Sbjct: 188 SWPAYEGGLPPRVNYSLLADICNLWRNYDDIQDSWWSVLSILNWFVEHQDILQPVAGPGH 247
Query: 219 WNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
WNDPDML +GN G+ + ++W + APLL+ D+ ++A+ M I+ N +I +NQ
Sbjct: 248 WNDPDMLLIGNFGLSLEQSRAQMALWTVLAAPLLMSTDLRTISAQNMDILQNPLMIKINQ 307
>gi|219279757|ref|NP_001136428.1| N-acetylgalactosaminidase, alpha- precursor [Xenopus laevis]
Length = 410
Score = 134 bits (338), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 99/319 (31%), Positives = 138/319 (43%), Gaps = 88/319 (27%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQIN----------EKIIRE-------------------T 50
L NGL TPPMGW +W + C I+ E +I+
Sbjct: 19 LDNGLVRTPPMGWMTWQRYRCNIDCKSDPDNCISENLIKIMADEMADSGWRDLGYVYVCI 78
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW + RDS G L + FPSG KALADYVH
Sbjct: 79 DDCWSQKQRDSN--------------------------GRLQPDPERFPSGMKALADYVH 112
Query: 111 -----ILISNSLDN--------------RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYR 151
+ I + + N + ++ +D LK+D C+++ T + Y
Sbjct: 113 AKGLKLGIYSDMGNYTCGGYPGTTLDTIKTDAETFASWEVDMLKFDGCYSNSTEKALGYP 172
Query: 152 AMSDALKKAGRPIFFSMCEW----GDMRPAL----WGSNIRNSWRTTDDISDSWARMLTI 203
MS+AL GRPI +S C W G + P + GS I N WR DDI DSW +L I
Sbjct: 173 KMSEALNATGRPILYS-CSWPAYEGGLPPKVNYTQLGS-ICNMWRNYDDIDDSWDSVLDI 230
Query: 204 ----ADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGN 259
A +V A PG WNDPDML G+ G+ Y + +IWAI APL++ D+
Sbjct: 231 IEWYAKNQDVLQPAAGPGRWNDPDMLITGDFGLSYEQSKSQLAIWAILAAPLIMSNDLRA 290
Query: 260 LTAETMPIIGNEEVIAVNQ 278
++ + ++ N +I +NQ
Sbjct: 291 ISQDAKDLLQNRLLIYINQ 309
>gi|261824882|pdb|3H53|A Chain A, Crystal Structure Of Human Alpha-N-Acetylgalactosaminidase
gi|261824883|pdb|3H53|B Chain B, Crystal Structure Of Human Alpha-N-Acetylgalactosaminidase
gi|261824884|pdb|3H54|A Chain A, Crystal Structure Of Human
Alpha-N-Acetylgalactosaminidase,Complex With Galnac
gi|261824885|pdb|3H54|B Chain B, Crystal Structure Of Human
Alpha-N-Acetylgalactosaminidase,Complex With Galnac
gi|261824886|pdb|3H55|A Chain A, Crystal Structure Of Human
Alpha-N-Acetylgalactosaminidase, Complex With Galactose
gi|261824887|pdb|3H55|B Chain B, Crystal Structure Of Human
Alpha-N-Acetylgalactosaminidase, Complex With Galactose
gi|261824995|pdb|3IGU|A Chain A, Crystal Structure Of Human
Alpha-n-acetylgalactosaminidase, Covalent Intermediate
gi|261824996|pdb|3IGU|B Chain B, Crystal Structure Of Human
Alpha-n-acetylgalactosaminidase, Covalent Intermediate
gi|408535901|pdb|4DO4|A Chain A, Pharmacological Chaperones For Human
Alpha-N-Acetylgalactosaminidase
gi|408535902|pdb|4DO4|B Chain B, Pharmacological Chaperones For Human
Alpha-N-Acetylgalactosaminidase
gi|408535903|pdb|4DO5|A Chain A, Pharmacological Chaperones For Human
Alpha-N-Acetylgalactosaminidase
gi|408535904|pdb|4DO5|B Chain B, Pharmacological Chaperones For Human
Alpha-N-Acetylgalactosaminidase
gi|408535905|pdb|4DO6|A Chain A, Pharmacological Chaperones For Human
Alpha-N-Acetylgalactosaminidase
gi|408535906|pdb|4DO6|B Chain B, Pharmacological Chaperones For Human
Alpha-N-Acetylgalactosaminidase
Length = 400
Score = 134 bits (338), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 96/299 (32%), Positives = 135/299 (45%), Gaps = 49/299 (16%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQIN----------EKIIRETDDCWGESSRDSQISLSCSF 69
L NGL TPPMGW +W F C IN E++ E D ++D + ++
Sbjct: 1 LDNGLLQTPPMGWLAWERFRCNINCDEDPKNCISEQLFMEMAD---RMAQDGWRDMGYTY 57
Query: 70 HMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL----------------- 112
D I + G L+ + FP G LADYVH L
Sbjct: 58 LNIDD-CWIGGR----DASGRLMPDPKRFPHGIPFLADYVHSLGLKLGIYADMGNFTCMG 112
Query: 113 -ISNSLDNRIESNIAF-EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCE 170
+LD ++ F E +D LK D CF+ Y M+ AL GRPI FS C
Sbjct: 113 YPGTTLDKVVQDAQTFAEWKVDMLKLDGCFSTPEERAQGYPKMAAALNATGRPIAFS-CS 171
Query: 171 W----GDMRPALWGS---NIRNSWRTTDDISDSWARMLTI----ADMNEVYADHAKPGGW 219
W G + P + S +I N WR DDI DSW +L+I + ++ A PG W
Sbjct: 172 WPAYEGGLPPRVQYSLLADICNLWRNYDDIQDSWWSVLSILNWFVEHQDILQPVAGPGHW 231
Query: 220 NDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
NDPDML +GN G+ + ++W + APLL+ D+ ++A+ M I+ N +I +NQ
Sbjct: 232 NDPDMLLIGNFGLSLEQSRAQMALWTVLAAPLLMSTDLRTISAQNMDILQNPLMIKINQ 290
>gi|440704554|ref|ZP_20885392.1| alpha-galactosidase [Streptomyces turgidiscabies Car8]
gi|440273754|gb|ELP62457.1| alpha-galactosidase [Streptomyces turgidiscabies Car8]
Length = 678
Score = 134 bits (338), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 106/343 (30%), Positives = 155/343 (45%), Gaps = 62/343 (18%)
Query: 29 PMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQ 88
PMGW SWN F +I+ +I+ D + S L + + + ++ + +
Sbjct: 60 PMGWASWNSFAAKIDYSVIKGQVDAFVASG------LPAAGYKYVNIDEGWWQGT-RDSA 112
Query: 89 GNLVANKTTFPSGTKALADYVH---------------------------ILISNSLDNRI 121
GN+ +++ +P G KA+ADY+H S S +
Sbjct: 113 GNITIDESEWPGGMKAIADYIHSKGLKAGIYTDAGKDGCGYYYPTGRPAAPGSGSEGHYD 172
Query: 122 ESNIAFEQ-GIDYLKYDNCFNDDTR--PTIRYRAMSDALKKA----GRPIFFSMCEWGDM 174
+ + F Q G D++K D C D P Y+++SDA+ KA GRP+ S+C WG
Sbjct: 173 QDMLQFSQWGFDFVKVDWCGGDAEGLDPATTYQSISDAITKATATTGRPMTLSLCNWGRQ 232
Query: 175 RPALWGSNIRNSWRTTDDI-----SDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGN 229
P W WRT DDI S + +LT D N V+ G +NDPDML VG
Sbjct: 233 NPWNWAPGQGAMWRTNDDIIIYGNKPSMSNLLTNYDRN-VHPTAQHTGYYNDPDMLMVGM 291
Query: 230 GGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ----------- 278
G + H +WAIS APLL G ++ +T ET I+ N E++AV+Q
Sbjct: 292 DGFTAAQNRTHMDLWAISGAPLLAGNNLATMTTETANILKNPEIVAVDQDARGLQGVKVA 351
Query: 279 -AINDQQIWAGPLS--GNRIVVLLESRKTFSSMMTAHWDDIGI 318
Q++ LS GNR VVLL R + + MT W D+G+
Sbjct: 352 EDTTGLQVYGKVLSGTGNRAVVLLN-RTSSAQNMTVRWSDLGL 393
>gi|348569314|ref|XP_003470443.1| PREDICTED: alpha-N-acetylgalactosaminidase-like [Cavia porcellus]
Length = 413
Score = 134 bits (337), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 106/341 (31%), Positives = 149/341 (43%), Gaps = 67/341 (19%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQI 78
+L NGL TPPMGW +W F C IN C E + IS M D L +
Sbjct: 17 MLDNGLLQTPPMGWLAWERFRCNIN---------C--EEDPKNCISEHLFMEMADRLAKD 65
Query: 79 AAKPLP------NQC-------QGNLVANKTTFPSGTKALADYVHIL-----ISNSLDN- 119
+ L + C +G LV + FP G LADY H L I + N
Sbjct: 66 GWRDLGYVYLNIDDCWIGGRDTEGRLVPDSKRFPHGIAFLADYAHSLGLKLGIYADMGNL 125
Query: 120 -------------RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFF 166
+ ++ E +D LK D CF+ Y M+ AL GRPI F
Sbjct: 126 TCMGYPGTTLNKVKQDAQTFAEWKVDMLKLDGCFSTQEDRAKGYPMMASALNATGRPIAF 185
Query: 167 SMCEW----GDMRPAL---WGSNIRNSWRTTDDISDSWARMLTIAD----MNEVYADHAK 215
S C W G + P + +NI N WR DDI DSW +L+I D +V A
Sbjct: 186 S-CSWPAYEGGLPPKVNYTLLANICNLWRNYDDIQDSWQDVLSILDWFVRHQDVLQPVAG 244
Query: 216 PGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIA 275
PG WNDPDML +GN G+ + + ++W + APL + D+ ++A+ + I+ N +I
Sbjct: 245 PGHWNDPDMLIIGNFGLSFEQARAQMALWTVLAAPLFMSTDLRTISAQNVEILQNPLMIK 304
Query: 276 VNQ---AINDQQI---------WAGPLSGNRIVVLLESRKT 304
+NQ I ++I + PLS ++ SR+T
Sbjct: 305 INQDPLGIQGRRILKEKSHIEVFMRPLSEEASALVFLSRRT 345
>gi|148228315|ref|NP_001089687.1| galactosidase, alpha precursor [Xenopus laevis]
gi|76779733|gb|AAI06338.1| MGC130872 protein [Xenopus laevis]
Length = 408
Score = 134 bits (337), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 102/348 (29%), Positives = 147/348 (42%), Gaps = 98/348 (28%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQIN------------------EKIIRE-----------T 50
L NGL LTPPMGW W F C+ N ++++ E
Sbjct: 22 LQNGLALTPPMGWLHWERFLCETNCERNPKNCISERLFMDMADQMVSEGWLDAGYRYLCI 81
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW + RD L A+ FP G K LADYVH
Sbjct: 82 DDCWLDRERDEN--------------------------NRLQADPVRFPGGIKKLADYVH 115
Query: 111 ---ILISNSLDNRIESNIAF---------------EQGIDYLKYDNCFNDDTRPTIR--Y 150
+L+ D ++ F E +D LK+D C N T + Y
Sbjct: 116 SRGLLLGIYQDVGTKTCEGFPGSQGYYDLDAQTFAEWEVDLLKFDGC-NYGTLDQLEDGY 174
Query: 151 RAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIR------NSWRTTDDISDSWARMLTIA 204
R MS AL + GR I FS CEW + N NSWR DISDSW + +
Sbjct: 175 RQMSVALNRTGRKIVFS-CEWPLYERGIKKINYTEVAEYCNSWRNFGDISDSWGSVKAVM 233
Query: 205 DMNEVYADH----AKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNL 260
++ + A PG WNDPDML +GN G+ +N+ V ++WAI APL++ D+ ++
Sbjct: 234 TLSSAVQEELVPVAGPGSWNDPDMLVIGNFGLSWNQQVTQMALWAIMAAPLMMSNDLRDI 293
Query: 261 TAETMPIIGNEEVIAVNQ-----------AINDQQIWAGPLSGNRIVV 297
+AE+ ++ ++ VI++NQ ++ ++W PL + V
Sbjct: 294 SAESKALLQDKHVISINQDPLGAQGYRVFKADNFEVWEKPLIKGSLAV 341
>gi|351712920|gb|EHB15839.1| Alpha-galactosidase A, partial [Heterocephalus glaber]
Length = 423
Score = 134 bits (337), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 103/324 (31%), Positives = 150/324 (46%), Gaps = 62/324 (19%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L+NGL TP MGW W F C +N + E D C E L S D +
Sbjct: 32 LNNGLAKTPTMGWLHWERFMCNLNCQ--EEPDSCISEQLFMQMAELMVS----DGWKDVG 85
Query: 80 AKPL--------PNQ-CQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNR----- 120
+ L P + + L A+ FP G + LA+YVH + I + N+
Sbjct: 86 YEYLCIDDCWMAPQRDSEDRLQADPQRFPGGIRHLANYVHSKGLKLGIYADVGNKTCAGF 145
Query: 121 --------IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YRAMSDALKKAGRPIFFSMCEW 171
+++ + G+D LK+D C+ D + Y+ MS AL + GR I +S CEW
Sbjct: 146 PGSFGHYDLDAQTFADWGVDLLKFDGCYCDSIKHLENGYKYMSLALNRTGRSIVYS-CEW 204
Query: 172 GDMRPALWG------SNIR---NSWRTTDDISDSWARMLTIAD----MNEVYADHAKPGG 218
LW + IR N WR D+SDSW + +I D E D A PGG
Sbjct: 205 PFY---LWPFHKPNYTTIRQYCNHWRNFVDVSDSWESIKSILDWTSFHQEKIVDVAGPGG 261
Query: 219 WNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
WNDPDML +GN G+ +++ V ++WAI APLL+ D+ ++ + ++ N++VIA+NQ
Sbjct: 262 WNDPDMLVIGNFGLSWDQQVTQMALWAIMAAPLLMSNDLRQISPQAKDLLQNKDVIAINQ 321
Query: 279 AINDQQ-----------IWAGPLS 291
+Q +W PLS
Sbjct: 322 DPLGKQGYRLRKEDNIEVWERPLS 345
>gi|403282880|ref|XP_003932863.1| PREDICTED: alpha-N-acetylgalactosaminidase [Saimiri boliviensis
boliviensis]
Length = 411
Score = 134 bits (337), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 103/338 (30%), Positives = 149/338 (44%), Gaps = 61/338 (18%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRETDDCWGESSRDSQISLSCS 68
+L NGL TPPMGW +W F C I+E++ E D ++D + +
Sbjct: 17 MLDNGLLQTPPMGWLAWERFRCNTNCDEDPKNCISEQLFMEMAD---RMAQDGWRDMGYT 73
Query: 69 FHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL---------------- 112
+ D I + G L+ + F G LADYVH L
Sbjct: 74 YLNIDD-CWIGGR----DASGRLMPDPKRFSHGIPFLADYVHSLGLKLGIYADMGNFTCM 128
Query: 113 --ISNSLDNRIESNIAF-EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMC 169
+LD I+ F E +D LK D CF+ D + Y M+ AL GRPI FS C
Sbjct: 129 GYPGTTLDKVIQDAQTFAEWKVDMLKLDGCFSTDEERALGYPKMAAALNATGRPIAFS-C 187
Query: 170 EW----GDMRPALWGS---NIRNSWRTTDDISDSWARMLTI----ADMNEVYADHAKPGG 218
W G + P + S +I N WR DDI DSW +L+I ++ A PG
Sbjct: 188 SWPAYEGGLPPRVNYSLLVDICNLWRNYDDIQDSWQSVLSILNWFVQHQDILQPVAGPGH 247
Query: 219 WNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
WNDPDML +GN G+ + ++W + APL + D+ ++A+ M I+ N +I +NQ
Sbjct: 248 WNDPDMLLIGNFGLSLEQSRAQMALWTVLAAPLFMSTDLRTISAQNMDILQNPLMIKINQ 307
Query: 279 ---AINDQQI---------WAGPLSGNRIVVLLESRKT 304
I ++I + PLS ++ SR+T
Sbjct: 308 DPLGIQGRRIHKEKSLIEVYMRPLSNKASALVFFSRRT 345
>gi|19547877|gb|AAL87527.1| alpha-N-acetyl-galactosaminidase [Mus musculus]
Length = 423
Score = 134 bits (337), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 95/303 (31%), Positives = 132/303 (43%), Gaps = 55/303 (18%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQI 78
+L NGL TPPMGW +W F C I+ + + +C IS M D L Q
Sbjct: 25 MLENGLLRTPPMGWLAWERFRCNID--CVEDPKNC---------ISERLFMEMADRLAQD 73
Query: 79 AAKPLP-------------NQCQGNLVANKTTFPSGTKALADYVHIL------------- 112
+ L G L+ + FP G LADY H L
Sbjct: 74 GWRDLGYVYLNIDDCWIGGRDASGRLIPDPKRFPHGIAFLADYAHSLGLKLGIYEDMGKM 133
Query: 113 -----ISNSLDN-RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFF 166
+LD +++ E +D LK D CF+ Y M+ AL GRPI F
Sbjct: 134 TCMGYPGTTLDKVELDAETFAEWKVDMLKLDGCFSSSRERAEGYPKMAAALNATGRPIAF 193
Query: 167 SMCEW----GDMRPAL---WGSNIRNSWRTTDDISDSWARMLTIAD----MNEVYADHAK 215
S C W G + P + S + N WR DI DSW +L+I D +V A
Sbjct: 194 S-CSWPAYEGGLPPKVNYTEVSRVCNLWRNYKDIQDSWKSVLSILDWFVRHQDVLQPVAG 252
Query: 216 PGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIA 275
PG WNDPDML +GN G+ ++E ++W + APLL+ D+ ++ + M I+ N +I
Sbjct: 253 PGHWNDPDMLLIGNFGLSFDESRAQMALWTVLAAPLLMSTDLRTISPQNMDILQNPLMIK 312
Query: 276 VNQ 278
+NQ
Sbjct: 313 INQ 315
>gi|291408079|ref|XP_002720395.1| PREDICTED: alpha-galactosidase A [Oryctolagus cuniculus]
Length = 426
Score = 134 bits (337), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 103/320 (32%), Positives = 150/320 (46%), Gaps = 54/320 (16%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQ-- 77
L NGL TP MGW W F C ++ + E + C E L S D+ +
Sbjct: 32 LDNGLARTPTMGWLHWERFLCNLDCQA--EPESCISEHLFMQMAELMVSDGWKDAGYEYL 89
Query: 78 ------IAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNR------ 120
+A + P +G L A+ FP G + LADYVH + I + N+
Sbjct: 90 CIDDCWMAPERDP---EGRLQADPQRFPGGIRRLADYVHSKGLKLGIYADVGNKTCAGYP 146
Query: 121 -------IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YRAMSDALKKAGRPIFFSMCEWG 172
I++ + G+D LK+D C D + I Y+ MS AL + GR I +S CEW
Sbjct: 147 GSFGHYDIDAQTFADWGVDLLKFDGCHCDSVKHLIEGYQYMSLALNRTGRSIVYS-CEWP 205
Query: 173 DMRPALWGSN---IR---NSWRTTDDISDSWARMLTI----ADMNEVYADHAKPGGWNDP 222
N IR N WR + DI DSW + +I + E D A PGGWNDP
Sbjct: 206 LYMWPFHKPNYTKIRTYCNHWRNSADIYDSWESIKSILAWTSSNQEKIVDVAGPGGWNDP 265
Query: 223 DMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAIND 282
DML +GN G+ +N+ V ++WAI APL++ D+ +++ + ++ N++VIA+NQ
Sbjct: 266 DMLVIGNFGLSWNQQVTQMALWAIMAAPLIMSNDLRHISPQAKSLLQNKDVIAINQDPLG 325
Query: 283 QQ-----------IWAGPLS 291
+Q +W PLS
Sbjct: 326 KQGYLLRKESSIEVWERPLS 345
>gi|451851087|gb|EMD64388.1| carbohydrate-binding module family 35 protein [Cochliobolus sativus
ND90Pr]
Length = 1451
Score = 134 bits (337), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 105/383 (27%), Positives = 178/383 (46%), Gaps = 71/383 (18%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETD----DCWGESSRDSQISLSCSFHMFDSLTQIAAKP 82
TPPMGWNS+N ++C +E+ I E+ D +S + +++ C ++ D
Sbjct: 26 TPPMGWNSYNTWNCLPDEEKIHESARGLVDLGFQSVGYNFVTVDCGWNSKD--------- 76
Query: 83 LPNQCQGNLVANKTTFPSGTKALADYVHILISNSLD--------------NRIESNIAFE 128
G L N+T FPSG KAL D++H L LD + +++ FE
Sbjct: 77 --RDSNGKLRWNETLFPSGGKALGDFLHDL---GLDFGLYSGAGYLQCGSEALPASLGFE 131
Query: 129 Q---------GIDYLKYDNCFND-----------DTRPTIRYRAMSDALKKAGRPIFFSM 168
Q G D LKYDNC++ + + R++ M+ L R I + +
Sbjct: 132 QLDAESFAEWGGDRLKYDNCYSTSNTTMVDSSSAEAQSPARFQHMAAELDAVNRDIQYYV 191
Query: 169 CEWG-DMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEV 227
C+WG W ++I +WR ++DI ++W + I + Y H G + D DML +
Sbjct: 192 CQWGIGTNVGDWAADIGTTWRMSNDIYNAWRSIWRITNQIVPYFRHTTTGAFADMDMLII 251
Query: 228 GNGGMKYNEYVVHFSIWAISKAPLLLGC--DVGNLTAETMPIIGNEEVIAVNQ------- 278
G + E HF +WAI+K+PL++G D L+ ++ I+ N+EVIA+NQ
Sbjct: 252 GLKALSEEEERFHFGMWAINKSPLIMGAALDPNRLSQSSIDIMTNKEVIAINQDPLAKPA 311
Query: 279 ------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT--LFEHRTL 330
++ IW G LSG++ ++ + + + S + + +GI S N ++ + L
Sbjct: 312 RLIQRNTESEWDIWLGELSGSKQILGVANWRNDSQTVELNLASLGIASANARDVWAAKDL 371
Query: 331 ETKFQGKLTAKVDGHSCKMYELT 353
G T + GH +++ L+
Sbjct: 372 -GAVSGSQTLDLAGHELRLWVLS 393
>gi|269914455|pdb|3HG3|A Chain A, Human Alpha-Galactosidase Catalytic Mechanism 2. Substrate
Bound
gi|269914456|pdb|3HG3|B Chain B, Human Alpha-Galactosidase Catalytic Mechanism 2. Substrate
Bound
gi|367460173|pdb|3TV8|A Chain A, Pharmacological Chaperoning In Human Alpha-Galactosidase
gi|367460174|pdb|3TV8|B Chain B, Pharmacological Chaperoning In Human Alpha-Galactosidase
Length = 404
Score = 134 bits (337), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 106/353 (30%), Positives = 149/353 (42%), Gaps = 96/353 (27%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRE-------------------T 50
L NGL TP MGW W F C I+EK+ E
Sbjct: 1 LDNGLARTPTMGWLHWERFMCNLDCQEEPDSCISEKLFMEMAELMVSEGWKDAGYEYLCI 60
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RDS+ G L A+ FP G + LA+YVH
Sbjct: 61 DDCWMAPQRDSE--------------------------GRLQADPQRFPHGIRQLANYVH 94
Query: 111 -----ILISNSLDNR-------------IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YR 151
+ I + N+ I++ + G+D LK+ C+ D Y+
Sbjct: 95 SKGLKLGIYADVGNKTCAGFPGSFGYYDIDAQTFADWGVDLLKFAGCYCDSLENLADGYK 154
Query: 152 AMSDALKKAGRPIFFSMCEWGDMRPALWGSN---IR---NSWRTTDDISDSWARMLTIAD 205
MS AL + GR I +S CEW N IR N WR DI DSW + +I D
Sbjct: 155 HMSLALNRTGRSIVYS-CEWPLYMWPFQKPNYTEIRQYCNHWRNFADIDDSWKSIKSILD 213
Query: 206 M----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
E D A PGGWNDPDML +GN G+ +N+ V ++WAI APL + D+ +++
Sbjct: 214 WTSFNQERIVDVAGPGGWNDPDMLVIGNFGLSWNQQVTQMALWAIMAAPLFMSNDLRHIS 273
Query: 262 AETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLSGNRIVVLLESRK 303
+ ++ +++VIA+NQ +Q +W PLSG V + +R+
Sbjct: 274 PQAKALLQDKDVIAINQDPLGKQGYQLRQGDNFEVWERPLSGLAWAVAMINRQ 326
>gi|168693635|ref|NP_032695.3| alpha-N-acetylgalactosaminidase precursor [Mus musculus]
gi|67461084|sp|Q9QWR8.2|NAGAB_MOUSE RecName: Full=Alpha-N-acetylgalactosaminidase; AltName:
Full=Alpha-galactosidase B; Flags: Precursor
gi|3396057|gb|AAC28851.1| alpha-N-acetylgalactosaminidase [Mus musculus]
gi|19547879|gb|AAL87528.1| alpha-N-acetyl-galactosaminidase [Mus musculus]
gi|26345938|dbj|BAC36620.1| unnamed protein product [Mus musculus]
gi|148672569|gb|EDL04516.1| N-acetyl galactosaminidase, alpha, isoform CRA_c [Mus musculus]
Length = 415
Score = 134 bits (336), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 95/303 (31%), Positives = 132/303 (43%), Gaps = 55/303 (18%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQI 78
+L NGL TPPMGW +W F C I+ + + +C IS M D L Q
Sbjct: 17 MLENGLLRTPPMGWLAWERFRCNID--CVEDPKNC---------ISERLFMEMADRLAQD 65
Query: 79 AAKPLP-------------NQCQGNLVANKTTFPSGTKALADYVHIL------------- 112
+ L G L+ + FP G LADY H L
Sbjct: 66 GWRDLGYVYLNIDDCWIGGRDASGRLIPDPKRFPHGIAFLADYAHSLGLKLGIYEDMGKM 125
Query: 113 -----ISNSLDN-RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFF 166
+LD +++ E +D LK D CF+ Y M+ AL GRPI F
Sbjct: 126 TCMGYPGTTLDKVELDAETFAEWKVDMLKLDGCFSSSRERAEGYPKMAAALNATGRPIAF 185
Query: 167 SMCEW----GDMRPAL---WGSNIRNSWRTTDDISDSWARMLTIAD----MNEVYADHAK 215
S C W G + P + S + N WR DI DSW +L+I D +V A
Sbjct: 186 S-CSWPAYEGGLPPKVNYTEVSRVCNLWRNYKDIQDSWKSVLSILDWFVRHQDVLQPVAG 244
Query: 216 PGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIA 275
PG WNDPDML +GN G+ ++E ++W + APLL+ D+ ++ + M I+ N +I
Sbjct: 245 PGHWNDPDMLLIGNFGLSFDESRAQMALWTVLAAPLLMSTDLRTISPQNMDILQNPLMIK 304
Query: 276 VNQ 278
+NQ
Sbjct: 305 INQ 307
>gi|70985246|ref|XP_748129.1| alpha-galactosidase [Aspergillus fumigatus Af293]
gi|292495605|sp|B0Y224.1|AGALB_ASPFC RecName: Full=Probable alpha-galactosidase B; AltName:
Full=Melibiase B; Flags: Precursor
gi|66845757|gb|EAL86091.1| alpha-galactosidase [Aspergillus fumigatus Af293]
gi|159125948|gb|EDP51064.1| alpha-galactosidase [Aspergillus fumigatus A1163]
Length = 426
Score = 134 bits (336), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 93/310 (30%), Positives = 136/310 (43%), Gaps = 62/310 (20%)
Query: 25 GLTPPMGWNSWNHFHCQINE-KIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPL 83
G P +GWN+WN F C I+ KI+ ++ +D L + D + K
Sbjct: 9 GKLPALGWNTWNAFGCDIDATKIMTAANEVVNLGLKD----LGYEYINIDDCWSV--KSG 62
Query: 84 PNQCQGNLVANKTTFPSGTKALADYVH-------ILISNSLDN-----------RIESNI 125
+ ++ + FP G +AD +H I S L I++
Sbjct: 63 RDASTQRIIPDPDKFPDGISGVADQIHDLGLKIGIYSSAGLTTCAGYPASLGYEDIDAQT 122
Query: 126 AFEQGIDYLKYDNC-------------------------FNDDTRPTI------------ 148
E GIDYLKYDNC D+ P
Sbjct: 123 FAEWGIDYLKYDNCGVPSNWTDTYTYCVPDPGSKATNGTCPDNKNPAPAGYDWRTSLTAE 182
Query: 149 RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNE 208
RYR M DAL R I +S+CEWG WG+ NSWRTT DI+ SW R+ IA+ N
Sbjct: 183 RYRRMRDALVSVDRTILYSLCEWGQANVNDWGNETGNSWRTTGDITPSWPRIAAIANENS 242
Query: 209 VYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPII 268
+H G+ DPDMLEVGNG + E HF++WA K+PL++G + +++ + + I+
Sbjct: 243 FLMNHVDFWGYPDPDMLEVGNGNLTLAENRAHFALWAAMKSPLIIGTALDSISQDHLAIL 302
Query: 269 GNEEVIAVNQ 278
N+ ++ +Q
Sbjct: 303 SNKILLKFHQ 312
>gi|109131558|ref|XP_001093625.1| PREDICTED: alpha-galactosidase A [Macaca mulatta]
gi|355704999|gb|EHH30924.1| Alpha-galactosidase A [Macaca mulatta]
Length = 429
Score = 134 bits (336), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 103/333 (30%), Positives = 149/333 (44%), Gaps = 56/333 (16%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCS---------FH 70
L NGL TP MGW W F C ++ + E D C E L S +
Sbjct: 32 LDNGLARTPTMGWLHWERFMCNLDCQ--EEPDSCISEKLFMEMAELMVSDGWKDAGYEYL 89
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNR----- 120
D + L +G L A+ FP G + LA+YVH + I + N+
Sbjct: 90 CIDDCWMAPQRDL----EGRLQADPQRFPHGIRQLANYVHSKGLKLGIYADVGNKTCAGF 145
Query: 121 --------IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YRAMSDALKKAGRPIFFSMCEW 171
I++ + G+D LK+D C+ D Y+ MS AL + GR I +S CEW
Sbjct: 146 PGSFGYYDIDAQTFADWGVDLLKFDGCYCDSLEKLADGYKHMSLALNRTGRSIVYS-CEW 204
Query: 172 GDMRPALWGSN---IR---NSWRTTDDISDSWARMLTIADM----NEVYADHAKPGGWND 221
N IR N WR DI DSW + +I D E D A PGGWND
Sbjct: 205 PLYMWPFQKPNYTEIRQYCNHWRNFADIDDSWKSIKSILDWTSFNQERIVDVAGPGGWND 264
Query: 222 PDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAIN 281
PDML +GN G+ +N+ V ++WAI APL + D+ +++ + ++ +++VIA+NQ
Sbjct: 265 PDMLVIGNFGLSWNQQVTQMALWAIMAAPLFMSNDLRHISPQAKALLQDKDVIAINQDPL 324
Query: 282 DQQ-----------IWAGPLSGNRIVVLLESRK 303
+Q +W PLS V + +R+
Sbjct: 325 GKQGYQLRQGDNFEVWERPLSDLAWAVAMINRQ 357
>gi|393234953|gb|EJD42511.1| glycoside hydrolase [Auricularia delicata TFB-10046 SS5]
Length = 431
Score = 134 bits (336), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 103/347 (29%), Positives = 152/347 (43%), Gaps = 77/347 (22%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQ 62
NG+ P +G+N+WN F C ++E++I T DDCW R++
Sbjct: 21 NGVARLPVLGYNAWNAFGCNVDEELIIATAHAMIDKGLLKAGYNHINLDDCWALRERNAA 80
Query: 63 ISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTF--------PSGTKALADYVHILIS 114
+ F PN G LV N T +G + A Y
Sbjct: 81 GDIVPDPAKF-----------PNGIDG-LVKNLTALGFKVGIYSDAGLQTCAGY-----P 123
Query: 115 NSLDNRIESNIAFEQ-GIDYLKYDNCFNDD---TRPTI--RYRAMSDALKKAGR-----P 163
S +N AF + GI LKYDNC + TR I RY AM A++ + P
Sbjct: 124 GSWNNEARDIRAFNKWGISLLKYDNCNVPEDSLTRQNIMGRYEAMRKAIEAEAKRSRKPP 183
Query: 164 IFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPD 223
I FS+CEWG + LWG ++ +SWRTT DI +W + +I + N G ND D
Sbjct: 184 IIFSLCEWGLQQVWLWGKDMGHSWRTTFDIGPNWGSLTSIINANSFITMATDFYGRNDLD 243
Query: 224 MLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ----- 278
ML++GNG + +E HF+ WA+ K+ LL+G D+ N++ + + I+ N E++A+NQ
Sbjct: 244 MLQLGNGNLTLDEAKSHFTAWALMKSTLLIGTDLRNVSQDIVDILTNPEILAINQDPVVG 303
Query: 279 -----------------AINDQQIWAGPLSGNRIVVLLESRKTFSSM 308
+ Q W GP + +LL T + M
Sbjct: 304 TSIAPFRWGLNPDWTSDPAHPAQYWTGPTRDGVVFMLLNVLDTPAKM 350
>gi|355757548|gb|EHH61073.1| Alpha-galactosidase A [Macaca fascicularis]
Length = 429
Score = 134 bits (336), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 103/333 (30%), Positives = 149/333 (44%), Gaps = 56/333 (16%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCS---------FH 70
L NGL TP MGW W F C ++ + E D C E L S +
Sbjct: 32 LDNGLARTPTMGWLHWERFMCNLDCQ--EEPDSCISEKLFMEMAELMVSDGWKDAGYEYL 89
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNR----- 120
D + L +G L A+ FP G + LA+YVH + I + N+
Sbjct: 90 CIDDCWMAPQRDL----EGRLQADPQRFPHGIRQLANYVHSKGLKLGIYADVGNKTCAGF 145
Query: 121 --------IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YRAMSDALKKAGRPIFFSMCEW 171
I++ + G+D LK+D C+ D Y+ MS AL + GR I +S CEW
Sbjct: 146 PGSFGYYDIDAQTFADWGVDLLKFDGCYCDSLEKLADGYKHMSLALNRTGRSIVYS-CEW 204
Query: 172 GDMRPALWGSN---IR---NSWRTTDDISDSWARMLTIADM----NEVYADHAKPGGWND 221
N IR N WR DI DSW + +I D E D A PGGWND
Sbjct: 205 PLYMWPFQKPNYTEIRQYCNHWRNFADIDDSWKSIKSILDWTSFNQERIVDVAGPGGWND 264
Query: 222 PDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAIN 281
PDML +GN G+ +N+ V ++WAI APL + D+ +++ + ++ +++VIA+NQ
Sbjct: 265 PDMLVIGNFGLSWNQQVTQMALWAIMAAPLFMSNDLRHISPQAKALLQDKDVIAINQDPL 324
Query: 282 DQQ-----------IWAGPLSGNRIVVLLESRK 303
+Q +W PLS V + +R+
Sbjct: 325 GKQGYQLRQGDNFEVWERPLSDLAWAVAMINRQ 357
>gi|321459356|gb|EFX70410.1| hypothetical protein DAPPUDRAFT_309434 [Daphnia pulex]
Length = 427
Score = 134 bits (336), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 120/409 (29%), Positives = 177/409 (43%), Gaps = 87/409 (21%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGL TPPMGW +W F C + K + ++C E + S + I
Sbjct: 19 LENGLARTPPMGWLAWERFRCNTDCK--NDPENCISEKLFMKMADIMVSEGYLSAGYNII 76
Query: 80 AKPLPNQC--------QGNLVANKTTFPSGTKALADYVHILISNSLD------------- 118
+ + C G L A+ + FPSG KALADYVH S L
Sbjct: 77 SL---DDCWLDHNRSESGQLQADASRFPSGIKALADYVH---SKGLQFGMYEDYGTLTCG 130
Query: 119 ------NRIESNIA--FEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCE 170
N +E++ E G+DY+K D C+ D + Y L GRP+ +S C
Sbjct: 131 GYPGILNHLETDAKTFAEWGVDYVKLDGCYADPSEMDKGYPEFGTLLNLTGRPMVYS-CS 189
Query: 171 WGD------MRPAL-WGSNIRNSWRTTDDISDSWARMLTIADMNEVYADH-------AKP 216
W D ++P + + N WR DDI DSW + +I D Y D+ A P
Sbjct: 190 WPDYQLDKGIKPNYTFIGTVCNLWRNFDDIDDSWVSVSSIIDY---YGDNQDTLIPVAGP 246
Query: 217 GGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAV 276
G WNDPDML +GN G+ +++ ++WA+ +PL + D+ + E I+ N+ VIA+
Sbjct: 247 GHWNDPDMLIIGNFGLSFDQSRAQMAMWAVLASPLFMSVDLRTIRPEMKAILLNKNVIAI 306
Query: 277 NQ---AINDQQIW-----------AGPLSG---NRIVVLLESRKTFS-SMMTAHWDDIGI 318
NQ + ++I+ P+SG + VV L R + + ++ DIG+
Sbjct: 307 NQDPLGVQGRRIYKKSGVEIWIKSVEPISGADSSAAVVFLNQRTDGTPTRVSVILRDIGL 366
Query: 319 KSNNTLFEHRTLETKFQGK----------LTAKVDGHSCKMYELTPVTK 357
S LF + +E F GK LT V+ M + T TK
Sbjct: 367 TS---LFGY-NVEELFDGKNLGVYRPDDSLTVTVNPSGVVMIKCTVQTK 411
>gi|313204122|ref|YP_004042779.1| alpha-galactosidase [Paludibacter propionicigenes WB4]
gi|312443438|gb|ADQ79794.1| Alpha-galactosidase [Paludibacter propionicigenes WB4]
Length = 544
Score = 134 bits (336), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 97/327 (29%), Positives = 153/327 (46%), Gaps = 62/327 (18%)
Query: 28 PPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQC 87
P MGW SWN F +I+E +I+ G++ D+ +S + F +
Sbjct: 31 PLMGWASWNQFGPKIDEGVIK------GQA--DAMVSSGLAAVGFQYVNIDDGFFNKRYA 82
Query: 88 QGNLVANKTTFPSGTKALADYVHI------LISNSLDN---------------------R 120
G L + FP G K LADY+H S + +N +
Sbjct: 83 DGTLKIDSIKFPKGMKELADYIHSKGLKAGFYSEAGENTCGSQYSGQPGGVGGGLYGHDQ 142
Query: 121 IESNIAFEQ-GIDYLKYDNCFNDDTR--PTIRYRAMSDALKKAGRPIFFSMCEWGDMRPA 177
++++ F+ G D+LK D C + RY A+ A+ GR + F++C W P
Sbjct: 143 QDADLFFKTWGFDFLKVDYCGGRVQKLDEQTRYTAIRKAIDNTGRDVNFNVCRW--QFPG 200
Query: 178 LWGSNIRNSWRTTDDIS------DSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGG 231
W + + +SWR ++DI+ W ++ I ++N+ A +A PG +ND DMLEVG G
Sbjct: 201 TWVTKLADSWRMSNDINFVPGSKAKWKSVIDIINLNKYLAQYASPGHYNDMDMLEVGRG- 259
Query: 232 MKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAI----------- 280
M + E HF++W I +PL LGCD+ ++ ET I+ N+E+IA+NQ +
Sbjct: 260 MSFEEDKSHFTMWCILSSPLALGCDMTKMSNETKTILTNKEIIALNQDVAGLQAHLIYEG 319
Query: 281 NDQQIWAGPL----SGNRIVVLLESRK 303
+ QIWA L S +V +L +K
Sbjct: 320 DSVQIWAKNLKDRQSKEFVVAMLNLKK 346
>gi|74626990|sp|O94221.1|AGALB_PENSI RecName: Full=Probable alpha-galactosidase B; AltName:
Full=Melibiase B; Flags: Precursor
gi|3821271|emb|CAA08915.1| alpha-galactosidase 1 [Penicillium simplicissimum]
Length = 435
Score = 134 bits (336), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 104/374 (27%), Positives = 160/374 (42%), Gaps = 85/374 (22%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINE-KIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAA 80
+G+G P +GWNSWN F C ++ KI+ ++ +D L + D +
Sbjct: 23 DGVGRLPALGWNSWNAFGCDVDSTKIMTAANEMVHLGLKD----LGYEYVNIDDCWSV-- 76
Query: 81 KPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----ISNSLDN-------------RIE 122
K N ++ + FP G +AD VH L I +S +++
Sbjct: 77 KNTRNSTTQRIIPDTQKFPDGISGVADQVHQLGLKIGIYSSAGETTCAGYPASLGYEKVD 136
Query: 123 SNIAFEQGIDYLKYDNCF--------------------NDDTRPTI-------------- 148
+ E GIDYLKYDNC + T P +
Sbjct: 137 AEAFAEWGIDYLKYDNCGVPSNWTDQYSSCVPDGSNEPANGTCPGLSNPAPAGYDWTKSN 196
Query: 149 ---RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIAD 205
RY M DAL R I +S+C+WG WG+ NSWR + DIS +WAR+ IA+
Sbjct: 197 TFTRYTMMRDALLGQTRTILYSLCDWGQADVNTWGNETGNSWRMSGDISANWARIAQIAN 256
Query: 206 MNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETM 265
N ++ G DPDMLEVGNG + E HF++WAI K+PL++G + ++ +
Sbjct: 257 ENTFRMNYVGFWGHPDPDMLEVGNGDLTAAENRAHFALWAIMKSPLIIGTALDGISDANL 316
Query: 266 PIIGNEEVIAVNQ----------------------AINDQQIWAGPLSGNR-IVVLLESR 302
++ N+ +I NQ + + W+GP S + +VL+ +
Sbjct: 317 AVLKNKYLIEFNQDPIIGRSAHPYKWGYNPDWTFDPAHPAEYWSGPSSTLKGTLVLMLNS 376
Query: 303 KTFSSMMTAHWDDI 316
+ +S TA W +I
Sbjct: 377 ENSTSTRTAVWKEI 390
>gi|3327016|emb|CAA11703.1| alpha-N-acetylgalactosaminidase [Mus musculus]
Length = 415
Score = 134 bits (336), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 95/303 (31%), Positives = 132/303 (43%), Gaps = 55/303 (18%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQI 78
+L NGL TPPMGW +W F C I+ + + +C IS M D L Q
Sbjct: 17 MLENGLLRTPPMGWLAWERFRCNID--CVEDPKNC---------ISDGLFMEMADRLAQD 65
Query: 79 AAKPLP-------------NQCQGNLVANKTTFPSGTKALADYVHIL------------- 112
+ L G L+ + FP G LADY H L
Sbjct: 66 GWRDLGYVYLNIDDCWIGGRDASGRLIPDPKRFPHGIAFLADYAHSLGLKLGIYEDMGKM 125
Query: 113 -----ISNSLDN-RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFF 166
+LD +++ E +D LK D CF+ Y M+ AL GRPI F
Sbjct: 126 TCMGYPGTTLDKVELDAETFAEWKVDMLKLDGCFSSSRERAEGYPKMAAALNATGRPIAF 185
Query: 167 SMCEW----GDMRPAL---WGSNIRNSWRTTDDISDSWARMLTIAD----MNEVYADHAK 215
S C W G + P + S + N WR DI DSW +L+I D +V A
Sbjct: 186 S-CSWPAYEGGLPPKVNYTEVSRVCNLWRNYKDIQDSWKSVLSILDWFVRHQDVLQPVAG 244
Query: 216 PGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIA 275
PG WNDPDML +GN G+ ++E ++W + APLL+ D+ ++ + M I+ N +I
Sbjct: 245 PGHWNDPDMLLIGNFGLSFDESRAQMALWTVLAAPLLMSTDLRTISPQNMDILQNPLMIK 304
Query: 276 VNQ 278
+NQ
Sbjct: 305 INQ 307
>gi|380792137|gb|AFE67944.1| alpha-galactosidase A precursor, partial [Macaca mulatta]
Length = 426
Score = 134 bits (336), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 103/333 (30%), Positives = 149/333 (44%), Gaps = 56/333 (16%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCS---------FH 70
L NGL TP MGW W F C ++ + E D C E L S +
Sbjct: 32 LDNGLARTPTMGWLHWERFMCNLDCQ--EEPDSCISEKLFMEMAELMVSDGWKDAGYEYL 89
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNR----- 120
D + L +G L A+ FP G + LA+YVH + I + N+
Sbjct: 90 CIDDCWMAPQRDL----EGRLQADPQRFPHGIRQLANYVHSKGLKLGIYADVGNKTCAGF 145
Query: 121 --------IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YRAMSDALKKAGRPIFFSMCEW 171
I++ + G+D LK+D C+ D Y+ MS AL + GR I +S CEW
Sbjct: 146 PGSFGYYDIDAQTFADWGVDLLKFDGCYCDSLEKLADGYKHMSLALNRTGRSIVYS-CEW 204
Query: 172 GDMRPALWGSN---IR---NSWRTTDDISDSWARMLTIADM----NEVYADHAKPGGWND 221
N IR N WR DI DSW + +I D E D A PGGWND
Sbjct: 205 PLYMWPFQKPNYTEIRQYCNHWRNFADIDDSWKSIKSILDWTSFNQERIVDVAGPGGWND 264
Query: 222 PDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAIN 281
PDML +GN G+ +N+ V ++WAI APL + D+ +++ + ++ +++VIA+NQ
Sbjct: 265 PDMLVIGNFGLSWNQQVTQMALWAIMAAPLFMSNDLRHISPQAKALLQDKDVIAINQDPL 324
Query: 282 DQQ-----------IWAGPLSGNRIVVLLESRK 303
+Q +W PLS V + +R+
Sbjct: 325 GKQGYQLRQGDNFEVWERPLSDLAWAVAMINRQ 357
>gi|320167774|gb|EFW44673.1| melibiase [Capsaspora owczarzaki ATCC 30864]
Length = 405
Score = 134 bits (336), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 104/307 (33%), Positives = 140/307 (45%), Gaps = 41/307 (13%)
Query: 1 LFIVSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRD 60
L I++ AA S + + L+NGL TPPMGW+SWN F C I+E +I + D S
Sbjct: 5 LAIIACVAALS--ATTVSALNNGLAKTPPMGWSSWNTFECDISETLIHQIADTMVSSGLA 62
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI--------- 111
+ D P G LV + T FPSG AL++Y+H
Sbjct: 63 KA---GFQYINLDDCWMSGRDP----TTGRLVPDATKFPSGMSALSEYIHSKGLKFGMYV 115
Query: 112 ---------LISNSLDNRIESNIAFEQGIDYLKYDNCFNDD-TRPTIRYRAMSDALKKAG 161
+I++ E +DYLK D C D Y AMSDAL
Sbjct: 116 SAGDITCMGFAGTKGHEQIDAETLAEWNVDYLKMDCCNQTDLVEAHGVYVAMSDALLSTR 175
Query: 162 RPIFFSMCEWGDM--------RPALWGSNIRNSWRTTDDISDSWARMLTIADM--NEVYA 211
I FS C+ ++ P W N R DI D+W ++ I D N +YA
Sbjct: 176 HDILFS-CDTDELLMRMNNHEAPWDWAPGRCNVARIWLDIKDNWPNLMDIVDHASNVMYA 234
Query: 212 DHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNE 271
+ PG WND D+L VG GG +Y HFS+W + +PLLLG D+ N+T T+ I+
Sbjct: 235 --SGPGYWNDLDILTVGMGGQTDAQYRSHFSLWCLLGSPLLLGNDIRNMTLATLNILTAT 292
Query: 272 EVIAVNQ 278
EVIAV+Q
Sbjct: 293 EVIAVSQ 299
>gi|417400841|gb|JAA47343.1| Putative alpha-d-galactosidase melibiase [Desmodus rotundus]
Length = 432
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 98/321 (30%), Positives = 145/321 (45%), Gaps = 56/321 (17%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGES---------SRDSQISLSCSFH 70
L NGL +TP MGW W F C + K E D C E D + +
Sbjct: 37 LDNGLAMTPTMGWLHWERFMCNTDCK--EEPDSCISEKLFMQMADLMDSDGWKEVGYEYL 94
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNR----- 120
D A + +G L A+ FPSG + LA+YVH + I + +
Sbjct: 95 CIDDCWMAAER----DSKGRLQADPIRFPSGIRHLANYVHSKGLKLGIYADVGKKTCAGF 150
Query: 121 --------IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YRAMSDALKKAGRPIFFSMCEW 171
I++ + G+D LK+D C+ D Y+ MS AL K GR I +S CEW
Sbjct: 151 PGSFGYYDIDAETFADWGVDLLKFDGCYCDSVEQLADGYKHMSLALNKTGRSIVYS-CEW 209
Query: 172 GDMRPALWGSN---IR---NSWRTTDDISDSWARMLTIADM----NEVYADHAKPGGWND 221
N IR N WR + D+ DSW + +I D + A PGGWND
Sbjct: 210 PLYMWPFRKPNYTEIRQYCNHWRNSGDVYDSWQSVKSILDWTSSNQKTIVSAAGPGGWND 269
Query: 222 PDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAIN 281
PDML +GN G+ +++ + ++WA+ APLL+ D+ +++ + ++ N++VIA+NQ
Sbjct: 270 PDMLVIGNFGLSWDQQITQMALWAVMAAPLLMSNDLRHISLQAKMLLQNKDVIAINQDPL 329
Query: 282 DQQ-----------IWAGPLS 291
+Q +W PLS
Sbjct: 330 GKQGYLLRKEDNFEVWERPLS 350
>gi|333379204|ref|ZP_08470928.1| hypothetical protein HMPREF9456_02523 [Dysgonomonas mossii DSM
22836]
gi|332885472|gb|EGK05721.1| hypothetical protein HMPREF9456_02523 [Dysgonomonas mossii DSM
22836]
Length = 664
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 121/431 (28%), Positives = 182/431 (42%), Gaps = 100/431 (23%)
Query: 2 FIVSSSAADSNGSDRRNLL---SNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWG--- 55
+ V A++ G D + +L + + LTPPMGWNSWN CWG
Sbjct: 254 YKVLLKASNDKGRDEKEVLFKIGDEIALTPPMGWNSWN----------------CWGLSV 297
Query: 56 --ESSRDSQISLSCSFHMFD-SLTQI--AAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
E RD+ ++ H + + I + QG +++N+ FP+ KAL DY+H
Sbjct: 298 DDEKVRDAARMMNDKLHAYGWTYVNIDDGWEAKERTKQGEILSNEK-FPN-FKALTDYIH 355
Query: 111 IL------------------ISNSLDNRIESNIAFEQGIDYLKYDNCF-----NDDTRPT 147
L + + I++ I + G+DYLKYD+C D +
Sbjct: 356 SLGLKFGIYSSPGHITCGGHVGSYQHEEIDAKIWEKWGVDYLKYDHCGYLEIQKDSEEKS 415
Query: 148 IR--YRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIR---NSWRTTDDISDSWARMLT 202
I+ Y M AL K R I + + G P +W + N WRTT DI+D W +
Sbjct: 416 IQEPYIVMRKALDKVNRDIVYCV---GYGAPNVWNWGEQAGGNQWRTTRDITDEWNVVTA 472
Query: 203 IADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYV-----------VHFSIWAISKAPL 251
I +V A PG +NDPDML +G G + E V H S+W+I APL
Sbjct: 473 IGFFQDVCAPATAPGKYNDPDMLVIGKLGKGWGEKVHDSYLTADEQYSHLSLWSILSAPL 532
Query: 252 LLGCDVGNLTAETMPIIGNEEVIAVNQ----------AINDQQIWAGPLSGNRIVVLL-- 299
L+GCD+ N+ T+ ++ N EVIAV+Q + Q+W L V L
Sbjct: 533 LIGCDMANIDDFTLNLLTNREVIAVDQDPLVAPAVKIMTENGQVWYKKLYDGSYAVGLFH 592
Query: 300 -------------ESRKTFSSMMTAHWDDIGIKSNNT---LFEHRTLETKFQGKLTAKVD 343
E+ + + M + +GI+ T L+ + L F+ K A +
Sbjct: 593 VDPYFILWDQDSAEAIQNETYKMQLDFSKLGIQGEVTVRDLWRQKDL-GNFKNKFQADIP 651
Query: 344 GHSCKMYELTP 354
H K ++TP
Sbjct: 652 YHGVKFVKVTP 662
>gi|195146804|ref|XP_002014374.1| GL18985 [Drosophila persimilis]
gi|194106327|gb|EDW28370.1| GL18985 [Drosophila persimilis]
Length = 411
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 91/297 (30%), Positives = 138/297 (46%), Gaps = 45/297 (15%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISL------SCSFHMFD 73
L NGL TPPMGW SW F C N + + ++C E + L + + + F
Sbjct: 18 LDNGLAKTPPMGWLSWERFRC--NTDCVNDPENCISEKLFQTMTDLVVADGYASAGYEFI 75
Query: 74 SLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESN--------- 124
++ + + G L+A++ FP+G KAL+DY+H S L I +
Sbjct: 76 NIDDCWLEKHRSH-DGKLLADRQRFPNGIKALSDYIH---SRGLKFGIYEDYGNFTCAGY 131
Query: 125 ---IAFEQ---------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEW- 171
I +EQ +DY+K D C++ + Y L G+P+ +S C W
Sbjct: 132 PGIIGYEQEDAFQFAEWNVDYVKLDGCYSLPLEMDLGYSTFGKLLNSTGKPMVYS-CSWP 190
Query: 172 -----GDMRPALWGSNIR-NSWRTTDDISDSWARMLTIADM----NEVYADHAKPGGWND 221
++P N WR DDI DSW + I D ++ A +A PG WND
Sbjct: 191 VYQIYAGIQPNFSAVKTYCNLWRNYDDIQDSWTSVENIIDYYGNNQDLIAPNAGPGHWND 250
Query: 222 PDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
PDML +GN G+ Y + F+IW+I APLL+ D+ + + I+ N ++IAV+Q
Sbjct: 251 PDMLIIGNFGLSYEQAKTQFAIWSILAAPLLMSVDLRTIRPQFKRILQNRKIIAVDQ 307
>gi|300794919|ref|NP_001179665.1| alpha-galactosidase A [Bos taurus]
gi|296471014|tpg|DAA13129.1| TPA: alpha-galactosidase A-like [Bos taurus]
Length = 439
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 95/306 (31%), Positives = 141/306 (46%), Gaps = 46/306 (15%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGES---------SRDSQISLSCSFH 70
L NGL +TP MGW W F C +N + E D C E + D + +
Sbjct: 37 LDNGLAMTPTMGWLHWERFMCNVNCQ--EEPDSCISEKLFLQIAEIMASDGWKDVGYEYI 94
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNR----- 120
D + +G L A+ FP G LADYVH + I + N+
Sbjct: 95 CIDDCWMAPER----DSKGRLQADPKRFPGGIHRLADYVHSKGLKLGIYADVGNKTCAGF 150
Query: 121 --------IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YRAMSDALKKAGRPIFFSMCEW 171
I++ + G+D LK+D C+ D + Y+ MS AL + GR I +S CEW
Sbjct: 151 PGSFGYYDIDAQTFADWGVDLLKFDGCYCDSIQHLAEGYKQMSLALNRTGRSIVYS-CEW 209
Query: 172 GDMRPALWGSN---IR---NSWRTTDDISDSWARMLTIADM----NEVYADHAKPGGWND 221
++ N IR N WR DI DSW + D ++ A PGGWND
Sbjct: 210 PLYVWPIFKPNYTEIREYCNHWRNFADIYDSWESIRKTLDWTSSNQKIIVPVAGPGGWND 269
Query: 222 PDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-AI 280
PDML +GN G+ ++ + ++WAI APLL+ D+ +++ E ++ +++VIA+NQ +
Sbjct: 270 PDMLVIGNFGLSRDQQITQMALWAIMAAPLLMSNDLRHISPEAKALLQDKDVIAINQDRL 329
Query: 281 NDQQIW 286
Q W
Sbjct: 330 GKQGYW 335
>gi|440901706|gb|ELR52598.1| Alpha-galactosidase A, partial [Bos grunniens mutus]
Length = 423
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 95/306 (31%), Positives = 141/306 (46%), Gaps = 46/306 (15%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGES---------SRDSQISLSCSFH 70
L NGL +TP MGW W F C +N + E D C E + D + +
Sbjct: 36 LDNGLAMTPTMGWLHWERFMCNVNCQ--EEPDSCISEKLFLQIAEIMASDGWKDVGYEYI 93
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNR----- 120
D + +G L A+ FP G LADYVH + I + N+
Sbjct: 94 CIDDCWMAPER----DSKGRLQADPKRFPGGIHRLADYVHSKGLKLGIYADVGNKTCAGF 149
Query: 121 --------IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YRAMSDALKKAGRPIFFSMCEW 171
I++ + G+D LK+D C+ D + Y+ MS AL + GR I +S CEW
Sbjct: 150 PGSFGYYDIDAQTFADWGVDLLKFDGCYCDSIQHLAEGYKQMSLALNRTGRSIVYS-CEW 208
Query: 172 GDMRPALWGSN---IR---NSWRTTDDISDSWARMLTIADM----NEVYADHAKPGGWND 221
++ N IR N WR DI DSW + D ++ A PGGWND
Sbjct: 209 PLYVWPIFKPNYTEIREYCNHWRNFADIYDSWESIRKTLDWTSSNQKIIVPVAGPGGWND 268
Query: 222 PDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-AI 280
PDML +GN G+ ++ + ++WAI APLL+ D+ +++ E ++ +++VIA+NQ +
Sbjct: 269 PDMLVIGNFGLSRDQQITQMALWAIMAAPLLMSNDLRHISPEAKALLQDKDVIAINQDRL 328
Query: 281 NDQQIW 286
Q W
Sbjct: 329 GKQGYW 334
>gi|348605144|ref|NP_001102290.2| alpha-galactosidase A [Rattus norvegicus]
gi|149055434|gb|EDM07018.1| galactosidase, alpha (mapped), isoform CRA_a [Rattus norvegicus]
Length = 421
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 102/321 (31%), Positives = 144/321 (44%), Gaps = 56/321 (17%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGL TP MGW W F C ++ + E D C E L S D+ +
Sbjct: 34 LDNGLARTPTMGWLHWERFMCNLDCQ--EEPDACISEQLFMQMAELMVSDGWRDAGYEYL 91
Query: 80 A-----KPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNR--------- 120
+G L A+ FPSG K LA+YVH + I + +
Sbjct: 92 CIDDCWMAPERDSKGRLQADPKRFPSGIKHLANYVHSKGLKLGIYADVGKKTCAGFPGSF 151
Query: 121 ----IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YRAMSDALKKAGRPIFFSMCEWGDMR 175
I++ + GID LK+D C D Y+ MS AL + GR I +S CEW
Sbjct: 152 GSYDIDAQTFADWGIDLLKFDGCHCDSVVSLANGYKYMSLALNRTGRSIVYS-CEWPLYL 210
Query: 176 PALWGSNIR------NSWRTTDDISDSWARMLTI--------ADMNEVYADHAKPGGWND 221
N N WR DD+ DSW + +I D+ EV A PGGWND
Sbjct: 211 RPFHKPNYTDIQYYCNHWRNFDDVYDSWESIKSILAWTVTHQKDIVEV----AGPGGWND 266
Query: 222 PDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ--- 278
PDML +GN G+ +++ V ++WAI APLL+ D+ ++++ ++ NE+VIA+NQ
Sbjct: 267 PDMLVIGNFGLSWDQQVTQMALWAIMAAPLLMSNDLRQISSQAKALLQNEDVIAINQDPL 326
Query: 279 --------AINDQQIWAGPLS 291
N+ ++W PLS
Sbjct: 327 GKQGYCFRKENEIEVWERPLS 347
>gi|367055814|ref|XP_003658285.1| glycoside hydrolase family 27 protein [Thielavia terrestris NRRL
8126]
gi|347005551|gb|AEO71949.1| glycoside hydrolase family 27 protein [Thielavia terrestris NRRL
8126]
Length = 402
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 114/408 (27%), Positives = 180/408 (44%), Gaps = 102/408 (25%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQI-NEKIIRET-------------------DDCWGESSR 59
L NGLG TP MGW+SWN C + K +T DDCW SR
Sbjct: 21 LDNGLGRTPLMGWSSWNVAQCNAASAKYALDTANKFVSLGLKDLGYQYINIDDCWSTKSR 80
Query: 60 DSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-ISNSL- 117
++ G LV + + +P G K +AD +H + + L
Sbjct: 81 NAS--------------------------GKLVPDPSKWPDGIKPVADKIHSMGLKFGLY 114
Query: 118 --------------DNRIESNIA--FEQGIDYLKYDNCFN---DDTRP----------TI 148
++ S++A E G+D+ K+DNC+ D+ RP
Sbjct: 115 GDAGQMTCAGYPGSESHEASDVAQLVEWGVDFWKFDNCYTPCLDNPRPQTCSRPAGSTKT 174
Query: 149 RYRAMSDALKKAG--RPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADM 206
Y M DA+ A R I+F++C WG WG+ NSWR ++D +A + IA +
Sbjct: 175 WYTPMRDAIVGAQKLRNIYFNLCNWGRDNVWTWGAQYGNSWRISEDNWGDFASVARIASI 234
Query: 207 NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMP 266
++ PGG+ND DML +G+ + NE +HF +WAI+K+PL+LG D+ N++ +
Sbjct: 235 AAGIYQYSAPGGFNDLDMLYIGSPKLTTNEERLHFGLWAITKSPLVLGLDLDNISDTRLA 294
Query: 267 IIGNEEVIAVNQ----------------AINDQQI---WAGPLSGNRIVVLLESRKTFSS 307
II N+ +I +NQ A Q+ WAGPLS ++ L T +
Sbjct: 295 IIRNKGIIDINQDPLGKAATTFRPPGAPAPVSGQVYPYWAGPLSDGVVIGLCAG--TAAG 352
Query: 308 MMTAHWDDI-GIKSNN-TLFEHRTLETKFQGKLTAKVDGHSCKMYELT 353
T ++ D+ G+ + T E T +T ++ +D H ++ ++T
Sbjct: 353 TYTVNFTDVPGLGAGTYTWKEMYTGQTGNGTSVSFNIDLHDMRVIKVT 400
>gi|344306872|ref|XP_003422107.1| PREDICTED: alpha-galactosidase A [Loxodonta africana]
Length = 436
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 103/332 (31%), Positives = 149/332 (44%), Gaps = 54/332 (16%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGL +TP MGW W F C ++ E D C E L S D+ Q
Sbjct: 35 LDNGLAMTPTMGWLHWERFMCNVD--CTEEPDSCISEQLFMQMADLMVSDGWKDAGYQYL 92
Query: 80 A-----KPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNR--------- 120
+G L A+ FP G LA+YVH + I + +
Sbjct: 93 CIDDCWMAPTRDSKGRLQADPERFPGGIHHLANYVHSKGLKLGIYADVGQKTCAGYPGSF 152
Query: 121 ----IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YRAMSDALKKAGRPIFFSMCEWGDMR 175
I++ + +D LK+D CF D + Y+ MS AL + GR I +S CEW
Sbjct: 153 GYYDIDAKTFADWEVDLLKFDGCFCDSVKHLADGYKHMSLALNRTGRSIVYS-CEW---P 208
Query: 176 PALWG------SNIR---NSWRTTDDISDSWARMLTIADMN----EVYADHAKPGGWNDP 222
LW + IR N WR DI DSW + I D + + D A PGGWNDP
Sbjct: 209 LYLWPFHKPNYTEIRQYCNHWRNFGDIFDSWQSVKGILDWSSSNQDTIVDVAGPGGWNDP 268
Query: 223 DMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAIND 282
DML +GN G+ +++ V ++WAI APLL+ D+ +++ + ++ +++VIA+NQ I
Sbjct: 269 DMLVIGNFGLSWDQQVTQMALWAIMAAPLLMSNDLRHISPQAKALLQDKDVIAINQDILG 328
Query: 283 QQ-----------IWAGPLSGNRIVVLLESRK 303
+Q +W PLS V + S +
Sbjct: 329 KQGYLLRKEDKFEVWERPLSNLAWAVAIRSLR 360
>gi|115670835|ref|XP_786834.2| PREDICTED: alpha-N-acetylgalactosaminidase-like [Strongylocentrotus
purpuratus]
Length = 430
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 99/334 (29%), Positives = 148/334 (44%), Gaps = 59/334 (17%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCS---------FH 70
L NGL TPPMGW SW F C + +R+ D+C E + L S F
Sbjct: 25 LDNGLARTPPMGWCSWERFRCDTD--CVRDPDNCVSEHLYKTMADLVVSEGYKDLGYEFI 82
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAFEQ- 129
D + + G L + T FP+G KALADYVH S L I ++ +
Sbjct: 83 NMDDCWMASTR----DSNGRLYGDPTRFPNGMKALADYVH---SKGLKLGIYESMGYATC 135
Query: 130 --------------------GIDYLKYDNCFNDDTRPTIR-YRAMSDALKKAGRPIFFSM 168
GID +K D C T + S AL GRPI +S
Sbjct: 136 QKLPGTFGHIETDAQTFADWGIDMVKMDTCHTPSVELTGEGFMNFSRALNGTGRPIVYS- 194
Query: 169 CEWGDMRPALWG--SNIRNSWRTTDDISDSWARMLTI----ADMNEVYADHAKPGGWNDP 222
CEW ++ + + + N++R DI DSW ++TI A +++ + + PG ++DP
Sbjct: 195 CEWAHVQSSNFSIIAETCNTFRNYIDIQDSWTNVMTIIEFFASKQDIFTNVSGPGSYSDP 254
Query: 223 DMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ---- 278
DML VG+ + ++ ++W+I A L++ D+ L I+ N+EVIAVNQ
Sbjct: 255 DMLIVGDYSLSIDQSKAQMALWSIMAAQLMMSNDLRTLAPWAKEILQNKEVIAVNQDPLG 314
Query: 279 --------AINDQQIWAGPLSGNRIVVLLESRKT 304
++WA PLSG ++ S++T
Sbjct: 315 IMGKRVFTTAAKIEVWAKPLSGESFAAVVFSKRT 348
>gi|444512124|gb|ELV10037.1| Alpha-galactosidase A [Tupaia chinensis]
Length = 408
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 98/317 (30%), Positives = 145/317 (45%), Gaps = 49/317 (15%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDS----- 74
L NGL +TP MGW W F C ++ + E D C E L S DS
Sbjct: 18 LDNGLAMTPTMGWLHWERFVCNLDCQ--EEPDSCISEQLFMQMADLMVSDGWKDSGYTYL 75
Query: 75 -LTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNR-------- 120
+ P +G L A+ FP G + LA+YVH + I + N+
Sbjct: 76 CIDDCWMAP-ERDSKGRLQADPQRFPGGIRRLANYVHSKGLKLGIYADVGNKTCEGFPGS 134
Query: 121 -----IESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMR 175
I++ + +D LK+D CF D Y+ MS AL + GR I +S CEW
Sbjct: 135 FGYYDIDAQTFADWEVDLLKFDGCFCDMEHLVYGYKYMSLALNRTGRSIVYS-CEWPFYL 193
Query: 176 PALWGSN---IR---NSWRTTDDISDSWARMLTIADM----NEVYADHAKPGGWNDPDML 225
+ N IR N WR DISDSW + + D + D A PGGWNDPDML
Sbjct: 194 WPYYKPNYTEIRQYCNHWRNFADISDSWQSVKSTLDWTSSNQDKIVDFAGPGGWNDPDML 253
Query: 226 EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ- 284
+GN G+ +++ V ++WAI APL + D+ +++ + ++ +++VIA+ + + +Q
Sbjct: 254 VIGNFGLSWDQQVTQMALWAIMAAPLFMSNDLRHISPQAKALLQDKDVIAIKKDVLGKQG 313
Query: 285 ----------IWAGPLS 291
+W PLS
Sbjct: 314 YRLRKEDNVEVWERPLS 330
>gi|18203750|gb|AAH21631.1| N-acetyl galactosaminidase, alpha [Mus musculus]
Length = 415
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 95/303 (31%), Positives = 132/303 (43%), Gaps = 55/303 (18%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQI 78
+L NGL TPPMGW +W F C I+ + + +C IS M D L Q
Sbjct: 17 MLENGLLRTPPMGWLAWERFRCNID--CVEDPKNC---------ISERLFMEMADRLAQD 65
Query: 79 AAKPLP-------------NQCQGNLVANKTTFPSGTKALADYVHIL------------- 112
+ L G L+ + FP G LADY H L
Sbjct: 66 GWRDLGYVYLNIDDCWIGGRDASGRLIPDPKRFPHGIAFLADYAHSLGLKLGIYEDMGKM 125
Query: 113 -----ISNSLDN-RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFF 166
+LD +++ E +D LK D CF+ Y M+ AL GRPI F
Sbjct: 126 TCMGYPGTTLDKVELDAETFAEWKVDMLKLDGCFSSSRERAEGYPKMAAALNATGRPIAF 185
Query: 167 SMCEW----GDMRPAL---WGSNIRNSWRTTDDISDSWARMLTIAD----MNEVYADHAK 215
S C W G + P + S + N WR DI DSW +L+I D +V A
Sbjct: 186 S-CSWPAYEGGLPPKVNYTEVSRVCNLWRNYKDIQDSWKSVLSILDWFVRHQDVPQPVAG 244
Query: 216 PGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIA 275
PG WNDPDML +GN G+ ++E ++W + APLL+ D+ ++ + M I+ N +I
Sbjct: 245 PGHWNDPDMLLIGNFGLSFDESRAQMALWTVLAAPLLMSTDLRTISPQNMDILQNPLMIK 304
Query: 276 VNQ 278
+NQ
Sbjct: 305 INQ 307
>gi|53711562|ref|YP_097554.1| alpha-galactosidase [Bacteroides fragilis YCH46]
gi|375356663|ref|YP_005109435.1| putative exported alpha-galactosidase [Bacteroides fragilis 638R]
gi|383116573|ref|ZP_09937321.1| hypothetical protein BSHG_1354 [Bacteroides sp. 3_2_5]
gi|423259385|ref|ZP_17240308.1| hypothetical protein HMPREF1055_02585 [Bacteroides fragilis
CL07T00C01]
gi|423263643|ref|ZP_17242646.1| hypothetical protein HMPREF1056_00333 [Bacteroides fragilis
CL07T12C05]
gi|423282466|ref|ZP_17261351.1| hypothetical protein HMPREF1204_00889 [Bacteroides fragilis HMW
615]
gi|52214427|dbj|BAD47020.1| alpha-galactosidase precursor [Bacteroides fragilis YCH46]
gi|251948153|gb|EES88435.1| hypothetical protein BSHG_1354 [Bacteroides sp. 3_2_5]
gi|301161344|emb|CBW20884.1| putative exported alpha-galactosidase [Bacteroides fragilis 638R]
gi|387776965|gb|EIK39065.1| hypothetical protein HMPREF1055_02585 [Bacteroides fragilis
CL07T00C01]
gi|392707065|gb|EIZ00185.1| hypothetical protein HMPREF1056_00333 [Bacteroides fragilis
CL07T12C05]
gi|404582034|gb|EKA86729.1| hypothetical protein HMPREF1204_00889 [Bacteroides fragilis HMW
615]
Length = 530
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 88/286 (30%), Positives = 135/286 (47%), Gaps = 48/286 (16%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQ 86
TP MGW+SWN F INE++I+ET D+ ++ + +
Sbjct: 29 TPMMGWSSWNTFRVHINEELIKET--------ADAMVNRGLKDVGYGYVNIDDGYFGGRN 80
Query: 87 CQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAFEQ----------------- 129
+G L ANK FP+G + L+DY+H S L I S+
Sbjct: 81 SEGRLFANKKKFPNGMRVLSDYIH---SKGLKAGIYSDAGSNTCGSIYDADTLGIGVGLW 137
Query: 130 --------------GIDYLKYDNCFNDDTRPT--IRYRAMSDALKKAGRP-IFFSMCEWG 172
G D++K D C + T + RY + A+++ GR + +++C W
Sbjct: 138 KHDDIDCQTFLKDWGYDFIKIDWCGGEATGQSEQQRYTDIYKAIRRTGRTDVRYNICRW- 196
Query: 173 DMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGM 232
P W + + SWR DI+ + + I + N A +A PG +ND DMLEVG G +
Sbjct: 197 -QFPGTWATQLAGSWRIHTDINPRFTTIDRIIERNLYLAPYASPGHYNDMDMLEVGRG-L 254
Query: 233 KYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
+E HF IW+I +PL++GCD+ + +T+ II N+EVIA+NQ
Sbjct: 255 TEDEEKTHFGIWSILSSPLMIGCDLRTIPEKTLSIITNKEVIALNQ 300
>gi|60679821|ref|YP_209965.1| alpha-galactosidase [Bacteroides fragilis NCTC 9343]
gi|60491255|emb|CAH06003.1| putative exported alpha-galactosidase [Bacteroides fragilis NCTC
9343]
Length = 530
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 88/286 (30%), Positives = 135/286 (47%), Gaps = 48/286 (16%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQ 86
TP MGW+SWN F INE++I+ET D+ ++ + +
Sbjct: 29 TPMMGWSSWNTFRVHINEELIKET--------ADAMVNRGLKDVGYGYVNIDDGYFGGRN 80
Query: 87 CQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAFEQ----------------- 129
+G L ANK FP+G + L+DY+H S L I S+
Sbjct: 81 SEGRLFANKKKFPNGMRVLSDYIH---SKGLKAGIYSDAGSNTCGSIYDADTLGIGVGLW 137
Query: 130 --------------GIDYLKYDNCFNDDTRPT--IRYRAMSDALKKAGRP-IFFSMCEWG 172
G D++K D C + T + RY + A+++ GR + +++C W
Sbjct: 138 KHDDIDCQTFLKDWGYDFIKIDWCGGEATGQSEQQRYTDIYKAIRRTGRTDVRYNICRW- 196
Query: 173 DMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGM 232
P W + + SWR DI+ + + I + N A +A PG +ND DMLEVG G +
Sbjct: 197 -QFPGTWATQLAGSWRIHTDINPRFTTIDRIIERNLYLAPYASPGHYNDMDMLEVGRG-L 254
Query: 233 KYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
+E HF IW+I +PL++GCD+ + +T+ II N+EVIA+NQ
Sbjct: 255 TEDEEKTHFGIWSILSSPLMIGCDLRTIPEKTLSIITNKEVIALNQ 300
>gi|347832799|emb|CCD48496.1| glycoside hydrolase family 27 protein [Botryotinia fuckeliana]
Length = 454
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 119/405 (29%), Positives = 180/405 (44%), Gaps = 80/405 (19%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQIS--LSCSFHMFDSLTQ 77
L+N L LTPPMGW+SWN F I+E +I+ S+ D+ IS L + ++ +L
Sbjct: 55 LNNSLALTPPMGWSSWNQFGDLIDENLIK--------STIDAMISNGLKDAGFVYVNLDD 106
Query: 78 IAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----------------ILISNSLDNR 120
+ + L A+ FP+G KA+ADY H + SL +
Sbjct: 107 GWQRYKGPRSANPLEADPVKFPNGIKAVADYAHERGFKLGIYSGPGDETCAGYTGSLGHE 166
Query: 121 IE-SNIAFEQGIDYLKYDNCFNDDTRPTIRYR----AMSDALKKAGRPIFFSMCEWGDMR 175
E +N+ GID+LKYD+C + + P + MS AL +GRPI + C G
Sbjct: 167 EEDANMFASWGIDHLKYDSCCSHEDAPKSVVQEIVLKMSKALLASGRPIVYHACHCGWAD 226
Query: 176 PALWGSNI-RNSWRTTDDISDS----------WARMLTIADMNEVYADHAKPGGWNDPDM 224
W ++ N WR DISD + +L + D ++ PG WND DM
Sbjct: 227 IWEWAADEGANQWRIGQDISDEFNYPGNREKYYFDVLDMIDRGNELVQYSGPGHWNDYDM 286
Query: 225 LEVG-NG--------GMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIA 275
L +G NG G EY HFS+WA+ +PLL+G DV L + + N+E+I
Sbjct: 287 LIIGLNGNSTQLVGTGASSIEYRTHFSMWAMVASPLLIGSDVRILDTYELQTLTNKEIIE 346
Query: 276 VNQ---------------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMT--------AH 312
+NQ A D Q++A ++ + L +R TF++ M+
Sbjct: 347 INQDPLGISAQTVGVGYEADGDLQVYAKEMADGSYAIALLNRGTFTAEMSFSPRRDILMT 406
Query: 313 WDDIGIKSNNTLFEHRTLETKFQGKLTAKVDGHSCKMYELTPVTK 357
WD I+ L++H+ E + T +V H K+ + V K
Sbjct: 407 WDHYRIRD---LWKHK--EGLYDVPYTVEVMPHEAKVLRVWEVQK 446
>gi|154319279|ref|XP_001558957.1| hypothetical protein BC1G_02591 [Botryotinia fuckeliana B05.10]
Length = 454
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 119/405 (29%), Positives = 180/405 (44%), Gaps = 80/405 (19%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQIS--LSCSFHMFDSLTQ 77
L+N L LTPPMGW+SWN F I+E +I+ S+ D+ IS L + ++ +L
Sbjct: 55 LNNSLALTPPMGWSSWNQFGDLIDENLIK--------STIDAMISNGLKDAGFVYVNLDD 106
Query: 78 IAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----------------ILISNSLDNR 120
+ + L A+ FP+G KA+ADY H + SL +
Sbjct: 107 GWQRYKGPRSANPLEADPVKFPNGIKAVADYAHERGFKLGIYSGPGDETCAGYTGSLGHE 166
Query: 121 IE-SNIAFEQGIDYLKYDNCFNDDTRPTIRYR----AMSDALKKAGRPIFFSMCEWGDMR 175
E +N+ GID+LKYD+C + + P + MS AL +GRPI + C G
Sbjct: 167 EEDANMFASWGIDHLKYDSCCSHEDAPKSVVQEIVLKMSKALLASGRPIVYHACHCGWAD 226
Query: 176 PALWGSNI-RNSWRTTDDISDS----------WARMLTIADMNEVYADHAKPGGWNDPDM 224
W ++ N WR DISD + +L + D ++ PG WND DM
Sbjct: 227 IWEWAADEGANQWRIGQDISDEFNYPGNREKYYFDVLDMIDRGNELVQYSGPGHWNDYDM 286
Query: 225 LEVG-NG--------GMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIA 275
L +G NG G EY HFS+WA+ +PLL+G DV L + + N+E+I
Sbjct: 287 LIIGLNGNSTQLVGTGASNIEYRTHFSMWAMVASPLLIGSDVRILDTYELQTLTNKEIIE 346
Query: 276 VNQ---------------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMT--------AH 312
+NQ A D Q++A ++ + L +R TF++ M+
Sbjct: 347 INQDPLGISAQTVGVGYEADGDLQVYAKEMADGSYAIALLNRGTFTAEMSFSPRRDILMT 406
Query: 313 WDDIGIKSNNTLFEHRTLETKFQGKLTAKVDGHSCKMYELTPVTK 357
WD I+ L++H+ E + T +V H K+ + V K
Sbjct: 407 WDHYRIRD---LWKHK--EGLYDVPYTVEVMPHEAKILRVWEVQK 446
>gi|336407757|ref|ZP_08588253.1| hypothetical protein HMPREF1018_00268 [Bacteroides sp. 2_1_56FAA]
gi|335944836|gb|EGN06653.1| hypothetical protein HMPREF1018_00268 [Bacteroides sp. 2_1_56FAA]
Length = 530
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 88/286 (30%), Positives = 135/286 (47%), Gaps = 48/286 (16%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQ 86
TP MGW+SWN F INE++I+ET D+ ++ + +
Sbjct: 29 TPMMGWSSWNTFRVHINEELIKET--------ADAMVNRGLKDVGYGYVNIDDGYFGGRN 80
Query: 87 CQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAFEQ----------------- 129
+G L ANK FP+G + L+DY+H S L I S+
Sbjct: 81 SEGRLFANKKKFPNGMRVLSDYIH---SKGLKAGIYSDAGSNTCGSIYDADTLGIGVGLW 137
Query: 130 --------------GIDYLKYDNCFNDDTRPT--IRYRAMSDALKKAGRP-IFFSMCEWG 172
G D++K D C + T + RY + A+++ GR + +++C W
Sbjct: 138 KHDDIDCQTFLKDWGYDFIKIDWCGGEATGQSEQQRYTDIYKAIRRTGRTDVRYNICRW- 196
Query: 173 DMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGM 232
P W + + SWR DI+ + + I + N A +A PG +ND DMLEVG G +
Sbjct: 197 -QFPGTWATQLAGSWRIHTDINPRFTTIDRIIERNLYLAPYASPGHYNDMDMLEVGRG-L 254
Query: 233 KYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
+E HF IW+I +PL++GCD+ + +T+ II N+EVIA+NQ
Sbjct: 255 TEDEEKTHFGIWSILSSPLMIGCDLRTIPEKTLSIITNKEVIALNQ 300
>gi|119499125|ref|XP_001266320.1| alpha-galactosidase [Neosartorya fischeri NRRL 181]
gi|292495601|sp|A1D0A3.1|AGALB_NEOFI RecName: Full=Probable alpha-galactosidase B; AltName:
Full=Melibiase B; Flags: Precursor
gi|119414484|gb|EAW24423.1| alpha-galactosidase [Neosartorya fischeri NRRL 181]
Length = 447
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 94/311 (30%), Positives = 136/311 (43%), Gaps = 62/311 (19%)
Query: 24 LGLTPPMGWNSWNHFHCQINE-KIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKP 82
+G P +GWNSWN F C I+ KI+ ++ +D L + D + K
Sbjct: 29 VGKLPALGWNSWNAFGCDIDAAKIMTAANEVVNLGLKD----LGYEYINIDDCWSV--KS 82
Query: 83 LPNQCQGNLVANKTTFPSGTKALADYVH-------ILISNSLDN-----------RIESN 124
+ +V + FP G LAD +H I S L I++
Sbjct: 83 GRDASTQRMVPDPEKFPDGISGLADQIHDLGLKVGIYSSAGLTTCAGYPASLGYEDIDAQ 142
Query: 125 IAFEQGIDYLKYDNC-------------------------FNDDTRPTI----------- 148
E GIDYLKYDNC D+ P
Sbjct: 143 TFAEWGIDYLKYDNCGVPSNWTDAYTYCVPDPGSKSTNGTCPDNKNPAPAGYDWRTSLTA 202
Query: 149 -RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMN 207
RYR M DAL R I +S+C WG WG+ NSWRTT DI+ SW R+ IA+ N
Sbjct: 203 ERYRRMRDALVSVDRTILYSLCNWGQADVNDWGNETGNSWRTTGDITPSWPRIAAIANEN 262
Query: 208 EVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPI 267
++ G+ DPDMLEVGNG + E HF++WA K+PL++G + +++ + + I
Sbjct: 263 SFLMNYVDFWGYPDPDMLEVGNGNLTLAENRAHFALWAAMKSPLIIGTALDSISQDHLAI 322
Query: 268 IGNEEVIAVNQ 278
+ N+ ++ +Q
Sbjct: 323 LSNKILLKFHQ 333
>gi|270010255|gb|EFA06703.1| hypothetical protein TcasGA2_TC009634 [Tribolium castaneum]
Length = 404
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 107/396 (27%), Positives = 162/396 (40%), Gaps = 68/396 (17%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGES---------SRDSQISLSCSFH 70
L NGL LTPPMGW W F C I+ DC E+ + D + +
Sbjct: 20 LDNGLALTPPMGWMHWERFRCIIDCD--NYPTDCISENLFMDMADKMASDGYLDAGYEYI 77
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------I 113
+ D + +G L A+ FPSG K LADYVH
Sbjct: 78 IVDD----CWLAMERDSEGRLQADPDRFPSGMKFLADYVHAKGLKFGIYEDYGTHTCGGY 133
Query: 114 SNSLDN-RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWG 172
S+D+ +++ + G+DYLK D C+ + Y M L K RPI FS C +
Sbjct: 134 PGSIDHLELDAQTFADWGVDYLKLDGCYAELDGMEDGYAEMEAYLNKTQRPIVFS-CSF- 191
Query: 173 DMRPALWGSNIR--------NSWRTTDDISDSWARMLTIADMNEVYAD----HAKPGGWN 220
PA G + N WR DI DSW ++ I + Y D +A PG WN
Sbjct: 192 ---PAYKGLDANYSAAVEHCNLWRNYGDIQDSWDSVVKITNWFSYYQDKLLPYAGPGHWN 248
Query: 221 DPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAI 280
DPDML +GN G+ Y++ V +IW++ APL++ D+ + E I+ N++ I +NQ +
Sbjct: 249 DPDMLIIGNYGLSYDQSVAQMTIWSVMAAPLIMSVDLRTIKPEFRDILLNKDAIGINQDV 308
Query: 281 -----------NDQQIWAGPLSGNRIVVLLESRKTFSS-------MMTAHWDDIGIKSNN 322
N +IW L + + S + + D+ G NN
Sbjct: 309 LGKPGVVKYNKNKIKIWVKELVDGSYAIGIASFRVDGKPYQIEIVLKDYGLDNEGYLINN 368
Query: 323 TLFEHRTLETKFQGKLTAKVDGHSCKMYELTPVTKL 358
+ + + ++ +++L P TK+
Sbjct: 369 VFEDVEPISVSVTEPIKVLINPTGGVLWKLIPATKV 404
>gi|189238968|ref|XP_973339.2| PREDICTED: similar to AGAP005846-PA [Tribolium castaneum]
Length = 406
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 94/300 (31%), Positives = 134/300 (44%), Gaps = 50/300 (16%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGES---------SRDSQISLSCSFH 70
L NGL LTPPMGW W F C I+ DC E+ + D + +
Sbjct: 20 LDNGLALTPPMGWMHWERFRCIIDCD--NYPTDCISENLFMDMADKMASDGYLDAGYEYI 77
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------I 113
+ D + +G L A+ FPSG K LADYVH
Sbjct: 78 IVDD----CWLAMERDSEGRLQADPDRFPSGMKFLADYVHAKGLKFGIYEDYGTHTCGGY 133
Query: 114 SNSLDN-RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWG 172
S+D+ +++ + G+DYLK D C+ + Y M L K RPI FS C +
Sbjct: 134 PGSIDHLELDAQTFADWGVDYLKLDGCYAELDGMEDGYAEMEAYLNKTQRPIVFS-CSF- 191
Query: 173 DMRPALWGSNIR--------NSWRTTDDISDSWARMLTIADMNEVYAD----HAKPGGWN 220
PA G + N WR DI DSW ++ I + Y D +A PG WN
Sbjct: 192 ---PAYKGLDANYSAAVEHCNLWRNYGDIQDSWDSVVKITNWFSYYQDKLLPYAGPGHWN 248
Query: 221 DPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAI 280
DPDML +GN G+ Y++ V +IW++ APL++ D+ + E I+ N++ I +NQ +
Sbjct: 249 DPDMLIIGNYGLSYDQSVAQMTIWSVMAAPLIMSVDLRTIKPEFRDILLNKDAIGINQDV 308
>gi|121719230|ref|XP_001276327.1| alpha-galactosidase [Aspergillus clavatus NRRL 1]
gi|292495604|sp|A1C5D3.1|AGALB_ASPCL RecName: Full=Probable alpha-galactosidase B; AltName:
Full=Melibiase B; Flags: Precursor
gi|119404525|gb|EAW14901.1| alpha-galactosidase [Aspergillus clavatus NRRL 1]
Length = 448
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 94/313 (30%), Positives = 139/313 (44%), Gaps = 62/313 (19%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINE-KIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAA 80
+G+G P +GWNSWN F C I++ KI+ + +D L + D +
Sbjct: 28 DGVGKLPALGWNSWNAFGCDIDDAKIMTAAKEIVNLGLKD----LGYEYINIDDCWSV-- 81
Query: 81 KPLPNQCQGNLVANKTTFPSGTKALADYVH-------ILISNSLDN-----------RIE 122
K ++ +V + FP G +AD +H I S L I+
Sbjct: 82 KSGRDKTTKRIVPDPAKFPDGIAGVADRIHDLGLKVGIYSSAGLTTCAGYPASLGYEEID 141
Query: 123 SNIAFEQGIDYLKYDNCF---------------------------NDDTRP--------- 146
+ E GIDYLKYDNC N++ P
Sbjct: 142 AQTFAEWGIDYLKYDNCGVPSNWTDAYTFCVPDPGSASTNGTCPDNENPAPQGYDWSTSL 201
Query: 147 -TIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIAD 205
R++ M DAL IF+S+CEWG + WG+ NSWR + DI+ SW R+ IA+
Sbjct: 202 TAQRHQRMRDALLGVEHTIFYSLCEWGQADVSAWGNATGNSWRMSGDITPSWDRIAAIAN 261
Query: 206 MNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETM 265
N +H G +DPDMLEVGNG + E HF++WA K+PL++G + + +
Sbjct: 262 ENSFLLNHVDFWGHSDPDMLEVGNGDLTLAENRAHFALWAAMKSPLIIGTALDGIDPAHL 321
Query: 266 PIIGNEEVIAVNQ 278
I+ N+ +IA +Q
Sbjct: 322 EILLNKYLIAFHQ 334
>gi|332254770|ref|XP_003276505.1| PREDICTED: alpha-galactosidase A [Nomascus leucogenys]
Length = 429
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 106/353 (30%), Positives = 148/353 (41%), Gaps = 96/353 (27%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRE-------------------T 50
L NGL TP MGW W F C I+EK+ E
Sbjct: 32 LDNGLARTPTMGWLHWERFMCNLDCQEEPDSCISEKLFMEMAELMVSDGWKDAGYEYLCI 91
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RDS+ G L A+ FP G + LA+YVH
Sbjct: 92 DDCWMAPQRDSE--------------------------GRLKADPQRFPHGIRQLANYVH 125
Query: 111 -----ILISNSLDNR-------------IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YR 151
+ I + N+ I++ + G+D LK+D C+ D Y+
Sbjct: 126 SKGLKLGIYADVGNKTCAGFPGSFGYYDIDAQTFADWGVDLLKFDGCYCDSLENLADGYK 185
Query: 152 AMSDALKKAGRPIFFSMCEWGDMRPALWGSN---IR---NSWRTTDDISDSWARMLTIAD 205
MS AL + GR I +S CEW N IR N WR DI DSW + +I D
Sbjct: 186 HMSLALNRTGRSIVYS-CEWPLYMWPFQKPNYTEIRQYCNHWRNFADIDDSWKSIKSILD 244
Query: 206 M----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
E D A PGGWNDPDML +GN G+ +N+ V ++WAI APL + D+ ++
Sbjct: 245 WTSFNQERIVDVAGPGGWNDPDMLVIGNFGLSWNQQVTQMALWAIMAAPLFMSNDLRHIR 304
Query: 262 AETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLSGNRIVVLLESRK 303
+ ++ +++VIA+NQ +Q +W PLS V + +R+
Sbjct: 305 PQAKALLQDKDVIAINQDPLGKQGYQLRQGDNFEVWERPLSDLAWAVAMINRQ 357
>gi|268557346|ref|XP_002636662.1| C. briggsae CBR-GANA-1 protein [Caenorhabditis briggsae]
Length = 452
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 97/317 (30%), Positives = 132/317 (41%), Gaps = 85/317 (26%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQI----------NEKIIRE-------------------T 50
L NGLG TPPMGW SW F+C+I NE++ ++
Sbjct: 17 LDNGLGRTPPMGWMSWTAFYCEIDCQKHPTGCINEQLYKDMADELVAGGYDKLGYKSVHI 76
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW E RD+ G LVANKT FPSG KALA Y+H
Sbjct: 77 DDCWSEMERDAN--------------------------GILVANKTRFPSGMKALAKYMH 110
Query: 111 I------------------LISNSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRA 152
+ R ++ +DYLK D C D I Y
Sbjct: 111 DRGLKFGIYEDYGTKTCGGYPGSYQHERTDAQTFAAWDVDYLKLDGCNIDQALMPIGYPL 170
Query: 153 MSDALKKAGRPIFFSMCEWGDM---RPALWGSNIR----NSWRTTDDISDSWARMLTIAD 205
L GRPI +S C W P L N+ N+WR DDI+ SW +++I
Sbjct: 171 FEKELNDTGRPIMYS-CSWPAYLIDHPELVDYNLIGKYCNTWRNFDDINSSWKSIVSIIS 229
Query: 206 ----MNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
M + + PG W+DPDML +GN G+ + + ++W I APL++ D+ +
Sbjct: 230 YYDKMQDKHIPTHGPGKWHDPDMLVIGNKGITVDMSISQMTVWCIWSAPLIMSNDLRIIG 289
Query: 262 AETMPIIGNEEVIAVNQ 278
E I+ N+E I +NQ
Sbjct: 290 DEFKAILQNQEAIRINQ 306
>gi|193664693|ref|XP_001951977.1| PREDICTED: alpha-N-acetylgalactosaminidase-like [Acyrthosiphon
pisum]
Length = 426
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 98/343 (28%), Positives = 145/343 (42%), Gaps = 63/343 (18%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQIN----------EKIIRETDDCWGESSRDSQISLSCSF 69
L+NGL LTPPMGW +W F C I+ E++ T D D + +
Sbjct: 24 LNNGLALTPPMGWLAWQRFRCTIDCDTFPDECVSEQLFMTTADLLVSEGYDK---VGYKY 80
Query: 70 HMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNR---- 120
+ D + + G L A++ FP G KALADYVH I S+ +
Sbjct: 81 VVIDDCWMMHNR----TADGKLQADEKRFPHGIKALADYVHSKGLKFGIYESVGTKSCEG 136
Query: 121 ---------IESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEW 171
I++ E G+DY+K D C+ D+ + L + GRP+ +S C W
Sbjct: 137 YAGIKGYEKIDAQTFAEWGVDYVKLDGCYTDERNMDTNFPQFGKYLNETGRPMIYSCC-W 195
Query: 172 -------GDMRPALWGSNIRNSWRTTDDISDSWARMLTIAD----MNEVYADHAKPGGWN 220
G S N WR DI +S+A + + D +++A +A PG WN
Sbjct: 196 PFYQEGKGMQVNYTLVSQSCNLWRNYGDIQESYASLFDVLDNFAVKQDIWAQYAGPGQWN 255
Query: 221 DPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-- 278
DPDML +GN + + + ++WAI APL + + + I+ N EVI VNQ
Sbjct: 256 DPDMLLIGNFALSHEQSQTQMALWAILAAPLFMSNKLSTVQPNFKKILQNTEVIGVNQDK 315
Query: 279 -------AINDQ--QIWAGPLSGNRI-----VVLLESRKTFSS 307
DQ +IW P+ I + SR+T +
Sbjct: 316 LGIQGKRIYKDQGIEIWTKPVEPKEIEHYSYAIAFVSRRTMGT 358
>gi|403298787|ref|XP_003940187.1| PREDICTED: alpha-galactosidase A [Saimiri boliviensis boliviensis]
Length = 429
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 113/387 (29%), Positives = 157/387 (40%), Gaps = 100/387 (25%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRE-------------------T 50
L NGL TP MGW W F C I+EK+ E
Sbjct: 32 LDNGLARTPTMGWLHWERFMCNLDCQEEPDSCISEKLFMEMAELLVSDGWKDAGYDYLCI 91
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RDS+ G L A+ FP G + LA+YVH
Sbjct: 92 DDCWMAPQRDSE--------------------------GRLQADPQRFPRGIRQLANYVH 125
Query: 111 -----ILISNSLDNR-------------IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YR 151
+ I + N+ I++ + G+D LK+D C D Y+
Sbjct: 126 SKGLKLGIYADVGNKTCSGFPGSFGYYDIDAQTFADWGVDLLKFDGCHCDSLESLADGYK 185
Query: 152 AMSDALKKAGRPIFFSMCEWGDMRPALWGSN---IR---NSWRTTDDISDSWARMLTIAD 205
MS AL + GR I +S CEW N IR N WR DI DSW + +I D
Sbjct: 186 HMSLALNRTGRSIVYS-CEWPLYMWPFQKPNYTEIRQYCNHWRNYADIDDSWKSIKSILD 244
Query: 206 M----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
E D A PGGWNDPDML +GN G+ +N+ V ++WAI APL + D+ +++
Sbjct: 245 WTSSNQERIVDVAGPGGWNDPDMLVIGNFGLSWNQQVTQMALWAIMAAPLFMSNDLRHIS 304
Query: 262 AETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLSGNRIVVLLESRKTF----S 306
+ ++ +++VIA+NQ +Q +W PLS V + +R+ S
Sbjct: 305 PQAKALLQDKDVIAINQDPLGKQGYQLRQGDNFEVWERPLSDLAWAVAVINRQEIGGPRS 364
Query: 307 SMMTAHWDDIGIKSNNTLFEHRTLETK 333
M G+ N F + L K
Sbjct: 365 YTMAVASLGKGVACNPACFITQHLPVK 391
>gi|256393580|ref|YP_003115144.1| alpha-galactosidase [Catenulispora acidiphila DSM 44928]
gi|256359806|gb|ACU73303.1| Alpha-galactosidase [Catenulispora acidiphila DSM 44928]
Length = 727
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 118/409 (28%), Positives = 163/409 (39%), Gaps = 105/409 (25%)
Query: 22 NGLGLTPPMGWNSWNHFHCQ--INEKIIRET-------------------DDCWGESSRD 60
NGL LTPPMG+N W F C +NE + +T DDCW + +RD
Sbjct: 49 NGLSLTPPMGFNDWAGFECNSDMNEALFTKTADEIVKLGLNKLGYDYVNIDDCWMQKTRD 108
Query: 61 SQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNR 120
+ G+L + T FP G K L DY+H S L
Sbjct: 109 A--------------------------NGDLQVDATRFPHGLKWLGDYIH---SKGLKFG 139
Query: 121 IESNIA-------------FEQ--------GIDYLKYDNCFND-DTRP--------TIRY 150
I + F+Q G+DYLK D C+ D P I Y
Sbjct: 140 IYEDAGYYTCQGAAGSYGHFQQDADLYASWGVDYLKLDYCYEPMDQFPGKTESQVAQIVY 199
Query: 151 RAMSDALKKAGRPIFFS-------MCEWGDMRPAL-WGSNIRNSWRTTDDISDSWARMLT 202
S AL RP+ FS C D L W N WR DI D+W +L
Sbjct: 200 TEASQALLNTHRPMLFSESAPAYVCCSGSDFTDELTWLYQHGNLWRFGSDIYDAWPSVLE 259
Query: 203 --IADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNL 260
D A A PG WND DMLE+GNGG+ E ++WA +P+LL D+ L
Sbjct: 260 NYSEDNTPGLAQWAGPGHWNDADMLEIGNGGLTPTEEQTQMTLWAEMASPILLSTDLSKL 319
Query: 261 TAETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLSGNRIVVLLESRKTFSSMM 309
T + I+ N +V+AV+Q Q + A PL+ + V+L ++ + +
Sbjct: 320 TPAEVGIVSNPDVVAVDQDRLGAQGTIVQSGTGYDVLAKPLADGDVSVVLFNKGDTAQTV 379
Query: 310 TAHWDDIGIKSNNTLFEHRTLETKF----QGKLTAKVDGHSCKMYELTP 354
T IG+ S F+ L +K G ++A + HS +Y + P
Sbjct: 380 TTTAAKIGLPSRGAPFQLTDLVSKATSASDGTISASLAPHSTVIYRVHP 428
>gi|313147907|ref|ZP_07810100.1| alpha-galactosidase [Bacteroides fragilis 3_1_12]
gi|313136674|gb|EFR54034.1| alpha-galactosidase [Bacteroides fragilis 3_1_12]
Length = 518
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 89/283 (31%), Positives = 137/283 (48%), Gaps = 42/283 (14%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQ 86
TP MGW+SWN F INEK+I+ET D+ ++ + +
Sbjct: 17 TPMMGWSSWNTFRVHINEKLIQET--------ADAMVTRGLKDAGYKYVNIDDGYFGGRD 68
Query: 87 CQGNLVANKTTFPSGTKALADYVH-------------------ILISNSL--------DN 119
G L NK FP+G KALADY+H I +++L +
Sbjct: 69 SNGYLFTNKKKFPNGMKALADYIHSKGLKAGIYSDAGSNTCGSIYDADTLGVGVGLWQHD 128
Query: 120 RIESNIAFEQ-GIDYLKYDNCFNDDTRPT--IRYRAMSDALKKAGRP-IFFSMCEWGDMR 175
I+ F+ G D++K D C + T + RY + A+ + GR + +++C W
Sbjct: 129 DIDCKTFFQDWGYDFIKIDWCGGEATGQSEQQRYTDIYKAINRTGRTDVRYNICRW--QF 186
Query: 176 PALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYN 235
P W + + +SWR DI+ + + I + N + + G +ND DMLEVG G + +
Sbjct: 187 PGTWATRLASSWRIHTDINPRFKTIDQIIEKNLYLSPYVSLGHYNDMDMLEVGRG-LSED 245
Query: 236 EYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
E HF IWAI +PL++GCD+ + +T+ II N+EVIA+NQ
Sbjct: 246 EEKTHFGIWAIMSSPLMIGCDLRTIPEKTLSIITNQEVIALNQ 288
>gi|414888110|tpg|DAA64124.1| TPA: hypothetical protein ZEAMMB73_968166 [Zea mays]
Length = 231
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 79/188 (42%), Positives = 96/188 (51%), Gaps = 25/188 (13%)
Query: 17 RNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLT 76
R L+NGL LTP MGWNSWN F C INE +IRET D + +L +F D
Sbjct: 51 RLQLNNGLALTPQMGWNSWNFFACNINETVIRETADALVSTGL---AALGYNFVNIDDCW 107
Query: 77 QIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----------------ILISNSLDN 119
+ +Q L+ + TFPSG KALADYVH + SL +
Sbjct: 108 SYVKRGKQDQ----LLPDPKTFPSGIKALADYVHGKGLKLGIYSDAGKFTCQVRPGSLGH 163
Query: 120 RIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPAL 178
+ F GIDYLKYDNC+N +P RY M DAL GR IF+S+CEWG PAL
Sbjct: 164 ENDDAAIFASWGIDYLKYDNCYNLGIKPQKRYPPMRDALNSTGRQIFYSLCEWGQYDPAL 223
Query: 179 WGSNIRNS 186
W + NS
Sbjct: 224 WAGKVGNS 231
>gi|66802866|ref|XP_635276.1| hypothetical protein DDB_G0291524 [Dictyostelium discoideum AX4]
gi|60463561|gb|EAL61746.1| hypothetical protein DDB_G0291524 [Dictyostelium discoideum AX4]
Length = 466
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 100/310 (32%), Positives = 137/310 (44%), Gaps = 78/310 (25%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ---INEKIIRET-------------------DDCWGES 57
LSNGL LTPPMG+N WN C+ INE ++ +T DDCW
Sbjct: 52 LSNGLALTPPMGFNYWNFDGCKTNLINETMMMDTAYAMSTNGMSKVGYEYVNLDDCWMAK 111
Query: 58 SRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL----- 112
+RD GNL A+ FP G K L+DY+H L
Sbjct: 112 TRDEN--------------------------GNLRADPIRFPHGIKYLSDYIHSLGLKFG 145
Query: 113 ISNSL-----------DNRIESNIA--FEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKK 159
I + +N +E + E G+D++K D C + Y + L+
Sbjct: 146 IYGDIGSETCQGYPGSENYLEQDAKQFAEWGVDFVKMDGCNMQVSDMKKAYTDLGQYLQN 205
Query: 160 AGRPIFFSMCEWGDM-------RPALWGSNIRNSWRTTDDISDSWARMLTIADMNE---- 208
GRP+ +S C W P + I N WR DI+D++ I D E
Sbjct: 206 TGRPMVYS-CSWPTYAYVQNITMPFDYIEGICNLWREFQDITDNFTEWTQILDEMEDSVP 264
Query: 209 VYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPII 268
+ A PG +NDPDMLE+GNG EY FS+W+I APL+ G D+ +++ E + I+
Sbjct: 265 KRSQFAAPGSFNDPDMLEIGNGLENAVEYKSMFSLWSIIAAPLIAGNDIISMSKEALEIL 324
Query: 269 GNEEVIAVNQ 278
NEEVI VNQ
Sbjct: 325 INEEVIQVNQ 334
>gi|293395343|ref|ZP_06639627.1| alpha galactosidase [Serratia odorifera DSM 4582]
gi|291422027|gb|EFE95272.1| alpha galactosidase [Serratia odorifera DSM 4582]
Length = 397
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 104/360 (28%), Positives = 164/360 (45%), Gaps = 59/360 (16%)
Query: 4 VSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQI 63
V ++ AD ++ L TPPMGW+SWN++ +INEKII +T D +
Sbjct: 17 VHAATADPTSAEAAAERVTTLAQTPPMGWSSWNYYGDRINEKIIIDTID------KMVAS 70
Query: 64 SLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------------ 111
L + +++ ++ K ++ L ++ FP G K LADY H
Sbjct: 71 GLRDAGYIYVNIDDGWQKYKGSRADHPLTYDEKKFPRGIKYLADYAHQKGMKLGIYSGPG 130
Query: 112 ------LISNSLDNRIESNIAFEQGIDYLKYDNC--FNDDTRPTIR--YRAMSDALKKAG 161
+ + + ++ + G+D+LKYD+C ++ ++ + MS AL+ G
Sbjct: 131 NSTCAGYVGSEGHEKQDAAMFASWGVDHLKYDSCCSHQQASQAVLKTVFGDMSKALQATG 190
Query: 162 RPIFFSMCEWGDMRPALWGSNI-RNSWRTTDDISDS----------WARMLTIADMNEVY 210
RPI + C G WG+++ N WR DISD + +L + D
Sbjct: 191 RPIVYHACHCGWQNIWQWGNSVGANQWRIGQDISDDFNYPGNREGYYFDVLDMIDRGVGL 250
Query: 211 ADHAKPGGWNDPDMLEVG---------NGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
++ G WND DML +G G EY HFS+WAI +PLL+G + +L
Sbjct: 251 EAYSGKGHWNDYDMLVIGLHGHSKELVGTGASDTEYRTHFSMWAILSSPLLIGASLDDLN 310
Query: 262 AETMPIIGNEEVIAVNQ---------AIN--DQQIWAGPLSGNRIVVLLESRKTFSSMMT 310
A M + N+E+IA+NQ +N DQQI+A PL+ V L +R ++ MT
Sbjct: 311 AADMVTLKNKEIIALNQDPLGLQAKRYVNNGDQQIFAKPLADGSWAVALLNRSGQTATMT 370
>gi|307095410|gb|ADN30030.1| alpha-galactosidase [Bispora sp. MEY-1]
Length = 426
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 92/295 (31%), Positives = 143/295 (48%), Gaps = 44/295 (14%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGES-SRDSQISLSCSFHMFDSLTQI 78
L+NG+G PPMG++++N + C N + GE+ R + + + D +
Sbjct: 18 LNNGVGKVPPMGYDTFNAYGCDYNASSVLAQ----GEAMKRTGLVDAGYNIFILDDCYAL 73
Query: 79 AAKPLPNQCQGNLVANKTTFPSGTKALADYVH---ILISNSLDNRIESNIAFEQ------ 129
+ G +VA+ FP+G AL+ ++ I ++ DN + +
Sbjct: 74 KER----NATGYMVADPKKFPNGIPALSKQMNDLGISLAAYGDNGYYTCAGYPGSYGHEM 129
Query: 130 ---------GIDYLKYDNCF---NDDTRPTI--RYRAMSDALK----KAGRPIF-FSMCE 170
G+ YLKYDNC+ ++ T+ + RY MSDA+ K RP F F +CE
Sbjct: 130 KDLETWHSWGMSYLKYDNCYIPADNITQENMFGRYTRMSDAIAAFAAKIHRPPFIFYLCE 189
Query: 171 WGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGN- 229
WG +P +WG I WR DI W+ + +I D + G ND D+LEVGN
Sbjct: 190 WGWQQPWIWGRRISQGWRIDGDIKPYWSAIASIIDQASFQYWASDFYGRNDMDILEVGNT 249
Query: 230 ------GGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
G + Y E HF+ WA+ K+PL++G D+ N T ET+ I+GN ++I +NQ
Sbjct: 250 GQGTPPGNLTYEESKTHFTAWALMKSPLIIGTDLTNATQETIDILGNRDLIKINQ 304
>gi|296235998|ref|XP_002763141.1| PREDICTED: alpha-galactosidase A [Callithrix jacchus]
Length = 439
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 106/353 (30%), Positives = 148/353 (41%), Gaps = 96/353 (27%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRE-------------------T 50
L NGL TP MGW W F C I+EK+ E
Sbjct: 32 LDNGLARTPTMGWLHWERFMCNLDCQEEPDSCISEKLFMEMAELLVSDGWKDAGYDYLCI 91
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RDS+ G L A+ FP G + LA+YVH
Sbjct: 92 DDCWMAPQRDSE--------------------------GRLQADPQRFPHGIRQLANYVH 125
Query: 111 -----ILISNSLDNR-------------IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YR 151
+ I + N+ I++ + G+D LK+D C D Y+
Sbjct: 126 SKGLKLGIYADVGNKTCSGFPGSFGYYDIDAQTFADWGVDLLKFDGCHCDSLESLADGYK 185
Query: 152 AMSDALKKAGRPIFFSMCEWGDMRPALWGSN---IR---NSWRTTDDISDSWARMLTIAD 205
MS AL + GR I +S CEW N IR N WR DI DSW + +I D
Sbjct: 186 HMSLALNRTGRSIVYS-CEWPLYMWPFQKPNYTEIRQYCNHWRNYADIDDSWKSIKSILD 244
Query: 206 M----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
E D A PGGWNDPDML +GN G+ +N+ V ++WAI APL + D+ +++
Sbjct: 245 WTSSNQERIVDVAGPGGWNDPDMLVIGNFGLSWNQQVTQMALWAIMAAPLFMSNDLRHIS 304
Query: 262 AETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLSGNRIVVLLESRK 303
+ ++ +++VIA+NQ +Q +W PLS V + +R+
Sbjct: 305 PQAKALLQDKDVIAINQDPLGKQGYQLRQGDNFEVWERPLSDLAWAVAVINRQ 357
>gi|410965731|ref|XP_003989395.1| PREDICTED: alpha-N-acetylgalactosaminidase [Felis catus]
Length = 414
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 149/335 (44%), Gaps = 61/335 (18%)
Query: 22 NGLGLTPPMGWNSWNHFHCQ----------INEKIIRETDDCWGESSRDSQISLSCSFHM 71
NGL PPMGW +W F C I+E++ E D ++D L ++
Sbjct: 20 NGLLRKPPMGWLAWERFRCNTDCDEDPKNCISERLFMEMAD---HLAQDGWRDLGYTYLN 76
Query: 72 FDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL---------ISN------- 115
D I + +G L+ + FP G LADY H L + N
Sbjct: 77 MDD-CWIGGR----DAKGRLIPDPKRFPHGIAFLADYAHSLGLKLGIYEDMGNFTCMGYP 131
Query: 116 --SLDNRIESNIAF-EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEW- 171
+LD + F E +D LK D CF+ + Y M+ AL GRPI FS C W
Sbjct: 132 GITLDKVTQDAQTFAEWKVDMLKLDGCFSTPEERAMGYPKMAAALNATGRPIAFS-CSWP 190
Query: 172 ---GDMRPALWGS---NIRNSWRTTDDISDSWARMLTI----ADMNEVYADHAKPGGWND 221
G + P + S I N WR DDI DSW+ +L+I D ++ A PG WND
Sbjct: 191 AYEGGLPPKVNYSLLAEICNVWRNYDDIQDSWSSVLSILDWFVDHQDILQPVAGPGHWND 250
Query: 222 PDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ--- 278
PDML +GN G+ + + ++W + APL + D+ ++A+ M I+ N +I +NQ
Sbjct: 251 PDMLLIGNFGLSFEQARAQMALWTVLAAPLFMSTDLRTISAQNMDILQNPLMIKINQDPL 310
Query: 279 AINDQQI---------WAGPLSGNRIVVLLESRKT 304
I ++I + PL+ ++ SR+T
Sbjct: 311 GIQGRRILKEKSHIEVYVRPLADEASALVFFSRRT 345
>gi|395146542|gb|AFN53695.1| putative tonoplast intrinsic protein [Linum usitatissimum]
Length = 426
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 67/148 (45%), Positives = 100/148 (67%), Gaps = 15/148 (10%)
Query: 221 DPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-- 278
DPDMLEVGNGGM +EY+ HFSIWA++K+PL++GCDV ++ ET I+ N EVIAVNQ
Sbjct: 279 DPDMLEVGNGGMTNDEYIAHFSIWALAKSPLIIGCDVRAMSNETYQILSNSEVIAVNQDR 338
Query: 279 --------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSN----NTLFE 326
D ++WAGPL+ NR+ V+L +R + + +TA + DIG++++ L+
Sbjct: 339 LGVQGRKVKEGDLEVWAGPLTNNRVAVVLWNRGSSRATVTAPFSDIGLRADLADARDLWV 398
Query: 327 HRTLETKFQGKLTAKVDGHSCKMYELTP 354
H T+ + + +++A +D H+CKMY LTP
Sbjct: 399 HMTIPS-VKEQISADLDSHACKMYILTP 425
>gi|423281182|ref|ZP_17260093.1| hypothetical protein HMPREF1203_04310 [Bacteroides fragilis HMW
610]
gi|404583346|gb|EKA88027.1| hypothetical protein HMPREF1203_04310 [Bacteroides fragilis HMW
610]
Length = 529
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 89/283 (31%), Positives = 137/283 (48%), Gaps = 42/283 (14%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQ 86
TP MGW+SWN F INEK+I+ET D+ ++ + +
Sbjct: 28 TPMMGWSSWNTFRVHINEKLIQET--------ADAMVTRGLKDAGYKYVNIDDGYFGGRD 79
Query: 87 CQGNLVANKTTFPSGTKALADYVH-------------------ILISNSL--------DN 119
G L NK FP+G KALADY+H I +++L +
Sbjct: 80 SNGYLFTNKKKFPNGMKALADYIHSKGLKAGIYSDAGSNTCGSIYDADTLGVGVGLWQHD 139
Query: 120 RIESNIAFEQ-GIDYLKYDNCFNDDTRPT--IRYRAMSDALKKAGRP-IFFSMCEWGDMR 175
I+ F+ G D++K D C + T + RY + A+ + GR + +++C W
Sbjct: 140 DIDCKTFFQDWGYDFIKIDWCGGEATGQSEQQRYTDIYKAINRTGRTDVRYNICRW--QF 197
Query: 176 PALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYN 235
P W + + +SWR DI+ + + I + N + + G +ND DMLEVG G + +
Sbjct: 198 PGTWATRLASSWRIHTDINPRFKTIDQIIEKNLYLSPYVSLGHYNDMDMLEVGRG-LSED 256
Query: 236 EYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
E HF IWAI +PL++GCD+ + +T+ II N+EVIA+NQ
Sbjct: 257 EEKTHFGIWAIMSSPLMIGCDLRTIPEKTLSIITNQEVIALNQ 299
>gi|322693203|gb|EFY85072.1| alpha-galactosidase [Metarhizium acridum CQMa 102]
Length = 543
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 102/402 (25%), Positives = 161/402 (40%), Gaps = 125/402 (31%)
Query: 13 GSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDC 53
G+ RN+ + G P +GWNSWN +HC INE+ + DDC
Sbjct: 23 GTLVRNVPHDPSGKLPTLGWNSWNAYHCDINEQHFLDAAQALVDTGLRDAGYKYVNIDDC 82
Query: 54 WGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL- 112
W E + + G++ NKT FP+G LA +H +
Sbjct: 83 WSEKT--------------------------GRVDGHIAVNKTRFPAGIDGLAKKIHDMK 116
Query: 113 -------ISNSLD-NRIESNIAFEQ---------GIDYLKYDNCF--------------- 140
+ +L +++ +E G+DYLKYDNC+
Sbjct: 117 LKFGIYSTAGTLTCAGYPASLGYEDVDAADFAKWGVDYLKYDNCYIPPEWQDEYIFCEED 176
Query: 141 ----------NDDTRPTI--------------RYRAMSDALKKAGRPIFFSMCEWGDMRP 176
+ P++ R+ M DAL K R + +++C WG
Sbjct: 177 GAKIGPNGTCSRSRNPSLAPDGYDWSKSKSARRFNRMRDALAKQDREMLYNLCIWGTADV 236
Query: 177 ALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNE 236
WG N SWR + DIS +W ++ I +MN + ND DMLEVGNG + E
Sbjct: 237 FSWGRNTAISWRMSGDISPNWRSVMHILNMNSFKMNAVGFYAHNDADMLEVGNGDLSPAE 296
Query: 237 YVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------------------ 278
HF+ WA K+PLL+G D+ L+ + + ++ N ++A NQ
Sbjct: 297 TRSHFAFWAAMKSPLLVGTDLRKLSRDNVDLLKNRHLLAFNQDSRHGKPAAPYKWGVNPD 356
Query: 279 ----AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDI 316
+ N + WAG + +V++L + + ++ TA W +I
Sbjct: 357 WTYNSTNPAEYWAGQSTDGHLVLMLNTLGS-TARKTAAWGEI 397
>gi|194761672|ref|XP_001963052.1| GF14131 [Drosophila ananassae]
gi|190616749|gb|EDV32273.1| GF14131 [Drosophila ananassae]
Length = 413
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 97/304 (31%), Positives = 139/304 (45%), Gaps = 55/304 (18%)
Query: 18 NLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQ 77
N L NGL TPPMGW SW F C N + + D+C E + + + D
Sbjct: 16 NALDNGLAKTPPMGWLSWERFRC--NTDCVNDPDNCISEQLFQTMTDIVVA----DGYAS 69
Query: 78 IAAKPLP-NQC--------QGNLVANKTTFPSGTKALADYVHILISNSLDNRIESN---- 124
+ + + + C G LVA++ FPSG KAL+DYVH S L I +
Sbjct: 70 VGYEYINIDDCWLEKHRSHDGKLVADRKRFPSGMKALSDYVH---SRGLKFGIYEDYGNY 126
Query: 125 --------IAFEQ---------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFS 167
I +E+ +DY+K D C+ Y L G+ + +S
Sbjct: 127 TCAGYPGIIGYEEKDALLFAEWNVDYVKLDGCYALPYDMDQGYSNFGRLLNSTGKSMVYS 186
Query: 168 MCEW------GDMRPALWGSNIR---NSWRTTDDISDSWARMLTIADM----NEVYADHA 214
C W ++P S I+ N WR DDI DSWA + I D ++ A +A
Sbjct: 187 -CSWPVYQIYAGIQPNY--SAIQSHCNLWRNYDDIQDSWASVENIIDYYGNNQDLIAPNA 243
Query: 215 KPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVI 274
PG WNDPDML +GN G+ Y + F+IW+I APLL+ D+ + + I+ N ++I
Sbjct: 244 GPGHWNDPDMLIIGNFGLSYEQAKTQFAIWSILAAPLLMSVDLRTIRPQFKHILQNRKII 303
Query: 275 AVNQ 278
AV+Q
Sbjct: 304 AVDQ 307
>gi|133778924|ref|NP_038491.2| alpha-galactosidase A [Mus musculus]
gi|26333581|dbj|BAC30508.1| unnamed protein product [Mus musculus]
gi|26344325|dbj|BAC35819.1| unnamed protein product [Mus musculus]
gi|74211253|dbj|BAE37692.1| unnamed protein product [Mus musculus]
Length = 421
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 100/317 (31%), Positives = 143/317 (45%), Gaps = 48/317 (15%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHM----FDSL 75
L NGL TP MGW W F C ++ + E D C E L S +D L
Sbjct: 34 LDNGLARTPTMGWLHWERFMCNLDCQ--EEPDACISEQLFMQMAELMVSDGWRDAGYDYL 91
Query: 76 TQIAAKPLPNQ-CQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNR--------- 120
P + +G L A+ FPSG K LA+YVH + I + N+
Sbjct: 92 CIDDCWMAPERDSKGRLQADPQRFPSGIKHLANYVHSKGLKLGIYADVGNKTCAGFPGSF 151
Query: 121 ----IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YRAMSDALKKAGRPIFFSMCEWGDMR 175
I++ + G+D LK+D C D Y+ M+ AL + GR I +S CEW
Sbjct: 152 GSYDIDAQTFADWGVDLLKFDGCHCDSVVSLENGYKYMALALNRTGRSIVYS-CEWPLYL 210
Query: 176 PALWGSNIR------NSWRTTDDISDSWARMLTIADMNEVY----ADHAKPGGWNDPDML 225
N N WR DD+ DSW + I VY + A PG WNDPDML
Sbjct: 211 RPFHKPNYTDIQYYCNHWRNFDDVYDSWESIKNILSWTVVYQKEIVEVAGPGSWNDPDML 270
Query: 226 EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------- 278
+GN G+ +++ V ++WAI APLL+ D+ ++++ ++ N++VIA+NQ
Sbjct: 271 VIGNFGLSWDQQVTQMALWAIMAAPLLMSNDLRQISSQAKALLQNKDVIAINQDPLGKQG 330
Query: 279 ----AINDQQIWAGPLS 291
N ++W PLS
Sbjct: 331 YCFRKENHIEVWERPLS 347
>gi|91080949|ref|XP_974398.1| PREDICTED: similar to
alpha-galactosidase/alpha-n-acetylgalactosaminidase
[Tribolium castaneum]
gi|270005370|gb|EFA01818.1| hypothetical protein TcasGA2_TC007420 [Tribolium castaneum]
Length = 442
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 99/318 (31%), Positives = 131/318 (41%), Gaps = 87/318 (27%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIR-------------------ET 50
L NGL TPPMGW +W F C I+E + R
Sbjct: 21 LENGLARTPPMGWLAWERFRCNTDCKNDPENCISENLFRTMADIVISEGYAAAGYEYINV 80
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW E RD G LV ++ FP G KALADYVH
Sbjct: 81 DDCWLEKERDRF--------------------------GQLVPDRQRFPRGMKALADYVH 114
Query: 111 -----ILISNSLDN-------------RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRA 152
I N R ++ +DY+K D C+ + Y
Sbjct: 115 SKGLKFGIYEDFGNYTCAGYPGVLGYLRTDAETFASWDVDYVKLDGCYAHPSEMDRGYPE 174
Query: 153 MSDALKKAGRPIFFSMCEW------GDMRPALWGSNIR--NSWRTTDDISDSWARMLTIA 204
L + GR + +S C W M+P S I N WR DDI DSWA + +I
Sbjct: 175 FGFYLNQTGRAMMYS-CSWPVYQIYAGMQPNF-TSIIEHCNLWRNFDDIQDSWASVESII 232
Query: 205 DM----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNL 260
D +V +A PG WNDPDML +GN G+ Y + +IWAI APLL+ D+ ++
Sbjct: 233 DYYGNNQDVIVANAGPGHWNDPDMLIIGNFGLSYEQSKTQMAIWAILAAPLLMSVDLRSI 292
Query: 261 TAETMPIIGNEEVIAVNQ 278
E I+ N+++IAV+Q
Sbjct: 293 RPEYKAILQNKKIIAVDQ 310
>gi|1703210|sp|P51569.1|AGAL_MOUSE RecName: Full=Alpha-galactosidase A; AltName:
Full=Alpha-D-galactosidase A; AltName:
Full=Alpha-D-galactoside galactohydrolase; AltName:
Full=Melibiase; Flags: Precursor
gi|945409|gb|AAA74453.1| alpha-D-galactosidase A [Mus musculus]
gi|1141788|gb|AAA96749.1| alpha-galactosidase A [Mus musculus]
gi|1335898|gb|AAC52583.1| alpha-galactosidase A [Mus musculus]
gi|1335900|gb|AAC52584.1| alpha-galactosidase A [Mus musculus]
gi|1666701|gb|AAB47244.1| alpha-D-galactosidase A [Mus musculus]
gi|14290506|gb|AAH09021.1| Galactosidase, alpha [Mus musculus]
gi|148688442|gb|EDL20389.1| galactosidase, alpha, isoform CRA_a [Mus musculus]
Length = 419
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 100/317 (31%), Positives = 143/317 (45%), Gaps = 48/317 (15%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHM----FDSL 75
L NGL TP MGW W F C ++ + E D C E L S +D L
Sbjct: 32 LDNGLARTPTMGWLHWERFMCNLDCQ--EEPDACISEQLFMQMAELMVSDGWRDAGYDYL 89
Query: 76 TQIAAKPLPNQ-CQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNR--------- 120
P + +G L A+ FPSG K LA+YVH + I + N+
Sbjct: 90 CIDDCWMAPERDSKGRLQADPQRFPSGIKHLANYVHSKGLKLGIYADVGNKTCAGFPGSF 149
Query: 121 ----IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YRAMSDALKKAGRPIFFSMCEWGDMR 175
I++ + G+D LK+D C D Y+ M+ AL + GR I +S CEW
Sbjct: 150 GSYDIDAQTFADWGVDLLKFDGCHCDSVVSLENGYKYMALALNRTGRSIVYS-CEWPLYL 208
Query: 176 PALWGSNIR------NSWRTTDDISDSWARMLTIADMNEVY----ADHAKPGGWNDPDML 225
N N WR DD+ DSW + I VY + A PG WNDPDML
Sbjct: 209 RPFHKPNYTDIQYYCNHWRNFDDVYDSWESIKNILSWTVVYQKEIVEVAGPGSWNDPDML 268
Query: 226 EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------- 278
+GN G+ +++ V ++WAI APLL+ D+ ++++ ++ N++VIA+NQ
Sbjct: 269 VIGNFGLSWDQQVTQMALWAIMAAPLLMSNDLRQISSQAKALLQNKDVIAINQDPLGKQG 328
Query: 279 ----AINDQQIWAGPLS 291
N ++W PLS
Sbjct: 329 YCFRKENHIEVWERPLS 345
>gi|198475823|ref|XP_002132504.1| GA27712 [Drosophila pseudoobscura pseudoobscura]
gi|198137975|gb|EDY69906.1| GA27712 [Drosophila pseudoobscura pseudoobscura]
Length = 411
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 95/320 (29%), Positives = 133/320 (41%), Gaps = 91/320 (28%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIR-------------------ET 50
L NGL TPPMGW SW F C I+EK+ +
Sbjct: 18 LDNGLAKTPPMGWLSWERFRCNTDCVFFLENCISEKLFQTMTDLVVADGYASAGYEFINI 77
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW E R G L+A++ FP+G KAL+DY+H
Sbjct: 78 DDCWLEKHRSHD--------------------------GKLLADRQRFPNGIKALSDYIH 111
Query: 111 ILISNSLDNRIESN------------IAFEQ---------GIDYLKYDNCFNDDTRPTIR 149
S L I + I +EQ +DY+K D C++ +
Sbjct: 112 ---SRGLKFGIYEDYGNFTCAGYPGIIGYEQEDAFQFAEWNVDYVKLDGCYSLPLEMDLG 168
Query: 150 YRAMSDALKKAGRPIFFSMCEW------GDMRPALWGSNIR-NSWRTTDDISDSWARMLT 202
Y L G+P+ +S C W ++P N WR DDI DSW +
Sbjct: 169 YSTFGKLLNSTGKPMVYS-CSWPVYQIYAGIQPNFSAVKTYCNLWRNYDDIQDSWTSVEN 227
Query: 203 IADM----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVG 258
I D ++ A +A PG WNDPDML +GN G+ Y + F+IW+I APLL+ D+
Sbjct: 228 IIDYYGNNQDLIAPNAGPGHWNDPDMLIIGNFGLSYEQAKTQFAIWSILAAPLLMSVDLR 287
Query: 259 NLTAETMPIIGNEEVIAVNQ 278
+ + I+ N ++IAV+Q
Sbjct: 288 TIRPQFKRILQNRKIIAVDQ 307
>gi|340520926|gb|EGR51161.1| glycoside hydrolase family 27 [Trichoderma reesei QM6a]
Length = 444
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 101/375 (26%), Positives = 148/375 (39%), Gaps = 124/375 (33%)
Query: 25 GLTPPMGWNSWNHFHCQINE-------KIIRET------------DDCWGESSRDSQISL 65
G P +GWNSWN +HC I+E ++I + DDCW S +D ++
Sbjct: 36 GKVPSLGWNSWNAYHCDIDESKFLSAAEVIVSSGLLDAGYNYVNIDDCW--SMKDGRV-- 91
Query: 66 SCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL---------ISNS 116
G++ N T FP G LA VH L +
Sbjct: 92 ----------------------DGHIAVNTTRFPDGIDGLAKKVHDLGLKLGIYSTAGTA 129
Query: 117 LDNRIESNIAFEQ---------GIDYLKYDNC-----FNDD------------------- 143
+++ +E G+DYLKYDNC + D+
Sbjct: 130 TCAGYPASLGYEDVDAADFADWGVDYLKYDNCNVPSDWQDEYVACAPDAVQTGPNGTCST 189
Query: 144 ---------------TRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWR 188
++ R+ AM DAL K R I S+C WG WG+ SWR
Sbjct: 190 ALEPNLAPPGYDWSTSKSAERFNAMRDALAKQSREIVLSLCIWGVADVFSWGNETGISWR 249
Query: 189 TTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISK 248
+ DIS W + I +MN + G ND DMLEVGNG + E HF++WA K
Sbjct: 250 MSGDISPEWGSVTHIINMNSFKMNSVGFWGHNDADMLEVGNGNLTAAETRTHFALWAAMK 309
Query: 249 APLLLGCDVGNLTAETMPIIGNEEVIAVNQ----------------------AINDQQIW 286
+PLL+G D+ L+ E + ++ N+ ++A NQ N + W
Sbjct: 310 SPLLIGTDLAQLSQENIELLKNKHLLAFNQDSVYGQPATPYKWGVNPDWTFNYTNPAEYW 369
Query: 287 AGPLSGNRIVVLLES 301
AGP S +V+++ +
Sbjct: 370 AGPSSKGHLVLMMNT 384
>gi|350597119|ref|XP_003484358.1| PREDICTED: alpha-N-acetylgalactosaminidase-like [Sus scrofa]
Length = 411
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 97/303 (32%), Positives = 133/303 (43%), Gaps = 55/303 (18%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQI 78
+L NGL PPMGW +W F C I DC + + IS M D L Q
Sbjct: 17 MLENGLLRKPPMGWLAWERFRCNI---------DC--DEDPKNCISERLFMEMADRLAQD 65
Query: 79 AAKPLP-------------NQCQGNLVANKTTFPSGTKALADYVHIL------------- 112
+ L +GNL+ + FP G LADY H L
Sbjct: 66 GWRDLGYVYLNIDDCWIGGRDAKGNLMPDPKRFPHGIAFLADYAHSLGLKLGIYEDLGSF 125
Query: 113 -----ISNSLDNRIESNIAF-EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFF 166
+LD ++ F E +D LK D CF+ Y M+ AL GRPI F
Sbjct: 126 TCMGYPGTTLDKVVQDAQTFAEWKVDMLKLDGCFSTHKERAEGYPKMAAALNATGRPIAF 185
Query: 167 SMCEW----GDMRPALWGS---NIRNSWRTTDDISDSWARMLTIADMNEVYAD----HAK 215
S C W G + P + S +I N WR DDI DSW +L+I D + D A
Sbjct: 186 S-CSWPAYEGGLPPRVNYSLLADICNLWRNYDDIQDSWWSVLSILDWFVAHQDILQPVAG 244
Query: 216 PGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIA 275
PG WNDPDML +GN G+ + + ++W + APL + D+ ++A+ + I+ N +I
Sbjct: 245 PGHWNDPDMLLIGNFGLSFEQAQSQMALWTVLAAPLFMSTDLRTISAQNVDILQNTLMIK 304
Query: 276 VNQ 278
+NQ
Sbjct: 305 INQ 307
>gi|265764954|ref|ZP_06093229.1| alpha-galactosidase [Bacteroides sp. 2_1_16]
gi|263254338|gb|EEZ25772.1| alpha-galactosidase [Bacteroides sp. 2_1_16]
Length = 530
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 87/286 (30%), Positives = 134/286 (46%), Gaps = 48/286 (16%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQ 86
TP MGW+SWN F INE++I+ET D+ ++ + +
Sbjct: 29 TPMMGWSSWNTFRVHINEELIKET--------ADAMVNRGLKDVGYGYVNIDDGYFGGRN 80
Query: 87 CQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAFEQ----------------- 129
+G L ANK FP+G + L+DY+H S L I S+
Sbjct: 81 SEGRLFANKKKFPNGMRVLSDYIH---SKGLKAGIYSDAGSNTCGSIYDADTLGIGVGLW 137
Query: 130 --------------GIDYLKYDNCFNDDTRPT--IRYRAMSDALKKAGRP-IFFSMCEWG 172
G D++K D C + T + RY + A+++ GR + +++C W
Sbjct: 138 KHDDIDCQTFLKDWGYDFIKIDWCGGEATGQSEQQRYTDIYKAIRRTGRTDVRYNICRW- 196
Query: 173 DMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGM 232
P W + + SWR DI+ + + I + N A + PG +ND DMLEVG G +
Sbjct: 197 -QFPGTWATQLAGSWRIHTDINPRFTTIDRIIERNLYLAPYVSPGHYNDMDMLEVGRG-L 254
Query: 233 KYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
+E HF IW+I +PL++GCD+ + +T+ II N+EVIA+NQ
Sbjct: 255 TEDEEKTHFGIWSILSSPLMIGCDLRTIPEKTLSIITNKEVIALNQ 300
>gi|241176422|ref|XP_002399573.1| alpha-D-galactosidase, putative [Ixodes scapularis]
gi|215495176|gb|EEC04817.1| alpha-D-galactosidase, putative [Ixodes scapularis]
Length = 526
Score = 131 bits (330), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 96/298 (32%), Positives = 136/298 (45%), Gaps = 47/298 (15%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGL TPPMGW SW F C ++ R+ ++C E ++ ++ + M Q
Sbjct: 22 LDNGLARTPPMGWLSWERFLCNVD--CARDPENCISE-----RLYMTMANIMHKEGYQEV 74
Query: 80 AKPLPN----------QCQGNLVANKTTFPSGTKALADYVH-----------------IL 112
N Q G+L+AN T FPSG K L D++H
Sbjct: 75 GYQYVNVDDCWMANSRQPDGSLMANSTRFPSGMKKLGDFIHSKGLKFGIYQDCGTQTCAG 134
Query: 113 ISNSLDNRIESNIAF-EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEW 171
SL + ++ F E G+D LK D C + Y A++ AL GR I FS C W
Sbjct: 135 YPGSLCHYVQDARTFAEWGVDMLKLDGCNINPIFMDRLYPAVTSALNATGRKIVFS-CSW 193
Query: 172 ------GDMRPALWG-SNIRNSWRTTDDISDSWARMLTIAD----MNEVYADHAKPGGWN 220
M+P + N WR DDI+DSW +L I D + + + PG W+
Sbjct: 194 PAYQVDAGMKPNYPAIARSCNMWRNFDDITDSWVSVLHIVDYYASVQDTLTAVSGPGRWS 253
Query: 221 DPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
DPDML +GN G+ ++ ++WAI APL + D+ N+ E I+ N VIA+NQ
Sbjct: 254 DPDMLIIGNYGLSLDQSRAQMALWAILAAPLFMSADLRNIRPEHKDILTNRLVIAINQ 311
>gi|395772378|ref|ZP_10452893.1| alpha-galactosidase [Streptomyces acidiscabies 84-104]
Length = 644
Score = 131 bits (330), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 110/390 (28%), Positives = 170/390 (43%), Gaps = 75/390 (19%)
Query: 29 PMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQ 88
PMGW SWN F +I+ +I++ D + LS + + + ++ + +
Sbjct: 45 PMGWASWNSFAAKIDSGVIKQQTDAF------VAAGLSEAGYTYVNIDEGWWQGT-RDAA 97
Query: 89 GNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIA---------------------- 126
GN+ +++ +P G A+ADY+H S L I ++
Sbjct: 98 GNITVDESEWPGGMAAIADYIH---SKGLKAGIYTDAGKDGCGYYYPTGRPAAPGSGSEG 154
Query: 127 -FEQ--------GIDYLKYDNCFND--DTRPTIRYRAMSDALKKA----GRPIFFSMCEW 171
++Q G D++K D C D P Y+++SDA+ +A GRP+ S+C W
Sbjct: 155 HYDQDMLQFSKWGFDFVKVDWCGGDAEGLDPATTYQSISDAITRATATTGRPLTLSLCNW 214
Query: 172 GDMRPALWGSNIRNSWRTTDDI-----SDSWARMLTIADMNEVYADHAKPGGWNDPDMLE 226
G P W WRT DDI S +LT D N ++ G +NDPDML
Sbjct: 215 GRQNPWNWAPGQGAMWRTNDDIIFYGNQPSMTNLLTNFDRN-LHPTAQHTGYYNDPDMLM 273
Query: 227 VGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ-- 284
G G + H ++WAIS APLL G ++ +T+ET I+ N EVIAV+Q Q
Sbjct: 274 TGMDGFTAAQNRTHMNLWAISGAPLLAGNNLATMTSETAAILKNPEVIAVDQDPRGLQGV 333
Query: 285 --------------IWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFEH--- 327
+ AG +GNR VLL +R + ++ +T W D+G+ +
Sbjct: 334 KVAEDTTGLQAYGKVLAG--TGNRAAVLL-NRTSAAANITVRWSDLGLTDASATVRDLWT 390
Query: 328 RTLETKFQGKLTAKVDGHSCKMYELTPVTK 357
R+ + G TA V M +T T+
Sbjct: 391 RSNIGSYSGSYTANVPAGGSVMLTVTGGTE 420
>gi|74141775|dbj|BAE38627.1| unnamed protein product [Mus musculus]
gi|74147510|dbj|BAE38657.1| unnamed protein product [Mus musculus]
Length = 421
Score = 131 bits (330), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 100/317 (31%), Positives = 143/317 (45%), Gaps = 48/317 (15%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHM----FDSL 75
L NGL TP MGW W F C ++ + E D C E L S +D L
Sbjct: 34 LDNGLARTPTMGWLHWERFMCNLDCQ--EEPDACISEQLFMQMAELMVSDGWRDAGYDYL 91
Query: 76 TQIAAKPLPNQ-CQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNR--------- 120
P + +G L A+ FPSG K LA+YVH + I + N+
Sbjct: 92 CIDDCWMAPERDSKGRLQADPQRFPSGIKHLANYVHSKGLKLGIYADVGNKTCAGFPGSF 151
Query: 121 ----IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YRAMSDALKKAGRPIFFSMCEWGDMR 175
I++ + G+D LK+D C D Y+ M+ AL + GR I +S CEW
Sbjct: 152 GSYDIDAQTFADWGVDLLKFDGCHCDSVVSLENGYKYMALALNRTGRSIVYS-CEWPLYL 210
Query: 176 PALWGSNIR------NSWRTTDDISDSWARMLTIADMNEVY----ADHAKPGGWNDPDML 225
N N WR DD+ DSW + I VY + A PG WNDPDML
Sbjct: 211 RPFHKPNYTDIQYYCNHWRNFDDVYDSWESIKNILSWTVVYQKEIVEVAGPGSWNDPDML 270
Query: 226 EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------- 278
+GN G+ +++ V ++WAI APLL+ D+ ++++ ++ N++VIA+NQ
Sbjct: 271 VIGNFGLSWDQQVTQMALWAIMAAPLLMSNDLRQISSQDKALLQNKDVIAINQDPLGKQG 330
Query: 279 ----AINDQQIWAGPLS 291
N ++W PLS
Sbjct: 331 YCFRKENHIEVWERPLS 347
>gi|157109986|ref|XP_001650906.1| alpha-galactosidase/alpha-n-acetylgalactosaminidase [Aedes aegypti]
gi|108878843|gb|EAT43068.1| AAEL005460-PA [Aedes aegypti]
Length = 463
Score = 131 bits (330), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 100/318 (31%), Positives = 133/318 (41%), Gaps = 87/318 (27%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIR-------------------ET 50
L NGL TPPMGW SW F C I+E + R
Sbjct: 55 LENGLARTPPMGWLSWERFRCNTDCEGDPENCISENLFRTMADLVVSEGYAALGYEYINV 114
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW E SR + G LVA++ FPSG KALADYVH
Sbjct: 115 DDCWLEKSRGPR--------------------------GELVADRRRFPSGMKALADYVH 148
Query: 111 -----ILISNSLDNRIESN----IAFEQG---------IDYLKYDNCFNDDTRPTIRYRA 152
I N + + F Q +DY+K D C++ Y
Sbjct: 149 AKGLKFGIYEDYGNYTCAGYPGILGFSQNDAAQFASWDVDYVKLDGCYSLPIDMDHGYPE 208
Query: 153 MSDALKKAGRPIFFSMCEW------GDMRPALWGSNIR--NSWRTTDDISDSWARMLTIA 204
L GR + +S C W M P + S I+ N WR DDI DSWA + +I
Sbjct: 209 FGRHLNATGRAMVYS-CSWPVYQIYAGMNPN-YSSIIQHCNLWRNYDDIQDSWASLESII 266
Query: 205 DM----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNL 260
D + +A PG WNDPDML +GN G+ Y + ++WAI APL++ D+ +
Sbjct: 267 DYYGNNQDAIIPNAGPGHWNDPDMLIIGNFGLSYEQSKTQMALWAIMAAPLMMSVDLRTI 326
Query: 261 TAETMPIIGNEEVIAVNQ 278
E I+ N+++IAV+Q
Sbjct: 327 RPEFKAILQNKKIIAVDQ 344
>gi|332373270|gb|AEE61776.1| unknown [Dendroctonus ponderosae]
Length = 443
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 94/297 (31%), Positives = 141/297 (47%), Gaps = 45/297 (15%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCS------FHMFD 73
L NGL TPPMGW +W F C + K + D+C E+ + + S + +
Sbjct: 22 LENGLARTPPMGWLAWERFRCNTDCK--NDPDNCISENLFRTMADIVVSEGYANVGYEYI 79
Query: 74 SLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL--------------------I 113
++ + NQ G LV ++T FP G KAL+DYVH +
Sbjct: 80 NVDDCWLEKERNQ-YGELVPDRTRFPRGMKALSDYVHAKGLKFGIYEDFGNYTCAGYPGV 138
Query: 114 SNSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEW-- 171
+L E+ +++ +DY+K D C+ + Y L + GR + +S C W
Sbjct: 139 IGNLQQDAETFASWD--VDYVKLDGCYAHPSEMDRGYPEFGFYLNRTGRAMIYS-CSWPV 195
Query: 172 ----GDMRPALWGSNIR--NSWRTTDDISDSWARMLTIADM----NEVYADHAKPGGWND 221
M P S I+ N WR DDI DSWA + +I D +V +A PG WND
Sbjct: 196 YQIYAGMAPNF-TSIIQSCNMWRNFDDIQDSWASVESIIDYYGNNQDVIVPNAGPGHWND 254
Query: 222 PDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
PDML +GN G+ Y + +IWAI APLL+ D+ + +E I+ N+++I+V+Q
Sbjct: 255 PDMLIIGNFGLSYEQSKTQMAIWAILAAPLLMSVDLRTIRSEYKAILQNKKIISVDQ 311
>gi|297204332|ref|ZP_06921729.1| secreted alpha-galactosidase [Streptomyces sviceus ATCC 29083]
gi|197716866|gb|EDY60900.1| secreted alpha-galactosidase [Streptomyces sviceus ATCC 29083]
Length = 655
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 111/380 (29%), Positives = 171/380 (45%), Gaps = 70/380 (18%)
Query: 29 PMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQ 88
PMGW SWN F +I+ +I+ D + S L + + + ++ + + +
Sbjct: 56 PMGWASWNSFAAKIDYSVIKAQVDAFVASG------LPAAGYKYINIDEGWWQGTRDSA- 108
Query: 89 GNLVANKTTFPSGTKALADYVH---------------------------ILISNSLDNRI 121
GN+ +++ +P G KA+ADY+H S S +
Sbjct: 109 GNITVDESEWPGGMKAIADYIHGKGLKAGIYTDAGKDGCGYYYPTGRPAAPGSGSEGHYD 168
Query: 122 ESNIAFEQ-GIDYLKYDNCFND----DTRPTIRYRAMSDALKKA----GRPIFFSMCEWG 172
+ + F Q G D++K D C D D + T Y+++SDA+ +A GRP+ S+C WG
Sbjct: 169 QDMLQFSQWGFDFVKVDWCGGDAEGLDAKTT--YQSISDAITRATATTGRPMTLSLCNWG 226
Query: 173 DMRPALWGSNIRNSWRTTDDI-----SDSWARMLTIADMNEVYADHAKPGGWNDPDMLEV 227
P W WRT DDI S +LT D N ++ G +NDPDML V
Sbjct: 227 RQNPWNWAPGQGAMWRTNDDIIIYGNKPSMTNLLTNYDRN-LHPTAQHTGYYNDPDMLMV 285
Query: 228 GNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ--------- 278
G G + H ++WA+S APLL G ++ +T ET I+ N EV+AV+Q
Sbjct: 286 GMDGFTAAQNRTHMNLWAVSGAPLLAGNNLATMTTETANILKNPEVVAVDQDPRGLQGVK 345
Query: 279 ---AINDQQIWAGPLS--GNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT----LFEHRT 329
Q++ LS GNR VVLL +R + + +T W D+G+ + N L+
Sbjct: 346 VAEDTTGLQVYGKVLSGTGNRAVVLL-NRTSAAQNITVRWSDLGLTNANATVRDLWARSN 404
Query: 330 LETKFQGKLTAKVDGHSCKM 349
L + G T+ G S +
Sbjct: 405 LGSYGTGYTTSVPAGGSVML 424
>gi|405970312|gb|EKC35227.1| Alpha-N-acetylgalactosaminidase [Crassostrea gigas]
Length = 545
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 101/328 (30%), Positives = 144/328 (43%), Gaps = 72/328 (21%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGLG TPPMGWNSW F C ++ + ++C GE ++ + + M + A
Sbjct: 17 LDNGLGRTPPMGWNSWERFRCNVDCD--NDPENCIGE-----KLYMQMADRMAADGYKDA 69
Query: 80 AKPLPN--QC--------QGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAFEQ 129
N C G L A+ FPSG KAL DY+H S L I + +
Sbjct: 70 GYEYVNVDDCWMAKERGPDGKLQADPKRFPSGMKALGDYIH---SKGLKFGIYEDFGTQT 126
Query: 130 ---------------------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSM 168
G+D LK D C+++ T Y M L K GRPI +S
Sbjct: 127 CGGFPGSKFFMETDAQTFADWGVDLLKLDGCYSNIEDMTSGYPIMEFFLNKTGRPILYS- 185
Query: 169 CEWGDMRPALWGSNIR-----------NSWRTTDDISDSWARMLTIADMNEV----YADH 213
C W PA + + + N WR DDI DSW + I + + +
Sbjct: 186 CSW----PAYFVAYKKIPDYKAIAKSCNIWRNYDDIQDSWDSVTEIVNYYAKNKGNFFEV 241
Query: 214 AKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEV 273
A PG +NDPDML +GN G+ ++ ++WAI APLL+ D+ + + I+ N++V
Sbjct: 242 AGPGSFNDPDMLIIGNFGLSRDQQRAQMAMWAIMAAPLLMSADLRKMDPYSKSILLNKDV 301
Query: 274 IAVNQ-----------AINDQQIWAGPL 290
IA+NQ +N QIW P+
Sbjct: 302 IAINQDPMGQPGSIILDMNQIQIWFRPV 329
>gi|270010256|gb|EFA06704.1| hypothetical protein TcasGA2_TC009635 [Tribolium castaneum]
Length = 432
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 103/337 (30%), Positives = 141/337 (41%), Gaps = 70/337 (20%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGL LTPPMGW W F C+ + + + DC E+ + S D+ +
Sbjct: 26 LDNGLALTPPMGWLHWERFRCETDCR--QFPFDCISENLFKAMADKMASDGYLDAGYEYV 83
Query: 80 AK-----PLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSL 117
+ +G L A+ FPSG KALADYVH SL
Sbjct: 84 IMDDCWLAMDRDSEGRLQADPDRFPSGIKALADYVHAKGLKFGIYEDYGTKTCAGYPGSL 143
Query: 118 DN-RIESNIAFEQGIDYLKYDNC-FNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMR 175
D+ I++ E G+DYLK D C D + M+ L + GR I FS CE+
Sbjct: 144 DHLEIDAKTFAEWGVDYLKMDGCNVTPDEAMEAGHLEMARYLNETGREIVFS-CEF---- 198
Query: 176 PALWGSNIRNS------------------------WRTTDDISDSWARMLTIADM----N 207
P G + NS WR +DI DSW + I +
Sbjct: 199 PLYRGDKVINSIMKFDKKLIVFQANYSVAIEACNLWRNYNDIEDSWVSVTNIVNHYKKNQ 258
Query: 208 EVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPI 267
+ Y A PG WNDPDML +GN G+ ++ +IWAI APL++ D+ + E I
Sbjct: 259 DKYVAVAGPGHWNDPDMLIIGNFGLSLDQSKAQMTIWAIWAAPLIMSVDLRTIKPEFKEI 318
Query: 268 IGNEEVIAVNQ-----------AINDQQIWAGPLSGN 293
+ N+ I +NQ IND +W P+S N
Sbjct: 319 LLNKHAIKINQDPLGIQGHLQTTINDVDVWLKPISPN 355
>gi|195473523|ref|XP_002089042.1| GE26351 [Drosophila yakuba]
gi|194175143|gb|EDW88754.1| GE26351 [Drosophila yakuba]
Length = 413
Score = 131 bits (329), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 93/302 (30%), Positives = 137/302 (45%), Gaps = 51/302 (16%)
Query: 18 NLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQ 77
N L NGL TPPMGW SW F C N + + D+C E+ + + + D
Sbjct: 16 NCLDNGLAKTPPMGWLSWERFRC--NTDCVNDPDNCISENLFQTMTDIVVA----DGYAS 69
Query: 78 IAAKPLP-NQC--------QGNLVANKTTFPSGTKALADYVHILISNSLDNRIESN---- 124
+ + + + C G LVA++ FP+G KAL+DY+H S L I +
Sbjct: 70 VGYEYINIDDCWLEKHRSHDGKLVADRKRFPNGIKALSDYIH---SRGLKFGIYEDYGNY 126
Query: 125 --------IAFEQ---------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFS 167
I +E+ +DY+K D C+ Y L G+ + +S
Sbjct: 127 TCAGYPGIIGYEEKDALQFADWNVDYVKLDGCYALPYDMDHGYSTFGRLLNSTGKSMVYS 186
Query: 168 MCEW------GDMRPALWGSNIR-NSWRTTDDISDSWARMLTIADM----NEVYADHAKP 216
C W ++P N WR DDI DSWA + I D ++ A +A P
Sbjct: 187 -CSWPVYQIYAGIQPNYSAIQTHCNLWRNYDDIQDSWASVENIIDYYGNNQDLIAPNAGP 245
Query: 217 GGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAV 276
G WNDPDML +GN G+ Y + F+IW+I APLL+ D+ + + I+ N ++IAV
Sbjct: 246 GHWNDPDMLIIGNFGLSYEQAKTQFAIWSILAAPLLMSVDLRTIRPQFKQILQNRKIIAV 305
Query: 277 NQ 278
+Q
Sbjct: 306 DQ 307
>gi|115387155|ref|XP_001211083.1| predicted protein [Aspergillus terreus NIH2624]
gi|114195167|gb|EAU36867.1| predicted protein [Aspergillus terreus NIH2624]
Length = 409
Score = 131 bits (329), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 98/363 (26%), Positives = 152/363 (41%), Gaps = 104/363 (28%)
Query: 20 LSNGLGLTPPMGWNSWNHFHC----------------QINEKIIRET----DDCWGESSR 59
L++ LGLTP MGW+SWN C ++ K + T DDCW R
Sbjct: 28 LNDSLGLTPHMGWSSWNVAQCDAASETYALGAANKFIELGLKDLGYTYVNIDDCWSTKER 87
Query: 60 DSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL------- 112
D+ GNLV + +P+G +A+AD +H L
Sbjct: 88 DAS--------------------------GNLVPDPAKWPNGIRAVADQIHGLGLKFGLY 121
Query: 113 -----------ISNSLDNRIESNIAFEQGIDYLKYDNCFND---------------DTRP 146
+ ++ + G+D+ K+DNC+ DTRP
Sbjct: 122 GCAGEMTCAGYPGSQGHEAQDAELLASWGVDFWKHDNCYTPCVSDPPPQTCWEASVDTRP 181
Query: 147 TIRYRAMSDALKKAG--RPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIA 204
Y M DA K +PI ++C+WG WG+ +SWR D WA ++ I
Sbjct: 182 --WYATMHDAFKSVTSVKPILLNICQWGRNEVWTWGAQYGHSWRIEADNWGDWASVVRIG 239
Query: 205 DMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAET 264
++ PGG+ND DML VGNG + + +HF +WAI+K+PL++G D+ ++ +
Sbjct: 240 AKASEIPQYSGPGGFNDLDMLFVGNGVLTEAQERLHFGLWAIAKSPLVIGADLNTISESS 299
Query: 265 MPIIGNEEVIAVNQ-------------------AINDQQIWAGPLSGNRIVVLLESR--K 303
+ I+ N+++I++NQ A WAGPLS ++ +
Sbjct: 300 LNILKNKDIISINQDELGIAAGIFQPPNAPAPVAGEIYPYWAGPLSDGVVIAFAAASGGG 359
Query: 304 TFS 306
TFS
Sbjct: 360 TFS 362
>gi|294675015|ref|YP_003575631.1| alpha-galactosidase/carbohydrate binding protein [Prevotella
ruminicola 23]
gi|294473546|gb|ADE82935.1| alpha-galactosidase/carbohydrate binding module, family 6 domain
protein [Prevotella ruminicola 23]
Length = 541
Score = 131 bits (329), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 102/378 (26%), Positives = 163/378 (43%), Gaps = 59/378 (15%)
Query: 28 PPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQC 87
P MGW+SWN F I+E II+ D + + + D Q P
Sbjct: 37 PTMGWSSWNTFAVNISEDIIKGQADAMVNQGLKA---VGYQYINIDDGFQYGRTP----- 88
Query: 88 QGNLVANKTTFPSGTKALADYVHI-------------LISNSLDN--------------R 120
+G + + FP+G K ++DY+H L S+ N +
Sbjct: 89 EGKVCIHPERFPNGLKVVSDYIHSKGLKAGIYSDAGDLTCGSISNGDVRNTNVGLYGYEQ 148
Query: 121 IESNIAF-EQGIDYLKYDNCFND--DTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPA 177
++++ F E D++K D C + +Y +S+A+K GR + F++C W P
Sbjct: 149 VDADFYFKELEFDFIKVDYCGGNHMSLNEQEQYTRISNAIKNTGRDVVFNICRW--RYPG 206
Query: 178 LWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEY 237
W NSWRTT DI +SW + + + N + + +ND DMLEVG + E
Sbjct: 207 DWCHYAANSWRTTGDIHESWQSVKDLVNENLYMSAYCYNDTYNDMDMLEVGRS-LTAEED 265
Query: 238 VVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQIWA---------- 287
HF +W I +PLL+GC++ + + ++ N+E+IA+NQ + Q +
Sbjct: 266 KTHFGLWCIMASPLLIGCNMATINERALELLKNKELIALNQDLLHLQAYVIQHIGETYVL 325
Query: 288 ----GPLSGNRIVVLLESRKTFSSMMTAHWDDI---GIKSNNTLFEHRTLETKFQGKLTA 340
L GN V L + M + ++ G S LFEH+ L K Q LT
Sbjct: 326 VKDLLKLHGNTRAVALYNPSDKPVEMCVDFSELELGGKVSVRDLFEHKDL-GKMQNSLTV 384
Query: 341 KVDGHSCKMYELTPVTKL 358
+V H+ ++Y L +L
Sbjct: 385 QVPAHATRIYRLKGEKRL 402
>gi|290958857|ref|YP_003490039.1| hydrolase [Streptomyces scabiei 87.22]
gi|260648383|emb|CBG71494.1| putative secreted hydrolase [Streptomyces scabiei 87.22]
Length = 647
Score = 131 bits (329), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 110/362 (30%), Positives = 156/362 (43%), Gaps = 101/362 (27%)
Query: 29 PMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQISLSCSF 69
PMGW SWN F +++ +I++ D+ W + +RDS
Sbjct: 49 PMGWASWNAFAAKVDYDVIKKQVDAFVAAGLPEAGYEYINIDEGWWQGTRDSA------- 101
Query: 70 HMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH------------------- 110
GN+ ++ +P G KA+ADY+H
Sbjct: 102 -------------------GNITVDEAEWPGGMKAIADYIHSKGLKAGIYTDAGKDGCGY 142
Query: 111 --------ILISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTR--PTIRYRAMSDALKK 159
S S + ++ F Q G D++K D C D R P Y+A+SDA+
Sbjct: 143 YFPTGRPAAPGSGSEGHYVQDMTRFSQWGFDFVKVDWCGGDAERLDPKTTYQAISDAVAT 202
Query: 160 A----GRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDI-----SDSWARMLTIADMNEVY 210
A GRP+ S+C WG P W + WRT DI ++A MLT D N V+
Sbjct: 203 ATATTGRPLALSLCNWGYSNPWNWAPGMGPMWRTNTDIFYHGQGLTYAGMLTNYDRN-VH 261
Query: 211 ADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGN 270
G +NDPDML +G G + + H ++WAIS APLL G D+ +T ET I+ N
Sbjct: 262 PTAQHTGYYNDPDMLMIGMG-LTAAQDRTHMNLWAISGAPLLAGNDLSTMTTETANILKN 320
Query: 271 EEVIAVNQ------------AINDQQIWAGPLS--GNRIVVLLESRKTFSSMMTAHWDDI 316
EVIAV+Q Q++ LS GNR VVLL +R + + MT W D+
Sbjct: 321 SEVIAVDQDPRGLQGVKVAEDTTGLQVYGKVLSGTGNRAVVLL-NRTSTTQNMTVRWSDL 379
Query: 317 GI 318
G+
Sbjct: 380 GL 381
>gi|410989030|ref|XP_004000772.1| PREDICTED: LOW QUALITY PROTEIN: alpha-galactosidase A [Felis catus]
Length = 483
Score = 130 bits (328), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 102/341 (29%), Positives = 145/341 (42%), Gaps = 96/341 (28%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRE-------------------T 50
L NGL +TP MGW W F C I+EK+ +
Sbjct: 32 LDNGLAMTPTMGWLHWERFMCNVDCQEEPDSCISEKLFMQMAELMDSDGWKDVGYEYLCI 91
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RDS+ G L A+ FP G + LA+YVH
Sbjct: 92 DDCWMAPKRDSK--------------------------GRLQADPKRFPGGIQGLANYVH 125
Query: 111 -----ILISNSLDNR-------------IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YR 151
+ I + N+ I++ + G+D LK+D C D Y+
Sbjct: 126 SKGLKLGIYADVGNKTCAGFPGSFGYYDIDAQTFADWGVDLLKFDGCHCDSMEHLENGYK 185
Query: 152 AMSDALKKAGRPIFFSMCEWGDMRPALWGSN---IR---NSWRTTDDISDSWARMLTIAD 205
MS AL + GR I +S CEW N IR N WR DD+ DSW + I D
Sbjct: 186 HMSLALNRTGRSIVYS-CEWPLYMWPFRKPNYTEIRQYCNHWRNFDDVYDSWQSIKNILD 244
Query: 206 M----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
+ D A PGGWNDPDML +GN G+ +++ V ++WAI APLL+ D+ +++
Sbjct: 245 WTSSNQKRIIDVAGPGGWNDPDMLVIGNFGLSWDQQVTQMALWAIMAAPLLMSNDLRHIS 304
Query: 262 AETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLS 291
+ ++ +++VIA+NQ +Q +W PLS
Sbjct: 305 PQAKALLQDKDVIAINQDPLGKQGYRLRKEDNIEVWERPLS 345
>gi|21064529|gb|AAM29494.1| RE47112p [Drosophila melanogaster]
Length = 427
Score = 130 bits (328), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 93/300 (31%), Positives = 135/300 (45%), Gaps = 51/300 (17%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGL TPPMGW SW F C N + + D+C E + + + D +
Sbjct: 18 LDNGLAKTPPMGWLSWERFRC--NTDCVNDPDNCISEQLFQTMTDIVVA----DGYASVG 71
Query: 80 AKPL-PNQC--------QGNLVANKTTFPSGTKALADYVHILISNSLDNRIESN------ 124
+ + + C G LVA++ FP+G KAL+DY+H S L I +
Sbjct: 72 YEYINIDDCWLEKHRSHDGKLVADRKRFPNGIKALSDYIH---SRGLKFGIYEDYGNYTC 128
Query: 125 ------IAFEQ---------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMC 169
I +E+ +DY+K D C+ Y L G+ + +S C
Sbjct: 129 AGYPGIIGYEEKDALQFADWNVDYVKLDGCYALPYDMDHGYSTFGRLLNSTGKSMVYS-C 187
Query: 170 EW------GDMRPALWGSNIR-NSWRTTDDISDSWARMLTIADM----NEVYADHAKPGG 218
W ++P N WR DDI DSWA + I D +V A +A PG
Sbjct: 188 SWPVYQIYAGIQPNYSAIQTHCNLWRNYDDIQDSWASVENIIDYYGNNQDVIAPNAGPGH 247
Query: 219 WNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
WNDPDML +GN G+ Y + F+IW+I APLL+ D+ + + I+ N ++IAV+Q
Sbjct: 248 WNDPDMLIIGNFGLSYEQAKTQFAIWSILAAPLLMSVDLRTIRPQFKHILQNRKIIAVDQ 307
>gi|24583253|ref|NP_609354.1| CG5731 [Drosophila melanogaster]
gi|22946094|gb|AAF52871.2| CG5731 [Drosophila melanogaster]
gi|201065479|gb|ACH92149.1| FI01815p [Drosophila melanogaster]
Length = 413
Score = 130 bits (328), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 93/300 (31%), Positives = 135/300 (45%), Gaps = 51/300 (17%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGL TPPMGW SW F C N + + D+C E + + + D +
Sbjct: 18 LDNGLAKTPPMGWLSWERFRC--NTDCVNDPDNCISEQLFQTMTDIVVA----DGYASVG 71
Query: 80 AKPL-PNQC--------QGNLVANKTTFPSGTKALADYVHILISNSLDNRIESN------ 124
+ + + C G LVA++ FP+G KAL+DY+H S L I +
Sbjct: 72 YEYINIDDCWLEKHRSHDGKLVADRKRFPNGIKALSDYIH---SRGLKFGIYEDYGNYTC 128
Query: 125 ------IAFEQ---------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMC 169
I +E+ +DY+K D C+ Y L G+ + +S C
Sbjct: 129 AGYPGIIGYEEKDALQFADWNVDYVKLDGCYALPYDMDHGYSTFGRLLNSTGKSMVYS-C 187
Query: 170 EW------GDMRPALWGSNIR-NSWRTTDDISDSWARMLTIADM----NEVYADHAKPGG 218
W ++P N WR DDI DSWA + I D +V A +A PG
Sbjct: 188 SWPVYQIYAGIQPNYSAIQTHCNLWRNYDDIQDSWASVENIIDYYGNNQDVIAPNAGPGH 247
Query: 219 WNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
WNDPDML +GN G+ Y + F+IW+I APLL+ D+ + + I+ N ++IAV+Q
Sbjct: 248 WNDPDMLIIGNFGLSYEQAKTQFAIWSILAAPLLMSVDLRTIRPQFKHILQNRKIIAVDQ 307
>gi|195339677|ref|XP_002036443.1| GM11946 [Drosophila sechellia]
gi|195578061|ref|XP_002078884.1| GD22296 [Drosophila simulans]
gi|194130323|gb|EDW52366.1| GM11946 [Drosophila sechellia]
gi|194190893|gb|EDX04469.1| GD22296 [Drosophila simulans]
Length = 413
Score = 130 bits (328), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 93/300 (31%), Positives = 135/300 (45%), Gaps = 51/300 (17%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGL TPPMGW SW F C N + + D+C E + + + D +
Sbjct: 18 LDNGLAKTPPMGWLSWERFRC--NTDCVNDPDNCISEQLFQTMTDIVVA----DGYASVG 71
Query: 80 AKPL-PNQC--------QGNLVANKTTFPSGTKALADYVHILISNSLDNRIESN------ 124
+ + + C G LVA++ FP+G KAL+DY+H S L I +
Sbjct: 72 YEYINIDDCWLEKHRSHDGKLVADRKRFPNGIKALSDYIH---SRGLKFGIYEDYGNYTC 128
Query: 125 ------IAFEQ---------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMC 169
I +E+ +DY+K D C+ Y L G+ + +S C
Sbjct: 129 AGYPGIIGYEEKDALQFADWNVDYVKLDGCYALPYDMDHGYSTFGRLLNSTGKSMVYS-C 187
Query: 170 EW------GDMRPALWGSNIR-NSWRTTDDISDSWARMLTIADM----NEVYADHAKPGG 218
W ++P N WR DDI DSWA + I D +V A +A PG
Sbjct: 188 SWPVYQIYAGIQPNYSAIQTHCNLWRNYDDIQDSWASVENIIDYYGNNQDVIAPNAGPGH 247
Query: 219 WNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
WNDPDML +GN G+ Y + F+IW+I APLL+ D+ + + I+ N ++IAV+Q
Sbjct: 248 WNDPDMLIIGNFGLSYEQAKTQFAIWSILAAPLLMSVDLRTIRPQFKHILQNRKIIAVDQ 307
>gi|195053245|ref|XP_001993537.1| GH13861 [Drosophila grimshawi]
gi|193900596|gb|EDV99462.1| GH13861 [Drosophila grimshawi]
Length = 413
Score = 130 bits (327), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 102/327 (31%), Positives = 144/327 (44%), Gaps = 68/327 (20%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCS----------F 69
L NGL TPPMGW SW F C N + + D+C E L S
Sbjct: 18 LENGLARTPPMGWLSWERFRC--NTDCVNDPDNCISERLFQVMADLLVSEGYASVGYEYV 75
Query: 70 HMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESN----- 124
++ D + P G L+A+ FP G K+L+DY+H S L I +
Sbjct: 76 NIDDCWLERHRAP-----DGKLLADHKRFPHGIKSLSDYIH---SRGLKFGIYEDYGNFT 127
Query: 125 -------IAFEQ---------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSM 168
I FE+ +DY+K D C+ Y L G+P+ +S
Sbjct: 128 CAGYPGIIGFEKLDAHQFAEWNVDYVKLDGCYALPYDMDRGYSEFGRHLNSTGKPMVYS- 186
Query: 169 CEW------GDMRPALWGSNIR---NSWRTTDDISDSWARMLTIADM----NEVYADHAK 215
C W ++P S I+ N WR DDI DSWA + I D +V A +A
Sbjct: 187 CSWPVYQIYAGIQPNF--SAIQSHCNLWRNFDDIQDSWASVENIIDYYGNNQDVIAPNAG 244
Query: 216 PGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIA 275
PG WNDPDML +GN G+ Y + F++WAI APLL+ D+ + E I+ N ++I+
Sbjct: 245 PGHWNDPDMLIIGNFGLSYEQAKTQFAVWAILAAPLLMSVDLRTIRPEFKEILLNRKIIS 304
Query: 276 VNQ---AINDQ--------QIWAGPLS 291
V+Q I + +IW+ P+S
Sbjct: 305 VDQDPLGIQGRRIYKHKGIEIWSKPIS 331
>gi|320164692|gb|EFW41591.1| alpha-galactosidase [Capsaspora owczarzaki ATCC 30864]
Length = 402
Score = 130 bits (327), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 105/324 (32%), Positives = 139/324 (42%), Gaps = 53/324 (16%)
Query: 20 LSNGLGLTPPMGWNSW------NHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFD 73
L NGL P MGWN+W F E+I++ D +D L + D
Sbjct: 25 LDNGLAELPVMGWNTWCTDDYCGLFDFCYEEEIMQMADAMVSSGMQD----LGYQYINLD 80
Query: 74 SLTQIAAKPLPNQCQGNLVANKTTFPSGTKA-LADYVHILI-----------SNSLDNRI 121
P G L A+ + FPSG+ A + YVH L R
Sbjct: 81 DCWGG-----PRSANGTLTADTSRFPSGSLAKVTSYVHSLGLKMGLYLCAGNETCKYKRP 135
Query: 122 ESNIAFEQ--------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGD 173
S F+Q GID++K D C + + P Y M D+L K GRPI FS CEWG+
Sbjct: 136 GSWGYFDQDAQTVADWGIDFVKLDWCNHPNLPPPTVYGMMRDSLNKTGRPIVFSACEWGE 195
Query: 174 MRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYA---DHAKPGGWNDPDMLEVGNG 230
P WG N WR D W A++ + A +A PGGWNDPD L
Sbjct: 196 DEPWTWGMETANMWRVHKDHLPLWGSEQGTANIIQSMAHLSKYAGPGGWNDPDFLMTMLP 255
Query: 231 GMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ---------AIN 281
+ E F+ WA+ APL++ D+ N+T I+ N EVIA+NQ A+N
Sbjct: 256 PLTERESRTEFAFWALFAAPLIVATDIRNMTDVKQSILLNPEVIAINQDAFAIAGDIALN 315
Query: 282 D---QQIWAGPLS---GNRIVVLL 299
+ Q+W PLS NR V+L
Sbjct: 316 NTDGTQVWTKPLSPSVDNRYAVIL 339
>gi|328704372|ref|XP_001944031.2| PREDICTED: alpha-N-acetylgalactosaminidase-like [Acyrthosiphon
pisum]
Length = 435
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 92/300 (30%), Positives = 136/300 (45%), Gaps = 51/300 (17%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGL LTPPMGW +W + C + + D+C E L S D+ +
Sbjct: 20 LDNGLALTPPMGWLAWQRYRCITDCETY--PDECVSEKLFMKAADLLVSEGYADAGYKYL 77
Query: 80 AKPLPNQC--------QGNLVANKTTFPSGTKALADYVHILISNSLD------------- 118
+ + C G L A+KT FPSG KALADYVH S L
Sbjct: 78 ---IVDDCWLAKNRSADGKLEADKTRFPSGIKALADYVH---SKGLKFGLYQDWGEKTCA 131
Query: 119 --------NRIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCE 170
+++ E +DY+K D C+++ Y L + GRP+ +S C
Sbjct: 132 GYPGVRGKEEMDAKQFAEWEVDYVKLDGCYSNVRDMDRGYVEFGQHLNQTGRPMVYS-CS 190
Query: 171 WGDMRP--------ALWGSNIRNSWRTTDDISDSWARMLTIADM----NEVYADHAKPGG 218
W + AL + N WR DI DSW + I+D E +A +A PG
Sbjct: 191 WPAYQEDKGMRINYALMAKHC-NLWRNYGDIDDSWTSVTAISDYFAMKQEFWAQYAGPGH 249
Query: 219 WNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
WNDPDML +GN G+ +++ +IW + APL++ ++ N+ E I+ N+E+I ++Q
Sbjct: 250 WNDPDMLLIGNFGLSFDQSKSQMAIWTVLAAPLMMSNNLSNVCPEFKEILQNKEIIKISQ 309
>gi|409198456|ref|ZP_11227119.1| alpha-galactosidase [Marinilabilia salmonicolor JCM 21150]
Length = 412
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 104/382 (27%), Positives = 164/382 (42%), Gaps = 74/382 (19%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETDDCWGESSRD----SQISLSCSFHMFDSLTQIAAKP 82
TP MGW+SWN++H I+E+II+ D S S I++ F F +
Sbjct: 30 TPIMGWSSWNNYHVNISEEIIQSQADAMVASGMKEAGYSYINIDDGF--FGGRDE----- 82
Query: 83 LPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAF--------------- 127
G L+A+K FPSG KALADY+H L I S+
Sbjct: 83 -----NGRLLAHKKRFPSGMKALADYIH---KKGLKAGIYSDAGINTCASYWDQDTIGSG 134
Query: 128 ----------------EQGIDYLKYDNCFND--DTRPTIRYRAMSDALKKAGRPIFFSMC 169
E G D++K D C D RY + ++ + +++C
Sbjct: 135 MGLYGHEWADLNLMLKEWGYDFIKIDWCGGQWLDLDEETRYSLLGKMIRSIRPDVVYNIC 194
Query: 170 EWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGN 229
W P W +SWR + DI++++ ++ I D N ++ PG ND DML+VG
Sbjct: 195 RW--EFPGKWALTTADSWRISGDITNTFESVMHIVDKNADLWKYSGPGHVNDMDMLQVGR 252
Query: 230 GGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAI--------- 280
G M Y E HFS+W + +PLL G D+ ++ ET+ I+ N+E+I++NQ
Sbjct: 253 G-MSYEEDKTHFSMWCMLNSPLLAGNDLAKMSNETLEILTNKELISINQDPLVYQARKLE 311
Query: 281 --NDQQIWAGPLSGN---RIVVLLESRKTFSSMMTAHWDDIGIKSNNTL-----FEHRTL 330
D ++WA PL ++ V L +R + +T + + I +EH T
Sbjct: 312 DHGDLELWARPLLSTISGKVAVALLNRSNQTQTITIELEKVAIMPGEAYTVRDCWEHTTS 371
Query: 331 ETKFQGKLTAKVDGHSCKMYEL 352
+ L+ V H + ++
Sbjct: 372 SLTRKPTLSYDVPPHGVVVLKI 393
>gi|118790535|ref|XP_318652.3| AGAP009621-PA [Anopheles gambiae str. PEST]
gi|116117992|gb|EAA14548.3| AGAP009621-PA [Anopheles gambiae str. PEST]
Length = 432
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 100/318 (31%), Positives = 132/318 (41%), Gaps = 87/318 (27%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIR-------------------ET 50
L NGL TPPMGW SW F C I+E + R
Sbjct: 24 LENGLARTPPMGWLSWERFRCNTDCEGDPENCISEHLFRTMADLVVSEGYAAVGYEYINV 83
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW E SR + G LVA++ FPSG KALA+YVH
Sbjct: 84 DDCWLEKSRGPR--------------------------GELVADRRRFPSGMKALANYVH 117
Query: 111 I--LISNSLDNRIESNIAFEQGI----------------DYLKYDNCFNDDTRPTIRYRA 152
L ++ A GI DY+K D C++ Y
Sbjct: 118 AKGLKFGIYEDYGNYTCAGYPGILGFSANDAAQFASWDVDYVKLDGCYSLPIDMDHGYPE 177
Query: 153 MSDALKKAGRPIFFSMCEW------GDMRPALWGSNIR--NSWRTTDDISDSWARMLTIA 204
L GRP+ +S C W M P + S I+ N WR DDI DSWA + +I
Sbjct: 178 FGRNLNATGRPMIYS-CSWPVYQIYAGMNPN-YSSIIQHCNLWRNYDDIQDSWASLESII 235
Query: 205 DM----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNL 260
D + +A PG WNDPDML +GN G+ Y + ++WAI APL++ D+ +
Sbjct: 236 DYYGNNQDAIIPNAGPGHWNDPDMLIIGNFGLSYEQSKTQMALWAIMAAPLMMSVDLRTI 295
Query: 261 TAETMPIIGNEEVIAVNQ 278
E I+ N ++IAV+Q
Sbjct: 296 RPEFKAILQNRKIIAVDQ 313
>gi|361125706|gb|EHK97738.1| putative alpha-galactosidase B [Glarea lozoyensis 74030]
Length = 422
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 92/312 (29%), Positives = 132/312 (42%), Gaps = 65/312 (20%)
Query: 26 LTPPMGWNSWNHFHCQINE-KIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLP 84
+ P +GWNSWN F+C I E KI+ +D+ ++ + A K
Sbjct: 20 ILPALGWNSWNAFYCDITEDKIMTAAKQVVSLGLKDA------GYNYINVDDCWAVKDGR 73
Query: 85 NQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLDNRIESNIAF 127
++ G ++ + FP+G + LAD +H L SLD F
Sbjct: 74 DKVTGKIIPDPVKFPTGIRGLADKIHGLGLKVGIYSSAGTKTCAGYPASLDKEGLDAATF 133
Query: 128 EQ-GIDYLKYDNCFNDD------------------------------TRPT--------- 147
GIDYLKYDNC+ D T P+
Sbjct: 134 ASWGIDYLKYDNCYVPDNWQDPYDFCVPDDYRAYGPYHNGTCGQSARTPPSGYNWTTSLE 193
Query: 148 -IRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADM 206
RY+ MSDAL R I +S+C WG WG + +SWR T DIS W R+ I +
Sbjct: 194 YKRYKQMSDALLAQNRTILYSLCNWGHAAVEEWGHQLGSSWRMTGDISPQWWRVSEILNQ 253
Query: 207 NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMP 266
+ +A ND DML +GNG + E HF+ WA K+PL++G + L AE +
Sbjct: 254 MSFKSAYADFWEHNDADMLHIGNGNLTMEESRSHFAFWAAIKSPLIIGTALDRLAAEKVA 313
Query: 267 IIGNEEVIAVNQ 278
I+ N ++A +Q
Sbjct: 314 ILKNAHLLAFHQ 325
>gi|157109025|ref|XP_001650490.1| alpha-galactosidase/alpha-n-acetylgalactosaminidase [Aedes aegypti]
gi|108879142|gb|EAT43367.1| AAEL005188-PA [Aedes aegypti]
Length = 426
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 100/343 (29%), Positives = 137/343 (39%), Gaps = 102/343 (29%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRE-------------------T 50
L NGL TPPMGW SW F C I+E++ +
Sbjct: 18 LENGLARTPPMGWMSWERFRCITDCTKYPDECISEQLFKRMADLMVSEGYLQAGYNFVNI 77
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW E RDS G LV +K FP+G K LADY+H
Sbjct: 78 DDCWSEMKRDS--------------------------SGRLVPDKKRFPNGIKHLADYIH 111
Query: 111 ILISNSLDNRIESNIA-------------FEQ--------GIDYLKYDNCFNDDTRPTIR 149
S L + +I FE +D++K D C+ D+ +
Sbjct: 112 ---SKGLKFGLYQDIGTKTCAGYPGMKDYFEVDAQTFADWDVDFIKIDGCYADERKMVDD 168
Query: 150 YRAMSDALKKAGRPIFFSMCEW-------GDMRPALWGSNIRNSWRTTDDISDSWARMLT 202
Y D + K GRPI +S C W G + N WR DI DS + + +
Sbjct: 169 YILFGDLMNKTGRPILYS-CSWPVYQEYNGIIPDYEILKKTCNMWRNWGDIDDSHSSVES 227
Query: 203 I----ADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVG 258
I +D H+ PG WNDPD L +GN G+ Y + ++W + APLLL D+
Sbjct: 228 ITRYFSDNQARIQPHSGPGHWNDPDTLVLGNFGLSYEQSKSQLAVWTVLAAPLLLSNDLA 287
Query: 259 NLTAETMPIIGNEEVIAVNQ-----------AINDQQIWAGPL 290
+T E ++ N+E+IA NQ IN +IW P+
Sbjct: 288 TVTPEIRELLLNKEIIAANQDPLGIQGLLVKTINRIEIWKRPI 330
>gi|332374436|gb|AEE62359.1| unknown [Dendroctonus ponderosae]
Length = 407
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 145/335 (43%), Gaps = 61/335 (18%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGES---------SRDSQISLSCSFH 70
L NGL LTPPMGW SW F C ++ + D+C E+ + D + +
Sbjct: 19 LDNGLALTPPMGWLSWERFRCLVDCDLY--PDECISEALFKRTADRLASDGYLEAGYEYI 76
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------I 113
+ D + N+ L A+ FP+G LA+YVH L
Sbjct: 77 IIDDCWADKQRDADNR----LRADPERFPNGIAPLAEYVHNLGLKMGIYGDYGTLTCGGY 132
Query: 114 SNSLDN-RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWG 172
S+D+ ++++ E GIDYLK D C+ Y MS L + GRPI FS C +
Sbjct: 133 PGSIDHLQLDAETFAEWGIDYLKLDGCYASTAGMEEGYAKMSRYLNETGRPIVFS-CSF- 190
Query: 173 DMRPAL--------WGSNIRNSWRTTDDISDSWARMLTI----ADMNEVYADHAKPGGWN 220
PA +I N WR DDI DSW + I +D + A PG WN
Sbjct: 191 ---PAYEQLDTNYSRAVDICNLWRNYDDIDDSWESVSDIFQWFSDNQDYLRQFAGPGHWN 247
Query: 221 DPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQA- 279
DPDML +GN G+ + +IW+I APL++ D+ + I+ N I +NQ
Sbjct: 248 DPDMLIIGNFGLSLEQSKAQMTIWSIMAAPLIMSVDLDTVKPRFKNILLNRNAIRINQDP 307
Query: 280 ----------INDQQIWAGPLSGNRIVVLLESRKT 304
I+D +W LSGN + + + S +
Sbjct: 308 LGIPGELVSNISDILLWRKSLSGNGVALGVVSTRV 342
>gi|395819650|ref|XP_003783194.1| PREDICTED: alpha-N-acetylgalactosaminidase [Otolemur garnettii]
Length = 411
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 103/338 (30%), Positives = 140/338 (41%), Gaps = 67/338 (19%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
NGL TPPMGW +W F C N K + +C IS M D L Q +
Sbjct: 20 NGLLRTPPMGWLAWERFRCNTNCK--EDPKNC---------ISERLFLEMADRLAQDGWR 68
Query: 82 PLP-------------NQCQGNLVANKTTFPSGTKALADYVHIL---------------- 112
L G LV + FP G LADY H L
Sbjct: 69 DLGYIYLNMDDCWIGGRDASGRLVPDPKRFPHGIAFLADYAHSLGLKLGIYGDMGNLTCM 128
Query: 113 --ISNSLDNRIESNIAF-EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMC 169
+LD + F + +D LK D CF+ Y M+ AL GRPI +S C
Sbjct: 129 GYPGTTLDKVVLDAQTFADWKVDMLKLDGCFSTPEERAEGYPKMAAALNATGRPIAYS-C 187
Query: 170 EW----GDMRPALWGS---NIRNSWRTTDDISDSWARMLTIAD----MNEVYADHAKPGG 218
W G + P + S I N WR DDI DSW +L+I D + A PG
Sbjct: 188 SWPAYEGGLPPRVNYSLLAEICNLWRNYDDIQDSWGSVLSILDWFVEHQNILQPVAGPGH 247
Query: 219 WNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
WNDPDML +GN G+ ++ ++W + APL + D+ +T + M I+ N +I +NQ
Sbjct: 248 WNDPDMLLIGNFGLSLDQAQAQMALWTVLAAPLFMSTDLRTITPQNMDILQNPLIIRINQ 307
Query: 279 ---AINDQQI---------WAGPLSGNRIVVLLESRKT 304
I ++I + PLS ++ SR+T
Sbjct: 308 DRLGIQGRRIRKEKSHIEVFVKPLSNEASAIVFFSRRT 345
>gi|74008237|ref|XP_538109.2| PREDICTED: alpha-galactosidase A [Canis lupus familiaris]
Length = 420
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 97/317 (30%), Positives = 141/317 (44%), Gaps = 85/317 (26%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRE-------------------T 50
L NGL +TP MGW W F C I+EK+ +
Sbjct: 32 LDNGLAMTPTMGWLHWERFMCNVDCQEEPDSCISEKLFMQMAELMDSDGWKDAGYKYLCI 91
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RDS+ G L A+ FP+G ++LA+YVH
Sbjct: 92 DDCWMAPKRDSK--------------------------GRLQADPKRFPNGIQSLANYVH 125
Query: 111 -----ILISNSLDNR-------------IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YR 151
+ I + N+ I++ + G+D LK+D C+ D Y+
Sbjct: 126 SKGLKLGIYADVGNKTCAGFPGSFGHYDIDAKTFADWGVDLLKFDGCYCDSVEHLENGYK 185
Query: 152 AMSDALKKAGRPIFFSMCEWGDMRPALWGSN---IR---NSWRTTDDISDSWARMLTI-- 203
MS AL + GR I +S CEW N IR N WR D+ DSW + +I
Sbjct: 186 RMSLALNRTGRSIVYS-CEWPLYMWPFHEPNYTEIRQYCNHWRNFQDVYDSWQSIKSILA 244
Query: 204 -ADMNEVYADH-AKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
N+ H A PGGWNDPDML +GN G+ +++ V ++WAI APLL+ D+ +++
Sbjct: 245 WTSSNQKRIVHVAGPGGWNDPDMLVIGNFGLSWDQQVTQMALWAIMAAPLLMSNDLRHIS 304
Query: 262 AETMPIIGNEEVIAVNQ 278
+ ++ N++VIA+NQ
Sbjct: 305 PQAKALLQNKDVIAINQ 321
>gi|52695597|pdb|1SZN|A Chain A, The Structure Of Alpha-Galactosidase
gi|56553811|pdb|1T0O|A Chain A, The Structure Of Alpha-Galactosidase From Trichoderma
Reesei Complexed With Beta-D-Galactose
Length = 417
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 104/390 (26%), Positives = 154/390 (39%), Gaps = 125/390 (32%)
Query: 25 GLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQISL 65
G P +GWNSWN +HC I+E DDCW S +D ++
Sbjct: 9 GKVPSLGWNSWNAYHCDIDESKFLSAAELIVSSGLLDAGYNYVNIDDCW--SMKDGRV-- 64
Query: 66 SCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL---------ISNS 116
G++ N T FP G LA VH L +
Sbjct: 65 ----------------------DGHIAPNATRFPDGIDGLAKKVHALGLKLGIYSTAGTA 102
Query: 117 LDNRIESNIAFEQ---------GIDYLKYDNC-----FNDD------------------- 143
+++ +E G+DYLKYDNC + D+
Sbjct: 103 TCAGYPASLGYEDVDAADFADWGVDYLKYDNCNVPSDWQDEYVACNPDFVKTGPNGTCTT 162
Query: 144 -TRPTI--------------RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWR 188
PT+ R+ AM +AL K I SMC WG WG++ SWR
Sbjct: 163 ALDPTLAPPGYDWSTSKSAERFGAMRNALAKQSHEIVLSMCIWGQADVFSWGNSTGISWR 222
Query: 189 TTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISK 248
+DDIS +W + I ++N + G ND DMLEVGNG + E HF++WA K
Sbjct: 223 MSDDISPNWGSVTRILNLNSFKLNSVDFWGHNDADMLEVGNGNLTAAETRTHFALWAAMK 282
Query: 249 APLLLGCDVGNLTAETMPIIGNEEVIAVNQ-------------AINDQ---------QIW 286
+PLL+G D+ L+ + ++ N+ ++A NQ IN + W
Sbjct: 283 SPLLIGTDLAQLSQNNINLLKNKHLLAFNQDSVYGQPATPYKWGINPDWTFNVTYPAEFW 342
Query: 287 AGPLSGNRIVVLLESRKTFSSMMTAHWDDI 316
AGP S +V+++ + ++ A W++I
Sbjct: 343 AGPSSKGHLVLMVNTLD-ITATKEAKWNEI 371
>gi|443727169|gb|ELU14039.1| hypothetical protein CAPTEDRAFT_154474 [Capitella teleta]
Length = 405
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 94/296 (31%), Positives = 135/296 (45%), Gaps = 43/296 (14%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGES-SRDSQISLSCSFHMFDSLTQI 78
L NGL TPPMGW W F C N + D C GE +D +S ++ +
Sbjct: 19 LDNGLARTPPMGWLDWERFRC--NTDCVNYPDTCIGEKLFKDMADEMSSGGYLEAGYEYV 76
Query: 79 -------AAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNR------ 120
A + P+ G L A+ FPSG K LADYVH + I +
Sbjct: 77 NIDDCWLAKERGPD---GRLRADPDRFPSGIKGLADYVHSKGLKLGIYEDFGTKTCAGYP 133
Query: 121 -------IESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEW-- 171
+++ + G+DYLK D C++D + Y AMS L + G PI +S C W
Sbjct: 134 GSEYYLQMDAQTFADWGVDYLKLDGCYSDPHQYKDAYPAMSFWLNQTGVPILYS-CSWPA 192
Query: 172 -----GDMRPALWGSNIRNSWRTTDDISDSWARMLTI----ADMNEVYADHAKPGGWNDP 222
GD + N WR DI DSW + +I D ++D A PG +NDP
Sbjct: 193 YVVGAGDTPEYPLIAKYCNVWRNYGDIQDSWDSVSSIITFYGDDKGNFSDVAAPGSFNDP 252
Query: 223 DMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
DML VGN G+ + V ++W I +PL++ D+ + E+ ++ N+ IA+NQ
Sbjct: 253 DMLIVGNFGLSRTQERVQMAMWCIMASPLIMSTDLRTINDESKALLLNKRAIAINQ 308
>gi|387789752|ref|YP_006254817.1| alpha-galactosidase [Solitalea canadensis DSM 3403]
gi|379652585|gb|AFD05641.1| alpha-galactosidase [Solitalea canadensis DSM 3403]
Length = 407
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 94/339 (27%), Positives = 154/339 (45%), Gaps = 61/339 (17%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQ 86
TP MGW+SWN+F INE+II+ S D +S + +
Sbjct: 31 TPIMGWSSWNNFRININEEIIK--------SQADYMVSTGMKDAGYSYVNIDDGFFGGRD 82
Query: 87 CQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAF------------------- 127
G L+ + FP+G +++A+Y+H S L I S+
Sbjct: 83 GNGQLLVHPVRFPNGMQSIANYIH---SKGLKAGIYSDAGINTCGSYWDKDTISTGSGLF 139
Query: 128 ------------EQGIDYLKYDNCFND--DTRPTIRYRAMSDALKKAGRPIFFSMCEWGD 173
+ G D++K D C + + RY A+S ++ + +++C W
Sbjct: 140 GHDQQDLELMLNKWGYDFIKVDWCGGEWMELDEQTRYTAISTIIRAINPKVVYNICRW-- 197
Query: 174 MRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMK 233
P W ++I +SWR + DI++ + +L I D+N +A PG ND DML+VG GM
Sbjct: 198 KFPGAWATHIADSWRISGDINNQYESILKIIDLNADLWKYASPGHVNDMDMLQVGR-GMS 256
Query: 234 YNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAI-----------ND 282
Y E HFS+W + +PLL G D+ ++ +T+ I+ N+EVIA+NQ D
Sbjct: 257 YEEDKSHFSMWCMMNSPLLAGNDLRKMSKQTIEILTNKEVIALNQDPLVYQARRLVDNGD 316
Query: 283 QQIWAGPLSGN---RIVVLLESRKTFSSMMTAHWDDIGI 318
++WA PL G + + L +R + + + + +GI
Sbjct: 317 LELWAKPLYGTMSGMVAIALLNRSSKNDTIVLDLNAVGI 355
>gi|256394017|ref|YP_003115581.1| ricin B lectin [Catenulispora acidiphila DSM 44928]
gi|256360243|gb|ACU73740.1| Ricin B lectin [Catenulispora acidiphila DSM 44928]
Length = 558
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 99/346 (28%), Positives = 160/346 (46%), Gaps = 66/346 (19%)
Query: 28 PPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQC 87
PP+GW SWN F QIN +I+ D S + + + + ++ + + +
Sbjct: 49 PPIGWASWNTFAAQINYNVIKGQADALASSG------MEAAGYQYVNIDEGWWQGTRD-A 101
Query: 88 QGNLVANKTTFPSGTKALADYVHILISNSLDNRIESN----------------------- 124
GN+ + +P G KA+ADY+H S L I ++
Sbjct: 102 SGNITVDSADWPGGMKAIADYIH---SKGLKAGIYTDAGKNGCGYYYPTGRPAAPGSGSE 158
Query: 125 -------IAFEQ-GIDYLKYDNCFNDDTRPTIR--YRAMSDALKKA----GRPIFFSMCE 170
+ F Q G DY+K D C + + Y+A+SDA+ +A GRP+ S+C+
Sbjct: 159 GHYDQDFLQFSQWGFDYVKVDWCGGNAEGLNAQNTYQAISDAIGRATAQTGRPMVLSICD 218
Query: 171 WGDMRPALWGSNIRNSWRTTDDI-----SDSWARMLTIADMNEVYADHAKPGGWNDPDML 225
WG+ P W + WRT+ DI + S +L D + A + PG +NDPDML
Sbjct: 219 WGNQSPWNWAPGMSALWRTSGDIIYYGQAPSMTNVLANFDAAQHPAAQS-PGHYNDPDML 277
Query: 226 EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQ-- 283
G G + H S+WAIS APLL G ++ ++++T ++ N E IA++Q Q
Sbjct: 278 IAGMPGFTAAQNRTHLSLWAISGAPLLAGNNLSTMSSDTRAVLTNPEAIAIDQDSRGQQG 337
Query: 284 ----------QIWAGPLSGN-RIVVLLESRKTFSSMMTAHWDDIGI 318
Q+++ LSG+ R V+L +R ++ +TA W +G+
Sbjct: 338 VKVAEAQSGLQVYSKVLSGSGRRAVVLLNRTGSTATITAPWSALGL 383
>gi|74203157|dbj|BAE26260.1| unnamed protein product [Mus musculus]
Length = 421
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 100/341 (29%), Positives = 142/341 (41%), Gaps = 96/341 (28%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRE-------------------T 50
L NGL TP MGW W F C I+E++ +
Sbjct: 34 LDNGLARTPTMGWLHWERFMCNLDCQEEPGACISEQLFMQMAELMVSDGWRDAGYDYLCI 93
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RDS+ G L A+ FPSG K LA+YVH
Sbjct: 94 DDCWMAPERDSK--------------------------GRLQADPQRFPSGIKHLANYVH 127
Query: 111 -----ILISNSLDNR-------------IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YR 151
+ I + N+ I++ + G+D LK+D C D Y+
Sbjct: 128 SKGLKLGIYADVGNKTCAGFPGSFGSYDIDAQTFADWGVDLLKFDGCHCDSVVSLENGYK 187
Query: 152 AMSDALKKAGRPIFFSMCEWGDMRPALWGSNIR------NSWRTTDDISDSWARMLTIAD 205
M+ AL + GR I +S CEW N N WR DD+ DSW + I
Sbjct: 188 YMALALNRTGRSIVYS-CEWPLYLRPFHKPNYTDIQYYCNHWRNFDDVYDSWESIKNILS 246
Query: 206 MNEVY----ADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
VY + A PG WNDPDML +GN G+ +++ V ++WAI APLL+ D+ ++
Sbjct: 247 WTVVYQKEIVEVAGPGSWNDPDMLVIGNFGLSWDQQVTQMALWAIMAAPLLMSNDLRQIS 306
Query: 262 AETMPIIGNEEVIAVNQ-----------AINDQQIWAGPLS 291
++ ++ N++VIA+NQ N ++W PLS
Sbjct: 307 SQDKALLQNKDVIAINQDPLGKQGYCFRKENHIEVWERPLS 347
>gi|224539233|ref|ZP_03679772.1| hypothetical protein BACCELL_04135 [Bacteroides cellulosilyticus
DSM 14838]
gi|224519156|gb|EEF88261.1| hypothetical protein BACCELL_04135 [Bacteroides cellulosilyticus
DSM 14838]
Length = 664
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 102/322 (31%), Positives = 146/322 (45%), Gaps = 61/322 (18%)
Query: 2 FIVSSSAADSNGSDRRNL---LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS 58
F V A + G + + + + + LTPPMGWNSWN + ++++ +RE E
Sbjct: 254 FKVLLKAMNEKGVAEKEITLKIGDRIALTPPMGWNSWNCWGLSVDDEKVREAARMMNEKL 313
Query: 59 RDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL------ 112
+ ++ D + A + QG L++N+ FP K LADY+H L
Sbjct: 314 H----AYGWTYVNIDDGWEAAER----TKQGELLSNEK-FPD-FKGLADYIHSLGLKFGI 363
Query: 113 ------------ISNSLDNRIESNIAFEQGIDYLKYDNCF-----NDDTRPTIR--YRAM 153
+ + I++ G+DYLKYD+C D TIR Y M
Sbjct: 364 YSSPGPTTCGDYLGSYQHEEIDARTWGRWGVDYLKYDHCGYHAVQKDSEEKTIREPYIVM 423
Query: 154 SDALKKAGRPIFFSMCEWGDMRPALW------GSNIRNSWRTTDDISDSWARMLTIADMN 207
DAL K R I + + G P +W G + WRTT DI+D W + I
Sbjct: 424 RDALDKVNRDIVYCV---GYGAPNVWNWAREAGGEL---WRTTRDITDEWNVVTAIGCFQ 477
Query: 208 EVYADHAKPGGWNDPDMLEVGNGGMKYNEYV-----------VHFSIWAISKAPLLLGCD 256
+V A PG +NDPDML VG G + E V H S+W I APLL+GCD
Sbjct: 478 DVCAQATAPGNYNDPDMLVVGKLGKAWREKVHESALTPDEQYSHISLWCILSAPLLIGCD 537
Query: 257 VGNLTAETMPIIGNEEVIAVNQ 278
+ ++ T+ ++ N EVIAVNQ
Sbjct: 538 MSDIDDFTLSLLTNNEVIAVNQ 559
>gi|194859594|ref|XP_001969411.1| GG23972 [Drosophila erecta]
gi|190661278|gb|EDV58470.1| GG23972 [Drosophila erecta]
Length = 413
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 92/300 (30%), Positives = 135/300 (45%), Gaps = 51/300 (17%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGL TPPMGW SW F C N + + D+C E + + + D +
Sbjct: 18 LDNGLAKTPPMGWLSWERFRC--NTDCVNDPDNCISEQLFQTMTDIVVA----DGYASVG 71
Query: 80 AKPL-PNQC--------QGNLVANKTTFPSGTKALADYVHILISNSLDNRIESN------ 124
+ + + C G LVA++ FP+G KAL+DY+H S L I +
Sbjct: 72 YEYINIDDCWLEKHRSHDGKLVADRKRFPNGIKALSDYIH---SRGLKFGIYEDYGNYTC 128
Query: 125 ------IAFEQ---------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMC 169
I +E+ +DY+K D C+ Y L G+ + +S C
Sbjct: 129 AGYPGIIGYEEKDALQFADWNVDYVKLDGCYALPYDMDHGYSTFGRLLNSTGKSMVYS-C 187
Query: 170 EW------GDMRPALWGSNIR-NSWRTTDDISDSWARMLTIADM----NEVYADHAKPGG 218
W ++P N WR DDI DSWA + I D ++ A +A PG
Sbjct: 188 SWPVYQIYAGIQPNYSAIQTHCNLWRNYDDIQDSWASVENIIDYYGNNQDLIAPNAGPGH 247
Query: 219 WNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
WNDPDML +GN G+ Y + F+IW+I APLL+ D+ + + I+ N ++IAV+Q
Sbjct: 248 WNDPDMLIIGNFGLSYEQAKTQFAIWSILAAPLLMSVDLRTIRPQFKHILQNRKIIAVDQ 307
>gi|384099498|ref|ZP_10000584.1| putative exported alpha-galactosidase [Imtechella halotolerans K1]
gi|383832846|gb|EID72316.1| putative exported alpha-galactosidase [Imtechella halotolerans K1]
Length = 421
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 111/408 (27%), Positives = 172/408 (42%), Gaps = 73/408 (17%)
Query: 1 LFIVSSSAADSNGSDRRNLLSNGLGLTPP-MGWNSWNHFHCQINEKIIRETDDCWGES-- 57
LF+ + + + + N + + PP MGW SWNH+ I E II+ + E
Sbjct: 13 LFLTACATTSNPVVNSENKEAPAITFNPPIMGWASWNHYRINITEDIIKAQTNAITEKGL 72
Query: 58 SRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSL 117
++ ++ F Q G L+ +K FP+G K+LA Y+ S L
Sbjct: 73 AKAGYTYVNIDDGFFGGRDQ----------NGQLLHHKERFPNGMKSLASYIK---SKGL 119
Query: 118 DNRIESNIAF-------------------------------EQGIDYLKYDNCFND--DT 144
I ++ + G +++K D C +
Sbjct: 120 KPGIYTDAGINTCASYWDKDTIGVGMGLYGHEYDDLKLFLNDWGYEFIKVDWCGGEWLGL 179
Query: 145 RPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIA 204
Y + + +K+ +++C W P W + I +SWR + DIS+ + +L I
Sbjct: 180 DEETSYTRIGNLIKQLKPTAIYNVCRW--KFPGKWVTQIADSWRISGDISNDFNSILHII 237
Query: 205 DMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAET 264
D+N + PG +ND DML+VG G M Y E HFS+W++ +PLLLG D+ L T
Sbjct: 238 DLNADLWKYCSPGRYNDMDMLQVGRG-MTYEEDKTHFSMWSMMHSPLLLGNDLTQLDEVT 296
Query: 265 MPIIGNEEVIAVNQA-----------INDQQIWAGPLSGN---RIVVLLESRKTFSSMMT 310
+ II NEE+IA+NQ+ D +IWA PL I V +R + +T
Sbjct: 297 LGIITNEEIIALNQSPFVYQARRMVDFGDTEIWAKPLVSTMSGEIAVAFLNRSASTQSIT 356
Query: 311 AHWDDIGIKSNNTLFEHRTLETK------FQGKLTAKVDGHSCKMYEL 352
+ D IGI + F R L TK LT +V H + +L
Sbjct: 357 FNLDSIGIDATEG-FTLRDLWTKEDFPKSTNTHLTREVPAHGIVVLQL 403
>gi|219842115|dbj|BAH10649.1| alfa-D-galactopyranosidase/beta-L- arabinopyranosidase [Fusarium
oxysporum]
Length = 549
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 101/390 (25%), Positives = 172/390 (44%), Gaps = 68/390 (17%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLP-N 85
TP MGWNS+N C N K I + + S+ + + F A++ +
Sbjct: 25 TPVMGWNSYNQVSCSPNNKTIATAIE-----ALSSRGFVDAGYKFFQIDCGWASRDTQRD 79
Query: 86 QCQGNLVANKTTFPSGTKALAD-----------------------YVHILISNSLDNRIE 122
G L + FP G K L+D Y ++ + I+
Sbjct: 80 PTTGALKVDSNAFPDGLKPLSDLARSKGMRWTMYSDAGVRMCDPQYPSPVLGSLGHEAID 139
Query: 123 SNIAFEQGIDYLKYDNCFND--------DTRPTI--RYRAMSDALKKAGRPIFFSMCEWG 172
+ +YLKYDNC+ D D R R+ M L+K G P +C+WG
Sbjct: 140 AAFFKSLNTEYLKYDNCYADSASNNAPKDPRTDFLTRFGTMWSELQKVGIPGML-ICQWG 198
Query: 173 DMR--------PALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADH---AKPGGWND 221
PA W +I S+R +DDI + W+ + I++ ++ H + PG D
Sbjct: 199 VPYSSSSGLEGPAEWTQSISTSFRLSDDIGEGWSNVYRISN-QAIHISHRNLSGPGHIAD 257
Query: 222 PDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQA-- 279
D+LEVGN GM ++E HF++WA+ K+ L++ D+ L+AET+ ++ N+++I++NQ
Sbjct: 258 ADLLEVGNSGMTFDEQATHFALWAMLKSALMISTDITALSAETVAVLQNKDLISINQDAA 317
Query: 280 ----------INDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT--LFEH 327
D+ +WAGPL+ I VL + + ++ +GI+S + L+
Sbjct: 318 VKPISLVQRWTGDRDLWAGPLANGDIAVLHVDQSDKARTLSLQLSALGIQSADVKDLWTG 377
Query: 328 RTLETKFQGKLTAKVDGHSCKMYELTPVTK 357
+T + + +V+GH L+ + +
Sbjct: 378 KT--STGASSFSKQVNGHGSVALRLSNIKR 405
>gi|212534926|ref|XP_002147619.1| alpha-galactosidase [Talaromyces marneffei ATCC 18224]
gi|210070018|gb|EEA24108.1| alpha-galactosidase [Talaromyces marneffei ATCC 18224]
Length = 439
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 110/397 (27%), Positives = 157/397 (39%), Gaps = 123/397 (30%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSR 59
+L +G+G P +GWNSWN + C ++E I DDCW S
Sbjct: 21 ILPHGVGRLPALGWNSWNAYGCDVDEDKILAAATQLNITGLQALGYEYVNIDDCWSNKSG 80
Query: 60 DSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----IS 114
D +T+ L+ N TFPSG +A+ VH L I
Sbjct: 81 R------------DPVTK------------RLLPNPDTFPSGISGIAEKVHALGLKLGIY 116
Query: 115 NSLD-------------NRIESNIAFEQGIDYLKYDNC---------FN----DDTRP-- 146
+S I++ E G+DYLKYDNC +N D P
Sbjct: 117 SSAGLKTCAGYPASLGYEEIDAETFAEWGVDYLKYDNCNYPPEWKDQYNFCVPDSIFPFV 176
Query: 147 ------------------------TIRYRAMSDALKKAGRP--IFFSMCEWGDMRPALWG 180
T R+ M DAL I +S+CEWG+ WG
Sbjct: 177 NPNGTCPYLKNQAPEGYDWSTSNTTKRFNLMRDALIAVQNKSVILYSLCEWGNADVTSWG 236
Query: 181 SNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVH 240
+ +SWR + DI+ W ++ +IA++N G NDPDMLE GNG + E H
Sbjct: 237 NATGSSWRVSGDINAGWFKITSIANLNAHALSTVDFWGHNDPDMLENGNGFLTIEENRSH 296
Query: 241 FSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ---------------------A 279
F +WAI K+PL++G DV L + I+ N+++IA NQ
Sbjct: 297 FGLWAIMKSPLIIGTDVSTLPRTHLSILKNQDLIAFNQDPIFGKPALPYKAGYSNGTYNP 356
Query: 280 INDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDI 316
+ + W G S +VLL + + + TA W +I
Sbjct: 357 EHPPEYWFGATSYGWSLVLLFNSEHITVNRTARWSEI 393
>gi|346979867|gb|EGY23319.1| alpha-galactosidase [Verticillium dahliae VdLs.17]
Length = 404
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 92/307 (29%), Positives = 141/307 (45%), Gaps = 56/307 (18%)
Query: 28 PPMGWNSWNHFHCQINEKIIRETDDCWGESS-RDSQISLSCSFHMFDSLTQIAAKPLPNQ 86
P MGWNSWN F IN+ I+ T ++ RD+ + M D + P
Sbjct: 42 PQMGWNSWNTFKLGINQTIVEGTAQALVDTGLRDAGYTY---LVMDDGWQNLTRGP---- 94
Query: 87 CQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAFEQGIDYLKYDNC--FNDDT 144
G AN T FPSG K LAD +H + ++ + ++ +YDNC F +T
Sbjct: 95 -DGRQQANATRFPSGLKVLADELH-------EKGLKFGLYSQKRQLIRQYDNCGGFYGNT 146
Query: 145 RPTI-RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSW------ 197
RP R++ MS ALK GR IF+++C+WG P W +S+R + DI +
Sbjct: 147 RPVQERFQIMSRALKNTGRDIFYAVCQWGHQWPWYWADQFTDSYRMSGDIHAKFRDDGNS 206
Query: 198 ---------------ARMLTIADMNEVYADHAKPGGWNDPDMLEVG-NGGMKYNEYVVHF 241
+ + I M E+ + K G W D DMLEVG N ++ H
Sbjct: 207 VCKTAYCLNTGYAGVSVLTMIRKMREI-SGFQKKGSWADMDMLEVGVNRNFTLHQDQTHL 265
Query: 242 SIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----AI---------NDQQIWA 287
S WA K+PL++G D+ + ++ ++ +E+I +NQ A+ ++ Q+WA
Sbjct: 266 SFWAALKSPLIIGADITTIRRSSLDVLLKKEIIGINQDDLGVAVSYVAELSKEDEIQVWA 325
Query: 288 GPLSGNR 294
GP+ R
Sbjct: 326 GPVKYKR 332
>gi|283483326|emb|CAX20733.1| putative alpha-N-acetylgalactosaminidase [Chorthippus parallelus]
Length = 443
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 102/325 (31%), Positives = 142/325 (43%), Gaps = 64/325 (19%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGES---------SRDSQISLSCSFH 70
L NGL TPPMGW SW F C + K + D+C E+ + +L F
Sbjct: 20 LENGLARTPPMGWLSWERFRCNTDCK--NDPDNCISENLFRTMADLVVSEGYAALGYEFI 77
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNI----- 125
D + L G LV ++T FPSG L++Y+H S L I +
Sbjct: 78 NIDDCWLEKERSL----TGQLVPDRTRFPSGLHDLSEYIH---SKGLKFGIYEDYGNYTC 130
Query: 126 ---------------AFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMC 169
AF +DY+K D C++ Y L + GRP+ +S C
Sbjct: 131 AGYPGILGYLETDAQAFADWNVDYVKIDGCYSHPRDMDRGYPEFGYYLNRTGRPMIYS-C 189
Query: 170 EW------GDMRPALWGSNIR--NSWRTTDDISDSWARMLTIADM----NEVYADHAKPG 217
W M P + S I N WR DDI DSWA + +I D ++ +A PG
Sbjct: 190 SWPVYQIYSGMSPN-FTSIIEHCNLWRNFDDIQDSWASVESIIDYYGNNQDIIVPNAGPG 248
Query: 218 GWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN 277
WNDPDML +GN G+ Y + +IWAI APLL+ D+ + E I+ N+++IAV+
Sbjct: 249 HWNDPDMLIIGNFGLSYEQSKTQMAIWAILAAPLLMSVDLRTIRPEYKAILQNKKIIAVD 308
Query: 278 Q-----------AINDQQIWAGPLS 291
Q N +IWA P +
Sbjct: 309 QDPLGIQGRRIYKHNGIEIWARPTT 333
>gi|423224777|ref|ZP_17211245.1| hypothetical protein HMPREF1062_03431 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392634527|gb|EIY28446.1| hypothetical protein HMPREF1062_03431 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 664
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 102/322 (31%), Positives = 146/322 (45%), Gaps = 61/322 (18%)
Query: 2 FIVSSSAADSNGSDRRNL---LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS 58
F V A + G + + + + + LTPPMGWNSWN + ++++ +RE E
Sbjct: 254 FKVLLKAMNEKGVAEKEITLKIGDRIALTPPMGWNSWNCWGLSVDDEKVREAARMMNEKL 313
Query: 59 RDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL------ 112
+ ++ D + A + QG L++N+ FP K LADY+H L
Sbjct: 314 H----AYGWTYVNIDDGWEAAER----TKQGELLSNEK-FPD-FKGLADYIHSLGLKFGI 363
Query: 113 ------------ISNSLDNRIESNIAFEQGIDYLKYDNCF-----NDDTRPTIR--YRAM 153
+ + I++ G+DYLKYD+C D TIR Y M
Sbjct: 364 YSSPGPTTCGDYLGSYQHEEIDARTWGRWGVDYLKYDHCGYHAVQKDSEEKTIREPYIVM 423
Query: 154 SDALKKAGRPIFFSMCEWGDMRPALW------GSNIRNSWRTTDDISDSWARMLTIADMN 207
DAL K R I + + G P +W G + WRTT DI+D W + I
Sbjct: 424 RDALDKVDRDIVYCV---GYGAPNVWNWAREAGGEL---WRTTRDITDEWNVVTAIGCFQ 477
Query: 208 EVYADHAKPGGWNDPDMLEVGNGGMKYNEYV-----------VHFSIWAISKAPLLLGCD 256
+V A PG +NDPDML VG G + E V H S+W I APLL+GCD
Sbjct: 478 DVCAQATAPGNYNDPDMLVVGKLGKAWREKVHESALTPDEQYSHISLWCILSAPLLIGCD 537
Query: 257 VGNLTAETMPIIGNEEVIAVNQ 278
+ ++ T+ ++ N EVIAVNQ
Sbjct: 538 MSDIDDFTLSLLTNNEVIAVNQ 559
>gi|297710567|ref|XP_002831942.1| PREDICTED: LOW QUALITY PROTEIN: alpha-galactosidase A [Pongo
abelii]
Length = 430
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 106/358 (29%), Positives = 146/358 (40%), Gaps = 97/358 (27%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRE---------------- 49
R L NGL TP MGW W F C I+EK+ E
Sbjct: 28 RARALDNGLARTPTMGWLHWERFMCNLDCQEEPDSCISEKLFMEMAELMVSEGWKDAGYE 87
Query: 50 ---TDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALA 106
DDCW RDS+ G L A+ FP G + LA
Sbjct: 88 YLCIDDCWMAPQRDSE--------------------------GRLQADPQRFPHGIRQLA 121
Query: 107 DYVH-------ILISNSLDNR------------IESNIAFEQGIDYLKYDNCFNDDTRPT 147
+YVH I N I++ + G+D K+D C+ D
Sbjct: 122 NYVHNKGLKLGIYADVGKXNPAQASPGSFGYYDIDAQTFADWGVDLPKFDGCYCDSLENL 181
Query: 148 IR-YRAMSDALKKAGRPIFFSMCEWGDMRPALWGSN---IR---NSWRTTDDISDSWARM 200
Y+ MS AL + GR I +S CEW N IR N WR DI DSW +
Sbjct: 182 ADGYKHMSLALNRTGRSIVYS-CEWPLYMWPFQKPNYTEIRQYCNHWRNFADIDDSWKSI 240
Query: 201 LTIADM----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCD 256
+I D E D A PGGWNDPDML +GN G+ +N+ V ++WAI APL + D
Sbjct: 241 KSILDWTSFNQERIVDVAGPGGWNDPDMLVIGNFGLSWNQQVTQMALWAIMAAPLFMSND 300
Query: 257 VGNLTAETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLSGNRIVVLLESRK 303
+ +++ + ++ +++VIA+NQ +Q +W PLS V + +R+
Sbjct: 301 LRHISPQAKALLQDKDVIAINQDPLGKQGYQLRQGDNFEVWQRPLSDLAWAVAMINRQ 358
>gi|47205155|emb|CAF95912.1| unnamed protein product [Tetraodon nigroviridis]
Length = 410
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 99/303 (32%), Positives = 133/303 (43%), Gaps = 56/303 (18%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGL TPPMGW +W F C I DC + + IS + M D L+Q
Sbjct: 19 LDNGLMRTPPMGWLAWERFRCDI---------DC--DQDPKNCISENLFTDMADRLSQDG 67
Query: 80 AKPLP------NQC--------QGNLVANKTTFPSGTKALADYVHI-------------- 111
K L + C +G L + FP G LA Y+H
Sbjct: 68 WKDLGYVYVNIDDCWSSKERDKKGRLQPDPKRFPGGIPKLARYMHDRGLKLGIYGDMGTL 127
Query: 112 ----LISNSLDN-RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFF 166
LD I++ E +D LK+D C++++ Y MS AL GRPI +
Sbjct: 128 TCGGYPGTPLDKIDIDAKTFAEWEVDMLKFDGCYSNEVEQQQGYPLMSKALNATGRPIAY 187
Query: 167 SMCEW----GDMRPAL---WGSNIRNSWRTTDDISDSWARMLTIADM----NEVYADHAK 215
S C W G + P + I N WR DDI DSW +L I D +V A A
Sbjct: 188 S-CSWPAYSGGLPPKVNYTQLGEICNLWRNYDDIQDSWDSVLNIIDWFFDNQDVIAPAAG 246
Query: 216 PGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIA 275
PG WNDPDML VG+ G+ + ++WAI APL + D+ ++AE I+ N+ I
Sbjct: 247 PGRWNDPDMLIVGDFGLSLEQSRTQMALWAIMAAPLFMSNDLRTISAEARSILQNKIAIG 306
Query: 276 VNQ 278
+NQ
Sbjct: 307 INQ 309
>gi|237829883|ref|XP_002364239.1| alpha-N-acetylgalactosaminidase, putative [Toxoplasma gondii ME49]
gi|211961903|gb|EEA97098.1| alpha-N-acetylgalactosaminidase, putative [Toxoplasma gondii ME49]
gi|221507106|gb|EEE32710.1| alpha-N-acetylgalactosaminidase, putative [Toxoplasma gondii VEG]
Length = 904
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 102/345 (29%), Positives = 152/345 (44%), Gaps = 63/345 (18%)
Query: 28 PPMGWNSWNHFHC---QINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLP 84
PPMGW+SWN F C ++NEK++ +T +S + + M D I K
Sbjct: 448 PPMGWSSWNRFECDLSELNEKLVMDTALALEQSGLHA---AGYEYVMLDDCWSIK-KDEE 503
Query: 85 NQCQGNLVANKTTFPSGTKALADYVH-----------------ILISNSLDNRIESNIAF 127
+ +V + FP+G ALA ++H + S D+ + +AF
Sbjct: 504 RGAREPIVWDAKRFPNGMPALASFLHSRGFRFGLYTDSGSRTCMGYIGSADHEEQDAMAF 563
Query: 128 EQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAG------RPIFFSMCEWG-------D 173
+ G+D+LK D C+ + + + Y S A + A + C W D
Sbjct: 564 QSWGVDFLKIDGCYAEPSEMGMLYSRWSPAFEYAAAANPDHKQTVVLSCSWPAYVHNPTD 623
Query: 174 MRPALWGSNIRNSWRTTDDISDSWARMLTI----ADMNEVYADHAKPGGWNDPDMLEVGN 229
L G + ++WR DDI +W + I D ++A+ PG +NDPDMLEVGN
Sbjct: 624 FDFKLIG-EMCDTWRIFDDIQPTWESLSAIMKFWGDNQGIFANVVAPGRFNDPDMLEVGN 682
Query: 230 GGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ----------- 278
G E S+W+I APL+LG DV N+T +T+ I+ N EVIAVNQ
Sbjct: 683 GNFSTAEGRTQMSVWSIVAAPLILGNDVRNMTPQTLQILANPEVIAVNQDDLILEGLRVF 742
Query: 279 -AINDQQIWAGPLSGNRIVVL---LESRK-----TFSSMMTAHWD 314
+ N IW PL+ V L S+ S + A+W+
Sbjct: 743 ESPNTMSIWMRPLAAGSTAVAFVNLSSKPRNVTINLSELQKAYWE 787
>gi|358380539|gb|EHK18217.1| glycoside hydrolase family 27 protein [Trichoderma virens Gv29-8]
Length = 444
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 102/376 (27%), Positives = 152/376 (40%), Gaps = 97/376 (25%)
Query: 25 GLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAA---- 80
G P +GWNSWN +HC I+E S S L S + D+ Q
Sbjct: 36 GKVPSLGWNSWNAYHCDIDE------------SKFLSAAELIVSSGLLDAGYQYVNIDDC 83
Query: 81 -KPLPNQCQGNLVANKTTFPSGTKALADYVHIL---------ISNSLDNRIESNIAFEQ- 129
+ G++ N T FP G LA VH + + +++ +E
Sbjct: 84 WSLKDGRVDGHIAPNTTRFPDGIDGLAQKVHGMGLKLGIYSTAGTATCAGYPASLGYEDV 143
Query: 130 --------GIDYLKYDNC------------FNDD-------------TRPTI-------- 148
G+DYLKYDNC N D PT+
Sbjct: 144 DATDFANWGVDYLKYDNCNVPTDWQDQYLACNPDFVKTGANGTCSTALEPTLAPPGYDWS 203
Query: 149 ------RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLT 202
R+ AM DAL K I SMC WG WG++ SWR ++DIS W+ +
Sbjct: 204 TSKSAERFGAMRDALAKQSHEIVLSMCIWGQADVFSWGNSTGISWRMSNDISPEWSSVAH 263
Query: 203 IADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTA 262
I ++N + G ND DMLEVGNG + E HF++WA K+PLL+G D+ L+
Sbjct: 264 IINLNSFKLNSVGFWGHNDADMLEVGNGNLTPAETRTHFALWAAMKSPLLIGTDLAQLSQ 323
Query: 263 ETMPIIGNEEVIAVNQ----------------------AINDQQIWAGPLSGNRIVVLLE 300
+ + ++ N+ ++A NQ N + WAGP + +V+++
Sbjct: 324 DNINLLKNKHLLAFNQDSVYGSPATPYKWGVNPDWTFNYTNPAEFWAGPSAKGHLVLMMN 383
Query: 301 SRKTFSSMMTAHWDDI 316
+ + A+W +I
Sbjct: 384 TLD-HTVTKEANWSEI 398
>gi|1580816|emb|CAA93244.1| alpha-galactosidase [Trichoderma reesei]
Length = 444
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 99/375 (26%), Positives = 148/375 (39%), Gaps = 124/375 (33%)
Query: 25 GLTPPMGWNSWNHFHCQINE-------KIIRET------------DDCWGESSRDSQISL 65
G P +GWNSWN +HC I+E ++I + DDCW S +D ++
Sbjct: 36 GKVPSLGWNSWNAYHCDIDESKFLSAAEVIVSSGLLDAGYNYVNIDDCW--SMKDGRV-- 91
Query: 66 SCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL---------ISNS 116
G++ N T FP G LA VH L +
Sbjct: 92 ----------------------DGHIAVNTTRFPDGIDGLAKKVHDLGLKLGIYSTAGTA 129
Query: 117 LDNRIESNIAFEQ---------GIDYLKYDNC-----FNDD------------------- 143
+++ +E G+DYLKYDNC + D+
Sbjct: 130 TCAGYPASLGYEDVDAADFADWGVDYLKYDNCNVPSDWQDEYVACAPDAVQTGPNGTCST 189
Query: 144 ---------------TRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWR 188
++ R+ AM +AL K R I S+C WG WG+ SWR
Sbjct: 190 ALEPNLAPPGYDWSTSKSAERFNAMRNALAKQSREIVLSLCIWGVADVFSWGNETGISWR 249
Query: 189 TTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISK 248
+ DIS W + I +MN + G ND D+LEVGNG + E HF++WA K
Sbjct: 250 MSGDISPEWGSVTHIINMNSFKMNSVGFWGHNDADILEVGNGNLTAAETRTHFALWAAMK 309
Query: 249 APLLLGCDVGNLTAETMPIIGNEEVIAVNQ----------------------AINDQQIW 286
+PLL+G D+ L+ E + ++ N+ ++A NQ N + W
Sbjct: 310 SPLLIGTDLAQLSQENIELLKNKHLLAFNQDSVYGQPATPYKWGVNPDWTFNYTNPAEYW 369
Query: 287 AGPLSGNRIVVLLES 301
AGP S +V+++ +
Sbjct: 370 AGPSSKGHLVLMMNT 384
>gi|332025551|gb|EGI65714.1| Alpha-N-acetylgalactosaminidase [Acromyrmex echinatior]
Length = 934
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 98/343 (28%), Positives = 138/343 (40%), Gaps = 100/343 (29%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIR-------------------ET 50
L NGL TPPMGW +W F C I++++ R
Sbjct: 519 LENGLARTPPMGWLAWERFRCNTDCKNDPDNCISDRLFRTMTDIVVAEGYAAVGYEYINV 578
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW E RD G LV ++ FP G K+L+DYVH
Sbjct: 579 DDCWLEKERD--------------------------LNGQLVPDRERFPYGMKSLSDYVH 612
Query: 111 IL--------------------ISNSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPTIRY 150
I LD I++ +DY+K D C++ Y
Sbjct: 613 SKGLKFGIYEDYGNYTCAGYPGILGYLD--IDAATFASWDVDYVKLDGCYSHPVDMDRGY 670
Query: 151 RAMSDALKKAGRPIFFSMCEW------GDMRPALWG-SNIRNSWRTTDDISDSWARMLTI 203
L + G+ + +S C W M+P + N WR DDI DSWA + +I
Sbjct: 671 PEFGYLLNQTGKHMIYS-CSWPVYQIYAGMKPNYTSIAENCNLWRNFDDIQDSWASVESI 729
Query: 204 ADM----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGN 259
D + +A PG WNDPDML +GN G+ Y + +IWAI APLL+ D+
Sbjct: 730 IDYYGNNQDAIVSNAGPGHWNDPDMLIIGNFGLSYEQSKTQMAIWAILAAPLLMSVDLRT 789
Query: 260 LTAETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLS 291
+ E I+ N+++IAV+Q + Q IWA P++
Sbjct: 790 IRPEYKAILQNKKIIAVDQDLLGIQGRRIYKHKGIEIWARPIT 832
>gi|300120341|emb|CBK19895.2| unnamed protein product [Blastocystis hominis]
Length = 395
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 116/395 (29%), Positives = 167/395 (42%), Gaps = 73/395 (18%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEK---------------IIRETDDCWGESSRDSQIS 64
L NGL LTPPMGW W F C ++ K I R DD W E D I+
Sbjct: 16 LDNGLALTPPMGWMQWEQFRCNVDCKNYPKDCISEQLFVDMIDRLADDGWLEYGYD-MIN 74
Query: 65 LSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDN 119
+ + D G L + FP G K LADY H + I N
Sbjct: 75 IDDCWMTHD-----------RDENGRLYPDPERFPHGIKWLADYAHSKGVKLGIYNDYGT 123
Query: 120 R-------------IESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFF 166
+ +++ E +DYLK D C++ Y AM+ L + GR I +
Sbjct: 124 KTCGGYMGSEGYLMLDAKTFAEWEVDYLKMDGCYSQLIDQADAYPAMTRFLNETGRKIVY 183
Query: 167 SMCEWG----DMRPALWGSNIRNSWRTTDDISDSWARMLTIADM--NEV-YADHAKPGGW 219
S C W +M A N N WR DDI +W + +I D NE +A++A PG W
Sbjct: 184 S-CSWPAYDMEMDYAPLPPNC-NLWRNWDDIECNWRSIKSIMDKFGNETKWAEYAGPGHW 241
Query: 220 NDPDMLEVG--NGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN 277
ND DM+ G G + +E HF+IW+I +PL++ D+ L + I+ N+E+IAV+
Sbjct: 242 NDADMIVAGLKGGSLTEDESKSHFAIWSILASPLIMSNDLRELPDWAVKILRNKEIIAVD 301
Query: 278 QAINDQQ------------IWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT-- 323
Q +Q +WA PL+ + V L + + + +G +
Sbjct: 302 QDKLGKQGIRLTPFEENRSVWARPLANGDVAVALHNHGDKVVDIPLDFAILGETQKYSIR 361
Query: 324 -LFEHRTLETKFQGKLTAK-VDGHSCKMYELTPVT 356
L+ H L F+G AK V H +M + PV
Sbjct: 362 DLYAHEELGV-FEGSYVAKEVPVHGVQMLRMKPVA 395
>gi|429860478|gb|ELA35214.1| alpha-galactosidase [Colletotrichum gloeosporioides Nara gc5]
Length = 413
Score = 128 bits (321), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 103/350 (29%), Positives = 159/350 (45%), Gaps = 74/350 (21%)
Query: 1 LFIVSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRD 60
L + ++AA + G+ R G+ P MGWNSWN F IN+ II T ++ D
Sbjct: 5 LCFLVTAAAPTLGAPR------GMASRPQMGWNSWNTFKANINQSIIETT----AKALVD 54
Query: 61 SQISLS-CSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH---ILISNS 116
+ ++ + + + D Q + G N T FPSG AL +++H + +
Sbjct: 55 TGLAAAGYKYLIMDEGWQADERA----TDGRQEFNSTRFPSGGSALVNHIHDMGLKVGIY 110
Query: 117 LDNRI---------------ESNIAFEQGIDYLKYDNC--FNDDTRPTI-RYRAMSDALK 158
D+ I ++ + GIDYLKYDNC F+ T R++ MS+AL+
Sbjct: 111 SDSGIFTCGFAPGSWGYEDLDAKTYADWGIDYLKYDNCGGFHAGTHTQQERFQTMSNALR 170
Query: 159 KAGRPIFFSMCEWGDMRPALWGSNI-RNSWRTTDDISDSWAR------------------ 199
GR IF+S+C+WG P W + S+R + DI S+A+
Sbjct: 171 NTGRDIFYSLCQWGHQFPWYWADQVGAGSYRMSGDIHASFAQDKAGVCPTAYCLNTGYAG 230
Query: 200 ---MLTIADMNEVYADHAKPG-GWNDPDMLEVGNGG-MKYNEYVVHFSIWAISKAPLLLG 254
+ I M E+ K W D DMLEVG G M E HFS WA K+PL++G
Sbjct: 231 VSVLTMIRKMREISPFQEKGRMSWADMDMLEVGVGNVMSEVEEQTHFSFWAGLKSPLIIG 290
Query: 255 CDVGNLTAETMPIIGNEEVIAVNQAINDQ--------------QIWAGPL 290
DV + +++ ++ N ++IA++Q + Q+WAGP+
Sbjct: 291 ADVTKIREQSLKVLLNRDIIAISQDQRGEAVKYLPELSTEGKVQVWAGPV 340
>gi|195397618|ref|XP_002057425.1| GJ18121 [Drosophila virilis]
gi|194141079|gb|EDW57498.1| GJ18121 [Drosophila virilis]
Length = 409
Score = 128 bits (321), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 104/346 (30%), Positives = 151/346 (43%), Gaps = 74/346 (21%)
Query: 1 LFIVSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRD 60
+F++S+ A N L NGL TPPMGW SW F C N + + D+C E
Sbjct: 5 IFLISALTAACN------CLENGLARTPPMGWLSWERFRC--NTDCLNDPDNCISERLFQ 56
Query: 61 SQISLSCS----------FHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
L S ++ D + P G L++++ FP+G KAL+DY+H
Sbjct: 57 KMADLLVSEGYASVGYEYVNIDDCWLERYRGP-----DGKLLSDQRRFPNGIKALSDYIH 111
Query: 111 ILISNSLDNRIESN------------IAFEQ---------GIDYLKYDNCFNDDTRPTIR 149
S L I + I FE+ +DY+K D C+
Sbjct: 112 ---SKGLKFGIYEDYGNYTCAGYPGIIGFEKIDAYQFAEWNVDYVKLDGCYALPYDMDRG 168
Query: 150 YRAMSDALKKAGRPIFFSMCEW------GDMRPALWGSNIR---NSWRTTDDISDSWARM 200
Y L G+ + +S C W + P S I+ N WR DDI DSW +
Sbjct: 169 YTEFGKLLNSTGKHMVYS-CSWPVYQIYAGIPPNF--SAIQSHCNLWRNFDDIQDSWLSV 225
Query: 201 LTIADM----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCD 256
I D +V A +A PG WNDPDML +GN G+ Y + F++WAI APLL+ D
Sbjct: 226 ENIIDYYGNNQDVIAPYAGPGHWNDPDMLIIGNFGLSYEQAKTQFAVWAILAAPLLMSVD 285
Query: 257 VGNLTAETMPIIGNEEVIAVNQ---AINDQ--------QIWAGPLS 291
+ + E I+ N ++I+V+Q I + +IW+ P+S
Sbjct: 286 LRTIRPEFKEILLNRKIISVDQDPLGIQGRRIYKHKGIEIWSKPIS 331
>gi|391346667|ref|XP_003747591.1| PREDICTED: alpha-N-acetylgalactosaminidase-like isoform 2
[Metaseiulus occidentalis]
Length = 420
Score = 128 bits (321), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 97/329 (29%), Positives = 143/329 (43%), Gaps = 64/329 (19%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGES---------SRDSQISLSCSFH 70
L NGL TPPMGW SW + C + K +DC E D + +
Sbjct: 20 LENGLARTPPMGWLSWQRYRCITDCK--NRPNDCISEGLYKRMADRLIADGYRDVGYEYV 77
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNR----- 120
D + + + G LVA+ FP+G ALA+Y+H + I +
Sbjct: 78 NVDDCWSLLER---DNVTGRLVADPARFPNGIAALAEYMHDRGLKLGIYGDAGTKTCGGY 134
Query: 121 --------IESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWG 172
+++ E G+D LK D C+ D Y M+ AL + GRPI +S C W
Sbjct: 135 PGSEGYFDVDAQTFAEWGVDMLKLDGCYLDTDAMADVYPKMTRALNETGRPILYS-CSW- 192
Query: 173 DMRPA-LWGSNIR-----------NSWRTTDDISDSWARMLTIADM----NEVYADHAKP 216
PA L G +R N WR +DI D+W + +I D + AKP
Sbjct: 193 ---PAYLIGKKVRANFRDSIQEHCNIWRNFNDIDDAWTSVKSIIDFYKTNQRTFVQVAKP 249
Query: 217 GGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAV 276
G ++DPDML +GN G+ ++ V ++WAI +PLL+ D+ + I+ N VIAV
Sbjct: 250 GAFHDPDMLIIGNFGLSLDQARVQMALWAIFASPLLMSNDLEKIGRAEREILLNRHVIAV 309
Query: 277 NQ-----------AINDQQIWAGPLSGNR 294
NQ D ++++ P+ NR
Sbjct: 310 NQDSDGIMGSMVEKSGDFEVYSRPVRPNR 338
>gi|291190554|ref|NP_001167136.1| Alpha-galactosidase A precursor [Salmo salar]
gi|223648312|gb|ACN10914.1| Alpha-galactosidase A precursor [Salmo salar]
Length = 408
Score = 128 bits (321), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 100/353 (28%), Positives = 148/353 (41%), Gaps = 97/353 (27%)
Query: 20 LSNGLGLTPPMGWNSWNHFHC------------------QINEKIIRE-----------T 50
L NGL L P MGW W F C Q+ + ++ E
Sbjct: 21 LDNGLALKPTMGWLHWERFMCNVDCDTDPNNCISENLYMQMADVMVTEGWKEAGYEYVCI 80
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RD++ G L A+ FP G K LADYVH
Sbjct: 81 DDCWPSHQRDAK--------------------------GRLQADPKRFPRGIKKLADYVH 114
Query: 111 -----ILISNSLDN-------------RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRA 152
+ I L I++ + G+D LK+D C+ T Y
Sbjct: 115 SKGLKLGIYADLGTFTCGGFPGSLGYYDIDAQTFADWGVDLLKFDGCYMKWTLLGEGYTN 174
Query: 153 MSDALKKAGRPIFFS----MCEWGDMRPALWGSNIR---NSWRTTDDISDSWARMLTI-- 203
MS AL + GR I +S + EW +P + IR N WR D+ DSW + TI
Sbjct: 175 MSIALNQTGRSILYSCEWPLYEWPYHQPDY--AAIRKACNHWRNFADVYDSWDSVKTILD 232
Query: 204 --ADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
AD +V A PGGWNDPDML +GN G+ + + ++WAI APLL+ D+ ++
Sbjct: 233 WTADHQDVIVPAAGPGGWNDPDMLVIGNFGLSHTQQESQMALWAIMAAPLLMSNDLRDIC 292
Query: 262 AETMPIIGNEEVIAVNQ-----------AINDQQIWAGPLSGNRIVVLLESRK 303
+ ++ N +IA++Q ++ ++W LSG R+ + + +++
Sbjct: 293 PRSKQLLQNTRIIAISQDPLGRQGYRTAKVDSFEVWERHLSGGRLALAVMNKQ 345
>gi|354474935|ref|XP_003499685.1| PREDICTED: LOW QUALITY PROTEIN: alpha-galactosidase A-like
[Cricetulus griseus]
Length = 422
Score = 128 bits (321), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 104/317 (32%), Positives = 150/317 (47%), Gaps = 48/317 (15%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGES----SRDSQISLSCSFHMFDSL 75
L NGL TP MGW F C ++ + E D C E D +S ++ L
Sbjct: 34 LDNGLARTPTMGWLHXESFMCNLDCQ--EEPDACISEQLFMQMADLMVSEGWKDAGYEYL 91
Query: 76 TQIAAKPLPNQ-CQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNR--------- 120
P + +G L A+ F SG + LA+YVH + I + N+
Sbjct: 92 CIDDCWMAPERDSKGGLQADPQRFLSGIQHLANYVHSKGLKLGIYADVGNKTCAGFPGSF 151
Query: 121 ----IESNIAFEQGIDYLKYDNCFNDD-TRPTIRYRAMSDALKKAGRPIFFSMCEWG-DM 174
I++ + G+D LK+D C D T Y+ MS AL + GR I +S CEW M
Sbjct: 152 GYYDIDAQTFADWGVDLLKFDGCHCDSVTSLADGYKYMSLALNRTGRSIVYS-CEWPLYM 210
Query: 175 RPALWG--SNIR---NSWRTTDDISDSWARMLTI----ADMNEVYADHAKPGGWNDPDML 225
RP ++I+ N WR DDI DSW + I D + + A PGGWNDPDML
Sbjct: 211 RPFHKPNYTDIQYYCNYWRNYDDIYDSWESIKNILAWTTDNQKEIVEVAGPGGWNDPDML 270
Query: 226 EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------- 278
+GN G+ +++ V ++WAI APLL+ D+ ++++ ++ NE+VIAVNQ
Sbjct: 271 VIGNFGLSWDQQVTQMALWAIMAAPLLMSNDLRQISSQAKALLQNEDVIAVNQDPLGKQG 330
Query: 279 ----AINDQQIWAGPLS 291
N ++W PLS
Sbjct: 331 YCFRKENKIEVWERPLS 347
>gi|189466324|ref|ZP_03015109.1| hypothetical protein BACINT_02698 [Bacteroides intestinalis DSM
17393]
gi|189434588|gb|EDV03573.1| alpha-galactosidase [Bacteroides intestinalis DSM 17393]
Length = 664
Score = 128 bits (321), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 117/427 (27%), Positives = 181/427 (42%), Gaps = 90/427 (21%)
Query: 2 FIVSSSAADSNGSDRRNL---LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS 58
F V A + G + + + + + LTPPMGWNSWN + ++++ +RE E
Sbjct: 254 FKVLLKAMNEKGVAEKEITLKIGDRIALTPPMGWNSWNCWGLSVDDEKVREAARMMNEKL 313
Query: 59 RDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL------ 112
+ ++ D + + QG L++N+ FP K LADY+H L
Sbjct: 314 H----AYGWTYVNIDDGWEATER----TKQGELLSNEK-FPD-FKGLADYIHSLGLKFGI 363
Query: 113 ------------ISNSLDNRIESNIAFEQGIDYLKYDNCF-----NDDTRPTIR--YRAM 153
+ + I++ G+DYLKYD+C D TIR Y M
Sbjct: 364 YSSPGPTTCGDYLGSYQHEEIDARTWGRWGVDYLKYDHCGYHAVQKDSEEKTIREPYIVM 423
Query: 154 SDALKKAGRPIFFSMCEWGDMRPALW------GSNIRNSWRTTDDISDSWARMLTIADMN 207
DAL K R I + + G P +W G + WRTT DI+D W + I
Sbjct: 424 RDALDKVDRDIVYCV---GYGAPNVWNWAREAGGEL---WRTTRDITDEWNVVTAIGCFQ 477
Query: 208 EVYADHAKPGGWNDPDMLEVGNGGMKYNEYV-----------VHFSIWAISKAPLLLGCD 256
+V A PG +NDPDML VG G + E V H S+W I APLL+GCD
Sbjct: 478 DVCAQATAPGNYNDPDMLVVGKLGKAWREKVHESALTPDEQYSHISLWCILSAPLLIGCD 537
Query: 257 VGNLTAETMPIIGNEEVIAVNQAI----------NDQQIW----------AGPLSGNRIV 296
+ ++ T+ ++ N EVIAVNQ + ++ QIW G +
Sbjct: 538 MSDIDDFTLSLLTNNEVIAVNQDLLATPATKLLTDNGQIWYKKLYDGSYAVGFFQIDPYF 597
Query: 297 VLLESRKTFSSMMTAHWDDIGIK--------SNNTLFEHRTLETKFQGKLTAKVDGHSCK 348
+L + + + M + ++ +K S L+ + L ++ G+ +V H
Sbjct: 598 ILWDQDEAEAIQMKTYDFELALKQLGITGKVSVRDLWRQKDL-GEYNGEFRTQVPYHGVS 656
Query: 349 MYELTPV 355
++TPV
Sbjct: 657 FVKITPV 663
>gi|340372683|ref|XP_003384873.1| PREDICTED: alpha-galactosidase-like [Amphimedon queenslandica]
Length = 397
Score = 128 bits (321), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 100/382 (26%), Positives = 159/382 (41%), Gaps = 52/382 (13%)
Query: 21 SNGLGLTPPMGWNSW-NHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
+NG+GL PPMGW+ W + C+ + +E D D+ ++ ++ +
Sbjct: 20 NNGVGLKPPMGWDMWCTNGRCERDYCDAKEVMDV-----TDAMVTNGMKDAGYEYVIISD 74
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYV------------------------HILISN 115
GN+V +K FP G + ++ H + +
Sbjct: 75 CWADHRDENGNIVPDKDRFPDGLVPVIQHINSKGFKAGLYTDAGMYTCSPGGRNHTIPGS 134
Query: 116 SLDNRIESNIAFEQGIDYLKYDNCFND----DTRPTIRYRAMSDALKKAGRPIFFSMCEW 171
++N G++Y+K D C P +Y+ MS+AL K G+PIF + CEW
Sbjct: 135 YGHYEQDANAYASWGVEYVKVDWCSTKVNGTQLDPHKQYQEMSEALNKTGKPIFLNSCEW 194
Query: 172 GDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPG-GWNDPDMLEVGNG 230
G +P W NSWRT D D W I ++N D+A G GWNDPD +
Sbjct: 195 GIDKPWEWMHQYANSWRTGPDHHDDWKTTSKIIELNADLGDYAGVGKGWNDPDFIMTHCP 254
Query: 231 GMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------------ 278
GM EY F++W + APLL+ DV N+T+ ++ N+++I +NQ
Sbjct: 255 GMTDTEYRTEFTLWCLMSAPLLVVTDVRNMTSIMKEVLLNKDLIEINQDTTGPGGKRIGF 314
Query: 279 ----AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTL-FEHRTLETK 333
N QIWA ++ + L + + S +T + +G K N T
Sbjct: 315 DKTCGENVCQIWAKNITNGEKAIALYNADSVSHNITLEFSLLGWKMVNIRNLWDGTKYYN 374
Query: 334 FQGKLTAKVDGHSCKMYELTPV 355
F +T V+ H + Y + V
Sbjct: 375 FTESVTVVVESHGVQAYRASAV 396
>gi|178056470|ref|NP_001116694.1| alpha-N-acetylgalactosaminidase precursor [Sus scrofa]
gi|169117928|gb|ACA43013.1| N-acetylgalactosaminidase alpha [Sus scrofa]
Length = 411
Score = 128 bits (321), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 96/303 (31%), Positives = 130/303 (42%), Gaps = 55/303 (18%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQI 78
+L NGL PPMGW W F C I DC + + IS M D L Q
Sbjct: 17 MLENGLLRKPPMGWLGWERFRCNI---------DC--DEDPKNCISERLFMEMADRLAQD 65
Query: 79 AAKPLP-------------NQCQGNLVANKTTFPSGTKALADYVHIL------------- 112
+ L +GNL+ + FP G LADY H L
Sbjct: 66 GWRDLGYVYLNIDDCWIGGRDAKGNLMPDPKRFPHGIAFLADYAHSLGLKLGIYDDLGSF 125
Query: 113 -----ISNSLDNRIESNIAF-EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFF 166
+LD + F E +D L D CF+ Y M+ AL GRPI F
Sbjct: 126 TCMGYPGTTLDKVVHDAHTFAEWKVDMLNLDGCFSTHKERADGYPKMAAALNATGRPIAF 185
Query: 167 SMCEW----GDMRPALWGS---NIRNSWRTTDDISDSWARMLTIADMNEVYAD----HAK 215
S C W G + P + S +I N WR DDI DSW +L+I D + D A
Sbjct: 186 S-CSWPAYEGGLPPRVNYSLLADICNLWRNYDDIQDSWWSVLSILDWFVAHQDILQPVAG 244
Query: 216 PGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIA 275
PG WNDPDML +GN G+ + + ++W + APL + D+ ++A+ + I+ N +I
Sbjct: 245 PGHWNDPDMLLIGNFGLSFEQAQSQMALWTVLAAPLFMSTDLRTISAQNVDILQNTLMIK 304
Query: 276 VNQ 278
+NQ
Sbjct: 305 INQ 307
>gi|221487311|gb|EEE25543.1| alpha-N-acetylgalactosaminidase, putative [Toxoplasma gondii GT1]
Length = 904
Score = 127 bits (320), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 102/345 (29%), Positives = 152/345 (44%), Gaps = 63/345 (18%)
Query: 28 PPMGWNSWNHFHC---QINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLP 84
PPMGW+SWN F C ++NEK++ +T +S + + M D I K
Sbjct: 448 PPMGWSSWNRFACDLSELNEKLVMDTALALEQSGLHA---AGYEYVMLDDCWSIK-KDEE 503
Query: 85 NQCQGNLVANKTTFPSGTKALADYVH-----------------ILISNSLDNRIESNIAF 127
+ +V + FP+G ALA ++H + S D+ + +AF
Sbjct: 504 RGAREPIVWDAKRFPNGMPALASFLHSRGFRFGLYTDSGSRTCMGYIGSADHEEQDAMAF 563
Query: 128 EQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAG------RPIFFSMCEWG-------D 173
+ G+D+LK D C+ + + + Y S A + A + C W D
Sbjct: 564 QSWGVDFLKIDGCYAEPSEMGMLYSRWSPAFEYAAAANPDHKQTVVLSCSWPAYVHNPTD 623
Query: 174 MRPALWGSNIRNSWRTTDDISDSWARMLTI----ADMNEVYADHAKPGGWNDPDMLEVGN 229
L G + ++WR DDI +W + I D ++A+ PG +NDPDMLEVGN
Sbjct: 624 FDFKLIG-EMCDTWRIFDDIQPTWESLSAIMKFWGDNQGIFANVVAPGRFNDPDMLEVGN 682
Query: 230 GGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ----------- 278
G E S+W+I APL+LG DV N+T +T+ I+ N EVIAVNQ
Sbjct: 683 GNFSTAEGRTQMSVWSIVAAPLILGNDVRNMTPQTLQILANPEVIAVNQDDLILEGLRVF 742
Query: 279 -AINDQQIWAGPLSGNRIVVL---LESRK-----TFSSMMTAHWD 314
+ N IW PL+ V L S+ S + A+W+
Sbjct: 743 ESPNTMSIWMRPLAAGSTAVAFVNLSSKPRNVTINLSELQKAYWE 787
>gi|2204224|emb|CAA74161.1| alpha-galactosidase [Hordeum vulgare subsp. vulgare]
Length = 159
Score = 127 bits (320), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 67/145 (46%), Positives = 92/145 (63%), Gaps = 17/145 (11%)
Query: 228 GNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ--------- 278
GNGGM +EY+VHFSIWAISKAPL++GCDV +++ +T I+ N+EVIAVNQ
Sbjct: 10 GNGGMTNDEYIVHFSIWAISKAPLIIGCDVRHMSQDTYDILANKEVIAVNQDPLGVQGKK 69
Query: 279 --AINDQQIWAGPLSGNRIVVLLESRKTF-SSMMTAHWDDIGIKSNNT-----LFEHRTL 330
+IWA PL+G R VLL +R + +TAHWDDIG+ + L+ H+TL
Sbjct: 70 VRMEGSSEIWAAPLTGYRTAVLLLNRHAKDEAQITAHWDDIGLPAGTAVEARDLWLHKTL 129
Query: 331 ETKFQGKLTAKVDGHSCKMYELTPV 355
E KF K++ V H+ +M+ LTP+
Sbjct: 130 EPKFTDKMSFNVTPHAARMFVLTPL 154
>gi|389608287|dbj|BAM17755.1| similar to CG5731 [Papilio xuthus]
Length = 439
Score = 127 bits (320), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 97/318 (30%), Positives = 134/318 (42%), Gaps = 87/318 (27%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIR-------------------ET 50
L NGL LTPPMGW +W F C I++++ R
Sbjct: 23 LDNGLALTPPMGWLAWERFRCNTDCKNDPDNCISDRLFRTMTDILVNEGYAAVGYEYINV 82
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW E RD++ G LV ++ FP G K+L+DYVH
Sbjct: 83 DDCWPERERDAR--------------------------GRLVPDRERFPYGMKSLSDYVH 116
Query: 111 -----ILISNSLDN-------RIESNIAFEQG------IDYLKYDNCFNDDTRPTIRYRA 152
I N + ++A + +DY+K D C+ Y A
Sbjct: 117 SKGLKFGIYEDYGNFTCAGYPGVVGHLAGDAATFAAWDVDYVKLDGCYALPADMDHGYPA 176
Query: 153 MSDALKKAGRPIFFSMCEW------GDMRPALWGSNIR--NSWRTTDDISDSWARMLTIA 204
L GR + +S C W ++P S I N WR DDI DSWA + +I
Sbjct: 177 FGRELNLTGRQMVYS-CSWPVYQIYAGIQPNF-TSIIEHCNLWRNFDDIQDSWASVESII 234
Query: 205 DM----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNL 260
D +V +A PG WNDPDML +GN G+ Y + F+IWAI APLL+ D+ +
Sbjct: 235 DYYGNHQDVIVPNAGPGHWNDPDMLIIGNFGLSYEQSKTQFAIWAILAAPLLMSVDLRTI 294
Query: 261 TAETMPIIGNEEVIAVNQ 278
E I+ N ++I V+Q
Sbjct: 295 RPEYKAILQNRKIIEVDQ 312
>gi|393218357|gb|EJD03845.1| glycoside hydrolase, partial [Fomitiporia mediterranea MF3/22]
Length = 400
Score = 127 bits (320), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 91/319 (28%), Positives = 130/319 (40%), Gaps = 98/319 (30%)
Query: 32 WNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQISLSCSFHMF 72
WNSWN + C INE+ I DDCW SRD
Sbjct: 1 WNSWNAYGCNINEQKILSAAQDIVSLGLKDAGYEYVNIDDCWATMSRDPS---------- 50
Query: 73 DSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISN 115
G +V + T FPSG LA+ +H +
Sbjct: 51 ---------------TGRIVPDPTKFPSGIDGLAEQIHTMGLKMGIYSDAGTATCAGFPG 95
Query: 116 SLDNR-IESNIAFEQGIDYLKYDNC----------------------------------- 139
SL N I++ + GID + DNC
Sbjct: 96 SLGNETIDAQTFADWGIDCIT-DNCNVPANWTDSGVRAIHRCWKIMRVNLPDRRYHRTTI 154
Query: 140 FNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWAR 199
++ + IRYR M+ AL RPI FS+CEWG WG + +SWR + D + +W+
Sbjct: 155 ISNYSNTGIRYRRMAGALASVSRPILFSLCEWGIDNVWDWGGRVGHSWRMSGDATPAWSY 214
Query: 200 MLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGN 259
+ I +N + D + ND DM+EVGNG + E HF+ WA K+P+LLG D+
Sbjct: 215 ITEIIALNVQHLDSIQFFAHNDMDMMEVGNGNLTIEEQRTHFAAWAFLKSPILLGTDLSQ 274
Query: 260 LTAETMPIIGNEEVIAVNQ 278
L+A+ + II N+E++A +Q
Sbjct: 275 LSADQVAIISNKELLAFSQ 293
>gi|223648436|gb|ACN10976.1| Alpha-N-acetylgalactosaminidase [Salmo salar]
Length = 435
Score = 127 bits (320), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 97/318 (30%), Positives = 131/318 (41%), Gaps = 86/318 (27%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQIN----------EKIIRE-------------------T 50
L NGL TPPMGW +W F C I+ E + R+
Sbjct: 19 LDNGLMRTPPMGWLAWERFRCDIDCQNDPKNCISEVLFRDMADRLAEDGWRELGYVYVNI 78
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RDS G L A+ FPSG LA Y+H
Sbjct: 79 DDCWASKDRDSN--------------------------GRLQADPKRFPSGIPNLASYIH 112
Query: 111 I------------------LISNSLDN-RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYR 151
LD I++ + +D LK+D C+++ T Y
Sbjct: 113 DRGLKLGIYGDMGTLTCGGYPGTPLDKITIDAQTFADWKVDMLKFDGCYSNATEQEQGYP 172
Query: 152 AMSDALKKAGRPIFFSMCEW----GDMRPAL---WGSNIRNSWRTTDDISDSWARMLTIA 204
AMS AL GRPI +S C W G + P + I N WR DI DSW +L+IA
Sbjct: 173 AMSKALNATGRPIGYS-CSWPAYQGGLPPKVNYTQLGEICNLWRNYGDIEDSWNSVLSIA 231
Query: 205 DM----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNL 260
D +V A PG WNDPDML VG+ G+ ++ ++WAI APL + D+ +
Sbjct: 232 DWFFNNQDVLQPAAGPGRWNDPDMLVVGDFGLSMDQSRSQMALWAIMAAPLFMSNDLRTI 291
Query: 261 TAETMPIIGNEEVIAVNQ 278
++ I+ N+ I +NQ
Sbjct: 292 SSGARTILQNKVAIGINQ 309
>gi|399078812|ref|ZP_10752996.1| alpha-galactosidase [Caulobacter sp. AP07]
gi|398033101|gb|EJL26417.1| alpha-galactosidase [Caulobacter sp. AP07]
Length = 639
Score = 127 bits (320), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 99/343 (28%), Positives = 137/343 (39%), Gaps = 87/343 (25%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGE 56
R ++ G PPMGWNSWN FH + E + ++ DD W
Sbjct: 34 RWTAVATGQAAKPPMGWNSWNAFHTDVTEAKVLDSAKAIVDSGLAAKGYRYINIDDGWWL 93
Query: 57 SSRDSQISLSCSFHMFDSLT-----QIAAKPL------------------PNQCQGNLVA 93
R + + +F S Q + KP N C
Sbjct: 94 KRRTTDGRMVVRTSIFPSAAVGGAEQTSFKPFTDRIHAMGLKAGIYSDIGANNCSQAFAP 153
Query: 94 NKTTFPSGTKALADYVHILISNSLDNRIESNIAF---EQGIDYLKYDNC----FNDDTR- 145
N P GT A + L IE +I + DY+K D C F+ D
Sbjct: 154 NAPNLPEGTVAEREV-------GLYGHIEQDIELYFKDWAFDYIKVDACGIRAFDPDNAR 206
Query: 146 -----------------------PTIR--YRAMSDALKKAGRP--IFFSMCEWGDMRPAL 178
P +R Y+ ++DAL++A FS+C WG
Sbjct: 207 VKSGQYRALGPLIDIKEIRRTDIPAVRKLYQQVADALRRANPDGDYIFSICAWGSADVRA 266
Query: 179 WGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGM---KYN 235
W ++ N RT+DDI+ SW RML D A +A PG WNDPDML +G+G
Sbjct: 267 WAKDVGNLSRTSDDITPSWTRMLANFDSAANRALYAHPGSWNDPDMLFIGHGDFDERHLT 326
Query: 236 EYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
E HF++W++ APLL+G D+ N M I+GN ++IAVNQ
Sbjct: 327 EAKSHFALWSMLSAPLLIGYDLRNAPQPLMDILGNSDLIAVNQ 369
>gi|300863115|ref|NP_001032465.2| alpha-N-acetylgalactosaminidase [Danio rerio]
Length = 437
Score = 127 bits (320), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 117/419 (27%), Positives = 172/419 (41%), Gaps = 84/419 (20%)
Query: 1 LFIVSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRD 60
LF+ SSAA + L NGL TPPMGW +W F C I DC ++
Sbjct: 7 LFLAFSSAALA--------LDNGLMRTPPMGWLAWERFRCDI---------DC--QNDPY 47
Query: 61 SQISLSCSFHMFDSLTQIAAKPL--------------PNQCQGNLVANKTTFPSGTKALA 106
+ IS M D L++ K L QG L + FP G LA
Sbjct: 48 NCISEQLFMDMADRLSEDGWKELGYVYVNIDDCWSSKDRDAQGRLQPDPKRFPRGIPHLA 107
Query: 107 DYVH-----ILISNSLD------------NRIESNIA--FEQGIDYLKYDNCFNDDTRPT 147
YVH + I + ++IE++ + GID LK D C+++ +
Sbjct: 108 QYVHDRGLKLGIYGDMGTLTCGGYPGTTLDKIETDAQTFADWGIDMLKLDGCYSNSSYQE 167
Query: 148 IRYRAMSDALKKAGRPIFFSMCEW----GDMRP----ALWGSNIRNSWRTTDDISDSWAR 199
Y MS AL GRPI +S C W G + P L G I N WR DDI DSW
Sbjct: 168 QGYPMMSKALNATGRPIGYS-CSWPAYQGGLPPKVNYTLLG-QICNLWRNYDDIQDSWDS 225
Query: 200 MLTIADM----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGC 255
++ I D + A PG WNDPDML +G+ + ++ ++W+I APL +
Sbjct: 226 VMGIVDWFFDNQDALQPAAAPGQWNDPDMLIIGDFSLSLDQSRAQMALWSIMAAPLFMSN 285
Query: 256 DVGNLTAETMPIIGNEEVIAVNQ---AINDQ---------QIWAGPLS-GNRIVVLLESR 302
D+ +++ ++ N+ VI++NQ I + Q++ PLS G +V R
Sbjct: 286 DLRTISSAARSVLQNKAVISINQDPLGIQGRRLLKEKSGIQVFQRPLSKGASALVFFSRR 345
Query: 303 KTFSSMMTAHWDDIGIKSN-----NTLFEHRTLETKFQGKLTAKVDGHSCKMYELTPVT 356
T +G + + E R E K + T ++ M+ + P
Sbjct: 346 SDMPYRYTTSLKTLGYQPGVFEVFDVFSEQRLPELKDSTQFTVSINPSGVVMWYIYPAA 404
>gi|194757301|ref|XP_001960903.1| GF11268 [Drosophila ananassae]
gi|190622201|gb|EDV37725.1| GF11268 [Drosophila ananassae]
Length = 417
Score = 127 bits (320), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 95/319 (29%), Positives = 134/319 (42%), Gaps = 88/319 (27%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKI-----IRET------------------------ 50
L NGL TPPMGW SW F C + K+ I E
Sbjct: 24 LDNGLARTPPMGWMSWERFRCITDCKLYPNECISENLFSRHADLLVSEGYAAAGYEYVII 83
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW E +RDS+ LVA+K FPSG ALAD++H
Sbjct: 84 DDCWLEKNRDSKTQ-------------------------KLVADKKRFPSGLNALADHIH 118
Query: 111 IL--------------------ISNSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPTIRY 150
+ + N + ++++ E +DY+K D C+ + + + Y
Sbjct: 119 EIGLKFGLYQDYGTNTCAGYPGVINHM--KLDAQTFAEWDVDYVKLDGCYANISDMAVGY 176
Query: 151 RAMSDALKKAGRPIFFSMCEW-------GDMRPALWGSNIRNSWRTTDDISDSWARMLTI 203
L + GRP+ +S C W G+M N WR DDI DS ++ I
Sbjct: 177 PEFGRLLNETGRPMVYS-CSWPAYQEPAGEMPDYESLKKHCNLWRNWDDIDDSLESLMQI 235
Query: 204 ADM----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGN 259
D + A PG WNDPDML +GN G+ Y++ + +IWAI APL++ D+
Sbjct: 236 MDYFGKNQDRIQPQAGPGHWNDPDMLLLGNYGLSYDQSKLQMAIWAIMAAPLIMSNDLEA 295
Query: 260 LTAETMPIIGNEEVIAVNQ 278
+ E I+ N VIAV+Q
Sbjct: 296 VRPEIKAILQNRAVIAVDQ 314
>gi|383857212|ref|XP_003704099.1| PREDICTED: alpha-N-acetylgalactosaminidase-like [Megachile
rotundata]
Length = 436
Score = 127 bits (319), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 98/341 (28%), Positives = 139/341 (40%), Gaps = 96/341 (28%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIR-------------------ET 50
L NGL TPPMGW +W F C I++++ R
Sbjct: 18 LDNGLVKTPPMGWLAWERFRCNTDCKNDPDNCISDRLFRTMADIIVAEGYAAVGYEYINV 77
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW E RD G LV ++ FP G K+LA+Y+H
Sbjct: 78 DDCWLEKDRD--------------------------IHGQLVPDRQRFPYGMKSLANYIH 111
Query: 111 -----ILISNSLDNRIESNIAFEQG-------------IDYLKYDNCFNDDTRPTIRYRA 152
I N + G +DY+K D C++ + Y
Sbjct: 112 SKGLKFGIYEDFGNYTCAGYPGILGYLETDALTFASWDVDYVKLDGCYSHPSEMDRGYPE 171
Query: 153 MSDALKKAGRPIFFSMCEW------GDMRPALWG-SNIRNSWRTTDDISDSWARMLTIAD 205
L + GRP+ +S C W M+P + N WR DDI DSW + TI D
Sbjct: 172 FGFYLNQTGRPMVYS-CSWPVYQVYAGMQPNYTAITEHCNLWRNFDDIQDSWNSLETIID 230
Query: 206 M----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
+V +A PG WNDPDML +GN G+ Y + ++WAI APLL+ D+ +
Sbjct: 231 YYGNNQDVIVPNAGPGHWNDPDMLIIGNFGLSYEQSKTQMALWAILAAPLLMSVDLRTIR 290
Query: 262 AETMPIIGNEEVIAVNQ---AINDQ--------QIWAGPLS 291
E I+ N+++IAV+Q I + +IWA P++
Sbjct: 291 PEYKAILQNKKIIAVDQDPLGIQGRRIYKHKGIEIWARPIT 331
>gi|189210181|ref|XP_001941422.1| alpha-galactosidase precursor [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187977515|gb|EDU44141.1| alpha-galactosidase precursor [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 364
Score = 127 bits (319), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 78/221 (35%), Positives = 110/221 (49%), Gaps = 53/221 (23%)
Query: 134 LKYDNCFNDDT--RPTI---RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWR 188
LKYD+C++ P++ R++AMSDALK GR I ++C WG+ WG +I NSWR
Sbjct: 2 LKYDSCYHLGRVGTPSVSFNRFKAMSDALKATGRNILLNLCNWGEDLVHTWGMSISNSWR 61
Query: 189 TTDDISDSWARMLTIADMNEV-------------------------YADHAKPGGWNDPD 223
T DI DS+ R + N V +AD + PGGW+D D
Sbjct: 62 ITGDIYDSFTRPDDLCGCNSVSPGDVNCIAPGTHCSVLFILNKVAPFADRSIPGGWSDLD 121
Query: 224 MLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ----- 278
MLEVG GGM EY HF++WA K+PL LG D+ + A + II N +IA++Q
Sbjct: 122 MLEVGQGGMTDEEYKAHFALWAALKSPLFLGNDLRAMPASALTIINNPAIIALSQDPHGR 181
Query: 279 -----------------AINDQQIWAGPL-SGNRIVVLLES 301
+ + +WAG L +G+ V+LL +
Sbjct: 182 SVTRVRRDTAGVAKDKWGVGETHVWAGHLHNGDEAVILLNA 222
>gi|456387271|gb|EMF52784.1| hydrolase [Streptomyces bottropensis ATCC 25435]
Length = 647
Score = 127 bits (319), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 107/362 (29%), Positives = 156/362 (43%), Gaps = 101/362 (27%)
Query: 29 PMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQISLSCSF 69
PMGW SWN F +++ +I++ D+ W + +RDS
Sbjct: 49 PMGWASWNAFAAKVDYNVIKQQVDAFVAAGLPEAGYEYINIDEGWWQGTRDSA------- 101
Query: 70 HMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH------------------- 110
GN+ ++ +P G KA+ADY+H
Sbjct: 102 -------------------GNITIDEAEWPGGMKAIADYIHSKGLKAGIYTDAGKDGCGY 142
Query: 111 --------ILISNSLDNRIESNIAFEQ-GIDYLKYDNCFND--DTRPTIRYRAMSDALKK 159
S S + ++ F + G D++K D C D P Y+A+SDA+
Sbjct: 143 YFPTGRPAAPGSGSEGHYVQDMTQFSKWGFDFVKVDWCGGDAEGLDPKTTYQAISDAVAT 202
Query: 160 A----GRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDI-----SDSWARMLTIADMNEVY 210
A GRP+ S+C WG P W + WRT DI ++A MLT D N V+
Sbjct: 203 ATATTGRPLALSLCNWGYSNPWNWAPGMGPMWRTNTDIFYHGQGLTYAGMLTNYDRN-VH 261
Query: 211 ADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGN 270
G +NDPDML +G G + + H ++WA+S APLL G D+ +T +T I+ N
Sbjct: 262 PTAQHTGYYNDPDMLMIGMG-LSAAQDRTHMNLWAVSGAPLLAGNDLSTMTTQTANILKN 320
Query: 271 EEVIAVNQAINDQ------------QIWAGPL--SGNRIVVLLESRKTFSSMMTAHWDDI 316
EVIAV+Q Q Q++ L SGNR VVLL +R + + MT W D+
Sbjct: 321 PEVIAVDQDPRGQQGVKVAEDTTGLQVYGKVLSGSGNRAVVLL-NRTSTTQNMTVRWSDL 379
Query: 317 GI 318
G+
Sbjct: 380 GL 381
>gi|196002335|ref|XP_002111035.1| hypothetical protein TRIADDRAFT_23253 [Trichoplax adhaerens]
gi|190586986|gb|EDV27039.1| hypothetical protein TRIADDRAFT_23253 [Trichoplax adhaerens]
Length = 365
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 101/331 (30%), Positives = 144/331 (43%), Gaps = 55/331 (16%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGES---------SRDSQISLSCSFH 70
L NGL LTPPMGW +W F C I+ K D+C + ++D + ++
Sbjct: 25 LDNGLALTPPMGWLAWERFRCNIDCK--NNPDNCISQKLFTDMADHLAKDGYRDVGYNYI 82
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------------------L 112
D +A K N G L A+K FP G K LA+Y+H
Sbjct: 83 NVDD-CWMAKKRNKN---GRLYADKERFPKGIKYLAEYMHKRGLKLGIYGDFGTKTCGGY 138
Query: 113 ISNSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEW- 171
+ + ++ +D LK D C ++ + Y AM L K GRPI +S C W
Sbjct: 139 PGSEFHLQTDAETFANWKVDSLKLDGCNSNTSDYKKGYPAMGHYLNKTGRPILYS-CSWP 197
Query: 172 GDMRPALWGSNIR----NSWRTTDDISDSWARMLTIADM----NEVYADHAKPGGWNDPD 223
+ P+ I N WR DI DSW +L+I D N+V + PG WNDPD
Sbjct: 198 AYLNPSKVNYTIIAKACNLWRNYGDIQDSWDSILSIVDFYGNNNKVLSAAQGPGHWNDPD 257
Query: 224 MLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ----- 278
L VG+ + Y + F++W+I APL + D+ + I+ N EVIAVNQ
Sbjct: 258 QLIVGDFSLSYEQSKSQFALWSIFGAPLYMSNDLRKIPTWAKDILQNREVIAVNQDKLGL 317
Query: 279 -------AINDQQIWAGPLSGNRIVVLLESR 302
N+ ++W LS V+L +R
Sbjct: 318 MGKRIVYKKNNYEVWMKNLSDVTTAVVLFNR 348
>gi|355785038|gb|EHH65889.1| hypothetical protein EGM_02750 [Macaca fascicularis]
Length = 420
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 94/309 (30%), Positives = 137/309 (44%), Gaps = 58/309 (18%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQIN----------EKIIRETDDCWGESSRDSQISLSCS 68
+L NGL TPPMGW +W F C IN E++ E D ++D + +
Sbjct: 17 MLDNGLLQTPPMGWLAWERFRCNINCDEDPKNCISERLFMEMAD---RMAQDGWRDMGYT 73
Query: 69 FHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL---------------- 112
+ D I + G L+ + FP G LADYVH L
Sbjct: 74 YLNIDD-CWIGGR----DASGRLMPDPKRFPHGIPFLADYVHSLGLKLGIYADMGNFTCM 128
Query: 113 --ISNSLDNRIESNIAF-EQGIDYLKYDNCFNDDTRP---------TIRYRAMSDALKKA 160
+LD ++ F E +D LK D CF+ ++ Y M+ AL
Sbjct: 129 GYPGTTLDKVVQDAQTFAEWKVDMLKLDGCFSTAEERAQGLKQIGLSLGYPKMAAALNAT 188
Query: 161 GRPIFFSMCEW----GDMRPAL---WGSNIRNSWRTTDDISDSWARMLTIAD----MNEV 209
GRPI FS C W G + P + ++I N WR DI DSW +L+I + ++
Sbjct: 189 GRPIAFS-CSWPAYEGGLPPRVNYSLLADICNLWRNYADIQDSWQSVLSILNWFVKHQDI 247
Query: 210 YADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIG 269
A PG WNDPDML +GN G+ + ++W + APLL+ D+ ++A+ M I+
Sbjct: 248 LQPVAGPGHWNDPDMLLIGNFGLSLEQSRAQMALWTVLAAPLLMSTDLRTISAQNMDILQ 307
Query: 270 NEEVIAVNQ 278
N +I +NQ
Sbjct: 308 NPLMIKINQ 316
>gi|195116661|ref|XP_002002870.1| GI17617 [Drosophila mojavensis]
gi|193913445|gb|EDW12312.1| GI17617 [Drosophila mojavensis]
Length = 413
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 100/344 (29%), Positives = 141/344 (40%), Gaps = 102/344 (29%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIR-------------------ET 50
L NGL TPPMGW SW F C I+E++ +
Sbjct: 18 LENGLARTPPMGWLSWERFRCNTDCANDPDNCISERLFQVMADLLVTEGYASVGYEYVNI 77
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW E RD G L+A+ FPSG KAL+DY+H
Sbjct: 78 DDCWLERRRD--------------------------IDGKLLADHKRFPSGIKALSDYIH 111
Query: 111 ILISNSLDNRIESN------------IAFEQ---------GIDYLKYDNCFNDDTRPTIR 149
S L I + I +E+ +DY+K D C+
Sbjct: 112 ---SRGLKFGIYEDYGNFTCAGYPGIIGYEETDALQFAEWDVDYVKLDGCYALPYDMDRG 168
Query: 150 YRAMSDALKKAGRPIFFSMCEW------GDMRPALWGSNIR-NSWRTTDDISDSWARMLT 202
Y L + G+ + +S C W ++P N WR DDI DSWA +
Sbjct: 169 YTQFGRFLNRTGKAMVYS-CSWPVYQIYAGIQPNFSAIQTHCNLWRNFDDIQDSWASVEN 227
Query: 203 IADM----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVG 258
I D ++ A +A PG WNDPDML +GN G+ Y + F+IW+I APLL+ D+
Sbjct: 228 IIDYYGNNQDLIAPNAGPGHWNDPDMLIIGNFGLSYEQAKTQFAIWSILAAPLLMSVDLR 287
Query: 259 NLTAETMPIIGNEEVIAVNQ---AINDQ--------QIWAGPLS 291
+ E I+ N ++I+V+Q I + +IW+ P+S
Sbjct: 288 TIRPEFKEILLNRKIISVDQDPLGIQGRRIYKHKGIEIWSKPIS 331
>gi|364284978|gb|AEW47968.1| GHF27 protein [uncultured bacterium G2_7]
Length = 516
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 112/424 (26%), Positives = 182/424 (42%), Gaps = 95/424 (22%)
Query: 8 AADSNGSDRRNL---LSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------- 50
A ++ G ++R L + + L LTPPMGWN W + ++ +K+IRE
Sbjct: 108 AVNAKGKNKRKLKIVVGDKLSLTPPMGWNHWYAHYYRVTDKMIREAADVMISSGMADVGY 167
Query: 51 -----DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKAL 105
DDCW +S +Q ++ L+++ PL + QGN++ N+ FP K L
Sbjct: 168 QYVNIDDCWMNASPGAQ--------KYEDLSRVG--PLRDD-QGNILPNR-HFPD-MKEL 214
Query: 106 ADYVHI------------------LISNSLDNRIESNIAFEQGIDYLKYDNC-----FND 142
DY+H ++ +++ G D+LKYD C N
Sbjct: 215 TDYIHSKGLKAGIYTSPGELTCQGYAASYQHEELDTQQFANWGFDFLKYDWCSYNKIANG 274
Query: 143 DT------RPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNI-RNSWRTTDDISD 195
DT +P YR M L+ R I F++C++G WG+ + + WRT D+
Sbjct: 275 DTSLETYQKP---YRNMGMILQSLQRDIVFNLCQYGRGEVWKWGAEVGGHCWRTAGDLGF 331
Query: 196 SWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGG------------MKYNEYVVHFSI 243
+ +A N Y ++KPG WNDPD +++G G M N + S+
Sbjct: 332 RLNNIFDVAIRNSEYRSYSKPGEWNDPDYIQIGWIGDARGQGIPELTKMPANMQYAYMSL 391
Query: 244 WAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQIWAGPLSGNRIVVLLE--- 300
W++ APL+ D+ L T+ ++ N EVI VNQ + W SG +++ E
Sbjct: 392 WSLMAAPLIYSGDMTKLDKFTLNVLCNPEVIEVNQDPLGKCGWIIHQSGEHFIMVKELHD 451
Query: 301 --------SRKTFSSMMTAHWDDIGIKSNNT---LFEHRTLETKFQGKLTAKVDGHSCKM 349
+R + MT +WD + + T L+ + L F+GK A V M
Sbjct: 452 GSKAVGMFNRAENIAEMTINWDKLQLSERQTVRDLWRQKNLGV-FKGKFKALVPPQGVIM 510
Query: 350 YELT 353
+++
Sbjct: 511 VKIS 514
>gi|355563723|gb|EHH20285.1| hypothetical protein EGK_03105 [Macaca mulatta]
Length = 420
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 94/309 (30%), Positives = 137/309 (44%), Gaps = 58/309 (18%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQIN----------EKIIRETDDCWGESSRDSQISLSCS 68
+L NGL TPPMGW +W F C IN E++ E D ++D + +
Sbjct: 17 MLDNGLLQTPPMGWLAWERFRCNINCDEDPKNCISERLFMEMAD---RMAQDGWRDMGYT 73
Query: 69 FHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL---------------- 112
+ D I + G L+ + FP G LADYVH L
Sbjct: 74 YLNIDD-CWIGGR----DASGRLMPDPKRFPHGIPFLADYVHSLGLKLGIYADMGNFTCM 128
Query: 113 --ISNSLDNRIESNIAF-EQGIDYLKYDNCFNDDTRP---------TIRYRAMSDALKKA 160
+LD ++ F E +D LK D CF+ ++ Y M+ AL
Sbjct: 129 GYPGTTLDKVVQDAQTFAEWKVDMLKLDGCFSTAEERAQGLKQIGLSLGYPKMAAALNAT 188
Query: 161 GRPIFFSMCEW----GDMRPAL---WGSNIRNSWRTTDDISDSWARMLTIAD----MNEV 209
GRPI FS C W G + P + ++I N WR DI DSW +L+I + ++
Sbjct: 189 GRPIAFS-CSWPAYEGGLPPRVNYSLLADICNLWRNYADIQDSWQSVLSILNWFVKHQDI 247
Query: 210 YADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIG 269
A PG WNDPDML +GN G+ + ++W + APLL+ D+ ++A+ M I+
Sbjct: 248 LQPVAGPGHWNDPDMLLIGNFGLSLEQSRAQMALWTVLAAPLLMSTDLRTISAQNMDILQ 307
Query: 270 NEEVIAVNQ 278
N +I +NQ
Sbjct: 308 NPLMIKINQ 316
>gi|310796419|gb|EFQ31880.1| alpha-galactosidase [Glomerella graminicola M1.001]
Length = 550
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 116/391 (29%), Positives = 171/391 (43%), Gaps = 76/391 (19%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRE-----TDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
TP MGWNS+N+++C INE II+ D E+ + ++ C + T+ AA
Sbjct: 25 TPQMGWNSYNYYNCYINETIIKANAHALVDTGLAEAGY-TTVTTDCGWLS----TERAA- 78
Query: 82 PLPNQCQGNLVANKTTFPSGT-KALADYVHIL-------------------ISNSLDNRI 121
G L N FPSG K L DY+H L SL + +
Sbjct: 79 ------DGQLEWNPELFPSGGGKELGDYIHNLGLKFGVYSGGGYFQCGSTDQPASLHHEL 132
Query: 122 ESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYR-----------AMSDALKKAGRPIFFSMC 169
+F G D LKYDNC+ + + YR M++AL R I + +C
Sbjct: 133 TDAKSFAAWGADSLKYDNCYAVEPDVMVDYRHPEAISPDRFEIMAEALNTTDRDILYQVC 192
Query: 170 EWG-DMRPALWGSNIR-NSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEV 227
+WG +W I NS+R ++DI + W + IA+ Y H PG + D DML V
Sbjct: 193 QWGTGYDLGIWAPKIGGNSYRISNDIYNGWRSIWRIANQVVPYYKHTGPGAFPDMDMLLV 252
Query: 228 GNGGMKYNEYVVHFSIWAISKAPLLLGCD--VGNLTAETMPIIGNEEVIAVNQAINDQQ- 284
G + E H +WAI+K+PL LG G + + I+ N+EVIA+NQ +Q
Sbjct: 253 GLNALSNEEERFHMGMWAINKSPLTLGSPAIAGLVPESSHEILINKEVIALNQDPLAKQA 312
Query: 285 ------------IWAGPLSGNRIVVLLESRKTFSSMMTAHWDDI-GIKSNNTLFEHRTLE 331
+WAG LSG+ +VV L + K S ++ + GI S E R +
Sbjct: 313 QLVRRYTEEEWDVWAGELSGSHVVVGLANWKNDSQPVSVDLAAVLGINSA----EARDVW 368
Query: 332 T-----KFQGKLTAKVDGHSCKMYELTPVTK 357
G ++GH K+ L+ ++K
Sbjct: 369 AASDIGSVSGTYETTLNGHELKLLVLSDISK 399
>gi|58865810|ref|NP_001012120.1| alpha-N-acetylgalactosaminidase precursor [Rattus norvegicus]
gi|67460657|sp|Q66H12.1|NAGAB_RAT RecName: Full=Alpha-N-acetylgalactosaminidase; AltName:
Full=Alpha-galactosidase B; Flags: Precursor
gi|51859221|gb|AAH82084.1| N-acetyl galactosaminidase, alpha [Rattus norvegicus]
gi|149065788|gb|EDM15661.1| rCG59517, isoform CRA_b [Rattus norvegicus]
Length = 415
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 91/303 (30%), Positives = 130/303 (42%), Gaps = 55/303 (18%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQI 78
+L NGL TPPMGW +W F C IN C E + IS M D L Q
Sbjct: 17 MLENGLLRTPPMGWLAWERFRCNIN---------C--EEDPKNCISERLFMEMADRLAQD 65
Query: 79 AAKPLP-------------NQCQGNLVANKTTFPSGTKALADYVHIL------------- 112
+ L G L+ + FP G LADY H L
Sbjct: 66 GWRDLGYVYLNIDDCWIGGRDATGRLIPDPKRFPHGIAFLADYAHSLGLKLGIYEDMGKM 125
Query: 113 -----ISNSLDN-RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFF 166
+LD +++ E +D LK D C++ Y M+ AL GRPI F
Sbjct: 126 TCMGYPGTTLDKVELDAATFAEWKVDMLKLDGCYSTPKERAEGYPKMAAALNATGRPIAF 185
Query: 167 SMCEW----GDMRPALWGSNIR---NSWRTTDDISDSWARMLTIAD----MNEVYADHAK 215
S C W G + P + + + N WR DI DSW +L+I D ++ +
Sbjct: 186 S-CSWPAYEGGLPPKVNYTEVAGTCNLWRNYKDIQDSWKSVLSILDWFVKHQDILQPVSG 244
Query: 216 PGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIA 275
PG WNDPDML +GN G+ ++E ++W + APL + D+ ++ + + I+ N +I
Sbjct: 245 PGHWNDPDMLLIGNFGLSFDESRAQMALWTVLAAPLFMSTDLRTISPQNIDILQNPLLIK 304
Query: 276 VNQ 278
+NQ
Sbjct: 305 INQ 307
>gi|194882531|ref|XP_001975364.1| GG22272 [Drosophila erecta]
gi|190658551|gb|EDV55764.1| GG22272 [Drosophila erecta]
Length = 417
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 95/319 (29%), Positives = 140/319 (43%), Gaps = 51/319 (15%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGL L PPMGW SW F C + K+ D+C E L S D+ +
Sbjct: 24 LDNGLALKPPMGWMSWERFRCITDCKLY--PDECISEQLFQRHADLLVSEGYADAGYEYV 81
Query: 80 -------AKPLPNQCQGNLVANKTTFPSGTKALADYVHI------LISNSLDN------- 119
K N Q LVA++ FP+G AL+D++H L + N
Sbjct: 82 IIDDCWLEKNRDNDTQ-KLVADRKRFPNGLNALSDHIHKQGLKFGLYQDYGTNTCAGYPG 140
Query: 120 -----RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEW--- 171
++++ E +DY+K D C+ + + Y L + GRP+ +S C W
Sbjct: 141 VIKHMKLDAQTFAEWDVDYVKLDGCYANISDMATGYPEFGRLLNETGRPMVYS-CSWPAY 199
Query: 172 ----GDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYAD----HAKPGGWNDPD 223
G+M N WR DDI DS ++ I D D H PG WNDPD
Sbjct: 200 QEDAGEMPDYESLKQHCNLWRNWDDIDDSLESLMQIMDYFAKNQDRIQPHGGPGHWNDPD 259
Query: 224 MLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ----- 278
ML +GN G+ Y++ + +IW+I APL++ D+ + E I+ N VIAV+Q
Sbjct: 260 MLLLGNYGLSYDQSKLQMAIWSIMAAPLIMSNDLAAVRPEIKDILQNRAVIAVDQDELGI 319
Query: 279 ------AINDQQIWAGPLS 291
+ N ++W P++
Sbjct: 320 QGRRVLSRNQIEVWKRPIT 338
>gi|195429918|ref|XP_002063004.1| GK21691 [Drosophila willistoni]
gi|194159089|gb|EDW73990.1| GK21691 [Drosophila willistoni]
Length = 417
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 100/347 (28%), Positives = 139/347 (40%), Gaps = 103/347 (29%)
Query: 18 NLLSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRE------------------ 49
N L NGL PPM W SW F C I+EK+
Sbjct: 21 NALDNGLARKPPMAWMSWQRFRCITDCETYPNECISEKLFSRHADLLVSEGYADAGYEYI 80
Query: 50 -TDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADY 108
DDCW E RD++ LVANK FPSG ALAD+
Sbjct: 81 IIDDCWLEKERDNKTQ-------------------------KLVANKERFPSGLNALADH 115
Query: 109 VH----------------------ILISNSLDNRIESNIAFEQGIDYLKYDNCFNDDTRP 146
+H ++ +LD + + + +DY+K D C+ + T
Sbjct: 116 IHDKGLKFGLYQDFGTNTCAGYPGVINHMALDAQTFA----DWDVDYVKLDGCYANLTDM 171
Query: 147 TIRYRAMSDALKKAGRPIFFSMCEW-------GDMRPALWGSNIRNSWRTTDDISDSWAR 199
Y L GRP+ +S C W G M N WR +DI DS
Sbjct: 172 VDGYPEFGRLLNATGRPMVYS-CSWPAYQEDEGQMPDYESLKKHCNLWRNWNDIDDSLES 230
Query: 200 MLTIADMNEVYAD----HAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGC 255
++ I D D HA PG WNDPDML +GN G+ Y++ + +IW+I APL++
Sbjct: 231 VMQIMDYFSKNQDRIQPHAGPGHWNDPDMLILGNYGLSYDQSKLQMAIWSIMAAPLIMSN 290
Query: 256 DVGNLTAETMPIIGNEEVIAVNQ-----------AINDQQIWAGPLS 291
D+ ++ E I+ N EVIAV+Q ND ++W P++
Sbjct: 291 DLAHVRPEIKEILQNREVIAVDQDELGIQGRRIILRNDIEVWRRPIT 337
>gi|226293587|gb|EEH49007.1| tRNA wybutosine-synthesizing protein [Paracoccidioides brasiliensis
Pb18]
Length = 1384
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 101/369 (27%), Positives = 160/369 (43%), Gaps = 76/369 (20%)
Query: 16 RRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSL 75
R +NG + +N+WN F+C +NE+++ + D + + + L + + D
Sbjct: 914 RARYHTNGRTSIELVTYNTWNSFYCDLNEEVVLDAAD---KIVQLGFMDLGYEYIVLDDC 970
Query: 76 TQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----ISNSLDNRI----ESNIA 126
A + + Q NL FPSG LA +H + I +S + E ++
Sbjct: 971 WS-AGRNSSDYLQPNL----EKFPSGIDGLAAKIHAMGLKIGIYSSAGTKTCAHYEGSLG 1025
Query: 127 FEQ---------GIDYLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFSMCEWG 172
+E+ GIDYLKYDNC+N+ T RY M AL GRPI +S+C WG
Sbjct: 1026 YEEKDAELWASWGIDYLKYDNCYNEGQEGTPLLSFNRYNVMGKALNATGRPILYSLCNWG 1085
Query: 173 DMRPALWGSNIRNSWRTTDDISDSWAR-----------------------MLTIADMNEV 209
P + I NSWR T D+ ++ R ++ + +
Sbjct: 1086 IDGPWNFAPTIANSWRITGDLFLNYNRETPECPCAELGGLDCKLPGFRCSVMNVLNKAAY 1145
Query: 210 YADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIG 269
G WND DML VGNGG+ + V HFS+WA K+PLL+ + + A+++ I+
Sbjct: 1146 LPSKGFSGAWNDLDMLVVGNGGLTDDAMVAHFSLWAALKSPLLMTPVLSKVDAKSLSILQ 1205
Query: 270 NEEVIAVN-------------QAINDQ---------QIWAGPLSGNRIVVLLESRKTFSS 307
N V+A++ Q + D Q+++G L+G VVL +
Sbjct: 1206 NIAVLAISQDSAGLSATRKWRQYVGDADEFGREGEIQMFSGRLAGGDEVVLFLNAAATDR 1265
Query: 308 MMTAHWDDI 316
M + +DI
Sbjct: 1266 QMESSLEDI 1274
>gi|300777556|ref|ZP_07087414.1| alpha-galactosidase [Chryseobacterium gleum ATCC 35910]
gi|300503066|gb|EFK34206.1| alpha-galactosidase [Chryseobacterium gleum ATCC 35910]
Length = 674
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 109/360 (30%), Positives = 167/360 (46%), Gaps = 71/360 (19%)
Query: 2 FIVSSSAADSNGSDRRNL---LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS 58
+ V+ +A ++ GS + L + + LTP MGWNSWN F +++ ++ D +S
Sbjct: 246 YEVTLNAKNAKGSASKKLKIECGDRIALTPTMGWNSWNCFGHEVSADKVKRAADALIKSG 305
Query: 59 RDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNL------VANKTTFPSGTKALADYVHIL 112
I+ ++ D Q QG + + + FP K LADY+H
Sbjct: 306 L---INHGWNYINIDDSWQYNRDGKDPSFQGKMRDENGYILTNSKFPD-MKGLADYMH-- 359
Query: 113 ISNSLDNRIESN------------IAFEQ---------GIDYLKYDNCF----------N 141
+N L I S+ +E+ G+DYLKYD C N
Sbjct: 360 -NNGLKMGIYSSPGPWTCGGCAGSYGYEKQDAESYTKWGVDYLKYDWCSYGGVIDGLPDN 418
Query: 142 DDTR-PTIRYRA-------------MSDALKKAGRPIFFSMCEWGDMRPALWGSNIR-NS 186
D + P++ ++ M LK+ R I +++C++G WG + S
Sbjct: 419 DPNKVPSLAFQGGGDPDKGVKPFKLMGSLLKQQPRDIVYNLCQYGMGDVWKWGDEVNAQS 478
Query: 187 WRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGG--------MKYNEYV 238
WRTT+DI+D+W+ + IA + A AKPG NDPDML VG G +K +E
Sbjct: 479 WRTTNDITDTWSSVKNIALAQDRAAPFAKPGNRNDPDMLVVGVVGWGNPHQSRLKPDEQY 538
Query: 239 VHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-AINDQQIWAGPLSGNRIVV 297
+H S+W+I APLL+GCD+ L T+ ++ N+EVIAVNQ A+ Q + + RI V
Sbjct: 539 LHMSLWSIFSAPLLIGCDLEKLDDFTLNLLTNDEVIAVNQDALGRQGVCLQTIGELRIYV 598
>gi|195029159|ref|XP_001987442.1| GH21921 [Drosophila grimshawi]
gi|193903442|gb|EDW02309.1| GH21921 [Drosophila grimshawi]
Length = 411
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 109/399 (27%), Positives = 173/399 (43%), Gaps = 72/399 (18%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGES--SRDSQISLSCSF----HMFDSL 75
NGL LTPPMGW SW F C + D+C E SR + + +S + + + +
Sbjct: 20 NGLALTPPMGWMSWQRFRCITDCDTY--PDECISEQLFSRHADLLVSEGYAAAGYEYVII 77
Query: 76 TQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH----------------------ILI 113
+ +Q LV ++ FP G LAD++H ++
Sbjct: 78 DDCWLEKERDQKTNKLVPDRNRFPRGLNVLADHIHNKGLKFGLYQDYGTNTCAGYPGVIN 137
Query: 114 SNSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGD 173
LD + +N +DY+K D C+ + + + Y L GRP+ +S C W
Sbjct: 138 HMELDAQTFAN----WDVDYVKLDGCYANVSDMALGYPEFGRLLNSTGRPMVYS-CSWPA 192
Query: 174 MRPAL-----WGSNIR--NSWRTTDDISDSWARMLTIADM----NEVYADHAKPGGWNDP 222
+ L + S + N WR DDI DS ++ I D E H PG WNDP
Sbjct: 193 YQSELNELPDFESLKKHCNLWRNWDDIDDSLESVMQIMDYFGKNQESMQQHGGPGHWNDP 252
Query: 223 DMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ---- 278
DML +GN G+ Y++ + +IWA+ APL++ D+ + E I+ N +VIAVNQ
Sbjct: 253 DMLILGNYGLSYDQSKLQMAIWAVLAAPLIMSNDLAYVRPEIKEILQNRDVIAVNQDPLG 312
Query: 279 -------AINDQQIWAGPLS----GNR--IVVLLESRKTFSSMMTAHW--DDIGIKSNN- 322
ND ++W P++ GN V SR+T + + + ++ +K+ N
Sbjct: 313 IQGRRLLLKNDIEVWRRPITPQTDGNEYSYAVAFVSRRTDGAPYSIKFTLKELSLKNKNG 372
Query: 323 ----TLFEHRTLETKFQGK--LTAKVDGHSCKMYELTPV 355
LF+ T FQ + + +V + Y+ T +
Sbjct: 373 YAVKDLFDSNTNLGVFQSESYFSTRVIPNGVNFYKFTAL 411
>gi|427782429|gb|JAA56666.1| Putative alpha-d-galactosidase melibiase [Rhipicephalus pulchellus]
Length = 519
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 93/320 (29%), Positives = 133/320 (41%), Gaps = 92/320 (28%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQIN----------EKIIRE-------------------T 50
L NGL TPPMGW +W F C I+ E++ +E
Sbjct: 39 LENGLARTPPMGWMAWERFLCNIDCADDPENCVSERLFKEMGDAVVRQGYRKVGYQYVNI 98
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW + RD+ G L AN+T FP+ K LAD++H
Sbjct: 99 DDCWMANQRDAS--------------------------GRLQANRTRFPNDIKHLADFMH 132
Query: 111 -----ILISNSLDNR-------------IESNIAFEQGIDYLKYDNCFNDDTRPTIRYRA 152
+ I + + +++ E +D +K D CF + T Y
Sbjct: 133 ARGLKLGIYGDVGTKTCEKYPGSKNHLLLDAQTFAEWDVDMVKMDGCFANMTEFKTLYPD 192
Query: 153 MSDALKKAGRPIFFSMCEWGDMRPALWGSNIR----------NSWRTTDDISDSWARMLT 202
+A+ K GR + +S C W PA SN+ N WR DI+DSW + +
Sbjct: 193 FGNAINKTGRSMVYS-CSW----PAYLPSNVTPNYSLIGHHCNMWRNYVDIADSWRSVES 247
Query: 203 IADM----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVG 258
I D + A PG WNDPDML +GN G+ Y++ ++WAI APL + D+
Sbjct: 248 IIDYYAKNQDALVAAASPGRWNDPDMLVIGNFGLSYDQSRAQMALWAIMAAPLFMSHDLR 307
Query: 259 NLTAETMPIIGNEEVIAVNQ 278
+ E I+ N +IAVNQ
Sbjct: 308 HTRREFRKILQNRAIIAVNQ 327
>gi|440467773|gb|ELQ36972.1| alpha-galactosidase 1 [Magnaporthe oryzae Y34]
gi|440490103|gb|ELQ69694.1| alpha-galactosidase 1 [Magnaporthe oryzae P131]
Length = 522
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 106/371 (28%), Positives = 169/371 (45%), Gaps = 58/371 (15%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAA-KPLPN 85
TPPMGWNS+NH++C E I++ S + L + ++ +T
Sbjct: 18 TPPMGWNSYNHYNCLPTEDDIKQ--------SAQGIVDLGFADLGYNIVTTDCGWNGRDR 69
Query: 86 QCQGNLVANKTTFPSGTKALADYVHIL-------------------ISNSLD-NRIESNI 125
QG N T FPSG KAL D++H L + SL I+S
Sbjct: 70 DEQGRQQWNATLFPSGGKALGDFIHSLGLKFGLYSGAGYFQCGSTDLPASLGFEEIDSKS 129
Query: 126 AFEQGIDYLKYDNCFND--DTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNI 183
E G D LK +D + + R+ M LK++GR I + +C+WG +
Sbjct: 130 FAEWGGDTLKTVMVDSDSAEAKSPERFLKMGRLLKESGRQIDYFLCQWG---------IV 180
Query: 184 RNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSI 243
SWR ++DI + W + I + +A H + G + D DML VG G M Y+E HF +
Sbjct: 181 GKSWRMSNDIYNGWRSIWRITNQVIPHAKHTREGSYADMDMLTVGLGAMSYDEERFHFGM 240
Query: 244 WAISKAPLLLGC--DVGNLTAETMPIIGNEEVIAVNQ---------AINDQQ----IWAG 288
W++ K+PL +G D + E++ I+ N+EVIA+NQ A+ + + +W G
Sbjct: 241 WSMMKSPLHIGAPIDKSITSQESLDIMANKEVIAINQDPLSEAAQLALRNTEGEWDVWIG 300
Query: 289 PLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT--LFEHRTLETKFQGKLTAKVDGHS 346
LS +R +V + + + S + +GI S + ++ L + G ++ GH
Sbjct: 301 NLSEDRKIVGVANWRNESQSVALDLSFLGIGSADARDVWAASDLGS-ISGTRQVELKGHE 359
Query: 347 CKMYELTPVTK 357
K+ L+ V K
Sbjct: 360 LKLLILSNVQK 370
>gi|432101977|gb|ELK29798.1| Alpha-galactosidase A [Myotis davidii]
Length = 425
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 99/321 (30%), Positives = 144/321 (44%), Gaps = 56/321 (17%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGL +TP MGW W F C N E D C E+ L S D ++
Sbjct: 33 LDNGLAMTPTMGWLHWERFMC--NTDCTEEPDSCISENLFMQMADLMDS----DGWKEVG 86
Query: 80 AKPL--------PNQ-CQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNR----- 120
+ L P + +G L A+ FPSG + L++YVH + I + +
Sbjct: 87 YEYLCIDDCWMAPQRDSKGRLQADPKRFPSGIRHLSNYVHSKGLKLGIYADVGKKTCAGY 146
Query: 121 --------IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YRAMSDALKKAGRPIFFSMCEW 171
I++ + G+D LK+D C D Y+ MS +L K GR I +S CEW
Sbjct: 147 PGSLGHYDIDAETFADWGVDLLKFDGCHCDTLEQLADGYKHMSLSLNKTGRSIVYS-CEW 205
Query: 172 GDMRPALWGSN---IR---NSWRTTDDISDSWARMLTIADM----NEVYADHAKPGGWND 221
N IR N WR DI DSW + +I D A PGGWND
Sbjct: 206 PLYMWPFKKPNYTEIRQYCNHWRNFGDIYDSWQSVKSILDWTSSNQRTVVSAAGPGGWND 265
Query: 222 PDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAIN 281
PDML +GN G+ +++ + ++WAI APL + D+ +++ + ++ N++VIA+NQ
Sbjct: 266 PDMLVIGNFGLSWDQQITQMALWAIMAAPLFMSNDLRHISLQAKTLLQNKDVIAINQDPL 325
Query: 282 DQQ-----------IWAGPLS 291
+Q +W PLS
Sbjct: 326 GKQGYLLRKEDNIEVWERPLS 346
>gi|402910811|ref|XP_003918043.1| PREDICTED: LOW QUALITY PROTEIN: alpha-galactosidase A [Papio
anubis]
Length = 429
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 100/334 (29%), Positives = 147/334 (44%), Gaps = 56/334 (16%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCS---------F 69
+L GL MGW W F C ++ + E D C E L S +
Sbjct: 31 VLDXGLARMLTMGWLHWERFMCNLDCQ--EEPDSCISEKLFMEMAELMVSDGWKDAGYEY 88
Query: 70 HMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNR---- 120
D + L +G L A+ FP G + LA+YVH + I + N+
Sbjct: 89 LCIDDCWMAPQRDL----EGRLQADPQRFPHGIRQLANYVHSKGLKLGIYADVGNKTCAG 144
Query: 121 ---------IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YRAMSDALKKAGRPIFFSMCE 170
I++ + G+D LK+D C+ D Y+ MS AL + GR I +S CE
Sbjct: 145 FPGSFGYYDIDAQTFADWGVDLLKFDGCYCDSLEKLADGYKHMSLALNRTGRSIVYS-CE 203
Query: 171 WGDMRPALWGSN---IR---NSWRTTDDISDSWARMLTIADM----NEVYADHAKPGGWN 220
W N IR N WR DI DSW + +I D E D A PGGWN
Sbjct: 204 WPLYMWPFQKPNYTEIRQYCNHWRNFADIDDSWKSIKSILDWTSFNQERIVDVAGPGGWN 263
Query: 221 DPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAI 280
DPDML +GN G+ +N+ V ++WAI APL + D+ +++ + ++ +++VIA+NQ
Sbjct: 264 DPDMLVIGNFGLSWNQQVTQMALWAIMAAPLFMSNDLRHISPQAKALLQDKDVIAINQDP 323
Query: 281 NDQQ-----------IWAGPLSGNRIVVLLESRK 303
+Q +W PLS V + +R+
Sbjct: 324 LGKQGYQLRQGDNFEVWERPLSDLAWAVAMINRQ 357
>gi|167522441|ref|XP_001745558.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775907|gb|EDQ89529.1| predicted protein [Monosiga brevicollis MX1]
Length = 970
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 85/273 (31%), Positives = 135/273 (49%), Gaps = 30/273 (10%)
Query: 28 PPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQC 87
PPMG+N+WN + C ++E I+R + R Q + ++ D Q+A Q
Sbjct: 605 PPMGFNTWNAYRCNLDENIVRSMAQAMQQ--RGLQAA-GYTYVNIDDCWQVA-----RQA 656
Query: 88 QGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLDN-RIESNIAFEQ 129
G LV + FPSG ALA+ +H SL++ RI++ +
Sbjct: 657 NGTLVPDPVRFPSGMTALAEDLHAAGFGFGVYTARGTGTCQGRPGSLEHERIDAATFCDW 716
Query: 130 GIDYLKYDNC---FNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRP-ALWGSNIRN 185
+DY+K D C + + T + +D + I S+ GD W + + N
Sbjct: 717 DVDYVKIDVCEGAQDANASWTQFHTGFADCYAQKNHSIVMSVETCGDPNGCGTWVAGLAN 776
Query: 186 SWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWA 245
WRT+ D+ ++A +L A N+ A AKPG +NDPDMLEVGN G+ +E + HF++W
Sbjct: 777 LWRTSPDLEATFASVLANAYANDAMAAVAKPGHYNDPDMLEVGNPGLDADEALSHFALWC 836
Query: 246 ISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
+ APLL+G D+ + T+ I+ E+IA++Q
Sbjct: 837 VMSAPLLIGTDLIRASNNTLAILSAPELIAIDQ 869
>gi|242005240|ref|XP_002423479.1| alpha-N-acetylgalactosaminidase, putative [Pediculus humanus
corporis]
gi|212506567|gb|EEB10741.1| alpha-N-acetylgalactosaminidase, putative [Pediculus humanus
corporis]
Length = 472
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 97/343 (28%), Positives = 144/343 (41%), Gaps = 100/343 (29%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRE-------------------T 50
L NGL TPPMGW SW F C I++++ R+
Sbjct: 18 LENGLARTPPMGWLSWERFRCNTDCKNDPDNCISDRLFRKMADLVVSEGYAAAGYKYINI 77
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW E R+ G LVA++ FP G K L++Y+H
Sbjct: 78 DDCWLEKERN--------------------------FAGQLVADRERFPYGIKDLSNYIH 111
Query: 111 IL--------------------ISNSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPTIRY 150
I L+ ++ +++ +DY+K D C++ + Y
Sbjct: 112 SKGLKFGIYEDYGNYTCAGYPGILGHLEKDAQTFASWD--VDYVKLDGCYSHPAEMDLGY 169
Query: 151 RAMSDALKKAGRPIFFSMCEWGDMR-PALWGSNIR------NSWRTTDDISDSWARMLTI 203
L + GRP+ +S C W + A N + N WR DDI DSWA + +I
Sbjct: 170 PEFGYYLNRTGRPMVYS-CSWPVYQIYAGIVPNFKSIIDSCNLWRNFDDIQDSWASLESI 228
Query: 204 ADM----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGN 259
D + +A PG +NDPDML VGN G+ Y + F++WAI APLL+ D+
Sbjct: 229 IDYYGNNQDAIVPNAGPGHFNDPDMLIVGNFGLSYEQSKTQFALWAILAAPLLMSVDLRT 288
Query: 260 LTAETMPIIGNEEVIAVNQ---AINDQ--------QIWAGPLS 291
+ E I+ N ++IAV+Q I + +IWA P++
Sbjct: 289 IRPEYKAILQNRKIIAVDQDPLGIQGRRIYKHRGIEIWARPIT 331
>gi|391346665|ref|XP_003747590.1| PREDICTED: alpha-N-acetylgalactosaminidase-like isoform 1
[Metaseiulus occidentalis]
Length = 417
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 96/326 (29%), Positives = 142/326 (43%), Gaps = 61/326 (18%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGES---------SRDSQISLSCSFH 70
L NGL TPPMGW SW + C + K +DC E D + +
Sbjct: 20 LENGLARTPPMGWLSWQRYRCITDCK--NRPNDCISEGLYKRMADRLIADGYRDVGYEYV 77
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNR----- 120
D + + + G LVA+ FP+G ALA+Y+H + I +
Sbjct: 78 NVDDCWSLLER---DNVTGRLVADPARFPNGIAALAEYMHDRGLKLGIYGDAGTKTCGGY 134
Query: 121 --------IESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWG 172
+++ E G+D LK D C+ D Y M+ AL + GRPI +S C W
Sbjct: 135 PGSEGYFDVDAQTFAEWGVDMLKLDGCYLDTDAMADVYPKMTRALNETGRPILYS-CSW- 192
Query: 173 DMRPA-LWGSNIR--------NSWRTTDDISDSWARMLTIADM----NEVYADHAKPGGW 219
PA L G + N WR +DI D+W + +I D + AKPG +
Sbjct: 193 ---PAYLIGKKLDYESIQEHCNIWRNFNDIDDAWTSVKSIIDFYKTNQRTFVQVAKPGAF 249
Query: 220 NDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ- 278
+DPDML +GN G+ ++ V ++WAI +PLL+ D+ + I+ N VIAVNQ
Sbjct: 250 HDPDMLIIGNFGLSLDQARVQMALWAIFASPLLMSNDLEKIGRAEREILLNRHVIAVNQD 309
Query: 279 ----------AINDQQIWAGPLSGNR 294
D ++++ P+ NR
Sbjct: 310 SDGIMGSMVEKSGDFEVYSRPVRPNR 335
>gi|391338932|ref|XP_003743807.1| PREDICTED: alpha-galactosidase A-like isoform 1 [Metaseiulus
occidentalis]
Length = 415
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 97/318 (30%), Positives = 138/318 (43%), Gaps = 88/318 (27%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQIN--------------EKIIRE---------------T 50
L NGL L PPMGW SW F C+I+ ++I E
Sbjct: 20 LDNGLALKPPMGWISWERFRCEIDCDQYPNDCISENLYKRIADELVDGGYRDVGYVHVNI 79
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RDS G +VA+ FPSG K LA Y+H
Sbjct: 80 DDCWALKERDSN--------------------------GRMVADPKRFPSGMKGLAKYMH 113
Query: 111 -----ILISNSLDNR-------------IESNIAFEQGIDYLKYDNCF--NDDTRPTIRY 150
+ I + ++ I++ + ID LKYD CF N+ P + Y
Sbjct: 114 DRGLKLGIYSDAGSKTCAGYPGSRDYEDIDAQTYADWDIDMLKYDGCFIANEADIPNL-Y 172
Query: 151 RAMSDALKKAGRPIFFSMCEW----GDMRPALWGSNIR--NSWRTTDDISDSWARMLTIA 204
M+ AL K G+ I +S CEW + P WG N WR DDI+D++ +
Sbjct: 173 MKMTQALNKTGKQIVYS-CEWPLYQKNTEPD-WGKIAASCNLWRNYDDIADTFESVKRTI 230
Query: 205 DM----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNL 260
D+ ++Y H KPG + DPDML +G+ G+ +E V +IWAI APL + D+ +
Sbjct: 231 DVFVKNQDLYVKHQKPGAFFDPDMLILGDYGLSKDEARVQMAIWAIWGAPLFMSNDLAKI 290
Query: 261 TAETMPIIGNEEVIAVNQ 278
++ ++ N VI +NQ
Sbjct: 291 DEDSKKLLLNRGVIGINQ 308
>gi|242019900|ref|XP_002430396.1| alpha-N-acetylgalactosaminidase, putative [Pediculus humanus
corporis]
gi|212515526|gb|EEB17658.1| alpha-N-acetylgalactosaminidase, putative [Pediculus humanus
corporis]
Length = 407
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 95/321 (29%), Positives = 140/321 (43%), Gaps = 61/321 (19%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQIN----------EKIIRETDDCWGESSRDSQISLSCSF 69
L NGL TPPMGW SW F C N EK++RE D S D ++ +
Sbjct: 20 LDNGLAKTPPMGWMSWERFRCLTNCTLFPDDCLSEKLLREMAD---RMSSDGYLNAGYEY 76
Query: 70 HMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAFEQ 129
+ D + G L ++ FPSG K L+DY+H S L I + +
Sbjct: 77 LIIDDCWLERER----GSDGKLKEDRQRFPSGMKNLSDYIH---SKGLKFGIYEDYGTKT 129
Query: 130 ---------------------GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSM 168
+DY+K D C+++ Y +L RP+ +S
Sbjct: 130 CGGYPGIIGHLEKDAELFKSWDVDYVKLDGCYSEPFDMDEGYIEFGKSLLNTRRPMVYS- 188
Query: 169 CEWG------DMRPAL-WGSNIRNSWRTTDDISDSWARMLTI----ADMNEVYADHAKPG 217
C W M P S N WR DDI DSW +++I D E+ ++ PG
Sbjct: 189 CSWPFYQELVGMAPNFTLISKHCNLWRNYDDIEDSWNSVVSIINYFGDRQELIGKYSGPG 248
Query: 218 GWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN 277
WNDPDML +GN G+ Y++ +IW+I APL++ D+ N+ E I+ N++ I +N
Sbjct: 249 HWNDPDMLIIGNYGLSYSQAKAQMAIWSILSAPLIMSNDLRNIRPEFKEILLNKDAIEIN 308
Query: 278 Q---AINDQ-----QIWAGPL 290
Q I+ + ++W PL
Sbjct: 309 QHELGISGKRKYRAKVWTKPL 329
>gi|198456834|ref|XP_001360458.2| GA20753 [Drosophila pseudoobscura pseudoobscura]
gi|198135769|gb|EAL25033.2| GA20753 [Drosophila pseudoobscura pseudoobscura]
Length = 417
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 94/339 (27%), Positives = 142/339 (41%), Gaps = 95/339 (28%)
Query: 22 NGLGLTPPMGWNSWNHFHCQ----------INEKIIRE-------------------TDD 52
NGL LTPPMGW SW F C I+E++ R DD
Sbjct: 26 NGLALTPPMGWMSWERFRCLTDCKLYPNECISEELFRRHADLLVSEGYADAGYEYIIIDD 85
Query: 53 CWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-I 111
CW E +RD++ LVA+K FP+G LAD++H I
Sbjct: 86 CWLEKNRDNKTK-------------------------KLVADKKRFPNGLNPLADHIHNI 120
Query: 112 LISNSLDN-----------------RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMS 154
+ L ++++ + +DY+K D C+ + + + Y
Sbjct: 121 GLKFGLYQDYGTNTCAGFPGVIKHMQLDAQTFADWDVDYVKLDGCYANISDMAVGYPEFG 180
Query: 155 DALKKAGRPIFFSMCEW-------GDMRPALWGSNIRNSWRTTDDISDSWARMLTIADM- 206
L GRP+ +S C W G+M N WR DDI DS + I D
Sbjct: 181 RLLNATGRPMVYS-CSWPAYQEDAGEMPDYTSLKQHCNLWRNWDDIDDSLESVAQIMDYF 239
Query: 207 ---NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAE 263
+ HA PG WNDPDML +GN G+ Y++ + +IW++ APL++ D+ + E
Sbjct: 240 GKNQDRIQPHAGPGHWNDPDMLLLGNYGLSYDQSKLQMAIWSVMAAPLIMSNDLAVVRPE 299
Query: 264 TMPIIGNEEVIAVNQ-----------AINDQQIWAGPLS 291
I+ N +VIAV+Q + N ++W+ P++
Sbjct: 300 IKDILQNRKVIAVDQDELGIQGRRILSKNQIEVWSRPIT 338
>gi|195334911|ref|XP_002034120.1| GM20062 [Drosophila sechellia]
gi|194126090|gb|EDW48133.1| GM20062 [Drosophila sechellia]
Length = 417
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 95/341 (27%), Positives = 139/341 (40%), Gaps = 95/341 (27%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRE-------------------T 50
L NGL L PPMGW SW F C I+EK+ R
Sbjct: 24 LDNGLALKPPMGWMSWERFRCITDCKLYPDECISEKLFRRHADLLVSEGYADAGYEYVII 83
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW E +RD+ LV ++ FP+G AL+D++H
Sbjct: 84 DDCWLEKNRDNDTQ-------------------------KLVPDRKRFPNGLNALSDHIH 118
Query: 111 I------LISNSLDN------------RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRA 152
L + N ++++ + +DY+K D C+ + + Y
Sbjct: 119 YQGLKFGLYQDYGTNTCAGYPGVIKHMKLDAQTFADWDVDYVKLDGCYANISDMATGYPE 178
Query: 153 MSDALKKAGRPIFFSMCEW-------GDMRPALWGSNIRNSWRTTDDISDSWARMLTIAD 205
L + GRP+ +S C W G+M N WR DDI DS ++ I D
Sbjct: 179 FGRLLNETGRPMVYS-CSWPAYQEDAGEMPDYESLKQHCNMWRNWDDIEDSLESLMQIMD 237
Query: 206 MNEVYAD----HAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
D H PG WNDPDML +GN G+ Y++ + +IWAI APL++ D+ +
Sbjct: 238 YFAKNQDRIQPHGGPGHWNDPDMLLLGNYGLSYDQSKLQMAIWAIMAAPLIMSNDLAAVR 297
Query: 262 AETMPIIGNEEVIAVNQ-----------AINDQQIWAGPLS 291
E I+ N VIAV+Q + N ++W P++
Sbjct: 298 PEIKAILQNRAVIAVDQDELGIQGRRVLSRNQIEVWKRPIT 338
>gi|307183812|gb|EFN70460.1| Alpha-N-acetylgalactosaminidase [Camponotus floridanus]
Length = 435
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 100/343 (29%), Positives = 140/343 (40%), Gaps = 100/343 (29%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIR-------------------ET 50
L NGL TPPMGW +W F C I++++ +
Sbjct: 20 LENGLARTPPMGWLAWERFRCNTDCKNDPDNCISDRLFKTMTDIVVAEGYAAVGYEYINV 79
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW E RD G LV ++ FP G K+LADYVH
Sbjct: 80 DDCWLEKERD--------------------------VNGQLVPDRERFPYGIKSLADYVH 113
Query: 111 IL--------------------ISNSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPTIRY 150
I LD + A++ +DY+K D C+ + Y
Sbjct: 114 SKGLKFGIYEDYGNYTCAGYPGILGYLDIDAATFAAWD--VDYVKLDGCYAHPSEMDRGY 171
Query: 151 RAMSDALKKAGRPIFFSMCEW------GDMRPALWG-SNIRNSWRTTDDISDSWARMLTI 203
L + G+ + +S C W M+P S N WR DDI DSWA + +I
Sbjct: 172 PEFGFHLNQTGKQMIYS-CSWPVYQIYAGMQPNYTAISENCNLWRNFDDIQDSWASVESI 230
Query: 204 ADM----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGN 259
D +V A PG WNDPDML +GN G+ Y + +IWAI APLL+ D+
Sbjct: 231 IDYYGNNQDVIVPIAGPGHWNDPDMLIIGNFGLSYEQSKTQMAIWAILAAPLLMSVDLRT 290
Query: 260 LTAETMPIIGNEEVIAVNQ---AINDQ--------QIWAGPLS 291
+ E I+ N+++IAV+Q I + +IWA P++
Sbjct: 291 IRPEYKAILQNKKIIAVDQDPMGIQGRRIYKHKGIEIWARPIT 333
>gi|149755215|ref|XP_001492699.1| PREDICTED: alpha-galactosidase A [Equus caballus]
Length = 421
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 96/341 (28%), Positives = 140/341 (41%), Gaps = 96/341 (28%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRE-------------------T 50
L NGL +TP MGW W F C ++EK+ +
Sbjct: 32 LDNGLAMTPTMGWLHWERFMCNTDCEEDPDSCVSEKLFMQMADLMDSEGWRDVGYEYLCI 91
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RDS+ G L A+ FP G + LADYVH
Sbjct: 92 DDCWMAPQRDSK--------------------------GRLQADPKRFPGGIRRLADYVH 125
Query: 111 ---ILISNSLD---------------NRIESNIAFEQGIDYLKYDNC-FNDDTRPTIRYR 151
+ + D I++ + G+D LK+D C F+ Y+
Sbjct: 126 GKGLKLGIYADVGKLTCAGFPGSYGYYDIDAKTFADWGVDLLKFDGCHFDTLVDLADGYK 185
Query: 152 AMSDALKKAGRPIFFSMCEWGDMRPALWGSNIR------NSWRTTDDISDSWARMLTIAD 205
MS AL + GR I +S CEW N N WR DI D+W + ++ D
Sbjct: 186 YMSLALNRTGRSIVYS-CEWPLYMMPFRKPNYTEVREYCNHWRNFADIYDAWQSVKSVLD 244
Query: 206 M----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
E D A PGGWNDPDML +GN G+ +++ + ++WAI APL + D+ ++
Sbjct: 245 WTSSNQEKIVDAAGPGGWNDPDMLVIGNFGLSWDQQITQMALWAIMAAPLFMSNDLRQIS 304
Query: 262 AETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLS 291
+ ++ +++VIA+NQ +Q +W PLS
Sbjct: 305 PQAKALLQDKDVIAINQDPLGKQGYRLRKEDNIEVWERPLS 345
>gi|315500301|ref|YP_004089104.1| alpha-galactosidase [Asticcacaulis excentricus CB 48]
gi|315418313|gb|ADU14953.1| Alpha-galactosidase [Asticcacaulis excentricus CB 48]
Length = 652
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 95/338 (28%), Positives = 142/338 (42%), Gaps = 74/338 (21%)
Query: 21 SNGLGLTPPMGWNSWNHFHCQINE-------KIIRET------------DDCWGESSRDS 61
S G TPPMGW+SWN F + E +II ++ DD W R S
Sbjct: 51 SEGRAATPPMGWSSWNAFGTDLTEARVLDSARIIVDSGLAAKGYRYINIDDGWWLKRRQS 110
Query: 62 QISLSCSFHMFDSLT-----QIAAKPLPNQCQ------------GNLVANKTTFPSGTKA 104
+ +F S + + +P ++ G ++ T
Sbjct: 111 DGRMQVRIQLFPSAAVGGGEETSLRPFTDRLHKMGLKAGIYSDLGRNACSQAYGGPNTPN 170
Query: 105 LADYVHILISNSLDNRIESNIAF---EQGIDYLKYDNC----FNDDTR-----------P 146
L + + L IE +I+ + G D++K D C F D+ P
Sbjct: 171 LPEGTVLEREVGLYGHIEQDISLYFKDWGFDFIKVDGCGVRAFGADSERVKAGTYRELPP 230
Query: 147 TIRYRAMSDALKKAGRPIF-----------------FSMCEWGDMRPALWGSNIRNSWRT 189
I ++++S A R ++ FS+C WG WG ++ N RT
Sbjct: 231 LIDFQSISRTNIPAVRGLYTQIATSLKASNPDGDYVFSLCAWGSADVRAWGKDVGNLSRT 290
Query: 190 TDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYV---VHFSIWAI 246
+DD++ SW+R+LT D A +A PG WNDPDML +G+G V HF++WA+
Sbjct: 291 SDDLTPSWSRLLTNFDSAATRALYAHPGSWNDPDMLFIGHGEFDAKHLVEARSHFALWAM 350
Query: 247 SKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ 284
APLL+G D+ + E M I GNE +IA+NQ Q
Sbjct: 351 INAPLLIGYDLSQASPELMQIFGNEAIIALNQDAGGHQ 388
>gi|348506638|ref|XP_003440865.1| PREDICTED: alpha-N-acetylgalactosaminidase-like [Oreochromis
niloticus]
Length = 435
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 103/341 (30%), Positives = 148/341 (43%), Gaps = 68/341 (19%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGL TPPMGW +W + C I DC E+ + IS + M D L +
Sbjct: 19 LDNGLMRTPPMGWLAWERYRCDI---------DC--ENDPKNCISENLFMDMADRLAEDG 67
Query: 80 AKPLP------NQC--------QGNLVANKTTFPSGTKALADYVHI-------------- 111
K L + C QG L + FP G + LA Y+H
Sbjct: 68 WKELGYVYVNIDDCWSSKQRDDQGRLQPDPKRFPGGIQKLARYLHDRGLKLGIYGDMGKL 127
Query: 112 ----LISNSLDN-RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFF 166
LD +++ + +D KYD C+++ T Y MS AL GRPI +
Sbjct: 128 TCGGYPGTPLDKIELDAQTFADWEVDMFKYDGCYSNATEQEQGYPLMSKALNATGRPIGY 187
Query: 167 SMCEW----GDMRPALWGSN---IRNSWRTTDDISDSWARMLTIADM----NEVYADHAK 215
S C W G + P + + I N WR DI DSW +L I D +V A
Sbjct: 188 S-CSWPAYQGGLPPKVNYTQLGEICNLWRNYGDIQDSWDSVLNIVDWFFENQDVLTPAAG 246
Query: 216 PGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIA 275
PG WNDPDML +G+ G+ + ++WAI APL + D+ +++ I+ N+ I+
Sbjct: 247 PGRWNDPDMLIIGDFGLSMEQSRSQMALWAIMAAPLFMSNDLRTISSGAQSILRNKMAIS 306
Query: 276 VNQ-AINDQ-----------QIWAGPLSGNRIVVLLESRKT 304
+NQ A+ Q +++ PLS N +L SR+T
Sbjct: 307 INQDALGIQGRRIVKEKSGIEVFWRPLSKNASALLFFSRRT 347
>gi|188995895|ref|YP_001930147.1| alpha-galactosidase [Porphyromonas gingivalis ATCC 33277]
gi|188595575|dbj|BAG34550.1| putative alpha-galactosidase [Porphyromonas gingivalis ATCC 33277]
Length = 481
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 97/318 (30%), Positives = 152/318 (47%), Gaps = 51/318 (16%)
Query: 1 LFIVSSSAADSNGSDR---RNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGES 57
+F SAA+ +G+D + +S GL P MGW++ +INE+ I + D
Sbjct: 93 VFEFEISAANESGADSCIWQVEVSRYNGLAPVMGWSTRWLKEKEINEQTILDVADA---M 149
Query: 58 SRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL----- 112
++ + + +S Q + + G + A+ FP+G K LAD +H
Sbjct: 150 QSKGLVAAGYNHIIIESHWQTSVR----DSDGRIKADPLRFPNGIKHLADELHRRGMRLG 205
Query: 113 ------------ISNSLDNR-IESNIAFEQGIDYLKYDNCFNDDTRPTI------RYRAM 153
+S S + I++ E G+D++KYD C + P I RY AM
Sbjct: 206 LSSNASPLNVHGLSGSFEKEAIDAATFSEWGVDFIKYDYCCS----PPISIVAQQRYGAM 261
Query: 154 SDALKKAGRPIFFSMCEWGDMRPALWGSNIR-NSWRTTDDISDSWAR---------MLTI 203
AL GRP FFS+CEWG++ P WG +R SWRT+ DI D W +
Sbjct: 262 GRALAATGRPPFFSVCEWGELMPWSWGVTVRAGSWRTSFDIQDKWKAGRYSKRDNGIWDA 321
Query: 204 ADMNEVYADHAKPG-GWNDPDMLEVG--NGGMKYNEYVVHFSIWAISKAPLLLGCDVGNL 260
A +N YA HA G +DPD+L +G + G EY +WA+ +APL+ D ++
Sbjct: 322 AMINLRYAQHANRGRAISDPDLLLIGSPSNGCTAEEYHAQMQMWAMMRAPLVFAADPRSV 381
Query: 261 TAETMPIIGNEEVIAVNQ 278
+AE + ++ + ++IAV+Q
Sbjct: 382 SAEAVALLTHPDLIAVDQ 399
>gi|387165376|gb|AFJ59927.1| Aga2 [Myceliophthora thermophila]
Length = 416
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 94/314 (29%), Positives = 135/314 (42%), Gaps = 81/314 (25%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ---------INEKIIR-----------ETDDCWGESSR 59
L NG+GLTP MGW+SWN C EK I DDCW SR
Sbjct: 21 LDNGVGLTPHMGWSSWNVAQCDAASAKYALDTAEKFISLGLKDLGYEYINIDDCWSLKSR 80
Query: 60 DSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL------- 112
D G LV + +P G K +AD +H L
Sbjct: 81 DEN--------------------------GKLVPDPGKWPDGIKPVADRIHDLGLKFGLY 114
Query: 113 -----------ISNSLDNRIESNIA--FEQGIDYLKYDNCFN---DDTRPTI-------- 148
+ D S+++ E G+D+ KYDNC+ D+ P
Sbjct: 115 GCAGQKTCAGYPGSDGDKYAASDVSQLVEWGVDFWKYDNCYTPCLDNPPPQTCQRPAGNS 174
Query: 149 --RYRAMSDAL--KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIA 204
Y M DA+ + R I F++C WG WG + +SWR + D W + I
Sbjct: 175 QEWYAPMRDAILGVQETRKIHFNLCNWGRDDVWEWGDDYGHSWRMSVDNWGDWESVERIG 234
Query: 205 DMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAET 264
A+++ PGG+ND DML +G+ + NE +HF +WAI+K+PL+LG D+ ++ T
Sbjct: 235 SAAADIAEYSGPGGFNDLDMLYLGSPKLNANEERLHFGLWAIAKSPLVLGLDLATISNAT 294
Query: 265 MPIIGNEEVIAVNQ 278
+ II N+ +I +NQ
Sbjct: 295 LDIIRNKGIIDINQ 308
>gi|195583918|ref|XP_002081763.1| GD25544 [Drosophila simulans]
gi|194193772|gb|EDX07348.1| GD25544 [Drosophila simulans]
Length = 417
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 95/341 (27%), Positives = 139/341 (40%), Gaps = 95/341 (27%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRE-------------------T 50
L NGL L PPMGW SW F C I+EK+ R
Sbjct: 24 LDNGLALKPPMGWMSWERFRCITDCKLYPDECISEKLFRRHADLLVSEGYADAGYEYVII 83
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW E +RD+ LV ++ FP+G AL+D++H
Sbjct: 84 DDCWLEKNRDNDTQ-------------------------KLVPDRKRFPNGLNALSDHIH 118
Query: 111 I------LISNSLDN------------RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRA 152
L + N ++++ + +DY+K D C+ + + Y
Sbjct: 119 YQGLKFGLYQDYGTNTCAGYPGVIKHMKLDAQTFADWDVDYVKLDGCYANISDMATGYPE 178
Query: 153 MSDALKKAGRPIFFSMCEW-------GDMRPALWGSNIRNSWRTTDDISDSWARMLTIAD 205
L + GRP+ +S C W G+M N WR DDI DS ++ I D
Sbjct: 179 FGRLLNETGRPMVYS-CSWPAYQEDAGEMPDYESLKQHCNLWRNWDDIEDSLESLMQIID 237
Query: 206 MNEVYAD----HAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
D H PG WNDPDML +GN G+ Y++ + +IWAI APL++ D+ +
Sbjct: 238 YFAKNQDRIQPHGGPGHWNDPDMLLLGNYGLSYDQSKLQMAIWAIMAAPLIMSNDLAAVR 297
Query: 262 AETMPIIGNEEVIAVNQ-----------AINDQQIWAGPLS 291
E I+ N VIAV+Q + N ++W P++
Sbjct: 298 PEIKAILQNRAVIAVDQDELGIQGRRVLSRNQIEVWKRPIT 338
>gi|377563635|ref|ZP_09792974.1| hypothetical protein GOSPT_034_00060 [Gordonia sputi NBRC 100414]
gi|377529082|dbj|GAB38139.1| hypothetical protein GOSPT_034_00060 [Gordonia sputi NBRC 100414]
Length = 406
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 100/405 (24%), Positives = 166/405 (40%), Gaps = 103/405 (25%)
Query: 23 GLGLTPPMGWNSWNHFHCQINEKIIRE-------------------TDDCWGESSRDSQI 63
GL PPMGWNSWN F C I E+I+R DDCW RD+
Sbjct: 29 GLAAVPPMGWNSWNTFGCGITEQIVRAQADALVSSGLRDAGYKYVIVDDCWAAPQRDAS- 87
Query: 64 SLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------LISNSL 117
G L A+ FPSG AL Y+H + S +
Sbjct: 88 -------------------------GRLQADPVRFPSGMAALGAYLHERGLLFGIYSGAR 122
Query: 118 DNR-------------------IESNIAFEQGIDYLKYDNCFNDDTR--PTIRYRAMSDA 156
D I++ G+DYLKYD C ++ + + AM +A
Sbjct: 123 DKTCTQYQGTYPGATGSGGHEVIDAQTFAGWGVDYLKYDWCSSNTSHDDQVSSFTAMRNA 182
Query: 157 LKKAGRPIFFSM---CEWGDMRPAL---WGSNIRNSWRTTDDISDSWAR---------ML 201
L+ GR I F++ + P WG + R T+D++ +W+ ++
Sbjct: 183 LRDTGRRIVFAINPNSGYAGSVPGAQFDWGGTATTT-RVTNDVTPAWSTANGPSGYQGIV 241
Query: 202 TIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
I D+ KPG + DPDML VG+ G+ E S+W++ APL+ G D+ N++
Sbjct: 242 NIVDVAGPLTARVKPGSFIDPDMLVVGSPGLTAAEQRTQMSMWSMMAAPLIAGNDLTNMS 301
Query: 262 AETMPIIGNEEVIAVNQ---------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAH 312
+T+ + N +IA++Q +D ++W+ + +V+ L +R + ++
Sbjct: 302 QQTLDALRNAAIIAIDQDSRVVAGAMVDDDPEVWSRAIGDKGLVISLTNRSDHARTLSVS 361
Query: 313 WDDIGIKSNNTL-----FEHRTLETKFQGKLTAKVDGHSCKMYEL 352
+G+ + ++ + R+ T G+L+ V H + E+
Sbjct: 362 LGSVGLVGDASVTGVDAWTGRSF-TAQHGELSVPVGVHDTAVLEI 405
>gi|156386780|ref|XP_001634089.1| predicted protein [Nematostella vectensis]
gi|156221168|gb|EDO42026.1| predicted protein [Nematostella vectensis]
Length = 386
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 117/396 (29%), Positives = 162/396 (40%), Gaps = 74/396 (18%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRETDDCWGESSRDSQISLSCSF 69
+ +GL LTPPMGW +W F C I+E++ D E D + +
Sbjct: 1 MDDGLALTPPMGWMAWERFRCNVDCDNDPDECIHERLFMAMAD---EMMCDGYLEAGYKY 57
Query: 70 HMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI-----------------L 112
D + PN G L A+ FPSG K LA YVH
Sbjct: 58 VCIDDCWTSKDRG-PN---GELQADPNRFPSGIKELAKYVHSKGLKLGIYADYGTMTCER 113
Query: 113 ISNSLDNRIESNI-AFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCE 170
S+D R+E ++ F +DYLK D CF D ++ Y S L GRPI FS C
Sbjct: 114 YPGSID-RLEKDMRTFASWDVDYLKMDGCFADPSKMDTGYPLASKYLNATGRPIMFS-CS 171
Query: 171 W------GDMRPALWGSNIRNS---WRTTDDISDSWARMLTIADM----NEVYADHAKPG 217
W +P NI WR DI DSW + I D EV A PG
Sbjct: 172 WPFYQVLKGQKPDY--KNIAKHCHLWRNYIDIEDSWEWVTKIVDHYAENQEVLIPVAGPG 229
Query: 218 GWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN 277
WNDPD L +GN G+ + ++WAI APL + D+ + ++ N EVIAVN
Sbjct: 230 HWNDPDQLIIGNFGLSPEQSKSQMALWAILAAPLFMSNDLRKIPQFAKDVLLNTEVIAVN 289
Query: 278 QAINDQQ----------------IWAGPLSGNRIVVLLESRKTFSSMMTAHW-DDIGIKS 320
Q +Q +WA L + V+L +R+ ++ + W D G+
Sbjct: 290 QDPLGKQGWRVWKGQNGWEKYTEVWARHLVNGNVAVVLFNRRVDKAVQISTWFKDYGMGK 349
Query: 321 NNT---LFEHRTLETKFQGKLTAKVDGHSCKMYELT 353
L+ H+ L F +AKV + M ++T
Sbjct: 350 QAAVRDLYLHQDLGV-FTSGYSAKVYPNGVVMVKIT 384
>gi|189207006|ref|XP_001939837.1| alpha-galactosidase precursor [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187975930|gb|EDU42556.1| alpha-galactosidase precursor [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 447
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 122/447 (27%), Positives = 167/447 (37%), Gaps = 142/447 (31%)
Query: 25 GLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQISL 65
G P MGWNSWN + C INE + E DDCW
Sbjct: 27 GRLPTMGWNSWNEYGCAINESVFLEVGELLNTLGLKKLGYTYVNIDDCW----------- 75
Query: 66 SCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL------------- 112
S H DS+T G + + FP+G K AD +H L
Sbjct: 76 SNKTHQRDSVT------------GQIRPDSNKFPNGIKHTADEIHKLDLKVGIYSDAGDT 123
Query: 113 ----ISNSLDNR-IESNIAFEQGIDYLKYDNC-----FNDDTR----------------- 145
+ SL++ +++ GIDYLKYDNC ++D+ R
Sbjct: 124 TCGGYAGSLEHEELDARTFAGWGIDYLKYDNCAVPDRWHDEYRWWPENWLGGPPAENQTA 183
Query: 146 -----------PT----------IRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIR 184
PT RY+ MSDAL R I FS C WG WG+
Sbjct: 184 GGDGETKPVAAPTGYDWTTSKTFTRYKTMSDALLATNRTIEFSQCAWGHAHIDEWGNRTG 243
Query: 185 NSWRTTDDISDSWAR-------MLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEY 237
+SWR DI W ++ I + Y + G D DMLEVGNG E
Sbjct: 244 HSWRMWGDIYPQWEGNHQGSWGLMPILNHASFYNNDTNFWGHGDWDMLEVGNGNFTIEEN 303
Query: 238 VVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------------------- 278
HF++WA K+PL++G + N+ E + I+ N E+I NQ
Sbjct: 304 RSHFALWAALKSPLIIGTPLSNIKPEILDILSNNELIDFNQDSVVGKAAKPYKWGVNPDF 363
Query: 279 ---AINDQQIWAGPLS-GNRIVVL----LESRKTFSSMMTAHWDDIGIKSNNTLFEHRTL 330
+ + W+G S G + VL +E KT + D K+ +T+FE T
Sbjct: 364 TWNQTHPAEYWSGKSSKGVHVFVLNTLNIEQTKTINFAEVPELDP---KTEHTVFESWTG 420
Query: 331 ET--KFQGKLTAKVDGHSCKMYELTPV 355
+ +F G+ A V GH L V
Sbjct: 421 KERGRFTGQYEAAVAGHDTIAIRLIEV 447
>gi|169595744|ref|XP_001791296.1| hypothetical protein SNOG_00615 [Phaeosphaeria nodorum SN15]
gi|160701151|gb|EAT92110.2| hypothetical protein SNOG_00615 [Phaeosphaeria nodorum SN15]
Length = 359
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 76/221 (34%), Positives = 114/221 (51%), Gaps = 53/221 (23%)
Query: 134 LKYDNCFNDDT--RPTI---RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWR 188
LKYD+C++ P++ R++ M+DALK G+ I F++C WG+ WG +I NSWR
Sbjct: 2 LKYDSCYHMGRVGTPSVSFNRFKTMADALKATGKNILFNLCNWGEDLVHTWGMSIANSWR 61
Query: 189 TTDDISDSWAR--------MLTIADMNEV-----------------YADHAKPGGWNDPD 223
T DI DS++R + D+N V +AD + PGGW+D D
Sbjct: 62 ITGDIYDSFSRPDDLCGCNTVAPGDVNCVAPGTHCSTLFILNKVAPFADRSIPGGWSDLD 121
Query: 224 MLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ----- 278
MLEVG GG EY HF++WA K+PL LG D+ ++ A + II N +IA++Q
Sbjct: 122 MLEVGQGGQTDEEYKAHFALWAAIKSPLFLGNDLRSMPASALTIINNPAIIALSQDPHGR 181
Query: 279 -----------------AINDQQIWAGPL-SGNRIVVLLES 301
+ + +WAG L +G+ +V+LL +
Sbjct: 182 SVTRVRRDTEGVAKDEWGMGETHVWAGHLHNGDEVVILLNA 222
>gi|367036174|ref|XP_003667369.1| glycoside hydrolase family 27 protein [Myceliophthora thermophila
ATCC 42464]
gi|347014642|gb|AEO62124.1| glycoside hydrolase family 27 protein [Myceliophthora thermophila
ATCC 42464]
Length = 416
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 94/314 (29%), Positives = 135/314 (42%), Gaps = 81/314 (25%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ---------INEKIIR-----------ETDDCWGESSR 59
L NG+GLTP MGW+SWN C EK I DDCW SR
Sbjct: 21 LDNGVGLTPHMGWSSWNVAQCDAASAKYALDTAEKFISLGLKDLGYEYINIDDCWSLKSR 80
Query: 60 DSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL------- 112
D G LV + +P G K +AD +H L
Sbjct: 81 DEN--------------------------GKLVPDPGKWPDGIKPVADRIHDLGLKFGLY 114
Query: 113 -----------ISNSLDNRIESNIA--FEQGIDYLKYDNCFN---DDTRPTI-------- 148
+ D S+++ E G+D+ KYDNC+ D+ P
Sbjct: 115 GCAGQKTCAGYPGSDGDKYAASDVSQLVEWGVDFWKYDNCYTPCLDNPPPQTCQRPAGNS 174
Query: 149 --RYRAMSDAL--KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIA 204
Y M DA+ + R I F++C WG WG + +SWR + D W + I
Sbjct: 175 QEWYAPMRDAILGVQETRKIHFNLCNWGRDDVWGWGDDYGHSWRMSVDNWGDWESVERIG 234
Query: 205 DMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAET 264
A+++ PGG+ND DML +G+ + NE +HF +WAI+K+PL+LG D+ ++ T
Sbjct: 235 SAAADIAEYSGPGGFNDLDMLYLGSPKLNANEERLHFGLWAIAKSPLVLGLDLATISNAT 294
Query: 265 MPIIGNEEVIAVNQ 278
+ II N+ +I +NQ
Sbjct: 295 LDIIRNKGIIDINQ 308
>gi|34539955|ref|NP_904434.1| alpha-galactosidase [Porphyromonas gingivalis W83]
gi|34396266|gb|AAQ65333.1| alpha-galactosidase [Porphyromonas gingivalis W83]
Length = 434
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 97/318 (30%), Positives = 152/318 (47%), Gaps = 51/318 (16%)
Query: 1 LFIVSSSAADSNGSDR---RNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGES 57
+F SAA+ +G+D + +S GL P MGW++ +INE+ I + D
Sbjct: 46 VFEFEISAANESGADSCIWQVEVSRYNGLAPVMGWSTRWLKEKEINEQTILDVADA---M 102
Query: 58 SRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL----- 112
++ + + +S Q + + G + A+ FP+G K LAD +H
Sbjct: 103 QSKGLVAAGYNHIIIESHWQTSVR----DSDGRIKADPLRFPNGIKHLADELHRRGMRLG 158
Query: 113 ------------ISNSLDNR-IESNIAFEQGIDYLKYDNCFNDDTRPTI------RYRAM 153
+S S + I++ E G+D++KYD C + P I RY AM
Sbjct: 159 LSSNASPLNVHGLSGSFEKEAIDAATFSEWGVDFIKYDYCCS----PPISIVAQQRYGAM 214
Query: 154 SDALKKAGRPIFFSMCEWGDMRPALWGSNIR-NSWRTTDDISDSWAR---------MLTI 203
AL GRP FFS+CEWG++ P WG +R SWRT+ DI D W +
Sbjct: 215 GRALAATGRPPFFSVCEWGELMPWSWGITVRAGSWRTSFDIQDKWKAGRYSKRDNGIWDA 274
Query: 204 ADMNEVYADHAKPG-GWNDPDMLEVG--NGGMKYNEYVVHFSIWAISKAPLLLGCDVGNL 260
A +N YA HA G +DPD+L +G + G EY +WA+ +APL+ D ++
Sbjct: 275 AMINLRYAQHANRGRAISDPDLLLIGSPSNGCTAEEYHAQMQMWAMMRAPLVFAADPRSV 334
Query: 261 TAETMPIIGNEEVIAVNQ 278
+AE + ++ + ++IAV+Q
Sbjct: 335 SAEAVALLTHPDLIAVDQ 352
>gi|334146171|ref|YP_004509098.1| alpha-galactosidase [Porphyromonas gingivalis TDC60]
gi|333803325|dbj|BAK24532.1| alpha-galactosidase [Porphyromonas gingivalis TDC60]
Length = 434
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 97/318 (30%), Positives = 152/318 (47%), Gaps = 51/318 (16%)
Query: 1 LFIVSSSAADSNGSDR---RNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGES 57
+F SAA+ +G+D + +S GL P MGW++ +INE+ I + D
Sbjct: 46 VFEFEISAANESGADSCIWQVEVSRYNGLAPVMGWSTRWLKEKEINEQTILDVADA---M 102
Query: 58 SRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL----- 112
++ + + +S Q + + G + A+ FP+G K LAD +H
Sbjct: 103 QSKGLVAAGYNHIIIESHWQTSVR----DSDGRIKADPLRFPNGIKHLADELHRRGMRLG 158
Query: 113 ------------ISNSLDNR-IESNIAFEQGIDYLKYDNCFNDDTRPTI------RYRAM 153
+S S + I++ E G+D++KYD C + P I RY AM
Sbjct: 159 LSSNASPLNVHGLSGSFEKEAIDAATFSEWGVDFIKYDYCCS----PPISIVAQQRYGAM 214
Query: 154 SDALKKAGRPIFFSMCEWGDMRPALWGSNIR-NSWRTTDDISDSWAR---------MLTI 203
AL GRP FFS+CEWG++ P WG +R SWRT+ DI D W +
Sbjct: 215 GRALAATGRPPFFSVCEWGELMPWSWGITVRAGSWRTSFDIQDKWKAGRYSKRDNGIWDA 274
Query: 204 ADMNEVYADHAKPG-GWNDPDMLEVG--NGGMKYNEYVVHFSIWAISKAPLLLGCDVGNL 260
A +N YA HA G +DPD+L +G + G EY +WA+ +APL+ D ++
Sbjct: 275 AMINLRYAQHANRGRAISDPDLLLIGSPSNGCTAEEYHAQMQMWAMMRAPLVFAADPRSV 334
Query: 261 TAETMPIIGNEEVIAVNQ 278
+AE + ++ + ++IAV+Q
Sbjct: 335 SAEAVALLTHPDLIAVDQ 352
>gi|419970338|ref|ZP_14485837.1| alpha-galactosidase domain protein [Porphyromonas gingivalis W50]
gi|392611009|gb|EIW93763.1| alpha-galactosidase domain protein [Porphyromonas gingivalis W50]
Length = 481
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 97/318 (30%), Positives = 152/318 (47%), Gaps = 51/318 (16%)
Query: 1 LFIVSSSAADSNGSDR---RNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGES 57
+F SAA+ +G+D + +S GL P MGW++ +INE+ I + D
Sbjct: 93 VFEFEISAANESGADSCIWQVEVSRYNGLAPVMGWSTRWLKEKEINEQTILDVADA---M 149
Query: 58 SRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL----- 112
++ + + +S Q + + G + A+ FP+G K LAD +H
Sbjct: 150 QSKGLVAAGYNHIIIESHWQTSVR----DSDGRIKADPLRFPNGIKHLADELHRRGMRLG 205
Query: 113 ------------ISNSLDNR-IESNIAFEQGIDYLKYDNCFNDDTRPTI------RYRAM 153
+S S + I++ E G+D++KYD C + P I RY AM
Sbjct: 206 LSSNASPLNVHGLSGSFEKEAIDAATFSEWGVDFIKYDYCCS----PPISIVAQQRYGAM 261
Query: 154 SDALKKAGRPIFFSMCEWGDMRPALWGSNIR-NSWRTTDDISDSWAR---------MLTI 203
AL GRP FFS+CEWG++ P WG +R SWRT+ DI D W +
Sbjct: 262 GRALAATGRPPFFSVCEWGELMPWSWGITVRAGSWRTSFDIQDKWKAGRYSKRDNGIWDA 321
Query: 204 ADMNEVYADHAKPG-GWNDPDMLEVG--NGGMKYNEYVVHFSIWAISKAPLLLGCDVGNL 260
A +N YA HA G +DPD+L +G + G EY +WA+ +APL+ D ++
Sbjct: 322 AMINLRYAQHANRGRAISDPDLLLIGSPSNGCTAEEYHAQMQMWAMMRAPLVFAADPRSV 381
Query: 261 TAETMPIIGNEEVIAVNQ 278
+AE + ++ + ++IAV+Q
Sbjct: 382 SAEAVALLTHPDLIAVDQ 399
>gi|195453629|ref|XP_002073870.1| GK12921 [Drosophila willistoni]
gi|194169955|gb|EDW84856.1| GK12921 [Drosophila willistoni]
Length = 424
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 92/344 (26%), Positives = 146/344 (42%), Gaps = 104/344 (30%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRET------------------- 50
L NGL LTPPMGW + + C I+E++ R T
Sbjct: 27 LDNGLVLTPPMGWMPFERYRCMTDCVRFPRDCISERLFRRTADLLVSEGYAKVGYKYLII 86
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW E RD+ L+ N+ FPSG L++Y+H
Sbjct: 87 DDCWMERQRDTTTD-------------------------ELLPNENRFPSGMNDLSNYIH 121
Query: 111 ILISNSLDNRIESNIAFEQ----------------------GIDYLKYDNCFNDDTRPTI 148
+ L + ++ ++ G+DY+K D C+ +
Sbjct: 122 ---NRGLKFGLYHDVGYKTCMYQGPGAIDHFELDAKTFASWGVDYIKMDGCYASERDLDK 178
Query: 149 RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIR---NSWRTTDDISDSWARMLTIAD 205
Y A AL RP+ +S C W + I+ N WR +DI+DS+ +IA+
Sbjct: 179 GYPAFGQALNNTKRPMVYS-CSWPFYKSKPNYQLIKKHCNLWRFAEDINDSYE---SIAN 234
Query: 206 MNEVY-------ADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVG 258
+ ++Y A+H+ PG WNDPDML +GN + Y++ + F+IWA+ APL++ D+
Sbjct: 235 IMQIYHKQQHLLANHSGPGHWNDPDMLVLGNYFLSYDQSRLQFAIWAVIAAPLIMTNDLE 294
Query: 259 NLTAETMPIIGNEEVIAVNQ-----------AINDQQIWAGPLS 291
+ E ++ N ++IA+NQ ++ QIWA P+
Sbjct: 295 TVRPEIKALLQNRDIIAINQDRLGNPGKCILTKHNFQIWARPVE 338
>gi|391328645|ref|XP_003738796.1| PREDICTED: alpha-galactosidase A-like [Metaseiulus occidentalis]
Length = 483
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 99/322 (30%), Positives = 142/322 (44%), Gaps = 65/322 (20%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
NGL TPPMGW SW F C++ DC + DS IS M D +T+ +
Sbjct: 29 NGLAKTPPMGWLSWERFLCEV---------DC--KKYPDSCISEKLYTQMADVMTEEGYR 77
Query: 82 PLP------NQC--------QGNLVANKTTFPSGTKALADYVH-----ILISNSLDNRI- 121
+ C G L AN + FP G K LADY+H + I +
Sbjct: 78 DAGYEYVNIDDCWMSSQRDFDGTLQANYSRFPHGIKWLADYMHARGLKMGIYQDCGTKTC 137
Query: 122 ------------ESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMC 169
++N G+D LK D C+ D + Y M+ AL +GR + +S C
Sbjct: 138 GGYPGSEGFFKKDANTYAAWGVDMLKLDGCYADPNKMDEIYPQMTQALHDSGRSMVYS-C 196
Query: 170 EWG----DMRPALWGSNIR--NSWRTTDDISDSWARMLTIADMNEVYAD----HAKPGGW 219
W D R + S + N WR DDI+DSW + I D D ++ PG W
Sbjct: 197 SWPAYQFDKRKPDYNSISKHCNLWRNFDDIADSWQSVTKIMDYYAKVQDDLIPYSGPGAW 256
Query: 220 NDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ- 278
+DPDML +G+ G+ ++ +IW+I APL + D+ + + I+ N +VIAV+Q
Sbjct: 257 SDPDMLIIGDFGLSIDQAKTQMAIWSILAAPLFMSADLRKMDPKFKDILLNRDVIAVDQD 316
Query: 279 --------AINDQ--QIWAGPL 290
ND+ QIW P+
Sbjct: 317 EYGRMGRRIFNDKSIQIWIRPV 338
>gi|322799781|gb|EFZ20978.1| hypothetical protein SINV_01689 [Solenopsis invicta]
Length = 811
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 98/341 (28%), Positives = 135/341 (39%), Gaps = 96/341 (28%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIR-------------------ET 50
L NGL TPPMGW +W F C I++++ R
Sbjct: 21 LENGLARTPPMGWLAWERFRCNTDCKNDPDNCISDRLFRTMTDIVVAEGYAAVGYEYINV 80
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW E RD G LV +K FP G K+LADYVH
Sbjct: 81 DDCWLEKERD--------------------------INGQLVPDKERFPYGMKSLADYVH 114
Query: 111 -----ILISNSLDNR-------------IESNIAFEQGIDYLKYDNCFNDDTRPTIRYRA 152
I N I++ +DY+K D C++ Y
Sbjct: 115 SKGLKFGIYEDYGNYTCAGYPGILGYLDIDAATFASWDVDYVKLDGCYSHPVDMDRGYPE 174
Query: 153 MSDALKKAGRPIFFSMCEW------GDMRPALWGSNIR-NSWRTTDDISDSWARMLTIAD 205
L + G+ + +S C W M+P N WR DDI DSWA + +I D
Sbjct: 175 FGFHLNQTGKHMIYS-CSWPVYQIYAGMKPNYTSIAAHCNLWRNFDDIQDSWASVESIID 233
Query: 206 M----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
+ +A PG WNDPDML +GN G+ Y + +IWAI APLL+ D+ +
Sbjct: 234 YYGNNQDGIVPNAGPGHWNDPDMLIIGNFGLSYEQSKTQMAIWAILAAPLLMSVDLRTIR 293
Query: 262 AETMPIIGNEEVIAVNQ-----------AINDQQIWAGPLS 291
E I+ N+++IAV+Q +IWA P++
Sbjct: 294 PEYKAILQNKKIIAVDQDPLGIQGRRIYKYKGIEIWARPIT 334
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 95/305 (31%), Positives = 131/305 (42%), Gaps = 61/305 (20%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGL TPPMGW +W F C ++ K + D+C S R +I M D +
Sbjct: 416 LENGLARTPPMGWLAWERFRCNVDCK--NDPDNC--ISDRLFRI-------MADRVVSQG 464
Query: 80 AKPLPNQ--------------CQGNLVANKTTFPSGTKALADYVH--------------- 110
+ + Q G LV + FP G K+LADYVH
Sbjct: 465 YRAVGYQYIIMDDCWLARQRNSNGLLVPDSERFPYGIKSLADYVHSKGLKFGIYEDYGNF 524
Query: 111 --ILISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFS 167
L + + F G+DY+K D C + Y L + GRP+ +S
Sbjct: 525 TCAGFPGILGHLTGDAVTFASWGVDYVKLDGCHANPYDMNTGYPEFGYYLNQTGRPMVYS 584
Query: 168 MCEW------GDMRPALWG-SNIRNSWRTTDDISDSWARMLTIADMNEVYADH------- 213
C W + P + N WR DI DSWA + +I D Y D+
Sbjct: 585 -CSWPVYQIYARIEPNYAAIAQTCNLWRNYHDIQDSWASVESIIDY---YGDNQDTLMPW 640
Query: 214 AKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEV 273
A PG WNDPDML +GN G+ Y++ +IWAI APLL+ D+ + I+ N+++
Sbjct: 641 AGPGRWNDPDMLIIGNFGLSYDQSKTQMAIWAILAAPLLMSVDLRTIRPLYKAILQNKKI 700
Query: 274 IAVNQ 278
IAVNQ
Sbjct: 701 IAVNQ 705
>gi|427794437|gb|JAA62670.1| Putative alpha-d-galactosidase melibiase, partial [Rhipicephalus
pulchellus]
Length = 625
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 91/317 (28%), Positives = 130/317 (41%), Gaps = 85/317 (26%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQIN----------EKIIRE-------------------T 50
L NGL TPPMGW +W F C I+ E++ +E
Sbjct: 143 LENGLARTPPMGWLAWERFQCNIDCSREPENCISERLFKEMAEALVYEGYRDVGYEYVNI 202
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RD G L AN T FP+G K +AD++H
Sbjct: 203 DDCWMNKERD--------------------------VVGRLQANITRFPNGIKHIADFMH 236
Query: 111 -----ILISNSLDNRIESNIAFEQG-------------IDYLKYDNCFNDDTRPTIRYRA 152
+ I ++ + + A G +D +K D C+ + Y
Sbjct: 237 ARGLKLGIYGNVGTKTCAGYAGSLGNLYTDAQTFADWEVDMVKMDGCYANIQDYPRLYTD 296
Query: 153 MSDALKKAGRPIFFSMCEWGDMRPALWGS-NIR------NSWRTTDDISDSWARMLTIAD 205
DA+ + GRP+ +S C W + S N + N WR DI+D+W + ++ D
Sbjct: 297 FGDAINRTGRPMVYS-CSWPAYEVSYGASPNYKLIGHHCNLWRNYVDIADTWQSVESVID 355
Query: 206 M----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
A PG WNDPDML +GN G+ Y++ +IWAI APLL+ D+ +
Sbjct: 356 YYAANQNALTAAAAPGRWNDPDMLVIGNFGLSYDQSRAQMAIWAIMAAPLLMSNDLRRMR 415
Query: 262 AETMPIIGNEEVIAVNQ 278
E I+ N +IAVNQ
Sbjct: 416 PEFKQILQNRAIIAVNQ 432
>gi|323331075|gb|EGA72494.1| hypothetical protein AWRI796_5155 [Saccharomyces cerevisiae
AWRI796]
Length = 266
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 88/251 (35%), Positives = 116/251 (46%), Gaps = 53/251 (21%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
NGLGLTP MGW+SWN F C ++E+++ +T D R S + L + + L +
Sbjct: 24 NGLGLTPQMGWDSWNTFACDVSEQLLLDTAD------RISDLGLKDMGYKYVILDDCWSS 77
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVH---ILIS--------------NSLDNRIESN 124
G LVA+K FP+G +AD++H L SL E
Sbjct: 78 G--RDSDGFLVADKHKFPNGMGHVADHLHNNSFLFGMYSSAGEYTCAGYPGSLGREEEDA 135
Query: 125 IAFEQG-IDYLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFSMCEWGDMRPAL 178
F +DYLKYDNC+N T RY+AMSDAL K GRPIF+S+C WG
Sbjct: 136 QFFANNRVDYLKYDNCYNKGQFGTPDVSYHRYKAMSDALNKTGRPIFYSLCNWGQDLTFY 195
Query: 179 WGSNIRNSWRTTDDISDSWAR----------------------MLTIADMNEVYADHAKP 216
WGS I NSWR + DI+ + R ++ I + +A
Sbjct: 196 WGSGIANSWRMSGDITAEFTRPDSRCPCDGDEYDCKYAGFHCSIMNILNKAAPMGQNAGV 255
Query: 217 GGWNDPDMLEV 227
GGWND D LEV
Sbjct: 256 GGWNDLDNLEV 266
>gi|225680261|gb|EEH18545.1| alpha-galactosidase [Paracoccidioides brasiliensis Pb03]
Length = 455
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 99/354 (27%), Positives = 153/354 (43%), Gaps = 80/354 (22%)
Query: 33 NSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQISLSCSFHMFD 73
N+WN F+C +NE+++ + DDCW + R+S L + F
Sbjct: 2 NTWNSFYCDLNEEVVLDAADKIVQLGFMDLGYEYIVLDDCW-SAGRNSSDYLQPNLEKFP 60
Query: 74 S-LTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAFEQGID 132
S + +AAK + + ++ +GTK A Y L D + ++ GID
Sbjct: 61 SGIDGLAAKIHAMGLKIGIYSS-----AGTKTCAHYKGSLGYEEKDAELWAS----WGID 111
Query: 133 YLKYDNCFNDDTRPTI-----RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSW 187
YLKYDNC+N+ T RY M AL GRPI +S+C WG P + I NSW
Sbjct: 112 YLKYDNCYNEGQEGTPLLSFNRYNVMGKALNATGRPILYSLCNWGIDGPWNFAPTIANSW 171
Query: 188 RTTDDISDSWAR-----------------------MLTIADMNEVYADHAKPGGWNDPDM 224
R T D+ ++ R ++ + + G WND DM
Sbjct: 172 RITGDLFLNYNRETPECPCAELGGLDCKLPGFRCSVMNVLNKAAYLPSKGFSGAWNDLDM 231
Query: 225 LEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN------- 277
L VGNGG+ + V HFS+WA K+PLL+ + + A+++ I+ N V+A++
Sbjct: 232 LVVGNGGLTDDAMVAHFSLWAALKSPLLMTPVLSKVDAKSLSILQNIAVLAISQDSAGLS 291
Query: 278 ------QAINDQ---------QIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDI 316
Q + D Q+++G L+G VVL + M + +DI
Sbjct: 292 ATRKWRQYVGDADEFGREGEIQMFSGRLAGGDEVVLFLNAAATDRQMESSLEDI 345
>gi|402075966|gb|EJT71389.1| alpha-galactosidase [Gaeumannomyces graminis var. tritici
R3-111a-1]
Length = 408
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 95/355 (26%), Positives = 147/355 (41%), Gaps = 100/355 (28%)
Query: 20 LSNGLGLTPPMGWNSWNHFHC-QINEKIIRET-------------------DDCWGESSR 59
L NGLG P +GWNSWN C K +T DDCW +R
Sbjct: 25 LENGLGRKPALGWNSWNVAQCNSATAKYALDTANLFVSLGLKDLGYEYVNIDDCWSTMNR 84
Query: 60 DSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL------- 112
+ G LVA+ +P G K + D +H +
Sbjct: 85 NGS--------------------------GYLVADPNKWPQGVKPVVDKIHSMGLKFGLY 118
Query: 113 ----------ISNSLDNRIE-SNIAFEQGIDYLKYDNCFND-DTRPTIR----------- 149
S + E + + G+D K+DNCF + P +
Sbjct: 119 GCAGTKTCGGYPGSWGHETEDAKLLASWGVDLWKHDNCFTPCEGNPWPQTCWGKKAGGHT 178
Query: 150 ---YRAMSDALK--KAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIA 204
+ M DAL + + I+FSMC+WG WG + N+WR ++D + WA ++ IA
Sbjct: 179 QEWFGKMRDALMSVRNQKSIYFSMCQWGVDNVWTWGKDYGNAWRMSNDNWNDWASVVRIA 238
Query: 205 DMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAET 264
A +A PGG+ND DM+++GNG + + H +WAI+K+P++LG D+ ++ T
Sbjct: 239 STAGTIAQYAGPGGFNDLDMMQLGNGKLTAAQERTHMGLWAIAKSPIILGNDLSKISQAT 298
Query: 265 MPIIGNEEVIAVNQ-----------------AINDQ--QIWAGPLSGNRIVVLLE 300
+ ++ N+ +I +NQ +N Q WAG LS +V L+
Sbjct: 299 LSLVRNKGLIGINQDSLGKAAGYFRPPGKPAPVNGQIYNYWAGQLSDGVVVALVS 353
>gi|391338934|ref|XP_003743808.1| PREDICTED: alpha-galactosidase A-like isoform 2 [Metaseiulus
occidentalis]
Length = 451
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 97/324 (29%), Positives = 138/324 (42%), Gaps = 94/324 (29%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQIN--------------EKIIRE---------------T 50
L NGL L PPMGW SW F C+I+ ++I E
Sbjct: 50 LDNGLALKPPMGWISWERFRCEIDCDQYPNDCISENLYKRIADELVDGGYRDVGYVHVNI 109
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RDS G +VA+ FPSG K LA Y+H
Sbjct: 110 DDCWALKERDSN--------------------------GRMVADPKRFPSGMKGLAKYMH 143
Query: 111 -----ILISNSLDNR-------------IESNIAFEQGIDYLKYDNCF--NDDTRPTIRY 150
+ I + ++ I++ + ID LKYD CF N+ P + Y
Sbjct: 144 DRGLKLGIYSDAGSKTCAGYPGSRDYEDIDAQTYADWDIDMLKYDGCFIANEADIPNL-Y 202
Query: 151 RAMSDALKKAGRPIFFSMCEWG----------DMRPALWGSNIR--NSWRTTDDISDSWA 198
M+ AL K G+ I +S CEW +P WG N WR DDI+D++
Sbjct: 203 MKMTQALNKTGKQIVYS-CEWPLYQKNTETTLSFQPD-WGKIAASCNLWRNYDDIADTFE 260
Query: 199 RMLTIADM----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLG 254
+ D+ ++Y H KPG + DPDML +G+ G+ +E V +IWAI APL +
Sbjct: 261 SVKRTIDVFVKNQDLYVKHQKPGAFFDPDMLILGDYGLSKDEARVQMAIWAIWGAPLFMS 320
Query: 255 CDVGNLTAETMPIIGNEEVIAVNQ 278
D+ + ++ ++ N VI +NQ
Sbjct: 321 NDLAKIDEDSKKLLLNRGVIGINQ 344
>gi|170030918|ref|XP_001843334.1| alpha-galactosidase A [Culex quinquefasciatus]
gi|167868814|gb|EDS32197.1| alpha-galactosidase A [Culex quinquefasciatus]
Length = 428
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 109/433 (25%), Positives = 163/433 (37%), Gaps = 128/433 (29%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRET------------------- 50
L NGL PPMGW +W F C I+E +I T
Sbjct: 19 LENGLARLPPMGWMTWERFRCITDCKKYPTECISENLIMRTADLMVLEGYLDAGYEYVNI 78
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RDS+ G L A+ FPSG K L+DY+H
Sbjct: 79 DDCWMMRERDSE--------------------------GKLHADPERFPSGIKYLSDYIH 112
Query: 111 ILISNSLDNRIESNIA---------------------FEQGIDYLKYDNCFNDDTRPTIR 149
S L I +I E +D++K D C+ D+ +
Sbjct: 113 ---SKGLKFGIYQDIGTHTCAGYPGMKNYFEIDAKTFAEWEVDFIKIDGCYADEVKMVDD 169
Query: 150 YRAMSDALKKAGRPIFFSMCEWGDMRPALWGSN-----------IRNSWRTTDDISDSWA 198
Y + + + GRPI +S C W PA N N WR DI DS +
Sbjct: 170 YVYFGELMNQTGRPILYS-CSW----PAYQEYNGITPNYETLKKTCNMWRNWGDIEDSHS 224
Query: 199 RMLTI----ADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLG 254
+ +I +D + H+ PG WNDPD L +GN G+ Y + ++W I AP LL
Sbjct: 225 SVESITQYFSDNQDRIQPHSGPGHWNDPDTLVLGNYGLSYEQSKSQLAVWTILAAPFLLS 284
Query: 255 CDVGNLTAETMPIIGNEEVIAVNQ-----------AINDQQIWAGPL------SGNRIVV 297
D+ +T + ++ N+E+IA+NQ +N +IW P+ +
Sbjct: 285 NDLARVTPDVKALLLNKEIIAMNQDSLGIQGLLVKTVNKIEIWKKPILPKVKDEMTHGIA 344
Query: 298 LLESRKTFS--SMMTAHWDDIGIKSNNT--------LF--EHRTLETKFQGKLTAKVDGH 345
+ R + S+ +D+G+ + LF EH+ K G +++
Sbjct: 345 FVSRRADGAPYSVSVKVIEDLGLGGSQYIKGYMVYDLFDAEHKPFFVKVNGTFNTRINPT 404
Query: 346 SCKMYELTPVTKL 358
+Y PV +
Sbjct: 405 GANLYRFEPVLNI 417
>gi|391339086|ref|XP_003743884.1| PREDICTED: alpha-galactosidase A-like [Metaseiulus occidentalis]
Length = 413
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 99/315 (31%), Positives = 134/315 (42%), Gaps = 80/315 (25%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGL L PPMGW SW F C I DC + S D IS M D L
Sbjct: 20 LENGLALRPPMGWKSWERFRCNI---------DC--DKSPDECISEKLYKRMADELVAQG 68
Query: 80 AKPL-------------PNQCQGN-LVANKTTFPSGTKALADYVHILISNSLDNRIESNI 125
+ L P + QGN LVA+ FPSG KALA Y+H D ++ I
Sbjct: 69 YRDLGYVFVSIDDCWSMPERDQGNRLVADTKRFPSGIKALAKYMH-------DRGLKLGI 121
Query: 126 AFEQG-------------------------IDYLKYDNCFNDDTRPT-IRYRAMSDALKK 159
+ G ID LK+D C D + + Y AM+DAL K
Sbjct: 122 YADAGASTCRGYPGSMQYVSTDAQTFADWDIDMLKFDGCNVPDPKTAGLIYIAMTDALNK 181
Query: 160 AGRPIFFSMCEWGDMRPALWGSNIR------------NSWRTTDDISDSWARMLTIADMN 207
GR I +S CEW L+ + N++R D+ D + + + D
Sbjct: 182 TGRDILYS-CEW-----PLYQLEFKLEVNFETVAATCNTFRNYYDVQDKFESVQNVLDFY 235
Query: 208 EVYAD----HAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAE 263
Y D H +PG + DPD L +GN G+ ++E ++WAI +PL + D+ +
Sbjct: 236 VRYQDDLTPHQRPGAFFDPDALVIGNFGLSHDESQTQMALWAIWGSPLFMSNDLTEVEPG 295
Query: 264 TMPIIGNEEVIAVNQ 278
I+ NE VIA+NQ
Sbjct: 296 FKAILQNEAVIALNQ 310
>gi|365875039|ref|ZP_09414569.1| alpha-galactosidase [Elizabethkingia anophelis Ag1]
gi|442589317|ref|ZP_21008125.1| alpha-galactosidase [Elizabethkingia anophelis R26]
gi|365757151|gb|EHM99060.1| alpha-galactosidase [Elizabethkingia anophelis Ag1]
gi|442560927|gb|ELR78154.1| alpha-galactosidase [Elizabethkingia anophelis R26]
Length = 385
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 88/284 (30%), Positives = 135/284 (47%), Gaps = 47/284 (16%)
Query: 28 PPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQC 87
P MGW+SWN+F INE++I+E D S L + + + ++
Sbjct: 28 PIMGWSSWNNFRININEQMIKEQADALVSSG------LYAAGYRYINIDD--GYFGGRDE 79
Query: 88 QGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIA--------------FEQGI-- 131
+GNL+ + FPSG K LA Y+H S L I S+ F G+
Sbjct: 80 KGNLLTDNKKFPSGMKNLAAYIH---SKGLKAGIYSDAGKNTCGSIWDNDKQGFGVGLYG 136
Query: 132 ---------------DYLKYDNCFNDDTR--PTIRYRAMSDALKKAGRPIFFSMCEWGDM 174
D++K D C + + Y + + +K I F++C W
Sbjct: 137 HLNQDADLFFKDWKYDFIKVDWCGGEQMKLNEQEEYTKIINKVKTIDPNIVFNVCRW--Q 194
Query: 175 RPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKY 234
P W I +SWR + DIS ++ +L I D+N+ +A G +ND DML+VG G M Y
Sbjct: 195 FPGEWAIKIADSWRVSGDISAKFSSILHIIDLNKNLYSYASAGHYNDMDMLQVGRG-MSY 253
Query: 235 NEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
+E HFS+WA+ +PLL G D+ ++ T+ I+ N+E+IA+NQ
Sbjct: 254 DEDKTHFSMWAMLNSPLLAGNDLRTMSKATIEILTNKEIIALNQ 297
>gi|226469126|emb|CAX70042.1| N-acetyl galactosaminidase, alpha [Schistosoma japonicum]
gi|226469128|emb|CAX70043.1| N-acetyl galactosaminidase, alpha [Schistosoma japonicum]
Length = 434
Score = 124 bits (311), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 93/329 (28%), Positives = 139/329 (42%), Gaps = 94/329 (28%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQI----------NEKIIRET------------------- 50
L NGL TPPMGW +W F CQI NE +I+ T
Sbjct: 26 LDNGLARTPPMGWMTWQRFRCQIDCKEYPNDCINENLIKRTADKLVLNGWRDLGYKYVII 85
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RDS+ + LV + FP+G K + +Y+H
Sbjct: 86 DDCWPARKRDSKTN-------------------------ELVPDPDRFPNGMKNVGEYLH 120
Query: 111 ---ILISNSLDN---------------RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRA 152
+L LD +++ E +DY+K D C++ Y
Sbjct: 121 SKNLLFGIYLDYGTLTCEGYPGSMNYLELDARSIAEWKVDYVKMDGCYSLPNIQPEGYEN 180
Query: 153 MSDALKKAGRPIFFSMCEWGDMRPAL--WGSNIR-----------NSWRTTDDISDSWAR 199
S L GRP+ FS C + PA W +NI+ N WR DI DSW+
Sbjct: 181 FSRLLNTTGRPMVFS-CSY----PAYISWINNIKLIDWNRLKKNCNLWRVLGDIQDSWSS 235
Query: 200 MLTIADM----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGC 255
+++I + N++ A PG WNDPD L +GN G+ ++ VHF +W + APLL+
Sbjct: 236 VVSIINAYKIRNDILPKVAGPGHWNDPDTLLLGNYGLSNDQKRVHFGMWCMFAAPLLISA 295
Query: 256 DVGNLTAETMPIIGNEEVIAVNQAINDQQ 284
D+ N+ ++ ++ N ++A++Q Q
Sbjct: 296 DMDNIDGFSVSLLRNAHLLAIDQDKGGHQ 324
>gi|387165374|gb|AFJ59926.1| Aga1 [Myceliophthora thermophila]
Length = 435
Score = 124 bits (311), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 84/295 (28%), Positives = 138/295 (46%), Gaps = 44/295 (14%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRD-SQISLSCSFHMFDSLTQI 78
L++G+G P +G+N+WN + C ++ E ++ D + ++ M D +
Sbjct: 15 LNDGVGKLPALGFNTWNLYQCNYTADVLLEQ----AQAIVDRGLLKAGYNYFMLDDCYSL 70
Query: 79 AAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLDNRI 121
+ G +V + FP+G K + + L S N
Sbjct: 71 KER----DENGRIVEDPEKFPNGMKNFTESLAKLGFRAGIYSDAGYRTCGGYPGSYGNEA 126
Query: 122 ESNIAF-EQGIDYLKYDNC---FNDDTRPTI--RYRAMSDALKKAGR-----PIFFSMCE 170
+ F E G +YLKYDNC F++ T+ + RY M++A++ P+ ++CE
Sbjct: 127 KDLETFAEWGFEYLKYDNCYIPFDNVTQENVYGRYERMAEAIRARAEETDSAPLQLALCE 186
Query: 171 WGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGN- 229
WG +P W + SWR DI W+ + +I + A ND DMLEVGN
Sbjct: 187 WGWQQPWRWAGRLGQSWRIGGDIRPWWSALSSIINQASFIAGATGFYARNDLDMLEVGNT 246
Query: 230 ------GGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
G + Y+E HF+ WA+ K+PLL+G D+ + + ETM I+GNE+++ +NQ
Sbjct: 247 GIGTPPGNLSYDEAKSHFTAWALLKSPLLIGSDMASASRETMEILGNEDILRINQ 301
>gi|374374115|ref|ZP_09631774.1| Alpha-galactosidase [Niabella soli DSM 19437]
gi|373233557|gb|EHP53351.1| Alpha-galactosidase [Niabella soli DSM 19437]
Length = 407
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 100/358 (27%), Positives = 153/358 (42%), Gaps = 69/358 (19%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETDDCW----GESSRDSQISLSCSFHMFDSLTQIAAKP 82
TP MGW SWN++H INEKII+ D +++R I++ F F
Sbjct: 31 TPMMGWASWNNYHVHINEKIIKAQADALVATGMKAARYQFINIDDGF--FGG-------- 80
Query: 83 LPNQCQGNLVANKTTFPSGTKALADYVH-------------------------ILISNSL 117
G + + FP+G ++L Y+H I I L
Sbjct: 81 --RDINGTIKTHPARFPNGMRSLVTYIHSKGLKAGTYSDAGINTCGSYWDKDSIGIGMGL 138
Query: 118 DNRIESNIA---FEQGIDYLKYDNCFND--DTRPTIRYRAMSDALKKAGRPIFFSMCEWG 172
++++ + G D++K D C + RY + + +++C W
Sbjct: 139 YGHEQTDLQRMLIDWGFDFIKVDWCGGEWLGLDEQTRYTEIGRLIHSIKPAALYNVCRW- 197
Query: 173 DMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGM 232
P W N+ +SWR + DI S+ ++ I D++ +A PG ND DML+VG G M
Sbjct: 198 -KFPGKWVVNVADSWRISGDIRASFESIMKIVDLDADLWPYAGPGHVNDMDMLQVGRG-M 255
Query: 233 KYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AIN 281
Y E HFS+W + +PLL G D+ ++ ET+ I+ N E+I++NQ
Sbjct: 256 SYEEDKAHFSMWCMLNSPLLAGNDLTKMSKETLSILTNPEMISINQDPLVYQARRLKDYG 315
Query: 282 DQQIWAGPL----SGNRIVVLLESRKTFSSMMTAHW--DDIGIKSNNTLFEHRTLETK 333
DQ++WA PL SG V LL K + T H+ D + I S H K
Sbjct: 316 DQEVWARPLHHTMSGEVAVALLNRSKNKT---TIHFDLDSVSIDSKKGYLMHDVWNKK 370
>gi|367030263|ref|XP_003664415.1| glycoside hydrolase family 27 protein [Myceliophthora thermophila
ATCC 42464]
gi|347011685|gb|AEO59170.1| glycoside hydrolase family 27 protein [Myceliophthora thermophila
ATCC 42464]
Length = 428
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 84/295 (28%), Positives = 138/295 (46%), Gaps = 44/295 (14%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRD-SQISLSCSFHMFDSLTQI 78
L++G+G P +G+N+WN + C ++ E ++ D + ++ M D +
Sbjct: 6 LNDGVGKLPALGFNTWNLYQCNYTADVLLEQ----AQAIVDRGLLKAGYNYFMLDDCYSL 61
Query: 79 AAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLDNRI 121
+ G +V + FP+G K + + L S N
Sbjct: 62 KER----DENGRIVEDPEKFPNGMKNFTESLAKLGFRAGIYSDAGYRTCGGYPGSYGNEA 117
Query: 122 ESNIAF-EQGIDYLKYDNC---FNDDTRPTI--RYRAMSDALKKAGR-----PIFFSMCE 170
+ F E G +YLKYDNC F++ T+ + RY M++A++ P+ ++CE
Sbjct: 118 KDLETFAEWGFEYLKYDNCYIPFDNVTQENVYGRYERMAEAIRARAEETDSAPLQLALCE 177
Query: 171 WGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGN- 229
WG +P W + SWR DI W+ + +I + A ND DMLEVGN
Sbjct: 178 WGWQQPWRWAGRLGQSWRIGGDIRPWWSALSSIINQASFIAGATGFYARNDLDMLEVGNT 237
Query: 230 ------GGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
G + Y+E HF+ WA+ K+PLL+G D+ + + ETM I+GNE+++ +NQ
Sbjct: 238 GIGTPPGNLSYDEAKSHFTAWALLKSPLLIGSDMASASRETMEILGNEDILRINQ 292
>gi|321459552|gb|EFX70604.1| hypothetical protein DAPPUDRAFT_309394 [Daphnia pulex]
Length = 418
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 92/317 (29%), Positives = 129/317 (40%), Gaps = 85/317 (26%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRET------------------- 50
L NGL L+PPMGW +W F C I+E++ E
Sbjct: 21 LDNGLALSPPMGWLAWQRFRCNTDCVNDPKHCISERLFMEMADLLVSEGYAKAGYNLISL 80
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW + RDS G L A+ FPSG AL+DY+H
Sbjct: 81 DDCWLDKVRDSD--------------------------GRLKADPIRFPSGIPALSDYIH 114
Query: 111 -----ILISNSLDN-------------RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRA 152
I N ++++ + +DY+K D C++ ++ Y
Sbjct: 115 SRGLRFGIYEDYGNYTCAGYPGILNNMKLDAQTFADWKVDYVKLDGCYSFPSQMDKGYPE 174
Query: 153 MSDALKKAGRPIFFSMCEWG------DMRPALWG-SNIRNSWRTTDDISDSWARMLTIAD 205
L + GR + +S C W M P S N WR +DI DSW + +I D
Sbjct: 175 FGYYLNRTGRSMIYS-CSWPFYQLVTKMEPDYATISKTCNLWRNFEDIQDSWQSVTSIID 233
Query: 206 M----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
+ A PG WNDPDML +GN G+ Y + +WAI +PLL+ D+ +
Sbjct: 234 YYGDNQDTLIPLAGPGHWNDPDMLIIGNFGLSYEQSRAQMCMWAIMASPLLMSVDLRTIR 293
Query: 262 AETMPIIGNEEVIAVNQ 278
E I+ N E+IAVNQ
Sbjct: 294 PEFKAILLNTEIIAVNQ 310
>gi|392390775|ref|YP_006427378.1| alpha-galactosidase [Ornithobacterium rhinotracheale DSM 15997]
gi|390521853|gb|AFL97584.1| alpha-galactosidase [Ornithobacterium rhinotracheale DSM 15997]
Length = 529
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 105/385 (27%), Positives = 172/385 (44%), Gaps = 81/385 (21%)
Query: 28 PPMGWNSWNHFHCQINEKIIRETDDCWGESSRD----SQISLSCSFHMFDSLTQIAAKPL 83
P MGW SWN++ I++ II+ D + + I++ F FD
Sbjct: 26 PIMGWASWNNYRNHISDSIIKRQTDYLVKLGLNKIGYQYINIDDGF--FDG--------- 74
Query: 84 PNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIA----------------- 126
G+L NK FP+G K +A+Y+H S L I S++
Sbjct: 75 -RNPDGSLNLNKERFPNGLKPIAEYIH---SKGLKAGIYSDVGPNTCASIWDKDKGGVGA 130
Query: 127 ---------FEQGI-----DYLKYDNCFN-------DDTRPTIRYRAMSDALKKAGRPIF 165
E I D++K D C D+ + + M D++ + R I
Sbjct: 131 GLYQNERKDLEYFINDLKFDFIKVDYCGASQQDLKLDEEQHYTYIKKMLDSI--SSRKIG 188
Query: 166 FSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDML 225
F++C W P W +++ +SWR + DI++++ M I D+N A + PG +ND DML
Sbjct: 189 FNVCRW--QFPGSWVTSLADSWRISGDIANTFHSMTEIVDLNTFLAPYMSPGHYNDMDML 246
Query: 226 EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQ-- 283
++G G+ Y E S+W+I +PL+LG D+ ++ ET+ +I N E+IAVNQ + +Q
Sbjct: 247 QIGR-GLSYEEDKAQMSMWSILTSPLMLGNDLSQISDETLKVISNTEIIAVNQDMTEQGK 305
Query: 284 ---------QIWAGPL----SGNRIVVLLESRKTFSSMMTAHWDDIGIK---SNNTLFEH 327
Q+W+ L SG R VVL K ++ ++ D+ +K S L+ H
Sbjct: 306 LISNYKDPLQVWSKKLNGINSGERAVVLFNRTKK-KEIIWFNFSDLELKGGVSLRDLWGH 364
Query: 328 RTLETKFQGKLTAKVDGHSCKMYEL 352
+ + K ++ V H +L
Sbjct: 365 KDIPGKKHNRIGVWVPPHGVVALKL 389
>gi|223938039|ref|ZP_03629937.1| Alpha-galactosidase [bacterium Ellin514]
gi|223893253|gb|EEF59716.1| Alpha-galactosidase [bacterium Ellin514]
Length = 603
Score = 124 bits (311), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 113/377 (29%), Positives = 166/377 (44%), Gaps = 67/377 (17%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L+NG+ LTPPMG+NSW +++ I E+ ++ D + L + + +L
Sbjct: 27 LTNGVALTPPMGYNSWYYWYDGITEENMKAVADMVVTNG------LKAVGYEYINLDDGW 80
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVH---------ILISNSLDNRIESNIAFEQ- 129
A G +VAN + FP G K LADY+H + + ++ +E
Sbjct: 81 AGY--RDSNGVMVAN-SKFPHGMKYLADYIHSKGLKIGLYTVFGPTTCAGFPGSMGYELL 137
Query: 130 --------GIDYLKYDNCF--NDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPAL- 178
GIDY+KY+ C N + + M DAL GRPI F+M P L
Sbjct: 138 DAQTYASWGIDYVKYEGCSFPNPLSVQEEKCFQMRDALMATGRPIVFTMST----GPVLS 193
Query: 179 WGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYV 238
W + NSWR T D +W +L D+ A PG WND D+L +G G E
Sbjct: 194 WFPDAMNSWRYTGDNDVNWESVLYHIDIVAQTPGMAGPGHWNDADVLNIGRGWSTKTEDK 253
Query: 239 VHFSIWAISKAPLLLGCDVGNLTAETMP----IIGNEEVIAVNQAI-----------NDQ 283
FS++ + +PL LTA P ++ N E IA++Q + +
Sbjct: 254 AMFSMYCMLTSPL--------LTATPFPSQLDVLTNAEAIAIDQDVAGIQGICVRTNGEL 305
Query: 284 QIWAGPL---SGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFEHRTL-----ETKFQ 335
Q+W PL + N V L +R T ++ +TA W D+G +N L R L E FQ
Sbjct: 306 QVWCKPLGSANSNVKAVALFNRGTNAANITATWSDLGF--SNGLVSVRDLWSKVDEGNFQ 363
Query: 336 GKLTAKVDGHSCKMYEL 352
TA V GH K+ ++
Sbjct: 364 TNYTANVPGHGVKLMKM 380
>gi|242019902|ref|XP_002430397.1| alpha-N-acetylgalactosaminidase, putative [Pediculus humanus
corporis]
gi|212515527|gb|EEB17659.1| alpha-N-acetylgalactosaminidase, putative [Pediculus humanus
corporis]
Length = 422
Score = 124 bits (310), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 84/299 (28%), Positives = 134/299 (44%), Gaps = 48/299 (16%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRETDDCWGESSRDSQISLSCSF 69
L+N L LTPPMGW++WN F C I+E + R+ D + D +S +
Sbjct: 24 LNNNLALTPPMGWSAWNRFRCNTDCKNFPKECISEWLFRDMAD---KMLEDGYLSAGYQY 80
Query: 70 HMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-------------ILISNS 116
+ D + N+ L + FPSG K L+DY+H
Sbjct: 81 ILIDDCWMSKRRDKNNR----LTPDAQRFPSGLKNLSDYIHEKGLKFGIYADYGKKTCEG 136
Query: 117 LDNRIESNIA------FEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCE 170
+E N+A E G+D++K D CF D + + + K GRP+ ++ C
Sbjct: 137 YPGTLEINMATDAKTFVEWGVDFIKLDGCFTDPLDMELGFINFGYWMWKMGRPMVYA-CS 195
Query: 171 WGDMRPAL-------WGSNIRNSWRTTDDISDSWARMLTI----ADMNEVYADHAKPGGW 219
W + + + + NSWR DI DSW+ + + + +++++ PG W
Sbjct: 196 WPIYQNYMGISPNYTLAAKVCNSWRVYKDIQDSWSSVRDVITYFGEFQDIFSNFGGPGHW 255
Query: 220 NDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
NDPD L +GN G+ Y + V IW+I APL++ D+ N I+ N +I++NQ
Sbjct: 256 NDPDSLIIGNFGLSYEQAKVQMVIWSILSAPLIMSHDLRNAHPNFRNILLNTALISINQ 314
>gi|19922382|ref|NP_611119.1| CG7997, isoform A [Drosophila melanogaster]
gi|24654142|ref|NP_725571.1| CG7997, isoform B [Drosophila melanogaster]
gi|17862190|gb|AAL39572.1| LD13649p [Drosophila melanogaster]
gi|21627129|gb|AAF58008.2| CG7997, isoform A [Drosophila melanogaster]
gi|21627130|gb|AAM68519.1| CG7997, isoform B [Drosophila melanogaster]
gi|220960316|gb|ACL92694.1| CG7997-PA [synthetic construct]
Length = 417
Score = 124 bits (310), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 94/319 (29%), Positives = 139/319 (43%), Gaps = 51/319 (15%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGL L PPMGW SW F C + K+ D+C E L S D+ +
Sbjct: 24 LDNGLALKPPMGWMSWERFRCITDCKLY--PDECISEKLFQRHADLLVSEGYADAGYEYV 81
Query: 80 -------AKPLPNQCQGNLVANKTTFPSGTKALADYVH------ILISNSLDN------- 119
K N Q LV ++ FP+G AL+D++H L + N
Sbjct: 82 IIDDCWLEKNRDNDTQ-KLVPDRKRFPNGLNALSDHIHNQGLKFGLYQDYGTNTCAGYPG 140
Query: 120 -----RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEW--- 171
++++ + +DY+K D C+ + + Y L + GRP+ +S C W
Sbjct: 141 VIKHMKLDAQTFADWDVDYVKLDGCYANISDMASGYPEFGRLLNETGRPMVYS-CSWPAY 199
Query: 172 ----GDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYAD----HAKPGGWNDPD 223
G+M N WR DDI DS ++ I D D H PG WNDPD
Sbjct: 200 QEDAGEMPDYESLKQHCNLWRNWDDIEDSLESLMQIIDYFAKNQDRIQPHGGPGHWNDPD 259
Query: 224 MLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ----- 278
ML +GN G+ Y++ + +IWAI APL++ D+ + E I+ N VIAV+Q
Sbjct: 260 MLLLGNYGLSYDQSKLQMAIWAIMAAPLIMSNDLAAVRPEIKAILQNRAVIAVDQDELGI 319
Query: 279 ------AINDQQIWAGPLS 291
+ N ++W P++
Sbjct: 320 QGRRVLSRNQIEVWKRPIT 338
>gi|156551900|ref|XP_001606799.1| PREDICTED: alpha-N-acetylgalactosaminidase-like [Nasonia
vitripennis]
Length = 438
Score = 124 bits (310), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 93/320 (29%), Positives = 133/320 (41%), Gaps = 91/320 (28%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIR-------------------ET 50
L NGL TPPMGW +W F C I++++ R
Sbjct: 21 LENGLARTPPMGWLAWERFRCNTDCKNDPDNCISDRLFRTMADIVVNEGYAAVGYEYINV 80
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW E RD GNLV ++ FP G K+L++YVH
Sbjct: 81 DDCWLEKERD--------------------------INGNLVPDRERFPYGMKSLSNYVH 114
Query: 111 IL--------------------ISNSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPTIRY 150
+ ++N A++ +DY+K D C+ + Y
Sbjct: 115 SKGLKFGIYEDYGNYTCAGYPGVIGYMENDAAQFAAWD--VDYVKLDGCYAHPSEMDQGY 172
Query: 151 RAMSDALKKAGRPIFFSMCEW------GDMRPALWGSNIR--NSWRTTDDISDSWARMLT 202
+ + GR + +S C W M+P + I+ N WR DDI DSWA + T
Sbjct: 173 PEFGFHMNQTGRQMIYS-CSWPVYQIYAGMQPNF-TAIIQHCNLWRNFDDIQDSWASVET 230
Query: 203 IADM----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVG 258
I D + +A PG WNDPDML VGN G+ Y + +IWAI APLL+ D+
Sbjct: 231 IIDYYGNNQDAIVPNAGPGHWNDPDMLIVGNFGLSYEQSKTQMAIWAILAAPLLMSVDLR 290
Query: 259 NLTAETMPIIGNEEVIAVNQ 278
+ E I+ N+++I V+Q
Sbjct: 291 TIRPEYKAILQNKKIIDVDQ 310
>gi|340619487|ref|YP_004737940.1| carbohydrate esterase / alpha-galactosidase [Zobellia
galactanivorans]
gi|339734284|emb|CAZ97661.1| Carbohydrate esterase, family CE6 / Alpha-galactosidase, family
GH27 [Zobellia galactanivorans]
Length = 866
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 103/347 (29%), Positives = 154/347 (44%), Gaps = 53/347 (15%)
Query: 21 SNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS-RDSQISLSCSFHMFDSLTQIA 79
S+ L TPPMGWNSWN F I+EK I D S RD+ + + D A
Sbjct: 340 SDYLTPTPPMGWNSWNAFEKDIDEKKIMNMADIMVTSGMRDA----GYEYLVIDDAWMAA 395
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNSLDNRIE 122
+ N+ G LVA+ FP G KA+ DY+H + S ++
Sbjct: 396 ER---NEA-GQLVADPVKFPGGMKAIGDYIHSKGLKYGIYECRGDLTCQNLPGSFEHEQT 451
Query: 123 SNIAFEQ-GIDYLKYDNCF--NDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALW 179
+F G+DY+K D CF + + A+ RP+ S+ ++G A
Sbjct: 452 DMDSFASWGVDYIKLDACFAIKNGRLSSEDLDVYHQAIVHTRRPMVLSISDFGSGAWAWG 511
Query: 180 GSNIRNSWRTTDDISDSWARMLTIADMN----------EVYADHAKPGGWNDPDMLEVGN 229
G N WRT+ DI + + A+ + + A P WNDPDML+VGN
Sbjct: 512 GKNYGQLWRTSGDIYPTIRSVYNCANTSGGDGSIHPAFQGLWQFAGPDSWNDPDMLQVGN 571
Query: 230 GGMKYN-EYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ---- 284
+K E VHFS+W+I AP++ G D+ +T ET I+ EVIA+NQ Q
Sbjct: 572 --LKTTLEDKVHFSLWSILAAPIMAGNDLSKMTEETKKILLAAEVIAINQDARAHQGYKV 629
Query: 285 -------IWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTL 324
I+ PLS VL+ ++ + + +T ++ IG++ +
Sbjct: 630 FDKDSVEIYNKPLSDGTTAVLMLNKGSKKTDITVQFNTIGLQGKQKV 676
>gi|229366154|gb|ACQ58057.1| Alpha-N-acetylgalactosaminidase [Anoplopoma fimbria]
Length = 436
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 99/342 (28%), Positives = 144/342 (42%), Gaps = 70/342 (20%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQIN----------EKIIRET-----DDCWGESSRDSQIS 64
L NGL TPPMGW +W F C I+ E + R+T +D W E + I
Sbjct: 19 LDNGLLKTPPMGWMAWERFRCDIDCKDDPENCISEDLFRDTADRLAEDGWKELGYEYIIV 78
Query: 65 LSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------------- 111
C M G L + FP G LA Y+H
Sbjct: 79 DDCWMSML------------RDEHGRLQPEPSRFPGGIAKLARYIHDRGLKLGIYADMGT 126
Query: 112 -----LISNSLDN-RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIF 165
+LD I++ G+DYLK+D C+++ + Y MS AL GRP+
Sbjct: 127 HTCMGFPGTTLDKIEIDAQTFASWGVDYLKFDGCYSNPVEQILGYPRMSKALNATGRPMA 186
Query: 166 FSMCEW----GDMRPAL---WGSNIRNSWRTTDDISDSWARMLTIADMNEVYADH----A 214
+S C G + P + + +I + WR DI DSW + I + D A
Sbjct: 187 YS-CSLPVYVGGLPPNVNYSFLGDICHLWRNYYDIQDSWDSVQGIIEWFSNNQDDLQPAA 245
Query: 215 KPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVI 274
PG WNDPDML +GN G+ ++ ++WAI APL++ D+ NL I+ N+ I
Sbjct: 246 GPGRWNDPDMLIIGNFGLSVDQARSQMALWAIMAAPLIMSNDLRNLDNSARAILQNKVAI 305
Query: 275 AVN------------QAINDQQIWAGPLSGNRIVVLLESRKT 304
A+N Q N +++ PLS + ++ SR+T
Sbjct: 306 AINQDPMGIQGRRLLQEKNRIEVYWRPLSHSASALVFLSRRT 347
>gi|340722731|ref|XP_003399756.1| PREDICTED: alpha-N-acetylgalactosaminidase-like [Bombus terrestris]
Length = 439
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 96/341 (28%), Positives = 136/341 (39%), Gaps = 96/341 (28%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIR-------------------ET 50
L NGL TPPMGW +W F C I++++ R
Sbjct: 21 LENGLVKTPPMGWLAWERFRCNTDCKNDPDNCISDRLFRTMADIVVAEGYAEVGYEYINV 80
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW E R G LV ++ FP G K+LA+Y+H
Sbjct: 81 DDCWLEKDR--------------------------SVNGQLVPDRQRFPYGMKSLANYIH 114
Query: 111 -----ILISNSLDNRIESNIAFEQG-------------IDYLKYDNCFNDDTRPTIRYRA 152
I N + G +DY+K D C++ + Y
Sbjct: 115 SKGLKFGIYEDFGNYTCAGYPGILGYLETDALTFASWDVDYVKLDGCYSHPSEMDRGYPE 174
Query: 153 MSDALKKAGRPIFFSMCEW------GDMRPALWG-SNIRNSWRTTDDISDSWARMLTIAD 205
L + GRP+ +S C W M+P + N WR DDI DSW + TI D
Sbjct: 175 FGFYLNQTGRPMVYS-CSWPVYQIYAGMQPNFTAITEHCNLWRNFDDIQDSWNSLETIID 233
Query: 206 M----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
+ +A PG WNDPDML +GN G+ Y + ++WAI APLL+ D+ +
Sbjct: 234 YYGNNQDAIVPNAGPGHWNDPDMLIIGNFGLSYEQSKTQMALWAILAAPLLMSVDLRTIR 293
Query: 262 AETMPIIGNEEVIAVNQ---AINDQ--------QIWAGPLS 291
E I+ N ++IAV+Q I + +IWA P++
Sbjct: 294 PEYKAILQNRKIIAVDQDPLGIQGRRIYKHKGIEIWARPIT 334
>gi|427782437|gb|JAA56670.1| Putative alpha-d-galactosidase melibiase [Rhipicephalus pulchellus]
Length = 528
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 91/317 (28%), Positives = 130/317 (41%), Gaps = 85/317 (26%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQIN----------EKIIRE-------------------T 50
L NGL TPPMGW +W F C I+ E++ +E
Sbjct: 46 LENGLARTPPMGWLAWERFQCNIDCSREPENCISERLFKEMAEALVYEGYRDVGYEYVNI 105
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RD G L AN T FP+G K +AD++H
Sbjct: 106 DDCWMNKERD--------------------------VVGRLQANITRFPNGIKHIADFMH 139
Query: 111 -----ILISNSLDNRIESNIAFEQG-------------IDYLKYDNCFNDDTRPTIRYRA 152
+ I ++ + + A G +D +K D C+ + Y
Sbjct: 140 ARGLKLGIYGNVGTKTCAGYAGSLGNLYTDAQTFADWEVDMVKMDGCYANIQDYPRLYTD 199
Query: 153 MSDALKKAGRPIFFSMCEWGDMRPALWGS-NIR------NSWRTTDDISDSWARMLTIAD 205
DA+ + GRP+ +S C W + S N + N WR DI+D+W + ++ D
Sbjct: 200 FGDAINRTGRPMVYS-CSWPAYEVSYGASPNYKLIGHHCNLWRNYVDIADTWQSVESVID 258
Query: 206 M----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
A PG WNDPDML +GN G+ Y++ +IWAI APLL+ D+ +
Sbjct: 259 YYAANQNALTAAAAPGRWNDPDMLVIGNFGLSYDQSRAQMAIWAIMAAPLLMSNDLRRMR 318
Query: 262 AETMPIIGNEEVIAVNQ 278
E I+ N +IAVNQ
Sbjct: 319 PEFKQILQNRAIIAVNQ 335
>gi|443695656|gb|ELT96522.1| hypothetical protein CAPTEDRAFT_183492 [Capitella teleta]
Length = 408
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 105/344 (30%), Positives = 154/344 (44%), Gaps = 61/344 (17%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGE------SSRDSQISLSCSFHMFD 73
L NGL LTPPMGW SW F C N + + D+C GE + R +Q + + +
Sbjct: 19 LDNGLALTPPMGWLSWERFECNTNCR--EDPDNCIGERLVMQMADRMAQDGFLAAGYEYI 76
Query: 74 SLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNS 116
+L +GN++ + FP G KALADYVH L S
Sbjct: 77 ALDD-CWPARDRDPKGNILPDPERFPHGMKALADYVHSLGLKLGLYADVGRHTCAGFPGS 135
Query: 117 LDN-RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWG-DM 174
LD+ +SN E G+D +K+D C D+ I Y L K RPI +S CEW
Sbjct: 136 LDHYEQDSNTFAEWGVDMVKFDGCNTDEQHFEIGYPLFGFYLNKTRRPIMYS-CEWALYA 194
Query: 175 RPALWGSNIR------NSWRTTDDISDSWARMLTI----ADMNEVYADHAKPGGWNDPDM 224
R + +N N++R DI D++ + +I AD ++ A PG +ND DM
Sbjct: 195 RAKGFKANYTAVAETCNTFRVYGDIWDNYESIQSISKWYADDEGNFSAVAAPGSFNDADM 254
Query: 225 LEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------ 278
L +GN G+ + WA+ +PLL+ D+ + + ++ N+ V+ +NQ
Sbjct: 255 LVIGNYGLSKDGAKSQMGFWAMVASPLLMSVDLRTIDTFSKELLQNKRVLKINQDPLGVQ 314
Query: 279 --AI-----NDQQIWAGPLSGN-----RIVVLLES----RKTFS 306
AI + Q+W PLS IV LL+ R +FS
Sbjct: 315 GKAIIQFLDGNLQVWTRPLSARGSFAIAIVYLLQVGQPLRTSFS 358
>gi|323456868|gb|EGB12734.1| hypothetical protein AURANDRAFT_2775, partial [Aureococcus
anophagefferens]
Length = 297
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 90/301 (29%), Positives = 144/301 (47%), Gaps = 22/301 (7%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWG-ESSRDSQISLSCSFHMFDSLTQI 78
L NG+GLTPPMG+N + + + + + + G ++ + I+ + + T
Sbjct: 1 LDNGVGLTPPMGFNDYQTGLGVADLEAVADAFESLGLRAAGYAFINSDAGWQSGRNATSG 60
Query: 79 AAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL--ISNSLDNRIESNIAFEQGIDYLKY 136
A P PN F G + V +++ + Y+KY
Sbjct: 61 A--PTPNMASLAAALGARGFGLGLYSALSSVQCGGDPGGLYHEAVDAAAYAAWNVSYVKY 118
Query: 137 DNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDS 196
DNC PT R M AL GR + FS E D+ P L +++ N WRTT D++
Sbjct: 119 DNCAEYALEPTARVLPMRRALNATGRRVVFS-TEPFDLAPNLR-AHLANLWRTTTDVAAD 176
Query: 197 WARMLTIADMNEVYADHAKPGGWNDPDMLEVGN-GGMKYNEYVVHFSIWAISKAPLLLGC 255
++ D+N+ +A+ A PGG+NDPDML+VG GG ++ V +F +WA++KAPLLL
Sbjct: 177 AGKVYVNIDLNDKWAEFAGPGGFNDPDMLQVGKLGGNGGDDDVSNFVLWAVAKAPLLLSV 236
Query: 256 DVGNLTAETMPIIGNEEVIAVNQ---AINDQQI-----------WAGPLSGNRIVVLLES 301
+V L + + N +++AVNQ + +++ WAGPL+G V+L +
Sbjct: 237 NVSTLAPRLLALAKNAKILAVNQDALGVQGRKVAVDGAATLLDAWAGPLAGGAFAVVLHN 296
Query: 302 R 302
R
Sbjct: 297 R 297
>gi|195402577|ref|XP_002059881.1| GJ14995 [Drosophila virilis]
gi|194140747|gb|EDW57218.1| GJ14995 [Drosophila virilis]
Length = 419
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 92/321 (28%), Positives = 130/321 (40%), Gaps = 92/321 (28%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRE-------------------T 50
L NGL L PPMGW SW F C I+E++ R
Sbjct: 25 LDNGLALKPPMGWMSWQRFRCITDCKTYPDECISEQLFRRHADLIVSEGYADVGYEYVII 84
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW E +RD++ + LV ++ FP+G LAD++H
Sbjct: 85 DDCWLEKNRDNKTN-------------------------KLVPDRKRFPNGLNVLADHIH 119
Query: 111 ----------------------ILISNSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPTI 148
++ +LD +N +DY+K D C+ + +
Sbjct: 120 ERGLKFGLYQDFGTNTCAGYPGVINHMALDAATFAN----WDVDYVKLDGCYANISDMAA 175
Query: 149 RYRAMSDALKKAGRPIFFSMCEW-------GDMRPALWGSNIRNSWRTTDDISDSWARML 201
Y L GRP+ +S C W G M N WR DDI DS ++
Sbjct: 176 GYPEFGRLLNSTGRPMVYS-CSWPAYQSEVGQMPEYESLKKHCNLWRNWDDIDDSLESLM 234
Query: 202 TIADM----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDV 257
I D + HA PG WNDPDML +GN G+ Y++ + +IWA+ APL++ D+
Sbjct: 235 QIIDYFGKNQDSIQPHAGPGHWNDPDMLLLGNYGLSYDQSKLQMAIWAVLAAPLIMSNDL 294
Query: 258 GNLTAETMPIIGNEEVIAVNQ 278
+ E I+ N VIAVNQ
Sbjct: 295 AKVRPEIKDILQNRAVIAVNQ 315
>gi|345015173|ref|YP_004817527.1| glycoside hydrolase clan GH-D [Streptomyces violaceusniger Tu 4113]
gi|344041522|gb|AEM87247.1| glycoside hydrolase clan GH-D [Streptomyces violaceusniger Tu 4113]
Length = 554
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 105/344 (30%), Positives = 148/344 (43%), Gaps = 62/344 (18%)
Query: 29 PMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQ 88
PMGW SWN QI+ I+ D S + D + +
Sbjct: 73 PMGWASWNSLASQIDYGTIKAQVDALVSSG---MAQAGYEYVNIDEGWWWGTR----DRK 125
Query: 89 GNLVANKTTFPSGTKALADYVH----------------------------ILISNSLDNR 120
GN+ + +P G KA+ADY+H + +
Sbjct: 126 GNITVDSQQWPGGMKAIADYIHSKGLKAGIYTDAGKNGCGYFYPTPPGTPPAPGSGSEGH 185
Query: 121 IESNI-AFEQ-GIDYLKYDNCFNDDTR--PTIRYRAMSDALKKA----GRPIFFSMCEWG 172
E ++ AF++ G DY+K D C D YR++S A K A GR + S+CEWG
Sbjct: 186 YEQDLRAFQEWGFDYVKVDWCGGQDEGLDQEATYRSLSQANKAATAVTGRRLVLSLCEWG 245
Query: 173 DMRPALWGSNIRNSWRTTDDI-----SDSWARMLTIADMNEVYADHAKPGGWNDPDMLEV 227
RP W + + WRT+ DI + S ML+ D ++ G +NDPDML V
Sbjct: 246 SGRPWNWAPGVGDLWRTSTDIIYWGQTPSATAMLSNFDQG-LHPTAQHTGYYNDPDMLTV 304
Query: 228 GNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ--------- 278
G G H S+WAIS APLL G D+ +T ET + N EVIAV+Q
Sbjct: 305 GMKGSNATLDRTHMSLWAISGAPLLAGNDLTTMTRETAANLTNPEVIAVDQDPRGLQGVK 364
Query: 279 AINDQ---QIWAGPLSG-NRIVVLLESRKTFSSMMTAHWDDIGI 318
D Q++A LSG + VLL +R + + +TA W D+G+
Sbjct: 365 VAEDAAGLQVYAKVLSGTGKRAVLLLNRTSAAKPVTARWTDLGL 408
>gi|328697368|ref|XP_001951816.2| PREDICTED: alpha-N-acetylgalactosaminidase-like [Acyrthosiphon
pisum]
Length = 385
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 102/343 (29%), Positives = 153/343 (44%), Gaps = 52/343 (15%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGE----SSRDSQISLSCSFHMFDSL 75
L NGL TPPMGW +W F C + K + D+C E + D IS + ++ +
Sbjct: 22 LENGLARTPPMGWLAWERFRCNTDCK--NDPDNCISEKLFRTMTDLVISEGYAAAGYEYI 79
Query: 76 -TQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDN---------- 119
G L + FP G L++Y+H I N
Sbjct: 80 NVDDCWLDFARTYDGRLQPDAKRFPRGMADLSEYIHSRGLKFGIYEDYGNFTCAGYPGIL 139
Query: 120 ---RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEW----- 171
+++ E +D++K D C++ Y L K GR + +S C W
Sbjct: 140 GSLEVDAFTFAEWNVDFVKLDGCYSLPKDMDQGYSEFGYHLNKTGRAMVYS-CSWPVYQT 198
Query: 172 -GDMRPALWGSNIR-NSWRTTDDISDSWARMLTIADM----NEVYADHAKPGGWNDPDML 225
++P R N WR DDI DSWA + +I D +V A +A PG WNDPDML
Sbjct: 199 YAGLQPNYSAITSRCNLWRNFDDIQDSWASVESIIDYYGDNQDVIAANAAPGHWNDPDML 258
Query: 226 EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQI 285
+GN G+ Y + V +IWAI APLL+ D+ + E I+ N+++IA++Q D+
Sbjct: 259 IIGNFGLSYEQSKVQMAIWAILAAPLLMSTDLRTIRPEYKAILQNKKIIAIDQ---DKL- 314
Query: 286 WAGPLSGNRIVVLLESRKTFSSMMTAHWDD--IGIKSNNTLFE 326
+ G RI K S++ +H D+ I IKS++ L +
Sbjct: 315 ---GIQGRRIY------KNGDSVVNSHRDNFTIFIKSHDNLLK 348
>gi|373460855|ref|ZP_09552605.1| hypothetical protein HMPREF9944_00869 [Prevotella maculosa OT 289]
gi|371954680|gb|EHO72489.1| hypothetical protein HMPREF9944_00869 [Prevotella maculosa OT 289]
Length = 549
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 81/283 (28%), Positives = 133/283 (46%), Gaps = 43/283 (15%)
Query: 28 PPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQC 87
P MGW+SWN FH I++ +IR + D+ + L + +
Sbjct: 31 PIMGWSSWNTFHVNISDSLIR--------TQADAMVRLGLKDVGYTHINIDDGFFGWRDA 82
Query: 88 QGNLVANKTTFPSGTKALADYVHI------LISNSLDNRI-------------------- 121
G + + FP+G K +ADY+H + S++ N
Sbjct: 83 SGEMHPHPQRFPNGLKVVADYIHSKGLMAGIYSDAGANTCGSKYDADDHGFGAGLYGHEE 142
Query: 122 -ESNIAFEQ-GIDYLKYDNCFND---DTRPTIRYRAMSDALKK-AGRPIFFSMCEWGDMR 175
++ + F G D++K D C D RY A+ +A+KK AG P+ ++C W
Sbjct: 143 QDARLYFRDWGFDFIKIDYCGAGTWLDLEEQKRYTAVCEAIKKEAGHPVSINICRWA--F 200
Query: 176 PALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYN 235
P +W NI SWR + DI +W+ + I + N + A G +ND DMLE+G G+K +
Sbjct: 201 PGVWAKNIATSWRISRDILPNWSSIKYIIEKNMYLSAFAGDGHYNDMDMLEIGR-GLKPS 259
Query: 236 EYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
E VH +W + +P+L+GCD+ L ++ ++ N E++A+NQ
Sbjct: 260 EEEVHVGMWCMMSSPMLIGCDLNKLNESSLALLKNRELLAINQ 302
>gi|358339800|dbj|GAA36869.2| alpha-galactosidase [Clonorchis sinensis]
Length = 604
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 96/321 (29%), Positives = 131/321 (40%), Gaps = 90/321 (28%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRET------------------- 50
L NGL TPPMGW SW F CQ I+EK+I+ T
Sbjct: 25 LENGLVRTPPMGWMSWERFRCQTNCVEYPDDCISEKLIKRTVDRMISDGWKEVGYEYIIT 84
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW E RD TQ +VA++ FPSG + L +Y+H
Sbjct: 85 DDCWPEKERDRN-------------TQ------------EIVADRKRFPSGIEGLGEYIH 119
Query: 111 ---ILISNSLDN---------------RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRA 152
+ LD I+S + +DY+K D C + Y
Sbjct: 120 DHGLKFGIYLDYGTLTCAGYPGSMNFLEIDSKSLAKWKVDYVKVDGCNSPVEAMADGYEK 179
Query: 153 MSDALKKAGRPIFFSMCEWGDMRPALWGSNIR-----------NSWRTTDDISDSWARML 201
L + GRPI +S C + P W SN N WR DI DSW +L
Sbjct: 180 FGRLLNETGRPIVYS-CSYPAYIP--WRSNPGRLDWGRLKTNCNLWRMFGDIDDSWDSVL 236
Query: 202 TIA----DMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDV 257
TI D A PG WNDPDML +GN G+ ++ V +W + APLL+ D+
Sbjct: 237 TIINLMRDTQSTLQPIAGPGHWNDPDMLVIGNFGLSLDQERVQMGMWCLFAAPLLISADM 296
Query: 258 GNLTAETMPIIGNEEVIAVNQ 278
++ ++ I+ N +I++NQ
Sbjct: 297 DHMNPDSAEILKNPLLISINQ 317
>gi|195119798|ref|XP_002004416.1| GI19628 [Drosophila mojavensis]
gi|193909484|gb|EDW08351.1| GI19628 [Drosophila mojavensis]
Length = 417
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 96/300 (32%), Positives = 135/300 (45%), Gaps = 50/300 (16%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRET------------------- 50
L NGL LTPPMGW SW F C I+E++ R +
Sbjct: 24 LDNGLALTPPMGWMSWQRFRCITDCDTYPDECISEQLFRRSADLLVSEGYADVGYEYVII 83
Query: 51 DDCWGESSRDSQIS-LSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYV 109
DDCW E +RD+ + L F + A + N+ G F GTK A Y
Sbjct: 84 DDCWLEWNRDNMTNKLVPDRKRFPRGLNVLADHIHNK--GLKFGLYQDF--GTKTCAGYP 139
Query: 110 HILISNSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMC 169
++ +LD + + +DY+K D C + + Y L + GRP+ +S C
Sbjct: 140 GVIDHMALD----AETFAKWDVDYVKLDGCNANVSDMAAGYPEFGRLLNETGRPMVYS-C 194
Query: 170 EWGDMRPAL-----WGSNIR--NSWRTTDDISDSWARMLTIADMNEVYAD----HAKPGG 218
W + + S R N WR DDI DS+ ++ I D D HA PG
Sbjct: 195 SWPAYQSEFGEQPHFESLKRHCNLWRNWDDIDDSFESVMEIVDYFAKNQDSMQPHAGPGH 254
Query: 219 WNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
WNDPDML +GN G+ Y++ V +IWA+ APL++ D+ + E I+ N +VIAVNQ
Sbjct: 255 WNDPDMLILGNYGLSYDQSKVQMAIWAVLAAPLIMSNDLAKVRPEIKEILQNRDVIAVNQ 314
>gi|307213390|gb|EFN88826.1| Alpha-N-acetylgalactosaminidase [Harpegnathos saltator]
Length = 1392
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 97/342 (28%), Positives = 142/342 (41%), Gaps = 98/342 (28%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIR-------------------ET 50
L NGL TPPMGW +W F C I++++ +
Sbjct: 902 LENGLARTPPMGWLAWERFRCNTDCKNDPDNCISDRLFKTMADIVVAEGYAAVGYNYINV 961
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW E RD G LV ++ FP G + LADYVH
Sbjct: 962 DDCWLEKDRD--------------------------FNGQLVPDRQRFPYGMRNLADYVH 995
Query: 111 -----ILISNSLDNR-----------IESNIAFEQG--IDYLKYDNCFNDDTRPTIRYRA 152
I N +E++ A +DY+K D C++ + Y
Sbjct: 996 SKGLKFGIYEDYGNYTCAGYPGVLGYLETDAATFASWDVDYVKLDGCYSHPSDMDRGYPE 1055
Query: 153 MSDALKKAGRPIFFSMCEW------GDMRPALWGSNIR--NSWRTTDDISDSWARMLTIA 204
L + G+ + +S C W M+P + + I N WR DDI DSWA + +I
Sbjct: 1056 FGFHLNQTGKSMVYS-CSWPVYQIYAGMQPN-YTAIIEHCNLWRNFDDIQDSWASVESII 1113
Query: 205 DM----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNL 260
D + +A PG WNDPDML +GN G+ Y + ++WAI APLL+ D+ +
Sbjct: 1114 DYYGNNQDAIVPNAGPGHWNDPDMLIIGNFGLSYEQSKTQMALWAILAAPLLMSVDLRTI 1173
Query: 261 TAETMPIIGNEEVIAVNQ---AINDQ--------QIWAGPLS 291
E I+ N+++IAV+Q I + +IWA P++
Sbjct: 1174 RPEYKAILQNKKIIAVDQDPLGIQGRRIYKHKGIEIWARPIT 1215
>gi|423305441|ref|ZP_17283440.1| hypothetical protein HMPREF1072_02380 [Bacteroides uniformis
CL03T00C23]
gi|423311258|ref|ZP_17289227.1| hypothetical protein HMPREF1073_03977 [Bacteroides uniformis
CL03T12C37]
gi|392679305|gb|EIY72690.1| hypothetical protein HMPREF1073_03977 [Bacteroides uniformis
CL03T12C37]
gi|392681142|gb|EIY74503.1| hypothetical protein HMPREF1072_02380 [Bacteroides uniformis
CL03T00C23]
Length = 655
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 99/341 (29%), Positives = 151/341 (44%), Gaps = 65/341 (19%)
Query: 2 FIVSSSAADSNGSDRRNL---LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS 58
+ V+ A + G+ ++ + + + + LTP MGWNSWN + +N++ +R+ E
Sbjct: 245 YKVTLKATNDKGTAQKEVTVKIGDAIALTPSMGWNSWNCWGLSVNDEKVRDAARMMNEKL 304
Query: 59 RDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL------ 112
+ + D + A++ QG +++N FP KAL DY+H L
Sbjct: 305 H----AYGWEYVNIDDGWEAASR----TKQGEILSNDK-FPD-FKALTDYIHGLGLKFGI 354
Query: 113 ------------ISNSLDNRIESNIAFEQGIDYLKYDNCF-----NDDTRPTIR--YRAM 153
+ + I++ G+DYLKYD C D TIR Y M
Sbjct: 355 YSSPGHITCGGHVGSYQHEEIDAKTWERWGVDYLKYDYCGYLEIEKDSEEKTIREPYIVM 414
Query: 154 SDALKKAGRPIFFSMCEWGDMRPALWG---SNIRNSWRTTDDISDSWARMLTIADMNEVY 210
AL K R I + + G P +W N WRTT DI+D W + I +V
Sbjct: 415 RKALDKVNRDIVYCV---GYGAPNVWNWAPEAGGNQWRTTRDITDEWNVVTAIGTFQDVC 471
Query: 211 ADHAKPGGWNDPDMLEVGNGGMKY-----------NEYVVHFSIWAISKAPLLLGCDVGN 259
AD PG NDPDML VG G + +E H S+W + +PLL+GCD+ N
Sbjct: 472 ADATAPGRNNDPDMLVVGKLGQGWGSKVHDSYLTADEQYSHISLWCLLSSPLLIGCDMAN 531
Query: 260 LTAETMPIIGNEEVIAVNQ----------AINDQQIWAGPL 290
+ T+ ++ N EVIAV+Q + + Q+W+ L
Sbjct: 532 MDDFTLNLLTNNEVIAVSQDPMVAPAKKMMVENGQVWSKKL 572
>gi|195488215|ref|XP_002092220.1| GE14068 [Drosophila yakuba]
gi|194178321|gb|EDW91932.1| GE14068 [Drosophila yakuba]
Length = 417
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 93/319 (29%), Positives = 139/319 (43%), Gaps = 51/319 (15%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGL L PPMGW SW F C + K+ D+C E L S D+ +
Sbjct: 24 LDNGLALKPPMGWMSWERFRCITDCKLY--PDECISEKLFQRHADLLVSEGYADAGYEYV 81
Query: 80 -------AKPLPNQCQGNLVANKTTFPSGTKALADYVH------ILISNSLDN------- 119
K N Q LV ++ FP+G AL+D++H L + N
Sbjct: 82 IIDDCWLEKNRDNDTQ-KLVPDRKRFPNGLNALSDHIHNQGLKFGLYQDYGTNTCAGYPG 140
Query: 120 -----RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEW--- 171
++++ + +DY+K D C+ + + Y L + GRP+ +S C W
Sbjct: 141 VIKHMKLDAQTFADWDVDYVKLDGCYANISDMATGYPEFGRLLNETGRPMVYS-CSWPAY 199
Query: 172 ----GDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYAD----HAKPGGWNDPD 223
G+M N WR DDI DS ++ I D D H PG WNDPD
Sbjct: 200 QEDAGEMPDYESLKQHCNLWRNWDDIDDSLESLMQIMDYFAKNQDRIQPHGGPGHWNDPD 259
Query: 224 MLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ----- 278
ML +GN G+ Y++ + +IW+I APL++ D+ + E I+ N VIAV+Q
Sbjct: 260 MLLLGNYGLSYDQSKLQMAIWSIMAAPLIMSNDLAAVRPEIKDILQNRAVIAVDQDELGI 319
Query: 279 ------AINDQQIWAGPLS 291
+ N ++W P++
Sbjct: 320 QGRRVLSRNQIEVWKRPIT 338
>gi|167536212|ref|XP_001749778.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771705|gb|EDQ85367.1| predicted protein [Monosiga brevicollis MX1]
Length = 890
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 101/310 (32%), Positives = 137/310 (44%), Gaps = 54/310 (17%)
Query: 24 LGLTPPMGWNSWNHFHCQ----------INEKIIRETDDCWGESSRDSQISLSCSFHMFD 73
L LTPPMGW SW F C+ IN+K+ + D E + + + H+ D
Sbjct: 26 LSLTPPMGWMSWEIFRCETDCASDPDNCINDKLYMTSADALAEGGYLA--AGYDTIHIDD 83
Query: 74 SLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNS 116
+ PN LV N T FP G KA+ADYVH S
Sbjct: 84 CWERQTPTRDPNT--DRLVPNATRFPDGMKAVADYVHGKGVKFGLYTAESSKTCGGYPAS 141
Query: 117 LDNRIESNIAF-EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMR 175
D+ + F E G+DYLK D C D + Y M AL+ R I +S C W
Sbjct: 142 KDHEVLDASTFAEWGVDYLKVDGC-GDTSYYNEGYPRMGSALQNQSRDIVYS-CSW---- 195
Query: 176 PALWGSNIR------------NSWRTTDDISDSWARMLTIAD----MNEVYADHAKPGGW 219
PA GS+ N WR +DI +WA + +I D E +A PG W
Sbjct: 196 PAYLGSDETTKPYERFIEIGCNLWRNWNDIQCNWASLSSIIDHWGQYGETLRKYAGPGHW 255
Query: 220 NDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQA 279
NDPDML +GN + +E +IW+I APL++G DV N++AE I+ N++ IA
Sbjct: 256 NDPDMLLIGNDCINADEARTQMAIWSIVAAPLIMGNDVRNISAEHRNILLNKDAIASGGR 315
Query: 280 INDQQIWAGP 289
+ + GP
Sbjct: 316 LATCLVPLGP 325
>gi|380478648|emb|CCF43480.1| beta-L-arabinopyranosidase/alfa-D-galactopyranosidase, partial
[Colletotrichum higginsianum]
Length = 333
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 88/289 (30%), Positives = 132/289 (45%), Gaps = 45/289 (15%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQ 86
TP MGWNS+N+++C NE II++ + + + + + ++T P ++
Sbjct: 29 TPQMGWNSYNYYNCYPNETIIKQ--------NAHALVDTGLAEAGYTTVTTDCGWPXKDR 80
Query: 87 C-QGNLVANKTTFPSGT-KALADYVHIL-------------------ISNSLDNRIESNI 125
G LV N FPSG K L DY+H L SL++ +
Sbjct: 81 AADGQLVWNPELFPSGGGKELGDYIHNLGLKYGVYSGGGYFQCGSTDQPASLNHELTDAK 140
Query: 126 AFEQ-GIDYLKYDNCF-----------NDDTRPTIRYRAMSDALKKAGRPIFFSMCEWG- 172
+F G D LKYDNC+ + D R+ M+DAL R I + +C+WG
Sbjct: 141 SFAAWGADSLKYDNCYAVKPDVMVDFVHPDAVSPDRFETMADALNTTDRDILYQVCQWGT 200
Query: 173 DMRPALWGSNIR-NSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGG 231
+W + NSWR ++DI + W + IA+ + H PG + D DML +G
Sbjct: 201 GTDLGIWAPKLGGNSWRISNDIYNGWRSIWRIANQVVPFYKHTGPGAFPDMDMLLIGLNA 260
Query: 232 MKYNEYVVHFSIWAISKAPLLLGCDV--GNLTAETMPIIGNEEVIAVNQ 278
+ E H +WAI+K+PL LG G + I+ N EVIA+NQ
Sbjct: 261 LSIEEEKFHMGMWAINKSPLTLGAPAIPGLVPDSAHAILTNREVIALNQ 309
>gi|193617883|ref|XP_001946391.1| PREDICTED: alpha-N-acetylgalactosaminidase-like [Acyrthosiphon
pisum]
Length = 442
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 90/298 (30%), Positives = 130/298 (43%), Gaps = 47/298 (15%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCS----------F 69
L NGL TPPMGW +W F C + K + D+C E + L S
Sbjct: 22 LENGLARTPPMGWLAWERFRCNTDCK--NDPDNCISEKLFRTMTDLVISEGYAAAGYEYI 79
Query: 70 HMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDN----- 119
++ D A G L + FP G L++Y+H I N
Sbjct: 80 NVDDCWLDFA-----RTYDGRLQPDAKRFPRGMADLSEYIHSRGLKFGIYEDYGNFTCAG 134
Query: 120 --------RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEW 171
+++ E +D++K D C++ Y L K GR + +S C W
Sbjct: 135 YPGILGSLEVDAFTFAEWNVDFVKLDGCYSLPKDMDQGYSEFGYHLNKTGRAMVYS-CSW 193
Query: 172 ------GDMRPALWGSNIR-NSWRTTDDISDSWARMLTIADM----NEVYADHAKPGGWN 220
++P R N WR DDI DSWA + +I D +V A +A PG WN
Sbjct: 194 PVYQTYAGLQPNYSAITSRCNLWRNFDDIQDSWASVESIIDYYGDNQDVIAANAAPGHWN 253
Query: 221 DPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
DPDML +GN G+ Y + V +IWAI APLL+ D+ + E I+ N+++IA++Q
Sbjct: 254 DPDMLIIGNFGLSYEQSKVQMAIWAILAAPLLMSTDLRTIRPEYKAILQNKKIIAIDQ 311
>gi|377562096|ref|ZP_09791511.1| putative glycoside hydrolase [Gordonia otitidis NBRC 100426]
gi|377520766|dbj|GAB36676.1| putative glycoside hydrolase [Gordonia otitidis NBRC 100426]
Length = 409
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 100/406 (24%), Positives = 163/406 (40%), Gaps = 103/406 (25%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRE-------------------TDDCWGESSRDSQ 62
GL TPPMGWNSWN F C + EK++RE DDCW +RD+
Sbjct: 31 GGLPATPPMGWNSWNTFGCGVTEKVVREQADALVSSGLRDAGYRYVVVDDCWAAPTRDAS 90
Query: 63 ISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH---ILISNSLDN 119
G L A+ FPSG +L Y+H +L
Sbjct: 91 --------------------------GRLQADPVRFPSGMASLGAYLHERGLLFGIYSGA 124
Query: 120 RIESNIAFEQ----------------------GIDYLKYDNCFNDDTR--PTIRYRAMSD 155
R ++ F+ G+DYLKYD C +D + AM D
Sbjct: 125 RDKTCTQFQGVYPGATGSGGHEELDATTFAAWGVDYLKYDWCSSDSDHDDQVSSFTAMRD 184
Query: 156 ALKKAGRPIFFSMCEWGDMRPAL------WGSNIRNSWRTTDDISDSWAR---------M 200
AL+ RPI +S+ + ++ WG + R T+DI+ +W+ +
Sbjct: 185 ALRDTRRPIVYSINPNSGVSGSVPGAEFDWGGTATMT-RVTNDITPAWSTPGGPSGLQGI 243
Query: 201 LTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNL 260
+ I D +PG + DPDML VG G + + S+WA+ APL+ G D+ +
Sbjct: 244 VNIIDAAGPLTSRVQPGSFLDPDMLVVGMGRLTAAQQRTQMSMWAMMAAPLITGNDLTTM 303
Query: 261 TAETMPIIGNEEVIAVNQ---------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTA 311
+ ET ++ N + ++Q +D ++W+ + +V+ L +R ++
Sbjct: 304 SQETRDLLRNATAVDIDQDSRVVAGAMVDDDPEVWSRAIGDKGLVISLTNRADHPRTLSV 363
Query: 312 HWDDIGIKSNNTL-----FEHRTLETKFQGKLTAKVDGHSCKMYEL 352
+G+ + ++ + R+ T G+L+ V H M E+
Sbjct: 364 PLGSVGLVGDASVTGVDAWTGRSY-TAQHGELSVPVGVHDTVMLEI 408
>gi|195156874|ref|XP_002019321.1| GL12300 [Drosophila persimilis]
gi|194115912|gb|EDW37955.1| GL12300 [Drosophila persimilis]
Length = 424
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 88/318 (27%), Positives = 144/318 (45%), Gaps = 52/318 (16%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGES--SRDSQISLSCSF----HMFD 73
L NGL TPPMGW + F C ++ I DC ES R + + +S + +
Sbjct: 27 LENGLAQTPPMGWMPFERFQCTVD--CIHFPRDCISESLFKRTADLLVSEGYAAVGFQYL 84
Query: 74 SLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNR-------- 120
+ + + + L+A++ FPSG AL+ Y+H I + + +
Sbjct: 85 IIDDCWMERIRDTSTQKLLADRKRFPSGMSALSRYIHGRGLKFGIYHDVGQQTCMFRGPG 144
Query: 121 ------IESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWG-- 172
+++ + G DY+K D C+ ++ Y A A+ + GR + +S C W
Sbjct: 145 ARGHFELDAQTFADWGADYVKLDGCYATESERNRGYPAFGKAMNRTGRTMVYS-CSWPFY 203
Query: 173 ----DMRPALWGSNIRNSWRTTDDISDSWARMLTIADM----NEVYADHAKPGGWNDPDM 224
D R N+ WR +DI+DS+ + I ++ E+ HA PG WNDPDM
Sbjct: 204 TEKPDYRLIAKHCNL---WRFAEDITDSYTSVFRIMELYRRNQELLLAHAGPGRWNDPDM 260
Query: 225 LEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------ 278
L +GN + Y+ + +IW++ APL++ D+ + E ++ N EVIAVNQ
Sbjct: 261 LVLGNFRLSYDASRLQLAIWSVIAAPLIMTNDLETVRPEIKELLQNREVIAVNQDLLGMP 320
Query: 279 -----AINDQQIWAGPLS 291
A Q+W P++
Sbjct: 321 GKCVRASKYFQVWVRPVT 338
>gi|350424081|ref|XP_003493682.1| PREDICTED: alpha-N-acetylgalactosaminidase-like [Bombus impatiens]
Length = 439
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 96/341 (28%), Positives = 135/341 (39%), Gaps = 96/341 (28%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIR-------------------ET 50
L NGL TPPMGW +W F C I++++ R
Sbjct: 21 LENGLVKTPPMGWLAWERFRCNTDCKNDPDNCISDRLFRTMADIVVAEGYAEVGYEYINV 80
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW E R G LV ++ FP G K LA+Y+H
Sbjct: 81 DDCWLEKDR--------------------------SVNGQLVPDRQRFPYGMKNLANYIH 114
Query: 111 -----ILISNSLDNRIESNIAFEQG-------------IDYLKYDNCFNDDTRPTIRYRA 152
I N + G +DY+K D C++ + Y
Sbjct: 115 SKGLKFGIYEDFGNYTCAGYPGILGYLETDALTFASWDVDYVKLDGCYSHPSEMDRGYPE 174
Query: 153 MSDALKKAGRPIFFSMCEW------GDMRPALWG-SNIRNSWRTTDDISDSWARMLTIAD 205
L + GRP+ +S C W M+P + N WR DDI DSW + TI D
Sbjct: 175 FGFYLNQTGRPMVYS-CSWPVYQIYAGMQPNFTAITEHCNLWRNFDDIQDSWNSLETIID 233
Query: 206 M----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
+ +A PG WNDPDML +GN G+ Y + ++WAI APLL+ D+ +
Sbjct: 234 YYGNNQDAIVPNAGPGHWNDPDMLIIGNFGLSYEQSKTQMALWAILAAPLLMSVDLRTIR 293
Query: 262 AETMPIIGNEEVIAVNQ---AINDQ--------QIWAGPLS 291
E I+ N ++IAV+Q I + +IWA P++
Sbjct: 294 PEYKAILQNRKIIAVDQDPLGIQGRRIYKHKGIEIWARPIT 334
>gi|227536734|ref|ZP_03966783.1| possible alpha-galactosidase precursor [Sphingobacterium
spiritivorum ATCC 33300]
gi|227243426|gb|EEI93441.1| possible alpha-galactosidase precursor [Sphingobacterium
spiritivorum ATCC 33300]
Length = 388
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 81/282 (28%), Positives = 133/282 (47%), Gaps = 41/282 (14%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQ 86
P MGW+SWN F I+EK+I+E D+ IS + +
Sbjct: 28 APIMGWSSWNSFRINIDEKLIKE--------QADALISSGLYKAGYRYINVDDGYFGGRD 79
Query: 87 CQGNLVANKTTFPSGTKALADYVH---------------------------ILISNSLDN 119
G L + T FP+G A+A YVH I +
Sbjct: 80 KNGKLYVDSTKFPNGMGAIAAYVHSKGLKAGLYSEGGKNTCGSIWDNDTKGIGVGMYGHE 139
Query: 120 RIESNIAFEQ-GIDYLKYDNCFNDDTR--PTIRYRAMSDALKKAGRPIFFSMCEWGDMRP 176
+ ++ + F++ D++K D C + + +Y + A+K+A F++C W P
Sbjct: 140 KQDAELFFKEWNFDFIKVDWCGGQEMKLNEEEQYTKIVHAVKEAKPDAGFNLCRW--QFP 197
Query: 177 ALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNE 236
W + +SWR + DI +++A +L I D+N ++ PG +ND DML+VG GM Y E
Sbjct: 198 GEWALKLVDSWRISGDIRNNFASVLEIIDLNRNLYKYSSPGHYNDMDMLQVGR-GMSYEE 256
Query: 237 YVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
HFS+W + +PL+ G D+ ++ +T+ I+ N+E+IA+NQ
Sbjct: 257 DKTHFSMWCMLNSPLMAGNDLRTISEQTIEILTNKELIALNQ 298
>gi|357477641|ref|XP_003609106.1| Potassium channel, partial [Medicago truncatula]
gi|355510161|gb|AES91303.1| Potassium channel, partial [Medicago truncatula]
Length = 150
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 71/149 (47%), Positives = 92/149 (61%), Gaps = 22/149 (14%)
Query: 225 LEVGNGGM-KYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAI--- 280
L VGNGG K + + +FS+WA PLLLGCDV N+T ETM I+ N+EVIAVNQ +
Sbjct: 5 LRVGNGGQTKKRDILFNFSLWA----PLLLGCDVRNVTKETMEIVSNKEVIAVNQDLLGV 60
Query: 281 --------NDQQIWAGPLSGNRIVVLLESRKTFSSM-MTAHWDDIGIKSNNT-----LFE 326
D +IWAGPLSG R+ V+ +R + +TA+W+DIGI L+E
Sbjct: 61 QAKKVRMEGDAEIWAGPLSGYRVAVVFLNRGPQKHIDITANWNDIGIPPKTVVQARDLWE 120
Query: 327 HRTLETKFQGKLTAKVDGHSCKMYELTPV 355
H+TL+T F KL A V+ H+CKMY L PV
Sbjct: 121 HKTLKTPFVNKLRATVESHACKMYVLKPV 149
>gi|347831055|emb|CCD46752.1| glycoside hydrolase family 27 protein [Botryotinia fuckeliana]
Length = 475
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 70/219 (31%), Positives = 113/219 (51%), Gaps = 19/219 (8%)
Query: 153 MSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYAD 212
M++AL +PI F +C+WG P+ W + N+WR T+DI S+ + I +
Sbjct: 1 MTNALATLSKPILFQICDWGVDFPSTWAPAMGNTWRITNDIIPSYPTVSRILNQAVPQTS 60
Query: 213 HAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGC----DVGNLTAETMPII 268
A PG W D DMLEVGN E HFS+WAI K+PL++G + ++++ ++ I+
Sbjct: 61 FAGPGHWLDLDMLEVGNNIFTTAEEQTHFSLWAILKSPLVIGAALKDSLTSISSASLAIL 120
Query: 269 GNEEVIAVNQ-------------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDD 315
N++VI+ NQ + ++WAGPLSG R VV + + + + +T ++ D
Sbjct: 121 SNKDVISYNQDSLGVAASFRRRWTTDGYEVWAGPLSGGRTVVAVINLQNTAKTLTLNFPD 180
Query: 316 IGIKSNNTL--FEHRTLETKFQGKLTAKVDGHSCKMYEL 352
+G++ +TL T +A VD H + EL
Sbjct: 181 VGLQKASTLKNIWANISTTNVLTSWSADVDAHGTMLLEL 219
>gi|440479915|gb|ELQ60647.1| alpha-galactosidase [Magnaporthe oryzae P131]
Length = 785
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 99/368 (26%), Positives = 149/368 (40%), Gaps = 87/368 (23%)
Query: 2 FIVSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQI-NEKIIRET---------- 50
F ++ A +N S L NGLG TP +GWNSWN C E +T
Sbjct: 8 FASTAILALANASPMEKRLENGLGRTPALGWNSWNVAQCNAATEAFALDTANRFISMGLK 67
Query: 51 ---------DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSG 101
DDCW R+S L + KP+ ++ + +G
Sbjct: 68 DLGYTYVNIDDCWSTMQRNSSGYLVADPKKWPR----GIKPVVDEIHAKGLKFGLYGSAG 123
Query: 102 TKALADYVHILISNSLDNRIESNIAFEQGIDYLKYDNCFND------DTRPTIR------ 149
TK A Y S + + ++ + E G+DY K+DNC+ T P +
Sbjct: 124 TKTCAGYP---ASQGYEGK-DAQLLAEWGVDYWKHDNCYTPCRQGLPQTCPENQVAGNTR 179
Query: 150 --YRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMN 207
Y M DA+ +PIFFS+C WG R WG + NSWR + DI + WA
Sbjct: 180 TWYGTMRDAVLATKKPIFFSLCNWGRDRVWEWGKDYGNSWRMSIDIWNDWAS-------- 231
Query: 208 EVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPI 267
++ NG + + H IWAI+K+P++LG D+ ++A ++ I
Sbjct: 232 ------------------QISNGALNPAQERTHMGIWAIAKSPIILGMDLSKISASSLAI 273
Query: 268 IGNEEVIAVNQ-----------------AINDQ--QIWAGPLSGNRIVVLLESRKTFSSM 308
I N+ +IA+NQ ++ Q WAGPLS +V L + T
Sbjct: 274 IKNKGLIAINQDKLGKAATYFQPPGKPAPVSGQLYPYWAGPLSDGVVVGLTNAMGTGRQT 333
Query: 309 MTAHWDDI 316
+ + D+
Sbjct: 334 LAVDFKDV 341
>gi|336397609|ref|ZP_08578409.1| Alpha-galactosidase [Prevotella multisaccharivorax DSM 17128]
gi|336067345|gb|EGN55979.1| Alpha-galactosidase [Prevotella multisaccharivorax DSM 17128]
Length = 540
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 108/394 (27%), Positives = 168/394 (42%), Gaps = 64/394 (16%)
Query: 2 FIVSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDS 61
FI +A SN ++ L P MGW+SWN F INE +I C G++
Sbjct: 15 FIALIAATPSNAHAQKRAF-----LPPTMGWSSWNTFALNINEDVI-----C-GQADEMV 63
Query: 62 QISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------LISN 115
+ L + + ++ GNL N T FP G K +ADY+H + S+
Sbjct: 64 KKGLDKVGYQYVNIDD--GYWDGRDANGNLRINATLFPKGMKYVADYIHSKGLKAGIYSD 121
Query: 116 SLDNRIESN---------------------IAFEQ-GIDYLKYDNCFN-----DDTRPTI 148
+ DN SN + F G D++K D C D+ +
Sbjct: 122 AGDNTCGSNGGKNAYGVNVGLAGHEYQDCKLYFNDWGYDFIKVDYCGGAHMGLDEQK--- 178
Query: 149 RYRAMSDALKK----AGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIA 204
+Y +SDA+K+ GR I +++C W P W S + +SWRTT DI D+W I
Sbjct: 179 QYTKISDAIKRCEKETGRRIVYNVCRWA--YPGTWISRVADSWRTTGDIYDAWKSYEGII 236
Query: 205 DMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAET 264
N + G +ND DMLE+G +E H + W I+ +PLL+GCD+ L+ +
Sbjct: 237 RENLYIQAYTGGGHYNDMDMLELGR-SQSLDEDRTHMAYWCITSSPLLIGCDLRTLSDAS 295
Query: 265 MPIIGNEEVIAVNQAINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTL 324
+ ++ N ++IA+NQ + G+ VV + M T H + N
Sbjct: 296 LGLLKNTDLIAMNQDVLGLGAPVAQRQGDVYVVAKD-------MKTLHGSQRAVVVMNLT 348
Query: 325 FEHRTLETKFQG-KLTAKVDGHSCKMYELTPVTK 357
+T+ + + T +V H C T V++
Sbjct: 349 DRRQTINVDLRALQFTGQVAVHDCFTKADTKVSE 382
>gi|17562946|ref|NP_506031.1| Protein GANA-1 [Caenorhabditis elegans]
gi|3878996|emb|CAB00120.1| Protein GANA-1 [Caenorhabditis elegans]
Length = 451
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 96/317 (30%), Positives = 134/317 (42%), Gaps = 85/317 (26%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQI----------NEKIIRE-------------------T 50
L NGLG TPPMGW SW F+C+I NE++ ++
Sbjct: 16 LDNGLGRTPPMGWMSWTAFYCEIDCVKHPTGCINEQLYKDMADQLVSGGYDKVGYKSVHI 75
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW E RDS G LVANKT FPSG KALA Y+H
Sbjct: 76 DDCWSEMERDSH--------------------------GILVANKTRFPSGMKALAKYMH 109
Query: 111 I------------------LISNSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRA 152
+ ++++ +DYLK D C D I Y
Sbjct: 110 DRGLKFGIYEDYGTKTCGGYPGSYKHEKVDAQTFAAWDVDYLKLDGCNIDQAMMPIGYPL 169
Query: 153 MSDALKKAGRPIFFSMCEWGDM---RPALWGSNIR----NSWRTTDDISDSWARMLTIA- 204
L + GRPI +S C W P L N+ N+WR DDI+ SW +++I
Sbjct: 170 FEKELNETGRPIMYS-CSWPAYLIDHPELVNYNLIGKYCNTWRNFDDINSSWKSIISIIS 228
Query: 205 ---DMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
M + + PG W+DPDML +GN G+ + + F++W I APL++ D+ +
Sbjct: 229 YYDKMQDKHIPTHGPGKWHDPDMLVIGNKGITLDMSISQFTVWCIWSAPLIMSNDLRIIG 288
Query: 262 AETMPIIGNEEVIAVNQ 278
++ N+E I +NQ
Sbjct: 289 DSFKDVLKNKEAIKINQ 305
>gi|241176424|ref|XP_002399574.1| alpha-D-galactosidase, putative [Ixodes scapularis]
gi|215495177|gb|EEC04818.1| alpha-D-galactosidase, putative [Ixodes scapularis]
Length = 425
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 97/330 (29%), Positives = 146/330 (44%), Gaps = 71/330 (21%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQIN----------EKIIRETDDCW-GESSRDS---QISL 65
L NGL TPPMGW +W F C I+ E++ +E + E RD+ +++
Sbjct: 32 LENGLARTPPMGWLAWERFQCNIDCARDPENCISERLFKEMAEILVSEGYRDAGYVYVNI 91
Query: 66 SCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI----------LISN 115
+ FD G L A+ FPSG K LAD++H + +N
Sbjct: 92 DDCWSTFD-----------RDANGRLQADPIRFPSGIKHLADFMHARGLKLGIYGDVGTN 140
Query: 116 SLDNR--------IESNIAFEQGIDYLKYDNCFNDDTRPTIR-YRAMSDALKKAGRPIFF 166
+ I++ + +D +K D C N D R R Y A+ + GRP+ +
Sbjct: 141 TCAGYPGSYSHLYIDAQTFADWDVDMVKMDGC-NLDFRDFERVYTEFGQAINRTGRPMVY 199
Query: 167 SMCEWGDMR----PALWGSNIR------NSWRTTDDISDSWARMLTIAD----MNEVYAD 212
S C W P+ N N WR DI+D+W + D + + A+
Sbjct: 200 S-CSWPAYEVTRVPSFSQPNYPKIGQHCNLWRNYMDIADTWVSVEHTIDFYASIQTMLAE 258
Query: 213 HAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEE 272
A PG WNDPDML +GN G+ Y++ ++WAI APLL+ D+ + E I+ N++
Sbjct: 259 AAAPGRWNDPDMLIIGNFGLSYDQSKAQMALWAILAAPLLMSNDLRRMRPEFKKILLNKD 318
Query: 273 VIAVN-----------QAINDQQIWAGPLS 291
+IAVN QA N +IW P++
Sbjct: 319 IIAVNQDPLGRMGQKVQAQNGVEIWRRPIT 348
>gi|259489794|tpe|CBF90359.1| TPA: alpha-1,4-galactosidase (Eurofung) [Aspergillus nidulans FGSC
A4]
Length = 320
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 97/290 (33%), Positives = 133/290 (45%), Gaps = 52/290 (17%)
Query: 21 SNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDS-----L 75
+ L TP MGWNSWN F IN I+ E D L S + D+ L
Sbjct: 17 ATSLAQTPQMGWNSWNSFKLNINASILSEIAD------------LLVSLGLKDTGYNYLL 64
Query: 76 TQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-------------ILI------SNS 116
G L AN T FP G KALAD VH IL S
Sbjct: 65 LNEGWSSYERTADGYLQANTTGFPDGIKALADEVHDKGLKLGLYGDSGILTCAFRTGSWG 124
Query: 117 LDNRIESNIAFEQGIDYLKYDNC--FNDDTR-PTIRYRAMSDALKKAGRPIFFSMCEWGD 173
+ R IA GIDYLKYDNC F T P R+ AM +AL + GR IF+S+CEWG
Sbjct: 125 YEERDALTIA-GWGIDYLKYDNCGGFRAMTNAPQERFLAMQNALLRTGRDIFYSVCEWGY 183
Query: 174 MRPALWGSNIRNSWRTTDDISD---SWARMLTIADMNEVYADH-------AKPGGWNDPD 223
P WG+NI +S+R ++DI++ S R+ + D + + A P + P
Sbjct: 184 QFPWHWGANIGHSYRMSEDITNCILSEHRICWLLDYDHCAENERDERLSIALPLARHGP- 242
Query: 224 MLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEV 273
VGN M HF+ WA ++PL++ D+ +T E++ ++ N+++
Sbjct: 243 -ARVGNFNMTMYMQQTHFAFWAALESPLIISADLRKMTNESLAVLTNKDI 291
>gi|403738601|ref|ZP_10951202.1| putative glycoside hydrolase [Austwickia chelonae NBRC 105200]
gi|403191251|dbj|GAB77972.1| putative glycoside hydrolase [Austwickia chelonae NBRC 105200]
Length = 405
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 91/341 (26%), Positives = 143/341 (41%), Gaps = 52/341 (15%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS-RDSQISLSCSFHMFDSLTQIAAKP 82
+ TPPMGW +WN F I+E ++ D + RD+ + + D Q +
Sbjct: 9 IAATPPMGWCTWNAFRTAIDETMVIAMADAMASTGLRDA----GYRYLVIDDGWQAPRR- 63
Query: 83 LPNQCQGNLVANKTTFPSGTKALADYVH-----------------ILISNSLDNRIESNI 125
G L ++ FPSG AL + +I + +
Sbjct: 64 ---DRHGRLASDPHRFPSGIPALVEETERRGLTLGIYASPGRQTCAMIYDQYTGEGLGSY 120
Query: 126 AFEQ---------GIDYLKYDNCFNDDTRPTIRYRA----MSDALKKAGRPIFFSMCEWG 172
EQ GI YLKYD C + + YR+ M + G P +S+ E+G
Sbjct: 121 GHEQQDMTQFADWGIGYLKYDWCEAHRSGTGLDYRSAFTRMRSVIDDLGWPGIYSISEYG 180
Query: 173 DMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGM 232
P W +SWRTT DI +W +L IA + +A PGGWND DML+VGNG +
Sbjct: 181 RSNPWEWAPGTAHSWRTTPDILPTWESILGIAHNSRFIGRYAGPGGWNDLDMLQVGNGEL 240
Query: 233 KYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAI-----------N 281
E H ++W++ +PL+ G D+ + ++ N VIA++Q +
Sbjct: 241 TAAECRTHLTLWSLLASPLMAGNDLRTMPEWVSSVLTNPGVIAIDQDPAGRPGNRLLRED 300
Query: 282 DQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNN 322
+IW PL+ R V + + T + WD +++ +
Sbjct: 301 THEIWTRPLARGRHAVAVIA--TGREPVHVTWDGTTLRTRD 339
>gi|390480954|ref|XP_003736041.1| PREDICTED: LOW QUALITY PROTEIN: alpha-N-acetylgalactosaminidase
[Callithrix jacchus]
Length = 576
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 92/301 (30%), Positives = 132/301 (43%), Gaps = 50/301 (16%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRETDDCWGESSRDSQISLSCS 68
+L NGL TPPMGW +W F C I+E++ E D ++D + +
Sbjct: 221 MLDNGLLRTPPMGWLAWERFRCNTNCDEDPKNCISEQLFMEMAD---RMAQDGWRDMGYT 277
Query: 69 FHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL---------------- 112
+ D I + G L+ + FP G LADYVH L
Sbjct: 278 YVNIDD-CWIGGR----DASGRLMPDPKRFPHGIPFLADYVHSLGLKLGIYADMGKFTCM 332
Query: 113 --ISNSLDNRIESNIAF-EQGIDYLKYDNCFN-DDTRPTIRYRAMSDALKKAGRPIFFSM 168
+LD ++ F E +D LK D CF+ D R M+ A PI FS
Sbjct: 333 HYPGTTLDKVVQDAQTFSEWKVDMLKLDGCFSTDKERALGEVPRMAAAXMPQAGPIAFS- 391
Query: 169 CEW----GDMRPALWGS---NIRNSWRTTDDISDSWARMLTI----ADMNEVYADHAKPG 217
C W G + P + S +I N WR DDI DSW +L+I ++ A PG
Sbjct: 392 CSWPAYEGGLPPRVNYSLLVDICNLWRNYDDIQDSWRSVLSILNWFVQHQDILQPVAGPG 451
Query: 218 GWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN 277
WNDPDML +GN G+ + ++W + APL + D+ ++A+ M I+ N +I +N
Sbjct: 452 HWNDPDMLLIGNFGLSLEQSRAQMALWTVLAAPLFMSTDLRTISAQNMDILQNPLMIKIN 511
Query: 278 Q 278
Q
Sbjct: 512 Q 512
>gi|441507581|ref|ZP_20989507.1| putative alpha-galactosidase [Gordonia aichiensis NBRC 108223]
gi|441448657|dbj|GAC47468.1| putative alpha-galactosidase [Gordonia aichiensis NBRC 108223]
Length = 415
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 99/406 (24%), Positives = 164/406 (40%), Gaps = 103/406 (25%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRE-------------------TDDCWGESSRDSQ 62
+GL PPMGWNSWN F C + E+I+R DDCW RD+
Sbjct: 37 SGLAAIPPMGWNSWNTFGCGVTEQIVRAQADALVSSGLRDVGYKYVIVDDCWAAPQRDAA 96
Query: 63 ISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH---ILISNSLDN 119
G L A+ FPSG AL Y+H +L
Sbjct: 97 --------------------------GRLQADPVRFPSGMAALGTYLHDRGLLFGVYSGA 130
Query: 120 RIESNIAFEQ----------------------GIDYLKYDNCFNDDTR--PTIRYRAMSD 155
R ++ F+ G+DYLKYD C +D + + AM D
Sbjct: 131 RDKTCTQFQGRYPGATGSGGHESVDAQTFARWGVDYLKYDWCSSDASHDDQVSAFTAMRD 190
Query: 156 ALKKAGRPIFFSMCEWGDMRPAL------WGSNIRNSWRTTDDISDSWAR---------M 200
AL++ R I F++ + ++ WG + R T+DI+ +W+ +
Sbjct: 191 ALRETARRIVFAINPNSGVAGSVPGAEFDWGGTATTT-RVTNDIAPAWSTTGGPSGSQGI 249
Query: 201 LTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNL 260
+ I D KPG + DPDML G+G + + S+W++ APL++G D+ +
Sbjct: 250 VNIIDAAGPLTSRVKPGSFIDPDMLVAGSGSLTPAQQRTQLSMWSMMAAPLIMGNDLTTM 309
Query: 261 TAETMPIIGNEEVIAVNQ---------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTA 311
+ +T ++ N VI V+Q +D ++W+ + +V+ L +R ++
Sbjct: 310 SQDTRELLRNATVINVDQDSRVVAGAMVDDDPEVWSRAIGDKGLVISLTNRADHPRTLSV 369
Query: 312 HWDDIGIKSNNTL-----FEHRTLETKFQGKLTAKVDGHSCKMYEL 352
+G+ + ++ + R+ T G+L+ V H M E+
Sbjct: 370 PLGSVGLVGDASVTGVDAWTGRSY-TAQHGELSVPVGVHDTVMLEI 414
>gi|145244232|ref|XP_001394616.1| alpha-galactosidase B [Aspergillus niger CBS 513.88]
gi|134079306|emb|CAK96935.1| unnamed protein product [Aspergillus niger]
Length = 432
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 88/331 (26%), Positives = 138/331 (41%), Gaps = 103/331 (31%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQIS 64
+G P MGWN WN + C I+E+I+ T DDCW S
Sbjct: 25 MGKLPAMGWNPWNTYGCNISEEILLSTAEQLINLGLLEAGYNYVNIDDCWSVKSGR---- 80
Query: 65 LSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESN 124
D++TQ ++ + +FP+G +A +H S L I S+
Sbjct: 81 --------DNVTQ------------QIIPDPASFPNGISGVASTLH---SQGLKLGIYSS 117
Query: 125 ------------IAFEQ---------GIDYLKYDNC-------------------FNDDT 144
+ +E G+DYLKYDNC +DT
Sbjct: 118 AGTTTCAGYPASLGYEDIDAATFASWGVDYLKYDNCGVPSNWTDQYNSCTDRWTNMQNDT 177
Query: 145 -----------------RPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSW 187
+RY M DAL++ I +++C WG W + + S+
Sbjct: 178 CIGLTNPAPPGYDWSKSLTAVRYGRMKDALQRQNHSILYALCPWGFAEVQTWATGVGASF 237
Query: 188 RTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAIS 247
R + DI SW +L I + N ++ G +D DMLEVGN G+ + E HF++WA+
Sbjct: 238 RMSKDIKASWDYVLLILNENSFLMNYNDFGIHSDADMLEVGNNGLTFPEQRSHFALWALM 297
Query: 248 KAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
K+PL++G + NL+A+ + ++ N ++A NQ
Sbjct: 298 KSPLIIGSKLSNLSADQLSLLTNPYLLAFNQ 328
>gi|198454540|ref|XP_002137888.1| GA26269 [Drosophila pseudoobscura pseudoobscura]
gi|198132839|gb|EDY68446.1| GA26269 [Drosophila pseudoobscura pseudoobscura]
Length = 424
Score = 121 bits (304), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 87/318 (27%), Positives = 144/318 (45%), Gaps = 52/318 (16%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGES--SRDSQISLSCSF----HMFD 73
L NG+ TPPMGW + F C ++ I DC ES R + + +S + +
Sbjct: 27 LENGVAQTPPMGWMPFERFQCTVD--CIHFPRDCISESLFKRTADLLVSEGYAAVGFQYL 84
Query: 74 SLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNR-------- 120
+ + + + L+A++ FPSG AL+ Y+H I + + +
Sbjct: 85 IIDDCWMERIRDTSTQKLLADRKRFPSGMSALSRYIHGRGLKFGIYHDVGQQTCMFRGPG 144
Query: 121 ------IESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWG-- 172
+++ + G DY+K D C+ ++ Y A A+ + GR + +S C W
Sbjct: 145 ARGHFELDAQTFADWGADYVKLDGCYATESERNRGYPAFGKAMNRTGRTMVYS-CSWPFY 203
Query: 173 ----DMRPALWGSNIRNSWRTTDDISDSWARMLTIADM----NEVYADHAKPGGWNDPDM 224
D R N+ WR +DI+DS+ + I ++ E+ HA PG WNDPDM
Sbjct: 204 TEKPDYRLIAKHCNL---WRFAEDITDSYTSVFRIMELYRRNQELLLAHAGPGRWNDPDM 260
Query: 225 LEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------ 278
L +GN + Y+ + +IW++ APL++ D+ + E ++ N EVIAVNQ
Sbjct: 261 LVLGNFRLSYDASRLQLAIWSVIAAPLIMTNDLETVRPEIKELLQNREVIAVNQDLLGMP 320
Query: 279 -----AINDQQIWAGPLS 291
A Q+W P++
Sbjct: 321 GKCVRASKYFQVWVRPVT 338
>gi|380025440|ref|XP_003696482.1| PREDICTED: alpha-N-acetylgalactosaminidase-like [Apis florea]
Length = 439
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 95/341 (27%), Positives = 137/341 (40%), Gaps = 96/341 (28%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIR-------------------ET 50
L NGL TPPMGW +W F C I++++ R
Sbjct: 21 LENGLVKTPPMGWLAWERFRCNTDCKNDPENCISDRLFRTMADIVVAEGYAAVGYEYINI 80
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW E R G LV ++ FP G K+LA+Y+H
Sbjct: 81 DDCWLEKDR--------------------------SVNGQLVPDRQRFPYGMKSLANYIH 114
Query: 111 -----ILISNSLDN-------------RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRA 152
I N +++ +DY+K D C++ + Y
Sbjct: 115 SKGLKFGIYEDFGNYTCAGYPGILGYLETDAHTFASWDVDYVKLDGCYSHPSEMDRGYPE 174
Query: 153 MSDALKKAGRPIFFSMCEW------GDMRPALWG-SNIRNSWRTTDDISDSWARMLTIAD 205
L + GR + +S C W M+P + N WR DDI DSW + TI D
Sbjct: 175 FGFYLNQTGRAMVYS-CSWPVYQIYAGMQPNYTAITEHCNLWRNFDDIQDSWNSLETIID 233
Query: 206 M----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
+V +A PG WNDPDML +GN G+ Y + ++WAI APLL+ D+ +
Sbjct: 234 YYGNNQDVIVPNAGPGHWNDPDMLIIGNFGLSYEQSKTQMALWAILAAPLLMSVDLRTIR 293
Query: 262 AETMPIIGNEEVIAVNQ---AINDQ--------QIWAGPLS 291
E I+ N ++IAV+Q I + +IWA P++
Sbjct: 294 PEYKAILQNRKIIAVDQDPLGIQGRRIYKHKGIEIWARPIT 334
>gi|282878813|ref|ZP_06287580.1| alpha-galactosidase [Prevotella buccalis ATCC 35310]
gi|281299021|gb|EFA91423.1| alpha-galactosidase [Prevotella buccalis ATCC 35310]
Length = 724
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 105/347 (30%), Positives = 153/347 (44%), Gaps = 82/347 (23%)
Query: 8 AADSNGSDRRNL---LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQIS 64
A +S G D ++ + + + LTPPMGWNSWN CWG S ++
Sbjct: 332 AENSKGKDEKDFSIHVGHQIALTPPMGWNSWN----------------CWGTSVSQEKVM 375
Query: 65 LSCSFHMFDSLTQI---------AAKPLPNQCQGNLVANKTTFPSGTKALADYVH----- 110
S + L A + G + N+ FP+ K L D++H
Sbjct: 376 ASAKALIDRGLADYGYNYINVDDAWEAEKRNADGTIAVNEK-FPN-MKGLGDWLHNNGLR 433
Query: 111 --ILIS----------NSLDNRIESNIAF-EQGIDYLKYDNC-----FNDDTRPTIR--- 149
I S SLD+ + + E G+DYLKYD C F+ D ++
Sbjct: 434 FGIYSSPGDLTCGHYLGSLDHEEQDAKTYNEWGVDYLKYDWCGYSRKFDADGDLSVAAYV 493
Query: 150 --YRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNI-RNSWRTTDDISDSWARMLTIADM 206
Y M + L+ R IF+S+C++G WG + NSWRTT DI+D+W + I +
Sbjct: 494 RPYLKMQEYLRAQPRDIFYSLCQYGMADVWKWGHAVDANSWRTTGDITDTWQSLYYIGFV 553
Query: 207 N--EVYADHAKPGGWNDPDMLEVGNGG---------MKYNEYVVHFSIWAISKAPLLLGC 255
E+Y +A PG WNDPDML VG G + +E H S+W + A +L+G
Sbjct: 554 RQAELYP-YAGPGHWNDPDMLVVGKVGWGPKLHDTRLTPDEQYTHISLWTLLAANMLMGG 612
Query: 256 DVGNLTAETMPIIGNEEVIAVNQ-AINDQ----------QIWAGPLS 291
D+ + T ++ N EV A+NQ A+ Q QIW PL+
Sbjct: 613 DLSQMDDFTFGLLCNNEVNAINQDALGKQAKRDVLDGDIQIWQRPLA 659
>gi|410916273|ref|XP_003971611.1| PREDICTED: alpha-N-acetylgalactosaminidase-like [Takifugu rubripes]
Length = 431
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 94/303 (31%), Positives = 132/303 (43%), Gaps = 56/303 (18%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGL TPPMGW +W F C I DC + + IS + M D L+Q
Sbjct: 19 LDNGLMRTPPMGWLAWERFRCDI---------DC--DQDPKNCISENLFTDMADRLSQDG 67
Query: 80 AKPLP------NQC--------QGNLVANKTTFPSGTKALADYVHI-------------- 111
K L + C +G L + FP G LA Y+H
Sbjct: 68 WKDLGYVYVNIDDCWSSKERDEKGRLQPDPKRFPGGIPKLARYMHDRGLKLGIYGDMGTL 127
Query: 112 ----LISNSLDN-RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFF 166
LD ++++ E +D LK D C++++ Y MS AL GRPI +
Sbjct: 128 TCGGYPGTPLDKIKMDAQTFAEWEVDMLKLDGCYSNEIEQQQGYPLMSKALNATGRPIAY 187
Query: 167 SMCEW----GDMRPAL---WGSNIRNSWRTTDDISDSWARMLTIADM----NEVYADHAK 215
S C W G + P + I N WR DI DSW +L I D ++ A
Sbjct: 188 S-CSWPAYQGGLPPKVNYTQLGEICNLWRNYGDIQDSWDSVLNIIDWFFDNQDIIGPAAG 246
Query: 216 PGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIA 275
PG WNDPDML VG+ G+ ++ ++WAI APL + D+ +++E I+ N+ I
Sbjct: 247 PGRWNDPDMLIVGDFGLSTDQSRTQMALWAIMAAPLFMSNDLRTISSEARSILQNKLAIR 306
Query: 276 VNQ 278
+NQ
Sbjct: 307 INQ 309
>gi|270294333|ref|ZP_06200535.1| alpha-galactosidase [Bacteroides sp. D20]
gi|270275800|gb|EFA21660.1| alpha-galactosidase [Bacteroides sp. D20]
Length = 655
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 98/341 (28%), Positives = 151/341 (44%), Gaps = 65/341 (19%)
Query: 2 FIVSSSAADSNGSDRRNL---LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS 58
+ V+ A + G+ ++ + + + + LTP MGWNSWN + +N++ +R+ E
Sbjct: 245 YKVTLKATNDKGTAQKEVTVKIGDAIALTPSMGWNSWNCWGLSVNDEKVRDAARMMNEKL 304
Query: 59 RDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL------ 112
+ + D + A++ QG +++N FP KAL DY+H L
Sbjct: 305 H----AYGWEYVNIDDGWEAASR----TKQGEILSNDK-FPD-FKALTDYIHGLGLKFGI 354
Query: 113 ------------ISNSLDNRIESNIAFEQGIDYLKYDNCF-----NDDTRPTIR--YRAM 153
+ + I++ G+DYLKYD C D TI+ Y M
Sbjct: 355 YSSPGHITCGGHVGSYQHEEIDAKTWERWGVDYLKYDYCGYLEIEKDSEEKTIQEPYIVM 414
Query: 154 SDALKKAGRPIFFSMCEWGDMRPALWG---SNIRNSWRTTDDISDSWARMLTIADMNEVY 210
AL K R I + + G P +W N WRTT DI+D W + I +V
Sbjct: 415 RKALDKVNRDIVYCV---GYGAPNVWNWAPEAGGNQWRTTRDITDEWNVVTAIGTFQDVC 471
Query: 211 ADHAKPGGWNDPDMLEVGNGGMKY-----------NEYVVHFSIWAISKAPLLLGCDVGN 259
AD PG NDPDML VG G + +E H S+W + +PLL+GCD+ N
Sbjct: 472 ADATAPGRNNDPDMLVVGKLGQGWGSKVHDSYLTADEQYSHISLWCLLSSPLLIGCDMAN 531
Query: 260 LTAETMPIIGNEEVIAVNQ----------AINDQQIWAGPL 290
+ T+ ++ N EVIAV+Q + + Q+W+ L
Sbjct: 532 MDDFTLNLLTNNEVIAVSQDPMVAPAKKMMVENGQVWSKKL 572
>gi|270010257|gb|EFA06705.1| hypothetical protein TcasGA2_TC009636 [Tribolium castaneum]
Length = 385
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 92/316 (29%), Positives = 137/316 (43%), Gaps = 68/316 (21%)
Query: 30 MGWNSWNHFHCQ----------INEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
MGW W F C I+EK+ + D + + + ++ + + D
Sbjct: 1 MGWMHWQRFRCLVDCDAYPDECISEKLFKTMAD---KMASEGYLAAGYEYLIIDDCWMAK 57
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIA------------- 126
+ QG L N T FPSG KAL+DYVH S L I S+
Sbjct: 58 NR----DAQGRLQPNATRFPSGIKALSDYVH---SKGLKFGIYSDYGTKTCAGYPGSHGH 110
Query: 127 --------FEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEW------G 172
E G+DYLK D C+ D Y M L + GRPI +S C W
Sbjct: 111 LETDAQTFAEWGVDYLKLDGCYADLDDLEPGYIQMGKFLNQTGRPIVYS-CSWPAYQEPK 169
Query: 173 DMRPALWGSNIR---NSWRTTDDISDSWARMLTI----ADMNEVYADHAKPGGWNDPDML 225
++P + +R N WR DDI D+W+ + +I + + A+ + PG WNDPDML
Sbjct: 170 GVKPNY--TALRETCNLWRNWDDIDDTWSNVTSILKWFSTNQDRIAEFSGPGHWNDPDML 227
Query: 226 EVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------- 278
+GN G+ Y + ++WAI APL++ D+ + + ++ ++EVI +NQ
Sbjct: 228 IIGNFGLSYEQSKAQMALWAILAAPLIMSVDLRTIEPKFRDVLLHQEVIKINQDALGIQG 287
Query: 279 ----AINDQQIWAGPL 290
IN IW P+
Sbjct: 288 RLITTINKIDIWTKPI 303
>gi|189461929|ref|ZP_03010714.1| hypothetical protein BACCOP_02596 [Bacteroides coprocola DSM 17136]
gi|189431323|gb|EDV00308.1| alpha-galactosidase [Bacteroides coprocola DSM 17136]
Length = 542
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 89/287 (31%), Positives = 135/287 (47%), Gaps = 51/287 (17%)
Query: 28 PPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQ----ISLSCSFHMFDSLTQIAAKPL 83
P MGW+SWN + INE++I++ D D I++ F F
Sbjct: 32 PIMGWSSWNTYRVNINEELIKKQADAMISQGLDKVGYHFINIDDGFFGF----------- 80
Query: 84 PNQCQGNLVANKTTFPSGTKALADYVHIL-------------ISNSL----DNRI----- 121
+G L + FP+G K +ADY+H L SL N I
Sbjct: 81 -RDEKGILHTHPQRFPNGMKGIADYIHSLGLKAGIYSEAGANTCGSLWDGDKNGIGVGLY 139
Query: 122 -----ESNIAF-EQGIDYLKYDNCFND---DTRPTIRYRAMSDALKKA-GRPIFFSMCEW 171
++N+ F E G D++K D C D RY + +A+++ R I ++C W
Sbjct: 140 GFEHQDANLFFNEWGFDFIKTDYCGAGQQLDLEEQERYTEIVNAIREVCPRNISLNICRW 199
Query: 172 GDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGG 231
P W S++ SWR + DI+ SW + I D N + A G +ND DMLE+G G
Sbjct: 200 A--YPGTWVSSLARSWRISGDINPSWESVKYIIDKNLYLSAFAGNGHYNDMDMLEIGRG- 256
Query: 232 MKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
+K E HF +W I +PLL+GCD+ + A ++ ++ N+E+IA+NQ
Sbjct: 257 LKPEEEETHFGMWCIMSSPLLIGCDLTAIPASSLQLLKNKELIALNQ 303
>gi|350631382|gb|EHA19753.1| alpha-galactosidase [Aspergillus niger ATCC 1015]
Length = 432
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 88/331 (26%), Positives = 137/331 (41%), Gaps = 103/331 (31%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQIS 64
+G P MGWN WN + C I+E+I+ T DDCW S
Sbjct: 25 MGKLPAMGWNPWNTYGCNISEEILLSTAEQLINLGLLKAGYNYVNIDDCWSVKSGR---- 80
Query: 65 LSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESN 124
D++TQ ++ + FP+G +A +H S L I S+
Sbjct: 81 --------DNVTQ------------QIIPDPANFPNGISGVASTLH---SQGLKLGIYSS 117
Query: 125 ------------IAFEQ---------GIDYLKYDNC-------------------FNDDT 144
+ +E G+DYLKYDNC +DT
Sbjct: 118 AGTTTCAGYPASLGYEDIDAATFASWGVDYLKYDNCGVPSNWTDQYNSCTDRWTNMQNDT 177
Query: 145 -----------------RPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSW 187
+RY M DAL++ I +++C WG W + + S+
Sbjct: 178 CIGLTNPAPPGYDWSKSLTAVRYGRMKDALQRQNHSILYALCPWGFAEVQTWATGVGASF 237
Query: 188 RTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAIS 247
R + DI SW +L I + N ++ G +D DMLEVGN G+ + E HF++WA+
Sbjct: 238 RMSKDIKASWDYVLLILNENSFLMNYNDFGIHSDADMLEVGNNGLTFPEQRSHFALWALM 297
Query: 248 KAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
K+PL++G + NL+A+ + ++ N ++A NQ
Sbjct: 298 KSPLIIGSKLSNLSADQLSLLTNPYLLAFNQ 328
>gi|453083490|gb|EMF11536.1| glycoside hydrolase family 27, carbohydrate-binding module family
35 protein [Mycosphaerella populorum SO2202]
Length = 567
Score = 121 bits (303), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 101/354 (28%), Positives = 163/354 (46%), Gaps = 67/354 (18%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPL 83
G TP +G+N++N C N+ +R T S + I+ F D Q +
Sbjct: 28 FGHTPLLGYNTYNDVACSPNQTHLRTTIHAL---STHALIAAGYKFFQLDCGWQGFTRS- 83
Query: 84 PNQCQGNLVANKTTFPSGT---KALADYVHILIS-------NSLDNRIE------SNIAF 127
PN G+L +++ FP G ALA + ++ S S D R + ++ F
Sbjct: 84 PN---GSLTFDESVFPDGIAPLSALAREMGLVWSMYTNQGVYSCDTRSKEEGLRPGSLGF 140
Query: 128 EQ---------GIDYLKYDNCFND--------DTRPTI--RYRAMSDALKKAGRPIFFSM 168
E+ G Y+K DNC+ + D R R++ MS AL++ G P +
Sbjct: 141 EREDARVFEGWGTGYVKVDNCYIEGAENNAPKDPRTDFPSRFQKMSTALQQVGIPGIL-V 199
Query: 169 CEWGD----------MRPALWGSNIRNSWRTTDDISDSWARMLTIADMN--EVYADHAKP 216
C+WG PA W + NS+R +DDI+ WA +L I + N ++ D++ P
Sbjct: 200 CQWGVPYQNPTDGSLQGPATWTPPMANSFRVSDDIAPGWANVLRIMNQNLHVIWNDNSGP 259
Query: 217 GGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAV 276
G W D D+LEVGN GM E HF+IWA+ K+ L++ +V +++ I+ N ++IA+
Sbjct: 260 GHWADMDLLEVGNSGMTIEEQASHFAIWAMFKSSLMISTNVPSMSEAVREILLNRDLIAI 319
Query: 277 NQAI------------NDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGI 318
NQ +D+ ++AGPL + VL+ S ++ +GI
Sbjct: 320 NQDAAGKPVKLIQRFSHDKDVFAGPLENGDLAVLVLDHSNTSRVLELELALLGI 373
>gi|441150599|ref|ZP_20965568.1| ricin B lectin [Streptomyces rimosus subsp. rimosus ATCC 10970]
gi|440619153|gb|ELQ82206.1| ricin B lectin [Streptomyces rimosus subsp. rimosus ATCC 10970]
Length = 437
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 103/407 (25%), Positives = 156/407 (38%), Gaps = 104/407 (25%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIR-------------------ETDDCWGESSRDSQIS 64
L PPMGWN+W+++ C INEK+I DDCW RD++
Sbjct: 57 LAPKPPMGWNNWSYYMCDINEKVILGNARALVKSGLAAKGYDTVTVDDCWMSKQRDAR-- 114
Query: 65 LSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL---------ISN 115
GNLV + FP G + +H + + N
Sbjct: 115 ------------------------GNLVPDPARFPRGMAYVGRELHRMGLKFGIYEDVGN 150
Query: 116 SLDNRIESNIAFEQ---------GIDYLKYDNCFNDDTRPTIR--------YRAMSDALK 158
+ + Q +DYLK D C N T P YRA+S AL+
Sbjct: 151 LTCEKYPGSFGHYQQDADLFAKWKVDYLKMDGC-NLSTAPGKTKERTYHDAYRAVSRALR 209
Query: 159 KAGRPIFFSMCEW------GDM---RPALWGSNIRNSWRTTDDIS-------DSWARMLT 202
GR I FS+ GD + W S + N WR DI+ W+ +
Sbjct: 210 NTGRDIVFSVSAPAYFQYDGDQVWHQVIRWSSQVGNLWREGRDIAVEKHSGAKKWSSIKY 269
Query: 203 IADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTA 262
N A +PG WNDPD L G+ G+ + S+WA+ APL+ D+ ++
Sbjct: 270 NYAYNAKLAPFQRPGRWNDPDFLLAGDSGLSTRQMQSQMSLWAVMAAPLISSTDLTKISP 329
Query: 263 ETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLSGNRIVVLLESRKTFSSMMTA 311
+ ++GN ++IAV+Q Q + + PL G V L + + +T
Sbjct: 330 AALKVLGNRDIIAVDQDPKGVQGRVVQQGTGYDVLSKPLQGGDHAVALFNSGDSAQTLTT 389
Query: 312 HWDDIGIKSNNTLFEHRTLE----TKFQGKLTAKVDGHSCKMYELTP 354
G+ +E R L T+ GK++ +V H+ +Y + P
Sbjct: 390 TAKAAGLPQARA-YELRDLNTGRTTRTAGKISVRVPAHATVLYRVHP 435
>gi|312065148|ref|XP_003135649.1| hypothetical protein LOAG_00060 [Loa loa]
gi|307769180|gb|EFO28414.1| hypothetical protein LOAG_00060 [Loa loa]
Length = 415
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 100/352 (28%), Positives = 139/352 (39%), Gaps = 90/352 (25%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRE-------------------T 50
L NGL TPPMGW SW FHCQ I+EK+ +
Sbjct: 20 LENGLARTPPMGWMSWTKFHCQMDCVHHPFTCISEKLYMDMADRMVDDGYLQAGYQYVHV 79
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW E RD L+ ++ FP G ALADY+H
Sbjct: 80 DDCWMERKRDQ--------------------------HNRLIPDRQRFPGGMAALADYMH 113
Query: 111 I------------------LISNSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRA 152
+ +I+++ E +DYLK D C D Y
Sbjct: 114 QRGLKFGIYEDYGTATCAGFPGSYQYTKIDADTFAEWQVDYLKLDGCNIDVNLMPSGYAE 173
Query: 153 MSDALKKAGRPIFFSMCEWG--------DMRPALWGSNIRNSWRTTDDISDSWARMLTIA 204
M L GRPI +S C W + L G + N WR DDI SWA + TI
Sbjct: 174 MGRMLNLTGRPIVYS-CSWPAYLINQPEKVDYQLIGHHC-NLWRNFDDIKRSWASVRTII 231
Query: 205 DMNEVYADH----AKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNL 260
D + + D PG W+DPDM+ VGN + ++ V SIW+I +PL++ D+ +
Sbjct: 232 DYYDYHQDKHIPAQGPGRWHDPDMIIVGNTELTVDQAKVQMSIWSIWSSPLIMSNDLRLI 291
Query: 261 TAETMPIIGNEEVIAVNQAINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAH 312
I+ N VIA++Q D G L N + + + ++ +AH
Sbjct: 292 APVFRDILLNRRVIAIDQ---DSLGIMGRLVANTTDIGIYVKPVIPAVPSAH 340
>gi|334199180|gb|AEG73867.1| alpha-galactosidase [Aspergillus ustus]
Length = 438
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 93/314 (29%), Positives = 148/314 (47%), Gaps = 49/314 (15%)
Query: 3 IVSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET------------ 50
IV SS A G R + L G+G P +G+N++N F C +E ++
Sbjct: 6 IVISSLAALAG--RASALDAGVGKLPKLGYNTFNAFGCNYDEDVVLSQAKAMKALGLVDL 63
Query: 51 -------DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTK 103
DDC E +RDS L F K L +Q + +++ +G
Sbjct: 64 GYKSFLFDDCMTEKTRDSNGRLVADAEKFPH----GLKQLTSQLKSLGISSSAYSDAGHW 119
Query: 104 ALADYVHILISNSLDNRIESNIAFEQ-GIDYLKYDNCF---NDDTRPTI--RYRAMSDAL 157
A Y S + + ++E G DYLKYDNCF ++ T+ + RY+ M+DA+
Sbjct: 120 TCAGY-----PGSYGHEAQDLESWENWGFDYLKYDNCFIPFDNVTQENVYGRYKRMADAI 174
Query: 158 ------KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYA 211
K + FS+CEWG +P +W + SWR DI W+ + +I + +
Sbjct: 175 ADRAARKPHSKSFQFSLCEWGWQQPWIWARQLGQSWRVNGDIKPWWSSIASIINTASFIS 234
Query: 212 DHAKPGGWNDPDMLEVGN-------GGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAET 264
G ND D+LEVGN G M Y+E HF+ WA+ K+PLL+ ++ N+T ++
Sbjct: 235 SATDFYGRNDFDILEVGNYGQGEPHGNMTYDEEKSHFTTWALLKSPLLISANLANITRQS 294
Query: 265 MPIIGNEEVIAVNQ 278
+ I+ N++++ +NQ
Sbjct: 295 LEILSNKDLLRINQ 308
>gi|330933449|ref|XP_003304179.1| hypothetical protein PTT_16647 [Pyrenophora teres f. teres 0-1]
gi|311319407|gb|EFQ87742.1| hypothetical protein PTT_16647 [Pyrenophora teres f. teres 0-1]
Length = 416
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 95/336 (28%), Positives = 132/336 (39%), Gaps = 110/336 (32%)
Query: 30 MGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQISLSCSFH 70
MGWNSWN + C INE + E DDCW + H
Sbjct: 1 MGWNSWNEYGCAINETVFLEVGEFLNTLGLKNLGYTYVNIDDCWSNKT-----------H 49
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------I 113
D++T G + + T FP+G K AD +H L
Sbjct: 50 QRDNVT------------GQIRPDSTKFPNGIKHTADEIHKLGLKVGIYSDAGDTTCGGY 97
Query: 114 SNSLDNR-IESNIAFEQGIDYLKYDNCF-----------------------------NDD 143
+ SL++ +++ + GIDYLKYDNC N +
Sbjct: 98 AGSLEHEGLDARTFADWGIDYLKYDNCAVPDRWHDEYRWWPENWLGGPPAENQTAGDNGE 157
Query: 144 TRPT--------------IRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRT 189
T+P RY+ MSDAL R I FS C WG WG+ +SWR
Sbjct: 158 TKPVAAPAGYDWTTSNTFTRYKTMSDALLATNRTIEFSQCAWGHAHIDEWGNRTGHSWRM 217
Query: 190 TDDISDSWAR-------MLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFS 242
DI W ++ I + Y + G D DMLEVGNG E HF+
Sbjct: 218 WGDIYPQWEGNHQGSWGLMPILNHASFYNNDTDFWGHGDWDMLEVGNGNFTIEENRSHFA 277
Query: 243 IWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
+WA K+PL++G + ++ ET+ I+ N+E+I NQ
Sbjct: 278 LWAALKSPLIIGTPLHDIKPETLEILSNKELIDFNQ 313
>gi|452000003|gb|EMD92465.1| glycoside hydrolase family 27 protein [Cochliobolus heterostrophus
C5]
Length = 423
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 94/331 (28%), Positives = 132/331 (39%), Gaps = 105/331 (31%)
Query: 30 MGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQISLSCSFH 70
MGW+SWN + C INE + E DDCW S +D Q
Sbjct: 1 MGWSSWNVYRCDINETLFVEVGNLMVSLGLKDLGYSYVNIDDCW--SDKDYQR------- 51
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------I 113
D +T G + + + FP G K AD +H L
Sbjct: 52 --DDIT------------GKIRPDYSKFPKGIKHTADEIHKLGLRFGIYSDSGTLTCAGY 97
Query: 114 SNSLDN-RIESNIAFEQGIDYLKYDNCFN----------------------DDTRPTI-- 148
+ SL N R+++ E G+DYLKYDNCF DD +
Sbjct: 98 AGSLGNERLDAETFAEWGVDYLKYDNCFVPQEWQDRYRWSPDDWVADPTTVDDAETEVIS 157
Query: 149 --------------RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDIS 194
RY+ M DAL RPI FS C WG+ WG+ +SWR DI
Sbjct: 158 APDGYDWRSSKTFTRYKRMGDALLATNRPIQFSQCAWGNAHIDQWGNETGHSWRMWGDIG 217
Query: 195 DSWAR-------MLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAIS 247
W ++ I + + + G D DML+VGNG + + + HF++WA
Sbjct: 218 PRWEEFQNGDWGIMPILNHASFFINGTDFWGHGDWDMLQVGNGELTFQKSRTHFAMWAAL 277
Query: 248 KAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
K+PL++G + N+ E + I+ N E+I NQ
Sbjct: 278 KSPLIIGTPLHNIKPEILGILSNRELIDFNQ 308
>gi|326912062|ref|XP_003202373.1| PREDICTED: alpha-N-acetylgalactosaminidase-like [Meleagris
gallopavo]
Length = 458
Score = 120 bits (302), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 87/259 (33%), Positives = 112/259 (43%), Gaps = 59/259 (22%)
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RD++ G LV + FP G KALADYVH
Sbjct: 71 DDCWAAKQRDAE--------------------------GRLVPDPERFPRGIKALADYVH 104
Query: 111 -------------ILI-----SNSLDNRIESNIAF-EQGIDYLKYDNCFNDDTRPTIRYR 151
IL +LD + F E G+D LK D C++ Y
Sbjct: 105 ARGLKLGIYGDLGILTCQGYPGTTLDRIGQDARTFAEWGVDMLKLDGCYSSGKEQAQGYP 164
Query: 152 AMSDALKKAGRPIFFSMCEW----GDMRP----ALWGSNIRNSWRTTDDISDSWARMLTI 203
M+ AL GRPI +S C W G + P L G I N WR DDI DSW +L+I
Sbjct: 165 EMARALNATGRPIIYS-CSWPAYQGGLPPKVNYTLLG-EICNLWRNYDDIQDSWDSVLSI 222
Query: 204 ADM----NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGN 259
D +V A PG WNDPDML +GN G+ Y + ++W I APLL+ D+
Sbjct: 223 VDWFFTNQDVLQLAAGPGHWNDPDMLIIGNFGLSYEQSRSQMALWTIMAAPLLMSTDLRT 282
Query: 260 LTAETMPIIGNEEVIAVNQ 278
+ I+ N +I +NQ
Sbjct: 283 ILPSAKEILQNRLMIQINQ 301
>gi|256052817|ref|XP_002569946.1| alpha-galactosidase/alpha-n-acetylgalactosaminidase [Schistosoma
mansoni]
Length = 444
Score = 120 bits (301), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 96/351 (27%), Positives = 143/351 (40%), Gaps = 105/351 (29%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRET------------------- 50
L NG+ PPMGW +W F CQ I+EK+I+ T
Sbjct: 33 LDNGMATVPPMGWMTWQRFRCQTDCQNYPDGCISEKLIKRTADRLVLDGWRDLGYRYVIV 92
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW E RDS+ + +VA+ FP+G K++ Y+H
Sbjct: 93 DDCWPEWKRDSRTN-------------------------EIVADNKRFPNGIKSVGQYLH 127
Query: 111 ---ILISNSLDN---------------RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRA 152
+ LD +++ + DY+K D C + + Y
Sbjct: 128 SKNLRFGIYLDYGTRTCEGYPGSMNYLELDAKSVAKWEADYVKMDGCNSPENIMPDGYGK 187
Query: 153 MSDALKKAGRPIFFSMCEWGDMRPAL--WGSNIR-----------NSWRTTDDISDSWAR 199
S L GRPI FS C + PA W N N WR D+ DSW+
Sbjct: 188 FSKLLNATGRPIVFS-CSY----PAYISWMKNTSLIDWKKLQNNCNLWRMLGDVQDSWSS 242
Query: 200 MLTIAD----MNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGC 255
+++I + N V A PG WNDPDML +GN G+ ++ V +W + APLL+
Sbjct: 243 VISIINAYKLQNAVLPKLAGPGHWNDPDMLLMGNFGLSDDQKRVQMGMWCMFSAPLLISA 302
Query: 256 DVGNLTAETMPIIGNEEVIAVNQ-----------AINDQQIWAGPLSGNRI 295
D+ L ++ ++ N ++A++Q + ND Q+W PL G+ I
Sbjct: 303 DMDKLDNFSVSLLRNARLLAIDQDKGGHQAEFVKSQNDVQMWVRPLDGDPI 353
>gi|141795796|gb|AAI39644.1| Zgc:123333 protein [Danio rerio]
Length = 415
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 107/393 (27%), Positives = 160/393 (40%), Gaps = 76/393 (19%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPL--- 83
TPPMGW +W F C I DC ++ + IS M D L++ K L
Sbjct: 3 TPPMGWLAWERFRCDI---------DC--QNDPYNCISEQLFMDMADRLSEDGWKELGYV 51
Query: 84 -----------PNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLD--------- 118
QG L + FP G LA YVH + I +
Sbjct: 52 YVNIDDCWSSKDRDAQGRLQPDPKRFPRGIPHLAQYVHDRGLKLGIYGDMGTLTCGGYPG 111
Query: 119 ---NRIESNIA--FEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEW-- 171
++IE++ + GID LK D C+++ + Y MS AL GRPI +S C W
Sbjct: 112 TTLDKIETDAQTFADWGIDMLKLDGCYSNSSYQEQGYPMMSKALNATGRPIGYS-CSWPA 170
Query: 172 --GDMRP----ALWGSNIRNSWRTTDDISDSWARMLTIADM----NEVYADHAKPGGWND 221
G + P L G I N WR DDI DSW ++ I D + A PG WND
Sbjct: 171 YQGGLPPKVNYTLLG-QICNLWRNYDDIQDSWDSVMGIVDWFFDNQDALQPAAAPGQWND 229
Query: 222 PDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ--- 278
PDML +G+ + ++ ++W+I APL + D+ +++ ++ N+ VI++NQ
Sbjct: 230 PDMLIIGDFSLSLDQSRAQMALWSIMAAPLFMSNDLRTISSAARSVLQNKAVISINQDPL 289
Query: 279 AINDQ---------QIWAGPLS-GNRIVVLLESRKTFSSMMTAHWDDIGIKSN-----NT 323
I + Q++ PLS G +V R T +G + +
Sbjct: 290 GIQGRRLLKEKSGIQVFQRPLSKGASALVFFSRRSDMPYRYTTSLKTLGYQPGVFEVFDV 349
Query: 324 LFEHRTLETKFQGKLTAKVDGHSCKMYELTPVT 356
E R E K + T ++ M+ + P
Sbjct: 350 FSEQRLPELKDSTQFTVSINPSGVVMWYIYPAA 382
>gi|224024031|ref|ZP_03642397.1| hypothetical protein BACCOPRO_00748 [Bacteroides coprophilus DSM
18228]
gi|224017253|gb|EEF75265.1| hypothetical protein BACCOPRO_00748 [Bacteroides coprophilus DSM
18228]
Length = 571
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 86/290 (29%), Positives = 134/290 (46%), Gaps = 57/290 (19%)
Query: 28 PPMGWNSWNHFHCQINEKIIRETDDCWGE----SSRDSQISLSCSFHMFDSLTQIAAKPL 83
P MGW+SWN + I+E +IR+ D E S +++ F +
Sbjct: 62 PVMGWSSWNTYRVNIDEALIRKQADAMVELGLKESGYRYVNIDDGFFGY----------- 110
Query: 84 PNQCQGNLVANKTTFPSGTKALADYVHIL------ISNSLDN------------------ 119
+G L + FP+G K +A Y+H L S + N
Sbjct: 111 -RDEKGRLHTHPQRFPNGLKGVAAYIHSLGLQAGIYSEAGANTCGSIWDADKHGIGVGLY 169
Query: 120 ---RIESNIAF-EQGIDYLKYDNC-------FNDDTRPTIRYRAMSDALKKAGRPIFFSM 168
R ++++ F E G D++K D C ++ R T RA+ + R I ++
Sbjct: 170 GFERQDADLFFNEWGFDFIKIDYCGAGQQLELDEQERYTEIVRAIREVCP---RNISLNI 226
Query: 169 CEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVG 228
C W P W N+ SWR + DI+ +WA + DMN + +A G +ND DMLE+G
Sbjct: 227 CRWA--YPGTWVRNLARSWRISPDIAPNWAAVKRCIDMNLYLSAYAGGGHYNDMDMLEIG 284
Query: 229 NGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
G+K E HF +W I +PLL+GCD+ + E++ ++ NEE+IA+NQ
Sbjct: 285 R-GLKPEEEETHFGMWCIMSSPLLIGCDLTRIPEESLKLLRNEELIALNQ 333
>gi|398788167|ref|ZP_10550391.1| ricin B lectin [Streptomyces auratus AGR0001]
gi|396992426|gb|EJJ03534.1| ricin B lectin [Streptomyces auratus AGR0001]
Length = 435
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 112/408 (27%), Positives = 160/408 (39%), Gaps = 105/408 (25%)
Query: 24 LGLTPPMGWNSWNHFHCQINEKIIRE-------------------TDDCWGESSRDSQIS 64
L PPMGWN W+++ C+I+EK + + DDCW +R S+
Sbjct: 55 LAPRPPMGWNDWSYYMCRIDEKTVLDNARALVRTGLADKGYRTVTVDDCWMSKARGSR-- 112
Query: 65 LSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL------------ 112
G LVA+ FP G L +H +
Sbjct: 113 ------------------------GELVADPQKFPHGMAYLGRQLHRMGLKFGIYEDVGT 148
Query: 113 -----ISNSLDNRIESNIAFEQ-GIDYLKYDNC-----FNDDTRPTIR--YRAMSDALKK 159
SL + + F Q +DY+K D C T R YR MS AL+
Sbjct: 149 LTCEKYPGSLGHFPQDARQFAQWKVDYVKADGCNVPVAAGHTKEETYRDLYRQMSQALRD 208
Query: 160 AGRPIFFSMCEW------GD---MRPALWGSNIRNSWRTTDDIS-------DSWARMLTI 203
GRPI FS+ GD R W + + N WR D++ W+ +L
Sbjct: 209 TGRPITFSVSAPAYFQYDGDSVWHRVIGWSAEVGNLWREGRDVALQQHSPAAKWSSILYN 268
Query: 204 ADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAE 263
N AD KPG WNDPD L G+ G+ E S+WA+ APL+ D+ L+ +
Sbjct: 269 FRYNAKLADLQKPGRWNDPDFLLAGDTGLTREEIQSQMSLWAVMAAPLISSTDIARLSPQ 328
Query: 264 TMPIIGNEEVIAVNQ-AINDQ----------QIWAGPLSGNRIVVLL----ESRKTFS-S 307
++GN++VIAV+Q A+ Q + A PL + L ++ +T S +
Sbjct: 329 AREVLGNKKVIAVDQDALGVQGRVVQQDGGSTVLAKPLKNRDGAIALFNSGDTARTLSVT 388
Query: 308 MMTAHWDDIGIKSNNTLFEHRTLETKFQGKLTAK-VDGHSCKMYELTP 354
A D G + L R T G + A V HS +Y +TP
Sbjct: 389 AAAAGLPDAGSYRLHDLVTGR--RTHSDGTIVAPDVPPHSTVLYRVTP 434
>gi|342876022|gb|EGU77686.1| hypothetical protein FOXB_11805 [Fusarium oxysporum Fo5176]
Length = 553
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 95/394 (24%), Positives = 172/394 (43%), Gaps = 70/394 (17%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLP-N 85
TP MGWNS+N C N K I + + S+ + + F A++ +
Sbjct: 25 TPVMGWNSYNQVSCSPNNKTIATAIE-----ALSSRGFVDAGYKFFQIDCGWASRDTQRD 79
Query: 86 QCQGNLVANKTTFPSGTKALAD-----------------------YVHILISNSLDNRIE 122
G L + FP G K L+D Y ++ + ++
Sbjct: 80 PTTGALKVDSNAFPDGLKPLSDLARSKGMKWTMYSDAGERMCDPQYPSPVLGSLGHEAVD 139
Query: 123 SNIAFEQGIDYLKY----DNCFNDD----------TRPTIRYRAMSDALKKAGRPIFFSM 168
+ +YLK+ DNC+ D T R+ M L+K G P +
Sbjct: 140 AAFFKSLNTEYLKWYDLDDNCYADSASNNAPKDPRTDFVTRFGTMWSELQKVGIPGML-I 198
Query: 169 CEWGDMR--------PALWGSNIRNSWRTTDDISDSWARMLTIAD--MNEVYADHAKPGG 218
C+WG PA W +I S+R +DDI + W+ + I++ ++ + + + PG
Sbjct: 199 CQWGVPYSSSSGLEGPAEWTQSISTSFRLSDDIGEGWSNVYRISNQAIHIAHRNLSGPGH 258
Query: 219 WNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
D D+LEVGN GM ++E HF++WA+ K+ L++ D+ L+A+T+ ++ N+++I++NQ
Sbjct: 259 IADADLLEVGNSGMTFDEQATHFALWAMLKSALMISTDITALSAQTVAVLQNKDLISINQ 318
Query: 279 A------------INDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT--L 324
D+ +WAGPL+ I VL + + ++ +G +S + L
Sbjct: 319 DAAVKPISLVQRWTGDRDLWAGPLANGDIAVLYVDQSNSARTLSLQLSALGFQSADIKDL 378
Query: 325 FEHRTLETKFQGKLTAKVDGHSCKMYELTPVTKL 358
+ +T + + +V+GH L+ + ++
Sbjct: 379 WTGKT--STGASSFSKQVNGHGSVALRLSNIKRV 410
>gi|432946011|ref|XP_004083764.1| PREDICTED: alpha-N-acetylgalactosaminidase-like [Oryzias latipes]
Length = 431
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 97/341 (28%), Positives = 145/341 (42%), Gaps = 68/341 (19%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L+NGL TPPMGW +W F C I DC ++ + IS + M D L++
Sbjct: 19 LNNGLMRTPPMGWLAWERFRCDI---------DC--QNDPKNCISENLFMDMADRLSEDG 67
Query: 80 AKPLP------NQC--------QGNLVANKTTFPSGTKALADYVHI-------------- 111
+ L + C G L + FP G + LA Y+H
Sbjct: 68 WRELGYVYVNIDDCWSSMERDENGRLQPDPKRFPGGIRKLARYMHDRGLKLGIYGDMGTH 127
Query: 112 ----LISNSLDN-RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFF 166
+LD I++ + +D K+D C+++ T Y MS AL GRPI +
Sbjct: 128 TCGGYPGTTLDKIEIDAQTIADWEVDMFKFDGCYSNATEQEQGYPLMSKALNATGRPIGY 187
Query: 167 SMCEWGDMRPAL-------WGSNIRNSWRTTDDISDSWARMLTIADM----NEVYADHAK 215
S C W R L I N WR DDI DSW +L I D + A
Sbjct: 188 S-CSWPAYRGGLPPKVNYTQLGEICNLWRNYDDIQDSWDSVLNIIDWFFDNQDALQPAAG 246
Query: 216 PGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIA 275
PG WNDPDML +G+ G+ ++ ++W+I APL + D+ +++ I+ N+ I
Sbjct: 247 PGRWNDPDMLIIGDFGLSLDQSRSQMALWSIMAAPLFMSNDLRTISSGARSILQNKIAIG 306
Query: 276 VNQAI------------NDQQIWAGPLSGNRIVVLLESRKT 304
+NQ + +++ PLS N ++ SR+T
Sbjct: 307 INQDAMGMPGRRILKEKSGIEVFWRPLSNNASALVFFSRRT 347
>gi|195119800|ref|XP_002004417.1| GI19627 [Drosophila mojavensis]
gi|193909485|gb|EDW08352.1| GI19627 [Drosophila mojavensis]
Length = 413
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 91/318 (28%), Positives = 129/318 (40%), Gaps = 89/318 (27%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRET------------------- 50
L NGL LTPPMGW SW F C I+E++ R +
Sbjct: 24 LDNGLALTPPMGWMSWQRFRCITDCDTYPDECISEQLFRRSADLLVSEGYADVGYEYVII 83
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW E +RD+ + LV ++ FP G LAD++H
Sbjct: 84 DDCWLEWNRDNMTN-------------------------KLVPDRKRFPRGLNVLADHIH 118
Query: 111 -----------------ILISNSLDNRIESNIAFEQ-GIDYLKYDNCFNDDTRPTIRYRA 152
+D+ F + +DY+K D C + + Y
Sbjct: 119 DKGLKFGLYQDFGTKTCAGYPGVIDHMALDAETFAKWDVDYVKLDGCNANLSDMETGYPE 178
Query: 153 MSDALKKAGRPIFFSMCEWGDMRPALWGSNIR--------NSWRTTDDISDSWARMLTIA 204
L + GRP+ +S C W PA N WR D+ DS+ ++ I
Sbjct: 179 FGRLLNETGRPMVYS-CSW----PAYQSEQPHFESLKRHCNLWRNWADVQDSFQSVMAIT 233
Query: 205 DMNEVYAD----HAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNL 260
D D HA PG WNDPDML +GN G+ Y++ + +IWA+ APL++ D+ +
Sbjct: 234 DYFAKNQDSMQPHAGPGHWNDPDMLILGNYGLSYDQSKLQMAIWAVLAAPLIMSNDLAKV 293
Query: 261 TAETMPIIGNEEVIAVNQ 278
E I+ N +VIAVNQ
Sbjct: 294 RPEIKEILQNRDVIAVNQ 311
>gi|427782003|gb|JAA56453.1| Putative alpha-d-galactosidase melibiase [Rhipicephalus pulchellus]
Length = 421
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 88/300 (29%), Positives = 136/300 (45%), Gaps = 49/300 (16%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGL LTPPMGW SW F C ++ ++ D+C E ++ + + H+ D + A
Sbjct: 28 LDNGLALTPPMGWMSWERFRCNVD--CYKDPDNCLSE-----KLFMKMADHLADDGYRDA 80
Query: 80 AKP--LPNQC--------QGNLVANKTTFPSGTKALADYVH-----------------IL 112
+ + C G L+A+ FP G L+DY+H +
Sbjct: 81 GYVYLVIDDCWSERQRTNDGYLMADLERFPRGLNFLSDYIHTKGLKFGMYTNYGHSTCMG 140
Query: 113 ISNSLDNRIESNIA--FEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCE 170
+ D+ +E + IDYLK D C + + Y S+ L K RPI +S C
Sbjct: 141 FPGTEDHDMEKDAERFASWKIDYLKVDGCHTTAKQQSKGYTRFSEYLNKTNRPIVYS-CS 199
Query: 171 WG--DMRPALWGSNIR------NSWRTTDDISDSWARMLTI----ADMNEVYADHAKPGG 218
W D+ N N WR DI D+W + ++ D EV+A +A PG
Sbjct: 200 WPYYDLFLNKVEPNFPQLEKDCNLWRNYHDIRDNWFMIESVINFFGDNQEVFAKYAGPGH 259
Query: 219 WNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
WND DML +GN + + V ++WAI +PLL+ D+ + ++ ++E+IA+NQ
Sbjct: 260 WNDADMLMIGNFRLTQEQAEVQMALWAILASPLLMSNDLRKIPEAAKKLLQHKEIIAINQ 319
>gi|171691458|ref|XP_001910654.1| hypothetical protein [Podospora anserina S mat+]
gi|170945677|emb|CAP71790.1| unnamed protein product [Podospora anserina S mat+]
Length = 421
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 106/412 (25%), Positives = 165/412 (40%), Gaps = 120/412 (29%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ---------INEKIIR-----------ETDDCWGESSR 59
L NG+GL P MGW+SWN C +K I DDCW SR
Sbjct: 21 LDNGVGLKPHMGWSSWNVAQCNAASARYALATADKFISLGLKDLGYEYINIDDCWSTKSR 80
Query: 60 DSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL------- 112
DS G LV + +P+G KA+ D +H +
Sbjct: 81 DSS--------------------------GRLVPDPAKWPNGIKAVTDRIHSMGLKFGLY 114
Query: 113 -----------ISNSLDNRIESNIAFEQGIDYLKYDNCFN---DDTRP----------TI 148
N R + + G+D+ KYDNC+ + RP
Sbjct: 115 GCAGDKTCAGYPGNEGHERGDVDQLVSWGVDFWKYDNCYTPCRQNPRPQTCTSPAGSTKT 174
Query: 149 RYRAMSDAL--KKAGRPIFFSMCEWGDMRPALWGSNIRNSWR------------------ 188
Y M DA+ + + + F++C WG WG++ +SWR
Sbjct: 175 WYAPMRDAILGVQNTKKLHFNLCNWGRDEVWTWGASYGHSWRYVVIAHHQHPLPRLTDSR 234
Query: 189 -TTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAIS 247
+ D+ D WA + I + ++ PGG+ND DML +G+ + N+ +HF +WAI+
Sbjct: 235 MSVDNWGD-WASVERIGNAAAGIYQYSAPGGFNDLDMLYLGSSKLNTNQEKLHFGLWAIT 293
Query: 248 KAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------------AINDQ--QIWAG 288
K+PL+LG D+ ++ T+ II N+ +I +NQ +N + WAG
Sbjct: 294 KSPLVLGLDLEKISNSTLDIIRNKGLIDINQDSLGKAATTFRPPGAPAPVNGKIYPYWAG 353
Query: 289 PLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFEHRTLETKFQGKLTA 340
PLS +V L T + + ++ D+ + +E R + T GK T+
Sbjct: 354 PLSDGVVVGLCAG--TSAGTYSVNFKDVPGLGGSGSYEWREMYTGQTGKGTS 403
>gi|384251074|gb|EIE24552.1| glycoside hydrolase, partial [Coccomyxa subellipsoidea C-169]
Length = 294
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 92/299 (30%), Positives = 132/299 (44%), Gaps = 49/299 (16%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRETDDCW-GESSRDSQISLSCS 68
L NG+GLTPPMGW SW F C I+E++ ET D + RD+
Sbjct: 1 LDNGVGLTPPMGWLSWEKFRCNVDCSHDPANCISERLFMETADSMVADGFRDAGYE---- 56
Query: 69 FHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-----ILISNSLDNRI-- 121
F + D + G L + FP+G A Y+H I + +
Sbjct: 57 FVIIDDCWLAKGR----DEGGKLQPDPDRFPTGMTAFGRYLHRRKLKFGIYGDIGTKTCG 112
Query: 122 -----------ESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCE 170
++ + G+DYLK D C+ D Y + AL GRPI +S C
Sbjct: 113 GYPGMAAHLKQDAQTYADWGVDYLKVDGCYADTDTYNETYPELGVALNATGRPIVYS-CS 171
Query: 171 WG----DMRPALWGSNIRNSWRTTDDISDSWARMLTI-------ADMNEVYADHAKPGGW 219
W D P N WR DI WA + +I + +++ + D A PG +
Sbjct: 172 WPAYLPDPIPYRALKKHCNLWRNWLDIQSDWASLKSIIVFWAEASLLSKGFTDIAGPGSF 231
Query: 220 NDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
NDPDML +GN G+ ++ + WA+ APLL+G DV NLT I+ N++VIA++Q
Sbjct: 232 NDPDMLIIGNEGISDDQGKLQMGAWAMFAAPLLMGNDVRNLTDAQQAILLNKDVIAIDQ 290
>gi|451854085|gb|EMD67378.1| glycoside hydrolase family 27 protein [Cochliobolus sativus ND90Pr]
Length = 428
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 95/336 (28%), Positives = 131/336 (38%), Gaps = 110/336 (32%)
Query: 30 MGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQISLSCSFH 70
MGW+SWN + C INE + E DDCW S +D Q
Sbjct: 1 MGWSSWNVYRCDINETLFVEVGNLMVSLGLKDLGYSYVNIDDCW--SDKDYQR------- 51
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------I 113
D +T G + + FP G K AD +H L
Sbjct: 52 --DDIT------------GKIRPDYNKFPKGIKHTADEIHKLGLRFGIYSDSGTLTCAGY 97
Query: 114 SNSLDN-RIESNIAFEQGIDYLKYDNCF---------------------------NDDTR 145
+ SL N R+++ E G+DYLKYDNCF DD
Sbjct: 98 AGSLGNERLDAETFAEWGVDYLKYDNCFVPQEWQDRYRWRPDEWMADPPTGDQKAGDDAE 157
Query: 146 PTI----------------RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRT 189
+ RY+ M DAL RPI FS C WG+ WG+ +SWR
Sbjct: 158 TEVIPAPDGYDWRSSKTYTRYKNMRDALLVTNRPIQFSQCVWGNAHIDQWGNETGHSWRM 217
Query: 190 TDDISDSWARM-------LTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFS 242
DI W + + I + + + G D DML+VGNG + + E HF+
Sbjct: 218 WGDIGPRWEEVQNGDWGVMPILNHASFFINGTDFWGHGDWDMLQVGNGELTFQESRTHFA 277
Query: 243 IWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
+WA K+PL++G + N+ E + I+ N E+I NQ
Sbjct: 278 MWAALKSPLIIGTPLHNIRPEILEILSNRELIDFNQ 313
>gi|389772902|ref|ZP_10192251.1| alpha-galactosidase [Rhodanobacter sp. 115]
gi|388429450|gb|EIL86796.1| alpha-galactosidase [Rhodanobacter sp. 115]
Length = 305
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 82/239 (34%), Positives = 114/239 (47%), Gaps = 44/239 (18%)
Query: 103 KALADYVH-----------------ILISNSLDNRIESNIAFEQ-GIDYLKYDNC----- 139
KALADYVH + S + ++ ++ + GIDYLKYD C
Sbjct: 2 KALADYVHSRGLKLGIYSSPGPTTCAGYAGSYGHVVQDARSYAKWGIDYLKYDLCGFRGV 61
Query: 140 ----FNDDTRPTI-----RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIR-NSWRT 189
+ D + + Y +M AL+ RPI +S+C++G WG+ + N WRT
Sbjct: 62 VKARYPKDRKAQMLMMRDAYESMHKALEDTRRPIVYSLCQYGWDAVWDWGAKVGGNLWRT 121
Query: 190 TDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKA 249
T DI +W + TIA M A PG WNDPDMLEVGNG + E HFS WA+ A
Sbjct: 122 TGDIKPTWKSIYTIASMQVGLERFAGPGHWNDPDMLEVGNGSLTLAENRSHFSWWAMLAA 181
Query: 250 PLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AINDQQIWAGPLSGNRIVV 297
PL+ G D+ ++ E I+ N VIAV+Q A + ++W L +VV
Sbjct: 182 PLIAGNDLTHMPREIRDILTNRAVIAVDQDPLGKQGTRAYASDQMEVWTRSLENGALVV 240
>gi|167522765|ref|XP_001745720.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776069|gb|EDQ89691.1| predicted protein [Monosiga brevicollis MX1]
Length = 423
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 107/413 (25%), Positives = 158/413 (38%), Gaps = 119/413 (28%)
Query: 22 NGLGLTPPMGWNSW----------NHFHCQINEKIIRET-------------------DD 52
NG+ L P +GWN+W N++ + NE +RE DD
Sbjct: 22 NGVALKPALGWNTWCTLSDCHNGDNNYFDRCNEWELREIAEAMLSNGMHDLGFQYINLDD 81
Query: 53 CWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-- 110
CW RDS GN+ + + FPSG KA+AD++H
Sbjct: 82 CWAAQERDSN--------------------------GNIQPDPSRFPSGMKAMADWLHEK 115
Query: 111 ---------------------ILISNSLDNRIESNIAF-EQGIDYLKYDNCFNDDTRPTI 148
+ I S + E F E G+DY+K D C +
Sbjct: 116 GLKFGLYTSMGTETCNHGGRPLPIPGSFGHYAEDAKTFAEWGMDYVKVDWCGGELNDAQT 175
Query: 149 RYRAMSDALKKAGRPIFFSMCEWGDMRPAL-WGSNIRNSWRTTDDISDSWARMLTIADMN 207
++ +S AL GRPI+ +C P + + + SWR T D D W+ T+ +
Sbjct: 176 QHTELSKALNATGRPIWLELCRGYSYDPIPDYVTEVAQSWRITGDHQDEWSNTKTVIEGF 235
Query: 208 EVYADHAKPGGWNDPDMLEVGNGGMKYN-----------EYVVHFSIWAISKAPLLLGCD 256
+ ++ A P WN D L G G N EY FS+W IS +PL++ D
Sbjct: 236 MIPSNQAGPNQWNYGDFLMTGGPGCNLNNSDHCPFSSDDEYRTSFSVWTISSSPLIVSTD 295
Query: 257 VGNLTAETMPIIGNEEVIAVNQ-----------------------AINDQQIWAGPLS-G 292
+ N+TA + N++ IA+NQ D I+ LS G
Sbjct: 296 IRNMTAVMKQCLLNKQAIAINQDHESTPGRLIGQAADANCIASDLRTGDCPIFGRKLSDG 355
Query: 293 NRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFEHRTLETK----FQGKLTAK 341
VLL + +T ++ +G+ SN T+ + LE K F G TAK
Sbjct: 356 TYAAVLLNFADQGTRTVTLPFNWLGVASNQTMAVYDILEQKSLGNFTGSFTAK 408
>gi|443722730|gb|ELU11490.1| hypothetical protein CAPTEDRAFT_171260 [Capitella teleta]
Length = 416
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 105/395 (26%), Positives = 159/395 (40%), Gaps = 106/395 (26%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRET------------------- 50
L NG+ LTPPMGW SW F C I+E +++E
Sbjct: 25 LDNGVALTPPMGWLSWERFECNTDCDVDKNNCISEWLMKEMADVMVANGYRDAGYAYVAL 84
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RD + G L A+ FPSG KALADY+H
Sbjct: 85 DDCWLAHERDER--------------------------GQLQADPKRFPSGMKALADYLH 118
Query: 111 -----ILISNSLDNRI-------------ESNIAFEQGIDYLKYDNCFNDDTRPTIRYRA 152
+ I + + ++N E G+D LK+D C + A
Sbjct: 119 QRGMKLGIYEDIGTKTCAGFPGCMGNMQKDANTMAEWGVDMLKFDGCNAHIADYDYGFPA 178
Query: 153 MSDALKKAGRPIFFSMCEWGDMRPALWGS-------NIRNSWRTTDDISDSWARMLTIAD 205
M+ L GR I +S CEW + A G+ N +R DI DSW ++
Sbjct: 179 MARYLNSTGRHIVYS-CEWPMYKKAQGGTCNYTAIAESCNMFRQLTDIYDSW---YSVIG 234
Query: 206 MNEVYADH-------AKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVG 258
M E + D A PG WNDPD + +GN G+ +++ V +++AI +PL++ D
Sbjct: 235 MIEYFGDDPGNFSQVAGPGAWNDPDQVVIGNFGLSHDQERVQMALYAILASPLMVSADFR 294
Query: 259 NLTAETMPIIGNEEVIAVNQ---AINDQQ--------IWAGPLS--GNRIVVLLESRKTF 305
+ + I+ N VIA+NQ AI ++ IW P+ G+ + L ++++
Sbjct: 295 KIRKSSQDILLNPGVIAINQDPLAIQGKRIRNVMKLSIWTRPVQPVGSYAIAFLYTQESG 354
Query: 306 SSMMTAH--WDDIGIKSNNTLFEHRTLETKFQGKL 338
S + + D+G+ + E GK
Sbjct: 355 SPIKASANLTSDLGLTDSRGYLFTEVFENDVIGKF 389
>gi|389793900|ref|ZP_10197061.1| alpha-galactosidase [Rhodanobacter fulvus Jip2]
gi|388433533|gb|EIL90499.1| alpha-galactosidase [Rhodanobacter fulvus Jip2]
Length = 635
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 110/368 (29%), Positives = 150/368 (40%), Gaps = 96/368 (26%)
Query: 23 GLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDS-- 61
G TPPMGWNSWN F ++E + T DD W R S
Sbjct: 41 GRAATPPMGWNSWNAFRTDVDEAKVMGTAKVLVDSGLRDLGYIYVNIDDGWWLKRRTSDG 100
Query: 62 --QI------------SLSCSFHMF-DSLTQIAAKPL------PNQCQGNLVANKTTFPS 100
Q+ S + SF F D L ++ K N C + P
Sbjct: 101 RMQVRTAIFPSAKVGGSETSSFRPFTDKLHRMGLKAGIYTDIGGNSCSQAYDLHSPNLPE 160
Query: 101 GTKALADYVHILISNSLDNRIESNIAFEQGIDYLKYDNC--------------------- 139
GT A I + + I+ A + G DY+K D C
Sbjct: 161 GTTA---ERQIGLRGHVQQDIKLYFA-DWGFDYIKVDACGINVYGADSPVVKQHKYNVYP 216
Query: 140 -------FNDDTRPTIR--YRAMSDALKKAGRP---IFFSMCEWGDMRPALWGSNIRNSW 187
N P +R Y+ ++DAL A P +S+C WG WG ++ N W
Sbjct: 217 PLIDQASINRTDIPAVRALYQQVADALA-ASNPDGDYVYSICAWGAADVRSWGKDVGNLW 275
Query: 188 RTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYN---EYVVHFSIW 244
RT+ DI+ W RML D A +A PG WNDPDML VG+G + E HFS+W
Sbjct: 276 RTSGDITPQWGRMLHTFDSASTRALYAGPGHWNDPDMLFVGHGDFDVHHLTEARTHFSLW 335
Query: 245 AISKAPLLLGCDVGNLTAETMPIIGNEEVIAVN------QAI-----NDQQIWAGPL--S 291
A+ APL++G D+ N M I N++++ VN QA+ +D QI+ L S
Sbjct: 336 AMINAPLIIGYDLRNAPPALMSIWSNKDIVRVNQDRGGHQAVLAYDSDDVQIFVKTLSDS 395
Query: 292 GNRIVVLL 299
G + V+LL
Sbjct: 396 GTKAVLLL 403
>gi|194764495|ref|XP_001964364.1| GF23088 [Drosophila ananassae]
gi|190614636|gb|EDV30160.1| GF23088 [Drosophila ananassae]
Length = 419
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 91/325 (28%), Positives = 133/325 (40%), Gaps = 81/325 (24%)
Query: 9 ADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRET-------- 50
D G+ N L NGL PPMGW + F C I+E+++R T
Sbjct: 11 CDCVGTHIVNGLENGLARKPPMGWMPFERFRCVTDCSRFPKDCISERLMRRTANLLVSEG 70
Query: 51 -----------DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFP 99
DDCW E+SRD + L+ ++ FP
Sbjct: 71 YAAAGYRYLIIDDCWMEASRD-------------------------KATHELLPSEDRFP 105
Query: 100 SGTKALADYVH-----ILISNSLDNR--------------IESNIAFEQGIDYLKYDNCF 140
SG + L +Y+H I + L + +++ G+DY+K D CF
Sbjct: 106 SGMRDLGNYIHNKGLLFGIYHDLGEKTCMFHGPGAARHFNLDAQTFANWGVDYVKMDGCF 165
Query: 141 NDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIR---NSWRTTDDISDSW 197
+ Y AL K GRP+ +S C W + I N WR DDI DS
Sbjct: 166 ASEIELDRGYPEFGRALNKTGRPMVYS-CSWPFYKAQPDFKLIAKHCNLWRFADDIQDSV 224
Query: 198 ARMLTIADMNEV----YADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLL 253
A + I HA PG WNDPDML +GN + Y+ + +IWA+ APL++
Sbjct: 225 ASVTNIMKQYSKSQIELTTHAGPGRWNDPDMLVLGNYHLSYDASRLQLAIWAVIAAPLIM 284
Query: 254 GCDVGNLTAETMPIIGNEEVIAVNQ 278
D+ ++ E ++ N ++I+VNQ
Sbjct: 285 TNDLESVRPEIKELLLNRDIISVNQ 309
>gi|256393779|ref|YP_003115343.1| ricin B lectin [Catenulispora acidiphila DSM 44928]
gi|256360005|gb|ACU73502.1| Ricin B lectin [Catenulispora acidiphila DSM 44928]
Length = 574
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 103/374 (27%), Positives = 143/374 (38%), Gaps = 110/374 (29%)
Query: 28 PPMGWNSWNHFHCQINEKIIRE-------------------TDDCWGESSRDSQISLSCS 68
PPMGWN W H+ C ++E + DDCW SSRDS
Sbjct: 57 PPMGWNDWAHYQCSVDESTVVANANALVSSGLAAKGYKTVTVDDCWMASSRDSG------ 110
Query: 69 FHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH------------------ 110
G LVAN T FP G L Y+H
Sbjct: 111 --------------------GTLVANSTKFPHGMAWLGSYLHSKGLNFGIYEDAGSSTCG 150
Query: 111 ------ILISNSLDNRIESNIAFEQ-GIDYLKYDNC-----FNDDTRPTIR--YRAMSDA 156
D+ F G+DYLK D C + T Y A S A
Sbjct: 151 GYPGSGQPQGGGADHFAHDAATFASWGVDYLKLDGCNVYIPSGESTEQAYHNAYTAESTA 210
Query: 157 LKKAGRPIFFSMC--------EWGD---MRPALWGSNIRNSWRTTDDIS---------DS 196
L AGRPI FS EWG+ W + WR DI+
Sbjct: 211 LANAGRPIVFSESAPAYFQSGEWGNPTWFDVLGWVGQLGQLWREGYDIATYNSGNPTASR 270
Query: 197 WARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCD 256
W+ +++ N A +A PG WNDPD L G+ G+ E ++WA+ AP++L D
Sbjct: 271 WSSVMSNYGYNRWIARYAHPGNWNDPDFLIAGDPGLTAEESRSQVALWAMMNAPMILSSD 330
Query: 257 VGNLTAETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLS-GNRIVVLLESRKT 304
V NL+A+ + +GN ++IA++Q +Q + A PL+ G+R V +L R +
Sbjct: 331 VANLSADGLAALGNTDLIALDQDSAGRQAGVVSTNGTTDVLAKPLANGDRAVAVLN-RGS 389
Query: 305 FSSMMTAHWDDIGI 318
S ++ IG+
Sbjct: 390 ASQNVSTTLASIGL 403
>gi|406863235|gb|EKD16283.1| putative alpha-galactosidase B [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 447
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 102/371 (27%), Positives = 148/371 (39%), Gaps = 97/371 (26%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCW----G 55
L + +G P +GWNSWN F C ++E I DDCW G
Sbjct: 18 LQKDNVGKLPALGWNSWNAFGCNVDETKIMTAANQIVALGLKDAGYEYVNIDDCWSTKSG 77
Query: 56 ESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPS--GTKALADYVHILI 113
++ D QI L++ + + + + K S G A Y L
Sbjct: 78 RNTSDQQI--------IPDLSKFPTGIIGTAQKIHSLGLKIGIYSSAGVSTCAGYPASLG 129
Query: 114 SNSLDNRIESNIAFEQGIDYLKYDNC---------FND------------DTRPTI---- 148
++D + + A+E IDYLKYDNC +ND T P +
Sbjct: 130 HEAID--VATWAAWE--IDYLKYDNCGVPPRWNDQYNDCVPELSGGPYPNGTCPNLQNPA 185
Query: 149 -------------RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISD 195
RY M DAL R I +S+C WG WG+ NSWR++ DI
Sbjct: 186 PRGYDWSQSNSFRRYATMRDALLLQNRTILYSLCNWGLAGVDSWGNATGNSWRSSGDIEP 245
Query: 196 SWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGC 255
SW+R++ I + N G ND DMLEVGNGG+ E HF+ WA K+P+++G
Sbjct: 246 SWSRIMEIFNFNSFQLTSVGFWGHNDADMLEVGNGGLTDAECRSHFAFWAAMKSPIIIGT 305
Query: 256 DVGNLTAETMPIIGNEEVIAVNQ----------------------AINDQQIWAGPLSGN 293
++ L + ++ N ++A NQ N + W+G SG
Sbjct: 306 ELERLKKGVVDVLKNGYLLAFNQDDVYGGPATPYKWGVNPDWTFNKTNPAEYWSGRSSGG 365
Query: 294 RIVVLLESRKT 304
+V+ S T
Sbjct: 366 TLVLAFNSLGT 376
>gi|300785302|ref|YP_003765593.1| alpha-galactosidase [Amycolatopsis mediterranei U32]
gi|384148591|ref|YP_005531407.1| alpha-galactosidase [Amycolatopsis mediterranei S699]
gi|399537186|ref|YP_006549847.1| alpha-galactosidase [Amycolatopsis mediterranei S699]
gi|299794816|gb|ADJ45191.1| alpha-galactosidase [Amycolatopsis mediterranei U32]
gi|340526745|gb|AEK41950.1| alpha-galactosidase [Amycolatopsis mediterranei S699]
gi|398317956|gb|AFO76903.1| alpha-galactosidase [Amycolatopsis mediterranei S699]
Length = 635
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 102/345 (29%), Positives = 157/345 (45%), Gaps = 64/345 (18%)
Query: 28 PPMGWNSWNHFHCQINEKIIRETDDCWGESS-RDSQISLSCSFHMFDSLTQIAAKPLPNQ 86
P MGW SWN F +I+ I+ D + +D+ S + T+ +A
Sbjct: 37 PAMGWASWNTFAAKIDYGTIKAQADALVSAGLKDAGYSYVNIDEGWWQGTRDSA------ 90
Query: 87 CQGNLVANKTTFPSGTKALADYVH---------------------------ILISNSLDN 119
GN+ + +P G KA+ADY+H S S +
Sbjct: 91 --GNITVDTAEWPGGMKAIADYLHSKGLKAGIYTDAGRDGCGYYFPTGRPAAPGSGSEGH 148
Query: 120 RIESNIAFEQ-GIDYLKYDNCFND--DTRPTIRYRAMSDALKKA----GRPIFFSMCEWG 172
++ + F++ G D++K D C D P YRA+SDA + A GR + S+C+WG
Sbjct: 149 YLQDMLQFQRWGFDFVKVDWCGGDAEGLDPQSTYRAISDANQAATAQTGRTLTLSICDWG 208
Query: 173 DMRPALWGSNIRNSWRTTDDI-----SDSWARMLTIADMNEVYADHAKPGGWNDPDMLEV 227
P WG+ WRT+ DI + + ++LT D + + G NDPDML V
Sbjct: 209 RKNPWNWGAGTAPMWRTSTDIIYYGQTANLGQVLTNFDQAQ-HPLSQHTGYVNDPDMLTV 267
Query: 228 GNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ--------- 278
G G+ + +W++S APLL G ++ ++A T I+ N EVIAV+Q
Sbjct: 268 GMPGLTDAQARTELGLWSVSGAPLLAGNNLATMSAATKSILTNREVIAVDQDPSGLQGVK 327
Query: 279 AINDQ---QIWAGPLSGN--RIVVLLESRKTFSSMMTAHWDDIGI 318
D Q+++ LSG+ R VVLL R + ++ MT W D+G+
Sbjct: 328 VAEDAAGLQVYSKVLSGSGKRAVVLLN-RTSAAAAMTVRWADLGL 371
>gi|395861495|ref|XP_003803019.1| PREDICTED: alpha-galactosidase A [Otolemur garnettii]
Length = 374
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 93/309 (30%), Positives = 132/309 (42%), Gaps = 76/309 (24%)
Query: 13 GSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMF 72
G+ R L NGL +TP MGW W F C ++ + E C E L S
Sbjct: 26 GNPRARALDNGLAMTPTMGWLHWERFMCNVDCR--EEPYSCISEQLFMQMADLMVS---- 79
Query: 73 DSLTQIAAKPL---------PNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIES 123
D + + L +G L A+ FP G + LA YV
Sbjct: 80 DGWKDVGYEYLCIDDCWMAPKRDLEGKLQADPERFPGGIRRLASYVS------------- 126
Query: 124 NIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWG-DMRPALWGS- 181
Y+ MS AL K GR I +S CEW MRP +
Sbjct: 127 --------------------------YKHMSLALNKTGRSIVYS-CEWPLYMRPFKKPNY 159
Query: 182 -NIR---NSWRTTDDISDSWARMLTIADM----NEVYADHAKPGGWNDPDMLEVGNGGMK 233
IR N WR+ DISDSW + +I D E A PGGWNDPDML +GN G+
Sbjct: 160 KEIRQYCNHWRSFADISDSWESVKSILDWTSSNQEKIVGVAGPGGWNDPDMLVIGNFGLS 219
Query: 234 YNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AIND 282
+++ V H ++WAI APL + D+ +++ + ++ +++VIA+NQ ++
Sbjct: 220 WDQQVTHMALWAIMAAPLFMSNDLRSISPQAKALLQDKDVIAINQDPLGKQGYLLRKVDS 279
Query: 283 QQIWAGPLS 291
++W PLS
Sbjct: 280 FEVWERPLS 288
>gi|375144026|ref|YP_005006467.1| alpha-galactosidase [Niastella koreensis GR20-10]
gi|361058072|gb|AEV97063.1| Alpha-galactosidase [Niastella koreensis GR20-10]
Length = 514
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 108/421 (25%), Positives = 174/421 (41%), Gaps = 79/421 (18%)
Query: 2 FIVSSSAADSNGSDRRN---LLSNGLGLTPPMGWNSWNHFHCQINEKII----------- 47
+ ++ A + G+ +R+ ++ + LTPPMGWNSWN + ++ ++++
Sbjct: 99 YTITLHAKNKLGATKRSFKIVVGEAIALTPPMGWNSWNIYASKVTQELVLANAKAMASSG 158
Query: 48 --------RETDDCW-----GE--------SSRDSQISLSCSFHMFDSLTQIAAKPLPNQ 86
DD W GE ++ + +L H I + P
Sbjct: 159 LIDHGWNYMNIDDVWQGKRGGEFGGILPDSTTFPNMQALVNDIHQLGLKAGIYSTPWVES 218
Query: 87 -----------CQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAFEQGIDYLK 135
+G V P K L + I D + + + G DYLK
Sbjct: 219 YGHHIGGSAINAEGTFVRTTENIPRNKKQLPYAIGQYIFWDKDVQQWA----KWGFDYLK 274
Query: 136 YDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISD 195
YD +N P +AM D L+ +GR + FS+ S I N+WRT DI D
Sbjct: 275 YD--WNPIEVP--ETKAMYDLLRNSGRDVVFSLSNSTPFAGINELSKIANTWRTGGDIRD 330
Query: 196 SWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGG---------MKYNEYVVHFSIWAI 246
SW + + + +A +A PG WNDPDM+ VG G + +E H S W +
Sbjct: 331 SWKSLKSRLLTQDKWAPYASPGHWNDPDMMIVGWVGWGKGPYPTHLTPDEQYAHMSAWCL 390
Query: 247 SKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLS-GNR 294
PLLLGCD+ L A T+ ++ N+EV+AVNQ +Q + A L+ G++
Sbjct: 391 QSVPLLLGCDLTKLDAFTLSLLTNDEVLAVNQDPLGKQATIVSKTDSCGVLAKDLADGSK 450
Query: 295 IVVLLESRKTFSSMMTAHWDDIGIKSN---NTLFEHRTLETKFQGKLTAKVDGHSCKMYE 351
L + + +T W D+GI+ L+ + L ++ + +A V H M
Sbjct: 451 AAGLFNVTDSIARKLTVKWSDLGIQGAYIVRDLWRQKDLGV-YKDEFSADVPPHGVIMIS 509
Query: 352 L 352
+
Sbjct: 510 I 510
>gi|327284822|ref|XP_003227134.1| PREDICTED: alpha-galactosidase A-like [Anolis carolinensis]
Length = 393
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 86/293 (29%), Positives = 126/293 (43%), Gaps = 67/293 (22%)
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW +RD Q G L + FPSG LADYVH
Sbjct: 48 DDCWMAPTRDEQ--------------------------GRLQPDPKRFPSGIGKLADYVH 81
Query: 111 -----ILISNSLDNR-------------IESNIAFEQGIDYLKYDNC-FNDDTRPTIRYR 151
+ I + N+ +++ G+D LK+D C F Y+
Sbjct: 82 SKGLKLGIYADVGNKTCAGFPGSFGSYQLDAETFANWGVDLLKFDGCSFGTLELLAEGYK 141
Query: 152 AMSDALKKAGRPIFFSMCEWGDMRPALWGSNIR------NSWRTTDDISDSWARMLTIAD 205
MS AL K GR I +S CEW + N N WR DI D+W + + D
Sbjct: 142 TMSQALNKTGRSIVYS-CEWPLYERPMQEPNYTEIKQYCNYWRNYADIYDAWNSVKNVLD 200
Query: 206 MNEVYADH----AKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
+ D A PGGWNDPDML +G+ G+ +++ V ++WAI APLL+ D+ +
Sbjct: 201 WTSYHQDSLVDIAGPGGWNDPDMLVIGDFGLSWDQQVTQMALWAIMAAPLLMSNDLRRIN 260
Query: 262 AETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLSGNRIVVLLESRK 303
+ ++ N++VIA+NQ +Q +W PLSG + + +R+
Sbjct: 261 PQAKALLQNKDVIAINQDPLGKQGYRITKDNYFELWERPLSGGAFAIAVVNRQ 313
>gi|170594051|ref|XP_001901777.1| Melibiase family protein [Brugia malayi]
gi|158590721|gb|EDP29336.1| Melibiase family protein [Brugia malayi]
Length = 415
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 96/325 (29%), Positives = 129/325 (39%), Gaps = 98/325 (30%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQ----------INEKIIRE-------------------T 50
L NGL TPPMGW SW F+CQ I+EK+ +
Sbjct: 19 LENGLARTPPMGWMSWTKFYCQTDCVLHPFTCISEKLYMDMADRMVEDGYLQAGYQYVHV 78
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW E RD Q LV ++ FP+G LADY+H
Sbjct: 79 DDCWMERKRDQQ--------------------------DRLVPDRQRFPNGMAXLADYMH 112
Query: 111 I------------------LISNSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRA 152
+ +I+++ E +DYLK D C D Y
Sbjct: 113 QRGLKFGIYEDYGTTTCAGFPGSYKHTKIDADTFAEWQVDYLKLDGCNIDVDLMPAGYAE 172
Query: 153 MSDALKKAGRPIFFSMCEWG-----------DMRPALWGSNIRNSWRTTDDISDSWARML 201
M L GRPI +S C W + L G + N WR DDI SWA +
Sbjct: 173 MGXMLNLTGRPIVYS-CSWPAYLINQPEKVISVNYQLIGQHC-NLWRNFDDIKRSWASVR 230
Query: 202 TIADMNEVYADHAK--------PGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLL 253
TI D Y DH + PG W+DPDM+ VGN + ++ V SIW++ +PL++
Sbjct: 231 TIID----YYDHHQDKHIPAQGPGRWHDPDMIIVGNTELTVDQAKVQMSIWSVWSSPLIM 286
Query: 254 GCDVGNLTAETMPIIGNEEVIAVNQ 278
D+ + I+ N VIAV+Q
Sbjct: 287 SNDLRLIAPVFRDILLNRRVIAVDQ 311
>gi|256840614|ref|ZP_05546122.1| melibiase [Parabacteroides sp. D13]
gi|423331208|ref|ZP_17308992.1| hypothetical protein HMPREF1075_01005 [Parabacteroides distasonis
CL03T12C09]
gi|256737886|gb|EEU51212.1| melibiase [Parabacteroides sp. D13]
gi|409230504|gb|EKN23366.1| hypothetical protein HMPREF1075_01005 [Parabacteroides distasonis
CL03T12C09]
Length = 530
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 95/384 (24%), Positives = 161/384 (41%), Gaps = 65/384 (16%)
Query: 28 PPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQC 87
P MGW+SWN + I++K+I + D SR+ + + D
Sbjct: 22 PTMGWSSWNTYRVNISDKLIMKQADA---MSRNGLKEVGYKYINIDD-----GYFGGRDA 73
Query: 88 QGNLVANKTTFPSGTKALADYVHIL------ISNSLDNRIES---------NIAF----- 127
G L+ + FP+G K D++H L S++ N S N+ F
Sbjct: 74 SGELITHPVRFPNGLKQTVDHIHALGFKAGIYSDAGRNTCGSFWDKDSLGINVGFYGHDR 133
Query: 128 --------EQGIDYLKYDNCFNDDTR--------PTIRYRAMSDALKKAGRP-IFFSMCE 170
E G D++K D C D + RY A+ +A+ GR + ++C
Sbjct: 134 QDADYFFKEIGFDFIKIDFCGGDAKQNFDQLGLDEQERYTAIHNAILATGRKDVRMNVCR 193
Query: 171 WGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNG 230
W P W ++ SWR + DI+ SW + I N + +A G +ND DMLE+G G
Sbjct: 194 WN--FPGTWVHDVAFSWRISQDINPSWESVKNIIRQNLYLSAYASEGKYNDMDMLEIGRG 251
Query: 231 GMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQIWA--- 287
M E HF +W I +PLL+GCD+ ++ +++ ++ NEE+IA+NQ + Q +
Sbjct: 252 -MSEEEDKTHFGMWCIMSSPLLIGCDLTTISEKSLRLLKNEELIALNQDVLGLQAYVVKQ 310
Query: 288 -----------GPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSN---NTLFEHRTLETK 333
++GN V + + + + D ++ + LFE R L
Sbjct: 311 MDGVYILTKDLEEVNGNTCAVAVYNSTDEERTIHLDFADFYLRGDVSVRDLFERRDLGKY 370
Query: 334 FQGKLTAKVDGHSCKMYELTPVTK 357
+ + H +++ L + +
Sbjct: 371 KDRDFSVTIPAHGTRIFRLDGLER 394
>gi|300773380|ref|ZP_07083249.1| 30S ribosomal protein S32 [Sphingobacterium spiritivorum ATCC
33861]
gi|300759551|gb|EFK56378.1| 30S ribosomal protein S32 [Sphingobacterium spiritivorum ATCC
33861]
Length = 388
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 82/282 (29%), Positives = 131/282 (46%), Gaps = 41/282 (14%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQ 86
P MGW+SWN F I+EK+I+E D+ IS + +
Sbjct: 28 APIMGWSSWNSFRVHIDEKLIKE--------QADALISSGLYKAGYRYINVDDGYFGGRD 79
Query: 87 CQGNLVANKTTFPSGTKALADYVHI------LISNSLDN--------------------- 119
G L + T FP+G A+A YVH L S N
Sbjct: 80 KNGKLYVDSTKFPNGMGAIAAYVHSKGLKAGLYSEGGKNTCGSMWDNDTKGVGVGMYGHE 139
Query: 120 RIESNIAF-EQGIDYLKYDNCFNDDTR--PTIRYRAMSDALKKAGRPIFFSMCEWGDMRP 176
+ ++ + F E D++K D C + + +Y + A+K+ F++C W P
Sbjct: 140 KEDAELFFGEWNFDFIKVDWCGGQEMKLNEEEQYTKIVKAVKEVKPEAGFNLCRW--QFP 197
Query: 177 ALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNE 236
W + +SWR + DI +++ +L I D+N ++ PG +ND DML+VG GM Y E
Sbjct: 198 GEWAIKLVDSWRISGDIRNNFDSVLEIIDLNRNLYKYSSPGHYNDMDMLQVGR-GMSYEE 256
Query: 237 YVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
HFS+W + +PL+ G D+ ++ +T+ I+ N+E+IA+NQ
Sbjct: 257 DKTHFSMWCMLNSPLMAGNDLRTISKQTIEILTNKELIALNQ 298
>gi|405965493|gb|EKC30862.1| Alpha-N-acetylgalactosaminidase [Crassostrea gigas]
Length = 425
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 93/331 (28%), Positives = 135/331 (40%), Gaps = 102/331 (30%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQIN----------EKIIRET------------------- 50
L NGL + PPMGW +W F C IN EK+I +
Sbjct: 35 LDNGLAIQPPMGWMAWERFRCNINCTEDPDNCISEKLIVQMCDKMVAGGFRDAGYTYIAL 94
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW E RD PL G LV ++T FP G KALADYVH
Sbjct: 95 DDCWAEKRRD---------------------PL----TGKLVPDRTRFPRGMKALADYVH 129
Query: 111 -----ILISNSLDNRI-------------ESNIAFEQGIDYLKYDNCFNDDTRPTIRYRA 152
+ I + + + ++ + G+D LK D C+ I Y
Sbjct: 130 RQGMKLGIYSDMGTKTCKEYPGSEFYIQTDAQTFADWGVDMLKLDCCYGGSGM-EIGYET 188
Query: 153 MSDALKKAGRPIFFSMCEWGDMRPALWGSNIR--------NSWRTTDDISDSWARMLTIA 204
M L K GRP+ +S C + P G ++ N WR D++DSW R+ +
Sbjct: 189 MGFFLNKTGRPVLYS-CSY----PVCLGGHVIYERVAKTCNMWRNAIDLTDSWDRVYKVI 243
Query: 205 DMNEVYADH-------AKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDV 257
+Y D+ A PG WND D L VG+ G+ + +WA+ PL + D+
Sbjct: 244 ---RIYGDNIGNFSAAAGPGHWNDADQLMVGDFGLSNGQQTAQIVMWAMWSVPLFMSVDL 300
Query: 258 GNLTAETMPIIGNEEVIAVNQAINDQQIWAG 288
++ ++ N+++I++NQ WAG
Sbjct: 301 RDIGVFAEELLLNKDIISINQD------WAG 325
>gi|225711828|gb|ACO11760.1| Alpha-N-acetylgalactosaminidase [Lepeophtheirus salmonis]
Length = 461
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 92/294 (31%), Positives = 130/294 (44%), Gaps = 43/294 (14%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGL TPPMGW SW F C N K + ++C E + L S D +
Sbjct: 21 LDNGLARTPPMGWMSWQRFRCDTNCK--DDPENCISERLFKTMADLLVSQGFKDVGYEYI 78
Query: 80 AKPLPNQC--------QGNLVANKTTFPSGTKALADYVHIL------------------- 112
+ + C G L + FPSG KALADYVH L
Sbjct: 79 ---IIDDCWLSRTRDKDGKLQPDPERFPSGIKALADYVHNLGLKFGIYEDFGTHTCAGYP 135
Query: 113 -ISNSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEW 171
I N+L + ++ E +DYLK D C+ + T Y L + GRPI +S C W
Sbjct: 136 GILNNL--KKDAFTIAEWEVDYLKVDGCYVNVTLMDKGYPEFGKYLNQTGRPILYS-CSW 192
Query: 172 GDMRPALWGSNIR---NSWRTTDDISDSWARMLTIADM----NEVYADHAKPGGWNDPDM 224
+ +I N WR DI DS+ +L I D + + A PG +NDPDM
Sbjct: 193 PACQKNPDYKSIAKYCNIWRNGGDIQDSFNSVLGITDFFGTNQDTFISVAGPGHFNDPDM 252
Query: 225 LEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
L +G+ + ++ ++WA APL++ D+ +L E I+ N ++I VNQ
Sbjct: 253 LIIGDFALSIDQSQYQMAVWATLAAPLIMSNDLRSLRPEFKEILQNRKIIRVNQ 306
>gi|396487065|ref|XP_003842549.1| similar to alpha-galactosidase [Leptosphaeria maculans JN3]
gi|312219126|emb|CBX99070.1| similar to alpha-galactosidase [Leptosphaeria maculans JN3]
Length = 449
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 93/341 (27%), Positives = 133/341 (39%), Gaps = 110/341 (32%)
Query: 25 GLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQISL 65
G P MGWNSWN + C INE + + DDCW S RD +
Sbjct: 27 GRLPAMGWNSWNEYECNINETVFLDVGKLLVSLGLKDLGYTYVNIDDCW--SDRDKRR-- 82
Query: 66 SCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL------------- 112
D++T G +V + FP G K AD +H L
Sbjct: 83 -------DNVT------------GKIVPDYKKFPQGIKHTADEIHKLGLQLGIYSDAGTS 123
Query: 113 -----ISNSLDNRIESNIAFEQGIDYLKYDNCF--------------------------- 140
+ + +++ E G+DYLKYDNC
Sbjct: 124 TCGGYVGSLGHEELDAQTWAEWGVDYLKYDNCAVPDEWDDEYRWWPENWLGGPPNEDQTA 183
Query: 141 --NDDTRPTI--------------RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIR 184
+ + RP RY+ MSDAL R I +S C WG WG++
Sbjct: 184 GGDGEARPVPAPAGYDWTTSKSFDRYKMMSDALLAINRTIEYSQCAWGHAHIEQWGNSTG 243
Query: 185 NSWRTTDDISDSWAR-------MLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEY 237
+SWR DI W+ ++ I + + + + G D DMLEVGNG + E
Sbjct: 244 HSWRMWGDIYPEWSGKWQWSWGLMPILNHASFFYNSSNFWGHADWDMLEVGNGNLTLEES 303
Query: 238 VVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
HF++WA K+PL++G + + E + I+ N E+IA NQ
Sbjct: 304 RSHFALWAALKSPLIIGTPLDGIKPEILKILSNPELIAFNQ 344
>gi|451855069|gb|EMD68361.1| carbohydrate-binding module family 35 protein [Cochliobolus sativus
ND90Pr]
Length = 551
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 95/352 (26%), Positives = 147/352 (41%), Gaps = 64/352 (18%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK-PLPN 85
TP MGWNS+N C N + I T + S+ ++ + F A++ +
Sbjct: 25 TPVMGWNSYNQMSCSPNAQKITTTINALA-----SRGFVAAGYKFFQIDCGWASRDGQRS 79
Query: 86 QCQGNLVANKTTFPSGTKALADYVHIL----------------------ISNSLDNRIES 123
G L + FP+G + LAD + SL +
Sbjct: 80 STNGALKVDLKAFPNGLQPLADLTRSKGMKWTMYSDAGVRMCDPQSPSPVLGSLGHEAAD 139
Query: 124 NIAFEQ-GIDYLKYDNCFNDD------------TRPTIRYRAMSDALKKAGRPIFFSMCE 170
F+ G +Y+KYDNC+ D T R AM L+K G P +C+
Sbjct: 140 AAFFKSLGTEYVKYDNCYADGPNANQNAPKNARTDFVTRMGAMWKELQKVGIPGML-ICQ 198
Query: 171 WGD--------MRPALWGSNIRNSWRTTDDISDSWARMLTIAD--MNEVYADHAKPGGWN 220
WG P W I S+R +DDI+ WA M I + ++ + + PG
Sbjct: 199 WGVPYSTSAGLEGPDEWTKGISTSFRLSDDITSGWASMFRIYNQAIHIAKSGNVGPGHIA 258
Query: 221 DPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQA- 279
D D+LEVGN M ++E HF+ WA+ K+ L++ DV L+ E + ++ N+++IA+NQ
Sbjct: 259 DADLLEVGNSAMSFDEQATHFAAWAMLKSALMISTDVAALSNELVTVLQNKDLIAINQDS 318
Query: 280 -----------INDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKS 320
D+ +WAG L+ + VL +T D+GI S
Sbjct: 319 AVKPVTLRQRWTGDRDLWAGDLANGDMAVLAVDLSNTGRTLTVQLADLGIAS 370
>gi|449269964|gb|EMC80699.1| Alpha-galactosidase A, partial [Columba livia]
Length = 364
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 91/302 (30%), Positives = 137/302 (45%), Gaps = 77/302 (25%)
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW +RD Q G L + FPSG + LADYVH
Sbjct: 28 DDCWMAPTRDQQ--------------------------GRLQPDPKRFPSGIRKLADYVH 61
Query: 111 -----ILISNSLDNR-------------IESNIAFEQGIDYLKYDNCFNDDTRPTIR--Y 150
+ I + + N+ +++ G+D LK+D C N T + Y
Sbjct: 62 SKGLKLGIYSDVGNKTCAGFPGSYDHYDLDAQTFASWGVDLLKFDGC-NSGTLELLAEGY 120
Query: 151 RAMSDALKKAGRPIFFSMCEWG-DMRPALWGSNIR-----NSWRTTDDISDSWARMLTIA 204
R MS AL K GR I +S CEW +RP + N WR D+ DSW+ + +I
Sbjct: 121 RRMSLALNKTGRSIVYS-CEWPFYLRPVQQPNYTEIKHYCNHWRNFFDVYDSWSSIKSIL 179
Query: 205 DMNEVYADH----AKPGGWNDPDM--------LEVGNGGMKYNEYVVHFSIWAISKAPLL 252
D ++ D A PGGWNDPDM L +GN G+ +++ V ++WAI APL
Sbjct: 180 DWTALHQDSIVKIAGPGGWNDPDMASIVFLSQLVIGNFGLSWDQAVTQMAMWAIMAAPLF 239
Query: 253 LGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLSGNRIVVLLES 301
+ D+ +++ E ++ N++VIA+NQ + +Q +W PLSG V + +
Sbjct: 240 MSNDLRHISPEAKWLLQNKDVIAINQDVLGKQGYQITKDKHFELWERPLSGGAYAVAVLN 299
Query: 302 RK 303
++
Sbjct: 300 QQ 301
>gi|116205487|ref|XP_001228554.1| hypothetical protein CHGG_10627 [Chaetomium globosum CBS 148.51]
gi|88176755|gb|EAQ84223.1| hypothetical protein CHGG_10627 [Chaetomium globosum CBS 148.51]
Length = 417
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 103/409 (25%), Positives = 171/409 (41%), Gaps = 102/409 (24%)
Query: 20 LSNGLGLTPPMGWNSWNHFHC-QINEKIIRET-------------------DDCWGESSR 59
L NG+G P MGW+SWN C + K +T DDCW SR
Sbjct: 24 LDNGIGRKPHMGWSSWNVAQCNSASAKYAIDTANKFISLGLKDLGYEYINIDDCWTTRSR 83
Query: 60 DSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL------- 112
D G LV + + +P+G K + D +H +
Sbjct: 84 DGN--------------------------GKLVPDPSKWPNGIKPVVDQIHGMGLKFGLY 117
Query: 113 -----------ISNSLDNRIESNIA--FEQGIDYLKYDNCFN---DDTRP---------- 146
+ S++A + G+D+ KYDNC+ D+ P
Sbjct: 118 GCAGQQTCAGFPGSDGSKHAASDVAQLADWGVDFWKYDNCYTPCLDNPPPQTCGRPNGNT 177
Query: 147 TIRYRAMSDAL--KKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIA 204
Y M DA+ + I F++C WG WG+ +SWR + D WA + I
Sbjct: 178 KTWYAPMRDAILAVQDKHKIHFNLCNWGRDEVWKWGAEYGHSWRMSTDNWGDWASVERIG 237
Query: 205 DMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAET 264
+ + ++ PGG+ND DML +G+ + ++ +HF +WAI+K+PL+LG D+ ++ T
Sbjct: 238 SSSAAISGYSGPGGFNDLDMLYLGSSKINADQERIHFGLWAIAKSPLVLGLDLNKISDWT 297
Query: 265 MPIIGNEEVIAVNQ-----------------AINDQ--QIWAGPLSGNRIVVLLESRKTF 305
+ II N+ +IA+NQ +N + WAGPLS + +V+ L + +
Sbjct: 298 LGIIRNKGIIAINQDSLGKPATTFTPPGKQGPVNGKIHPYWAGPLS-DGVVIGLCAGTSG 356
Query: 306 SSMMTAHWDDIGIKSNNTLFEHRTLETKFQGK-LTAKVDGHSCKMYELT 353
+ + D G+ + +E K GK ++ + H ++ ++T
Sbjct: 357 GTYGVSFKDVPGLSDSEYSWEEMYTGQKGTGKSISFDIKLHDMRVIKVT 405
>gi|351708354|gb|EHB11273.1| Alpha-N-acetylgalactosaminidase [Heterocephalus glaber]
Length = 454
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 87/289 (30%), Positives = 126/289 (43%), Gaps = 49/289 (16%)
Query: 30 MGWNSWNHFHCQ----------INEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
MGW SW F C I+E++ E D ++D L + D I
Sbjct: 1 MGWLSWERFRCNTDCDKDPKNCISERLFMEMAD---RLAKDGWRDLGYIYLNIDD-CWIG 56
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVHIL------------------ISNSLDNRI 121
+ +G+L+ + FP G LADY H L +LD
Sbjct: 57 GR----DAKGHLMPDSKRFPHGIAFLADYAHSLGLKLGIYEDMGNLTCMGYPGTTLDKVE 112
Query: 122 ESNIAF-EQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEW----GDMRP 176
+ F E +D LK D CF+ Y M+ AL GRPI FS C W G + P
Sbjct: 113 QDAQTFAEWKVDMLKLDGCFSTSEDRAKGYPMMAAALNATGRPIAFS-CSWPAYEGGLPP 171
Query: 177 AL---WGSNIRNSWRTTDDISDSWARMLTIAD----MNEVYADHAKPGGWNDPDMLEVGN 229
+ ++I N WR DDI DSW +L++ D +V A PG WNDPDML +GN
Sbjct: 172 KVNYSLLADICNLWRNYDDIQDSWKDVLSVLDWFVLHQDVLQPVAGPGHWNDPDMLIIGN 231
Query: 230 GGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
G+ + + ++W + APL + D+ ++ + M I+ N +I +NQ
Sbjct: 232 FGLSFEQARAQMALWTVLAAPLFMSTDLRTISPQNMDILQNPLIIKINQ 280
Score = 40.8 bits (94), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 23/92 (25%), Positives = 44/92 (47%), Gaps = 12/92 (13%)
Query: 225 LEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ---AIN 281
L +GN G+ + + ++W + APL + D+ ++ + M I+ N +I +NQ I
Sbjct: 293 LIIGNFGLSFEQARAQMALWTVLAAPLFMSTDLRTISPQNMDILQNPLIIKINQDPLGIQ 352
Query: 282 DQQI---------WAGPLSGNRIVVLLESRKT 304
++I + PLS ++ SR+T
Sbjct: 353 GRRILKEKSQIEVFKRPLSDEASALVFFSRRT 384
>gi|195150075|ref|XP_002015980.1| GL11348 [Drosophila persimilis]
gi|194109827|gb|EDW31870.1| GL11348 [Drosophila persimilis]
Length = 427
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 94/349 (26%), Positives = 142/349 (40%), Gaps = 105/349 (30%)
Query: 22 NGLGLTPPMGWNSWNHFHCQ----------INEKIIRE-------------------TDD 52
NGL LTPPMGW SW F C I+E++ R DD
Sbjct: 26 NGLALTPPMGWMSWERFRCLTDCKLYPNECISEELFRRHADLLVSEGYADAGYEYIIIDD 85
Query: 53 CWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-I 111
CW E +RD++ LVA+K FP+G LAD++H I
Sbjct: 86 CWLEKNRDNKTK-------------------------KLVADKKRFPNGLNPLADHIHNI 120
Query: 112 LISNSLDN-----------------RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMS 154
+ L ++++ + +DY+K D C+ + + + Y
Sbjct: 121 GLKFGLYQDYGTNTCAGFPGVIKHMQLDAQTFADWDVDYVKLDGCYANISDMAVGYPEFG 180
Query: 155 DALKKAGRPIFFSMCEW-------GDMRPALWGSNIRNSWRTTDDISDSWARMLTIADM- 206
L GRP+ +S C W G+M N WR DDI DS + I D
Sbjct: 181 RLLNATGRPMVYS-CSWPAYQEDAGEMPDYTSLKQHCNLWRNWDDIDDSLESVAQIMDYF 239
Query: 207 ---NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAE 263
+ HA PG WNDPDML +GN G+ Y++ + +IW++ APL++ D+ + E
Sbjct: 240 GKNQDRIQPHAGPGHWNDPDMLLLGNYGLSYDQSKLQMAIWSVMAAPLIMSNDLAVVRPE 299
Query: 264 TMPIIGNE----------EVIAVNQ-----------AINDQQIWAGPLS 291
I+ N +VIAV+Q + N ++W+ P++
Sbjct: 300 IKDILQNRVCSAFEDLKIKVIAVDQDELGIQGRRILSKNQIEVWSRPIT 348
>gi|320169948|gb|EFW46847.1| alpha-N-acetylgalactosaminidase [Capsaspora owczarzaki ATCC 30864]
Length = 395
Score = 117 bits (294), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 100/343 (29%), Positives = 157/343 (45%), Gaps = 59/343 (17%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGES----SRDSQISLSCSFHMFDSL 75
L NGL TPPMGW +W F C +N + ++C E+ D +S +D++
Sbjct: 33 LENGLARTPPMGWLAWERFRCDVN--CTSDPNNCISETLFKQMADHMVSGGYLAAGYDTI 90
Query: 76 TQ----IAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL------------ISNSLDN 119
++ L N G L+A+ FP+G K LADYVH I +D
Sbjct: 91 IMDDCWMSTSRLAN---GTLLADPVRFPNGVKGLADYVHSKDYGTNTCGGYPGIYGYID- 146
Query: 120 RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGD-MR--- 175
I++ GIDY+K D C D P + GRP+ +S C W D MR
Sbjct: 147 -IDTRDYASWGIDYVKMDGCNVD---PATMNKG------NTGRPMVYS-CSWPDYMRIAK 195
Query: 176 -PALWGSNIR--NSWRTTDDISDSWARMLTIAD----MNEVYADHAKPGGWNDPDMLEVG 228
P W I+ N WR DDI DS + + +I + + + A PG WND DML +G
Sbjct: 196 IPIDWQLIIQYCNLWRLFDDIQDSLSSVSSIVEYWGSQQVLLSGIAGPGHWNDMDMLIIG 255
Query: 229 NGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ---- 284
+ G+ + + ++WAI+ +PL++ D+ N+++ ++ N+ +IAV+Q QQ
Sbjct: 256 DFGLNQAQSEMQMALWAINASPLIMSNDLRNISSWASSLLLNKAIIAVSQDPLGQQGIRL 315
Query: 285 -------IWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKS 320
W L+ + L + T ++ +T + D+G S
Sbjct: 316 ARNATSETWVRILADGSFAIALYNTATVAATLTFQFADLGFSS 358
>gi|429195825|ref|ZP_19187825.1| alpha-galactosidase [Streptomyces ipomoeae 91-03]
gi|428668482|gb|EKX67505.1| alpha-galactosidase [Streptomyces ipomoeae 91-03]
Length = 666
Score = 117 bits (294), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 100/379 (26%), Positives = 166/379 (43%), Gaps = 66/379 (17%)
Query: 21 SNGLGLTPP-MGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
+N +G+TPP MGW SWN F I+ +I++ D S + + D
Sbjct: 48 TNQIGVTPPPMGWASWNTFFSSIDHNVIKQQADALVSSG---MAAAGYKYVNLDDGWWQG 104
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVH--------------------------ILI 113
A+ G++V ++T +P G KA+ADY+H
Sbjct: 105 AR----DANGDIVVDETLWPGGMKAIADYIHSKGLKAGIYTDAGKQGCGYYYPTTRPAAP 160
Query: 114 SNSLDNRIESNI-AFEQ-GIDYLKYDNC------FNDDTRPTIRYRAMSDALKKAGRPIF 165
+ ++ + ++ AF++ G DY+K D C + +T+ A A GR +
Sbjct: 161 NTGMEGHYQQDLEAFQRWGFDYVKIDWCGGRVEGLDQETQYKQIAAANEAASAVTGRKLV 220
Query: 166 FSMCEWGDMRPALWGSNIRNSWRTTDDI-----SDSWARMLTIADMNEVYADHAKPGGW- 219
S CEWG P W + + WRT+DD+ + A+M + ++ A+ G+
Sbjct: 221 LSFCEWGTGLPWNWATGYGDLWRTSDDVLFFQQTPDLAKMYR--NFDQALQPAAQHTGYV 278
Query: 220 NDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ- 278
NDPDM+ VG GM +H S+W++ PLL G ++ ++ ET I+ N EVIA++Q
Sbjct: 279 NDPDMMMVGLNGMTAQRNRLHMSLWSLVGGPLLAGNNIATMSTETRDILTNPEVIAIDQD 338
Query: 279 -----------AINDQQIWAGPLSG-NRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFE 326
Q++ LSG + V+L +R ++ +T W D+G+ + +
Sbjct: 339 PRGLQGVKVAEDTRGLQVYGKVLSGTGKRAVMLFNRTGSAANITVRWADLGLTTASATVR 398
Query: 327 H---RTLETKFQGKLTAKV 342
+ RT F TA V
Sbjct: 399 NAWTRTDAGSFATSYTASV 417
>gi|123456302|ref|XP_001315888.1| Melibiase family protein [Trichomonas vaginalis G3]
gi|121898578|gb|EAY03665.1| Melibiase family protein [Trichomonas vaginalis G3]
Length = 525
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 79/289 (27%), Positives = 134/289 (46%), Gaps = 48/289 (16%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQ 86
TP MGW+SWN + I+E++I++ E + + D N+
Sbjct: 16 TPTMGWSSWNTYRVHISEELIKKQATAMAEKGLKEK---GYKYINIDD----GYFGGRNE 68
Query: 87 CQGNLVANKTTFPSGTKALADYVHILI----------SNSLDNRIESNIAFEQGI----- 131
G L+ N FP+G + + ++H L N+ N ++++ +G+
Sbjct: 69 TTGQLLINPERFPNGLEPVVTHIHNLGFKAGIYSDGGRNTCGNYYDADV-LGKGVGLYAH 127
Query: 132 --------------DYLKYDNCFNDDTRPT--------IRYRAMSDALKKAGRPIFFSMC 169
D++K D C D + + RY ++ AL K R I ++C
Sbjct: 128 DDQDADLFFNTLQFDFIKLDYCGGDPNQNSERLNLDEQTRYTSIRKALNKITRSIRVNVC 187
Query: 170 EWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGN 229
W P W + + SWR + DI +++ + + ++N Y+ +A G +ND DMLEVG
Sbjct: 188 RWA--YPGSWVTKLATSWRISSDIENNFDAIKRLINVNMFYSAYAVEGHYNDMDMLEVGR 245
Query: 230 GGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
G M +E HF +W I +PLL+GCD+ + T+ ++ NEE+IA+NQ
Sbjct: 246 G-MSDDEDKTHFGMWCIMSSPLLIGCDMTTIKDSTLELLKNEELIALNQ 293
>gi|332184907|ref|ZP_08386656.1| melibiase family protein [Sphingomonas sp. S17]
gi|332014631|gb|EGI56687.1| melibiase family protein [Sphingomonas sp. S17]
Length = 640
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 98/337 (29%), Positives = 134/337 (39%), Gaps = 87/337 (25%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGESSRDSQ 62
+G PPMGWNSWN F+ ++E+ + + DD W R
Sbjct: 45 DGQAALPPMGWNSWNAFNSDVDEEKVMASARLIVDKGLAAAGYRYINIDDGWWLRRRQPD 104
Query: 63 ISLSCSFHMFDSLT-----QIAAKPLP------------------NQCQGNLVANKTTFP 99
L F S Q + +PL N C N P
Sbjct: 105 GRLVIRADKFPSAAAGPNQQTSFRPLTDRLHAMGFKAGIYSDIGRNSCGQVYTPNFANQP 164
Query: 100 SGTKALADYVHILISNSLDNRIESNIAF---EQGIDYLKYDNC----------------- 139
GT A + L I+ +IA E G DY+K D C
Sbjct: 165 EGTIAEREV-------GLYGHIDQDIALYFREWGFDYIKVDGCGIRGLGRDSEHVRKGDY 217
Query: 140 --------FNDDTR---PTIR--YRAMSDALKKAGRP--IFFSMCEWGDMRPALWGSNIR 184
N R P +R Y ++ ALK+ FS+C WG W +
Sbjct: 218 RELAPLIDMNSLARTDIPAVRALYDHVAMALKRENPDGDYLFSICLWGSADVRSWAKQLG 277
Query: 185 NSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYN---EYVVHF 241
N RT+DDI+ W+RMLT D + A +A P WNDPDML +G+G N E HF
Sbjct: 278 NVSRTSDDITAHWSRMLTNFDTSATRALYAHPHSWNDPDMLFIGSGDFDANHMTEARSHF 337
Query: 242 SIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
++WA+ APLL+G D+ AE + ++GN +IA+NQ
Sbjct: 338 ALWAMMNAPLLIGFDLRKANAEQLALLGNRAIIALNQ 374
>gi|330930098|ref|XP_003302894.1| hypothetical protein PTT_14879 [Pyrenophora teres f. teres 0-1]
gi|311321451|gb|EFQ89002.1| hypothetical protein PTT_14879 [Pyrenophora teres f. teres 0-1]
Length = 553
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 101/389 (25%), Positives = 164/389 (42%), Gaps = 65/389 (16%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK-PLPN 85
TP MGWNS+N C N + I T + S S+ + + F A++ N
Sbjct: 27 TPVMGWNSYNQMSCSPNTQKITTTIN-----SLSSRGFVDAGYKFFQIDCGWASRDGQRN 81
Query: 86 QCQGNLVANKTTFPSGTKALADYVH------ILISNS----LDNRIESNIAFEQG----- 130
G L + FP G K L+D + S++ D + S + G
Sbjct: 82 SSNGALKVDLNAFPGGLKPLSDLARSKGMKWTMYSDAGVRMCDPQSPSPVLGSLGHEAAD 141
Query: 131 --------IDYLKYDNCFNDD------------TRPTIRYRAMSDALKKAGRPIFFSMCE 170
+Y+KYDNC+ D T R AM L++ G P +C+
Sbjct: 142 ADFFKTLNTEYVKYDNCYADGPAASQNAPKNARTDFVTRMGAMWKELQRVGIPGLL-ICQ 200
Query: 171 WGD--------MRPALWGSNIRNSWRTTDDISDSWARMLTIAD--MNEVYADHAKPGGWN 220
WG P W + S+R +DDI+ WA M I + ++ + PG
Sbjct: 201 WGVPFTTSSGLQGPVQWTKGLSTSFRLSDDITSGWASMFRIYNQAIHIAQSGLTGPGHIA 260
Query: 221 DPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-- 278
D D+L VGN GM ++E HF+ WA+ K+ L++ DV NL+ E + ++ N+++IA+NQ
Sbjct: 261 DADLLMVGNNGMTFDEQATHFAAWAMLKSALMISTDVPNLSNELVAVLQNKDLIAINQDS 320
Query: 279 AI----------NDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFEHR 328
A+ D+ +WAG L+ + VL + ++ D+GI S T+ +
Sbjct: 321 AVKPVTLRQRWSGDRDLWAGDLANGDMAVLAVDLSNSARTLSVQLSDLGITS-ATIKDLW 379
Query: 329 TLETKFQGKLTAKVDGHSCKMYELTPVTK 357
T + +V H + L+ + +
Sbjct: 380 AGTTTTGSSFSKQVKAHGSVVLRLSNIVR 408
>gi|71020839|ref|XP_760650.1| hypothetical protein UM04503.1 [Ustilago maydis 521]
gi|46100152|gb|EAK85385.1| hypothetical protein UM04503.1 [Ustilago maydis 521]
Length = 572
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 93/356 (26%), Positives = 147/356 (41%), Gaps = 72/356 (20%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQ 86
TP MG+N++N C E D SQ + ++ F ++
Sbjct: 44 TPGMGFNTYNQVSCSPTESKSHTIMDIMA-----SQGYIDAGYNFFQVDCGWVSRTTSRD 98
Query: 87 CQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAFEQ----------------- 129
GNL+ N FPSG K+L Y S L+ + S+ +
Sbjct: 99 ASGNLITNTDAFPSGMKSLGQYA---TSKGLEFGLYSDAGYRACDPQAPSPVLGSLGHEA 155
Query: 130 ---------GIDYLKYDNCFNDDTR-------------PTIRYRAMSDALKKAGRPIFFS 167
+ YLKYDNC+ D T PT R+ AMS AL G
Sbjct: 156 QDAQLLKSYNVSYLKYDNCYADGTTSGDNAPKHARTDFPT-RFGAMSKALSDVGINKLL- 213
Query: 168 MCEWGD---------MRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAK--P 216
+C+WG + PA W + S+R +DDI+ W + I + + H + P
Sbjct: 214 VCQWGVPQKQSNGTLVGPAQWTQGLSTSYRLSDDIATGWINVERILNQGIQISLHGRSGP 273
Query: 217 GGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAV 276
+ D D+LEVGN GM +E HF+ WA+ K+PL++ D+ ++++ I+ N+ +IA+
Sbjct: 274 NHFADGDLLEVGNSGMSIDEQGTHFAFWAMIKSPLVISTDLSSISSNAKAILLNKGLIAI 333
Query: 277 NQ------------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKS 320
NQ + + AG LS + VL + +T ++ ++GI S
Sbjct: 334 NQDRLGEPVKLVERRTGNFDLHAGKLSNGDMAVLAFDWSNARNTITVNFSELGIAS 389
>gi|308503765|ref|XP_003114066.1| CRE-GANA-1 protein [Caenorhabditis remanei]
gi|308261451|gb|EFP05404.1| CRE-GANA-1 protein [Caenorhabditis remanei]
Length = 452
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 96/317 (30%), Positives = 131/317 (41%), Gaps = 85/317 (26%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQI----------NEKIIRE-------------------T 50
L NGLG TPPMGW SW F+C+I NE++ ++
Sbjct: 17 LDNGLGRTPPMGWMSWTAFYCEIDCQKHPTGCINEQLYKDMADQLVSGGYDKLGYKSVHI 76
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW E RDS G LVANKT FPSG K LA Y+H
Sbjct: 77 DDCWSEMERDSN--------------------------GVLVANKTRFPSGMKTLAKYMH 110
Query: 111 I------------------LISNSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRA 152
+ R ++ +DYLK D C D I Y
Sbjct: 111 DRGLKFGIYEDYGTKTCGGYPGSYKHERTDAQTFAAWDVDYLKLDGCNIDQAMMPIGYPL 170
Query: 153 MSDALKKAGRPIFFSMCEWGDM---RPALWGSNIR----NSWRTTDDISDSWARMLTIAD 205
L GRPI +S C W P L ++ N+WR DDI+ SW +++I
Sbjct: 171 FEKELNATGRPIMYS-CSWPAYLIDHPELVNYHLIGKYCNTWRNFDDINSSWKSIISIIS 229
Query: 206 MNEVYADHA----KPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLT 261
+ D PG W+DPDML +GN G+ + + F++W I APL++ D+ +
Sbjct: 230 YYDKNQDKHIPTHGPGRWHDPDMLVIGNKGITLDMSISQFTVWCIWSAPLIMSNDLRIIG 289
Query: 262 AETMPIIGNEEVIAVNQ 278
E ++ N+E I +NQ
Sbjct: 290 DEFKGVLQNKEAIRINQ 306
>gi|301309686|ref|ZP_07215625.1| alpha-galactosidase [Bacteroides sp. 20_3]
gi|423340158|ref|ZP_17317897.1| hypothetical protein HMPREF1059_03822 [Parabacteroides distasonis
CL09T03C24]
gi|300831260|gb|EFK61891.1| alpha-galactosidase [Bacteroides sp. 20_3]
gi|409227593|gb|EKN20489.1| hypothetical protein HMPREF1059_03822 [Parabacteroides distasonis
CL09T03C24]
Length = 530
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 95/384 (24%), Positives = 160/384 (41%), Gaps = 65/384 (16%)
Query: 28 PPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQC 87
P MGW+SWN + I++K+I + D SR+ + + D
Sbjct: 22 PTMGWSSWNTYRVNISDKLIMKQADA---MSRNGLKEVGYKYINIDD-----GYFGGRDA 73
Query: 88 QGNLVANKTTFPSGTKALADYVHIL------ISNSLDNRIES---------NIAF----- 127
G L+ + FP+G K D++H L S++ N S N+ F
Sbjct: 74 SGELITHPVRFPNGLKQTVDHIHALGFKAGIYSDAGRNTCGSFWDKDSLGINVGFYGHDR 133
Query: 128 --------EQGIDYLKYDNCFNDDTR--------PTIRYRAMSDALKKAGRP-IFFSMCE 170
E G D++K D C D + R+ A+ +A+ GR + ++C
Sbjct: 134 QDADYFFKEIGFDFIKIDFCGGDAKQNFDQLGLDEQERFTAIHNAILATGRKDVRMNVCR 193
Query: 171 WGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNG 230
W P W ++ SWR + DI+ SW + I N + +A G +ND DMLE+G G
Sbjct: 194 WN--FPGTWVHDVAFSWRISQDINPSWESVKNIIRQNLYLSAYASEGKYNDMDMLEIGRG 251
Query: 231 GMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQIWA--- 287
M E HF +W I +PLL+GCD+ ++ + + ++ NEE+IA+NQ + Q +
Sbjct: 252 -MSEEEDKTHFGMWCIMSSPLLIGCDLTTISEKALRLLKNEELIALNQDVLGLQAYVVKQ 310
Query: 288 -----------GPLSGNRIVVLLESRKTFSSMMTAHWDDIGIK---SNNTLFEHRTLETK 333
++GN V + + + + D ++ S LFE R L
Sbjct: 311 MDGVYILTKDLEEVNGNTCAVAVYNSTDEERTIHLDFADFYLRGDVSVRDLFERRDLGKY 370
Query: 334 FQGKLTAKVDGHSCKMYELTPVTK 357
+ + H +++ L + +
Sbjct: 371 KDRDFSVTIPAHGTRIFRLDGLER 394
>gi|343425390|emb|CBQ68925.1| related to Alpha-N-acetylgalactosaminidase precursor [Sporisorium
reilianum SRZ2]
Length = 574
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 94/360 (26%), Positives = 150/360 (41%), Gaps = 75/360 (20%)
Query: 28 PPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQC 87
P MG+N++N C E T D SQ L+ ++ F ++
Sbjct: 46 PGMGFNTYNQVSCSPTEAKSHTTMDLM-----QSQGYLAAGYNFFQVDCGWVSRTTARDS 100
Query: 88 QGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAFEQ------------------ 129
GNL+ N FPSG K+L Y S L+ + S+ +
Sbjct: 101 SGNLMVNTDAFPSGMKSLGSYA---TSKGLEFGLYSDAGYRACDPQSPSPVLGSLGHESQ 157
Query: 130 --------GIDYLKYDNCFND-----DTRPT-------IRYRAMSDALKKAGRPIFFSMC 169
+ YLKYDNC+ D D P R+ AM+ AL G +C
Sbjct: 158 DAALLKSFNVTYLKYDNCYADGATAADNAPKNARTDFPTRFGAMTKALSSVGINKLL-IC 216
Query: 170 EWGD---------MRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYA----DHAKP 216
+WG + PA W + + S+R +DDI+ W ++ I +N+ + P
Sbjct: 217 QWGVPQKQSDGSLVGPAQWTTGLSTSYRLSDDIAAGWINVVRI--LNQAIPISLNGRSGP 274
Query: 217 GGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAV 276
G + D D+LEVGN GM +E HF+ WA+ K+PL++ D+ +++ I+ N+ +IA+
Sbjct: 275 GHFADGDLLEVGNSGMTIDE-ATHFAFWAMIKSPLVISTDLSTISSAAKAILLNKGLIAI 333
Query: 277 NQ------------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTL 324
NQ + + AG L+ + VL +T ++ D+GI S N +
Sbjct: 334 NQDSLGEPVKLIERRTGNYDLHAGKLANGDMAVLAFDWTGTQRSITVNFSDLGIASANVV 393
>gi|334144846|ref|YP_004538055.1| alpha-galactosidase [Novosphingobium sp. PP1Y]
gi|333936729|emb|CCA90088.1| alpha-galactosidase [Novosphingobium sp. PP1Y]
Length = 630
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 107/373 (28%), Positives = 159/373 (42%), Gaps = 92/373 (24%)
Query: 28 PPMGWNSWNHFHCQINE-KIIRETDDC-------------------WGESSRDSQISL-- 65
PPMGWNSWN F +INE KI+ + W D ++ +
Sbjct: 43 PPMGWNSWNAFALEINEAKILGAAETIVTSGLAAKGYRYVDLDEGWWARRGADGRLVIRT 102
Query: 66 -----------SCSFH-MFDSLTQIAAKPL------PNQCQGNLVANKTTFPSGTKALAD 107
+ SF + D L + K N C K + PSGT A +
Sbjct: 103 DRFPSAIRSDGTTSFRPLTDKLHDMGLKAGIYSEIGRNSCGQIYSEEKASLPSGTVAERE 162
Query: 108 YVHILISNSLDNRIESNIAFEQGIDYLKYDNC-----------------------FNDDT 144
+ + + +D I A E G D +K D C + D+
Sbjct: 163 ---VGLYDHVDRDIGLYFA-EWGFDLIKVDGCGIRGLPQGSPRVASGEFRAFPPLIDSDS 218
Query: 145 R-----PTIR--YRAMSDALKK--AGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISD 195
P +R + ++DAL++ FS+C WG W N+ N RT++DI
Sbjct: 219 LGRTNIPAVRALFGKVADALERYNPDNDFVFSLCIWGSADVRSWAKNVGNLSRTSEDIFP 278
Query: 196 SWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYN---EYVVHFSIWAISKAPLL 252
W RML D A +A+PG WNDPDML +G+G + E HFS+WA+ APL+
Sbjct: 279 IWGRMLFNMDSTSRRALYAQPGSWNDPDMLYIGSGEFDADHLTEARTHFSMWAMLNAPLI 338
Query: 253 LGCDVGNLTAETMPIIGNEEVIAV------NQAI-----NDQQIWAGPLS-GNRIVVLL- 299
+G D+ + E M I+GNE +IA+ NQA+ +D QI+A L+ G + V +L
Sbjct: 339 IGYDLRHAAPELMEILGNERIIALDQDPAGNQAVLAYDSDDVQIYAKTLADGTKGVAILN 398
Query: 300 ESRKTFSSMMTAH 312
+ +++TA
Sbjct: 399 RTSAPLDAVLTAE 411
>gi|365118956|ref|ZP_09337349.1| hypothetical protein HMPREF1033_00695 [Tannerella sp.
6_1_58FAA_CT1]
gi|363648935|gb|EHL88071.1| hypothetical protein HMPREF1033_00695 [Tannerella sp.
6_1_58FAA_CT1]
Length = 531
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 103/415 (24%), Positives = 176/415 (42%), Gaps = 82/415 (19%)
Query: 1 LFIVSSSAADSNGSDRRNLLSNGLGLTPP-MGWNSWNHFHCQINEKIIRETDDCWGESSR 59
LFI+ G + N TPP MGW+SWN + INE++I++ D +
Sbjct: 7 LFIIVLCYVAMTGKAQNNRF------TPPIMGWSSWNTYRININEQLIKKQADAIIDRGL 60
Query: 60 DS----QISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH----- 110
+ I++ F + + I L + FP+G + + Y+H
Sbjct: 61 KTVGYRYINIDDGFFGWRDESGI------------LHTHPKRFPNGLENIVKYIHDKGLK 108
Query: 111 ----------------------ILISNSLDNRIESNIAFEQ-GIDYLKYDNC-------F 140
I + + +++ F + G D++K D C
Sbjct: 109 AGIYSDAGSNTCGSIWDNDPNGIGVGLYGHEKQNADLFFNKWGFDFIKIDYCGAGQELAL 168
Query: 141 NDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARM 200
++ R T YRA +D I ++C W P W +I SWR + DI+ WA +
Sbjct: 169 DEQKRYTEIYRAFNDV---CNHKISLNICRWA--FPGTWAEDIATSWRISADITPEWASI 223
Query: 201 LTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNL 260
I + N + +A+ G +ND DMLE+G G +K E VHF +W I +PLL+GCD+ +
Sbjct: 224 KHIINKNLYLSAYARNGHYNDMDMLEIGRG-LKPEEEEVHFGMWCIMSSPLLIGCDLTTI 282
Query: 261 TAETMPIIGNEEVIAVNQAINDQQIWAGPLSGNRIVVL--LESRKTFSSMM--------- 309
+++ ++ N+E+IA+NQ Q + N V++ +E ++ + M
Sbjct: 283 PDKSLELLKNKELIAINQDSLGLQAYVVQHEKNGYVLVKDIEQKRGKARAMALYNPSDSI 342
Query: 310 ------TAHWDDIGIKSNNTLFEHRTLETKFQGKLTAKVDGHSCKMYELTPVTKL 358
TA + G L EH+T++T + L ++ HS K + + +L
Sbjct: 343 CSFSVPTACLELGGTIKVRDLIEHKTIDTN-KEYLEYELPPHSAKFFRIEAEKRL 396
>gi|189468400|ref|ZP_03017185.1| hypothetical protein BACINT_04797 [Bacteroides intestinalis DSM
17393]
gi|189436664|gb|EDV05649.1| alpha-galactosidase [Bacteroides intestinalis DSM 17393]
Length = 536
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 81/286 (28%), Positives = 132/286 (46%), Gaps = 50/286 (17%)
Query: 28 PPMGWNSWNHFHCQINEKIIRETDDCW-GESSRDS---QISLSCSFHMFDSLTQIAAKPL 83
P MGW+SWN +H INE++I++ D +D+ I++ F + T
Sbjct: 34 PLMGWSSWNTYHVNINEELIKKQADALVTHGLKDAGYLYINVDDGFFGWRDET------- 86
Query: 84 PNQCQGNLVANKTTFPSGTKALADYVHIL-------------------------ISNSLD 118
G + A+ FP+G + ++DY+H L + + L
Sbjct: 87 -----GKMHAHPERFPNGMRPISDYIHSLGLKAGIYSDAGDNTCGSIYDDDANGVGSGLY 141
Query: 119 NRIESNIAF---EQGIDYLKYDNCFNDDT--RPTIRYRAMSDALKKAGRP-IFFSMCEWG 172
+ +I E D++K D C + RY + +A+K GR + ++C W
Sbjct: 142 GHEQQDIDLYLKEWNYDFIKIDYCGGRELGLDEEKRYATICEAIKNTGRTDVSINICRWA 201
Query: 173 DMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGM 232
P W ++ SWR + DI W + I N + +A G +ND DMLEVG G +
Sbjct: 202 --FPGTWAKSMARSWRISPDIRPRWESVKNIIRKNLYLSAYAGEGHYNDMDMLEVGRG-L 258
Query: 233 KYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
K E VHF +W I +PLL+GCD+ + ++ ++ N+E+IA+NQ
Sbjct: 259 KQEEEEVHFGMWCIMSSPLLIGCDMTTIPEASLALLKNKELIALNQ 304
>gi|254220898|pdb|3A21|A Chain A, Crystal Structure Of Streptomyces Avermitilis Beta-L-
Arabinopyranosidase
gi|254220899|pdb|3A21|B Chain B, Crystal Structure Of Streptomyces Avermitilis Beta-L-
Arabinopyranosidase
gi|254220900|pdb|3A22|A Chain A, Crystal Structure Of Beta-L-Arabinopyranosidase Complexed
With L- Arabinose
gi|254220901|pdb|3A22|B Chain B, Crystal Structure Of Beta-L-Arabinopyranosidase Complexed
With L- Arabinose
gi|254220902|pdb|3A23|A Chain A, Crystal Structure Of Beta-l-arabinopyranosidase Complexed
With D- Galactose
gi|254220903|pdb|3A23|B Chain B, Crystal Structure Of Beta-l-arabinopyranosidase Complexed
With D- Galactose
Length = 614
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 107/389 (27%), Positives = 167/389 (42%), Gaps = 73/389 (18%)
Query: 29 PMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQ 88
PMGW SWN F +I+ +I++ D + L + + + ++ + + +
Sbjct: 13 PMGWASWNSFAAKIDYSVIKKQVDAF------VAAGLPAAGYTYINIDEGWWQGTRDSA- 65
Query: 89 GNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIA---------------------- 126
GN+ + +P G A+ Y+H S L I ++
Sbjct: 66 GNITVDTAEWPGGMSAITAYIH---SKGLKAGIYTDAGKDGCGYYYPTGRPAAPGSGSEG 122
Query: 127 -FEQ--------GIDYLKYDNCFND--DTRPTIRYRAMSDALKKA----GRPIFFSMCEW 171
++Q G D++K D C D Y+++SDA+ +A GRP+ S+C W
Sbjct: 123 HYDQDMLQFSTWGFDFVKVDWCGGDAEGLDAATTYKSISDAVGRAAATTGRPLTLSICNW 182
Query: 172 GDMRPALWGSNIRNSWRTTDDI-----SDSWARMLTIADMNEVYADHAKPGGWNDPDMLE 226
G P W + WRT+ DI S +L+ D ++ G +NDPDML
Sbjct: 183 GYQNPWNWAAGQAPLWRTSTDIIYYGNQPSMTSLLSNFDQT-LHPTAQHTGYYNDPDMLM 241
Query: 227 VGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-------- 278
VG G + H ++WAIS APLL G D+ +T+ET I+ N EVIAV+Q
Sbjct: 242 VGMDGFTAAQNRTHMNLWAISGAPLLAGNDLTTMTSETAGILKNPEVIAVDQDSRGLQGV 301
Query: 279 ----AINDQQIWAGPLS--GNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT----LFEHR 328
Q + LS GNR VVLL R + + +T W D+G+ + + L+ +
Sbjct: 302 KVAEDTTGLQAYGKVLSGTGNRAVVLLN-RTSAAHDITVRWSDLGLTNASATVRDLWARQ 360
Query: 329 TLETKFQGKLTAKVDGHSCKMYELTPVTK 357
+ T G TA V M +T T+
Sbjct: 361 NVGTSATG-YTASVPAGGSVMLTVTGGTE 388
>gi|29828728|ref|NP_823362.1| alpha-galactosidase [Streptomyces avermitilis MA-4680]
gi|29605832|dbj|BAC69897.1| putative secreted alpha-galactosidase [Streptomyces avermitilis
MA-4680]
Length = 658
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 107/389 (27%), Positives = 167/389 (42%), Gaps = 73/389 (18%)
Query: 29 PMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQ 88
PMGW SWN F +I+ +I++ D + L + + + ++ + + +
Sbjct: 57 PMGWASWNSFAAKIDYSVIKKQVDAF------VAAGLPAAGYTYINIDEGWWQGTRDSA- 109
Query: 89 GNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIA---------------------- 126
GN+ + +P G A+ Y+H S L I ++
Sbjct: 110 GNITVDTAEWPGGMSAITAYIH---SKGLKAGIYTDAGKDGCGYYYPTGRPAAPGSGSEG 166
Query: 127 -FEQ--------GIDYLKYDNCFND--DTRPTIRYRAMSDALKKA----GRPIFFSMCEW 171
++Q G D++K D C D Y+++SDA+ +A GRP+ S+C W
Sbjct: 167 HYDQDMLQFSTWGFDFVKVDWCGGDAEGLDAATTYKSISDAVGRAAATTGRPLTLSICNW 226
Query: 172 GDMRPALWGSNIRNSWRTTDDI-----SDSWARMLTIADMNEVYADHAKPGGWNDPDMLE 226
G P W + WRT+ DI S +L+ D ++ G +NDPDML
Sbjct: 227 GYQNPWNWAAGQAPLWRTSTDIIYYGNQPSMTSLLSNFDQT-LHPTAQHTGYYNDPDMLM 285
Query: 227 VGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-------- 278
VG G + H ++WAIS APLL G D+ +T+ET I+ N EVIAV+Q
Sbjct: 286 VGMDGFTAAQNRTHMNLWAISGAPLLAGNDLTTMTSETAGILKNPEVIAVDQDSRGLQGV 345
Query: 279 ----AINDQQIWAGPLS--GNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT----LFEHR 328
Q + LS GNR VVLL R + + +T W D+G+ + + L+ +
Sbjct: 346 KVAEDTTGLQAYGKVLSGTGNRAVVLLN-RTSAAHDITVRWSDLGLTNASATVRDLWARQ 404
Query: 329 TLETKFQGKLTAKVDGHSCKMYELTPVTK 357
+ T G TA V M +T T+
Sbjct: 405 NVGTSATG-YTASVPAGGSVMLTVTGGTE 432
>gi|403418921|emb|CCM05621.1| predicted protein [Fibroporia radiculosa]
Length = 327
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 71/229 (31%), Positives = 112/229 (48%), Gaps = 33/229 (14%)
Query: 114 SNSLDNRIESNIAFEQ-GIDYLKYDNC---FNDDTRPTI--RYRAMSDALKKAGR----- 162
+ S ++ + F+ G DYLKYDNC F+D R +Y+ M+DA+ +
Sbjct: 20 AGSFEHELVDATTFQNWGFDYLKYDNCYIPFDDVIRQGTYGKYQRMADAIAELAETSRAP 79
Query: 163 PIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDP 222
P+ +S+CEWG + +WG + NSWR DI +W + +I + G ND
Sbjct: 80 PLIYSLCEWGWSQVWIWGPQVGNSWRIDGDIEANWDSIASIINFASFITQSTNFYGRNDM 139
Query: 223 DMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ---- 278
D+LEVGNG + Y E HF+ WA+ K+PLL+G ++ E + ++ N E++A+NQ
Sbjct: 140 DILEVGNGNLTYEENKSHFTAWALLKSPLLIGTNLSATPPEIIEVLSNTEILAINQDPVV 199
Query: 279 ---------AIN---------DQQIWAGPLSGNRIVVLLESRKTFSSMM 309
IN Q W+GP +V+LL + ++M
Sbjct: 200 GTNISPFRWGINADWTSNSSYPAQYWSGPSQNGTVVMLLNTLDEPANMF 248
>gi|345561280|gb|EGX44376.1| hypothetical protein AOL_s00193g104 [Arthrobotrys oligospora ATCC
24927]
Length = 466
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 98/323 (30%), Positives = 133/323 (41%), Gaps = 71/323 (21%)
Query: 18 NLLSNGLGLTPPMGWNSWNHFHCQINE-KIIRETDDCWGESSRDSQISLSCSFHMFD--- 73
L S G P MG+NSWN F+ INE K + +D L ++ D
Sbjct: 23 QLESTTEGRLPAMGFNSWNCFNEHINEEKFLTAAQQLVDLGLKD----LGYTYVNIDDGW 78
Query: 74 SLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----------------ISNS 116
S + P + ++ + FP G LAD VH L S
Sbjct: 79 SDKDLWRDPETKR----IIVDTKKFPDGISGLADKVHELGLKLGIYSDRGTKTCASYPGS 134
Query: 117 LD-NRIESNIAFEQGIDYLKYDNCFNDD-------------------------------- 143
+D I++ + G+DYLKYDNCF D
Sbjct: 135 MDYEEIDAKTFADWGVDYLKYDNCFIPDEEEDEYPWAPEWWNTTHDGNKIHAPESYDFST 194
Query: 144 TRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWA----- 198
+R RYRAM DALK+ R I F MC WG WG+ SWR DI W
Sbjct: 195 SRSAQRYRAMRDALKRQDRIIQFGMCNWGHAHVEKWGNETAQSWRIWGDILPQWTGHKDH 254
Query: 199 ---RMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGC 255
++ I + + G D DMLEVGN G+ E HF++WAI K+PLL+G
Sbjct: 255 IAWGVMPILNHALFHLSQTNFWGHADMDMLEVGN-GLTPAEDRSHFALWAILKSPLLIGT 313
Query: 256 DVGNLTAETMPIIGNEEVIAVNQ 278
+ ++ ET+ + N+E+IA NQ
Sbjct: 314 PLDKVSPETLAVFKNKELIAFNQ 336
>gi|302404982|ref|XP_003000328.1| alpha-galactosidase [Verticillium albo-atrum VaMs.102]
gi|261360985|gb|EEY23413.1| alpha-galactosidase [Verticillium albo-atrum VaMs.102]
Length = 402
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 88/300 (29%), Positives = 132/300 (44%), Gaps = 62/300 (20%)
Query: 31 GWNSWNHFHCQINEKIIRETDDCWGESS-RDSQISLSCSFHMFDSLTQIAAKPLPNQCQG 89
GWNSWN F IN+ I+ T ++ RD+ + M D + P G
Sbjct: 48 GWNSWNTFKLGINQTIVEGTAQALVDTGLRDAGYTYLV---MDDGWQNLTRGP-----DG 99
Query: 90 NLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAFEQGIDYLKY--DNCFNDDTRPT 147
AN T FPSG K LAD VH + + F G+ L Y F +TRP
Sbjct: 100 RQQANATRFPSGLKVLADEVH-----------KKGLKF--GLYRLGYGLSEGFYGNTRPL 146
Query: 148 I-RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSW--------- 197
R++ MS ALK GR IF+++C+WG P W +S+R + DI +
Sbjct: 147 QERFQIMSHALKNTGRDIFYAVCQWGHQWPWYWADQFTDSYRMSGDIHAKFRDDGNSVCK 206
Query: 198 ------------ARMLTIADMNEVYADHAKPGGWNDPDMLEVG-NGGMKYNEYVVHFSIW 244
+ + I M E+ + K G W D DMLEVG N ++ H S W
Sbjct: 207 TAYCLNTGYAGVSVLTMIRKMREI-SGFQKKGSWADMDMLEVGVNRNFTLHQDQTHLSFW 265
Query: 245 AISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ--------------AINDQQIWAGPL 290
A K+PL++G D+ + ++ ++ +++I +NQ ++ Q+WAGP+
Sbjct: 266 AALKSPLIIGADIRTIRKSSLDVLLKKDIIGINQDDLGVAVSYVAELSKEDEIQVWAGPV 325
>gi|328851067|gb|EGG00225.1| family 27 glycoside hydrolase [Melampsora larici-populina 98AG31]
Length = 525
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 95/358 (26%), Positives = 154/358 (43%), Gaps = 74/358 (20%)
Query: 30 MGWNSWNHFHCQINEKIIRETDDCWGESS-RDS---QISLSCSFHMFDSLTQIAAKPLPN 85
MGWNS F C I+EK +R+ D S RDS L C + D
Sbjct: 1 MGWNSAVAFGCGIDEKSMRQQVDLMESSGLRDSGYVTFILGCGWQAIDRAKD-------- 52
Query: 86 QCQGNLVANKTTFPSGTKALADYVHI------LISNSLDN------RIESNIAFEQ---- 129
G AN+ FPSG +A YVH L S + D+ + S + +
Sbjct: 53 ---GTFAANQQAFPSGMNGMAKYVHGKKMQFGLTSTAGDHPCGTRAKAHSKMIGSRNHEV 109
Query: 130 ---------GIDYLKYDNCFNDDTRPTI----------RYRAMSDALKKAGRPIFFSMCE 170
GID+LK + C+ D+ + R++ DAL +G P+ +++
Sbjct: 110 ADVKTFVGWGIDFLKMEFCWADNPHTPVDYNPNFSVKDRFKVTRDALIASGAPVTYAISA 169
Query: 171 WGDMRPALWG-SNIRNSWRTTDDI---SDSWARMLTIADMNEVYADHAKPGGWNDPDMLE 226
WG P W + N WR DD+ SD+W+R++ + A +PG + D D+LE
Sbjct: 170 WGIQNPLSWPVKDYANLWRIGDDVAGASDTWSRVVRTINQFASEAWANEPGTFADLDLLE 229
Query: 227 VGNGGMKYNEYVVHFSIWAIS--------KAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
VG GG+ E + F+ WA + ++PL++ D+ L + N ++IAVNQ
Sbjct: 230 VGQGGLTRIEVITQFTFWAAANSSRHRKHRSPLIISTDLTRLDPSLIGHFKNPKLIAVNQ 289
Query: 279 ------------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTL 324
+ +W+GPL +VV++ + + ++ + D+G+ S N +
Sbjct: 290 DPLGASIRLRRRYTDQHDVWSGPLKDGSVVVVIVNFLADTLLIDFNLADLGLLSANIV 347
>gi|403724394|ref|ZP_10946032.1| putative glycoside hydrolase [Gordonia rhizosphera NBRC 16068]
gi|403205604|dbj|GAB90363.1| putative glycoside hydrolase [Gordonia rhizosphera NBRC 16068]
Length = 424
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 90/349 (25%), Positives = 150/349 (42%), Gaps = 46/349 (13%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRE-------------------TDDCWGESSRDSQ 62
GL TPPMGWNSWN F C I E++IR DDCWG R +
Sbjct: 45 TGLPPTPPMGWNSWNTFGCDITEELIRRQADALVESGLRDAGYRYVVVDDCWGADHRAAD 104
Query: 63 ISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIE 122
SL F S + L + + + + + T+ Y S + +
Sbjct: 105 GSLQADPQRFPSGMAALGRYLHARGLAFGLYSGASAQTCTQYQGTYPGSTGSRGHETQDA 164
Query: 123 SNIAFEQGIDYLKYDNCFNDDTR--PTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWG 180
+ A Q +DY+KYD C +D + AM DAL+ GRPI +S+ + ++ G
Sbjct: 165 ATFADWQ-VDYIKYDWCSSDSDHDDQVAAFTAMRDALRATGRPIVYSINPNSGVSGSVPG 223
Query: 181 S-----NIRNSWRTTDDISDSWAR---------MLTIADMNEVYADHAKPGGWNDPDMLE 226
+ + R T+DI+ +W+ ++ I D PG + DPDM+
Sbjct: 224 TEFDWGGVATMTRATNDITPAWSTDAGPSGYQGIIDIVDAIAPLGARVAPGSFLDPDMMV 283
Query: 227 VGNG-GMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------- 278
VG G G+ S+WA+ APL+ G D+ ++A+T+ + + ++A++Q
Sbjct: 284 VGIGDGLTAAMDRTQMSMWAMMAAPLIAGNDLTQMSAQTLATLTSPAILALDQDERVSAG 343
Query: 279 --AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLF 325
+D +IW+ + +VV L +R + M+ +G+ + +
Sbjct: 344 RPVDDDPEIWSRAVGDKGLVVSLTNRSDYPRTMSVSLASLGLVGDERVV 392
>gi|345328430|ref|XP_001513605.2| PREDICTED: alpha-galactosidase A-like [Ornithorhynchus anatinus]
Length = 394
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 85/255 (33%), Positives = 122/255 (47%), Gaps = 41/255 (16%)
Query: 89 GNLVANKTTFPSGTKALADYVH-----ILISNSLDN-------------RIESNIAFEQG 130
G L A+ FP G + LA+YVH + I + N +++ + G
Sbjct: 111 GRLRADPDRFPGGIRRLANYVHSKGLKLGIYEDVGNHTCAGYPGSFGAYELDAQTFADWG 170
Query: 131 IDYLKYDNCFNDD-TRPTIRYRAMSDALKKAGRPIFFSMCEWG-DMRPALWG--SNIR-- 184
+D LK+D C+ D R YR MS AL GR I +S CEW +RP + IR
Sbjct: 171 VDLLKFDGCYCDSLERLAEGYRRMSLALNGTGRSIVYS-CEWPLYLRPFQKPNYTEIRQY 229
Query: 185 -NSWRTTDDISDSWARMLTIADMNEVYADH----AKPGGWNDPDMLEVGNGGMKYNEYVV 239
N WR DI DSW + +I D D A PGGWNDPDML +GN G+ +++ V
Sbjct: 230 CNHWRNFGDIFDSWDSIKSILDWTASNQDSIVGAAGPGGWNDPDMLVIGNFGLSWDQQVT 289
Query: 240 HFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ-----------IWAG 288
++WAI APL + D+ ++ + ++ N EVIA+NQ +Q +W
Sbjct: 290 QMALWAIMAAPLFMSNDLRQISPQAKALLQNREVIAINQDPLGRQGYRLSKEDNFDLWER 349
Query: 289 PLSGNRIVVLLESRK 303
PLS V + +R+
Sbjct: 350 PLSNLTWAVAMVNRQ 364
>gi|427386287|ref|ZP_18882484.1| hypothetical protein HMPREF9447_03517 [Bacteroides oleiciplenus YIT
12058]
gi|425726327|gb|EKU89192.1| hypothetical protein HMPREF9447_03517 [Bacteroides oleiciplenus YIT
12058]
Length = 664
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 96/343 (27%), Positives = 146/343 (42%), Gaps = 69/343 (20%)
Query: 2 FIVSSSAADSNGSDRRNL---LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS 58
+ V A + G+ + + + + + LTP MGWNSWN + +N+ E
Sbjct: 254 YKVQLKATNDKGTAVKEVTIKIGDEIALTPSMGWNSWNCWGLSVND-----------EKV 302
Query: 59 RDSQISLSCSFHMF--DSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL---- 112
RD+ ++ H + + + P + Q + + FP KAL DY+H L
Sbjct: 303 RDAARMMNEKLHAYGWEYVNIDDGWEAPERTQKGEILSNDKFPD-FKALTDYIHGLGLKF 361
Query: 113 --------------ISNSLDNRIESNIAFEQGIDYLKYDNCF-----NDDTRPTIR--YR 151
+ + I++ G+DYLKYD C + TI+ Y
Sbjct: 362 GIYSSPGHITCGGHVGSYQHEEIDAKTWERWGVDYLKYDYCGYLEIEKNSEEKTIQEPYI 421
Query: 152 AMSDALKKAGRPIFFSMCEWGDMRPALWG---SNIRNSWRTTDDISDSWARMLTIADMNE 208
M AL K R I + + G P +W N WRTT DI+D W + I +
Sbjct: 422 VMRKALDKVNRDIVYCV---GYGAPNVWNWAPEAGGNQWRTTRDITDEWNVVTAIGTFQD 478
Query: 209 VYADHAKPGGWNDPDMLEVGNGGMKY-----------NEYVVHFSIWAISKAPLLLGCDV 257
V A+ PG NDPDML VG G + +E H S+W + +PLL+GCD+
Sbjct: 479 VCAEATAPGRNNDPDMLVVGKLGQGWGSKVHDSYLTADEQYSHISLWCLLSSPLLIGCDM 538
Query: 258 GNLTAETMPIIGNEEVIAVNQ----------AINDQQIWAGPL 290
N+ T+ ++ N EVIAV+Q + + QIW+ L
Sbjct: 539 ANMDDFTLNLLTNNEVIAVSQDPLVAPARKMMVENGQIWSKKL 581
>gi|329956841|ref|ZP_08297409.1| alpha-galactosidase [Bacteroides clarus YIT 12056]
gi|328523598|gb|EGF50690.1| alpha-galactosidase [Bacteroides clarus YIT 12056]
Length = 555
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 116/431 (26%), Positives = 173/431 (40%), Gaps = 87/431 (20%)
Query: 2 FIVSSSAADSNGSDRRN---LLSNGLGLTPPMGWNSWNHF-HCQINEKIIRET------- 50
+IV+ A +S G +RN ++ + + LTPPMGWNSWN + H EK++
Sbjct: 123 YIVTFIAKNSLGEAKRNFKIVVGDKIALTPPMGWNSWNCWGHAVSQEKVLSSAKAMVEKG 182
Query: 51 -----------DDCW------------GESSRDSQISLSCSFHMFDSLTQIAAKPLPNQC 87
DD W S L+ + H I + P
Sbjct: 183 LINHGWQYINIDDGWQGLRGGKYNGVMTNSKFPDMKGLADAVHAMGLKIGIYSGPWVGTY 242
Query: 88 QGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAFEQGIDYL--KYDNCFND--- 142
G++ A P GT D+V +N +++ + G++Y KY ND
Sbjct: 243 AGHIGAYCDN-PDGT---YDWVE-RYANEYYRYVDTTKQTKHGVNYHHGKYSFVKNDVRQ 297
Query: 143 -----------DTRPTIRYRA--MSDALKKAGRPIFFSM---CEWGDMRPALWGSNIRNS 186
D P Y M DAL+ R + FS+ WGD A+ + N
Sbjct: 298 WMEWGMDYLKYDWNPNDVYHVTEMHDALRSHNRDVVFSLSNSAPWGD---AIQWERLANC 354
Query: 187 WRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGG---------MKYNEY 237
WRTT DI D+W M + +A PG W DPDML VG G + +E
Sbjct: 355 WRTTGDIKDTWESMSRLGFNQTKWAPFVGPGHWIDPDMLVVGMVGWGPKLHYTRLTPDEQ 414
Query: 238 VVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ-----------IW 286
H S+WA+ +PLL+GCD+ L T+ ++ N+EVI VNQ +Q +
Sbjct: 415 YTHISLWALLSSPLLIGCDMAQLDDFTLSLLTNDEVIEVNQDPMGKQGIVIAEQGNIVTY 474
Query: 287 AGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT---LFEHRTLETKFQGKLTAKVD 343
A PL + V L +R + T W +GI+ T L+ + + T + +
Sbjct: 475 AKPLEDGSMAVGLFNRGNTMAKGTLTWKSVGIRGEQTVRDLWRQQDIATS-DVEFVTDIA 533
Query: 344 GHSCKMYELTP 354
H + +L P
Sbjct: 534 PHGVRFIKLYP 544
>gi|440696831|ref|ZP_20879279.1| alpha-galactosidase [Streptomyces turgidiscabies Car8]
gi|440280862|gb|ELP68543.1| alpha-galactosidase [Streptomyces turgidiscabies Car8]
Length = 698
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 101/368 (27%), Positives = 148/368 (40%), Gaps = 104/368 (28%)
Query: 27 TPPMGWNSWNHFHCQINEKIIR-------------------ETDDCWGESSRDSQISLSC 67
+ PMGW SWN F I+ I+ DD W + +RDS
Sbjct: 59 SAPMGWASWNSFASSIDYNTIKTQADALVSSGMAAAGYKYVNIDDGWWQGARDSN----- 113
Query: 68 SFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIA- 126
G +V +++ FP G KA+ADY+H S L I ++
Sbjct: 114 ---------------------GKIVTDESLFPGGMKAMADYIH---SKGLKAGIYTDAGK 149
Query: 127 ----------------------FEQ--------GIDYLKYDNC------FNDDTRPTIRY 150
++Q G D++K D C + +T
Sbjct: 150 QGCGYYYPTTRPAAPNTGSEGHYQQDMQTFQEWGYDFVKVDWCGGNAEGLDQETTYKAMS 209
Query: 151 RAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDI-----SDSWARMLTIAD 205
A A G + S CEWG P WG+ WRT+ DI S S A MLT D
Sbjct: 210 AANDAAAAVTGHKLVLSFCEWGTGNPWNWGAGTAPMWRTSTDILFWGNSPSTANMLTNFD 269
Query: 206 MNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETM 265
++ G +NDPDML VG G + H +WAIS APLL G ++ +++ T
Sbjct: 270 -KTLHPSSQHTGYYNDPDMLMVGISGFTAAQNRTHMGLWAISGAPLLAGNNLATMSSTTA 328
Query: 266 PIIGNEEVIAVNQ------------AINDQQIWAGPLSG-NRIVVLLESRKTFSSMMTAH 312
I+ N EVIA++Q Q+++ LSG + VLL +R + ++ +TA
Sbjct: 329 SILKNPEVIAIDQDARGLQGVKVAEDTKGLQVYSKVLSGTGKRAVLLLNRTSSAASVTAR 388
Query: 313 WDDIGIKS 320
W D+G+ +
Sbjct: 389 WADLGLTT 396
>gi|298251572|ref|ZP_06975375.1| coagulation factor 5/8 type domain protein [Ktedonobacter racemifer
DSM 44963]
gi|297546164|gb|EFH80032.1| coagulation factor 5/8 type domain protein [Ktedonobacter racemifer
DSM 44963]
Length = 863
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 111/396 (28%), Positives = 170/396 (42%), Gaps = 76/396 (19%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGL +TP MGWN++ NE+ I D S ++ + D
Sbjct: 58 LDNGLAITPYMGWNTYYGLGSNFNEQTIVSIADAMVSSGLEAA---GYQYVWIDGGWWSG 114
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIA------------- 126
+ N N+ + T +P G + +ADY+H S L I ++
Sbjct: 115 TRDNNN----NITVDATQWPHGMQWIADYIH---SKGLKAGIYTDAGSNGCGGANQGSYG 167
Query: 127 -FEQ--------GIDYLKYDNCFND--DTRPTIRYRAMSDAL--KKAGRPIFFSMCE--- 170
++Q G D +K D C D P Y DAL + RP+ F++C
Sbjct: 168 HYQQDVDQFAAWGFDAVKVDFCGGHELDLDPATAYGQFRDALLNNNSHRPMLFNICNPFP 227
Query: 171 ---WGDMRPAL---------WGSNIRNSWRTTDDI----SDSWARMLTIADMNEVYADHA 214
+G P L +G N NSWRT D+ S W +L D + + + A
Sbjct: 228 PNVFGQNNPPLQKSAYYSYSFGPNTGNSWRTDTDVGFPNSVLWPDVLRNLDHDAAHPEAA 287
Query: 215 KPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVI 274
PG WNDPD L G GM E F++W++ APL++G D+ N++A ++ N +VI
Sbjct: 288 GPGHWNDPDYLGPGL-GMTDAESQAQFTMWSMVAAPLIVGSDIRNMSATIQGMLTNRDVI 346
Query: 275 AVNQAINDQQ-----------IWAGPLS-GNRIVVLLESRKTFSSMMTAHWDDIGIK--- 319
AV+Q Q +W PL+ G+R V LL +R T +T D G+
Sbjct: 347 AVDQDTLGVQGTRIKQDGNGDVWVKPLANGDRAVALL-NRGTSPLTITTSAADTGLAYAS 405
Query: 320 --SNNTLFEHRTLETKFQGKLTAKVDGHSCKMYELT 353
+ N L+ H++ ET G+++A V S +Y ++
Sbjct: 406 DYTLNNLWAHQSTETA--GQISATVAPQSAILYRVS 439
>gi|330821491|ref|YP_004350353.1| Secreted alpha-galactosidase [Burkholderia gladioli BSR3]
gi|327373486|gb|AEA64841.1| Secreted alpha-galactosidase [Burkholderia gladioli BSR3]
Length = 728
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 110/383 (28%), Positives = 160/383 (41%), Gaps = 121/383 (31%)
Query: 30 MGWNSWNHFHCQINEKIIRE-------------------TDDCWGESSRDSQISLSCSFH 70
MGW SWN C INEK+I + DDCW E +RDSQ
Sbjct: 71 MGWASWNGHGCNINEKVIHDAADYLVSSGLKSAGYVYVNVDDCWSELTRDSQ-------- 122
Query: 71 MFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH-------ILIS--------- 114
G L N +FPSG AL DY+H I +
Sbjct: 123 ------------------GKLNGNHVSFPSGMTALGDYIHSKGLKYGIYATPGLRTCAQL 164
Query: 115 NSLDNRI-------ESN-------------IAFEQ-GIDYLKYDNCFNDDTRPTIR---Y 150
N+L N + E+N F Q G+DYLKYD C T + +
Sbjct: 165 NNLQNELAPGAPGNEANGNTGSLGHEALDAATFVQWGVDYLKYDWCTKGSTAAPAQVPVF 224
Query: 151 RAMSDALKKA----GRP--IFFSMC--EWGDMRPAL---WGSNIRNSWRTTDDIS----- 194
M DAL++A GR +F+S+ + D + WG N+ + WRT +DIS
Sbjct: 225 AKMRDALRQAALGAGRKGYLFYSINPDSFNDAKTGRDYDWG-NVADMWRTEEDISYDTGD 283
Query: 195 ---DSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPL 251
W+R++ + + G +NDPDM+ G G+ + H ++WAIS AP+
Sbjct: 284 NFTPGWSRLVGQNFEGNAFPEAQHTGRYNDPDMMLAGM-GLSDEQDRSHIALWAISGAPM 342
Query: 252 LLGCDVGNL--TAETMPIIGNEEVIAVNQ------------AINDQQIWAGPLSGN-RIV 296
+LG D A T+ ++ N E+IA++Q ++WA PL GN R
Sbjct: 343 ILGNDFTGRPPKAATLALLKNPEMIAIDQDGLGLQGVLVANPAPGVEVWAKPLMGNGRRA 402
Query: 297 VLLESRKTFSSMMTAHWDDIGIK 319
VLL + T + M+ W ++G++
Sbjct: 403 VLLFNNSTKDASMSVSWRELGLE 425
>gi|298376374|ref|ZP_06986329.1| alpha-galactosidase [Bacteroides sp. 3_1_19]
gi|298266252|gb|EFI07910.1| alpha-galactosidase [Bacteroides sp. 3_1_19]
Length = 530
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 94/384 (24%), Positives = 159/384 (41%), Gaps = 65/384 (16%)
Query: 28 PPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQC 87
P MGW+SWN + I++K+I + D SR+ + + D
Sbjct: 22 PTMGWSSWNTYRVNISDKLIMKQADA---MSRNGLKEVGYKYINIDD-----GYFGGRDA 73
Query: 88 QGNLVANKTTFPSGTKALADYVHIL------ISNSLDNRIES---------NIAF----- 127
G L+ + FP+G K D++H L S++ N S N+ F
Sbjct: 74 SGELITHPVRFPNGLKQTVDHIHALGFKAGIYSDAGRNTCGSFWDKDSLGINVGFYGHDR 133
Query: 128 --------EQGIDYLKYDNCFNDDTR--------PTIRYRAMSDALKKAGRP-IFFSMCE 170
E G D++K D C D + RY A+ +A+ R + ++C
Sbjct: 134 QDADYFFKEIGFDFIKIDFCGGDAKQNFDQLGLDEQERYTAIHNAILATSRKDVRMNVCR 193
Query: 171 WGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNG 230
W P W ++ SWR + DI+ SW + I N + +A G +ND DMLE+G G
Sbjct: 194 WD--FPGTWVHDVAFSWRISQDINPSWESVKNIIRQNLYLSAYASEGKYNDMDMLEIGRG 251
Query: 231 GMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQIWA--- 287
M E HF +W I +PLL+GCD+ ++ + + ++ NEE+IA+NQ + Q +
Sbjct: 252 -MSEEEDKTHFGMWCIMSSPLLIGCDLTTISEKALRLLKNEELIALNQDVLGLQAYVVKQ 310
Query: 288 -----------GPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSN---NTLFEHRTLETK 333
++GN V + + + + D ++ + LFE R L
Sbjct: 311 MDGVYILTKDLEEVNGNTCAVAVYNSTDEERTIHLDFADFYLRGDVSVRDLFERRDLGKY 370
Query: 334 FQGKLTAKVDGHSCKMYELTPVTK 357
+ + H +++ L + +
Sbjct: 371 KDRDFSVTIPAHGTRIFRLDGLER 394
>gi|378718792|ref|YP_005283681.1| alpha-galactosidase AgaA [Gordonia polyisoprenivorans VH2]
gi|375753495|gb|AFA74315.1| alpha-galactosidase AgaA [Gordonia polyisoprenivorans VH2]
Length = 425
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 98/360 (27%), Positives = 148/360 (41%), Gaps = 64/360 (17%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
GL TPPMGWNSWN F C I E+I+R D S L+ + + +
Sbjct: 40 GGLPATPPMGWNSWNSFGCTITEQIVRAQADALVSSG------LAAKGYRYVVVDDCWMA 93
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVHIL------------------------ISNSL 117
P N G LVA+ FPSG ALADYVH + S
Sbjct: 94 PDRN-AGGQLVADPVRFPSGMAALADYVHARGLKFGIYAGAREKTCAQFGGTYPGSAGSG 152
Query: 118 DNRIESNIAFEQ-GIDYLKYDNCFNDDTR--PTIRYRAMSDALKKAGRPIFFSMCEWGDM 174
+ + AF G+D++KYD C D + AM DA++ GRP+ +S+ +
Sbjct: 153 GHELTDARAFASWGVDFVKYDWCSGDSAHDDQVASFTAMRDAIRSTGRPMVYSINPNSGI 212
Query: 175 RPALWGS-----NIRNSWRTTDDISDSWARMLTIADMNEVYA--------DHAKP----- 216
++ G+ + RTT+DIS W AD D P
Sbjct: 213 AGSVPGAQFDWGGVATMTRTTNDISPVWTTRPAGADATPASGYQGIRDIIDAVAPIGARV 272
Query: 217 --GGWNDPDMLEVGNG-GMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEV 273
G + D DML VG G + S+WA+ APL+ G D+ ++ T I+ NE +
Sbjct: 273 ADGSFVDMDMLVVGVGNALTPAMERTQMSMWAMMAAPLMAGNDLTAMSTRTRDILANEAI 332
Query: 274 IAVNQ---------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTL 324
I ++Q ++ +IW+ L +VV L +R + +G+ ++++
Sbjct: 333 IGIDQDERVRAGAMVGDNPEIWSRALGSKGLVVSLTNRADHPRTLKVSLSSLGLTGDSSV 392
>gi|54400708|ref|NP_001006103.1| alpha-galactosidase A [Danio rerio]
gi|53734040|gb|AAH83209.1| Galactosidase, alpha [Danio rerio]
gi|182890574|gb|AAI64755.1| Gla protein [Danio rerio]
Length = 338
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 89/297 (29%), Positives = 131/297 (44%), Gaps = 57/297 (19%)
Query: 98 FPSGTKALADYVHILISNSLDNRIESNIAFEQ---------------------GIDYLKY 136
FP +K LADYVH S L I +++ + G+D LK+
Sbjct: 31 FPVASKKLADYVH---SKGLKLGIYADVGTKTCAGYPGSLGYYDIDAKTFADWGVDLLKF 87
Query: 137 DNCFNDDTRPTIR-YRAMSDALKKAGRPIFFS----MCEWGDMRPALWGSNIR---NSWR 188
D CF D Y MS AL + GR I +S + EW +P IR N WR
Sbjct: 88 DGCFMPDWHQLGEGYINMSSALNQTGRSIVYSCEWPLYEWQHQQPDY--EAIRKTCNHWR 145
Query: 189 TTDDISDSWARMLTI----ADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIW 244
D+ D W + +I A+ ++ A PGGWNDPDML +GN G+ ++ ++W
Sbjct: 146 NYGDVYDQWTSVKSILDWTAEKQKIVVPVAGPGGWNDPDMLIIGNFGLSRDQQQTQMALW 205
Query: 245 AISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ-----------IWAGPLSGN 293
AI APLL+ D+ ++ + ++ N+++IA+NQ +Q +W PLSGN
Sbjct: 206 AIMAAPLLMSNDLRDICPKAKELLQNKQIIAINQDPLGKQGYRILKADSFELWERPLSGN 265
Query: 294 RIVVLLESRK--------TFSSMMTAHWDDIGIKSNNTLFEHRTLETKFQGKLTAKV 342
R+ V + +R+ T S + W K N T E Q L+ V
Sbjct: 266 RLAVAVVNRQEIGGPRRFTISVAIMPSWKLCNPKCNVTQILPTYKEMGVQNLLSEVV 322
>gi|292495629|sp|Q5AVQ6.2|AGALB_EMENI RecName: Full=Probable alpha-galactosidase B; AltName:
Full=Melibiase B; Flags: Precursor
gi|95025949|gb|ABF50888.1| putative alpha-galactosidase [Emericella nidulans]
Length = 455
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 107/402 (26%), Positives = 161/402 (40%), Gaps = 108/402 (26%)
Query: 16 RRNLL--SNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFD 73
R N L +G+G P +GW+SWN C IN +I + + L + D
Sbjct: 13 RANALMRPDGVGRLPALGWSSWNAHECDINATVILTAA---AQVVKLGLKDLGYEYINID 69
Query: 74 SLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----ISNSLDNR-------- 120
I P ++ + FP G ++A +H L I +S
Sbjct: 70 DCWSIKTHRDPTT--NRMIPDADRFPDGIASVASQIHELGLKVGIYSSAGETTCAGYPAS 127
Query: 121 -----IESNIAFEQGIDYLKYDNC-----------------------FNDDTRPTI---- 148
I++ E IDYLKYD+C F + T P++
Sbjct: 128 LGYEDIDAETFAEWEIDYLKYDDCGVPDNWKDPYTFCVPDTANNAGPFPNGTCPSLPNPA 187
Query: 149 -------------RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISD 195
R+R M DAL R I +S+C WG+ WG+ I NSWR + DIS
Sbjct: 188 PANYNWSTSPSAERFRRMLDALNTQDRTILYSLCNWGNAAVNTWGAEIGNSWRMSGDISP 247
Query: 196 S-------------WARMLTIA-DMNEVYADHAKPGGWNDPDMLEVGN--GGMKYNEYVV 239
W R+ I +M+ + ++A+ GW D DMLEVGN GGM E
Sbjct: 248 GRGEVGPDRTRIAVWERIAEITNEMSFLVREYAEFWGWPDADMLEVGNGEGGMTVAENRA 307
Query: 240 HFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ--------------------- 278
HF++WA ++PLL+G + + E + I+ N ++ +Q
Sbjct: 308 HFALWAAMRSPLLIGTKLDTIRQEHLKILKNPTLLTFHQDPIINRPAYPYKWGYNADYTF 367
Query: 279 -AINDQQIWAGP---LSGNRIVVLLESRKTFSSMMTAHWDDI 316
A + + W+GP L G +VV+L S S+ M A W ++
Sbjct: 368 DAAHPAEYWSGPSPALEGT-LVVMLNSENVTSTRM-AVWSEV 407
>gi|380693886|ref|ZP_09858745.1| alpha-galactosidase [Bacteroides faecis MAJ27]
Length = 527
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 83/288 (28%), Positives = 132/288 (45%), Gaps = 50/288 (17%)
Query: 28 PPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQ----ISLSCSFHMFDSLTQIAAKPL 83
P MGW+SWN +H I+E++I++ D + I++ F + T
Sbjct: 28 PIMGWSSWNTYHVNISEELIKQQADALIKHGLKEAGYNYINIDDGFFGYRDET------- 80
Query: 84 PNQCQGNLVANKTTFPSGTKALADYVHIL------ISNSLDN---RIESNIAFEQG---- 130
G + + FP+G K ++DY+H L S++ DN I N A G
Sbjct: 81 -----GKMHPHPDRFPNGMKVVSDYIHSLGLRAGIYSDAGDNTCGSIYDNDANGVGSGLY 135
Query: 131 ---------------IDYLKYDNCFNDDT--RPTIRYRAMSDALKKAGRP-IFFSMCEWG 172
D++K D C + RY A+ A+ GR + ++C W
Sbjct: 136 GHEQQDMDLYIKEWNYDFIKIDYCGGRELGLDEEKRYNAICQAIANTGRTDVSINICRWA 195
Query: 173 DMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGM 232
P W + SWR + DI W + I + N + +A G +ND DMLE+G G +
Sbjct: 196 --FPGTWAKKLARSWRISPDIRPKWNSVKGIIEKNLYLSAYATGGHYNDMDMLEIGRG-L 252
Query: 233 KYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQAI 280
K NE VHF +W I +PLL+GCD+ + ++ ++ N+E+IA+NQ +
Sbjct: 253 KPNEEEVHFGMWCIMSSPLLIGCDMNTIPDFSLKLLKNKELIALNQDV 300
>gi|255531737|ref|YP_003092109.1| alpha-galactosidase [Pedobacter heparinus DSM 2366]
gi|255344721|gb|ACU04047.1| Alpha-galactosidase [Pedobacter heparinus DSM 2366]
Length = 535
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 110/428 (25%), Positives = 174/428 (40%), Gaps = 80/428 (18%)
Query: 2 FIVSSSAADSNGSDRRN---LLSNGLGLTPPMGWNSWNHFHCQIN-EKIIRET------- 50
+ V+ +A +S G +R ++ + + LTPPMGWNSWN + ++ EK++
Sbjct: 102 YEVTLTAKNSLGESKRTFKIVVGDQIALTPPMGWNSWNCWGDAVSQEKVLSSAKAMVEKG 161
Query: 51 -----------DDCWGESSRDSQISLSCS------------FHMFDSLTQIAAKPLPNQC 87
DD W ++ C+ H I + P
Sbjct: 162 LLNYGWQYINIDDGWQGLRGGKYNAIQCNSKFPDMKGLADEVHRMGLKIGIYSGPWVGTY 221
Query: 88 QGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIAFEQGIDYL--KYDNCFND--- 142
G+L A + D+V N + + GI+Y KY ND
Sbjct: 222 AGHLGA----YSDNADGTYDWVKQGKHNEFYRFADPEKKEKHGINYHHGKYSFVKNDVQQ 277
Query: 143 -----------DTRPTIRY--RAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRT 189
D P Y + M DAL+ R + +S+ A + NSWRT
Sbjct: 278 WMDWGMDYLKYDWNPNDVYHVKEMKDALRSYKRDVVYSLSNSAPYGDATQWEKMANSWRT 337
Query: 190 TDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGG---------MKYNEYVVH 240
T DI D+W RM + +A A PG W DPDML VG G + +E H
Sbjct: 338 TGDIRDTWERMCQLGFNQTKWAPFAGPGHWIDPDMLVVGMVGWGPKLHYTKLTADEQYTH 397
Query: 241 FSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ----------AINDQQ-IWAGP 289
S+W + +PLL+GCD+ L T+ ++ N EVI VNQ A N + ++A P
Sbjct: 398 ISLWCLLASPLLIGCDMAQLDDFTISLLTNNEVIDVNQDPMGKFGMLVAENGETVVYAKP 457
Query: 290 LSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT---LFEHRTLETKFQGKLTAKVDGHS 346
L + V L +R S +T +W +G++ T L+ + + Q + +++V+ H
Sbjct: 458 LEDGSMAVGLFNRGQKSEKITVNWKTLGLRGEQTVRDLWRQQDVAKSDQ-EFSSEVNPHG 516
Query: 347 CKMYELTP 354
+ ++ P
Sbjct: 517 VRFIKVYP 524
>gi|29349000|ref|NP_812503.1| alpha-galactosidase [Bacteroides thetaiotaomicron VPI-5482]
gi|383124406|ref|ZP_09945070.1| hypothetical protein BSIG_3567 [Bacteroides sp. 1_1_6]
gi|29340907|gb|AAO78697.1| alpha-galactosidase precursor [Bacteroides thetaiotaomicron
VPI-5482]
gi|251839098|gb|EES67182.1| hypothetical protein BSIG_3567 [Bacteroides sp. 1_1_6]
Length = 660
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 97/319 (30%), Positives = 147/319 (46%), Gaps = 55/319 (17%)
Query: 2 FIVSSSAADSNGSDRRNL---LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS 58
F V A + G+D + + + + + LTPPMGWNSWN + +++ +R+ E
Sbjct: 250 FKVRLQATNEKGTDEKEITLKIGSEIMLTPPMGWNSWNCWRFAADDQKVRDAARIMHEKL 309
Query: 59 RDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL------ 112
+ + ++ D + A + P +G L AN+ FP K L DY+H L
Sbjct: 310 Q----AYGWTYVNIDDGWE-ADERTP---EGELPANEK-FPD-FKTLTDYIHSLGLKFGI 359
Query: 113 ------------ISNSLDNRIESNIAFEQGIDYLKYDNCF-----NDDTRPTIR--YRAM 153
I + ++ + G+DYLKYD C + TI+ + M
Sbjct: 360 YSSPGWTTCGRHIGSCQHELTDAKTWEKWGVDYLKYDYCGYAAIEKNSEEKTIQEPFIVM 419
Query: 154 SDALKKAGRPIFFSMCEWGDMRPALWGSNIR---NSWRTTDDISDSWARMLTIADMNEVY 210
+AL + R I + + G P +W N WRTT DI+D W ++ I +V
Sbjct: 420 RNALDQIKRDIVYCV---GYGAPNVWNWGAEAGGNLWRTTRDINDQWNIVMAIGCFQDVC 476
Query: 211 ADHAKPGGWNDPDMLEVGNGG-----------MKYNEYVVHFSIWAISKAPLLLGCDVGN 259
A + PG +NDPDML VG G + +E H S+W+I APLLLGCD+
Sbjct: 477 AYVSAPGKYNDPDMLVVGKLGPGWGAKSHDSDLTADEQYAHISLWSILSAPLLLGCDMTA 536
Query: 260 LTAETMPIIGNEEVIAVNQ 278
+ T+ ++ N EVIAVNQ
Sbjct: 537 IDDFTLGLLTNPEVIAVNQ 555
>gi|329847137|ref|ZP_08262165.1| melibiase family protein [Asticcacaulis biprosthecum C19]
gi|328842200|gb|EGF91769.1| melibiase family protein [Asticcacaulis biprosthecum C19]
Length = 632
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 91/330 (27%), Positives = 130/330 (39%), Gaps = 74/330 (22%)
Query: 23 GLGLTPPMGWNSWNHFHCQINEKIIR-------------------ETDDCWGESSRDSQI 63
G TPPMGWNSWN F +++E + DD W R S
Sbjct: 37 GNAATPPMGWNSWNAFRTEVDEAKVMGAAQKLVDTGLAKLGYRYVNIDDGWWLKRRKSDG 96
Query: 64 SLSCSFHMFDSL------TQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISN-- 115
+ +F S +P ++ + +G A + + N
Sbjct: 97 RMVIRTQIFPSAQPKNKKADTTFRPFTDKLHSMGLKAGIYSDAGYNACSQAYDLHSPNLP 156
Query: 116 ---------SLDNRIESNIAF---EQGIDYLKYDNC----FNDDTRPTIRY--------- 150
L ++ +IA E G DY+K D C + D +Y
Sbjct: 157 EGDTAERSVGLYGHVDQDIALYFKEWGFDYIKVDACGLNVYGPDREIVAKYQYQPFSPLI 216
Query: 151 ------RAMSDALKKAGRPI-------------FFSMCEWGDMRPALWGSNIRNSWRTTD 191
R D +K I FS+C WG WG+++ N WRT+
Sbjct: 217 DSLSIGRTKVDEVKAIYADIGAAIARHNPDGDYVFSICNWGTSDVRSWGNDVGNLWRTSG 276
Query: 192 DISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYN---EYVVHFSIWAISK 248
DI+ +W+RML D A +A+PG WNDPDML VG+G N E HFS+WA+
Sbjct: 277 DITPTWSRMLHTFDSAATRALYARPGAWNDPDMLFVGHGDFDANHLTEAKSHFSLWAMIN 336
Query: 249 APLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
APLL+G D+ + + I GN ++ VNQ
Sbjct: 337 APLLIGYDLRDAPQSLIDIWGNAGLVRVNQ 366
>gi|427401951|ref|ZP_18893023.1| hypothetical protein HMPREF9710_02619 [Massilia timonae CCUG 45783]
gi|425719142|gb|EKU82079.1| hypothetical protein HMPREF9710_02619 [Massilia timonae CCUG 45783]
Length = 661
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 95/339 (28%), Positives = 134/339 (39%), Gaps = 89/339 (26%)
Query: 21 SNGLGLTPPMGWNSWNHFHCQINE-KIIRET------------------DDCWGESSRDS 61
++G TPPMGW+SWN F +++E K++ DD W R
Sbjct: 51 TSGSASTPPMGWSSWNAFRTEVDEGKVLGAAQTLVDSGLARLGYQHVNIDDGWWLKRRTG 110
Query: 62 QISLSCSFHMFDSLTQ-----IAAKPLP------------------NQCQGNLVANKTTF 98
L ++F S + KP N C +
Sbjct: 111 DGRLQIRTNIFPSAATGGPAGTSFKPFTDKLHAMGLKAGIYTDIGRNACSQAYDLHSPNL 170
Query: 99 PSGTKALADYVHILISNSLDNRIESNIAF---EQGIDYLKYDNCFNDDTRP--------- 146
P GT A + L+ + +I E G DY+K D C D P
Sbjct: 171 PQGTTAEREV-------GLEGHVTQDINLYFKEWGFDYIKIDACGLADFLPDSDLVKKQD 223
Query: 147 ---------------------TIRYRAMSDALKKAGRP---IFFSMCEWGDMRPALWGSN 182
RY ++ ALK+A RP S+C WG WG +
Sbjct: 224 YRAAPPLIERGSINRTDVKAVRARYEDVAAALKEA-RPNNDYVLSICAWGMANVRTWGKD 282
Query: 183 IRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYN---EYVV 239
+ N WRT+ DI+ SW ML D A +A PG WNDPD+L +G+G E
Sbjct: 283 VGNLWRTSADITPSWTSMLHNFDSAAKRALYAGPGHWNDPDILHIGHGAFDAANPVEVRS 342
Query: 240 HFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
HFS+WA+ APLL+ D+ N A + ++GN +V+A+NQ
Sbjct: 343 HFSLWAMINAPLLISYDLRNGPASFLGVLGNADVVALNQ 381
>gi|148905920|gb|ABR16121.1| unknown [Picea sitchensis]
Length = 228
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 73/191 (38%), Positives = 83/191 (43%), Gaps = 70/191 (36%)
Query: 17 RNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDCWGES 57
R+LL NGL TPPMGWNSWNHF C I+EKI+R+T DDCW E
Sbjct: 34 RSLLENGLARTPPMGWNSWNHFGCNIDEKIVRKTADALISTGFKNLGYEYVNIDDCWAEH 93
Query: 58 SRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSL 117
R+ G LV +TFPSG KALADYVH S L
Sbjct: 94 KRNK--------------------------DGRLVPKASTFPSGIKALADYVH---SKGL 124
Query: 118 DNRIESNIAF----------------------EQGIDYLKYDNCFNDDTRPTIRYRAMSD 155
I S+ F E G+DYLKYDNC D +RP +RY M D
Sbjct: 125 KLGIYSDAGFRTCSGQQPGSLGYEKIDADTFAEWGVDYLKYDNCNTDRSRPELRYPKMRD 184
Query: 156 ALKKAGRPIFF 166
AL G F
Sbjct: 185 ALLGTGVQYFI 195
>gi|353232943|emb|CCD80298.1| putative alpha-galactosidase/alpha-n-acetylgalactosamini da se
[Schistosoma mansoni]
Length = 408
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 93/347 (26%), Positives = 140/347 (40%), Gaps = 105/347 (30%)
Query: 24 LGLTPPMGWNSWNHFHCQ----------INEKIIRET-------------------DDCW 54
+ PPMGW +W F CQ I+EK+I+ T DDCW
Sbjct: 1 MATVPPMGWMTWQRFRCQTDCQNYPDGCISEKLIKRTADRLVLDGWRDLGYRYVIVDDCW 60
Query: 55 GESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH---I 111
E RDS+ + +VA+ FP+G K++ Y+H +
Sbjct: 61 PEWKRDSRTN-------------------------EIVADNKRFPNGIKSVGQYLHSKNL 95
Query: 112 LISNSLDN---------------RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDA 156
LD +++ + DY+K D C + + Y S
Sbjct: 96 RFGIYLDYGTRTCEGYPGSMNYLELDAKSVAKWEADYVKMDGCNSPENIMPDGYGKFSKL 155
Query: 157 LKKAGRPIFFSMCEWGDMRPAL--WGSNIR-----------NSWRTTDDISDSWARMLTI 203
L GRPI FS C + PA W N N WR D+ DSW+ +++I
Sbjct: 156 LNATGRPIVFS-CSY----PAYISWMKNTSLIDWKKLQNNCNLWRMLGDVQDSWSSVISI 210
Query: 204 AD----MNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGN 259
+ N V A PG WNDPDML +GN G+ ++ V +W + APLL+ D+
Sbjct: 211 INAYKLQNAVLPKLAGPGHWNDPDMLLMGNFGLSDDQKRVQMGMWCMFSAPLLISADMDK 270
Query: 260 LTAETMPIIGNEEVIAVNQ-----------AINDQQIWAGPLSGNRI 295
L ++ ++ N ++A++Q + ND Q+W PL G+ I
Sbjct: 271 LDNFSVSLLRNARLLAIDQDKGGHQAEFVKSQNDVQMWVRPLDGDPI 317
>gi|452836475|gb|EME38419.1| glycoside hydrolase family 27 protein [Dothistroma septosporum
NZE10]
Length = 433
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 109/428 (25%), Positives = 171/428 (39%), Gaps = 126/428 (29%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINE-------KIIRET------------DDCWGESSRDSQ 62
NG+G P +G++++N F C N +I+R+ DDC+ R++
Sbjct: 20 NGVGKLPALGYDTFNAFGCDYNASSVLAQARIMRDVGLVEAGYNILILDDCYALKQRNAS 79
Query: 63 ISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH---ILISNSLDN 119
G++VA+ FP G A + V I ++ +N
Sbjct: 80 --------------------------GHMVADPEKFPDGLPAFSAQVERLGISLAAYGNN 113
Query: 120 RIESNIAFEQ---------------GIDYLKYDNCF---NDDTRPTI--RYRAMSDALKK 159
E+ + G+ YLKYDNC+ ++ T+ + RY MSDA+ K
Sbjct: 114 GYETCAGYPGSYGRELQDLQTWHSWGMKYLKYDNCYIPADNITQQNMLGRYTRMSDAIAK 173
Query: 160 ----AGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAK 215
G FS+CEWG +P +WG + +WR DI W+ + I D
Sbjct: 174 FATQTGCTFEFSLCEWGWEQPWIWGKRLAQAWRIDGDIKPFWSAISAIIDQVSFQYWATD 233
Query: 216 PGGWNDPDMLEVGN-------GGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPII 268
G ND D+LEVGN G + Y E HF+ WA+ K+PL++G D+ + +T+ I+
Sbjct: 234 FYGHNDMDILEVGNTGVGTPPGNLTYEESKSHFTAWALMKSPLIIGTDLTEASTQTLNIL 293
Query: 269 GNEEVIAVNQAIN----------------------DQQIWAGPLSGNRIVVLLESR---K 303
N+ +I +NQ N + W+G S ++++L S+
Sbjct: 294 RNQNLIKINQDPNVGEGISPFSWGVNPDYVSNPAHPAEYWSGNSSYGVVIMVLNSQDQPA 353
Query: 304 TFSSMMTAHW----------------DDIGIKSNNTLFEH-----RTLETKFQGKLTAKV 342
T S +T W + G+ N FE R L G A +
Sbjct: 354 TMSFNLTQSWALRAGRQYSVYDMWTQEQTGVAVRNMTFELPPHGVRALLLNDAGPEPASL 413
Query: 343 DGHSCKMY 350
DG SC +Y
Sbjct: 414 DG-SCGLY 420
>gi|198276282|ref|ZP_03208813.1| hypothetical protein BACPLE_02474 [Bacteroides plebeius DSM 17135]
gi|198270724|gb|EDY94994.1| alpha-galactosidase [Bacteroides plebeius DSM 17135]
Length = 542
Score = 114 bits (285), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 82/282 (29%), Positives = 132/282 (46%), Gaps = 42/282 (14%)
Query: 28 PPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQC 87
P MGW+SWN + INE +I + D E Q + ++ D K
Sbjct: 34 PLMGWSSWNTYRVNINEALICKQADAMVEQG--LQAAGYSYVNIDDGFFGYRDK------ 85
Query: 88 QGNLVANKTTFPSGTKALADYVHI------LISNSLDN---------------------R 120
+G L + FP+G K +ADY+H + S++ N R
Sbjct: 86 EGLLHTHDQRFPNGIKPVADYIHSKGLKAGIYSDAGSNTCGSIWDNDRNGVGVGLYGFER 145
Query: 121 IESNIAFEQ-GIDYLKYDNC--FNDDTRPTIRYRAMSDALKKA-GRPIFFSMCEWGDMRP 176
++++ F + G D++K D C D RY + +A+++ R I ++C W P
Sbjct: 146 QDADLYFNKWGFDFIKIDYCGALQLDLDERKRYTEIVEAIREVCPRNISVNICRWA--YP 203
Query: 177 ALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNE 236
W ++ SWR + DIS W + +I N + +A G +ND DMLE+G G + E
Sbjct: 204 GTWVKDLARSWRISPDISPKWESVRSIIYKNLYLSAYAGEGHYNDMDMLEIGRG-LNPEE 262
Query: 237 YVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
VHF +W I +PLL+GCD+ + ++ ++ N E+IA+NQ
Sbjct: 263 EEVHFGMWCIMSSPLLIGCDLTRIPEASLELLKNPELIALNQ 304
>gi|427387430|ref|ZP_18883486.1| hypothetical protein HMPREF9447_04519 [Bacteroides oleiciplenus YIT
12058]
gi|425725384|gb|EKU88256.1| hypothetical protein HMPREF9447_04519 [Bacteroides oleiciplenus YIT
12058]
Length = 537
Score = 114 bits (285), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 89/315 (28%), Positives = 139/315 (44%), Gaps = 59/315 (18%)
Query: 1 LFIVSSSAADSNGSDRRNLLSNGLGLTPP--MGWNSWNHFHCQINEKIIRETDDCWGESS 58
L +V+SS +N L PP MGW+SWN +H INE++I++ D
Sbjct: 13 LILVASSCTPTNEK-------TALEAFPPPLMGWSSWNTYHVNINEELIKKQADALVTHG 65
Query: 59 RDS----QISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-- 112
I++ F + T G + A+ FP G + ++DY+H L
Sbjct: 66 LKDVGYLYINVDDGFFGWRDET------------GKMHAHPERFPKGMRPISDYIHSLGL 113
Query: 113 ----ISNSLDNRIESNI------------AFEQ----------GIDYLKYDNCFNDDT-- 144
S++ DN S EQ D++K D C +
Sbjct: 114 KAGIYSDAGDNTCGSIYDDDANGVGSGLYGHEQQDMDLYLKEWNYDFIKIDYCGGRELGL 173
Query: 145 RPTIRYRAMSDALKKAGRP-IFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTI 203
RY + +A+K GR + ++C W P W ++ SWR + DI W + I
Sbjct: 174 DEEKRYTTICEAIKNTGRTDVSINICRWA--FPGTWAKSMARSWRISSDIRPRWESVKYI 231
Query: 204 ADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAE 263
N + +A G +ND DMLEVG G ++ E VHF +W I +PLL+GCD+ +
Sbjct: 232 IRKNLYLSAYAGEGHYNDMDMLEVGRG-LQQEEEEVHFGMWCIMSSPLLIGCDMTTIPET 290
Query: 264 TMPIIGNEEVIAVNQ 278
++ ++ N+E+IA+NQ
Sbjct: 291 SLALLKNKELIALNQ 305
>gi|359769537|ref|ZP_09273295.1| putative glycoside hydrolase [Gordonia polyisoprenivorans NBRC
16320]
gi|359313053|dbj|GAB26128.1| putative glycoside hydrolase [Gordonia polyisoprenivorans NBRC
16320]
Length = 413
Score = 114 bits (284), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 97/360 (26%), Positives = 148/360 (41%), Gaps = 64/360 (17%)
Query: 22 NGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAK 81
GL TPPMGWNSWN F C I E+I+R D S L+ + + +
Sbjct: 28 GGLPATPPMGWNSWNSFGCTITEQIVRAQADALVSSG------LAAKGYRYVVVDDCWMA 81
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVHIL------------------------ISNSL 117
P N G LVA+ FPSG ALADYVH + S
Sbjct: 82 PDRN-AGGQLVADPVRFPSGMAALADYVHARGLKFGIYAGAREKTCAQFGGTYPGSAGSG 140
Query: 118 DNRIESNIAFEQ-GIDYLKYDNCFNDDTR--PTIRYRAMSDALKKAGRPIFFSMCEWGDM 174
+ + AF G+D++KYD C D + AM DA++ GRP+ +S+ +
Sbjct: 141 GHELTDARAFASWGVDFVKYDWCSGDSAHDDQVASFTAMRDAIRSTGRPMVYSINPNSGI 200
Query: 175 RPALWGS-----NIRNSWRTTDDISDSWARMLTIADMNEVYA--------DHAKP----- 216
++ G+ + RTT+DIS W AD D P
Sbjct: 201 AGSVPGAQFDWGGVATMTRTTNDISPVWTTRPAGADATPASGYQGIRDIIDAVAPIGARV 260
Query: 217 --GGWNDPDMLEVGNG-GMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEV 273
G + D D+L VG G + S+WA+ APL+ G D+ ++ T I+ NE +
Sbjct: 261 ADGSFVDMDVLVVGVGNALTPAMERTQMSMWAMMAAPLMAGNDLTAMSTRTRDILANEAI 320
Query: 274 IAVNQ---------AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTL 324
I ++Q ++ +IW+ L +VV L +R + +G+ ++++
Sbjct: 321 IGIDQDERVRAGAMVGDNPEIWSRALGSKGLVVSLTNRADHPRTLKVSLSSLGLTGDSSV 380
>gi|414589481|tpg|DAA40052.1| TPA: hypothetical protein ZEAMMB73_445343 [Zea mays]
Length = 745
Score = 114 bits (284), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 69/168 (41%), Positives = 90/168 (53%), Gaps = 22/168 (13%)
Query: 88 QGNLVANKTTFPSG--TKALADYVH-----ILISNSLDNRIESNIAFEQGIDYLKYDNCF 140
+GNL NK S +AL + H IL+ N+ + ++ N+A + K
Sbjct: 386 RGNLDDNKENVSSALSEEALDNSSHYGSENILLENNENVPLKENVALKVAKLQKKV---- 441
Query: 141 NDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARM 200
++ + LKK P + + G PALW S + NSWRTTDDI+D+W M
Sbjct: 442 ---------HQEQAGKLKKTTNPRV--LWDGGQYDPALWASKVGNSWRTTDDITDTWKSM 490
Query: 201 LTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISK 248
IAD N +A +A PGGWNDPDMLEVGNGGM EY HFSIWA+ K
Sbjct: 491 TDIADKNNKWASYAGPGGWNDPDMLEVGNGGMTLAEYRSHFSIWALMK 538
>gi|393912471|gb|EJD76753.1| hypothetical protein, variant [Loa loa]
Length = 394
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 96/345 (27%), Positives = 135/345 (39%), Gaps = 90/345 (26%)
Query: 27 TPPMGWNSWNHFHCQ----------INEKIIRE-------------------TDDCWGES 57
TPPMGW SW FHCQ I+EK+ + DDCW E
Sbjct: 6 TPPMGWMSWTKFHCQMDCVHHPFTCISEKLYMDMADRMVDDGYLQAGYQYVHVDDCWMER 65
Query: 58 SRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHI------ 111
RD L+ ++ FP G ALADY+H
Sbjct: 66 KRDQ--------------------------HNRLIPDRQRFPGGMAALADYMHQRGLKFG 99
Query: 112 ------------LISNSLDNRIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKK 159
+ +I+++ E +DYLK D C D Y M L
Sbjct: 100 IYEDYGTATCAGFPGSYQYTKIDADTFAEWQVDYLKLDGCNIDVNLMPSGYAEMGRMLNL 159
Query: 160 AGRPIFFSMCEWG--------DMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYA 211
GRPI +S C W + L G + N WR DDI SWA + TI D + +
Sbjct: 160 TGRPIVYS-CSWPAYLINQPEKVDYQLIGHHC-NLWRNFDDIKRSWASVRTIIDYYDYHQ 217
Query: 212 DH----AKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPI 267
D PG W+DPDM+ VGN + ++ V SIW+I +PL++ D+ + I
Sbjct: 218 DKHIPAQGPGRWHDPDMIIVGNTELTVDQAKVQMSIWSIWSSPLIMSNDLRLIAPVFRDI 277
Query: 268 IGNEEVIAVNQAINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAH 312
+ N VIA++Q D G L N + + + ++ +AH
Sbjct: 278 LLNRRVIAIDQ---DSLGIMGRLVANTTDIGIYVKPVIPAVPSAH 319
>gi|255617788|ref|XP_002539881.1| alpha-galactosidase/alpha-n-acetylgalactosaminidase, putative
[Ricinus communis]
gi|223501520|gb|EEF22502.1| alpha-galactosidase/alpha-n-acetylgalactosaminidase, putative
[Ricinus communis]
Length = 237
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 58/138 (42%), Positives = 88/138 (63%), Gaps = 9/138 (6%)
Query: 150 YRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIR-NSWRTTDDISDSWARMLTIADMNE 208
Y+ M D L+K R I +S+C++G WG + +SWRTT+DI+D+WA + IA +
Sbjct: 4 YQIMGDYLRKQHRDIVYSLCQYGMSDVWKWGDKVGGSSWRTTNDITDTWASVKNIALAQD 63
Query: 209 VYADHAKPGGWNDPDMLEVGNGG--------MKYNEYVVHFSIWAISKAPLLLGCDVGNL 260
A+ +KPG WNDPDML VG G ++ +E +H S+W++ APLL+GCD+ L
Sbjct: 64 KAAEWSKPGNWNDPDMLVVGVVGWGNPHATKLRPDEQYLHLSLWSLFSAPLLIGCDMEKL 123
Query: 261 TAETMPIIGNEEVIAVNQ 278
A T+ ++ N+EVIA++Q
Sbjct: 124 DAFTLNLLTNDEVIAIDQ 141
>gi|359772380|ref|ZP_09275809.1| putative glycoside hydrolase [Gordonia effusa NBRC 100432]
gi|359310480|dbj|GAB18587.1| putative glycoside hydrolase [Gordonia effusa NBRC 100432]
Length = 407
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 95/377 (25%), Positives = 162/377 (42%), Gaps = 58/377 (15%)
Query: 23 GLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESS-RDSQISLSCSFHMFDSLTQIAAK 81
G+ TPPMGWNSW F C + E IR D S RD+ + + D +A+
Sbjct: 41 GVAPTPPMGWNSWQPFGCAVTEAQIRAQADALVSSGLRDA----GYRYVVVDDCWNASAR 96
Query: 82 PLPNQCQGNLVANKTTFPSGTKALADYVH-------ILI-------------------SN 115
G L A+ T FPSG AL +Y+H + + S
Sbjct: 97 ----ANDGALQADSTRFPSGMAALGEYLHERGLKFGVYVGASDKTCTQYQGHYPGATGSR 152
Query: 116 SLDNRIESNIAFEQGIDYLKYDNCFNDDTR--PTIRYRAMSDALKKAGRPIFFSM---CE 170
++ R + +A G D++K D C ++ + A +AL+ GRP+ S+
Sbjct: 153 GVETRDAATLA-SWGADFVKADWCSSNGRHDDQVQAFTAWRNALRAVGRPMVLSINPNSG 211
Query: 171 WGDMRPAL---WGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEV 227
P WG + R T+DI+ ++ +L IAD + A + +NDPDML V
Sbjct: 212 VSGTPPGQTYDWG-GVATMTRVTNDIAPTFDSVLGIADAVGLVAPRTRIDAFNDPDMLVV 270
Query: 228 GNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ--------- 278
G G + H S+WA+ APL+LG D+ L+ + ++ N+ ++A++Q
Sbjct: 271 GQG-LSTPHARTHMSLWAMMAAPLMLGTDLTRLSTSDLSLVANKAMVALDQDARVVSGAP 329
Query: 279 AINDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNTLFEHRTLETKFQ--- 335
D Q+W+ + + V + +R + ++ +T +G+ + ++Q
Sbjct: 330 VAGDAQVWSRAIGHKGLAVSMTNRTSKATTITVSLASLGLTGDTVAGVDVWTSKRYQARD 389
Query: 336 GKLTAKVDGHSCKMYEL 352
G L+ +V + E+
Sbjct: 390 GALSVRVAPGDTALLEI 406
>gi|325298542|ref|YP_004258459.1| Alpha-galactosidase [Bacteroides salanitronis DSM 18170]
gi|324318095|gb|ADY35986.1| Alpha-galactosidase [Bacteroides salanitronis DSM 18170]
Length = 535
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 84/310 (27%), Positives = 140/310 (45%), Gaps = 56/310 (18%)
Query: 4 VSSSAADSNGSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCW-GESSRDS- 61
+S+ NG ++ L P MGW+SWN +H INE++I++ D + +D+
Sbjct: 16 ATSACTQENGQEKAGFLP------PLMGWSSWNTYHVNINEELIKKQADALVSQGLKDAG 69
Query: 62 --QISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL------I 113
I++ F T G + A+ FP+G + ++DY+H L
Sbjct: 70 YLYINVDDGFFGHRDET------------GKMHAHPERFPNGMRVVSDYIHSLGLKAGIY 117
Query: 114 SNSLDNRIESNI------------AFEQ----------GIDYLKYDNCFNDDT--RPTIR 149
S++ DN S EQ D++K D C + R
Sbjct: 118 SDAGDNTCGSIYDDDANGVGAGLYGHEQQDMDLYLKEWNYDFIKIDYCGARELGLDEEKR 177
Query: 150 YRAMSDALKKAGRP-IFFSMCEWGDMRPALWGSNIRNSWRTTDDISDSWARMLTIADMNE 208
Y + +A++ GR + ++C W P W + SWR + DI +W + I + N
Sbjct: 178 YTTICEAIRNTGRTDVSINICRWA--FPGTWARQLARSWRISPDIRPNWNSVKHIIEKNL 235
Query: 209 VYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPII 268
+ +A G +ND DMLE+G G+ E HF +W I +PLL+GCD+ + ++ ++
Sbjct: 236 YLSAYAGEGHYNDMDMLEIGR-GLSPAEEETHFGMWCIMSSPLLIGCDLTTIPEASLALL 294
Query: 269 GNEEVIAVNQ 278
N+E+IA+NQ
Sbjct: 295 KNKELIALNQ 304
>gi|429725686|ref|ZP_19260508.1| alpha-galactosidase [Prevotella sp. oral taxon 473 str. F0040]
gi|429149256|gb|EKX92238.1| alpha-galactosidase [Prevotella sp. oral taxon 473 str. F0040]
Length = 729
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 88/301 (29%), Positives = 135/301 (44%), Gaps = 53/301 (17%)
Query: 28 PPMGWNSWNHFHCQINEKIIRETDDCW-GESSRDSQISLSCSFHMFDSLTQIAAKPLPNQ 86
P MGW SWN +I+ ++++ D + +D+ +F + D L +
Sbjct: 304 PMMGWISWNVVQDKISTDVVKKVSDAMVKQGLKDA----GYNFLIIDDLWHAKTR----H 355
Query: 87 CQGNLVANKTTFPSGTKALADYVHI------LISNSLD------------NRIESNIAFE 128
G + FP G KA DY H + S++ D +I++ E
Sbjct: 356 ADGRPQEDPAKFPVGMKATVDYAHSKGLKFGIYSDAADRTCAGAFGSFGKEKIDAKQYAE 415
Query: 129 QGIDYLKYDNCF--NDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNS 186
G+D LKYD C D +RY+AM DALK +GR I MCEWG P WGS +
Sbjct: 416 WGVDLLKYDYCHAPGDAATAQVRYKAMGDALKASGRNILLYMCEWGAREPWKWGSTTGSP 475
Query: 187 -WRTTDDISDSW-------ARMLTIADMNEVYADHAKPGGWNDPDMLEV----------- 227
WR T D D W + +IA M +++ ++ +ND DM+ V
Sbjct: 476 VWRATYDTRDGWNGVQGGIGIIQSIAAMKDLWP-YSGVNRFNDADMMCVGIHGTGKSSND 534
Query: 228 ---GNGGMKYNEYVVHFSIWAISKAPLLLGCDVGN-LTAETMPIIGNEEVIAVNQAINDQ 283
G GM +EY F++W + +PLLL D+ +TA+ ++ N ++IA++Q Q
Sbjct: 535 LVAGKPGMTQDEYRTQFALWCMWSSPLLLSFDLTKPITADDKKLMTNADLIALDQDDLGQ 594
Query: 284 Q 284
Q
Sbjct: 595 Q 595
>gi|148672568|gb|EDL04515.1| N-acetyl galactosaminidase, alpha, isoform CRA_b [Mus musculus]
Length = 360
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 73/220 (33%), Positives = 103/220 (46%), Gaps = 31/220 (14%)
Query: 89 GNLVANKTTFPSGTKALADYVHIL------------------ISNSLDN-RIESNIAFEQ 129
G L+ + FP G LADY H L +LD +++ E
Sbjct: 34 GRLIPDPKRFPHGIAFLADYAHSLGLKLGIYEDMGKMTCMGYPGTTLDKVELDAETFAEW 93
Query: 130 GIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEW----GDMRPAL---WGSN 182
+D LK D CF+ Y M+ AL GRPI FS C W G + P + S
Sbjct: 94 KVDMLKLDGCFSSSRERAEGYPKMAAALNATGRPIAFS-CSWPAYEGGLPPKVNYTEVSR 152
Query: 183 IRNSWRTTDDISDSWARMLTIAD----MNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYV 238
+ N WR DI DSW +L+I D +V A PG WNDPDML +GN G+ ++E
Sbjct: 153 VCNLWRNYKDIQDSWKSVLSILDWFVRHQDVLQPVAGPGHWNDPDMLLIGNFGLSFDESR 212
Query: 239 VHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
++W + APLL+ D+ ++ + M I+ N +I +NQ
Sbjct: 213 AQMALWTVLAAPLLMSTDLRTISPQNMDILQNPLMIKINQ 252
>gi|429853811|gb|ELA28859.1| alpha-galactosidase 5 precursor [Colletotrichum gloeosporioides
Nara gc5]
Length = 553
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 98/391 (25%), Positives = 162/391 (41%), Gaps = 68/391 (17%)
Query: 27 TPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKP-LPN 85
TP MGWNS+N C N I + S S+ + + F A++ N
Sbjct: 25 TPVMGWNSYNQVSCSPNHDKIEAAIN-----SLASRGFVDAGYKFFQIDCGWASRDGQRN 79
Query: 86 QCQGNLVANKTTFPSGTKALADY----------------------VHILISNSLDNR-IE 122
G L N FP G + L+D V ++ SL + ++
Sbjct: 80 DNSGALKINLDAFPQGLQPLSDLARSKGMKWTMYSDAGEKMCDPEVPSPVAGSLGHEAVD 139
Query: 123 SNIAFEQGIDYLKYDNCFN-----DDTRP-------TIRYRAMSDALKKAGRPIFFSMCE 170
++ G +Y+KYDNC+ DD P R+ M L++ G P +C+
Sbjct: 140 ADFFKSLGTEYVKYDNCYASGSSADDNAPKSARDDFVTRFTTMWKELQRVGIPGML-ICQ 198
Query: 171 WGD--------MRPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHA--KPGGWN 220
WG PA W + S+R +DDI+ W + I + A PG
Sbjct: 199 WGVPFSSSTGLQGPAQWTKGLSTSFRLSDDIATGWGNVYRIYNQAIHIAKSGIIGPGHIA 258
Query: 221 DPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQA- 279
D D+LEVGN GM ++E HF+ WA+ K+ L++ D+ L+ ET+ ++ N+++IA+NQ
Sbjct: 259 DADLLEVGNTGMTFDEQATHFASWAMLKSALMISTDLAALSDETVALLQNKDLIAINQDP 318
Query: 280 -----------INDQQIWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIGIKSN--NTLFE 326
D+ +WAG L + VL+ + + + I S L++
Sbjct: 319 AVKPIQLVQRWTGDRDLWAGDLDNGDVAVLVVDLTNTARTLDVQLSQLNITSAAIKDLWD 378
Query: 327 HRTLETKFQGKLTAKVDGHSCKMYELTPVTK 357
+T++ + +V+ H L+ V +
Sbjct: 379 SKTVDGAT--SFSKQVNAHGSLALRLSNVKR 407
>gi|322708363|gb|EFY99940.1| alpha-galactosidase [Metarhizium anisopliae ARSEF 23]
Length = 446
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 93/387 (24%), Positives = 153/387 (39%), Gaps = 84/387 (21%)
Query: 13 GSDRRNLLSNGLGLTPPMGWNSWNHFHCQINEKIIRET-------------------DDC 53
G+ RN+ G P +GWNSWN + C INE+ + DDC
Sbjct: 18 GTLARNVPHEPSGKLPTLGWNSWNAYRCDINEQHFLDAAQALVDTGLRDAGYNYVNIDDC 77
Query: 54 WGESSRDSQISLSCSFHMF-DSLTQIAAKPLPNQCQGNLVANKTTFP-SGTKALADYVHI 111
W E + ++ + F D + +A K + + + + T +G A Y +
Sbjct: 78 WSERTGRVNGHIAVNKTRFPDGIDGLANKIHDMKLKLGIYSTAGTLTCAGYPASLGYEDV 137
Query: 112 LISNSLDNRIESNIAFEQGIDYLKY-DNCF-----NDD---------------------- 143
++ ++S A G++ + DNC+ D+
Sbjct: 138 DAADFAKWGVDSRTAVRHGLNRNSHSDNCYIPKQWQDEYIYCEEDGAQIGPNGTCSRSQN 197
Query: 144 ------------TRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNSWRTTD 191
++ R+ M DAL + R I +++C WG WG SWR +
Sbjct: 198 PRLAPDGYDWSKSKSAQRFNRMRDALARQDREILYNLCIWGTADVTSWGRGTATSWRMSG 257
Query: 192 DISDSWARMLTIADMNEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAPL 251
DIS W + I +MN ND DMLEVGNG + E HF++WA K+PL
Sbjct: 258 DISPRWRSVTHILNMNSFKMGAVGFHAHNDADMLEVGNGDLSPAETRSHFALWAAMKSPL 317
Query: 252 LLGCDVGNLTAETMPIIGNEEVIAVNQ----------------------AINDQQIWAGP 289
L+G D+ L+ + ++ N ++A +Q + + + WAGP
Sbjct: 318 LIGTDLRRLSRRNLDLLANRRLLAFHQDAGHGGPAAPYKWGVNPDWTYNSTHPAEYWAGP 377
Query: 290 LSGNRIVVLLESRKTFSSMMTAHWDDI 316
+G +V++L + + TA W +I
Sbjct: 378 STGGHLVLMLNTLGV-TVRKTAAWGEI 403
>gi|153806029|ref|ZP_01958697.1| hypothetical protein BACCAC_00278 [Bacteroides caccae ATCC 43185]
gi|423221090|ref|ZP_17207583.1| hypothetical protein HMPREF1061_04356 [Bacteroides caccae
CL03T12C61]
gi|149130706|gb|EDM21912.1| alpha-galactosidase [Bacteroides caccae ATCC 43185]
gi|392621960|gb|EIY16098.1| hypothetical protein HMPREF1061_04356 [Bacteroides caccae
CL03T12C61]
Length = 545
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 79/292 (27%), Positives = 132/292 (45%), Gaps = 49/292 (16%)
Query: 26 LTPP-MGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLP 84
+TPP MGW+SWN F I+E II+ D E +H + K
Sbjct: 34 ITPPIMGWSSWNAFRVDISEDIIKHQADLMVEKGLKD-----AGYHYINVDDGFFGKRDD 88
Query: 85 NQCQGNLVANKTTFPSGTKALADYVHIL--------------ISNSLDNRI--------- 121
N G + N+ FP+G K +AD++H L + DN +
Sbjct: 89 N---GIMFTNEKRFPNGMKPVADHIHSLGMKAGIYTDAGNNTCGSIWDNDLAGVGAGIYG 145
Query: 122 ----ESNIAF-EQGIDYLKYDNCFND--DTRPTIRYRAMSDALKKAGRPIFFSMCEWGDM 174
++ + F + G D++K D C D RY ++ +++ K + + ++C W
Sbjct: 146 HEPQDAQLYFGDWGFDFIKIDYCGGDVLGLDEEERYTSIRNSIDKVNKNVSVNICRWA-- 203
Query: 175 RPALWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDML--------E 226
P W ++ SWR + DI+ W + + N + +A G +ND DM+ +
Sbjct: 204 FPGTWAKDVATSWRISGDINAHWGSLKYVVRKNLYLSAYAGNGHYNDMDMMVIGFRDNSK 263
Query: 227 VGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
VG G+ E HF +W I +PLL+GC++ NL ++ ++ N+E+IA+NQ
Sbjct: 264 VGGKGLTPTEEEAHFGLWCIMSSPLLIGCNLENLPDSSLQLLTNKELIALNQ 315
>gi|288928562|ref|ZP_06422409.1| alpha-galactosidase (Melibiase) (Alpha-D-galactoside
galactohydrolase) [Prevotella sp. oral taxon 317 str.
F0108]
gi|288331396|gb|EFC69980.1| alpha-galactosidase (Melibiase) (Alpha-D-galactoside
galactohydrolase) [Prevotella sp. oral taxon 317 str.
F0108]
Length = 486
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 105/401 (26%), Positives = 166/401 (41%), Gaps = 79/401 (19%)
Query: 19 LLSNGLGLTPP-MGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQ 77
++S+ L + P MGW SWN ++E I++ D + + + M D L Q
Sbjct: 98 IVSDKLAMPLPFMGWLSWNSVEGDVSEAIVKRVADMF---RANGLYQAGWNTVMMDDLWQ 154
Query: 78 IA-----AKPLPNQCQGNLVANKTTFPSGTKALADYVHI------LISNSLDN------- 119
KPLP+ FP+G + LADYVH L +++ D
Sbjct: 155 ARKRADDGKPLPD---------PKRFPNGLRNLADYVHGKGMKFGLYTDAADKTCAGAFG 205
Query: 120 -----RIESNIAFEQGIDYLKYDNCFNDDTRPT--IRYRAMSDALKKAGRPIFFSMCEWG 172
RI++ + +D +K D C + T +RYR + DA K RPI +CEWG
Sbjct: 206 SYGYERIDAEQYAQWNVDIVKCDYCNAPPEQDTAMVRYRRLGDAFKAVARPITLYICEWG 265
Query: 173 DMRPALWGSNIRNS-WRTTDDISDSW-------ARMLTIADMNEVYADHAKPGGWNDPDM 224
D +P LWG+ S WR + D+ D W + +I M + A +A +ND DM
Sbjct: 266 DRKPWLWGAESGGSCWRVSADVRDRWTCEPGGTGVVESIKAMKNI-AQYAGVNRFNDADM 324
Query: 225 LEV--------------GNGGMKYNEYVVHFSIWAISKAPLLLGCD--VGNLTAETMPII 268
L G GM +EY F++W + +P+ L D +T E + I+
Sbjct: 325 LCTGLHGKGKSSNDLCFGTPGMTQDEYATQFALWCMWSSPMALSFDPRANTVTKEDLKIL 384
Query: 269 GNEEVIAVNQAINDQQ-----------IWAGPLSGNRIVVLLESRKTFSSMMTAHWDDIG 317
N +IA+NQ QQ ++A L + + + + S T H+ I
Sbjct: 385 TNRHLIALNQDRMGQQADLISDADSLVMFAKDLENGDVALSVTNMSGKSLEATFHFTQIP 444
Query: 318 IKSNNTLFEHRTLET-----KFQGKLTAKVDGHSCKMYELT 353
N ++ L T + T KV H+ +++ L+
Sbjct: 445 ALDANKRYQCHDLWTGERLSAVKRSFTTKVRPHATRVFRLS 485
>gi|294867920|ref|XP_002765294.1| alpha-galactosidase/alpha-n-acetylgalactosaminidase, putative
[Perkinsus marinus ATCC 50983]
gi|239865307|gb|EEQ98011.1| alpha-galactosidase/alpha-n-acetylgalactosaminidase, putative
[Perkinsus marinus ATCC 50983]
Length = 894
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 110/384 (28%), Positives = 155/384 (40%), Gaps = 77/384 (20%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L N L TPPMGW +W F C+ DC E+ D+ IS M D L +
Sbjct: 25 LDNDLARTPPMGWLAWERFGCET---------DC--ETHPDTCISEKLYTDMADELIRGG 73
Query: 80 AKPLPNQCQGNLVANKTTFPSGTKALADYVHILISNSLDNRIESNIA--FEQGIDYLKYD 137
+ + A+ GTK Y L+ E ++ E GID LK D
Sbjct: 74 YN---RGLKLGIYAD-----IGTKTCGGY------PGLEGHFEQDVKTFAEWGIDSLKVD 119
Query: 138 NCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIR------------- 184
C+ + + Y + L GRPI +S C W PA +
Sbjct: 120 GCYANTSTFGETYPGLGRLLNATGRPILYS-CSW----PAYLADHAEDKDVLVKEIAPAC 174
Query: 185 NSWRTTDDISDSWARMLTIADM------NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYV 238
N WR DDI DSWA + I + ++ A PG WNDPDM+ VGN G+ E
Sbjct: 175 NLWRNFDDIYDSWASIQGITNFWARRNSTDILIRAAGPGHWNDPDMIVVGNNGLSEVEQQ 234
Query: 239 VHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ-----------AINDQQIWA 287
F++WA+ APL L D+ + + I+ N+E+IA+NQ + N +IW
Sbjct: 235 SQFALWAMFAAPLYLTADLRTMPSWAREIVQNKEIIAINQDPLGKQGYVVWSENGARIWI 294
Query: 288 GPLSG-----NRIVVLLESRKTFSSMM------TAH---WDDIGIKSNNTLFEHRTLETK 333
L+G + VLLE+ + + T H W + S LF R +
Sbjct: 295 RELAGTDKSVDTWAVLLENSNSIFGLQRIALQPTRHIPRWAEGTQFSVRDLFRGRNIGL- 353
Query: 334 FQGKLTAKVDGHSCKMYELTPVTK 357
F + +A VD S + V K
Sbjct: 354 FVDEYSANVDTSSVHFVIIRKVRK 377
Score = 107 bits (267), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 82/274 (29%), Positives = 112/274 (40%), Gaps = 73/274 (26%)
Query: 51 DDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVH 110
DDCW RD++ G ++ + FP G K LADY+H
Sbjct: 552 DDCWSTLERDAKT-------------------------GEILPDPERFPHGIKWLADYMH 586
Query: 111 ILISNSLDNRIESNIA-------------FEQ--------GIDYLKYDNCFNDDTRPTIR 149
S L I ++I FEQ GID LK D C+ D +
Sbjct: 587 ---SRGLKLGIYADIGTKTCGGYPGLEGHFEQDVKTFAEWGIDSLKVDGCYADTSTFGET 643
Query: 150 YRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNS-------------WRTTDDISDS 196
Y + L GRPI +S C W PA + N WR DI DS
Sbjct: 644 YPRLGRLLNATGRPILYS-CSW----PAYLADHAENQDVLVKEIAPACNLWRNFHDIRDS 698
Query: 197 WARMLTIADM------NEVYADHAKPGGWNDPDMLEVGNGGMKYNEYVVHFSIWAISKAP 250
WA + +I + + A PG WNDPDM+ VGN G+ E F++WA+ AP
Sbjct: 699 WASVQSITNFWARTSPTNILIRAAGPGHWNDPDMIVVGNNGLSEVEQQSQFALWAMFAAP 758
Query: 251 LLLGCDVGNLTAETMPIIGNEEVIAVNQAINDQQ 284
L + D+ + I+ N+E+IAVNQ + +Q
Sbjct: 759 LYMTADLRTFPSWARKIVQNKEIIAVNQDLLGKQ 792
>gi|238025407|ref|YP_002909639.1| Secreted alpha-galactosidase [Burkholderia glumae BGR1]
gi|237880072|gb|ACR32404.1| Secreted alpha-galactosidase [Burkholderia glumae BGR1]
Length = 707
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 99/358 (27%), Positives = 152/358 (42%), Gaps = 68/358 (18%)
Query: 28 PPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQC 87
PPMGW+SWN +N I+ D G ++ ++QI + ++ +
Sbjct: 55 PPMGWSSWNSLEENVNYNTIKAEAD--GLAALNAQIKSGAKYEYVNTDEGWWTSGV-RDA 111
Query: 88 QGNLVANKTTFPSGTKALADYVHI------------------------LISNSLDNRIES 123
GN V + T +P G +A+ Y+H + + +
Sbjct: 112 DGNFVIDNTQWPGGMQAIVQYIHSKGLKAGIYIDAGPQGCGTRTDGSHFVGSDFAHYDHD 171
Query: 124 NIAFEQ-GIDYLKYDNCFNDDTR--PTIRYRAMSDALKKA----GRPIFFSMCEWGDMR- 175
+ F Q G D++K D C P Y A++ A++KA GR + S+C+WG +
Sbjct: 172 FLQFAQWGYDFVKVDFCGGSAAGYDPQQAYTAVASAIEKAYVQTGRRLTLSICDWGTIAG 231
Query: 176 ----------PALWGSNIRNSWRTTDDI------SDSWARMLTIADMNEVYADHAKPGGW 219
P W + + WRTT DI + S+ R+L + + G +
Sbjct: 232 NPAYPDYKQGPWAWAAGVGRMWRTTGDIYVPNTGAPSYDRVLG-NFFGNYHPESQHTGFY 290
Query: 220 NDPDMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGN-LTAETMPIIGNEEVIAVNQ 278
NDPDM+ G G M + H S+WAI+ APL+LG D+ LTA+ ++ N VIAV+Q
Sbjct: 291 NDPDMMVAGMG-MTAVQDQAHMSLWAIAGAPLILGNDLSKPLTADATNLMTNSGVIAVDQ 349
Query: 279 -----------AINDQQIWAGPL--SGNRIVVLLESRKTFSSMMTAHWDDIGIKSNNT 323
QQ+WA L SG R VVL T + MT W +G N++
Sbjct: 350 DPLGIQGVLVAQSGAQQVWAKLLAGSGQRAVVLFN-NGTSDAPMTVTWQQLGFAPNSS 406
>gi|383121664|ref|ZP_09942371.1| hypothetical protein BSIG_4991 [Bacteroides sp. 1_1_6]
gi|251837989|gb|EES66078.1| hypothetical protein BSIG_4991 [Bacteroides sp. 1_1_6]
Length = 545
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 78/289 (26%), Positives = 127/289 (43%), Gaps = 48/289 (16%)
Query: 28 PPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIAAKPLPNQC 87
P MGW+SWN F I+E II+ D + +H + + N
Sbjct: 37 PIMGWSSWNAFRVDISEDIIKNQADLMVKKGLKD-----AGYHYINIDDGFFGERDGN-- 89
Query: 88 QGNLVANKTTFPSGTKALADYVHIL-----ISNSLDNRIESNI-------------AFEQ 129
G + NK FP+G K +AD++H L I N +I EQ
Sbjct: 90 -GKMQTNKNRFPNGMKPVADHIHSLGMKAGIYTDAGNNTCGSIWDNDHAGVGAGIYGHEQ 148
Query: 130 ----------GIDYLKYDNCFND--DTRPTIRYRAMSDALKKAGRPIFFSMCEWGDMRPA 177
G D++K D C D RY ++ +++ K + + ++C W P
Sbjct: 149 QDAQLYFGDWGFDFIKIDYCGGDVLGLNEQERYTSIRNSIDKVNKDVSVNICRWA--FPG 206
Query: 178 LWGSNIRNSWRTTDDISDSWARMLTIADMNEVYADHAKPGGWNDPDML--------EVGN 229
W ++ SWR + DI+ W + + N + +AK G +ND DM+ +VG
Sbjct: 207 TWAKDVATSWRISGDINAHWGSLRYVVGKNLYLSAYAKDGHYNDMDMMVIGFRDNSKVGG 266
Query: 230 GGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ 278
G+ E HF +W I +PLL+GC++ +L ++ ++ N+E+IA+NQ
Sbjct: 267 KGLTPTEEEAHFGLWCIMSSPLLIGCNLESLPESSLELLTNKELIALNQ 315
>gi|118787136|ref|XP_315871.3| AGAP005846-PA [Anopheles gambiae str. PEST]
gi|116126656|gb|EAA11949.3| AGAP005846-PA [Anopheles gambiae str. PEST]
Length = 428
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 86/319 (26%), Positives = 131/319 (41%), Gaps = 54/319 (16%)
Query: 20 LSNGLGLTPPMGWNSWNHFHCQINEKIIRETDDCWGESSRDSQISLSCSFHMFDSLTQIA 79
L NGL PPMGW SW + C + + D+C E+ L S D+ +
Sbjct: 25 LDNGLAHRPPMGWMSWERYRCITD--CSKYPDECISEALFKRMADLMVSEGYRDAGYEYV 82
Query: 80 AKPLPNQC--------QGNLVANKTTFPSGTKALADYVH-----ILISNSLDN------- 119
+ C G L NK FP+G K LADY+H I +
Sbjct: 83 NI---DDCWMADERDEDGVLQPNKERFPNGIKHLADYIHERGLKFGIYQDIGTKTCAGYP 139
Query: 120 ------RIESNIAFEQGIDYLKYDNCFNDDTRPTIRYRAMSDALKKAGRPIFFSMCEW-- 171
++++ + G+D++K D C+ D Y + + + GRPI +S C W
Sbjct: 140 GMVGYFKLDAETFADWGVDFIKIDGCYADVELMVNDYIKFGEYMNRTGRPILYS-CSWPA 198
Query: 172 -----GDMRPALWGSNIRNSWRTTDDISDSWARMLTI----ADMNEVYADHAKPGGWNDP 222
G + N WR DI DS ++ I +D H PG WNDP
Sbjct: 199 YQEYDGIIPDYDQLKETCNMWRNWGDIEDSHQSVVEITKYFSDHQNRIVPHHGPGHWNDP 258
Query: 223 DMLEVGNGGMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ---- 278
D L +GN G+ Y++ ++W + APLL+ D+ + E ++ N+ +I VNQ
Sbjct: 259 DTLILGNYGLSYDQSKSQLAVWTVMGAPLLISNDLAKVRPEIKELLLNKAIIRVNQDPLG 318
Query: 279 -------AINDQQIWAGPL 290
+N +IW P+
Sbjct: 319 IQGRLVKTVNKIEIWKRPI 337
>gi|67901274|ref|XP_680893.1| hypothetical protein AN7624.2 [Aspergillus nidulans FGSC A4]
gi|40742620|gb|EAA61810.1| hypothetical protein AN7624.2 [Aspergillus nidulans FGSC A4]
gi|259483955|tpe|CBF79768.1| TPA: Putative alpha-galactosidase
[Source:UniProtKB/TrEMBL;Acc:Q1HFR2] [Aspergillus
nidulans FGSC A4]
Length = 469
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 109/411 (26%), Positives = 166/411 (40%), Gaps = 120/411 (29%)
Query: 19 LLSNGLGLTPPMGWNSWNHFHCQINEKIIRET---------DDCWGE-----SSRDSQIS 64
+ +G+G P +GW+SWN C IN +I D E + R+ S
Sbjct: 18 MRPDGVGRLPALGWSSWNAHECDINATVILTAAAQVVKLGLKDLGYEYINNHAFREDNTS 77
Query: 65 LSCSFHMFDSLTQIAAKPLPNQCQGNLVANKTTFPSGTKALADYVHIL-----ISNSLDN 119
SC S+ + P N+ ++ + FP G ++A +H L I +S
Sbjct: 78 NSCPVDDCWSI-KTHRDPTTNR----MIPDADRFPDGIASVASQIHELGLKVGIYSSAGE 132
Query: 120 R-------------IESNIAFEQGIDYLKYDNC-----------------------FNDD 143
I++ E IDYLKYD+C F +
Sbjct: 133 TTCAGYPASLGYEDIDAETFAEWEIDYLKYDDCGVPDNWKDPYTFCVPDTANNAGPFPNG 192
Query: 144 TRPTI-----------------RYRAMSDALKKAGRPIFFSMCEWGDMRPALWGSNIRNS 186
T P++ R+R M DAL R I +S+C WG+ WG+ I NS
Sbjct: 193 TCPSLPNPAPANYNWSTSPSAERFRRMLDALNTQDRTILYSLCNWGNAAVNTWGAEIGNS 252
Query: 187 WRTTDDISDS-------------WARMLTIA-DMNEVYADHAKPGGWNDPDMLEVGN--G 230
WR + DIS W R+ I +M+ + ++A+ GW D DMLEVGN G
Sbjct: 253 WRMSGDISPGRGEVGPDRTRIAVWERIAEITNEMSFLVREYAEFWGWPDADMLEVGNGEG 312
Query: 231 GMKYNEYVVHFSIWAISKAPLLLGCDVGNLTAETMPIIGNEEVIAVNQ------------ 278
GM E HF++WA ++PLL+G + + E + I+ N ++ +Q
Sbjct: 313 GMTVAENRAHFALWAAMRSPLLIGTKLDTIRQEHLKILKNPTLLTFHQDPIINRPAYPYK 372
Query: 279 ----------AINDQQIWAGP---LSGNRIVVLLESRKTFSSMMTAHWDDI 316
A + + W+GP L G +VV+L S S+ M A W ++
Sbjct: 373 WGYNADYTFDAAHPAEYWSGPSPALEGT-LVVMLNSENVTSTRM-AVWSEV 421
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.133 0.418
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,879,086,376
Number of Sequences: 23463169
Number of extensions: 244742044
Number of successful extensions: 486029
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1308
Number of HSP's successfully gapped in prelim test: 138
Number of HSP's that attempted gapping in prelim test: 479167
Number of HSP's gapped (non-prelim): 2286
length of query: 358
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 215
effective length of database: 9,003,962,200
effective search space: 1935851873000
effective search space used: 1935851873000
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 77 (34.3 bits)