BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy10434
(593 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|P50429|ARSB_MOUSE Arylsulfatase B OS=Mus musculus GN=Arsb PE=2 SV=3
Length = 534
Score = 359 bits (921), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 184/369 (49%), Positives = 242/369 (65%), Gaps = 15/369 (4%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
+ A + PH++ +LADDLGWND+ FHGS I TP++DALA G++L+ +YVQ LCTPSRS
Sbjct: 40 SGATQPPHVVFVLADDLGWNDLGFHGSV-IRTPHLDALAAGGVVLDNYYVQPLCTPSRSQ 98
Query: 84 LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
L+TG+Y IH+G+QH +I+ +P +PL EKLLPQ LKEAGYATH +GKWHLG +R+ P
Sbjct: 99 LLTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLP 158
Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG----LDMRHNMQVDNKTIGIYSTDLYTE 199
T RGFD+++GY G +DYY H A E G LD+R + + IYST+++T+
Sbjct: 159 TRRGFDTYFGYLLGSEDYYTHEACAPIESLNGTRCALDLRDGEEPAKEYNNIYSTNIFTK 218
Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
A VIA H KP+FLYLA +VH +P Q P+E + + I D RR YAGMVS
Sbjct: 219 RATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRIYAGMVSL 273
Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
+DE+VGNV AL+ HG+ N++ +F DNG G + G+N PLRG K T W+GG+RG
Sbjct: 274 MDEAVGNVTKALKSHGLWNNTVFIFSTDNG----GQTRSGGNNWPLRGRKGTLWEGGIRG 329
Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKR 378
+ SP LKQ S EL HI+DWLPTL AG N T LDG N W +++G + R
Sbjct: 330 TGFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGSTNGTKPLDGFNMWKTISEGHPSPR 389
Query: 379 SEILHNIDN 387
E+LHNID
Sbjct: 390 VELLHNIDQ 398
Score = 41.6 bits (96), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 19/60 (31%), Positives = 31/60 (51%), Gaps = 1/60 (1%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LF+I DP E+++++ +++ L +L Y VP P D R DP + +W PW
Sbjct: 475 LFDINQDPEERHDVSREHPHIVQNLLSRLQYYHEHSVPSHFPPLDPRCDP-KSTGVWSPW 533
>sp|P50430|ARSB_RAT Arylsulfatase B OS=Rattus norvegicus GN=Arsb PE=2 SV=2
Length = 528
Score = 347 bits (891), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 179/366 (48%), Positives = 241/366 (65%), Gaps = 15/366 (4%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A PH++ +LADDLGWND+ FHGS I TP++DALA G++L+ +YVQ LCTPSRS L+
Sbjct: 36 AAPPPHVVFVLADDLGWNDLGFHGSV-IRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLL 94
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG+Y IH+G+QH +I+ +P +PL EKLLPQ LK+AGYATH +GKWHLG +R+ PT
Sbjct: 95 TGRYQIHMGLQHYLIMTCQPNCVPLDEKLLPQLLKDAGYATHMVGKWHLGMYRKECLPTR 154
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQG----LDMRHNMQVDNKTIGIYSTDLYTEAA 201
RGFD+++GY G +DYY H A E G LD+R + + IYST+++T+ A
Sbjct: 155 RGFDTYFGYLLGSEDYYTHEACAPIECLNGTRCALDLRDGEEPAKEYTDIYSTNIFTKRA 214
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
+IA H KP+FLYLA +VH +P Q P+E + + I D RR YAGMVS LD
Sbjct: 215 TTLIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYDFIQDKHRRIYAGMVSLLD 269
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E+VGNV AL+ G+ N++++F DNG G + G+N PLRG K T W+GG+RG
Sbjct: 270 EAVGNVTKALKSRGLWNNTVLIFSTDNG----GQTRSGGNNWPLRGRKGTLWEGGIRGAG 325
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
+ SP LKQ S EL HI+DWLPTL AG + T LDG + W+ +++G+ + R E
Sbjct: 326 FVASPLLKQKGVKSRELMHITDWLPTLVNLAGGSTHGTKPLDGFDVWETISEGSPSPRVE 385
Query: 381 ILHNID 386
+L NID
Sbjct: 386 LLLNID 391
Score = 40.8 bits (94), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 19/60 (31%), Positives = 31/60 (51%), Gaps = 1/60 (1%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LF+I DP E+++++ +++ L +L Y VP P D R DP + +W PW
Sbjct: 469 LFDINRDPEERHDVSREHPHIVQNLLSRLQYYHEHSVPSYFPPLDPRCDP-KGTGVWSPW 527
>sp|P33727|ARSB_FELCA Arylsulfatase B OS=Felis catus GN=ARSB PE=2 SV=1
Length = 535
Score = 342 bits (878), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 178/366 (48%), Positives = 234/366 (63%), Gaps = 15/366 (4%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A + PH++ +LADDLGWNDVSFHGS+ I TP++D LA G++L+ +Y Q LCTPSRS L+
Sbjct: 43 ADRPPHLVFVLADDLGWNDVSFHGSN-IRTPHLDELAAGGVLLDNYYTQPLCTPSRSQLL 101
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG+Y IH G+QH +I +P +PL EKLLPQ LKEAGY TH +GKWHLG +R+ PT
Sbjct: 102 TGRYQIHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 161
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
RGFD+++GY G +DYY H A + LD R QV +YST+++TE A
Sbjct: 162 RGFDTYFGYLLGSEDYYSHERCALIDSLNVTRCALDFRDGEQVATGYKNMYSTNIFTERA 221
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
+I H KP+FLYLA +VH EP Q P+E + + I D R YAGMVS +D
Sbjct: 222 TALITSHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHYYAGMVSLMD 276
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E+VGNV AAL+ HG+ N++ +F DNG + G+N PLRG K + W+GG+RGV
Sbjct: 277 EAVGNVTAALKSHGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGIRGVG 332
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTKRSE 380
+ SP LKQ + EL HISDWLPTL A G LDG + W +++G+ + R E
Sbjct: 333 FVASPLLKQKGVKNRELIHISDWLPTLVKLARGSTKGTKPLDGFDVWKTISEGSPSPRKE 392
Query: 381 ILHNID 386
+LHNID
Sbjct: 393 LLHNID 398
Score = 35.0 bits (79), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 19/59 (32%), Positives = 29/59 (49%), Gaps = 1/59 (1%)
Query: 506 FNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
F+I DP E+++L+ +++QL +L Y VP D R DP + W PW
Sbjct: 477 FDIDQDPEERHDLSRDYPHIVEQLLSRLQFYHKHSVPVHFPAQDPRCDP-KGTGAWGPW 534
>sp|P15848|ARSB_HUMAN Arylsulfatase B OS=Homo sapiens GN=ARSB PE=1 SV=1
Length = 533
Score = 335 bits (860), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 175/366 (47%), Positives = 233/366 (63%), Gaps = 15/366 (4%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A + PH++ +LADDLGWNDV FHGS +I TP++DALA G++L+ +Y Q LCTPSRS L+
Sbjct: 41 ASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 99
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG+Y I G+QH +I +P +PL EKLLPQ LKEAGY TH +GKWHLG +R+ PT
Sbjct: 100 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 159
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
RGFD+++GY G +DYY H + LD R +V +YST+++T+ A
Sbjct: 160 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 219
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
I +I H KP+FLYLA +VH EP Q P+E + + I D R YAGMVS +D
Sbjct: 220 IALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 274
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E+VGNV AAL+ G+ N++ +F DNG + G+N PLRG K + W+GG+RGV
Sbjct: 275 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 330
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
+ SP LKQ + EL HISDWLPTL A N T LDG + W +++G+ + R E
Sbjct: 331 FVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIE 390
Query: 381 ILHNID 386
+LHNID
Sbjct: 391 LLHNID 396
Score = 35.4 bits (80), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 19/60 (31%), Positives = 30/60 (50%), Gaps = 1/60 (1%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LF+I DP E+++L+ ++ +L +L Y VP D R DP + +W PW
Sbjct: 474 LFDIDRDPEERHDLSREYPHIVTKLLSRLQFYHKHSVPVYFPAQDPRCDP-KATGVWGPW 532
>sp|Q32KI9|ARSI_MOUSE Arylsulfatase I OS=Mus musculus GN=Arsi PE=2 SV=1
Length = 573
Score = 331 bits (849), Expect = 8e-90, Method: Compositional matrix adjust.
Identities = 202/554 (36%), Positives = 289/554 (52%), Gaps = 90/554 (16%)
Query: 27 PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
P+ PHII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSRS L+T
Sbjct: 44 PQPPHIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLT 102
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
G+Y IH G+QH +I +P LPL + LPQ L+EAGY+TH +GKWHLGF+R+ PT R
Sbjct: 103 GRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRR 162
Query: 147 GFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
GFD+ G G DYY + +C G D+ V G YST LY + A +++
Sbjct: 163 GFDTFLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHIL 220
Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
A HN P+FLY+A AVH P Q+P E + ++ + + RR YA MV+ +DE+V
Sbjct: 221 ASHNPQNPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVR 275
Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
N+ AL+++G NS+++F +DNG +F + GSN PLRG K T W+GG+RG+ + S
Sbjct: 276 NITWALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHS 331
Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHN 384
P LK+ ++ S L HI+DW PTL AG + LDG + W +++G + R+EILHN
Sbjct: 332 PLLKKKRRTSRALVHITDWYPTLVGLAGGTTSAADGLDGYDVWPAISEGRASPRTEILHN 391
Query: 385 IDNVDNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYS 430
ID + N ++ AA+RV + K + G D YGD +
Sbjct: 392 IDPLYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WI 437
Query: 431 PKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNY 490
P + L S G +N E++ +R+
Sbjct: 438 PPQTLASFPGSWWNL-----------------------------ERMASIRQAV------ 462
Query: 491 DNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDK 550
LFNI+ DP E+ +LA + D+++ L +LA Y T +P +
Sbjct: 463 -------------WLFNISADPYEREDLAGQRPDVVRTLLARLADYNRTAIPVRYPAANP 509
Query: 551 RADPARWNNIWVPW 564
RA P W PW
Sbjct: 510 RAHPDFNGGAWGPW 523
>sp|Q5FYB0|ARSJ_HUMAN Arylsulfatase J OS=Homo sapiens GN=ARSJ PE=2 SV=1
Length = 599
Score = 328 bits (842), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 204/557 (36%), Positives = 293/557 (52%), Gaps = 70/557 (12%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
+TT+ +PH+I ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS
Sbjct: 69 STTSTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 127
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+TGKY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+
Sbjct: 128 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 187
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
PT RGFD+ +G G DYY H K G D+ N DN GIYST +YT
Sbjct: 188 PTRRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 243
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
+ ++A HN +KP+FLY+A+ AVH+ P QAP + I + RR YA M+S
Sbjct: 244 QRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 298
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGM 317
LDE++ NV AL+ +G NSI+++ +DNG P+ G GSN PLRG K T W+GG+
Sbjct: 299 CLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGI 353
Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKT 376
R V + SP LK V EL HI+DW PTL + A +I+ D LDG + W+ +++G ++
Sbjct: 354 RAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRS 413
Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
RVD L + ++ W
Sbjct: 414 P---------------------RVDILHNIDPIYTKAKNGSWA----------------- 435
Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGA 495
+ GI A+++ ++++ SD + + F+ + N + N+
Sbjct: 436 AGYGIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QSFSNLGPNRWHNERI 485
Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
++ LFNIT DP E+ +L+ ++K+L +L+ + T VP P D R++P
Sbjct: 486 TLSTGKSVWLFNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPR 545
Query: 556 RWNNIWVPWYDELDKQK 572
+W PWY E K+K
Sbjct: 546 LNGGVWGPWYKEETKKK 562
>sp|Q32KJ8|ARSI_RAT Arylsulfatase I OS=Rattus norvegicus GN=Arsi PE=2 SV=1
Length = 573
Score = 327 bits (837), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 200/550 (36%), Positives = 288/550 (52%), Gaps = 90/550 (16%)
Query: 31 HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
HII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSRS L+TG+Y
Sbjct: 48 HIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 106
Query: 91 IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
IH G+QH +I +P LPL + LPQ L+EAGY+TH +GKWHLGF+R+ PT RGFD+
Sbjct: 107 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166
Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
G G DYY + +C G D+ V G YST LY + A +++A H+
Sbjct: 167 FLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHS 224
Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
KP+FLY+A AVH P Q+P E + ++ + + RR YA MV+ +DE+V N+
Sbjct: 225 PQKPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 279
Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
AL+++G NS+++F +DNG +F + GSN PLRG K T W+GG+RG+ + SP LK
Sbjct: 280 ALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 335
Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
+ ++ S L HI+DW PTL AG + LDG + W +++G + R+EILHNID +
Sbjct: 336 KKRRTSRALVHITDWYPTLVGLAGGTTSAADGLDGYDVWPAISEGRASPRTEILHNIDPL 395
Query: 389 DNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
N ++ AA+RV + K + G D YGD + P +
Sbjct: 396 YNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WIPPQT 441
Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG 494
L S G +N E++ +R+
Sbjct: 442 LASFPGSWWNL-----------------------------ERMASIRQAV---------- 462
Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
LFNI+ DP E+ +LA+ + D+++ L +LA Y T +P + RA P
Sbjct: 463 ---------WLFNISADPYEREDLADQRPDVVRTLLARLADYNRTAIPVRYPAANPRAHP 513
Query: 555 ARWNNIWVPW 564
W PW
Sbjct: 514 DFNGGAWGPW 523
>sp|Q5FYB1|ARSI_HUMAN Arylsulfatase I OS=Homo sapiens GN=ARSI PE=1 SV=1
Length = 569
Score = 325 bits (834), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 199/550 (36%), Positives = 287/550 (52%), Gaps = 90/550 (16%)
Query: 31 HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
HII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSRS L+TG+Y
Sbjct: 48 HIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQ 106
Query: 91 IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
IH G+QH +I +P LPL + LPQ L+EAGY+TH +GKWHLGF+R+ PT RGFD+
Sbjct: 107 IHTGLQHSIIRPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166
Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
G G DYY + +C G D+ V G YST LY + A +++A H+
Sbjct: 167 FLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHS 224
Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
+P+FLY+A AVH P Q+P E + ++ + + RR YA MV+ +DE+V N+
Sbjct: 225 PQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 279
Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
AL+++G NS+++F +DNG +F + GSN PLRG K T W+GG+RG+ + SP LK
Sbjct: 280 ALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 335
Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
+ Q+ S L HI+DW PTL AG + LDG + W +++G + R+EILHNID +
Sbjct: 336 RKQRTSRALMHITDWYPTLVGLAGGTTSAADGLDGYDVWPAISEGRASPRTEILHNIDPL 395
Query: 389 DNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
N ++ AA+RV + K + G D YGD + P +
Sbjct: 396 YNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WIPPQT 441
Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG 494
L + G +N E++ +R+
Sbjct: 442 LATFPGSWWNL-----------------------------ERMASVRQAV---------- 462
Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
LFNI+ DP E+ +LA + D+++ L +LA Y T +P + RA P
Sbjct: 463 ---------WLFNISADPYEREDLAGQRPDVVRTLLARLAEYNRTAIPVRYPAENPRAHP 513
Query: 555 ARWNNIWVPW 564
W PW
Sbjct: 514 DFNGGAWGPW 523
>sp|Q32KH7|ARSI_CANFA Arylsulfatase I OS=Canis familiaris GN=ARSI PE=2 SV=2
Length = 573
Score = 321 bits (822), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 196/550 (35%), Positives = 285/550 (51%), Gaps = 90/550 (16%)
Query: 31 HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
HII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSRS L+TG+Y
Sbjct: 49 HIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 107
Query: 91 IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
IH G+QH +I +P LPL + LPQ L+EAGY+TH +GKWHLGF+R+ PT RGFD+
Sbjct: 108 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 167
Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
G G DYY + +C G D+ V G YST LY + +++A H+
Sbjct: 168 FLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGENVAWGLSGQYSTMLYAQRVSHILASHS 225
Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
+P+FLY+A AVH P Q+P E + ++ + + RR YA MV+ +DE+V N+ +
Sbjct: 226 PRRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITS 280
Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
AL+++G NS+++F +DNG +F + GSN PLRG K T W+GG+RG+ + SP LK
Sbjct: 281 ALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 336
Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
+ ++ S L HI+DW PTL AG + LDG + W +++G + R+EILHNID +
Sbjct: 337 RKRRTSRALVHITDWYPTLVGLAGGTASAADGLDGYDVWPAISEGRASPRTEILHNIDPL 396
Query: 389 DNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
N ++ AA+RV + K + G D YGD + P +
Sbjct: 397 YNHARHGSLEAGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WIPPQT 442
Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG 494
L + G +N E++ R+
Sbjct: 443 LAAFPGSWWNL-----------------------------ERMASARQAV---------- 463
Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
LFNI+ DP E+ +LA + D+++ L +L Y T +P + RA P
Sbjct: 464 ---------WLFNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENPRAHP 514
Query: 555 ARWNNIWVPW 564
W PW
Sbjct: 515 DFNGGAWGPW 524
>sp|Q8BM89|ARSJ_MOUSE Arylsulfatase J OS=Mus musculus GN=Arsj PE=2 SV=1
Length = 598
Score = 319 bits (818), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 198/549 (36%), Positives = 286/549 (52%), Gaps = 68/549 (12%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
T +PH+I ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS
Sbjct: 67 GTAGTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 125
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+TGKY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+
Sbjct: 126 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCM 185
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
PT RGFD+ +G G DYY H K G D+ N DN GIYST +YT
Sbjct: 186 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGVCGYDLYENDNAAWDYDN---GIYSTQMYT 241
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
+ ++A H+ +KP+FLY+A+ AVH+ P QAP + I + RR YA M+S
Sbjct: 242 QRVQQILATHDPTKPLFLYVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 296
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
LDE++ NV AL+++G NSI+++ +DNG G + GSN PLRG K T W+GG+R
Sbjct: 297 CLDEAIHNVTLALKRYGFYNNSIIIYSSDNG----GQPTAGGSNWPLRGSKGTYWEGGIR 352
Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTK 377
V + SP LK V EL HI+DW PTL + A +I+ D LDG + W+ +++G ++
Sbjct: 353 AVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSP 412
Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
RVD L + ++ W +
Sbjct: 413 ---------------------RVDILHNIDPIYTKAKNGSWA-----------------A 434
Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAH 496
GI A+++ ++++ SD + + F+ + N + N+
Sbjct: 435 GYGIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QAFSNLGPNRWHNERIT 484
Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
++ LFNIT DP E+ +L+ ++K+L +L+ + T VP P D R++P
Sbjct: 485 LSTGKSIWLFNITADPYERVDLSSRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPRL 544
Query: 557 WNNIWVPWY 565
+W PWY
Sbjct: 545 NGGVWGPWY 553
>sp|Q571E4|GALNS_MOUSE N-acetylgalactosamine-6-sulfatase OS=Mus musculus GN=Galns PE=2
SV=2
Length = 520
Score = 179 bits (455), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 121/366 (33%), Positives = 181/366 (49%), Gaps = 26/366 (7%)
Query: 27 PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALM 85
P+ P+I+++L DD+GW D+ +G TPN+D +A G++ Y LC+PSR+AL+
Sbjct: 25 PQPPNIVLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALL 84
Query: 86 TGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
TG+ PI G ++ + G+P +E LLP+ LK+AGY +GKWHLG R
Sbjct: 85 TGRLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-R 143
Query: 139 EVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYST 194
+ P GFD +G YD+ K Y+ +M ++ KT T
Sbjct: 144 PQFHPLKHGFDEWFGSPNCHFGPYDNKAKPNIPVYRDWEMVGRFYEEFPINRKTGEANLT 203
Query: 195 DLYTEAAINVI-AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
LYT+ A++ I +H + P FLY A A HA P +FL S R Y
Sbjct: 204 QLYTQEALDFIQTQHARQSPFFLYWAIDATHA---------PVYASRQFLGTS--LRGRY 252
Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
V +D+SVG +++ L+ G+ +N+ V F +DNGA + GSN P K T +
Sbjct: 253 GDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALISAPNEGGSNGPFLCGKQTTF 312
Query: 314 DGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTK 372
+GGMR A W P +VS +L I D T + AG++ +D +DG++ + K
Sbjct: 313 EGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAGLKPPSDRVIDGLDLLPTMLK 372
Query: 373 GAKTKR 378
G R
Sbjct: 373 GQMMDR 378
>sp|P34059|GALNS_HUMAN N-acetylgalactosamine-6-sulfatase OS=Homo sapiens GN=GALNS PE=1
SV=1
Length = 522
Score = 177 bits (450), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 120/378 (31%), Positives = 182/378 (48%), Gaps = 25/378 (6%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRS 82
+ AP+ P+I+++L DD+GW D+ +G TPN+D +A GL+ Y LC+PSR+
Sbjct: 25 SGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGLLFPNFYSANPLCSPSRA 84
Query: 83 ALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
AL+TG+ PI G ++ + G+P +E+LLP+ LK+AGY + +GKWHLG
Sbjct: 85 ALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG 144
Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGI 191
R + P GFD +G YD+ + Y+ +M ++ KT
Sbjct: 145 H-RPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEA 203
Query: 192 YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR 251
T +Y + A++ I + P FLY A A HA P FL S +R
Sbjct: 204 NLTQIYLQEALDFIKRQARHHPFFLYWAVDATHA---------PVYASKPFLGTS--QRG 252
Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
Y V +D+S+G ++ L+ + +N+ V F +DNGA GSN P K T
Sbjct: 253 RYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQGGSNGPFLCGKQT 312
Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVL 370
++GGMR A W P +VS +L I D T A AG+ +D ++DG+N L
Sbjct: 313 TFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLALAGLTPPSDRAIDGLNLLPTL 372
Query: 371 TKGAKTKRSEILHNIDNV 388
+G R + D +
Sbjct: 373 LQGRLMDRPIFYYRGDTL 390
>sp|Q32KJ6|GALNS_RAT N-acetylgalactosamine-6-sulfatase OS=Rattus norvegicus GN=Galns
PE=1 SV=1
Length = 524
Score = 177 bits (448), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 121/373 (32%), Positives = 181/373 (48%), Gaps = 26/373 (6%)
Query: 20 AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCT 78
L AP+ P+I+++L DD+GW D+ +G TPN+D +A G++ Y LC+
Sbjct: 22 GLLAAGAPQPPNIVLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYSANPLCS 81
Query: 79 PSRSALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGK 131
PSR+AL+TG+ PI G ++ + G+P +E LLP+ LK+AGY +GK
Sbjct: 82 PSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGK 141
Query: 132 WHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNK 187
WHLG R + P GFD +G YD+ K Y+ +M ++ K
Sbjct: 142 WHLGH-RPQFHPLKHGFDEWFGSPNCHFGPYDNKVKPNIPVYRDWEMVGRFYEEFPINLK 200
Query: 188 TIGIYSTDLYTEAAINVI-AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDIS 246
T T LY + A++ I +H + P FLY A A HA P +FL S
Sbjct: 201 TGEANLTQLYLQEALDFIRTQHARQSPFFLYWAIDATHA---------PVYASKQFLGTS 251
Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLR 306
R Y V +D+SVG +++ L+ G+ +N+ V F +DNGA GSN P
Sbjct: 252 --LRGRYGDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALISAPKEGGSNGPFL 309
Query: 307 GMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVN 365
K T ++GGMR A W P +VS +L I D T + AG++ +D +DG++
Sbjct: 310 CGKQTTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAGLKPPSDRVIDGLD 369
Query: 366 QWDVLTKGAKTKR 378
+ +G R
Sbjct: 370 LLPTMLQGHIIDR 382
>sp|Q8WNQ7|GALNS_PIG N-acetylgalactosamine-6-sulfatase OS=Sus scrofa GN=GALNS PE=2 SV=1
Length = 522
Score = 175 bits (444), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 123/379 (32%), Positives = 184/379 (48%), Gaps = 27/379 (7%)
Query: 15 LLFNDAFLNTT-APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV 73
L+ + A L T AP+ P+I+++L DD+GW D+ +G TPN+D +A G++ Y
Sbjct: 14 LVLSAAGLGVTGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYA 73
Query: 74 -QALCTPSRSALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
LC+PSR+AL+TG+ PI G ++ + G+P E LLP+ LK AGYA
Sbjct: 74 ANPLCSPSRAALLTGRLPIRTGFYTTNGHARNAYTPQEIVGGIPDPEHLLPELLKGAGYA 133
Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHN 181
+ +GKWHLG R + P GFD +G YD+ + Y+ +M
Sbjct: 134 SKIVGKWHLGH-RPQFHPLKHGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRFYEE 192
Query: 182 MQVDNKTIGIYSTDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
++ KT T +Y + A++ I + P FLY A A HA P
Sbjct: 193 FPINLKTGESNLTQIYLQEALDFIKRQQATHHPFFLYWAIDATHA---------PVYASR 243
Query: 241 KFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG 300
FL S +R Y V +D+SVG ++ LR + N+ V F +DNGA G
Sbjct: 244 AFLGTS--QRGRYGDAVREIDDSVGRIVGLLRDLKIAGNTFVFFTSDNGAALVSAPKQGG 301
Query: 301 SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDT 359
SN P K T ++GGMR A W P +VS +L + D T + AG+E +D
Sbjct: 302 SNGPFLCGKQTTFEGGMREPAIAWWPGHIPAGQVSHQLGSVMDLFTTSLSLAGLEPPSDR 361
Query: 360 SLDGVNQWDVLTKGAKTKR 378
++DG++ + +G T+R
Sbjct: 362 AIDGLDLLPAMLQGRLTER 380
>sp|Q32KH5|GALNS_CANFA N-acetylgalactosamine-6-sulfatase OS=Canis familiaris GN=GALNS PE=2
SV=1
Length = 522
Score = 171 bits (433), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 118/376 (31%), Positives = 179/376 (47%), Gaps = 26/376 (6%)
Query: 27 PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALM 85
P+ P+I+++L DD+GW D+ +G TPN+D +A G++ Y LC+PSR+AL+
Sbjct: 27 PQPPNILLLLMDDMGWGDLGIYGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALL 86
Query: 86 TGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
TG+ PI G ++ + G+P E +LP+ LKEAGY + +GKWHLG R
Sbjct: 87 TGRLPIRNGFYTTNRHARNAYTPQEIVGGIPDQEHVLPELLKEAGYVSKIVGKWHLGH-R 145
Query: 139 EVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYST 194
+ P GFD +G YD+ + Y+ +M ++ KT T
Sbjct: 146 PQFHPLKHGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRYYEEFPINLKTGEANLT 205
Query: 195 DLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
+Y + A++ I + +P FLY A A HA P FL S +R Y
Sbjct: 206 QVYLQEALDFIKRQQAAQRPFFLYWAIDATHA---------PVYASRPFLGTS--QRGRY 254
Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
V +D SVG +++ L+ + EN+ V F +DNGA + GSN P K T +
Sbjct: 255 GDAVREIDNSVGKILSLLQDLRISENTFVFFTSDNGAALISAPNQGGSNGPFLCGKQTTF 314
Query: 314 DGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTK 372
+GGMR A W P +VS +L I D T + AG+ +D +DG++ +
Sbjct: 315 EGGMREPAIAWWPGRIPAGRVSHQLGSIMDLFTTSLSLAGLAPPSDRVIDGLDLLPAMLG 374
Query: 373 GAKTKRSEILHNIDNV 388
G T R + D +
Sbjct: 375 GQLTDRPIFYYRGDTL 390
>sp|P50428|ARSA_MOUSE Arylsulfatase A OS=Mus musculus GN=Arsa PE=2 SV=2
Length = 506
Score = 160 bits (405), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 132/390 (33%), Positives = 189/390 (48%), Gaps = 33/390 (8%)
Query: 9 FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
AL L A L+T +P P+I++I ADDLG+ D+ +G TPN+D LA GL
Sbjct: 1 MALGTLFLALAAGLSTASP--PNILLIFADDLGYGDLGSYGHPSSTTPNLDQLAEGGLRF 58
Query: 69 NQHYVQ-ALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
YV +LCTPSR+AL+TG+ P+ GM GV+ GLPL E L + L GY T
Sbjct: 59 TDFYVPVSLCTPSRAALLTGRLPVRSGMYPGVLGPSSQGGLPLEEVTLAEVLAARGYLTG 118
Query: 128 AIGKWHLGFFRE-VYTPTFRGFD-------SH-YGYWQGLQDY-YDHSCKATFEPYQGL- 176
GKWHLG E + P +GF SH G Q L + D CK + QGL
Sbjct: 119 MAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPDIPCKGGCD--QGLV 176
Query: 177 --DMRHNMQVDNKTIGIYSTDL-YTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPF 232
+ N+ V+ + + + Y + +++A+ + +P FLY A H Y F
Sbjct: 177 PIPLLANLTVEAQPPWLPGLEARYVSFSRDLMADAQRQGRPFFLYYASHHTH----YPQF 232
Query: 233 QAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS 292
F S R + + LD +VG ++ + G+LE ++V+F ADNG P
Sbjct: 233 SG-----QSFTKRSG--RGPFGDSLMELDGAVGALMTTVGDLGLLEETLVIFTADNG-PE 284
Query: 293 FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAA 352
SN G + LR K T ++GG+R A ++ P T V+ EL D LPTL A
Sbjct: 285 LMRMSNGGCSGLLRCGKGTTFEGGVREPALVYWPG-HITPGVTHELASSLDLLPTLAALT 343
Query: 353 GIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
G + + +LDGV+ +L K+ R +
Sbjct: 344 GAPLPNVTLDGVDISPLLLGTGKSPRKSVF 373
>sp|P15289|ARSA_HUMAN Arylsulfatase A OS=Homo sapiens GN=ARSA PE=1 SV=3
Length = 507
Score = 158 bits (400), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 125/404 (30%), Positives = 190/404 (47%), Gaps = 30/404 (7%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRSALMT 86
+ P+I++I ADDLG+ D+ +G TPN+D LA GL YV +LCTPSR+AL+T
Sbjct: 19 RPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLRFTDFYVPVSLCTPSRAALLT 78
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE-VYTPTF 145
G+ P+ +GM GV++ GLPL E + + L GY T GKWHLG E + P
Sbjct: 79 GRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLTGMAGKWHLGVGPEGAFLPPH 138
Query: 146 RGFDSHYG--YWQGLQDYYDHSCKATFEPYQG--------LDMRHNMQVDNKTIGIYSTD 195
+GF G Y + +C P G + + N+ V+ + + +
Sbjct: 139 QGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVPIPLLANLSVEAQPPWLPGLE 198
Query: 196 L-YTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
Y A +++A+ + +P FLY A H Y F F + S R +
Sbjct: 199 ARYMAFAHDLMADAQRQDRPFFLYYASHHTH----YPQFSG-----QSFAERSG--RGPF 247
Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
+ LD +VG ++ A+ G+LE ++V+F ADNG + + S G + LR K T +
Sbjct: 248 GDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRM-SRGGCSGLLRCGKGTTY 306
Query: 314 DGGMRGVA-AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTK 372
+GG+R A A W + V+ EL D LPTL A AG + + +LDG + +L
Sbjct: 307 EGGVREPALAFWPGHI--APGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDLSPLLLG 364
Query: 373 GAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSD 416
K+ R + D + + A+R K T + SD
Sbjct: 365 TGKSPRQSLFFYPSYPDEVRGVF-AVRTGKYKAHFFTQGSAHSD 407
>sp|P25549|ASLA_ECOLI Arylsulfatase OS=Escherichia coli (strain K12) GN=aslA PE=3 SV=2
Length = 551
Score = 152 bits (383), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 130/425 (30%), Positives = 200/425 (47%), Gaps = 66/425 (15%)
Query: 20 AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQI---PTPNIDALAYNGLILNQHYVQAL 76
A L KKP++++ L DD+GW DV F+G PTP+IDA+A GLIL Y Q
Sbjct: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS 135
Query: 77 CTPSRSALMTGKYPIHIGMQHGVILE---GEPWGLP-LTEKLLPQYLKEAGYATHAIGKW 132
+P+R+ ++TG+Y IH HG+++ G+P GL LT LPQ L + GY T AIGKW
Sbjct: 136 SSPTRATILTGQYSIH----HGILMPPMYGQPGGLQGLTT--LPQLLHDQGYVTQAIGKW 189
Query: 133 HLGFFRE-----VYTPTFRGFDSHYGYWQGLQDYY---------DHSCKATFEPYQGLDM 178
H+G +E V FRGF+S + +D + D S P+ D+
Sbjct: 190 HMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDV 249
Query: 179 RHNMQVDNKTIG----IYSTDL---YTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYE 230
+ + I Y DL + + + + + KS KP FLY H N
Sbjct: 250 HAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNY-- 307
Query: 231 PFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA 290
P+ + A S P R +Y + +++ N+ L K+G L+N++++F +DNG
Sbjct: 308 ----PNAKYAG----SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG- 358
Query: 291 PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCA 350
P + + P RG K + W+GG+R ++ + Q +K S + ++D PT
Sbjct: 359 PEAEVPPH--GRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALD 415
Query: 351 AAG--------IEINDTSLDGVNQWDVL--TKGAKTKRSEILHNIDNVDNPQKYYAALRV 400
AG + T +DGV+Q T G +++E H N AA+R+
Sbjct: 416 LAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAE--HYFLN-----GKLAAVRM 468
Query: 401 DDLKY 405
D+ KY
Sbjct: 469 DEFKY 473
>sp|Q08DD1|ARSA_BOVIN Arylsulfatase A OS=Bos taurus GN=ARSA PE=2 SV=1
Length = 507
Score = 150 bits (380), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 119/377 (31%), Positives = 177/377 (46%), Gaps = 43/377 (11%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRSALMT 86
P+I++I ADDLG+ D+ +G TPN+D LA GL YV +LCTPSR+AL+T
Sbjct: 19 SPPNILLIFADDLGYGDLGSYGHPSSTTPNLDQLAAGGLRFTDFYVPVSLCTPSRAALLT 78
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE-VYTPTF 145
G+ P+ +G+ GV+ GLPL E L + L GY T GKWHLG E + P
Sbjct: 79 GRLPVRMGLYPGVLEPSSRGGLPLDEVTLAEVLAAQGYLTGIAGKWHLGVGPEGAFLPPH 138
Query: 146 RGFDSHYG--YWQGLQDYYDHSCKATFEPYQG--------LDMRHNMQVDNKTIGI---- 191
GF G Y + +C P +G + + N+ V+ + +
Sbjct: 139 HGFHRFLGIPYSHDQGPCQNLTCFPPATPCEGICDQGLVPIPLLANLSVEAQPPWLPGLE 198
Query: 192 -----YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDIS 246
++ DL T+A ++ +P FLY A H Y F F S
Sbjct: 199 ARYVAFARDLMTDA-------QHQGRPFFLYYASHHTH----YPQFSG-----QSFPGHS 242
Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLR 306
R + + LD +VG ++ A+ G+L ++V F ADNG + + S+ G + LR
Sbjct: 243 G--RGPFGDSLMELDAAVGALMTAVGDLGLLGETLVFFTADNGPETMRM-SHGGCSGLLR 299
Query: 307 GMKSTPWDGGMRGVA-AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVN 365
K T ++GG+R A A W + V+ EL D LPTL A AG ++ + +LDGV+
Sbjct: 300 CGKGTTFEGGVREPALAFWPGHI--APGVTHELASSLDLLPTLAALAGAQLPNITLDGVD 357
Query: 366 QWDVLTKGAKTKRSEIL 382
+L K+ R +
Sbjct: 358 LSPLLLGTGKSPRHTLF 374
>sp|P51691|ARS_PSEAE Arylsulfatase OS=Pseudomonas aeruginosa (strain ATCC 15692 / PAO1 /
1C / PRS 101 / LMG 12228) GN=atsA PE=1 SV=3
Length = 536
Score = 147 bits (370), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 125/425 (29%), Positives = 183/425 (43%), Gaps = 105/425 (24%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
K+P+ ++I+ADDLG++D+ G +I TPN+DALA GL L + + C+P+RS L+TG
Sbjct: 3 KRPNFLVIVADDLGFSDIGAFGG-EIATPNLDALAIAGLRLTDFHTASTCSPTRSMLLTG 61
Query: 88 K--YPIHIGMQHGVI---LEGEP-WGLPLTEKL--LPQYLKEAGYATHAIGKWHLGFFRE 139
+ IG + LEG+P + L E++ LP+ L+EAGY T GKWHLG E
Sbjct: 62 TDHHIAGIGTMAEALTPELEGKPGYEGHLNERVVALPELLREAGYQTLMAGKWHLGLKPE 121
Query: 140 VYTPTFRGFDSHYGYWQGLQDY------YDHSCKATFEPYQGLDMRHNMQVDNKTIGIYS 193
TP RGF+ + G ++ YD S + L + +D G YS
Sbjct: 122 -QTPHARGFERSFSLLPGAANHYGFEPPYDESTPRILKGTPALYVEDERYLDTLPEGFYS 180
Query: 194 TDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD--------- 244
+D + + + + E ++S+P F YL A H P QAP E V K+
Sbjct: 181 SDAFGDKLLQYLKERDQSRPFFAYLPFSAPHW-----PLQAPREIVEKYRGRYDAGPEAL 235
Query: 245 --------------------------------ISDPER-------RTYAGMVSRLDESVG 265
+ D ER YA MV R+D ++G
Sbjct: 236 RQERLARLKELGLVEADVEAHPVLALTREWEALEDEERAKSARAMEVYAAMVERMDWNIG 295
Query: 266 NVIAALRKHGMLENSIVLFMADNGA--------PSFGI------------------HSN- 298
V+ LR+ G L+N+ VLFM+DNGA P FG +N
Sbjct: 296 RVVDYLRRQGELDNTFVLFMSDNGAEGALLEAFPKFGPDLLGFLDRHYDNSLENIGRANS 355
Query: 299 ---------KGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLC 349
+ + P R K+ GG+R A + P L + +S + D PTL
Sbjct: 356 YVWYGPRWAQAATAPSRLYKAFTTQGGIRVPALVRYPRLSRQGAISHAFATVMDVTPTLL 415
Query: 350 AAAGI 354
AG+
Sbjct: 416 DLAGV 420
>sp|Q32KH9|ARSG_CANFA Arylsulfatase G OS=Canis familiaris GN=ARSG PE=2 SV=1
Length = 535
Score = 136 bits (343), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 122/422 (28%), Positives = 188/422 (44%), Gaps = 55/422 (13%)
Query: 25 TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL-ILNQHYVQALCTPSRSA 83
T +KP+ +IILADD+GW D+ + + T N+D +A G+ ++ H + C+PSR++
Sbjct: 31 TRGQKPNFVIILADDMGWGDLGANWAETKDTANLDKMAAEGMRFVDFHAAASTCSPSRAS 90
Query: 84 LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
L+TG+ + G+ H + GLPL E L + L++AGY T IGKWHLG Y P
Sbjct: 91 LLTGRLGLRNGVTHNFAVT-SVGGLPLNETTLAEVLQQAGYVTGMIGKWHLG-HHGPYHP 148
Query: 144 TFRGFDSHYG----YWQGLQDY--YDH----SCKATFEPYQGLD----------MRHNMQ 183
FRGFD ++G + G D Y+H +C P + L+ + N+
Sbjct: 149 NFRGFDYYFGIPYSHDMGCTDTPGYNHPPCPACPRGDRPSRSLERDCYTDVALPLYENLN 208
Query: 184 VDNKTIGIYS-TDLYTEAAINVIAEHNKS-KPMFLY--LAHLAVHAGNTYEPFQAPDEEV 239
+ + + + S Y E AI I + S +P LY LAH+ V T ++
Sbjct: 209 IVEQPVNLSSLAHKYAEKAIQFIQHASASGRPFLLYMGLAHMHVPISRT---------QL 259
Query: 240 AKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK 299
+ L RR Y + +D VG + + + EN+ + F DNG P
Sbjct: 260 SAVLR----GRRPYGAGLREMDSLVGQIKDKVDRTAK-ENTFLWFTGDNG-PWAQKCELA 313
Query: 300 GSNHPLRGM----------KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLC 349
GS P G+ K T W+GG R A + P S+ L + D PT+
Sbjct: 314 GSVGPFTGLWQTHQGGSPAKQTTWEGGHRVPALAYWPGRVPVNVTSTALLSVLDIFPTVV 373
Query: 350 AAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLK--YV 406
A AG + D DG++ +VL ++T + H +R+ K YV
Sbjct: 374 ALAGASLPQDRHFDGLDASEVLFGWSQTGHRVLFHPNSGAAGEFGALQTVRLGSYKAFYV 433
Query: 407 AG 408
+G
Sbjct: 434 SG 435
>sp|Q32KJ9|ARSG_RAT Arylsulfatase G OS=Rattus norvegicus GN=Arsg PE=2 SV=1
Length = 526
Score = 134 bits (337), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 119/422 (28%), Positives = 189/422 (44%), Gaps = 54/422 (12%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL-ILNQHYVQALCTPSRS 82
T AP+ P+I+IILADD+GW D+ + + T N+D +A G+ ++ H + C+PSR+
Sbjct: 31 TRAPR-PNIVIILADDMGWGDLGANWAETKDTTNLDKMASEGMRFVDFHAAASTCSPSRA 89
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+L+TG+ + G+ H + GLPL E L + L++AGY T IGKWHLG Y
Sbjct: 90 SLLTGRLGLRNGVTHNFAVTSV-GGLPLNETTLAEVLQQAGYVTAMIGKWHLGHHGS-YH 147
Query: 143 PTFRGFDSHYGYW----QGLQD---YYDHSCKATFE-------PYQ------GLDMRHNM 182
P+FRGFD ++G G D Y C A + P + L + N+
Sbjct: 148 PSFRGFDYYFGIPYSNDMGCTDNPGYNYPPCPACPQSDGRWRNPDRDCYTDVALPLYENL 207
Query: 183 QVDNKTIGIYS-TDLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
+ + + + Y E A+ I + + S +P LY+ +H + P
Sbjct: 208 NIVEQPVNLSGLAQKYAERAVEFIEQASTSGRPFLLYVGLAHMHVPLSVTP--------- 258
Query: 241 KFLDISDPE-RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK 299
+++P+ +R Y + +D VG + + H EN+++ F DNG P
Sbjct: 259 ---PLANPQSQRLYRASLQEMDSLVGQIKDKV-DHVAKENTLLWFAGDNG-PWAQKCELA 313
Query: 300 GSNHPLRGM----------KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLC 349
GS P G+ K T W+GG R A + P S+ L + D PT+
Sbjct: 314 GSMGPFSGLWQTHQGGSPAKQTTWEGGHRVPALAYWPGRVPVNVTSTALLSLLDIFPTVI 373
Query: 350 AAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLK--YV 406
A AG + + DGV+ +VL ++T + H +R+D K Y+
Sbjct: 374 ALAGASLPPNRKFDGVDVSEVLFGKSQTGHRVLFHPNSGAAGEYGALQTVRLDRYKAFYI 433
Query: 407 AG 408
G
Sbjct: 434 TG 435
>sp|P20713|ATSA_ENTAE Arylsulfatase OS=Enterobacter aerogenes GN=atsA PE=1 SV=1
Length = 464
Score = 131 bits (330), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 109/418 (26%), Positives = 184/418 (44%), Gaps = 94/418 (22%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
++P++I+I+ADD+G++D+S G +IPTPN+ A+A G+ ++Q+Y + P+RS L+TG
Sbjct: 24 ERPNVIVIIADDMGYSDISPFGG-EIPTPNLQAMAEQGMRMSQYYTSPMSAPARSMLLTG 82
Query: 88 KYPIHIGMQ----HGVILEGEPWGLPLTEKL--LPQYLKEAGYATHAIGKWHLGFFREVY 141
GM + + E + L LT+++ + + K+AGY T GKWHLGF
Sbjct: 83 NSNQQAGMGGMWWYDSTIGKEGYELRLTDRVTTMAERFKDAGYNTLMAGKWHLGFVPGA- 141
Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCK-ATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
TP RGF+ + + G +++ + T E + R +V YS++ Y
Sbjct: 142 TPKDRGFNHAFAFMGGGTSHFNDAIPLGTVEAFHTYYTRDGERVSLPD-DFYSSEAYARQ 200
Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF------------------ 242
+ I K +P+F +LA A H +P QAPDE + +F
Sbjct: 201 MNSWIKATPKEQPVFAWLAFTAPH-----DPLQAPDEWIKRFKGQYEQGYAEVYRQRIAR 255
Query: 243 ---------------LDISD------PER--------RTYAGMVSRLDESVGNVIAALRK 273
L++ PE+ + YA M++ +D +G ++ L++
Sbjct: 256 LKALGIIHDDTPLPHLELDKEWEALTPEQQKYTAKVMQVYAAMIANMDAQIGTLMETLKQ 315
Query: 274 HGMLENSIVLFMADNGA-------------------------------PSFGIHSNKGSN 302
G +N++++F+ DNGA S+G H SN
Sbjct: 316 TGRDKNTLLVFLTDNGANPAQGFYYESTPEFWKQFDNSYDNVGRKGSFVSYGPHWANVSN 375
Query: 303 HPLRGM-KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT 359
P K+T GG+ I P + + K+ + + D PTL AGI+ N +
Sbjct: 376 APYANYHKTTSAQGGINTDFMISGPGITRHGKIDASTMAVYDVAPTLYEFAGIDPNKS 433
>sp|Q9X759|ATSA_KLEPN Arylsulfatase OS=Klebsiella pneumoniae GN=atsA PE=1 SV=1
Length = 577
Score = 130 bits (328), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 109/418 (26%), Positives = 185/418 (44%), Gaps = 94/418 (22%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
++P++I+I+ADD+G++D+S G +IPTPN+ A+A G+ ++Q+Y + P+RS L+TG
Sbjct: 24 ERPNVIVIIADDMGYSDISPFGG-EIPTPNLQAMAEQGMRMSQYYTSPMSAPARSMLLTG 82
Query: 88 KYPIHIGMQ----HGVILEGEPWGLPLTEKL--LPQYLKEAGYATHAIGKWHLGFFREVY 141
GM + + E + L LT+++ + + K+AGY T GKWHLGF
Sbjct: 83 NSNQQAGMGGMWWYDSTIGKEGYELRLTDRVTTMAERFKDAGYNTLMAGKWHLGFVPGA- 141
Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCK-ATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
TP RGF+ + + G +++ + T E + R +V + YS++ Y
Sbjct: 142 TPKERGFNHAFAFMGGGTSHFNDAIPLGTVEAFHTYYTRDGERV-SLPDDFYSSEAYARQ 200
Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF------------------ 242
+ I K +P+F +LA A H +P QAPDE + +F
Sbjct: 201 MNSWIKATPKEQPVFAWLAFTAPH-----DPLQAPDEWIKRFKGQYEQGYAEVYRQRIAR 255
Query: 243 ---------------LDISD------PER--------RTYAGMVSRLDESVGNVIAALRK 273
L++ PE+ + YA M++ +D +G ++ L++
Sbjct: 256 LKALGIIHDDTPLPHLELDKEWEALTPEQQKYTAKVMQVYAAMIANMDAQIGTLMETLKQ 315
Query: 274 HGMLENSIVLFMADNGA-------------------------------PSFGIHSNKGSN 302
G +N++++F+ DNGA S+G H SN
Sbjct: 316 TGRDKNTLLVFLTDNGANPAQGFYYESTPEFWKQFDNSYDNVGRKGSFVSYGPHWANVSN 375
Query: 303 HPLRGM-KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT 359
P K+T GG+ I P + + K+ + + D PTL AGI+ N +
Sbjct: 376 APYANYHKTTSAQGGINTDFMISGPGITRHGKIDASTMAVYDVAPTLYEFAGIDPNKS 433
>sp|Q9C0V7|YHJ2_SCHPO Uncharacterized sulfatase PB10D8.02c OS=Schizosaccharomyces pombe
(strain 972 / ATCC 24843) GN=SPBPB10D8.02c PE=3 SV=1
Length = 554
Score = 126 bits (317), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 119/434 (27%), Positives = 178/434 (41%), Gaps = 107/434 (24%)
Query: 20 AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTP 79
AF KKP+ ++I+ADDLGW+DVS G S+I TPNI+ LA G+ L + + C+P
Sbjct: 2 AFNKQAESKKPNFLVIVADDLGWSDVSPFG-SEIHTPNIERLAKEGVRLTNFHTASACSP 60
Query: 80 SRSALMTGKYPIHIG----MQHGVILEGEPWGLP------LTEKL--LPQYLKEAGYATH 127
+RS L++G HI M V + WG L +++ LP+ L+EAGY T
Sbjct: 61 TRSMLLSGT-DNHIAGLGQMAETVRRFSKVWGGKPGYEGYLNDRVAALPEILQEAGYYTT 119
Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEP----YQGLDMRHNMQ 183
GKWHLG + Y P+ RGF + G +++ + P L ++
Sbjct: 120 MSGKWHLGLTPDRY-PSKRGFKESFALLPGGGNHFAYEPGTRENPAVPFLPPLYTHNHDP 178
Query: 184 VDNKTI-GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF 242
VD+K++ YS++ + E I+ + KS+ F YL A H P Q+P E + K+
Sbjct: 179 VDHKSLKNFYSSNYFAEKLIDQLKNREKSQSFFAYLPFTAPHW-----PLQSPKEYINKY 233
Query: 243 L-------------------------------------------------DISDPERRTY 253
+ S Y
Sbjct: 234 RGRYSEGPDVLRKNRLQAQKDLGLIPENVIPAPVDGMGTKSWDELTTEEKEFSARTMEVY 293
Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS--------------------- 292
A MV LD ++G VI L+ G L+N+ V+FM+DNGA
Sbjct: 294 AAMVELLDLNIGRVIDYLKTIGELDNTFVIFMSDNGAEGSVLEAIPVLSTKPPVKYFDNS 353
Query: 293 ------------FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFH 340
+G + + P R K +GG+R A I P L + +S E
Sbjct: 354 LENLGNYNSFIWYGPRWAQAATAPSRLSKGFITEGGIRCPAIIRYPPLIKPDIISDEFVT 413
Query: 341 ISDWLPTLCAAAGI 354
+ D LPT+ A +
Sbjct: 414 VMDILPTILELAEV 427
>sp|Q3TYD4|ARSG_MOUSE Arylsulfatase G OS=Mus musculus GN=Arsg PE=2 SV=1
Length = 525
Score = 126 bits (316), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 109/397 (27%), Positives = 176/397 (44%), Gaps = 56/397 (14%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL-ILNQHYVQALCTPSRS 82
T AP+ P+I+IILADD+GW D+ + + T N+D +A G+ ++ H + C+PSR+
Sbjct: 31 TRAPQ-PNIVIILADDMGWGDLGANWAETKDTTNLDKMASEGMRFVDFHAAASTCSPSRA 89
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+L+TG+ + G+ H + GLP+ E L + L++ GY T IGKWHLG Y
Sbjct: 90 SLLTGRLGLRNGVTHNFAVTSV-GGLPVNETTLAEVLRQEGYVTAMIGKWHLGHHGS-YH 147
Query: 143 PTFRGFDSHYGYW----QGLQD---YYDHSCKATFE-------PYQ------GLDMRHNM 182
P FRGFD ++G G D Y C A + P + L + N+
Sbjct: 148 PNFRGFDYYFGIPYSNDMGCTDAPGYNYPPCPACPQRDGLWRNPGRDCYTDVALPLYENL 207
Query: 183 QVDNKTIGIYS-TDLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
+ + + + Y E A+ I + + S +P LY+ +H + P
Sbjct: 208 NIVEQPVNLSGLAQKYAERAVEFIEQASTSGRPFLLYVGLAHMHVPLSVTP--------- 258
Query: 241 KFLDISDPERRT-YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNG---------- 289
++ P+R++ Y + +D VG + + H EN+++ F DNG
Sbjct: 259 ---PLAHPQRQSLYRASLREMDSLVGQIKDKV-DHVARENTLLWFTGDNGPWAQKCELAG 314
Query: 290 --APSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPT 347
P FG+ P K T W+GG R A + P S+ L + D PT
Sbjct: 315 SVGPFFGLWQTHQGGSP---TKQTTWEGGHRVPALAYWPGRVPANVTSTALLSLLDIFPT 371
Query: 348 LCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILH 383
+ A AG + + DG + +VL ++ + H
Sbjct: 372 VIALAGASLPPNRKFDGRDVSEVLFGKSQMGHRVLFH 408
>sp|P14000|ARS_HEMPU Arylsulfatase OS=Hemicentrotus pulcherrimus PE=1 SV=1
Length = 551
Score = 124 bits (311), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 109/372 (29%), Positives = 171/372 (45%), Gaps = 36/372 (9%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMTG 87
KP++++++AD +G D++ +G ID +A GL YV A+CTPSRSA+MTG
Sbjct: 51 KPNVVLLVADHMGSGDLTSYGHPTQEAGFIDKMAAEGLRFTNGYVGDAVCTPSRSAIMTG 110
Query: 88 KYPIHIGM--QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT--- 142
+ P+ IG + V L GLP +E + + +KEAGYAT +GKWHLG T
Sbjct: 111 RLPVRIGTFGETRVFLPWTKTGLPKSELTIAEAMKEAGYATGMVGKWHLGINENSSTDGA 170
Query: 143 --PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI--------GIY 192
P GFD G+ + + SC T D + N T+ G+
Sbjct: 171 HLPFNHGFD-FVGHNLPFTNSW--SCDDTGLHKDFPDSQRCYLYVNATLVSQPYQHKGL- 226
Query: 193 STDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
T L+T+ A+ I E N + P FLY+A +H F + D R
Sbjct: 227 -TQLFTDDALGFI-EDNHADPFFLYVAFAHMHT----SLFSSDDFSCTS-------RRGR 273
Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
Y + + ++V ++ L ++ + EN+I+ F++D+G P G RG KS
Sbjct: 274 YGDNLLEMHDAVQKIVDKLEENNISENTIIFFISDHG-PHREYCEEGGDASIFRGGKSHS 332
Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLT 371
W+GG R ++ P + +S+E+ D + T G + D DG + DVL
Sbjct: 333 WEGGHRIPYIVYWPG-TISPGISNEIVTSMDIIATAADLGGTTLPTDRIYDGKSIKDVLL 391
Query: 372 KGAKTKRSEILH 383
+G+ + S +
Sbjct: 392 EGSASPHSSFFY 403
>sp|Q96EG1|ARSG_HUMAN Arylsulfatase G OS=Homo sapiens GN=ARSG PE=1 SV=1
Length = 525
Score = 124 bits (311), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 117/398 (29%), Positives = 175/398 (43%), Gaps = 59/398 (14%)
Query: 25 TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL-ILNQHYVQALCTPSRSA 83
T +KP+ +IILADD+GW D+ + + T N+D +A G+ ++ H + C+PSR++
Sbjct: 31 TRGQKPNFVIILADDMGWGDLGANWAETKDTANLDKMASEGMRFVDFHAAASTCSPSRAS 90
Query: 84 LMTGKYPIHIGMQHGVILE---GEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
L+TG+ +G+++GV GLPL E L + L++AGY T IGKWHLG
Sbjct: 91 LLTGR----LGLRNGVTRNFAVTSVGGLPLNETTLAEVLQQAGYVTGIIGKWHLGHHGS- 145
Query: 141 YTPTFRGFDSHYG----YWQGLQDY--YDH----SCKATFEPYQGLD----------MRH 180
Y P FRGFD ++G + G D Y+H +C P + L +
Sbjct: 146 YHPNFRGFDYYFGIPYSHDMGCTDTPGYNHPPCPACPQGDGPSRNLQRDCYTDVALPLYE 205
Query: 181 NMQVDNKTIGIYS-TDLYTEAAINVIAEHNKS-KPMFLY--LAHLAVHAGNTYEPFQAPD 236
N+ + + + + S Y E A I + S +P LY LAH+ V T P AP
Sbjct: 206 NLNIVEQPVNLSSLAQKYAEKATQFIQRASTSGRPFLLYVALAHMHVPLPVTQLP-AAPR 264
Query: 237 EEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH 296
R Y + +D VG + + H + EN+ + F DNG P
Sbjct: 265 ------------GRSLYGAGLWEMDSLVGQIKDKV-DHTVKENTFLWFTGDNG-PWAQKC 310
Query: 297 SNKGSNHPLRGM----------KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLP 346
GS P G K T W+GG R A + P S+ L + D P
Sbjct: 311 ELAGSVGPFTGFWQTRQGGSPAKQTTWEGGHRVPALAYWPGRVPVNVTSTALLSVLDIFP 370
Query: 347 TLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILH 383
T+ A A + DGV+ +VL ++ + H
Sbjct: 371 TVVALAQASLPQGRRFDGVDVSEVLFGRSQPGHRVLFH 408
>sp|P50473|ARS_STRPU Arylsulfatase OS=Strongylocentrotus purpuratus PE=2 SV=1
Length = 567
Score = 124 bits (310), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 114/402 (28%), Positives = 179/402 (44%), Gaps = 42/402 (10%)
Query: 2 TWARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDAL 61
T R+Y L + TA KP++I++LADD+G D+S +G ID +
Sbjct: 39 TATRRYGDGEDLLHLLGQTGQHRTAMTKPNVILLLADDMGVGDLSVYGHPTQEPGFIDQM 98
Query: 62 AYNGLILNQHYV-QALCTPSRSALMTGKYPIHIGM--QHGVILEGEPWGLPLTEKLLPQY 118
A GL Q Y ++CTPSRSA++TG+ PI G+ + + L GLPL E + +
Sbjct: 99 ANQGLRFTQGYSGDSVCTPSRSAIVTGRQPIRTGVYGEERIFLPWTTTGLPLYEVTIAEA 158
Query: 119 LKEAGYATHAIGKWHLGFFRE-----VYTPTFRGFD--SH---YG-YWQ----GL-QDYY 162
+K AGY T +GKWHLG + P RGFD H +G W+ GL QD+
Sbjct: 159 MKGAGYTTGMVGKWHLGINENSSSDGAHLPANRGFDFVGHNLPFGNSWRCDDTGLHQDFP 218
Query: 163 DHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLA 222
D A F Y + Q +K + T L + + I E N +KP F+Y++
Sbjct: 219 D--TNACFLYYNSTSVAQPFQ--HKGL----TQLLRDDTVGFI-EDNVNKPFFMYVSFAH 269
Query: 223 VHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIV 282
+H F + D R Y + +D+++ ++ L + + +N+++
Sbjct: 270 MHT----SLFSSDDFSCTS-------RRGRYGDNLREMDQAIEQIVTTLVDNDIDDNTVI 318
Query: 283 LFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS 342
F +D+G P G + RG K W+GG R ++ P + VS E+
Sbjct: 319 FFTSDHG-PHREYCGEGGDANVFRGGKGQSWEGGHRIPYIVYWPG-TISPGVSHEIVTSM 376
Query: 343 DWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILH 383
D + T G ++ D DG VL +GA + + +
Sbjct: 377 DIIATAVNLGGSQLPTDRIYDGKCLKSVLLEGASSPHDDFFY 418
>sp|P08842|STS_HUMAN Steryl-sulfatase OS=Homo sapiens GN=STS PE=1 SV=2
Length = 583
Score = 123 bits (309), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 119/432 (27%), Positives = 177/432 (40%), Gaps = 77/432 (17%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSAL 84
A +P+II+++ADDLG D +G+ I TPNID LA G+ L QH + LCTPSR+A
Sbjct: 23 AASRPNIILVMADDLGIGDPGCYGNKTIRTPNIDRLASGGVKLTQHLAASPLCTPSRAAF 82
Query: 85 MTGKYPIHIGM----QHGVIL-EGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE 139
MTG+YP+ GM + GV L GLP E + LK+ GY+T IGKWHLG
Sbjct: 83 MTGRYPVRSGMASWSRTGVFLFTASSGGLPTDEITFAKLLKDQGYSTALIGKWHLGMSCH 142
Query: 140 VYT-----PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYST 194
T P GF+ YG L + D CK + + + + +G+
Sbjct: 143 SKTDFCHHPLHHGFNYFYGI--SLTNLRD--CKPGEGSVFTTGFKRLVFLPLQIVGV--- 195
Query: 195 DLYTEAAINVIAEHNKSKPMFLYLAHLA------------------VHAGNTYEPFQAPD 236
L T AA+N + + +F L LA YE Q P
Sbjct: 196 TLLTLAALNCLGLLHVPLGVFFSLLFLAALILTLFLGFLHYFRPLNCFMMRNYEIIQQPM 255
Query: 237 E----------EVAKFLD--------------------------ISDPERRTYAGMVSRL 260
E A+F+ + Y V +
Sbjct: 256 SYDNLTQRLTVEAAQFIQRNTETPFLLVLSYLHVHTALFSSKDFAGKSQHGVYGDAVEEM 315
Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHS----NKGSNHPLRGMKSTPWDGG 316
D SVG ++ L + + ++++ F +D GA + S + GSN +G K+ W+GG
Sbjct: 316 DWSVGQILNLLDELRLANDTLIYFTSDQGAHVEEVSSKGEIHGGSNGIYKGGKANNWEGG 375
Query: 317 MRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAK 375
+R + P + Q + E D PT+ AG + D +DG + +L ++
Sbjct: 376 IRVPGILRWPRVIQAGQKIDEPTSNMDIFPTVAKLAGAPLPEDRIIDGRDLMPLLEGKSQ 435
Query: 376 TKRSEILHNIDN 387
E L + N
Sbjct: 436 RSDHEFLFHYCN 447
>sp|P77318|YDEN_ECOLI Uncharacterized sulfatase YdeN OS=Escherichia coli (strain K12)
GN=ydeN PE=3 SV=2
Length = 560
Score = 121 bits (303), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 112/389 (28%), Positives = 163/389 (41%), Gaps = 77/389 (19%)
Query: 29 KPHIIIILADDLGWNDVSFHGSS--------------------------QIPTPNIDALA 62
KP+II++ DDLG+ + F S Q TP + +L
Sbjct: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116
Query: 63 YNGLILNQHYV-QALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKE 121
G+ YV + PSR+A+MTG+ P G+ + G+PLTE LP+ +
Sbjct: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD---GIPLTETFLPELFQN 173
Query: 122 AGYATHAIGKWHLG----------------------FFREVYTPTFRGFDSHYGYWQGLQ 159
GY T A+GKWHL F E + P RGFD G+
Sbjct: 174 HGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGT 233
Query: 160 DYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNK-SKPMFLYL 218
YY+ ++ +V K Y +D T+ AI V+ +P LYL
Sbjct: 234 AYYNSPSL----------FKNRERVPAKG---YISDQLTDEAIGVVDRAKTLDQPFMLYL 280
Query: 219 AHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLE 278
A+ A H N APD+ +F S YA + S +D+ V ++ L+K+G +
Sbjct: 281 AYNAPHLPNDNP---APDQYQKQFNTGSQTADNYYASVYS-VDQGVKRILEQLKKNGQYD 336
Query: 279 NSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVS-SE 337
N+I+LF +DNGA G G+ +G KS + GG +W W + Q + +
Sbjct: 337 NTIILFTSDNGAVIDGPLPLNGAQ---KGYKSQTYPGGTHTPMFMW--WKGKLQPGNYDK 391
Query: 338 LFHISDWLPTLCAAAGIEI-NDTSLDGVN 365
L D+ PT AA I I D LDGV+
Sbjct: 392 LISAMDFYPTALDAADISIPKDLKLDGVS 420
>sp|P51690|ARSE_HUMAN Arylsulfatase E OS=Homo sapiens GN=ARSE PE=1 SV=2
Length = 589
Score = 109 bits (272), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 61/137 (44%), Positives = 83/137 (60%), Gaps = 11/137 (8%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
+P+I++++ADDLG D+ +G++ + TPNID LA +G+ L QH A LCTPSR+A +T
Sbjct: 36 SRPNILLLMADDLGIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLT 95
Query: 87 GKYPIHIGMQHGV---ILE--GEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE-- 139
G+YP+ GM + +L+ G GLP E + LKE GYAT IGKWHLG E
Sbjct: 96 GRYPVRSGMVSSIGYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESA 155
Query: 140 ---VYTPTFRGFDSHYG 153
+ P GFD YG
Sbjct: 156 SDHCHHPLHHGFDHFYG 172
>sp|Q60HH5|ARSE_MACFA Arylsulfatase E OS=Macaca fascicularis GN=ARSE PE=2 SV=1
Length = 588
Score = 108 bits (270), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 61/137 (44%), Positives = 83/137 (60%), Gaps = 11/137 (8%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
+P+I++++ADDLG D+ +G++ + TPNID LA +G+ L QH A LCTPSR+A +T
Sbjct: 36 SRPNILLLMADDLGIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLT 95
Query: 87 GKYPIHIGMQHGV---ILE--GEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE-- 139
G+YP+ GM + +L+ G GLP E + LKE GYAT IGKWHLG E
Sbjct: 96 GRYPVRSGMVSSIGYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESA 155
Query: 140 ---VYTPTFRGFDSHYG 153
+ P GFD YG
Sbjct: 156 SDHCHHPLHHGFDHFYG 172
>sp|P54793|ARSF_HUMAN Arylsulfatase F OS=Homo sapiens GN=ARSF PE=1 SV=4
Length = 590
Score = 107 bits (267), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 67/157 (42%), Positives = 87/157 (55%), Gaps = 12/157 (7%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
F +L C LL N + KP+I++I+ DDLG D+ +G+ + TP+ID LA G+
Sbjct: 9 FMSLVCALL-NTCQAHRVHDDKPNIVLIMVDDLGIGDLGCYGNDTMRTPHIDRLAREGVR 67
Query: 68 LNQHYVQA-LCTPSRSALMTGKYPIHIGM----QHGVILE-GEPWGLPLTEKLLPQYLKE 121
L QH A LC+PSRSA +TG+YPI GM VI P GLPL E L LK+
Sbjct: 68 LTQHISAASLCSPSRSAFLTGRYPIRSGMVSSGNRRVIQNLAVPAGLPLNETTLAALLKK 127
Query: 122 AGYATHAIGKWHLGF-----FREVYTPTFRGFDSHYG 153
GY+T IGKWH G + + P GFD +YG
Sbjct: 128 QGYSTGLIGKWHQGLNCDSRSDQCHHPYNYGFDYYYG 164
Score = 32.7 bits (73), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 24/95 (25%), Positives = 42/95 (44%), Gaps = 12/95 (12%)
Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
+ + AI+ + H+K L+ + L VH P D+ F S + Y
Sbjct: 266 IMVKEAISFLERHSKET-FLLFFSFLHVHT-----PLPTTDD----FTGTS--KHGLYGD 313
Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA 290
V +D VG ++ A+ G+ N++V F +D+G
Sbjct: 314 NVEEMDSMVGKILDAIDDFGLRNNTLVYFTSDHGG 348
>sp|Q5FYA8|ARSH_HUMAN Arylsulfatase H OS=Homo sapiens GN=ARSH PE=2 SV=1
Length = 562
Score = 105 bits (263), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 120/432 (27%), Positives = 174/432 (40%), Gaps = 72/432 (16%)
Query: 25 TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSA 83
T +P+I++++ADDLG D+ +G++ + TPNID LA G+ L QH A +CTPSR+A
Sbjct: 2 TRNARPNIVLLMADDLGVGDLCCYGNNSVSTPNIDRLASEGVRLTQHLAAASMCTPSRAA 61
Query: 84 LMTGKYPIHIGMQHGVILE------GEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
+TG+YPI GM L G GLP E + L+ GY T IGKWHLG
Sbjct: 62 FLTGRYPIRSGMVSAYNLNRAFTWLGGSGGLPTNETTFAKLLQHRGYRTGLIGKWHLGLS 121
Query: 138 -----REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIY 192
Y P GF YG GL C+A+ P +R + + + +
Sbjct: 122 CASRNDHCYHPLNHGFHYFYGVPFGLLS----DCQASKTPELHRWLRIKLWISTVALALV 177
Query: 193 STDLYTEAAINVIAEHNKSKPMFLYLAHLAVHA-----GNT----------YEPFQAP-- 235
L + K +F LA L + G T +E Q P
Sbjct: 178 PFLLLIPKFARWFSVPWKVIFVFALLAFLFFTSWYSSYGFTRRWNCILMRNHEIIQQPMK 237
Query: 236 DEEVA-----------------------KFLDISDP--ERRTYAGM---------VSRLD 261
+E+VA FL + P ++ + G V +D
Sbjct: 238 EEKVASLMLKEALAFIERYKREPFLLFFSFLHVHTPLISKKKFVGRSKYGRYGDNVEEMD 297
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGA---PSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
VG ++ AL + + +++V F +DNG P G G N +G K G
Sbjct: 298 WMVGKILDALDQERLANHTLVYFTSDNGGHLEPLDGAVQLGGWNGIYKGGKGMGGWEGGI 357
Query: 319 GVAAIWS-PWLKQTQKVSSELFHISDWLPTLC-AAAGIEINDTSLDGVNQWDVLTKGAKT 376
V I+ P + + +V +E + D PTL GI D +DG N +L A
Sbjct: 358 RVPGIFRWPSVLEAGRVINEPTSLMDIYPTLSYIGGGILSQDRVIDGQNLMPLLEGRASH 417
Query: 377 KRSEILHNIDNV 388
E L + V
Sbjct: 418 SDHEFLFHYCGV 429
>sp|Q32KH8|ARSH_CANFA Arylsulfatase H OS=Canis familiaris GN=ARSH PE=2 SV=1
Length = 562
Score = 103 bits (258), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 63/160 (39%), Positives = 84/160 (52%), Gaps = 16/160 (10%)
Query: 25 TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSA 83
T +P+I++++ADDLG D+ +G++ + TPNID LA G+ L QH A +CTPSR+A
Sbjct: 2 TRNSRPNIVLLMADDLGVGDLCCYGNNTVSTPNIDRLASEGVRLTQHLAAASVCTPSRAA 61
Query: 84 LMTGKYPIHIGM------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
+TG+YPI GM G+ G GLP E + L+ GY T IGKWH G
Sbjct: 62 FLTGRYPIRSGMASPYNLNRGLTWLGGSGGLPTNETTFAKLLQHYGYRTGLIGKWHQGLS 121
Query: 138 -----REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEP 172
Y P GFD YG GL C+A+ P
Sbjct: 122 CASRNDHCYHPLNHGFDYFYGLPFGLLS----DCQASRTP 157
>sp|P15589|STS_RAT Steryl-sulfatase OS=Rattus norvegicus GN=Sts PE=1 SV=2
Length = 577
Score = 96.7 bits (239), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 58/145 (40%), Positives = 77/145 (53%), Gaps = 12/145 (8%)
Query: 21 FLNTTAPKK-PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCT 78
FL P P+ ++I+ADDLG D+ +G+ + TP+ID LA G+ L QH A LCT
Sbjct: 16 FLCAARPGPGPNFLLIMADDLGIGDLGCYGNRTLRTPHIDRLALEGVKLTQHLAAAPLCT 75
Query: 79 PSRSALMTGKYPIHIGM-QHG----VILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWH 133
PSR+A +TG+YP+ GM HG + GLP E + LK GY T +GKWH
Sbjct: 76 PSRAAFLTGRYPVRSGMASHGRLGVFLFSASSGGLPPNEVTFAKLLKGQGYTTGLVGKWH 135
Query: 134 LGFFRE-----VYTPTFRGFDSHYG 153
LG + + P GFD G
Sbjct: 136 LGLSCQAASDFCHHPGRHGFDRFLG 160
Score = 57.0 bits (136), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 51/186 (27%), Positives = 81/186 (43%), Gaps = 18/186 (9%)
Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVI 268
N+ P L+L+ + VH + P E + L Y V +D +VG V+
Sbjct: 274 NRDTPFLLFLSFMHVHTAHFANP-----EFAGQSL------HGAYGDAVEEMDWAVGQVL 322
Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSN----KGSNHPLRGMKSTPWDGGMRGVAAI- 323
A L K G+ N++V +D+GA + N GSN RG K+ W+GG+R +
Sbjct: 323 ATLDKLGLANNTLVYLTSDHGAHVEELGPNGERHGGSNGIYRGGKANTWEGGIRVPGLVR 382
Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEIL 382
W + Q+V ++ D PT+ AG E+ D +DG + +L + E L
Sbjct: 383 WPGVIVPGQEVEEPTSNM-DVFPTVARLAGAELPTDRVIDGRDLMPLLLGHVQHSEHEFL 441
Query: 383 HNIDNV 388
+ N
Sbjct: 442 FHYCNA 447
>sp|P51689|ARSD_HUMAN Arylsulfatase D OS=Homo sapiens GN=ARSD PE=1 SV=2
Length = 593
Score = 94.0 bits (232), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 56/136 (41%), Positives = 74/136 (54%), Gaps = 11/136 (8%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTG 87
KP+I++I+ADDLG D+ +G++ + TPNID LA G+ L QH A LCTPSR+A +TG
Sbjct: 40 KPNILLIMADDLGTGDLGCYGNNTLRTPNIDQLAEEGVRLTQHLAAAPLCTPSRAAFLTG 99
Query: 88 KYPIHIGMQHGVILEGEPW-----GLPLTEKLLPQYLKEAGYATHAIGKWHLGF-----F 137
++ GM W GLP E + L++ GYAT IGKWH G
Sbjct: 100 RHSFRSGMDASNGYRALQWNAGSGGLPENETTFARILQQHGYATGLIGKWHQGVNCASRG 159
Query: 138 REVYTPTFRGFDSHYG 153
+ P GFD YG
Sbjct: 160 DHCHHPLNHGFDYFYG 175
Score = 33.9 bits (76), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 27/113 (23%), Positives = 49/113 (43%), Gaps = 12/113 (10%)
Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE 237
MR++ + + + L + A++ I H K P L+L+ L VH P
Sbjct: 259 MRNHDVTEQPMVLEKTASLMLKEAVSYIERH-KHGPFLLFLSLLHVH---------IPLV 308
Query: 238 EVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA 290
+ FL S + Y V +D +G V+ A+ +G+ ++ F +D+G
Sbjct: 309 TTSAFLGKS--QHGLYGDNVEEMDWLIGKVLNAIEDNGLKNSTFTYFTSDHGG 359
>sp|P50427|STS_MOUSE Steryl-sulfatase OS=Mus musculus GN=Sts PE=2 SV=1
Length = 624
Score = 94.0 bits (232), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 56/133 (42%), Positives = 73/133 (54%), Gaps = 11/133 (8%)
Query: 32 IIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTGKYP 90
++I+ADDLG D+ +G+ + TP++D LA G+ L QH A LCTPSR+A +TG+YP
Sbjct: 37 FLLIMADDLGIGDLGCYGNKTLRTPHLDRLAREGVKLTQHLAAAPLCTPSRAAFLTGRYP 96
Query: 91 IHIGM-QHGVI----LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT--- 142
GM HG + GLP +E + + LK GYAT IGKWHLG T
Sbjct: 97 PRSGMAAHGRVGVYLFTASSGGLPPSEVTMARLLKGRGYATALIGKWHLGLSCRGATDFC 156
Query: 143 --PTFRGFDSHYG 153
P GFD G
Sbjct: 157 HHPLRHGFDRFLG 169
Score = 67.4 bits (163), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 57/204 (27%), Positives = 90/204 (44%), Gaps = 18/204 (8%)
Query: 190 GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE 249
G + L EAA+ + N+++P L+L+ L VH + +P A + D
Sbjct: 266 GGLTRRLADEAALFL--RRNRARPFLLFLSFLHVHTAHFADPGFAGRSLHGAYGD----- 318
Query: 250 RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA--PSFGIHSNK--GSNHPL 305
V +D VG V+AAL + G+ ++V F +D+GA G + GSN
Sbjct: 319 ------SVEEMDWGVGRVLAALDELGLARETLVYFTSDHGAHVEELGPRGERMGGSNGVF 372
Query: 306 RGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGV 364
RG K W+GG+R + P +V +E + D PT+ AG E+ D +DG
Sbjct: 373 RGGKGNNWEGGVRVPCLVRWPRELSPGRVVAEPTSLMDVFPTVARLAGAELPGDRVIDGR 432
Query: 365 NQWDVLTKGAKTKRSEILHNIDNV 388
+ +L A+ E L + N
Sbjct: 433 DLMPLLRGDAQRSEHEFLFHYCNA 456
>sp|Q8XNV1|SULF_CLOPE Arylsulfatase OS=Clostridium perfringens (strain 13 / Type A)
GN=CPE0231 PE=3 SV=1
Length = 481
Score = 86.3 bits (212), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 101/411 (24%), Positives = 170/411 (41%), Gaps = 81/411 (19%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTG 87
KP+I++I+ D + + + +G+ I TPN+D +A G Y C SR++++TG
Sbjct: 2 KPNIVLIMVDQMRGDCLGVNGNEFIETPNLDMMATEGYNFENAYTAVPSCIASRASILTG 61
Query: 88 ---KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
K +G + GV W E + +AGY T IGK H+ + E
Sbjct: 62 MSQKSHGRVGYEDGV-----SWNY---ENTIASEFSKAGYHTQCIGKMHV--YPERNLCG 111
Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM------RHNMQVDNKTIGI------- 191
F H GY L + KA+ + Q D + VD IG+
Sbjct: 112 FHNIMLHDGY---LHFARNKEGKASTQIEQCDDYLKWFREKKGHNVDLIDIGLDCNSWVS 168
Query: 192 ---------YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAG--------NTYEPFQA 234
+ T+ +I+ + + SKP FL ++ + H+ + Y+
Sbjct: 169 RPWGYEENLHPTNWVVNESIDFLRRRDPSKPFFLKMSFVRPHSPLDPPKFYFDMYKDEDL 228
Query: 235 PDEEVAKFLDISDPERR---------------------TYAGMVSRLDESVGNVIAALRK 273
P+ + + + D E R Y G ++ +D +G + AL +
Sbjct: 229 PEPLMGDWANKEDEENRGKDINCVKGIINKKALKRAKAAYYGSITHIDHQIGRFLIALSE 288
Query: 274 HGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP--WLK-Q 330
+G L N+I LF++D+G G ++ R K P++G R I+ P LK +
Sbjct: 289 YGKLNNTIFLFVSDHG-------DMMGDHNWFR--KGIPYEGSARVPFFIYDPGNLLKGK 339
Query: 331 TQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEI 381
KV E+ + D +PTL A I I D S++G++ D++ + T R I
Sbjct: 340 KGKVFDEVLELRDIMPTLLDFAHISIPD-SVEGLSLKDLIEERNSTWRDYI 389
>sp|Q0TUK6|SULF_CLOP1 Arylsulfatase OS=Clostridium perfringens (strain ATCC 13124 / NCTC
8237 / Type A) GN=CPF_0221 PE=1 SV=1
Length = 481
Score = 85.1 bits (209), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 109/464 (23%), Positives = 191/464 (41%), Gaps = 92/464 (19%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTG 87
KP+I++I+ D + + + +G+ I TPN+D +A G Y C SR++++TG
Sbjct: 2 KPNIVLIMVDQMRGDCLGVNGNEFIETPNLDMMATEGYNFENAYTAVPSCIASRASILTG 61
Query: 88 ---KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
K +G + GV W E + +AGY T IGK H+ + E
Sbjct: 62 MSQKSHGRVGYEDGV-----SWNY---ENTIASEFSKAGYHTQCIGKMHV--YPERNLCG 111
Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM------RHNMQVDNKTIGI------- 191
F H GY L + KA+ + Q D + VD IG+
Sbjct: 112 FHNIMLHDGY---LHFARNKEGKASTQIEQCDDYLKWFREKKGHNVDLIDIGLDCNSWVS 168
Query: 192 ---------YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAG--------NTYEPFQA 234
+ T+ +I+ + + SKP FL ++ + H+ + Y+
Sbjct: 169 RPWGYEENLHPTNWVVNESIDFLRRKDPSKPFFLKMSFVRPHSPLDPPKFYFDMYKDEDL 228
Query: 235 PDEEVAKFLDISDPERR---------------------TYAGMVSRLDESVGNVIAALRK 273
P+ + + + D E R Y G ++ +D +G + AL +
Sbjct: 229 PEPLMGDWANKEDEENRGKDINCVKGIINKKALKRAKAAYYGSITHIDHQIGRFLIALSE 288
Query: 274 HGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP--WLK-Q 330
+G L N+I LF++D+G G ++ R K P++G R I+ P LK +
Sbjct: 289 YGELNNTIFLFVSDHG-------DMMGDHNWFR--KGIPYEGSSRVPFFIYDPGNLLKGK 339
Query: 331 TQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDN 390
KV E+ + D +PTL A I I D S++G++ +++ + T R + +H +
Sbjct: 340 KGKVFDEVLELRDIMPTLLDFAHISIPD-SVEGLSLKNLIEERNSTWR-DYIHGEHSFGE 397
Query: 391 PQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
+Y + D + + + +E Y D +N+ PKE+
Sbjct: 398 DSNHYIVTKRDKFLWFS-----QRGEEQYFDLEND-----PKEL 431
>sp|Q8BFR4|GNS_MOUSE N-acetylglucosamine-6-sulfatase OS=Mus musculus GN=Gns PE=2 SV=1
Length = 544
Score = 72.0 bits (175), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 106/408 (25%), Positives = 173/408 (42%), Gaps = 88/408 (21%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDAL-AYNGLILNQHYV-QALCTPSRSA 83
A ++P+++++L DD D G + P AL G+ + YV ALC PSR++
Sbjct: 35 AARRPNVLLLLTDD---QDAELGGMT--PLKKTKALIGEKGMTFSSAYVPSALCCPSRAS 89
Query: 84 LMTGKYPIHIGMQHGVI---LEG----EPWGLPLTEKLLPQYLKE-AGYATHAIGKWHLG 135
++TGKYP H V+ LEG + W P LK GY T GK
Sbjct: 90 ILTGKYP----HNHHVVNNTLEGNCSSKAWQKIQEPYTFPAILKSVCGYQTFFAGK---- 141
Query: 136 FFREVYTPTFRGFDS---HYGYWQGLQ---DYYDHSCKATFEPYQGLDMRH--NMQVDNK 187
+ E P G + + YW L+ YY+++ G +H N VD
Sbjct: 142 YLNEYGAPDAGGLEHIPLGWSYWYALEKNSKYYNYTLSI-----NGKARKHGENYSVD-- 194
Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP-----FQ---APDEE- 238
Y TD+ +++ + + S+P F+ ++ A H+ T P FQ AP +
Sbjct: 195 ----YLTDVLANLSLDFLDYKSNSEPFFMMISTPAPHSPWTAAPQYQKAFQNVIAPRNKN 250
Query: 239 ----------------------VAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGM 276
+FLD D RR + ++S +D+ V ++ L G
Sbjct: 251 FNIHGTNKHWLIRQAKTPMTNSSIRFLD--DAFRRRWQTLLS-VDDLVEKLVKRLDSTGE 307
Query: 277 LENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSS 336
L+N+ + + +DN G H+ + S L K ++ ++ + P +K Q S
Sbjct: 308 LDNTYIFYTSDN-----GYHTGQFS---LPIDKRQLYEFDIKVPLLVRGPGIKPNQ-TSK 358
Query: 337 ELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAK--TKRSEIL 382
L D PT+ AG ++N T +DG++ +L KG + T RS++L
Sbjct: 359 MLVSNIDLGPTILDLAGYDLNKTQMDGMSLLPIL-KGDRNLTWRSDVL 405
>sp|P31447|YIDJ_ECOLI Uncharacterized sulfatase YidJ OS=Escherichia coli (strain K12)
GN=yidJ PE=3 SV=1
Length = 497
Score = 71.6 bits (174), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 88/366 (24%), Positives = 141/366 (38%), Gaps = 63/366 (17%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALMT 86
K+P+ + ++ D N V + + T NID+LA G+ N Y +CTP+R+ L T
Sbjct: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHL---GFFREVYTP 143
G Y G + G+ + +Y K+AGY T IGKWHL +F P
Sbjct: 62 GIYANQSGPWTNNVAPGK------NISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP 115
Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
D YW +Y + ++ GL+ ++Q ++ + A+
Sbjct: 116 PEWDAD----YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAV 171
Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI-------------SDPE 249
+ + + ++ FL + V + PF P E + K+ D + PE
Sbjct: 172 DFLQQPARADEPFL----MVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPE 227
Query: 250 RRT--------------------YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNG 289
Y +D+ +G VI AL EN+ V++ +D
Sbjct: 228 HHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPE-QRENTWVIYTSD-- 284
Query: 290 APSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLC 349
H H L + +D R I SP ++ Q V + + HI D LPT+
Sbjct: 285 ------HGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQ-VDTPVSHI-DLLPTMM 336
Query: 350 AAAGIE 355
A A IE
Sbjct: 337 ALADIE 342
>sp|Q1LZH9|GNS_BOVIN N-acetylglucosamine-6-sulfatase OS=Bos taurus GN=GNS PE=2 SV=1
Length = 560
Score = 67.0 bits (162), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 102/403 (25%), Positives = 172/403 (42%), Gaps = 82/403 (20%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDAL-AYNGLILNQHYV-QALCTPSRSALM 85
++P+++++LADD D G + P AL G+ + YV ALC PSR++++
Sbjct: 53 RRPNVVLLLADD---QDEVLGGMT--PLKKTKALIGEMGMTFSSAYVPSALCCPSRASIL 107
Query: 86 TGKYPIHIGMQHGVILEG----EPWGLPLTEKLLPQYLKE-AGYATHAIGKWHLGFFREV 140
TGKYP ++ + + LEG + W P L+ GY T GK + E
Sbjct: 108 TGKYPHNLHVVNNT-LEGNCSSKSWQKIQEPNTFPAILRSMCGYQTFFAGK----YLNEY 162
Query: 141 YTPTFRGFDS---HYGYWQGLQ---DYYDHSCKATFEPYQGLDMRH--NMQVDNKTIGIY 192
P G + YW L+ YY+++ G +H N VD Y
Sbjct: 163 GAPDAGGLGHVPLGWSYWYALEKNSKYYNYTLSI-----NGKARKHGENYSVD------Y 211
Query: 193 STDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP-----FQ---APDEE------ 238
TD+ +++ + + S+P F+ ++ A H+ T P FQ AP +
Sbjct: 212 LTDVLANVSLDFLDYKSNSEPFFMMISTPAPHSPWTAAPQYQNAFQNVFAPRNKNFNIHG 271
Query: 239 -----------------VAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSI 281
+FLD + R+ + ++S +D+ V ++ L +G L N+
Sbjct: 272 TNKHWLIRQAKTPMTNSSIQFLD--NAFRKRWQTLLS-VDDLVEKLVKRLEFNGELNNTY 328
Query: 282 VLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHI 341
+ + +DN G H+ + S L K ++ ++ + P +K Q S L
Sbjct: 329 IFYTSDN-----GYHTGQFS---LPIDKRQLYEFDIKVPLLVRGPGIKPNQ-TSKMLVAN 379
Query: 342 SDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAK--TKRSEIL 382
D PT+ AG +N T +DG++ +L KGA T RS++L
Sbjct: 380 IDLGPTILDIAGYSLNKTQMDGMSFLPIL-KGASNLTWRSDVL 421
>sp|Q8IWU6|SULF1_HUMAN Extracellular sulfatase Sulf-1 OS=Homo sapiens GN=SULF1 PE=1 SV=1
Length = 871
Score = 66.2 bits (160), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 89/367 (24%), Positives = 147/367 (40%), Gaps = 73/367 (19%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG-LILNQHYVQALCTPSRSALMTG 87
+P+II++L DD DV GS Q+ + + G +N +C PSRS+++TG
Sbjct: 42 RPNIILVLTDD---QDVEL-GSLQVMNKTRKIMEHGGATFINAFVTTPMCCPSRSSMLTG 97
Query: 88 KYPIHIGMQHGVILEGE-----PWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
KY +H H V E W + YL GY T GK+ L + Y
Sbjct: 98 KY-VH---NHNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYI 152
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
P G+ G + + Y C+ + G D + Y TDL T +I
Sbjct: 153 PP--GWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKD----------YFTDLITNESI 200
Query: 203 NVIAEHNK---SKPMFLYLAHLAVHAGNTYEP-FQ----------------APDEEVAKF 242
N + +P+ + ++H A H P F AP+ +
Sbjct: 201 NYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWI 260
Query: 243 LDISDP------------ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNG- 289
+ + P +R+ ++S +D+SV + L + G LEN+ +++ AD+G
Sbjct: 261 MQYTGPMLPIHMEFTNILQRKRLQTLMS-VDDSVERLYNMLVETGELENTYIIYTADHGY 319
Query: 290 -APSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTL 348
FG+ ++G KS P+D +R I P ++ V + +I D PT+
Sbjct: 320 HIGQFGL---------VKG-KSMPYDFDIRVPFFIRGPSVEPGSIVPQIVLNI-DLAPTI 368
Query: 349 CAAAGIE 355
AG++
Sbjct: 369 LDIAGLD 375
>sp|P51688|SPHM_HUMAN N-sulphoglucosamine sulphohydrolase OS=Homo sapiens GN=SGSH PE=1
SV=1
Length = 502
Score = 65.5 bits (158), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 78/309 (25%), Positives = 123/309 (39%), Gaps = 70/309 (22%)
Query: 13 CTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY 72
C LL L + + +++LADD G+ +++ S+ I TP++DALA L+ +
Sbjct: 9 CALLL---VLGLCRARPRNALLLLADDGGFESGAYNNSA-IATPHLDALARRSLLFRNAF 64
Query: 73 VQ-ALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGK 131
+ C+PSR++L+TG P H +G+ + + + LP L +AG T IGK
Sbjct: 65 TSVSSCSPSRASLLTG-LPQHQNGMYGLHQDVHHFNSFDKVRSLPLLLSQAGVRTGIIGK 123
Query: 132 WHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI 191
H+G T FD Y G +QV I
Sbjct: 124 KHVG------PETVYPFDFAYTEENG----------------------SVLQVGRNITRI 155
Query: 192 YSTDLYTEAAINVIAEHNKSKPMFLYLA----HLAVHAGNTYEPF------------QAP 235
+ + + +P FLY+A H H+ Y F + P
Sbjct: 156 -------KLLVRKFLQTQDDRPFFLYVAFHDPHRCGHSQPQYGTFCEKFGNGESGMGRIP 208
Query: 236 DEE----------VAKFLDISDPERRTYAGM---VSRLDESVGNVIAALRKHGMLENSIV 282
D V F+ + R A V R+D+ VG V+ LR G+L +++V
Sbjct: 209 DWTPQAYDPLDVLVPYFVPNTPAARADLAAQYTTVGRMDQGVGLVLQELRDAGVLNDTLV 268
Query: 283 LFMADNGAP 291
+F +DNG P
Sbjct: 269 IFTSDNGIP 277
>sp|Q8K007|SULF1_MOUSE Extracellular sulfatase Sulf-1 OS=Mus musculus GN=Sulf1 PE=2 SV=1
Length = 870
Score = 65.5 bits (158), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 90/375 (24%), Positives = 151/375 (40%), Gaps = 74/375 (19%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTG 87
+P+II++L DD DV GS Q+ + G +V +C PSRS+++TG
Sbjct: 42 RPNIILVLTDD---QDVEL-GSLQVMNKTRKIMEQGGATFTNAFVTTPMCCPSRSSMLTG 97
Query: 88 KYPIHIGMQHGVILEGE-----PWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
KY +H H V E W + YL GY T GK+ L + Y
Sbjct: 98 KY-VH---NHNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYI 152
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
P G+ G + + Y C+ + G D + Y TDL T +I
Sbjct: 153 PP--GWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKD----------YFTDLITNESI 200
Query: 203 NVIAEHNK---SKPMFLYLAHLAVHAGNTYEP-FQ----------------APDEEVAKF 242
N + +P+ + ++H A H P F AP+ +
Sbjct: 201 NYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWI 260
Query: 243 LDISDP------------ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNG- 289
+ + P +R+ ++S +D+SV + L + G L+N+ +++ AD+G
Sbjct: 261 MQYTGPMLPIHMEFTNVLQRKRLQTLMS-VDDSVERLYNMLVESGELDNTYIIYTADHGY 319
Query: 290 -APSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTL 348
FG+ ++G KS P+D +R I P ++ V + +I D PT+
Sbjct: 320 HIGQFGL---------VKG-KSMPYDFDIRVPFFIRGPSIEPGSIVPQIVLNI-DLAPTI 368
Query: 349 CAAAGIEINDTSLDG 363
AG++ + + +DG
Sbjct: 369 LDIAGLD-SPSDVDG 382
>sp|P50426|GNS_CAPHI N-acetylglucosamine-6-sulfatase OS=Capra hircus GN=GNS PE=2 SV=1
Length = 559
Score = 65.1 bits (157), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 104/406 (25%), Positives = 171/406 (42%), Gaps = 88/406 (21%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDAL-AYNGLILNQHYV-QALCTPSRSALM 85
++P+++++LADD D G + P AL G+ + YV ALC PSR++++
Sbjct: 52 RRPNVVLVLADD---QDEVLGGMT--PLKKTKALIGEMGMTFSSAYVPSALCCPSRASIL 106
Query: 86 TGKYPIHIGMQHGVI---LEG----EPWGLPLTEKLLPQYLKE-AGYATHAIGKWHLGFF 137
TGKYP H V+ LEG + W P L+ GY T GK +
Sbjct: 107 TGKYP----HNHHVVNNTLEGNCSSKSWQKIQEPNTFPAILRSMCGYQTFFAGK----YL 158
Query: 138 REVYTPTFRGFDS---HYGYWQGLQ---DYYDHSCKATFEPYQGLDMRH--NMQVDNKTI 189
E P G + YW L+ YY+++ G +H N VD
Sbjct: 159 NEYGAPDAGGLGHVPLGWSYWYALEKNSKYYNYTLSI-----NGKARKHGENYSVD---- 209
Query: 190 GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP-----FQ---APDEE--- 238
Y TD+ +++ + + S+P F+ ++ A H+ T P FQ AP +
Sbjct: 210 --YLTDVLANVSLDFLDYKSNSEPFFMMISTPAPHSPWTAAPQYQNAFQNVFAPRNKNFN 267
Query: 239 --------------------VAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLE 278
+FLD + ER + ++S +D+ V ++ L +G L
Sbjct: 268 IHGTNKHWLIRQAKTPMTNSSIQFLDNAFRER--WQTLLS-VDDLVEKLVKRLEFNGELN 324
Query: 279 NSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSEL 338
N+ + + +DN G H+ + S L K ++ ++ + P +K Q S L
Sbjct: 325 NTYIFYTSDN-----GYHTGQFS---LPIDKRQLYEFDIKVPLLVRGPGIKPNQ-TSKML 375
Query: 339 FHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAK--TKRSEIL 382
D PT+ AG +N T +DG++ +L +GA T RS++L
Sbjct: 376 VANIDLGPTILDIAGYGLNKTQMDGMSFLPIL-RGASNLTWRSDVL 420
>sp|Q8VI60|SULF1_RAT Extracellular sulfatase Sulf-1 OS=Rattus norvegicus GN=Sulf1 PE=1
SV=1
Length = 870
Score = 64.7 bits (156), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 91/375 (24%), Positives = 151/375 (40%), Gaps = 74/375 (19%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTG 87
+P+II++L DD DV GS Q+ + + G +V +C PSRS+++TG
Sbjct: 42 RPNIILVLTDD---QDVEL-GSLQVMNKTRKIMEHGGATFTNAFVTTPMCCPSRSSMLTG 97
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKL-----LPQYLKEAGYATHAIGKWHLGFFREVYT 142
KY +H H V E P + L YL GY T GK+ L + Y
Sbjct: 98 KY-VH---NHNVYTNNENCSSPSWQALHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYI 152
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
P G+ G + + Y C+ + G D + Y TDL T +I
Sbjct: 153 PP--GWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKD----------YFTDLITNESI 200
Query: 203 NVIAEHNK---SKPMFLYLAHLAVHAGNTYEP-FQ----------------APDEEVAKF 242
N + +P+ + ++H A H P F AP+ +
Sbjct: 201 NYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWI 260
Query: 243 LDISDP------------ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNG- 289
+ + P +R+ ++S +D+SV + L + G L N+ +++ AD+G
Sbjct: 261 MQYTGPMLPIHMEFTNVLQRKRLQTLMS-VDDSVERLYNMLVETGELGNTYIIYTADHGY 319
Query: 290 -APSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTL 348
FG+ ++G KS P+D +R I P ++ V + +I D PT+
Sbjct: 320 HIGQFGL---------VKG-KSMPYDFDIRVPFFIRGPSIEPGSIVPQIVLNI-DLAPTI 368
Query: 349 CAAAGIEINDTSLDG 363
AG++ + +DG
Sbjct: 369 LDIAGLDT-PSDVDG 382
>sp|P15586|GNS_HUMAN N-acetylglucosamine-6-sulfatase OS=Homo sapiens GN=GNS PE=1 SV=3
Length = 552
Score = 63.9 bits (154), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 102/406 (25%), Positives = 170/406 (41%), Gaps = 88/406 (21%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDAL-AYNGLILNQHYV-QALCTPSRSALM 85
++P+++++L DD D G + P AL G+ + YV ALC PSR++++
Sbjct: 45 RRPNVVLLLTDD---QDEVLGGMT--PLKKTKALIGEMGMTFSSAYVPSALCCPSRASIL 99
Query: 86 TGKYPIHIGMQHGVI---LEG----EPWGLPLTEKLLPQYLKE-AGYATHAIGKWHLGFF 137
TGKYP H V+ LEG + W P L+ GY T GK +
Sbjct: 100 TGKYP----HNHHVVNNTLEGNCSSKSWQKIQEPNTFPAILRSMCGYQTFFAGK----YL 151
Query: 138 REVYTPTFRGFDS---HYGYWQGLQ---DYYDHSCKATFEPYQGLDMRH--NMQVDNKTI 189
E P G + + YW L+ YY+++ G +H N VD
Sbjct: 152 NEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSI-----NGKARKHGENYSVD---- 202
Query: 190 GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP-----FQ---APDEE--- 238
Y TD+ +++ + + +P F+ +A A H+ T P FQ AP +
Sbjct: 203 --YLTDVLANVSLDFLDYKSNFEPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFN 260
Query: 239 --------------------VAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLE 278
+FLD + R+ + ++S +D+ V ++ L G L
Sbjct: 261 IHGTNKHWLIRQAKTPMTNSSIQFLD--NAFRKRWQTLLS-VDDLVEKLVKRLEFTGELN 317
Query: 279 NSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSEL 338
N+ + + +DN G H+ + S L K ++ ++ + P +K Q S L
Sbjct: 318 NTYIFYTSDN-----GYHTGQFS---LPIDKRQLYEFDIKVPLLVRGPGIKPNQ-TSKML 368
Query: 339 FHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAK--TKRSEIL 382
D PT+ AG ++N T +DG++ +L +GA T RS++L
Sbjct: 369 VANIDLGPTILDIAGYDLNKTQMDGMSLLPIL-RGASNLTWRSDVL 413
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.318 0.135 0.420
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 236,755,866
Number of Sequences: 539616
Number of extensions: 10538256
Number of successful extensions: 23453
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 53
Number of HSP's successfully gapped in prelim test: 39
Number of HSP's that attempted gapping in prelim test: 23161
Number of HSP's gapped (non-prelim): 161
length of query: 593
length of database: 191,569,459
effective HSP length: 123
effective length of query: 470
effective length of database: 125,196,691
effective search space: 58842444770
effective search space used: 58842444770
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 64 (29.3 bits)