BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy1088
(905 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|P50430|ARSB_RAT Arylsulfatase B OS=Rattus norvegicus GN=Arsb PE=2 SV=2
Length = 528
Score = 327 bits (838), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 156/334 (46%), Positives = 224/334 (67%), Gaps = 16/334 (4%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
++ PPH++F+LADDLGWND+GFHG I TP++DALA G++L NYY LCTPSRS ++
Sbjct: 36 AAPPPHVVFVLADDLGWNDLGFHG-SVIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLL 94
Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
TG++ IH G+QH ++ C+ +PL EK+LPQ LK+ GY T +VGKWHLG Y+KE PT
Sbjct: 95 TGRYQIHMGLQHYLIMTCQPNCVPLDEKLLPQLLKDAGYATHMVGKWHLGMYRKECLPTR 154
Query: 176 RGFESHLGYWTGHQDYFDHSAEEM------KMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
RGF+++ GY G +DY+ H A LD+R EPA + YST++FT A
Sbjct: 155 RGFDTYFGYLLGSEDYYTHEACAPIECLNGTRCALDLRDGEEPAKEYTDIYSTNIFTKRA 214
Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
+I NH ++PLFLYLA + H +PLQ P+ Y+ + I+D R +A ++ LD
Sbjct: 215 TTLIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYDFIQDKHRRIYAGMVSLLD 269
Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
E+VG V +AL+ R + +N++++F +DNGG + +NWPLRG K TLWEGG+RGAG
Sbjct: 270 EAVGNVTKALKSRGLWNNTVLIFSTDNGGQTR----SGGNNWPLRGRKGTLWEGGIRGAG 325
Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
+ SPLL+ +G+ + + +H++DWLPTL++ A S
Sbjct: 326 FVASPLLKQKGVKSRELMHITDWLPTLVNLAGGS 359
Score = 38.5 bits (88), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 15/36 (41%), Positives = 24/36 (66%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH ++ C+ +PL EK+LPQ LK+ GY T ++
Sbjct: 104 LQHYLIMTCQPNCVPLDEKLLPQLLKDAGYATHMVG 139
>sp|P15848|ARSB_HUMAN Arylsulfatase B OS=Homo sapiens GN=ARSB PE=1 SV=1
Length = 533
Score = 327 bits (838), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 159/333 (47%), Positives = 219/333 (65%), Gaps = 16/333 (4%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
+S PPH++F+LADDLGWNDVGFHG +I TP++DALA G++L NYYT LCTPSRS
Sbjct: 39 AGASRPPHLVFLLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 97
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
++TG++ I TG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE P
Sbjct: 98 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 157
Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
T RGF+++ GY G +DY+ H A + LD R E A YST++FT
Sbjct: 158 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTK 217
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
A+ +I NH ++PLFLYLA + H EPLQ P+ YL + I+D R +A ++
Sbjct: 218 RAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSL 272
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
+DE+VG V AL+ + +N++ +F +DNGG L +NWPLRG K +LWEGGVRG
Sbjct: 273 MDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRG 328
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
G + SPLL+ +G+ + +H+SDWLPTL+ A
Sbjct: 329 VGFVASPLLKQKGVKNRELIHISDWLPTLVKLA 361
Score = 42.7 bits (99), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 109 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 144
Score = 36.2 bits (82), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 14/26 (53%), Positives = 17/26 (65%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNID 535
+DG DVW +S PS R +LHNID
Sbjct: 371 LDGFDVWKTISEGSPSPRIELLHNID 396
Score = 36.2 bits (82), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 14/26 (53%), Positives = 17/26 (65%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNID 600
+DG DVW +S PS R +LHNID
Sbjct: 371 LDGFDVWKTISEGSPSPRIELLHNID 396
>sp|P50429|ARSB_MOUSE Arylsulfatase B OS=Mus musculus GN=Arsb PE=2 SV=3
Length = 534
Score = 325 bits (833), Expect = 9e-88, Method: Compositional matrix adjust.
Identities = 155/335 (46%), Positives = 223/335 (66%), Gaps = 16/335 (4%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
++ PPH++F+LADDLGWND+GFHG I TP++DALA G++L NYY LCTPSRS +
Sbjct: 41 GATQPPHVVFVLADDLGWNDLGFHG-SVIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQL 99
Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
+TG++ IH G+QH ++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE PT
Sbjct: 100 LTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPT 159
Query: 175 FRGFESHLGYWTGHQDYFDHSA----EEM--KMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
RGF+++ GY G +DY+ H A E + LD+R EPA + + YST++FT
Sbjct: 160 RRGFDTYFGYLLGSEDYYTHEACAPIESLNGTRCALDLRDGEEPAKEYNNIYSTNIFTKR 219
Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
A +I NH ++PLFLYLA + H +PLQ P+ Y+ + I+D R +A ++ +
Sbjct: 220 ATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRIYAGMVSLM 274
Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
DE+VG V +AL+ + +N++ +F +DNGG + +NWPLRG K TLWEGG+RG
Sbjct: 275 DEAVGNVTKALKSHGLWNNTVFIFSTDNGGQTR----SGGNNWPLRGRKGTLWEGGIRGT 330
Query: 349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
G + SPLL+ +G+ + + +H++DWLPTL+ A S
Sbjct: 331 GFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGS 365
Score = 40.0 bits (92), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 24/36 (66%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH ++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 110 LQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVG 145
Score = 33.9 bits (76), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 14/38 (36%), Positives = 21/38 (55%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGK 612
+DG ++W +S PS R +LHNID ++ GK
Sbjct: 372 LDGFNMWKTISEGHPSPRVELLHNIDQDFFDGLPCPGK 409
Score = 33.9 bits (76), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 12/29 (41%), Positives = 19/29 (65%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDEW 538
+DG ++W +S PS R +LHNID ++
Sbjct: 372 LDGFNMWKTISEGHPSPRVELLHNIDQDF 400
>sp|P33727|ARSB_FELCA Arylsulfatase B OS=Felis catus GN=ARSB PE=2 SV=1
Length = 535
Score = 318 bits (815), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 154/331 (46%), Positives = 215/331 (64%), Gaps = 16/331 (4%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PPH++F+LADDLGWNDV FHG I TP++D LA G++L NYYT LCTPSRS ++TG+
Sbjct: 46 PPHLVFVLADDLGWNDVSFHG-SNIRTPHLDELAAGGVLLDNYYTQPLCTPSRSQLLTGR 104
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ IHTG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE PT RGF
Sbjct: 105 YQIHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGF 164
Query: 179 ESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
+++ GY G +DY+ H + + LD R + A YST++FT A +
Sbjct: 165 DTYFGYLLGSEDYYSHERCALIDSLNVTRCALDFRDGEQVATGYKNMYSTNIFTERATAL 224
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
I +H ++PLFLYLA + H EPLQ P+ YL + I+D R +A ++ +DE+V
Sbjct: 225 ITSHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHYYAGMVSLMDEAV 279
Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
G V AL+ + +N++ +F +DNGG L +NWPLRG K +LWEGG+RG G +
Sbjct: 280 GNVTAALKSHGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGIRGVGFVA 335
Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
SPLL+ +G+ + +H+SDWLPTL+ A S
Sbjct: 336 SPLLKQKGVKNRELIHISDWLPTLVKLARGS 366
Score = 42.7 bits (99), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 111 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 146
Score = 37.0 bits (84), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 14/26 (53%), Positives = 17/26 (65%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNID 535
+DG DVW +S PS R +LHNID
Sbjct: 373 LDGFDVWKTISEGSPSPRKELLHNID 398
Score = 37.0 bits (84), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 14/26 (53%), Positives = 17/26 (65%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNID 600
+DG DVW +S PS R +LHNID
Sbjct: 373 LDGFDVWKTISEGSPSPRKELLHNID 398
>sp|Q5FYB0|ARSJ_HUMAN Arylsulfatase J OS=Homo sapiens GN=ARSJ PE=2 SV=1
Length = 599
Score = 304 bits (779), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 153/329 (46%), Positives = 209/329 (63%), Gaps = 12/329 (3%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
S+ PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 70 TTSTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 128
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+TGK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+KE P
Sbjct: 129 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 188
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
T RGF++ G G DY+ H + M G D+ + AWD +G YST ++T
Sbjct: 189 TRRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 248
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I+ +H+ +P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE+
Sbjct: 249 ILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 303
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
+ V AL+ +NSII++ SDNGG SNWPLRG K T WEGG+R G +
Sbjct: 304 INNVTLALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 359
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 360 HSPLLKNKGTVCKELVHITDWYPTLISLA 388
Score = 38.1 bits (87), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 46/185 (24%), Positives = 77/185 (41%), Gaps = 24/185 (12%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQIS-----ALTKGKW--KLVKVVKVMRYQV 626
++DG D+W +S S R ILHNID + + A G W + ++V +++
Sbjct: 397 QLDGYDIWETISEGLRSPRVDILHNIDPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKL 456
Query: 627 DLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCE 686
LTG P Y + + + L + + I K V LF+I DP E
Sbjct: 457 -LTGNPG--YSDWVPPQSFSNLGPNRWHN-ERITLSTGKSV--------WLFNITADPYE 504
Query: 687 KNNLADRSEDQRINHYTTEVGRFNQIA----YPDKEEEEEKKKKKKKKKKKKKKKKKKKK 742
+ +L++R + + +FN+ A YP K+ + K++ KKKK
Sbjct: 505 RVDLSNRYPG-IVKKLLRRLSQFNKTAVPVRYPPKDPRSNPRLNGGVWGPWYKEETKKKK 563
Query: 743 KKKKK 747
K +
Sbjct: 564 PSKNQ 568
>sp|Q8BM89|ARSJ_MOUSE Arylsulfatase J OS=Mus musculus GN=Arsj PE=2 SV=1
Length = 598
Score = 300 bits (767), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 152/329 (46%), Positives = 208/329 (63%), Gaps = 12/329 (3%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
A + PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 68 TAGTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 126
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+TGK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+K+ P
Sbjct: 127 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMP 186
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
T RGF++ G G DY+ H + + G D+ + AWD +G YST ++T
Sbjct: 187 TKRGFDTFFGSLLGSGDYYTHYKCDSPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 246
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I+ H +PLFLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE+
Sbjct: 247 ILATHDPTKPLFLYVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 301
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
+ V AL++ +NSII++ SDNGG SNWPLRG K T WEGG+R G +
Sbjct: 302 IHNVTLALKRYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 357
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 358 HSPLLKNKGTVCKELVHITDWYPTLISLA 386
Score = 35.8 bits (81), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 40/166 (24%), Positives = 69/166 (41%), Gaps = 28/166 (16%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQIS-----ALTKGKW--KLVKVVKVMRYQV 626
++DG D+W +S S R ILHNID + + A G W + ++V +++
Sbjct: 395 QLDGYDIWETISEGLRSPRVDILHNIDPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKL 454
Query: 627 DLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVK----EVPCEPQIAPCLFDIKN 682
LTG P G SD W+ A GP + + + LF+I
Sbjct: 455 -LTGNP------GYSD--WVP-------PQAFSNLGPNRWHNERITLSTGKSIWLFNITA 498
Query: 683 DPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKK 728
DP E+ +L+ R + + +FN+ A P + ++ + +
Sbjct: 499 DPYERVDLSSRYPGI-VKKLLRRLSQFNKTAVPVRYPPKDPRSNPR 543
>sp|Q32KI9|ARSI_MOUSE Arylsulfatase I OS=Mus musculus GN=Arsi PE=2 SV=1
Length = 573
Score = 295 bits (754), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 151/331 (45%), Positives = 212/331 (64%), Gaps = 11/331 (3%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
VA PPHIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 41 VAPPQPPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQ 99
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
++TG++ IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFY+KE P
Sbjct: 100 LLTGRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLP 159
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
T RGF++ LG TG+ DY+ + + + + G D+ AW L G+YST ++ A I
Sbjct: 160 TRRGFDTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHI 219
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
+ +H+ PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V
Sbjct: 220 LASHNPQNPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAV 274
Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
+ AL++ +NS+I+F SDNGG + SNWPLRG K T WEGGVRG G +
Sbjct: 275 RNITWALKRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVH 330
Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
SPLL+ + + VH++DW PTL+ A +
Sbjct: 331 SPLLKKKRRTSRALVHITDWYPTLVGLAGGT 361
Score = 37.7 bits (86), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 43/171 (25%), Positives = 60/171 (35%), Gaps = 55/171 (32%)
Query: 569 NSYQNEIDGIDVWSVLSRNEPSKRNTILHNID---------------DEWQI---SALTK 610
S + +DG DVW +S S R ILHNID W +A+
Sbjct: 362 TSAADGLDGYDVWPAISEGRASPRTEILHNIDPLYNHARHGSLEGGFGIWNTAVQAAIRV 421
Query: 611 GKWKLVKVVKVMRYQVDLTGGP---DQV---YLSGLSDREWLALAMRKLRDAASIQCGPV 664
G+WKL LTG P D + L+ W M +R A
Sbjct: 422 GEWKL------------LTGDPGYGDWIPPQTLASFPGSWWNLERMASIRQAV------- 462
Query: 665 KEVPCEPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
LF+I DP E+ +LA + D + + +N+ A P
Sbjct: 463 -----------WLFNISADPYEREDLAGQRPDV-VRTLLARLADYNRTAIP 501
Score = 35.0 bits (79), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 15/32 (46%), Positives = 18/32 (56%)
Query: 504 NSYQNEIDGIDVWSVLSRNEPSKRNTILHNID 535
S + +DG DVW +S S R ILHNID
Sbjct: 362 TSAADGLDGYDVWPAISEGRASPRTEILHNID 393
>sp|Q32KJ8|ARSI_RAT Arylsulfatase I OS=Rattus norvegicus GN=Arsi PE=2 SV=1
Length = 573
Score = 289 bits (740), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 148/324 (45%), Positives = 209/324 (64%), Gaps = 11/324 (3%)
Query: 61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
HIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS ++TG++
Sbjct: 48 HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 106
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFY+KE PT RGF++
Sbjct: 107 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166
Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
LG TG+ DY+ + + + + G D+ AW L G+YST ++ A I+ +HS
Sbjct: 167 FLGSLTGNVDYYTYDNCDGPGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHSPQ 226
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
+PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V + AL
Sbjct: 227 KPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWAL 281
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
++ +NS+I+F SDNGG + SNWPLRG K T WEGGVRG G + SPLL+ +
Sbjct: 282 KRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKKK 337
Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
+ VH++DW PTL+ A +
Sbjct: 338 RRTSRALVHITDWYPTLVGLAGGT 361
Score = 40.0 bits (92), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 44/171 (25%), Positives = 61/171 (35%), Gaps = 55/171 (32%)
Query: 569 NSYQNEIDGIDVWSVLSRNEPSKRNTILHNID---------------DEWQI---SALTK 610
S + +DG DVW +S S R ILHNID W +A+
Sbjct: 362 TSAADGLDGYDVWPAISEGRASPRTEILHNIDPLYNHARHGSLEGGFGIWNTAVQAAIRV 421
Query: 611 GKWKLVKVVKVMRYQVDLTGGP---DQV---YLSGLSDREWLALAMRKLRDAASIQCGPV 664
G+WKL LTG P D + L+ W M +R A
Sbjct: 422 GEWKL------------LTGDPGYGDWIPPQTLASFPGSWWNLERMASIRQAV------- 462
Query: 665 KEVPCEPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
LF+I DP E+ +LAD+ D + + +N+ A P
Sbjct: 463 -----------WLFNISADPYEREDLADQRPDV-VRTLLARLADYNRTAIP 501
Score = 34.7 bits (78), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 15/32 (46%), Positives = 18/32 (56%)
Query: 504 NSYQNEIDGIDVWSVLSRNEPSKRNTILHNID 535
S + +DG DVW +S S R ILHNID
Sbjct: 362 TSAADGLDGYDVWPAISEGRASPRTEILHNID 393
>sp|Q5FYB1|ARSI_HUMAN Arylsulfatase I OS=Homo sapiens GN=ARSI PE=1 SV=1
Length = 569
Score = 289 bits (739), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 147/324 (45%), Positives = 208/324 (64%), Gaps = 11/324 (3%)
Query: 61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
HIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS ++TG++
Sbjct: 48 HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQ 106
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFY+KE PT RGF++
Sbjct: 107 IHTGLQHSIIRPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166
Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
LG TG+ DY+ + + + + G D+ AW L G+YST ++ A I+ +HS
Sbjct: 167 FLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHSPQ 226
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V + AL
Sbjct: 227 RPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWAL 281
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
++ +NS+I+F SDNGG + SNWPLRG K T WEGGVRG G + SPLL+ +
Sbjct: 282 KRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRK 337
Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
+ +H++DW PTL+ A +
Sbjct: 338 QRTSRALMHITDWYPTLVGLAGGT 361
Score = 39.3 bits (90), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 43/169 (25%), Positives = 67/169 (39%), Gaps = 33/169 (19%)
Query: 569 NSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEW---QISALTK--GKW--KLVKVVKV 621
S + +DG DVW +S S R ILHNID + Q +L G W + ++V
Sbjct: 362 TSAADGLDGYDVWPAISEGRASPRTEILHNIDPLYNHAQHGSLEGGFGIWNTAVQAAIRV 421
Query: 622 MRYQVDLTGGP---DQV---YLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAP 675
+++ LTG P D + L+ W M +R A
Sbjct: 422 GEWKL-LTGDPGYGDWIPPQTLATFPGSWWNLERMASVRQAV------------------ 462
Query: 676 CLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKK 724
LF+I DP E+ +LA + D + + +N+ A P + E +
Sbjct: 463 WLFNISADPYEREDLAGQRPDV-VRTLLARLAEYNRTAIPVRYPAENPR 510
Score = 35.0 bits (79), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 15/32 (46%), Positives = 18/32 (56%)
Query: 504 NSYQNEIDGIDVWSVLSRNEPSKRNTILHNID 535
S + +DG DVW +S S R ILHNID
Sbjct: 362 TSAADGLDGYDVWPAISEGRASPRTEILHNID 393
>sp|Q32KH7|ARSI_CANFA Arylsulfatase I OS=Canis familiaris GN=ARSI PE=2 SV=2
Length = 573
Score = 288 bits (738), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 147/324 (45%), Positives = 207/324 (63%), Gaps = 11/324 (3%)
Query: 61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
HIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS ++TG++
Sbjct: 49 HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 107
Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFY+KE PT RGF++
Sbjct: 108 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 167
Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
LG TG+ DY+ + + + + G D+ AW L G+YST ++ I+ +HS
Sbjct: 168 FLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTMLYAQRVSHILASHSPR 227
Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V + AL
Sbjct: 228 RPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITSAL 282
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
++ +NS+I+F SDNGG + SNWPLRG K T WEGGVRG G + SPLL+ +
Sbjct: 283 KRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRK 338
Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
+ VH++DW PTL+ A +
Sbjct: 339 RRTSRALVHITDWYPTLVGLAGGT 362
Score = 36.6 bits (83), Expect = 1.00, Method: Compositional matrix adjust.
Identities = 44/179 (24%), Positives = 62/179 (34%), Gaps = 55/179 (30%)
Query: 570 SYQNEIDGIDVWSVLSRNEPSKRNTILHNID---------------DEWQI---SALTKG 611
S + +DG DVW +S S R ILHNID W +A+ G
Sbjct: 364 SAADGLDGYDVWPAISEGRASPRTEILHNIDPLYNHARHGSLEAGFGIWNTAVQAAIRVG 423
Query: 612 KWKLVKVVKVMRYQVDLTGGP---DQV---YLSGLSDREWLALAMRKLRDAASIQCGPVK 665
+WKL LTG P D + L+ W M R A
Sbjct: 424 EWKL------------LTGDPGYGDWIPPQTLAAFPGSWWNLERMASARQAV-------- 463
Query: 666 EVPCEPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKK 724
LF+I DP E+ +LA + D + + +N+ A P + E +
Sbjct: 464 ----------WLFNISADPYEREDLAGQRPDV-VRALLARLVDYNRTAIPVRYPAENPR 511
Score = 34.7 bits (78), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 15/31 (48%), Positives = 18/31 (58%)
Query: 505 SYQNEIDGIDVWSVLSRNEPSKRNTILHNID 535
S + +DG DVW +S S R ILHNID
Sbjct: 364 SAADGLDGYDVWPAISEGRASPRTEILHNID 394
>sp|P34059|GALNS_HUMAN N-acetylgalactosamine-6-sulfatase OS=Homo sapiens GN=GALNS PE=1
SV=1
Length = 522
Score = 179 bits (453), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 113/362 (31%), Positives = 179/362 (49%), Gaps = 42/362 (11%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L LS + + PP+I+ +L DD+GW D+G +G TPN+D +A G++ N+
Sbjct: 13 LLLVLSAAGMGASGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGLLFPNF 72
Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCER-------GGLPLSEKILPQYLKELG 153
Y+ LC+PSR+A++TG+ PI G + GG+P SE++LP+ LK+ G
Sbjct: 73 YSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAG 132
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
Y ++IVGKWHLG ++ ++ P GF+ G H +D+ A + + RD
Sbjct: 133 YVSKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKARP----NIPVYRD---- 183
Query: 214 WDLHGKYS--------------TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP 259
W++ G+Y T ++ EA+D I + P FLY A ATH+ P
Sbjct: 184 WEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQARHHPFFLYWAVDATHA-----P 238
Query: 260 LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
+ A +L +R ++ + ++D+S+GK++E L+ + N+ + F SDNG A
Sbjct: 239 VYASKPFLGTS------QRGRYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAA 292
Query: 320 AAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSA 379
SN P K T +EGG+R L W P + G V+ Q + D T L+
Sbjct: 293 LISAPEQGGSNGPFLCGKQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLAL 352
Query: 380 AN 381
A
Sbjct: 353 AG 354
>sp|Q8WNQ7|GALNS_PIG N-acetylgalactosamine-6-sulfatase OS=Sus scrofa GN=GALNS PE=2 SV=1
Length = 522
Score = 173 bits (439), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 115/363 (31%), Positives = 177/363 (48%), Gaps = 43/363 (11%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L LS + + + PP+I+ +L DD+GW D+G +G TPN+D +A G++ ++
Sbjct: 12 LLLVLSAAGLGVTGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSF 71
Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELG 153
Y LC+PSR+A++TG+ PI TG N E GG+P E +LP+ LK G
Sbjct: 72 YAANPLCSPSRAALLTGRLPIRTGFYTTNGHARNAYTPQEIVGGIPDPEHLLPELLKGAG 131
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
Y ++IVGKWHLG ++ ++ P GF+ G H +D+ A + + RD
Sbjct: 132 YASKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNRARP----NIPVYRD---- 182
Query: 214 WDLHGKYS--------------TDVFTAEAVDII-HNHSTDEPLFLYLAHAATHSANPYE 258
W++ G++ T ++ EA+D I +T P FLY A ATH+
Sbjct: 183 WEMVGRFYEEFPINLKTGESNLTQIYLQEALDFIKRQQATHHPFFLYWAIDATHA----- 237
Query: 259 PLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG 318
P+ A +L +R ++ + ++D+SVG++V L ++ N+ + F SDNG
Sbjct: 238 PVYASRAFLGTS------QRGRYGDAVREIDDSVGRIVGLLRDLKIAGNTFVFFTSDNGA 291
Query: 319 AAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
A SN P K T +EGG+R + W P G V+ Q V D T LS
Sbjct: 292 ALVSAPKQGGSNGPFLCGKQTTFEGGMREPAIAWWPGHIPAGQVSHQLGSVMDLFTTSLS 351
Query: 379 AAN 381
A
Sbjct: 352 LAG 354
>sp|Q571E4|GALNS_MOUSE N-acetylgalactosamine-6-sulfatase OS=Mus musculus GN=Galns PE=2
SV=2
Length = 520
Score = 173 bits (438), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 112/346 (32%), Positives = 174/346 (50%), Gaps = 43/346 (12%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTG 117
PP+I+ +L DD+GW D+G +G TPN+D +A G++ ++Y+ LC+PSR+A++TG
Sbjct: 27 PPNIVLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTG 86
Query: 118 KHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
+ PI G N E GG+P SE +LP+ LK+ GY +IVGKWHLG ++ +
Sbjct: 87 RLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG-HRPQ 145
Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------- 221
+ P GF+ G H +D+ A+ + + RD W++ G++
Sbjct: 146 FHPLKHGFDEWFGSPNCHFGPYDNKAKP----NIPVYRD----WEMVGRFYEEFPINRKT 197
Query: 222 -----TDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
T ++T EA+D I H+ P FLY A ATH+ P+ A +L
Sbjct: 198 GEANLTQLYTQEALDFIQTQHARQSPFFLYWAIDATHA-----PVYASRQFLGTSL---- 248
Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
R ++ + ++D+SVGK++ L+ + N+ + F SDNG A SN P
Sbjct: 249 --RGRYGDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALISAPNEGGSNGPFLC 306
Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
K T +EGG+R + W P + G V+ Q + D T LS A
Sbjct: 307 GKQTTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAG 352
>sp|Q32KH5|GALNS_CANFA N-acetylgalactosamine-6-sulfatase OS=Canis familiaris GN=GALNS PE=2
SV=1
Length = 522
Score = 172 bits (437), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 116/368 (31%), Positives = 176/368 (47%), Gaps = 53/368 (14%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L LS + + PP+I+ +L DD+GW D+G +G TPN+D +A G++ ++
Sbjct: 12 LLLVLSAAGLGAAGAPQPPNILLLLMDDMGWGDLGIYGEPSRETPNLDRMAAEGMLFPSF 71
Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCER------------GGLPLSEKILPQY 148
Y+ LC+PSR+A++TG+ PI G Y R GG+P E +LP+
Sbjct: 72 YSANPLCSPSRAALLTGRLPIRNG-----FYTTNRHARNAYTPQEIVGGIPDQEHVLPEL 126
Query: 149 LKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRR 208
LKE GY ++IVGKWHLG ++ ++ P GF+ G H +D+ A + + R
Sbjct: 127 LKEAGYVSKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNRARP----NIPVYR 181
Query: 209 DLEPAWDLHGKYS--------------TDVFTAEAVDII-HNHSTDEPLFLYLAHAATHS 253
D W++ G+Y T V+ EA+D I + P FLY A ATH+
Sbjct: 182 D----WEMVGRYYEEFPINLKTGEANLTQVYLQEALDFIKRQQAAQRPFFLYWAIDATHA 237
Query: 254 ANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFV 313
P+ A +L +R ++ + ++D SVGK++ L+ R+ N+ + F
Sbjct: 238 -----PVYASRPFLGTS------QRGRYGDAVREIDNSVGKILSLLQDLRISENTFVFFT 286
Query: 314 SDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWL 373
SDNG A SN P K T +EGG+R + W P G V+ Q + D
Sbjct: 287 SDNGAALISAPNQGGSNGPFLCGKQTTFEGGMREPAIAWWPGRIPAGRVSHQLGSIMDLF 346
Query: 374 PTLLSAAN 381
T LS A
Sbjct: 347 TTSLSLAG 354
>sp|Q32KJ6|GALNS_RAT N-acetylgalactosamine-6-sulfatase OS=Rattus norvegicus GN=Galns
PE=1 SV=1
Length = 524
Score = 169 bits (429), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 112/360 (31%), Positives = 177/360 (49%), Gaps = 43/360 (11%)
Query: 45 TLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV 104
LS + + + PP+I+ +L DD+GW D+G +G TPN+D +A G++ ++Y+
Sbjct: 17 VLSALGLLAAGAPQPPNIVLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYSA 76
Query: 105 Q-LCTPSRSAIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELGYRT 156
LC+PSR+A++TG+ PI G N E GG+P SE +LP+ LK+ GY
Sbjct: 77 NPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTN 136
Query: 157 RIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDL 216
+IVGKWHLG ++ ++ P GF+ G H +D+ + + + RD W++
Sbjct: 137 KIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKVKP----NIPVYRD----WEM 187
Query: 217 HGKYS--------------TDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQ 261
G++ T ++ EA+D I H+ P FLY A ATH+ P+
Sbjct: 188 VGRFYEEFPINLKTGEANLTQLYLQEALDFIRTQHARQSPFFLYWAIDATHA-----PVY 242
Query: 262 APDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAA 321
A +L R ++ + ++D+SVGK++ L+ + N+ + F SDNG A
Sbjct: 243 ASKQFLGTSL------RGRYGDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALI 296
Query: 322 GFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
SN P K T +EGG+R + W P + G V+ Q + D T LS A
Sbjct: 297 SAPKEGGSNGPFLCGKQTTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAG 356
>sp|P15289|ARSA_HUMAN Arylsulfatase A OS=Homo sapiens GN=ARSA PE=1 SV=3
Length = 507
Score = 150 bits (379), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 112/350 (32%), Positives = 161/350 (46%), Gaps = 40/350 (11%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTG 117
PP+I+ I ADDLG+ D+G +G TPN+D LA G+ ++Y V LCTPSR+A++TG
Sbjct: 20 PPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLRFTDFYVPVSLCTPSRAALLTG 79
Query: 118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK-EYTPTFR 176
+ P+ GM VL RGGLPL E + + L GY T + GKWHLG + + P +
Sbjct: 80 RLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLTGMAGKWHLGVGPEGAFLPPHQ 139
Query: 177 GFESHLGYWTGH-----QDYFDHSAEEMKMWGLD-----------MRRDLEPAW--DLHG 218
GF LG H Q+ G D + + +P W L
Sbjct: 140 GFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVPIPLLANLSVEAQPPWLPGLEA 199
Query: 219 KYSTDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK 277
+Y A A D++ + D P FLY A TH + E
Sbjct: 200 RY-----MAFAHDLMADAQRQDRPFFLYYASHHTHYPQ-----------FSGQSFAERSG 243
Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
R F L +LD +VG ++ A+ +L ++++F +DNG + S LR K
Sbjct: 244 RGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGL-LRCGK 302
Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
T +EGGVR L + P + G+ E + D LPTL + A + +PN
Sbjct: 303 GTTYEGGVREPALAFWPGHIAPGVTHELASSL-DLLPTLAALAG-APLPN 350
>sp|P50428|ARSA_MOUSE Arylsulfatase A OS=Mus musculus GN=Arsa PE=2 SV=2
Length = 506
Score = 146 bits (368), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 110/371 (29%), Positives = 172/371 (46%), Gaps = 40/371 (10%)
Query: 45 TLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT- 103
TL + ++++ PP+I+ I ADDLG+ D+G +G TPN+D LA G+ ++Y
Sbjct: 5 TLFLALAAGLSTASPPNILLIFADDLGYGDLGSYGHPSSTTPNLDQLAEGGLRFTDFYVP 64
Query: 104 VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWH 163
V LCTPSR+A++TG+ P+ +GM VL +GGLPL E L + L GY T + GKWH
Sbjct: 65 VSLCTPSRAALLTGRLPVRSGMYPGVLGPSSQGGLPLEEVTLAEVLAARGYLTGMAGKWH 124
Query: 164 LGFYKK-EYTPTFRGFESHLGYWTGH-----QDYFDHSAEEMKMWGLD-----------M 206
LG + + P +GF LG H Q+ + G D +
Sbjct: 125 LGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPDIPCKGGCDQGLVPIPLLANL 184
Query: 207 RRDLEPAW--DLHGKYSTDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAP 263
+ +P W L +Y + + D++ + P FLY A TH
Sbjct: 185 TVEAQPPWLPGLEARY-----VSFSRDLMADAQRQGRPFFLYYASHHTHYPQ-------- 231
Query: 264 DHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
+ + R F L +LD +VG ++ + +L ++++F +DNG
Sbjct: 232 ---FSGQSFTKRSGRGPFGDSLMELDGAVGALMTTVGDLGLLEETLVIFTADNGPELMRM 288
Query: 324 NLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
+ N + LR K T +EGGVR L++ P + G+ E + D LPT L+A +
Sbjct: 289 S-NGGCSGLLRCGKGTTFEGGVREPALVYWPGHITPGVTHELASSL-DLLPT-LAALTGA 345
Query: 384 DIPNYVNSTVE 394
+PN V+
Sbjct: 346 PLPNVTLDGVD 356
>sp|Q08DD1|ARSA_BOVIN Arylsulfatase A OS=Bos taurus GN=ARSA PE=2 SV=1
Length = 507
Score = 145 bits (367), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 112/358 (31%), Positives = 162/358 (45%), Gaps = 42/358 (11%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTG 117
PP+I+ I ADDLG+ D+G +G TPN+D LA G+ ++Y V LCTPSR+A++TG
Sbjct: 20 PPNILLIFADDLGYGDLGSYGHPSSTTPNLDQLAAGGLRFTDFYVPVSLCTPSRAALLTG 79
Query: 118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK-EYTPTFR 176
+ P+ G+ VL RGGLPL E L + L GY T I GKWHLG + + P
Sbjct: 80 RLPVRMGLYPGVLEPSSRGGLPLDEVTLAEVLAAQGYLTGIAGKWHLGVGPEGAFLPPHH 139
Query: 177 GFESHLGYWTGH-----------------QDYFDHSAEEMKMWGLDMRRDLEPAW--DLH 217
GF LG H + D + + ++ + +P W L
Sbjct: 140 GFHRFLGIPYSHDQGPCQNLTCFPPATPCEGICDQGLVPIPLLA-NLSVEAQPPWLPGLE 198
Query: 218 GKYSTDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF 276
+Y A A D++ + P FLY A TH + P H
Sbjct: 199 ARY-----VAFARDLMTDAQHQGRPFFLYYASHHTHYPQ-FSGQSFPGHS---------- 242
Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
R F L +LD +VG ++ A+ +L +++ F +DNG + S LR
Sbjct: 243 GRGPFGDSLMELDAAVGALMTAVGDLGLLGETLVFFTADNGPETMRMSHGGCSGL-LRCG 301
Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVE 394
K T +EGGVR L + P + G+ E + D LPTL + A + +PN V+
Sbjct: 302 KGTTFEGGVREPALAFWPGHIAPGVTHELASSL-DLLPTLAALAG-AQLPNITLDGVD 357
>sp|P25549|ASLA_ECOLI Arylsulfatase OS=Escherichia coli (strain K12) GN=aslA PE=3 SV=2
Length = 551
Score = 137 bits (345), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 109/371 (29%), Positives = 174/371 (46%), Gaps = 57/371 (15%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQI---PTPNIDALAYSGIILKNYYTVQLCTPSRSAIMT 116
P+++ L DD+GW DVGF+G PTP+IDA+A G+IL + Y+ +P+R+ I+T
Sbjct: 86 PNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILT 145
Query: 117 GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT-- 174
G++ IH G+ +YG + GGL LPQ L + GY T+ +GKWH+G KE P
Sbjct: 146 GQYSIHHGILMPPMYG-QPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMG-ENKESQPQNV 202
Query: 175 ----FRGFESHLGYWTGHQDYFDHSAEEMKMW--------GLDMRRD---------LEPA 213
FRGF S +T +D H E+ + L +D +
Sbjct: 203 GFDDFRGFNSVSDMYTEWRDV--HVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAI 260
Query: 214 WDLHGKYSTDV---FTAEAVDIIHNHS-TDEPLFLYLAHAATHSANPYEPLQAPDHYLNI 269
D+ KY D+ + V + + +D+P FLY H D+Y N
Sbjct: 261 ADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF----------DNYPNA 310
Query: 270 HRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAAS 329
R+ + + ++++ + + LE+ L N++IVF SDNG A +
Sbjct: 311 KYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEA---EVPPHG 367
Query: 330 NWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
P RG K + WEGGVR + W +++ R ++ V ++D PT L D+ +
Sbjct: 368 RTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK--SDGIVDLADLFPTAL------DLAGH 419
Query: 389 VNSTVENIIPR 399
+ V N++P+
Sbjct: 420 PGAKVANLVPK 430
>sp|P51691|ARS_PSEAE Arylsulfatase OS=Pseudomonas aeruginosa (strain ATCC 15692 / PAO1 /
1C / PRS 101 / LMG 12228) GN=atsA PE=1 SV=3
Length = 536
Score = 132 bits (333), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 118/423 (27%), Positives = 185/423 (43%), Gaps = 109/423 (25%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK- 118
P+ + I+ADDLG++D+G G +I TPN+DALA +G+ L +++T C+P+RS ++TG
Sbjct: 5 PNFLVIVADDLGFSDIGAFG-GEIATPNLDALAIAGLRLTDFHTASTCSPTRSMLLTGTD 63
Query: 119 -HPIHTGMQHNVLYGCERGGLP-----LSEKI--LPQYLKELGYRTRIVGKWHLGFYKKE 170
H G L E G P L+E++ LP+ L+E GY+T + GKWHLG K E
Sbjct: 64 HHIAGIGTMAEALT-PELEGKPGYEGHLNERVVALPELLREAGYQTLMAGKWHLGL-KPE 121
Query: 171 YTPTFRGFESHLGYWTGHQDY------FDHSAEEMKMWGLDMRRDLEPAWDL--HGKYST 222
TP RGFE G ++ +D S + + + E D G YS+
Sbjct: 122 QTPHARGFERSFSLLPGAANHYGFEPPYDESTPRILKGTPALYVEDERYLDTLPEGFYSS 181
Query: 223 DVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR----------- 271
D F + + + P F YL +A P+ PLQAP + +R
Sbjct: 182 DAFGDKLLQYLKERDQSRPFFAYLPFSA-----PHWPLQAPREIVEKYRGRYDAGPEALR 236
Query: 272 ------------------------------HIEDFKRSK-------FAAILHKLDESVGK 294
+ED +R+K +AA++ ++D ++G+
Sbjct: 237 QERLARLKELGLVEADVEAHPVLALTREWEALEDEERAKSARAMEVYAAMVERMDWNIGR 296
Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAA-------------GF----------NLNAASNW 331
VV+ L ++ L N+ ++F+SDNG A GF N+ A+++
Sbjct: 297 VVDYLRRQGELDNTFVLFMSDNGAEGALLEAFPKFGPDLLGFLDRHYDNSLENIGRANSY 356
Query: 332 -------------PLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
P R K +GG+R L+ P L +G ++ + V D PTLL
Sbjct: 357 VWYGPRWAQAATAPSRLYKAFTTQGGIRVPALVRYPRLSRQGAISHAFATVMDVTPTLLD 416
Query: 379 AAN 381
A
Sbjct: 417 LAG 419
>sp|P77318|YDEN_ECOLI Uncharacterized sulfatase YdeN OS=Escherichia coli (strain K12)
GN=ydeN PE=3 SV=2
Length = 560
Score = 123 bits (309), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 121/454 (26%), Positives = 192/454 (42%), Gaps = 98/454 (21%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFH--------------------GLD------QIPTPNI 88
++ G P+II + DDLG+ + F G+D Q TP +
Sbjct: 53 STKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTL 112
Query: 89 DALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQ 147
+L G+ N Y + PSR+AIMTG+ P G+ N + G+PL+E LP+
Sbjct: 113 LSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT---DAQDGIPLTETFLPE 169
Query: 148 YLKELGYRTRIVGKWHLG----------------------FYKKEYTPTFRGFESHLGYW 185
+ GY T VGKWHL F +E+ P RGF+ +G+
Sbjct: 170 LFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFH 229
Query: 186 TGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST-DEPLFL 244
Y++ + L R+ PA Y +D T EA+ ++ T D+P L
Sbjct: 230 AAGTAYYNSPS-------LFKNRERVPA----KGYISDQLTDEAIGVVDRAKTLDQPFML 278
Query: 245 YLAHAATH--SANPYEPLQAPDHY---LNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
YLA+ A H + NP APD Y N D + A ++ +D+ V +++E L
Sbjct: 279 YLAYNAPHLPNDNP-----APDQYQKQFNTGSQTAD----NYYASVYSVDQGVKRILEQL 329
Query: 300 EQRRMLSNSIIVFVSDNGGAAAG-FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
++ N+II+F SDNG G LN A +G K+ + GG +W
Sbjct: 330 KKNGQYDNTIILFTSDNGAVIDGPLPLNGAQ----KGYKSQTYPGGTHTPMFMWWKGKLQ 385
Query: 359 RGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENS-----------ILRY 407
G ++ + D+ PT L AA+ S IP + +++P ++ I Y
Sbjct: 386 PGNY-DKLISAMDFYPTALDAADIS-IPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSY 443
Query: 408 ENGTHEYNSPRIENSN--TRYENGTHEYNPKYEN 439
+ E N P +N + R+++ + +NP E+
Sbjct: 444 SHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTED 477
>sp|Q3TYD4|ARSG_MOUSE Arylsulfatase G OS=Mus musculus GN=Arsg PE=2 SV=1
Length = 525
Score = 119 bits (298), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 112/396 (28%), Positives = 174/396 (43%), Gaps = 67/396 (16%)
Query: 39 VLPLAFTLSMVFVDLVASS------GP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
VL + S F LV S P P+I+ ILADD+GW D+G + + T N+D +
Sbjct: 8 VLLVGMAFSGFFYPLVDFSISGKTRAPQPNIVIILADDMGWGDLGANWAETKDTTNLDKM 67
Query: 92 AYSGIILKNYYTV-QLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK 150
A G+ +++ C+PSR++++TG+ + G+ HN GGLP++E L + L+
Sbjct: 68 ASEGMRFVDFHAAASTCSPSRASLLTGRLGLRNGVTHNFAV-TSVGGLPVNETTLAEVLR 126
Query: 151 ELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGY-WTGHQDYFDHSA----------EEM 199
+ GY T ++GKWHLG + Y P FRGF+ + G ++ D +
Sbjct: 127 QEGYVTAMIGKWHLG-HHGSYHPNFRGFDYYFGIPYSNDMGCTDAPGYNYPPCPACPQRD 185
Query: 200 KMW---GLDMRRD-----------LEPAWDLHGKYSTDVFTAEAVDIIHNHSTD-EPLFL 244
+W G D D +E +L G + AV+ I ST P L
Sbjct: 186 GLWRNPGRDCYTDVALPLYENLNIVEQPVNLSGL--AQKYAERAVEFIEQASTSGRPFLL 243
Query: 245 YLAHAATH---SANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQ 301
Y+ A H S P PL P ++S + A L ++D VG++ + ++
Sbjct: 244 YVGLAHMHVPLSVTP--PLAHPQ------------RQSLYRASLREMDSLVGQIKDKVDH 289
Query: 302 RRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV----------KNTLWEGGVRGAGLI 351
N+++ F DNG A L A S P G+ K T WEGG R L
Sbjct: 290 VAR-ENTLLWFTGDNGPWAQKCEL-AGSVGPFFGLWQTHQGGSPTKQTTWEGGHRVPALA 347
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
+ P + + + + D PT+++ A S PN
Sbjct: 348 YWPGRVPANVTSTALLSLLDIFPTVIALAGASLPPN 383
>sp|P08842|STS_HUMAN Steryl-sulfatase OS=Homo sapiens GN=STS PE=1 SV=2
Length = 583
Score = 119 bits (298), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 104/388 (26%), Positives = 158/388 (40%), Gaps = 68/388 (17%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+II ++ADDLG D G +G I TPNID LA G+ L + LCTPSR+A MTG+
Sbjct: 27 PNIILVMADDLGIGDPGCYGNKTIRTPNIDRLASGGVKLTQHLAASPLCTPSRAAFMTGR 86
Query: 119 HPIHTGMQH-----NVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT- 172
+P+ +GM L+ GGLP E + LK+ GY T ++GKWHLG T
Sbjct: 87 YPVRSGMASWSRTGVFLFTASSGGLPTDEITFAKLLKDQGYSTALIGKWHLGMSCHSKTD 146
Query: 173 ----PTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
P GF G + D E ++ +R + + G +
Sbjct: 147 FCHHPLHHGFNYFYG--ISLTNLRDCKPGEGSVFTTGFKRLVFLPLQIVGVTLLTLAALN 204
Query: 229 AVDIIH-------NHSTDEPLFLYLAHAATHSANP--------YEPLQAPDHYLN----- 268
+ ++H + L L L H P YE +Q P Y N
Sbjct: 205 CLGLLHVPLGVFFSLLFLAALILTLFLGFLHYFRPLNCFMMRNYEIIQQPMSYDNLTQRL 264
Query: 269 ----------------------IHRHIEDFKRSKFAA---------ILHKLDESVGKVVE 297
+H H F FA + ++D SVG+++
Sbjct: 265 TVEAAQFIQRNTETPFLLVLSYLHVHTALFSSKDFAGKSQHGVYGDAVEEMDWSVGQILN 324
Query: 298 ALEQRRMLSNSIIVFVSDNGG----AAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
L++ R+ ++++I F SD G ++ ++ SN +G K WEGG+R G++
Sbjct: 325 LLDELRLANDTLIYFTSDQGAHVEEVSSKGEIHGGSNGIYKGGKANNWEGGIRVPGILRW 384
Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAAN 381
P + G ++ D PT+ A
Sbjct: 385 PRVIQAGQKIDEPTSNMDIFPTVAKLAG 412
>sp|Q32KJ9|ARSG_RAT Arylsulfatase G OS=Rattus norvegicus GN=Arsg PE=2 SV=1
Length = 526
Score = 118 bits (296), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 99/363 (27%), Positives = 158/363 (43%), Gaps = 50/363 (13%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
P+I+ ILADD+GW D+G + + T N+D +A G+ +++ C+PSR++++TG+
Sbjct: 36 PNIVIILADDMGWGDLGANWAETKDTTNLDKMASEGMRFVDFHAAASTCSPSRASLLTGR 95
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ G+ HN GGLPL+E L + L++ GY T ++GKWHLG + Y P+FRGF
Sbjct: 96 LGLRNGVTHNFAV-TSVGGLPLNETTLAEVLQQAGYVTAMIGKWHLG-HHGSYHPSFRGF 153
Query: 179 ESHLGYW----TGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDV---------- 224
+ + G G D ++ R P D + + +
Sbjct: 154 DYYFGIPYSNDMGCTDNPGYNYPPCPACPQSDGRWRNPDRDCYTDVALPLYENLNIVEQP 213
Query: 225 ---------FTAEAVDIIHNHSTD-EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE 274
+ AV+ I ST P LY+ A H P A ++R
Sbjct: 214 VNLSGLAQKYAERAVEFIEQASTSGRPFLLYVGLAHMHVPLSVTPPLANPQSQRLYR--- 270
Query: 275 DFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLR 334
A L ++D VG++ + ++ N+++ F DNG A L A S P
Sbjct: 271 --------ASLQEMDSLVGQIKDKVDHVAK-ENTLLWFAGDNGPWAQKCEL-AGSMGPFS 320
Query: 335 GV----------KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSD 384
G+ K T WEGG R L + P + + + + D PT+++ A S
Sbjct: 321 GLWQTHQGGSPAKQTTWEGGHRVPALAYWPGRVPVNVTSTALLSLLDIFPTVIALAGASL 380
Query: 385 IPN 387
PN
Sbjct: 381 PPN 383
>sp|Q9C0V7|YHJ2_SCHPO Uncharacterized sulfatase PB10D8.02c OS=Schizosaccharomyces pombe
(strain 972 / ATCC 24843) GN=SPBPB10D8.02c PE=3 SV=1
Length = 554
Score = 118 bits (296), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 113/454 (24%), Positives = 182/454 (40%), Gaps = 104/454 (22%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
A S P+ + I+ADDLGW+DV G +I TPNI+ LA G+ L N++T C+P+RS +
Sbjct: 7 AESKKPNFLVIVADDLGWSDVSPFG-SEIHTPNIERLAKEGVRLTNFHTASACSPTRSML 65
Query: 115 MTGK--HPIHTGMQHNVLYGCER--GGLP-----LSEKI--LPQYLKELGYRTRIVGKWH 163
++G H G + + GG P L++++ LP+ L+E GY T + GKWH
Sbjct: 66 LSGTDNHIAGLGQMAETVRRFSKVWGGKPGYEGYLNDRVAALPEILQEAGYYTTMSGKWH 125
Query: 164 LGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAW--------- 214
LG Y P+ RGF+ G ++F + + + L P +
Sbjct: 126 LGLTPDRY-PSKRGFKESFALLPGGGNHFAYEPGTRENPAVPF---LPPLYTHNHDPVDH 181
Query: 215 -DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATH--------------------- 252
L YS++ F + +D + N + F YL A H
Sbjct: 182 KSLKNFYSSNYFAEKLIDQLKNREKSQSFFAYLPFTAPHWPLQSPKEYINKYRGRYSEGP 241
Query: 253 --------------SANPYEPLQAP---------DHYLNIHRHIEDFKRSKFAAILHKLD 289
P + AP D + +AA++ LD
Sbjct: 242 DVLRKNRLQAQKDLGLIPENVIPAPVDGMGTKSWDELTTEEKEFSARTMEVYAAMVELLD 301
Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAA---------------------------- 321
++G+V++ L+ L N+ ++F+SDNG +
Sbjct: 302 LNIGRVIDYLKTIGELDNTFVIFMSDNGAEGSVLEAIPVLSTKPPVKYFDNSLENLGNYN 361
Query: 322 -----GFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
G A+ P R K + EGG+R +I P L I+++++V V D LPT+
Sbjct: 362 SFIWYGPRWAQAATAPSRLSKGFITEGGIRCPAIIRYPPLIKPDIISDEFVTVMDILPTI 421
Query: 377 LSAANKSDIPNYVNSTVENIIPRYENSILRYENG 410
L A P + + +IPR + I + +G
Sbjct: 422 LELAEVPH-PGHKFQGRDVVIPRGKPWIDHFVHG 454
>sp|Q96EG1|ARSG_HUMAN Arylsulfatase G OS=Homo sapiens GN=ARSG PE=1 SV=1
Length = 525
Score = 117 bits (294), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 100/359 (27%), Positives = 157/359 (43%), Gaps = 50/359 (13%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
P+ + ILADD+GW D+G + + T N+D +A G+ +++ C+PSR++++TG+
Sbjct: 36 PNFVIILADDMGWGDLGANWAETKDTANLDKMASEGMRFVDFHAAASTCSPSRASLLTGR 95
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ G+ N GGLPL+E L + L++ GY T I+GKWHLG + Y P FRGF
Sbjct: 96 LGLRNGVTRNFAV-TSVGGLPLNETTLAEVLQQAGYVTGIIGKWHLG-HHGSYHPNFRGF 153
Query: 179 ESHLGYWTGH------QDYFDHSAEEMKMWGLDMRRDLE---------PAWD-------- 215
+ + G H ++H G R+L+ P ++
Sbjct: 154 DYYFGIPYSHDMGCTDTPGYNHPPCPACPQGDGPSRNLQRDCYTDVALPLYENLNIVEQP 213
Query: 216 LHGKYSTDVFTAEAVDIIHNHSTD-EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE 274
++ + +A I ST P LY+A A H P L A
Sbjct: 214 VNLSSLAQKYAEKATQFIQRASTSGRPFLLYVALAHMHVPLPVTQLPAAPR--------- 264
Query: 275 DFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLR 334
RS + A L ++D VG++ + ++ + N+ + F DNG A L A S P
Sbjct: 265 --GRSLYGAGLWEMDSLVGQIKDKVDH-TVKENTFLWFTGDNGPWAQKCEL-AGSVGPFT 320
Query: 335 G----------VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
G K T WEGG R L + P + + + V D PT+++ A S
Sbjct: 321 GFWQTRQGGSPAKQTTWEGGHRVPALAYWPGRVPVNVTSTALLSVLDIFPTVVALAQAS 379
>sp|Q32KH9|ARSG_CANFA Arylsulfatase G OS=Canis familiaris GN=ARSG PE=2 SV=1
Length = 535
Score = 116 bits (291), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 99/359 (27%), Positives = 155/359 (43%), Gaps = 50/359 (13%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
P+ + ILADD+GW D+G + + T N+D +A G+ +++ C+PSR++++TG+
Sbjct: 36 PNFVIILADDMGWGDLGANWAETKDTANLDKMAAEGMRFVDFHAAASTCSPSRASLLTGR 95
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ G+ HN GGLPL+E L + L++ GY T ++GKWHLG + Y P FRGF
Sbjct: 96 LGLRNGVTHNFAV-TSVGGLPLNETTLAEVLQQAGYVTGMIGKWHLG-HHGPYHPNFRGF 153
Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDV-------------- 224
+ + G H + R P+ L TDV
Sbjct: 154 DYYFGIPYSHDMGCTDTPGYNHPPCPACPRGDRPSRSLERDCYTDVALPLYENLNIVEQP 213
Query: 225 ---------FTAEAVDII-HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE 274
+ +A+ I H ++ P LY+ A H L A + R
Sbjct: 214 VNLSSLAHKYAEKAIQFIQHASASGRPFLLYMGLAHMHVPISRTQLSA------VLR--- 264
Query: 275 DFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLR 334
R + A L ++D VG++ + ++ R N+ + F DNG A L A S P
Sbjct: 265 --GRRPYGAGLREMDSLVGQIKDKVD-RTAKENTFLWFTGDNGPWAQKCEL-AGSVGPFT 320
Query: 335 GV----------KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
G+ K T WEGG R L + P + + + V D PT+++ A S
Sbjct: 321 GLWQTHQGGSPAKQTTWEGGHRVPALAYWPGRVPVNVTSTALLSVLDIFPTVVALAGAS 379
>sp|P14000|ARS_HEMPU Arylsulfatase OS=Hemicentrotus pulcherrimus PE=1 SV=1
Length = 551
Score = 116 bits (290), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 96/324 (29%), Positives = 152/324 (46%), Gaps = 38/324 (11%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+++ ++AD +G D+ +G ID +A G+ N Y +CTPSRSAIMTG+
Sbjct: 52 PNVVLLVADHMGSGDLTSYGHPTQEAGFIDKMAAEGLRFTNGYVGDAVCTPSRSAIMTGR 111
Query: 119 HPIHTGM--QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
P+ G + V + GLP SE + + +KE GY T +VGKWHLG + T
Sbjct: 112 LPVRIGTFGETRVFLPWTKTGLPKSELTIAEAMKEAGYATGMVGKWHLGINENSSTDG-- 169
Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLD---MRRDLEPAWDLH-------------GKY 220
+HL + G D+ H+ W D + +D + + K
Sbjct: 170 ---AHLPFNHGF-DFVGHNLPFTNSWSCDDTGLHKDFPDSQRCYLYVNATLVSQPYQHKG 225
Query: 221 STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSK 280
T +FT +A+ I ++ D P FLY+A A H++ L + D + R R +
Sbjct: 226 LTQLFTDDALGFIEDNHAD-PFFLYVAFAHMHTS-----LFSSDDFSCTSR------RGR 273
Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTL 340
+ L ++ ++V K+V+ LE+ + N+II F+SD+ G + RG K+
Sbjct: 274 YGDNLLEMHDAVQKIVDKLEENNISENTIIFFISDH-GPHREYCEEGGDASIFRGGKSHS 332
Query: 341 WEGGVRGAGLIWSPLLESRGIVAE 364
WEGG R +++ P S GI E
Sbjct: 333 WEGGHRIPYIVYWPGTISPGISNE 356
>sp|P20713|ATSA_ENTAE Arylsulfatase OS=Enterobacter aerogenes GN=atsA PE=1 SV=1
Length = 464
Score = 114 bits (285), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 92/340 (27%), Positives = 154/340 (45%), Gaps = 65/340 (19%)
Query: 42 LAFTLSMVFVD--LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
+A +SM+ A P++I I+ADD+G++D+ G +IPTPN+ A+A G+ +
Sbjct: 6 MAAAVSMILAGGAHAAQQERPNVIVIIADDMGYSDISPFG-GEIPTPNLQAMAEQGMRMS 64
Query: 100 NYYTVQLCTPSRSAIMTGKHPIHTGMQ----HNVLYGCERGGLPLSEKI--LPQYLKELG 153
YYT + P+RS ++TG GM ++ G E L L++++ + + K+ G
Sbjct: 65 QYYTSPMSAPARSMLLTGNSNQQAGMGGMWWYDSTIGKEGYELRLTDRVTTMAERFKDAG 124
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAE--EMKMWGLDMRRDLE 211
Y T + GKWHLGF TP RGF + G +F+ + ++ + RD E
Sbjct: 125 YNTLMAGKWHLGFVPGA-TPKDRGFNHAFAFMGGGTSHFNDAIPLGTVEAFHTYYTRDGE 183
Query: 212 PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDH------ 265
YS++ + + I ++P+F +LA A P++PLQAPD
Sbjct: 184 RVSLPDDFYSSEAYARQMNSWIKATPKEQPVFAWLAFTA-----PHDPLQAPDEWIKRFK 238
Query: 266 ------YLNIHR-------------------HIEDFKR----------------SKFAAI 284
Y ++R H+E K +AA+
Sbjct: 239 GQYEQGYAEVYRQRIARLKALGIIHDDTPLPHLELDKEWEALTPEQQKYTAKVMQVYAAM 298
Query: 285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG-AAAGF 323
+ +D +G ++E L+Q N+++VF++DNG A GF
Sbjct: 299 IANMDAQIGTLMETLKQTGRDKNTLLVFLTDNGANPAQGF 338
>sp|Q9X759|ATSA_KLEPN Arylsulfatase OS=Klebsiella pneumoniae GN=atsA PE=1 SV=1
Length = 577
Score = 114 bits (285), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 92/340 (27%), Positives = 154/340 (45%), Gaps = 65/340 (19%)
Query: 42 LAFTLSMVFVD--LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
+A +SM+ A P++I I+ADD+G++D+ G +IPTPN+ A+A G+ +
Sbjct: 6 MAAAVSMILAGGAHAAQQERPNVIVIIADDMGYSDISPFG-GEIPTPNLQAMAEQGMRMS 64
Query: 100 NYYTVQLCTPSRSAIMTGKHPIHTGMQ----HNVLYGCERGGLPLSEKI--LPQYLKELG 153
YYT + P+RS ++TG GM ++ G E L L++++ + + K+ G
Sbjct: 65 QYYTSPMSAPARSMLLTGNSNQQAGMGGMWWYDSTIGKEGYELRLTDRVTTMAERFKDAG 124
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAE--EMKMWGLDMRRDLE 211
Y T + GKWHLGF TP RGF + G +F+ + ++ + RD E
Sbjct: 125 YNTLMAGKWHLGFVPGA-TPKERGFNHAFAFMGGGTSHFNDAIPLGTVEAFHTYYTRDGE 183
Query: 212 PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDH------ 265
YS++ + + I ++P+F +LA A P++PLQAPD
Sbjct: 184 RVSLPDDFYSSEAYARQMNSWIKATPKEQPVFAWLAFTA-----PHDPLQAPDEWIKRFK 238
Query: 266 ------YLNIHR-------------------HIEDFKR----------------SKFAAI 284
Y ++R H+E K +AA+
Sbjct: 239 GQYEQGYAEVYRQRIARLKALGIIHDDTPLPHLELDKEWEALTPEQQKYTAKVMQVYAAM 298
Query: 285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG-AAAGF 323
+ +D +G ++E L+Q N+++VF++DNG A GF
Sbjct: 299 IANMDAQIGTLMETLKQTGRDKNTLLVFLTDNGANPAQGF 338
>sp|P50473|ARS_STRPU Arylsulfatase OS=Strongylocentrotus purpuratus PE=2 SV=1
Length = 567
Score = 111 bits (278), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 98/330 (29%), Positives = 154/330 (46%), Gaps = 50/330 (15%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTGK 118
P++I +LADD+G D+ +G ID +A G+ + Y +CTPSRSAI+TG+
Sbjct: 67 PNVILLLADDMGVGDLSVYGHPTQEPGFIDQMANQGLRFTQGYSGDSVCTPSRSAIVTGR 126
Query: 119 HPIHTGMQHNVLYGCER-------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE- 170
PI TG +YG ER GLPL E + + +K GY T +VGKWHLG +
Sbjct: 127 QPIRTG-----VYGEERIFLPWTTTGLPLYEVTIAEAMKGAGYTTGMVGKWHLGINENSS 181
Query: 171 ----YTPTFRGFESHLGY-------W----TG-HQDYFDHSAEEMKMWGLDMRRDLEPAW 214
+ P RGF+ +G+ W TG HQD+ D +A + A
Sbjct: 182 SDGAHLPANRGFD-FVGHNLPFGNSWRCDDTGLHQDFPDTNACFL------YYNSTSVAQ 234
Query: 215 DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE 274
K T + + V I + + ++P F+Y++ A H++ L + D + R
Sbjct: 235 PFQHKGLTQLLRDDTVGFIED-NVNKPFFMYVSFAHMHTS-----LFSSDDFSCTSR--- 285
Query: 275 DFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLR 334
R ++ L ++D+++ ++V L + N++I F SD+G +N R
Sbjct: 286 ---RGRYGDNLREMDQAIEQIVTTLVDNDIDDNTVIFFTSDHGPHREYCGEGGDANV-FR 341
Query: 335 GVKNTLWEGGVRGAGLIWSPLLESRGIVAE 364
G K WEGG R +++ P S G+ E
Sbjct: 342 GGKGQSWEGGHRIPYIVYWPGTISPGVSHE 371
>sp|Q5FYA8|ARSH_HUMAN Arylsulfatase H OS=Homo sapiens GN=ARSH PE=2 SV=1
Length = 562
Score = 103 bits (258), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 88/322 (27%), Positives = 138/322 (42%), Gaps = 65/322 (20%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYY-TVQLCTPSRSAIMTGK 118
P+I+ ++ADDLG D+ +G + + TPNID LA G+ L + +CTPSR+A +TG+
Sbjct: 7 PNIVLLMADDLGVGDLCCYGNNSVSTPNIDRLASEGVRLTQHLAAASMCTPSRAAFLTGR 66
Query: 119 HPIHTGMQHNVLYGCER--------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
+PI +GM Y R GGLP +E + L+ GYRT ++GKWHLG
Sbjct: 67 YPIRSGMVSA--YNLNRAFTWLGGSGGLPTNETTFAKLLQHRGYRTGLIGKWHLGLSCAS 124
Query: 171 -----YTPTFRGFESHLGYWTGHQDYFD-------HSAEEMKMWGLDMRRDLEPAWDLHG 218
Y P GF G G H +K+W + L P L
Sbjct: 125 RNDHCYHPLNHGFHYFYGVPFGLLSDCQASKTPELHRWLRIKLWISTVALALVPFLLLIP 184
Query: 219 KYS---------TDVFTAEAV------------------------DIIHNHSTDEPLFLY 245
K++ VF A +II +E +
Sbjct: 185 KFARWFSVPWKVIFVFALLAFLFFTSWYSSYGFTRRWNCILMRNHEIIQQPMKEEKVASL 244
Query: 246 LAHAATHSANPY--EPLQAPDHYLNIH-------RHIEDFKRSKFAAILHKLDESVGKVV 296
+ A Y EP +L++H + + K ++ + ++D VGK++
Sbjct: 245 MLKEALAFIERYKREPFLLFFSFLHVHTPLISKKKFVGRSKYGRYGDNVEEMDWMVGKIL 304
Query: 297 EALEQRRMLSNSIIVFVSDNGG 318
+AL+Q R+ +++++ F SDNGG
Sbjct: 305 DALDQERLANHTLVYFTSDNGG 326
>sp|Q60HH5|ARSE_MACFA Arylsulfatase E OS=Macaca fascicularis GN=ARSE PE=2 SV=1
Length = 588
Score = 100 bits (250), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 85/325 (26%), Positives = 146/325 (44%), Gaps = 62/325 (19%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAI 114
S+ P+I+ ++ADDLG D+G +G + + TPNID LA G+ L + + LCTPSR+A
Sbjct: 34 STSRPNILLLMADDLGIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAF 93
Query: 115 MTGKHPIHTGMQHNVLYGCER-----GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
+TG++P+ +GM ++ Y + GGLP +E + LKE GY T ++GKWHLG +
Sbjct: 94 LTGRYPVRSGMVSSIGYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCE 153
Query: 170 EYT-----PTFRGFESHLGY---WTGHQDYFDHSAEEMKM-------------------- 201
+ P GF+ G G +++ S + + +
Sbjct: 154 SASDHCHHPLHHGFDHFYGMPFSLMGDCAHWELSEKRVNLEQKLNFLFQVLALVALTLVA 213
Query: 202 ----------WGLDMRRDLEPAWDLHGKYSTDVFTAEA-VDIIHNHSTDE---------P 241
W + L L G Y A ++ NH+ E P
Sbjct: 214 GKLTHLIPVSWTPVIWSALWAVLLLTGSYFVGALIVHAGCLLMRNHTITEQPMRFQKTTP 273
Query: 242 LFLYLAHAATHSANPYEPLQAPDHYLNIH---RHIEDFKRSKFAAI----LHKLDESVGK 294
L L A+ N + P +L++H +E+F + + ++D VG+
Sbjct: 274 LILQEV-ASFLKRNKHGPFLLFVSFLHVHIPLITMENFLGKSLHGLYGDNVEEMDWMVGQ 332
Query: 295 VVEALEQRRMLSNSIIVFVSDNGGA 319
+++ L+ + ++++I F SD+GG+
Sbjct: 333 ILDTLDMEGLTNSTLIYFTSDHGGS 357
>sp|P51690|ARSE_HUMAN Arylsulfatase E OS=Homo sapiens GN=ARSE PE=1 SV=2
Length = 589
Score = 97.8 bits (242), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 54/139 (38%), Positives = 82/139 (58%), Gaps = 11/139 (7%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAI 114
S+ P+I+ ++ADDLG D+G +G + + TPNID LA G+ L + + LCTPSR+A
Sbjct: 34 SASRPNILLLMADDLGIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAF 93
Query: 115 MTGKHPIHTGMQHNVLYGCER-----GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
+TG++P+ +GM ++ Y + GGLP +E + LKE GY T ++GKWHLG +
Sbjct: 94 LTGRYPVRSGMVSSIGYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCE 153
Query: 170 EYT-----PTFRGFESHLG 183
+ P GF+ G
Sbjct: 154 SASDHCHHPLHHGFDHFYG 172
>sp|P15589|STS_RAT Steryl-sulfatase OS=Rattus norvegicus GN=Sts PE=1 SV=2
Length = 577
Score = 97.1 bits (240), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 56/142 (39%), Positives = 79/142 (55%), Gaps = 12/142 (8%)
Query: 54 VASSGP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSR 111
A GP P+ + I+ADDLG D+G +G + TP+ID LA G+ L + LCTPSR
Sbjct: 19 AARPGPGPNFLLIMADDLGIGDLGCYGNRTLRTPHIDRLALEGVKLTQHLAAAPLCTPSR 78
Query: 112 SAIMTGKHPIHTGMQHN-----VLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGF 166
+A +TG++P+ +GM + L+ GGLP +E + LK GY T +VGKWHLG
Sbjct: 79 AAFLTGRYPVRSGMASHGRLGVFLFSASSGGLPPNEVTFAKLLKGQGYTTGLVGKWHLGL 138
Query: 167 YKKEYT-----PTFRGFESHLG 183
+ + P GF+ LG
Sbjct: 139 SCQAASDFCHHPGRHGFDRFLG 160
Score = 47.4 bits (111), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 43/170 (25%), Positives = 75/170 (44%), Gaps = 17/170 (10%)
Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
T +EA D + + D P L+L+ H+A+ P A ++H +
Sbjct: 260 TQRLASEAGDFLRR-NRDTPFLLFLSFMHVHTAHFANPEFAGQ---SLH--------GAY 307
Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA----ASNWPLRGVK 337
+ ++D +VG+V+ L++ + +N+++ SD+G N SN RG K
Sbjct: 308 GDAVEEMDWAVGQVLATLDKLGLANNTLVYLTSDHGAHVEELGPNGERHGGSNGIYRGGK 367
Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
WEGG+R GL+ P + G E+ D PT+ A +++P
Sbjct: 368 ANTWEGGIRVPGLVRWPGVIVPGQEVEEPTSNMDVFPTVARLAG-AELPT 416
>sp|P50427|STS_MOUSE Steryl-sulfatase OS=Mus musculus GN=Sts PE=2 SV=1
Length = 624
Score = 93.6 bits (231), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 53/136 (38%), Positives = 75/136 (55%), Gaps = 11/136 (8%)
Query: 62 IIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHP 120
+ I+ADDLG D+G +G + TP++D LA G+ L + LCTPSR+A +TG++P
Sbjct: 37 FLLIMADDLGIGDLGCYGNKTLRTPHLDRLAREGVKLTQHLAAAPLCTPSRAAFLTGRYP 96
Query: 121 IHTGMQHN-----VLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT--- 172
+GM + L+ GGLP SE + + LK GY T ++GKWHLG + T
Sbjct: 97 PRSGMAAHGRVGVYLFTASSGGLPPSEVTMARLLKGRGYATALIGKWHLGLSCRGATDFC 156
Query: 173 --PTFRGFESHLGYWT 186
P GF+ LG T
Sbjct: 157 HHPLRHGFDRFLGVPT 172
Score = 47.0 bits (110), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 43/159 (27%), Positives = 66/159 (41%), Gaps = 17/159 (10%)
Query: 269 IHRHIEDFKRSKFAA-ILH--------KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
+H H F FA LH ++D VG+V+ AL++ + +++ F SD+G
Sbjct: 295 LHVHTAHFADPGFAGRSLHGAYGDSVEEMDWGVGRVLAALDELGLARETLVYFTSDHGAH 354
Query: 320 AAGFNLNA----ASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPT 375
SN RG K WEGGVR L+ P S G V + + D PT
Sbjct: 355 VEELGPRGERMGGSNGVFRGGKGNNWEGGVRVPCLVRWPRELSPGRVVAEPTSLMDVFPT 414
Query: 376 LLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEY 414
+ A +++P +++P R E HE+
Sbjct: 415 VARLAG-AELPGDRVIDGRDLMPLLRGDAQRSE---HEF 449
>sp|P54793|ARSF_HUMAN Arylsulfatase F OS=Homo sapiens GN=ARSF PE=1 SV=4
Length = 590
Score = 91.7 bits (226), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 70/116 (60%), Gaps = 12/116 (10%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P+I+ I+ DDLG D+G +G D + TP+ID LA G+ L + + LC+PSRSA +TG+
Sbjct: 30 PNIVLIMVDDLGIGDLGCYGNDTMRTPHIDRLAREGVRLTQHISAASLCSPSRSAFLTGR 89
Query: 119 HPIHTGMQHNVLYGCER--------GGLPLSEKILPQYLKELGYRTRIVGKWHLGF 166
+PI +GM V G R GLPL+E L LK+ GY T ++GKWH G
Sbjct: 90 YPIRSGM---VSSGNRRVIQNLAVPAGLPLNETTLAALLKKQGYSTGLIGKWHQGL 142
Score = 38.9 bits (89), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 22/95 (23%), Positives = 45/95 (47%), Gaps = 12/95 (12%)
Query: 224 VFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAA 283
+ EA+ + HS E L+ + H+ PL D + +H +
Sbjct: 266 IMVKEAISFLERHS-KETFLLFFSFLHVHT-----PLPTTDDFTGTSKH------GLYGD 313
Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG 318
+ ++D VGK+++A++ + +N+++ F SD+GG
Sbjct: 314 NVEEMDSMVGKILDAIDDFGLRNNTLVYFTSDHGG 348
>sp|Q32KH8|ARSH_CANFA Arylsulfatase H OS=Canis familiaris GN=ARSH PE=2 SV=1
Length = 562
Score = 91.7 bits (226), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 52/136 (38%), Positives = 76/136 (55%), Gaps = 12/136 (8%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYY-TVQLCTPSRSAIMTGK 118
P+I+ ++ADDLG D+ +G + + TPNID LA G+ L + +CTPSR+A +TG+
Sbjct: 7 PNIVLLMADDLGVGDLCCYGNNTVSTPNIDRLASEGVRLTQHLAAASVCTPSRAAFLTGR 66
Query: 119 HPIHTGMQ--HNVLYGCE----RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE-- 170
+PI +GM +N+ G GGLP +E + L+ GYRT ++GKWH G
Sbjct: 67 YPIRSGMASPYNLNRGLTWLGGSGGLPTNETTFAKLLQHYGYRTGLIGKWHQGLSCASRN 126
Query: 171 ---YTPTFRGFESHLG 183
Y P GF+ G
Sbjct: 127 DHCYHPLNHGFDYFYG 142
Score = 38.9 bits (89), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 21/78 (26%), Positives = 40/78 (51%), Gaps = 11/78 (14%)
Query: 241 PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALE 300
P L+++ H+ PL D + + K + + ++D VGK++E L+
Sbjct: 260 PFLLFVSFLHVHT-----PLITKDKF------VGHSKYGLYGDNVEEMDWMVGKILETLD 308
Query: 301 QRRMLSNSIIVFVSDNGG 318
Q R+ +++++ F SDNGG
Sbjct: 309 QERLTNHTLVYFTSDNGG 326
>sp|P51689|ARSD_HUMAN Arylsulfatase D OS=Homo sapiens GN=ARSD PE=1 SV=2
Length = 593
Score = 89.0 bits (219), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 46/112 (41%), Positives = 68/112 (60%), Gaps = 6/112 (5%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+I+ I+ADDLG D+G +G + + TPNID LA G+ L + LCTPSR+A +TG+
Sbjct: 41 PNILLIMADDLGTGDLGCYGNNTLRTPNIDQLAEEGVRLTQHLAAAPLCTPSRAAFLTGR 100
Query: 119 HPIHTGMQHN-----VLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
H +GM + + + GGLP +E + L++ GY T ++GKWH G
Sbjct: 101 HSFRSGMDASNGYRALQWNAGSGGLPENETTFARILQQHGYATGLIGKWHQG 152
>sp|Q0TUK6|SULF_CLOP1 Arylsulfatase OS=Clostridium perfringens (strain ATCC 13124 / NCTC
8237 / Type A) GN=CPF_0221 PE=1 SV=1
Length = 481
Score = 88.6 bits (218), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 104/421 (24%), Positives = 175/421 (41%), Gaps = 94/421 (22%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P+I+ I+ D + + +G +G + I TPN+D +A G +N YT V C SR++I+TG
Sbjct: 3 PNIVLIMVDQMRGDCLGVNGNEFIETPNLDMMATEGYNFENAYTAVPSCIASRASILTGM 62
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
G G E G E + + GY T+ +GK H+ Y + F
Sbjct: 63 SQKSHGR-----VGYEDGVSWNYENTIASEFSKAGYHTQCIGKMHV--YPERNLCGFHNI 115
Query: 179 ESHLGYWTGHQ--------------DYF-------DHSAEEMKMWGLDMRRDLEPAW--- 214
H GY + DY H+ + + + GLD + W
Sbjct: 116 MLHDGYLHFARNKEGKASTQIEQCDDYLKWFREKKGHNVDLIDI-GLDCNSWVSRPWGYE 174
Query: 215 -DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
+LH T+ E++D + +P FL ++ HS PL P Y ++++
Sbjct: 175 ENLH---PTNWVVNESIDFLRRKDPSKPFFLKMSFVRPHS-----PLDPPKFYFDMYKD- 225
Query: 274 EDF---------------KRSK---------------------FAAILHKLDESVGKVVE 297
ED R K + +I H +D +G+ +
Sbjct: 226 EDLPEPLMGDWANKEDEENRGKDINCVKGIINKKALKRAKAAYYGSITH-IDHQIGRFLI 284
Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP--- 354
AL + L+N+I +FVSD+G ++ NW +G+ +EG R I+ P
Sbjct: 285 ALSEYGELNNTIFLFVSDHG------DMMGDHNWFRKGIP---YEGSSRVPFFIYDPGNL 335
Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNS-TVENIIPRYENSILRYENGTHE 413
L +G V ++ + + D +PTLL A+ S IP+ V +++N+I ++ Y +G H
Sbjct: 336 LKGKKGKVFDEVLELRDIMPTLLDFAHIS-IPDSVEGLSLKNLIEERNSTWRDYIHGEHS 394
Query: 414 Y 414
+
Sbjct: 395 F 395
>sp|Q8XNV1|SULF_CLOPE Arylsulfatase OS=Clostridium perfringens (strain 13 / Type A)
GN=CPE0231 PE=3 SV=1
Length = 481
Score = 88.2 bits (217), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 103/421 (24%), Positives = 175/421 (41%), Gaps = 94/421 (22%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
P+I+ I+ D + + +G +G + I TPN+D +A G +N YT V C SR++I+TG
Sbjct: 3 PNIVLIMVDQMRGDCLGVNGNEFIETPNLDMMATEGYNFENAYTAVPSCIASRASILTGM 62
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
G G E G E + + GY T+ +GK H+ Y + F
Sbjct: 63 SQKSHGR-----VGYEDGVSWNYENTIASEFSKAGYHTQCIGKMHV--YPERNLCGFHNI 115
Query: 179 ESHLGYWTGHQ--------------DYF-------DHSAEEMKMWGLDMRRDLEPAW--- 214
H GY + DY H+ + + + GLD + W
Sbjct: 116 MLHDGYLHFARNKEGKASTQIEQCDDYLKWFREKKGHNVDLIDI-GLDCNSWVSRPWGYE 174
Query: 215 -DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
+LH T+ E++D + +P FL ++ HS PL P Y ++++
Sbjct: 175 ENLH---PTNWVVNESIDFLRRRDPSKPFFLKMSFVRPHS-----PLDPPKFYFDMYKD- 225
Query: 274 EDF---------------KRSK---------------------FAAILHKLDESVGKVVE 297
ED R K + +I H +D +G+ +
Sbjct: 226 EDLPEPLMGDWANKEDEENRGKDINCVKGIINKKALKRAKAAYYGSITH-IDHQIGRFLI 284
Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP--- 354
AL + L+N+I +FVSD+G ++ NW +G+ +EG R I+ P
Sbjct: 285 ALSEYGKLNNTIFLFVSDHG------DMMGDHNWFRKGIP---YEGSARVPFFIYDPGNL 335
Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNS-TVENIIPRYENSILRYENGTHE 413
L +G V ++ + + D +PTLL A+ S IP+ V +++++I ++ Y +G H
Sbjct: 336 LKGKKGKVFDEVLELRDIMPTLLDFAHIS-IPDSVEGLSLKDLIEERNSTWRDYIHGEHS 394
Query: 414 Y 414
+
Sbjct: 395 F 395
>sp|P31447|YIDJ_ECOLI Uncharacterized sulfatase YidJ OS=Escherichia coli (strain K12)
GN=yidJ PE=3 SV=1
Length = 497
Score = 76.6 bits (187), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 91/366 (24%), Positives = 148/366 (40%), Gaps = 73/366 (19%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+ +F++ D N VG + + T NID+LA GI + YT +CTP+R+ + TG
Sbjct: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63
Query: 119 HPIHTG-MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG----FYKKEYTP 173
+ +G +NV G + + +Y K+ GY T +GKWHL F E P
Sbjct: 64 YANQSGPWTNNVAPG-------KNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPP 116
Query: 174 TFRGFESHLGYWTGHQDYFDHSAE-EMKMWGLDMRRDLEPAWDLHGKYSTDVFT------ 226
E YW +Y E E+ +W R L DL + + FT
Sbjct: 117 -----EWDADYWFDGANYLSELTEKEISLW----RNGLNSVEDLQANHIDETFTWAHRIS 167
Query: 227 AEAVDIIHNHS-TDEPLFLYLAHAATHS--ANPYEPLQA-PDHYLNIHRHIED------- 275
AVD + + DEP + +++ H P E L+ D Y + +D
Sbjct: 168 NRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPE 227
Query: 276 ------------------FKRSKFAAILHKLDESVGKVVEAL--EQRRMLSNSIIVFVSD 315
+ + A +D+ +G+V+ AL EQR N+ +++ SD
Sbjct: 228 HHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR---ENTWVIYTSD 284
Query: 316 NGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPT 375
+G L + +++ R +I SP E R + + V D LPT
Sbjct: 285 HGEMMGAHKLISKG--------AAMYDDITRIPLIIRSPQGERRQV--DTPVSHIDLLPT 334
Query: 376 LLSAAN 381
+++ A+
Sbjct: 335 MMALAD 340
>sp|Q8IWU6|SULF1_HUMAN Extracellular sulfatase Sulf-1 OS=Homo sapiens GN=SULF1 PE=1 SV=1
Length = 871
Score = 73.2 bits (178), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 87/375 (23%), Positives = 147/375 (39%), Gaps = 65/375 (17%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+II +L DD DV L Q+ + + G N + T +C PSRS+++TGK
Sbjct: 43 PNIILVLTDD---QDVELGSL-QVMNKTRKIMEHGGATFINAFVTTPMCCPSRSSMLTGK 98
Query: 119 HPIHTGMQHNVLYGCERGGLPLSE-----KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+ +H HNV E P + + YL GYRT GK+ L Y Y P
Sbjct: 99 Y-VHN---HNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
G+ LG + +++++ G+ + + A D Y TD+ T E+++
Sbjct: 154 P--GWREWLGL-IKNSRFYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESINYF 203
Query: 234 HNHST---DEPLFLYLAHAATHSANPYEP----------------------------LQA 262
P+ + ++HAA H P +Q
Sbjct: 204 KMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQY 263
Query: 263 PDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG 322
L IH + + K L +D+SV ++ L + L N+ I++ +D+G
Sbjct: 264 TGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQ 323
Query: 323 FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANK 382
F L + P ++ +R I P +E IV + +++ D PT+L A
Sbjct: 324 FGLVKGKSMP--------YDFDIRVPFFIRGPSVEPGSIVPQIVLNI-DLAPTILDIAGL 374
Query: 383 SDIPNYVNSTVENII 397
P+ +V ++
Sbjct: 375 DTPPDVDGKSVLKLL 389
>sp|Q8VI60|SULF1_RAT Extracellular sulfatase Sulf-1 OS=Rattus norvegicus GN=Sulf1 PE=1
SV=1
Length = 870
Score = 73.2 bits (178), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 90/368 (24%), Positives = 147/368 (39%), Gaps = 66/368 (17%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+II +L DD DV L Q+ + + G N + T +C PSRS+++TGK
Sbjct: 43 PNIILVLTDD---QDVELGSL-QVMNKTRKIMEHGGATFTNAFVTTPMCCPSRSSMLTGK 98
Query: 119 HPIHTGMQHNVLYGCERGGLP----LSE-KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+ +H HNV E P L E + YL GYRT GK+ L Y Y P
Sbjct: 99 Y-VHN---HNVYTNNENCSSPSWQALHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
G+ LG + +++++ G+ + + A D Y TD+ T E+++
Sbjct: 154 P--GWREWLGL-IKNSRFYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESINYF 203
Query: 234 HNHST---DEPLFLYLAHAATHSANPYEP----------------------------LQA 262
P+ + ++HAA H P +Q
Sbjct: 204 KMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQY 263
Query: 263 PDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG 322
L IH + + K L +D+SV ++ L + L N+ I++ +D+G
Sbjct: 264 TGPMLPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVETGELGNTYIIYTADHGYHIGQ 323
Query: 323 FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANK 382
F L + P ++ +R I P +E IV + +++ D PT+L A
Sbjct: 324 FGLVKGKSMP--------YDFDIRVPFFIRGPSIEPGSIVPQIVLNI-DLAPTILDIAG- 373
Query: 383 SDIPNYVN 390
D P+ V+
Sbjct: 374 LDTPSDVD 381
>sp|Q8K007|SULF1_MOUSE Extracellular sulfatase Sulf-1 OS=Mus musculus GN=Sulf1 PE=2 SV=1
Length = 870
Score = 71.2 bits (173), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 85/358 (23%), Positives = 140/358 (39%), Gaps = 65/358 (18%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+II +L DD DV L Q+ + G N + T +C PSRS+++TGK
Sbjct: 43 PNIILVLTDD---QDVELGSL-QVMNKTRKIMEQGGATFTNAFVTTPMCCPSRSSMLTGK 98
Query: 119 HPIHTGMQHNVLYGCERGGLPLSE-----KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+ +H HNV E P + + YL GYRT GK+ L Y Y P
Sbjct: 99 Y-VHN---HNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
G+ LG + +++++ G+ + + A D Y TD+ T E+++
Sbjct: 154 P--GWREWLGL-IKNSRFYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESINYF 203
Query: 234 HNHST---DEPLFLYLAHAATHSANPYEP----------------------------LQA 262
P+ + ++HAA H P +Q
Sbjct: 204 KMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQY 263
Query: 263 PDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG 322
L IH + + K L +D+SV ++ L + L N+ I++ +D+G
Sbjct: 264 TGPMLPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVESGELDNTYIIYTADHGYHIGQ 323
Query: 323 FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
F L + P ++ +R I P +E IV + +++ D PT+L A
Sbjct: 324 FGLVKGKSMP--------YDFDIRVPFFIRGPSIEPGSIVPQIVLNI-DLAPTILDIA 372
>sp|P51688|SPHM_HUMAN N-sulphoglucosamine sulphohydrolase OS=Homo sapiens GN=SGSH PE=1
SV=1
Length = 502
Score = 71.2 bits (173), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 93/372 (25%), Positives = 152/372 (40%), Gaps = 87/372 (23%)
Query: 41 PLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
P+ +++ V + + P + + +LADD G+ + G + I TP++DALA ++ +N
Sbjct: 4 PVPACCALLLVLGLCRARPRNALLLLADDGGF-ESGAYNNSAIATPHLDALARRSLLFRN 62
Query: 101 YYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSE----KILPQYLKELGYR 155
+T V C+PSR++++TG P H N +YG + + + LP L + G R
Sbjct: 63 AFTSVSSCSPSRASLLTGL-PQH----QNGMYGLHQDVHHFNSFDKVRSLPLLLSQAGVR 117
Query: 156 TRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
T I+GK H+G + P FD + E L + R++
Sbjct: 118 TGIIGKKHVG--PETVYP------------------FDFAYTEENGSVLQVGRNITRIKL 157
Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLA----HAATHSANPYE------------- 258
L K+ D P FLY+A H HS Y
Sbjct: 158 LVRKFL-------------QTQDDRPFFLYVAFHDPHRCGHSQPQYGTFCEKFGNGESGM 204
Query: 259 ---PLQAPDHYLNIHRHIEDF------KRSKFAA---ILHKLDESVGKVVEALEQRRMLS 306
P P Y + + F R+ AA + ++D+ VG V++ L +L+
Sbjct: 205 GRIPDWTPQAYDPLDVLVPYFVPNTPAARADLAAQYTTVGRMDQGVGLVLQELRDAGVLN 264
Query: 307 NSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR-GIVAEQ 365
+++++F SDNG P + L+ G L+ SP R G V+E
Sbjct: 265 DTLVIFTSDNG-------------IPFPSGRTNLYWPGTAEPLLVSSPEHPKRWGQVSEA 311
Query: 366 YVHVSDWLPTLL 377
YV + D PT+L
Sbjct: 312 YVSLLDLTPTIL 323
>sp|Q90XB6|SULF1_COTCO Extracellular sulfatase Sulf-1 OS=Coturnix coturnix GN=SULF1 PE=1
SV=1
Length = 867
Score = 70.1 bits (170), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 88/381 (23%), Positives = 147/381 (38%), Gaps = 77/381 (20%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+II +L DD DV L Q+ + G N + T +C PSRS+++TGK
Sbjct: 43 PNIILVLTDD---QDVELGSL-QVMNKTRRIMENGGASFINAFVTTPMCCPSRSSMLTGK 98
Query: 119 HPIHTGMQHNVLYGCERGGLPLSE-----KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+ +H HN+ E P + + YL GYRT GK+ L Y Y P
Sbjct: 99 Y-VHN---HNIYTNNENCSSPSWQATHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEE---MKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
G+ +G + +++++ + G D +D Y TD+ T E++
Sbjct: 154 P--GWREWVGL-VKNSRFYNYTISRNGNKEKHGFDYAKD----------YFTDLITNESI 200
Query: 231 DI------IHNHSTDEPLFLYLAHAATHSANPYEP------------------------- 259
+ I+ H P+ + ++HAA H P
Sbjct: 201 NYFRMSKRIYPH---RPIMMVISHAAPHGPEDSAPQFSELYPNASQHITPSYNYAPNMDK 257
Query: 260 ---LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDN 316
+Q L IH + + K L +D+S+ ++ + L + L N+ I++ +D+
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSMERLYQMLAEMGELENTYIIYTADH 317
Query: 317 GGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
G F L + P ++ +R I P +E G V Q V D PT+
Sbjct: 318 GYHIGQFGLVKGKSMP--------YDFDIRVPFFIRGPSVEP-GSVVPQIVLNIDLAPTI 368
Query: 377 LSAANKSDIPNYVNSTVENII 397
L A P+ +V ++
Sbjct: 369 LDIAGLDTPPDMDGKSVLKLL 389
>sp|Q21376|SULF1_CAEEL Putative extracellular sulfatase Sulf-1 homolog OS=Caenorhabditis
elegans GN=sul-1 PE=3 SV=1
Length = 709
Score = 68.2 bits (165), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 90/381 (23%), Positives = 146/381 (38%), Gaps = 67/381 (17%)
Query: 35 MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
+ F ++P+ T S+ FVD ++I IL DD D+ +D +P +
Sbjct: 16 VLFLIIPIKVT-SIHFVD-----SQHNVILILTDD---QDIELGSMDFMPKTSQIMKERG 66
Query: 95 GIILKNYYTVQLCTPSRSAIMTG----KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK 150
Y T +C PSRS I+TG H +HT Q+ G E + +K + YL+
Sbjct: 67 TEFTSGYVTTPICCPSRSTILTGLYVHNHHVHTNNQNCT--GVEWRKVH-EKKSIGVYLQ 123
Query: 151 ELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL 210
E GYRT +GK +L Y Y P + + +Y +S E + +G + +D
Sbjct: 124 EAGYRTAYLGK-YLNEYDGSYIPPGWDEWHAIVKNSKFYNYTMNSNGEREKFGSEYEKD- 181
Query: 211 EPAWDLHGKYSTDVFTAEAVDIIHNH---STDEPLFLYLAHAATHSANPYEP-------- 259
Y TD+ T ++ I H +P L +++ A H P
Sbjct: 182 ---------YFTDLVTNRSLKFIDKHIKIRAWQPFALIISYPAPHGPEDPAPQFAHMFEN 232
Query: 260 --------------------LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
LQ ++H D + L +DE + ++ L
Sbjct: 233 EISHRTGSWNFAPNPDKQWLLQRTGKMNDVHISFTDLLHRRRLQTLQSVDEGIERLFNLL 292
Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
+ L N+ ++ SD+G F L L+G KN +E +R + P + R
Sbjct: 293 RELNQLWNTYAIYTSDHGYHLGQFGL-------LKG-KNMPYEFDIRVPFFMRGPGI-PR 343
Query: 360 GIVAEQYVHVSDWLPTLLSAA 380
+ + V D PT+L A
Sbjct: 344 NVTFNEIVTNVDIAPTMLHIA 364
>sp|Q9VEX0|SULF1_DROME Extracellular sulfatase SULF-1 homolog OS=Drosophila melanogaster
GN=Sulf1 PE=1 SV=1
Length = 1114
Score = 63.9 bits (154), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 88/355 (24%), Positives = 146/355 (41%), Gaps = 65/355 (18%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+II IL DD DV L+ +P + L G ++ YT +C P+RS+++TG
Sbjct: 54 PNIILILTDD---QDVELGSLNFMPR-TLRLLRDGGAEFRHAYTTTPMCCPARSSLLTGM 109
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKI--LPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
+ +H M C + + YL GYRT GK+ L Y Y P
Sbjct: 110 Y-VHNHMVFTNNDNCSSPQWQATHETRSYATYLSNAGYRTGYFGKY-LNKYNGSYIPP-- 165
Query: 177 GFESHLGYWTG---HQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
G+ W G + Y+++S + + G ++ + A D Y D+ +++ +
Sbjct: 166 GWRE----WGGLIMNSKYYNYS---INLNGQKIKHGFDYAKD----YYPDLIANDSIAFL 214
Query: 234 HN---HSTDEPLFLYLAHAATH----SANPYEPL-------QAP--DHYLN--------- 268
+ + +P+ L ++ A H SA Y L P DH N
Sbjct: 215 RSSKQQNQRKPVLLTMSFPAPHGPEDSAPQYSHLFFNVTTHHTPSYDHAPNPDKQWILRV 274
Query: 269 ------IHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG 322
+H+ + +K L +D +V +V L++ L N+ IV+ SD+G
Sbjct: 275 TEPMQPVHKRFTNLLMTKRLQTLQSVDVAVERVYNELKELGELDNTYIVYTSDHGYHLGQ 334
Query: 323 FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLL 377
F L ++P +E VR LI P +++ +V E ++V D PT L
Sbjct: 335 FGLIKGKSFP--------FEFDVRVPFLIRGPGIQASKVVNEIVLNV-DLAPTFL 380
>sp|Q8BFR4|GNS_MOUSE N-acetylglucosamine-6-sulfatase OS=Mus musculus GN=Gns PE=2 SV=1
Length = 544
Score = 63.5 bits (153), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 92/366 (25%), Positives = 152/366 (41%), Gaps = 63/366 (17%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDAL-AYSGIILKNYYT-VQLCTPSR 111
V ++ P+++ +L DD D G+ P AL G+ + Y LC PSR
Sbjct: 33 VGAARRPNVLLLLTDD---QDAELGGM--TPLKKTKALIGEKGMTFSSAYVPSALCCPSR 87
Query: 112 SAIMTGKHPIHTGMQHNVLYG-CERGGLPLSEK--ILPQYLKEL-GYRTRIVGKWHLGFY 167
++I+TGK+P + + +N L G C ++ P LK + GY+T GK Y
Sbjct: 88 ASILTGKYPHNHHVVNNTLEGNCSSKAWQKIQEPYTFPAILKSVCGYQTFFAGK-----Y 142
Query: 168 KKEYTPTFRGFESHL----GYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTD 223
EY G H+ YW + + + + G + + D Y TD
Sbjct: 143 LNEYGAPDAGGLEHIPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVD----YLTD 198
Query: 224 VFTAEAVDIIHNHSTDEPLFLYLAHAATHS---ANP-----YEPLQAP-DHYLNIH---- 270
V ++D + S EP F+ ++ A HS A P ++ + AP + NIH
Sbjct: 199 VLANLSLDFLDYKSNSEPFFMMISTPAPHSPWTAAPQYQKAFQNVIAPRNKNFNIHGTNK 258
Query: 271 ----------------RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVS 314
R ++D R ++ +L +D+ V K+V+ L+ L N+ I + S
Sbjct: 259 HWLIRQAKTPMTNSSIRFLDDAFRRRWQTLL-SVDDLVEKLVKRLDSTGELDNTYIFYTS 317
Query: 315 DNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLP 374
DNG F+L P+ K L+E ++ L+ P ++ ++ V D P
Sbjct: 318 DNGYHTGQFSL------PID--KRQLYEFDIKVPLLVRGPGIKPNQ-TSKMLVSNIDLGP 368
Query: 375 TLLSAA 380
T+L A
Sbjct: 369 TILDLA 374
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.314 0.133 0.403
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 383,720,120
Number of Sequences: 539616
Number of extensions: 18899222
Number of successful extensions: 360465
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 1474
Number of HSP's successfully gapped in prelim test: 1060
Number of HSP's that attempted gapping in prelim test: 202327
Number of HSP's gapped (non-prelim): 66834
length of query: 905
length of database: 191,569,459
effective HSP length: 127
effective length of query: 778
effective length of database: 123,038,227
effective search space: 95723740606
effective search space used: 95723740606
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 66 (30.0 bits)