BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy12301
(632 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|P50430|ARSB_RAT Arylsulfatase B OS=Rattus norvegicus GN=Arsb PE=2 SV=2
Length = 528
Score = 304 bits (779), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 198/572 (34%), Positives = 286/572 (50%), Gaps = 85/572 (14%)
Query: 71 PNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTP 130
P R + AA L GWNDL FHGS I TP++DALA G++L+N Y QP+CTP
Sbjct: 30 PARASDAAPPPHVVFVLADDLGWNDLGFHGS-VIRTPHLDALAAGGVVLDNYYVQPLCTP 88
Query: 131 SRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR 190
SR+ L+TG+Y IH G+Q I +P VPL E+ LP+ L++ GY+T +GKWHLG +R+
Sbjct: 89 SRSQLLTGRYQIHMGLQHYLIMTCQPNCVPLDEKLLPQLLKDAGYATHMVGKWHLGMYRK 148
Query: 191 EYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGH----DMRRNLSTAWDTVGEY 246
E P RGF+++FGYL G YY H + + LNG D+R A + Y
Sbjct: 149 ECLPTRRGFDTYFGYLLGSEDYYTH---EACAPIECLNGTRCALDLRDGEEPAKEYTDIY 205
Query: 247 ATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
+T++FTK A LI + P +KPLFLYLA + H L+ P+E + + +I D +RR
Sbjct: 206 STNIFTKRATTLIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYDFIQDKHRRI 260
Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
YA MV LD++VG V AL+ +G+ N+++IF +DNG T R+ G+N+P RG
Sbjct: 261 YAGMVSLLDEAVGNVTKALKSRGLWNNTVLIFSTDNGGQT---------RSGGNNWPLRG 311
Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
K TLWEGG++ + SP ++Q S ++MHI+DWLPTL AGG T +DG D
Sbjct: 312 RKGTLWEGGIRGAGFVASPLLKQKGVKSRELMHITDWLPTLVNLAGGSTHGTK-PLDGFD 370
Query: 427 QWSSLLLNTPSRR-----NSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT---AAVRL 478
W ++ +PS R N + D D NT +N T A +R
Sbjct: 371 VWETISEGSPSPRVELLLNIDPDFFDGLPCPGKNTTPEKNDSFPLEHSAFNTSIHAGIRY 430
Query: 479 DSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDK 538
+WKL+ G G Q+ ++VP S+ ++ ++L
Sbjct: 431 KNWKLLTGYPGCGYWFPPPSQSNISEVP--------------SVDSPTKTLWL------- 469
Query: 539 MRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLK 598
F++ DP E+++++ P I L L+
Sbjct: 470 -------------------------------FDINRDPEERHDVSREHPHIVQNLLSRLQ 498
Query: 599 YHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
Y+ VP S+ P + DPK WSPW+
Sbjct: 499 YYHEHSVP-SYFPPLDPRCDPKG-TGVWSPWM 528
Score = 57.0 bits (136), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A LI + P +KPLFLYLA + H L+ P+E + + +I D +RR
Sbjct: 205 YSTNIFTKRATTLIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYDFIQDKHRR 259
Query: 75 TYAALT 80
YA +
Sbjct: 260 IYAGMV 265
>sp|P50429|ARSB_MOUSE Arylsulfatase B OS=Mus musculus GN=Arsb PE=2 SV=3
Length = 534
Score = 299 bits (765), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 196/572 (34%), Positives = 289/572 (50%), Gaps = 85/572 (14%)
Query: 71 PNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTP 130
P R + A L GWNDL FHGS I TP++DALA G++L+N Y QP+CTP
Sbjct: 36 PARASGATQPPHVVFVLADDLGWNDLGFHGS-VIRTPHLDALAAGGVVLDNYYVQPLCTP 94
Query: 131 SRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR 190
SR+ L+TG+Y IH G+Q I +P VPL E+ LP+ L+E GY+T +GKWHLG +R+
Sbjct: 95 SRSQLLTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRK 154
Query: 191 EYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGH----DMRRNLSTAWDTVGEY 246
E P RGF+++FGYL G YY H + + LNG D+R A + Y
Sbjct: 155 ECLPTRRGFDTYFGYLLGSEDYYTH---EACAPIESLNGTRCALDLRDGEEPAKEYNNIY 211
Query: 247 ATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
+T++FTK A +I + P +KPLFLYLA + H L+ P+E + + +I D +RR
Sbjct: 212 STNIFTKRATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRI 266
Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
YA MV +D++VG V AL+ G+ N++ IF +DNG T R+ G+N+P RG
Sbjct: 267 YAGMVSLMDEAVGNVTKALKSHGLWNNTVFIFSTDNGGQT---------RSGGNNWPLRG 317
Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
K TLWEGG++ + SP ++Q S ++MHI+DWLPTL AGG T+
Sbjct: 318 RKGTLWEGGIRGTGFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGSTNG--------- 368
Query: 427 QWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLG 486
+DG + W ++ PS R +L NID+ D + +
Sbjct: 369 -------------TKPLDGFNMWKTISEGHPSPRVELLHNIDQ---------DFFDGLPC 406
Query: 487 TQENGTMDGYYGQTRSNKVPLLN--FNAIVESKT-YQSLQQLSQNI-----FLPISNIDK 538
+N T + + PL + FN + + Y++ + L+ + F P S
Sbjct: 407 PGKNMT------PAKDDSFPLEHSAFNTSIHAGIRYKNWKLLTGHPGCGYWFPPPSQ--- 457
Query: 539 MRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLK 598
+N + + P +LF++ DP E+++++ P I L L+
Sbjct: 458 -------------SNVSEIPPVGPPTKTLWLFDINQDPEERHDVSREHPHIVQNLLSRLQ 504
Query: 599 YHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
Y+ VP SH P + DPK WSPW+
Sbjct: 505 YYHEHSVP-SHFPPLDPRCDPKS-TGVWSPWM 534
Score = 55.5 bits (132), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 40/66 (60%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A +I + P +KPLFLYLA + H L+ P+E + + +I D +RR
Sbjct: 211 YSTNIFTKRATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRR 265
Query: 75 TYAALT 80
YA +
Sbjct: 266 IYAGMV 271
>sp|P15848|ARSB_HUMAN Arylsulfatase B OS=Homo sapiens GN=ARSB PE=1 SV=1
Length = 533
Score = 291 bits (744), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 167/425 (39%), Positives = 243/425 (57%), Gaps = 35/425 (8%)
Query: 77 AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
A ++ L L+ GWND+ FHGS I TP++DALA G++L+N Y QP+CTPSR+
Sbjct: 39 AGASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 97
Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
L+TG+Y I TG+Q IW +P VPL E+ LP+ L+E GY+T +GKWHLG +R+E P
Sbjct: 98 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 157
Query: 195 LYRGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
RGF+++FGYL G YY H L D + V D R A Y+T++F
Sbjct: 158 TRRGFDTYFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIF 215
Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
TK A+ LI + P +KPLFLYLA + H + L+ P+E + + +I D NR YA MV
Sbjct: 216 TKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMV 270
Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
+D++VG V +AL+ G+ N++ IF +DNG T+ G+N+P RG K +L
Sbjct: 271 SLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSL 321
Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
WEGGV+ + SP ++Q + +++HISDWLPTL A G T+ +DG D W ++
Sbjct: 322 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDVWKTI 380
Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSW 481
+PS R + +D + + ++P RNS+ D+ AA+R +W
Sbjct: 381 SEGSPSPRIELLHNID--PNFVDSSPCPRNSMAPAKDDSSLPEYSAFNTSVHAAIRHGNW 438
Query: 482 KLVLG 486
KL+ G
Sbjct: 439 KLLTG 443
Score = 57.4 bits (137), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A+ LI + P +KPLFLYLA + H + L+ P+E + + +I D NR
Sbjct: 210 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 264
Query: 75 TYAALT 80
YA +
Sbjct: 265 HYAGMV 270
Score = 35.0 bits (79), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 18/63 (28%), Positives = 34/63 (53%), Gaps = 2/63 (3%)
Query: 568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
+LF++ DP E+++++ P I ++L L+++ + VP D + DPK W
Sbjct: 473 WLFDIDRDPEERHDLSREYPHIVTKLLSRLQFYHKHSVPVYFPAQD-PRCDPKA-TGVWG 530
Query: 628 PWI 630
PW+
Sbjct: 531 PWM 533
>sp|P33727|ARSB_FELCA Arylsulfatase B OS=Felis catus GN=ARSB PE=2 SV=1
Length = 535
Score = 288 bits (736), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 164/414 (39%), Positives = 234/414 (56%), Gaps = 33/414 (7%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+SFHGSN I TP++D LA G++L+N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 58 GWNDVSFHGSN-IRTPHLDELAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 116
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
W +P VPL E+ LP+ L+E GY+T +GKWHLG +R+E P RGF+++FGYL G
Sbjct: 117 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 176
Query: 212 YYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
YY H L D S V D R A Y+T++FT+ A LI P +KPL
Sbjct: 177 YYSHERCALID--SLNVTRCALDFRDGEQVATGYKNMYSTNIFTERATALITSHPPEKPL 234
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
FLYLA + H + L+ P+E + + +I D NR YA MV +D++VG V +AL+
Sbjct: 235 FLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHYYAGMVSLMDEAVGNVTAALKSH 289
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
G+ N++ IF +DNG T+ G+N+P RG K +LWEGG++ + SP ++
Sbjct: 290 GLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGIRGVGFVASPLLK 340
Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
Q + +++HISDWLPTL A G T +DG D W ++ +PS R + +D
Sbjct: 341 QKGVKNRELIHISDWLPTLVKLARGSTKGTK-PLDGFDVWKTISEGSPSPRKELLHNID- 398
Query: 449 WSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVLGTQENGT 492
+ + +P S+ D+ AA+R +WKL+ G G
Sbjct: 399 -PNFVDISPCPGKSLAPAKDDSSHPAYLAFNTSLHAAIRHGNWKLLTGYPGCGC 451
Score = 53.9 bits (128), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 39/66 (59%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FT+ A LI P +KPLFLYLA + H + L+ P+E + + +I D NR
Sbjct: 212 YSTNIFTERATALITSHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 266
Query: 75 TYAALT 80
YA +
Sbjct: 267 YYAGMV 272
>sp|Q32KJ8|ARSI_RAT Arylsulfatase I OS=Rattus norvegicus GN=Arsi PE=2 SV=1
Length = 573
Score = 284 bits (727), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 185/540 (34%), Positives = 272/540 (50%), Gaps = 76/540 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 58 GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 116
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+E GYST +GKWHLGF+R+E P RGF++ G L G +
Sbjct: 117 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 176
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G D+ S AW G+Y+T L+ + A ++ KPLFLY
Sbjct: 177 YYTYDNCD----GPGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHSPQKPLFLY 232
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E + +++ + + RR YAAMV +D++V + AL+R G
Sbjct: 233 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 287
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
NS+IIF SDNG T + GSN+P RG K T WEGGV+ + SP +++
Sbjct: 288 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKKKR 338
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R S ++HI+DW PTL AGG TS +DG D W ++ S R + +D
Sbjct: 339 RTSRALVHITDWYPTLVGLAGGTTSAAD-GLDGYDVWPAISEGRASPRTEILHNIDP--- 394
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L +R S+ I AA+R+ WKL+ G D YG + +P
Sbjct: 395 --LYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPP-- 439
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
Q+L + + N+++M S RQ +L
Sbjct: 440 ----------QTLASFPGSWW----NLERMASIRQA---------------------VWL 464
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
FN+ DP E+ ++A RPD+ L L + RT +P + + +A P W PW
Sbjct: 465 FNISADPYEREDLADQRPDVVRTLLARLADYNRTAIPVRYPAAN-PRAHPDFNGGAWGPW 523
Score = 50.1 bits (118), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 25/75 (33%), Positives = 42/75 (56%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
S AW G+Y+T L+ + A ++ KPLFLY+A A H L++P+E + ++
Sbjct: 198 SVAWGLSGQYSTMLYAQRASHILASHSPQKPLFLYVAFQAVHT-----PLQSPREYLYRY 252
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 253 RTMGNVARRKYAAMV 267
>sp|Q5FYB1|ARSI_HUMAN Arylsulfatase I OS=Homo sapiens GN=ARSI PE=1 SV=1
Length = 569
Score = 283 bits (723), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 185/542 (34%), Positives = 273/542 (50%), Gaps = 80/542 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 58 GYHDVGYHGS-DIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 116
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+E GYST +GKWHLGF+R+E P RGF++ G L G +
Sbjct: 117 RPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 176
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G D+ + AW G+Y+T L+ + A ++ +PLFLY
Sbjct: 177 YYTYDNCD----GPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 232
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E + +++ + + RR YAAMV +D++V + AL+R G
Sbjct: 233 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 287
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
NS+IIF SDNG T + GSN+P RG K T WEGGV+ + SP +++
Sbjct: 288 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKQ 338
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R S +MHI+DW PTL AGG TS +DG D W ++ S R + +D
Sbjct: 339 RTSRALMHITDWYPTLVGLAGGTTSAAD-GLDGYDVWPAISEGRASPRTEILHNIDP--- 394
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L ++ S+ I AA+R+ WKL+ G D YG + +P
Sbjct: 395 --LYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPP-- 439
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
Q+L + + N+++M S RQ +L
Sbjct: 440 ----------QTLATFPGSWW----NLERMASVRQA---------------------VWL 464
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
FN+ DP E+ ++A RPD+ L L + RT +P + E P +A P W
Sbjct: 465 FNISADPYEREDLAGQRPDVVRTLLARLAEYNRTAIPVRYPAENP---RAHPDFNGGAWG 521
Query: 628 PW 629
PW
Sbjct: 522 PW 523
Score = 47.8 bits (112), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 42/75 (56%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
+ AW G+Y+T L+ + A ++ +PLFLY+A A H L++P+E + ++
Sbjct: 198 NVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRY 252
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 253 RTMGNVARRKYAAMV 267
>sp|Q32KI9|ARSI_MOUSE Arylsulfatase I OS=Mus musculus GN=Arsi PE=2 SV=1
Length = 573
Score = 281 bits (718), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 184/540 (34%), Positives = 271/540 (50%), Gaps = 76/540 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 58 GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 116
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+E GYST +GKWHLGF+R+E P RGF++ G L G +
Sbjct: 117 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 176
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G D+ S AW G+Y+T L+ + A ++ PLFLY
Sbjct: 177 YYTYDNCD----GPGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHNPQNPLFLY 232
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E + +++ + + RR YAAMV +D++V + AL+R G
Sbjct: 233 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 287
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
NS+IIF SDNG T + GSN+P RG K T WEGGV+ + SP +++
Sbjct: 288 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKKKR 338
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R S ++HI+DW PTL AGG TS +DG D W ++ S R + +D
Sbjct: 339 RTSRALVHITDWYPTLVGLAGGTTSAAD-GLDGYDVWPAISEGRASPRTEILHNIDP--- 394
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L +R S+ I AA+R+ WKL+ G D YG + +P
Sbjct: 395 --LYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPP-- 439
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
Q+L + + N+++M S RQ +L
Sbjct: 440 ----------QTLASFPGSWW----NLERMASIRQA---------------------VWL 464
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
FN+ DP E+ ++A RPD+ L L + RT +P + + +A P W PW
Sbjct: 465 FNISADPYEREDLAGQRPDVVRTLLARLADYNRTAIPVRYPAAN-PRAHPDFNGGAWGPW 523
Score = 47.4 bits (111), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
S AW G+Y+T L+ + A ++ PLFLY+A A H L++P+E + ++
Sbjct: 198 SVAWGLSGQYSTMLYAQRASHILASHNPQNPLFLYVAFQAVHT-----PLQSPREYLYRY 252
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 253 RTMGNVARRKYAAMV 267
>sp|Q32KH7|ARSI_CANFA Arylsulfatase I OS=Canis familiaris GN=ARSI PE=2 SV=2
Length = 573
Score = 281 bits (718), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 184/542 (33%), Positives = 270/542 (49%), Gaps = 80/542 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 59 GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 117
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+E GYST +GKWHLGF+R+E P RGF++ G L G +
Sbjct: 118 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 177
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G D+ + AW G+Y+T L+ + ++ +PLFLY
Sbjct: 178 YYTYDNCDGPG----VCGFDLHEGENVAWGLSGQYSTMLYAQRVSHILASHSPRRPLFLY 233
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E + +++ + + RR YAAMV +D++V + SAL+R G
Sbjct: 234 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITSALKRYGFY 288
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
NS+IIF SDNG T + GSN+P RG K T WEGGV+ + SP +++
Sbjct: 289 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKR 339
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R S ++HI+DW PTL AGG T+ +DG D W ++ S R + +D
Sbjct: 340 RTSRALVHITDWYPTLVGLAGG-TASAADGLDGYDVWPAISEGRASPRTEILHNIDP--- 395
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L +R S+ I AA+R+ WKL+ G D YG + +P
Sbjct: 396 --LYNHARHGSLEAGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPPQT 442
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
A S N+++M S RQ +L
Sbjct: 443 LAAFPGS----------------WWNLERMASARQA---------------------VWL 465
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
FN+ DP E+ ++A RPD+ L L + RT +P + E P +A P W
Sbjct: 466 FNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENP---RAHPDFNGGAWG 522
Query: 628 PW 629
PW
Sbjct: 523 PW 524
Score = 45.8 bits (107), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/75 (29%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
+ AW G+Y+T L+ + ++ +PLFLY+A A H L++P+E + ++
Sbjct: 199 NVAWGLSGQYSTMLYAQRVSHILASHSPRRPLFLYVAFQAVHT-----PLQSPREYLYRY 253
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 254 RTMGNVARRKYAAMV 268
>sp|Q8BM89|ARSJ_MOUSE Arylsulfatase J OS=Mus musculus GN=Arsj PE=2 SV=1
Length = 598
Score = 278 bits (710), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 181/540 (33%), Positives = 269/540 (49%), Gaps = 74/540 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTPSR+ +TGKY IHTG+Q I
Sbjct: 85 GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 143
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP+ L+E+GYST +GKWHLGF+R++ P RGF++ FG L G
Sbjct: 144 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMPTKRGFDTFFGSLLGSGD 203
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H D + + G+D+ N + AWD G Y+T ++T+ Q++ KPLFL
Sbjct: 204 YYTHYKCD----SPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILATHDPTKPLFL 259
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+A+ A H+ L+AP ++ I + NRR YAAM+ LD+++ V AL+R G
Sbjct: 260 YVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAIHNVTLALKRYGF 314
Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
NSIII+ SDNG PT GSN+P RG K T WEGG++ + SP ++
Sbjct: 315 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 364
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
V +++HI+DW PTL + A G + +DG D W ++ + R+ +D L
Sbjct: 365 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNI 420
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
+ + + I +A+R+ WKL+ G GY S+ VP
Sbjct: 421 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGNP------GY-----SDWVPPQA 469
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
F SN+ R ++ T+ G + +L
Sbjct: 470 F-----------------------SNLGPNRWHNERITLSTGKS-------------IWL 493
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
FN+ DP E+ +++S P I +L L +T VP + D +++P+ W PW
Sbjct: 494 FNITADPYERVDLSSRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 552
Score = 49.7 bits (117), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ Q++ KPLFLY+A+ A H+ L+AP
Sbjct: 220 LYENDNAAWDYDNGIYSTQMYTQRVQQILATHDPTKPLFLYVAYQAVHS-----PLQAPG 274
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 275 RYFEHYRSIININRRRYAAM 294
>sp|Q5FYB0|ARSJ_HUMAN Arylsulfatase J OS=Homo sapiens GN=ARSJ PE=2 SV=1
Length = 599
Score = 276 bits (706), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 179/540 (33%), Positives = 269/540 (49%), Gaps = 74/540 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTPSR+ +TGKY IHTG+Q I
Sbjct: 87 GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 145
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP+ L+E+GYST +GKWHLGF+R+E P RGF++ FG L G
Sbjct: 146 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTRRGFDTFFGSLLGSGD 205
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H D + + G+D+ N + AWD G Y+T ++T+ Q++ KP+FL
Sbjct: 206 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFL 261
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+A+ A H+ L+AP ++ I + NRR YAAM+ LD+++ V AL+ G
Sbjct: 262 YIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGF 316
Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
NSIII+ SDNG PT GSN+P RG K T WEGG++ + SP ++
Sbjct: 317 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 366
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
V +++HI+DW PTL + A G + +DG D W ++ + R+ +D L
Sbjct: 367 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNI 422
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
+ + + I +A+R+ WKL+ G GY S+ VP +
Sbjct: 423 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQS 471
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
F SN+ R ++ T+ G + +L
Sbjct: 472 F-----------------------SNLGPNRWHNERITLSTGKS-------------VWL 495
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
FN+ DP E+ ++++ P I +L L +T VP + D +++P+ W PW
Sbjct: 496 FNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 554
Score = 49.3 bits (116), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 222 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPG 276
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 277 RYFEHYRSIININRRRYAAM 296
>sp|Q32KJ6|GALNS_RAT N-acetylgalactosamine-6-sulfatase OS=Rattus norvegicus GN=Galns
PE=1 SV=1
Length = 524
Score = 165 bits (418), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 125/368 (33%), Positives = 179/368 (48%), Gaps = 46/368 (12%)
Query: 84 TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
L L+ GW DL +G TPN+D +A G++ + Y A P+C+PSRA+L+TG+ PI
Sbjct: 35 VLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPI 94
Query: 143 HTGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
G A R G+P +E LPE L++ GY+ K +GKWHLG R ++
Sbjct: 95 RNGFY---TTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-RPQF 150
Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYA 247
PL GF+ FG N YD+ + R E+ G + NL T +
Sbjct: 151 HPLKHGFDEWFGSPNCHFGPYDNKVKPNIPVYRDWEMVGRFYEEFPINLKTGEANL---- 206
Query: 248 TDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
T L+ +EA+ I Q + P FLY A A HA Q++ R
Sbjct: 207 TQLYLQEALDFIRTQHARQSPFFLYWAIDATHA-----------PVYASKQFLGTSLRGR 255
Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
Y V+++DDSVG ++S LQ G+ +N+ + F SDNGA + S + GSN P+
Sbjct: 256 YGDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALI-----SAPKEGGSNGPFLC 310
Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG--GDTSRLPLNIDG 424
K T +EGG++ PAI W P +VS Q+ I D T + AG + R+ IDG
Sbjct: 311 GKQTTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAGLKPPSDRV---IDG 367
Query: 425 LDQWSSLL 432
LD ++L
Sbjct: 368 LDLLPTML 375
>sp|Q571E4|GALNS_MOUSE N-acetylgalactosamine-6-sulfatase OS=Mus musculus GN=Galns PE=2
SV=2
Length = 520
Score = 165 bits (417), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 124/365 (33%), Positives = 175/365 (47%), Gaps = 40/365 (10%)
Query: 84 TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
L L+ GW DL +G TPN+D +A G++ + Y A P+C+PSRA+L+TG+ PI
Sbjct: 31 VLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPI 90
Query: 143 HTGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
G A R G+P +E LPE L++ GY+ K +GKWHLG R ++
Sbjct: 91 RNGFY---TTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-RPQF 146
Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNGHDMRRNLSTAWDTVGEYATDL 250
PL GF+ FG N YD+ R E+ G T T L
Sbjct: 147 HPLKHGFDEWFGSPNCHFGPYDNKAKPNIPVYRDWEMVGR-FYEEFPINRKTGEANLTQL 205
Query: 251 FTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
+T+EA+ I+ Q + P FLY A A HA Q++ R Y
Sbjct: 206 YTQEALDFIQTQHARQSPFFLYWAIDATHA-----------PVYASRQFLGTSLRGRYGD 254
Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
V+++DDSVG ++S LQ G+ +N+ + F SDNGA + S GSN P+ K
Sbjct: 255 AVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALI-----SAPNEGGSNGPFLCGKQ 309
Query: 370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG--GDTSRLPLNIDGLDQ 427
T +EGG++ PAI W P +VS Q+ I D T + AG + R+ IDGLD
Sbjct: 310 TTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAGLKPPSDRV---IDGLDL 366
Query: 428 WSSLL 432
++L
Sbjct: 367 LPTML 371
>sp|P34059|GALNS_HUMAN N-acetylgalactosamine-6-sulfatase OS=Homo sapiens GN=GALNS PE=1
SV=1
Length = 522
Score = 162 bits (409), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 121/364 (33%), Positives = 173/364 (47%), Gaps = 41/364 (11%)
Query: 85 LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
L L+ GW DL +G TPN+D +A G++ N Y A P+C+PSRA+L+TG+ PI
Sbjct: 35 LLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGLLFPNFYSANPLCSPSRAALLTGRLPIR 94
Query: 144 TGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
G A R G+P +E+ LPE L++ GY +K +GKWHLG R ++
Sbjct: 95 NGFY---TTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFH 150
Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYAT 248
PL GF+ FG N YD+ R E+ G + NL T + T
Sbjct: 151 PLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANL----T 206
Query: 249 DLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYA 308
++ +EA+ I+ Q P FLY A A HA ++ R Y
Sbjct: 207 QIYLQEALDFIKRQARHHPFFLYWAVDATHA-----------PVYASKPFLGTSQRGRYG 255
Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
V+++DDS+G ++ LQ + +N+ + F SDNGA + E GSN P+ K
Sbjct: 256 DAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQG-----GSNGPFLCGK 310
Query: 369 NTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
T +EGG++ PA+ W P +VS Q+ I D L T A G T IDGL+
Sbjct: 311 QTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMD-LFTTSLALAGLTPPSDRAIDGLNLL 369
Query: 429 SSLL 432
+LL
Sbjct: 370 PTLL 373
>sp|Q32KH5|GALNS_CANFA N-acetylgalactosamine-6-sulfatase OS=Canis familiaris GN=GALNS PE=2
SV=1
Length = 522
Score = 158 bits (400), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 124/368 (33%), Positives = 175/368 (47%), Gaps = 48/368 (13%)
Query: 85 LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
L L+ GW DL +G TPN+D +A G++ + Y A P+C+PSRA+L+TG+ PI
Sbjct: 34 LLLMDDMGWGDLGIYGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPIR 93
Query: 144 TGM-----------QGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
G I G G+P E LPE L+E GY +K +GKWHLG R ++
Sbjct: 94 NGFYTTNRHARNAYTPQEIVG----GIPDQEHVLPELLKEAGYVSKIVGKWHLG-HRPQF 148
Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYA 247
PL GF+ FG N YD+ R E+ G + NL T +
Sbjct: 149 HPLKHGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRYYEEFPINLKTGEANL---- 204
Query: 248 TDLFTKEAVQLIE-DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
T ++ +EA+ I+ Q +P FLY A A HA ++ R
Sbjct: 205 TQVYLQEALDFIKRQQAAQRPFFLYWAIDATHA-----------PVYASRPFLGTSQRGR 253
Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
Y V+++D+SVG ++S LQ + EN+ + F SDNGA + S GSN P+
Sbjct: 254 YGDAVREIDNSVGKILSLLQDLRISENTFVFFTSDNGAALI-----SAPNQGGSNGPFLC 308
Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG--GDTSRLPLNIDG 424
K T +EGG++ PAI W P RVS Q+ I D T + AG + R+ IDG
Sbjct: 309 GKQTTFEGGMREPAIAWWPGRIPAGRVSHQLGSIMDLFTTSLSLAGLAPPSDRV---IDG 365
Query: 425 LDQWSSLL 432
LD ++L
Sbjct: 366 LDLLPAML 373
>sp|Q8WNQ7|GALNS_PIG N-acetylgalactosamine-6-sulfatase OS=Sus scrofa GN=GALNS PE=2 SV=1
Length = 522
Score = 157 bits (397), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 120/362 (33%), Positives = 173/362 (47%), Gaps = 36/362 (9%)
Query: 85 LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIH 143
L L+ GW DL +G TPN+D +A G++ + YA P+C+PSRA+L+TG+ PI
Sbjct: 34 LLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYAANPLCSPSRAALLTGRLPIR 93
Query: 144 TGM---QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
TG G P+ G+P E LPE L+ GY++K +GKWHLG R ++ PL
Sbjct: 94 TGFYTTNGHARNAYTPQEIVGGIPDPEHLLPELLKGAGYASKIVGKWHLG-HRPQFHPLK 152
Query: 197 RGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYATDLF 251
GF+ FG N YD+ R E+ G + NL T + T ++
Sbjct: 153 HGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRFYEEFPINLKTGESNL----TQIY 208
Query: 252 TKEAVQLIE-DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAM 310
+EA+ I+ Q P FLY A A HA ++ R Y
Sbjct: 209 LQEALDFIKRQQATHHPFFLYWAIDATHA-----------PVYASRAFLGTSQRGRYGDA 257
Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
V+++DDSVG ++ L+ + N+ + F SDNGA V S + GSN P+ K T
Sbjct: 258 VREIDDSVGRIVGLLRDLKIAGNTFVFFTSDNGAALV-----SAPKQGGSNGPFLCGKQT 312
Query: 371 LWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSS 430
+EGG++ PAI W P +VS Q+ + D T + AG + IDGLD +
Sbjct: 313 TFEGGMREPAIAWWPGHIPAGQVSHQLGSVMDLFTTSLSLAGLEPPS-DRAIDGLDLLPA 371
Query: 431 LL 432
+L
Sbjct: 372 ML 373
>sp|P25549|ASLA_ECOLI Arylsulfatase OS=Escherichia coli (strain K12) GN=aslA PE=3 SV=2
Length = 551
Score = 157 bits (397), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 138/440 (31%), Positives = 210/440 (47%), Gaps = 69/440 (15%)
Query: 87 LLIVYGWNDLSFHGSNEI---PTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIH 143
LL GW D+ F+G PTP+IDA+A G+IL + Y+QP +P+RA+++TG+Y IH
Sbjct: 92 LLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIH 151
Query: 144 TGMQGPPIWGAEPRGVP-LTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESH 202
G+ PP++G +P G+ LT LP+ L + GY T+AIGKWH+G +E P GF+
Sbjct: 152 HGILMPPMYG-QPGGLQGLTT--LPQLLHDQGYVTQAIGKWHMG-ENKESQPQNVGFDDF 207
Query: 203 FGYLNGVISYYD-----HI-----LSDQYSRTVEL------NGHDMRRNLSTA-WDTVGE 245
G+ N V Y H+ LS S ++ + H +R A D +
Sbjct: 208 RGF-NSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPK 266
Query: 246 YATDL---FTKEAVQLIEDQP-VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITD 301
Y DL + V+ ++ DKP FLY H N N +
Sbjct: 267 YMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDN----------YPNAKYAGSS 316
Query: 302 PNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSN 361
P R +Y + +++D + L++ G L+N++I+F SDNG P E
Sbjct: 317 PARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG-PEAEVPPH-------GR 368
Query: 362 YPYRGVKNTLWEGGVKVPA-ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPL 420
P+RG K + WEGGV+VP + W IQ PR S ++ ++D PT AG +++
Sbjct: 369 TPFRGAKGSTWEGGVRVPTFVYWKGMIQ--PRKSDGIVDLADLFPTALDLAGHPGAKV-- 424
Query: 421 NIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDS 480
++L+ T + IDG+DQ +S L T + N + + AAVR+D
Sbjct: 425 --------ANLVPKT-----TFIDGVDQ-TSFFLGTNGQSNRKAEHYFLNGKLAAVRMDE 470
Query: 481 WKLVLGTQE--NGTMDGYYG 498
+K + Q+ T GY G
Sbjct: 471 FKYHVLIQQPYAYTQSGYQG 490
>sp|P51691|ARS_PSEAE Arylsulfatase OS=Pseudomonas aeruginosa (strain ATCC 15692 / PAO1 /
1C / PRS 101 / LMG 12228) GN=atsA PE=1 SV=3
Length = 536
Score = 122 bits (306), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 114/413 (27%), Positives = 174/413 (42%), Gaps = 101/413 (24%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQ---- 147
G++D+ G EI TPN+DALA G+ L + + C+P+R+ L+TG G+
Sbjct: 16 GFSDIGAFG-GEIATPNLDALAIAGLRLTDFHTASTCSPTRSMLLTGTDHHIAGIGTMAE 74
Query: 148 --GPPIWGAEPRGVPLTERF--LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
P + G L ER LPE LRE GY T GKWHLG + E TP RGFE F
Sbjct: 75 ALTPELEGKPGYEGHLNERVVALPELLREAGYQTLMAGKWHLG-LKPEQTPHARGFERSF 133
Query: 204 GYLNGVISYYDHILSDQYSRTVELNGH-----DMRRNLSTAWDTVGEYATDLFTKEAVQL 258
L G ++Y S L G + R L T + G Y++D F + +Q
Sbjct: 134 SLLPGAANHYGFEPPYDESTPRILKGTPALYVEDERYLDTLPE--GFYSSDAFGDKLLQY 191
Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF---------------------- 296
++++ +P F YL A H L+AP+E + ++
Sbjct: 192 LKERDQSRPFFAYLPFSAPH-----WPLQAPREIVEKYRGRYDAGPEALRQERLARLKEL 246
Query: 297 -------------------QYITDPNR-------RTYAAMVKKLDDSVGTVISALQRKGM 330
+ + D R YAAMV+++D ++G V+ L+R+G
Sbjct: 247 GLVEADVEAHPVLALTREWEALEDEERAKSARAMEVYAAMVERMDWNIGRVVDYLRRQGE 306
Query: 331 LENSIIIFMSDNGAP------------------------TVEYRETSNYRNW-------G 359
L+N+ ++FMSDNGA ++E +N W
Sbjct: 307 LDNTFVLFMSDNGAEGALLEAFPKFGPDLLGFLDRHYDNSLENIGRANSYVWYGPRWAQA 366
Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
+ P R K +GG++VPA++ P++ + +S + D PTL AG
Sbjct: 367 ATAPSRLYKAFTTQGGIRVPALVRYPRLSRQGAISHAFATVMDVTPTLLDLAG 419
>sp|P15289|ARSA_HUMAN Arylsulfatase A OS=Homo sapiens GN=ARSA PE=1 SV=3
Length = 507
Score = 122 bits (306), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 124/421 (29%), Positives = 174/421 (41%), Gaps = 63/421 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPV--CTPSRASLMTGKYPIHTGMQGP 149
G+ DL +G TPN+D LA G+ + Y PV CTPSRA+L+TG+ P+ GM
Sbjct: 32 GYGDLGCYGHPSSTTPNLDQLAAGGLRFTDFYV-PVSLCTPSRAALLTGRLPVRMGMYPG 90
Query: 150 PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFF-RREYTPLYRGFESHFGYLNG 208
+ + G+PL E + E L GY T GKWHLG + P ++GF G
Sbjct: 91 VLVPSSRGGLPLEEVTVAEVLAARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYS 150
Query: 209 VISYYDHILSDQYSRTVELNGHD-------MRRNLST----AWDTVGEYATDLFTKEAVQ 257
L+ T G D + NLS W E F + +
Sbjct: 151 HDQGPCQNLTCFPPATPCDGGCDQGLVPIPLLANLSVEAQPPWLPGLEARYMAFAHDLMA 210
Query: 258 LIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDS 317
+ Q D+P FLY A H PQ + F + R + + +LD +
Sbjct: 211 DAQRQ--DRPFFLYYAS---------HHTHYPQFSGQSFAERS--GRGPFGDSLMELDAA 257
Query: 318 VGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVK 377
VGT+++A+ G+LE +++IF +DNG ET G + R K T +EGGV+
Sbjct: 258 VGTLMTAIGDLGLLEETLVIFTADNGP------ETMRMSRGGCSGLLRCGKGTTYEGGVR 311
Query: 378 VPAI-LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTP 436
PA+ W I P V+ ++ D LPTL AG + LP
Sbjct: 312 EPALAFWPGHIA--PGVTHELASSLDLLPTLAALAG---APLP----------------- 349
Query: 437 SRRNSNIDGLDQWSSLLLNTPSRRNSVLI---NIDEKKRTAAVRLDSWKLVLGTQENGTM 493
N +DG D LL S R S+ DE + AVR +K TQ +
Sbjct: 350 ---NVTLDGFDLSPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGSAHS 406
Query: 494 D 494
D
Sbjct: 407 D 407
>sp|Q9X759|ATSA_KLEPN Arylsulfatase OS=Klebsiella pneumoniae GN=atsA PE=1 SV=1
Length = 577
Score = 122 bits (305), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 116/461 (25%), Positives = 197/461 (42%), Gaps = 103/461 (22%)
Query: 75 TYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSR 132
+AA + + ++I G++D+S G EIPTPN+ A+A G+ ++ Y P+ P+R
Sbjct: 18 AHAAQQERPNVIVIIADDMGYSDISPFG-GEIPTPNLQAMAEQGMRMSQYYTSPMSAPAR 76
Query: 133 ASLMTGKYPIHTGMQGPPIW------GAEPRGVPLTERF--LPEYLRELGYSTKAIGKWH 184
+ L+TG GM G +W G E + LT+R + E ++ GY+T GKWH
Sbjct: 77 SMLLTGNSNQQAGMGG--MWWYDSTIGKEGYELRLTDRVTTMAERFKDAGYNTLMAGKWH 134
Query: 185 LGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVG 244
LGF TP RGF F ++ G S+++ + TVE R+
Sbjct: 135 LGFVPGA-TPKERGFNHAFAFMGGGTSHFNDAIP---LGTVEAFHTYYTRDGERVSLPDD 190
Query: 245 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF-------- 296
Y+++ + ++ I+ P ++P+F +LA A H L+AP E I +F
Sbjct: 191 FYSSEAYARQMNSWIKATPKEQPVFAWLAFTAPH-----DPLQAPDEWIKRFKGQYEQGY 245
Query: 297 ---------------------------------------QYITDPNRRTYAAMVKKLDDS 317
Q T + YAAM+ +D
Sbjct: 246 AEVYRQRIARLKALGIIHDDTPLPHLELDKEWEALTPEQQKYTAKVMQVYAAMIANMDAQ 305
Query: 318 VGTVISALQRKGMLENSIIIFMSDNGAPTVE--YRETS---------NYRNWG------- 359
+GT++ L++ G +N++++F++DNGA + Y E++ +Y N G
Sbjct: 306 IGTLMETLKQTGRDKNTLLVFLTDNGANPAQGFYYESTPEFWKQFDNSYDNVGRKGSFVS 365
Query: 360 --------SNYPYRGV-KNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTA 410
SN PY K T +GG+ ++ P I ++ ++ M + D PTLY
Sbjct: 366 YGPHWANVSNAPYANYHKTTSAQGGINTDFMISGPGITRHGKIDASTMAVYDVAPTLYEF 425
Query: 411 AGGDTSR-------LPLNIDGLDQWSSLLLNTPSRRNSNID 444
AG D ++ LP+ ++ + + P R N ++
Sbjct: 426 AGIDPNKSLAKKPVLPMIGVSFKRYLTGEVQEPPRGNYGVE 466
>sp|P20713|ATSA_ENTAE Arylsulfatase OS=Enterobacter aerogenes GN=atsA PE=1 SV=1
Length = 464
Score = 121 bits (303), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 108/408 (26%), Positives = 177/408 (43%), Gaps = 94/408 (23%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+S G EIPTPN+ A+A G+ ++ Y P+ P+R+ L+TG GM G +
Sbjct: 37 GYSDISPFG-GEIPTPNLQAMAEQGMRMSQYYTSPMSAPARSMLLTGNSNQQAGMGG--M 93
Query: 152 W------GAEPRGVPLTERF--LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
W G E + LT+R + E ++ GY+T GKWHLGF TP RGF F
Sbjct: 94 WWYDSTIGKEGYELRLTDRVTTMAERFKDAGYNTLMAGKWHLGFVPGA-TPKDRGFNHAF 152
Query: 204 GYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP 263
++ G S+++ + TVE R+ Y+++ + ++ I+ P
Sbjct: 153 AFMGGGTSHFNDAIP---LGTVEAFHTYYTRDGERVSLPDDFYSSEAYARQMNSWIKATP 209
Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF--------------------------- 296
++P+F +LA A H L+AP E I +F
Sbjct: 210 KEQPVFAWLAFTAPH-----DPLQAPDEWIKRFKGQYEQGYAEVYRQRIARLKALGIIHD 264
Query: 297 --------------------QYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSII 336
Q T + YAAM+ +D +GT++ L++ G +N+++
Sbjct: 265 DTPLPHLELDKEWEALTPEQQKYTAKVMQVYAAMIANMDAQIGTLMETLKQTGRDKNTLL 324
Query: 337 IFMSDNGAPTVE--YRETS---------NYRNWG---------------SNYPYRGV-KN 369
+F++DNGA + Y E++ +Y N G SN PY K
Sbjct: 325 VFLTDNGANPAQGFYYESTPEFWKQFDNSYDNVGRKGSFVSYGPHWANVSNAPYANYHKT 384
Query: 370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSR 417
T +GG+ ++ P I ++ ++ M + D PTLY AG D ++
Sbjct: 385 TSAQGGINTDFMISGPGITRHGKIDASTMAVYDVAPTLYEFAGIDPNK 432
>sp|P50428|ARSA_MOUSE Arylsulfatase A OS=Mus musculus GN=Arsa PE=2 SV=2
Length = 506
Score = 120 bits (301), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 112/380 (29%), Positives = 174/380 (45%), Gaps = 48/380 (12%)
Query: 77 AALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPV--CTP 130
A L+ ++ +L+++ G+ DL +G TPN+D LA G+ + Y PV CTP
Sbjct: 12 AGLSTASPPNILLIFADDLGYGDLGSYGHPSSTTPNLDQLAEGGLRFTDFYV-PVSLCTP 70
Query: 131 SRASLMTGKYPIHTGMQGPPIWGAEPR-GVPLTERFLPEYLRELGYSTKAIGKWHLGFF- 188
SRA+L+TG+ P+ +GM P + G + G+PL E L E L GY T GKWHLG
Sbjct: 71 SRAALLTGRLPVRSGMY-PGVLGPSSQGGLPLEEVTLAEVLAARGYLTGMAGKWHLGVGP 129
Query: 189 RREYTPLYRGFESHFGY--------LNGVISYYDHIL----SDQYSRTVELNGHDMRRNL 236
+ P ++GF G + + I DQ + L ++
Sbjct: 130 EGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPDIPCKGGCDQGLVPIPLLA-NLTVEA 188
Query: 237 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 296
W E F+++ + + Q +P FLY A H PQ + F
Sbjct: 189 QPPWLPGLEARYVSFSRDLMADAQRQ--GRPFFLYYAS---------HHTHYPQFSGQSF 237
Query: 297 QYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYR 356
+ R + + +LD +VG +++ + G+LE +++IF +DNG E
Sbjct: 238 TKRS--GRGPFGDSLMELDGAVGALMTTVGDLGLLEETLVIFTADNGP------ELMRMS 289
Query: 357 NWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTS 416
N G + R K T +EGGV+ PA+++ P P V+ ++ D LPTL G +
Sbjct: 290 NGGCSGLLRCGKGTTFEGGVREPALVYWPG-HITPGVTHELASSLDLLPTLAALTG---A 345
Query: 417 RLP-LNIDGLDQWSSLLLNT 435
LP + +DG+D S LLL T
Sbjct: 346 PLPNVTLDGVD-ISPLLLGT 364
>sp|Q9C0V7|YHJ2_SCHPO Uncharacterized sulfatase PB10D8.02c OS=Schizosaccharomyces pombe
(strain 972 / ATCC 24843) GN=SPBPB10D8.02c PE=3 SV=1
Length = 554
Score = 117 bits (292), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 115/420 (27%), Positives = 179/420 (42%), Gaps = 117/420 (27%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGM----- 146
GW+D+S GS EI TPNI+ LA G+ L N + C+P+R+ L++G G+
Sbjct: 23 GWSDVSPFGS-EIHTPNIERLAKEGVRLTNFHTASACSPTRSMLLSGTDNHIAGLGQMAE 81
Query: 147 ---QGPPIWGAEP--RGVPLTERF--LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGF 199
+ +WG +P G L +R LPE L+E GY T GKWHLG Y P RGF
Sbjct: 82 TVRRFSKVWGGKPGYEGY-LNDRVAALPEILQEAGYYTTMSGKWHLGLTPDRY-PSKRGF 139
Query: 200 ESHFGYLNGVISYYDH-----------ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYAT 248
+ F L G +++ + L Y+ + H +N Y++
Sbjct: 140 KESFALLPGGGNHFAYEPGTRENPAVPFLPPLYTHNHDPVDHKSLKNF---------YSS 190
Query: 249 DLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF--QYITDP---- 302
+ F ++ + ++++ + F YL A H L++P+E IN++ +Y P
Sbjct: 191 NYFAEKLIDQLKNREKSQSFFAYLPFTAPHW-----PLQSPKEYINKYRGRYSEGPDVLR 245
Query: 303 -NR------------------------------------------RTYAAMVKKLDDSVG 319
NR YAAMV+ LD ++G
Sbjct: 246 KNRLQAQKDLGLIPENVIPAPVDGMGTKSWDELTTEEKEFSARTMEVYAAMVELLDLNIG 305
Query: 320 TVISALQRKGMLENSIIIFMSDNGA--------------PTVEYRETS-----NYRNW-- 358
VI L+ G L+N+ +IFMSDNGA P V+Y + S NY ++
Sbjct: 306 RVIDYLKTIGELDNTFVIFMSDNGAEGSVLEAIPVLSTKPPVKYFDNSLENLGNYNSFIW 365
Query: 359 -------GSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
+ P R K + EGG++ PAI+ P + + +S + + + D LPT+ A
Sbjct: 366 YGPRWAQAATAPSRLSKGFITEGGIRCPAIIRYPPLIKPDIISDEFVTVMDILPTILELA 425
>sp|Q08DD1|ARSA_BOVIN Arylsulfatase A OS=Bos taurus GN=ARSA PE=2 SV=1
Length = 507
Score = 113 bits (283), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 107/361 (29%), Positives = 160/361 (44%), Gaps = 44/361 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPV--CTPSRASLMTGKYPIHTGMQGP 149
G+ DL +G TPN+D LA G+ + Y PV CTPSRA+L+TG+ P+ G+
Sbjct: 32 GYGDLGSYGHPSSTTPNLDQLAAGGLRFTDFYV-PVSLCTPSRAALLTGRLPVRMGLYPG 90
Query: 150 PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFF-RREYTPLYRGFESHFG---- 204
+ + G+PL E L E L GY T GKWHLG + P + GF G
Sbjct: 91 VLEPSSRGGLPLDEVTLAEVLAAQGYLTGIAGKWHLGVGPEGAFLPPHHGFHRFLGIPYS 150
Query: 205 YLNGVISYYDHI--------LSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAV 256
+ G + DQ + L ++ W E F ++ +
Sbjct: 151 HDQGPCQNLTCFPPATPCEGICDQGLVPIPLLA-NLSVEAQPPWLPGLEARYVAFARDLM 209
Query: 257 QLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
+ Q +P FLY A H PQ + F + R + + +LD
Sbjct: 210 TDAQHQ--GRPFFLYY---------ASHHTHYPQFSGQSFPGHS--GRGPFGDSLMELDA 256
Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
+VG +++A+ G+L +++ F +DNG ET + G + R K T +EGGV
Sbjct: 257 AVGALMTAVGDLGLLGETLVFFTADNGP------ETMRMSHGGCSGLLRCGKGTTFEGGV 310
Query: 377 KVPAI-LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP-LNIDGLDQWSSLLLN 434
+ PA+ W I P V+ ++ D LPTL AG ++LP + +DG+D S LLL
Sbjct: 311 REPALAFWPGHIA--PGVTHELASSLDLLPTLAALAG---AQLPNITLDGVD-LSPLLLG 364
Query: 435 T 435
T
Sbjct: 365 T 365
>sp|P14000|ARS_HEMPU Arylsulfatase OS=Hemicentrotus pulcherrimus PE=1 SV=1
Length = 551
Score = 109 bits (273), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 101/353 (28%), Positives = 156/353 (44%), Gaps = 44/353 (12%)
Query: 77 AALTKSTTLTLLIVY-GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRAS 134
A L K + L+ + G DL+ +G ID +A G+ N Y VCTPSR++
Sbjct: 47 APLVKPNVVLLVADHMGSGDLTSYGHPTQEAGFIDKMAAEGLRFTNGYVGDAVCTPSRSA 106
Query: 135 LMTGKYPIHTGMQGPPIWGAEPR--------GVPLTERFLPEYLRELGYSTKAIGKWHLG 186
+MTG+ P+ G G E R G+P +E + E ++E GY+T +GKWHLG
Sbjct: 107 IMTGRLPVRIGTFG------ETRVFLPWTKTGLPKSELTIAEAMKEAGYATGMVGKWHLG 160
Query: 187 FFRREYT-----PLYRGFE--SHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTA 239
T P GF+ H S D L + + + +S
Sbjct: 161 INENSSTDGAHLPFNHGFDFVGHNLPFTNSWSCDDTGLHKDFPDSQRCYLYVNATLVSQP 220
Query: 240 WDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 299
+ G T LFT +A+ IED D P FLY+A H+ + + F
Sbjct: 221 YQHKG--LTQLFTDDALGFIEDNHAD-PFFLYVAF---------AHMHTSLFSSDDFSCT 268
Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
+ R Y + ++ D+V ++ L+ + EN+II F+SD+G P EY E G
Sbjct: 269 S--RRGRYGDNLLEMHDAVQKIVDKLEENNISENTIIFFISDHG-PHREYCEEG-----G 320
Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
+RG K+ WEGG ++P I++ P +P +S +++ D + T G
Sbjct: 321 DASIFRGGKSHSWEGGHRIPYIVYWPGT-ISPGISNEIVTSMDIIATAADLGG 372
>sp|Q32KH9|ARSG_CANFA Arylsulfatase G OS=Canis familiaris GN=ARSG PE=2 SV=1
Length = 535
Score = 107 bits (268), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 135/519 (26%), Positives = 212/519 (40%), Gaps = 106/519 (20%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGPP 150
GW DL + + T N+D +A G+ + +A C+PSRASL+TG+ + G+
Sbjct: 47 GWGDLGANWAETKDTANLDKMAAEGMRFVDFHAAASTCSPSRASLLTGRLGLRNGVT-HN 105
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
G+PL E L E L++ GY T IGKWHLG Y P +RGF+ +FG I
Sbjct: 106 FAVTSVGGLPLNETTLAEVLQQAGYVTGMIGKWHLGH-HGPYHPNFRGFDYYFG-----I 159
Query: 211 SY-----------YDHI------LSDQYSRTVELNGHD-----MRRNLSTAWDTVG-EYA 247
Y Y+H D+ SR++E + + + NL+ V
Sbjct: 160 PYSHDMGCTDTPGYNHPPCPACPRGDRPSRSLERDCYTDVALPLYENLNIVEQPVNLSSL 219
Query: 248 TDLFTKEAVQLIEDQPVD-KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
+ ++A+Q I+ +P LY+ H + L A RR
Sbjct: 220 AHKYAEKAIQFIQHASASGRPFLLYMGLAHMHVPISRTQLSAVLR-----------GRRP 268
Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
Y A ++++D VG + + R EN+ + F DNG P + E + GS P+ G
Sbjct: 269 YGAGLREMDSLVGQIKDKVDRTAK-ENTFLWFTGDNG-PWAQKCELA-----GSVGPFTG 321
Query: 367 V----------KNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTS 416
+ K T WEGG +VPA+ + P S ++ + D PT+ AG +
Sbjct: 322 LWQTHQGGSPAKQTTWEGGHRVPALAYWPGRVPVNVTSTALLSVLDIFPTVVALAG---A 378
Query: 417 RLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLIN-----IDEKK 471
LP ++ + DGLD S +L + VL + E
Sbjct: 379 SLP-------------------QDRHFDGLDA-SEVLFGWSQTGHRVLFHPNSGAAGEFG 418
Query: 472 RTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNA---IVE-------SKTYQS 521
VRL S+K + DG G+ + + PL+ FN + E S YQ
Sbjct: 419 ALQTVRLGSYKAFYVSGGAKACDGDVGREQHHDPPLI-FNLEDDVAEAVPLDRGSAEYQG 477
Query: 522 ----LQQLSQNIFLPISNIDKMRS--TRQQATIHCGANP 554
++++ ++ L I+ + R+ TR + C NP
Sbjct: 478 VLPKVREILADVLLDIAGDNTSRADYTRHPSVTPC-CNP 515
>sp|P77318|YDEN_ECOLI Uncharacterized sulfatase YdeN OS=Escherichia coli (strain K12)
GN=ydeN PE=3 SV=2
Length = 560
Score = 107 bits (266), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 97/344 (28%), Positives = 146/344 (42%), Gaps = 57/344 (16%)
Query: 106 TPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTER 164
TP + +L G+ N Y A V PSRA++MTG+ P G+ G+PLTE
Sbjct: 109 TPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT---DAQDGIPLTET 165
Query: 165 FLPEYLRELGYSTKAIGKWHLG----------------------FFRREYTPLYRGFESH 202
FLPE + GY T A+GKWHL F E+ P RGF+
Sbjct: 166 FLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYF 225
Query: 203 FGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIE-D 261
G+ +YY+ + V G Y +D T EA+ +++
Sbjct: 226 MGFHAAGTAYYNSPSLFKNRERVPAKG----------------YISDQLTDEAIGVVDRA 269
Query: 262 QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTV 321
+ +D+P LYLA+ A H N + Q+ N D Y A V +D V +
Sbjct: 270 KTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTAD----NYYASVYSVDQGVKRI 325
Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
+ L++ G +N+II+F SDNGA ++ N G+ +G K+ + GG P
Sbjct: 326 LEQLKKNGQYDNTIILFTSDNGA-VIDGPLPLN----GAQ---KGYKSQTYPGGTHTPMF 377
Query: 382 LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
+W Q P +++ D+ PT AA + L +DG+
Sbjct: 378 MWWKGKLQ-PGNYDKLISAMDFYPTALDAADISIPK-DLKLDGV 419
>sp|P50473|ARS_STRPU Arylsulfatase OS=Strongylocentrotus purpuratus PE=2 SV=1
Length = 567
Score = 102 bits (255), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 98/360 (27%), Positives = 160/360 (44%), Gaps = 45/360 (12%)
Query: 78 ALTKSTTLTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASL 135
A+TK + LL G DLS +G ID +A G+ Y+ VCTPSR+++
Sbjct: 63 AMTKPNVILLLADDMGVGDLSVYGHPTQEPGFIDQMANQGLRFTQGYSGDSVCTPSRSAI 122
Query: 136 MTGKYPIHTGMQGPPIWGAE-------PRGVPLTERFLPEYLRELGYSTKAIGKWHLGFF 188
+TG+ PI TG ++G E G+PL E + E ++ GY+T +GKWHLG
Sbjct: 123 VTGRQPIRTG-----VYGEERIFLPWTTTGLPLYEVTIAEAMKGAGYTTGMVGKWHLGIN 177
Query: 189 RRE-----YTPLYRGFE--SHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD 241
+ P RGF+ H D L + T + +++ +
Sbjct: 178 ENSSSDGAHLPANRGFDFVGHNLPFGNSWRCDDTGLHQDFPDTNACFLYYNSTSVAQPFQ 237
Query: 242 TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITD 301
G T L + V IED V+KP F+Y++ H+ + + F +
Sbjct: 238 HKG--LTQLLRDDTVGFIEDN-VNKPFFMYVSF---------AHMHTSLFSSDDFSCTS- 284
Query: 302 PNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSN 361
R Y ++++D ++ +++ L + +N++I F SD+G P EY N
Sbjct: 285 -RRGRYGDNLREMDQAIEQIVTTLVDNDIDDNTVIFFTSDHG-PHREYCGEGGDANV--- 339
Query: 362 YPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN 421
+RG K WEGG ++P I++ P +P VS +++ D + T G S+LP +
Sbjct: 340 --FRGGKGQSWEGGHRIPYIVYWPGT-ISPGVSHEIVTSMDIIATAVNLGG---SQLPTD 393
>sp|Q32KJ9|ARSG_RAT Arylsulfatase G OS=Rattus norvegicus GN=Arsg PE=2 SV=1
Length = 526
Score = 99.8 bits (247), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 125/475 (26%), Positives = 194/475 (40%), Gaps = 89/475 (18%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGPP 150
GW DL + + T N+D +A G+ + +A C+PSRASL+TG+ + G+
Sbjct: 47 GWGDLGANWAETKDTTNLDKMASEGMRFVDFHAAASTCSPSRASLLTGRLGLRNGVTHN- 105
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
G+PL E L E L++ GY T IGKWHLG Y P +RGF+ +FG I
Sbjct: 106 FAVTSVGGLPLNETTLAEVLQQAGYVTAMIGKWHLG-HHGSYHPSFRGFDYYFG-----I 159
Query: 211 SYYDHI-----------------LSDQYSRTVELNGHD-----MRRNLSTAWDTVGEYA- 247
Y + + SD R + + + + NL+ V
Sbjct: 160 PYSNDMGCTDNPGYNYPPCPACPQSDGRWRNPDRDCYTDVALPLYENLNIVEQPVNLSGL 219
Query: 248 TDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDP-NRRT 306
+ + AV+ IE FL LA H+ P ++ + +P ++R
Sbjct: 220 AQKYAERAVEFIEQASTSGRPFLLYVGLA--------HMHVP---LSVTPPLANPQSQRL 268
Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
Y A ++++D VG + + EN+++ F DNG P + E + GS P+ G
Sbjct: 269 YRASLQEMDSLVGQIKDKVDHVAK-ENTLLWFAGDNG-PWAQKCELA-----GSMGPFSG 321
Query: 367 V----------KNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTS 416
+ K T WEGG +VPA+ + P S ++ + D PT+ AG +
Sbjct: 322 LWQTHQGGSPAKQTTWEGGHRVPALAYWPGRVPVNVTSTALLSLLDIFPTVIALAG---A 378
Query: 417 RLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLIN-----IDEKK 471
LP P+R+ DG+D S +L + VL + E
Sbjct: 379 SLP----------------PNRK---FDGVDV-SEVLFGKSQTGHRVLFHPNSGAAGEYG 418
Query: 472 RTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLS 526
VRLD +K T DG G + + PL+ FN ++ LQ+ S
Sbjct: 419 ALQTVRLDRYKAFYITGGAKACDGGVGPEQHHVSPLI-FNLEDDAAESSPLQKGS 472
>sp|Q3TYD4|ARSG_MOUSE Arylsulfatase G OS=Mus musculus GN=Arsg PE=2 SV=1
Length = 525
Score = 96.7 bits (239), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 122/452 (26%), Positives = 180/452 (39%), Gaps = 73/452 (16%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGPP 150
GW DL + + T N+D +A G+ + +A C+PSRASL+TG+ + G+
Sbjct: 47 GWGDLGANWAETKDTTNLDKMASEGMRFVDFHAAASTCSPSRASLLTGRLGLRNGVTHN- 105
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG--YLNG 208
G+P+ E L E LR+ GY T IGKWHLG Y P +RGF+ +FG Y N
Sbjct: 106 FAVTSVGGLPVNETTLAEVLRQEGYVTAMIGKWHLG-HHGSYHPNFRGFDYYFGIPYSND 164
Query: 209 V-------ISYYDHILSDQYSRTVELNGHD--------MRRNLSTAWDTVGEYA-TDLFT 252
+ +Y Q G D + NL+ V +
Sbjct: 165 MGCTDAPGYNYPPCPACPQRDGLWRNPGRDCYTDVALPLYENLNIVEQPVNLSGLAQKYA 224
Query: 253 KEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT-YAAMV 311
+ AV+ IE FL LA H+ P + P R++ Y A +
Sbjct: 225 ERAVEFIEQASTSGRPFLLYVGLA--------HMHVPLSVTPPLAH---PQRQSLYRASL 273
Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGV---- 367
+++D VG + + EN+++ F DNG P + E + GS P+ G+
Sbjct: 274 REMDSLVGQIKDKVDHVAR-ENTLLWFTGDNG-PWAQKCELA-----GSVGPFFGLWQTH 326
Query: 368 ------KNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN 421
K T WEGG +VPA+ + P S ++ + D PT+ AG + LP N
Sbjct: 327 QGGSPTKQTTWEGGHRVPALAYWPGRVPANVTSTALLSLLDIFPTVIALAG---ASLPPN 383
Query: 422 --IDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLD 479
DG D L G Q +L P+ + E VRL+
Sbjct: 384 RKFDGRDVSEVLF------------GKSQMGHRVLFHPNSGAA-----GEYGALQTVRLN 426
Query: 480 SWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
+K T DG G + + PL+ FN
Sbjct: 427 HYKAFYITGGAKACDGSVGPEQHHVAPLI-FN 457
>sp|Q96EG1|ARSG_HUMAN Arylsulfatase G OS=Homo sapiens GN=ARSG PE=1 SV=1
Length = 525
Score = 94.7 bits (234), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 124/468 (26%), Positives = 181/468 (38%), Gaps = 105/468 (22%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGPP 150
GW DL + + T N+D +A G+ + +A C+PSRASL+TG+ + G+
Sbjct: 47 GWGDLGANWAETKDTANLDKMASEGMRFVDFHAAASTCSPSRASLLTGRLGLRNGVT-RN 105
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
G+PL E L E L++ GY T IGKWHLG Y P +RGF+ +FG I
Sbjct: 106 FAVTSVGGLPLNETTLAEVLQQAGYVTGIIGKWHLG-HHGSYHPNFRGFDYYFG-----I 159
Query: 211 SY-----------YDH---------------ILSDQYSRTV-----ELNGHDMRRNLSTA 239
Y Y+H + D Y+ LN + NLS+
Sbjct: 160 PYSHDMGCTDTPGYNHPPCPACPQGDGPSRNLQRDCYTDVALPLYENLNIVEQPVNLSSL 219
Query: 240 WDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 299
E AT + + +P LY+A LA H+ P Q
Sbjct: 220 AQKYAEKATQFIQRASTS-------GRPFLLYVA-LA--------HMHVPLPVT---QLP 260
Query: 300 TDPNRRT-YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNW 358
P R+ Y A + ++D VG + + + EN+ + F DNG P + E +
Sbjct: 261 AAPRGRSLYGAGLWEMDSLVGQIKDKVDHT-VKENTFLWFTGDNG-PWAQKCELA----- 313
Query: 359 GSNYPYRG----------VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLY 408
GS P+ G K T WEGG +VPA+ + P S ++ + D PT+
Sbjct: 314 GSVGPFTGFWQTRQGGSPAKQTTWEGGHRVPALAYWPGRVPVNVTSTALLSVLDIFPTVV 373
Query: 409 TAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLIN-- 466
A + LP + DG+D S +L + VL +
Sbjct: 374 ALA---QASLP-------------------QGRRFDGVDV-SEVLFGRSQPGHRVLFHPN 410
Query: 467 ---IDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
E VRL+ +K T DG G +K PL+ FN
Sbjct: 411 SGAAGEFGALQTVRLERYKAFYITGGARACDGSTGPELQHKFPLI-FN 457
>sp|Q60HH5|ARSE_MACFA Arylsulfatase E OS=Macaca fascicularis GN=ARSE PE=2 SV=1
Length = 588
Score = 90.1 bits (222), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 103/409 (25%), Positives = 165/409 (40%), Gaps = 83/409 (20%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
G D+ +G+N + TPNID LA +G+ L ++ A +CTPSRA+ +TG+YP+ +GM
Sbjct: 49 GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 108
Query: 151 -----IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE-----YTPLYRGFE 200
W G+P E + L+E GY+T IGKWHLG + PL+ GF+
Sbjct: 109 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 168
Query: 201 SHFGY---LNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQ 257
+G L G ++++ LS++ ++ + L+ + + A L +
Sbjct: 169 HFYGMPFSLMGDCAHWE--LSEKRV--------NLEQKLNFLFQVLALVALTLVAGKLTH 218
Query: 258 LIEDQPVD----------KPLFL----YLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 303
LI PV L L ++ L HAG E +FQ T
Sbjct: 219 LI---PVSWTPVIWSALWAVLLLTGSYFVGALIVHAGCLLMRNHTITEQPMRFQKTTPLI 275
Query: 304 RRTYAAMVKK----------------------------------------LDDSVGTVIS 323
+ A+ +K+ +D VG ++
Sbjct: 276 LQEVASFLKRNKHGPFLLFVSFLHVHIPLITMENFLGKSLHGLYGDNVEEMDWMVGQILD 335
Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILW 383
L +G+ +++I F SD+G + Y W Y EGG++VP I
Sbjct: 336 TLDMEGLTNSTLIYFTSDHGGSLENQLGRTQYGGWNGIYKGGKGMGGW-EGGIRVPGIFR 394
Query: 384 SPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
P + +V + + D PT+ AGG+ + + IDG D LL
Sbjct: 395 WPGVLPAGQVIGEPTSLMDVFPTVVQLAGGEVPQDRV-IDGQDLLPLLL 442
>sp|Q32KH8|ARSH_CANFA Arylsulfatase H OS=Canis familiaris GN=ARSH PE=2 SV=1
Length = 562
Score = 89.7 bits (221), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 58/149 (38%), Positives = 76/149 (51%), Gaps = 18/149 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQGP- 149
G DL +G+N + TPNID LA G+ L ++ A VCTPSRA+ +TG+YPI +GM P
Sbjct: 18 GVGDLCCYGNNTVSTPNIDRLASEGVRLTQHLAAASVCTPSRAAFLTGRYPIRSGMASPY 77
Query: 150 -----PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE-----YTPLYRGF 199
W G+P E + L+ GY T IGKWH G Y PL GF
Sbjct: 78 NLNRGLTWLGGSGGLPTNETTFAKLLQHYGYRTGLIGKWHQGLSCASRNDHCYHPLNHGF 137
Query: 200 ESHFGYLNGVISYYDHILSDQYSRTVELN 228
+ +G G++S Q SRT EL+
Sbjct: 138 DYFYGLPFGLLS------DCQASRTPELH 160
Score = 44.7 bits (104), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 45/170 (26%), Positives = 77/170 (45%), Gaps = 15/170 (8%)
Query: 245 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 304
E L KEA+ I D+ P L+++ L H+ P T ++F +
Sbjct: 239 ERVASLMLKEALAFI-DRYKRGPFLLFVSFL---------HVHTPLITKDKF--VGHSKY 286
Query: 305 RTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPY 364
Y V+++D VG ++ L ++ + ++++ F SDNG +E +E + GSN Y
Sbjct: 287 GLYGDNVEEMDWMVGKILETLDQERLTNHTLVYFTSDNGG-RLEVQE-GEVQLGGSNGIY 344
Query: 365 RGVKNTLWEGG-VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGG 413
+G + G ++VP I P + Q +V + + D PTL GG
Sbjct: 345 KGGQGMGGWEGGIRVPGIFRWPTVLQAGKVINEPTSLMDIYPTLSYIGGG 394
>sp|P08842|STS_HUMAN Steryl-sulfatase OS=Homo sapiens GN=STS PE=1 SV=2
Length = 583
Score = 89.4 bits (220), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 54/142 (38%), Positives = 76/142 (53%), Gaps = 11/142 (7%)
Query: 74 RTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSR 132
++AA + L + G D +G+ I TPNID LA G+ L ++ A P+CTPSR
Sbjct: 20 ESHAASRPNIILVMADDLGIGDPGCYGNKTIRTPNIDRLASGGVKLTQHLAASPLCTPSR 79
Query: 133 ASLMTGKYPIHTGMQ-----GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGF 187
A+ MTG+YP+ +GM G ++ A G+P E + L++ GYST IGKWHLG
Sbjct: 80 AAFMTGRYPVRSGMASWSRTGVFLFTASSGGLPTDEITFAKLLKDQGYSTALIGKWHLGM 139
Query: 188 FRREYT-----PLYRGFESHFG 204
T PL+ GF +G
Sbjct: 140 SCHSKTDFCHHPLHHGFNYFYG 161
Score = 63.2 bits (152), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 55/181 (30%), Positives = 87/181 (48%), Gaps = 18/181 (9%)
Query: 248 TDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTY 307
T T EA Q I+ + + P L L++L H L + ++ + Q+ Y
Sbjct: 261 TQRLTVEAAQFIQ-RNTETPFLLVLSYLHVHTA-----LFSSKDFAGKSQH------GVY 308
Query: 308 AAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGV 367
V+++D SVG +++ L + +++I F SD GA VE + + GSN Y+G
Sbjct: 309 GDAVEEMDWSVGQILNLLDELRLANDTLIYFTSDQGA-HVEEVSSKGEIHGGSNGIYKGG 367
Query: 368 KNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--IDGL 425
K WEGG++VP IL P++ Q + + D PT+ AG + LP + IDG
Sbjct: 368 KANNWEGGIRVPGILRWPRVIQAGQKIDEPTSNMDIFPTVAKLAG---APLPEDRIIDGR 424
Query: 426 D 426
D
Sbjct: 425 D 425
>sp|Q5FYA8|ARSH_HUMAN Arylsulfatase H OS=Homo sapiens GN=ARSH PE=2 SV=1
Length = 562
Score = 85.1 bits (209), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 56/149 (37%), Positives = 75/149 (50%), Gaps = 18/149 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQG-- 148
G DL +G+N + TPNID LA G+ L ++ A +CTPSRA+ +TG+YPI +GM
Sbjct: 18 GVGDLCCYGNNSVSTPNIDRLASEGVRLTQHLAAASMCTPSRAAFLTGRYPIRSGMVSAY 77
Query: 149 ----PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE-----YTPLYRGF 199
W G+P E + L+ GY T IGKWHLG Y PL GF
Sbjct: 78 NLNRAFTWLGGSGGLPTNETTFAKLLQHRGYRTGLIGKWHLGLSCASRNDHCYHPLNHGF 137
Query: 200 ESHFGYLNGVISYYDHILSDQYSRTVELN 228
+G G++S Q S+T EL+
Sbjct: 138 HYFYGVPFGLLS------DCQASKTPELH 160
Score = 44.3 bits (103), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 70/173 (40%), Gaps = 13/173 (7%)
Query: 245 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 304
E L KEA+ IE +P L+ + L H I++ +++
Sbjct: 239 EKVASLMLKEALAFIERYK-REPFLLFFSFLHVHT-----------PLISKKKFVGRSKY 286
Query: 305 RTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPY 364
Y V+++D VG ++ AL ++ + ++++ F SDNG W Y
Sbjct: 287 GRYGDNVEEMDWMVGKILDALDQERLANHTLVYFTSDNGGHLEPLDGAVQLGGWNGIYKG 346
Query: 365 RGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSR 417
EGG++VP I P + + RV + + D PTL GG S+
Sbjct: 347 GKGMGGW-EGGIRVPGIFRWPSVLEAGRVINEPTSLMDIYPTLSYIGGGILSQ 398
>sp|P51690|ARSE_HUMAN Arylsulfatase E OS=Homo sapiens GN=ARSE PE=1 SV=2
Length = 589
Score = 85.1 bits (209), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 47/124 (37%), Positives = 69/124 (55%), Gaps = 11/124 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
G D+ +G+N + TPNID LA +G+ L ++ A +CTPSRA+ +TG+YP+ +GM
Sbjct: 49 GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 108
Query: 151 -----IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE-----YTPLYRGFE 200
W G+P E + L+E GY+T IGKWHLG + PL+ GF+
Sbjct: 109 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 168
Query: 201 SHFG 204
+G
Sbjct: 169 HFYG 172
Score = 41.6 bits (96), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 49/201 (24%), Positives = 81/201 (40%), Gaps = 14/201 (6%)
Query: 232 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 291
MR + T + T L +E ++ P L+++ L H+ P
Sbjct: 256 MRNHTITEQPMCFQRTTPLILQEVASFLKRNK-HGPFLLFVSFL---------HVHIPLI 305
Query: 292 TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE 351
T+ F + Y V+++D VG ++ L +G+ +++I F SD+G
Sbjct: 306 TMENF--LGKSLHGLYGDNVEEMDWMVGRILDTLDVEGLSNSTLIYFTSDHGGSLENQLG 363
Query: 352 TSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
+ Y W Y EGG++VP I P + RV + + D PT+ A
Sbjct: 364 NTQYGGWNGIYKGGKGMGGW-EGGIRVPGIFRWPGVLPAGRVIGEPTSLMDVFPTVVRLA 422
Query: 412 GGDTSRLPLNIDGLDQWSSLL 432
GG+ + + IDG D LL
Sbjct: 423 GGEVPQDRV-IDGQDLLPLLL 442
>sp|P50427|STS_MOUSE Steryl-sulfatase OS=Mus musculus GN=Sts PE=2 SV=1
Length = 624
Score = 81.6 bits (200), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 48/124 (38%), Positives = 69/124 (55%), Gaps = 11/124 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQ--- 147
G DL +G+ + TP++D LA G+ L ++ A P+CTPSRA+ +TG+YP +GM
Sbjct: 46 GIGDLGCYGNKTLRTPHLDRLAREGVKLTQHLAAAPLCTPSRAAFLTGRYPPRSGMAAHG 105
Query: 148 --GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT-----PLYRGFE 200
G ++ A G+P +E + L+ GY+T IGKWHLG R T PL GF+
Sbjct: 106 RVGVYLFTASSGGLPPSEVTMARLLKGRGYATALIGKWHLGLSCRGATDFCHHPLRHGFD 165
Query: 201 SHFG 204
G
Sbjct: 166 RFLG 169
Score = 67.4 bits (163), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 53/169 (31%), Positives = 77/169 (45%), Gaps = 29/169 (17%)
Query: 266 KPLFLYLAHLAAHA------GNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVG 319
+P L+L+ L H G AG+ L Y V+++D VG
Sbjct: 286 RPFLLFLSFLHVHTAHFADPGFAGRSLHG-----------------AYGDSVEEMDWGVG 328
Query: 320 TVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP 379
V++AL G+ +++ F SD+GA VE R GSN +RG K WEGGV+VP
Sbjct: 329 RVLAALDELGLARETLVYFTSDHGA-HVEELGPRGERMGGSNGVFRGGKGNNWEGGVRVP 387
Query: 380 AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--IDGLD 426
++ P+ RV + + D PT+ AG + LP + IDG D
Sbjct: 388 CLVRWPRELSPGRVVAEPTSLMDVFPTVARLAGAE---LPGDRVIDGRD 433
>sp|P15589|STS_RAT Steryl-sulfatase OS=Rattus norvegicus GN=Sts PE=1 SV=2
Length = 577
Score = 78.6 bits (192), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 41/102 (40%), Positives = 61/102 (59%), Gaps = 6/102 (5%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQ--- 147
G DL +G+ + TP+ID LA G+ L ++ A P+CTPSRA+ +TG+YP+ +GM
Sbjct: 37 GIGDLGCYGNRTLRTPHIDRLALEGVKLTQHLAAAPLCTPSRAAFLTGRYPVRSGMASHG 96
Query: 148 --GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGF 187
G ++ A G+P E + L+ GY+T +GKWHLG
Sbjct: 97 RLGVFLFSASSGGLPPNEVTFAKLLKGQGYTTGLVGKWHLGL 138
Score = 65.5 bits (158), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 50/170 (29%), Positives = 79/170 (46%), Gaps = 17/170 (10%)
Query: 265 DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISA 324
D P L+L+ + H H P+ + Y V+++D +VG V++
Sbjct: 276 DTPFLLFLSFMHVHT----AHFANPE-------FAGQSLHGAYGDAVEEMDWAVGQVLAT 324
Query: 325 LQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWS 384
L + G+ N+++ SD+GA VE + R+ GSN YRG K WEGG++VP ++
Sbjct: 325 LDKLGLANNTLVYLTSDHGA-HVEELGPNGERHGGSNGIYRGGKANTWEGGIRVPGLVRW 383
Query: 385 PQIQQNPRVSLQMMHISDWLPTLYTAAGGD--TSRLPLNIDGLDQWSSLL 432
P + + + D PT+ AG + T R+ IDG D LL
Sbjct: 384 PGVIVPGQEVEEPTSNMDVFPTVARLAGAELPTDRV---IDGRDLMPLLL 430
>sp|P51689|ARSD_HUMAN Arylsulfatase D OS=Homo sapiens GN=ARSD PE=1 SV=2
Length = 593
Score = 77.8 bits (190), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 50/142 (35%), Positives = 73/142 (51%), Gaps = 11/142 (7%)
Query: 74 RTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSR 132
+T A + L + G DL +G+N + TPNID LA G+ L ++ A P+CTPSR
Sbjct: 34 KTANAFKPNILLIMADDLGTGDLGCYGNNTLRTPNIDQLAEEGVRLTQHLAAAPLCTPSR 93
Query: 133 ASLMTGKYPIHTGMQGPP-----IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGF 187
A+ +TG++ +GM W A G+P E L++ GY+T IGKWH G
Sbjct: 94 AAFLTGRHSFRSGMDASNGYRALQWNAGSGGLPENETTFARILQQHGYATGLIGKWHQGV 153
Query: 188 ---FRREYT--PLYRGFESHFG 204
R ++ PL GF+ +G
Sbjct: 154 NCASRGDHCHHPLNHGFDYFYG 175
Score = 46.2 bits (108), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 49/184 (26%), Positives = 78/184 (42%), Gaps = 13/184 (7%)
Query: 232 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 291
MR + T V E L KEAV IE P L+L+ L H+ P
Sbjct: 259 MRNHDVTEQPMVLEKTASLMLKEAVSYIERHK-HGPFLLFLSLL---------HVHIPLV 308
Query: 292 TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE 351
T + F + Y V+++D +G V++A++ G+ ++ F SD+G +E R+
Sbjct: 309 TTSAF--LGKSQHGLYGDNVEEMDWLIGKVLNAIEDNGLKNSTFTYFTSDHGG-HLEARD 365
Query: 352 TSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
+ + G WEGG++VP I P + RV + + D PT+
Sbjct: 366 GHSQLGGWNGIYKGGKGMGGWEGGIRVPGIFHWPGVLPAGRVIGEPTSLMDVFPTVVQLV 425
Query: 412 GGDT 415
GG+
Sbjct: 426 GGEV 429
>sp|P54793|ARSF_HUMAN Arylsulfatase F OS=Homo sapiens GN=ARSF PE=1 SV=4
Length = 590
Score = 75.1 bits (183), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 47/127 (37%), Positives = 70/127 (55%), Gaps = 17/127 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
G DL +G++ + TP+ID LA G+ L ++ A +C+PSR++ +TG+YPI +GM
Sbjct: 41 GIGDLGCYGNDTMRTPHIDRLAREGVRLTQHISAASLCSPSRSAFLTGRYPIRSGMVSS- 99
Query: 151 IWG--------AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGF-----FRREYTPLYR 197
G A P G+PL E L L++ GYST IGKWH G + + P
Sbjct: 100 --GNRRVIQNLAVPAGLPLNETTLAALLKKQGYSTGLIGKWHQGLNCDSRSDQCHHPYNY 157
Query: 198 GFESHFG 204
GF+ ++G
Sbjct: 158 GFDYYYG 164
Score = 47.8 bits (112), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 56/226 (24%), Positives = 93/226 (41%), Gaps = 24/226 (10%)
Query: 203 FGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
F +L G + H S Y + + GH++ A E A + KEA+ +E
Sbjct: 225 FIFLLGYAWFSSHT-SPLYWDCLLMRGHEITEQPMKA-----ERAGSIMVKEAISFLERH 278
Query: 263 PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVI 322
+ L+ + L H+ P T + F + Y V+++D VG ++
Sbjct: 279 SKET-FLLFFSFL---------HVHTPLPTTDDFTGTS--KHGLYGDNVEEMDSMVGKIL 326
Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
A+ G+ N+++ F SD+G R + W Y EGG++VP I+
Sbjct: 327 DAIDDFGLRNNTLVYFTSDHGGHLEARRGHAQLGGWNGIYKGGKGMGGW-EGGIRVPGIV 385
Query: 383 WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--IDGLD 426
P R+ + + D LPT+ + +GG LP + IDG D
Sbjct: 386 RWPGKVPAGRLIKEPTSLMDILPTVASVSGGS---LPQDRVIDGRD 428
>sp|Q10723|ARS_VOLCA Arylsulfatase OS=Volvox carteri PE=1 SV=1
Length = 649
Score = 72.0 bits (175), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 87/371 (23%), Positives = 153/371 (41%), Gaps = 86/371 (23%)
Query: 112 LAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQ---GPPIWGAEPRGVPLTERFLP 167
+ Y GI L N + PVC PSR +L G++ +T GP A+ + + + + +LP
Sbjct: 55 IRYPGIELKNYFVTTPVCCPSRTNLWRGQFSHNTNFTDVLGPHGGYAKWKSLGIDKSYLP 114
Query: 168 EYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVEL 227
+L+ LGY+T +GK+ + + Y + G+ ++ +++ Y T +
Sbjct: 115 VWLQNLGYNTYYVGKFLVDYSVSNYQNVPAGWTD----IDALVTPY----------TFDY 160
Query: 228 NGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGN----- 281
N RN +T G Y+TD+ +AV I+ KP + ++ +A H
Sbjct: 161 NNPGFSRNGATPNIYPGFYSTDVIADKAVAQIKTAVAAGKPFYAQISPIAPHTSTQIYFD 220
Query: 282 ---------------AGKHLE------APQETINQFQYITD---------------PNRR 305
A +H E P+ T ++ Y D N R
Sbjct: 221 PVANATKTFFYPPIPAPRHWELFSDATLPEGTSHKNLYEADVSDKPAWIRALPLAQQNNR 280
Query: 306 TYAAMVKKL--------DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRN 357
TY V +L D+ + V++ LQ G+L+N+ +I+ +DNG +R
Sbjct: 281 TYLEEVYRLRLRSLASVDELIDRVVATLQEAGVLDNTYLIYSADNGYHVGTHR------- 333
Query: 358 WGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN----PRVSLQMMHISDWLPTLYTAAGG 413
+ K T ++ ++VP ++ P I+ + P S +H+ D+ PT+ T AG
Sbjct: 334 ------FGAGKVTAYDEDLRVPFLIRGPGIRASHSDKPANSKVGLHV-DFAPTILTLAGA 386
Query: 414 DTSRLPLNIDG 424
+DG
Sbjct: 387 GDQVGDKALDG 397
>sp|Q90XB6|SULF1_COTCO Extracellular sulfatase Sulf-1 OS=Coturnix coturnix GN=SULF1 PE=1
SV=1
Length = 867
Score = 69.3 bits (168), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 89/361 (24%), Positives = 141/361 (39%), Gaps = 79/361 (21%)
Query: 118 ILNNMYAQPVCTPSRASLMTGKYP----IHTGMQ--GPPIWGA--EPRGVPLTERFLPEY 169
+N P+C PSR+S++TGKY I+T + P W A EPR + Y
Sbjct: 77 FINAFVTTPMCCPSRSSMLTGKYVHNHNIYTNNENCSSPSWQATHEPRTFAV-------Y 129
Query: 170 LRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNG 229
L GY T GK+ L + Y P G+ G + S Y+ T+ NG
Sbjct: 130 LNNTGYRTAFFGKY-LNEYNGSYIPP--GWREWVGLVKN---------SRFYNYTISRNG 177
Query: 230 HDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV---DKPLFLYLAHLAAHA------- 279
+ + +D +Y TDL T E++ +P+ + ++H A H
Sbjct: 178 NKEKH----GFDYAKDYFTDLITNESINYFRMSKRIYPHRPIMMVISHAAPHGPEDSAPQ 233
Query: 280 -----GNAGKHLE-----APQETINQFQYITDPN-----------RRTYAAMVKKLDDSV 318
NA +H+ AP + T P +R + +DDS+
Sbjct: 234 FSELYPNASQHITPSYNYAPNMDKHWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSM 293
Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
+ L G LEN+ II+ +D+G ++ G + PY + ++V
Sbjct: 294 ERLYQMLAEMGELENTYIIYTADHGYHIGQFGLVK-----GKSMPY--------DFDIRV 340
Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
P + P ++ V +++I D PT+ AG DT P ++DG L L P
Sbjct: 341 PFFIRGPSVEPGSVVPQIVLNI-DLAPTILDIAGLDT---PPDMDGKSVLKLLDLERPGN 396
Query: 439 R 439
R
Sbjct: 397 R 397
>sp|Q8VI60|SULF1_RAT Extracellular sulfatase Sulf-1 OS=Rattus norvegicus GN=Sulf1 PE=1
SV=1
Length = 870
Score = 68.6 bits (166), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 93/380 (24%), Positives = 150/380 (39%), Gaps = 80/380 (21%)
Query: 100 GSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYP----IHTGMQ--GPPIW 152
GS ++ + + G N + P+C PSR+S++TGKY ++T + P W
Sbjct: 58 GSLQVMNKTRKIMEHGGATFTNAFVTTPMCCPSRSSMLTGKYVHNHNVYTNNENCSSPSW 117
Query: 153 GA--EPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
A EPR + YL GY T GK+ L + Y P G+ G +
Sbjct: 118 QALHEPRTFAV-------YLNNTGYRTAFFGKY-LNEYNGSYIP--PGWREWLGLIKN-- 165
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAV---QLIEDQPVDKP 267
S Y+ TV NG + +D +Y TDL T E++ ++ + +P
Sbjct: 166 -------SRFYNYTVCRNGIKEKH----GFDYAKDYFTDLITNESINYFKMSKRMYPHRP 214
Query: 268 LFLYLAHLAAHA------------GNAGKHLE-----APQETINQFQYITDPN------- 303
+ + ++H A H NA +H+ AP + T P
Sbjct: 215 VMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLPIHMEF 274
Query: 304 ----RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
+R + +DDSV + + L G L N+ II+ +D+G ++ G
Sbjct: 275 TNVLQRKRLQTLMSVDDSVERLYNMLVETGELGNTYIIYTADHGYHIGQFGLVK-----G 329
Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
+ PY + ++VP + P I+ V +++I D PT+ AG DT P
Sbjct: 330 KSMPY--------DFDIRVPFFIRGPSIEPGSIVPQIVLNI-DLAPTILDIAGLDT---P 377
Query: 420 LNIDGLDQWSSLLLNTPSRR 439
++DG L L P R
Sbjct: 378 SDVDGKSVLKLLDLEKPGNR 397
>sp|Q8K007|SULF1_MOUSE Extracellular sulfatase Sulf-1 OS=Mus musculus GN=Sulf1 PE=2 SV=1
Length = 870
Score = 68.2 bits (165), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 92/380 (24%), Positives = 150/380 (39%), Gaps = 80/380 (21%)
Query: 100 GSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYP----IHTGMQ--GPPIW 152
GS ++ + G N + P+C PSR+S++TGKY ++T + P W
Sbjct: 58 GSLQVMNKTRKIMEQGGATFTNAFVTTPMCCPSRSSMLTGKYVHNHNVYTNNENCSSPSW 117
Query: 153 GA--EPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
A EPR + YL GY T GK+ L + Y P G+ G +
Sbjct: 118 QAMHEPRTFAV-------YLNNTGYRTAFFGKY-LNEYNGSYIP--PGWREWLGLIKN-- 165
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAV---QLIEDQPVDKP 267
S Y+ TV NG + +D +Y TDL T E++ ++ + +P
Sbjct: 166 -------SRFYNYTVCRNGIKEKH----GFDYAKDYFTDLITNESINYFKMSKRMYPHRP 214
Query: 268 LFLYLAHLAAHA------------GNAGKHLE-----APQETINQFQYITDPN------- 303
+ + ++H A H NA +H+ AP + T P
Sbjct: 215 IMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLPIHMEF 274
Query: 304 ----RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
+R + +DDSV + + L G L+N+ II+ +D+G ++ G
Sbjct: 275 TNVLQRKRLQTLMSVDDSVERLYNMLVESGELDNTYIIYTADHGYHIGQFGLVK-----G 329
Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
+ PY + ++VP + P I+ V +++I D PT+ AG D+ P
Sbjct: 330 KSMPY--------DFDIRVPFFIRGPSIEPGSIVPQIVLNI-DLAPTILDIAGLDS---P 377
Query: 420 LNIDGLDQWSSLLLNTPSRR 439
++DG L L P R
Sbjct: 378 SDVDGKSVLKLLDLEKPGNR 397
>sp|P14217|ARS_CHLRE Arylsulfatase OS=Chlamydomonas reinhardtii GN=AS PE=1 SV=2
Length = 647
Score = 67.4 bits (163), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 89/375 (23%), Positives = 151/375 (40%), Gaps = 95/375 (25%)
Query: 112 LAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQG--PPIWG-AEPRGVPLTERFLP 167
+ Y G+ L+ + PVC PSR +L G++ +T PP G A+ +G+ + + +LP
Sbjct: 56 IRYPGVELSQYFVTTPVCCPSRTNLXRGQFAHNTNFTSVLPPYGGWAKWKGLGIDQSYLP 115
Query: 168 EYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVEL 227
+L++ GY+T +GK+ + + Y + R G IS + T +
Sbjct: 116 LWLKDQGYNTYYVGKFLVDYSVSNYQQVPRA---------GTIS-----MPXVTPYTFDY 161
Query: 228 NGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAH-------- 278
N ++RN +T GEY+TD+ + V I+ KP + ++ +A H
Sbjct: 162 NTR-LQRNGATPNIYPGEYSTDVIRDKGVAQIKSAVAAGKPFYAQISPIAPHTSTQISTN 220
Query: 279 --AGNAGKHLEAPQETINQFQYITDP-------------------------------NRR 305
G + P +Q +D N R
Sbjct: 221 PATGVTRSYFFPPIPAPPHWQLFSDANLPGGSXNKNLYEVDVSDKPAWIRALPLAQQNNR 280
Query: 306 TYAAMVKKL-------DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNW 358
TY + +L D+ + V+ L G+L+N+ II+ +DNG +R
Sbjct: 281 TYQEEIYRLRLRSLGPDELIEQVVKTLDEAGVLDNTYIIYSADNGYHVGAHR-------- 332
Query: 359 GSNYPYRGVKNTLWEGGVKVPAILWSPQIQ-------QNPRVSLQMMHISDWLPTLYTAA 411
+ K T +E ++VP ++ P I+ QN +V L + D+ PT+ + A
Sbjct: 333 -----FGAGKTTGYEEDLRVPFLIRGPGIKASKSDKPQNSKVGLHV----DFAPTILSLA 383
Query: 412 GGDTSRLPLNIDGLD 426
G S L L GLD
Sbjct: 384 GA--SHL-LGDKGLD 395
>sp|Q8IWU6|SULF1_HUMAN Extracellular sulfatase Sulf-1 OS=Homo sapiens GN=SULF1 PE=1 SV=1
Length = 871
Score = 67.4 bits (163), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 86/346 (24%), Positives = 140/346 (40%), Gaps = 79/346 (22%)
Query: 118 ILNNMYAQPVCTPSRASLMTGKYP----IHTGMQ--GPPIWGA--EPRGVPLTERFLPEY 169
+N P+C PSR+S++TGKY ++T + P W A EPR + Y
Sbjct: 77 FINAFVTTPMCCPSRSSMLTGKYVHNHNVYTNNENCSSPSWQAMHEPRTFAV-------Y 129
Query: 170 LRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNG 229
L GY T GK+ L + Y P G+ G + S Y+ TV NG
Sbjct: 130 LNNTGYRTAFFGKY-LNEYNGSYIP--PGWREWLGLIKN---------SRFYNYTVCRNG 177
Query: 230 HDMRRNLSTAWDTVGEYATDLFTKEAV---QLIEDQPVDKPLFLYLAHLAAHA------- 279
+ +D +Y TDL T E++ ++ + +P+ + ++H A H
Sbjct: 178 IKEKH----GFDYAKDYFTDLITNESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQ 233
Query: 280 -----GNAGKHLE-----APQETINQFQYITDPN-----------RRTYAAMVKKLDDSV 318
NA +H+ AP + T P +R + +DDSV
Sbjct: 234 FSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSV 293
Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
+ + L G LEN+ II+ +D+G ++ G + PY + ++V
Sbjct: 294 ERLYNMLVETGELENTYIIYTADHGYHIGQFGLVK-----GKSMPY--------DFDIRV 340
Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
P + P ++ V +++I D PT+ AG DT P ++DG
Sbjct: 341 PFFIRGPSVEPGSIVPQIVLNI-DLAPTILDIAGLDT---PPDVDG 382
>sp|Q8IWU5|SULF2_HUMAN Extracellular sulfatase Sulf-2 OS=Homo sapiens GN=SULF2 PE=1 SV=1
Length = 870
Score = 64.7 bits (156), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 82/348 (23%), Positives = 136/348 (39%), Gaps = 83/348 (23%)
Query: 118 ILNNMYAQPVCTPSRASLMTGKYPIHTGMQ-------GPPIWGAEPRGVPLTERFLPEYL 170
+N P+C PSR+S++TGKY +H P W A+ R YL
Sbjct: 78 FINAFVTTPMCCPSRSSILTGKY-VHNHNTYTNNENCSSPSWQAQHE-----SRTFAVYL 131
Query: 171 RELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGH 230
GY T GK+ L + Y P G++ G L +Y++ L + E +G
Sbjct: 132 NSTGYRTAFFGKY-LNEYNGSYVP--PGWKEWVGLLKNS-RFYNYTLCRNGVK--EKHGS 185
Query: 231 DMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV---DKPLFLYLAHLAAHA-------- 279
D + +Y TDL T ++V +P+ + ++H A H
Sbjct: 186 DYSK----------DYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQY 235
Query: 280 ----GNAGKHLE-----APQETINQFQYITDPNR-----------RTYAAMVKKLDDSVG 319
NA +H+ AP + T P + R + +DDS+
Sbjct: 236 SRLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSME 295
Query: 320 TVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP 379
T+ + L G L+N+ I++ +D+G ++ G + PY E ++VP
Sbjct: 296 TIYNMLVETGELDNTYIVYTADHGYHIGQFGLVK-----GKSMPY--------EFDIRVP 342
Query: 380 AILWSPQIQQ---NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
+ P ++ NP + L + D PT+ AG D +P ++DG
Sbjct: 343 FYVRGPNVEAGCLNPHIVLNI----DLAPTILDIAGLD---IPADMDG 383
>sp|Q8CFG0|SULF2_MOUSE Extracellular sulfatase Sulf-2 OS=Mus musculus GN=Sulf2 PE=2 SV=2
Length = 875
Score = 63.5 bits (153), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 83/348 (23%), Positives = 136/348 (39%), Gaps = 83/348 (23%)
Query: 118 ILNNMYAQPVCTPSRASLMTGKYPIHTGMQ-------GPPIWGAEPRGVPLTERFLPEYL 170
+N P+C PSR+S++TGKY +H P W A+ R YL
Sbjct: 78 FINAFVTTPMCCPSRSSILTGKY-VHNHNTYTNNENCSSPSWQAQHE-----SRTFAVYL 131
Query: 171 RELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGH 230
GY T GK+ L + Y P G++ G L S Y+ T+ NG
Sbjct: 132 NSTGYRTAFFGKY-LNEYNGSYVP--PGWKEWVGLLKN---------SRFYNYTLCRNG- 178
Query: 231 DMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV---DKPLFLYLAHLAAHA-------- 279
++ + + T +Y TDL T ++V +P+ + ++H A H
Sbjct: 179 -VKEKHGSDYST--DYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQY 235
Query: 280 ----GNAGKHLE-----APQETINQFQYITDPNR-----------RTYAAMVKKLDDSVG 319
NA +H+ AP + T P + R + +DDS+
Sbjct: 236 SRLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSME 295
Query: 320 TVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP 379
T+ L G L+N+ I++ +D+G ++ G + PY E ++VP
Sbjct: 296 TIYDMLVETGELDNTYILYTADHGYHIGQFGLVK-----GKSMPY--------EFDIRVP 342
Query: 380 AILWSPQIQQ---NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
+ P ++ NP + L + D PT+ AG D +P ++DG
Sbjct: 343 FYVRGPNVEAGSLNPHIVLNI----DLAPTILDIAGLD---IPADMDG 383
>sp|Q0TUK6|SULF_CLOP1 Arylsulfatase OS=Clostridium perfringens (strain ATCC 13124 / NCTC
8237 / Type A) GN=CPF_0221 PE=1 SV=1
Length = 481
Score = 63.2 bits (152), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 93/398 (23%), Positives = 147/398 (36%), Gaps = 84/398 (21%)
Query: 96 LSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPPIWGA 154
L +G+ I TPN+D +A G N Y A P C SRAS++TG G G
Sbjct: 18 LGVNGNEFIETPNLDMMATEGYNFENAYTAVPSCIASRASILTGMSQKSHGRVG------ 71
Query: 155 EPRGVPLT-ERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN------ 207
GV E + + GY T+ IGK H+ + + H GYL+
Sbjct: 72 YEDGVSWNYENTIASEFSKAGYHTQCIGKMHV--YPERNLCGFHNIMLHDGYLHFARNKE 129
Query: 208 GVISYYDHILSDQYSRTVELNGH---------DMRRNLSTAWD-TVGEYATDLFTKEAVQ 257
G S D E GH D +S W + T+ E++
Sbjct: 130 GKASTQIEQCDDYLKWFREKKGHNVDLIDIGLDCNSWVSRPWGYEENLHPTNWVVNESID 189
Query: 258 LIEDQPVDKPLFLYLAHLAAHA-------------------------------GNAGKHL 286
+ + KP FL ++ + H+ N GK +
Sbjct: 190 FLRRKDPSKPFFLKMSFVRPHSPLDPPKFYFDMYKDEDLPEPLMGDWANKEDEENRGKDI 249
Query: 287 EAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPT 346
+ IN+ + Y + +D +G + AL G L N+I +F+SD+G
Sbjct: 250 NCVKGIINKKALKR--AKAAYYGSITHIDHQIGRFLIALSEYGELNNTIFLFVSDHGDMM 307
Query: 347 VEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ---IQQNPRVSLQMMHISDW 403
++ NW +R K +EG +VP ++ P + +V +++ + D
Sbjct: 308 GDH-------NW-----FR--KGIPYEGSSRVPFFIYDPGNLLKGKKGKVFDEVLELRDI 353
Query: 404 LPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNS 441
+PTL A +P +++GL L N RNS
Sbjct: 354 MPTLLDFA---HISIPDSVEGLS-----LKNLIEERNS 383
>sp|Q8XNV1|SULF_CLOPE Arylsulfatase OS=Clostridium perfringens (strain 13 / Type A)
GN=CPE0231 PE=3 SV=1
Length = 481
Score = 63.2 bits (152), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 88/382 (23%), Positives = 142/382 (37%), Gaps = 79/382 (20%)
Query: 96 LSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPPIWGA 154
L +G+ I TPN+D +A G N Y A P C SRAS++TG G G
Sbjct: 18 LGVNGNEFIETPNLDMMATEGYNFENAYTAVPSCIASRASILTGMSQKSHGRVG------ 71
Query: 155 EPRGVPLT-ERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN------ 207
GV E + + GY T+ IGK H+ + + H GYL+
Sbjct: 72 YEDGVSWNYENTIASEFSKAGYHTQCIGKMHV--YPERNLCGFHNIMLHDGYLHFARNKE 129
Query: 208 GVISYYDHILSDQYSRTVELNGH---------DMRRNLSTAWD-TVGEYATDLFTKEAVQ 257
G S D E GH D +S W + T+ E++
Sbjct: 130 GKASTQIEQCDDYLKWFREKKGHNVDLIDIGLDCNSWVSRPWGYEENLHPTNWVVNESID 189
Query: 258 LIEDQPVDKPLFLYLAHLAAHA-------------------------------GNAGKHL 286
+ + KP FL ++ + H+ N GK +
Sbjct: 190 FLRRRDPSKPFFLKMSFVRPHSPLDPPKFYFDMYKDEDLPEPLMGDWANKEDEENRGKDI 249
Query: 287 EAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPT 346
+ IN+ + Y + +D +G + AL G L N+I +F+SD+G
Sbjct: 250 NCVKGIINKKALKR--AKAAYYGSITHIDHQIGRFLIALSEYGKLNNTIFLFVSDHGDMM 307
Query: 347 VEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ---IQQNPRVSLQMMHISDW 403
++ NW +R K +EG +VP ++ P + +V +++ + D
Sbjct: 308 GDH-------NW-----FR--KGIPYEGSARVPFFIYDPGNLLKGKKGKVFDEVLELRDI 353
Query: 404 LPTLYTAAGGDTSRLPLNIDGL 425
+PTL A +P +++GL
Sbjct: 354 MPTLLDFA---HISIPDSVEGL 372
>sp|Q5ZK90|ARSK_CHICK Arylsulfatase K OS=Gallus gallus GN=ARSK PE=2 SV=1
Length = 535
Score = 61.6 bits (148), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 93/374 (24%), Positives = 152/374 (40%), Gaps = 86/374 (22%)
Query: 96 LSFH-GSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPPIWG 153
L+F+ G+ + P I+ + +G + N Y P+C PSRA++ +G + H G
Sbjct: 47 LTFYPGNQTVDLPFINFMKRHGSVFLNAYTNSPICCPSRAAMWSGLF-THLTESWNNFKG 105
Query: 154 AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYY 213
+P V + +++ GY T+ GK +YT G S +
Sbjct: 106 LDPDYVTWMD-----LMQKHGYYTQKYGK-------LDYT---SGHHSVSNRVEAWTRDV 150
Query: 214 DHILSDQYSRTVELNGHDMR--RNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
+ +L + V L G D R R + T W V + A KEAV L QP L L
Sbjct: 151 EFLLRQEGRPKVNLTG-DRRHVRVMKTDWQ-VTDKAVTWIKKEAVNLT--QPFALYLGLN 206
Query: 272 LAH-----LAAHAGNAGKHLEAPQ--ETINQFQYITDPN--------------------- 303
L H A + L +P E + +++ I P
Sbjct: 207 LPHPYPSPYAGENFGSSTFLTSPYWLEKV-KYEAIKIPTWTALSEMHPVDYYSSYTKNCT 265
Query: 304 -----------RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRET 352
R Y AM + D +G +ISALQ +L+ +II+F SD+G +E+R+
Sbjct: 266 GEFTKQEVRRIRAFYYAMCAETDAMLGEIISALQDTDLLKKTIIMFTSDHGELAMEHRQF 325
Query: 353 SNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
K +++EG VP ++ P I++ +VS ++ + D PT+
Sbjct: 326 --------------YKMSMYEGSSHVPLLVMGPGIRKQQQVS-AVVSLVDIYPTML---- 366
Query: 413 GDTSRLPL--NIDG 424
D +R+P+ N+ G
Sbjct: 367 -DLARIPVLQNLSG 379
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.317 0.134 0.411
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 248,668,337
Number of Sequences: 539616
Number of extensions: 10948249
Number of successful extensions: 24027
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 57
Number of HSP's successfully gapped in prelim test: 41
Number of HSP's that attempted gapping in prelim test: 23689
Number of HSP's gapped (non-prelim): 194
length of query: 632
length of database: 191,569,459
effective HSP length: 124
effective length of query: 508
effective length of database: 124,657,075
effective search space: 63325794100
effective search space used: 63325794100
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 64 (29.3 bits)