BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy12301
         (632 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|P50430|ARSB_RAT Arylsulfatase B OS=Rattus norvegicus GN=Arsb PE=2 SV=2
          Length = 528

 Score =  304 bits (779), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 198/572 (34%), Positives = 286/572 (50%), Gaps = 85/572 (14%)

Query: 71  PNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTP 130
           P R + AA        L    GWNDL FHGS  I TP++DALA  G++L+N Y QP+CTP
Sbjct: 30  PARASDAAPPPHVVFVLADDLGWNDLGFHGS-VIRTPHLDALAAGGVVLDNYYVQPLCTP 88

Query: 131 SRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR 190
           SR+ L+TG+Y IH G+Q   I   +P  VPL E+ LP+ L++ GY+T  +GKWHLG +R+
Sbjct: 89  SRSQLLTGRYQIHMGLQHYLIMTCQPNCVPLDEKLLPQLLKDAGYATHMVGKWHLGMYRK 148

Query: 191 EYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGH----DMRRNLSTAWDTVGEY 246
           E  P  RGF+++FGYL G   YY H   +  +    LNG     D+R     A +    Y
Sbjct: 149 ECLPTRRGFDTYFGYLLGSEDYYTH---EACAPIECLNGTRCALDLRDGEEPAKEYTDIY 205

Query: 247 ATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
           +T++FTK A  LI + P +KPLFLYLA  + H       L+ P+E +  + +I D +RR 
Sbjct: 206 STNIFTKRATTLIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYDFIQDKHRRI 260

Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
           YA MV  LD++VG V  AL+ +G+  N+++IF +DNG  T         R+ G+N+P RG
Sbjct: 261 YAGMVSLLDEAVGNVTKALKSRGLWNNTVLIFSTDNGGQT---------RSGGNNWPLRG 311

Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
            K TLWEGG++    + SP ++Q    S ++MHI+DWLPTL   AGG T      +DG D
Sbjct: 312 RKGTLWEGGIRGAGFVASPLLKQKGVKSRELMHITDWLPTLVNLAGGSTHGTK-PLDGFD 370

Query: 427 QWSSLLLNTPSRR-----NSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT---AAVRL 478
            W ++   +PS R     N + D  D       NT   +N           T   A +R 
Sbjct: 371 VWETISEGSPSPRVELLLNIDPDFFDGLPCPGKNTTPEKNDSFPLEHSAFNTSIHAGIRY 430

Query: 479 DSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDK 538
            +WKL+ G    G       Q+  ++VP              S+   ++ ++L       
Sbjct: 431 KNWKLLTGYPGCGYWFPPPSQSNISEVP--------------SVDSPTKTLWL------- 469

Query: 539 MRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLK 598
                                          F++  DP E+++++   P I   L   L+
Sbjct: 470 -------------------------------FDINRDPEERHDVSREHPHIVQNLLSRLQ 498

Query: 599 YHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
           Y+    VP S+  P   + DPK     WSPW+
Sbjct: 499 YYHEHSVP-SYFPPLDPRCDPKG-TGVWSPWM 528



 Score = 57.0 bits (136), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A  LI + P +KPLFLYLA  + H       L+ P+E +  + +I D +RR
Sbjct: 205 YSTNIFTKRATTLIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYDFIQDKHRR 259

Query: 75  TYAALT 80
            YA + 
Sbjct: 260 IYAGMV 265


>sp|P50429|ARSB_MOUSE Arylsulfatase B OS=Mus musculus GN=Arsb PE=2 SV=3
          Length = 534

 Score =  299 bits (765), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 196/572 (34%), Positives = 289/572 (50%), Gaps = 85/572 (14%)

Query: 71  PNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTP 130
           P R + A         L    GWNDL FHGS  I TP++DALA  G++L+N Y QP+CTP
Sbjct: 36  PARASGATQPPHVVFVLADDLGWNDLGFHGS-VIRTPHLDALAAGGVVLDNYYVQPLCTP 94

Query: 131 SRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR 190
           SR+ L+TG+Y IH G+Q   I   +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+
Sbjct: 95  SRSQLLTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRK 154

Query: 191 EYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGH----DMRRNLSTAWDTVGEY 246
           E  P  RGF+++FGYL G   YY H   +  +    LNG     D+R     A +    Y
Sbjct: 155 ECLPTRRGFDTYFGYLLGSEDYYTH---EACAPIESLNGTRCALDLRDGEEPAKEYNNIY 211

Query: 247 ATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
           +T++FTK A  +I + P +KPLFLYLA  + H       L+ P+E +  + +I D +RR 
Sbjct: 212 STNIFTKRATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRI 266

Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
           YA MV  +D++VG V  AL+  G+  N++ IF +DNG  T         R+ G+N+P RG
Sbjct: 267 YAGMVSLMDEAVGNVTKALKSHGLWNNTVFIFSTDNGGQT---------RSGGNNWPLRG 317

Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
            K TLWEGG++    + SP ++Q    S ++MHI+DWLPTL   AGG T+          
Sbjct: 318 RKGTLWEGGIRGTGFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGSTNG--------- 368

Query: 427 QWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLG 486
                           +DG + W ++    PS R  +L NID+         D +  +  
Sbjct: 369 -------------TKPLDGFNMWKTISEGHPSPRVELLHNIDQ---------DFFDGLPC 406

Query: 487 TQENGTMDGYYGQTRSNKVPLLN--FNAIVESKT-YQSLQQLSQNI-----FLPISNIDK 538
             +N T        + +  PL +  FN  + +   Y++ + L+ +      F P S    
Sbjct: 407 PGKNMT------PAKDDSFPLEHSAFNTSIHAGIRYKNWKLLTGHPGCGYWFPPPSQ--- 457

Query: 539 MRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLK 598
                        +N + + P        +LF++  DP E+++++   P I   L   L+
Sbjct: 458 -------------SNVSEIPPVGPPTKTLWLFDINQDPEERHDVSREHPHIVQNLLSRLQ 504

Query: 599 YHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
           Y+    VP SH  P   + DPK     WSPW+
Sbjct: 505 YYHEHSVP-SHFPPLDPRCDPKS-TGVWSPWM 534



 Score = 55.5 bits (132), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 40/66 (60%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A  +I + P +KPLFLYLA  + H       L+ P+E +  + +I D +RR
Sbjct: 211 YSTNIFTKRATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRR 265

Query: 75  TYAALT 80
            YA + 
Sbjct: 266 IYAGMV 271


>sp|P15848|ARSB_HUMAN Arylsulfatase B OS=Homo sapiens GN=ARSB PE=1 SV=1
          Length = 533

 Score =  291 bits (744), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 167/425 (39%), Positives = 243/425 (57%), Gaps = 35/425 (8%)

Query: 77  AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
           A  ++   L  L+    GWND+ FHGS  I TP++DALA  G++L+N Y QP+CTPSR+ 
Sbjct: 39  AGASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 97

Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
           L+TG+Y I TG+Q   IW  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P
Sbjct: 98  LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 157

Query: 195 LYRGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
             RGF+++FGYL G   YY H    L D  +  V     D R     A      Y+T++F
Sbjct: 158 TRRGFDTYFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIF 215

Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
           TK A+ LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR  YA MV
Sbjct: 216 TKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMV 270

Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
             +D++VG V +AL+  G+  N++ IF +DNG  T+           G+N+P RG K +L
Sbjct: 271 SLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSL 321

Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
           WEGGV+    + SP ++Q    + +++HISDWLPTL   A G T+     +DG D W ++
Sbjct: 322 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDVWKTI 380

Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSW 481
              +PS R   +  +D   + + ++P  RNS+    D+              AA+R  +W
Sbjct: 381 SEGSPSPRIELLHNID--PNFVDSSPCPRNSMAPAKDDSSLPEYSAFNTSVHAAIRHGNW 438

Query: 482 KLVLG 486
           KL+ G
Sbjct: 439 KLLTG 443



 Score = 57.4 bits (137), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A+ LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR 
Sbjct: 210 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 264

Query: 75  TYAALT 80
            YA + 
Sbjct: 265 HYAGMV 270



 Score = 35.0 bits (79), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 18/63 (28%), Positives = 34/63 (53%), Gaps = 2/63 (3%)

Query: 568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
           +LF++  DP E+++++   P I ++L   L+++ +  VP      D  + DPK     W 
Sbjct: 473 WLFDIDRDPEERHDLSREYPHIVTKLLSRLQFYHKHSVPVYFPAQD-PRCDPKA-TGVWG 530

Query: 628 PWI 630
           PW+
Sbjct: 531 PWM 533


>sp|P33727|ARSB_FELCA Arylsulfatase B OS=Felis catus GN=ARSB PE=2 SV=1
          Length = 535

 Score =  288 bits (736), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 164/414 (39%), Positives = 234/414 (56%), Gaps = 33/414 (7%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+SFHGSN I TP++D LA  G++L+N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 58  GWNDVSFHGSN-IRTPHLDELAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 116

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           W  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P  RGF+++FGYL G   
Sbjct: 117 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 176

Query: 212 YYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
           YY H    L D  S  V     D R     A      Y+T++FT+ A  LI   P +KPL
Sbjct: 177 YYSHERCALID--SLNVTRCALDFRDGEQVATGYKNMYSTNIFTERATALITSHPPEKPL 234

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           FLYLA  + H     + L+ P+E +  + +I D NR  YA MV  +D++VG V +AL+  
Sbjct: 235 FLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHYYAGMVSLMDEAVGNVTAALKSH 289

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
           G+  N++ IF +DNG  T+           G+N+P RG K +LWEGG++    + SP ++
Sbjct: 290 GLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGIRGVGFVASPLLK 340

Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
           Q    + +++HISDWLPTL   A G T      +DG D W ++   +PS R   +  +D 
Sbjct: 341 QKGVKNRELIHISDWLPTLVKLARGSTKGTK-PLDGFDVWKTISEGSPSPRKELLHNID- 398

Query: 449 WSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVLGTQENGT 492
             + +  +P    S+    D+              AA+R  +WKL+ G    G 
Sbjct: 399 -PNFVDISPCPGKSLAPAKDDSSHPAYLAFNTSLHAAIRHGNWKLLTGYPGCGC 451



 Score = 53.9 bits (128), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 39/66 (59%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FT+ A  LI   P +KPLFLYLA  + H     + L+ P+E +  + +I D NR 
Sbjct: 212 YSTNIFTERATALITSHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 266

Query: 75  TYAALT 80
            YA + 
Sbjct: 267 YYAGMV 272


>sp|Q32KJ8|ARSI_RAT Arylsulfatase I OS=Rattus norvegicus GN=Arsi PE=2 SV=1
          Length = 573

 Score =  284 bits (727), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 185/540 (34%), Positives = 272/540 (50%), Gaps = 76/540 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 58  GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 116

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+E GYST  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct: 117 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 176

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G D+    S AW   G+Y+T L+ + A  ++      KPLFLY
Sbjct: 177 YYTYDNCD----GPGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHSPQKPLFLY 232

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G  
Sbjct: 233 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 287

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            NS+IIF SDNG  T          + GSN+P RG K T WEGGV+    + SP +++  
Sbjct: 288 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKKKR 338

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R S  ++HI+DW PTL   AGG TS     +DG D W ++     S R   +  +D    
Sbjct: 339 RTSRALVHITDWYPTLVGLAGGTTSAAD-GLDGYDVWPAISEGRASPRTEILHNIDP--- 394

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   +R  S+     I      AA+R+  WKL+ G       D  YG    + +P   
Sbjct: 395 --LYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPP-- 439

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
                     Q+L     + +    N+++M S RQ                       +L
Sbjct: 440 ----------QTLASFPGSWW----NLERMASIRQA---------------------VWL 464

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           FN+  DP E+ ++A  RPD+   L   L  + RT +P  +   +  +A P      W PW
Sbjct: 465 FNISADPYEREDLADQRPDVVRTLLARLADYNRTAIPVRYPAAN-PRAHPDFNGGAWGPW 523



 Score = 50.1 bits (118), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 25/75 (33%), Positives = 42/75 (56%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           S AW   G+Y+T L+ + A  ++      KPLFLY+A  A H       L++P+E + ++
Sbjct: 198 SVAWGLSGQYSTMLYAQRASHILASHSPQKPLFLYVAFQAVHT-----PLQSPREYLYRY 252

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 253 RTMGNVARRKYAAMV 267


>sp|Q5FYB1|ARSI_HUMAN Arylsulfatase I OS=Homo sapiens GN=ARSI PE=1 SV=1
          Length = 569

 Score =  283 bits (723), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 185/542 (34%), Positives = 273/542 (50%), Gaps = 80/542 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 58  GYHDVGYHGS-DIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 116

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+E GYST  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct: 117 RPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 176

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G D+    + AW   G+Y+T L+ + A  ++      +PLFLY
Sbjct: 177 YYTYDNCD----GPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 232

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G  
Sbjct: 233 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 287

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            NS+IIF SDNG  T          + GSN+P RG K T WEGGV+    + SP +++  
Sbjct: 288 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKQ 338

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R S  +MHI+DW PTL   AGG TS     +DG D W ++     S R   +  +D    
Sbjct: 339 RTSRALMHITDWYPTLVGLAGGTTSAAD-GLDGYDVWPAISEGRASPRTEILHNIDP--- 394

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   ++  S+     I      AA+R+  WKL+ G       D  YG    + +P   
Sbjct: 395 --LYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPP-- 439

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
                     Q+L     + +    N+++M S RQ                       +L
Sbjct: 440 ----------QTLATFPGSWW----NLERMASVRQA---------------------VWL 464

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
           FN+  DP E+ ++A  RPD+   L   L  + RT +P  +  E P   +A P      W 
Sbjct: 465 FNISADPYEREDLAGQRPDVVRTLLARLAEYNRTAIPVRYPAENP---RAHPDFNGGAWG 521

Query: 628 PW 629
           PW
Sbjct: 522 PW 523



 Score = 47.8 bits (112), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 42/75 (56%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           + AW   G+Y+T L+ + A  ++      +PLFLY+A  A H       L++P+E + ++
Sbjct: 198 NVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRY 252

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 253 RTMGNVARRKYAAMV 267


>sp|Q32KI9|ARSI_MOUSE Arylsulfatase I OS=Mus musculus GN=Arsi PE=2 SV=1
          Length = 573

 Score =  281 bits (718), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 184/540 (34%), Positives = 271/540 (50%), Gaps = 76/540 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 58  GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 116

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+E GYST  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct: 117 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 176

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G D+    S AW   G+Y+T L+ + A  ++       PLFLY
Sbjct: 177 YYTYDNCD----GPGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHNPQNPLFLY 232

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G  
Sbjct: 233 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 287

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            NS+IIF SDNG  T          + GSN+P RG K T WEGGV+    + SP +++  
Sbjct: 288 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKKKR 338

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R S  ++HI+DW PTL   AGG TS     +DG D W ++     S R   +  +D    
Sbjct: 339 RTSRALVHITDWYPTLVGLAGGTTSAAD-GLDGYDVWPAISEGRASPRTEILHNIDP--- 394

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   +R  S+     I      AA+R+  WKL+ G       D  YG    + +P   
Sbjct: 395 --LYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPP-- 439

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
                     Q+L     + +    N+++M S RQ                       +L
Sbjct: 440 ----------QTLASFPGSWW----NLERMASIRQA---------------------VWL 464

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           FN+  DP E+ ++A  RPD+   L   L  + RT +P  +   +  +A P      W PW
Sbjct: 465 FNISADPYEREDLAGQRPDVVRTLLARLADYNRTAIPVRYPAAN-PRAHPDFNGGAWGPW 523



 Score = 47.4 bits (111), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 41/75 (54%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           S AW   G+Y+T L+ + A  ++       PLFLY+A  A H       L++P+E + ++
Sbjct: 198 SVAWGLSGQYSTMLYAQRASHILASHNPQNPLFLYVAFQAVHT-----PLQSPREYLYRY 252

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 253 RTMGNVARRKYAAMV 267


>sp|Q32KH7|ARSI_CANFA Arylsulfatase I OS=Canis familiaris GN=ARSI PE=2 SV=2
          Length = 573

 Score =  281 bits (718), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 184/542 (33%), Positives = 270/542 (49%), Gaps = 80/542 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 59  GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 117

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+E GYST  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct: 118 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 177

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G D+    + AW   G+Y+T L+ +    ++      +PLFLY
Sbjct: 178 YYTYDNCDGPG----VCGFDLHEGENVAWGLSGQYSTMLYAQRVSHILASHSPRRPLFLY 233

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E + +++ + +  RR YAAMV  +D++V  + SAL+R G  
Sbjct: 234 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITSALKRYGFY 288

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            NS+IIF SDNG  T          + GSN+P RG K T WEGGV+    + SP +++  
Sbjct: 289 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKR 339

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R S  ++HI+DW PTL   AGG T+     +DG D W ++     S R   +  +D    
Sbjct: 340 RTSRALVHITDWYPTLVGLAGG-TASAADGLDGYDVWPAISEGRASPRTEILHNIDP--- 395

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   +R  S+     I      AA+R+  WKL+ G       D  YG    + +P   
Sbjct: 396 --LYNHARHGSLEAGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPPQT 442

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
             A   S                  N+++M S RQ                       +L
Sbjct: 443 LAAFPGS----------------WWNLERMASARQA---------------------VWL 465

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
           FN+  DP E+ ++A  RPD+   L   L  + RT +P  +  E P   +A P      W 
Sbjct: 466 FNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENP---RAHPDFNGGAWG 522

Query: 628 PW 629
           PW
Sbjct: 523 PW 524



 Score = 45.8 bits (107), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 22/75 (29%), Positives = 41/75 (54%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           + AW   G+Y+T L+ +    ++      +PLFLY+A  A H       L++P+E + ++
Sbjct: 199 NVAWGLSGQYSTMLYAQRVSHILASHSPRRPLFLYVAFQAVHT-----PLQSPREYLYRY 253

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 254 RTMGNVARRKYAAMV 268


>sp|Q8BM89|ARSJ_MOUSE Arylsulfatase J OS=Mus musculus GN=Arsj PE=2 SV=1
          Length = 598

 Score =  278 bits (710), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 181/540 (33%), Positives = 269/540 (49%), Gaps = 74/540 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct: 85  GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 143

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL    LP+ L+E+GYST  +GKWHLGF+R++  P  RGF++ FG L G   
Sbjct: 144 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMPTKRGFDTFFGSLLGSGD 203

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H   D    +  + G+D+  N + AWD   G Y+T ++T+   Q++      KPLFL
Sbjct: 204 YYTHYKCD----SPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILATHDPTKPLFL 259

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+A+ A H+      L+AP      ++ I + NRR YAAM+  LD+++  V  AL+R G 
Sbjct: 260 YVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAIHNVTLALKRYGF 314

Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
             NSIII+ SDNG  PT            GSN+P RG K T WEGG++    + SP ++ 
Sbjct: 315 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 364

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
              V  +++HI+DW PTL + A G      + +DG D W ++   +   R+  +D L   
Sbjct: 365 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNI 420

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             +     +   +    I      +A+R+  WKL+ G        GY     S+ VP   
Sbjct: 421 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGNP------GY-----SDWVPPQA 469

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           F                       SN+   R   ++ T+  G +              +L
Sbjct: 470 F-----------------------SNLGPNRWHNERITLSTGKS-------------IWL 493

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           FN+  DP E+ +++S  P I  +L   L    +T VP  +   D  +++P+     W PW
Sbjct: 494 FNITADPYERVDLSSRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 552



 Score = 49.7 bits (117), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   Q++      KPLFLY+A+ A H+      L+AP 
Sbjct: 220 LYENDNAAWDYDNGIYSTQMYTQRVQQILATHDPTKPLFLYVAYQAVHS-----PLQAPG 274

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 275 RYFEHYRSIININRRRYAAM 294


>sp|Q5FYB0|ARSJ_HUMAN Arylsulfatase J OS=Homo sapiens GN=ARSJ PE=2 SV=1
          Length = 599

 Score =  276 bits (706), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 179/540 (33%), Positives = 269/540 (49%), Gaps = 74/540 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct: 87  GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 145

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL    LP+ L+E+GYST  +GKWHLGF+R+E  P  RGF++ FG L G   
Sbjct: 146 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTRRGFDTFFGSLLGSGD 205

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H   D    +  + G+D+  N + AWD   G Y+T ++T+   Q++      KP+FL
Sbjct: 206 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFL 261

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+A+ A H+      L+AP      ++ I + NRR YAAM+  LD+++  V  AL+  G 
Sbjct: 262 YIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGF 316

Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
             NSIII+ SDNG  PT            GSN+P RG K T WEGG++    + SP ++ 
Sbjct: 317 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 366

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
              V  +++HI+DW PTL + A G      + +DG D W ++   +   R+  +D L   
Sbjct: 367 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNI 422

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             +     +   +    I      +A+R+  WKL+ G        GY     S+ VP  +
Sbjct: 423 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQS 471

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           F                       SN+   R   ++ T+  G +              +L
Sbjct: 472 F-----------------------SNLGPNRWHNERITLSTGKS-------------VWL 495

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           FN+  DP E+ ++++  P I  +L   L    +T VP  +   D  +++P+     W PW
Sbjct: 496 FNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 554



 Score = 49.3 bits (116), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 222 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPG 276

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 277 RYFEHYRSIININRRRYAAM 296


>sp|Q32KJ6|GALNS_RAT N-acetylgalactosamine-6-sulfatase OS=Rattus norvegicus GN=Galns
           PE=1 SV=1
          Length = 524

 Score =  165 bits (418), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 125/368 (33%), Positives = 179/368 (48%), Gaps = 46/368 (12%)

Query: 84  TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
            L L+   GW DL  +G     TPN+D +A  G++  + Y A P+C+PSRA+L+TG+ PI
Sbjct: 35  VLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPI 94

Query: 143 HTGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
             G        A  R          G+P +E  LPE L++ GY+ K +GKWHLG  R ++
Sbjct: 95  RNGFY---TTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-RPQF 150

Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYA 247
            PL  GF+  FG  N     YD+ +       R  E+ G    +   NL T    +    
Sbjct: 151 HPLKHGFDEWFGSPNCHFGPYDNKVKPNIPVYRDWEMVGRFYEEFPINLKTGEANL---- 206

Query: 248 TDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
           T L+ +EA+  I  Q   + P FLY A  A HA                 Q++    R  
Sbjct: 207 TQLYLQEALDFIRTQHARQSPFFLYWAIDATHA-----------PVYASKQFLGTSLRGR 255

Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
           Y   V+++DDSVG ++S LQ  G+ +N+ + F SDNGA  +     S  +  GSN P+  
Sbjct: 256 YGDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALI-----SAPKEGGSNGPFLC 310

Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG--GDTSRLPLNIDG 424
            K T +EGG++ PAI W P      +VS Q+  I D   T  + AG    + R+   IDG
Sbjct: 311 GKQTTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAGLKPPSDRV---IDG 367

Query: 425 LDQWSSLL 432
           LD   ++L
Sbjct: 368 LDLLPTML 375


>sp|Q571E4|GALNS_MOUSE N-acetylgalactosamine-6-sulfatase OS=Mus musculus GN=Galns PE=2
           SV=2
          Length = 520

 Score =  165 bits (417), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 124/365 (33%), Positives = 175/365 (47%), Gaps = 40/365 (10%)

Query: 84  TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
            L L+   GW DL  +G     TPN+D +A  G++  + Y A P+C+PSRA+L+TG+ PI
Sbjct: 31  VLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPI 90

Query: 143 HTGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
             G        A  R          G+P +E  LPE L++ GY+ K +GKWHLG  R ++
Sbjct: 91  RNGFY---TTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-RPQF 146

Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNGHDMRRNLSTAWDTVGEYATDL 250
            PL  GF+  FG  N     YD+         R  E+ G            T     T L
Sbjct: 147 HPLKHGFDEWFGSPNCHFGPYDNKAKPNIPVYRDWEMVGR-FYEEFPINRKTGEANLTQL 205

Query: 251 FTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
           +T+EA+  I+ Q   + P FLY A  A HA                 Q++    R  Y  
Sbjct: 206 YTQEALDFIQTQHARQSPFFLYWAIDATHA-----------PVYASRQFLGTSLRGRYGD 254

Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
            V+++DDSVG ++S LQ  G+ +N+ + F SDNGA  +     S     GSN P+   K 
Sbjct: 255 AVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALI-----SAPNEGGSNGPFLCGKQ 309

Query: 370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG--GDTSRLPLNIDGLDQ 427
           T +EGG++ PAI W P      +VS Q+  I D   T  + AG    + R+   IDGLD 
Sbjct: 310 TTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAGLKPPSDRV---IDGLDL 366

Query: 428 WSSLL 432
             ++L
Sbjct: 367 LPTML 371


>sp|P34059|GALNS_HUMAN N-acetylgalactosamine-6-sulfatase OS=Homo sapiens GN=GALNS PE=1
           SV=1
          Length = 522

 Score =  162 bits (409), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 121/364 (33%), Positives = 173/364 (47%), Gaps = 41/364 (11%)

Query: 85  LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
           L L+   GW DL  +G     TPN+D +A  G++  N Y A P+C+PSRA+L+TG+ PI 
Sbjct: 35  LLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGLLFPNFYSANPLCSPSRAALLTGRLPIR 94

Query: 144 TGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
            G        A  R          G+P +E+ LPE L++ GY +K +GKWHLG  R ++ 
Sbjct: 95  NGFY---TTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFH 150

Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYAT 248
           PL  GF+  FG  N     YD+         R  E+ G    +   NL T    +    T
Sbjct: 151 PLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANL----T 206

Query: 249 DLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYA 308
            ++ +EA+  I+ Q    P FLY A  A HA                  ++    R  Y 
Sbjct: 207 QIYLQEALDFIKRQARHHPFFLYWAVDATHA-----------PVYASKPFLGTSQRGRYG 255

Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
             V+++DDS+G ++  LQ   + +N+ + F SDNGA  +   E       GSN P+   K
Sbjct: 256 DAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQG-----GSNGPFLCGK 310

Query: 369 NTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
            T +EGG++ PA+ W P      +VS Q+  I D L T   A  G T      IDGL+  
Sbjct: 311 QTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMD-LFTTSLALAGLTPPSDRAIDGLNLL 369

Query: 429 SSLL 432
            +LL
Sbjct: 370 PTLL 373


>sp|Q32KH5|GALNS_CANFA N-acetylgalactosamine-6-sulfatase OS=Canis familiaris GN=GALNS PE=2
           SV=1
          Length = 522

 Score =  158 bits (400), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 124/368 (33%), Positives = 175/368 (47%), Gaps = 48/368 (13%)

Query: 85  LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
           L L+   GW DL  +G     TPN+D +A  G++  + Y A P+C+PSRA+L+TG+ PI 
Sbjct: 34  LLLMDDMGWGDLGIYGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPIR 93

Query: 144 TGM-----------QGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
            G                I G    G+P  E  LPE L+E GY +K +GKWHLG  R ++
Sbjct: 94  NGFYTTNRHARNAYTPQEIVG----GIPDQEHVLPELLKEAGYVSKIVGKWHLG-HRPQF 148

Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYA 247
            PL  GF+  FG  N     YD+         R  E+ G    +   NL T    +    
Sbjct: 149 HPLKHGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRYYEEFPINLKTGEANL---- 204

Query: 248 TDLFTKEAVQLIE-DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
           T ++ +EA+  I+  Q   +P FLY A  A HA                  ++    R  
Sbjct: 205 TQVYLQEALDFIKRQQAAQRPFFLYWAIDATHA-----------PVYASRPFLGTSQRGR 253

Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
           Y   V+++D+SVG ++S LQ   + EN+ + F SDNGA  +     S     GSN P+  
Sbjct: 254 YGDAVREIDNSVGKILSLLQDLRISENTFVFFTSDNGAALI-----SAPNQGGSNGPFLC 308

Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG--GDTSRLPLNIDG 424
            K T +EGG++ PAI W P      RVS Q+  I D   T  + AG    + R+   IDG
Sbjct: 309 GKQTTFEGGMREPAIAWWPGRIPAGRVSHQLGSIMDLFTTSLSLAGLAPPSDRV---IDG 365

Query: 425 LDQWSSLL 432
           LD   ++L
Sbjct: 366 LDLLPAML 373


>sp|Q8WNQ7|GALNS_PIG N-acetylgalactosamine-6-sulfatase OS=Sus scrofa GN=GALNS PE=2 SV=1
          Length = 522

 Score =  157 bits (397), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 120/362 (33%), Positives = 173/362 (47%), Gaps = 36/362 (9%)

Query: 85  LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIH 143
           L L+   GW DL  +G     TPN+D +A  G++  + YA  P+C+PSRA+L+TG+ PI 
Sbjct: 34  LLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYAANPLCSPSRAALLTGRLPIR 93

Query: 144 TGM---QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
           TG     G       P+    G+P  E  LPE L+  GY++K +GKWHLG  R ++ PL 
Sbjct: 94  TGFYTTNGHARNAYTPQEIVGGIPDPEHLLPELLKGAGYASKIVGKWHLG-HRPQFHPLK 152

Query: 197 RGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYATDLF 251
            GF+  FG  N     YD+         R  E+ G    +   NL T    +    T ++
Sbjct: 153 HGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRFYEEFPINLKTGESNL----TQIY 208

Query: 252 TKEAVQLIE-DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAM 310
            +EA+  I+  Q    P FLY A  A HA                  ++    R  Y   
Sbjct: 209 LQEALDFIKRQQATHHPFFLYWAIDATHA-----------PVYASRAFLGTSQRGRYGDA 257

Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
           V+++DDSVG ++  L+   +  N+ + F SDNGA  V     S  +  GSN P+   K T
Sbjct: 258 VREIDDSVGRIVGLLRDLKIAGNTFVFFTSDNGAALV-----SAPKQGGSNGPFLCGKQT 312

Query: 371 LWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSS 430
            +EGG++ PAI W P      +VS Q+  + D   T  + AG +       IDGLD   +
Sbjct: 313 TFEGGMREPAIAWWPGHIPAGQVSHQLGSVMDLFTTSLSLAGLEPPS-DRAIDGLDLLPA 371

Query: 431 LL 432
           +L
Sbjct: 372 ML 373


>sp|P25549|ASLA_ECOLI Arylsulfatase OS=Escherichia coli (strain K12) GN=aslA PE=3 SV=2
          Length = 551

 Score =  157 bits (397), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 138/440 (31%), Positives = 210/440 (47%), Gaps = 69/440 (15%)

Query: 87  LLIVYGWNDLSFHGSNEI---PTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIH 143
           LL   GW D+ F+G       PTP+IDA+A  G+IL + Y+QP  +P+RA+++TG+Y IH
Sbjct: 92  LLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIH 151

Query: 144 TGMQGPPIWGAEPRGVP-LTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESH 202
            G+  PP++G +P G+  LT   LP+ L + GY T+AIGKWH+G   +E  P   GF+  
Sbjct: 152 HGILMPPMYG-QPGGLQGLTT--LPQLLHDQGYVTQAIGKWHMG-ENKESQPQNVGFDDF 207

Query: 203 FGYLNGVISYYD-----HI-----LSDQYSRTVEL------NGHDMRRNLSTA-WDTVGE 245
            G+ N V   Y      H+     LS   S  ++       + H +R     A  D   +
Sbjct: 208 RGF-NSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPK 266

Query: 246 YATDL---FTKEAVQLIEDQP-VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITD 301
           Y  DL   +    V+ ++     DKP FLY      H  N            N     + 
Sbjct: 267 YMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDN----------YPNAKYAGSS 316

Query: 302 PNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSN 361
           P R +Y   + +++D    +   L++ G L+N++I+F SDNG P  E             
Sbjct: 317 PARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG-PEAEVPPH-------GR 368

Query: 362 YPYRGVKNTLWEGGVKVPA-ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPL 420
            P+RG K + WEGGV+VP  + W   IQ  PR S  ++ ++D  PT    AG   +++  
Sbjct: 369 TPFRGAKGSTWEGGVRVPTFVYWKGMIQ--PRKSDGIVDLADLFPTALDLAGHPGAKV-- 424

Query: 421 NIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDS 480
                   ++L+  T     + IDG+DQ +S  L T  + N    +     + AAVR+D 
Sbjct: 425 --------ANLVPKT-----TFIDGVDQ-TSFFLGTNGQSNRKAEHYFLNGKLAAVRMDE 470

Query: 481 WKLVLGTQE--NGTMDGYYG 498
           +K  +  Q+    T  GY G
Sbjct: 471 FKYHVLIQQPYAYTQSGYQG 490


>sp|P51691|ARS_PSEAE Arylsulfatase OS=Pseudomonas aeruginosa (strain ATCC 15692 / PAO1 /
           1C / PRS 101 / LMG 12228) GN=atsA PE=1 SV=3
          Length = 536

 Score =  122 bits (306), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 114/413 (27%), Positives = 174/413 (42%), Gaps = 101/413 (24%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQ---- 147
           G++D+   G  EI TPN+DALA  G+ L + +    C+P+R+ L+TG      G+     
Sbjct: 16  GFSDIGAFG-GEIATPNLDALAIAGLRLTDFHTASTCSPTRSMLLTGTDHHIAGIGTMAE 74

Query: 148 --GPPIWGAEPRGVPLTERF--LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
              P + G       L ER   LPE LRE GY T   GKWHLG  + E TP  RGFE  F
Sbjct: 75  ALTPELEGKPGYEGHLNERVVALPELLREAGYQTLMAGKWHLG-LKPEQTPHARGFERSF 133

Query: 204 GYLNGVISYYDHILSDQYSRTVELNGH-----DMRRNLSTAWDTVGEYATDLFTKEAVQL 258
             L G  ++Y        S    L G      +  R L T  +  G Y++D F  + +Q 
Sbjct: 134 SLLPGAANHYGFEPPYDESTPRILKGTPALYVEDERYLDTLPE--GFYSSDAFGDKLLQY 191

Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF---------------------- 296
           ++++   +P F YL   A H       L+AP+E + ++                      
Sbjct: 192 LKERDQSRPFFAYLPFSAPH-----WPLQAPREIVEKYRGRYDAGPEALRQERLARLKEL 246

Query: 297 -------------------QYITDPNR-------RTYAAMVKKLDDSVGTVISALQRKGM 330
                              + + D  R         YAAMV+++D ++G V+  L+R+G 
Sbjct: 247 GLVEADVEAHPVLALTREWEALEDEERAKSARAMEVYAAMVERMDWNIGRVVDYLRRQGE 306

Query: 331 LENSIIIFMSDNGAP------------------------TVEYRETSNYRNW-------G 359
           L+N+ ++FMSDNGA                         ++E    +N   W        
Sbjct: 307 LDNTFVLFMSDNGAEGALLEAFPKFGPDLLGFLDRHYDNSLENIGRANSYVWYGPRWAQA 366

Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
           +  P R  K    +GG++VPA++  P++ +   +S     + D  PTL   AG
Sbjct: 367 ATAPSRLYKAFTTQGGIRVPALVRYPRLSRQGAISHAFATVMDVTPTLLDLAG 419


>sp|P15289|ARSA_HUMAN Arylsulfatase A OS=Homo sapiens GN=ARSA PE=1 SV=3
          Length = 507

 Score =  122 bits (306), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 124/421 (29%), Positives = 174/421 (41%), Gaps = 63/421 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPV--CTPSRASLMTGKYPIHTGMQGP 149
           G+ DL  +G     TPN+D LA  G+   + Y  PV  CTPSRA+L+TG+ P+  GM   
Sbjct: 32  GYGDLGCYGHPSSTTPNLDQLAAGGLRFTDFYV-PVSLCTPSRAALLTGRLPVRMGMYPG 90

Query: 150 PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFF-RREYTPLYRGFESHFGYLNG 208
            +  +   G+PL E  + E L   GY T   GKWHLG      + P ++GF    G    
Sbjct: 91  VLVPSSRGGLPLEEVTVAEVLAARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYS 150

Query: 209 VISYYDHILSDQYSRTVELNGHD-------MRRNLST----AWDTVGEYATDLFTKEAVQ 257
                   L+     T    G D       +  NLS      W    E     F  + + 
Sbjct: 151 HDQGPCQNLTCFPPATPCDGGCDQGLVPIPLLANLSVEAQPPWLPGLEARYMAFAHDLMA 210

Query: 258 LIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDS 317
             + Q  D+P FLY A           H   PQ +   F   +   R  +   + +LD +
Sbjct: 211 DAQRQ--DRPFFLYYAS---------HHTHYPQFSGQSFAERS--GRGPFGDSLMELDAA 257

Query: 318 VGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVK 377
           VGT+++A+   G+LE +++IF +DNG       ET      G +   R  K T +EGGV+
Sbjct: 258 VGTLMTAIGDLGLLEETLVIFTADNGP------ETMRMSRGGCSGLLRCGKGTTYEGGVR 311

Query: 378 VPAI-LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTP 436
            PA+  W   I   P V+ ++    D LPTL   AG   + LP                 
Sbjct: 312 EPALAFWPGHIA--PGVTHELASSLDLLPTLAALAG---APLP----------------- 349

Query: 437 SRRNSNIDGLDQWSSLLLNTPSRRNSVLI---NIDEKKRTAAVRLDSWKLVLGTQENGTM 493
              N  +DG D    LL    S R S+       DE +   AVR   +K    TQ +   
Sbjct: 350 ---NVTLDGFDLSPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGSAHS 406

Query: 494 D 494
           D
Sbjct: 407 D 407


>sp|Q9X759|ATSA_KLEPN Arylsulfatase OS=Klebsiella pneumoniae GN=atsA PE=1 SV=1
          Length = 577

 Score =  122 bits (305), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 116/461 (25%), Positives = 197/461 (42%), Gaps = 103/461 (22%)

Query: 75  TYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSR 132
            +AA  +   + ++I    G++D+S  G  EIPTPN+ A+A  G+ ++  Y  P+  P+R
Sbjct: 18  AHAAQQERPNVIVIIADDMGYSDISPFG-GEIPTPNLQAMAEQGMRMSQYYTSPMSAPAR 76

Query: 133 ASLMTGKYPIHTGMQGPPIW------GAEPRGVPLTERF--LPEYLRELGYSTKAIGKWH 184
           + L+TG      GM G  +W      G E   + LT+R   + E  ++ GY+T   GKWH
Sbjct: 77  SMLLTGNSNQQAGMGG--MWWYDSTIGKEGYELRLTDRVTTMAERFKDAGYNTLMAGKWH 134

Query: 185 LGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVG 244
           LGF     TP  RGF   F ++ G  S+++  +      TVE       R+         
Sbjct: 135 LGFVPGA-TPKERGFNHAFAFMGGGTSHFNDAIP---LGTVEAFHTYYTRDGERVSLPDD 190

Query: 245 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF-------- 296
            Y+++ + ++    I+  P ++P+F +LA  A H       L+AP E I +F        
Sbjct: 191 FYSSEAYARQMNSWIKATPKEQPVFAWLAFTAPH-----DPLQAPDEWIKRFKGQYEQGY 245

Query: 297 ---------------------------------------QYITDPNRRTYAAMVKKLDDS 317
                                                  Q  T    + YAAM+  +D  
Sbjct: 246 AEVYRQRIARLKALGIIHDDTPLPHLELDKEWEALTPEQQKYTAKVMQVYAAMIANMDAQ 305

Query: 318 VGTVISALQRKGMLENSIIIFMSDNGAPTVE--YRETS---------NYRNWG------- 359
           +GT++  L++ G  +N++++F++DNGA   +  Y E++         +Y N G       
Sbjct: 306 IGTLMETLKQTGRDKNTLLVFLTDNGANPAQGFYYESTPEFWKQFDNSYDNVGRKGSFVS 365

Query: 360 --------SNYPYRGV-KNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTA 410
                   SN PY    K T  +GG+    ++  P I ++ ++    M + D  PTLY  
Sbjct: 366 YGPHWANVSNAPYANYHKTTSAQGGINTDFMISGPGITRHGKIDASTMAVYDVAPTLYEF 425

Query: 411 AGGDTSR-------LPLNIDGLDQWSSLLLNTPSRRNSNID 444
           AG D ++       LP+      ++ +  +  P R N  ++
Sbjct: 426 AGIDPNKSLAKKPVLPMIGVSFKRYLTGEVQEPPRGNYGVE 466


>sp|P20713|ATSA_ENTAE Arylsulfatase OS=Enterobacter aerogenes GN=atsA PE=1 SV=1
          Length = 464

 Score =  121 bits (303), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 108/408 (26%), Positives = 177/408 (43%), Gaps = 94/408 (23%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+S  G  EIPTPN+ A+A  G+ ++  Y  P+  P+R+ L+TG      GM G  +
Sbjct: 37  GYSDISPFG-GEIPTPNLQAMAEQGMRMSQYYTSPMSAPARSMLLTGNSNQQAGMGG--M 93

Query: 152 W------GAEPRGVPLTERF--LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
           W      G E   + LT+R   + E  ++ GY+T   GKWHLGF     TP  RGF   F
Sbjct: 94  WWYDSTIGKEGYELRLTDRVTTMAERFKDAGYNTLMAGKWHLGFVPGA-TPKDRGFNHAF 152

Query: 204 GYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP 263
            ++ G  S+++  +      TVE       R+          Y+++ + ++    I+  P
Sbjct: 153 AFMGGGTSHFNDAIP---LGTVEAFHTYYTRDGERVSLPDDFYSSEAYARQMNSWIKATP 209

Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF--------------------------- 296
            ++P+F +LA  A H       L+AP E I +F                           
Sbjct: 210 KEQPVFAWLAFTAPH-----DPLQAPDEWIKRFKGQYEQGYAEVYRQRIARLKALGIIHD 264

Query: 297 --------------------QYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSII 336
                               Q  T    + YAAM+  +D  +GT++  L++ G  +N+++
Sbjct: 265 DTPLPHLELDKEWEALTPEQQKYTAKVMQVYAAMIANMDAQIGTLMETLKQTGRDKNTLL 324

Query: 337 IFMSDNGAPTVE--YRETS---------NYRNWG---------------SNYPYRGV-KN 369
           +F++DNGA   +  Y E++         +Y N G               SN PY    K 
Sbjct: 325 VFLTDNGANPAQGFYYESTPEFWKQFDNSYDNVGRKGSFVSYGPHWANVSNAPYANYHKT 384

Query: 370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSR 417
           T  +GG+    ++  P I ++ ++    M + D  PTLY  AG D ++
Sbjct: 385 TSAQGGINTDFMISGPGITRHGKIDASTMAVYDVAPTLYEFAGIDPNK 432


>sp|P50428|ARSA_MOUSE Arylsulfatase A OS=Mus musculus GN=Arsa PE=2 SV=2
          Length = 506

 Score =  120 bits (301), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 112/380 (29%), Positives = 174/380 (45%), Gaps = 48/380 (12%)

Query: 77  AALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPV--CTP 130
           A L+ ++   +L+++    G+ DL  +G     TPN+D LA  G+   + Y  PV  CTP
Sbjct: 12  AGLSTASPPNILLIFADDLGYGDLGSYGHPSSTTPNLDQLAEGGLRFTDFYV-PVSLCTP 70

Query: 131 SRASLMTGKYPIHTGMQGPPIWGAEPR-GVPLTERFLPEYLRELGYSTKAIGKWHLGFF- 188
           SRA+L+TG+ P+ +GM  P + G   + G+PL E  L E L   GY T   GKWHLG   
Sbjct: 71  SRAALLTGRLPVRSGMY-PGVLGPSSQGGLPLEEVTLAEVLAARGYLTGMAGKWHLGVGP 129

Query: 189 RREYTPLYRGFESHFGY--------LNGVISYYDHIL----SDQYSRTVELNGHDMRRNL 236
              + P ++GF    G            +  +   I      DQ    + L   ++    
Sbjct: 130 EGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPDIPCKGGCDQGLVPIPLLA-NLTVEA 188

Query: 237 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 296
              W    E     F+++ +   + Q   +P FLY A           H   PQ +   F
Sbjct: 189 QPPWLPGLEARYVSFSRDLMADAQRQ--GRPFFLYYAS---------HHTHYPQFSGQSF 237

Query: 297 QYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYR 356
              +   R  +   + +LD +VG +++ +   G+LE +++IF +DNG       E     
Sbjct: 238 TKRS--GRGPFGDSLMELDGAVGALMTTVGDLGLLEETLVIFTADNGP------ELMRMS 289

Query: 357 NWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTS 416
           N G +   R  K T +EGGV+ PA+++ P     P V+ ++    D LPTL    G   +
Sbjct: 290 NGGCSGLLRCGKGTTFEGGVREPALVYWPG-HITPGVTHELASSLDLLPTLAALTG---A 345

Query: 417 RLP-LNIDGLDQWSSLLLNT 435
            LP + +DG+D  S LLL T
Sbjct: 346 PLPNVTLDGVD-ISPLLLGT 364


>sp|Q9C0V7|YHJ2_SCHPO Uncharacterized sulfatase PB10D8.02c OS=Schizosaccharomyces pombe
           (strain 972 / ATCC 24843) GN=SPBPB10D8.02c PE=3 SV=1
          Length = 554

 Score =  117 bits (292), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 115/420 (27%), Positives = 179/420 (42%), Gaps = 117/420 (27%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGM----- 146
           GW+D+S  GS EI TPNI+ LA  G+ L N +    C+P+R+ L++G      G+     
Sbjct: 23  GWSDVSPFGS-EIHTPNIERLAKEGVRLTNFHTASACSPTRSMLLSGTDNHIAGLGQMAE 81

Query: 147 ---QGPPIWGAEP--RGVPLTERF--LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGF 199
              +   +WG +P   G  L +R   LPE L+E GY T   GKWHLG     Y P  RGF
Sbjct: 82  TVRRFSKVWGGKPGYEGY-LNDRVAALPEILQEAGYYTTMSGKWHLGLTPDRY-PSKRGF 139

Query: 200 ESHFGYLNGVISYYDH-----------ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYAT 248
           +  F  L G  +++ +            L   Y+   +   H   +N          Y++
Sbjct: 140 KESFALLPGGGNHFAYEPGTRENPAVPFLPPLYTHNHDPVDHKSLKNF---------YSS 190

Query: 249 DLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF--QYITDP---- 302
           + F ++ +  ++++   +  F YL   A H       L++P+E IN++  +Y   P    
Sbjct: 191 NYFAEKLIDQLKNREKSQSFFAYLPFTAPHW-----PLQSPKEYINKYRGRYSEGPDVLR 245

Query: 303 -NR------------------------------------------RTYAAMVKKLDDSVG 319
            NR                                            YAAMV+ LD ++G
Sbjct: 246 KNRLQAQKDLGLIPENVIPAPVDGMGTKSWDELTTEEKEFSARTMEVYAAMVELLDLNIG 305

Query: 320 TVISALQRKGMLENSIIIFMSDNGA--------------PTVEYRETS-----NYRNW-- 358
            VI  L+  G L+N+ +IFMSDNGA              P V+Y + S     NY ++  
Sbjct: 306 RVIDYLKTIGELDNTFVIFMSDNGAEGSVLEAIPVLSTKPPVKYFDNSLENLGNYNSFIW 365

Query: 359 -------GSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
                   +  P R  K  + EGG++ PAI+  P + +   +S + + + D LPT+   A
Sbjct: 366 YGPRWAQAATAPSRLSKGFITEGGIRCPAIIRYPPLIKPDIISDEFVTVMDILPTILELA 425


>sp|Q08DD1|ARSA_BOVIN Arylsulfatase A OS=Bos taurus GN=ARSA PE=2 SV=1
          Length = 507

 Score =  113 bits (283), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 107/361 (29%), Positives = 160/361 (44%), Gaps = 44/361 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPV--CTPSRASLMTGKYPIHTGMQGP 149
           G+ DL  +G     TPN+D LA  G+   + Y  PV  CTPSRA+L+TG+ P+  G+   
Sbjct: 32  GYGDLGSYGHPSSTTPNLDQLAAGGLRFTDFYV-PVSLCTPSRAALLTGRLPVRMGLYPG 90

Query: 150 PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFF-RREYTPLYRGFESHFG---- 204
            +  +   G+PL E  L E L   GY T   GKWHLG      + P + GF    G    
Sbjct: 91  VLEPSSRGGLPLDEVTLAEVLAAQGYLTGIAGKWHLGVGPEGAFLPPHHGFHRFLGIPYS 150

Query: 205 YLNGVISYYDHI--------LSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAV 256
           +  G                + DQ    + L   ++       W    E     F ++ +
Sbjct: 151 HDQGPCQNLTCFPPATPCEGICDQGLVPIPLLA-NLSVEAQPPWLPGLEARYVAFARDLM 209

Query: 257 QLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
              + Q   +P FLY          A  H   PQ +   F   +   R  +   + +LD 
Sbjct: 210 TDAQHQ--GRPFFLYY---------ASHHTHYPQFSGQSFPGHS--GRGPFGDSLMELDA 256

Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
           +VG +++A+   G+L  +++ F +DNG       ET    + G +   R  K T +EGGV
Sbjct: 257 AVGALMTAVGDLGLLGETLVFFTADNGP------ETMRMSHGGCSGLLRCGKGTTFEGGV 310

Query: 377 KVPAI-LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP-LNIDGLDQWSSLLLN 434
           + PA+  W   I   P V+ ++    D LPTL   AG   ++LP + +DG+D  S LLL 
Sbjct: 311 REPALAFWPGHIA--PGVTHELASSLDLLPTLAALAG---AQLPNITLDGVD-LSPLLLG 364

Query: 435 T 435
           T
Sbjct: 365 T 365


>sp|P14000|ARS_HEMPU Arylsulfatase OS=Hemicentrotus pulcherrimus PE=1 SV=1
          Length = 551

 Score =  109 bits (273), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 101/353 (28%), Positives = 156/353 (44%), Gaps = 44/353 (12%)

Query: 77  AALTKSTTLTLLIVY-GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRAS 134
           A L K   + L+  + G  DL+ +G        ID +A  G+   N Y    VCTPSR++
Sbjct: 47  APLVKPNVVLLVADHMGSGDLTSYGHPTQEAGFIDKMAAEGLRFTNGYVGDAVCTPSRSA 106

Query: 135 LMTGKYPIHTGMQGPPIWGAEPR--------GVPLTERFLPEYLRELGYSTKAIGKWHLG 186
           +MTG+ P+  G  G      E R        G+P +E  + E ++E GY+T  +GKWHLG
Sbjct: 107 IMTGRLPVRIGTFG------ETRVFLPWTKTGLPKSELTIAEAMKEAGYATGMVGKWHLG 160

Query: 187 FFRREYT-----PLYRGFE--SHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTA 239
                 T     P   GF+   H        S  D  L   +  +     +     +S  
Sbjct: 161 INENSSTDGAHLPFNHGFDFVGHNLPFTNSWSCDDTGLHKDFPDSQRCYLYVNATLVSQP 220

Query: 240 WDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 299
           +   G   T LFT +A+  IED   D P FLY+A           H+     + + F   
Sbjct: 221 YQHKG--LTQLFTDDALGFIEDNHAD-PFFLYVAF---------AHMHTSLFSSDDFSCT 268

Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
           +   R  Y   + ++ D+V  ++  L+   + EN+II F+SD+G P  EY E       G
Sbjct: 269 S--RRGRYGDNLLEMHDAVQKIVDKLEENNISENTIIFFISDHG-PHREYCEEG-----G 320

Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
               +RG K+  WEGG ++P I++ P    +P +S +++   D + T     G
Sbjct: 321 DASIFRGGKSHSWEGGHRIPYIVYWPGT-ISPGISNEIVTSMDIIATAADLGG 372


>sp|Q32KH9|ARSG_CANFA Arylsulfatase G OS=Canis familiaris GN=ARSG PE=2 SV=1
          Length = 535

 Score =  107 bits (268), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 135/519 (26%), Positives = 212/519 (40%), Gaps = 106/519 (20%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGPP 150
           GW DL  + +    T N+D +A  G+   + +A    C+PSRASL+TG+  +  G+    
Sbjct: 47  GWGDLGANWAETKDTANLDKMAAEGMRFVDFHAAASTCSPSRASLLTGRLGLRNGVT-HN 105

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
                  G+PL E  L E L++ GY T  IGKWHLG     Y P +RGF+ +FG     I
Sbjct: 106 FAVTSVGGLPLNETTLAEVLQQAGYVTGMIGKWHLGH-HGPYHPNFRGFDYYFG-----I 159

Query: 211 SY-----------YDHI------LSDQYSRTVELNGHD-----MRRNLSTAWDTVG-EYA 247
            Y           Y+H         D+ SR++E + +      +  NL+     V     
Sbjct: 160 PYSHDMGCTDTPGYNHPPCPACPRGDRPSRSLERDCYTDVALPLYENLNIVEQPVNLSSL 219

Query: 248 TDLFTKEAVQLIEDQPVD-KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
              + ++A+Q I+      +P  LY+     H   +   L A               RR 
Sbjct: 220 AHKYAEKAIQFIQHASASGRPFLLYMGLAHMHVPISRTQLSAVLR-----------GRRP 268

Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
           Y A ++++D  VG +   + R    EN+ + F  DNG P  +  E +     GS  P+ G
Sbjct: 269 YGAGLREMDSLVGQIKDKVDRTAK-ENTFLWFTGDNG-PWAQKCELA-----GSVGPFTG 321

Query: 367 V----------KNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTS 416
           +          K T WEGG +VPA+ + P        S  ++ + D  PT+   AG   +
Sbjct: 322 LWQTHQGGSPAKQTTWEGGHRVPALAYWPGRVPVNVTSTALLSVLDIFPTVVALAG---A 378

Query: 417 RLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLIN-----IDEKK 471
            LP                   ++ + DGLD  S +L       + VL +       E  
Sbjct: 379 SLP-------------------QDRHFDGLDA-SEVLFGWSQTGHRVLFHPNSGAAGEFG 418

Query: 472 RTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNA---IVE-------SKTYQS 521
               VRL S+K    +      DG  G+ + +  PL+ FN    + E       S  YQ 
Sbjct: 419 ALQTVRLGSYKAFYVSGGAKACDGDVGREQHHDPPLI-FNLEDDVAEAVPLDRGSAEYQG 477

Query: 522 ----LQQLSQNIFLPISNIDKMRS--TRQQATIHCGANP 554
               ++++  ++ L I+  +  R+  TR  +   C  NP
Sbjct: 478 VLPKVREILADVLLDIAGDNTSRADYTRHPSVTPC-CNP 515


>sp|P77318|YDEN_ECOLI Uncharacterized sulfatase YdeN OS=Escherichia coli (strain K12)
           GN=ydeN PE=3 SV=2
          Length = 560

 Score =  107 bits (266), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 97/344 (28%), Positives = 146/344 (42%), Gaps = 57/344 (16%)

Query: 106 TPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTER 164
           TP + +L   G+   N Y A  V  PSRA++MTG+ P   G+           G+PLTE 
Sbjct: 109 TPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT---DAQDGIPLTET 165

Query: 165 FLPEYLRELGYSTKAIGKWHLG----------------------FFRREYTPLYRGFESH 202
           FLPE  +  GY T A+GKWHL                       F   E+ P  RGF+  
Sbjct: 166 FLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYF 225

Query: 203 FGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIE-D 261
            G+     +YY+     +    V   G                Y +D  T EA+ +++  
Sbjct: 226 MGFHAAGTAYYNSPSLFKNRERVPAKG----------------YISDQLTDEAIGVVDRA 269

Query: 262 QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTV 321
           + +D+P  LYLA+ A H  N     +  Q+  N      D     Y A V  +D  V  +
Sbjct: 270 KTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTAD----NYYASVYSVDQGVKRI 325

Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
           +  L++ G  +N+II+F SDNGA  ++     N    G+    +G K+  + GG   P  
Sbjct: 326 LEQLKKNGQYDNTIILFTSDNGA-VIDGPLPLN----GAQ---KGYKSQTYPGGTHTPMF 377

Query: 382 LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
           +W     Q P    +++   D+ PT   AA     +  L +DG+
Sbjct: 378 MWWKGKLQ-PGNYDKLISAMDFYPTALDAADISIPK-DLKLDGV 419


>sp|P50473|ARS_STRPU Arylsulfatase OS=Strongylocentrotus purpuratus PE=2 SV=1
          Length = 567

 Score =  102 bits (255), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 98/360 (27%), Positives = 160/360 (44%), Gaps = 45/360 (12%)

Query: 78  ALTKSTTLTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASL 135
           A+TK   + LL    G  DLS +G        ID +A  G+     Y+   VCTPSR+++
Sbjct: 63  AMTKPNVILLLADDMGVGDLSVYGHPTQEPGFIDQMANQGLRFTQGYSGDSVCTPSRSAI 122

Query: 136 MTGKYPIHTGMQGPPIWGAE-------PRGVPLTERFLPEYLRELGYSTKAIGKWHLGFF 188
           +TG+ PI TG     ++G E         G+PL E  + E ++  GY+T  +GKWHLG  
Sbjct: 123 VTGRQPIRTG-----VYGEERIFLPWTTTGLPLYEVTIAEAMKGAGYTTGMVGKWHLGIN 177

Query: 189 RRE-----YTPLYRGFE--SHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD 241
                   + P  RGF+   H           D  L   +  T     +    +++  + 
Sbjct: 178 ENSSSDGAHLPANRGFDFVGHNLPFGNSWRCDDTGLHQDFPDTNACFLYYNSTSVAQPFQ 237

Query: 242 TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITD 301
             G   T L   + V  IED  V+KP F+Y++           H+     + + F   + 
Sbjct: 238 HKG--LTQLLRDDTVGFIEDN-VNKPFFMYVSF---------AHMHTSLFSSDDFSCTS- 284

Query: 302 PNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSN 361
             R  Y   ++++D ++  +++ L    + +N++I F SD+G P  EY       N    
Sbjct: 285 -RRGRYGDNLREMDQAIEQIVTTLVDNDIDDNTVIFFTSDHG-PHREYCGEGGDANV--- 339

Query: 362 YPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN 421
             +RG K   WEGG ++P I++ P    +P VS +++   D + T     G   S+LP +
Sbjct: 340 --FRGGKGQSWEGGHRIPYIVYWPGT-ISPGVSHEIVTSMDIIATAVNLGG---SQLPTD 393


>sp|Q32KJ9|ARSG_RAT Arylsulfatase G OS=Rattus norvegicus GN=Arsg PE=2 SV=1
          Length = 526

 Score = 99.8 bits (247), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 125/475 (26%), Positives = 194/475 (40%), Gaps = 89/475 (18%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGPP 150
           GW DL  + +    T N+D +A  G+   + +A    C+PSRASL+TG+  +  G+    
Sbjct: 47  GWGDLGANWAETKDTTNLDKMASEGMRFVDFHAAASTCSPSRASLLTGRLGLRNGVTHN- 105

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
                  G+PL E  L E L++ GY T  IGKWHLG     Y P +RGF+ +FG     I
Sbjct: 106 FAVTSVGGLPLNETTLAEVLQQAGYVTAMIGKWHLG-HHGSYHPSFRGFDYYFG-----I 159

Query: 211 SYYDHI-----------------LSDQYSRTVELNGHD-----MRRNLSTAWDTVGEYA- 247
            Y + +                  SD   R  + + +      +  NL+     V     
Sbjct: 160 PYSNDMGCTDNPGYNYPPCPACPQSDGRWRNPDRDCYTDVALPLYENLNIVEQPVNLSGL 219

Query: 248 TDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDP-NRRT 306
              + + AV+ IE        FL    LA        H+  P   ++    + +P ++R 
Sbjct: 220 AQKYAERAVEFIEQASTSGRPFLLYVGLA--------HMHVP---LSVTPPLANPQSQRL 268

Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
           Y A ++++D  VG +   +      EN+++ F  DNG P  +  E +     GS  P+ G
Sbjct: 269 YRASLQEMDSLVGQIKDKVDHVAK-ENTLLWFAGDNG-PWAQKCELA-----GSMGPFSG 321

Query: 367 V----------KNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTS 416
           +          K T WEGG +VPA+ + P        S  ++ + D  PT+   AG   +
Sbjct: 322 LWQTHQGGSPAKQTTWEGGHRVPALAYWPGRVPVNVTSTALLSLLDIFPTVIALAG---A 378

Query: 417 RLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLIN-----IDEKK 471
            LP                P+R+    DG+D  S +L       + VL +       E  
Sbjct: 379 SLP----------------PNRK---FDGVDV-SEVLFGKSQTGHRVLFHPNSGAAGEYG 418

Query: 472 RTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLS 526
               VRLD +K    T      DG  G  + +  PL+ FN   ++     LQ+ S
Sbjct: 419 ALQTVRLDRYKAFYITGGAKACDGGVGPEQHHVSPLI-FNLEDDAAESSPLQKGS 472


>sp|Q3TYD4|ARSG_MOUSE Arylsulfatase G OS=Mus musculus GN=Arsg PE=2 SV=1
          Length = 525

 Score = 96.7 bits (239), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 122/452 (26%), Positives = 180/452 (39%), Gaps = 73/452 (16%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGPP 150
           GW DL  + +    T N+D +A  G+   + +A    C+PSRASL+TG+  +  G+    
Sbjct: 47  GWGDLGANWAETKDTTNLDKMASEGMRFVDFHAAASTCSPSRASLLTGRLGLRNGVTHN- 105

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG--YLNG 208
                  G+P+ E  L E LR+ GY T  IGKWHLG     Y P +RGF+ +FG  Y N 
Sbjct: 106 FAVTSVGGLPVNETTLAEVLRQEGYVTAMIGKWHLG-HHGSYHPNFRGFDYYFGIPYSND 164

Query: 209 V-------ISYYDHILSDQYSRTVELNGHD--------MRRNLSTAWDTVGEYA-TDLFT 252
           +        +Y       Q        G D        +  NL+     V        + 
Sbjct: 165 MGCTDAPGYNYPPCPACPQRDGLWRNPGRDCYTDVALPLYENLNIVEQPVNLSGLAQKYA 224

Query: 253 KEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT-YAAMV 311
           + AV+ IE        FL    LA        H+  P        +   P R++ Y A +
Sbjct: 225 ERAVEFIEQASTSGRPFLLYVGLA--------HMHVPLSVTPPLAH---PQRQSLYRASL 273

Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGV---- 367
           +++D  VG +   +      EN+++ F  DNG P  +  E +     GS  P+ G+    
Sbjct: 274 REMDSLVGQIKDKVDHVAR-ENTLLWFTGDNG-PWAQKCELA-----GSVGPFFGLWQTH 326

Query: 368 ------KNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN 421
                 K T WEGG +VPA+ + P        S  ++ + D  PT+   AG   + LP N
Sbjct: 327 QGGSPTKQTTWEGGHRVPALAYWPGRVPANVTSTALLSLLDIFPTVIALAG---ASLPPN 383

Query: 422 --IDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLD 479
              DG D    L             G  Q    +L  P+   +      E      VRL+
Sbjct: 384 RKFDGRDVSEVLF------------GKSQMGHRVLFHPNSGAA-----GEYGALQTVRLN 426

Query: 480 SWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
            +K    T      DG  G  + +  PL+ FN
Sbjct: 427 HYKAFYITGGAKACDGSVGPEQHHVAPLI-FN 457


>sp|Q96EG1|ARSG_HUMAN Arylsulfatase G OS=Homo sapiens GN=ARSG PE=1 SV=1
          Length = 525

 Score = 94.7 bits (234), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 124/468 (26%), Positives = 181/468 (38%), Gaps = 105/468 (22%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGPP 150
           GW DL  + +    T N+D +A  G+   + +A    C+PSRASL+TG+  +  G+    
Sbjct: 47  GWGDLGANWAETKDTANLDKMASEGMRFVDFHAAASTCSPSRASLLTGRLGLRNGVT-RN 105

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
                  G+PL E  L E L++ GY T  IGKWHLG     Y P +RGF+ +FG     I
Sbjct: 106 FAVTSVGGLPLNETTLAEVLQQAGYVTGIIGKWHLG-HHGSYHPNFRGFDYYFG-----I 159

Query: 211 SY-----------YDH---------------ILSDQYSRTV-----ELNGHDMRRNLSTA 239
            Y           Y+H               +  D Y+         LN  +   NLS+ 
Sbjct: 160 PYSHDMGCTDTPGYNHPPCPACPQGDGPSRNLQRDCYTDVALPLYENLNIVEQPVNLSSL 219

Query: 240 WDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 299
                E AT    + +          +P  LY+A LA        H+  P       Q  
Sbjct: 220 AQKYAEKATQFIQRASTS-------GRPFLLYVA-LA--------HMHVPLPVT---QLP 260

Query: 300 TDPNRRT-YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNW 358
             P  R+ Y A + ++D  VG +   +    + EN+ + F  DNG P  +  E +     
Sbjct: 261 AAPRGRSLYGAGLWEMDSLVGQIKDKVDHT-VKENTFLWFTGDNG-PWAQKCELA----- 313

Query: 359 GSNYPYRG----------VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLY 408
           GS  P+ G           K T WEGG +VPA+ + P        S  ++ + D  PT+ 
Sbjct: 314 GSVGPFTGFWQTRQGGSPAKQTTWEGGHRVPALAYWPGRVPVNVTSTALLSVLDIFPTVV 373

Query: 409 TAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLIN-- 466
             A    + LP                   +    DG+D  S +L       + VL +  
Sbjct: 374 ALA---QASLP-------------------QGRRFDGVDV-SEVLFGRSQPGHRVLFHPN 410

Query: 467 ---IDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
                E      VRL+ +K    T      DG  G    +K PL+ FN
Sbjct: 411 SGAAGEFGALQTVRLERYKAFYITGGARACDGSTGPELQHKFPLI-FN 457


>sp|Q60HH5|ARSE_MACFA Arylsulfatase E OS=Macaca fascicularis GN=ARSE PE=2 SV=1
          Length = 588

 Score = 90.1 bits (222), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 103/409 (25%), Positives = 165/409 (40%), Gaps = 83/409 (20%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G  D+  +G+N + TPNID LA +G+ L  ++ A  +CTPSRA+ +TG+YP+ +GM    
Sbjct: 49  GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 108

Query: 151 -----IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE-----YTPLYRGFE 200
                 W     G+P  E    + L+E GY+T  IGKWHLG          + PL+ GF+
Sbjct: 109 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 168

Query: 201 SHFGY---LNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQ 257
             +G    L G  ++++  LS++          ++ + L+  +  +   A  L   +   
Sbjct: 169 HFYGMPFSLMGDCAHWE--LSEKRV--------NLEQKLNFLFQVLALVALTLVAGKLTH 218

Query: 258 LIEDQPVD----------KPLFL----YLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 303
           LI   PV             L L    ++  L  HAG          E   +FQ  T   
Sbjct: 219 LI---PVSWTPVIWSALWAVLLLTGSYFVGALIVHAGCLLMRNHTITEQPMRFQKTTPLI 275

Query: 304 RRTYAAMVKK----------------------------------------LDDSVGTVIS 323
            +  A+ +K+                                        +D  VG ++ 
Sbjct: 276 LQEVASFLKRNKHGPFLLFVSFLHVHIPLITMENFLGKSLHGLYGDNVEEMDWMVGQILD 335

Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILW 383
            L  +G+  +++I F SD+G         + Y  W   Y          EGG++VP I  
Sbjct: 336 TLDMEGLTNSTLIYFTSDHGGSLENQLGRTQYGGWNGIYKGGKGMGGW-EGGIRVPGIFR 394

Query: 384 SPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
            P +    +V  +   + D  PT+   AGG+  +  + IDG D    LL
Sbjct: 395 WPGVLPAGQVIGEPTSLMDVFPTVVQLAGGEVPQDRV-IDGQDLLPLLL 442


>sp|Q32KH8|ARSH_CANFA Arylsulfatase H OS=Canis familiaris GN=ARSH PE=2 SV=1
          Length = 562

 Score = 89.7 bits (221), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 58/149 (38%), Positives = 76/149 (51%), Gaps = 18/149 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQGP- 149
           G  DL  +G+N + TPNID LA  G+ L  ++ A  VCTPSRA+ +TG+YPI +GM  P 
Sbjct: 18  GVGDLCCYGNNTVSTPNIDRLASEGVRLTQHLAAASVCTPSRAAFLTGRYPIRSGMASPY 77

Query: 150 -----PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE-----YTPLYRGF 199
                  W     G+P  E    + L+  GY T  IGKWH G          Y PL  GF
Sbjct: 78  NLNRGLTWLGGSGGLPTNETTFAKLLQHYGYRTGLIGKWHQGLSCASRNDHCYHPLNHGF 137

Query: 200 ESHFGYLNGVISYYDHILSDQYSRTVELN 228
           +  +G   G++S        Q SRT EL+
Sbjct: 138 DYFYGLPFGLLS------DCQASRTPELH 160



 Score = 44.7 bits (104), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 45/170 (26%), Positives = 77/170 (45%), Gaps = 15/170 (8%)

Query: 245 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 304
           E    L  KEA+  I D+    P  L+++ L         H+  P  T ++F  +     
Sbjct: 239 ERVASLMLKEALAFI-DRYKRGPFLLFVSFL---------HVHTPLITKDKF--VGHSKY 286

Query: 305 RTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPY 364
             Y   V+++D  VG ++  L ++ +  ++++ F SDNG   +E +E    +  GSN  Y
Sbjct: 287 GLYGDNVEEMDWMVGKILETLDQERLTNHTLVYFTSDNGG-RLEVQE-GEVQLGGSNGIY 344

Query: 365 RGVKNTLWEGG-VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGG 413
           +G +      G ++VP I   P + Q  +V  +   + D  PTL    GG
Sbjct: 345 KGGQGMGGWEGGIRVPGIFRWPTVLQAGKVINEPTSLMDIYPTLSYIGGG 394


>sp|P08842|STS_HUMAN Steryl-sulfatase OS=Homo sapiens GN=STS PE=1 SV=2
          Length = 583

 Score = 89.4 bits (220), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 54/142 (38%), Positives = 76/142 (53%), Gaps = 11/142 (7%)

Query: 74  RTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSR 132
            ++AA   +  L +    G  D   +G+  I TPNID LA  G+ L  ++ A P+CTPSR
Sbjct: 20  ESHAASRPNIILVMADDLGIGDPGCYGNKTIRTPNIDRLASGGVKLTQHLAASPLCTPSR 79

Query: 133 ASLMTGKYPIHTGMQ-----GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGF 187
           A+ MTG+YP+ +GM      G  ++ A   G+P  E    + L++ GYST  IGKWHLG 
Sbjct: 80  AAFMTGRYPVRSGMASWSRTGVFLFTASSGGLPTDEITFAKLLKDQGYSTALIGKWHLGM 139

Query: 188 FRREYT-----PLYRGFESHFG 204
                T     PL+ GF   +G
Sbjct: 140 SCHSKTDFCHHPLHHGFNYFYG 161



 Score = 63.2 bits (152), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 55/181 (30%), Positives = 87/181 (48%), Gaps = 18/181 (9%)

Query: 248 TDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTY 307
           T   T EA Q I+ +  + P  L L++L  H       L + ++   + Q+        Y
Sbjct: 261 TQRLTVEAAQFIQ-RNTETPFLLVLSYLHVHTA-----LFSSKDFAGKSQH------GVY 308

Query: 308 AAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGV 367
              V+++D SVG +++ L    +  +++I F SD GA  VE   +    + GSN  Y+G 
Sbjct: 309 GDAVEEMDWSVGQILNLLDELRLANDTLIYFTSDQGA-HVEEVSSKGEIHGGSNGIYKGG 367

Query: 368 KNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--IDGL 425
           K   WEGG++VP IL  P++ Q  +   +     D  PT+   AG   + LP +  IDG 
Sbjct: 368 KANNWEGGIRVPGILRWPRVIQAGQKIDEPTSNMDIFPTVAKLAG---APLPEDRIIDGR 424

Query: 426 D 426
           D
Sbjct: 425 D 425


>sp|Q5FYA8|ARSH_HUMAN Arylsulfatase H OS=Homo sapiens GN=ARSH PE=2 SV=1
          Length = 562

 Score = 85.1 bits (209), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 56/149 (37%), Positives = 75/149 (50%), Gaps = 18/149 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQG-- 148
           G  DL  +G+N + TPNID LA  G+ L  ++ A  +CTPSRA+ +TG+YPI +GM    
Sbjct: 18  GVGDLCCYGNNSVSTPNIDRLASEGVRLTQHLAAASMCTPSRAAFLTGRYPIRSGMVSAY 77

Query: 149 ----PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE-----YTPLYRGF 199
                  W     G+P  E    + L+  GY T  IGKWHLG          Y PL  GF
Sbjct: 78  NLNRAFTWLGGSGGLPTNETTFAKLLQHRGYRTGLIGKWHLGLSCASRNDHCYHPLNHGF 137

Query: 200 ESHFGYLNGVISYYDHILSDQYSRTVELN 228
              +G   G++S        Q S+T EL+
Sbjct: 138 HYFYGVPFGLLS------DCQASKTPELH 160



 Score = 44.3 bits (103), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 70/173 (40%), Gaps = 13/173 (7%)

Query: 245 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 304
           E    L  KEA+  IE     +P  L+ + L  H              I++ +++     
Sbjct: 239 EKVASLMLKEALAFIERYK-REPFLLFFSFLHVHT-----------PLISKKKFVGRSKY 286

Query: 305 RTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPY 364
             Y   V+++D  VG ++ AL ++ +  ++++ F SDNG              W   Y  
Sbjct: 287 GRYGDNVEEMDWMVGKILDALDQERLANHTLVYFTSDNGGHLEPLDGAVQLGGWNGIYKG 346

Query: 365 RGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSR 417
                   EGG++VP I   P + +  RV  +   + D  PTL    GG  S+
Sbjct: 347 GKGMGGW-EGGIRVPGIFRWPSVLEAGRVINEPTSLMDIYPTLSYIGGGILSQ 398


>sp|P51690|ARSE_HUMAN Arylsulfatase E OS=Homo sapiens GN=ARSE PE=1 SV=2
          Length = 589

 Score = 85.1 bits (209), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 47/124 (37%), Positives = 69/124 (55%), Gaps = 11/124 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G  D+  +G+N + TPNID LA +G+ L  ++ A  +CTPSRA+ +TG+YP+ +GM    
Sbjct: 49  GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 108

Query: 151 -----IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE-----YTPLYRGFE 200
                 W     G+P  E    + L+E GY+T  IGKWHLG          + PL+ GF+
Sbjct: 109 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 168

Query: 201 SHFG 204
             +G
Sbjct: 169 HFYG 172



 Score = 41.6 bits (96), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 49/201 (24%), Positives = 81/201 (40%), Gaps = 14/201 (6%)

Query: 232 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 291
           MR +  T      +  T L  +E    ++      P  L+++ L         H+  P  
Sbjct: 256 MRNHTITEQPMCFQRTTPLILQEVASFLKRNK-HGPFLLFVSFL---------HVHIPLI 305

Query: 292 TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE 351
           T+  F  +       Y   V+++D  VG ++  L  +G+  +++I F SD+G        
Sbjct: 306 TMENF--LGKSLHGLYGDNVEEMDWMVGRILDTLDVEGLSNSTLIYFTSDHGGSLENQLG 363

Query: 352 TSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
            + Y  W   Y          EGG++VP I   P +    RV  +   + D  PT+   A
Sbjct: 364 NTQYGGWNGIYKGGKGMGGW-EGGIRVPGIFRWPGVLPAGRVIGEPTSLMDVFPTVVRLA 422

Query: 412 GGDTSRLPLNIDGLDQWSSLL 432
           GG+  +  + IDG D    LL
Sbjct: 423 GGEVPQDRV-IDGQDLLPLLL 442


>sp|P50427|STS_MOUSE Steryl-sulfatase OS=Mus musculus GN=Sts PE=2 SV=1
          Length = 624

 Score = 81.6 bits (200), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 48/124 (38%), Positives = 69/124 (55%), Gaps = 11/124 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQ--- 147
           G  DL  +G+  + TP++D LA  G+ L  ++ A P+CTPSRA+ +TG+YP  +GM    
Sbjct: 46  GIGDLGCYGNKTLRTPHLDRLAREGVKLTQHLAAAPLCTPSRAAFLTGRYPPRSGMAAHG 105

Query: 148 --GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT-----PLYRGFE 200
             G  ++ A   G+P +E  +   L+  GY+T  IGKWHLG   R  T     PL  GF+
Sbjct: 106 RVGVYLFTASSGGLPPSEVTMARLLKGRGYATALIGKWHLGLSCRGATDFCHHPLRHGFD 165

Query: 201 SHFG 204
              G
Sbjct: 166 RFLG 169



 Score = 67.4 bits (163), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 53/169 (31%), Positives = 77/169 (45%), Gaps = 29/169 (17%)

Query: 266 KPLFLYLAHLAAHA------GNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVG 319
           +P  L+L+ L  H       G AG+ L                    Y   V+++D  VG
Sbjct: 286 RPFLLFLSFLHVHTAHFADPGFAGRSLHG-----------------AYGDSVEEMDWGVG 328

Query: 320 TVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP 379
            V++AL   G+   +++ F SD+GA  VE       R  GSN  +RG K   WEGGV+VP
Sbjct: 329 RVLAALDELGLARETLVYFTSDHGA-HVEELGPRGERMGGSNGVFRGGKGNNWEGGVRVP 387

Query: 380 AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--IDGLD 426
            ++  P+     RV  +   + D  PT+   AG +   LP +  IDG D
Sbjct: 388 CLVRWPRELSPGRVVAEPTSLMDVFPTVARLAGAE---LPGDRVIDGRD 433


>sp|P15589|STS_RAT Steryl-sulfatase OS=Rattus norvegicus GN=Sts PE=1 SV=2
          Length = 577

 Score = 78.6 bits (192), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 41/102 (40%), Positives = 61/102 (59%), Gaps = 6/102 (5%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQ--- 147
           G  DL  +G+  + TP+ID LA  G+ L  ++ A P+CTPSRA+ +TG+YP+ +GM    
Sbjct: 37  GIGDLGCYGNRTLRTPHIDRLALEGVKLTQHLAAAPLCTPSRAAFLTGRYPVRSGMASHG 96

Query: 148 --GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGF 187
             G  ++ A   G+P  E    + L+  GY+T  +GKWHLG 
Sbjct: 97  RLGVFLFSASSGGLPPNEVTFAKLLKGQGYTTGLVGKWHLGL 138



 Score = 65.5 bits (158), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 50/170 (29%), Positives = 79/170 (46%), Gaps = 17/170 (10%)

Query: 265 DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISA 324
           D P  L+L+ +  H      H   P+       +        Y   V+++D +VG V++ 
Sbjct: 276 DTPFLLFLSFMHVHT----AHFANPE-------FAGQSLHGAYGDAVEEMDWAVGQVLAT 324

Query: 325 LQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWS 384
           L + G+  N+++   SD+GA  VE    +  R+ GSN  YRG K   WEGG++VP ++  
Sbjct: 325 LDKLGLANNTLVYLTSDHGA-HVEELGPNGERHGGSNGIYRGGKANTWEGGIRVPGLVRW 383

Query: 385 PQIQQNPRVSLQMMHISDWLPTLYTAAGGD--TSRLPLNIDGLDQWSSLL 432
           P +    +   +     D  PT+   AG +  T R+   IDG D    LL
Sbjct: 384 PGVIVPGQEVEEPTSNMDVFPTVARLAGAELPTDRV---IDGRDLMPLLL 430


>sp|P51689|ARSD_HUMAN Arylsulfatase D OS=Homo sapiens GN=ARSD PE=1 SV=2
          Length = 593

 Score = 77.8 bits (190), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 50/142 (35%), Positives = 73/142 (51%), Gaps = 11/142 (7%)

Query: 74  RTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSR 132
           +T  A   +  L +    G  DL  +G+N + TPNID LA  G+ L  ++ A P+CTPSR
Sbjct: 34  KTANAFKPNILLIMADDLGTGDLGCYGNNTLRTPNIDQLAEEGVRLTQHLAAAPLCTPSR 93

Query: 133 ASLMTGKYPIHTGMQGPP-----IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGF 187
           A+ +TG++   +GM          W A   G+P  E      L++ GY+T  IGKWH G 
Sbjct: 94  AAFLTGRHSFRSGMDASNGYRALQWNAGSGGLPENETTFARILQQHGYATGLIGKWHQGV 153

Query: 188 ---FRREYT--PLYRGFESHFG 204
               R ++   PL  GF+  +G
Sbjct: 154 NCASRGDHCHHPLNHGFDYFYG 175



 Score = 46.2 bits (108), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 49/184 (26%), Positives = 78/184 (42%), Gaps = 13/184 (7%)

Query: 232 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 291
           MR +  T    V E    L  KEAV  IE      P  L+L+ L         H+  P  
Sbjct: 259 MRNHDVTEQPMVLEKTASLMLKEAVSYIERHK-HGPFLLFLSLL---------HVHIPLV 308

Query: 292 TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE 351
           T + F  +       Y   V+++D  +G V++A++  G+  ++   F SD+G   +E R+
Sbjct: 309 TTSAF--LGKSQHGLYGDNVEEMDWLIGKVLNAIEDNGLKNSTFTYFTSDHGG-HLEARD 365

Query: 352 TSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
             +     +     G     WEGG++VP I   P +    RV  +   + D  PT+    
Sbjct: 366 GHSQLGGWNGIYKGGKGMGGWEGGIRVPGIFHWPGVLPAGRVIGEPTSLMDVFPTVVQLV 425

Query: 412 GGDT 415
           GG+ 
Sbjct: 426 GGEV 429


>sp|P54793|ARSF_HUMAN Arylsulfatase F OS=Homo sapiens GN=ARSF PE=1 SV=4
          Length = 590

 Score = 75.1 bits (183), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 47/127 (37%), Positives = 70/127 (55%), Gaps = 17/127 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G  DL  +G++ + TP+ID LA  G+ L  ++ A  +C+PSR++ +TG+YPI +GM    
Sbjct: 41  GIGDLGCYGNDTMRTPHIDRLAREGVRLTQHISAASLCSPSRSAFLTGRYPIRSGMVSS- 99

Query: 151 IWG--------AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGF-----FRREYTPLYR 197
             G        A P G+PL E  L   L++ GYST  IGKWH G        + + P   
Sbjct: 100 --GNRRVIQNLAVPAGLPLNETTLAALLKKQGYSTGLIGKWHQGLNCDSRSDQCHHPYNY 157

Query: 198 GFESHFG 204
           GF+ ++G
Sbjct: 158 GFDYYYG 164



 Score = 47.8 bits (112), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 56/226 (24%), Positives = 93/226 (41%), Gaps = 24/226 (10%)

Query: 203 FGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
           F +L G   +  H  S  Y   + + GH++      A     E A  +  KEA+  +E  
Sbjct: 225 FIFLLGYAWFSSHT-SPLYWDCLLMRGHEITEQPMKA-----ERAGSIMVKEAISFLERH 278

Query: 263 PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVI 322
             +    L+ + L         H+  P  T + F   +      Y   V+++D  VG ++
Sbjct: 279 SKET-FLLFFSFL---------HVHTPLPTTDDFTGTS--KHGLYGDNVEEMDSMVGKIL 326

Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
            A+   G+  N+++ F SD+G      R  +    W   Y          EGG++VP I+
Sbjct: 327 DAIDDFGLRNNTLVYFTSDHGGHLEARRGHAQLGGWNGIYKGGKGMGGW-EGGIRVPGIV 385

Query: 383 WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--IDGLD 426
             P      R+  +   + D LPT+ + +GG    LP +  IDG D
Sbjct: 386 RWPGKVPAGRLIKEPTSLMDILPTVASVSGGS---LPQDRVIDGRD 428


>sp|Q10723|ARS_VOLCA Arylsulfatase OS=Volvox carteri PE=1 SV=1
          Length = 649

 Score = 72.0 bits (175), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 87/371 (23%), Positives = 153/371 (41%), Gaps = 86/371 (23%)

Query: 112 LAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQ---GPPIWGAEPRGVPLTERFLP 167
           + Y GI L N +   PVC PSR +L  G++  +T      GP    A+ + + + + +LP
Sbjct: 55  IRYPGIELKNYFVTTPVCCPSRTNLWRGQFSHNTNFTDVLGPHGGYAKWKSLGIDKSYLP 114

Query: 168 EYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVEL 227
            +L+ LGY+T  +GK+ + +    Y  +  G+      ++ +++ Y          T + 
Sbjct: 115 VWLQNLGYNTYYVGKFLVDYSVSNYQNVPAGWTD----IDALVTPY----------TFDY 160

Query: 228 NGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGN----- 281
           N     RN +T     G Y+TD+   +AV  I+      KP +  ++ +A H        
Sbjct: 161 NNPGFSRNGATPNIYPGFYSTDVIADKAVAQIKTAVAAGKPFYAQISPIAPHTSTQIYFD 220

Query: 282 ---------------AGKHLE------APQETINQFQYITD---------------PNRR 305
                          A +H E       P+ T ++  Y  D                N R
Sbjct: 221 PVANATKTFFYPPIPAPRHWELFSDATLPEGTSHKNLYEADVSDKPAWIRALPLAQQNNR 280

Query: 306 TYAAMVKKL--------DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRN 357
           TY   V +L        D+ +  V++ LQ  G+L+N+ +I+ +DNG     +R       
Sbjct: 281 TYLEEVYRLRLRSLASVDELIDRVVATLQEAGVLDNTYLIYSADNGYHVGTHR------- 333

Query: 358 WGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN----PRVSLQMMHISDWLPTLYTAAGG 413
                 +   K T ++  ++VP ++  P I+ +    P  S   +H+ D+ PT+ T AG 
Sbjct: 334 ------FGAGKVTAYDEDLRVPFLIRGPGIRASHSDKPANSKVGLHV-DFAPTILTLAGA 386

Query: 414 DTSRLPLNIDG 424
                   +DG
Sbjct: 387 GDQVGDKALDG 397


>sp|Q90XB6|SULF1_COTCO Extracellular sulfatase Sulf-1 OS=Coturnix coturnix GN=SULF1 PE=1
           SV=1
          Length = 867

 Score = 69.3 bits (168), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 89/361 (24%), Positives = 141/361 (39%), Gaps = 79/361 (21%)

Query: 118 ILNNMYAQPVCTPSRASLMTGKYP----IHTGMQ--GPPIWGA--EPRGVPLTERFLPEY 169
            +N     P+C PSR+S++TGKY     I+T  +    P W A  EPR   +       Y
Sbjct: 77  FINAFVTTPMCCPSRSSMLTGKYVHNHNIYTNNENCSSPSWQATHEPRTFAV-------Y 129

Query: 170 LRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNG 229
           L   GY T   GK+ L  +   Y P   G+    G +           S  Y+ T+  NG
Sbjct: 130 LNNTGYRTAFFGKY-LNEYNGSYIPP--GWREWVGLVKN---------SRFYNYTISRNG 177

Query: 230 HDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV---DKPLFLYLAHLAAHA------- 279
           +  +      +D   +Y TDL T E++            +P+ + ++H A H        
Sbjct: 178 NKEKH----GFDYAKDYFTDLITNESINYFRMSKRIYPHRPIMMVISHAAPHGPEDSAPQ 233

Query: 280 -----GNAGKHLE-----APQETINQFQYITDPN-----------RRTYAAMVKKLDDSV 318
                 NA +H+      AP    +     T P            +R     +  +DDS+
Sbjct: 234 FSELYPNASQHITPSYNYAPNMDKHWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSM 293

Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
             +   L   G LEN+ II+ +D+G    ++         G + PY        +  ++V
Sbjct: 294 ERLYQMLAEMGELENTYIIYTADHGYHIGQFGLVK-----GKSMPY--------DFDIRV 340

Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
           P  +  P ++    V   +++I D  PT+   AG DT   P ++DG      L L  P  
Sbjct: 341 PFFIRGPSVEPGSVVPQIVLNI-DLAPTILDIAGLDT---PPDMDGKSVLKLLDLERPGN 396

Query: 439 R 439
           R
Sbjct: 397 R 397


>sp|Q8VI60|SULF1_RAT Extracellular sulfatase Sulf-1 OS=Rattus norvegicus GN=Sulf1 PE=1
           SV=1
          Length = 870

 Score = 68.6 bits (166), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 93/380 (24%), Positives = 150/380 (39%), Gaps = 80/380 (21%)

Query: 100 GSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYP----IHTGMQ--GPPIW 152
           GS ++       + + G    N +   P+C PSR+S++TGKY     ++T  +    P W
Sbjct: 58  GSLQVMNKTRKIMEHGGATFTNAFVTTPMCCPSRSSMLTGKYVHNHNVYTNNENCSSPSW 117

Query: 153 GA--EPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
            A  EPR   +       YL   GY T   GK+ L  +   Y P   G+    G +    
Sbjct: 118 QALHEPRTFAV-------YLNNTGYRTAFFGKY-LNEYNGSYIP--PGWREWLGLIKN-- 165

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAV---QLIEDQPVDKP 267
                  S  Y+ TV  NG   +      +D   +Y TDL T E++   ++ +     +P
Sbjct: 166 -------SRFYNYTVCRNGIKEKH----GFDYAKDYFTDLITNESINYFKMSKRMYPHRP 214

Query: 268 LFLYLAHLAAHA------------GNAGKHLE-----APQETINQFQYITDPN------- 303
           + + ++H A H              NA +H+      AP    +     T P        
Sbjct: 215 VMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLPIHMEF 274

Query: 304 ----RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
               +R     +  +DDSV  + + L   G L N+ II+ +D+G    ++         G
Sbjct: 275 TNVLQRKRLQTLMSVDDSVERLYNMLVETGELGNTYIIYTADHGYHIGQFGLVK-----G 329

Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
            + PY        +  ++VP  +  P I+    V   +++I D  PT+   AG DT   P
Sbjct: 330 KSMPY--------DFDIRVPFFIRGPSIEPGSIVPQIVLNI-DLAPTILDIAGLDT---P 377

Query: 420 LNIDGLDQWSSLLLNTPSRR 439
            ++DG      L L  P  R
Sbjct: 378 SDVDGKSVLKLLDLEKPGNR 397


>sp|Q8K007|SULF1_MOUSE Extracellular sulfatase Sulf-1 OS=Mus musculus GN=Sulf1 PE=2 SV=1
          Length = 870

 Score = 68.2 bits (165), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 92/380 (24%), Positives = 150/380 (39%), Gaps = 80/380 (21%)

Query: 100 GSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYP----IHTGMQ--GPPIW 152
           GS ++       +   G    N +   P+C PSR+S++TGKY     ++T  +    P W
Sbjct: 58  GSLQVMNKTRKIMEQGGATFTNAFVTTPMCCPSRSSMLTGKYVHNHNVYTNNENCSSPSW 117

Query: 153 GA--EPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
            A  EPR   +       YL   GY T   GK+ L  +   Y P   G+    G +    
Sbjct: 118 QAMHEPRTFAV-------YLNNTGYRTAFFGKY-LNEYNGSYIP--PGWREWLGLIKN-- 165

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAV---QLIEDQPVDKP 267
                  S  Y+ TV  NG   +      +D   +Y TDL T E++   ++ +     +P
Sbjct: 166 -------SRFYNYTVCRNGIKEKH----GFDYAKDYFTDLITNESINYFKMSKRMYPHRP 214

Query: 268 LFLYLAHLAAHA------------GNAGKHLE-----APQETINQFQYITDPN------- 303
           + + ++H A H              NA +H+      AP    +     T P        
Sbjct: 215 IMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLPIHMEF 274

Query: 304 ----RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
               +R     +  +DDSV  + + L   G L+N+ II+ +D+G    ++         G
Sbjct: 275 TNVLQRKRLQTLMSVDDSVERLYNMLVESGELDNTYIIYTADHGYHIGQFGLVK-----G 329

Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
            + PY        +  ++VP  +  P I+    V   +++I D  PT+   AG D+   P
Sbjct: 330 KSMPY--------DFDIRVPFFIRGPSIEPGSIVPQIVLNI-DLAPTILDIAGLDS---P 377

Query: 420 LNIDGLDQWSSLLLNTPSRR 439
            ++DG      L L  P  R
Sbjct: 378 SDVDGKSVLKLLDLEKPGNR 397


>sp|P14217|ARS_CHLRE Arylsulfatase OS=Chlamydomonas reinhardtii GN=AS PE=1 SV=2
          Length = 647

 Score = 67.4 bits (163), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 89/375 (23%), Positives = 151/375 (40%), Gaps = 95/375 (25%)

Query: 112 LAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQG--PPIWG-AEPRGVPLTERFLP 167
           + Y G+ L+  +   PVC PSR +L  G++  +T      PP  G A+ +G+ + + +LP
Sbjct: 56  IRYPGVELSQYFVTTPVCCPSRTNLXRGQFAHNTNFTSVLPPYGGWAKWKGLGIDQSYLP 115

Query: 168 EYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVEL 227
            +L++ GY+T  +GK+ + +    Y  + R          G IS     +      T + 
Sbjct: 116 LWLKDQGYNTYYVGKFLVDYSVSNYQQVPRA---------GTIS-----MPXVTPYTFDY 161

Query: 228 NGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAH-------- 278
           N   ++RN +T     GEY+TD+   + V  I+      KP +  ++ +A H        
Sbjct: 162 NTR-LQRNGATPNIYPGEYSTDVIRDKGVAQIKSAVAAGKPFYAQISPIAPHTSTQISTN 220

Query: 279 --AGNAGKHLEAPQETINQFQYITDP-------------------------------NRR 305
              G    +   P      +Q  +D                                N R
Sbjct: 221 PATGVTRSYFFPPIPAPPHWQLFSDANLPGGSXNKNLYEVDVSDKPAWIRALPLAQQNNR 280

Query: 306 TYAAMVKKL-------DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNW 358
           TY   + +L       D+ +  V+  L   G+L+N+ II+ +DNG     +R        
Sbjct: 281 TYQEEIYRLRLRSLGPDELIEQVVKTLDEAGVLDNTYIIYSADNGYHVGAHR-------- 332

Query: 359 GSNYPYRGVKNTLWEGGVKVPAILWSPQIQ-------QNPRVSLQMMHISDWLPTLYTAA 411
                +   K T +E  ++VP ++  P I+       QN +V L +    D+ PT+ + A
Sbjct: 333 -----FGAGKTTGYEEDLRVPFLIRGPGIKASKSDKPQNSKVGLHV----DFAPTILSLA 383

Query: 412 GGDTSRLPLNIDGLD 426
           G   S L L   GLD
Sbjct: 384 GA--SHL-LGDKGLD 395


>sp|Q8IWU6|SULF1_HUMAN Extracellular sulfatase Sulf-1 OS=Homo sapiens GN=SULF1 PE=1 SV=1
          Length = 871

 Score = 67.4 bits (163), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 86/346 (24%), Positives = 140/346 (40%), Gaps = 79/346 (22%)

Query: 118 ILNNMYAQPVCTPSRASLMTGKYP----IHTGMQ--GPPIWGA--EPRGVPLTERFLPEY 169
            +N     P+C PSR+S++TGKY     ++T  +    P W A  EPR   +       Y
Sbjct: 77  FINAFVTTPMCCPSRSSMLTGKYVHNHNVYTNNENCSSPSWQAMHEPRTFAV-------Y 129

Query: 170 LRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNG 229
           L   GY T   GK+ L  +   Y P   G+    G +           S  Y+ TV  NG
Sbjct: 130 LNNTGYRTAFFGKY-LNEYNGSYIP--PGWREWLGLIKN---------SRFYNYTVCRNG 177

Query: 230 HDMRRNLSTAWDTVGEYATDLFTKEAV---QLIEDQPVDKPLFLYLAHLAAHA------- 279
              +      +D   +Y TDL T E++   ++ +     +P+ + ++H A H        
Sbjct: 178 IKEKH----GFDYAKDYFTDLITNESINYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQ 233

Query: 280 -----GNAGKHLE-----APQETINQFQYITDPN-----------RRTYAAMVKKLDDSV 318
                 NA +H+      AP    +     T P            +R     +  +DDSV
Sbjct: 234 FSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLPIHMEFTNILQRKRLQTLMSVDDSV 293

Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
             + + L   G LEN+ II+ +D+G    ++         G + PY        +  ++V
Sbjct: 294 ERLYNMLVETGELENTYIIYTADHGYHIGQFGLVK-----GKSMPY--------DFDIRV 340

Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
           P  +  P ++    V   +++I D  PT+   AG DT   P ++DG
Sbjct: 341 PFFIRGPSVEPGSIVPQIVLNI-DLAPTILDIAGLDT---PPDVDG 382


>sp|Q8IWU5|SULF2_HUMAN Extracellular sulfatase Sulf-2 OS=Homo sapiens GN=SULF2 PE=1 SV=1
          Length = 870

 Score = 64.7 bits (156), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 82/348 (23%), Positives = 136/348 (39%), Gaps = 83/348 (23%)

Query: 118 ILNNMYAQPVCTPSRASLMTGKYPIHTGMQ-------GPPIWGAEPRGVPLTERFLPEYL 170
            +N     P+C PSR+S++TGKY +H             P W A+        R    YL
Sbjct: 78  FINAFVTTPMCCPSRSSILTGKY-VHNHNTYTNNENCSSPSWQAQHE-----SRTFAVYL 131

Query: 171 RELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGH 230
              GY T   GK+ L  +   Y P   G++   G L     +Y++ L     +  E +G 
Sbjct: 132 NSTGYRTAFFGKY-LNEYNGSYVP--PGWKEWVGLLKNS-RFYNYTLCRNGVK--EKHGS 185

Query: 231 DMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV---DKPLFLYLAHLAAHA-------- 279
           D  +          +Y TDL T ++V            +P+ + ++H A H         
Sbjct: 186 DYSK----------DYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQY 235

Query: 280 ----GNAGKHLE-----APQETINQFQYITDPNR-----------RTYAAMVKKLDDSVG 319
                NA +H+      AP    +     T P +           R     +  +DDS+ 
Sbjct: 236 SRLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSME 295

Query: 320 TVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP 379
           T+ + L   G L+N+ I++ +D+G    ++         G + PY        E  ++VP
Sbjct: 296 TIYNMLVETGELDNTYIVYTADHGYHIGQFGLVK-----GKSMPY--------EFDIRVP 342

Query: 380 AILWSPQIQQ---NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
             +  P ++    NP + L +    D  PT+   AG D   +P ++DG
Sbjct: 343 FYVRGPNVEAGCLNPHIVLNI----DLAPTILDIAGLD---IPADMDG 383


>sp|Q8CFG0|SULF2_MOUSE Extracellular sulfatase Sulf-2 OS=Mus musculus GN=Sulf2 PE=2 SV=2
          Length = 875

 Score = 63.5 bits (153), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 83/348 (23%), Positives = 136/348 (39%), Gaps = 83/348 (23%)

Query: 118 ILNNMYAQPVCTPSRASLMTGKYPIHTGMQ-------GPPIWGAEPRGVPLTERFLPEYL 170
            +N     P+C PSR+S++TGKY +H             P W A+        R    YL
Sbjct: 78  FINAFVTTPMCCPSRSSILTGKY-VHNHNTYTNNENCSSPSWQAQHE-----SRTFAVYL 131

Query: 171 RELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGH 230
              GY T   GK+ L  +   Y P   G++   G L           S  Y+ T+  NG 
Sbjct: 132 NSTGYRTAFFGKY-LNEYNGSYVP--PGWKEWVGLLKN---------SRFYNYTLCRNG- 178

Query: 231 DMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV---DKPLFLYLAHLAAHA-------- 279
            ++    + + T  +Y TDL T ++V            +P+ + ++H A H         
Sbjct: 179 -VKEKHGSDYST--DYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQY 235

Query: 280 ----GNAGKHLE-----APQETINQFQYITDPNR-----------RTYAAMVKKLDDSVG 319
                NA +H+      AP    +     T P +           R     +  +DDS+ 
Sbjct: 236 SRLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSME 295

Query: 320 TVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP 379
           T+   L   G L+N+ I++ +D+G    ++         G + PY        E  ++VP
Sbjct: 296 TIYDMLVETGELDNTYILYTADHGYHIGQFGLVK-----GKSMPY--------EFDIRVP 342

Query: 380 AILWSPQIQQ---NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
             +  P ++    NP + L +    D  PT+   AG D   +P ++DG
Sbjct: 343 FYVRGPNVEAGSLNPHIVLNI----DLAPTILDIAGLD---IPADMDG 383


>sp|Q0TUK6|SULF_CLOP1 Arylsulfatase OS=Clostridium perfringens (strain ATCC 13124 / NCTC
           8237 / Type A) GN=CPF_0221 PE=1 SV=1
          Length = 481

 Score = 63.2 bits (152), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 93/398 (23%), Positives = 147/398 (36%), Gaps = 84/398 (21%)

Query: 96  LSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPPIWGA 154
           L  +G+  I TPN+D +A  G    N Y A P C  SRAS++TG      G  G      
Sbjct: 18  LGVNGNEFIETPNLDMMATEGYNFENAYTAVPSCIASRASILTGMSQKSHGRVG------ 71

Query: 155 EPRGVPLT-ERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN------ 207
              GV    E  +     + GY T+ IGK H+  +       +     H GYL+      
Sbjct: 72  YEDGVSWNYENTIASEFSKAGYHTQCIGKMHV--YPERNLCGFHNIMLHDGYLHFARNKE 129

Query: 208 GVISYYDHILSDQYSRTVELNGH---------DMRRNLSTAWD-TVGEYATDLFTKEAVQ 257
           G  S       D      E  GH         D    +S  W      + T+    E++ 
Sbjct: 130 GKASTQIEQCDDYLKWFREKKGHNVDLIDIGLDCNSWVSRPWGYEENLHPTNWVVNESID 189

Query: 258 LIEDQPVDKPLFLYLAHLAAHA-------------------------------GNAGKHL 286
            +  +   KP FL ++ +  H+                                N GK +
Sbjct: 190 FLRRKDPSKPFFLKMSFVRPHSPLDPPKFYFDMYKDEDLPEPLMGDWANKEDEENRGKDI 249

Query: 287 EAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPT 346
              +  IN+        +  Y   +  +D  +G  + AL   G L N+I +F+SD+G   
Sbjct: 250 NCVKGIINKKALKR--AKAAYYGSITHIDHQIGRFLIALSEYGELNNTIFLFVSDHGDMM 307

Query: 347 VEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ---IQQNPRVSLQMMHISDW 403
            ++       NW     +R  K   +EG  +VP  ++ P      +  +V  +++ + D 
Sbjct: 308 GDH-------NW-----FR--KGIPYEGSSRVPFFIYDPGNLLKGKKGKVFDEVLELRDI 353

Query: 404 LPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNS 441
           +PTL   A      +P +++GL      L N    RNS
Sbjct: 354 MPTLLDFA---HISIPDSVEGLS-----LKNLIEERNS 383


>sp|Q8XNV1|SULF_CLOPE Arylsulfatase OS=Clostridium perfringens (strain 13 / Type A)
           GN=CPE0231 PE=3 SV=1
          Length = 481

 Score = 63.2 bits (152), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 88/382 (23%), Positives = 142/382 (37%), Gaps = 79/382 (20%)

Query: 96  LSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPPIWGA 154
           L  +G+  I TPN+D +A  G    N Y A P C  SRAS++TG      G  G      
Sbjct: 18  LGVNGNEFIETPNLDMMATEGYNFENAYTAVPSCIASRASILTGMSQKSHGRVG------ 71

Query: 155 EPRGVPLT-ERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN------ 207
              GV    E  +     + GY T+ IGK H+  +       +     H GYL+      
Sbjct: 72  YEDGVSWNYENTIASEFSKAGYHTQCIGKMHV--YPERNLCGFHNIMLHDGYLHFARNKE 129

Query: 208 GVISYYDHILSDQYSRTVELNGH---------DMRRNLSTAWD-TVGEYATDLFTKEAVQ 257
           G  S       D      E  GH         D    +S  W      + T+    E++ 
Sbjct: 130 GKASTQIEQCDDYLKWFREKKGHNVDLIDIGLDCNSWVSRPWGYEENLHPTNWVVNESID 189

Query: 258 LIEDQPVDKPLFLYLAHLAAHA-------------------------------GNAGKHL 286
            +  +   KP FL ++ +  H+                                N GK +
Sbjct: 190 FLRRRDPSKPFFLKMSFVRPHSPLDPPKFYFDMYKDEDLPEPLMGDWANKEDEENRGKDI 249

Query: 287 EAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPT 346
              +  IN+        +  Y   +  +D  +G  + AL   G L N+I +F+SD+G   
Sbjct: 250 NCVKGIINKKALKR--AKAAYYGSITHIDHQIGRFLIALSEYGKLNNTIFLFVSDHGDMM 307

Query: 347 VEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ---IQQNPRVSLQMMHISDW 403
            ++       NW     +R  K   +EG  +VP  ++ P      +  +V  +++ + D 
Sbjct: 308 GDH-------NW-----FR--KGIPYEGSARVPFFIYDPGNLLKGKKGKVFDEVLELRDI 353

Query: 404 LPTLYTAAGGDTSRLPLNIDGL 425
           +PTL   A      +P +++GL
Sbjct: 354 MPTLLDFA---HISIPDSVEGL 372


>sp|Q5ZK90|ARSK_CHICK Arylsulfatase K OS=Gallus gallus GN=ARSK PE=2 SV=1
          Length = 535

 Score = 61.6 bits (148), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 93/374 (24%), Positives = 152/374 (40%), Gaps = 86/374 (22%)

Query: 96  LSFH-GSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPPIWG 153
           L+F+ G+  +  P I+ +  +G +  N Y   P+C PSRA++ +G +  H         G
Sbjct: 47  LTFYPGNQTVDLPFINFMKRHGSVFLNAYTNSPICCPSRAAMWSGLF-THLTESWNNFKG 105

Query: 154 AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYY 213
            +P  V   +      +++ GY T+  GK        +YT    G  S    +       
Sbjct: 106 LDPDYVTWMD-----LMQKHGYYTQKYGK-------LDYT---SGHHSVSNRVEAWTRDV 150

Query: 214 DHILSDQYSRTVELNGHDMR--RNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           + +L  +    V L G D R  R + T W  V + A     KEAV L   QP    L L 
Sbjct: 151 EFLLRQEGRPKVNLTG-DRRHVRVMKTDWQ-VTDKAVTWIKKEAVNLT--QPFALYLGLN 206

Query: 272 LAH-----LAAHAGNAGKHLEAPQ--ETINQFQYITDPN--------------------- 303
           L H      A     +   L +P   E + +++ I  P                      
Sbjct: 207 LPHPYPSPYAGENFGSSTFLTSPYWLEKV-KYEAIKIPTWTALSEMHPVDYYSSYTKNCT 265

Query: 304 -----------RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRET 352
                      R  Y AM  + D  +G +ISALQ   +L+ +II+F SD+G   +E+R+ 
Sbjct: 266 GEFTKQEVRRIRAFYYAMCAETDAMLGEIISALQDTDLLKKTIIMFTSDHGELAMEHRQF 325

Query: 353 SNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
                          K +++EG   VP ++  P I++  +VS  ++ + D  PT+     
Sbjct: 326 --------------YKMSMYEGSSHVPLLVMGPGIRKQQQVS-AVVSLVDIYPTML---- 366

Query: 413 GDTSRLPL--NIDG 424
            D +R+P+  N+ G
Sbjct: 367 -DLARIPVLQNLSG 379


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.317    0.134    0.411 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 248,668,337
Number of Sequences: 539616
Number of extensions: 10948249
Number of successful extensions: 24027
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 57
Number of HSP's successfully gapped in prelim test: 41
Number of HSP's that attempted gapping in prelim test: 23689
Number of HSP's gapped (non-prelim): 194
length of query: 632
length of database: 191,569,459
effective HSP length: 124
effective length of query: 508
effective length of database: 124,657,075
effective search space: 63325794100
effective search space used: 63325794100
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 64 (29.3 bits)