BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy10434
         (593 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|P50429|ARSB_MOUSE Arylsulfatase B OS=Mus musculus GN=Arsb PE=2 SV=3
          Length = 534

 Score =  359 bits (921), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 184/369 (49%), Positives = 242/369 (65%), Gaps = 15/369 (4%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
           + A + PH++ +LADDLGWND+ FHGS  I TP++DALA  G++L+ +YVQ LCTPSRS 
Sbjct: 40  SGATQPPHVVFVLADDLGWNDLGFHGSV-IRTPHLDALAAGGVVLDNYYVQPLCTPSRSQ 98

Query: 84  LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
           L+TG+Y IH+G+QH +I+  +P  +PL EKLLPQ LKEAGYATH +GKWHLG +R+   P
Sbjct: 99  LLTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLP 158

Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG----LDMRHNMQVDNKTIGIYSTDLYTE 199
           T RGFD+++GY  G +DYY H   A  E   G    LD+R   +   +   IYST+++T+
Sbjct: 159 TRRGFDTYFGYLLGSEDYYTHEACAPIESLNGTRCALDLRDGEEPAKEYNNIYSTNIFTK 218

Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
            A  VIA H   KP+FLYLA  +VH     +P Q P+E +  +  I D  RR YAGMVS 
Sbjct: 219 RATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRIYAGMVSL 273

Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
           +DE+VGNV  AL+ HG+  N++ +F  DNG    G   + G+N PLRG K T W+GG+RG
Sbjct: 274 MDEAVGNVTKALKSHGLWNNTVFIFSTDNG----GQTRSGGNNWPLRGRKGTLWEGGIRG 329

Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKR 378
              + SP LKQ    S EL HI+DWLPTL   AG   N T  LDG N W  +++G  + R
Sbjct: 330 TGFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGSTNGTKPLDGFNMWKTISEGHPSPR 389

Query: 379 SEILHNIDN 387
            E+LHNID 
Sbjct: 390 VELLHNIDQ 398



 Score = 41.6 bits (96), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 19/60 (31%), Positives = 31/60 (51%), Gaps = 1/60 (1%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LF+I  DP E+++++     +++ L  +L  Y    VP    P D R DP +   +W PW
Sbjct: 475 LFDINQDPEERHDVSREHPHIVQNLLSRLQYYHEHSVPSHFPPLDPRCDP-KSTGVWSPW 533


>sp|P50430|ARSB_RAT Arylsulfatase B OS=Rattus norvegicus GN=Arsb PE=2 SV=2
          Length = 528

 Score =  347 bits (891), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 179/366 (48%), Positives = 241/366 (65%), Gaps = 15/366 (4%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A   PH++ +LADDLGWND+ FHGS  I TP++DALA  G++L+ +YVQ LCTPSRS L+
Sbjct: 36  AAPPPHVVFVLADDLGWNDLGFHGSV-IRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLL 94

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG+Y IH+G+QH +I+  +P  +PL EKLLPQ LK+AGYATH +GKWHLG +R+   PT 
Sbjct: 95  TGRYQIHMGLQHYLIMTCQPNCVPLDEKLLPQLLKDAGYATHMVGKWHLGMYRKECLPTR 154

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQG----LDMRHNMQVDNKTIGIYSTDLYTEAA 201
           RGFD+++GY  G +DYY H   A  E   G    LD+R   +   +   IYST+++T+ A
Sbjct: 155 RGFDTYFGYLLGSEDYYTHEACAPIECLNGTRCALDLRDGEEPAKEYTDIYSTNIFTKRA 214

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
             +IA H   KP+FLYLA  +VH     +P Q P+E +  +  I D  RR YAGMVS LD
Sbjct: 215 TTLIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYDFIQDKHRRIYAGMVSLLD 269

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E+VGNV  AL+  G+  N++++F  DNG    G   + G+N PLRG K T W+GG+RG  
Sbjct: 270 EAVGNVTKALKSRGLWNNTVLIFSTDNG----GQTRSGGNNWPLRGRKGTLWEGGIRGAG 325

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
            + SP LKQ    S EL HI+DWLPTL   AG   + T  LDG + W+ +++G+ + R E
Sbjct: 326 FVASPLLKQKGVKSRELMHITDWLPTLVNLAGGSTHGTKPLDGFDVWETISEGSPSPRVE 385

Query: 381 ILHNID 386
           +L NID
Sbjct: 386 LLLNID 391



 Score = 40.8 bits (94), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 19/60 (31%), Positives = 31/60 (51%), Gaps = 1/60 (1%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LF+I  DP E+++++     +++ L  +L  Y    VP    P D R DP +   +W PW
Sbjct: 469 LFDINRDPEERHDVSREHPHIVQNLLSRLQYYHEHSVPSYFPPLDPRCDP-KGTGVWSPW 527


>sp|P33727|ARSB_FELCA Arylsulfatase B OS=Felis catus GN=ARSB PE=2 SV=1
          Length = 535

 Score =  342 bits (878), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 178/366 (48%), Positives = 234/366 (63%), Gaps = 15/366 (4%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A + PH++ +LADDLGWNDVSFHGS+ I TP++D LA  G++L+ +Y Q LCTPSRS L+
Sbjct: 43  ADRPPHLVFVLADDLGWNDVSFHGSN-IRTPHLDELAAGGVLLDNYYTQPLCTPSRSQLL 101

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG+Y IH G+QH +I   +P  +PL EKLLPQ LKEAGY TH +GKWHLG +R+   PT 
Sbjct: 102 TGRYQIHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 161

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
           RGFD+++GY  G +DYY H   A  +        LD R   QV      +YST+++TE A
Sbjct: 162 RGFDTYFGYLLGSEDYYSHERCALIDSLNVTRCALDFRDGEQVATGYKNMYSTNIFTERA 221

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
             +I  H   KP+FLYLA  +VH     EP Q P+E +  +  I D  R  YAGMVS +D
Sbjct: 222 TALITSHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHYYAGMVSLMD 276

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E+VGNV AAL+ HG+  N++ +F  DNG  +       G+N PLRG K + W+GG+RGV 
Sbjct: 277 EAVGNVTAALKSHGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGIRGVG 332

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTKRSE 380
            + SP LKQ    + EL HISDWLPTL   A G       LDG + W  +++G+ + R E
Sbjct: 333 FVASPLLKQKGVKNRELIHISDWLPTLVKLARGSTKGTKPLDGFDVWKTISEGSPSPRKE 392

Query: 381 ILHNID 386
           +LHNID
Sbjct: 393 LLHNID 398



 Score = 35.0 bits (79), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 19/59 (32%), Positives = 29/59 (49%), Gaps = 1/59 (1%)

Query: 506 FNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           F+I  DP E+++L+     +++QL  +L  Y    VP      D R DP +    W PW
Sbjct: 477 FDIDQDPEERHDLSRDYPHIVEQLLSRLQFYHKHSVPVHFPAQDPRCDP-KGTGAWGPW 534


>sp|P15848|ARSB_HUMAN Arylsulfatase B OS=Homo sapiens GN=ARSB PE=1 SV=1
          Length = 533

 Score =  335 bits (860), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 175/366 (47%), Positives = 233/366 (63%), Gaps = 15/366 (4%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A + PH++ +LADDLGWNDV FHGS +I TP++DALA  G++L+ +Y Q LCTPSRS L+
Sbjct: 41  ASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 99

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG+Y I  G+QH +I   +P  +PL EKLLPQ LKEAGY TH +GKWHLG +R+   PT 
Sbjct: 100 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 159

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
           RGFD+++GY  G +DYY H      +        LD R   +V      +YST+++T+ A
Sbjct: 160 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 219

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
           I +I  H   KP+FLYLA  +VH     EP Q P+E +  +  I D  R  YAGMVS +D
Sbjct: 220 IALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 274

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E+VGNV AAL+  G+  N++ +F  DNG  +       G+N PLRG K + W+GG+RGV 
Sbjct: 275 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 330

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
            + SP LKQ    + EL HISDWLPTL   A    N T  LDG + W  +++G+ + R E
Sbjct: 331 FVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIE 390

Query: 381 ILHNID 386
           +LHNID
Sbjct: 391 LLHNID 396



 Score = 35.4 bits (80), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 19/60 (31%), Positives = 30/60 (50%), Gaps = 1/60 (1%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LF+I  DP E+++L+     ++ +L  +L  Y    VP      D R DP +   +W PW
Sbjct: 474 LFDIDRDPEERHDLSREYPHIVTKLLSRLQFYHKHSVPVYFPAQDPRCDP-KATGVWGPW 532


>sp|Q32KI9|ARSI_MOUSE Arylsulfatase I OS=Mus musculus GN=Arsi PE=2 SV=1
          Length = 573

 Score =  331 bits (849), Expect = 8e-90,   Method: Compositional matrix adjust.
 Identities = 202/554 (36%), Positives = 289/554 (52%), Gaps = 90/554 (16%)

Query: 27  PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
           P+ PHII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSRS L+T
Sbjct: 44  PQPPHIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLT 102

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           G+Y IH G+QH +I   +P  LPL +  LPQ L+EAGY+TH +GKWHLGF+R+   PT R
Sbjct: 103 GRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRR 162

Query: 147 GFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
           GFD+  G   G  DYY + +C        G D+     V     G YST LY + A +++
Sbjct: 163 GFDTFLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHIL 220

Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
           A HN   P+FLY+A  AVH      P Q+P E + ++  + +  RR YA MV+ +DE+V 
Sbjct: 221 ASHNPQNPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVR 275

Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
           N+  AL+++G   NS+++F +DNG  +F    + GSN PLRG K T W+GG+RG+  + S
Sbjct: 276 NITWALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHS 331

Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHN 384
           P LK+ ++ S  L HI+DW PTL   AG   +    LDG + W  +++G  + R+EILHN
Sbjct: 332 PLLKKKRRTSRALVHITDWYPTLVGLAGGTTSAADGLDGYDVWPAISEGRASPRTEILHN 391

Query: 385 IDNVDNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYS 430
           ID + N  ++               AA+RV + K + G       D  YGD       + 
Sbjct: 392 IDPLYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WI 437

Query: 431 PKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNY 490
           P + L S  G  +N                              E++  +R+        
Sbjct: 438 PPQTLASFPGSWWNL-----------------------------ERMASIRQAV------ 462

Query: 491 DNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDK 550
                         LFNI+ DP E+ +LA  + D+++ L  +LA Y  T +P      + 
Sbjct: 463 -------------WLFNISADPYEREDLAGQRPDVVRTLLARLADYNRTAIPVRYPAANP 509

Query: 551 RADPARWNNIWVPW 564
           RA P      W PW
Sbjct: 510 RAHPDFNGGAWGPW 523


>sp|Q5FYB0|ARSJ_HUMAN Arylsulfatase J OS=Homo sapiens GN=ARSJ PE=2 SV=1
          Length = 599

 Score =  328 bits (842), Expect = 6e-89,   Method: Compositional matrix adjust.
 Identities = 204/557 (36%), Positives = 293/557 (52%), Gaps = 70/557 (12%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
           +TT+  +PH+I ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS
Sbjct: 69  STTSTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 127

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
             +TGKY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   
Sbjct: 128 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 187

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
           PT RGFD+ +G   G  DYY H  K       G D+  N       DN   GIYST +YT
Sbjct: 188 PTRRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 243

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           +    ++A HN +KP+FLY+A+ AVH+     P QAP      +  I +  RR YA M+S
Sbjct: 244 QRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 298

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGM 317
            LDE++ NV  AL+ +G   NSI+++ +DNG  P+ G     GSN PLRG K T W+GG+
Sbjct: 299 CLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGI 353

Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKT 376
           R V  + SP LK    V  EL HI+DW PTL + A  +I+ D  LDG + W+ +++G ++
Sbjct: 354 RAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRS 413

Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
                                 RVD L  +       ++  W                  
Sbjct: 414 P---------------------RVDILHNIDPIYTKAKNGSWA----------------- 435

Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGA 495
           +  GI   A+++ ++++           SD +            + F+ +  N + N+  
Sbjct: 436 AGYGIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QSFSNLGPNRWHNERI 485

Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
             ++     LFNIT DP E+ +L+     ++K+L  +L+ +  T VP    P D R++P 
Sbjct: 486 TLSTGKSVWLFNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPR 545

Query: 556 RWNNIWVPWYDELDKQK 572
               +W PWY E  K+K
Sbjct: 546 LNGGVWGPWYKEETKKK 562


>sp|Q32KJ8|ARSI_RAT Arylsulfatase I OS=Rattus norvegicus GN=Arsi PE=2 SV=1
          Length = 573

 Score =  327 bits (837), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 200/550 (36%), Positives = 288/550 (52%), Gaps = 90/550 (16%)

Query: 31  HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
           HII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSRS L+TG+Y 
Sbjct: 48  HIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 106

Query: 91  IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
           IH G+QH +I   +P  LPL +  LPQ L+EAGY+TH +GKWHLGF+R+   PT RGFD+
Sbjct: 107 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166

Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
             G   G  DYY + +C        G D+     V     G YST LY + A +++A H+
Sbjct: 167 FLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHS 224

Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
             KP+FLY+A  AVH      P Q+P E + ++  + +  RR YA MV+ +DE+V N+  
Sbjct: 225 PQKPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 279

Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
           AL+++G   NS+++F +DNG  +F    + GSN PLRG K T W+GG+RG+  + SP LK
Sbjct: 280 ALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 335

Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
           + ++ S  L HI+DW PTL   AG   +    LDG + W  +++G  + R+EILHNID +
Sbjct: 336 KKRRTSRALVHITDWYPTLVGLAGGTTSAADGLDGYDVWPAISEGRASPRTEILHNIDPL 395

Query: 389 DNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
            N  ++               AA+RV + K + G       D  YGD       + P + 
Sbjct: 396 YNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WIPPQT 441

Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG 494
           L S  G  +N                              E++  +R+            
Sbjct: 442 LASFPGSWWNL-----------------------------ERMASIRQAV---------- 462

Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
                     LFNI+ DP E+ +LA+ + D+++ L  +LA Y  T +P      + RA P
Sbjct: 463 ---------WLFNISADPYEREDLADQRPDVVRTLLARLADYNRTAIPVRYPAANPRAHP 513

Query: 555 ARWNNIWVPW 564
                 W PW
Sbjct: 514 DFNGGAWGPW 523


>sp|Q5FYB1|ARSI_HUMAN Arylsulfatase I OS=Homo sapiens GN=ARSI PE=1 SV=1
          Length = 569

 Score =  325 bits (834), Expect = 5e-88,   Method: Compositional matrix adjust.
 Identities = 199/550 (36%), Positives = 287/550 (52%), Gaps = 90/550 (16%)

Query: 31  HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
           HII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSRS L+TG+Y 
Sbjct: 48  HIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQ 106

Query: 91  IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
           IH G+QH +I   +P  LPL +  LPQ L+EAGY+TH +GKWHLGF+R+   PT RGFD+
Sbjct: 107 IHTGLQHSIIRPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166

Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
             G   G  DYY + +C        G D+     V     G YST LY + A +++A H+
Sbjct: 167 FLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHS 224

Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
             +P+FLY+A  AVH      P Q+P E + ++  + +  RR YA MV+ +DE+V N+  
Sbjct: 225 PQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 279

Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
           AL+++G   NS+++F +DNG  +F    + GSN PLRG K T W+GG+RG+  + SP LK
Sbjct: 280 ALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 335

Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
           + Q+ S  L HI+DW PTL   AG   +    LDG + W  +++G  + R+EILHNID +
Sbjct: 336 RKQRTSRALMHITDWYPTLVGLAGGTTSAADGLDGYDVWPAISEGRASPRTEILHNIDPL 395

Query: 389 DNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
            N  ++               AA+RV + K + G       D  YGD       + P + 
Sbjct: 396 YNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WIPPQT 441

Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG 494
           L +  G  +N                              E++  +R+            
Sbjct: 442 LATFPGSWWNL-----------------------------ERMASVRQAV---------- 462

Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
                     LFNI+ DP E+ +LA  + D+++ L  +LA Y  T +P      + RA P
Sbjct: 463 ---------WLFNISADPYEREDLAGQRPDVVRTLLARLAEYNRTAIPVRYPAENPRAHP 513

Query: 555 ARWNNIWVPW 564
                 W PW
Sbjct: 514 DFNGGAWGPW 523


>sp|Q32KH7|ARSI_CANFA Arylsulfatase I OS=Canis familiaris GN=ARSI PE=2 SV=2
          Length = 573

 Score =  321 bits (822), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 196/550 (35%), Positives = 285/550 (51%), Gaps = 90/550 (16%)

Query: 31  HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
           HII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSRS L+TG+Y 
Sbjct: 49  HIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 107

Query: 91  IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
           IH G+QH +I   +P  LPL +  LPQ L+EAGY+TH +GKWHLGF+R+   PT RGFD+
Sbjct: 108 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 167

Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
             G   G  DYY + +C        G D+     V     G YST LY +   +++A H+
Sbjct: 168 FLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGENVAWGLSGQYSTMLYAQRVSHILASHS 225

Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
             +P+FLY+A  AVH      P Q+P E + ++  + +  RR YA MV+ +DE+V N+ +
Sbjct: 226 PRRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITS 280

Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
           AL+++G   NS+++F +DNG  +F    + GSN PLRG K T W+GG+RG+  + SP LK
Sbjct: 281 ALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 336

Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
           + ++ S  L HI+DW PTL   AG   +    LDG + W  +++G  + R+EILHNID +
Sbjct: 337 RKRRTSRALVHITDWYPTLVGLAGGTASAADGLDGYDVWPAISEGRASPRTEILHNIDPL 396

Query: 389 DNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
            N  ++               AA+RV + K + G       D  YGD       + P + 
Sbjct: 397 YNHARHGSLEAGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WIPPQT 442

Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG 494
           L +  G  +N                              E++   R+            
Sbjct: 443 LAAFPGSWWNL-----------------------------ERMASARQAV---------- 463

Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
                     LFNI+ DP E+ +LA  + D+++ L  +L  Y  T +P      + RA P
Sbjct: 464 ---------WLFNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENPRAHP 514

Query: 555 ARWNNIWVPW 564
                 W PW
Sbjct: 515 DFNGGAWGPW 524


>sp|Q8BM89|ARSJ_MOUSE Arylsulfatase J OS=Mus musculus GN=Arsj PE=2 SV=1
          Length = 598

 Score =  319 bits (818), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 198/549 (36%), Positives = 286/549 (52%), Gaps = 68/549 (12%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
            T    +PH+I ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS
Sbjct: 67  GTAGTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 125

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
             +TGKY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   
Sbjct: 126 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCM 185

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
           PT RGFD+ +G   G  DYY H  K       G D+  N       DN   GIYST +YT
Sbjct: 186 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGVCGYDLYENDNAAWDYDN---GIYSTQMYT 241

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           +    ++A H+ +KP+FLY+A+ AVH+     P QAP      +  I +  RR YA M+S
Sbjct: 242 QRVQQILATHDPTKPLFLYVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 296

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
            LDE++ NV  AL+++G   NSI+++ +DNG    G  +  GSN PLRG K T W+GG+R
Sbjct: 297 CLDEAIHNVTLALKRYGFYNNSIIIYSSDNG----GQPTAGGSNWPLRGSKGTYWEGGIR 352

Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTK 377
            V  + SP LK    V  EL HI+DW PTL + A  +I+ D  LDG + W+ +++G ++ 
Sbjct: 353 AVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSP 412

Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
                                RVD L  +       ++  W                  +
Sbjct: 413 ---------------------RVDILHNIDPIYTKAKNGSWA-----------------A 434

Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAH 496
             GI   A+++ ++++           SD +            + F+ +  N + N+   
Sbjct: 435 GYGIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QAFSNLGPNRWHNERIT 484

Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
            ++     LFNIT DP E+ +L+     ++K+L  +L+ +  T VP    P D R++P  
Sbjct: 485 LSTGKSIWLFNITADPYERVDLSSRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPRL 544

Query: 557 WNNIWVPWY 565
              +W PWY
Sbjct: 545 NGGVWGPWY 553


>sp|Q571E4|GALNS_MOUSE N-acetylgalactosamine-6-sulfatase OS=Mus musculus GN=Galns PE=2
           SV=2
          Length = 520

 Score =  179 bits (455), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 121/366 (33%), Positives = 181/366 (49%), Gaps = 26/366 (7%)

Query: 27  PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALM 85
           P+ P+I+++L DD+GW D+  +G     TPN+D +A  G++    Y    LC+PSR+AL+
Sbjct: 25  PQPPNIVLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALL 84

Query: 86  TGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
           TG+ PI  G        ++    +    G+P +E LLP+ LK+AGY    +GKWHLG  R
Sbjct: 85  TGRLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-R 143

Query: 139 EVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYST 194
             + P   GFD  +G        YD+  K     Y+  +M         ++ KT     T
Sbjct: 144 PQFHPLKHGFDEWFGSPNCHFGPYDNKAKPNIPVYRDWEMVGRFYEEFPINRKTGEANLT 203

Query: 195 DLYTEAAINVI-AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
            LYT+ A++ I  +H +  P FLY A  A HA         P     +FL  S   R  Y
Sbjct: 204 QLYTQEALDFIQTQHARQSPFFLYWAIDATHA---------PVYASRQFLGTS--LRGRY 252

Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
              V  +D+SVG +++ L+  G+ +N+ V F +DNGA      +  GSN P    K T +
Sbjct: 253 GDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALISAPNEGGSNGPFLCGKQTTF 312

Query: 314 DGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTK 372
           +GGMR  A  W P      +VS +L  I D   T  + AG++  +D  +DG++    + K
Sbjct: 313 EGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAGLKPPSDRVIDGLDLLPTMLK 372

Query: 373 GAKTKR 378
           G    R
Sbjct: 373 GQMMDR 378


>sp|P34059|GALNS_HUMAN N-acetylgalactosamine-6-sulfatase OS=Homo sapiens GN=GALNS PE=1
           SV=1
          Length = 522

 Score =  177 bits (450), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 120/378 (31%), Positives = 182/378 (48%), Gaps = 25/378 (6%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRS 82
           + AP+ P+I+++L DD+GW D+  +G     TPN+D +A  GL+    Y    LC+PSR+
Sbjct: 25  SGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGLLFPNFYSANPLCSPSRA 84

Query: 83  ALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
           AL+TG+ PI  G        ++    +    G+P +E+LLP+ LK+AGY +  +GKWHLG
Sbjct: 85  ALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG 144

Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGI 191
             R  + P   GFD  +G        YD+  +     Y+  +M         ++ KT   
Sbjct: 145 H-RPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEA 203

Query: 192 YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR 251
             T +Y + A++ I    +  P FLY A  A HA         P      FL  S  +R 
Sbjct: 204 NLTQIYLQEALDFIKRQARHHPFFLYWAVDATHA---------PVYASKPFLGTS--QRG 252

Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
            Y   V  +D+S+G ++  L+   + +N+ V F +DNGA         GSN P    K T
Sbjct: 253 RYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQGGSNGPFLCGKQT 312

Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVL 370
            ++GGMR  A  W P      +VS +L  I D   T  A AG+   +D ++DG+N    L
Sbjct: 313 TFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLALAGLTPPSDRAIDGLNLLPTL 372

Query: 371 TKGAKTKRSEILHNIDNV 388
            +G    R    +  D +
Sbjct: 373 LQGRLMDRPIFYYRGDTL 390


>sp|Q32KJ6|GALNS_RAT N-acetylgalactosamine-6-sulfatase OS=Rattus norvegicus GN=Galns
           PE=1 SV=1
          Length = 524

 Score =  177 bits (448), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 121/373 (32%), Positives = 181/373 (48%), Gaps = 26/373 (6%)

Query: 20  AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCT 78
             L   AP+ P+I+++L DD+GW D+  +G     TPN+D +A  G++    Y    LC+
Sbjct: 22  GLLAAGAPQPPNIVLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYSANPLCS 81

Query: 79  PSRSALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGK 131
           PSR+AL+TG+ PI  G        ++    +    G+P +E LLP+ LK+AGY    +GK
Sbjct: 82  PSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGK 141

Query: 132 WHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNK 187
           WHLG  R  + P   GFD  +G        YD+  K     Y+  +M         ++ K
Sbjct: 142 WHLGH-RPQFHPLKHGFDEWFGSPNCHFGPYDNKVKPNIPVYRDWEMVGRFYEEFPINLK 200

Query: 188 TIGIYSTDLYTEAAINVI-AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDIS 246
           T     T LY + A++ I  +H +  P FLY A  A HA         P     +FL  S
Sbjct: 201 TGEANLTQLYLQEALDFIRTQHARQSPFFLYWAIDATHA---------PVYASKQFLGTS 251

Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLR 306
              R  Y   V  +D+SVG +++ L+  G+ +N+ V F +DNGA         GSN P  
Sbjct: 252 --LRGRYGDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALISAPKEGGSNGPFL 309

Query: 307 GMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVN 365
             K T ++GGMR  A  W P      +VS +L  I D   T  + AG++  +D  +DG++
Sbjct: 310 CGKQTTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAGLKPPSDRVIDGLD 369

Query: 366 QWDVLTKGAKTKR 378
               + +G    R
Sbjct: 370 LLPTMLQGHIIDR 382


>sp|Q8WNQ7|GALNS_PIG N-acetylgalactosamine-6-sulfatase OS=Sus scrofa GN=GALNS PE=2 SV=1
          Length = 522

 Score =  175 bits (444), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 123/379 (32%), Positives = 184/379 (48%), Gaps = 27/379 (7%)

Query: 15  LLFNDAFLNTT-APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV 73
           L+ + A L  T AP+ P+I+++L DD+GW D+  +G     TPN+D +A  G++    Y 
Sbjct: 14  LVLSAAGLGVTGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYA 73

Query: 74  -QALCTPSRSALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
              LC+PSR+AL+TG+ PI  G        ++    +    G+P  E LLP+ LK AGYA
Sbjct: 74  ANPLCSPSRAALLTGRLPIRTGFYTTNGHARNAYTPQEIVGGIPDPEHLLPELLKGAGYA 133

Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHN 181
           +  +GKWHLG  R  + P   GFD  +G        YD+  +     Y+  +M       
Sbjct: 134 SKIVGKWHLGH-RPQFHPLKHGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRFYEE 192

Query: 182 MQVDNKTIGIYSTDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
             ++ KT     T +Y + A++ I     +  P FLY A  A HA         P     
Sbjct: 193 FPINLKTGESNLTQIYLQEALDFIKRQQATHHPFFLYWAIDATHA---------PVYASR 243

Query: 241 KFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG 300
            FL  S  +R  Y   V  +D+SVG ++  LR   +  N+ V F +DNGA         G
Sbjct: 244 AFLGTS--QRGRYGDAVREIDDSVGRIVGLLRDLKIAGNTFVFFTSDNGAALVSAPKQGG 301

Query: 301 SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDT 359
           SN P    K T ++GGMR  A  W P      +VS +L  + D   T  + AG+E  +D 
Sbjct: 302 SNGPFLCGKQTTFEGGMREPAIAWWPGHIPAGQVSHQLGSVMDLFTTSLSLAGLEPPSDR 361

Query: 360 SLDGVNQWDVLTKGAKTKR 378
           ++DG++    + +G  T+R
Sbjct: 362 AIDGLDLLPAMLQGRLTER 380


>sp|Q32KH5|GALNS_CANFA N-acetylgalactosamine-6-sulfatase OS=Canis familiaris GN=GALNS PE=2
           SV=1
          Length = 522

 Score =  171 bits (433), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 118/376 (31%), Positives = 179/376 (47%), Gaps = 26/376 (6%)

Query: 27  PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALM 85
           P+ P+I+++L DD+GW D+  +G     TPN+D +A  G++    Y    LC+PSR+AL+
Sbjct: 27  PQPPNILLLLMDDMGWGDLGIYGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALL 86

Query: 86  TGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
           TG+ PI  G        ++    +    G+P  E +LP+ LKEAGY +  +GKWHLG  R
Sbjct: 87  TGRLPIRNGFYTTNRHARNAYTPQEIVGGIPDQEHVLPELLKEAGYVSKIVGKWHLGH-R 145

Query: 139 EVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYST 194
             + P   GFD  +G        YD+  +     Y+  +M         ++ KT     T
Sbjct: 146 PQFHPLKHGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRYYEEFPINLKTGEANLT 205

Query: 195 DLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
            +Y + A++ I     + +P FLY A  A HA         P      FL  S  +R  Y
Sbjct: 206 QVYLQEALDFIKRQQAAQRPFFLYWAIDATHA---------PVYASRPFLGTS--QRGRY 254

Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
              V  +D SVG +++ L+   + EN+ V F +DNGA      +  GSN P    K T +
Sbjct: 255 GDAVREIDNSVGKILSLLQDLRISENTFVFFTSDNGAALISAPNQGGSNGPFLCGKQTTF 314

Query: 314 DGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTK 372
           +GGMR  A  W P      +VS +L  I D   T  + AG+   +D  +DG++    +  
Sbjct: 315 EGGMREPAIAWWPGRIPAGRVSHQLGSIMDLFTTSLSLAGLAPPSDRVIDGLDLLPAMLG 374

Query: 373 GAKTKRSEILHNIDNV 388
           G  T R    +  D +
Sbjct: 375 GQLTDRPIFYYRGDTL 390


>sp|P50428|ARSA_MOUSE Arylsulfatase A OS=Mus musculus GN=Arsa PE=2 SV=2
          Length = 506

 Score =  160 bits (405), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 132/390 (33%), Positives = 189/390 (48%), Gaps = 33/390 (8%)

Query: 9   FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
            AL    L   A L+T +P  P+I++I ADDLG+ D+  +G     TPN+D LA  GL  
Sbjct: 1   MALGTLFLALAAGLSTASP--PNILLIFADDLGYGDLGSYGHPSSTTPNLDQLAEGGLRF 58

Query: 69  NQHYVQ-ALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
              YV  +LCTPSR+AL+TG+ P+  GM  GV+      GLPL E  L + L   GY T 
Sbjct: 59  TDFYVPVSLCTPSRAALLTGRLPVRSGMYPGVLGPSSQGGLPLEEVTLAEVLAARGYLTG 118

Query: 128 AIGKWHLGFFRE-VYTPTFRGFD-------SH-YGYWQGLQDY-YDHSCKATFEPYQGL- 176
             GKWHLG   E  + P  +GF        SH  G  Q L  +  D  CK   +  QGL 
Sbjct: 119 MAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPDIPCKGGCD--QGLV 176

Query: 177 --DMRHNMQVDNKTIGIYSTDL-YTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPF 232
              +  N+ V+ +   +   +  Y   + +++A+   + +P FLY A    H    Y  F
Sbjct: 177 PIPLLANLTVEAQPPWLPGLEARYVSFSRDLMADAQRQGRPFFLYYASHHTH----YPQF 232

Query: 233 QAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS 292
                    F   S   R  +   +  LD +VG ++  +   G+LE ++V+F ADNG P 
Sbjct: 233 SG-----QSFTKRSG--RGPFGDSLMELDGAVGALMTTVGDLGLLEETLVIFTADNG-PE 284

Query: 293 FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAA 352
               SN G +  LR  K T ++GG+R  A ++ P    T  V+ EL    D LPTL A  
Sbjct: 285 LMRMSNGGCSGLLRCGKGTTFEGGVREPALVYWPG-HITPGVTHELASSLDLLPTLAALT 343

Query: 353 GIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
           G  + + +LDGV+   +L    K+ R  + 
Sbjct: 344 GAPLPNVTLDGVDISPLLLGTGKSPRKSVF 373


>sp|P15289|ARSA_HUMAN Arylsulfatase A OS=Homo sapiens GN=ARSA PE=1 SV=3
          Length = 507

 Score =  158 bits (400), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 125/404 (30%), Positives = 190/404 (47%), Gaps = 30/404 (7%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRSALMT 86
           + P+I++I ADDLG+ D+  +G     TPN+D LA  GL     YV  +LCTPSR+AL+T
Sbjct: 19  RPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLRFTDFYVPVSLCTPSRAALLT 78

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE-VYTPTF 145
           G+ P+ +GM  GV++     GLPL E  + + L   GY T   GKWHLG   E  + P  
Sbjct: 79  GRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLTGMAGKWHLGVGPEGAFLPPH 138

Query: 146 RGFDSHYG--YWQGLQDYYDHSCKATFEPYQG--------LDMRHNMQVDNKTIGIYSTD 195
           +GF    G  Y        + +C     P  G        + +  N+ V+ +   +   +
Sbjct: 139 QGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVPIPLLANLSVEAQPPWLPGLE 198

Query: 196 L-YTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
             Y   A +++A+   + +P FLY A    H    Y  F         F + S   R  +
Sbjct: 199 ARYMAFAHDLMADAQRQDRPFFLYYASHHTH----YPQFSG-----QSFAERSG--RGPF 247

Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
              +  LD +VG ++ A+   G+LE ++V+F ADNG  +  + S  G +  LR  K T +
Sbjct: 248 GDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRM-SRGGCSGLLRCGKGTTY 306

Query: 314 DGGMRGVA-AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTK 372
           +GG+R  A A W   +     V+ EL    D LPTL A AG  + + +LDG +   +L  
Sbjct: 307 EGGVREPALAFWPGHI--APGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDLSPLLLG 364

Query: 373 GAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSD 416
             K+ R  +       D  +  + A+R    K    T  +  SD
Sbjct: 365 TGKSPRQSLFFYPSYPDEVRGVF-AVRTGKYKAHFFTQGSAHSD 407


>sp|P25549|ASLA_ECOLI Arylsulfatase OS=Escherichia coli (strain K12) GN=aslA PE=3 SV=2
          Length = 551

 Score =  152 bits (383), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 130/425 (30%), Positives = 200/425 (47%), Gaps = 66/425 (15%)

Query: 20  AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQI---PTPNIDALAYNGLILNQHYVQAL 76
           A L     KKP++++ L DD+GW DV F+G       PTP+IDA+A  GLIL   Y Q  
Sbjct: 76  AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS 135

Query: 77  CTPSRSALMTGKYPIHIGMQHGVILE---GEPWGLP-LTEKLLPQYLKEAGYATHAIGKW 132
            +P+R+ ++TG+Y IH    HG+++    G+P GL  LT   LPQ L + GY T AIGKW
Sbjct: 136 SSPTRATILTGQYSIH----HGILMPPMYGQPGGLQGLTT--LPQLLHDQGYVTQAIGKW 189

Query: 133 HLGFFRE-----VYTPTFRGFDSHYGYWQGLQDYY---------DHSCKATFEPYQGLDM 178
           H+G  +E     V    FRGF+S    +   +D +         D S      P+   D+
Sbjct: 190 HMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDV 249

Query: 179 RHNMQVDNKTIG----IYSTDL---YTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYE 230
                 + + I      Y  DL   + +  +  + +  KS KP FLY      H  N   
Sbjct: 250 HAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNY-- 307

Query: 231 PFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA 290
               P+ + A     S P R +Y   +  +++   N+   L K+G L+N++++F +DNG 
Sbjct: 308 ----PNAKYAG----SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG- 358

Query: 291 PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCA 350
           P   +  +     P RG K + W+GG+R    ++   + Q +K S  +  ++D  PT   
Sbjct: 359 PEAEVPPH--GRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALD 415

Query: 351 AAG--------IEINDTSLDGVNQWDVL--TKGAKTKRSEILHNIDNVDNPQKYYAALRV 400
            AG        +    T +DGV+Q      T G   +++E  H   N        AA+R+
Sbjct: 416 LAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAE--HYFLN-----GKLAAVRM 468

Query: 401 DDLKY 405
           D+ KY
Sbjct: 469 DEFKY 473


>sp|Q08DD1|ARSA_BOVIN Arylsulfatase A OS=Bos taurus GN=ARSA PE=2 SV=1
          Length = 507

 Score =  150 bits (380), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 119/377 (31%), Positives = 177/377 (46%), Gaps = 43/377 (11%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRSALMT 86
             P+I++I ADDLG+ D+  +G     TPN+D LA  GL     YV  +LCTPSR+AL+T
Sbjct: 19  SPPNILLIFADDLGYGDLGSYGHPSSTTPNLDQLAAGGLRFTDFYVPVSLCTPSRAALLT 78

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE-VYTPTF 145
           G+ P+ +G+  GV+      GLPL E  L + L   GY T   GKWHLG   E  + P  
Sbjct: 79  GRLPVRMGLYPGVLEPSSRGGLPLDEVTLAEVLAAQGYLTGIAGKWHLGVGPEGAFLPPH 138

Query: 146 RGFDSHYG--YWQGLQDYYDHSCKATFEPYQG--------LDMRHNMQVDNKTIGI---- 191
            GF    G  Y        + +C     P +G        + +  N+ V+ +   +    
Sbjct: 139 HGFHRFLGIPYSHDQGPCQNLTCFPPATPCEGICDQGLVPIPLLANLSVEAQPPWLPGLE 198

Query: 192 -----YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDIS 246
                ++ DL T+A        ++ +P FLY A    H    Y  F         F   S
Sbjct: 199 ARYVAFARDLMTDA-------QHQGRPFFLYYASHHTH----YPQFSG-----QSFPGHS 242

Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLR 306
              R  +   +  LD +VG ++ A+   G+L  ++V F ADNG  +  + S+ G +  LR
Sbjct: 243 G--RGPFGDSLMELDAAVGALMTAVGDLGLLGETLVFFTADNGPETMRM-SHGGCSGLLR 299

Query: 307 GMKSTPWDGGMRGVA-AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVN 365
             K T ++GG+R  A A W   +     V+ EL    D LPTL A AG ++ + +LDGV+
Sbjct: 300 CGKGTTFEGGVREPALAFWPGHI--APGVTHELASSLDLLPTLAALAGAQLPNITLDGVD 357

Query: 366 QWDVLTKGAKTKRSEIL 382
              +L    K+ R  + 
Sbjct: 358 LSPLLLGTGKSPRHTLF 374


>sp|P51691|ARS_PSEAE Arylsulfatase OS=Pseudomonas aeruginosa (strain ATCC 15692 / PAO1 /
           1C / PRS 101 / LMG 12228) GN=atsA PE=1 SV=3
          Length = 536

 Score =  147 bits (370), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 125/425 (29%), Positives = 183/425 (43%), Gaps = 105/425 (24%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           K+P+ ++I+ADDLG++D+   G  +I TPN+DALA  GL L   +  + C+P+RS L+TG
Sbjct: 3   KRPNFLVIVADDLGFSDIGAFGG-EIATPNLDALAIAGLRLTDFHTASTCSPTRSMLLTG 61

Query: 88  K--YPIHIGMQHGVI---LEGEP-WGLPLTEKL--LPQYLKEAGYATHAIGKWHLGFFRE 139
              +   IG     +   LEG+P +   L E++  LP+ L+EAGY T   GKWHLG   E
Sbjct: 62  TDHHIAGIGTMAEALTPELEGKPGYEGHLNERVVALPELLREAGYQTLMAGKWHLGLKPE 121

Query: 140 VYTPTFRGFDSHYGYWQGLQDY------YDHSCKATFEPYQGLDMRHNMQVDNKTIGIYS 193
             TP  RGF+  +    G  ++      YD S     +    L +     +D    G YS
Sbjct: 122 -QTPHARGFERSFSLLPGAANHYGFEPPYDESTPRILKGTPALYVEDERYLDTLPEGFYS 180

Query: 194 TDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD--------- 244
           +D + +  +  + E ++S+P F YL   A H      P QAP E V K+           
Sbjct: 181 SDAFGDKLLQYLKERDQSRPFFAYLPFSAPHW-----PLQAPREIVEKYRGRYDAGPEAL 235

Query: 245 --------------------------------ISDPER-------RTYAGMVSRLDESVG 265
                                           + D ER         YA MV R+D ++G
Sbjct: 236 RQERLARLKELGLVEADVEAHPVLALTREWEALEDEERAKSARAMEVYAAMVERMDWNIG 295

Query: 266 NVIAALRKHGMLENSIVLFMADNGA--------PSFGI------------------HSN- 298
            V+  LR+ G L+N+ VLFM+DNGA        P FG                    +N 
Sbjct: 296 RVVDYLRRQGELDNTFVLFMSDNGAEGALLEAFPKFGPDLLGFLDRHYDNSLENIGRANS 355

Query: 299 ---------KGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLC 349
                    + +  P R  K+    GG+R  A +  P L +   +S     + D  PTL 
Sbjct: 356 YVWYGPRWAQAATAPSRLYKAFTTQGGIRVPALVRYPRLSRQGAISHAFATVMDVTPTLL 415

Query: 350 AAAGI 354
             AG+
Sbjct: 416 DLAGV 420


>sp|Q32KH9|ARSG_CANFA Arylsulfatase G OS=Canis familiaris GN=ARSG PE=2 SV=1
          Length = 535

 Score =  136 bits (343), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 122/422 (28%), Positives = 188/422 (44%), Gaps = 55/422 (13%)

Query: 25  TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL-ILNQHYVQALCTPSRSA 83
           T  +KP+ +IILADD+GW D+  + +    T N+D +A  G+  ++ H   + C+PSR++
Sbjct: 31  TRGQKPNFVIILADDMGWGDLGANWAETKDTANLDKMAAEGMRFVDFHAAASTCSPSRAS 90

Query: 84  LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
           L+TG+  +  G+ H   +     GLPL E  L + L++AGY T  IGKWHLG     Y P
Sbjct: 91  LLTGRLGLRNGVTHNFAVT-SVGGLPLNETTLAEVLQQAGYVTGMIGKWHLG-HHGPYHP 148

Query: 144 TFRGFDSHYG----YWQGLQDY--YDH----SCKATFEPYQGLD----------MRHNMQ 183
            FRGFD ++G    +  G  D   Y+H    +C     P + L+          +  N+ 
Sbjct: 149 NFRGFDYYFGIPYSHDMGCTDTPGYNHPPCPACPRGDRPSRSLERDCYTDVALPLYENLN 208

Query: 184 VDNKTIGIYS-TDLYTEAAINVIAEHNKS-KPMFLY--LAHLAVHAGNTYEPFQAPDEEV 239
           +  + + + S    Y E AI  I   + S +P  LY  LAH+ V    T         ++
Sbjct: 209 IVEQPVNLSSLAHKYAEKAIQFIQHASASGRPFLLYMGLAHMHVPISRT---------QL 259

Query: 240 AKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK 299
           +  L      RR Y   +  +D  VG +   + +    EN+ + F  DNG P        
Sbjct: 260 SAVLR----GRRPYGAGLREMDSLVGQIKDKVDRTAK-ENTFLWFTGDNG-PWAQKCELA 313

Query: 300 GSNHPLRGM----------KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLC 349
           GS  P  G+          K T W+GG R  A  + P        S+ L  + D  PT+ 
Sbjct: 314 GSVGPFTGLWQTHQGGSPAKQTTWEGGHRVPALAYWPGRVPVNVTSTALLSVLDIFPTVV 373

Query: 350 AAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLK--YV 406
           A AG  +  D   DG++  +VL   ++T    + H              +R+   K  YV
Sbjct: 374 ALAGASLPQDRHFDGLDASEVLFGWSQTGHRVLFHPNSGAAGEFGALQTVRLGSYKAFYV 433

Query: 407 AG 408
           +G
Sbjct: 434 SG 435


>sp|Q32KJ9|ARSG_RAT Arylsulfatase G OS=Rattus norvegicus GN=Arsg PE=2 SV=1
          Length = 526

 Score =  134 bits (337), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 119/422 (28%), Positives = 189/422 (44%), Gaps = 54/422 (12%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL-ILNQHYVQALCTPSRS 82
           T AP+ P+I+IILADD+GW D+  + +    T N+D +A  G+  ++ H   + C+PSR+
Sbjct: 31  TRAPR-PNIVIILADDMGWGDLGANWAETKDTTNLDKMASEGMRFVDFHAAASTCSPSRA 89

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
           +L+TG+  +  G+ H   +     GLPL E  L + L++AGY T  IGKWHLG     Y 
Sbjct: 90  SLLTGRLGLRNGVTHNFAVTSV-GGLPLNETTLAEVLQQAGYVTAMIGKWHLGHHGS-YH 147

Query: 143 PTFRGFDSHYGYW----QGLQD---YYDHSCKATFE-------PYQ------GLDMRHNM 182
           P+FRGFD ++G       G  D   Y    C A  +       P +       L +  N+
Sbjct: 148 PSFRGFDYYFGIPYSNDMGCTDNPGYNYPPCPACPQSDGRWRNPDRDCYTDVALPLYENL 207

Query: 183 QVDNKTIGIYS-TDLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
            +  + + +      Y E A+  I + + S +P  LY+    +H   +  P         
Sbjct: 208 NIVEQPVNLSGLAQKYAERAVEFIEQASTSGRPFLLYVGLAHMHVPLSVTP--------- 258

Query: 241 KFLDISDPE-RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK 299
               +++P+ +R Y   +  +D  VG +   +  H   EN+++ F  DNG P        
Sbjct: 259 ---PLANPQSQRLYRASLQEMDSLVGQIKDKV-DHVAKENTLLWFAGDNG-PWAQKCELA 313

Query: 300 GSNHPLRGM----------KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLC 349
           GS  P  G+          K T W+GG R  A  + P        S+ L  + D  PT+ 
Sbjct: 314 GSMGPFSGLWQTHQGGSPAKQTTWEGGHRVPALAYWPGRVPVNVTSTALLSLLDIFPTVI 373

Query: 350 AAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLK--YV 406
           A AG  +  +   DGV+  +VL   ++T    + H              +R+D  K  Y+
Sbjct: 374 ALAGASLPPNRKFDGVDVSEVLFGKSQTGHRVLFHPNSGAAGEYGALQTVRLDRYKAFYI 433

Query: 407 AG 408
            G
Sbjct: 434 TG 435


>sp|P20713|ATSA_ENTAE Arylsulfatase OS=Enterobacter aerogenes GN=atsA PE=1 SV=1
          Length = 464

 Score =  131 bits (330), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 109/418 (26%), Positives = 184/418 (44%), Gaps = 94/418 (22%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           ++P++I+I+ADD+G++D+S  G  +IPTPN+ A+A  G+ ++Q+Y   +  P+RS L+TG
Sbjct: 24  ERPNVIVIIADDMGYSDISPFGG-EIPTPNLQAMAEQGMRMSQYYTSPMSAPARSMLLTG 82

Query: 88  KYPIHIGMQ----HGVILEGEPWGLPLTEKL--LPQYLKEAGYATHAIGKWHLGFFREVY 141
                 GM     +   +  E + L LT+++  + +  K+AGY T   GKWHLGF     
Sbjct: 83  NSNQQAGMGGMWWYDSTIGKEGYELRLTDRVTTMAERFKDAGYNTLMAGKWHLGFVPGA- 141

Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCK-ATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
           TP  RGF+  + +  G   +++ +    T E +     R   +V       YS++ Y   
Sbjct: 142 TPKDRGFNHAFAFMGGGTSHFNDAIPLGTVEAFHTYYTRDGERVSLPD-DFYSSEAYARQ 200

Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF------------------ 242
             + I    K +P+F +LA  A H     +P QAPDE + +F                  
Sbjct: 201 MNSWIKATPKEQPVFAWLAFTAPH-----DPLQAPDEWIKRFKGQYEQGYAEVYRQRIAR 255

Query: 243 ---------------LDISD------PER--------RTYAGMVSRLDESVGNVIAALRK 273
                          L++        PE+        + YA M++ +D  +G ++  L++
Sbjct: 256 LKALGIIHDDTPLPHLELDKEWEALTPEQQKYTAKVMQVYAAMIANMDAQIGTLMETLKQ 315

Query: 274 HGMLENSIVLFMADNGA-------------------------------PSFGIHSNKGSN 302
            G  +N++++F+ DNGA                                S+G H    SN
Sbjct: 316 TGRDKNTLLVFLTDNGANPAQGFYYESTPEFWKQFDNSYDNVGRKGSFVSYGPHWANVSN 375

Query: 303 HPLRGM-KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT 359
            P     K+T   GG+     I  P + +  K+ +    + D  PTL   AGI+ N +
Sbjct: 376 APYANYHKTTSAQGGINTDFMISGPGITRHGKIDASTMAVYDVAPTLYEFAGIDPNKS 433


>sp|Q9X759|ATSA_KLEPN Arylsulfatase OS=Klebsiella pneumoniae GN=atsA PE=1 SV=1
          Length = 577

 Score =  130 bits (328), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 109/418 (26%), Positives = 185/418 (44%), Gaps = 94/418 (22%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           ++P++I+I+ADD+G++D+S  G  +IPTPN+ A+A  G+ ++Q+Y   +  P+RS L+TG
Sbjct: 24  ERPNVIVIIADDMGYSDISPFGG-EIPTPNLQAMAEQGMRMSQYYTSPMSAPARSMLLTG 82

Query: 88  KYPIHIGMQ----HGVILEGEPWGLPLTEKL--LPQYLKEAGYATHAIGKWHLGFFREVY 141
                 GM     +   +  E + L LT+++  + +  K+AGY T   GKWHLGF     
Sbjct: 83  NSNQQAGMGGMWWYDSTIGKEGYELRLTDRVTTMAERFKDAGYNTLMAGKWHLGFVPGA- 141

Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCK-ATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
           TP  RGF+  + +  G   +++ +    T E +     R   +V +     YS++ Y   
Sbjct: 142 TPKERGFNHAFAFMGGGTSHFNDAIPLGTVEAFHTYYTRDGERV-SLPDDFYSSEAYARQ 200

Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF------------------ 242
             + I    K +P+F +LA  A H     +P QAPDE + +F                  
Sbjct: 201 MNSWIKATPKEQPVFAWLAFTAPH-----DPLQAPDEWIKRFKGQYEQGYAEVYRQRIAR 255

Query: 243 ---------------LDISD------PER--------RTYAGMVSRLDESVGNVIAALRK 273
                          L++        PE+        + YA M++ +D  +G ++  L++
Sbjct: 256 LKALGIIHDDTPLPHLELDKEWEALTPEQQKYTAKVMQVYAAMIANMDAQIGTLMETLKQ 315

Query: 274 HGMLENSIVLFMADNGA-------------------------------PSFGIHSNKGSN 302
            G  +N++++F+ DNGA                                S+G H    SN
Sbjct: 316 TGRDKNTLLVFLTDNGANPAQGFYYESTPEFWKQFDNSYDNVGRKGSFVSYGPHWANVSN 375

Query: 303 HPLRGM-KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT 359
            P     K+T   GG+     I  P + +  K+ +    + D  PTL   AGI+ N +
Sbjct: 376 APYANYHKTTSAQGGINTDFMISGPGITRHGKIDASTMAVYDVAPTLYEFAGIDPNKS 433


>sp|Q9C0V7|YHJ2_SCHPO Uncharacterized sulfatase PB10D8.02c OS=Schizosaccharomyces pombe
           (strain 972 / ATCC 24843) GN=SPBPB10D8.02c PE=3 SV=1
          Length = 554

 Score =  126 bits (317), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 119/434 (27%), Positives = 178/434 (41%), Gaps = 107/434 (24%)

Query: 20  AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTP 79
           AF      KKP+ ++I+ADDLGW+DVS  G S+I TPNI+ LA  G+ L   +  + C+P
Sbjct: 2   AFNKQAESKKPNFLVIVADDLGWSDVSPFG-SEIHTPNIERLAKEGVRLTNFHTASACSP 60

Query: 80  SRSALMTGKYPIHIG----MQHGVILEGEPWGLP------LTEKL--LPQYLKEAGYATH 127
           +RS L++G    HI     M   V    + WG        L +++  LP+ L+EAGY T 
Sbjct: 61  TRSMLLSGT-DNHIAGLGQMAETVRRFSKVWGGKPGYEGYLNDRVAALPEILQEAGYYTT 119

Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEP----YQGLDMRHNMQ 183
             GKWHLG   + Y P+ RGF   +    G  +++ +       P       L   ++  
Sbjct: 120 MSGKWHLGLTPDRY-PSKRGFKESFALLPGGGNHFAYEPGTRENPAVPFLPPLYTHNHDP 178

Query: 184 VDNKTI-GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF 242
           VD+K++   YS++ + E  I+ +    KS+  F YL   A H      P Q+P E + K+
Sbjct: 179 VDHKSLKNFYSSNYFAEKLIDQLKNREKSQSFFAYLPFTAPHW-----PLQSPKEYINKY 233

Query: 243 L-------------------------------------------------DISDPERRTY 253
                                                             + S      Y
Sbjct: 234 RGRYSEGPDVLRKNRLQAQKDLGLIPENVIPAPVDGMGTKSWDELTTEEKEFSARTMEVY 293

Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS--------------------- 292
           A MV  LD ++G VI  L+  G L+N+ V+FM+DNGA                       
Sbjct: 294 AAMVELLDLNIGRVIDYLKTIGELDNTFVIFMSDNGAEGSVLEAIPVLSTKPPVKYFDNS 353

Query: 293 ------------FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFH 340
                       +G    + +  P R  K    +GG+R  A I  P L +   +S E   
Sbjct: 354 LENLGNYNSFIWYGPRWAQAATAPSRLSKGFITEGGIRCPAIIRYPPLIKPDIISDEFVT 413

Query: 341 ISDWLPTLCAAAGI 354
           + D LPT+   A +
Sbjct: 414 VMDILPTILELAEV 427


>sp|Q3TYD4|ARSG_MOUSE Arylsulfatase G OS=Mus musculus GN=Arsg PE=2 SV=1
          Length = 525

 Score =  126 bits (316), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 109/397 (27%), Positives = 176/397 (44%), Gaps = 56/397 (14%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL-ILNQHYVQALCTPSRS 82
           T AP+ P+I+IILADD+GW D+  + +    T N+D +A  G+  ++ H   + C+PSR+
Sbjct: 31  TRAPQ-PNIVIILADDMGWGDLGANWAETKDTTNLDKMASEGMRFVDFHAAASTCSPSRA 89

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
           +L+TG+  +  G+ H   +     GLP+ E  L + L++ GY T  IGKWHLG     Y 
Sbjct: 90  SLLTGRLGLRNGVTHNFAVTSV-GGLPVNETTLAEVLRQEGYVTAMIGKWHLGHHGS-YH 147

Query: 143 PTFRGFDSHYGYW----QGLQD---YYDHSCKATFE-------PYQ------GLDMRHNM 182
           P FRGFD ++G       G  D   Y    C A  +       P +       L +  N+
Sbjct: 148 PNFRGFDYYFGIPYSNDMGCTDAPGYNYPPCPACPQRDGLWRNPGRDCYTDVALPLYENL 207

Query: 183 QVDNKTIGIYS-TDLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
            +  + + +      Y E A+  I + + S +P  LY+    +H   +  P         
Sbjct: 208 NIVEQPVNLSGLAQKYAERAVEFIEQASTSGRPFLLYVGLAHMHVPLSVTP--------- 258

Query: 241 KFLDISDPERRT-YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNG---------- 289
               ++ P+R++ Y   +  +D  VG +   +  H   EN+++ F  DNG          
Sbjct: 259 ---PLAHPQRQSLYRASLREMDSLVGQIKDKV-DHVARENTLLWFTGDNGPWAQKCELAG 314

Query: 290 --APSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPT 347
              P FG+        P    K T W+GG R  A  + P        S+ L  + D  PT
Sbjct: 315 SVGPFFGLWQTHQGGSP---TKQTTWEGGHRVPALAYWPGRVPANVTSTALLSLLDIFPT 371

Query: 348 LCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILH 383
           + A AG  +  +   DG +  +VL   ++     + H
Sbjct: 372 VIALAGASLPPNRKFDGRDVSEVLFGKSQMGHRVLFH 408


>sp|P14000|ARS_HEMPU Arylsulfatase OS=Hemicentrotus pulcherrimus PE=1 SV=1
          Length = 551

 Score =  124 bits (311), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 109/372 (29%), Positives = 171/372 (45%), Gaps = 36/372 (9%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMTG 87
           KP++++++AD +G  D++ +G        ID +A  GL     YV  A+CTPSRSA+MTG
Sbjct: 51  KPNVVLLVADHMGSGDLTSYGHPTQEAGFIDKMAAEGLRFTNGYVGDAVCTPSRSAIMTG 110

Query: 88  KYPIHIGM--QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT--- 142
           + P+ IG   +  V L     GLP +E  + + +KEAGYAT  +GKWHLG      T   
Sbjct: 111 RLPVRIGTFGETRVFLPWTKTGLPKSELTIAEAMKEAGYATGMVGKWHLGINENSSTDGA 170

Query: 143 --PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI--------GIY 192
             P   GFD   G+     + +  SC  T       D +      N T+        G+ 
Sbjct: 171 HLPFNHGFD-FVGHNLPFTNSW--SCDDTGLHKDFPDSQRCYLYVNATLVSQPYQHKGL- 226

Query: 193 STDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
            T L+T+ A+  I E N + P FLY+A   +H       F + D             R  
Sbjct: 227 -TQLFTDDALGFI-EDNHADPFFLYVAFAHMHT----SLFSSDDFSCTS-------RRGR 273

Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
           Y   +  + ++V  ++  L ++ + EN+I+ F++D+G P        G     RG KS  
Sbjct: 274 YGDNLLEMHDAVQKIVDKLEENNISENTIIFFISDHG-PHREYCEEGGDASIFRGGKSHS 332

Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLT 371
           W+GG R    ++ P    +  +S+E+    D + T     G  +  D   DG +  DVL 
Sbjct: 333 WEGGHRIPYIVYWPG-TISPGISNEIVTSMDIIATAADLGGTTLPTDRIYDGKSIKDVLL 391

Query: 372 KGAKTKRSEILH 383
           +G+ +  S   +
Sbjct: 392 EGSASPHSSFFY 403


>sp|Q96EG1|ARSG_HUMAN Arylsulfatase G OS=Homo sapiens GN=ARSG PE=1 SV=1
          Length = 525

 Score =  124 bits (311), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 117/398 (29%), Positives = 175/398 (43%), Gaps = 59/398 (14%)

Query: 25  TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL-ILNQHYVQALCTPSRSA 83
           T  +KP+ +IILADD+GW D+  + +    T N+D +A  G+  ++ H   + C+PSR++
Sbjct: 31  TRGQKPNFVIILADDMGWGDLGANWAETKDTANLDKMASEGMRFVDFHAAASTCSPSRAS 90

Query: 84  LMTGKYPIHIGMQHGVILE---GEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
           L+TG+    +G+++GV          GLPL E  L + L++AGY T  IGKWHLG     
Sbjct: 91  LLTGR----LGLRNGVTRNFAVTSVGGLPLNETTLAEVLQQAGYVTGIIGKWHLGHHGS- 145

Query: 141 YTPTFRGFDSHYG----YWQGLQDY--YDH----SCKATFEPYQGLD----------MRH 180
           Y P FRGFD ++G    +  G  D   Y+H    +C     P + L           +  
Sbjct: 146 YHPNFRGFDYYFGIPYSHDMGCTDTPGYNHPPCPACPQGDGPSRNLQRDCYTDVALPLYE 205

Query: 181 NMQVDNKTIGIYS-TDLYTEAAINVIAEHNKS-KPMFLY--LAHLAVHAGNTYEPFQAPD 236
           N+ +  + + + S    Y E A   I   + S +P  LY  LAH+ V    T  P  AP 
Sbjct: 206 NLNIVEQPVNLSSLAQKYAEKATQFIQRASTSGRPFLLYVALAHMHVPLPVTQLP-AAPR 264

Query: 237 EEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH 296
                        R  Y   +  +D  VG +   +  H + EN+ + F  DNG P     
Sbjct: 265 ------------GRSLYGAGLWEMDSLVGQIKDKV-DHTVKENTFLWFTGDNG-PWAQKC 310

Query: 297 SNKGSNHPLRGM----------KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLP 346
              GS  P  G           K T W+GG R  A  + P        S+ L  + D  P
Sbjct: 311 ELAGSVGPFTGFWQTRQGGSPAKQTTWEGGHRVPALAYWPGRVPVNVTSTALLSVLDIFP 370

Query: 347 TLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILH 383
           T+ A A   +      DGV+  +VL   ++     + H
Sbjct: 371 TVVALAQASLPQGRRFDGVDVSEVLFGRSQPGHRVLFH 408


>sp|P50473|ARS_STRPU Arylsulfatase OS=Strongylocentrotus purpuratus PE=2 SV=1
          Length = 567

 Score =  124 bits (310), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 114/402 (28%), Positives = 179/402 (44%), Gaps = 42/402 (10%)

Query: 2   TWARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDAL 61
           T  R+Y        L      + TA  KP++I++LADD+G  D+S +G        ID +
Sbjct: 39  TATRRYGDGEDLLHLLGQTGQHRTAMTKPNVILLLADDMGVGDLSVYGHPTQEPGFIDQM 98

Query: 62  AYNGLILNQHYV-QALCTPSRSALMTGKYPIHIGM--QHGVILEGEPWGLPLTEKLLPQY 118
           A  GL   Q Y   ++CTPSRSA++TG+ PI  G+  +  + L     GLPL E  + + 
Sbjct: 99  ANQGLRFTQGYSGDSVCTPSRSAIVTGRQPIRTGVYGEERIFLPWTTTGLPLYEVTIAEA 158

Query: 119 LKEAGYATHAIGKWHLGFFRE-----VYTPTFRGFD--SH---YG-YWQ----GL-QDYY 162
           +K AGY T  +GKWHLG          + P  RGFD   H   +G  W+    GL QD+ 
Sbjct: 159 MKGAGYTTGMVGKWHLGINENSSSDGAHLPANRGFDFVGHNLPFGNSWRCDDTGLHQDFP 218

Query: 163 DHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLA 222
           D    A F  Y    +    Q  +K +    T L  +  +  I E N +KP F+Y++   
Sbjct: 219 D--TNACFLYYNSTSVAQPFQ--HKGL----TQLLRDDTVGFI-EDNVNKPFFMYVSFAH 269

Query: 223 VHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIV 282
           +H       F + D             R  Y   +  +D+++  ++  L  + + +N+++
Sbjct: 270 MHT----SLFSSDDFSCTS-------RRGRYGDNLREMDQAIEQIVTTLVDNDIDDNTVI 318

Query: 283 LFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS 342
            F +D+G P        G  +  RG K   W+GG R    ++ P    +  VS E+    
Sbjct: 319 FFTSDHG-PHREYCGEGGDANVFRGGKGQSWEGGHRIPYIVYWPG-TISPGVSHEIVTSM 376

Query: 343 DWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILH 383
           D + T     G ++  D   DG     VL +GA +   +  +
Sbjct: 377 DIIATAVNLGGSQLPTDRIYDGKCLKSVLLEGASSPHDDFFY 418


>sp|P08842|STS_HUMAN Steryl-sulfatase OS=Homo sapiens GN=STS PE=1 SV=2
          Length = 583

 Score =  123 bits (309), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 119/432 (27%), Positives = 177/432 (40%), Gaps = 77/432 (17%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSAL 84
           A  +P+II+++ADDLG  D   +G+  I TPNID LA  G+ L QH   + LCTPSR+A 
Sbjct: 23  AASRPNIILVMADDLGIGDPGCYGNKTIRTPNIDRLASGGVKLTQHLAASPLCTPSRAAF 82

Query: 85  MTGKYPIHIGM----QHGVIL-EGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE 139
           MTG+YP+  GM    + GV L      GLP  E    + LK+ GY+T  IGKWHLG    
Sbjct: 83  MTGRYPVRSGMASWSRTGVFLFTASSGGLPTDEITFAKLLKDQGYSTALIGKWHLGMSCH 142

Query: 140 VYT-----PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYST 194
             T     P   GF+  YG    L +  D  CK           +  + +  + +G+   
Sbjct: 143 SKTDFCHHPLHHGFNYFYGI--SLTNLRD--CKPGEGSVFTTGFKRLVFLPLQIVGV--- 195

Query: 195 DLYTEAAINVIAEHNKSKPMFLYLAHLA------------------VHAGNTYEPFQAPD 236
            L T AA+N +   +    +F  L  LA                        YE  Q P 
Sbjct: 196 TLLTLAALNCLGLLHVPLGVFFSLLFLAALILTLFLGFLHYFRPLNCFMMRNYEIIQQPM 255

Query: 237 E----------EVAKFLD--------------------------ISDPERRTYAGMVSRL 260
                      E A+F+                               +   Y   V  +
Sbjct: 256 SYDNLTQRLTVEAAQFIQRNTETPFLLVLSYLHVHTALFSSKDFAGKSQHGVYGDAVEEM 315

Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHS----NKGSNHPLRGMKSTPWDGG 316
           D SVG ++  L +  +  ++++ F +D GA    + S    + GSN   +G K+  W+GG
Sbjct: 316 DWSVGQILNLLDELRLANDTLIYFTSDQGAHVEEVSSKGEIHGGSNGIYKGGKANNWEGG 375

Query: 317 MRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAK 375
           +R    +  P + Q  +   E     D  PT+   AG  +  D  +DG +   +L   ++
Sbjct: 376 IRVPGILRWPRVIQAGQKIDEPTSNMDIFPTVAKLAGAPLPEDRIIDGRDLMPLLEGKSQ 435

Query: 376 TKRSEILHNIDN 387
               E L +  N
Sbjct: 436 RSDHEFLFHYCN 447


>sp|P77318|YDEN_ECOLI Uncharacterized sulfatase YdeN OS=Escherichia coli (strain K12)
           GN=ydeN PE=3 SV=2
          Length = 560

 Score =  121 bits (303), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 112/389 (28%), Positives = 163/389 (41%), Gaps = 77/389 (19%)

Query: 29  KPHIIIILADDLGWNDVSFHGSS--------------------------QIPTPNIDALA 62
           KP+II++  DDLG+  + F   S                          Q  TP + +L 
Sbjct: 57  KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116

Query: 63  YNGLILNQHYV-QALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKE 121
             G+     YV   +  PSR+A+MTG+ P   G+      +    G+PLTE  LP+  + 
Sbjct: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD---GIPLTETFLPELFQN 173

Query: 122 AGYATHAIGKWHLG----------------------FFREVYTPTFRGFDSHYGYWQGLQ 159
            GY T A+GKWHL                       F  E + P  RGFD   G+     
Sbjct: 174 HGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGT 233

Query: 160 DYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNK-SKPMFLYL 218
            YY+               ++  +V  K    Y +D  T+ AI V+       +P  LYL
Sbjct: 234 AYYNSPSL----------FKNRERVPAKG---YISDQLTDEAIGVVDRAKTLDQPFMLYL 280

Query: 219 AHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLE 278
           A+ A H  N      APD+   +F   S      YA + S +D+ V  ++  L+K+G  +
Sbjct: 281 AYNAPHLPNDNP---APDQYQKQFNTGSQTADNYYASVYS-VDQGVKRILEQLKKNGQYD 336

Query: 279 NSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVS-SE 337
           N+I+LF +DNGA   G     G+    +G KS  + GG      +W  W  + Q  +  +
Sbjct: 337 NTIILFTSDNGAVIDGPLPLNGAQ---KGYKSQTYPGGTHTPMFMW--WKGKLQPGNYDK 391

Query: 338 LFHISDWLPTLCAAAGIEI-NDTSLDGVN 365
           L    D+ PT   AA I I  D  LDGV+
Sbjct: 392 LISAMDFYPTALDAADISIPKDLKLDGVS 420


>sp|P51690|ARSE_HUMAN Arylsulfatase E OS=Homo sapiens GN=ARSE PE=1 SV=2
          Length = 589

 Score =  109 bits (272), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 61/137 (44%), Positives = 83/137 (60%), Gaps = 11/137 (8%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
            +P+I++++ADDLG  D+  +G++ + TPNID LA +G+ L QH   A LCTPSR+A +T
Sbjct: 36  SRPNILLLMADDLGIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLT 95

Query: 87  GKYPIHIGMQHGV---ILE--GEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE-- 139
           G+YP+  GM   +   +L+  G   GLP  E    + LKE GYAT  IGKWHLG   E  
Sbjct: 96  GRYPVRSGMVSSIGYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESA 155

Query: 140 ---VYTPTFRGFDSHYG 153
               + P   GFD  YG
Sbjct: 156 SDHCHHPLHHGFDHFYG 172


>sp|Q60HH5|ARSE_MACFA Arylsulfatase E OS=Macaca fascicularis GN=ARSE PE=2 SV=1
          Length = 588

 Score =  108 bits (270), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 61/137 (44%), Positives = 83/137 (60%), Gaps = 11/137 (8%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
            +P+I++++ADDLG  D+  +G++ + TPNID LA +G+ L QH   A LCTPSR+A +T
Sbjct: 36  SRPNILLLMADDLGIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLT 95

Query: 87  GKYPIHIGMQHGV---ILE--GEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE-- 139
           G+YP+  GM   +   +L+  G   GLP  E    + LKE GYAT  IGKWHLG   E  
Sbjct: 96  GRYPVRSGMVSSIGYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESA 155

Query: 140 ---VYTPTFRGFDSHYG 153
               + P   GFD  YG
Sbjct: 156 SDHCHHPLHHGFDHFYG 172


>sp|P54793|ARSF_HUMAN Arylsulfatase F OS=Homo sapiens GN=ARSF PE=1 SV=4
          Length = 590

 Score =  107 bits (267), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 67/157 (42%), Positives = 87/157 (55%), Gaps = 12/157 (7%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
           F +L C LL N    +     KP+I++I+ DDLG  D+  +G+  + TP+ID LA  G+ 
Sbjct: 9   FMSLVCALL-NTCQAHRVHDDKPNIVLIMVDDLGIGDLGCYGNDTMRTPHIDRLAREGVR 67

Query: 68  LNQHYVQA-LCTPSRSALMTGKYPIHIGM----QHGVILE-GEPWGLPLTEKLLPQYLKE 121
           L QH   A LC+PSRSA +TG+YPI  GM       VI     P GLPL E  L   LK+
Sbjct: 68  LTQHISAASLCSPSRSAFLTGRYPIRSGMVSSGNRRVIQNLAVPAGLPLNETTLAALLKK 127

Query: 122 AGYATHAIGKWHLGF-----FREVYTPTFRGFDSHYG 153
            GY+T  IGKWH G        + + P   GFD +YG
Sbjct: 128 QGYSTGLIGKWHQGLNCDSRSDQCHHPYNYGFDYYYG 164



 Score = 32.7 bits (73), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 24/95 (25%), Positives = 42/95 (44%), Gaps = 12/95 (12%)

Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
           +  + AI+ +  H+K     L+ + L VH      P    D+    F   S  +   Y  
Sbjct: 266 IMVKEAISFLERHSKET-FLLFFSFLHVHT-----PLPTTDD----FTGTS--KHGLYGD 313

Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA 290
            V  +D  VG ++ A+   G+  N++V F +D+G 
Sbjct: 314 NVEEMDSMVGKILDAIDDFGLRNNTLVYFTSDHGG 348


>sp|Q5FYA8|ARSH_HUMAN Arylsulfatase H OS=Homo sapiens GN=ARSH PE=2 SV=1
          Length = 562

 Score =  105 bits (263), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 120/432 (27%), Positives = 174/432 (40%), Gaps = 72/432 (16%)

Query: 25  TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSA 83
           T   +P+I++++ADDLG  D+  +G++ + TPNID LA  G+ L QH   A +CTPSR+A
Sbjct: 2   TRNARPNIVLLMADDLGVGDLCCYGNNSVSTPNIDRLASEGVRLTQHLAAASMCTPSRAA 61

Query: 84  LMTGKYPIHIGMQHGVILE------GEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
            +TG+YPI  GM     L       G   GLP  E    + L+  GY T  IGKWHLG  
Sbjct: 62  FLTGRYPIRSGMVSAYNLNRAFTWLGGSGGLPTNETTFAKLLQHRGYRTGLIGKWHLGLS 121

Query: 138 -----REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIY 192
                   Y P   GF   YG   GL       C+A+  P     +R  + +    + + 
Sbjct: 122 CASRNDHCYHPLNHGFHYFYGVPFGLLS----DCQASKTPELHRWLRIKLWISTVALALV 177

Query: 193 STDLYTEAAINVIAEHNKSKPMFLYLAHLAVHA-----GNT----------YEPFQAP-- 235
              L         +   K   +F  LA L   +     G T          +E  Q P  
Sbjct: 178 PFLLLIPKFARWFSVPWKVIFVFALLAFLFFTSWYSSYGFTRRWNCILMRNHEIIQQPMK 237

Query: 236 DEEVA-----------------------KFLDISDP--ERRTYAGM---------VSRLD 261
           +E+VA                        FL +  P   ++ + G          V  +D
Sbjct: 238 EEKVASLMLKEALAFIERYKREPFLLFFSFLHVHTPLISKKKFVGRSKYGRYGDNVEEMD 297

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGA---PSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
             VG ++ AL +  +  +++V F +DNG    P  G     G N   +G K      G  
Sbjct: 298 WMVGKILDALDQERLANHTLVYFTSDNGGHLEPLDGAVQLGGWNGIYKGGKGMGGWEGGI 357

Query: 319 GVAAIWS-PWLKQTQKVSSELFHISDWLPTLC-AAAGIEINDTSLDGVNQWDVLTKGAKT 376
            V  I+  P + +  +V +E   + D  PTL     GI   D  +DG N   +L   A  
Sbjct: 358 RVPGIFRWPSVLEAGRVINEPTSLMDIYPTLSYIGGGILSQDRVIDGQNLMPLLEGRASH 417

Query: 377 KRSEILHNIDNV 388
              E L +   V
Sbjct: 418 SDHEFLFHYCGV 429


>sp|Q32KH8|ARSH_CANFA Arylsulfatase H OS=Canis familiaris GN=ARSH PE=2 SV=1
          Length = 562

 Score =  103 bits (258), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 63/160 (39%), Positives = 84/160 (52%), Gaps = 16/160 (10%)

Query: 25  TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSA 83
           T   +P+I++++ADDLG  D+  +G++ + TPNID LA  G+ L QH   A +CTPSR+A
Sbjct: 2   TRNSRPNIVLLMADDLGVGDLCCYGNNTVSTPNIDRLASEGVRLTQHLAAASVCTPSRAA 61

Query: 84  LMTGKYPIHIGM------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
            +TG+YPI  GM        G+   G   GLP  E    + L+  GY T  IGKWH G  
Sbjct: 62  FLTGRYPIRSGMASPYNLNRGLTWLGGSGGLPTNETTFAKLLQHYGYRTGLIGKWHQGLS 121

Query: 138 -----REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEP 172
                   Y P   GFD  YG   GL       C+A+  P
Sbjct: 122 CASRNDHCYHPLNHGFDYFYGLPFGLLS----DCQASRTP 157


>sp|P15589|STS_RAT Steryl-sulfatase OS=Rattus norvegicus GN=Sts PE=1 SV=2
          Length = 577

 Score = 96.7 bits (239), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 58/145 (40%), Positives = 77/145 (53%), Gaps = 12/145 (8%)

Query: 21  FLNTTAPKK-PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCT 78
           FL    P   P+ ++I+ADDLG  D+  +G+  + TP+ID LA  G+ L QH   A LCT
Sbjct: 16  FLCAARPGPGPNFLLIMADDLGIGDLGCYGNRTLRTPHIDRLALEGVKLTQHLAAAPLCT 75

Query: 79  PSRSALMTGKYPIHIGM-QHG----VILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWH 133
           PSR+A +TG+YP+  GM  HG     +      GLP  E    + LK  GY T  +GKWH
Sbjct: 76  PSRAAFLTGRYPVRSGMASHGRLGVFLFSASSGGLPPNEVTFAKLLKGQGYTTGLVGKWH 135

Query: 134 LGFFRE-----VYTPTFRGFDSHYG 153
           LG   +      + P   GFD   G
Sbjct: 136 LGLSCQAASDFCHHPGRHGFDRFLG 160



 Score = 57.0 bits (136), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 51/186 (27%), Positives = 81/186 (43%), Gaps = 18/186 (9%)

Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVI 268
           N+  P  L+L+ + VH  +   P     E   + L         Y   V  +D +VG V+
Sbjct: 274 NRDTPFLLFLSFMHVHTAHFANP-----EFAGQSL------HGAYGDAVEEMDWAVGQVL 322

Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSN----KGSNHPLRGMKSTPWDGGMRGVAAI- 323
           A L K G+  N++V   +D+GA    +  N     GSN   RG K+  W+GG+R    + 
Sbjct: 323 ATLDKLGLANNTLVYLTSDHGAHVEELGPNGERHGGSNGIYRGGKANTWEGGIRVPGLVR 382

Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEIL 382
           W   +   Q+V     ++ D  PT+   AG E+  D  +DG +   +L    +    E L
Sbjct: 383 WPGVIVPGQEVEEPTSNM-DVFPTVARLAGAELPTDRVIDGRDLMPLLLGHVQHSEHEFL 441

Query: 383 HNIDNV 388
            +  N 
Sbjct: 442 FHYCNA 447


>sp|P51689|ARSD_HUMAN Arylsulfatase D OS=Homo sapiens GN=ARSD PE=1 SV=2
          Length = 593

 Score = 94.0 bits (232), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 56/136 (41%), Positives = 74/136 (54%), Gaps = 11/136 (8%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTG 87
           KP+I++I+ADDLG  D+  +G++ + TPNID LA  G+ L QH   A LCTPSR+A +TG
Sbjct: 40  KPNILLIMADDLGTGDLGCYGNNTLRTPNIDQLAEEGVRLTQHLAAAPLCTPSRAAFLTG 99

Query: 88  KYPIHIGMQHGVILEGEPW-----GLPLTEKLLPQYLKEAGYATHAIGKWHLGF-----F 137
           ++    GM          W     GLP  E    + L++ GYAT  IGKWH G       
Sbjct: 100 RHSFRSGMDASNGYRALQWNAGSGGLPENETTFARILQQHGYATGLIGKWHQGVNCASRG 159

Query: 138 REVYTPTFRGFDSHYG 153
              + P   GFD  YG
Sbjct: 160 DHCHHPLNHGFDYFYG 175



 Score = 33.9 bits (76), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 27/113 (23%), Positives = 49/113 (43%), Gaps = 12/113 (10%)

Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE 237
           MR++   +   +   +  L  + A++ I  H K  P  L+L+ L VH          P  
Sbjct: 259 MRNHDVTEQPMVLEKTASLMLKEAVSYIERH-KHGPFLLFLSLLHVH---------IPLV 308

Query: 238 EVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA 290
             + FL  S  +   Y   V  +D  +G V+ A+  +G+  ++   F +D+G 
Sbjct: 309 TTSAFLGKS--QHGLYGDNVEEMDWLIGKVLNAIEDNGLKNSTFTYFTSDHGG 359


>sp|P50427|STS_MOUSE Steryl-sulfatase OS=Mus musculus GN=Sts PE=2 SV=1
          Length = 624

 Score = 94.0 bits (232), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 56/133 (42%), Positives = 73/133 (54%), Gaps = 11/133 (8%)

Query: 32  IIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTGKYP 90
            ++I+ADDLG  D+  +G+  + TP++D LA  G+ L QH   A LCTPSR+A +TG+YP
Sbjct: 37  FLLIMADDLGIGDLGCYGNKTLRTPHLDRLAREGVKLTQHLAAAPLCTPSRAAFLTGRYP 96

Query: 91  IHIGM-QHGVI----LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT--- 142
              GM  HG +          GLP +E  + + LK  GYAT  IGKWHLG      T   
Sbjct: 97  PRSGMAAHGRVGVYLFTASSGGLPPSEVTMARLLKGRGYATALIGKWHLGLSCRGATDFC 156

Query: 143 --PTFRGFDSHYG 153
             P   GFD   G
Sbjct: 157 HHPLRHGFDRFLG 169



 Score = 67.4 bits (163), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 57/204 (27%), Positives = 90/204 (44%), Gaps = 18/204 (8%)

Query: 190 GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE 249
           G  +  L  EAA+ +    N+++P  L+L+ L VH  +  +P  A       + D     
Sbjct: 266 GGLTRRLADEAALFL--RRNRARPFLLFLSFLHVHTAHFADPGFAGRSLHGAYGD----- 318

Query: 250 RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA--PSFGIHSNK--GSNHPL 305
                  V  +D  VG V+AAL + G+   ++V F +D+GA     G    +  GSN   
Sbjct: 319 ------SVEEMDWGVGRVLAALDELGLARETLVYFTSDHGAHVEELGPRGERMGGSNGVF 372

Query: 306 RGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGV 364
           RG K   W+GG+R    +  P      +V +E   + D  PT+   AG E+  D  +DG 
Sbjct: 373 RGGKGNNWEGGVRVPCLVRWPRELSPGRVVAEPTSLMDVFPTVARLAGAELPGDRVIDGR 432

Query: 365 NQWDVLTKGAKTKRSEILHNIDNV 388
           +   +L   A+    E L +  N 
Sbjct: 433 DLMPLLRGDAQRSEHEFLFHYCNA 456


>sp|Q8XNV1|SULF_CLOPE Arylsulfatase OS=Clostridium perfringens (strain 13 / Type A)
           GN=CPE0231 PE=3 SV=1
          Length = 481

 Score = 86.3 bits (212), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 101/411 (24%), Positives = 170/411 (41%), Gaps = 81/411 (19%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTG 87
           KP+I++I+ D +  + +  +G+  I TPN+D +A  G      Y     C  SR++++TG
Sbjct: 2   KPNIVLIMVDQMRGDCLGVNGNEFIETPNLDMMATEGYNFENAYTAVPSCIASRASILTG 61

Query: 88  ---KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
              K    +G + GV      W     E  +     +AGY T  IGK H+  + E     
Sbjct: 62  MSQKSHGRVGYEDGV-----SWNY---ENTIASEFSKAGYHTQCIGKMHV--YPERNLCG 111

Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM------RHNMQVDNKTIGI------- 191
           F     H GY   L    +   KA+ +  Q  D       +    VD   IG+       
Sbjct: 112 FHNIMLHDGY---LHFARNKEGKASTQIEQCDDYLKWFREKKGHNVDLIDIGLDCNSWVS 168

Query: 192 ---------YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAG--------NTYEPFQA 234
                    + T+     +I+ +   + SKP FL ++ +  H+         + Y+    
Sbjct: 169 RPWGYEENLHPTNWVVNESIDFLRRRDPSKPFFLKMSFVRPHSPLDPPKFYFDMYKDEDL 228

Query: 235 PDEEVAKFLDISDPERR---------------------TYAGMVSRLDESVGNVIAALRK 273
           P+  +  + +  D E R                      Y G ++ +D  +G  + AL +
Sbjct: 229 PEPLMGDWANKEDEENRGKDINCVKGIINKKALKRAKAAYYGSITHIDHQIGRFLIALSE 288

Query: 274 HGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP--WLK-Q 330
           +G L N+I LF++D+G          G ++  R  K  P++G  R    I+ P   LK +
Sbjct: 289 YGKLNNTIFLFVSDHG-------DMMGDHNWFR--KGIPYEGSARVPFFIYDPGNLLKGK 339

Query: 331 TQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEI 381
             KV  E+  + D +PTL   A I I D S++G++  D++ +   T R  I
Sbjct: 340 KGKVFDEVLELRDIMPTLLDFAHISIPD-SVEGLSLKDLIEERNSTWRDYI 389


>sp|Q0TUK6|SULF_CLOP1 Arylsulfatase OS=Clostridium perfringens (strain ATCC 13124 / NCTC
           8237 / Type A) GN=CPF_0221 PE=1 SV=1
          Length = 481

 Score = 85.1 bits (209), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 109/464 (23%), Positives = 191/464 (41%), Gaps = 92/464 (19%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTG 87
           KP+I++I+ D +  + +  +G+  I TPN+D +A  G      Y     C  SR++++TG
Sbjct: 2   KPNIVLIMVDQMRGDCLGVNGNEFIETPNLDMMATEGYNFENAYTAVPSCIASRASILTG 61

Query: 88  ---KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
              K    +G + GV      W     E  +     +AGY T  IGK H+  + E     
Sbjct: 62  MSQKSHGRVGYEDGV-----SWNY---ENTIASEFSKAGYHTQCIGKMHV--YPERNLCG 111

Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM------RHNMQVDNKTIGI------- 191
           F     H GY   L    +   KA+ +  Q  D       +    VD   IG+       
Sbjct: 112 FHNIMLHDGY---LHFARNKEGKASTQIEQCDDYLKWFREKKGHNVDLIDIGLDCNSWVS 168

Query: 192 ---------YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAG--------NTYEPFQA 234
                    + T+     +I+ +   + SKP FL ++ +  H+         + Y+    
Sbjct: 169 RPWGYEENLHPTNWVVNESIDFLRRKDPSKPFFLKMSFVRPHSPLDPPKFYFDMYKDEDL 228

Query: 235 PDEEVAKFLDISDPERR---------------------TYAGMVSRLDESVGNVIAALRK 273
           P+  +  + +  D E R                      Y G ++ +D  +G  + AL +
Sbjct: 229 PEPLMGDWANKEDEENRGKDINCVKGIINKKALKRAKAAYYGSITHIDHQIGRFLIALSE 288

Query: 274 HGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP--WLK-Q 330
           +G L N+I LF++D+G          G ++  R  K  P++G  R    I+ P   LK +
Sbjct: 289 YGELNNTIFLFVSDHG-------DMMGDHNWFR--KGIPYEGSSRVPFFIYDPGNLLKGK 339

Query: 331 TQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDN 390
             KV  E+  + D +PTL   A I I D S++G++  +++ +   T R + +H   +   
Sbjct: 340 KGKVFDEVLELRDIMPTLLDFAHISIPD-SVEGLSLKNLIEERNSTWR-DYIHGEHSFGE 397

Query: 391 PQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
              +Y   + D   + +      + +E Y D +N+     PKE+
Sbjct: 398 DSNHYIVTKRDKFLWFS-----QRGEEQYFDLEND-----PKEL 431


>sp|Q8BFR4|GNS_MOUSE N-acetylglucosamine-6-sulfatase OS=Mus musculus GN=Gns PE=2 SV=1
          Length = 544

 Score = 72.0 bits (175), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 106/408 (25%), Positives = 173/408 (42%), Gaps = 88/408 (21%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDAL-AYNGLILNQHYV-QALCTPSRSA 83
           A ++P+++++L DD    D    G +  P     AL    G+  +  YV  ALC PSR++
Sbjct: 35  AARRPNVLLLLTDD---QDAELGGMT--PLKKTKALIGEKGMTFSSAYVPSALCCPSRAS 89

Query: 84  LMTGKYPIHIGMQHGVI---LEG----EPWGLPLTEKLLPQYLKE-AGYATHAIGKWHLG 135
           ++TGKYP      H V+   LEG    + W         P  LK   GY T   GK    
Sbjct: 90  ILTGKYP----HNHHVVNNTLEGNCSSKAWQKIQEPYTFPAILKSVCGYQTFFAGK---- 141

Query: 136 FFREVYTPTFRGFDS---HYGYWQGLQ---DYYDHSCKATFEPYQGLDMRH--NMQVDNK 187
           +  E   P   G +     + YW  L+    YY+++         G   +H  N  VD  
Sbjct: 142 YLNEYGAPDAGGLEHIPLGWSYWYALEKNSKYYNYTLSI-----NGKARKHGENYSVD-- 194

Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP-----FQ---APDEE- 238
               Y TD+    +++ +   + S+P F+ ++  A H+  T  P     FQ   AP  + 
Sbjct: 195 ----YLTDVLANLSLDFLDYKSNSEPFFMMISTPAPHSPWTAAPQYQKAFQNVIAPRNKN 250

Query: 239 ----------------------VAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGM 276
                                   +FLD  D  RR +  ++S +D+ V  ++  L   G 
Sbjct: 251 FNIHGTNKHWLIRQAKTPMTNSSIRFLD--DAFRRRWQTLLS-VDDLVEKLVKRLDSTGE 307

Query: 277 LENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSS 336
           L+N+ + + +DN     G H+ + S   L   K   ++  ++    +  P +K  Q  S 
Sbjct: 308 LDNTYIFYTSDN-----GYHTGQFS---LPIDKRQLYEFDIKVPLLVRGPGIKPNQ-TSK 358

Query: 337 ELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAK--TKRSEIL 382
            L    D  PT+   AG ++N T +DG++   +L KG +  T RS++L
Sbjct: 359 MLVSNIDLGPTILDLAGYDLNKTQMDGMSLLPIL-KGDRNLTWRSDVL 405


>sp|P31447|YIDJ_ECOLI Uncharacterized sulfatase YidJ OS=Escherichia coli (strain K12)
           GN=yidJ PE=3 SV=1
          Length = 497

 Score = 71.6 bits (174), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 88/366 (24%), Positives = 141/366 (38%), Gaps = 63/366 (17%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALMT 86
           K+P+ + ++ D    N V  +    + T NID+LA  G+  N  Y    +CTP+R+ L T
Sbjct: 2   KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHL---GFFREVYTP 143
           G Y    G     +  G+          + +Y K+AGY T  IGKWHL    +F     P
Sbjct: 62  GIYANQSGPWTNNVAPGK------NISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP 115

Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
                D    YW    +Y     +     ++ GL+   ++Q ++           +  A+
Sbjct: 116 PEWDAD----YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAV 171

Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI-------------SDPE 249
           + + +  ++   FL    + V     + PF  P E + K+ D              + PE
Sbjct: 172 DFLQQPARADEPFL----MVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPE 227

Query: 250 RRT--------------------YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNG 289
                                  Y      +D+ +G VI AL      EN+ V++ +D  
Sbjct: 228 HHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPE-QRENTWVIYTSD-- 284

Query: 290 APSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLC 349
                 H      H L    +  +D   R    I SP  ++ Q V + + HI D LPT+ 
Sbjct: 285 ------HGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQ-VDTPVSHI-DLLPTMM 336

Query: 350 AAAGIE 355
           A A IE
Sbjct: 337 ALADIE 342


>sp|Q1LZH9|GNS_BOVIN N-acetylglucosamine-6-sulfatase OS=Bos taurus GN=GNS PE=2 SV=1
          Length = 560

 Score = 67.0 bits (162), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 102/403 (25%), Positives = 172/403 (42%), Gaps = 82/403 (20%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDAL-AYNGLILNQHYV-QALCTPSRSALM 85
           ++P+++++LADD    D    G +  P     AL    G+  +  YV  ALC PSR++++
Sbjct: 53  RRPNVVLLLADD---QDEVLGGMT--PLKKTKALIGEMGMTFSSAYVPSALCCPSRASIL 107

Query: 86  TGKYPIHIGMQHGVILEG----EPWGLPLTEKLLPQYLKE-AGYATHAIGKWHLGFFREV 140
           TGKYP ++ + +   LEG    + W         P  L+   GY T   GK    +  E 
Sbjct: 108 TGKYPHNLHVVNNT-LEGNCSSKSWQKIQEPNTFPAILRSMCGYQTFFAGK----YLNEY 162

Query: 141 YTPTFRGFDS---HYGYWQGLQ---DYYDHSCKATFEPYQGLDMRH--NMQVDNKTIGIY 192
             P   G       + YW  L+    YY+++         G   +H  N  VD      Y
Sbjct: 163 GAPDAGGLGHVPLGWSYWYALEKNSKYYNYTLSI-----NGKARKHGENYSVD------Y 211

Query: 193 STDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP-----FQ---APDEE------ 238
            TD+    +++ +   + S+P F+ ++  A H+  T  P     FQ   AP  +      
Sbjct: 212 LTDVLANVSLDFLDYKSNSEPFFMMISTPAPHSPWTAAPQYQNAFQNVFAPRNKNFNIHG 271

Query: 239 -----------------VAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSI 281
                              +FLD  +  R+ +  ++S +D+ V  ++  L  +G L N+ 
Sbjct: 272 TNKHWLIRQAKTPMTNSSIQFLD--NAFRKRWQTLLS-VDDLVEKLVKRLEFNGELNNTY 328

Query: 282 VLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHI 341
           + + +DN     G H+ + S   L   K   ++  ++    +  P +K  Q  S  L   
Sbjct: 329 IFYTSDN-----GYHTGQFS---LPIDKRQLYEFDIKVPLLVRGPGIKPNQ-TSKMLVAN 379

Query: 342 SDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAK--TKRSEIL 382
            D  PT+   AG  +N T +DG++   +L KGA   T RS++L
Sbjct: 380 IDLGPTILDIAGYSLNKTQMDGMSFLPIL-KGASNLTWRSDVL 421


>sp|Q8IWU6|SULF1_HUMAN Extracellular sulfatase Sulf-1 OS=Homo sapiens GN=SULF1 PE=1 SV=1
          Length = 871

 Score = 66.2 bits (160), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 89/367 (24%), Positives = 147/367 (40%), Gaps = 73/367 (19%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG-LILNQHYVQALCTPSRSALMTG 87
           +P+II++L DD    DV   GS Q+       + + G   +N      +C PSRS+++TG
Sbjct: 42  RPNIILVLTDD---QDVEL-GSLQVMNKTRKIMEHGGATFINAFVTTPMCCPSRSSMLTG 97

Query: 88  KYPIHIGMQHGVILEGE-----PWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
           KY +H    H V    E      W      +    YL   GY T   GK+ L  +   Y 
Sbjct: 98  KY-VH---NHNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYI 152

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
           P   G+    G  +  + Y    C+   +   G D   +          Y TDL T  +I
Sbjct: 153 PP--GWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKD----------YFTDLITNESI 200

Query: 203 NVIAEHNK---SKPMFLYLAHLAVHAGNTYEP-FQ----------------APDEEVAKF 242
           N      +    +P+ + ++H A H      P F                 AP+ +    
Sbjct: 201 NYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWI 260

Query: 243 LDISDP------------ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNG- 289
           +  + P            +R+    ++S +D+SV  +   L + G LEN+ +++ AD+G 
Sbjct: 261 MQYTGPMLPIHMEFTNILQRKRLQTLMS-VDDSVERLYNMLVETGELENTYIIYTADHGY 319

Query: 290 -APSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTL 348
               FG+         ++G KS P+D  +R    I  P ++    V   + +I D  PT+
Sbjct: 320 HIGQFGL---------VKG-KSMPYDFDIRVPFFIRGPSVEPGSIVPQIVLNI-DLAPTI 368

Query: 349 CAAAGIE 355
              AG++
Sbjct: 369 LDIAGLD 375


>sp|P51688|SPHM_HUMAN N-sulphoglucosamine sulphohydrolase OS=Homo sapiens GN=SGSH PE=1
           SV=1
          Length = 502

 Score = 65.5 bits (158), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 78/309 (25%), Positives = 123/309 (39%), Gaps = 70/309 (22%)

Query: 13  CTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY 72
           C LL     L     +  + +++LADD G+   +++ S+ I TP++DALA   L+    +
Sbjct: 9   CALLL---VLGLCRARPRNALLLLADDGGFESGAYNNSA-IATPHLDALARRSLLFRNAF 64

Query: 73  VQ-ALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGK 131
              + C+PSR++L+TG  P H    +G+  +   +      + LP  L +AG  T  IGK
Sbjct: 65  TSVSSCSPSRASLLTG-LPQHQNGMYGLHQDVHHFNSFDKVRSLPLLLSQAGVRTGIIGK 123

Query: 132 WHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI 191
            H+G        T   FD  Y    G                        +QV      I
Sbjct: 124 KHVG------PETVYPFDFAYTEENG----------------------SVLQVGRNITRI 155

Query: 192 YSTDLYTEAAINVIAEHNKSKPMFLYLA----HLAVHAGNTYEPF------------QAP 235
                  +  +    +    +P FLY+A    H   H+   Y  F            + P
Sbjct: 156 -------KLLVRKFLQTQDDRPFFLYVAFHDPHRCGHSQPQYGTFCEKFGNGESGMGRIP 208

Query: 236 DEE----------VAKFLDISDPERRTYAGM---VSRLDESVGNVIAALRKHGMLENSIV 282
           D            V  F+  +   R   A     V R+D+ VG V+  LR  G+L +++V
Sbjct: 209 DWTPQAYDPLDVLVPYFVPNTPAARADLAAQYTTVGRMDQGVGLVLQELRDAGVLNDTLV 268

Query: 283 LFMADNGAP 291
           +F +DNG P
Sbjct: 269 IFTSDNGIP 277


>sp|Q8K007|SULF1_MOUSE Extracellular sulfatase Sulf-1 OS=Mus musculus GN=Sulf1 PE=2 SV=1
          Length = 870

 Score = 65.5 bits (158), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 90/375 (24%), Positives = 151/375 (40%), Gaps = 74/375 (19%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTG 87
           +P+II++L DD    DV   GS Q+       +   G      +V   +C PSRS+++TG
Sbjct: 42  RPNIILVLTDD---QDVEL-GSLQVMNKTRKIMEQGGATFTNAFVTTPMCCPSRSSMLTG 97

Query: 88  KYPIHIGMQHGVILEGE-----PWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
           KY +H    H V    E      W      +    YL   GY T   GK+ L  +   Y 
Sbjct: 98  KY-VH---NHNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYI 152

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
           P   G+    G  +  + Y    C+   +   G D   +          Y TDL T  +I
Sbjct: 153 PP--GWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKD----------YFTDLITNESI 200

Query: 203 NVIAEHNK---SKPMFLYLAHLAVHAGNTYEP-FQ----------------APDEEVAKF 242
           N      +    +P+ + ++H A H      P F                 AP+ +    
Sbjct: 201 NYFKMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWI 260

Query: 243 LDISDP------------ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNG- 289
           +  + P            +R+    ++S +D+SV  +   L + G L+N+ +++ AD+G 
Sbjct: 261 MQYTGPMLPIHMEFTNVLQRKRLQTLMS-VDDSVERLYNMLVESGELDNTYIIYTADHGY 319

Query: 290 -APSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTL 348
               FG+         ++G KS P+D  +R    I  P ++    V   + +I D  PT+
Sbjct: 320 HIGQFGL---------VKG-KSMPYDFDIRVPFFIRGPSIEPGSIVPQIVLNI-DLAPTI 368

Query: 349 CAAAGIEINDTSLDG 363
              AG++ + + +DG
Sbjct: 369 LDIAGLD-SPSDVDG 382


>sp|P50426|GNS_CAPHI N-acetylglucosamine-6-sulfatase OS=Capra hircus GN=GNS PE=2 SV=1
          Length = 559

 Score = 65.1 bits (157), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 104/406 (25%), Positives = 171/406 (42%), Gaps = 88/406 (21%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDAL-AYNGLILNQHYV-QALCTPSRSALM 85
           ++P+++++LADD    D    G +  P     AL    G+  +  YV  ALC PSR++++
Sbjct: 52  RRPNVVLVLADD---QDEVLGGMT--PLKKTKALIGEMGMTFSSAYVPSALCCPSRASIL 106

Query: 86  TGKYPIHIGMQHGVI---LEG----EPWGLPLTEKLLPQYLKE-AGYATHAIGKWHLGFF 137
           TGKYP      H V+   LEG    + W         P  L+   GY T   GK    + 
Sbjct: 107 TGKYP----HNHHVVNNTLEGNCSSKSWQKIQEPNTFPAILRSMCGYQTFFAGK----YL 158

Query: 138 REVYTPTFRGFDS---HYGYWQGLQ---DYYDHSCKATFEPYQGLDMRH--NMQVDNKTI 189
            E   P   G       + YW  L+    YY+++         G   +H  N  VD    
Sbjct: 159 NEYGAPDAGGLGHVPLGWSYWYALEKNSKYYNYTLSI-----NGKARKHGENYSVD---- 209

Query: 190 GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP-----FQ---APDEE--- 238
             Y TD+    +++ +   + S+P F+ ++  A H+  T  P     FQ   AP  +   
Sbjct: 210 --YLTDVLANVSLDFLDYKSNSEPFFMMISTPAPHSPWTAAPQYQNAFQNVFAPRNKNFN 267

Query: 239 --------------------VAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLE 278
                                 +FLD +  ER  +  ++S +D+ V  ++  L  +G L 
Sbjct: 268 IHGTNKHWLIRQAKTPMTNSSIQFLDNAFRER--WQTLLS-VDDLVEKLVKRLEFNGELN 324

Query: 279 NSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSEL 338
           N+ + + +DN     G H+ + S   L   K   ++  ++    +  P +K  Q  S  L
Sbjct: 325 NTYIFYTSDN-----GYHTGQFS---LPIDKRQLYEFDIKVPLLVRGPGIKPNQ-TSKML 375

Query: 339 FHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAK--TKRSEIL 382
               D  PT+   AG  +N T +DG++   +L +GA   T RS++L
Sbjct: 376 VANIDLGPTILDIAGYGLNKTQMDGMSFLPIL-RGASNLTWRSDVL 420


>sp|Q8VI60|SULF1_RAT Extracellular sulfatase Sulf-1 OS=Rattus norvegicus GN=Sulf1 PE=1
           SV=1
          Length = 870

 Score = 64.7 bits (156), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 91/375 (24%), Positives = 151/375 (40%), Gaps = 74/375 (19%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTG 87
           +P+II++L DD    DV   GS Q+       + + G      +V   +C PSRS+++TG
Sbjct: 42  RPNIILVLTDD---QDVEL-GSLQVMNKTRKIMEHGGATFTNAFVTTPMCCPSRSSMLTG 97

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKL-----LPQYLKEAGYATHAIGKWHLGFFREVYT 142
           KY +H    H V    E    P  + L        YL   GY T   GK+ L  +   Y 
Sbjct: 98  KY-VH---NHNVYTNNENCSSPSWQALHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYI 152

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
           P   G+    G  +  + Y    C+   +   G D   +          Y TDL T  +I
Sbjct: 153 PP--GWREWLGLIKNSRFYNYTVCRNGIKEKHGFDYAKD----------YFTDLITNESI 200

Query: 203 NVIAEHNK---SKPMFLYLAHLAVHAGNTYEP-FQ----------------APDEEVAKF 242
           N      +    +P+ + ++H A H      P F                 AP+ +    
Sbjct: 201 NYFKMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWI 260

Query: 243 LDISDP------------ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNG- 289
           +  + P            +R+    ++S +D+SV  +   L + G L N+ +++ AD+G 
Sbjct: 261 MQYTGPMLPIHMEFTNVLQRKRLQTLMS-VDDSVERLYNMLVETGELGNTYIIYTADHGY 319

Query: 290 -APSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTL 348
               FG+         ++G KS P+D  +R    I  P ++    V   + +I D  PT+
Sbjct: 320 HIGQFGL---------VKG-KSMPYDFDIRVPFFIRGPSIEPGSIVPQIVLNI-DLAPTI 368

Query: 349 CAAAGIEINDTSLDG 363
              AG++   + +DG
Sbjct: 369 LDIAGLDT-PSDVDG 382


>sp|P15586|GNS_HUMAN N-acetylglucosamine-6-sulfatase OS=Homo sapiens GN=GNS PE=1 SV=3
          Length = 552

 Score = 63.9 bits (154), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 102/406 (25%), Positives = 170/406 (41%), Gaps = 88/406 (21%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDAL-AYNGLILNQHYV-QALCTPSRSALM 85
           ++P+++++L DD    D    G +  P     AL    G+  +  YV  ALC PSR++++
Sbjct: 45  RRPNVVLLLTDD---QDEVLGGMT--PLKKTKALIGEMGMTFSSAYVPSALCCPSRASIL 99

Query: 86  TGKYPIHIGMQHGVI---LEG----EPWGLPLTEKLLPQYLKE-AGYATHAIGKWHLGFF 137
           TGKYP      H V+   LEG    + W         P  L+   GY T   GK    + 
Sbjct: 100 TGKYP----HNHHVVNNTLEGNCSSKSWQKIQEPNTFPAILRSMCGYQTFFAGK----YL 151

Query: 138 REVYTPTFRGFDS---HYGYWQGLQ---DYYDHSCKATFEPYQGLDMRH--NMQVDNKTI 189
            E   P   G +     + YW  L+    YY+++         G   +H  N  VD    
Sbjct: 152 NEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSI-----NGKARKHGENYSVD---- 202

Query: 190 GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP-----FQ---APDEE--- 238
             Y TD+    +++ +   +  +P F+ +A  A H+  T  P     FQ   AP  +   
Sbjct: 203 --YLTDVLANVSLDFLDYKSNFEPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFN 260

Query: 239 --------------------VAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLE 278
                                 +FLD  +  R+ +  ++S +D+ V  ++  L   G L 
Sbjct: 261 IHGTNKHWLIRQAKTPMTNSSIQFLD--NAFRKRWQTLLS-VDDLVEKLVKRLEFTGELN 317

Query: 279 NSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSEL 338
           N+ + + +DN     G H+ + S   L   K   ++  ++    +  P +K  Q  S  L
Sbjct: 318 NTYIFYTSDN-----GYHTGQFS---LPIDKRQLYEFDIKVPLLVRGPGIKPNQ-TSKML 368

Query: 339 FHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAK--TKRSEIL 382
               D  PT+   AG ++N T +DG++   +L +GA   T RS++L
Sbjct: 369 VANIDLGPTILDIAGYDLNKTQMDGMSLLPIL-RGASNLTWRSDVL 413


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.318    0.135    0.420 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 236,755,866
Number of Sequences: 539616
Number of extensions: 10538256
Number of successful extensions: 23453
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 53
Number of HSP's successfully gapped in prelim test: 39
Number of HSP's that attempted gapping in prelim test: 23161
Number of HSP's gapped (non-prelim): 161
length of query: 593
length of database: 191,569,459
effective HSP length: 123
effective length of query: 470
effective length of database: 125,196,691
effective search space: 58842444770
effective search space used: 58842444770
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 64 (29.3 bits)