BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy1088
         (905 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|P50430|ARSB_RAT Arylsulfatase B OS=Rattus norvegicus GN=Arsb PE=2 SV=2
          Length = 528

 Score =  327 bits (838), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 156/334 (46%), Positives = 224/334 (67%), Gaps = 16/334 (4%)

Query: 56  SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
           ++ PPH++F+LADDLGWND+GFHG   I TP++DALA  G++L NYY   LCTPSRS ++
Sbjct: 36  AAPPPHVVFVLADDLGWNDLGFHG-SVIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLL 94

Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
           TG++ IH G+QH ++  C+   +PL EK+LPQ LK+ GY T +VGKWHLG Y+KE  PT 
Sbjct: 95  TGRYQIHMGLQHYLIMTCQPNCVPLDEKLLPQLLKDAGYATHMVGKWHLGMYRKECLPTR 154

Query: 176 RGFESHLGYWTGHQDYFDHSAEEM------KMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
           RGF+++ GY  G +DY+ H A             LD+R   EPA +    YST++FT  A
Sbjct: 155 RGFDTYFGYLLGSEDYYTHEACAPIECLNGTRCALDLRDGEEPAKEYTDIYSTNIFTKRA 214

Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
             +I NH  ++PLFLYLA  + H     +PLQ P+ Y+  +  I+D  R  +A ++  LD
Sbjct: 215 TTLIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYDFIQDKHRRIYAGMVSLLD 269

Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAG 349
           E+VG V +AL+ R + +N++++F +DNGG       +  +NWPLRG K TLWEGG+RGAG
Sbjct: 270 EAVGNVTKALKSRGLWNNTVLIFSTDNGGQTR----SGGNNWPLRGRKGTLWEGGIRGAG 325

Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
            + SPLL+ +G+ + + +H++DWLPTL++ A  S
Sbjct: 326 FVASPLLKQKGVKSRELMHITDWLPTLVNLAGGS 359



 Score = 38.5 bits (88), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 15/36 (41%), Positives = 24/36 (66%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH ++  C+   +PL EK+LPQ LK+ GY T ++ 
Sbjct: 104 LQHYLIMTCQPNCVPLDEKLLPQLLKDAGYATHMVG 139


>sp|P15848|ARSB_HUMAN Arylsulfatase B OS=Homo sapiens GN=ARSB PE=1 SV=1
          Length = 533

 Score =  327 bits (838), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 159/333 (47%), Positives = 219/333 (65%), Gaps = 16/333 (4%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             +S PPH++F+LADDLGWNDVGFHG  +I TP++DALA  G++L NYYT  LCTPSRS 
Sbjct: 39  AGASRPPHLVFLLADDLGWNDVGFHG-SRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 97

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           ++TG++ I TG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  P
Sbjct: 98  LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 157

Query: 174 TFRGFESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA 227
           T RGF+++ GY  G +DY+ H       A  +    LD R   E A      YST++FT 
Sbjct: 158 TRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTK 217

Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
            A+ +I NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A ++  
Sbjct: 218 RAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSL 272

Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRG 347
           +DE+VG V  AL+   + +N++ +F +DNGG      L   +NWPLRG K +LWEGGVRG
Sbjct: 273 MDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGVRG 328

Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            G + SPLL+ +G+   + +H+SDWLPTL+  A
Sbjct: 329 VGFVASPLLKQKGVKNRELIHISDWLPTLVKLA 361



 Score = 42.7 bits (99), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 109 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 144



 Score = 36.2 bits (82), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 14/26 (53%), Positives = 17/26 (65%)

Query: 510 IDGIDVWSVLSRNEPSKRNTILHNID 535
           +DG DVW  +S   PS R  +LHNID
Sbjct: 371 LDGFDVWKTISEGSPSPRIELLHNID 396



 Score = 36.2 bits (82), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 14/26 (53%), Positives = 17/26 (65%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNID 600
           +DG DVW  +S   PS R  +LHNID
Sbjct: 371 LDGFDVWKTISEGSPSPRIELLHNID 396


>sp|P50429|ARSB_MOUSE Arylsulfatase B OS=Mus musculus GN=Arsb PE=2 SV=3
          Length = 534

 Score =  325 bits (833), Expect = 9e-88,   Method: Compositional matrix adjust.
 Identities = 155/335 (46%), Positives = 223/335 (66%), Gaps = 16/335 (4%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
            ++ PPH++F+LADDLGWND+GFHG   I TP++DALA  G++L NYY   LCTPSRS +
Sbjct: 41  GATQPPHVVFVLADDLGWNDLGFHG-SVIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQL 99

Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
           +TG++ IH G+QH ++  C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  PT
Sbjct: 100 LTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPT 159

Query: 175 FRGFESHLGYWTGHQDYFDHSA----EEM--KMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
            RGF+++ GY  G +DY+ H A    E +      LD+R   EPA + +  YST++FT  
Sbjct: 160 RRGFDTYFGYLLGSEDYYTHEACAPIESLNGTRCALDLRDGEEPAKEYNNIYSTNIFTKR 219

Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
           A  +I NH  ++PLFLYLA  + H     +PLQ P+ Y+  +  I+D  R  +A ++  +
Sbjct: 220 ATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRIYAGMVSLM 274

Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGA 348
           DE+VG V +AL+   + +N++ +F +DNGG       +  +NWPLRG K TLWEGG+RG 
Sbjct: 275 DEAVGNVTKALKSHGLWNNTVFIFSTDNGGQTR----SGGNNWPLRGRKGTLWEGGIRGT 330

Query: 349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           G + SPLL+ +G+ + + +H++DWLPTL+  A  S
Sbjct: 331 GFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGS 365



 Score = 40.0 bits (92), Expect = 0.089,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 24/36 (66%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH ++  C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 110 LQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVG 145



 Score = 33.9 bits (76), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 14/38 (36%), Positives = 21/38 (55%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTKGK 612
           +DG ++W  +S   PS R  +LHNID ++       GK
Sbjct: 372 LDGFNMWKTISEGHPSPRVELLHNIDQDFFDGLPCPGK 409



 Score = 33.9 bits (76), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 12/29 (41%), Positives = 19/29 (65%)

Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDEW 538
           +DG ++W  +S   PS R  +LHNID ++
Sbjct: 372 LDGFNMWKTISEGHPSPRVELLHNIDQDF 400


>sp|P33727|ARSB_FELCA Arylsulfatase B OS=Felis catus GN=ARSB PE=2 SV=1
          Length = 535

 Score =  318 bits (815), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 154/331 (46%), Positives = 215/331 (64%), Gaps = 16/331 (4%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
           PPH++F+LADDLGWNDV FHG   I TP++D LA  G++L NYYT  LCTPSRS ++TG+
Sbjct: 46  PPHLVFVLADDLGWNDVSFHG-SNIRTPHLDELAAGGVLLDNYYTQPLCTPSRSQLLTGR 104

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
           + IHTG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  PT RGF
Sbjct: 105 YQIHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGF 164

Query: 179 ESHLGYWTGHQDYFDH------SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
           +++ GY  G +DY+ H       +  +    LD R   + A      YST++FT  A  +
Sbjct: 165 DTYFGYLLGSEDYYSHERCALIDSLNVTRCALDFRDGEQVATGYKNMYSTNIFTERATAL 224

Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
           I +H  ++PLFLYLA  + H     EPLQ P+ YL  +  I+D  R  +A ++  +DE+V
Sbjct: 225 ITSHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHYYAGMVSLMDEAV 279

Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
           G V  AL+   + +N++ +F +DNGG      L   +NWPLRG K +LWEGG+RG G + 
Sbjct: 280 GNVTAALKSHGLWNNTVFIFSTDNGGQ----TLAGGNNWPLRGRKWSLWEGGIRGVGFVA 335

Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           SPLL+ +G+   + +H+SDWLPTL+  A  S
Sbjct: 336 SPLLKQKGVKNRELIHISDWLPTLVKLARGS 366



 Score = 42.7 bits (99), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%)

Query: 1   MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMA 36
           +QH +++ C+   +PL EK+LPQ LKE GY T ++ 
Sbjct: 111 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVG 146



 Score = 37.0 bits (84), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 14/26 (53%), Positives = 17/26 (65%)

Query: 510 IDGIDVWSVLSRNEPSKRNTILHNID 535
           +DG DVW  +S   PS R  +LHNID
Sbjct: 373 LDGFDVWKTISEGSPSPRKELLHNID 398



 Score = 37.0 bits (84), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 14/26 (53%), Positives = 17/26 (65%)

Query: 575 IDGIDVWSVLSRNEPSKRNTILHNID 600
           +DG DVW  +S   PS R  +LHNID
Sbjct: 373 LDGFDVWKTISEGSPSPRKELLHNID 398


>sp|Q5FYB0|ARSJ_HUMAN Arylsulfatase J OS=Homo sapiens GN=ARSJ PE=2 SV=1
          Length = 599

 Score =  304 bits (779), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 153/329 (46%), Positives = 209/329 (63%), Gaps = 12/329 (3%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             S+  PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct: 70  TTSTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 128

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
            +TGK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+KE  P
Sbjct: 129 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 188

Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
           T RGF++  G   G  DY+ H   +   M G D+  +   AWD  +G YST ++T     
Sbjct: 189 TRRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 248

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I+ +H+  +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE+
Sbjct: 249 ILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 303

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           +  V  AL+     +NSII++ SDNGG          SNWPLRG K T WEGG+R  G +
Sbjct: 304 INNVTLALKTYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 359

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 360 HSPLLKNKGTVCKELVHITDWYPTLISLA 388



 Score = 38.1 bits (87), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 46/185 (24%), Positives = 77/185 (41%), Gaps = 24/185 (12%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQIS-----ALTKGKW--KLVKVVKVMRYQV 626
           ++DG D+W  +S    S R  ILHNID  +  +     A   G W   +   ++V  +++
Sbjct: 397 QLDGYDIWETISEGLRSPRVDILHNIDPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKL 456

Query: 627 DLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCE 686
            LTG P   Y   +  + +  L   +  +   I     K V         LF+I  DP E
Sbjct: 457 -LTGNPG--YSDWVPPQSFSNLGPNRWHN-ERITLSTGKSV--------WLFNITADPYE 504

Query: 687 KNNLADRSEDQRINHYTTEVGRFNQIA----YPDKEEEEEKKKKKKKKKKKKKKKKKKKK 742
           + +L++R     +      + +FN+ A    YP K+     +          K++ KKKK
Sbjct: 505 RVDLSNRYPG-IVKKLLRRLSQFNKTAVPVRYPPKDPRSNPRLNGGVWGPWYKEETKKKK 563

Query: 743 KKKKK 747
             K +
Sbjct: 564 PSKNQ 568


>sp|Q8BM89|ARSJ_MOUSE Arylsulfatase J OS=Mus musculus GN=Arsj PE=2 SV=1
          Length = 598

 Score =  300 bits (767), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 152/329 (46%), Positives = 208/329 (63%), Gaps = 12/329 (3%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
            A +  PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct: 68  TAGTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 126

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
            +TGK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+K+  P
Sbjct: 127 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMP 186

Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
           T RGF++  G   G  DY+ H   +   + G D+  +   AWD  +G YST ++T     
Sbjct: 187 TKRGFDTFFGSLLGSGDYYTHYKCDSPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 246

Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
           I+  H   +PLFLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE+
Sbjct: 247 ILATHDPTKPLFLYVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 301

Query: 292 VGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLI 351
           +  V  AL++    +NSII++ SDNGG          SNWPLRG K T WEGG+R  G +
Sbjct: 302 IHNVTLALKRYGFYNNSIIIYSSDNGGQPTA----GGSNWPLRGSKGTYWEGGIRAVGFV 357

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
            SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 358 HSPLLKNKGTVCKELVHITDWYPTLISLA 386



 Score = 35.8 bits (81), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 40/166 (24%), Positives = 69/166 (41%), Gaps = 28/166 (16%)

Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQIS-----ALTKGKW--KLVKVVKVMRYQV 626
           ++DG D+W  +S    S R  ILHNID  +  +     A   G W   +   ++V  +++
Sbjct: 395 QLDGYDIWETISEGLRSPRVDILHNIDPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKL 454

Query: 627 DLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVK----EVPCEPQIAPCLFDIKN 682
            LTG P      G SD  W+          A    GP +     +      +  LF+I  
Sbjct: 455 -LTGNP------GYSD--WVP-------PQAFSNLGPNRWHNERITLSTGKSIWLFNITA 498

Query: 683 DPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKKKKKK 728
           DP E+ +L+ R     +      + +FN+ A P +   ++ +   +
Sbjct: 499 DPYERVDLSSRYPGI-VKKLLRRLSQFNKTAVPVRYPPKDPRSNPR 543


>sp|Q32KI9|ARSI_MOUSE Arylsulfatase I OS=Mus musculus GN=Arsi PE=2 SV=1
          Length = 573

 Score =  295 bits (754), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 151/331 (45%), Positives = 212/331 (64%), Gaps = 11/331 (3%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
           VA   PPHIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct: 41  VAPPQPPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQ 99

Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           ++TG++ IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFY+KE  P
Sbjct: 100 LLTGRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLP 159

Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
           T RGF++ LG  TG+ DY+ + + +   + G D+      AW L G+YST ++   A  I
Sbjct: 160 TRRGFDTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHI 219

Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
           + +H+   PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V
Sbjct: 220 LASHNPQNPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAV 274

Query: 293 GKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIW 352
             +  AL++    +NS+I+F SDNGG       +  SNWPLRG K T WEGGVRG G + 
Sbjct: 275 RNITWALKRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVH 330

Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           SPLL+ +   +   VH++DW PTL+  A  +
Sbjct: 331 SPLLKKKRRTSRALVHITDWYPTLVGLAGGT 361



 Score = 37.7 bits (86), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 43/171 (25%), Positives = 60/171 (35%), Gaps = 55/171 (32%)

Query: 569 NSYQNEIDGIDVWSVLSRNEPSKRNTILHNID---------------DEWQI---SALTK 610
            S  + +DG DVW  +S    S R  ILHNID                 W     +A+  
Sbjct: 362 TSAADGLDGYDVWPAISEGRASPRTEILHNIDPLYNHARHGSLEGGFGIWNTAVQAAIRV 421

Query: 611 GKWKLVKVVKVMRYQVDLTGGP---DQV---YLSGLSDREWLALAMRKLRDAASIQCGPV 664
           G+WKL            LTG P   D +    L+      W    M  +R A        
Sbjct: 422 GEWKL------------LTGDPGYGDWIPPQTLASFPGSWWNLERMASIRQAV------- 462

Query: 665 KEVPCEPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
                       LF+I  DP E+ +LA +  D  +      +  +N+ A P
Sbjct: 463 -----------WLFNISADPYEREDLAGQRPDV-VRTLLARLADYNRTAIP 501



 Score = 35.0 bits (79), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 15/32 (46%), Positives = 18/32 (56%)

Query: 504 NSYQNEIDGIDVWSVLSRNEPSKRNTILHNID 535
            S  + +DG DVW  +S    S R  ILHNID
Sbjct: 362 TSAADGLDGYDVWPAISEGRASPRTEILHNID 393


>sp|Q32KJ8|ARSI_RAT Arylsulfatase I OS=Rattus norvegicus GN=Arsi PE=2 SV=1
          Length = 573

 Score =  289 bits (740), Expect = 6e-77,   Method: Compositional matrix adjust.
 Identities = 148/324 (45%), Positives = 209/324 (64%), Gaps = 11/324 (3%)

Query: 61  HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
           HIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++TG++ 
Sbjct: 48  HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 106

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFY+KE  PT RGF++
Sbjct: 107 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166

Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
            LG  TG+ DY+ + + +   + G D+      AW L G+YST ++   A  I+ +HS  
Sbjct: 167 FLGSLTGNVDYYTYDNCDGPGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHSPQ 226

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
           +PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V  +  AL
Sbjct: 227 KPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWAL 281

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
           ++    +NS+I+F SDNGG       +  SNWPLRG K T WEGGVRG G + SPLL+ +
Sbjct: 282 KRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKKK 337

Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
              +   VH++DW PTL+  A  +
Sbjct: 338 RRTSRALVHITDWYPTLVGLAGGT 361



 Score = 40.0 bits (92), Expect = 0.086,   Method: Compositional matrix adjust.
 Identities = 44/171 (25%), Positives = 61/171 (35%), Gaps = 55/171 (32%)

Query: 569 NSYQNEIDGIDVWSVLSRNEPSKRNTILHNID---------------DEWQI---SALTK 610
            S  + +DG DVW  +S    S R  ILHNID                 W     +A+  
Sbjct: 362 TSAADGLDGYDVWPAISEGRASPRTEILHNIDPLYNHARHGSLEGGFGIWNTAVQAAIRV 421

Query: 611 GKWKLVKVVKVMRYQVDLTGGP---DQV---YLSGLSDREWLALAMRKLRDAASIQCGPV 664
           G+WKL            LTG P   D +    L+      W    M  +R A        
Sbjct: 422 GEWKL------------LTGDPGYGDWIPPQTLASFPGSWWNLERMASIRQAV------- 462

Query: 665 KEVPCEPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
                       LF+I  DP E+ +LAD+  D  +      +  +N+ A P
Sbjct: 463 -----------WLFNISADPYEREDLADQRPDV-VRTLLARLADYNRTAIP 501



 Score = 34.7 bits (78), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 15/32 (46%), Positives = 18/32 (56%)

Query: 504 NSYQNEIDGIDVWSVLSRNEPSKRNTILHNID 535
            S  + +DG DVW  +S    S R  ILHNID
Sbjct: 362 TSAADGLDGYDVWPAISEGRASPRTEILHNID 393


>sp|Q5FYB1|ARSI_HUMAN Arylsulfatase I OS=Homo sapiens GN=ARSI PE=1 SV=1
          Length = 569

 Score =  289 bits (739), Expect = 8e-77,   Method: Compositional matrix adjust.
 Identities = 147/324 (45%), Positives = 208/324 (64%), Gaps = 11/324 (3%)

Query: 61  HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
           HIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++TG++ 
Sbjct: 48  HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQ 106

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFY+KE  PT RGF++
Sbjct: 107 IHTGLQHSIIRPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166

Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
            LG  TG+ DY+ + + +   + G D+      AW L G+YST ++   A  I+ +HS  
Sbjct: 167 FLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHSPQ 226

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
            PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V  +  AL
Sbjct: 227 RPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWAL 281

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
           ++    +NS+I+F SDNGG       +  SNWPLRG K T WEGGVRG G + SPLL+ +
Sbjct: 282 KRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRK 337

Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
              +   +H++DW PTL+  A  +
Sbjct: 338 QRTSRALMHITDWYPTLVGLAGGT 361



 Score = 39.3 bits (90), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 43/169 (25%), Positives = 67/169 (39%), Gaps = 33/169 (19%)

Query: 569 NSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEW---QISALTK--GKW--KLVKVVKV 621
            S  + +DG DVW  +S    S R  ILHNID  +   Q  +L    G W   +   ++V
Sbjct: 362 TSAADGLDGYDVWPAISEGRASPRTEILHNIDPLYNHAQHGSLEGGFGIWNTAVQAAIRV 421

Query: 622 MRYQVDLTGGP---DQV---YLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAP 675
             +++ LTG P   D +    L+      W    M  +R A                   
Sbjct: 422 GEWKL-LTGDPGYGDWIPPQTLATFPGSWWNLERMASVRQAV------------------ 462

Query: 676 CLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKK 724
            LF+I  DP E+ +LA +  D  +      +  +N+ A P +   E  +
Sbjct: 463 WLFNISADPYEREDLAGQRPDV-VRTLLARLAEYNRTAIPVRYPAENPR 510



 Score = 35.0 bits (79), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 15/32 (46%), Positives = 18/32 (56%)

Query: 504 NSYQNEIDGIDVWSVLSRNEPSKRNTILHNID 535
            S  + +DG DVW  +S    S R  ILHNID
Sbjct: 362 TSAADGLDGYDVWPAISEGRASPRTEILHNID 393


>sp|Q32KH7|ARSI_CANFA Arylsulfatase I OS=Canis familiaris GN=ARSI PE=2 SV=2
          Length = 573

 Score =  288 bits (738), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 147/324 (45%), Positives = 207/324 (63%), Gaps = 11/324 (3%)

Query: 61  HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP 120
           HIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++TG++ 
Sbjct: 49  HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 107

Query: 121 IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES 180
           IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFY+KE  PT RGF++
Sbjct: 108 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 167

Query: 181 HLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD 239
            LG  TG+ DY+ + + +   + G D+      AW L G+YST ++      I+ +HS  
Sbjct: 168 FLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTMLYAQRVSHILASHSPR 227

Query: 240 EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
            PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V  +  AL
Sbjct: 228 RPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITSAL 282

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
           ++    +NS+I+F SDNGG       +  SNWPLRG K T WEGGVRG G + SPLL+ +
Sbjct: 283 KRYGFYNNSVIIFSSDNGGQ----TFSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRK 338

Query: 360 GIVAEQYVHVSDWLPTLLSAANKS 383
              +   VH++DW PTL+  A  +
Sbjct: 339 RRTSRALVHITDWYPTLVGLAGGT 362



 Score = 36.6 bits (83), Expect = 1.00,   Method: Compositional matrix adjust.
 Identities = 44/179 (24%), Positives = 62/179 (34%), Gaps = 55/179 (30%)

Query: 570 SYQNEIDGIDVWSVLSRNEPSKRNTILHNID---------------DEWQI---SALTKG 611
           S  + +DG DVW  +S    S R  ILHNID                 W     +A+  G
Sbjct: 364 SAADGLDGYDVWPAISEGRASPRTEILHNIDPLYNHARHGSLEAGFGIWNTAVQAAIRVG 423

Query: 612 KWKLVKVVKVMRYQVDLTGGP---DQV---YLSGLSDREWLALAMRKLRDAASIQCGPVK 665
           +WKL            LTG P   D +    L+      W    M   R A         
Sbjct: 424 EWKL------------LTGDPGYGDWIPPQTLAAFPGSWWNLERMASARQAV-------- 463

Query: 666 EVPCEPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEEEEKK 724
                      LF+I  DP E+ +LA +  D  +      +  +N+ A P +   E  +
Sbjct: 464 ----------WLFNISADPYEREDLAGQRPDV-VRALLARLVDYNRTAIPVRYPAENPR 511



 Score = 34.7 bits (78), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 15/31 (48%), Positives = 18/31 (58%)

Query: 505 SYQNEIDGIDVWSVLSRNEPSKRNTILHNID 535
           S  + +DG DVW  +S    S R  ILHNID
Sbjct: 364 SAADGLDGYDVWPAISEGRASPRTEILHNID 394


>sp|P34059|GALNS_HUMAN N-acetylgalactosamine-6-sulfatase OS=Homo sapiens GN=GALNS PE=1
           SV=1
          Length = 522

 Score =  179 bits (453), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 113/362 (31%), Positives = 179/362 (49%), Gaps = 42/362 (11%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           L   LS   +    +  PP+I+ +L DD+GW D+G +G     TPN+D +A  G++  N+
Sbjct: 13  LLLVLSAAGMGASGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGLLFPNF 72

Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCER-------GGLPLSEKILPQYLKELG 153
           Y+   LC+PSR+A++TG+ PI  G      +           GG+P SE++LP+ LK+ G
Sbjct: 73  YSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAG 132

Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
           Y ++IVGKWHLG ++ ++ P   GF+   G    H   +D+ A       + + RD    
Sbjct: 133 YVSKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKARP----NIPVYRD---- 183

Query: 214 WDLHGKYS--------------TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEP 259
           W++ G+Y               T ++  EA+D I   +   P FLY A  ATH+     P
Sbjct: 184 WEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQARHHPFFLYWAVDATHA-----P 238

Query: 260 LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
           + A   +L         +R ++   + ++D+S+GK++E L+   +  N+ + F SDNG A
Sbjct: 239 VYASKPFLGTS------QRGRYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAA 292

Query: 320 AAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSA 379
                    SN P    K T +EGG+R   L W P   + G V+ Q   + D   T L+ 
Sbjct: 293 LISAPEQGGSNGPFLCGKQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLAL 352

Query: 380 AN 381
           A 
Sbjct: 353 AG 354


>sp|Q8WNQ7|GALNS_PIG N-acetylgalactosamine-6-sulfatase OS=Sus scrofa GN=GALNS PE=2 SV=1
          Length = 522

 Score =  173 bits (439), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 115/363 (31%), Positives = 177/363 (48%), Gaps = 43/363 (11%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           L   LS   + +  +  PP+I+ +L DD+GW D+G +G     TPN+D +A  G++  ++
Sbjct: 12  LLLVLSAAGLGVTGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSF 71

Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELG 153
           Y    LC+PSR+A++TG+ PI TG         N     E  GG+P  E +LP+ LK  G
Sbjct: 72  YAANPLCSPSRAALLTGRLPIRTGFYTTNGHARNAYTPQEIVGGIPDPEHLLPELLKGAG 131

Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
           Y ++IVGKWHLG ++ ++ P   GF+   G    H   +D+ A       + + RD    
Sbjct: 132 YASKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNRARP----NIPVYRD---- 182

Query: 214 WDLHGKYS--------------TDVFTAEAVDII-HNHSTDEPLFLYLAHAATHSANPYE 258
           W++ G++               T ++  EA+D I    +T  P FLY A  ATH+     
Sbjct: 183 WEMVGRFYEEFPINLKTGESNLTQIYLQEALDFIKRQQATHHPFFLYWAIDATHA----- 237

Query: 259 PLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG 318
           P+ A   +L         +R ++   + ++D+SVG++V  L   ++  N+ + F SDNG 
Sbjct: 238 PVYASRAFLGTS------QRGRYGDAVREIDDSVGRIVGLLRDLKIAGNTFVFFTSDNGA 291

Query: 319 AAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
           A         SN P    K T +EGG+R   + W P     G V+ Q   V D   T LS
Sbjct: 292 ALVSAPKQGGSNGPFLCGKQTTFEGGMREPAIAWWPGHIPAGQVSHQLGSVMDLFTTSLS 351

Query: 379 AAN 381
            A 
Sbjct: 352 LAG 354


>sp|Q571E4|GALNS_MOUSE N-acetylgalactosamine-6-sulfatase OS=Mus musculus GN=Galns PE=2
           SV=2
          Length = 520

 Score =  173 bits (438), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 112/346 (32%), Positives = 174/346 (50%), Gaps = 43/346 (12%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTG 117
           PP+I+ +L DD+GW D+G +G     TPN+D +A  G++  ++Y+   LC+PSR+A++TG
Sbjct: 27  PPNIVLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTG 86

Query: 118 KHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
           + PI  G         N     E  GG+P SE +LP+ LK+ GY  +IVGKWHLG ++ +
Sbjct: 87  RLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG-HRPQ 145

Query: 171 YTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYS--------- 221
           + P   GF+   G    H   +D+ A+      + + RD    W++ G++          
Sbjct: 146 FHPLKHGFDEWFGSPNCHFGPYDNKAKP----NIPVYRD----WEMVGRFYEEFPINRKT 197

Query: 222 -----TDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIED 275
                T ++T EA+D I   H+   P FLY A  ATH+     P+ A   +L        
Sbjct: 198 GEANLTQLYTQEALDFIQTQHARQSPFFLYWAIDATHA-----PVYASRQFLGTSL---- 248

Query: 276 FKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRG 335
             R ++   + ++D+SVGK++  L+   +  N+ + F SDNG A         SN P   
Sbjct: 249 --RGRYGDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALISAPNEGGSNGPFLC 306

Query: 336 VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
            K T +EGG+R   + W P   + G V+ Q   + D   T LS A 
Sbjct: 307 GKQTTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAG 352


>sp|Q32KH5|GALNS_CANFA N-acetylgalactosamine-6-sulfatase OS=Canis familiaris GN=GALNS PE=2
           SV=1
          Length = 522

 Score =  172 bits (437), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 116/368 (31%), Positives = 176/368 (47%), Gaps = 53/368 (14%)

Query: 42  LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
           L   LS   +    +  PP+I+ +L DD+GW D+G +G     TPN+D +A  G++  ++
Sbjct: 12  LLLVLSAAGLGAAGAPQPPNILLLLMDDMGWGDLGIYGEPSRETPNLDRMAAEGMLFPSF 71

Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCER------------GGLPLSEKILPQY 148
           Y+   LC+PSR+A++TG+ PI  G      Y   R            GG+P  E +LP+ 
Sbjct: 72  YSANPLCSPSRAALLTGRLPIRNG-----FYTTNRHARNAYTPQEIVGGIPDQEHVLPEL 126

Query: 149 LKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRR 208
           LKE GY ++IVGKWHLG ++ ++ P   GF+   G    H   +D+ A       + + R
Sbjct: 127 LKEAGYVSKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNRARP----NIPVYR 181

Query: 209 DLEPAWDLHGKYS--------------TDVFTAEAVDII-HNHSTDEPLFLYLAHAATHS 253
           D    W++ G+Y               T V+  EA+D I    +   P FLY A  ATH+
Sbjct: 182 D----WEMVGRYYEEFPINLKTGEANLTQVYLQEALDFIKRQQAAQRPFFLYWAIDATHA 237

Query: 254 ANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFV 313
                P+ A   +L         +R ++   + ++D SVGK++  L+  R+  N+ + F 
Sbjct: 238 -----PVYASRPFLGTS------QRGRYGDAVREIDNSVGKILSLLQDLRISENTFVFFT 286

Query: 314 SDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWL 373
           SDNG A         SN P    K T +EGG+R   + W P     G V+ Q   + D  
Sbjct: 287 SDNGAALISAPNQGGSNGPFLCGKQTTFEGGMREPAIAWWPGRIPAGRVSHQLGSIMDLF 346

Query: 374 PTLLSAAN 381
            T LS A 
Sbjct: 347 TTSLSLAG 354


>sp|Q32KJ6|GALNS_RAT N-acetylgalactosamine-6-sulfatase OS=Rattus norvegicus GN=Galns
           PE=1 SV=1
          Length = 524

 Score =  169 bits (429), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 112/360 (31%), Positives = 177/360 (49%), Gaps = 43/360 (11%)

Query: 45  TLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV 104
            LS + +    +  PP+I+ +L DD+GW D+G +G     TPN+D +A  G++  ++Y+ 
Sbjct: 17  VLSALGLLAAGAPQPPNIVLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYSA 76

Query: 105 Q-LCTPSRSAIMTGKHPIHTGM------QHNVLYGCE-RGGLPLSEKILPQYLKELGYRT 156
             LC+PSR+A++TG+ PI  G         N     E  GG+P SE +LP+ LK+ GY  
Sbjct: 77  NPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTN 136

Query: 157 RIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDL 216
           +IVGKWHLG ++ ++ P   GF+   G    H   +D+  +      + + RD    W++
Sbjct: 137 KIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKVKP----NIPVYRD----WEM 187

Query: 217 HGKYS--------------TDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQ 261
            G++               T ++  EA+D I   H+   P FLY A  ATH+     P+ 
Sbjct: 188 VGRFYEEFPINLKTGEANLTQLYLQEALDFIRTQHARQSPFFLYWAIDATHA-----PVY 242

Query: 262 APDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAA 321
           A   +L          R ++   + ++D+SVGK++  L+   +  N+ + F SDNG A  
Sbjct: 243 ASKQFLGTSL------RGRYGDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALI 296

Query: 322 GFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
                  SN P    K T +EGG+R   + W P   + G V+ Q   + D   T LS A 
Sbjct: 297 SAPKEGGSNGPFLCGKQTTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAG 356


>sp|P15289|ARSA_HUMAN Arylsulfatase A OS=Homo sapiens GN=ARSA PE=1 SV=3
          Length = 507

 Score =  150 bits (379), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 112/350 (32%), Positives = 161/350 (46%), Gaps = 40/350 (11%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTG 117
           PP+I+ I ADDLG+ D+G +G     TPN+D LA  G+   ++Y  V LCTPSR+A++TG
Sbjct: 20  PPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLRFTDFYVPVSLCTPSRAALLTG 79

Query: 118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK-EYTPTFR 176
           + P+  GM   VL    RGGLPL E  + + L   GY T + GKWHLG   +  + P  +
Sbjct: 80  RLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLTGMAGKWHLGVGPEGAFLPPHQ 139

Query: 177 GFESHLGYWTGH-----QDYFDHSAEEMKMWGLD-----------MRRDLEPAW--DLHG 218
           GF   LG    H     Q+            G D           +  + +P W   L  
Sbjct: 140 GFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVPIPLLANLSVEAQPPWLPGLEA 199

Query: 219 KYSTDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFK 277
           +Y      A A D++ +    D P FLY A   TH               +     E   
Sbjct: 200 RY-----MAFAHDLMADAQRQDRPFFLYYASHHTHYPQ-----------FSGQSFAERSG 243

Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVK 337
           R  F   L +LD +VG ++ A+    +L  ++++F +DNG      +    S   LR  K
Sbjct: 244 RGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGL-LRCGK 302

Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
            T +EGGVR   L + P   + G+  E    + D LPTL + A  + +PN
Sbjct: 303 GTTYEGGVREPALAFWPGHIAPGVTHELASSL-DLLPTLAALAG-APLPN 350


>sp|P50428|ARSA_MOUSE Arylsulfatase A OS=Mus musculus GN=Arsa PE=2 SV=2
          Length = 506

 Score =  146 bits (368), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 110/371 (29%), Positives = 172/371 (46%), Gaps = 40/371 (10%)

Query: 45  TLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT- 103
           TL +     ++++ PP+I+ I ADDLG+ D+G +G     TPN+D LA  G+   ++Y  
Sbjct: 5   TLFLALAAGLSTASPPNILLIFADDLGYGDLGSYGHPSSTTPNLDQLAEGGLRFTDFYVP 64

Query: 104 VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWH 163
           V LCTPSR+A++TG+ P+ +GM   VL    +GGLPL E  L + L   GY T + GKWH
Sbjct: 65  VSLCTPSRAALLTGRLPVRSGMYPGVLGPSSQGGLPLEEVTLAEVLAARGYLTGMAGKWH 124

Query: 164 LGFYKK-EYTPTFRGFESHLGYWTGH-----QDYFDHSAEEMKMWGLD-----------M 206
           LG   +  + P  +GF   LG    H     Q+      +     G D           +
Sbjct: 125 LGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPDIPCKGGCDQGLVPIPLLANL 184

Query: 207 RRDLEPAW--DLHGKYSTDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAP 263
             + +P W   L  +Y      + + D++ +      P FLY A   TH           
Sbjct: 185 TVEAQPPWLPGLEARY-----VSFSRDLMADAQRQGRPFFLYYASHHTHYPQ-------- 231

Query: 264 DHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGF 323
               +     +   R  F   L +LD +VG ++  +    +L  ++++F +DNG      
Sbjct: 232 ---FSGQSFTKRSGRGPFGDSLMELDGAVGALMTTVGDLGLLEETLVIFTADNGPELMRM 288

Query: 324 NLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           + N   +  LR  K T +EGGVR   L++ P   + G+  E    + D LPT L+A   +
Sbjct: 289 S-NGGCSGLLRCGKGTTFEGGVREPALVYWPGHITPGVTHELASSL-DLLPT-LAALTGA 345

Query: 384 DIPNYVNSTVE 394
            +PN     V+
Sbjct: 346 PLPNVTLDGVD 356


>sp|Q08DD1|ARSA_BOVIN Arylsulfatase A OS=Bos taurus GN=ARSA PE=2 SV=1
          Length = 507

 Score =  145 bits (367), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 112/358 (31%), Positives = 162/358 (45%), Gaps = 42/358 (11%)

Query: 59  PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTG 117
           PP+I+ I ADDLG+ D+G +G     TPN+D LA  G+   ++Y  V LCTPSR+A++TG
Sbjct: 20  PPNILLIFADDLGYGDLGSYGHPSSTTPNLDQLAAGGLRFTDFYVPVSLCTPSRAALLTG 79

Query: 118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK-EYTPTFR 176
           + P+  G+   VL    RGGLPL E  L + L   GY T I GKWHLG   +  + P   
Sbjct: 80  RLPVRMGLYPGVLEPSSRGGLPLDEVTLAEVLAAQGYLTGIAGKWHLGVGPEGAFLPPHH 139

Query: 177 GFESHLGYWTGH-----------------QDYFDHSAEEMKMWGLDMRRDLEPAW--DLH 217
           GF   LG    H                 +   D     + +   ++  + +P W   L 
Sbjct: 140 GFHRFLGIPYSHDQGPCQNLTCFPPATPCEGICDQGLVPIPLLA-NLSVEAQPPWLPGLE 198

Query: 218 GKYSTDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF 276
            +Y      A A D++ +      P FLY A   TH    +     P H           
Sbjct: 199 ARY-----VAFARDLMTDAQHQGRPFFLYYASHHTHYPQ-FSGQSFPGHS---------- 242

Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV 336
            R  F   L +LD +VG ++ A+    +L  +++ F +DNG      +    S   LR  
Sbjct: 243 GRGPFGDSLMELDAAVGALMTAVGDLGLLGETLVFFTADNGPETMRMSHGGCSGL-LRCG 301

Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVE 394
           K T +EGGVR   L + P   + G+  E    + D LPTL + A  + +PN     V+
Sbjct: 302 KGTTFEGGVREPALAFWPGHIAPGVTHELASSL-DLLPTLAALAG-AQLPNITLDGVD 357


>sp|P25549|ASLA_ECOLI Arylsulfatase OS=Escherichia coli (strain K12) GN=aslA PE=3 SV=2
          Length = 551

 Score =  137 bits (345), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 109/371 (29%), Positives = 174/371 (46%), Gaps = 57/371 (15%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQI---PTPNIDALAYSGIILKNYYTVQLCTPSRSAIMT 116
           P+++  L DD+GW DVGF+G       PTP+IDA+A  G+IL + Y+    +P+R+ I+T
Sbjct: 86  PNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILT 145

Query: 117 GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT-- 174
           G++ IH G+    +YG + GGL      LPQ L + GY T+ +GKWH+G   KE  P   
Sbjct: 146 GQYSIHHGILMPPMYG-QPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMG-ENKESQPQNV 202

Query: 175 ----FRGFESHLGYWTGHQDYFDHSAEEMKMW--------GLDMRRD---------LEPA 213
               FRGF S    +T  +D   H   E+ +          L   +D          +  
Sbjct: 203 GFDDFRGFNSVSDMYTEWRDV--HVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAI 260

Query: 214 WDLHGKYSTDV---FTAEAVDIIHNHS-TDEPLFLYLAHAATHSANPYEPLQAPDHYLNI 269
            D+  KY  D+   +    V  +   + +D+P FLY      H           D+Y N 
Sbjct: 261 ADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF----------DNYPNA 310

Query: 270 HRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAAS 329
                   R+ +   + ++++    + + LE+   L N++IVF SDNG  A    +    
Sbjct: 311 KYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEA---EVPPHG 367

Query: 330 NWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
             P RG K + WEGGVR    + W  +++ R   ++  V ++D  PT L      D+  +
Sbjct: 368 RTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK--SDGIVDLADLFPTAL------DLAGH 419

Query: 389 VNSTVENIIPR 399
             + V N++P+
Sbjct: 420 PGAKVANLVPK 430


>sp|P51691|ARS_PSEAE Arylsulfatase OS=Pseudomonas aeruginosa (strain ATCC 15692 / PAO1 /
           1C / PRS 101 / LMG 12228) GN=atsA PE=1 SV=3
          Length = 536

 Score =  132 bits (333), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 118/423 (27%), Positives = 185/423 (43%), Gaps = 109/423 (25%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK- 118
           P+ + I+ADDLG++D+G  G  +I TPN+DALA +G+ L +++T   C+P+RS ++TG  
Sbjct: 5   PNFLVIVADDLGFSDIGAFG-GEIATPNLDALAIAGLRLTDFHTASTCSPTRSMLLTGTD 63

Query: 119 -HPIHTGMQHNVLYGCERGGLP-----LSEKI--LPQYLKELGYRTRIVGKWHLGFYKKE 170
            H    G     L   E  G P     L+E++  LP+ L+E GY+T + GKWHLG  K E
Sbjct: 64  HHIAGIGTMAEALT-PELEGKPGYEGHLNERVVALPELLREAGYQTLMAGKWHLGL-KPE 121

Query: 171 YTPTFRGFESHLGYWTGHQDY------FDHSAEEMKMWGLDMRRDLEPAWDL--HGKYST 222
            TP  RGFE       G  ++      +D S   +      +  + E   D    G YS+
Sbjct: 122 QTPHARGFERSFSLLPGAANHYGFEPPYDESTPRILKGTPALYVEDERYLDTLPEGFYSS 181

Query: 223 DVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR----------- 271
           D F  + +  +       P F YL  +A     P+ PLQAP   +  +R           
Sbjct: 182 DAFGDKLLQYLKERDQSRPFFAYLPFSA-----PHWPLQAPREIVEKYRGRYDAGPEALR 236

Query: 272 ------------------------------HIEDFKRSK-------FAAILHKLDESVGK 294
                                          +ED +R+K       +AA++ ++D ++G+
Sbjct: 237 QERLARLKELGLVEADVEAHPVLALTREWEALEDEERAKSARAMEVYAAMVERMDWNIGR 296

Query: 295 VVEALEQRRMLSNSIIVFVSDNGGAAA-------------GF----------NLNAASNW 331
           VV+ L ++  L N+ ++F+SDNG   A             GF          N+  A+++
Sbjct: 297 VVDYLRRQGELDNTFVLFMSDNGAEGALLEAFPKFGPDLLGFLDRHYDNSLENIGRANSY 356

Query: 332 -------------PLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
                        P R  K    +GG+R   L+  P L  +G ++  +  V D  PTLL 
Sbjct: 357 VWYGPRWAQAATAPSRLYKAFTTQGGIRVPALVRYPRLSRQGAISHAFATVMDVTPTLLD 416

Query: 379 AAN 381
            A 
Sbjct: 417 LAG 419


>sp|P77318|YDEN_ECOLI Uncharacterized sulfatase YdeN OS=Escherichia coli (strain K12)
           GN=ydeN PE=3 SV=2
          Length = 560

 Score =  123 bits (309), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 121/454 (26%), Positives = 192/454 (42%), Gaps = 98/454 (21%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFH--------------------GLD------QIPTPNI 88
           ++ G P+II +  DDLG+  + F                     G+D      Q  TP +
Sbjct: 53  STKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTL 112

Query: 89  DALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQ 147
            +L   G+   N Y    +  PSR+AIMTG+ P   G+  N      + G+PL+E  LP+
Sbjct: 113 LSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT---DAQDGIPLTETFLPE 169

Query: 148 YLKELGYRTRIVGKWHLG----------------------FYKKEYTPTFRGFESHLGYW 185
             +  GY T  VGKWHL                       F  +E+ P  RGF+  +G+ 
Sbjct: 170 LFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFH 229

Query: 186 TGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST-DEPLFL 244
                Y++  +       L   R+  PA      Y +D  T EA+ ++    T D+P  L
Sbjct: 230 AAGTAYYNSPS-------LFKNRERVPA----KGYISDQLTDEAIGVVDRAKTLDQPFML 278

Query: 245 YLAHAATH--SANPYEPLQAPDHY---LNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
           YLA+ A H  + NP     APD Y    N      D     + A ++ +D+ V +++E L
Sbjct: 279 YLAYNAPHLPNDNP-----APDQYQKQFNTGSQTAD----NYYASVYSVDQGVKRILEQL 329

Query: 300 EQRRMLSNSIIVFVSDNGGAAAG-FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLES 358
           ++     N+II+F SDNG    G   LN A     +G K+  + GG      +W      
Sbjct: 330 KKNGQYDNTIILFTSDNGAVIDGPLPLNGAQ----KGYKSQTYPGGTHTPMFMWWKGKLQ 385

Query: 359 RGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENS-----------ILRY 407
            G   ++ +   D+ PT L AA+ S IP  +     +++P  ++            I  Y
Sbjct: 386 PGNY-DKLISAMDFYPTALDAADIS-IPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSY 443

Query: 408 ENGTHEYNSPRIENSN--TRYENGTHEYNPKYEN 439
            +   E N P  +N +   R+++  + +NP  E+
Sbjct: 444 SHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTED 477


>sp|Q3TYD4|ARSG_MOUSE Arylsulfatase G OS=Mus musculus GN=Arsg PE=2 SV=1
          Length = 525

 Score =  119 bits (298), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 112/396 (28%), Positives = 174/396 (43%), Gaps = 67/396 (16%)

Query: 39  VLPLAFTLSMVFVDLVASS------GP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
           VL +    S  F  LV  S       P P+I+ ILADD+GW D+G +  +   T N+D +
Sbjct: 8   VLLVGMAFSGFFYPLVDFSISGKTRAPQPNIVIILADDMGWGDLGANWAETKDTTNLDKM 67

Query: 92  AYSGIILKNYYTV-QLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK 150
           A  G+   +++     C+PSR++++TG+  +  G+ HN       GGLP++E  L + L+
Sbjct: 68  ASEGMRFVDFHAAASTCSPSRASLLTGRLGLRNGVTHNFAV-TSVGGLPVNETTLAEVLR 126

Query: 151 ELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGY-WTGHQDYFDHSA----------EEM 199
           + GY T ++GKWHLG +   Y P FRGF+ + G  ++      D             +  
Sbjct: 127 QEGYVTAMIGKWHLG-HHGSYHPNFRGFDYYFGIPYSNDMGCTDAPGYNYPPCPACPQRD 185

Query: 200 KMW---GLDMRRD-----------LEPAWDLHGKYSTDVFTAEAVDIIHNHSTD-EPLFL 244
            +W   G D   D           +E   +L G      +   AV+ I   ST   P  L
Sbjct: 186 GLWRNPGRDCYTDVALPLYENLNIVEQPVNLSGL--AQKYAERAVEFIEQASTSGRPFLL 243

Query: 245 YLAHAATH---SANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQ 301
           Y+  A  H   S  P  PL  P             ++S + A L ++D  VG++ + ++ 
Sbjct: 244 YVGLAHMHVPLSVTP--PLAHPQ------------RQSLYRASLREMDSLVGQIKDKVDH 289

Query: 302 RRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGV----------KNTLWEGGVRGAGLI 351
                N+++ F  DNG  A    L A S  P  G+          K T WEGG R   L 
Sbjct: 290 VAR-ENTLLWFTGDNGPWAQKCEL-AGSVGPFFGLWQTHQGGSPTKQTTWEGGHRVPALA 347

Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
           + P      + +   + + D  PT+++ A  S  PN
Sbjct: 348 YWPGRVPANVTSTALLSLLDIFPTVIALAGASLPPN 383


>sp|P08842|STS_HUMAN Steryl-sulfatase OS=Homo sapiens GN=STS PE=1 SV=2
          Length = 583

 Score =  119 bits (298), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 104/388 (26%), Positives = 158/388 (40%), Gaps = 68/388 (17%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
           P+II ++ADDLG  D G +G   I TPNID LA  G+ L  +     LCTPSR+A MTG+
Sbjct: 27  PNIILVMADDLGIGDPGCYGNKTIRTPNIDRLASGGVKLTQHLAASPLCTPSRAAFMTGR 86

Query: 119 HPIHTGMQH-----NVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT- 172
           +P+ +GM         L+    GGLP  E    + LK+ GY T ++GKWHLG      T 
Sbjct: 87  YPVRSGMASWSRTGVFLFTASSGGLPTDEITFAKLLKDQGYSTALIGKWHLGMSCHSKTD 146

Query: 173 ----PTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
               P   GF    G      +  D    E  ++    +R +     + G     +    
Sbjct: 147 FCHHPLHHGFNYFYG--ISLTNLRDCKPGEGSVFTTGFKRLVFLPLQIVGVTLLTLAALN 204

Query: 229 AVDIIH-------NHSTDEPLFLYLAHAATHSANP--------YEPLQAPDHYLN----- 268
            + ++H       +      L L L     H   P        YE +Q P  Y N     
Sbjct: 205 CLGLLHVPLGVFFSLLFLAALILTLFLGFLHYFRPLNCFMMRNYEIIQQPMSYDNLTQRL 264

Query: 269 ----------------------IHRHIEDFKRSKFAA---------ILHKLDESVGKVVE 297
                                 +H H   F    FA           + ++D SVG+++ 
Sbjct: 265 TVEAAQFIQRNTETPFLLVLSYLHVHTALFSSKDFAGKSQHGVYGDAVEEMDWSVGQILN 324

Query: 298 ALEQRRMLSNSIIVFVSDNGG----AAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWS 353
            L++ R+ ++++I F SD G      ++   ++  SN   +G K   WEGG+R  G++  
Sbjct: 325 LLDELRLANDTLIYFTSDQGAHVEEVSSKGEIHGGSNGIYKGGKANNWEGGIRVPGILRW 384

Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAAN 381
           P +   G   ++     D  PT+   A 
Sbjct: 385 PRVIQAGQKIDEPTSNMDIFPTVAKLAG 412


>sp|Q32KJ9|ARSG_RAT Arylsulfatase G OS=Rattus norvegicus GN=Arsg PE=2 SV=1
          Length = 526

 Score =  118 bits (296), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 99/363 (27%), Positives = 158/363 (43%), Gaps = 50/363 (13%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
           P+I+ ILADD+GW D+G +  +   T N+D +A  G+   +++     C+PSR++++TG+
Sbjct: 36  PNIVIILADDMGWGDLGANWAETKDTTNLDKMASEGMRFVDFHAAASTCSPSRASLLTGR 95

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
             +  G+ HN       GGLPL+E  L + L++ GY T ++GKWHLG +   Y P+FRGF
Sbjct: 96  LGLRNGVTHNFAV-TSVGGLPLNETTLAEVLQQAGYVTAMIGKWHLG-HHGSYHPSFRGF 153

Query: 179 ESHLGYW----TGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDV---------- 224
           + + G       G  D   ++            R   P  D +   +  +          
Sbjct: 154 DYYFGIPYSNDMGCTDNPGYNYPPCPACPQSDGRWRNPDRDCYTDVALPLYENLNIVEQP 213

Query: 225 ---------FTAEAVDIIHNHSTD-EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE 274
                    +   AV+ I   ST   P  LY+  A  H      P  A      ++R   
Sbjct: 214 VNLSGLAQKYAERAVEFIEQASTSGRPFLLYVGLAHMHVPLSVTPPLANPQSQRLYR--- 270

Query: 275 DFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLR 334
                   A L ++D  VG++ + ++      N+++ F  DNG  A    L A S  P  
Sbjct: 271 --------ASLQEMDSLVGQIKDKVDHVAK-ENTLLWFAGDNGPWAQKCEL-AGSMGPFS 320

Query: 335 GV----------KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSD 384
           G+          K T WEGG R   L + P      + +   + + D  PT+++ A  S 
Sbjct: 321 GLWQTHQGGSPAKQTTWEGGHRVPALAYWPGRVPVNVTSTALLSLLDIFPTVIALAGASL 380

Query: 385 IPN 387
            PN
Sbjct: 381 PPN 383


>sp|Q9C0V7|YHJ2_SCHPO Uncharacterized sulfatase PB10D8.02c OS=Schizosaccharomyces pombe
           (strain 972 / ATCC 24843) GN=SPBPB10D8.02c PE=3 SV=1
          Length = 554

 Score =  118 bits (296), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 113/454 (24%), Positives = 182/454 (40%), Gaps = 104/454 (22%)

Query: 55  ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
           A S  P+ + I+ADDLGW+DV   G  +I TPNI+ LA  G+ L N++T   C+P+RS +
Sbjct: 7   AESKKPNFLVIVADDLGWSDVSPFG-SEIHTPNIERLAKEGVRLTNFHTASACSPTRSML 65

Query: 115 MTGK--HPIHTGMQHNVLYGCER--GGLP-----LSEKI--LPQYLKELGYRTRIVGKWH 163
           ++G   H    G     +    +  GG P     L++++  LP+ L+E GY T + GKWH
Sbjct: 66  LSGTDNHIAGLGQMAETVRRFSKVWGGKPGYEGYLNDRVAALPEILQEAGYYTTMSGKWH 125

Query: 164 LGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAW--------- 214
           LG     Y P+ RGF+       G  ++F +     +   +     L P +         
Sbjct: 126 LGLTPDRY-PSKRGFKESFALLPGGGNHFAYEPGTRENPAVPF---LPPLYTHNHDPVDH 181

Query: 215 -DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATH--------------------- 252
             L   YS++ F  + +D + N    +  F YL   A H                     
Sbjct: 182 KSLKNFYSSNYFAEKLIDQLKNREKSQSFFAYLPFTAPHWPLQSPKEYINKYRGRYSEGP 241

Query: 253 --------------SANPYEPLQAP---------DHYLNIHRHIEDFKRSKFAAILHKLD 289
                            P   + AP         D      +         +AA++  LD
Sbjct: 242 DVLRKNRLQAQKDLGLIPENVIPAPVDGMGTKSWDELTTEEKEFSARTMEVYAAMVELLD 301

Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDNGGAAA---------------------------- 321
            ++G+V++ L+    L N+ ++F+SDNG   +                            
Sbjct: 302 LNIGRVIDYLKTIGELDNTFVIFMSDNGAEGSVLEAIPVLSTKPPVKYFDNSLENLGNYN 361

Query: 322 -----GFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
                G     A+  P R  K  + EGG+R   +I  P L    I+++++V V D LPT+
Sbjct: 362 SFIWYGPRWAQAATAPSRLSKGFITEGGIRCPAIIRYPPLIKPDIISDEFVTVMDILPTI 421

Query: 377 LSAANKSDIPNYVNSTVENIIPRYENSILRYENG 410
           L  A     P +     + +IPR +  I  + +G
Sbjct: 422 LELAEVPH-PGHKFQGRDVVIPRGKPWIDHFVHG 454


>sp|Q96EG1|ARSG_HUMAN Arylsulfatase G OS=Homo sapiens GN=ARSG PE=1 SV=1
          Length = 525

 Score =  117 bits (294), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 100/359 (27%), Positives = 157/359 (43%), Gaps = 50/359 (13%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
           P+ + ILADD+GW D+G +  +   T N+D +A  G+   +++     C+PSR++++TG+
Sbjct: 36  PNFVIILADDMGWGDLGANWAETKDTANLDKMASEGMRFVDFHAAASTCSPSRASLLTGR 95

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
             +  G+  N       GGLPL+E  L + L++ GY T I+GKWHLG +   Y P FRGF
Sbjct: 96  LGLRNGVTRNFAV-TSVGGLPLNETTLAEVLQQAGYVTGIIGKWHLG-HHGSYHPNFRGF 153

Query: 179 ESHLGYWTGH------QDYFDHSAEEMKMWGLDMRRDLE---------PAWD-------- 215
           + + G    H         ++H        G    R+L+         P ++        
Sbjct: 154 DYYFGIPYSHDMGCTDTPGYNHPPCPACPQGDGPSRNLQRDCYTDVALPLYENLNIVEQP 213

Query: 216 LHGKYSTDVFTAEAVDIIHNHSTD-EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE 274
           ++       +  +A   I   ST   P  LY+A A  H   P   L A            
Sbjct: 214 VNLSSLAQKYAEKATQFIQRASTSGRPFLLYVALAHMHVPLPVTQLPAAPR--------- 264

Query: 275 DFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLR 334
              RS + A L ++D  VG++ + ++   +  N+ + F  DNG  A    L A S  P  
Sbjct: 265 --GRSLYGAGLWEMDSLVGQIKDKVDH-TVKENTFLWFTGDNGPWAQKCEL-AGSVGPFT 320

Query: 335 G----------VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           G           K T WEGG R   L + P      + +   + V D  PT+++ A  S
Sbjct: 321 GFWQTRQGGSPAKQTTWEGGHRVPALAYWPGRVPVNVTSTALLSVLDIFPTVVALAQAS 379


>sp|Q32KH9|ARSG_CANFA Arylsulfatase G OS=Canis familiaris GN=ARSG PE=2 SV=1
          Length = 535

 Score =  116 bits (291), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 99/359 (27%), Positives = 155/359 (43%), Gaps = 50/359 (13%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
           P+ + ILADD+GW D+G +  +   T N+D +A  G+   +++     C+PSR++++TG+
Sbjct: 36  PNFVIILADDMGWGDLGANWAETKDTANLDKMAAEGMRFVDFHAAASTCSPSRASLLTGR 95

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
             +  G+ HN       GGLPL+E  L + L++ GY T ++GKWHLG +   Y P FRGF
Sbjct: 96  LGLRNGVTHNFAV-TSVGGLPLNETTLAEVLQQAGYVTGMIGKWHLG-HHGPYHPNFRGF 153

Query: 179 ESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDV-------------- 224
           + + G    H      +            R   P+  L     TDV              
Sbjct: 154 DYYFGIPYSHDMGCTDTPGYNHPPCPACPRGDRPSRSLERDCYTDVALPLYENLNIVEQP 213

Query: 225 ---------FTAEAVDII-HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE 274
                    +  +A+  I H  ++  P  LY+  A  H       L A      + R   
Sbjct: 214 VNLSSLAHKYAEKAIQFIQHASASGRPFLLYMGLAHMHVPISRTQLSA------VLR--- 264

Query: 275 DFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLR 334
              R  + A L ++D  VG++ + ++ R    N+ + F  DNG  A    L A S  P  
Sbjct: 265 --GRRPYGAGLREMDSLVGQIKDKVD-RTAKENTFLWFTGDNGPWAQKCEL-AGSVGPFT 320

Query: 335 GV----------KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
           G+          K T WEGG R   L + P      + +   + V D  PT+++ A  S
Sbjct: 321 GLWQTHQGGSPAKQTTWEGGHRVPALAYWPGRVPVNVTSTALLSVLDIFPTVVALAGAS 379


>sp|P14000|ARS_HEMPU Arylsulfatase OS=Hemicentrotus pulcherrimus PE=1 SV=1
          Length = 551

 Score =  116 bits (290), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 96/324 (29%), Positives = 152/324 (46%), Gaps = 38/324 (11%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
           P+++ ++AD +G  D+  +G        ID +A  G+   N Y    +CTPSRSAIMTG+
Sbjct: 52  PNVVLLVADHMGSGDLTSYGHPTQEAGFIDKMAAEGLRFTNGYVGDAVCTPSRSAIMTGR 111

Query: 119 HPIHTGM--QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
            P+  G   +  V     + GLP SE  + + +KE GY T +VGKWHLG  +   T    
Sbjct: 112 LPVRIGTFGETRVFLPWTKTGLPKSELTIAEAMKEAGYATGMVGKWHLGINENSSTDG-- 169

Query: 177 GFESHLGYWTGHQDYFDHSAEEMKMWGLD---MRRDLEPAWDLH-------------GKY 220
              +HL +  G  D+  H+      W  D   + +D   +   +              K 
Sbjct: 170 ---AHLPFNHGF-DFVGHNLPFTNSWSCDDTGLHKDFPDSQRCYLYVNATLVSQPYQHKG 225

Query: 221 STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSK 280
            T +FT +A+  I ++  D P FLY+A A  H++     L + D +    R      R +
Sbjct: 226 LTQLFTDDALGFIEDNHAD-PFFLYVAFAHMHTS-----LFSSDDFSCTSR------RGR 273

Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTL 340
           +   L ++ ++V K+V+ LE+  +  N+II F+SD+ G    +          RG K+  
Sbjct: 274 YGDNLLEMHDAVQKIVDKLEENNISENTIIFFISDH-GPHREYCEEGGDASIFRGGKSHS 332

Query: 341 WEGGVRGAGLIWSPLLESRGIVAE 364
           WEGG R   +++ P   S GI  E
Sbjct: 333 WEGGHRIPYIVYWPGTISPGISNE 356


>sp|P20713|ATSA_ENTAE Arylsulfatase OS=Enterobacter aerogenes GN=atsA PE=1 SV=1
          Length = 464

 Score =  114 bits (285), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 92/340 (27%), Positives = 154/340 (45%), Gaps = 65/340 (19%)

Query: 42  LAFTLSMVFVD--LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
           +A  +SM+       A    P++I I+ADD+G++D+   G  +IPTPN+ A+A  G+ + 
Sbjct: 6   MAAAVSMILAGGAHAAQQERPNVIVIIADDMGYSDISPFG-GEIPTPNLQAMAEQGMRMS 64

Query: 100 NYYTVQLCTPSRSAIMTGKHPIHTGMQ----HNVLYGCERGGLPLSEKI--LPQYLKELG 153
            YYT  +  P+RS ++TG      GM     ++   G E   L L++++  + +  K+ G
Sbjct: 65  QYYTSPMSAPARSMLLTGNSNQQAGMGGMWWYDSTIGKEGYELRLTDRVTTMAERFKDAG 124

Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAE--EMKMWGLDMRRDLE 211
           Y T + GKWHLGF     TP  RGF     +  G   +F+ +     ++ +     RD E
Sbjct: 125 YNTLMAGKWHLGFVPGA-TPKDRGFNHAFAFMGGGTSHFNDAIPLGTVEAFHTYYTRDGE 183

Query: 212 PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDH------ 265
                   YS++ +  +    I     ++P+F +LA  A     P++PLQAPD       
Sbjct: 184 RVSLPDDFYSSEAYARQMNSWIKATPKEQPVFAWLAFTA-----PHDPLQAPDEWIKRFK 238

Query: 266 ------YLNIHR-------------------HIEDFKR----------------SKFAAI 284
                 Y  ++R                   H+E  K                   +AA+
Sbjct: 239 GQYEQGYAEVYRQRIARLKALGIIHDDTPLPHLELDKEWEALTPEQQKYTAKVMQVYAAM 298

Query: 285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG-AAAGF 323
           +  +D  +G ++E L+Q     N+++VF++DNG   A GF
Sbjct: 299 IANMDAQIGTLMETLKQTGRDKNTLLVFLTDNGANPAQGF 338


>sp|Q9X759|ATSA_KLEPN Arylsulfatase OS=Klebsiella pneumoniae GN=atsA PE=1 SV=1
          Length = 577

 Score =  114 bits (285), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 92/340 (27%), Positives = 154/340 (45%), Gaps = 65/340 (19%)

Query: 42  LAFTLSMVFVD--LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
           +A  +SM+       A    P++I I+ADD+G++D+   G  +IPTPN+ A+A  G+ + 
Sbjct: 6   MAAAVSMILAGGAHAAQQERPNVIVIIADDMGYSDISPFG-GEIPTPNLQAMAEQGMRMS 64

Query: 100 NYYTVQLCTPSRSAIMTGKHPIHTGMQ----HNVLYGCERGGLPLSEKI--LPQYLKELG 153
            YYT  +  P+RS ++TG      GM     ++   G E   L L++++  + +  K+ G
Sbjct: 65  QYYTSPMSAPARSMLLTGNSNQQAGMGGMWWYDSTIGKEGYELRLTDRVTTMAERFKDAG 124

Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAE--EMKMWGLDMRRDLE 211
           Y T + GKWHLGF     TP  RGF     +  G   +F+ +     ++ +     RD E
Sbjct: 125 YNTLMAGKWHLGFVPGA-TPKERGFNHAFAFMGGGTSHFNDAIPLGTVEAFHTYYTRDGE 183

Query: 212 PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDH------ 265
                   YS++ +  +    I     ++P+F +LA  A     P++PLQAPD       
Sbjct: 184 RVSLPDDFYSSEAYARQMNSWIKATPKEQPVFAWLAFTA-----PHDPLQAPDEWIKRFK 238

Query: 266 ------YLNIHR-------------------HIEDFKR----------------SKFAAI 284
                 Y  ++R                   H+E  K                   +AA+
Sbjct: 239 GQYEQGYAEVYRQRIARLKALGIIHDDTPLPHLELDKEWEALTPEQQKYTAKVMQVYAAM 298

Query: 285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG-AAAGF 323
           +  +D  +G ++E L+Q     N+++VF++DNG   A GF
Sbjct: 299 IANMDAQIGTLMETLKQTGRDKNTLLVFLTDNGANPAQGF 338


>sp|P50473|ARS_STRPU Arylsulfatase OS=Strongylocentrotus purpuratus PE=2 SV=1
          Length = 567

 Score =  111 bits (278), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 98/330 (29%), Positives = 154/330 (46%), Gaps = 50/330 (15%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI-ILKNYYTVQLCTPSRSAIMTGK 118
           P++I +LADD+G  D+  +G        ID +A  G+   + Y    +CTPSRSAI+TG+
Sbjct: 67  PNVILLLADDMGVGDLSVYGHPTQEPGFIDQMANQGLRFTQGYSGDSVCTPSRSAIVTGR 126

Query: 119 HPIHTGMQHNVLYGCER-------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE- 170
            PI TG     +YG ER        GLPL E  + + +K  GY T +VGKWHLG  +   
Sbjct: 127 QPIRTG-----VYGEERIFLPWTTTGLPLYEVTIAEAMKGAGYTTGMVGKWHLGINENSS 181

Query: 171 ----YTPTFRGFESHLGY-------W----TG-HQDYFDHSAEEMKMWGLDMRRDLEPAW 214
               + P  RGF+  +G+       W    TG HQD+ D +A  +             A 
Sbjct: 182 SDGAHLPANRGFD-FVGHNLPFGNSWRCDDTGLHQDFPDTNACFL------YYNSTSVAQ 234

Query: 215 DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE 274
               K  T +   + V  I + + ++P F+Y++ A  H++     L + D +    R   
Sbjct: 235 PFQHKGLTQLLRDDTVGFIED-NVNKPFFMYVSFAHMHTS-----LFSSDDFSCTSR--- 285

Query: 275 DFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLR 334
              R ++   L ++D+++ ++V  L    +  N++I F SD+G           +N   R
Sbjct: 286 ---RGRYGDNLREMDQAIEQIVTTLVDNDIDDNTVIFFTSDHGPHREYCGEGGDANV-FR 341

Query: 335 GVKNTLWEGGVRGAGLIWSPLLESRGIVAE 364
           G K   WEGG R   +++ P   S G+  E
Sbjct: 342 GGKGQSWEGGHRIPYIVYWPGTISPGVSHE 371


>sp|Q5FYA8|ARSH_HUMAN Arylsulfatase H OS=Homo sapiens GN=ARSH PE=2 SV=1
          Length = 562

 Score =  103 bits (258), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 88/322 (27%), Positives = 138/322 (42%), Gaps = 65/322 (20%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYY-TVQLCTPSRSAIMTGK 118
           P+I+ ++ADDLG  D+  +G + + TPNID LA  G+ L  +     +CTPSR+A +TG+
Sbjct: 7   PNIVLLMADDLGVGDLCCYGNNSVSTPNIDRLASEGVRLTQHLAAASMCTPSRAAFLTGR 66

Query: 119 HPIHTGMQHNVLYGCER--------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE 170
           +PI +GM     Y   R        GGLP +E    + L+  GYRT ++GKWHLG     
Sbjct: 67  YPIRSGMVSA--YNLNRAFTWLGGSGGLPTNETTFAKLLQHRGYRTGLIGKWHLGLSCAS 124

Query: 171 -----YTPTFRGFESHLGYWTGHQDYFD-------HSAEEMKMWGLDMRRDLEPAWDLHG 218
                Y P   GF    G   G             H    +K+W   +   L P   L  
Sbjct: 125 RNDHCYHPLNHGFHYFYGVPFGLLSDCQASKTPELHRWLRIKLWISTVALALVPFLLLIP 184

Query: 219 KYS---------TDVFTAEAV------------------------DIIHNHSTDEPLFLY 245
           K++           VF   A                         +II     +E +   
Sbjct: 185 KFARWFSVPWKVIFVFALLAFLFFTSWYSSYGFTRRWNCILMRNHEIIQQPMKEEKVASL 244

Query: 246 LAHAATHSANPY--EPLQAPDHYLNIH-------RHIEDFKRSKFAAILHKLDESVGKVV 296
           +   A      Y  EP      +L++H       + +   K  ++   + ++D  VGK++
Sbjct: 245 MLKEALAFIERYKREPFLLFFSFLHVHTPLISKKKFVGRSKYGRYGDNVEEMDWMVGKIL 304

Query: 297 EALEQRRMLSNSIIVFVSDNGG 318
           +AL+Q R+ +++++ F SDNGG
Sbjct: 305 DALDQERLANHTLVYFTSDNGG 326


>sp|Q60HH5|ARSE_MACFA Arylsulfatase E OS=Macaca fascicularis GN=ARSE PE=2 SV=1
          Length = 588

 Score =  100 bits (250), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 85/325 (26%), Positives = 146/325 (44%), Gaps = 62/325 (19%)

Query: 56  SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAI 114
           S+  P+I+ ++ADDLG  D+G +G + + TPNID LA  G+ L  + +   LCTPSR+A 
Sbjct: 34  STSRPNILLLMADDLGIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAF 93

Query: 115 MTGKHPIHTGMQHNVLYGCER-----GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
           +TG++P+ +GM  ++ Y   +     GGLP +E    + LKE GY T ++GKWHLG   +
Sbjct: 94  LTGRYPVRSGMVSSIGYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCE 153

Query: 170 EYT-----PTFRGFESHLGY---WTGHQDYFDHSAEEMKM-------------------- 201
             +     P   GF+   G      G   +++ S + + +                    
Sbjct: 154 SASDHCHHPLHHGFDHFYGMPFSLMGDCAHWELSEKRVNLEQKLNFLFQVLALVALTLVA 213

Query: 202 ----------WGLDMRRDLEPAWDLHGKYSTDVFTAEA-VDIIHNHSTDE---------P 241
                     W   +   L     L G Y        A   ++ NH+  E         P
Sbjct: 214 GKLTHLIPVSWTPVIWSALWAVLLLTGSYFVGALIVHAGCLLMRNHTITEQPMRFQKTTP 273

Query: 242 LFLYLAHAATHSANPYEPLQAPDHYLNIH---RHIEDFKRSKFAAI----LHKLDESVGK 294
           L L    A+    N + P      +L++H     +E+F       +    + ++D  VG+
Sbjct: 274 LILQEV-ASFLKRNKHGPFLLFVSFLHVHIPLITMENFLGKSLHGLYGDNVEEMDWMVGQ 332

Query: 295 VVEALEQRRMLSNSIIVFVSDNGGA 319
           +++ L+   + ++++I F SD+GG+
Sbjct: 333 ILDTLDMEGLTNSTLIYFTSDHGGS 357


>sp|P51690|ARSE_HUMAN Arylsulfatase E OS=Homo sapiens GN=ARSE PE=1 SV=2
          Length = 589

 Score = 97.8 bits (242), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 54/139 (38%), Positives = 82/139 (58%), Gaps = 11/139 (7%)

Query: 56  SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAI 114
           S+  P+I+ ++ADDLG  D+G +G + + TPNID LA  G+ L  + +   LCTPSR+A 
Sbjct: 34  SASRPNILLLMADDLGIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAF 93

Query: 115 MTGKHPIHTGMQHNVLYGCER-----GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
           +TG++P+ +GM  ++ Y   +     GGLP +E    + LKE GY T ++GKWHLG   +
Sbjct: 94  LTGRYPVRSGMVSSIGYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCE 153

Query: 170 EYT-----PTFRGFESHLG 183
             +     P   GF+   G
Sbjct: 154 SASDHCHHPLHHGFDHFYG 172


>sp|P15589|STS_RAT Steryl-sulfatase OS=Rattus norvegicus GN=Sts PE=1 SV=2
          Length = 577

 Score = 97.1 bits (240), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 56/142 (39%), Positives = 79/142 (55%), Gaps = 12/142 (8%)

Query: 54  VASSGP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSR 111
            A  GP P+ + I+ADDLG  D+G +G   + TP+ID LA  G+ L  +     LCTPSR
Sbjct: 19  AARPGPGPNFLLIMADDLGIGDLGCYGNRTLRTPHIDRLALEGVKLTQHLAAAPLCTPSR 78

Query: 112 SAIMTGKHPIHTGMQHN-----VLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGF 166
           +A +TG++P+ +GM  +      L+    GGLP +E    + LK  GY T +VGKWHLG 
Sbjct: 79  AAFLTGRYPVRSGMASHGRLGVFLFSASSGGLPPNEVTFAKLLKGQGYTTGLVGKWHLGL 138

Query: 167 YKKEYT-----PTFRGFESHLG 183
             +  +     P   GF+  LG
Sbjct: 139 SCQAASDFCHHPGRHGFDRFLG 160



 Score = 47.4 bits (111), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 43/170 (25%), Positives = 75/170 (44%), Gaps = 17/170 (10%)

Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
           T    +EA D +   + D P  L+L+    H+A+   P  A     ++H          +
Sbjct: 260 TQRLASEAGDFLRR-NRDTPFLLFLSFMHVHTAHFANPEFAGQ---SLH--------GAY 307

Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAGFNLNA----ASNWPLRGVK 337
              + ++D +VG+V+  L++  + +N+++   SD+G        N      SN   RG K
Sbjct: 308 GDAVEEMDWAVGQVLATLDKLGLANNTLVYLTSDHGAHVEELGPNGERHGGSNGIYRGGK 367

Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
              WEGG+R  GL+  P +   G   E+     D  PT+   A  +++P 
Sbjct: 368 ANTWEGGIRVPGLVRWPGVIVPGQEVEEPTSNMDVFPTVARLAG-AELPT 416


>sp|P50427|STS_MOUSE Steryl-sulfatase OS=Mus musculus GN=Sts PE=2 SV=1
          Length = 624

 Score = 93.6 bits (231), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 53/136 (38%), Positives = 75/136 (55%), Gaps = 11/136 (8%)

Query: 62  IIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHP 120
            + I+ADDLG  D+G +G   + TP++D LA  G+ L  +     LCTPSR+A +TG++P
Sbjct: 37  FLLIMADDLGIGDLGCYGNKTLRTPHLDRLAREGVKLTQHLAAAPLCTPSRAAFLTGRYP 96

Query: 121 IHTGMQHN-----VLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT--- 172
             +GM  +      L+    GGLP SE  + + LK  GY T ++GKWHLG   +  T   
Sbjct: 97  PRSGMAAHGRVGVYLFTASSGGLPPSEVTMARLLKGRGYATALIGKWHLGLSCRGATDFC 156

Query: 173 --PTFRGFESHLGYWT 186
             P   GF+  LG  T
Sbjct: 157 HHPLRHGFDRFLGVPT 172



 Score = 47.0 bits (110), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 43/159 (27%), Positives = 66/159 (41%), Gaps = 17/159 (10%)

Query: 269 IHRHIEDFKRSKFAA-ILH--------KLDESVGKVVEALEQRRMLSNSIIVFVSDNGGA 319
           +H H   F    FA   LH        ++D  VG+V+ AL++  +   +++ F SD+G  
Sbjct: 295 LHVHTAHFADPGFAGRSLHGAYGDSVEEMDWGVGRVLAALDELGLARETLVYFTSDHGAH 354

Query: 320 AAGFNLNA----ASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPT 375
                        SN   RG K   WEGGVR   L+  P   S G V  +   + D  PT
Sbjct: 355 VEELGPRGERMGGSNGVFRGGKGNNWEGGVRVPCLVRWPRELSPGRVVAEPTSLMDVFPT 414

Query: 376 LLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEY 414
           +   A  +++P        +++P       R E   HE+
Sbjct: 415 VARLAG-AELPGDRVIDGRDLMPLLRGDAQRSE---HEF 449


>sp|P54793|ARSF_HUMAN Arylsulfatase F OS=Homo sapiens GN=ARSF PE=1 SV=4
          Length = 590

 Score = 91.7 bits (226), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 51/116 (43%), Positives = 70/116 (60%), Gaps = 12/116 (10%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P+I+ I+ DDLG  D+G +G D + TP+ID LA  G+ L  + +   LC+PSRSA +TG+
Sbjct: 30  PNIVLIMVDDLGIGDLGCYGNDTMRTPHIDRLAREGVRLTQHISAASLCSPSRSAFLTGR 89

Query: 119 HPIHTGMQHNVLYGCER--------GGLPLSEKILPQYLKELGYRTRIVGKWHLGF 166
           +PI +GM   V  G  R         GLPL+E  L   LK+ GY T ++GKWH G 
Sbjct: 90  YPIRSGM---VSSGNRRVIQNLAVPAGLPLNETTLAALLKKQGYSTGLIGKWHQGL 142



 Score = 38.9 bits (89), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 22/95 (23%), Positives = 45/95 (47%), Gaps = 12/95 (12%)

Query: 224 VFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAA 283
           +   EA+  +  HS  E   L+ +    H+     PL   D +    +H        +  
Sbjct: 266 IMVKEAISFLERHS-KETFLLFFSFLHVHT-----PLPTTDDFTGTSKH------GLYGD 313

Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGG 318
            + ++D  VGK+++A++   + +N+++ F SD+GG
Sbjct: 314 NVEEMDSMVGKILDAIDDFGLRNNTLVYFTSDHGG 348


>sp|Q32KH8|ARSH_CANFA Arylsulfatase H OS=Canis familiaris GN=ARSH PE=2 SV=1
          Length = 562

 Score = 91.7 bits (226), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 52/136 (38%), Positives = 76/136 (55%), Gaps = 12/136 (8%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYY-TVQLCTPSRSAIMTGK 118
           P+I+ ++ADDLG  D+  +G + + TPNID LA  G+ L  +     +CTPSR+A +TG+
Sbjct: 7   PNIVLLMADDLGVGDLCCYGNNTVSTPNIDRLASEGVRLTQHLAAASVCTPSRAAFLTGR 66

Query: 119 HPIHTGMQ--HNVLYGCE----RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE-- 170
           +PI +GM   +N+  G       GGLP +E    + L+  GYRT ++GKWH G       
Sbjct: 67  YPIRSGMASPYNLNRGLTWLGGSGGLPTNETTFAKLLQHYGYRTGLIGKWHQGLSCASRN 126

Query: 171 ---YTPTFRGFESHLG 183
              Y P   GF+   G
Sbjct: 127 DHCYHPLNHGFDYFYG 142



 Score = 38.9 bits (89), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 21/78 (26%), Positives = 40/78 (51%), Gaps = 11/78 (14%)

Query: 241 PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALE 300
           P  L+++    H+     PL   D +      +   K   +   + ++D  VGK++E L+
Sbjct: 260 PFLLFVSFLHVHT-----PLITKDKF------VGHSKYGLYGDNVEEMDWMVGKILETLD 308

Query: 301 QRRMLSNSIIVFVSDNGG 318
           Q R+ +++++ F SDNGG
Sbjct: 309 QERLTNHTLVYFTSDNGG 326


>sp|P51689|ARSD_HUMAN Arylsulfatase D OS=Homo sapiens GN=ARSD PE=1 SV=2
          Length = 593

 Score = 89.0 bits (219), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 46/112 (41%), Positives = 68/112 (60%), Gaps = 6/112 (5%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
           P+I+ I+ADDLG  D+G +G + + TPNID LA  G+ L  +     LCTPSR+A +TG+
Sbjct: 41  PNILLIMADDLGTGDLGCYGNNTLRTPNIDQLAEEGVRLTQHLAAAPLCTPSRAAFLTGR 100

Query: 119 HPIHTGMQHN-----VLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
           H   +GM  +     + +    GGLP +E    + L++ GY T ++GKWH G
Sbjct: 101 HSFRSGMDASNGYRALQWNAGSGGLPENETTFARILQQHGYATGLIGKWHQG 152


>sp|Q0TUK6|SULF_CLOP1 Arylsulfatase OS=Clostridium perfringens (strain ATCC 13124 / NCTC
           8237 / Type A) GN=CPF_0221 PE=1 SV=1
          Length = 481

 Score = 88.6 bits (218), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 104/421 (24%), Positives = 175/421 (41%), Gaps = 94/421 (22%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P+I+ I+ D +  + +G +G + I TPN+D +A  G   +N YT V  C  SR++I+TG 
Sbjct: 3   PNIVLIMVDQMRGDCLGVNGNEFIETPNLDMMATEGYNFENAYTAVPSCIASRASILTGM 62

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
                G       G E G     E  +     + GY T+ +GK H+  Y +     F   
Sbjct: 63  SQKSHGR-----VGYEDGVSWNYENTIASEFSKAGYHTQCIGKMHV--YPERNLCGFHNI 115

Query: 179 ESHLGYWTGHQ--------------DYF-------DHSAEEMKMWGLDMRRDLEPAW--- 214
             H GY    +              DY         H+ + + + GLD    +   W   
Sbjct: 116 MLHDGYLHFARNKEGKASTQIEQCDDYLKWFREKKGHNVDLIDI-GLDCNSWVSRPWGYE 174

Query: 215 -DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
            +LH    T+    E++D +      +P FL ++    HS     PL  P  Y ++++  
Sbjct: 175 ENLH---PTNWVVNESIDFLRRKDPSKPFFLKMSFVRPHS-----PLDPPKFYFDMYKD- 225

Query: 274 EDF---------------KRSK---------------------FAAILHKLDESVGKVVE 297
           ED                 R K                     + +I H +D  +G+ + 
Sbjct: 226 EDLPEPLMGDWANKEDEENRGKDINCVKGIINKKALKRAKAAYYGSITH-IDHQIGRFLI 284

Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP--- 354
           AL +   L+N+I +FVSD+G      ++    NW  +G+    +EG  R    I+ P   
Sbjct: 285 ALSEYGELNNTIFLFVSDHG------DMMGDHNWFRKGIP---YEGSSRVPFFIYDPGNL 335

Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNS-TVENIIPRYENSILRYENGTHE 413
           L   +G V ++ + + D +PTLL  A+ S IP+ V   +++N+I    ++   Y +G H 
Sbjct: 336 LKGKKGKVFDEVLELRDIMPTLLDFAHIS-IPDSVEGLSLKNLIEERNSTWRDYIHGEHS 394

Query: 414 Y 414
           +
Sbjct: 395 F 395


>sp|Q8XNV1|SULF_CLOPE Arylsulfatase OS=Clostridium perfringens (strain 13 / Type A)
           GN=CPE0231 PE=3 SV=1
          Length = 481

 Score = 88.2 bits (217), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 103/421 (24%), Positives = 175/421 (41%), Gaps = 94/421 (22%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGK 118
           P+I+ I+ D +  + +G +G + I TPN+D +A  G   +N YT V  C  SR++I+TG 
Sbjct: 3   PNIVLIMVDQMRGDCLGVNGNEFIETPNLDMMATEGYNFENAYTAVPSCIASRASILTGM 62

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
                G       G E G     E  +     + GY T+ +GK H+  Y +     F   
Sbjct: 63  SQKSHGR-----VGYEDGVSWNYENTIASEFSKAGYHTQCIGKMHV--YPERNLCGFHNI 115

Query: 179 ESHLGYWTGHQ--------------DYF-------DHSAEEMKMWGLDMRRDLEPAW--- 214
             H GY    +              DY         H+ + + + GLD    +   W   
Sbjct: 116 MLHDGYLHFARNKEGKASTQIEQCDDYLKWFREKKGHNVDLIDI-GLDCNSWVSRPWGYE 174

Query: 215 -DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
            +LH    T+    E++D +      +P FL ++    HS     PL  P  Y ++++  
Sbjct: 175 ENLH---PTNWVVNESIDFLRRRDPSKPFFLKMSFVRPHS-----PLDPPKFYFDMYKD- 225

Query: 274 EDF---------------KRSK---------------------FAAILHKLDESVGKVVE 297
           ED                 R K                     + +I H +D  +G+ + 
Sbjct: 226 EDLPEPLMGDWANKEDEENRGKDINCVKGIINKKALKRAKAAYYGSITH-IDHQIGRFLI 284

Query: 298 ALEQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSP--- 354
           AL +   L+N+I +FVSD+G      ++    NW  +G+    +EG  R    I+ P   
Sbjct: 285 ALSEYGKLNNTIFLFVSDHG------DMMGDHNWFRKGIP---YEGSARVPFFIYDPGNL 335

Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNS-TVENIIPRYENSILRYENGTHE 413
           L   +G V ++ + + D +PTLL  A+ S IP+ V   +++++I    ++   Y +G H 
Sbjct: 336 LKGKKGKVFDEVLELRDIMPTLLDFAHIS-IPDSVEGLSLKDLIEERNSTWRDYIHGEHS 394

Query: 414 Y 414
           +
Sbjct: 395 F 395


>sp|P31447|YIDJ_ECOLI Uncharacterized sulfatase YidJ OS=Escherichia coli (strain K12)
           GN=yidJ PE=3 SV=1
          Length = 497

 Score = 76.6 bits (187), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 91/366 (24%), Positives = 148/366 (40%), Gaps = 73/366 (19%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
           P+ +F++ D    N VG +    + T NID+LA  GI   + YT   +CTP+R+ + TG 
Sbjct: 4   PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63

Query: 119 HPIHTG-MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG----FYKKEYTP 173
           +   +G   +NV  G        +   + +Y K+ GY T  +GKWHL     F   E  P
Sbjct: 64  YANQSGPWTNNVAPG-------KNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPP 116

Query: 174 TFRGFESHLGYWTGHQDYFDHSAE-EMKMWGLDMRRDLEPAWDLHGKYSTDVFT------ 226
                E    YW    +Y     E E+ +W    R  L    DL   +  + FT      
Sbjct: 117 -----EWDADYWFDGANYLSELTEKEISLW----RNGLNSVEDLQANHIDETFTWAHRIS 167

Query: 227 AEAVDIIHNHS-TDEPLFLYLAHAATHS--ANPYEPLQA-PDHYLNIHRHIED------- 275
             AVD +   +  DEP  + +++   H     P E L+   D Y  +    +D       
Sbjct: 168 NRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPE 227

Query: 276 ------------------FKRSKFAAILHKLDESVGKVVEAL--EQRRMLSNSIIVFVSD 315
                             +    + A    +D+ +G+V+ AL  EQR    N+ +++ SD
Sbjct: 228 HHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR---ENTWVIYTSD 284

Query: 316 NGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPT 375
           +G       L +            +++   R   +I SP  E R +  +  V   D LPT
Sbjct: 285 HGEMMGAHKLISKG--------AAMYDDITRIPLIIRSPQGERRQV--DTPVSHIDLLPT 334

Query: 376 LLSAAN 381
           +++ A+
Sbjct: 335 MMALAD 340


>sp|Q8IWU6|SULF1_HUMAN Extracellular sulfatase Sulf-1 OS=Homo sapiens GN=SULF1 PE=1 SV=1
          Length = 871

 Score = 73.2 bits (178), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 87/375 (23%), Positives = 147/375 (39%), Gaps = 65/375 (17%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
           P+II +L DD    DV    L Q+       + + G    N + T  +C PSRS+++TGK
Sbjct: 43  PNIILVLTDD---QDVELGSL-QVMNKTRKIMEHGGATFINAFVTTPMCCPSRSSMLTGK 98

Query: 119 HPIHTGMQHNVLYGCERGGLPLSE-----KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           + +H    HNV    E    P  +     +    YL   GYRT   GK+ L  Y   Y P
Sbjct: 99  Y-VHN---HNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153

Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
              G+   LG    +  +++++       G+  +   + A D    Y TD+ T E+++  
Sbjct: 154 P--GWREWLGL-IKNSRFYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESINYF 203

Query: 234 HNHST---DEPLFLYLAHAATHSANPYEP----------------------------LQA 262
                     P+ + ++HAA H      P                            +Q 
Sbjct: 204 KMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQY 263

Query: 263 PDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG 322
               L IH    +  + K    L  +D+SV ++   L +   L N+ I++ +D+G     
Sbjct: 264 TGPMLPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQ 323

Query: 323 FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANK 382
           F L    + P        ++  +R    I  P +E   IV +  +++ D  PT+L  A  
Sbjct: 324 FGLVKGKSMP--------YDFDIRVPFFIRGPSVEPGSIVPQIVLNI-DLAPTILDIAGL 374

Query: 383 SDIPNYVNSTVENII 397
              P+    +V  ++
Sbjct: 375 DTPPDVDGKSVLKLL 389


>sp|Q8VI60|SULF1_RAT Extracellular sulfatase Sulf-1 OS=Rattus norvegicus GN=Sulf1 PE=1
           SV=1
          Length = 870

 Score = 73.2 bits (178), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 90/368 (24%), Positives = 147/368 (39%), Gaps = 66/368 (17%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
           P+II +L DD    DV    L Q+       + + G    N + T  +C PSRS+++TGK
Sbjct: 43  PNIILVLTDD---QDVELGSL-QVMNKTRKIMEHGGATFTNAFVTTPMCCPSRSSMLTGK 98

Query: 119 HPIHTGMQHNVLYGCERGGLP----LSE-KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           + +H    HNV    E    P    L E +    YL   GYRT   GK+ L  Y   Y P
Sbjct: 99  Y-VHN---HNVYTNNENCSSPSWQALHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153

Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
              G+   LG    +  +++++       G+  +   + A D    Y TD+ T E+++  
Sbjct: 154 P--GWREWLGL-IKNSRFYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESINYF 203

Query: 234 HNHST---DEPLFLYLAHAATHSANPYEP----------------------------LQA 262
                     P+ + ++HAA H      P                            +Q 
Sbjct: 204 KMSKRMYPHRPVMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQY 263

Query: 263 PDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG 322
               L IH    +  + K    L  +D+SV ++   L +   L N+ I++ +D+G     
Sbjct: 264 TGPMLPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVETGELGNTYIIYTADHGYHIGQ 323

Query: 323 FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANK 382
           F L    + P        ++  +R    I  P +E   IV +  +++ D  PT+L  A  
Sbjct: 324 FGLVKGKSMP--------YDFDIRVPFFIRGPSIEPGSIVPQIVLNI-DLAPTILDIAG- 373

Query: 383 SDIPNYVN 390
            D P+ V+
Sbjct: 374 LDTPSDVD 381


>sp|Q8K007|SULF1_MOUSE Extracellular sulfatase Sulf-1 OS=Mus musculus GN=Sulf1 PE=2 SV=1
          Length = 870

 Score = 71.2 bits (173), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 85/358 (23%), Positives = 140/358 (39%), Gaps = 65/358 (18%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
           P+II +L DD    DV    L Q+       +   G    N + T  +C PSRS+++TGK
Sbjct: 43  PNIILVLTDD---QDVELGSL-QVMNKTRKIMEQGGATFTNAFVTTPMCCPSRSSMLTGK 98

Query: 119 HPIHTGMQHNVLYGCERGGLPLSE-----KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           + +H    HNV    E    P  +     +    YL   GYRT   GK+ L  Y   Y P
Sbjct: 99  Y-VHN---HNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153

Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
              G+   LG    +  +++++       G+  +   + A D    Y TD+ T E+++  
Sbjct: 154 P--GWREWLGL-IKNSRFYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESINYF 203

Query: 234 HNHST---DEPLFLYLAHAATHSANPYEP----------------------------LQA 262
                     P+ + ++HAA H      P                            +Q 
Sbjct: 204 KMSKRMYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQY 263

Query: 263 PDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG 322
               L IH    +  + K    L  +D+SV ++   L +   L N+ I++ +D+G     
Sbjct: 264 TGPMLPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVESGELDNTYIIYTADHGYHIGQ 323

Query: 323 FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
           F L    + P        ++  +R    I  P +E   IV +  +++ D  PT+L  A
Sbjct: 324 FGLVKGKSMP--------YDFDIRVPFFIRGPSIEPGSIVPQIVLNI-DLAPTILDIA 372


>sp|P51688|SPHM_HUMAN N-sulphoglucosamine sulphohydrolase OS=Homo sapiens GN=SGSH PE=1
           SV=1
          Length = 502

 Score = 71.2 bits (173), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 93/372 (25%), Positives = 152/372 (40%), Gaps = 87/372 (23%)

Query: 41  PLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
           P+    +++ V  +  + P + + +LADD G+ + G +    I TP++DALA   ++ +N
Sbjct: 4   PVPACCALLLVLGLCRARPRNALLLLADDGGF-ESGAYNNSAIATPHLDALARRSLLFRN 62

Query: 101 YYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSE----KILPQYLKELGYR 155
            +T V  C+PSR++++TG  P H     N +YG  +     +     + LP  L + G R
Sbjct: 63  AFTSVSSCSPSRASLLTGL-PQH----QNGMYGLHQDVHHFNSFDKVRSLPLLLSQAGVR 117

Query: 156 TRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
           T I+GK H+G   +   P                  FD +  E     L + R++     
Sbjct: 118 TGIIGKKHVG--PETVYP------------------FDFAYTEENGSVLQVGRNITRIKL 157

Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLA----HAATHSANPYE------------- 258
           L  K+                  D P FLY+A    H   HS   Y              
Sbjct: 158 LVRKFL-------------QTQDDRPFFLYVAFHDPHRCGHSQPQYGTFCEKFGNGESGM 204

Query: 259 ---PLQAPDHYLNIHRHIEDF------KRSKFAA---ILHKLDESVGKVVEALEQRRMLS 306
              P   P  Y  +   +  F       R+  AA    + ++D+ VG V++ L    +L+
Sbjct: 205 GRIPDWTPQAYDPLDVLVPYFVPNTPAARADLAAQYTTVGRMDQGVGLVLQELRDAGVLN 264

Query: 307 NSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR-GIVAEQ 365
           +++++F SDNG              P    +  L+  G     L+ SP    R G V+E 
Sbjct: 265 DTLVIFTSDNG-------------IPFPSGRTNLYWPGTAEPLLVSSPEHPKRWGQVSEA 311

Query: 366 YVHVSDWLPTLL 377
           YV + D  PT+L
Sbjct: 312 YVSLLDLTPTIL 323


>sp|Q90XB6|SULF1_COTCO Extracellular sulfatase Sulf-1 OS=Coturnix coturnix GN=SULF1 PE=1
           SV=1
          Length = 867

 Score = 70.1 bits (170), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 88/381 (23%), Positives = 147/381 (38%), Gaps = 77/381 (20%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
           P+II +L DD    DV    L Q+       +   G    N + T  +C PSRS+++TGK
Sbjct: 43  PNIILVLTDD---QDVELGSL-QVMNKTRRIMENGGASFINAFVTTPMCCPSRSSMLTGK 98

Query: 119 HPIHTGMQHNVLYGCERGGLPLSE-----KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
           + +H    HN+    E    P  +     +    YL   GYRT   GK+ L  Y   Y P
Sbjct: 99  Y-VHN---HNIYTNNENCSSPSWQATHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153

Query: 174 TFRGFESHLGYWTGHQDYFDHSAEE---MKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
              G+   +G    +  +++++       +  G D  +D          Y TD+ T E++
Sbjct: 154 P--GWREWVGL-VKNSRFYNYTISRNGNKEKHGFDYAKD----------YFTDLITNESI 200

Query: 231 DI------IHNHSTDEPLFLYLAHAATHSANPYEP------------------------- 259
           +       I+ H    P+ + ++HAA H      P                         
Sbjct: 201 NYFRMSKRIYPH---RPIMMVISHAAPHGPEDSAPQFSELYPNASQHITPSYNYAPNMDK 257

Query: 260 ---LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDN 316
              +Q     L IH    +  + K    L  +D+S+ ++ + L +   L N+ I++ +D+
Sbjct: 258 HWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSMERLYQMLAEMGELENTYIIYTADH 317

Query: 317 GGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL 376
           G     F L    + P        ++  +R    I  P +E  G V  Q V   D  PT+
Sbjct: 318 GYHIGQFGLVKGKSMP--------YDFDIRVPFFIRGPSVEP-GSVVPQIVLNIDLAPTI 368

Query: 377 LSAANKSDIPNYVNSTVENII 397
           L  A     P+    +V  ++
Sbjct: 369 LDIAGLDTPPDMDGKSVLKLL 389


>sp|Q21376|SULF1_CAEEL Putative extracellular sulfatase Sulf-1 homolog OS=Caenorhabditis
           elegans GN=sul-1 PE=3 SV=1
          Length = 709

 Score = 68.2 bits (165), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 90/381 (23%), Positives = 146/381 (38%), Gaps = 67/381 (17%)

Query: 35  MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
           + F ++P+  T S+ FVD        ++I IL DD    D+    +D +P  +       
Sbjct: 16  VLFLIIPIKVT-SIHFVD-----SQHNVILILTDD---QDIELGSMDFMPKTSQIMKERG 66

Query: 95  GIILKNYYTVQLCTPSRSAIMTG----KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLK 150
                 Y T  +C PSRS I+TG     H +HT  Q+    G E   +   +K +  YL+
Sbjct: 67  TEFTSGYVTTPICCPSRSTILTGLYVHNHHVHTNNQNCT--GVEWRKVH-EKKSIGVYLQ 123

Query: 151 ELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDL 210
           E GYRT  +GK +L  Y   Y P        +   +   +Y  +S  E + +G +  +D 
Sbjct: 124 EAGYRTAYLGK-YLNEYDGSYIPPGWDEWHAIVKNSKFYNYTMNSNGEREKFGSEYEKD- 181

Query: 211 EPAWDLHGKYSTDVFTAEAVDIIHNH---STDEPLFLYLAHAATHSANPYEP-------- 259
                    Y TD+ T  ++  I  H      +P  L +++ A H      P        
Sbjct: 182 ---------YFTDLVTNRSLKFIDKHIKIRAWQPFALIISYPAPHGPEDPAPQFAHMFEN 232

Query: 260 --------------------LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEAL 299
                               LQ      ++H    D    +    L  +DE + ++   L
Sbjct: 233 EISHRTGSWNFAPNPDKQWLLQRTGKMNDVHISFTDLLHRRRLQTLQSVDEGIERLFNLL 292

Query: 300 EQRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESR 359
            +   L N+  ++ SD+G     F L       L+G KN  +E  +R    +  P +  R
Sbjct: 293 RELNQLWNTYAIYTSDHGYHLGQFGL-------LKG-KNMPYEFDIRVPFFMRGPGI-PR 343

Query: 360 GIVAEQYVHVSDWLPTLLSAA 380
            +   + V   D  PT+L  A
Sbjct: 344 NVTFNEIVTNVDIAPTMLHIA 364


>sp|Q9VEX0|SULF1_DROME Extracellular sulfatase SULF-1 homolog OS=Drosophila melanogaster
           GN=Sulf1 PE=1 SV=1
          Length = 1114

 Score = 63.9 bits (154), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 88/355 (24%), Positives = 146/355 (41%), Gaps = 65/355 (18%)

Query: 60  PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
           P+II IL DD    DV    L+ +P   +  L   G   ++ YT   +C P+RS+++TG 
Sbjct: 54  PNIILILTDD---QDVELGSLNFMPR-TLRLLRDGGAEFRHAYTTTPMCCPARSSLLTGM 109

Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKI--LPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
           + +H  M       C       + +      YL   GYRT   GK+ L  Y   Y P   
Sbjct: 110 Y-VHNHMVFTNNDNCSSPQWQATHETRSYATYLSNAGYRTGYFGKY-LNKYNGSYIPP-- 165

Query: 177 GFESHLGYWTG---HQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
           G+      W G   +  Y+++S   + + G  ++   + A D    Y  D+   +++  +
Sbjct: 166 GWRE----WGGLIMNSKYYNYS---INLNGQKIKHGFDYAKD----YYPDLIANDSIAFL 214

Query: 234 HN---HSTDEPLFLYLAHAATH----SANPYEPL-------QAP--DHYLN--------- 268
            +    +  +P+ L ++  A H    SA  Y  L         P  DH  N         
Sbjct: 215 RSSKQQNQRKPVLLTMSFPAPHGPEDSAPQYSHLFFNVTTHHTPSYDHAPNPDKQWILRV 274

Query: 269 ------IHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDNGGAAAG 322
                 +H+   +   +K    L  +D +V +V   L++   L N+ IV+ SD+G     
Sbjct: 275 TEPMQPVHKRFTNLLMTKRLQTLQSVDVAVERVYNELKELGELDNTYIVYTSDHGYHLGQ 334

Query: 323 FNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLL 377
           F L    ++P        +E  VR   LI  P +++  +V E  ++V D  PT L
Sbjct: 335 FGLIKGKSFP--------FEFDVRVPFLIRGPGIQASKVVNEIVLNV-DLAPTFL 380


>sp|Q8BFR4|GNS_MOUSE N-acetylglucosamine-6-sulfatase OS=Mus musculus GN=Gns PE=2 SV=1
          Length = 544

 Score = 63.5 bits (153), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 92/366 (25%), Positives = 152/366 (41%), Gaps = 63/366 (17%)

Query: 54  VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDAL-AYSGIILKNYYT-VQLCTPSR 111
           V ++  P+++ +L DD    D    G+   P     AL    G+   + Y    LC PSR
Sbjct: 33  VGAARRPNVLLLLTDD---QDAELGGM--TPLKKTKALIGEKGMTFSSAYVPSALCCPSR 87

Query: 112 SAIMTGKHPIHTGMQHNVLYG-CERGGLPLSEK--ILPQYLKEL-GYRTRIVGKWHLGFY 167
           ++I+TGK+P +  + +N L G C        ++    P  LK + GY+T   GK     Y
Sbjct: 88  ASILTGKYPHNHHVVNNTLEGNCSSKAWQKIQEPYTFPAILKSVCGYQTFFAGK-----Y 142

Query: 168 KKEYTPTFRGFESHL----GYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTD 223
             EY     G   H+     YW   +    +    + + G   +     + D    Y TD
Sbjct: 143 LNEYGAPDAGGLEHIPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVD----YLTD 198

Query: 224 VFTAEAVDIIHNHSTDEPLFLYLAHAATHS---ANP-----YEPLQAP-DHYLNIH---- 270
           V    ++D +   S  EP F+ ++  A HS   A P     ++ + AP +   NIH    
Sbjct: 199 VLANLSLDFLDYKSNSEPFFMMISTPAPHSPWTAAPQYQKAFQNVIAPRNKNFNIHGTNK 258

Query: 271 ----------------RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVS 314
                           R ++D  R ++  +L  +D+ V K+V+ L+    L N+ I + S
Sbjct: 259 HWLIRQAKTPMTNSSIRFLDDAFRRRWQTLL-SVDDLVEKLVKRLDSTGELDNTYIFYTS 317

Query: 315 DNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLP 374
           DNG     F+L      P+   K  L+E  ++   L+  P ++     ++  V   D  P
Sbjct: 318 DNGYHTGQFSL------PID--KRQLYEFDIKVPLLVRGPGIKPNQ-TSKMLVSNIDLGP 368

Query: 375 TLLSAA 380
           T+L  A
Sbjct: 369 TILDLA 374


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.314    0.133    0.403 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 383,720,120
Number of Sequences: 539616
Number of extensions: 18899222
Number of successful extensions: 360465
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 1474
Number of HSP's successfully gapped in prelim test: 1060
Number of HSP's that attempted gapping in prelim test: 202327
Number of HSP's gapped (non-prelim): 66834
length of query: 905
length of database: 191,569,459
effective HSP length: 127
effective length of query: 778
effective length of database: 123,038,227
effective search space: 95723740606
effective search space used: 95723740606
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 66 (30.0 bits)