BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>psy12301
MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE
TINQFQYITDPNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILN
NMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAI
GKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAW
DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT
DPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS
NYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPL
NIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDS
WKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMR
STRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYH
RRTLVPQSHEQPDLVQADPKRFNDTWSPWIYR

High Scoring Gene Products

Symbol, full name Information P value
CG8646 protein from Drosophila melanogaster 2.1e-111
CG7408 protein from Drosophila melanogaster 4.3e-92
CG32191 protein from Drosophila melanogaster 3.9e-87
CG7402 protein from Drosophila melanogaster 3.0e-85
Arsb
arylsulfatase B
gene from Rattus norvegicus 1.5e-79
Arsb
arylsulfatase B
protein from Mus musculus 1.4e-78
ARSB
ARSB protein
protein from Bos taurus 1.1e-76
ARSB
Arylsulfatase B
protein from Homo sapiens 1.4e-76
arsb
Arylsulfatase B
protein from Canis lupus familiaris 7.5e-76
ARSB
Uncharacterized protein
protein from Gallus gallus 2.3e-68
ARSJ
Uncharacterized protein
protein from Bos taurus 1.4e-54
Arsj
arylsulfatase family, member J
gene from Rattus norvegicus 2.3e-54
Arsj
arylsulfatase J
protein from Mus musculus 2.9e-54
arsj
Uncharacterized protein
protein from Canis lupus familiaris 3.7e-54
ARSJ
Arylsulfatase J
protein from Homo sapiens 6.1e-54
Arsi
arylsulfatase family, member I
gene from Rattus norvegicus 9.9e-54
ARSI
Arylsulfatase I
protein from Homo sapiens 2.6e-53
ARSI
Arylsulfatase I
protein from Canis lupus familiaris 3.3e-53
Arsi
arylsulfatase i
protein from Mus musculus 4.2e-53
ARSI
Uncharacterized protein
protein from Bos taurus 6.9e-53
LOC100517463
Uncharacterized protein
protein from Sus scrofa 8.7e-53
ARSI
Uncharacterized protein
protein from Gallus gallus 1.8e-52
sul-3 gene from Caenorhabditis elegans 9.8e-50
Galns
galactosamine (N-acetyl)-6-sulfate sulfatase
gene from Rattus norvegicus 3.1e-41
Galns
galactosamine (N-acetyl)-6-sulfate sulfatase
protein from Mus musculus 8.4e-41
ARSJ
Uncharacterized protein
protein from Gallus gallus 6.9e-40
GALNS
Uncharacterized protein
protein from Bos taurus 8.7e-40
ARSJ
Uncharacterized protein
protein from Sus scrofa 2.9e-38
GALNS
N-acetylgalactosamine-6-sulfatase
protein from Canis lupus familiaris 4.6e-38
GALNS
N-acetylgalactosamine-6-sulfatase
protein from Canis lupus familiaris 7.2e-38
ARSJ
Uncharacterized protein
protein from Canis lupus familiaris 6.6e-37
GALNS
N-acetylgalactosamine-6-sulfatase
protein from Sus scrofa 4.5e-36
F1RL71
Uncharacterized protein
protein from Sus scrofa 1.4e-35
GALNS
N-acetylgalactosamine-6-sulfatase
protein from Homo sapiens 1.5e-35
GALNS
Uncharacterized protein
protein from Gallus gallus 2.4e-35
F1S2F1
Uncharacterized protein
protein from Sus scrofa 8.1e-35
aslA
arylsulfatase
protein from Escherichia coli K-12 2.1e-32
galns
galactosamine (N-acetyl)-6-sulfate sulfatase
gene_product from Danio rerio 6.5e-31
CPS_2364
sulfatase family protein
protein from Colwellia psychrerythraea 34H 3.9e-29
STS
Steryl-sulfatase
protein from Homo sapiens 8.0e-29
ARSH
Uncharacterized protein
protein from Gallus gallus 1.2e-28
ARSH
Arylsulfatase H
protein from Canis lupus familiaris 5.7e-28
ARSH
Arylsulfatase H
protein from Canis lupus familiaris 5.7e-28
arse
Arylsulfatase E
protein from Canis lupus familiaris 8.2e-28
ARSE
Uncharacterized protein
protein from Bos taurus 1.2e-27
ARSE
Arylsulfatase E
protein from Homo sapiens 1.3e-27
ARSE
Arylsulfatase E
protein from Homo sapiens 1.7e-27
STS
Uncharacterized protein
protein from Bos taurus 1.1e-26
ARSH
Arylsulfatase H
protein from Homo sapiens 4.5e-26
ARSH
Uncharacterized protein
protein from Bos taurus 6.5e-26
arsa
arylsulfatase A
gene_product from Danio rerio 9.4e-26
SPO_3286
arylsulfatase
protein from Ruegeria pomeroyi DSS-3 9.5e-26
GALNS
N-acetylgalactosamine-6-sulfatase
protein from Sus scrofa 1.1e-25
STS
Uncharacterized protein
protein from Canis lupus familiaris 1.4e-25
STS
Uncharacterized protein
protein from Gallus gallus 1.5e-25
Sts
steroid sulfatase (microsomal), isozyme S
gene from Rattus norvegicus 1.8e-25
STS
Uncharacterized protein
protein from Canis lupus familiaris 1.9e-25
Arsa
arylsulfatase A
gene from Rattus norvegicus 2.4e-25
ARSE
Arylsulfatase E
protein from Homo sapiens 2.8e-25
ARSA
Arylsulfatase A
protein from Bos taurus 4.7e-25
Arsa
arylsulfatase A
protein from Mus musculus 5.6e-25
ARSD
Uncharacterized protein
protein from Gallus gallus 6.1e-25
ARSA
Arylsulfatase A
protein from Homo sapiens 9.5e-25
Sts
steroid sulfatase
protein from Mus musculus 9.8e-25
STS
Uncharacterized protein
protein from Sus scrofa 1.1e-24
STS
Uncharacterized protein
protein from Sus scrofa 1.2e-24
Arse
arylsulfatase E (chondrodysplasia punctata 1)
gene from Rattus norvegicus 2.1e-24
ARSF
Arylsulfatase F
protein from Homo sapiens 2.4e-24
CPS_2985
sulfatase family protein
protein from Colwellia psychrerythraea 34H 5.2e-24
ARSF
Uncharacterized protein
protein from Canis lupus familiaris 6.1e-24
ARSA
Uncharacterized protein
protein from Canis lupus familiaris 9.3e-24
ARSD
Uncharacterized protein
protein from Canis lupus familiaris 1.7e-23
CPS_0660
sulfatase family protein
protein from Colwellia psychrerythraea 34H 1.3e-22
ARSD
Arylsulfatase D
protein from Homo sapiens 1.4e-22
CPS_3032
sulfatase family protein
protein from Colwellia psychrerythraea 34H 2.8e-22
ARSG
Uncharacterized protein
protein from Gallus gallus 5.4e-22
arsh
arylsulfatase H
gene_product from Danio rerio 1.0e-21
GALNS
N-acetylgalactosamine-6-sulfatase
protein from Homo sapiens 9.7e-21
LOC100521576
Uncharacterized protein
protein from Sus scrofa 1.1e-20
ARSG
Arylsulfatase G
protein from Canis lupus familiaris 1.2e-20
CPS_2983
putative arylsulfatase
protein from Colwellia psychrerythraea 34H 1.4e-20
sts
steroid sulfatase (microsomal), arylsulfatase C, isozyme S
gene_product from Danio rerio 2.3e-20
CPS_2984
sulfatase family protein
protein from Colwellia psychrerythraea 34H 3.4e-20
orf19.1608 gene_product from Candida albicans 3.8e-20
Arsg
arylsulfatase G
protein from Mus musculus 4.8e-20
Arsg
arylsulfatase G
gene from Rattus norvegicus 5.0e-20
arsg
arylsulfatase G
gene_product from Danio rerio 6.4e-20
ARSA
Uncharacterized protein
protein from Gallus gallus 1.1e-19
ARSD
Uncharacterized protein
protein from Sus scrofa 1.1e-18
ARSG
Arylsulfatase G
protein from Homo sapiens 1.2e-17
ydeN
putative sulfatase
protein from Escherichia coli K-12 4.9e-17
ARSE
Uncharacterized protein
protein from Canis lupus familiaris 5.0e-17
CPS_2381
sulfatase family protein
protein from Colwellia psychrerythraea 34H 6.9e-17

The BLAST search returned 5 gene products which did not match your query constraints. Please see the full BLAST report below for the details.

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  psy12301
        (632 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

FB|FBgn0033763 - symbol:CG8646 species:7227 "Drosophila m...   863  2.1e-111  2
FB|FBgn0036765 - symbol:CG7408 species:7227 "Drosophila m...   766  4.3e-92   2
FB|FBgn0052191 - symbol:CG32191 species:7227 "Drosophila ...   764  3.9e-87   2
FB|FBgn0036768 - symbol:CG7402 species:7227 "Drosophila m...   853  3.0e-85   1
RGD|2158 - symbol:Arsb "arylsulfatase B" species:10116 "R...   720  1.5e-79   2
MGI|MGI:88075 - symbol:Arsb "arylsulfatase B" species:100...   705  1.4e-78   2
UNIPROTKB|A6QLZ3 - symbol:ARSB "Uncharacterized protein" ...   705  1.1e-76   2
UNIPROTKB|P15848 - symbol:ARSB "Arylsulfatase B" species:...   704  1.4e-76   2
UNIPROTKB|Q32KI4 - symbol:arsb "Arylsulfatase B" species:...   691  7.5e-76   2
UNIPROTKB|F1P099 - symbol:ARSB "Uncharacterized protein" ...   628  2.3e-68   2
UNIPROTKB|F1P095 - symbol:ARSB "Uncharacterized protein" ...   622  1.1e-64   2
UNIPROTKB|F1NT29 - symbol:ARSB "Uncharacterized protein" ...   619  9.9e-64   2
UNIPROTKB|F1P098 - symbol:ARSB "Uncharacterized protein" ...   613  8.1e-60   1
UNIPROTKB|E1BKH3 - symbol:ARSJ "Uncharacterized protein" ...   489  1.4e-54   2
RGD|1307640 - symbol:Arsj "arylsulfatase family, member J...   488  2.3e-54   2
MGI|MGI:2443513 - symbol:Arsj "arylsulfatase J" species:1...   487  2.9e-54   2
UNIPROTKB|Q32KH6 - symbol:arsj "Uncharacterized protein" ...   487  3.7e-54   2
UNIPROTKB|Q5FYB0 - symbol:ARSJ "Arylsulfatase J" species:...   487  6.1e-54   2
RGD|1310242 - symbol:Arsi "arylsulfatase family, member I...   475  9.9e-54   2
UNIPROTKB|Q5FYB1 - symbol:ARSI "Arylsulfatase I" species:...   469  2.6e-53   2
UNIPROTKB|Q32KH7 - symbol:ARSI "Arylsulfatase I" species:...   470  3.3e-53   2
MGI|MGI:2670959 - symbol:Arsi "arylsulfatase i" species:1...   469  4.2e-53   2
UNIPROTKB|E1BIN3 - symbol:ARSI "Uncharacterized protein" ...   467  6.9e-53   2
UNIPROTKB|F1RL69 - symbol:LOC100517463 "Uncharacterized p...   466  8.7e-53   2
UNIPROTKB|F1NQP9 - symbol:ARSI "Uncharacterized protein" ...   465  1.8e-52   2
WB|WBGene00006310 - symbol:sul-3 species:6239 "Caenorhabd...   460  9.8e-50   2
RGD|1565391 - symbol:Galns "galactosamine (N-acetyl)-6-su...   396  3.1e-41   2
MGI|MGI:1355303 - symbol:Galns "galactosamine (N-acetyl)-...   395  8.4e-41   2
UNIPROTKB|F1NH07 - symbol:ARSJ "Uncharacterized protein" ...   350  6.9e-40   2
UNIPROTKB|F1MU84 - symbol:GALNS "Uncharacterized protein"...   392  8.7e-40   2
UNIPROTKB|F1S147 - symbol:ARSJ "Uncharacterized protein" ...   345  2.9e-38   2
UNIPROTKB|Q32KH5 - symbol:GALNS "N-acetylgalactosamine-6-...   379  4.6e-38   2
UNIPROTKB|F1PHF0 - symbol:GALNS "N-acetylgalactosamine-6-...   379  7.2e-38   2
UNIPROTKB|F6PKT4 - symbol:ARSJ "Uncharacterized protein" ...   352  6.6e-37   2
UNIPROTKB|Q8WNQ7 - symbol:GALNS "N-acetylgalactosamine-6-...   374  4.5e-36   2
UNIPROTKB|F1RL71 - symbol:F1RL71 "Uncharacterized protein...   252  1.4e-35   3
UNIPROTKB|P34059 - symbol:GALNS "N-acetylgalactosamine-6-...   374  1.5e-35   2
UNIPROTKB|F1NW57 - symbol:GALNS "Uncharacterized protein"...   371  2.4e-35   2
UNIPROTKB|F1S2F1 - symbol:F1S2F1 "Uncharacterized protein...   380  8.1e-35   1
UNIPROTKB|P25549 - symbol:aslA "arylsulfatase" species:83...   252  2.1e-32   2
ZFIN|ZDB-GENE-070112-1152 - symbol:galns "galactosamine (...   340  6.5e-31   2
TIGR_CMR|CPS_2364 - symbol:CPS_2364 "sulfatase family pro...   328  3.9e-29   2
UNIPROTKB|P08842 - symbol:STS "Steryl-sulfatase" species:...   220  8.0e-29   3
UNIPROTKB|F1NFQ0 - symbol:ARSH "Uncharacterized protein" ...   227  1.2e-28   3
UNIPROTKB|F1NFQ1 - symbol:ARSH "Uncharacterized protein" ...   227  1.7e-28   3
UNIPROTKB|F1PY85 - symbol:ARSH "Arylsulfatase H" species:...   227  5.7e-28   3
UNIPROTKB|Q32KH8 - symbol:ARSH "Arylsulfatase H" species:...   227  5.7e-28   3
UNIPROTKB|Q32KI1 - symbol:arse "Uncharacterized protein" ...   223  8.2e-28   3
UNIPROTKB|G5E629 - symbol:ARSE "Uncharacterized protein" ...   217  1.2e-27   3
UNIPROTKB|P51690 - symbol:ARSE "Arylsulfatase E" species:...   216  1.3e-27   3
UNIPROTKB|F5GYY5 - symbol:ARSE "Arylsulfatase E" species:...   216  1.7e-27   3
UNIPROTKB|F1MFZ8 - symbol:STS "Uncharacterized protein" s...   213  1.1e-26   3
UNIPROTKB|Q5FYA8 - symbol:ARSH "Arylsulfatase H" species:...   216  4.5e-26   3
UNIPROTKB|G3N2T7 - symbol:ARSH "Uncharacterized protein" ...   209  6.5e-26   3
ZFIN|ZDB-GENE-050320-118 - symbol:arsa "arylsulfatase A" ...   229  9.4e-26   3
TIGR_CMR|SPO_3286 - symbol:SPO_3286 "arylsulfatase" speci...   204  9.5e-26   3
UNIPROTKB|F1S6M1 - symbol:GALNS "N-acetylgalactosamine-6-...   298  1.1e-25   1
UNIPROTKB|Q32KK2 - symbol:Arsa "Arylsulfatase A" species:...   205  1.1e-25   3
UNIPROTKB|F1Q1V3 - symbol:STS "Uncharacterized protein" s...   219  1.4e-25   2
UNIPROTKB|F1NGC8 - symbol:STS "Uncharacterized protein" s...   212  1.5e-25   2
RGD|3783 - symbol:Sts "steroid sulfatase (microsomal), is...   197  1.8e-25   3
UNIPROTKB|F1Q1V2 - symbol:STS "Uncharacterized protein" s...   219  1.9e-25   2
RGD|1310381 - symbol:Arsa "arylsulfatase A" species:10116...   205  2.4e-25   3
UNIPROTKB|F5H324 - symbol:ARSE "Arylsulfatase E" species:...   194  2.8e-25   3
UNIPROTKB|Q08DD1 - symbol:ARSA "Arylsulfatase A" species:...   195  4.7e-25   3
MGI|MGI:88077 - symbol:Arsa "arylsulfatase A" species:100...   205  5.6e-25   3
UNIPROTKB|E1BYN0 - symbol:ARSD "Uncharacterized protein" ...   220  6.1e-25   3
UNIPROTKB|P15289 - symbol:ARSA "Arylsulfatase A" species:...   195  9.5e-25   3
MGI|MGI:98438 - symbol:Sts "steroid sulfatase" species:10...   202  9.8e-25   2
UNIPROTKB|I3LBW8 - symbol:STS "Uncharacterized protein" s...   205  1.1e-24   2
UNIPROTKB|K7GLQ3 - symbol:STS "Uncharacterized protein" s...   205  1.2e-24   2
RGD|1304917 - symbol:Arse "arylsulfatase E (chondrodyspla...   210  2.1e-24   2
UNIPROTKB|P54793 - symbol:ARSF "Arylsulfatase F" species:...   193  2.4e-24   3
TIGR_CMR|CPS_2985 - symbol:CPS_2985 "sulfatase family pro...   184  5.2e-24   2
UNIPROTKB|F6PN86 - symbol:ARSF "Uncharacterized protein" ...   210  6.1e-24   3
UNIPROTKB|F6PKZ1 - symbol:ARSA "Uncharacterized protein" ...   191  9.3e-24   3
UNIPROTKB|F1PYB4 - symbol:ARSD "Uncharacterized protein" ...   194  1.7e-23   3
TIGR_CMR|CPS_0660 - symbol:CPS_0660 "sulfatase family pro...   174  1.3e-22   2
UNIPROTKB|P51689 - symbol:ARSD "Arylsulfatase D" species:...   199  1.4e-22   4
TIGR_CMR|CPS_3032 - symbol:CPS_3032 "sulfatase family pro...   183  2.8e-22   2
UNIPROTKB|E1BU03 - symbol:ARSG "Uncharacterized protein" ...   212  5.4e-22   3
ZFIN|ZDB-GENE-081104-120 - symbol:arsh "arylsulfatase H" ...   195  1.0e-21   3
UNIPROTKB|F5H325 - symbol:GALNS "N-acetylgalactosamine-6-...   257  9.7e-21   2
UNIPROTKB|F1RV22 - symbol:ARSG "Uncharacterized protein" ...   272  1.1e-20   1
UNIPROTKB|Q32KH9 - symbol:ARSG "Arylsulfatase G" species:...   193  1.2e-20   2
TIGR_CMR|CPS_2983 - symbol:CPS_2983 "putative arylsulfata...   162  1.4e-20   3
ZFIN|ZDB-GENE-030717-5 - symbol:sts "steroid sulfatase (m...   189  2.3e-20   2
TIGR_CMR|CPS_2984 - symbol:CPS_2984 "sulfatase family pro...   167  3.4e-20   2
CGD|CAL0006319 - symbol:orf19.1608 species:5476 "Candida ...   194  3.8e-20   3
POMBASE|SPBPB10D8.02c - symbol:SPBPB10D8.02c "arylsulfata...   195  4.3e-20   3
MGI|MGI:1921258 - symbol:Arsg "arylsulfatase G" species:1...   194  4.8e-20   2
RGD|1306571 - symbol:Arsg "arylsulfatase G" species:10116...   196  5.0e-20   3
ZFIN|ZDB-GENE-060503-154 - symbol:arsg "arylsulfatase G" ...   191  6.4e-20   2
UNIPROTKB|F1NWF7 - symbol:ARSA "Uncharacterized protein" ...   163  1.1e-19   3
UNIPROTKB|I3LM95 - symbol:ARSD "Uncharacterized protein" ...   146  1.1e-18   3
ASPGD|ASPL0000001694 - symbol:AN6847 species:162425 "Emer...   113  3.2e-18   5
UNIPROTKB|Q96EG1 - symbol:ARSG "Arylsulfatase G" species:...   194  1.2e-17   2
UNIPROTKB|P77318 - symbol:ydeN "putative sulfatase" speci...   240  4.9e-17   1
UNIPROTKB|F1PYB3 - symbol:ARSE "Uncharacterized protein" ...   218  5.0e-17   1
TIGR_CMR|CPS_2381 - symbol:CPS_2381 "sulfatase family pro...   155  6.9e-17   2

WARNING:  Descriptions of 105 database sequences were not reported due to the
          limiting value of parameter V = 100.


>FB|FBgn0033763 [details] [associations]
            symbol:CG8646 species:7227 "Drosophila melanogaster"
            [GO:0003943 "N-acetylgalactosamine-4-sulfatase activity"
            evidence=ISS] [GO:0008152 "metabolic process" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 EMBL:AE013599 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            KO:K01135 GO:GO:0003943 GeneTree:ENSGT00560000077076 EMBL:AY071072
            RefSeq:NP_610807.3 UniGene:Dm.6132 HSSP:P15848 SMR:Q8SZ72
            STRING:Q8SZ72 EnsemblMetazoa:FBtr0301237 GeneID:36394
            KEGG:dme:Dmel_CG8646 FlyBase:FBgn0033763 InParanoid:Q8SZ72
            OMA:FRGSAQI OrthoDB:EOG4W6MBG GenomeRNAi:36394 NextBio:798315
            Uniprot:Q8SZ72
        Length = 562

 Score = 863 (308.9 bits), Expect = 2.1e-111, Sum P(2) = 2.1e-111
 Identities = 176/370 (47%), Positives = 228/370 (61%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
             G+ND+ FHGS EIPTPNIDALAY+GIILN  Y  P+CTPSR++LMTGKYPIHTGMQ   +
Sbjct:    37 GFNDVGFHGSAEIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVL 96

Query:   152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             + AEPRG+PL E+ LP+YL ELGY++   GKWHLG ++ +YTPLYRGF SH G+ +G   
Sbjct:    97 YAAEPRGLPLEEKILPQYLNELGYTSHIAGKWHLGHWKLKYTPLYRGFSSHVGFWSGHQD 156

Query:   212 YYDHILSDQYSRTVELN--GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PX 268
             Y DH         VE N  G DMR     A+D  G Y TD+ T  +V++I +    K P 
Sbjct:   157 YNDHT-------AVENNQWGLDMRNGTQVAYDLHGHYTTDVITDHSVKVIANHNATKGPL 209

Query:   269 XXXXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
                                 P   + +  +I +  RR +AAMV K+D+SVG ++  L++ 
Sbjct:   210 FLYVAHAACHSSNPYNPLPVPDNDVIKMSHIPNYKRRKFAAMVSKMDNSVGQIVDQLRKS 269

Query:   329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
              MLENSIIIF SDNG P   +       N+ SNYP +GVKNTLWEGGV+   ++WSP ++
Sbjct:   270 NMLENSIIIFSSDNGGPAQGFN-----LNFASNYPLKGVKNTLWEGGVRAAGLMWSPLLK 324

Query:   389 QNPRVSLQMMHISDWLPTLYTAAGGDT--SRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
             ++ RVS Q MHI DWLPTL  AAGG    S L   IDG   W +L+ +  S R + +  +
Sbjct:   325 KSQRVSNQTMHIIDWLPTLLEAAGGQPALSNLSKQIDGQSIWRALVQDKASPRLNVLHNI 384

Query:   447 DQ-WSSLLLN 455
             D  W S  L+
Sbjct:   385 DDIWGSAALS 394

 Score = 257 (95.5 bits), Expect = 2.1e-111, Sum P(2) = 2.1e-111
 Identities = 66/193 (34%), Positives = 102/193 (52%)

Query:   437 SRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGY 496
             S  +  IDG   W +L+ +  S R +VL NID+   +AA+ +  WKLV GT   G+ DG+
Sbjct:   354 SNLSKQIDGQSIWRALVQDKASPRLNVLHNIDDIWGSAALSVGDWKLVKGTNYRGSWDGW 413

Query:   497 YGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAP 556
             YG        L ++  +  S+  ++L+ L     LP S  D+ R  R  AT+ C    + 
Sbjct:   414 YGPAGERDPRLYDWQLVGRSRAGKALEALKM---LP-SRADQQR-IRAAATVSCPGQSSQ 468

Query:   557 MTPSPCT--NGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDL 614
              T    T  + PC LF++ +DPCEQ N+A   P++ + L   L+    T VP S++  D 
Sbjct:   469 GTSCVATAFSAPC-LFHIRDDPCEQYNLAKQYPEVVNALMTELERFNATAVPPSNKPAD- 526

Query:   615 VQADPKRFNDTWS 627
              +ADP+ +N TW+
Sbjct:   527 PRADPRFWNYTWT 539

 Score = 69 (29.3 bits), Expect = 1.6e-21, Sum P(2) = 1.6e-21
 Identities = 19/79 (24%), Positives = 30/79 (37%)

Query:     1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PXXXXXXXXXXXXXXXXXXXEAPQ 59
             MR     A+D  G Y TD+ T  +V++I +    K P                     P 
Sbjct:   172 MRNGTQVAYDLHGHYTTDVITDHSVKVIANHNATKGPLFLYVAHAACHSSNPYNPLPVPD 231

Query:    60 ETINQFQYITDPNRRTYAA 78
               + +  +I +  RR +AA
Sbjct:   232 NDVIKMSHIPNYKRRKFAA 250


>FB|FBgn0036765 [details] [associations]
            symbol:CG7408 species:7227 "Drosophila melanogaster"
            [GO:0003943 "N-acetylgalactosamine-4-sulfatase activity"
            evidence=ISS] [GO:0008152 "metabolic process" evidence=IEA]
            [GO:0042742 "defense response to bacterium" evidence=IMP]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 EMBL:AE014296 GO:GO:0042742 Gene3D:3.40.720.10
            SUPFAM:SSF53649 InterPro:IPR024607 PROSITE:PS00523 GO:GO:0003943
            GeneTree:ENSGT00560000077076 HSSP:P15289 FlyBase:FBgn0036765
            RefSeq:NP_001163462.1 RefSeq:NP_001163463.1 RefSeq:NP_649020.1
            UniGene:Dm.13634 EnsemblMetazoa:FBtr0075142
            EnsemblMetazoa:FBtr0300281 EnsemblMetazoa:FBtr0300282 GeneID:39991
            KEGG:dme:Dmel_CG7408 UCSC:CG7408-RB InParanoid:Q9VVM1 OMA:TRENERD
            GenomeRNAi:39991 NextBio:816442 Uniprot:Q9VVM1
        Length = 585

 Score = 766 (274.7 bits), Expect = 4.3e-92, Sum P(2) = 4.3e-92
 Identities = 163/361 (45%), Positives = 216/361 (59%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
             G++D+SF GSN   TPNIDALAY+G+ILNN+Y  P+CTPSRA+L+TGKYPI+TGMQ   I
Sbjct:    46 GFDDVSFRGSNNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGKYPINTGMQHYVI 105

Query:   152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
                +P G+PL E  + E  RE GY T  +GKWHLG  +R +TP  RGF+ H GYL   + 
Sbjct:   106 VNDQPWGLPLNETTMAEIFRENGYRTSLLGKWHLGLSQRNFTPTERGFDRHLGYLGAYVD 165

Query:   212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED---QPVDKPX 268
             YY      Q       NGHD R +L +  D VG Y TDL T  AV+ IED   +   +P 
Sbjct:   166 YYTQSYEQQNKG---YNGHDFRDSLKSTHDHVGHYVTDLLTDAAVKEIEDHGSKNSSQPL 222

Query:   269 XXXXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
                               +AP E +++F+YI++   R YAAMV +LD SVG+VI AL R+
Sbjct:   223 FLLLNHLAPHAANDDDPMQAPAEEVSRFEYISNKTHRYYAAMVSRLDKSVGSVIDALARQ 282

Query:   329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
              ML+NSII+F+SDNG PT     T+      SNYP RG KN+ WEG ++  A +WS + +
Sbjct:   283 EMLQNSIILFLSDNGGPTQGQHSTT-----ASNYPLRGQKNSPWEGALRSSAAIWSTEFE 337

Query:   389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLP-LNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
             +   V  Q ++I D LPTL  AAG   S  P L++DGL+ WS+L     S     +  +D
Sbjct:   338 RLGSVWKQQIYIGDLLPTLAAAAG--ISPDPALHLDGLNLWSALKYGYESVEREIVHVID 395

Query:   448 Q 448
             +
Sbjct:   396 E 396

 Score = 171 (65.3 bits), Expect = 4.3e-92, Sum P(2) = 4.3e-92
 Identities = 54/198 (27%), Positives = 97/198 (48%)

Query:   442 NIDGLDQWSSLLLNTPSRRNSVLINIDEK--KRTAAVRLDSWKLVLGTQENGTMDGYYGQ 499
             ++DGL+ WS+L     S    ++  IDE   +   +     WK++ GT   G  DG+ G 
Sbjct:   369 HLDGLNLWSALKYGYESVEREIVHVIDEDVAEPHLSYTRGKWKVISGTTNQGLYDGWLGH 428

Query:   500 TRSNKV-P-LLNFNAIVESKT-YQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAP 556
               +++V P  + +  +V + + +  LQQ+S        NI ++R    Q+ I C   P P
Sbjct:   429 RETSEVDPRAVEYEELVRNTSVWLQLQQVS----FGERNISELRD---QSRIEC---PDP 478

Query:   557 MTP-SPCT--NGPCYLFNLGNDPCEQNNIASSRPD--ISSQLYELLKYHRRTLVPQSHEQ 611
              T   PC    GPC LF++  DPCE++N+ +   +  I   L+  ++   +   P +++ 
Sbjct:   479 ATGVKPCLPLEGPC-LFDIEADPCERSNLYAEYQNSTIFLDLWSRIQQFAKQAHPPNNKP 537

Query:   612 PDLVQADPKRFNDTWSPW 629
              D    DP+ +++ W+ W
Sbjct:   538 GD-PNCDPRFYHNEWTWW 554

 Score = 100 (40.3 bits), Expect = 4.9e-15, Sum P(2) = 4.9e-15
 Identities = 26/80 (32%), Positives = 38/80 (47%)

Query:     2 RRNLSTAWDTVGEYATDLFTKEAVQLIED---QPVDKPXXXXXXXXXXXXXXXXXXXEAP 58
             R +L +  D VG Y TDL T  AV+ IED   +   +P                   +AP
Sbjct:   184 RDSLKSTHDHVGHYVTDLLTDAAVKEIEDHGSKNSSQPLFLLLNHLAPHAANDDDPMQAP 243

Query:    59 QETINQFQYITDPNRRTYAA 78
              E +++F+YI++   R YAA
Sbjct:   244 AEEVSRFEYISNKTHRYYAA 263

 Score = 42 (19.8 bits), Expect = 5.0e-09, Sum P(2) = 5.0e-09
 Identities = 15/50 (30%), Positives = 23/50 (46%)

Query:   333 NSIIIFMSDNGAPTVEYRETSNYRNWGSN-YPYRGV-KNTLWEGGVKVPA 380
             N III   D G   V +R ++N+     +   Y GV  N L+   +  P+
Sbjct:    36 NIIIIMADDLGFDDVSFRGSNNFLTPNIDALAYSGVILNNLYVAPMCTPS 85


>FB|FBgn0052191 [details] [associations]
            symbol:CG32191 species:7227 "Drosophila melanogaster"
            [GO:0003943 "N-acetylgalactosamine-4-sulfatase activity"
            evidence=ISS] [GO:0008152 "metabolic process" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 EMBL:AE014296 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00149 GO:GO:0003943 HSSP:P08842
            RefSeq:NP_730304.2 UniGene:Dm.15184 ProteinModelPortal:Q8IQS4
            SMR:Q8IQS4 MINT:MINT-943884 PRIDE:Q8IQS4 GeneID:317903
            KEGG:dme:Dmel_CG32191 UCSC:CG32191-RA FlyBase:FBgn0052191
            InParanoid:Q8IQS4 OrthoDB:EOG43FFBZ PhylomeDB:Q8IQS4
            GenomeRNAi:317903 NextBio:844132 Bgee:Q8IQS4 Uniprot:Q8IQS4
        Length = 554

 Score = 764 (274.0 bits), Expect = 3.9e-87, Sum P(2) = 3.9e-87
 Identities = 157/341 (46%), Positives = 202/341 (59%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
             G++D+SF G  E  TPNIDALAY+G +L+ +YA  +CTPSR +L++G+YPIHTG Q   I
Sbjct:    38 GFDDVSFRGGREFLTPNIDALAYHGRLLDRLYAPAMCTPSRGALLSGRYPIHTGTQHFVI 97

Query:   152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
                EP  + L    +PE  +E GYST  +GKWHLGF R EYTP  RGF+ HFGY    I 
Sbjct:    98 SNEEPWALTLNATLMPEIFKEAGYSTNLVGKWHLGFSRPEYTPTRRGFDYHFGYWGAYID 157

Query:   212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP-VDKPXXX 270
             Y+    S        L G+D RRN+       G Y TDL T EA +LI+D    ++P   
Sbjct:   158 YFQR-RSKMPVANYSL-GYDFRRNMELECRDRGVYVTDLLTAEAERLIKDHADKEQPLFL 215

Query:   271 XXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
                             +AP+E I +F YI DPNRR YAAM+ KLD SVG +I+AL     
Sbjct:   216 MLSHLAAHTANEDDPLQAPEEEIQKFSYIKDPNRRKYAAMISKLDQSVGRIITALSSTDQ 275

Query:   331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
             LENSI+IF SDNGAP+V       + N GSN+P RG KNT WEGGV+V   +WS  +Q  
Sbjct:   276 LENSIVIFYSDNGAPSV-----GMFSNTGSNFPLRGQKNTPWEGGVRVAGAIWSSGLQAR 330

Query:   391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
               +  Q ++++DWLPTL  AA  +     L +DG+D W  L
Sbjct:   331 GSIFRQPLYVADWLPTLSRAADIELDS-SLKLDGIDLWPEL 370

 Score = 127 (49.8 bits), Expect = 5.7e-13, Sum P(2) = 5.7e-13
 Identities = 30/78 (38%), Positives = 38/78 (48%)

Query:     2 RRNLSTAWDTVGEYATDLFTKEAVQLIEDQP-VDKPXXXXXXXXXXXXXXXXXXXEAPQE 60
             RRN+       G Y TDL T EA +LI+D    ++P                   +AP+E
Sbjct:   177 RRNMELECRDRGVYVTDLLTAEAERLIKDHADKEQPLFLMLSHLAAHTANEDDPLQAPEE 236

Query:    61 TINQFQYITDPNRRTYAA 78
              I +F YI DPNRR YAA
Sbjct:   237 EIQKFSYIKDPNRRKYAA 254

 Score = 126 (49.4 bits), Expect = 3.9e-87, Sum P(2) = 3.9e-87
 Identities = 43/188 (22%), Positives = 82/188 (43%)

Query:   443 IDGLDQWSSLL--LNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT 500
             +DG+D W  L    + P     +L  +D+  R +A+++  WK V GT  +G  D      
Sbjct:   361 LDGIDLWPELSGSADAPHVPREILHILDDVWRLSALQMGQWKYVNGTTASGRYDSVLTYR 420

Query:   501 RSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPS 560
               + +   +    V  +   + + LS+     ++   ++  TR+ A + CG      + +
Sbjct:   421 ELDDLDPRDSRYAVTVRNSATSRALSRYDLRRLTQ-QRISLTRRLAAVRCG--DLQRSCN 477

Query:   561 PCTNGPCYLFNLGNDPCEQNNIASSR--PDISSQLYELLKYHRRTLVPQSHEQPDLVQAD 618
             P     C L+++ +DPCEQNN+  S    D+ + L   ++   R    +   +  + +AD
Sbjct:   478 PLLE-EC-LYDILSDPCEQNNLVYSERHSDVLTALRRRVQ-ELRASASRPGNRASMPEAD 534

Query:   619 PKRFNDTW 626
             P      W
Sbjct:   535 PTLHTCAW 542


>FB|FBgn0036768 [details] [associations]
            symbol:CG7402 species:7227 "Drosophila melanogaster"
            [GO:0003943 "N-acetylgalactosamine-4-sulfatase activity"
            evidence=ISS] [GO:0008152 "metabolic process" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 EMBL:AE014296 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00149 KO:K01135 GO:GO:0003943
            GeneTree:ENSGT00560000077076 HSSP:P15848 RefSeq:NP_649023.1
            UniGene:Dm.13635 ProteinModelPortal:Q9VVM4 STRING:Q9VVM4
            PRIDE:Q9VVM4 EnsemblMetazoa:FBtr0075143 GeneID:39994
            KEGG:dme:Dmel_CG7402 UCSC:CG7402-RA FlyBase:FBgn0036768
            InParanoid:Q9VVM4 OMA:LYWAGPG PhylomeDB:Q9VVM4 GenomeRNAi:39994
            NextBio:816457 ArrayExpress:Q9VVM4 Bgee:Q9VVM4 Uniprot:Q9VVM4
        Length = 579

 Score = 853 (305.3 bits), Expect = 3.0e-85, P = 3.0e-85
 Identities = 197/459 (42%), Positives = 255/459 (55%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
             G ND+SFHGSN+I TPNIDALAYNGI+LN  Y   +CTPSRA+L+TGKYPIHTGMQ   I
Sbjct:    39 GMNDVSFHGSNQILTPNIDALAYNGILLNKHYVPNLCTPSRATLLTGKYPIHTGMQHFVI 98

Query:   152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
                EP G+P  ER +PE  R+ GYST  +GKWHLGF+R++ TP  RGF+ HFGY NG I 
Sbjct:    99 ITDEPWGLPQRERLMPEIFRDAGYSTHLVGKWHLGFWRKDLTPTMRGFDHHFGYYNGYID 158

Query:   212 YYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPX 268
             YYDH   +L   YS      G D RR+L    +  G YAT+ FT EA ++IE     KP 
Sbjct:   159 YYDHQVRMLDRNYSA-----GLDFRRDLEPCPEANGTYATEAFTSEAKRIIEQHDKSKPL 213

Query:   269 XXXXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
                               +AP+E + +F +I DP RRTYA M+  LD SV   I AL+  
Sbjct:   214 FMVLSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPKRRTYAGMISSLDKSVAQTIGALKDN 273

Query:   329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
             GML NSII+  SDNGAPT+       + N GSNYPYRG K + WEGG++    LWSP ++
Sbjct:   274 GMLNNSIILLYSDNGAPTIGI-----HSNAGSNYPYRGQKESPWEGGIRSAGALWSPLLK 328

Query:   389 QNPRVSLQMMHISDWLPTLYTAAGGDTSR-LPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
             +   VS Q +H  DWLPTL  AAG    + LPL  DG++ W  L  N   +  + I  LD
Sbjct:   329 ERGYVSNQAIHAVDWLPTLAGAAGVSLPQDLPL--DGINLWPMLSGNEEPKPRTMIHVLD 386

Query:   448 Q---WSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQ--TRS 502
             +   +SS + +T        +N    K     R D W   L T E+  +   Y Q    S
Sbjct:   387 EVFGYSSYMRDTLK-----YVNGSSFKG----RYDQWLGELETNEDDPLGESYEQHVLAS 437

Query:   503 NKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRS 541
             +   LL    + + +  Q   + ++    PI   + + S
Sbjct:   438 DVQSLLGNRGLTKDRIRQMRSEATETC-PPIEGQNPLES 475

 Score = 160 (61.4 bits), Expect = 2.5e-16, Sum P(2) = 2.5e-16
 Identities = 52/193 (26%), Positives = 86/193 (44%)

Query:   443 IDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRS 502
             +DG++ W  L  N   +  +++  +DE    ++   D+ K V G+   G  D + G+  +
Sbjct:   361 LDGINLWPMLSGNEEPKPRTMIHVLDEVFGYSSYMRDTLKYVNGSSFKGRYDQWLGELET 420

Query:   503 NKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHC----GANPAPMT 558
             N+   L   +  +      +Q L  N  L     D++R  R +AT  C    G NP   +
Sbjct:   421 NEDDPLG-ESYEQHVLASDVQSLLGNRGL---TKDRIRQMRSEATETCPPIEGQNPLE-S 475

Query:   559 PSPCT--NGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQ 616
                C     PC+ F+L  DPCE+ N+A   P    QL + L+  R+T +P +       +
Sbjct:   476 HFKCEPLKAPCF-FDLAKDPCERYNLAQMYPLQLQQLADELEQIRKTAIPSARVPHSDSR 534

Query:   617 ADPKRFNDTWSPW 629
             A+P   N  W  W
Sbjct:   535 ANPTFHNGNWEWW 547

 Score = 124 (48.7 bits), Expect = 2.5e-16, Sum P(2) = 2.5e-16
 Identities = 27/76 (35%), Positives = 37/76 (48%)

Query:     2 RRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPXXXXXXXXXXXXXXXXXXXEAPQET 61
             RR+L    +  G YAT+ FT EA ++IE     KP                   +AP+E 
Sbjct:   178 RRDLEPCPEANGTYATEAFTSEAKRIIEQHDKSKPLFMVLSHLAVHTGNEDSPMQAPEEE 237

Query:    62 INQFQYITDPNRRTYA 77
             + +F +I DP RRTYA
Sbjct:   238 VAKFPHIRDPKRRTYA 253


>RGD|2158 [details] [associations]
            symbol:Arsb "arylsulfatase B" species:10116 "Rattus norvegicus"
          [GO:0003943 "N-acetylgalactosamine-4-sulfatase activity"
          evidence=IEA] [GO:0004065 "arylsulfatase activity" evidence=ISO;TAS]
          [GO:0005739 "mitochondrion" evidence=IDA] [GO:0005764 "lysosome"
          evidence=IDA] [GO:0005791 "rough endoplasmic reticulum" evidence=IDA]
          [GO:0005794 "Golgi apparatus" evidence=IDA] [GO:0006914 "autophagy"
          evidence=IDA] [GO:0007417 "central nervous system development"
          evidence=IDA] [GO:0007584 "response to nutrient" evidence=IDA]
          [GO:0008152 "metabolic process" evidence=ISO] [GO:0008484 "sulfuric
          ester hydrolase activity" evidence=IDA] [GO:0009268 "response to pH"
          evidence=IDA] [GO:0043627 "response to estrogen stimulus"
          evidence=IDA] [GO:0046872 "metal ion binding" evidence=IEA]
          [GO:0051597 "response to methylmercury" evidence=IDA]
          InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
          RGD:2158 GO:GO:0005739 GO:GO:0005794 GO:GO:0046872 GO:GO:0005791
          GO:GO:0006914 GO:GO:0007584 GO:GO:0007417 GO:GO:0005764 GO:GO:0009268
          GO:GO:0043627 Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0051597
          eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
          GO:GO:0004065 CTD:411 HOGENOM:HOG000135354 HOVERGEN:HBG004282
          KO:K01135 OrthoDB:EOG4DV5M0 GO:GO:0003943
          GeneTree:ENSGT00560000077076 EMBL:AABR03012149 EMBL:AABR03015281
          EMBL:AABR03016930 EMBL:AABR03021723 EMBL:D49434 EMBL:BN000736
          IPI:IPI00198405 PIR:I54210 RefSeq:NP_254278.1 UniGene:Rn.94004
          ProteinModelPortal:P50430 SMR:P50430 IntAct:P50430 STRING:P50430
          PRIDE:P50430 Ensembl:ENSRNOT00000014860 GeneID:25227 KEGG:rno:25227
          UCSC:RGD:2158 InParanoid:P50430 OMA:ALMTARY NextBio:605779
          ArrayExpress:P50430 Genevestigator:P50430
          GermOnline:ENSRNOG00000011150 Uniprot:P50430
        Length = 528

 Score = 720 (258.5 bits), Expect = 1.5e-79, Sum P(2) = 1.5e-79
 Identities = 167/430 (38%), Positives = 238/430 (55%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
             GWNDL FHGS  I TP++DALA  G++L+N Y QP+CTPSR+ L+TG+Y IH G+Q   I
Sbjct:    51 GWNDLGFHGS-VIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLLTGRYQIHMGLQHYLI 109

Query:   152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
                +P  VPL E+ LP+ L++ GY+T  +GKWHLG +R+E  P  RGF+++FGYL G   
Sbjct:   110 MTCQPNCVPLDEKLLPQLLKDAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 169

Query:   212 YYDHILSDQYSRTVE-LNGH----DMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK 266
             YY H    +    +E LNG     D+R     A +    Y+T++FTK A  LI + P +K
Sbjct:   170 YYTH----EACAPIECLNGTRCALDLRDGEEPAKEYTDIYSTNIFTKRATTLIANHPPEK 225

Query:   267 PXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQ 326
             P                   + P+E +  + +I D +RR YA MV  LD++VG V  AL+
Sbjct:   226 PLFLYLAFQSVHDPL-----QVPEEYMEPYDFIQDKHRRIYAGMVSLLDEAVGNVTKALK 280

Query:   327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ 386
              +G+  N+++IF +DNG  T         R+ G+N+P RG K TLWEGG++    + SP 
Sbjct:   281 SRGLWNNTVLIFSTDNGGQT---------RSGGNNWPLRGRKGTLWEGGIRGAGFVASPL 331

Query:   387 IQQNPRVSLQMMHISDWLPTLYTAAGGDT-SRLPLNIDGLDQWSSLLLNTPSRRNS---N 442
             ++Q    S ++MHI+DWLPTL   AGG T    PL  DG D W ++   +PS R     N
Sbjct:   332 LKQKGVKSRELMHITDWLPTLVNLAGGSTHGTKPL--DGFDVWETISEGSPSPRVELLLN 389

Query:   443 IDGLDQWSSLLL----NTPSRRNSVLINIDEKKRT--AAVRLDSWKLVLGTQENGTMDGY 496
             ID  D +  L       TP + +S  +       +  A +R  +WKL+ G    G     
Sbjct:   390 IDP-DFFDGLPCPGKNTTPEKNDSFPLEHSAFNTSIHAGIRYKNWKLLTGYPGCGYWFPP 448

Query:   497 YGQTRSNKVP 506
               Q+  ++VP
Sbjct:   449 PSQSNISEVP 458

 Score = 98 (39.6 bits), Expect = 1.5e-79, Sum P(2) = 1.5e-79
 Identities = 24/75 (32%), Positives = 37/75 (49%)

Query:   556 PMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLV 615
             P   SP      +LF++  DP E+++++   P I   L   L+Y+    VP S+  P   
Sbjct:   458 PSVDSPTKT--LWLFDINRDPEERHDVSREHPHIVQNLLSRLQYYHEHSVP-SYFPPLDP 514

Query:   616 QADPKRFNDTWSPWI 630
             + DPK     WSPW+
Sbjct:   515 RCDPKG-TGVWSPWM 528

 Score = 81 (33.6 bits), Expect = 3.1e-05, Sum P(2) = 3.1e-05
 Identities = 19/63 (30%), Positives = 30/63 (47%)

Query:    15 YATDLFTKEAVQLIEDQPVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRR 74
             Y+T++FTK A  LI + P +KP                   + P+E +  + +I D +RR
Sbjct:   205 YSTNIFTKRATTLIANHPPEKPLFLYLAFQSVHDPL-----QVPEEYMEPYDFIQDKHRR 259

Query:    75 TYA 77
              YA
Sbjct:   260 IYA 262


>MGI|MGI:88075 [details] [associations]
            symbol:Arsb "arylsulfatase B" species:10090 "Mus musculus"
            [GO:0003824 "catalytic activity" evidence=IEA] [GO:0003943
            "N-acetylgalactosamine-4-sulfatase activity" evidence=IEA]
            [GO:0004065 "arylsulfatase activity" evidence=IDA] [GO:0005739
            "mitochondrion" evidence=ISO] [GO:0005764 "lysosome" evidence=ISO]
            [GO:0005791 "rough endoplasmic reticulum" evidence=ISO] [GO:0005794
            "Golgi apparatus" evidence=ISO] [GO:0006914 "autophagy"
            evidence=ISO] [GO:0007417 "central nervous system development"
            evidence=ISO] [GO:0007584 "response to nutrient" evidence=ISO]
            [GO:0008152 "metabolic process" evidence=IDA] [GO:0008484 "sulfuric
            ester hydrolase activity" evidence=ISO] [GO:0009268 "response to
            pH" evidence=ISO] [GO:0016787 "hydrolase activity" evidence=IEA]
            [GO:0043627 "response to estrogen stimulus" evidence=ISO]
            [GO:0046872 "metal ion binding" evidence=IEA] [GO:0051597 "response
            to methylmercury" evidence=ISO] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 MGI:MGI:88075
            GO:GO:0005739 GO:GO:0005794 GO:GO:0046872 GO:GO:0005791
            GO:GO:0006914 GO:GO:0007584 GO:GO:0007417 GO:GO:0005764
            GO:GO:0009268 GO:GO:0043627 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GO:GO:0051597 eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0004065 CTD:411 HOGENOM:HOG000135354
            HOVERGEN:HBG004282 KO:K01135 OrthoDB:EOG4DV5M0 GO:GO:0003943
            EMBL:AK083309 EMBL:AK154098 EMBL:AK158312 EMBL:AC131739
            EMBL:AC136976 EMBL:M82877 EMBL:X92096 EMBL:BN000746 IPI:IPI00406459
            IPI:IPI00652358 RefSeq:NP_033842.3 UniGene:Mm.300178
            UniGene:Mm.472255 ProteinModelPortal:P50429 SMR:P50429
            STRING:P50429 PhosphoSite:P50429 PaxDb:P50429 PRIDE:P50429
            DNASU:11881 Ensembl:ENSMUST00000091403 GeneID:11881 KEGG:mmu:11881
            UCSC:uc007rlo.1 UCSC:uc011zcv.1 GeneTree:ENSGT00560000077076
            InParanoid:P50429 SABIO-RK:P50429 NextBio:279911 Bgee:P50429
            CleanEx:MM_ARSB Genevestigator:P50429 GermOnline:ENSMUSG00000042093
            Uniprot:P50429
        Length = 534

 Score = 705 (253.2 bits), Expect = 1.4e-78, Sum P(2) = 1.4e-78
 Identities = 159/408 (38%), Positives = 230/408 (56%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
             GWNDL FHGS  I TP++DALA  G++L+N Y QP+CTPSR+ L+TG+Y IH G+Q   I
Sbjct:    57 GWNDLGFHGS-VIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLLTGRYQIHLGLQHYLI 115

Query:   152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
                +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P  RGF+++FGYL G   
Sbjct:   116 MTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175

Query:   212 YYDHILSDQYSRTVELNGH----DMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
             YY H   +  +    LNG     D+R     A +    Y+T++FTK A  +I + P +KP
Sbjct:   176 YYTH---EACAPIESLNGTRCALDLRDGEEPAKEYNNIYSTNIFTKRATTVIANHPPEKP 232

Query:   268 XXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
                                + P+E +  + +I D +RR YA MV  +D++VG V  AL+ 
Sbjct:   233 LFLYLAFQSVHDPL-----QVPEEYMEPYGFIQDKHRRIYAGMVSLMDEAVGNVTKALKS 287

Query:   328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
              G+  N++ IF +DNG  T         R+ G+N+P RG K TLWEGG++    + SP +
Sbjct:   288 HGLWNNTVFIFSTDNGGQT---------RSGGNNWPLRGRKGTLWEGGIRGTGFVASPLL 338

Query:   388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRL-PLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
             +Q    S ++MHI+DWLPTL   AGG T+   PL  DG + W ++    PS R   +  +
Sbjct:   339 KQKGVKSRELMHITDWLPTLVDLAGGSTNGTKPL--DGFNMWKTISEGHPSPRVELLHNI 396

Query:   447 DQ--WSSLLL---N-TPSRRNSVLINIDEKKRT--AAVRLDSWKLVLG 486
             DQ  +  L     N TP++ +S  +       +  A +R  +WKL+ G
Sbjct:   397 DQDFFDGLPCPGKNMTPAKDDSFPLEHSAFNTSIHAGIRYKNWKLLTG 444

 Score = 104 (41.7 bits), Expect = 1.4e-78, Sum P(2) = 1.4e-78
 Identities = 26/80 (32%), Positives = 41/80 (51%)

Query:   552 ANPAPMTP-SPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHE 610
             +N + + P  P T    +LF++  DP E+++++   P I   L   L+Y+    VP SH 
Sbjct:   458 SNVSEIPPVGPPTK-TLWLFDINQDPEERHDVSREHPHIVQNLLSRLQYYHEHSVP-SHF 515

Query:   611 QPDLVQADPKRFNDTWSPWI 630
              P   + DPK     WSPW+
Sbjct:   516 PPLDPRCDPKS-TGVWSPWM 534

 Score = 76 (31.8 bits), Expect = 2.3e-05, Sum P(2) = 2.3e-05
 Identities = 18/63 (28%), Positives = 30/63 (47%)

Query:    15 YATDLFTKEAVQLIEDQPVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRR 74
             Y+T++FTK A  +I + P +KP                   + P+E +  + +I D +RR
Sbjct:   211 YSTNIFTKRATTVIANHPPEKPLFLYLAFQSVHDPL-----QVPEEYMEPYGFIQDKHRR 265

Query:    75 TYA 77
              YA
Sbjct:   266 IYA 268


>UNIPROTKB|A6QLZ3 [details] [associations]
            symbol:ARSB "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0004065 "arylsulfatase activity" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0004065
            CTD:411 HOGENOM:HOG000135354 HOVERGEN:HBG004282 KO:K01135
            OMA:WLFDIDR OrthoDB:EOG4DV5M0 GeneTree:ENSGT00560000077076
            EMBL:DAAA02027809 EMBL:DAAA02027810 EMBL:DAAA02027811
            EMBL:DAAA02027812 EMBL:DAAA02027813 EMBL:DAAA02027814
            EMBL:DAAA02027815 EMBL:DAAA02027816 EMBL:DAAA02027817 EMBL:BC148139
            IPI:IPI00710068 RefSeq:NP_001094645.1 UniGene:Bt.35850 SMR:A6QLZ3
            STRING:A6QLZ3 Ensembl:ENSBTAT00000010988 GeneID:538401
            KEGG:bta:538401 InParanoid:A6QLZ3 NextBio:20877344 Uniprot:A6QLZ3
        Length = 533

 Score = 705 (253.2 bits), Expect = 1.1e-76, Sum P(2) = 1.1e-76
 Identities = 159/410 (38%), Positives = 234/410 (57%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
             GWND+ FHGS  I TP +DALA  G++L+N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct:    56 GWNDVGFHGS-AIRTPRLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 114

Query:   152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
                +P  +PL E+ LP+ L+E GY+T  +GKWHLG +R+E  P  RGF+++FGYL G   
Sbjct:   115 LPCQPSCIPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 174

Query:   212 YYDH---ILSDQYSRT-VELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
             YY H    L D  + T   L+  D    ++T +  +  Y+T++FT+ A  LI + P +KP
Sbjct:   175 YYSHERCTLIDALNVTRCALDFRD-GEEVATGYKNM--YSTNVFTERATTLITNHPPEKP 231

Query:   268 XXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
                                + P+E +  + +I D NRR YA M   +D++VG V +AL+R
Sbjct:   232 LFLYLALQSVHEPL-----QVPEEYLKPYDFIQDRNRRYYAGMASVMDEAVGNVTAALER 286

Query:   328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
             +G+  N++ IF +DNG  T+           G+N+P RG K +LWEGGV+    + SP +
Sbjct:   287 RGLWNNTVFIFSTDNGGQTLA---------GGNNWPLRGRKWSLWEGGVRGVGFVASPLL 337

Query:   388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRL-PLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
             ++    + +++HISDWLPTL   AGG T+   PL  DG D W+++   +PS R   +  +
Sbjct:   338 KRKGVKTRELIHISDWLPTLVKLAGGSTNGTKPL--DGFDVWNTISEGSPSPRMELLHNI 395

Query:   447 DQWSSLLLNTPSRRNSVLINIDEKK-------RT---AAVRLDSWKLVLG 486
             D   + +   P   NS+ +  DE          T   AAVR  +WKL+ G
Sbjct:   396 DP--NFVDTAPCPGNSMALAKDESSLLEYSAFNTSIHAAVRHQNWKLLTG 443

 Score = 86 (35.3 bits), Expect = 1.1e-76, Sum P(2) = 1.1e-76
 Identities = 18/63 (28%), Positives = 34/63 (53%)

Query:   568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
             +LF++  DP E+++++   P I  +L   L+++++  VP      D  + DPK     W 
Sbjct:   473 WLFDIDQDPEERHDLSREYPHIVKKLLSRLQFYQKHSVPVYFPAQD-PRCDPKA-TGAWG 530

Query:   628 PWI 630
             PW+
Sbjct:   531 PWM 533

 Score = 80 (33.2 bits), Expect = 0.00077, Sum P(2) = 0.00077
 Identities = 19/63 (30%), Positives = 30/63 (47%)

Query:    15 YATDLFTKEAVQLIEDQPVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRR 74
             Y+T++FT+ A  LI + P +KP                   + P+E +  + +I D NRR
Sbjct:   210 YSTNVFTERATTLITNHPPEKPLFLYLALQSVHEPL-----QVPEEYLKPYDFIQDRNRR 264

Query:    75 TYA 77
              YA
Sbjct:   265 YYA 267


>UNIPROTKB|P15848 [details] [associations]
            symbol:ARSB "Arylsulfatase B" species:9606 "Homo sapiens"
            [GO:0046872 "metal ion binding" evidence=IEA] [GO:0004065
            "arylsulfatase activity" evidence=IEA] [GO:0005739 "mitochondrion"
            evidence=IEA] [GO:0005791 "rough endoplasmic reticulum"
            evidence=IEA] [GO:0005794 "Golgi apparatus" evidence=IEA]
            [GO:0006914 "autophagy" evidence=IEA] [GO:0007417 "central nervous
            system development" evidence=IEA] [GO:0007584 "response to
            nutrient" evidence=IEA] [GO:0009268 "response to pH" evidence=IEA]
            [GO:0043627 "response to estrogen stimulus" evidence=IEA]
            [GO:0051597 "response to methylmercury" evidence=IEA] [GO:0005764
            "lysosome" evidence=TAS] [GO:0007041 "lysosomal transport"
            evidence=TAS] [GO:0007040 "lysosome organization" evidence=TAS]
            [GO:0003943 "N-acetylgalactosamine-4-sulfatase activity"
            evidence=TAS] [GO:0005788 "endoplasmic reticulum lumen"
            evidence=TAS] [GO:0005975 "carbohydrate metabolic process"
            evidence=TAS] [GO:0006644 "phospholipid metabolic process"
            evidence=TAS] [GO:0006665 "sphingolipid metabolic process"
            evidence=TAS] [GO:0006687 "glycosphingolipid metabolic process"
            evidence=TAS] [GO:0030203 "glycosaminoglycan metabolic process"
            evidence=TAS] [GO:0030204 "chondroitin sulfate metabolic process"
            evidence=TAS] [GO:0030207 "chondroitin sulfate catabolic process"
            evidence=TAS] [GO:0043202 "lysosomal lumen" evidence=TAS]
            [GO:0043687 "post-translational protein modification" evidence=TAS]
            [GO:0044267 "cellular protein metabolic process" evidence=TAS]
            [GO:0044281 "small molecule metabolic process" evidence=TAS]
            Reactome:REACT_17015 Reactome:REACT_111217 InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 GO:GO:0005739
            GO:GO:0005794 Reactome:REACT_116125 GO:GO:0046872 GO:GO:0005975
            GO:GO:0005791 GO:GO:0006914 GO:GO:0006644 GO:GO:0007584
            GO:GO:0007417 GO:GO:0007040 GO:GO:0009268 GO:GO:0005788
            EMBL:CH471084 GO:GO:0043627 GO:GO:0043687 GO:GO:0043202
            GO:GO:0007041 Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0051597
            eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            MIM:272200 GO:GO:0004065 GO:GO:0006687 EMBL:J05225 EMBL:M32373
            EMBL:X72735 EMBL:X72736 EMBL:X72737 EMBL:X72738 EMBL:X72739
            EMBL:X72740 EMBL:X72741 EMBL:X72742 EMBL:AK314903 EMBL:AC020937
            EMBL:AC025755 EMBL:AC099485 EMBL:AC114963 EMBL:BC029051 EMBL:S57777
            IPI:IPI00306576 IPI:IPI00413690 PIR:S35990 RefSeq:NP_000037.2
            RefSeq:NP_942002.1 UniGene:Hs.149103 UniGene:Hs.604199 PDB:1FSU
            PDBsum:1FSU ProteinModelPortal:P15848 SMR:P15848 IntAct:P15848
            STRING:P15848 PhosphoSite:P15848 DMDM:114223 PaxDb:P15848
            PRIDE:P15848 Ensembl:ENST00000264914 Ensembl:ENST00000396151
            Ensembl:ENST00000565165 GeneID:411 KEGG:hsa:411 UCSC:uc003kfq.3
            CTD:411 GeneCards:GC05M078108 HGNC:HGNC:714 HPA:HPA037770
            HPA:HPA037771 MIM:253200 MIM:611542 neXtProt:NX_P15848
            Orphanet:276212 Orphanet:276223 PharmGKB:PA25006
            HOGENOM:HOG000135354 HOVERGEN:HBG004282 InParanoid:P15848 KO:K01135
            OMA:WLFDIDR OrthoDB:EOG4DV5M0 PhylomeDB:P15848
            BioCyc:MetaCyc:HS03665-MONOMER BRENDA:3.1.6.12 ChEMBL:CHEMBL2399
            EvolutionaryTrace:P15848 GenomeRNAi:411 NextBio:1737
            ArrayExpress:P15848 Bgee:P15848 CleanEx:HS_ARSB
            Genevestigator:P15848 GermOnline:ENSG00000113273 GO:GO:0003943
            GO:GO:0030207 Uniprot:P15848
        Length = 533

 Score = 704 (252.9 bits), Expect = 1.4e-76, Sum P(2) = 1.4e-76
 Identities = 159/410 (38%), Positives = 234/410 (57%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
             GWND+ FHGS  I TP++DALA  G++L+N Y QP+CTPSR+ L+TG+Y I TG+Q   I
Sbjct:    56 GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 114

Query:   152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             W  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P  RGF+++FGYL G   
Sbjct:   115 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 174

Query:   212 YYDH---ILSDQYSRT-VELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
             YY H    L D  + T   L+  D    ++T +  +  Y+T++FTK A+ LI + P +KP
Sbjct:   175 YYSHERCTLIDALNVTRCALDFRD-GEEVATGYKNM--YSTNIFTKRAIALITNHPPEKP 231

Query:   268 XXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
                                + P+E +  + +I D NR  YA MV  +D++VG V +AL+ 
Sbjct:   232 LFLYLALQSVHEPL-----QVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGNVTAALKS 286

Query:   328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
              G+  N++ IF +DNG  T+           G+N+P RG K +LWEGGV+    + SP +
Sbjct:   287 SGLWNNTVFIFSTDNGGQTLA---------GGNNWPLRGRKWSLWEGGVRGVGFVASPLL 337

Query:   388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRL-PLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
             +Q    + +++HISDWLPTL   A G T+   PL  DG D W ++   +PS R   +  +
Sbjct:   338 KQKGVKNRELIHISDWLPTLVKLARGHTNGTKPL--DGFDVWKTISEGSPSPRIELLHNI 395

Query:   447 DQWSSLLLNTPSRRNSVLINIDEKK-------RT---AAVRLDSWKLVLG 486
             D   + + ++P  RNS+    D+          T   AA+R  +WKL+ G
Sbjct:   396 DP--NFVDSSPCPRNSMAPAKDDSSLPEYSAFNTSVHAAIRHGNWKLLTG 443

 Score = 86 (35.3 bits), Expect = 1.4e-76, Sum P(2) = 1.4e-76
 Identities = 18/63 (28%), Positives = 34/63 (53%)

Query:   568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
             +LF++  DP E+++++   P I ++L   L+++ +  VP      D  + DPK     W 
Sbjct:   473 WLFDIDRDPEERHDLSREYPHIVTKLLSRLQFYHKHSVPVYFPAQD-PRCDPKA-TGVWG 530

Query:   628 PWI 630
             PW+
Sbjct:   531 PWM 533

 Score = 84 (34.6 bits), Expect = 0.00030, Sum P(2) = 0.00030
 Identities = 19/63 (30%), Positives = 30/63 (47%)

Query:    15 YATDLFTKEAVQLIEDQPVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRR 74
             Y+T++FTK A+ LI + P +KP                   + P+E +  + +I D NR 
Sbjct:   210 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVHEPL-----QVPEEYLKPYDFIQDKNRH 264

Query:    75 TYA 77
              YA
Sbjct:   265 HYA 267


>UNIPROTKB|Q32KI4 [details] [associations]
            symbol:arsb "Arylsulfatase B" species:9615 "Canis lupus
            familiaris" [GO:0004065 "arylsulfatase activity" evidence=IEA]
            [GO:0003943 "N-acetylgalactosamine-4-sulfatase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            GO:GO:0004065 CTD:411 HOGENOM:HOG000135354 HOVERGEN:HBG004282
            KO:K01135 OMA:WLFDIDR OrthoDB:EOG4DV5M0 GO:GO:0003943
            GeneTree:ENSGT00560000077076 EMBL:AAEX03002118 EMBL:AAEX03002119
            EMBL:BN000753 RefSeq:NP_001041598.1 UniGene:Cfa.39080 SMR:Q32KI4
            STRING:Q32KI4 Ensembl:ENSCAFT00000014585 GeneID:610364
            KEGG:cfa:610364 InParanoid:Q32KI4 NextBio:20895924 Uniprot:Q32KI4
        Length = 535

 Score = 691 (248.3 bits), Expect = 7.5e-76, Sum P(2) = 7.5e-76
 Identities = 156/410 (38%), Positives = 231/410 (56%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
             GW+D+ FHGS  I TP++DALA  G++L+N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct:    58 GWHDVGFHGSR-IRTPHLDALAAAGVLLDNYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 116

Query:   152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             W  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P  RGF+++FGYL G   
Sbjct:   117 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 176

Query:   212 YYDH---ILSDQYSRT-VELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
             YY H    L D  + T   L+  D    ++T +  +  Y+T++FT+ A  LI + P +KP
Sbjct:   177 YYSHERCTLIDALNVTRCALDFRD-GEEVATGYKNM--YSTNIFTERATALISNHPPEKP 233

Query:   268 XXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
                                + P+E +  + +I D NRR YA MV  +D++VG V +AL+ 
Sbjct:   234 LFLYLALQSVHEPL-----QVPEEYLKPYDFIHDKNRRYYAGMVSLMDEAVGNVTAALKS 288

Query:   328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
              G+  N++ +F +DNG  T+           G+N+P RG K +LWEGGV+    + SP +
Sbjct:   289 HGLWNNTVFVFSTDNGGQTLA---------GGNNWPLRGRKWSLWEGGVRGVGFVASPLL 339

Query:   388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRL-PLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
             ++    S +++HISDWLPTL   AGG T    PL  DG D W ++   +PS R   +  +
Sbjct:   340 KRKGVKSRELVHISDWLPTLVGLAGGSTKGTKPL--DGFDVWRTISEGSPSPRMELLHNI 397

Query:   447 DQWSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVLG 486
             D   + +  +P    S+    D+              AA+R  +WKL+ G
Sbjct:   398 DP--NFVDISPCPGQSLAPAKDDSSHPGYFSFNTSLHAAIRHGNWKLLTG 445

 Score = 92 (37.4 bits), Expect = 7.5e-76, Sum P(2) = 7.5e-76
 Identities = 19/63 (30%), Positives = 33/63 (52%)

Query:   568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
             +LF++  DP E+++++   P +  QL   L+++ +  VP      D  + DPK     W 
Sbjct:   475 WLFDIDQDPEERHDLSRDHPHVVKQLLSRLQFYHKHSVPVYFPAQD-PRCDPKG-TGAWG 532

Query:   628 PWI 630
             PWI
Sbjct:   533 PWI 535

 Score = 82 (33.9 bits), Expect = 0.00011, Sum P(2) = 0.00011
 Identities = 19/63 (30%), Positives = 30/63 (47%)

Query:    15 YATDLFTKEAVQLIEDQPVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRR 74
             Y+T++FT+ A  LI + P +KP                   + P+E +  + +I D NRR
Sbjct:   212 YSTNIFTERATALISNHPPEKPLFLYLALQSVHEPL-----QVPEEYLKPYDFIHDKNRR 266

Query:    75 TYA 77
              YA
Sbjct:   267 YYA 269


>UNIPROTKB|F1P099 [details] [associations]
            symbol:ARSB "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0004065 "arylsulfatase activity" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
            PROSITE:PS00149 GO:GO:0004065 OMA:WLFDIDR
            GeneTree:ENSGT00560000077076 EMBL:AADN02046142 EMBL:AADN02046143
            EMBL:AADN02046144 EMBL:AADN02046145 EMBL:AADN02046146
            EMBL:AADN02046147 EMBL:AADN02046148 EMBL:AADN02046149
            EMBL:AADN02046150 IPI:IPI00822500 Ensembl:ENSGALT00000038612
            ArrayExpress:F1P099 Uniprot:F1P099
        Length = 527

 Score = 628 (226.1 bits), Expect = 2.3e-68, Sum P(2) = 2.3e-68
 Identities = 154/409 (37%), Positives = 215/409 (52%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
             GW D+ +HGS  I TP +DAL   G+ L   Y QP+CTPSR  L+ G Y IHTG+Q   I
Sbjct:    50 GWGDVGWHGS-AIRTPRLDALGAGGVRLKG-YTQPLCTPSRPFLLFGGYYIHTGLQHQII 107

Query:   152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             W  +P  +PL E+ LPE L++ GY T  +GKWHLG +R+E  P  RGF+++FGYL G   
Sbjct:   108 WPCQPSCLPLDEKLLPELLKDAGYVTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 167

Query:   212 YY--DHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPXX 269
             YY  DH +  + ++ V     D R     A      Y+T+LFT+ A+ LI +   +KP  
Sbjct:   168 YYSHDHCVLIK-AKNVTRCALDFRDGEEVATGFKNMYSTNLFTERAIDLIANHKTEKPLF 226

Query:   270 XXXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
                              E   E +  +  I D  RR YA MV  +D++VG +  AL+  G
Sbjct:   227 LYLAFQSVHEPL-----EVSAEYMKPYSSIKDVKRRRYAGMVSLMDEAVGNLTDALKEYG 281

Query:   330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
             +  N++++F +DNG  T+           G+N+P RG K TLWEGGV+    + SP ++Q
Sbjct:   282 LWNNTVLVFSTDNGGQTMA---------GGNNWPLRGRKWTLWEGGVRGVGFVASPLLKQ 332

Query:   390 NPRVSLQMMHISDWLPTLYTAAGGDTSRL-PLNIDGLDQWSSLLLNTPSRRNS---NIDG 445
                 S +++HISDWLPTL   AGG T+   PL  DG D W ++    PS R     NID 
Sbjct:   333 KGVESHELIHISDWLPTLVHLAGGHTNGTKPL--DGFDVWKTISEGRPSPRVELLHNIDP 390

Query:   446 L---D-----QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLG 486
             +   D      +S+     P +  + L  I      AA+R   WKL+ G
Sbjct:   391 MFVDDYPCEHSYSNFPSKNPQQHPAYLYFIISVH--AAIRHGKWKLLTG 437

 Score = 84 (34.6 bits), Expect = 2.3e-68, Sum P(2) = 2.3e-68
 Identities = 18/62 (29%), Positives = 31/62 (50%)

Query:   568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
             +LF++ +DP E+  ++   P +  +L   L+Y+ +  VP  +   D  Q DP      W 
Sbjct:   467 WLFDIVHDPEEKYELSEKYPHVVKKLLSRLQYYYKRSVPVFYPDED-PQCDPAA-TGVWG 524

Query:   628 PW 629
             PW
Sbjct:   525 PW 526


>UNIPROTKB|F1P095 [details] [associations]
            symbol:ARSB "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00149 GO:GO:0008484
            GeneTree:ENSGT00560000077076 EMBL:AADN02046142 EMBL:AADN02046143
            EMBL:AADN02046144 EMBL:AADN02046145 EMBL:AADN02046146
            EMBL:AADN02046147 EMBL:AADN02046148 EMBL:AADN02046149
            EMBL:AADN02046150 IPI:IPI00820595 Ensembl:ENSGALT00000038618
            ArrayExpress:F1P095 Uniprot:F1P095
        Length = 407

 Score = 622 (224.0 bits), Expect = 1.1e-64, Sum P(2) = 1.1e-64
 Identities = 140/359 (38%), Positives = 196/359 (54%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
             GW D+ +HGS  I TP +DAL   G+ L   Y QP+CTPSR  L+ G Y IHTG+Q   I
Sbjct:    51 GWGDVGWHGS-AIRTPRLDALGAGGVRLKG-YTQPLCTPSRPFLLFGGYYIHTGLQHQII 108

Query:   152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             W  +P  +PL E+ LPE L++ GY T  +GKWHLG +R+E  P  RGF+++FGYL G   
Sbjct:   109 WPCQPSCLPLDEKLLPELLKDAGYVTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 168

Query:   212 YY--DHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPXX 269
             YY  DH +  + ++ V     D R     A      Y+T+LFT+ A+ LI +   +KP  
Sbjct:   169 YYSHDHCVLIK-AKNVTRCALDFRDGEEVATGFKNMYSTNLFTERAIDLIANHKTEKPLF 227

Query:   270 XXXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
                              E   E +  +  I D  RR YA MV  +D++VG +  AL+  G
Sbjct:   228 LYLAFQSVHEPL-----EVSAEYMKPYSSIKDVKRRRYAGMVSLMDEAVGNLTDALKEYG 282

Query:   330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
             +  N++++F +DNG  T+           G+N+P RG K TLWEGGV+    + SP ++Q
Sbjct:   283 LWNNTVLVFSTDNGGQTMA---------GGNNWPLRGRKWTLWEGGVRGVGFVASPLLKQ 333

Query:   390 NPRVSLQMMHISDWLPTLYTAAGGDTSRL-PLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
                 S +++HISDWLPTL   AGG T+   PL  DG D W ++    PS R   +  +D
Sbjct:   334 KGVESHELIHISDWLPTLVHLAGGHTNGTKPL--DGFDVWKTISEGRPSPRVELLHNID 390

 Score = 55 (24.4 bits), Expect = 1.1e-64, Sum P(2) = 1.1e-64
 Identities = 11/26 (42%), Positives = 15/26 (57%)

Query:   443 IDGLDQWSSLLLNTPSRRNSVLINID 468
             +DG D W ++    PS R  +L NID
Sbjct:   365 LDGFDVWKTISEGRPSPRVELLHNID 390


>UNIPROTKB|F1NT29 [details] [associations]
            symbol:ARSB "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00149 GO:GO:0008484
            GeneTree:ENSGT00560000077076 EMBL:AADN02046142 EMBL:AADN02046143
            EMBL:AADN02046144 EMBL:AADN02046145 EMBL:AADN02046146
            EMBL:AADN02046147 EMBL:AADN02046148 EMBL:AADN02046149
            EMBL:AADN02046150 IPI:IPI00582830 Ensembl:ENSGALT00000007062
            ArrayExpress:F1NT29 Uniprot:F1NT29
        Length = 395

 Score = 619 (223.0 bits), Expect = 9.9e-64, Sum P(2) = 9.9e-64
 Identities = 139/351 (39%), Positives = 193/351 (54%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
             GW D+ +HGS  I TP +DAL   G+ L   Y QP+CTPSR  L+ G Y IHTG+Q   I
Sbjct:    57 GWGDVGWHGS-AIRTPRLDALGAGGVRLKG-YTQPLCTPSRPFLLFGGYYIHTGLQHQII 114

Query:   152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             W  +P  +PL E+ LPE L++ GY T  +GKWHLG +R+E  P  RGF+++FGYL G   
Sbjct:   115 WPCQPSCLPLDEKLLPELLKDAGYVTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 174

Query:   212 YY--DHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPXX 269
             YY  DH +  + ++ V     D R     A      Y+T+LFT+ A+ LI +   +KP  
Sbjct:   175 YYSHDHCVLIK-AKNVTRCALDFRDGEEVATGFKNMYSTNLFTERAIDLIANHKTEKPLF 233

Query:   270 XXXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
                              E   E +  +  I D  RR YA MV  +D++VG +  AL+  G
Sbjct:   234 LYLAFQSVHEPL-----EVSAEYMKPYSSIKDVKRRRYAGMVSLMDEAVGNLTDALKEYG 288

Query:   330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
             +  N++++F +DNG  T+           G+N+P RG K TLWEGGV+    + SP ++Q
Sbjct:   289 LWNNTVLVFSTDNGGQTMA---------GGNNWPLRGRKWTLWEGGVRGVGFVASPLLKQ 339

Query:   390 NPRVSLQMMHISDWLPTLYTAAGGDTSRL-PLNIDGLDQWSSLLLNTPSRR 439
                 S +++HISDWLPTL   AGG T+   PL  DG D W ++    PS R
Sbjct:   340 KGVESHELIHISDWLPTLVHLAGGHTNGTKPL--DGFDVWKTISEGRPSPR 388

 Score = 49 (22.3 bits), Expect = 9.9e-64, Sum P(2) = 9.9e-64
 Identities = 10/25 (40%), Positives = 14/25 (56%)

Query:   443 IDGLDQWSSLLLNTPSRRNSVLINI 467
             +DG D W ++    PS R  +L NI
Sbjct:   371 LDGFDVWKTISEGRPSPRVELLHNI 395


>UNIPROTKB|F1P098 [details] [associations]
            symbol:ARSB "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00149 GO:GO:0008484
            GeneTree:ENSGT00560000077076 EMBL:AADN02046142 EMBL:AADN02046143
            EMBL:AADN02046144 EMBL:AADN02046145 EMBL:AADN02046146
            EMBL:AADN02046147 EMBL:AADN02046148 EMBL:AADN02046149
            EMBL:AADN02046150 IPI:IPI00820025 Ensembl:ENSGALT00000038614
            ArrayExpress:F1P098 Uniprot:F1P098
        Length = 388

 Score = 613 (220.8 bits), Expect = 8.1e-60, P = 8.1e-60
 Identities = 137/345 (39%), Positives = 191/345 (55%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
             GW D+ +HGS  I TP +DAL   G+ L   Y QP+CTPSR  L+ G Y IHTG+Q   I
Sbjct:    57 GWGDVGWHGS-AIRTPRLDALGAGGVRLKG-YTQPLCTPSRPFLLFGGYYIHTGLQHQII 114

Query:   152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             W  +P  +PL E+ LPE L++ GY T  +GKWHLG +R+E  P  RGF+++FGYL G   
Sbjct:   115 WPCQPSCLPLDEKLLPELLKDAGYVTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 174

Query:   212 YY--DHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPXX 269
             YY  DH +  + ++ V     D R     A      Y+T+LFT+ A+ LI +   +KP  
Sbjct:   175 YYSHDHCVLIK-AKNVTRCALDFRDGEEVATGFKNMYSTNLFTERAIDLIANHKTEKPLF 233

Query:   270 XXXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
                              E   E +  +  I D  RR YA MV  +D++VG +  AL+  G
Sbjct:   234 LYLAFQSVHEPL-----EVSAEYMKPYSSIKDVKRRRYAGMVSLMDEAVGNLTDALKEYG 288

Query:   330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
             +  N++++F +DNG  T+           G+N+P RG K TLWEGGV+    + SP ++Q
Sbjct:   289 LWNNTVLVFSTDNGGQTMA---------GGNNWPLRGRKWTLWEGGVRGVGFVASPLLKQ 339

Query:   390 NPRVSLQMMHISDWLPTLYTAAGGDTSRL-PLNIDGLDQWSSLLL 433
                 S +++HISDWLPTL   AGG T+   PL  DG D W ++ L
Sbjct:   340 KGVESHELIHISDWLPTLVHLAGGHTNGTKPL--DGFDVWKTIRL 382


>UNIPROTKB|E1BKH3 [details] [associations]
            symbol:ARSJ "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            GeneTree:ENSGT00560000077076 KO:K12375 CTD:79642 OMA:AAGYGIW
            EMBL:DAAA02016458 EMBL:DAAA02016459 EMBL:DAAA02016460
            IPI:IPI00825946 RefSeq:XP_002688145.1 RefSeq:XP_611819.3
            UniGene:Bt.87496 ProteinModelPortal:E1BKH3
            Ensembl:ENSBTAT00000023672 GeneID:540514 KEGG:bta:540514
            NextBio:20878676 Uniprot:E1BKH3
        Length = 599

 Score = 489 (177.2 bits), Expect = 1.4e-54, Sum P(2) = 1.4e-54
 Identities = 105/257 (40%), Positives = 144/257 (56%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
             G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct:    88 GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 146

Query:   152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
                +P  +PL    LP+ L+E+GYST  +GKWHLGF+R+E  P  RGF++ FG L G   
Sbjct:   147 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGD 206

Query:   212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPXXX 270
             YY H   D       + G+D+  N + AWD   G Y+T ++T+   Q++      KP   
Sbjct:   207 YYTHYKCDSPG----MCGYDLYENDNAAWDYDNGVYSTQMYTQRVQQILASHDPRKPIFL 262

Query:   271 XXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
                             +AP      ++ I + NRR YAAM+  LD+++  V  AL+  G 
Sbjct:   263 YIAYQAVHSPL-----QAPGRYFEHYRSIVNINRRRYAAMLSCLDEAINNVTLALKMYGF 317

Query:   331 LENSIIIFMSDNGA-PT 346
               NSIII+ SDNG  PT
Sbjct:   318 YNNSIIIYSSDNGGQPT 334

 Score = 309 (113.8 bits), Expect = 8.0e-30, Sum P(2) = 8.0e-30
 Identities = 83/279 (29%), Positives = 130/279 (46%)

Query:   229 GHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPXXXXXXXXXXXXXXXXXXXE 287
             G+D+  N + AWD   G Y+T ++T+   Q++      KP                   +
Sbjct:   220 GYDLYENDNAAWDYDNGVYSTQMYTQRVQQILASHDPRKPIFLYIAYQAVHSPL-----Q 274

Query:   288 APQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTV 347
             AP      ++ I + NRR YAAM+  LD+++  V  AL+  G   NSIII+ SDNG    
Sbjct:   275 APGRYFEHYRSIVNINRRRYAAMLSCLDEAINNVTLALKMYGFYNNSIIIYSSDNGG--- 331

Query:   348 EYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTL 407
               + T+     GSN+P RG K T WEGG++    + SP ++    V  +++HI+DW PTL
Sbjct:   332 --QPTAG----GSNWPLRGSKGTYWEGGIRAIGFVHSPLLKNKGTVCKELVHITDWYPTL 385

Query:   408 YTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINI 467
              + A G      + +DG D W ++   +   R+  +D L     +     +   +    I
Sbjct:   386 ISLAEGQIDE-NIQLDGYDVWETI---SEGLRSPRVDILHNIDPIYTKAKNGSWAAGYGI 441

Query:   468 DEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVP 506
                   +A+R+  WKL+ G    G  D    Q+ SN  P
Sbjct:   442 WNTAIQSAIRVKHWKLLTGNP--GYSDWVPPQSFSNLGP 478

 Score = 92 (37.4 bits), Expect = 1.4e-54, Sum P(2) = 1.4e-54
 Identities = 20/67 (29%), Positives = 33/67 (49%)

Query:   563 TNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRF 622
             T    +LFN+  DP E+ ++++  P I  QL   L    +T VP  +   D  +++P+  
Sbjct:   490 TGKSVWLFNITADPYERVDLSNRYPGIVKQLLRRLSQFNKTAVPVRYPPKD-PRSNPRLN 548

Query:   623 NDTWSPW 629
                W PW
Sbjct:   549 GGVWGPW 555


>RGD|1307640 [details] [associations]
            symbol:Arsj "arylsulfatase family, member J" species:10116
            "Rattus norvegicus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 RGD:1307640 Gene3D:3.40.720.10
            SUPFAM:SSF53649 eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 HOGENOM:HOG000135354
            HOVERGEN:HBG004282 GeneTree:ENSGT00560000077076 KO:K12375 CTD:79642
            OMA:AAGYGIW OrthoDB:EOG45HRX5 EMBL:CH473952 EMBL:BN000740
            IPI:IPI00777558 RefSeq:NP_001041352.1 UniGene:Rn.202364 SMR:Q32KJ7
            STRING:Q32KJ7 Ensembl:ENSRNOT00000055633 GeneID:311013
            KEGG:rno:311013 UCSC:RGD:1307640 InParanoid:Q32KJ7 NextBio:662880
            Genevestigator:Q32KJ7 Uniprot:Q32KJ7
        Length = 597

 Score = 488 (176.8 bits), Expect = 2.3e-54, Sum P(2) = 2.3e-54
 Identities = 105/257 (40%), Positives = 145/257 (56%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
             G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct:    85 GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 143

Query:   152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
                +P  +PL    LP+ L+E+GYST  +GKWHLGF+R++  P  RGF++ FG L G   
Sbjct:   144 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMPTKRGFDTFFGSLLGSGD 203

Query:   212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPXXX 270
             YY H   D       + G+D+  N + AWD   G Y+T ++T+   Q++      KP   
Sbjct:   204 YYTHYKCDSPG----VCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPTKPLFL 259

Query:   271 XXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
                             +AP      ++ I + NRR YAAM+  LD+++  V  AL+R G 
Sbjct:   260 YVAYQAVHSPL-----QAPGRYFEHYRSIININRRRYAAMLSCLDEAIHNVTLALKRYGF 314

Query:   331 LENSIIIFMSDNGA-PT 346
               NSIII+ SDNG  PT
Sbjct:   315 YNNSIIIYSSDNGGQPT 331

 Score = 314 (115.6 bits), Expect = 2.5e-30, Sum P(2) = 2.5e-30
 Identities = 87/295 (29%), Positives = 136/295 (46%)

Query:   229 GHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPXXXXXXXXXXXXXXXXXXXE 287
             G+D+  N + AWD   G Y+T ++T+   Q++      KP                   +
Sbjct:   217 GYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPTKPLFLYVAYQAVHSPL-----Q 271

Query:   288 APQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTV 347
             AP      ++ I + NRR YAAM+  LD+++  V  AL+R G   NSIII+ SDNG    
Sbjct:   272 APGRYFEHYRSIININRRRYAAMLSCLDEAIHNVTLALKRYGFYNNSIIIYSSDNGG--- 328

Query:   348 EYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTL 407
               + T+     GSN+P RG K T WEGG++    + SP ++    V  +++HI+DW PTL
Sbjct:   329 --QPTAG----GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTL 382

Query:   408 YTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINI 467
              + A G      + +DG D W ++   +   R+  +D L     +     +   +    I
Sbjct:   383 ISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNIDPIYTKAKNGSWAAGYGI 438

Query:   468 DEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSL 522
                   +A+R+  WKL+ G    G  D    Q  SN  P    N  +   T +S+
Sbjct:   439 WNTAIQSAIRVQHWKLLTGNP--GYSDWVPPQAFSNLGPNRWHNERITLSTGKSI 491

 Score = 91 (37.1 bits), Expect = 2.3e-54, Sum P(2) = 2.3e-54
 Identities = 20/67 (29%), Positives = 33/67 (49%)

Query:   563 TNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRF 622
             T    +LFN+  DP E+ +++S  P I  +L   L    +T VP  +   D  +++P+  
Sbjct:   487 TGKSIWLFNITADPYERVDLSSRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLN 545

Query:   623 NDTWSPW 629
                W PW
Sbjct:   546 GGVWGPW 552


>MGI|MGI:2443513 [details] [associations]
            symbol:Arsj "arylsulfatase J" species:10090 "Mus musculus"
            [GO:0003824 "catalytic activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0005576 "extracellular
            region" evidence=IEA] [GO:0008152 "metabolic process" evidence=IEA]
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0046872 "metal
            ion binding" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 MGI:MGI:2443513 GO:GO:0005576
            GO:GO:0046872 Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            HOVERGEN:HBG004282 GeneTree:ENSGT00560000077076 KO:K12375 CTD:79642
            OMA:AAGYGIW OrthoDB:EOG45HRX5 EMBL:AK034454 EMBL:AK046410
            EMBL:AK052931 IPI:IPI00986759 RefSeq:NP_775627.1 UniGene:Mm.317021
            ProteinModelPortal:Q8BM89 SMR:Q8BM89 STRING:Q8BM89
            PhosphoSite:Q8BM89 PRIDE:Q8BM89 Ensembl:ENSMUST00000093976
            GeneID:271970 KEGG:mmu:271970 InParanoid:Q8BM89 NextBio:393532
            Bgee:Q8BM89 CleanEx:MM_ARSJ Genevestigator:Q8BM89
            GermOnline:ENSMUSG00000046561 Uniprot:Q8BM89
        Length = 598

 Score = 487 (176.5 bits), Expect = 2.9e-54, Sum P(2) = 2.9e-54
 Identities = 105/257 (40%), Positives = 145/257 (56%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
             G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct:    85 GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 143

Query:   152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
                +P  +PL    LP+ L+E+GYST  +GKWHLGF+R++  P  RGF++ FG L G   
Sbjct:   144 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMPTKRGFDTFFGSLLGSGD 203

Query:   212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPXXX 270
             YY H   D       + G+D+  N + AWD   G Y+T ++T+   Q++      KP   
Sbjct:   204 YYTHYKCDSPG----VCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILATHDPTKPLFL 259

Query:   271 XXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
                             +AP      ++ I + NRR YAAM+  LD+++  V  AL+R G 
Sbjct:   260 YVAYQAVHSPL-----QAPGRYFEHYRSIININRRRYAAMLSCLDEAIHNVTLALKRYGF 314

Query:   331 LENSIIIFMSDNGA-PT 346
               NSIII+ SDNG  PT
Sbjct:   315 YNNSIIIYSSDNGGQPT 331

 Score = 313 (115.2 bits), Expect = 3.3e-30, Sum P(2) = 3.3e-30
 Identities = 87/295 (29%), Positives = 136/295 (46%)

Query:   229 GHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPXXXXXXXXXXXXXXXXXXXE 287
             G+D+  N + AWD   G Y+T ++T+   Q++      KP                   +
Sbjct:   217 GYDLYENDNAAWDYDNGIYSTQMYTQRVQQILATHDPTKPLFLYVAYQAVHSPL-----Q 271

Query:   288 APQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTV 347
             AP      ++ I + NRR YAAM+  LD+++  V  AL+R G   NSIII+ SDNG    
Sbjct:   272 APGRYFEHYRSIININRRRYAAMLSCLDEAIHNVTLALKRYGFYNNSIIIYSSDNGG--- 328

Query:   348 EYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTL 407
               + T+     GSN+P RG K T WEGG++    + SP ++    V  +++HI+DW PTL
Sbjct:   329 --QPTAG----GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTL 382

Query:   408 YTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINI 467
              + A G      + +DG D W ++   +   R+  +D L     +     +   +    I
Sbjct:   383 ISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNIDPIYTKAKNGSWAAGYGI 438

Query:   468 DEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSL 522
                   +A+R+  WKL+ G    G  D    Q  SN  P    N  +   T +S+
Sbjct:   439 WNTAIQSAIRVQHWKLLTGNP--GYSDWVPPQAFSNLGPNRWHNERITLSTGKSI 491

 Score = 91 (37.1 bits), Expect = 2.9e-54, Sum P(2) = 2.9e-54
 Identities = 20/67 (29%), Positives = 33/67 (49%)

Query:   563 TNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRF 622
             T    +LFN+  DP E+ +++S  P I  +L   L    +T VP  +   D  +++P+  
Sbjct:   487 TGKSIWLFNITADPYERVDLSSRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLN 545

Query:   623 NDTWSPW 629
                W PW
Sbjct:   546 GGVWGPW 552


>UNIPROTKB|Q32KH6 [details] [associations]
            symbol:arsj "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            GO:GO:0008484 HOGENOM:HOG000135354 HOVERGEN:HBG004282
            GeneTree:ENSGT00560000077076 KO:K12375 CTD:79642 OMA:AAGYGIW
            OrthoDB:EOG45HRX5 EMBL:AAEX03016834 EMBL:BN000761
            RefSeq:NP_001041581.1 UniGene:Cfa.28600 SMR:Q32KH6
            Ensembl:ENSCAFT00000048607 GeneID:487909 KEGG:cfa:487909
            InParanoid:Q32KH6 NextBio:20861390 Uniprot:Q32KH6
        Length = 598

 Score = 487 (176.5 bits), Expect = 3.7e-54, Sum P(2) = 3.7e-54
 Identities = 105/257 (40%), Positives = 144/257 (56%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
             G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct:    85 GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 143

Query:   152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
                +P  +PL    LP+ L+E+GYST  +GKWHLGF+R+E  P  RGF++ FG L G   
Sbjct:   144 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGD 203

Query:   212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPXXX 270
             YY H   D       + G+D+  N + AWD   G Y+T ++T+   Q++      KP   
Sbjct:   204 YYTHYKCDSPG----MCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPRKPIFL 259

Query:   271 XXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
                             +AP      ++ I + NRR YAAM+  LD+++  V  AL+  G 
Sbjct:   260 YIAYQAVHSPL-----QAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGF 314

Query:   331 LENSIIIFMSDNGA-PT 346
               NSIII+ SDNG  PT
Sbjct:   315 YNNSIIIYSSDNGGQPT 331

 Score = 310 (114.2 bits), Expect = 9.6e-30, Sum P(2) = 9.6e-30
 Identities = 83/279 (29%), Positives = 130/279 (46%)

Query:   229 GHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPXXXXXXXXXXXXXXXXXXXE 287
             G+D+  N + AWD   G Y+T ++T+   Q++      KP                   +
Sbjct:   217 GYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPRKPIFLYIAYQAVHSPL-----Q 271

Query:   288 APQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTV 347
             AP      ++ I + NRR YAAM+  LD+++  V  AL+  G   NSIII+ SDNG    
Sbjct:   272 APGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGG--- 328

Query:   348 EYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTL 407
               + T+     GSN+P RG K T WEGG++    + SP ++    V  +++HI+DW PTL
Sbjct:   329 --QPTAG----GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTL 382

Query:   408 YTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINI 467
              + A G      + +DG D W ++   +   R+  +D L     +     +   +    I
Sbjct:   383 ISLAEGQIDE-DIQLDGYDVWETI---SEGLRSPRVDILHNIDPIYTKAKNGSWAAGYGI 438

Query:   468 DEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVP 506
                   +A+R+  WKL+ G    G  D    Q+ SN  P
Sbjct:   439 WNTAIQSAIRVQHWKLLTGNP--GYSDWVPPQSFSNLGP 475

 Score = 90 (36.7 bits), Expect = 3.7e-54, Sum P(2) = 3.7e-54
 Identities = 20/67 (29%), Positives = 32/67 (47%)

Query:   563 TNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRF 622
             T    +LFN+  DP E+ +++   P I  QL   L    +T VP  +   D  +++P+  
Sbjct:   487 TGKSVWLFNITADPYERVDLSHRYPGIVKQLLRRLSQFNKTAVPVRYPPKD-PRSNPRLN 545

Query:   623 NDTWSPW 629
                W PW
Sbjct:   546 GGVWGPW 552


>UNIPROTKB|Q5FYB0 [details] [associations]
            symbol:ARSJ "Arylsulfatase J" species:9606 "Homo sapiens"
            [GO:0046872 "metal ion binding" evidence=IEA] [GO:0005576
            "extracellular region" evidence=IEA] [GO:0004065 "arylsulfatase
            activity" evidence=TAS] [GO:0005788 "endoplasmic reticulum lumen"
            evidence=TAS] [GO:0006644 "phospholipid metabolic process"
            evidence=TAS] [GO:0006665 "sphingolipid metabolic process"
            evidence=TAS] [GO:0006687 "glycosphingolipid metabolic process"
            evidence=TAS] [GO:0043687 "post-translational protein modification"
            evidence=TAS] [GO:0044267 "cellular protein metabolic process"
            evidence=TAS] [GO:0044281 "small molecule metabolic process"
            evidence=TAS] Reactome:REACT_17015 Reactome:REACT_111217
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 GO:GO:0005576 GO:GO:0044281 GO:GO:0046872
            GO:GO:0006644 GO:GO:0005788 GO:GO:0043687 Gene3D:3.40.720.10
            SUPFAM:SSF53649 eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0004065 GO:GO:0006687 HOGENOM:HOG000135354
            HOVERGEN:HBG004282 KO:K12375 EMBL:AY875938 EMBL:AM049401
            EMBL:AY358647 EMBL:AC104779 EMBL:BC089445 EMBL:BC132879
            EMBL:BC132881 EMBL:BC144265 IPI:IPI00413865 RefSeq:NP_078866.3
            UniGene:Hs.22895 UniGene:Hs.700496 UniGene:Hs.712042
            ProteinModelPortal:Q5FYB0 SMR:Q5FYB0 STRING:Q5FYB0
            PhosphoSite:Q5FYB0 DMDM:74722580 PRIDE:Q5FYB0
            Ensembl:ENST00000315366 Ensembl:ENST00000541197 GeneID:79642
            KEGG:hsa:79642 UCSC:uc003ibq.1 CTD:79642 GeneCards:GC04M114821
            HGNC:HGNC:26286 HPA:HPA036482 MIM:610010 neXtProt:NX_Q5FYB0
            PharmGKB:PA143485310 InParanoid:Q5FYB0 OMA:AAGYGIW
            OrthoDB:EOG45HRX5 ChiTaRS:ARSJ GenomeRNAi:79642 NextBio:68769
            ArrayExpress:Q5FYB0 Bgee:Q5FYB0 CleanEx:HS_ARSJ
            Genevestigator:Q5FYB0 Uniprot:Q5FYB0
        Length = 599

 Score = 487 (176.5 bits), Expect = 6.1e-54, Sum P(2) = 6.1e-54
 Identities = 105/257 (40%), Positives = 144/257 (56%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
             G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct:    87 GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 145

Query:   152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
                +P  +PL    LP+ L+E+GYST  +GKWHLGF+R+E  P  RGF++ FG L G   
Sbjct:   146 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTRRGFDTFFGSLLGSGD 205

Query:   212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPXXX 270
             YY H   D       + G+D+  N + AWD   G Y+T ++T+   Q++      KP   
Sbjct:   206 YYTHYKCDSPG----MCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFL 261

Query:   271 XXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
                             +AP      ++ I + NRR YAAM+  LD+++  V  AL+  G 
Sbjct:   262 YIAYQAVHSPL-----QAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGF 316

Query:   331 LENSIIIFMSDNGA-PT 346
               NSIII+ SDNG  PT
Sbjct:   317 YNNSIIIYSSDNGGQPT 333

 Score = 309 (113.8 bits), Expect = 2.1e-29, Sum P(2) = 2.1e-29
 Identities = 83/279 (29%), Positives = 130/279 (46%)

Query:   229 GHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPXXXXXXXXXXXXXXXXXXXE 287
             G+D+  N + AWD   G Y+T ++T+   Q++      KP                   +
Sbjct:   219 GYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHSPL-----Q 273

Query:   288 APQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTV 347
             AP      ++ I + NRR YAAM+  LD+++  V  AL+  G   NSIII+ SDNG    
Sbjct:   274 APGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGG--- 330

Query:   348 EYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTL 407
               + T+     GSN+P RG K T WEGG++    + SP ++    V  +++HI+DW PTL
Sbjct:   331 --QPTAG----GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTL 384

Query:   408 YTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINI 467
              + A G      + +DG D W ++   +   R+  +D L     +     +   +    I
Sbjct:   385 ISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNIDPIYTKAKNGSWAAGYGI 440

Query:   468 DEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVP 506
                   +A+R+  WKL+ G    G  D    Q+ SN  P
Sbjct:   441 WNTAIQSAIRVQHWKLLTGNP--GYSDWVPPQSFSNLGP 477

 Score = 88 (36.0 bits), Expect = 6.1e-54, Sum P(2) = 6.1e-54
 Identities = 19/67 (28%), Positives = 33/67 (49%)

Query:   563 TNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRF 622
             T    +LFN+  DP E+ ++++  P I  +L   L    +T VP  +   D  +++P+  
Sbjct:   489 TGKSVWLFNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLN 547

Query:   623 NDTWSPW 629
                W PW
Sbjct:   548 GGVWGPW 554


>RGD|1310242 [details] [associations]
            symbol:Arsi "arylsulfatase family, member I" species:10116
            "Rattus norvegicus" [GO:0005576 "extracellular region"
            evidence=IEA] [GO:0005783 "endoplasmic reticulum" evidence=IEA]
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
            [GO:0046872 "metal ion binding" evidence=IEA] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 RGD:1310242
            GO:GO:0005783 GO:GO:0005576 GO:GO:0046872 Gene3D:3.40.720.10
            SUPFAM:SSF53649 eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 HOGENOM:HOG000135354
            HOVERGEN:HBG004282 GeneTree:ENSGT00560000077076 HSSP:P15289
            CTD:340075 KO:K12375 OrthoDB:EOG4DFPN6 OMA:YHGSDIE
            EMBL:AABR03109797 EMBL:BN000739 IPI:IPI00367540
            RefSeq:NP_001041346.1 UniGene:Rn.202490 ProteinModelPortal:Q32KJ8
            SMR:Q32KJ8 STRING:Q32KJ8 PhosphoSite:Q32KJ8
            Ensembl:ENSRNOT00000030966 GeneID:307404 KEGG:rno:307404
            UCSC:RGD:1310242 InParanoid:Q32KJ8 NextBio:657343
            Genevestigator:Q32KJ8 Uniprot:Q32KJ8
        Length = 573

 Score = 475 (172.3 bits), Expect = 9.9e-54, Sum P(2) = 9.9e-54
 Identities = 99/255 (38%), Positives = 146/255 (57%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
             G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct:    58 GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 116

Query:   152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
                +P  +PL +  LP+ L+E GYST  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct:   117 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 176

Query:   212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPXXXX 271
             YY +   D       + G D+    S AW   G+Y+T L+ + A  ++      KP    
Sbjct:   177 YYTYDNCDGPG----VCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHSPQKPLFLY 232

Query:   272 XXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
                            ++P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G  
Sbjct:   233 VAFQAVHTPL-----QSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 287

Query:   332 ENSIIIFMSDNGAPT 346
              NS+IIF SDNG  T
Sbjct:   288 NNSVIIFSSDNGGQT 302

 Score = 324 (119.1 bits), Expect = 2.1e-32, Sum P(2) = 2.1e-32
 Identities = 84/260 (32%), Positives = 126/260 (48%)

Query:   229 GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPXXXXXXXXXXXXXXXXXXXEA 288
             G D+    S AW   G+Y+T L+ + A  ++      KP                   ++
Sbjct:   190 GFDLHEGESVAWGLSGQYSTMLYAQRASHILASHSPQKPLFLYVAFQAVHTPL-----QS 244

Query:   289 PQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVE 348
             P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G   NS+IIF SDNG  T  
Sbjct:   245 PREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQTF- 303

Query:   349 YRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLY 408
                     + GSN+P RG K T WEGGV+    + SP +++  R S  ++HI+DW PTL 
Sbjct:   304 --------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKKKRRTSRALVHITDWYPTLV 355

Query:   409 TAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVL--IN 466
               AGG TS     +DG D W ++     S R   +  +D      L   +R  S+     
Sbjct:   356 GLAGGTTSAAD-GLDGYDVWPAISEGRASPRTEILHNIDP-----LYNHARHGSLEGGFG 409

Query:   467 IDEKKRTAAVRLDSWKLVLG 486
             I      AA+R+  WKL+ G
Sbjct:   410 IWNTAVQAAIRVGEWKLLTG 429

 Score = 98 (39.6 bits), Expect = 9.9e-54, Sum P(2) = 9.9e-54
 Identities = 20/62 (32%), Positives = 31/62 (50%)

Query:   568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
             +LFN+  DP E+ ++A  RPD+   L   L  + RT +P  +   +  +A P      W 
Sbjct:   463 WLFNISADPYEREDLADQRPDVVRTLLARLADYNRTAIPVRYPAAN-PRAHPDFNGGAWG 521

Query:   628 PW 629
             PW
Sbjct:   522 PW 523


>UNIPROTKB|Q5FYB1 [details] [associations]
            symbol:ARSI "Arylsulfatase I" species:9606 "Homo sapiens"
            [GO:0046872 "metal ion binding" evidence=IEA] [GO:0005576
            "extracellular region" evidence=IEA] [GO:0004065 "arylsulfatase
            activity" evidence=TAS] [GO:0005788 "endoplasmic reticulum lumen"
            evidence=TAS] [GO:0006644 "phospholipid metabolic process"
            evidence=TAS] [GO:0006665 "sphingolipid metabolic process"
            evidence=TAS] [GO:0006687 "glycosphingolipid metabolic process"
            evidence=TAS] [GO:0043687 "post-translational protein modification"
            evidence=TAS] [GO:0044267 "cellular protein metabolic process"
            evidence=TAS] [GO:0044281 "small molecule metabolic process"
            evidence=TAS] Reactome:REACT_17015 Reactome:REACT_111217
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 GO:GO:0005576 GO:GO:0044281 GO:GO:0046872
            GO:GO:0006644 GO:GO:0005788 GO:GO:0043687 Gene3D:3.40.720.10
            SUPFAM:SSF53649 eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0004065 GO:GO:0006687 HOGENOM:HOG000135354
            HOVERGEN:HBG004282 CTD:340075 KO:K12375 OrthoDB:EOG4DFPN6
            EMBL:AY875937 EMBL:AB448735 EMBL:AK122641 EMBL:BC129995
            EMBL:BC129996 IPI:IPI00257076 IPI:IPI00915442 RefSeq:NP_001012301.1
            UniGene:Hs.591252 ProteinModelPortal:Q5FYB1 SMR:Q5FYB1
            STRING:Q5FYB1 PhosphoSite:Q5FYB1 DMDM:74722581 PRIDE:Q5FYB1
            Ensembl:ENST00000328668 Ensembl:ENST00000515301 GeneID:340075
            KEGG:hsa:340075 UCSC:uc003lrv.2 GeneCards:GC05M149657
            HGNC:HGNC:32521 HPA:HPA038386 MIM:610009 neXtProt:NX_Q5FYB1
            PharmGKB:PA143485309 InParanoid:Q5FYB1 OMA:YHGSDIE
            GenomeRNAi:340075 NextBio:97681 ArrayExpress:Q5FYB1 Bgee:Q5FYB1
            CleanEx:HS_ARSI Genevestigator:Q5FYB1 GermOnline:ENSG00000183876
            Uniprot:Q5FYB1
        Length = 569

 Score = 469 (170.2 bits), Expect = 2.6e-53, Sum P(2) = 2.6e-53
 Identities = 97/255 (38%), Positives = 146/255 (57%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
             G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct:    58 GYHDVGYHGS-DIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 116

Query:   152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
                +P  +PL +  LP+ L+E GYST  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct:   117 RPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 176

Query:   212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPXXXX 271
             YY +   D       + G D+    + AW   G+Y+T L+ + A  ++      +P    
Sbjct:   177 YYTYDNCDGPG----VCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 232

Query:   272 XXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
                            ++P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G  
Sbjct:   233 VAFQAVHTPL-----QSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 287

Query:   332 ENSIIIFMSDNGAPT 346
              NS+IIF SDNG  T
Sbjct:   288 NNSVIIFSSDNGGQT 302

 Score = 319 (117.4 bits), Expect = 5.0e-32, Sum P(2) = 5.0e-32
 Identities = 82/260 (31%), Positives = 126/260 (48%)

Query:   229 GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPXXXXXXXXXXXXXXXXXXXEA 288
             G D+    + AW   G+Y+T L+ + A  ++      +P                   ++
Sbjct:   190 GFDLHEGENVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHTPL-----QS 244

Query:   289 PQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVE 348
             P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G   NS+IIF SDNG  T  
Sbjct:   245 PREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQTF- 303

Query:   349 YRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLY 408
                     + GSN+P RG K T WEGGV+    + SP +++  R S  +MHI+DW PTL 
Sbjct:   304 --------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLV 355

Query:   409 TAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVL--IN 466
               AGG TS     +DG D W ++     S R   +  +D      L   ++  S+     
Sbjct:   356 GLAGGTTSAAD-GLDGYDVWPAISEGRASPRTEILHNIDP-----LYNHAQHGSLEGGFG 409

Query:   467 IDEKKRTAAVRLDSWKLVLG 486
             I      AA+R+  WKL+ G
Sbjct:   410 IWNTAVQAAIRVGEWKLLTG 429

 Score = 100 (40.3 bits), Expect = 2.6e-53, Sum P(2) = 2.6e-53
 Identities = 22/64 (34%), Positives = 32/64 (50%)

Query:   568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDT 625
             +LFN+  DP E+ ++A  RPD+   L   L  + RT +P  +  E P   +A P      
Sbjct:   463 WLFNISADPYEREDLAGQRPDVVRTLLARLAEYNRTAIPVRYPAENP---RAHPDFNGGA 519

Query:   626 WSPW 629
             W PW
Sbjct:   520 WGPW 523


>UNIPROTKB|Q32KH7 [details] [associations]
            symbol:ARSI "Arylsulfatase I" species:9615 "Canis lupus
            familiaris" [GO:0005783 "endoplasmic reticulum" evidence=IEA]
            [GO:0005576 "extracellular region" evidence=IEA] [GO:0046872 "metal
            ion binding" evidence=IEA] [GO:0008484 "sulfuric ester hydrolase
            activity" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 GO:GO:0005783 GO:GO:0005576
            GO:GO:0046872 Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            HOGENOM:HOG000135354 HOVERGEN:HBG004282
            GeneTree:ENSGT00560000077076 HSSP:P15289 EMBL:AAEX02012119
            EMBL:BN000760 RefSeq:NP_001041583.1 UniGene:Cfa.39081
            ProteinModelPortal:Q32KH7 Ensembl:ENSCAFT00000028793 GeneID:489186
            KEGG:cfa:489186 CTD:340075 InParanoid:Q32KH7 KO:K12375
            OrthoDB:EOG4DFPN6 NextBio:20862393 Uniprot:Q32KH7
        Length = 573

 Score = 470 (170.5 bits), Expect = 3.3e-53, Sum P(2) = 3.3e-53
 Identities = 97/255 (38%), Positives = 146/255 (57%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
             G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct:    59 GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 117

Query:   152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
                +P  +PL +  LP+ L+E GYST  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct:   118 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 177

Query:   212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPXXXX 271
             YY +   D       + G D+    + AW   G+Y+T L+ +    ++      +P    
Sbjct:   178 YYTYDNCDGPG----VCGFDLHEGENVAWGLSGQYSTMLYAQRVSHILASHSPRRPLFLY 233

Query:   272 XXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
                            ++P+E + +++ + +  RR YAAMV  +D++V  + SAL+R G  
Sbjct:   234 VAFQAVHTPL-----QSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITSALKRYGFY 288

Query:   332 ENSIIIFMSDNGAPT 346
              NS+IIF SDNG  T
Sbjct:   289 NNSVIIFSSDNGGQT 303

 Score = 314 (115.6 bits), Expect = 3.5e-31, Sum P(2) = 3.5e-31
 Identities = 81/260 (31%), Positives = 125/260 (48%)

Query:   229 GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPXXXXXXXXXXXXXXXXXXXEA 288
             G D+    + AW   G+Y+T L+ +    ++      +P                   ++
Sbjct:   191 GFDLHEGENVAWGLSGQYSTMLYAQRVSHILASHSPRRPLFLYVAFQAVHTPL-----QS 245

Query:   289 PQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVE 348
             P+E + +++ + +  RR YAAMV  +D++V  + SAL+R G   NS+IIF SDNG  T  
Sbjct:   246 PREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITSALKRYGFYNNSVIIFSSDNGGQTF- 304

Query:   349 YRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLY 408
                     + GSN+P RG K T WEGGV+    + SP +++  R S  ++HI+DW PTL 
Sbjct:   305 --------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKRRTSRALVHITDWYPTLV 356

Query:   409 TAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVL--IN 466
               AGG  S     +DG D W ++     S R   +  +D      L   +R  S+     
Sbjct:   357 GLAGGTASAAD-GLDGYDVWPAISEGRASPRTEILHNIDP-----LYNHARHGSLEAGFG 410

Query:   467 IDEKKRTAAVRLDSWKLVLG 486
             I      AA+R+  WKL+ G
Sbjct:   411 IWNTAVQAAIRVGEWKLLTG 430

 Score = 98 (39.6 bits), Expect = 3.3e-53, Sum P(2) = 3.3e-53
 Identities = 22/64 (34%), Positives = 32/64 (50%)

Query:   568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDT 625
             +LFN+  DP E+ ++A  RPD+   L   L  + RT +P  +  E P   +A P      
Sbjct:   464 WLFNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENP---RAHPDFNGGA 520

Query:   626 WSPW 629
             W PW
Sbjct:   521 WGPW 524


>MGI|MGI:2670959 [details] [associations]
            symbol:Arsi "arylsulfatase i" species:10090 "Mus musculus"
            [GO:0003824 "catalytic activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0005576 "extracellular
            region" evidence=IEA] [GO:0005783 "endoplasmic reticulum"
            evidence=IEA] [GO:0008152 "metabolic process" evidence=IEA]
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0046872 "metal
            ion binding" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 MGI:MGI:2670959 GO:GO:0005783
            GO:GO:0005576 EMBL:CH466528 GO:GO:0046872 Gene3D:3.40.720.10
            SUPFAM:SSF53649 eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 HOGENOM:HOG000135354
            HOVERGEN:HBG004282 GeneTree:ENSGT00560000077076 HSSP:P15289
            CTD:340075 KO:K12375 OrthoDB:EOG4DFPN6 OMA:YHGSDIE EMBL:BC138970
            EMBL:BC141169 EMBL:BN000748 IPI:IPI00462991 RefSeq:NP_001033588.1
            UniGene:Mm.20147 ProteinModelPortal:Q32KI9 SMR:Q32KI9 STRING:Q32KI9
            PRIDE:Q32KI9 Ensembl:ENSMUST00000040359 GeneID:545260
            KEGG:mmu:545260 UCSC:uc008fbe.1 InParanoid:Q32KI9 NextBio:412424
            Bgee:Q32KI9 Genevestigator:Q32KI9 Uniprot:Q32KI9
        Length = 573

 Score = 469 (170.2 bits), Expect = 4.2e-53, Sum P(2) = 4.2e-53
 Identities = 98/255 (38%), Positives = 145/255 (56%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
             G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct:    58 GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 116

Query:   152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
                +P  +PL +  LP+ L+E GYST  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct:   117 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 176

Query:   212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPXXXX 271
             YY +   D       + G D+    S AW   G+Y+T L+ + A  ++       P    
Sbjct:   177 YYTYDNCDGPG----VCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHNPQNPLFLY 232

Query:   272 XXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
                            ++P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G  
Sbjct:   233 VAFQAVHTPL-----QSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 287

Query:   332 ENSIIIFMSDNGAPT 346
              NS+IIF SDNG  T
Sbjct:   288 NNSVIIFSSDNGGQT 302

 Score = 318 (117.0 bits), Expect = 1.1e-31, Sum P(2) = 1.1e-31
 Identities = 83/260 (31%), Positives = 125/260 (48%)

Query:   229 GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPXXXXXXXXXXXXXXXXXXXEA 288
             G D+    S AW   G+Y+T L+ + A  ++       P                   ++
Sbjct:   190 GFDLHEGESVAWGLSGQYSTMLYAQRASHILASHNPQNPLFLYVAFQAVHTPL-----QS 244

Query:   289 PQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVE 348
             P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G   NS+IIF SDNG  T  
Sbjct:   245 PREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQTF- 303

Query:   349 YRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLY 408
                     + GSN+P RG K T WEGGV+    + SP +++  R S  ++HI+DW PTL 
Sbjct:   304 --------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKKKRRTSRALVHITDWYPTLV 355

Query:   409 TAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVL--IN 466
               AGG TS     +DG D W ++     S R   +  +D      L   +R  S+     
Sbjct:   356 GLAGGTTSAAD-GLDGYDVWPAISEGRASPRTEILHNIDP-----LYNHARHGSLEGGFG 409

Query:   467 IDEKKRTAAVRLDSWKLVLG 486
             I      AA+R+  WKL+ G
Sbjct:   410 IWNTAVQAAIRVGEWKLLTG 429

 Score = 98 (39.6 bits), Expect = 4.2e-53, Sum P(2) = 4.2e-53
 Identities = 20/62 (32%), Positives = 31/62 (50%)

Query:   568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
             +LFN+  DP E+ ++A  RPD+   L   L  + RT +P  +   +  +A P      W 
Sbjct:   463 WLFNISADPYEREDLAGQRPDVVRTLLARLADYNRTAIPVRYPAAN-PRAHPDFNGGAWG 521

Query:   628 PW 629
             PW
Sbjct:   522 PW 523


>UNIPROTKB|E1BIN3 [details] [associations]
            symbol:ARSI "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            GeneTree:ENSGT00560000077076 OMA:YHGSDIE EMBL:DAAA02020627
            IPI:IPI00695273 Ensembl:ENSBTAT00000017050 Uniprot:E1BIN3
        Length = 572

 Score = 467 (169.5 bits), Expect = 6.9e-53, Sum P(2) = 6.9e-53
 Identities = 97/255 (38%), Positives = 146/255 (57%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
             G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct:    59 GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 117

Query:   152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
                +P  +PL +  LP+ L+ELGYST  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct:   118 RPRQPNCLPLDQVTLPQKLQELGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 177

Query:   212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPXXXX 271
             YY +   D       + G D+    + AW   G+Y+T L+ +    ++      +P    
Sbjct:   178 YYTYDNCDGPG----VCGFDLHEGENVAWGLSGQYSTLLYAQRVSHILASHSPRQPLFLY 233

Query:   272 XXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
                            ++P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G  
Sbjct:   234 VAFQAVHTPL-----QSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRHGFY 288

Query:   332 ENSIIIFMSDNGAPT 346
              NS+IIF SDNG  T
Sbjct:   289 NNSVIIFSSDNGGQT 303

 Score = 308 (113.5 bits), Expect = 1.8e-30, Sum P(2) = 1.8e-30
 Identities = 80/260 (30%), Positives = 124/260 (47%)

Query:   229 GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPXXXXXXXXXXXXXXXXXXXEA 288
             G D+    + AW   G+Y+T L+ +    ++      +P                   ++
Sbjct:   191 GFDLHEGENVAWGLSGQYSTLLYAQRVSHILASHSPRQPLFLYVAFQAVHTPL-----QS 245

Query:   289 PQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVE 348
             P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G   NS+IIF SDNG  T  
Sbjct:   246 PREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRHGFYNNSVIIFSSDNGGQTF- 304

Query:   349 YRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLY 408
                     + GSN+P RG K T WEGGV+    + SP +++  R S  ++HI+DW PTL 
Sbjct:   305 --------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKRRTSRALVHITDWYPTLV 356

Query:   409 TAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVL--IN 466
               AGG  S     +DG D W ++     S R   +  +D      L   +R  S+     
Sbjct:   357 ALAGGTASAAD-GLDGYDVWPAISEGRASPRTEILHNIDP-----LYNHARHGSLEGGFG 410

Query:   467 IDEKKRTAAVRLDSWKLVLG 486
             I      AA+R+  WKL+ G
Sbjct:   411 IWNTAVQAAIRVGEWKLLTG 430

 Score = 98 (39.6 bits), Expect = 6.9e-53, Sum P(2) = 6.9e-53
 Identities = 22/64 (34%), Positives = 32/64 (50%)

Query:   568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDT 625
             +LFN+  DP E+ ++A  RPD+   L   L  + RT +P  +  E P   +A P      
Sbjct:   464 WLFNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENP---RAHPDFNGGA 520

Query:   626 WSPW 629
             W PW
Sbjct:   521 WGPW 524


>UNIPROTKB|F1RL69 [details] [associations]
            symbol:LOC100517463 "Uncharacterized protein" species:9823
            "Sus scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            GeneTree:ENSGT00560000077076 OMA:YHGSDIE EMBL:FP102406
            Ensembl:ENSSSCT00000015795 Uniprot:F1RL69
        Length = 596

 Score = 466 (169.1 bits), Expect = 8.7e-53, Sum P(2) = 8.7e-53
 Identities = 96/255 (37%), Positives = 147/255 (57%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
             G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct:    83 GYHDVGYHGS-DIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 141

Query:   152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
                +P  +PL +  LP+ L++LGY+T  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct:   142 RPRQPNCLPLDQVTLPQRLQQLGYATHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 201

Query:   212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPXXXX 271
             YY +   D       + G D+    S AW   G+Y+T L+ +   +++      +P    
Sbjct:   202 YYTYDNCDGPG----VCGFDLHEGESVAWGLSGQYSTLLYAQRVSRILAGHSPRRPLFLY 257

Query:   272 XXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
                            ++P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G  
Sbjct:   258 VAFQAVHTPL-----QSPREYLYRYRGMGNVARRKYAAMVTCMDEAVRNITGALKRYGFY 312

Query:   332 ENSIIIFMSDNGAPT 346
              NS+IIF SDNG  T
Sbjct:   313 NNSVIIFSSDNGGQT 327

 Score = 312 (114.9 bits), Expect = 8.0e-31, Sum P(2) = 8.0e-31
 Identities = 81/260 (31%), Positives = 125/260 (48%)

Query:   229 GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPXXXXXXXXXXXXXXXXXXXEA 288
             G D+    S AW   G+Y+T L+ +   +++      +P                   ++
Sbjct:   215 GFDLHEGESVAWGLSGQYSTLLYAQRVSRILAGHSPRRPLFLYVAFQAVHTPL-----QS 269

Query:   289 PQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVE 348
             P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G   NS+IIF SDNG  T  
Sbjct:   270 PREYLYRYRGMGNVARRKYAAMVTCMDEAVRNITGALKRYGFYNNSVIIFSSDNGGQTF- 328

Query:   349 YRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLY 408
                     + GSN+P RG K T WEGGV+    + SP +++  R S  ++HI+DW PTL 
Sbjct:   329 --------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRTRRTSRALLHITDWYPTLV 380

Query:   409 TAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVL--IN 466
               AGG  S     +DG D W ++     S R   +  +D      L   +R  S+     
Sbjct:   381 GLAGGTASAAD-GLDGYDVWPAISEGRASPRTEILHNIDP-----LYNHARHGSLEGGFG 434

Query:   467 IDEKKRTAAVRLDSWKLVLG 486
             I      AA+R+  WKL+ G
Sbjct:   435 IWNTAVQAAIRVGEWKLLTG 454

 Score = 98 (39.6 bits), Expect = 8.7e-53, Sum P(2) = 8.7e-53
 Identities = 22/64 (34%), Positives = 32/64 (50%)

Query:   568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDT 625
             +LFN+  DP E+ ++A  RPD+   L   L  + RT +P  +  E P   +A P      
Sbjct:   488 WLFNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENP---RAHPDFNGGA 544

Query:   626 WSPW 629
             W PW
Sbjct:   545 WGPW 548


>UNIPROTKB|F1NQP9 [details] [associations]
            symbol:ARSI "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            GeneTree:ENSGT00560000077076 OMA:YHGSDIE EMBL:AADN02028629
            IPI:IPI00587142 Ensembl:ENSGALT00000009011 Uniprot:F1NQP9
        Length = 572

 Score = 465 (168.7 bits), Expect = 1.8e-52, Sum P(2) = 1.8e-52
 Identities = 94/255 (36%), Positives = 149/255 (58%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
             G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct:    59 GYHDVGYHGS-DIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLITGRYQIHTGLQHSII 117

Query:   152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
                +P  +PL +  LP+ L+E GYST  +GKWHLGF+++E  P  RGF++  G L G + 
Sbjct:   118 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRRGFDTFLGSLTGNVD 177

Query:   212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPXXXX 271
             YY +   D       + G+D+    + AWD  G+Y+T L+ +   +++      +P    
Sbjct:   178 YYTYDNCDGPG----VCGYDLHEGENVAWDQSGKYSTFLYAQRVSKILASHSPKEPIFIY 233

Query:   272 XXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
                            ++P+E I +++ + +  RR YAAMV  +D++V  +  AL++ G  
Sbjct:   234 VAFQAVHTPL-----QSPKEYIYRYRSMGNVARRKYAAMVTCMDEAVKNITWALKKYGYY 288

Query:   332 ENSIIIFMSDNGAPT 346
             +NS+I+F +DNG  T
Sbjct:   289 DNSVIVFSTDNGGQT 303

 Score = 314 (115.6 bits), Expect = 5.5e-31, Sum P(2) = 5.5e-31
 Identities = 82/277 (29%), Positives = 138/277 (49%)

Query:   229 GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPXXXXXXXXXXXXXXXXXXXEA 288
             G+D+    + AWD  G+Y+T L+ +   +++      +P                   ++
Sbjct:   191 GYDLHEGENVAWDQSGKYSTFLYAQRVSKILASHSPKEPIFIYVAFQAVHTPL-----QS 245

Query:   289 PQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVE 348
             P+E I +++ + +  RR YAAMV  +D++V  +  AL++ G  +NS+I+F +DNG  T  
Sbjct:   246 PKEYIYRYRSMGNVARRKYAAMVTCMDEAVKNITWALKKYGYYDNSVIVFSTDNGGQTF- 304

Query:   349 YRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLY 408
                     + GSN+P RG K T WEGGV+    + SP I++  R S  ++HI+DW PTL 
Sbjct:   305 --------SGGSNWPLRGRKGTYWEGGVRGIGFVHSPLIKRKRRTSWALVHITDWYPTLV 356

Query:   409 TAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVL--IN 466
             + A G+ S +P  +DG + W ++     S R   +  +D      L   ++  S+     
Sbjct:   357 SLARGNLSNVP-GLDGYNVWPAISEGKESPRTEILHNIDP-----LYNHAKYGSLEDGFG 410

Query:   467 IDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSN 503
             I      A++R+  WKL+ G  + G  D    QT +N
Sbjct:   411 IWNTAVQASIRVGEWKLLTG--DPGYSDWIPPQTLTN 445

 Score = 96 (38.9 bits), Expect = 1.8e-52, Sum P(2) = 1.8e-52
 Identities = 21/64 (32%), Positives = 33/64 (51%)

Query:   568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDT 625
             +LFN+  DP E+ +++  RPD+   L   L ++ RT +P  +  E P   +A P      
Sbjct:   465 WLFNITADPYERYDLSEQRPDVVRALLTRLVHYNRTAIPVRYPAENP---RAHPDFNGGA 521

Query:   626 WSPW 629
             W PW
Sbjct:   522 WGPW 525


>WB|WBGene00006310 [details] [associations]
            symbol:sul-3 species:6239 "Caenorhabditis elegans"
            [GO:0008152 "metabolic process" evidence=IEA] [GO:0008484 "sulfuric
            ester hydrolase activity" evidence=IEA] [GO:0003824 "catalytic
            activity" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GO:GO:0008484 GeneTree:ENSGT00560000077076 EMBL:FO080947
            UniGene:Cel.8880 GeneID:183778 KEGG:cel:CELE_C54D2.4 CTD:183778
            RefSeq:NP_001041231.1 ProteinModelPortal:H2KZF6 SMR:H2KZF6
            EnsemblMetazoa:C54D2.4a WormBase:C54D2.4a OMA:RGMMVSD
            Uniprot:H2KZF6
        Length = 488

 Score = 460 (167.0 bits), Expect = 9.8e-50, Sum P(2) = 9.8e-50
 Identities = 118/391 (30%), Positives = 192/391 (49%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAY--NGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGP 149
             G++D+ +  S  + TPN+  LA+  N  +L+N Y   +CTP+R++ MTG YP   G Q  
Sbjct:    42 GFSDVDWKDST-LHTPNLRHLAFHKNTALLSNSYVNQLCTPTRSAFMTGYYPFRVGTQNG 100

Query:   150 PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGV 209
                  EP GVP    FL E +R+L YST  +GKWHLG+ ++E+ P  RGF+  +G+    
Sbjct:   101 VFLHMEPAGVPTMFPFLSENMRQLDYSTYLVGKWHLGYCKKEFLPTNRGFDYFYGFYGPQ 160

Query:   210 ISYYDHILSDQYSRTVE--LNGHDMRRNLSTA-----WDTVGEYATDLFTKEAVQLIEDQ 262
               Y++H  +DQY R ++  + G D+   + +      +   G Y+TDLFT  A+ ++++ 
Sbjct:   161 TGYFNHS-ADQYHRELKRVVKGLDLFEEVGSGKSVPDFSQNGVYSTDLFTDVAMSVLDNH 219

Query:   263 PVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDPNR-RTYAA-MVKKLDDSVGT 320
                KP                       +TI Q +  T   R   ++  M+  +D ++G 
Sbjct:   220 NNSKPFFMFLSYQAVHPPLQVSQQS---KTIGQGKEATFILRSHAHSTRMLTAMDFAIGR 276

Query:   321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
             ++  L+   + EN++I+F SDNG        T+N+    SN P RG K+T+WEGG K   
Sbjct:   277 LVEYLKASNLYENTVIVFTSDNGG-------TANFG--ASNAPLRGEKDTIWEGGTKTTT 327

Query:   381 ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPS-RR 439
              + SP   +       M H+ DW  T+ +  G +        DG++QW  L    P  RR
Sbjct:   328 FVHSPMYIEEGGTRDMMFHVVDWHATILSITGLEIDSYG---DGINQWEYLKTGRPKFRR 384

Query:   440 NSNIDGLDQWSSLLLNTPSRRNSVLINIDEK 470
                +  +D   S + +   +   ++ N+D K
Sbjct:   385 FQFVYNIDNHGSAIRDGDYKL--IVGNVDRK 413

 Score = 75 (31.5 bits), Expect = 9.8e-50, Sum P(2) = 9.8e-50
 Identities = 19/61 (31%), Positives = 32/61 (52%)

Query:   569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
             LF +  DP E  +IA S P I  +L   L   ++ L  ++  +P  +   P+RFN ++S 
Sbjct:   423 LFRITTDPTESKDIARSNPKIVRRLLAKLDQLKKFL-HKNVRKPLSLNGSPERFNGSYSS 481

Query:   629 W 629
             +
Sbjct:   482 Y 482


>RGD|1565391 [details] [associations]
            symbol:Galns "galactosamine (N-acetyl)-6-sulfate sulfatase"
            species:10116 "Rattus norvegicus" [GO:0005575 "cellular_component"
            evidence=ND] [GO:0005764 "lysosome" evidence=IEA] [GO:0008152
            "metabolic process" evidence=RCA] [GO:0008484 "sulfuric ester
            hydrolase activity" evidence=IEA;ISO;RCA] [GO:0043890
            "N-acetylgalactosamine-6-sulfatase activity" evidence=IEA]
            [GO:0046872 "metal ion binding" evidence=IEA] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 RGD:1565391
            GO:GO:0046872 GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 HOGENOM:HOG000135352 HOVERGEN:HBG004283
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            GeneTree:ENSGT00560000077076 HSSP:P15289 CTD:2588 KO:K01132
            OrthoDB:EOG480HWH GO:GO:0043890 EMBL:AC134009 EMBL:BN000741
            IPI:IPI00359847 RefSeq:NP_001041316.1 UniGene:Rn.101398
            ProteinModelPortal:Q32KJ6 STRING:Q32KJ6 PRIDE:Q32KJ6
            Ensembl:ENSRNOT00000019528 GeneID:292073 KEGG:rno:292073
            UCSC:RGD:1565391 InParanoid:Q32KJ6 NextBio:633705
            Genevestigator:Q32KJ6 Uniprot:Q32KJ6
        Length = 524

 Score = 396 (144.5 bits), Expect = 3.1e-41, Sum P(2) = 3.1e-41
 Identities = 119/357 (33%), Positives = 171/357 (47%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
             GW DL  +G     TPN+D +A  G++  + Y A P+C+PSRA+L+TG+ PI  G     
Sbjct:    43 GWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPIRNGFYTTN 102

Query:   151 IWGAE---PR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
                     P+    G+P +E  LPE L++ GY+ K +GKWHLG  R ++ PL  GF+  F
Sbjct:   103 AHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-RPQFHPLKHGFDEWF 161

Query:   204 GYLNGVISYYDHILSDQYS--RTVELNGH---DMRRNLSTAWDTVGEY-ATDLFTKEAVQ 257
             G  N     YD+ +       R  E+ G    +   NL T     GE   T L+ +EA+ 
Sbjct:   162 GSPNCHFGPYDNKVKPNIPVYRDWEMVGRFYEEFPINLKT-----GEANLTQLYLQEALD 216

Query:   258 LIEDQPVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDDS 317
              I  Q   +                     AP     QF  +    R  Y   V+++DDS
Sbjct:   217 FIRTQHARQ--------SPFFLYWAIDATHAPVYASKQF--LGTSLRGRYGDAVREIDDS 266

Query:   318 VGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVK 377
             VG ++S LQ  G+ +N+ + F SDNGA  +     S  +  GSN P+   K T +EGG++
Sbjct:   267 VGKILSLLQNLGISKNTFVFFTSDNGAALI-----SAPKEGGSNGPFLCGKQTTFEGGMR 321

Query:   378 VPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG--GDTSRLPLNIDGLDQWSSLL 432
              PAI W P      +VS Q+  I D   T  + AG    + R+   IDGLD   ++L
Sbjct:   322 EPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAGLKPPSDRV---IDGLDLLPTML 375

 Score = 61 (26.5 bits), Expect = 3.1e-41, Sum P(2) = 3.1e-41
 Identities = 18/63 (28%), Positives = 34/63 (53%)

Query:   569 LFNLGNDPCEQNNI---ASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDT 625
             +F+LG DP E+  +   ++   D  S+  ++++ H+++LVP    QP L   +    N  
Sbjct:   443 IFHLGRDPGERFPLRFTSNEYQDALSRTTQVIQQHQKSLVPG---QPQLNVCNQAVMN-- 497

Query:   626 WSP 628
             W+P
Sbjct:   498 WAP 500


>MGI|MGI:1355303 [details] [associations]
            symbol:Galns "galactosamine (N-acetyl)-6-sulfate sulfatase"
            species:10090 "Mus musculus" [GO:0003824 "catalytic activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0008152
            "metabolic process" evidence=ISO] [GO:0008484 "sulfuric ester
            hydrolase activity" evidence=ISO] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0043890 "N-acetylgalactosamine-6-sulfatase
            activity" evidence=IEA] [GO:0046872 "metal ion binding"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 MGI:MGI:1355303 GO:GO:0046872
            GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            HOGENOM:HOG000135352 HOVERGEN:HBG004283 InterPro:IPR024607
            PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            GeneTree:ENSGT00560000077076 HSSP:P15289 CTD:2588 KO:K01132
            OrthoDB:EOG480HWH GO:GO:0043890 BRENDA:3.1.6.4 EMBL:AF111346
            EMBL:AF112242 EMBL:AF112230 EMBL:AF112231 EMBL:AF112233
            EMBL:AF112232 EMBL:AF112234 EMBL:AF112235 EMBL:AF112236
            EMBL:AF112237 EMBL:AF112238 EMBL:AF112239 EMBL:AF112240
            EMBL:AF112241 EMBL:AK220245 EMBL:AK159592 EMBL:BC004002
            IPI:IPI00310090 RefSeq:NP_001180574.1 RefSeq:NP_057931.3
            UniGene:Mm.34702 ProteinModelPortal:Q571E4 SMR:Q571E4 STRING:Q571E4
            PhosphoSite:Q571E4 PaxDb:Q571E4 PRIDE:Q571E4
            Ensembl:ENSMUST00000015171 GeneID:50917 KEGG:mmu:50917
            UCSC:uc012gmh.1 InParanoid:Q571E4 OMA:RKTGEAN NextBio:307919
            Bgee:Q571E4 CleanEx:MM_GALNS Genevestigator:Q571E4 Uniprot:Q571E4
        Length = 520

 Score = 395 (144.1 bits), Expect = 8.4e-41, Sum P(2) = 8.4e-41
 Identities = 116/353 (32%), Positives = 166/353 (47%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
             GW DL  +G     TPN+D +A  G++  + Y A P+C+PSRA+L+TG+ PI  G     
Sbjct:    39 GWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPIRNGFYTTN 98

Query:   151 IWGAE---PR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
                     P+    G+P +E  LPE L++ GY+ K +GKWHLG  R ++ PL  GF+  F
Sbjct:    99 AHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-RPQFHPLKHGFDEWF 157

Query:   204 GYLNGVISYYDHILSDQYS--RTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED 261
             G  N     YD+         R  E+ G            T     T L+T+EA+  I+ 
Sbjct:   158 GSPNCHFGPYDNKAKPNIPVYRDWEMVGR-FYEEFPINRKTGEANLTQLYTQEALDFIQT 216

Query:   262 QPVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTV 321
             Q   +                     AP     QF  +    R  Y   V+++DDSVG +
Sbjct:   217 QHARQ--------SPFFLYWAIDATHAPVYASRQF--LGTSLRGRYGDAVREIDDSVGKI 266

Query:   322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
             +S LQ  G+ +N+ + F SDNGA  +     S     GSN P+   K T +EGG++ PAI
Sbjct:   267 LSLLQNLGISKNTFVFFTSDNGAALI-----SAPNEGGSNGPFLCGKQTTFEGGMREPAI 321

Query:   382 LWSPQIQQNPRVSLQMMHISDWLPTLYTAAG--GDTSRLPLNIDGLDQWSSLL 432
              W P      +VS Q+  I D   T  + AG    + R+   IDGLD   ++L
Sbjct:   322 AWWPGHIAAGQVSHQLGSIMDLFTTSLSLAGLKPPSDRV---IDGLDLLPTML 371

 Score = 58 (25.5 bits), Expect = 8.4e-41, Sum P(2) = 8.4e-41
 Identities = 18/63 (28%), Positives = 33/63 (52%)

Query:   569 LFNLGNDPCEQNNIA---SSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDT 625
             +F+LG DP E+  ++       D  S+  ++++ H+++LVP    QP L   +    N  
Sbjct:   439 IFHLGRDPGERFPLSFHSDEYQDALSRTTQVVQEHQKSLVPG---QPQLNVCNQAVMN-- 493

Query:   626 WSP 628
             W+P
Sbjct:   494 WAP 496


>UNIPROTKB|F1NH07 [details] [associations]
            symbol:ARSJ "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00149 GO:GO:0008484
            GeneTree:ENSGT00560000077076 OMA:AAGYGIW EMBL:AADN02009321
            IPI:IPI00574604 Ensembl:ENSGALT00000019613 Uniprot:F1NH07
        Length = 472

 Score = 350 (128.3 bits), Expect = 6.9e-40, Sum P(2) = 6.9e-40
 Identities = 78/206 (37%), Positives = 113/206 (54%)

Query:   139 KYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRG 198
             +Y IHTG+Q   I   +P  +PL    LP+ L+E+GYST  +GKWHLGF+RRE  P  RG
Sbjct:     1 RYQIHTGLQHSIIRPTQPNCLPLDNITLPQKLKEVGYSTHMVGKWHLGFYRRECMPTQRG 60

Query:   199 FESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQ 257
             F++ FG L G   YY H   D       + G+D+  N + AWD   G Y+T ++T++  Q
Sbjct:    61 FDTFFGSLLGSGDYYTHFKCDSPG----ICGYDLYENDNAAWDHDNGIYSTQMYTQKVQQ 116

Query:   258 LIEDQPVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDDS 317
             ++      KP                   +AP +    ++ I + NRR YAAM+  LD++
Sbjct:   117 ILASHNPRKPIFLYIAYQAVHSPL-----QAPGKYFEHYRSINNINRRRYAAMLACLDEA 171

Query:   318 VGTVISALQRKGMLENSIIIFMSDNG 343
             +  V  AL++ G  +NSIII+ SDNG
Sbjct:   172 INNVTLALKKYGYYDNSIIIYSSDNG 197

 Score = 323 (118.8 bits), Expect = 9.0e-34, Sum P(2) = 9.0e-34
 Identities = 84/279 (30%), Positives = 133/279 (47%)

Query:   229 GHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPXXXXXXXXXXXXXXXXXXXE 287
             G+D+  N + AWD   G Y+T ++T++  Q++      KP                   +
Sbjct:    87 GYDLYENDNAAWDHDNGIYSTQMYTQKVQQILASHNPRKPIFLYIAYQAVHSPL-----Q 141

Query:   288 APQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTV 347
             AP +    ++ I + NRR YAAM+  LD+++  V  AL++ G  +NSIII+ SDNG   +
Sbjct:   142 APGKYFEHYRSINNINRRRYAAMLACLDEAINNVTLALKKYGYYDNSIIIYSSDNGGQPM 201

Query:   348 EYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTL 407
                        GSN+P RG K T WEGG++    + SP ++    V  +++HI+DW PTL
Sbjct:   202 A---------GGSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNKGSVCKELVHITDWFPTL 252

Query:   408 YTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINI 467
              T A G      + +DG D W ++   +  RR+  +D L     +     +   +    I
Sbjct:   253 ITLAEGQIDE-DIQLDGYDIWETI---SEGRRSPRVDILHNIDPIYTKAKNGSWAAGYGI 308

Query:   468 DEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVP 506
                   +A+R++ WKL+ G    G  D    Q  SN  P
Sbjct:   309 WNTAIQSAIRVNHWKLLTGNP--GYSDWVPPQAFSNVGP 345

 Score = 101 (40.6 bits), Expect = 6.9e-40, Sum P(2) = 6.9e-40
 Identities = 20/62 (32%), Positives = 33/62 (53%)

Query:   568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
             +LFN+  DP E+ ++++  PD+  QL   L    +T VP  +   D  +++PK     W 
Sbjct:   362 WLFNITADPYERVDLSAKYPDVVKQLLRRLSQFNKTAVPVRYPPKD-PRSNPKLNGGVWG 420

Query:   628 PW 629
             PW
Sbjct:   421 PW 422

 Score = 66 (28.3 bits), Expect = 0.00037, Sum P(2) = 0.00037
 Identities = 20/76 (26%), Positives = 33/76 (43%)

Query:     4 NLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPXXXXXXXXXXXXXXXXXXXEAPQETI 62
             N + AWD   G Y+T ++T++  Q++      KP                   +AP +  
Sbjct:    93 NDNAAWDHDNGIYSTQMYTQKVQQILASHNPRKPIFLYIAYQAVHSPL-----QAPGKYF 147

Query:    63 NQFQYITDPNRRTYAA 78
               ++ I + NRR YAA
Sbjct:   148 EHYRSINNINRRRYAA 163


>UNIPROTKB|F1MU84 [details] [associations]
            symbol:GALNS "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            GeneTree:ENSGT00560000077076 EMBL:DAAA02046255 IPI:IPI00703141
            Ensembl:ENSBTAT00000006001 OMA:DDQVGIL Uniprot:F1MU84
        Length = 527

 Score = 392 (143.0 bits), Expect = 8.7e-40, Sum P(2) = 8.7e-40
 Identities = 118/356 (33%), Positives = 170/356 (47%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGM---Q 147
             GW DL  +G     TPN+D +A  G++  N Y A P+C+PSRA+L+TG+ PI +G     
Sbjct:    46 GWGDLGVYGEPSRETPNLDRMAVEGMLFPNFYTANPLCSPSRAALLTGRLPIRSGFYTTN 105

Query:   148 GPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
             G       P+    G+P +E  LP  L+  GY++K +GKWHLG  R ++ PL  GF+  F
Sbjct:   106 GHARNAYTPQEIVGGIPDSELLLPALLKGAGYASKIVGKWHLGH-RPQFHPLKHGFDEWF 164

Query:   204 GYLNGVISYYDHILSDQYS--RTVELNGH---DMRRNLSTAWDTVGEY-ATDLFTKEAVQ 257
             G  N     YD+         R  E+ G    +   NL T     GE   T ++ +EA++
Sbjct:   165 GSPNCHFGPYDNKARPNIPVYRDQEMVGRFYEEFPINLKT-----GEANLTQIYLQEALE 219

Query:   258 LIE-DQPVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
              I+  Q   +P                    AP      F  +    R  Y   +++LDD
Sbjct:   220 FIQRQQAAHRPFFLYWAVDAT---------HAPIYASKPF--LGTSQRGRYGDAIRELDD 268

Query:   317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
             SVG ++  L+   + EN+ + F SDNGA  +     S  R  GSN P+   K T +EGG+
Sbjct:   269 SVGRILRLLRDLSIAENTFVFFTSDNGAALI-----SAPRQGGSNGPFLCGKQTTFEGGM 323

Query:   377 KVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
             + PAI W P      +VS Q+  I D   T  + AG +  R    IDGLD   ++L
Sbjct:   324 REPAIAWWPGHIPAGQVSHQLGSIMDLFTTSLSLAGLEPPR-DRAIDGLDLLPAML 378

 Score = 52 (23.4 bits), Expect = 8.7e-40, Sum P(2) = 8.7e-40
 Identities = 19/63 (30%), Positives = 32/63 (50%)

Query:   569 LFNLGNDPCEQN--NIASSRP-DISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDT 625
             +F+LG DP E+   ++AS    D   ++  +++ H+  LVP    QP L   +    N  
Sbjct:   446 IFHLGRDPGERFPLSVASIEYLDALRRITPVVQQHQEALVPG---QPQLNVCNRAVMN-- 500

Query:   626 WSP 628
             W+P
Sbjct:   501 WAP 503


>UNIPROTKB|F1S147 [details] [associations]
            symbol:ARSJ "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00149 GO:GO:0008484
            GeneTree:ENSGT00560000077076 OMA:AAGYGIW EMBL:CU694917
            Ensembl:ENSSSCT00000009989 Uniprot:F1S147
        Length = 467

 Score = 345 (126.5 bits), Expect = 2.9e-38, Sum P(2) = 2.9e-38
 Identities = 78/213 (36%), Positives = 112/213 (52%)

Query:   136 MTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPL 195
             +  +Y IHTG+Q   I   +P  +PL    LP+ L+E+GYST  +GKWHLGF+R+E  P 
Sbjct:     1 LLSRYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPT 60

Query:   196 YRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKE 254
              RGF++ FG L G   YY H   D       + G+D+  N + AWD   G Y+T ++T+ 
Sbjct:    61 KRGFDTFFGSLLGSGDYYTHYKCDSPG----MCGYDLYENENAAWDYDNGIYSTQMYTQR 116

Query:   255 AVQLIEDQPVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKL 314
               Q++      +P                   +AP      ++ I + NRR YAAM+  L
Sbjct:   117 VQQILASHDPKRPIFLYIAYQAVHSPL-----QAPGRYFEHYRSIININRRRYAAMLSCL 171

Query:   315 DDSVGTVISALQRKGMLENSIIIFMSDNGA-PT 346
             D+++  V  AL+  G   NSIII+ SDNG  PT
Sbjct:   172 DEAINNVTLALKMYGFYNNSIIIYSSDNGGQPT 204

 Score = 309 (113.8 bits), Expect = 7.5e-31, Sum P(2) = 7.5e-31
 Identities = 82/279 (29%), Positives = 130/279 (46%)

Query:   229 GHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPXXXXXXXXXXXXXXXXXXXE 287
             G+D+  N + AWD   G Y+T ++T+   Q++      +P                   +
Sbjct:    90 GYDLYENENAAWDYDNGIYSTQMYTQRVQQILASHDPKRPIFLYIAYQAVHSPL-----Q 144

Query:   288 APQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTV 347
             AP      ++ I + NRR YAAM+  LD+++  V  AL+  G   NSIII+ SDNG    
Sbjct:   145 APGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKMYGFYNNSIIIYSSDNGG--- 201

Query:   348 EYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTL 407
               + T+     GSN+P RG K T WEGG++    + SP ++    V  +++HI+DW PTL
Sbjct:   202 --QPTAG----GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTL 255

Query:   408 YTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINI 467
              + A G      + +DG D W ++   +   R+  +D L     +     +   +    I
Sbjct:   256 ISLAEGQIDE-DIQLDGYDVWETI---SEGLRSPRVDILHNIDPIYTKAKNGSWAAGYGI 311

Query:   468 DEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVP 506
                   +A+R+  WKL+ G    G  D    Q+ SN  P
Sbjct:   312 WNTAIQSAIRVQHWKLLTGNP--GYSDWVPPQSFSNLGP 348

 Score = 91 (37.1 bits), Expect = 2.9e-38, Sum P(2) = 2.9e-38
 Identities = 19/67 (28%), Positives = 33/67 (49%)

Query:   563 TNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRF 622
             T    +LFN+  DP E+ ++++  P +  QL   L    +T VP  +   D  +++P+  
Sbjct:   360 TGKSVWLFNITADPYERVDLSNRYPGVVKQLLRRLSQFNKTAVPVRYPPKD-PRSNPRLN 418

Query:   623 NDTWSPW 629
                W PW
Sbjct:   419 GGVWGPW 425


>UNIPROTKB|Q32KH5 [details] [associations]
            symbol:GALNS "N-acetylgalactosamine-6-sulfatase"
            species:9615 "Canis lupus familiaris" [GO:0005764 "lysosome"
            evidence=IEA] [GO:0043890 "N-acetylgalactosamine-6-sulfatase
            activity" evidence=IEA] [GO:0046872 "metal ion binding"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 GO:GO:0046872 GO:GO:0005764
            Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            HOGENOM:HOG000135352 HOVERGEN:HBG004283 InterPro:IPR024607
            PROSITE:PS00523 PROSITE:PS00149 HSSP:P15289 EMBL:BN000762
            RefSeq:NP_001041585.1 UniGene:Cfa.37704 ProteinModelPortal:Q32KH5
            STRING:Q32KH5 PRIDE:Q32KH5 GeneID:489661 KEGG:cfa:489661 CTD:2588
            InParanoid:Q32KH5 KO:K01132 OrthoDB:EOG480HWH NextBio:20862813
            GO:GO:0043890 Uniprot:Q32KH5
        Length = 522

 Score = 379 (138.5 bits), Expect = 4.6e-38, Sum P(2) = 4.6e-38
 Identities = 119/358 (33%), Positives = 168/358 (46%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
             GW DL  +G     TPN+D +A  G++  + Y A P+C+PSRA+L+TG+ PI  G     
Sbjct:    41 GWGDLGIYGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPIRNGFYTTN 100

Query:   151 IWGAE---PR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
                     P+    G+P  E  LPE L+E GY +K +GKWHLG  R ++ PL  GF+  F
Sbjct:   101 RHARNAYTPQEIVGGIPDQEHVLPELLKEAGYVSKIVGKWHLGH-RPQFHPLKHGFDEWF 159

Query:   204 GYLNGVISYYDHILSDQYS--RTVELNGH---DMRRNLSTAWDTVGEY-ATDLFTKEAVQ 257
             G  N     YD+         R  E+ G    +   NL T     GE   T ++ +EA+ 
Sbjct:   160 GSPNCHFGPYDNRARPNIPVYRDWEMVGRYYEEFPINLKT-----GEANLTQVYLQEALD 214

Query:   258 LIE-DQPVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
              I+  Q   +P                    AP      F  +    R  Y   V+++D+
Sbjct:   215 FIKRQQAAQRPFFLYWAIDAT---------HAPVYASRPF--LGTSQRGRYGDAVREIDN 263

Query:   317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
             SVG ++S LQ   + EN+ + F SDNGA  +     S     GSN P+   K T +EGG+
Sbjct:   264 SVGKILSLLQDLRISENTFVFFTSDNGAALI-----SAPNQGGSNGPFLCGKQTTFEGGM 318

Query:   377 KVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG--GDTSRLPLNIDGLDQWSSLL 432
             + PAI W P      RVS Q+  I D   T  + AG    + R+   IDGLD   ++L
Sbjct:   319 REPAIAWWPGRIPAGRVSHQLGSIMDLFTTSLSLAGLAPPSDRV---IDGLDLLPAML 373

 Score = 68 (29.0 bits), Expect = 4.6e-38, Sum P(2) = 4.6e-38
 Identities = 21/63 (33%), Positives = 35/63 (55%)

Query:   569 LFNLGNDPCEQN--NIASSRP-DISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDT 625
             +F+LG DP E+   + AS+   D+  ++  +++ H++TLVP    QP L   D    N  
Sbjct:   441 IFHLGRDPGERFPLSFASTEYLDVLQRVTPVVQQHQKTLVPG---QPQLNVCDRAVMN-- 495

Query:   626 WSP 628
             W+P
Sbjct:   496 WAP 498


>UNIPROTKB|F1PHF0 [details] [associations]
            symbol:GALNS "N-acetylgalactosamine-6-sulfatase"
            species:9615 "Canis lupus familiaris" [GO:0008484 "sulfuric ester
            hydrolase activity" evidence=IEA] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
            PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            GeneTree:ENSGT00560000077076 OMA:RKTGEAN EMBL:AAEX03003965
            Ensembl:ENSCAFT00000031604 Uniprot:F1PHF0
        Length = 524

 Score = 379 (138.5 bits), Expect = 7.2e-38, Sum P(2) = 7.2e-38
 Identities = 119/358 (33%), Positives = 168/358 (46%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
             GW DL  +G     TPN+D +A  G++  + Y A P+C+PSRA+L+TG+ PI  G     
Sbjct:    43 GWGDLGIYGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPIRNGFYTTN 102

Query:   151 IWGAE---PR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
                     P+    G+P  E  LPE L+E GY +K +GKWHLG  R ++ PL  GF+  F
Sbjct:   103 RHARNAYTPQEIVGGIPDQEHVLPELLKEAGYVSKIVGKWHLGH-RPQFHPLKHGFDEWF 161

Query:   204 GYLNGVISYYDHILSDQYS--RTVELNGH---DMRRNLSTAWDTVGEY-ATDLFTKEAVQ 257
             G  N     YD+         R  E+ G    +   NL T     GE   T ++ +EA+ 
Sbjct:   162 GSPNCHFGPYDNRARPNIPVYRDWEMVGRYYEEFPINLKT-----GEANLTQVYLQEALD 216

Query:   258 LIE-DQPVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
              I+  Q   +P                    AP      F  +    R  Y   V+++D+
Sbjct:   217 FIKRQQAAQRPFFLYWAIDAT---------HAPVYASRPF--LGTSQRGRYGDAVREIDN 265

Query:   317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
             SVG ++S LQ   + EN+ + F SDNGA  +     S     GSN P+   K T +EGG+
Sbjct:   266 SVGKILSLLQDLRISENTFVFFTSDNGAALI-----SAPNQGGSNGPFLCGKQTTFEGGM 320

Query:   377 KVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG--GDTSRLPLNIDGLDQWSSLL 432
             + PAI W P      RVS Q+  I D   T  + AG    + R+   IDGLD   ++L
Sbjct:   321 REPAIAWWPGRIPAGRVSHQLGSIMDLFTTSLSLAGLAPPSDRV---IDGLDLLPAML 375

 Score = 68 (29.0 bits), Expect = 7.2e-38, Sum P(2) = 7.2e-38
 Identities = 21/63 (33%), Positives = 35/63 (55%)

Query:   569 LFNLGNDPCEQN--NIASSRP-DISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDT 625
             +F+LG DP E+   + AS+   D+  ++  +++ H++TLVP    QP L   D    N  
Sbjct:   443 IFHLGRDPGERFPLSFASTEYLDVLQRVTPVVQQHQKTLVPG---QPQLNVCDRAVMN-- 497

Query:   626 WSP 628
             W+P
Sbjct:   498 WAP 500


>UNIPROTKB|F6PKT4 [details] [associations]
            symbol:ARSJ "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00149 GO:GO:0008484
            HOGENOM:HOG000135354 HOVERGEN:HBG004282
            GeneTree:ENSGT00560000077076 OrthoDB:EOG45HRX5 EMBL:AAEX03016834
            Ensembl:ENSCAFT00000019312 Uniprot:F6PKT4
        Length = 489

 Score = 352 (129.0 bits), Expect = 6.6e-37, Sum P(2) = 6.6e-37
 Identities = 80/214 (37%), Positives = 114/214 (53%)

Query:   135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
             L++ +Y IHTG+Q   I   +P  +PL    LP+ L+E+GYST  +GKWHLGF+R+E  P
Sbjct:     1 LLSSRYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 60

Query:   195 LYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTK 253
               RGF++ FG L G   YY H   D       + G+D+  N + AWD   G Y+T ++T+
Sbjct:    61 TKRGFDTFFGSLLGSGDYYTHYKCDSPG----MCGYDLYENDNAAWDYDNGIYSTQMYTQ 116

Query:   254 EAVQLIEDQPVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKK 313
                Q++      KP                   +AP      ++ I + NRR YAAM+  
Sbjct:   117 RVQQILASHDPRKPIFLYIAYQAVHSPL-----QAPGRYFEHYRSIININRRRYAAMLSC 171

Query:   314 LDDSVGTVISALQRKGMLENSIIIFMSDNGA-PT 346
             LD+++  V  AL+  G   NSIII+ SDNG  PT
Sbjct:   172 LDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPT 205

 Score = 310 (114.2 bits), Expect = 1.4e-30, Sum P(2) = 1.4e-30
 Identities = 83/279 (29%), Positives = 130/279 (46%)

Query:   229 GHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPXXXXXXXXXXXXXXXXXXXE 287
             G+D+  N + AWD   G Y+T ++T+   Q++      KP                   +
Sbjct:    91 GYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPRKPIFLYIAYQAVHSPL-----Q 145

Query:   288 APQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTV 347
             AP      ++ I + NRR YAAM+  LD+++  V  AL+  G   NSIII+ SDNG    
Sbjct:   146 APGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGG--- 202

Query:   348 EYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTL 407
               + T+     GSN+P RG K T WEGG++    + SP ++    V  +++HI+DW PTL
Sbjct:   203 --QPTAG----GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTL 256

Query:   408 YTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINI 467
              + A G      + +DG D W ++   +   R+  +D L     +     +   +    I
Sbjct:   257 ISLAEGQIDE-DIQLDGYDVWETI---SEGLRSPRVDILHNIDPIYTKAKNGSWAAGYGI 312

Query:   468 DEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVP 506
                   +A+R+  WKL+ G    G  D    Q+ SN  P
Sbjct:   313 WNTAIQSAIRVQHWKLLTGNP--GYSDWVPPQSFSNLGP 349

 Score = 90 (36.7 bits), Expect = 6.6e-37, Sum P(2) = 6.6e-37
 Identities = 20/67 (29%), Positives = 32/67 (47%)

Query:   563 TNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRF 622
             T    +LFN+  DP E+ +++   P I  QL   L    +T VP  +   D  +++P+  
Sbjct:   361 TGKSVWLFNITADPYERVDLSHRYPGIVKQLLRRLSQFNKTAVPVRYPPKD-PRSNPRLN 419

Query:   623 NDTWSPW 629
                W PW
Sbjct:   420 GGVWGPW 426


>UNIPROTKB|Q8WNQ7 [details] [associations]
            symbol:GALNS "N-acetylgalactosamine-6-sulfatase"
            species:9823 "Sus scrofa" [GO:0005764 "lysosome" evidence=IEA]
            [GO:0043890 "N-acetylgalactosamine-6-sulfatase activity"
            evidence=IEA] [GO:0046872 "metal ion binding" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 GO:GO:0046872 GO:GO:0005764 Gene3D:3.40.720.10
            SUPFAM:SSF53649 eggNOG:COG3119 HOGENOM:HOG000135352
            HOVERGEN:HBG004283 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 HSSP:P15289 CTD:2588 KO:K01132 OrthoDB:EOG480HWH
            GO:GO:0043890 EMBL:AF322917 RefSeq:NP_999120.1 UniGene:Ssc.4371
            ProteinModelPortal:Q8WNQ7 STRING:Q8WNQ7 GeneID:397000
            KEGG:ssc:397000 ArrayExpress:Q8WNQ7 Uniprot:Q8WNQ7
        Length = 522

 Score = 374 (136.7 bits), Expect = 4.5e-36, Sum P(2) = 4.5e-36
 Identities = 113/354 (31%), Positives = 164/354 (46%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGM---Q 147
             GW DL  +G     TPN+D +A  G++  + YA  P+C+PSRA+L+TG+ PI TG     
Sbjct:    41 GWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYAANPLCSPSRAALLTGRLPIRTGFYTTN 100

Query:   148 GPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
             G       P+    G+P  E  LPE L+  GY++K +GKWHLG  R ++ PL  GF+  F
Sbjct:   101 GHARNAYTPQEIVGGIPDPEHLLPELLKGAGYASKIVGKWHLGH-RPQFHPLKHGFDEWF 159

Query:   204 GYLNGVISYYDHILSDQYS--RTVELNGH---DMRRNLSTAWDTVGEYATDLFTKEAVQL 258
             G  N     YD+         R  E+ G    +   NL T    +    T ++ +EA+  
Sbjct:   160 GSPNCHFGPYDNRARPNIPVYRDWEMVGRFYEEFPINLKTGESNL----TQIYLQEALDF 215

Query:   259 IEDQPVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
             I+ Q                         AP      F  +    R  Y   V+++DDSV
Sbjct:   216 IKRQQA--------THHPFFLYWAIDATHAPVYASRAF--LGTSQRGRYGDAVREIDDSV 265

Query:   319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
             G ++  L+   +  N+ + F SDNGA  V     S  +  GSN P+   K T +EGG++ 
Sbjct:   266 GRIVGLLRDLKIAGNTFVFFTSDNGAALV-----SAPKQGGSNGPFLCGKQTTFEGGMRE 320

Query:   379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
             PAI W P      +VS Q+  + D   T  + AG +       IDGLD   ++L
Sbjct:   321 PAIAWWPGHIPAGQVSHQLGSVMDLFTTSLSLAGLEPPS-DRAIDGLDLLPAML 373

 Score = 60 (26.2 bits), Expect = 4.5e-36, Sum P(2) = 4.5e-36
 Identities = 18/63 (28%), Positives = 33/63 (52%)

Query:   569 LFNLGNDPCEQNNIASSRPDISSQLYEL---LKYHRRTLVPQSHEQPDLVQADPKRFNDT 625
             +F+LG DP E+  ++ +  +    L ++   ++ H+ +LVP    QP L   +P   N  
Sbjct:   441 IFHLGRDPGERFPLSFASTEYLDALRKITLVVQQHQESLVPG---QPQLNVCNPAVMN-- 495

Query:   626 WSP 628
             W+P
Sbjct:   496 WAP 498


>UNIPROTKB|F1RL71 [details] [associations]
            symbol:F1RL71 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484
            GeneTree:ENSGT00560000077076 EMBL:CU914366
            Ensembl:ENSSSCT00000015793 Uniprot:F1RL71
        Length = 561

 Score = 252 (93.8 bits), Expect = 1.4e-35, Sum P(3) = 1.4e-35
 Identities = 66/182 (36%), Positives = 92/182 (50%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             YAAMV  +D++V  +  AL + G   NS+IIF SDNG  T          + GSN+P RG
Sbjct:   254 YAAMVTCMDEAVRNITGAL-KYGFYNNSVIIFSSDNGGQTF---------SGGSNWPLRG 303

Query:   367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
              K T WEGGV+    + SP +++  R S  ++HI+DW PTL   AGG  S     +DG D
Sbjct:   304 RKGTYWEGGVRGLGFVHSPLLKRTRRTSRALLHITDWYPTLVGLAGGTASAAD-GLDGYD 362

Query:   427 QWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLV 484
              W ++     S R   +  +D      L   +R  S+     I      AA+R+  WKL+
Sbjct:   363 VWPAISEGRASPRTEILHNIDP-----LYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLL 417

Query:   485 LG 486
              G
Sbjct:   418 TG 419

 Score = 151 (58.2 bits), Expect = 1.4e-35, Sum P(3) = 1.4e-35
 Identities = 26/55 (47%), Positives = 39/55 (70%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGM 146
             G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG + +  G+
Sbjct:    58 GYHDVGYHGS-DIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGSHSLDRGL 111

 Score = 98 (39.6 bits), Expect = 1.4e-35, Sum P(3) = 1.4e-35
 Identities = 22/64 (34%), Positives = 32/64 (50%)

Query:   568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDT 625
             +LFN+  DP E+ ++A  RPD+   L   L  + RT +P  +  E P   +A P      
Sbjct:   453 WLFNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENP---RAHPDFNGGA 509

Query:   626 WSPW 629
             W PW
Sbjct:   510 WGPW 513


>UNIPROTKB|P34059 [details] [associations]
            symbol:GALNS "N-acetylgalactosamine-6-sulfatase"
            species:9606 "Homo sapiens" [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0043890 "N-acetylgalactosamine-6-sulfatase
            activity" evidence=IEA] [GO:0003943
            "N-acetylgalactosamine-4-sulfatase activity" evidence=TAS]
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IDA]
            [GO:0005975 "carbohydrate metabolic process" evidence=TAS]
            [GO:0030203 "glycosaminoglycan metabolic process" evidence=TAS]
            [GO:0042339 "keratan sulfate metabolic process" evidence=TAS]
            [GO:0042340 "keratan sulfate catabolic process" evidence=TAS]
            [GO:0043202 "lysosomal lumen" evidence=TAS] [GO:0044281 "small
            molecule metabolic process" evidence=TAS] Reactome:REACT_111217
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 Reactome:REACT_116125 GO:GO:0046872 GO:GO:0005975
            GO:GO:0043202 DrugBank:DB00070 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 HOGENOM:HOG000135352 HOVERGEN:HBG004283
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0003943
            Orphanet:582 GO:GO:0042340 CTD:2588 KO:K01132 OrthoDB:EOG480HWH
            GO:GO:0043890 EMBL:D17629 EMBL:U06088 EMBL:U06078 EMBL:U06079
            EMBL:U06080 EMBL:U06081 EMBL:U06082 EMBL:U06083 EMBL:U06084
            EMBL:U06085 EMBL:U06086 EMBL:U06087 EMBL:BC050684 EMBL:BC056151
            IPI:IPI00029605 PIR:JQ1299 RefSeq:NP_000503.1 UniGene:Hs.271383
            PDB:4FDI PDB:4FDJ PDBsum:4FDI PDBsum:4FDJ ProteinModelPortal:P34059
            SMR:P34059 STRING:P34059 PhosphoSite:P34059 DMDM:462148
            PaxDb:P34059 PRIDE:P34059 DNASU:2588 Ensembl:ENST00000268695
            GeneID:2588 KEGG:hsa:2588 UCSC:uc002fly.4 GeneCards:GC16M088880
            H-InvDB:HIX0134371 HGNC:HGNC:4122 HPA:CAB026404 MIM:253000
            MIM:612222 neXtProt:NX_P34059 PharmGKB:PA28535 InParanoid:P34059
            OMA:GAISHAF PhylomeDB:P34059 BioCyc:MetaCyc:HS06790-MONOMER
            BRENDA:3.1.6.4 ChiTaRS:Galns GenomeRNAi:2588 NextBio:10237
            ArrayExpress:P34059 Bgee:P34059 CleanEx:HS_GALNS
            Genevestigator:P34059 GermOnline:ENSG00000141012 Uniprot:P34059
        Length = 522

 Score = 374 (136.7 bits), Expect = 1.5e-35, Sum P(2) = 1.5e-35
 Identities = 108/335 (32%), Positives = 156/335 (46%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
             GW DL  +G     TPN+D +A  G++  N Y A P+C+PSRA+L+TG+ PI  G     
Sbjct:    42 GWGDLGVYGEPSRETPNLDRMAAEGLLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTN 101

Query:   151 IWGAE---PR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
                     P+    G+P +E+ LPE L++ GY +K +GKWHLG  R ++ PL  GF+  F
Sbjct:   102 AHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLGH-RPQFHPLKHGFDEWF 160

Query:   204 GYLNGVISYYDHILSDQYS--RTVELNGH---DMRRNLSTAWDTVGEY-ATDLFTKEAVQ 257
             G  N     YD+         R  E+ G    +   NL T     GE   T ++ +EA+ 
Sbjct:   161 GSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKT-----GEANLTQIYLQEALD 215

Query:   258 LIEDQPVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDDS 317
              I+ Q    P                    AP      F  +    R  Y   V+++DDS
Sbjct:   216 FIKRQARHHPFFLYWAVDAT---------HAPVYASKPF--LGTSQRGRYGDAVREIDDS 264

Query:   318 VGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVK 377
             +G ++  LQ   + +N+ + F SDNGA  +   E       GSN P+   K T +EGG++
Sbjct:   265 IGKILELLQDLHVADNTFVFFTSDNGAALISAPEQG-----GSNGPFLCGKQTTFEGGMR 319

Query:   378 VPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
              PA+ W P      +VS Q+  I D   T    AG
Sbjct:   320 EPALAWWPGHVTAGQVSHQLGSIMDLFTTSLALAG 354

 Score = 55 (24.4 bits), Expect = 1.5e-35, Sum P(2) = 1.5e-35
 Identities = 19/63 (30%), Positives = 33/63 (52%)

Query:   569 LFNLGNDPCEQN--NIASSR-PDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDT 625
             +F+LG DP E+   + AS+   +  S++  +++ H+  LVP    QP L   +    N  
Sbjct:   441 IFHLGRDPGERFPLSFASAEYQEALSRITSVVQQHQEALVPA---QPQLNVCNWAVMN-- 495

Query:   626 WSP 628
             W+P
Sbjct:   496 WAP 498


>UNIPROTKB|F1NW57 [details] [associations]
            symbol:GALNS "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484
            GeneTree:ENSGT00560000077076 OMA:DDQVGIL EMBL:AADN02054103
            IPI:IPI00577734 Ensembl:ENSGALT00000010149 Uniprot:F1NW57
        Length = 521

 Score = 371 (135.7 bits), Expect = 2.4e-35, Sum P(2) = 2.4e-35
 Identities = 112/355 (31%), Positives = 162/355 (45%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
             GW DL   G     TPN+D +A  G++  + YA  P+C+PSRA+L+TG+ P+  G     
Sbjct:    40 GWGDLGAFGEPSKETPNLDQMASEGMLFLDFYAANPLCSPSRAALLTGRLPVRNGFYTTN 99

Query:   151 IWGAE---PR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
                     P+    G+  +E  LPE L++ GY+ K IGKWHLG  R ++ PL  GF+  F
Sbjct:   100 AHARNAYTPQDIVGGIQDSEILLPELLKKAGYTNKIIGKWHLGH-RPQFHPLKHGFDEWF 158

Query:   204 GYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEY----ATDLFTKEA--VQ 257
             G  N     YD+       R   L    + R+    W+ +G Y      DL T EA   Q
Sbjct:   159 GSPNCHFGPYDN-------RA--LPNIPVYRD----WEMIGRYYEDFKIDLRTGEANLTQ 205

Query:   258 LIEDQPVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDDS 317
             +   + +D                      AP      F  +    R  Y   V+++DDS
Sbjct:   206 IYLQEALDFISKQQASQQPFFLYWAIDATHAPVYASKHF--LGTSQRGRYGDAVREIDDS 263

Query:   318 VGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVK 377
             VG ++  LQ+ G+ EN+ + F SDNGA  +     S  +  GSN P+   K T +EGG++
Sbjct:   264 VGKILKHLQKLGISENTFVFFTSDNGAALI-----SAPKQGGSNGPFLCGKQTTFEGGMR 318

Query:   378 VPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
              PAI W P       VS Q+  + D   T  +  G         IDG+D   ++L
Sbjct:   319 EPAIAWWPGHIPAGSVSRQLGSVMDLFTTSLSLVGLQPPS-DRQIDGIDLLPAIL 372

 Score = 58 (25.5 bits), Expect = 2.4e-35, Sum P(2) = 2.4e-35
 Identities = 18/63 (28%), Positives = 32/63 (50%)

Query:   569 LFNLGNDPCEQNNIASSRPD---ISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDT 625
             LF+LG DP E+  ++ +  +   +  ++  +++ H+ T+VP     P L   D    N  
Sbjct:   440 LFHLGRDPGEKYPLSFASDEYQGVMRRISAVVQQHKDTMVPGV---PQLNVCDKAVMN-- 494

Query:   626 WSP 628
             WSP
Sbjct:   495 WSP 497


>UNIPROTKB|F1S2F1 [details] [associations]
            symbol:F1S2F1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            GeneTree:ENSGT00560000077076 EMBL:CU468550
            Ensembl:ENSSSCT00000015408 Uniprot:F1S2F1
        Length = 151

 Score = 380 (138.8 bits), Expect = 8.1e-35, P = 8.1e-35
 Identities = 67/124 (54%), Positives = 90/124 (72%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
             GWND+ FHGS EI TP++DALA  G++L+N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct:    24 GWNDVGFHGS-EIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 82

Query:   152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             W  +P  +PL E+ LP+ L+E GY+T  +GKWHLG +R+E  P  RGF+++FG  NG   
Sbjct:    83 WPCQPSCIPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYFG--NGNAH 140

Query:   212 YYDH 215
              Y H
Sbjct:   141 TYIH 144


>UNIPROTKB|P25549 [details] [associations]
            symbol:aslA "arylsulfatase" species:83333 "Escherichia coli
            K-12" [GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
            [GO:0008152 "metabolic process" evidence=IEA] [GO:0003824
            "catalytic activity" evidence=IEA] [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0042597 "periplasmic space" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0004065
            "arylsulfatase activity" evidence=IEA] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 EMBL:U00096
            EMBL:AP009048 GenomeReviews:AP009048_GR GenomeReviews:U00096_GR
            GO:GO:0046872 GO:GO:0042597 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 OMA:FGPSQMA InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0004065 EMBL:M90498 EMBL:M87049 PIR:S30691
            RefSeq:NP_418245.1 RefSeq:YP_491641.1 ProteinModelPortal:P25549
            SMR:P25549 IntAct:P25549 EnsemblBacteria:EBESCT00000000559
            EnsemblBacteria:EBESCT00000017339 GeneID:12933611 GeneID:949015
            KEGG:ecj:Y75_p3377 KEGG:eco:b3801 PATRIC:32123099 EchoBASE:EB0087
            EcoGene:EG10089 HOGENOM:HOG000126460 KO:K01130
            ProtClustDB:CLSK880785 BioCyc:EcoCyc:ARYLSULFAT-MONOMER
            BioCyc:ECOL316407:JW3773-MONOMER BioCyc:MetaCyc:ARYLSULFAT-MONOMER
            Genevestigator:P25549 Uniprot:P25549
        Length = 551

 Score = 252 (93.8 bits), Expect = 2.1e-32, Sum P(2) = 2.1e-32
 Identities = 55/126 (43%), Positives = 80/126 (63%)

Query:    92 GWNDLSFHGSNEI---PTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQG 148
             GW D+ F+G       PTP+IDA+A  G+IL + Y+QP  +P+RA+++TG+Y IH G+  
Sbjct:    97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156

Query:   149 PPIWGAEPRGVP-LTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
             PP++G +P G+  LT   LP+ L + GY T+AIGKWH+G   +E  P   GF+   G+ N
Sbjct:   157 PPMYG-QPGGLQGLTT--LPQLLHDQGYVTQAIGKWHMGE-NKESQPQNVGFDDFRGF-N 211

Query:   208 GVISYY 213
              V   Y
Sbjct:   212 SVSDMY 217

 Score = 178 (67.7 bits), Expect = 2.1e-32, Sum P(2) = 2.1e-32
 Identities = 49/148 (33%), Positives = 77/148 (52%)

Query:   300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
             + P R +Y   + +++D    +   L++ G L+N++I+F SDNG P  E     + R   
Sbjct:   315 SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG-PEAEV--PPHGRT-- 369

Query:   360 SNYPYRGVKNTLWEGGVKVPA-ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRL 418
                P+RG K + WEGGV+VP  + W   IQ  PR S  ++ ++D  PT    AG   +++
Sbjct:   370 ---PFRGAKGSTWEGGVRVPTFVYWKGMIQ--PRKSDGIVDLADLFPTALDLAGHPGAKV 424

Query:   419 ----PLN--IDGLDQWSSLLLNTPSRRN 440
                 P    IDG+DQ +S  L T  + N
Sbjct:   425 ANLVPKTTFIDGVDQ-TSFFLGTNGQSN 451

 Score = 63 (27.2 bits), Expect = 2.2e-20, Sum P(2) = 2.2e-20
 Identities = 22/58 (37%), Positives = 29/58 (50%)

Query:   443 IDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKL-VLGTQENG-TMDGYYG 498
             IDG+DQ +S  L T  + N    +     + AAVR+D +K  VL  Q    T  GY G
Sbjct:   434 IDGVDQ-TSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQG 490


>ZFIN|ZDB-GENE-070112-1152 [details] [associations]
            symbol:galns "galactosamine (N-acetyl)-6-sulfate
            sulfatase" species:7955 "Danio rerio" [GO:0008152 "metabolic
            process" evidence=IEA] [GO:0008484 "sulfuric ester hydrolase
            activity" evidence=IEA] [GO:0003824 "catalytic activity"
            evidence=IEA] [GO:0005575 "cellular_component" evidence=ND]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 ZFIN:ZDB-GENE-070112-1152 Gene3D:3.40.720.10
            SUPFAM:SSF53649 InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484
            GeneTree:ENSGT00560000077076 EMBL:CR376726 EMBL:BX248306
            EMBL:CR388041 IPI:IPI01023807 ProteinModelPortal:F8W261
            Ensembl:ENSDART00000149478 ArrayExpress:F8W261 Bgee:F8W261
            Uniprot:F8W261
        Length = 514

 Score = 340 (124.7 bits), Expect = 6.5e-31, Sum P(2) = 6.5e-31
 Identities = 111/357 (31%), Positives = 162/357 (45%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
             GW DL   G     TP +D +A  G++  N Y A P+C+PSRA+L+TG+ P+  G     
Sbjct:    33 GWGDLGVFGEPSKETPYLDLMAAQGMLFPNFYTANPLCSPSRAALLTGRLPVRNGFYTTN 92

Query:   151 IWGAE---PR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
                     P+    G+   E  LPE L+   Y +K +GKWHLG  R +Y PL  GF+  F
Sbjct:    93 AHARNAYTPQEIVGGISADEILLPELLKNKHYVSKIVGKWHLGH-RTQYLPLKHGFDEWF 151

Query:   204 GYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDT---VGEY-ATDLFTKEAVQLI 259
             G  N     Y+   S + +  V  N  +M+      ++     GE   T L+ KE +  I
Sbjct:   152 GAPNCHFGPYND--SSRPNIPV-YNNSEMKGRYYEEFEINVKTGESNLTQLYLKEGLDFI 208

Query:   260 EDQPV-DKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
               Q +  +P                    AP      F  +    R  Y   V +LDDS+
Sbjct:   209 SQQAMAQRPFFLYWAPDAT---------HAPVYASKPF--LGKSQRGRYGDAVMELDDSI 257

Query:   319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
             G +++ L   G+  ++++ F SDNGA  +     S     GSN P+   K T +EGG++ 
Sbjct:   258 GQILAHLVSLGIQNDTLVFFTSDNGAALM-----SGPLQSGSNAPFLCGKETTFEGGMRE 312

Query:   379 PAILWSP-QIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLN 434
             PA+ W P QI     VS Q+  + D   T  + AG       + IDG+D  S +L N
Sbjct:   313 PAMAWWPGQIPAGT-VSHQLASVMDLFSTSLSVAGVSPPDDRV-IDGVDL-SPVLFN 366

 Score = 59 (25.8 bits), Expect = 6.5e-31, Sum P(2) = 6.5e-31
 Identities = 17/63 (26%), Positives = 32/63 (50%)

Query:   569 LFNLGNDPCEQNNIA---SSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDT 625
             +F+LG DP E+  ++       D+  ++  +++ H++ L+P    QP L   D    N  
Sbjct:   433 IFHLGRDPGERYPLSVQCKEYRDVFRRVTAVVEQHQKLLIPG---QPQLNMCDLAVMN-- 487

Query:   626 WSP 628
             W+P
Sbjct:   488 WTP 490


>TIGR_CMR|CPS_2364 [details] [associations]
            symbol:CPS_2364 "sulfatase family protein" species:167879
            "Colwellia psychrerythraea 34H" [GO:0008152 "metabolic process"
            evidence=ISS] [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=ISS] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 EMBL:CP000083
            GenomeReviews:CP000083_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 HOGENOM:HOG000135352 InterPro:IPR024607
            PROSITE:PS00149 GO:GO:0008484 RefSeq:YP_269082.1
            ProteinModelPortal:Q482D6 STRING:Q482D6 GeneID:3521400
            KEGG:cps:CPS_2364 PATRIC:21467813 OMA:MEIAVIN
            BioCyc:CPSY167879:GI48-2427-MONOMER Uniprot:Q482D6
        Length = 492

 Score = 328 (120.5 bits), Expect = 3.9e-29, Sum P(2) = 3.9e-29
 Identities = 133/461 (28%), Positives = 209/461 (45%)

Query:    91 YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGM-QG 148
             +G  DLS +GSN   TPNID LA +G+  +N YA  P C PSR ++ +G YP   G+ QG
Sbjct:    41 FGRQDLSTYGSNFYETPNIDQLAADGMKFDNAYAAHPRCVPSRVAIFSGSYPTRYGVPQG 100

Query:   149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF--GYL 206
               + G     +PL+     E+L+E GY T  IGKWHLG  +    P  +GF+S    G+ 
Sbjct:   101 ERV-GKHH--LPLSAVTFGEHLKEAGYQTGYIGKWHLG--KEGGDPTKQGFDSSIMAGHW 155

Query:   207 NGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIE---DQP 263
                 SYY       Y++ +  +G +  +  +    +  EY TD  T EA+  IE   DQP
Sbjct:   156 GAPPSYYF-----PYTK-MSKSGKN--KGFAKVEGSEEEYLTDRLTDEALTFIEQKKDQP 207

Query:   264 ---------VDKPXXXXXXXXXXXXXXXXXXXEA---PQETINQFQ-----YITDPNRRT 306
                      V  P                    A   P+   +  +     + T  N   
Sbjct:   208 FLLVLAHYAVHTPIEGKPALVKKYKTKMKKLGIANAGPKSDADLIKDSTGYHKTIQNNPD 267

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             YAAMV+ +D SVG +   L+R G+ +N+III  SD+G  +    + SN     SN PYR 
Sbjct:   268 YAAMVESVDISVGRIEQQLKRLGLEDNTIIILTSDHGGLSSRGLK-SNRVLATSNNPYRH 326

Query:   367 VKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
              K  +++GG +VP I+ W  +++      +Q+   +D  PT+   AG   S    + DG+
Sbjct:   327 GKGWIYDGGTRVPLIVKWPEKVKAGSISQVQVTG-TDHYPTILQMAGLSLSPKD-HQDGV 384

Query:   426 DQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVL 485
                ++L  +   R+           ++  ++P+ R S      +   +A +    WKL L
Sbjct:   385 SYLAALNSDETPRK-----------AMFWHSPAARPS---KTGDTNSSAIIE-GEWKL-L 428

Query:   486 GTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLS 526
                  G ++ Y    + +K    N   ++  KT + L +L+
Sbjct:   429 DFWSTGKVELY--NLKDDKSEANNLAKLMPEKTAEMLAKLT 467

 Score = 54 (24.1 bits), Expect = 3.9e-29, Sum P(2) = 3.9e-29
 Identities = 10/31 (32%), Positives = 19/31 (61%)

Query:   563 TNGPCYLFNLGNDPCEQNNIASSRPDISSQL 593
             + G   L+NL +D  E NN+A   P+ ++++
Sbjct:   432 STGKVELYNLKDDKSEANNLAKLMPEKTAEM 462


>UNIPROTKB|P08842 [details] [associations]
            symbol:STS "Steryl-sulfatase" species:9606 "Homo sapiens"
            [GO:0007565 "female pregnancy" evidence=IEA] [GO:0016021 "integral
            to membrane" evidence=IEA] [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0016020 "membrane" evidence=TAS] [GO:0005764
            "lysosome" evidence=TAS] [GO:0005768 "endosome" evidence=TAS]
            [GO:0005783 "endoplasmic reticulum" evidence=TAS] [GO:0043231
            "intracellular membrane-bounded organelle" evidence=TAS]
            [GO:0005794 "Golgi apparatus" evidence=TAS] [GO:0005886 "plasma
            membrane" evidence=TAS] [GO:0006706 "steroid catabolic process"
            evidence=TAS] [GO:0008544 "epidermis development" evidence=TAS]
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IDA]
            [GO:0004773 "steryl-sulfatase activity" evidence=TAS] [GO:0005788
            "endoplasmic reticulum lumen" evidence=TAS] [GO:0005789
            "endoplasmic reticulum membrane" evidence=TAS] [GO:0006644
            "phospholipid metabolic process" evidence=TAS] [GO:0006665
            "sphingolipid metabolic process" evidence=TAS] [GO:0006687
            "glycosphingolipid metabolic process" evidence=TAS] [GO:0043687
            "post-translational protein modification" evidence=TAS] [GO:0044267
            "cellular protein metabolic process" evidence=TAS] [GO:0044281
            "small molecule metabolic process" evidence=TAS]
            Reactome:REACT_17015 Reactome:REACT_111217 InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 GO:GO:0016021
            GO:GO:0005886 GO:GO:0005794 GO:GO:0005635 GO:GO:0043588
            GO:GO:0044281 GO:GO:0005789 GO:GO:0046872 GO:GO:0006706
            GO:GO:0008284 GO:GO:0005768 GO:GO:0043434 GO:GO:0006644
            GO:GO:0007565 GO:GO:0005764 GO:GO:0009268 GO:GO:0007611
            GO:GO:0005788 GO:GO:0043627 GO:GO:0043687 GO:GO:0008544
            DrugBank:DB00655 Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            HOGENOM:HOG000135352 HOVERGEN:HBG004283 InterPro:IPR024607
            PROSITE:PS00523 PROSITE:PS00149 GO:GO:0006687 OrthoDB:EOG4V4379
            EMBL:J04964 EMBL:M16505 EMBL:AK314034 EMBL:BC075030 EMBL:M23945
            EMBL:M23556 IPI:IPI00307433 PIR:A32641 RefSeq:NP_000342.2
            UniGene:Hs.522578 UniGene:Hs.700558 UniGene:Hs.700559
            UniGene:Hs.740067 PDB:1P49 PDBsum:1P49 ProteinModelPortal:P08842
            SMR:P08842 MINT:MINT-1177440 STRING:P08842 PhosphoSite:P08842
            DMDM:135006 PaxDb:P08842 PRIDE:P08842 Ensembl:ENST00000217961
            GeneID:412 KEGG:hsa:412 UCSC:uc004cry.4 CTD:412
            GeneCards:GC0XP007147 HGNC:HGNC:11425 HPA:HPA002904 MIM:300747
            MIM:308100 neXtProt:NX_P08842 Orphanet:461 PharmGKB:PA36225
            InParanoid:P08842 KO:K01131 OMA:GLSCQCD PhylomeDB:P08842
            BindingDB:P08842 ChEMBL:CHEMBL3559 EvolutionaryTrace:P08842
            GenomeRNAi:412 NextBio:1743 Bgee:P08842 CleanEx:HS_STS
            Genevestigator:P08842 GermOnline:ENSG00000101846 GO:GO:0004773
            Uniprot:P08842
        Length = 583

 Score = 220 (82.5 bits), Expect = 8.0e-29, Sum P(3) = 8.0e-29
 Identities = 51/124 (41%), Positives = 69/124 (55%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQ--- 147
             G  D   +G+  I TPNID LA  G+ L  ++ A P+CTPSRA+ MTG+YP+ +GM    
Sbjct:    38 GIGDPGCYGNKTIRTPNIDRLASGGVKLTQHLAASPLCTPSRAAFMTGRYPVRSGMASWS 97

Query:   148 --GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT-----PLYRGFE 200
               G  ++ A   G+P  E    + L++ GYST  IGKWHLG      T     PL+ GF 
Sbjct:    98 RTGVFLFTASSGGLPTDEITFAKLLKDQGYSTALIGKWHLGMSCHSKTDFCHHPLHHGFN 157

Query:   201 SHFG 204
               +G
Sbjct:   158 YFYG 161

 Score = 152 (58.6 bits), Expect = 8.0e-29, Sum P(3) = 8.0e-29
 Identities = 47/150 (31%), Positives = 72/150 (48%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             Y   V+++D SVG +++ L    +  +++I F SD GA   E          GSN  Y+G
Sbjct:   308 YGDAVEEMDWSVGQILNLLDELRLANDTLIYFTSDQGAHVEEVSSKGEIHG-GSNGIYKG 366

Query:   367 VKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
              K   WEGG++VP IL W   IQ   ++     ++ D  PT+   AG       + IDG 
Sbjct:   367 GKANNWEGGIRVPGILRWPRVIQAGQKIDEPTSNM-DIFPTVAKLAGAPLPEDRI-IDGR 424

Query:   426 DQWSSLLLNTPSRRNSNIDGLDQWSSLLLN 455
             D     LL   S+R+ + + L  + +  LN
Sbjct:   425 DLMP--LLEGKSQRSDH-EFLFHYCNAYLN 451

 Score = 69 (29.3 bits), Expect = 8.0e-29, Sum P(3) = 8.0e-29
 Identities = 19/68 (27%), Positives = 31/68 (45%)

Query:   566 PCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRF--- 622
             P  LF++  DP E+N +    P    + YE+LK  +      +   P++    P +F   
Sbjct:   499 PPLLFDISKDPRERNPLT---PASEPRFYEILKVMQEAADRHTQTLPEV----PDQFSWN 551

Query:   623 NDTWSPWI 630
             N  W PW+
Sbjct:   552 NFLWKPWL 559


>UNIPROTKB|F1NFQ0 [details] [associations]
            symbol:ARSH "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 EMBL:AADN02017595 IPI:IPI00587215
            Ensembl:ENSGALT00000026860 OMA:GHYKAVF ArrayExpress:F1NFQ0
            Uniprot:F1NFQ0
        Length = 590

 Score = 227 (85.0 bits), Expect = 1.2e-28, Sum P(3) = 1.2e-28
 Identities = 52/124 (41%), Positives = 71/124 (57%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
             G  D+  +G++ I TPNID LA  G+ L  ++ A P+CTPSRA+L+TG+YPI +GM    
Sbjct:    46 GIGDVGCYGNDTIRTPNIDRLAREGVKLTQHITAAPLCTPSRAALLTGRYPIRSGMDAVN 105

Query:   151 -----IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGF---FRREYT--PLYRGFE 200
                   W     G+P  E    + L++ GYST  IGKWHLG     R ++   PL  GFE
Sbjct:   106 NYRVIFWNGGSGGLPPNETTFAKILQQQGYSTGLIGKWHLGVNCEHRNDHCHHPLNHGFE 165

Query:   201 SHFG 204
               +G
Sbjct:   166 YFYG 169

 Score = 146 (56.5 bits), Expect = 1.2e-28, Sum P(3) = 1.2e-28
 Identities = 41/121 (33%), Positives = 65/121 (53%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             Y   V+++D  VG V+ A+ +KG+ +N+++ F SD+G   +E R+    +  G N  YRG
Sbjct:   316 YGDNVEEMDWMVGQVLDAIDKKGLKKNTLVYFASDHGG-WLE-RQEGKRQLGGWNGIYRG 373

Query:   367 VKNTL-WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
              K    WEGG++VP I   P +    +V  +   + D  PT+   AGG   +  + IDG 
Sbjct:   374 GKAMGGWEGGIRVPGIFRWPGVLPAGKVINEPTSLMDIYPTVVHLAGGVVPQDRV-IDGR 432

Query:   426 D 426
             D
Sbjct:   433 D 433

 Score = 66 (28.3 bits), Expect = 1.2e-28, Sum P(3) = 1.2e-28
 Identities = 24/71 (33%), Positives = 34/71 (47%)

Query:   566 PCYLFNLGNDPCEQNNI-ASSRP--D-ISSQLYELLKYHRRTL--VPQSHEQPDLVQADP 619
             P  L++L  DP E   + A + P  D +  Q+   ++ HRRTL  VPQ   Q  L     
Sbjct:   507 PPLLYDLSRDPSESQPLSADTEPLFDTVIEQIGRAIEEHRRTLAAVPQ---QLSL----- 558

Query:   620 KRFNDTWSPWI 630
               +N  W PW+
Sbjct:   559 --YNVIWKPWL 567


>UNIPROTKB|F1NFQ1 [details] [associations]
            symbol:ARSH "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 EMBL:AADN02017595 IPI:IPI00600266
            Ensembl:ENSGALT00000026858 ArrayExpress:F1NFQ1 Uniprot:F1NFQ1
        Length = 579

 Score = 227 (85.0 bits), Expect = 1.7e-28, Sum P(3) = 1.7e-28
 Identities = 52/124 (41%), Positives = 71/124 (57%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
             G  D+  +G++ I TPNID LA  G+ L  ++ A P+CTPSRA+L+TG+YPI +GM    
Sbjct:    33 GIGDVGCYGNDTIRTPNIDRLAREGVKLTQHITAAPLCTPSRAALLTGRYPIRSGMDAVN 92

Query:   151 -----IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGF---FRREYT--PLYRGFE 200
                   W     G+P  E    + L++ GYST  IGKWHLG     R ++   PL  GFE
Sbjct:    93 NYRVIFWNGGSGGLPPNETTFAKILQQQGYSTGLIGKWHLGVNCEHRNDHCHHPLNHGFE 152

Query:   201 SHFG 204
               +G
Sbjct:   153 YFYG 156

 Score = 144 (55.7 bits), Expect = 1.7e-28, Sum P(3) = 1.7e-28
 Identities = 37/121 (30%), Positives = 59/121 (48%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             Y   V+++D  VG V+ A+ +KG+ +N+++ F SD+G              W   Y  +G
Sbjct:   303 YGDNVEEMDWMVGQVLDAIDKKGLKKNTLVYFASDHGGWLERQEGKRQLGGWNGIYRVKG 362

Query:   367 VKNTL-WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
              K    WEGG++VP I   P +    +V  +   + D  PT+   AGG   +  + IDG 
Sbjct:   363 GKAMGGWEGGIRVPGIFRWPGVLPAGKVINEPTSLMDIYPTVVHLAGGVVPQDRV-IDGR 421

Query:   426 D 426
             D
Sbjct:   422 D 422

 Score = 66 (28.3 bits), Expect = 1.7e-28, Sum P(3) = 1.7e-28
 Identities = 24/71 (33%), Positives = 34/71 (47%)

Query:   566 PCYLFNLGNDPCEQNNI-ASSRP--D-ISSQLYELLKYHRRTL--VPQSHEQPDLVQADP 619
             P  L++L  DP E   + A + P  D +  Q+   ++ HRRTL  VPQ   Q  L     
Sbjct:   496 PPLLYDLSRDPSESQPLSADTEPLFDTVIEQIGRAIEEHRRTLAAVPQ---QLSL----- 547

Query:   620 KRFNDTWSPWI 630
               +N  W PW+
Sbjct:   548 --YNVIWKPWL 556


>UNIPROTKB|F1PY85 [details] [associations]
            symbol:ARSH "Arylsulfatase H" species:9615 "Canis lupus
            familiaris" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 OMA:ATVWKVH EMBL:AAEX03026108
            Ensembl:ENSCAFT00000017754 Uniprot:F1PY85
        Length = 562

 Score = 227 (85.0 bits), Expect = 5.7e-28, Sum P(3) = 5.7e-28
 Identities = 61/150 (40%), Positives = 79/150 (52%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
             G  DL  +G+N + TPNID LA  G+ L  ++ A  VCTPSRA+ +TG+YPI +GM  P 
Sbjct:    18 GVGDLCCYGNNTVSTPNIDRLASEGVRLTQHLAAASVCTPSRAAFLTGRYPIRSGMASPY 77

Query:   151 ------IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFF---RRE--YTPLYRGF 199
                    W     G+P  E    + L+  GY T  IGKWH G     R +  Y PL  GF
Sbjct:    78 NLNRGLTWLGGSGGLPTNETTFAKLLQHYGYRTGLIGKWHQGLSCASRNDHCYHPLNHGF 137

Query:   200 ESHFGYLNGVISYYDHILSD-QYSRTVELN 228
             +  +G   G       +LSD Q SRT EL+
Sbjct:   138 DYFYGLPFG-------LLSDCQASRTPELH 160

 Score = 136 (52.9 bits), Expect = 5.7e-28, Sum P(3) = 5.7e-28
 Identities = 39/121 (32%), Positives = 63/121 (52%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             Y   V+++D  VG ++  L ++ +  ++++ F SDNG   +E +E    +  GSN  Y+G
Sbjct:   289 YGDNVEEMDWMVGRILETLDQERLTNHTLVYFTSDNGG-RLEVQE-GEVQLGGSNGIYKG 346

Query:   367 VKNTL-WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--ID 423
              +    WEGG++VP I   P + Q  +V  +   + D  PTL    GG    LP +  ID
Sbjct:   347 GQGMGGWEGGIRVPGIFRWPTVLQAGKVINEPTSLMDIYPTLSYIGGG---MLPQDRVID 403

Query:   424 G 424
             G
Sbjct:   404 G 404

 Score = 68 (29.0 bits), Expect = 5.7e-28, Sum P(3) = 5.7e-28
 Identities = 21/84 (25%), Positives = 32/84 (38%)

Query:   551 GANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRP----DISSQLYELLKYHRRTLVP 606
             G+   P +     + P  LF++  DP E   +          +  ++   +K HRRTL P
Sbjct:   464 GSGICPCSGDVTYHDPPLLFDVSRDPSETRPLNPDNEALFDSVVKKIEAAIKEHRRTLTP 523

Query:   607 QSHEQPDLVQADPKRFNDTWSPWI 630
                  P         FN  W PW+
Sbjct:   524 V----PQQFSV----FNTLWKPWL 539


>UNIPROTKB|Q32KH8 [details] [associations]
            symbol:ARSH "Arylsulfatase H" species:9615 "Canis lupus
            familiaris" [GO:0046872 "metal ion binding" evidence=IEA]
            [GO:0016021 "integral to membrane" evidence=IEA] [GO:0008484
            "sulfuric ester hydrolase activity" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 GO:GO:0016021 GO:GO:0046872 Gene3D:3.40.720.10
            SUPFAM:SSF53649 eggNOG:COG3119 HOGENOM:HOG000135352
            HOVERGEN:HBG004283 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 KO:K12374 OrthoDB:EOG4V4379
            EMBL:AAEX01047377 EMBL:BN000759 RefSeq:NP_001041588.1
            UniGene:Cfa.39079 HSSP:P15289 ProteinModelPortal:Q32KH8 SMR:Q32KH8
            GeneID:491720 KEGG:cfa:491720 CTD:347527 InParanoid:Q32KH8
            NextBio:20864464 Uniprot:Q32KH8
        Length = 562

 Score = 227 (85.0 bits), Expect = 5.7e-28, Sum P(3) = 5.7e-28
 Identities = 61/150 (40%), Positives = 79/150 (52%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
             G  DL  +G+N + TPNID LA  G+ L  ++ A  VCTPSRA+ +TG+YPI +GM  P 
Sbjct:    18 GVGDLCCYGNNTVSTPNIDRLASEGVRLTQHLAAASVCTPSRAAFLTGRYPIRSGMASPY 77

Query:   151 ------IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFF---RRE--YTPLYRGF 199
                    W     G+P  E    + L+  GY T  IGKWH G     R +  Y PL  GF
Sbjct:    78 NLNRGLTWLGGSGGLPTNETTFAKLLQHYGYRTGLIGKWHQGLSCASRNDHCYHPLNHGF 137

Query:   200 ESHFGYLNGVISYYDHILSD-QYSRTVELN 228
             +  +G   G       +LSD Q SRT EL+
Sbjct:   138 DYFYGLPFG-------LLSDCQASRTPELH 160

 Score = 136 (52.9 bits), Expect = 5.7e-28, Sum P(3) = 5.7e-28
 Identities = 39/121 (32%), Positives = 63/121 (52%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             Y   V+++D  VG ++  L ++ +  ++++ F SDNG   +E +E    +  GSN  Y+G
Sbjct:   289 YGDNVEEMDWMVGKILETLDQERLTNHTLVYFTSDNGG-RLEVQE-GEVQLGGSNGIYKG 346

Query:   367 VKNTL-WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--ID 423
              +    WEGG++VP I   P + Q  +V  +   + D  PTL    GG    LP +  ID
Sbjct:   347 GQGMGGWEGGIRVPGIFRWPTVLQAGKVINEPTSLMDIYPTLSYIGGG---MLPQDRVID 403

Query:   424 G 424
             G
Sbjct:   404 G 404

 Score = 68 (29.0 bits), Expect = 5.7e-28, Sum P(3) = 5.7e-28
 Identities = 21/84 (25%), Positives = 32/84 (38%)

Query:   551 GANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRP----DISSQLYELLKYHRRTLVP 606
             G+   P +     + P  LF++  DP E   +          +  ++   +K HRRTL P
Sbjct:   464 GSGICPCSGDVTYHDPPLLFDVSRDPSETRPLNPDNEALFDSVVKKIEAAIKEHRRTLTP 523

Query:   607 QSHEQPDLVQADPKRFNDTWSPWI 630
                  P         FN  W PW+
Sbjct:   524 V----PQQFSV----FNTLWKPWL 539


>UNIPROTKB|Q32KI1 [details] [associations]
            symbol:arse "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0004065 "arylsulfatase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 GeneTree:ENSGT00560000076940 HOGENOM:HOG000135352
            HOVERGEN:HBG004283 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0004065 KO:K12374 OrthoDB:EOG4V4379 CTD:415
            EMBL:AAEX03026107 OMA:VCFQIMA EMBL:BN000756 RefSeq:NP_001041587.1
            UniGene:Cfa.28960 SMR:Q32KI1 STRING:Q32KI1
            Ensembl:ENSCAFT00000045735 GeneID:491719 KEGG:cfa:491719
            InParanoid:Q32KI1 NextBio:20864462 Uniprot:Q32KI1
        Length = 585

 Score = 223 (83.6 bits), Expect = 8.2e-28, Sum P(3) = 8.2e-28
 Identities = 49/125 (39%), Positives = 72/125 (57%)

Query:    91 YGWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGM--- 146
             +G  D+  +G+N I TPNID LA +G++L  ++ A  VCTPSRA+ +TG+YP+ +GM   
Sbjct:    44 FGIGDIGCYGNNSIRTPNIDRLAEDGVMLTQHIAAASVCTPSRAAFLTGRYPLRSGMVSS 103

Query:   147 QGPPI--WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE-----YTPLYRGF 199
              G  +  W     G+P  E    + L++ GY+T  IGKWHLG          + PL  GF
Sbjct:   104 NGYRVLQWTGVSGGLPTNETTFAKILKDRGYATGLIGKWHLGLNCESSNDHCHHPLNHGF 163

Query:   200 ESHFG 204
             +  +G
Sbjct:   164 DHFYG 168

 Score = 142 (55.0 bits), Expect = 8.2e-28, Sum P(3) = 8.2e-28
 Identities = 39/130 (30%), Positives = 61/130 (46%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             Y    +++D  VG ++  L  +G+  ++++ F SD+G           Y  W  N  Y+G
Sbjct:   315 YGDNTEEMDWMVGQILDTLDMEGLTNSTLVYFTSDHGGSLEAQLGKEQYGGW--NGIYKG 372

Query:   367 VKNTL-WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
              K    WEGG++VP I   P + Q  RV  +   + D  PT+    GG+  +  +  DG 
Sbjct:   373 GKGMGGWEGGIRVPGIFRWPGVLQAGRVIHEPTSLMDVFPTVVQLGGGEVPQDRVT-DGR 431

Query:   426 DQWSSLLLNT 435
             D    LLL T
Sbjct:   432 DLLP-LLLGT 440

 Score = 66 (28.3 bits), Expect = 8.2e-28, Sum P(3) = 8.2e-28
 Identities = 22/72 (30%), Positives = 32/72 (44%)

Query:   566 PCYLFNLGNDPCEQNNIASSRPDISSQLYELL-------KYHRRTLVPQSHEQPDLVQAD 618
             P  LF+L  DP E + +    PD     Y ++       + HRRTL+P     P  +Q D
Sbjct:   506 PPLLFDLSRDPSEAHALT---PDTEPSFYHVMDTVARAVEEHRRTLIPV----P--LQLD 556

Query:   619 PKRFNDTWSPWI 630
                  + W PW+
Sbjct:   557 T--LGNIWRPWL 566


>UNIPROTKB|G5E629 [details] [associations]
            symbol:ARSE "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 EMBL:DAAA02075311 EMBL:DAAA02075312
            EMBL:DAAA02075313 UniGene:Bt.6471 Ensembl:ENSBTAT00000050377
            OMA:VCFQIMA Uniprot:G5E629
        Length = 583

 Score = 217 (81.4 bits), Expect = 1.2e-27, Sum P(3) = 1.2e-27
 Identities = 50/124 (40%), Positives = 72/124 (58%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGM---Q 147
             G  D+  +G+  I TPNID LA +G+ L  ++ A P+CTPSRA+ +TG+YP+ +GM   Q
Sbjct:    45 GIGDVGCYGNTTIRTPNIDRLAADGVRLTQHLAAAPLCTPSRAAFLTGRYPLRSGMVSSQ 104

Query:   148 GPPI--WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE-----YTPLYRGFE 200
             G  +  W A   G+P +E    + L+  GY+T  IGKWHLG          + PL  GF+
Sbjct:   105 GLRVLQWTAVSGGLPPSEITFAKILKAKGYTTGLIGKWHLGLSCASPDDHCHHPLNHGFD 164

Query:   201 SHFG 204
               +G
Sbjct:   165 HFYG 168

 Score = 145 (56.1 bits), Expect = 1.2e-27, Sum P(3) = 1.2e-27
 Identities = 38/122 (31%), Positives = 63/122 (51%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSN-YRNWGSNYPYR 365
             Y    +++D  VG ++  L  +G+  ++++ F SD+G  ++E R  +N Y  W  N  Y+
Sbjct:   316 YGDNTEEMDWMVGQILETLDTEGLTNSTLVYFTSDHGG-SLEARFGNNQYGGW--NGIYK 372

Query:   366 GVKNTL-WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
             G K    WEGG++VP I   P +    RV  +   + D  PT+   AGG   +  + +DG
Sbjct:   373 GGKGMAGWEGGIRVPGIFRWPGVLPAGRVIHEPTSLMDIFPTVVHLAGGQVPQDRV-VDG 431

Query:   425 LD 426
              D
Sbjct:   432 RD 433

 Score = 68 (29.0 bits), Expect = 1.2e-27, Sum P(3) = 1.2e-27
 Identities = 19/70 (27%), Positives = 34/70 (48%)

Query:   564 NGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQ---PDLVQADPK 620
             + P  LF+L  DP E + +    PD     +++++   R +   +H Q   P  +Q D  
Sbjct:   502 HAPPLLFDLSRDPSEAHALT---PDTEPSFHQVVETVARAVA--AHRQTLIPVPLQLDAA 556

Query:   621 RFNDTWSPWI 630
               ++TW PW+
Sbjct:   557 --DNTWKPWL 564


>UNIPROTKB|P51690 [details] [associations]
            symbol:ARSE "Arylsulfatase E" species:9606 "Homo sapiens"
            [GO:0046872 "metal ion binding" evidence=IEA] [GO:0005795 "Golgi
            stack" evidence=IEA] [GO:0001501 "skeletal system development"
            evidence=TAS] [GO:0004065 "arylsulfatase activity" evidence=TAS]
            [GO:0005788 "endoplasmic reticulum lumen" evidence=TAS] [GO:0006644
            "phospholipid metabolic process" evidence=TAS] [GO:0006665
            "sphingolipid metabolic process" evidence=TAS] [GO:0006687
            "glycosphingolipid metabolic process" evidence=TAS] [GO:0043687
            "post-translational protein modification" evidence=TAS] [GO:0044267
            "cellular protein metabolic process" evidence=TAS] [GO:0044281
            "small molecule metabolic process" evidence=TAS]
            Reactome:REACT_17015 Reactome:REACT_111217 InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 GO:GO:0044281
            GO:GO:0046872 GO:GO:0006644 GO:GO:0005795 GO:GO:0005788
            GO:GO:0001501 GO:GO:0043687 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 HOVERGEN:HBG004283 InterPro:IPR024607
            PROSITE:PS00523 PROSITE:PS00149 GO:GO:0004065 GO:GO:0006687
            KO:K12374 OrthoDB:EOG4V4379 EMBL:X83573 EMBL:AK223183 EMBL:AK223199
            IPI:IPI01014058 PIR:I37187 RefSeq:NP_000038.2 UniGene:Hs.386975
            ProteinModelPortal:P51690 SMR:P51690 IntAct:P51690
            MINT:MINT-1382153 STRING:P51690 PhosphoSite:P51690 DMDM:77416850
            PaxDb:P51690 PRIDE:P51690 DNASU:415 Ensembl:ENST00000381134
            GeneID:415 KEGG:hsa:415 UCSC:uc004crc.4 CTD:415
            GeneCards:GC0XM002846 HGNC:HGNC:719 MIM:300180 MIM:302950
            neXtProt:NX_P51690 Orphanet:79345 PharmGKB:PA25010
            InParanoid:P51690 GenomeRNAi:415 NextBio:1755 ArrayExpress:P51690
            Bgee:P51690 CleanEx:HS_ARSE Genevestigator:P51690
            GermOnline:ENSG00000157399 Uniprot:P51690
        Length = 589

 Score = 216 (81.1 bits), Expect = 1.3e-27, Sum P(3) = 1.3e-27
 Identities = 48/124 (38%), Positives = 71/124 (57%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQ--- 147
             G  D+  +G+N + TPNID LA +G+ L  ++ A  +CTPSRA+ +TG+YP+ +GM    
Sbjct:    49 GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 108

Query:   148 GPPI--WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE-----YTPLYRGFE 200
             G  +  W     G+P  E    + L+E GY+T  IGKWHLG          + PL+ GF+
Sbjct:   109 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 168

Query:   201 SHFG 204
               +G
Sbjct:   169 HFYG 172

 Score = 148 (57.2 bits), Expect = 1.3e-27, Sum P(3) = 1.3e-27
 Identities = 42/130 (32%), Positives = 64/130 (49%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             Y   V+++D  VG ++  L  +G+  +++I F SD+G         + Y  W  N  Y+G
Sbjct:   319 YGDNVEEMDWMVGRILDTLDVEGLSNSTLIYFTSDHGGSLENQLGNTQYGGW--NGIYKG 376

Query:   367 VKNTL-WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
              K    WEGG++VP I   P +    RV  +   + D  PT+   AGG+  +  + IDG 
Sbjct:   377 GKGMGGWEGGIRVPGIFRWPGVLPAGRVIGEPTSLMDVFPTVVRLAGGEVPQDRV-IDGQ 435

Query:   426 DQWSSLLLNT 435
             D    LLL T
Sbjct:   436 DLLP-LLLGT 444

 Score = 66 (28.3 bits), Expect = 1.3e-27, Sum P(3) = 1.3e-27
 Identities = 21/69 (30%), Positives = 33/69 (47%)

Query:   566 PCYLFNLGNDPCEQNNIASSRPDISSQLYELLKY----HRRTLVPQSHEQPDLVQADPKR 621
             P  LF+L  DP E + +  +   +  Q+ E ++     H+RTL P     P  +Q D  R
Sbjct:   510 PPLLFDLSRDPSETHILTPASEPVFYQVMERVQQAVWEHQRTLSPV----P--LQLD--R 561

Query:   622 FNDTWSPWI 630
               + W PW+
Sbjct:   562 LGNIWRPWL 570


>UNIPROTKB|F5GYY5 [details] [associations]
            symbol:ARSE "Arylsulfatase E" species:9606 "Homo sapiens"
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
            PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484 EMBL:AC005295
            HGNC:HGNC:719 OMA:PVINRCA IPI:IPI00020005 ProteinModelPortal:F5GYY5
            SMR:F5GYY5 Ensembl:ENST00000545496 UCSC:uc011mhh.2
            ArrayExpress:F5GYY5 Bgee:F5GYY5 Uniprot:F5GYY5
        Length = 614

 Score = 216 (81.1 bits), Expect = 1.7e-27, Sum P(3) = 1.7e-27
 Identities = 48/124 (38%), Positives = 71/124 (57%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQ--- 147
             G  D+  +G+N + TPNID LA +G+ L  ++ A  +CTPSRA+ +TG+YP+ +GM    
Sbjct:    74 GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 133

Query:   148 GPPI--WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE-----YTPLYRGFE 200
             G  +  W     G+P  E    + L+E GY+T  IGKWHLG          + PL+ GF+
Sbjct:   134 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 193

Query:   201 SHFG 204
               +G
Sbjct:   194 HFYG 197

 Score = 148 (57.2 bits), Expect = 1.7e-27, Sum P(3) = 1.7e-27
 Identities = 42/130 (32%), Positives = 64/130 (49%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             Y   V+++D  VG ++  L  +G+  +++I F SD+G         + Y  W  N  Y+G
Sbjct:   344 YGDNVEEMDWMVGRILDTLDVEGLSNSTLIYFTSDHGGSLENQLGNTQYGGW--NGIYKG 401

Query:   367 VKNTL-WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
              K    WEGG++VP I   P +    RV  +   + D  PT+   AGG+  +  + IDG 
Sbjct:   402 GKGMGGWEGGIRVPGIFRWPGVLPAGRVIGEPTSLMDVFPTVVRLAGGEVPQDRV-IDGQ 460

Query:   426 DQWSSLLLNT 435
             D    LLL T
Sbjct:   461 DLLP-LLLGT 469

 Score = 66 (28.3 bits), Expect = 1.7e-27, Sum P(3) = 1.7e-27
 Identities = 21/69 (30%), Positives = 33/69 (47%)

Query:   566 PCYLFNLGNDPCEQNNIASSRPDISSQLYELLKY----HRRTLVPQSHEQPDLVQADPKR 621
             P  LF+L  DP E + +  +   +  Q+ E ++     H+RTL P     P  +Q D  R
Sbjct:   535 PPLLFDLSRDPSETHILTPASEPVFYQVMERVQQAVWEHQRTLSPV----P--LQLD--R 586

Query:   622 FNDTWSPWI 630
               + W PW+
Sbjct:   587 LGNIWRPWL 595


>UNIPROTKB|F1MFZ8 [details] [associations]
            symbol:STS "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 OMA:GLSCQCD EMBL:DAAA02075641
            EMBL:DAAA02075642 EMBL:DAAA02075643 EMBL:DAAA02075644
            EMBL:DAAA02075645 IPI:IPI00693675 UniGene:Bt.63535
            Ensembl:ENSBTAT00000027703 Uniprot:F1MFZ8
        Length = 578

 Score = 213 (80.0 bits), Expect = 1.1e-26, Sum P(3) = 1.1e-26
 Identities = 49/125 (39%), Positives = 70/125 (56%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQ--- 147
             G  D   +G+  + TPNID LA  G+ L  ++ A P+CTPSRA+ MTG+YP+ +GM    
Sbjct:    33 GIGDPGCYGNKTLRTPNIDRLARGGVKLTQHLAASPLCTPSRAAFMTGRYPVRSGMASQS 92

Query:   148 --GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESH-FG 204
               G  ++ A   G+P +E    + L++ GYST  IGKWHLG    +         SH F 
Sbjct:    93 QVGVFLFSASSGGLPPSEITFAKLLKDQGYSTALIGKWHLGISCHDPGDFCHHPTSHGFD 152

Query:   205 YLNGV 209
             Y +G+
Sbjct:   153 YFHGL 157

 Score = 149 (57.5 bits), Expect = 1.1e-26, Sum P(3) = 1.1e-26
 Identities = 43/133 (32%), Positives = 64/133 (48%)

Query:   306 TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYR 365
             +Y    +++D SVG ++  L    +  N+++ F SD GA   E       +  GSN  Y+
Sbjct:   302 SYGDAAEEMDWSVGQILDVLHELKLANNTLVYFSSDQGAHVEEVTVKGEVQG-GSNGIYK 360

Query:   366 GVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--I 422
             G K   WEGG++VP I+ W   IQ    +     ++ D  PT+   AG   S LP +  I
Sbjct:   361 GGKANNWEGGIRVPGIVRWPGVIQAGLEIDEPTSNM-DIFPTVAKLAG---SPLPQDRVI 416

Query:   423 DGLDQWSSLLLNT 435
             DG D    L + T
Sbjct:   417 DGRDLMPLLQMRT 429

 Score = 59 (25.8 bits), Expect = 1.1e-26, Sum P(3) = 1.1e-26
 Identities = 16/65 (24%), Positives = 27/65 (41%)

Query:   566 PCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDT 625
             P  LF +  DP E+N +    P +  + +E+L+  +      +    D V       N  
Sbjct:   494 PPLLFEISRDPRERNPLT---PTLEPRFWEILEAMQEAAARHARTLQD-VPNQLSLGNLM 549

Query:   626 WSPWI 630
             W PW+
Sbjct:   550 WKPWL 554


>UNIPROTKB|Q5FYA8 [details] [associations]
            symbol:ARSH "Arylsulfatase H" species:9606 "Homo sapiens"
            [GO:0016021 "integral to membrane" evidence=IEA] [GO:0046872 "metal
            ion binding" evidence=IEA] [GO:0004065 "arylsulfatase activity"
            evidence=TAS] [GO:0005788 "endoplasmic reticulum lumen"
            evidence=TAS] [GO:0006644 "phospholipid metabolic process"
            evidence=TAS] [GO:0006665 "sphingolipid metabolic process"
            evidence=TAS] [GO:0006687 "glycosphingolipid metabolic process"
            evidence=TAS] [GO:0043687 "post-translational protein modification"
            evidence=TAS] [GO:0044267 "cellular protein metabolic process"
            evidence=TAS] [GO:0044281 "small molecule metabolic process"
            evidence=TAS] Reactome:REACT_17015 Reactome:REACT_111217
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 GO:GO:0016021 GO:GO:0044281 GO:GO:0046872
            GO:GO:0006644 GO:GO:0005788 GO:GO:0043687 Gene3D:3.40.720.10
            SUPFAM:SSF53649 eggNOG:COG3119 HOGENOM:HOG000135352
            HOVERGEN:HBG004283 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0004065 GO:GO:0006687 KO:K12374
            OrthoDB:EOG4V4379 CTD:347527 EMBL:AY875940 IPI:IPI00233062
            RefSeq:NP_001011719.1 UniGene:Hs.351533 HSSP:P08842
            ProteinModelPortal:Q5FYA8 SMR:Q5FYA8 STRING:Q5FYA8 DMDM:74722579
            PRIDE:Q5FYA8 DNASU:347527 Ensembl:ENST00000381130 GeneID:347527
            KEGG:hsa:347527 UCSC:uc011mhj.2 GeneCards:GC0XP002919
            HGNC:HGNC:32488 HPA:HPA050011 MIM:300586 neXtProt:NX_Q5FYA8
            PharmGKB:PA143485308 InParanoid:Q5FYA8 OMA:ATVWKVH
            GenomeRNAi:347527 NextBio:99177 Bgee:Q5FYA8 CleanEx:HS_ARSH
            Genevestigator:Q5FYA8 Uniprot:Q5FYA8
        Length = 562

 Score = 216 (81.1 bits), Expect = 4.5e-26, Sum P(3) = 4.5e-26
 Identities = 59/150 (39%), Positives = 78/150 (52%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
             G  DL  +G+N + TPNID LA  G+ L  ++ A  +CTPSRA+ +TG+YPI +GM    
Sbjct:    18 GVGDLCCYGNNSVSTPNIDRLASEGVRLTQHLAAASMCTPSRAAFLTGRYPIRSGMVSAY 77

Query:   151 ------IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFF---RRE--YTPLYRGF 199
                    W     G+P  E    + L+  GY T  IGKWHLG     R +  Y PL  GF
Sbjct:    78 NLNRAFTWLGGSGGLPTNETTFAKLLQHRGYRTGLIGKWHLGLSCASRNDHCYHPLNHGF 137

Query:   200 ESHFGYLNGVISYYDHILSD-QYSRTVELN 228
                +G   G       +LSD Q S+T EL+
Sbjct:   138 HYFYGVPFG-------LLSDCQASKTPELH 160

 Score = 134 (52.2 bits), Expect = 4.5e-26, Sum P(3) = 4.5e-26
 Identities = 37/119 (31%), Positives = 58/119 (48%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             Y   V+++D  VG ++ AL ++ +  ++++ F SDNG              W  N  Y+G
Sbjct:   289 YGDNVEEMDWMVGKILDALDQERLANHTLVYFTSDNGGHLEPLDGAVQLGGW--NGIYKG 346

Query:   367 VKNTL-WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
              K    WEGG++VP I   P + +  RV  +   + D  PTL    GG  S+  + IDG
Sbjct:   347 GKGMGGWEGGIRVPGIFRWPSVLEAGRVINEPTSLMDIYPTLSYIGGGILSQDRV-IDG 404

 Score = 64 (27.6 bits), Expect = 4.5e-26, Sum P(3) = 4.5e-26
 Identities = 20/69 (28%), Positives = 30/69 (43%)

Query:   566 PCYLFNLGNDPCEQNNI-ASSRPDISSQLYEL---LKYHRRTLVPQSHEQPDLVQADPKR 621
             P  LF++  DP E   +   + P   S + ++   ++ HRRTL P     P         
Sbjct:   479 PPLLFDISRDPSEALPLNPDNEPLFDSVIKKMEAAIREHRRTLTPV----PQQFSV---- 530

Query:   622 FNDTWSPWI 630
             FN  W PW+
Sbjct:   531 FNTIWKPWL 539


>UNIPROTKB|G3N2T7 [details] [associations]
            symbol:ARSH "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 OMA:ATVWKVH EMBL:DAAA02075309
            EMBL:DAAA02075310 EMBL:DAAA02075311 Ensembl:ENSBTAT00000063647
            Uniprot:G3N2T7
        Length = 557

 Score = 209 (78.6 bits), Expect = 6.5e-26, Sum P(3) = 6.5e-26
 Identities = 56/150 (37%), Positives = 79/150 (52%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
             G  DL  +G+N + TPNID LA  G+ L  ++ A  +CTPSRA+ +TG+YP+ +GM    
Sbjct:    13 GVGDLCCYGNNSVSTPNIDRLASEGVRLTQHLAAASMCTPSRAAFLTGRYPVRSGMASSS 72

Query:   151 ------IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFF--RRE---YTPLYRGF 199
                   +W     G+P  E    + L+  GY T  IGKWH G     R+   Y PL  GF
Sbjct:    73 NLNRDVVWLGGSGGLPPNETTFAKLLQHRGYRTGLIGKWHQGLSCASRDDHCYHPLNHGF 132

Query:   200 ESHFGYLNGVISYYDHILSD-QYSRTVELN 228
             +  +G     + +   +LSD Q  RT EL+
Sbjct:   133 DYFYG-----MPF--ELLSDCQAFRTPELH 155

 Score = 145 (56.1 bits), Expect = 6.5e-26, Sum P(3) = 6.5e-26
 Identities = 40/119 (33%), Positives = 62/119 (52%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             Y   V+++D  VG V+ AL R+ +  ++++ F SDNG   +E ++ S     G N  YRG
Sbjct:   284 YGDNVEEMDWMVGKVLEALDRERLANHTLVYFTSDNGG-RLEAQDRSGQLG-GWNGRYRG 341

Query:   367 VKNTL-WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
              +    WEGG++VP I   P + +  +V  +   + D  PTL +  GG    L   IDG
Sbjct:   342 GRGMAGWEGGIRVPGIFRWPTVLEAGKVIDEPTSLMDIFPTL-SYIGGGIPPLGRVIDG 399

 Score = 59 (25.8 bits), Expect = 6.5e-26, Sum P(3) = 6.5e-26
 Identities = 20/84 (23%), Positives = 33/84 (39%)

Query:   551 GANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRP---D-ISSQLYELLKYHRRTLVP 606
             G+   P +     + P  LF++  DP E   +        D +  ++   ++ HR TL P
Sbjct:   459 GSGVCPCSGDVTYHDPPLLFDISRDPSESRPLNPDNEALFDAVVKKVEAAVRRHRGTLTP 518

Query:   607 QSHEQPDLVQADPKRFNDTWSPWI 630
                  P  +      FN  W PW+
Sbjct:   519 V----PQQLSV----FNALWKPWL 534


>ZFIN|ZDB-GENE-050320-118 [details] [associations]
            symbol:arsa "arylsulfatase A" species:7955 "Danio
            rerio" [GO:0008152 "metabolic process" evidence=IEA] [GO:0008484
            "sulfuric ester hydrolase activity" evidence=IEA] [GO:0003824
            "catalytic activity" evidence=IEA] [GO:0005575 "cellular_component"
            evidence=ND] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 ZFIN:ZDB-GENE-050320-118
            Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            GeneTree:ENSGT00560000076940 HOGENOM:HOG000135352
            HOVERGEN:HBG004283 OrthoDB:EOG4MKNG4 InterPro:IPR024607
            PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484 EMBL:CR936412
            IPI:IPI00488891 UniGene:Dr.91521 SMR:A5WV48
            Ensembl:ENSDART00000140193 Uniprot:A5WV48
        Length = 503

 Score = 229 (85.7 bits), Expect = 9.4e-26, Sum P(3) = 9.4e-26
 Identities = 47/115 (40%), Positives = 65/115 (56%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
             G+ DL   G     TPN+D LA NG+   + Y   PVC+PSRA+L+TG+Y   +G+    
Sbjct:    35 GYGDLGCFGHPCSLTPNLDRLAANGLRFTDFYVTSPVCSPSRAALLTGRYQTRSGIYPGV 94

Query:   151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGF-FRREYTPLYRGFESHFG 204
             ++     G+PL E  + E L+  GYST  +GKWHLG      Y P   GF+S+ G
Sbjct:    95 LYPGSRGGLPLNETTIAEVLKTQGYSTAIVGKWHLGVGLNGTYLPTRHGFDSYLG 149

 Score = 130 (50.8 bits), Expect = 9.4e-26, Sum P(3) = 9.4e-26
 Identities = 44/141 (31%), Positives = 70/141 (49%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
             R  +   + + D +VG ++  L+  G++ N++I F  DNG P  E    S   N G    
Sbjct:   247 RGPFGDALMEFDGTVGKILQTLEETGVINNTLIFFTGDNG-P--ELMRKSRGGNAGL--- 300

Query:   364 YRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP-LN 421
              +  K T +EGG++ PAI  W   I+  P V+  +    D LPT    AG     LP + 
Sbjct:   301 MKCGKGTTYEGGMREPAIAHWPGFIK--PGVTRALASSLDILPTFAKLAGAP---LPEVQ 355

Query:   422 IDGLDQWSSLLLNT-PSRRNS 441
             +DG++  + +L N  PS+R +
Sbjct:   356 LDGVEM-TDILFNLGPSKRQT 375

 Score = 47 (21.6 bits), Expect = 9.4e-26, Sum P(3) = 9.4e-26
 Identities = 12/41 (29%), Positives = 20/41 (48%)

Query:   566 PCYLFNLGNDPCEQNNIASSRPD-ISSQLYELLKYHRRTLV 605
             P  LFNL  DP E  N+   + D +  Q+  + +    ++V
Sbjct:   428 PPLLFNLETDPSENYNLDGDQWDAVRKQIQAVKQQFEASMV 468

 Score = 43 (20.2 bits), Expect = 2.4e-25, Sum P(3) = 2.4e-25
 Identities = 9/18 (50%), Positives = 10/18 (55%)

Query:   583 ASSRPDISSQLYELLKYH 600
             + S PD S  L   LKYH
Sbjct:   409 SESTPDNSCSLLAFLKYH 426


>TIGR_CMR|SPO_3286 [details] [associations]
            symbol:SPO_3286 "arylsulfatase" species:246200 "Ruegeria
            pomeroyi DSS-3" [GO:0004065 "arylsulfatase activity" evidence=ISS]
            [GO:0006790 "sulfur compound metabolic process" evidence=ISS]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 EMBL:CP000031 GenomeReviews:CP000031_GR
            Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
            PROSITE:PS00149 GO:GO:0004065 KO:K01130 HOGENOM:HOG000135353
            RefSeq:YP_168482.1 ProteinModelPortal:Q5LNC6 GeneID:3193868
            KEGG:sil:SPO3286 PATRIC:23380015 Uniprot:Q5LNC6
        Length = 535

 Score = 204 (76.9 bits), Expect = 9.5e-26, Sum P(3) = 9.5e-26
 Identities = 68/186 (36%), Positives = 91/186 (48%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGM--QGP 149
             G+ DL   GS EI TPNID LA +G +L  MY    C P+RASL+TG YP + G+   G 
Sbjct:    16 GFADLGCTGS-EIRTPNIDGLARDGALLTAMYNCARCCPTRASLLTGLYPHNAGIGHMGA 74

Query:   150 PIWGAEPRGVPLTE-RFLPEYLRELGYSTKAIGKWHLG--FFRREY-----------TPL 195
              +     RG    +   + E+LR  GY T   GKWH+G  F  RE            TP 
Sbjct:    75 DLGTPAYRGFLRNDCATIAEHLRAAGYRTCMSGKWHVGGDFMAREVDSWRVGDVDHPTPR 134

Query:   196 YRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
              RGF+  +G ++GV     H  S  Y   +E    D R  + T  D    Y TD  T +A
Sbjct:   135 QRGFDRFYGIVDGVT----HFFSPHYM--LE---DDTR--VETFPDDF--YFTDAITDKA 181

Query:   256 VQLIED 261
             + ++E+
Sbjct:   182 IGMVEE 187

 Score = 126 (49.4 bits), Expect = 9.5e-26, Sum P(3) = 9.5e-26
 Identities = 21/38 (55%), Positives = 33/38 (86%)

Query:   306 TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNG 343
             TYAAMV ++D S+GT+++AL+R G  +N++I+F+SDNG
Sbjct:   274 TYAAMVDRMDQSIGTLLAALKRMGQFDNTLILFLSDNG 311

 Score = 81 (33.6 bits), Expect = 9.5e-26, Sum P(3) = 9.5e-26
 Identities = 21/54 (38%), Positives = 28/54 (51%)

Query:   360 SNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
             SN P+R  K+ + EGG+  P I  W  +I   P       H+ D LPT+  AAG
Sbjct:   363 SNAPFRKFKHYVHEGGISTPLIAHWPGRIAA-PVPLHAACHVVDILPTILEAAG 415


>UNIPROTKB|F1S6M1 [details] [associations]
            symbol:GALNS "N-acetylgalactosamine-6-sulfatase"
            species:9823 "Sus scrofa" [GO:0008484 "sulfuric ester hydrolase
            activity" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            GeneTree:ENSGT00560000077076 EMBL:FP102571
            Ensembl:ENSSSCT00000002935 OMA:HISAGQX ArrayExpress:F1S6M1
            Uniprot:F1S6M1
        Length = 305

 Score = 298 (110.0 bits), Expect = 1.1e-25, P = 1.1e-25
 Identities = 85/269 (31%), Positives = 124/269 (46%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGM---Q 147
             GW DL  +G     TPN+D +A  G++  + YA  P+C+PSRA+L+TG+ PI TG     
Sbjct:    41 GWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYAANPLCSPSRAALLTGRLPIRTGFYTTN 100

Query:   148 GPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
             G       P+    G+P  E  LPE L+  GY++K +GKWHLG  R ++ PL  GF+  F
Sbjct:   101 GHARNAYTPQEIVGGIPDPEHLLPELLKGAGYASKIVGKWHLGH-RPQFHPLKHGFDEWF 159

Query:   204 GYLNGVISYYDHILSDQYS--RTVELNGH---DMRRNLSTAWDTVGEYATDLFTKEAVQL 258
             G  N     YD+         R  E+ G    +   NL T    +    T ++ +EA+  
Sbjct:   160 GSPNCHFGPYDNRARPNIPVYRDWEMVGRFYEEFPINLKTGESNL----TQIYLQEALDF 215

Query:   259 IEDQPVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
             I+ Q                         AP      F  +    R  Y   V+++DDSV
Sbjct:   216 IKRQQA--------THHPFFLYWAIDATHAPVYASRAF--LGTSQRGRYGDAVREIDDSV 265

Query:   319 GTVISALQRKGMLENSIIIFMSDNGAPTV 347
             G ++  L+   +  N+ + F SDNGA  V
Sbjct:   266 GRIVGLLRDLKIAGNTFVFFTSDNGAALV 294


>UNIPROTKB|Q32KK2 [details] [associations]
            symbol:Arsa "Arylsulfatase A" species:10116 "Rattus
            norvegicus" [GO:0004098 "cerebroside-sulfatase activity"
            evidence=IEA] [GO:0005509 "calcium ion binding" evidence=IEA]
            [GO:0005886 "plasma membrane" evidence=IEA] [GO:0007339 "binding of
            sperm to zona pellucida" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 RGD:1310381 GO:GO:0005886
            GO:GO:0005509 GO:GO:0007339 Gene3D:3.40.720.10 SUPFAM:SSF53649
            CTD:410 eggNOG:COG3119 GeneTree:ENSGT00560000076940
            HOGENOM:HOG000135352 HOVERGEN:HBG004283 KO:K01134 OMA:FGPSQMA
            OrthoDB:EOG4MKNG4 GO:GO:0004098 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 EMBL:CH474027 EMBL:BN000735 IPI:IPI00361483
            RefSeq:NP_001030105.2 UniGene:Rn.23323 SMR:Q32KK2 IntAct:Q32KK2
            STRING:Q32KK2 Ensembl:ENSRNOT00000017783 GeneID:315222
            KEGG:rno:315222 InParanoid:Q32KK2 NextBio:668936
            Genevestigator:Q32KK2 Uniprot:Q32KK2
        Length = 507

 Score = 205 (77.2 bits), Expect = 1.1e-25, Sum P(3) = 1.1e-25
 Identities = 48/117 (41%), Positives = 64/117 (54%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPV--CTPSRASLMTGKYPIHTGMQGP 149
             G+ DL  +G     TPN+D LA  G+   + Y  PV  CTPSRA+L+TG+ P+ +GM  P
Sbjct:    32 GYGDLGSYGHPSSTTPNLDQLAAGGLRFTDFYV-PVSLCTPSRAALLTGRLPVRSGMY-P 89

Query:   150 PIWGAEPRG-VPLTERFLPEYLRELGYSTKAIGKWHLGFFRR-EYTPLYRGFESHFG 204
              + G   +G +PL E  L E L   GY T   GKWHLG      + P ++GF    G
Sbjct:    90 GVLGPSSQGGLPLEEVTLAEVLAARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLG 146

 Score = 143 (55.4 bits), Expect = 1.1e-25, Sum P(3) = 1.1e-25
 Identities = 48/140 (34%), Positives = 74/140 (52%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
             R  +   + +LD +VG +++A+   G+L  +++IF +DNG P     E     + G +  
Sbjct:   244 RGPFGDSLMELDGAVGALMTAVGDLGLLGETLVIFTADNG-P-----ELMRMSDGGCSGL 297

Query:   364 YRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP-LNI 422
              R  K T +EGGV+ PA+++ P     P V+ ++    D LPTL    G     LP + +
Sbjct:   298 LRCGKGTTFEGGVREPALVYWPG-HITPGVTHELASSLDLLPTLAALTGAP---LPNITL 353

Query:   423 DGLDQWSSLLLNT-PSRRNS 441
             DG+D  S LLL T  S RNS
Sbjct:   354 DGVDI-SPLLLGTGKSPRNS 372

 Score = 61 (26.5 bits), Expect = 1.1e-25, Sum P(3) = 1.1e-25
 Identities = 15/58 (25%), Positives = 26/58 (44%)

Query:   545 QATIHCGANPAPMTPSP---CTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKY 599
             Q + H    P P   +      + P  L++L  DP E  N+  S  ++S +  + LK+
Sbjct:   401 QGSAHSDTTPDPACHAANRLTAHEPPLLYDLSKDPGENYNLLDSTEEVSPEALQALKH 458

 Score = 53 (23.7 bits), Expect = 1.9e-16, Sum P(3) = 1.9e-16
 Identities = 30/86 (34%), Positives = 38/86 (44%)

Query:   413 GDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNT-PSRRNSVLINI---D 468
             G T  L  ++D L   ++L    P   N  +DG+D  S LLL T  S RNSV       D
Sbjct:   325 GVTHELASSLDLLPTLAALT-GAPLP-NITLDGVDI-SPLLLGTGKSPRNSVFFYPPFPD 381

Query:   469 EKKRTAAVRLDSWKLVLGTQENGTMD 494
             E     AVR   +K    TQ +   D
Sbjct:   382 EIHGVFAVRNGKYKAHFFTQGSAHSD 407


>UNIPROTKB|F1Q1V3 [details] [associations]
            symbol:STS "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 EMBL:AAEX03026120 EMBL:AAEX03026121
            Ensembl:ENSCAFT00000017942 Uniprot:F1Q1V3
        Length = 594

 Score = 219 (82.2 bits), Expect = 1.4e-25, Sum P(2) = 1.4e-25
 Identities = 52/125 (41%), Positives = 69/125 (55%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQ--- 147
             G  D   +G+  + TPNID LA  G+ L  ++ A P+CTPSRA+ MTG+YPI +GM    
Sbjct:    52 GIGDPGCYGNTTLRTPNIDRLAAEGVKLTQHLAASPLCTPSRAAFMTGRYPIRSGMASQS 111

Query:   148 --GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESH-FG 204
               G  I+ A   G+P +E    + L+  GYST  IGKWHLG      T       SH F 
Sbjct:   112 FIGVFIFSASSGGLPTSEITFAKLLKNQGYSTALIGKWHLGTNCHNKTDFCHHPLSHGFD 171

Query:   205 YLNGV 209
             Y +G+
Sbjct:   172 YFHGI 176

 Score = 150 (57.9 bits), Expect = 1.4e-25, Sum P(2) = 1.4e-25
 Identities = 44/122 (36%), Positives = 62/122 (50%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             Y    ++LD SVG +++ L    +  N+++ F SD GA  VE   T    + GSN  Y+G
Sbjct:   322 YGDAAEELDWSVGQILNVLDELKLANNTLVYFTSDQGAH-VEEVTTKGEVHGGSNGIYKG 380

Query:   367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--IDG 424
              K   WEGG+++P IL  P + Q   V  +     D  PT+   AG   S LP +  IDG
Sbjct:   381 GKANNWEGGIRIPGILRWPGVIQAGLVIDEPTSNMDIFPTVAKLAG---SPLPEDRIIDG 437

Query:   425 LD 426
              D
Sbjct:   438 HD 439


>UNIPROTKB|F1NGC8 [details] [associations]
            symbol:STS "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 EMBL:AADN02017431 EMBL:AADN02017432
            EMBL:AADN02017433 IPI:IPI00584657 Ensembl:ENSGALT00000026830
            OMA:HTAMFAS Uniprot:F1NGC8
        Length = 471

 Score = 212 (79.7 bits), Expect = 1.5e-25, Sum P(2) = 1.5e-25
 Identities = 48/124 (38%), Positives = 71/124 (57%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQ--- 147
             G  DL  +G+  + TP+ID LA  G+ L  ++ A P+CTPSRA+ +TG+YPI +GM    
Sbjct:    49 GIGDLGCYGNRTLRTPHIDRLAKEGVTLTQHIAASPLCTPSRAAFLTGRYPIRSGMAAFS 108

Query:   148 --GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGF---FRREYT--PLYRGFE 200
               G  ++ A   G+P  E    + L++ GY+T  IGKWHLG       ++   PL  GF+
Sbjct:   109 RVGVFLFSASSGGLPSEEITFSKLLKQRGYATALIGKWHLGMNCESNNDFCHHPLSHGFD 168

Query:   201 SHFG 204
               +G
Sbjct:   169 YFYG 172

 Score = 153 (58.9 bits), Expect = 1.5e-25, Sum P(2) = 1.5e-25
 Identities = 41/123 (33%), Positives = 65/123 (52%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             Y   V+++D SVG ++  L+   +   S++ F SD GA  +E   +S   + G N  Y+G
Sbjct:   319 YGDAVEEMDWSVGQILDVLENYNLSNRSLVYFSSDQGAH-IEEISSSGEVHGGCNGIYKG 377

Query:   367 VKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--ID 423
              K+T WEGG++VP +L W   I     +     ++ D  PT+   AG   ++LP +  ID
Sbjct:   378 GKSTNWEGGIRVPGLLRWPGVIHAGTYIDDPTSNM-DIFPTIVKLAG---AQLPYDRIID 433

Query:   424 GLD 426
             G D
Sbjct:   434 GHD 436


>RGD|3783 [details] [associations]
            symbol:Sts "steroid sulfatase (microsomal), isozyme S"
          species:10116 "Rattus norvegicus" [GO:0004773 "steryl-sulfatase
          activity" evidence=IDA] [GO:0005635 "nuclear envelope" evidence=IDA]
          [GO:0005789 "endoplasmic reticulum membrane" evidence=IDA]
          [GO:0007565 "female pregnancy" evidence=IEA] [GO:0007611 "learning or
          memory" evidence=IMP] [GO:0008202 "steroid metabolic process"
          evidence=IEA] [GO:0008284 "positive regulation of cell proliferation"
          evidence=IMP] [GO:0008484 "sulfuric ester hydrolase activity"
          evidence=IEA;ISO] [GO:0009268 "response to pH" evidence=IDA]
          [GO:0014070 "response to organic cyclic compound" evidence=IDA]
          [GO:0016021 "integral to membrane" evidence=IDA] [GO:0043231
          "intracellular membrane-bounded organelle" evidence=IDA] [GO:0043434
          "response to peptide hormone stimulus" evidence=IDA] [GO:0043588
          "skin development" evidence=IEP] [GO:0043627 "response to estrogen
          stimulus" evidence=IDA] [GO:0046872 "metal ion binding" evidence=IEA]
          InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
          RGD:3783 GO:GO:0016021 GO:GO:0005635 GO:GO:0043588 GO:GO:0005789
          GO:GO:0008202 GO:GO:0046872 GO:GO:0008284 GO:GO:0043434 GO:GO:0007565
          GO:GO:0009268 GO:GO:0007611 GO:GO:0043627 Gene3D:3.40.720.10
          SUPFAM:SSF53649 eggNOG:COG3119 HOGENOM:HOG000135352
          HOVERGEN:HBG004283 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
          OrthoDB:EOG4V4379 CTD:412 KO:K01131 GO:GO:0004773 EMBL:U37138
          IPI:IPI00210494 PIR:S05414 RefSeq:NP_036793.1 UniGene:Rn.6312
          ProteinModelPortal:P15589 SMR:P15589 STRING:P15589 PRIDE:P15589
          GeneID:24800 KEGG:rno:24800 InParanoid:P15589 BindingDB:P15589
          ChEMBL:CHEMBL3531 NextBio:604458 Genevestigator:P15589
          GermOnline:ENSRNOG00000032487 Uniprot:P15589
        Length = 577

 Score = 197 (74.4 bits), Expect = 1.8e-25, Sum P(3) = 1.8e-25
 Identities = 41/101 (40%), Positives = 61/101 (60%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQ--- 147
             G  DL  +G+  + TP+ID LA  G+ L  ++ A P+CTPSRA+ +TG+YP+ +GM    
Sbjct:    37 GIGDLGCYGNRTLRTPHIDRLALEGVKLTQHLAAAPLCTPSRAAFLTGRYPVRSGMASHG 96

Query:   148 --GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLG 186
               G  ++ A   G+P  E    + L+  GY+T  +GKWHLG
Sbjct:    97 RLGVFLFSASSGGLPPNEVTFAKLLKGQGYTTGLVGKWHLG 137

 Score = 162 (62.1 bits), Expect = 1.8e-25, Sum P(3) = 1.8e-25
 Identities = 45/129 (34%), Positives = 68/129 (52%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             Y   V+++D +VG V++ L + G+  N+++   SD+GA  VE    +  R+ GSN  YRG
Sbjct:   307 YGDAVEEMDWAVGQVLATLDKLGLANNTLVYLTSDHGAH-VEELGPNGERHGGSNGIYRG 365

Query:   367 VKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--ID 423
              K   WEGG++VP ++ W   I     V     ++ D  PT+   AG +   LP +  ID
Sbjct:   366 GKANTWEGGIRVPGLVRWPGVIVPGQEVEEPTSNM-DVFPTVARLAGAE---LPTDRVID 421

Query:   424 GLDQWSSLL 432
             G D    LL
Sbjct:   422 GRDLMPLLL 430

 Score = 52 (23.4 bits), Expect = 1.8e-25, Sum P(3) = 1.8e-25
 Identities = 14/65 (21%), Positives = 25/65 (38%)

Query:   566 PCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDT 625
             P  LF++  DP E++ +         ++   +    R  V    E P+ +       N  
Sbjct:   498 PPLLFDIARDPRERHPLTPETEPRHGEILRNMDAAARAHVATLEEAPNQLSMS----NVA 553

Query:   626 WSPWI 630
             W PW+
Sbjct:   554 WKPWL 558


>UNIPROTKB|F1Q1V2 [details] [associations]
            symbol:STS "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 OMA:GLSCQCD EMBL:AAEX03026120
            EMBL:AAEX03026121 Ensembl:ENSCAFT00000017943 Uniprot:F1Q1V2
        Length = 637

 Score = 219 (82.2 bits), Expect = 1.9e-25, Sum P(2) = 1.9e-25
 Identities = 52/125 (41%), Positives = 69/125 (55%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQ--- 147
             G  D   +G+  + TPNID LA  G+ L  ++ A P+CTPSRA+ MTG+YPI +GM    
Sbjct:    33 GIGDPGCYGNTTLRTPNIDRLAAEGVKLTQHLAASPLCTPSRAAFMTGRYPIRSGMASQS 92

Query:   148 --GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESH-FG 204
               G  I+ A   G+P +E    + L+  GYST  IGKWHLG      T       SH F 
Sbjct:    93 FIGVFIFSASSGGLPTSEITFAKLLKNQGYSTALIGKWHLGTNCHNKTDFCHHPLSHGFD 152

Query:   205 YLNGV 209
             Y +G+
Sbjct:   153 YFHGI 157

 Score = 150 (57.9 bits), Expect = 1.9e-25, Sum P(2) = 1.9e-25
 Identities = 44/122 (36%), Positives = 62/122 (50%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             Y    ++LD SVG +++ L    +  N+++ F SD GA  VE   T    + GSN  Y+G
Sbjct:   303 YGDAAEELDWSVGQILNVLDELKLANNTLVYFTSDQGAH-VEEVTTKGEVHGGSNGIYKG 361

Query:   367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--IDG 424
              K   WEGG+++P IL  P + Q   V  +     D  PT+   AG   S LP +  IDG
Sbjct:   362 GKANNWEGGIRIPGILRWPGVIQAGLVIDEPTSNMDIFPTVAKLAG---SPLPEDRIIDG 418

Query:   425 LD 426
              D
Sbjct:   419 HD 420


>RGD|1310381 [details] [associations]
            symbol:Arsa "arylsulfatase A" species:10116 "Rattus norvegicus"
            [GO:0001669 "acrosomal vesicle" evidence=IDA] [GO:0004065
            "arylsulfatase activity" evidence=IDA] [GO:0005509 "calcium ion
            binding" evidence=ISO] [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0005737 "cytoplasm" evidence=IDA] [GO:0005764
            "lysosome" evidence=IDA] [GO:0005768 "endosome" evidence=IDA]
            [GO:0005886 "plasma membrane" evidence=ISO] [GO:0006914 "autophagy"
            evidence=IDA] [GO:0007339 "binding of sperm to zona pellucida"
            evidence=ISO] [GO:0007417 "central nervous system development"
            evidence=IDA] [GO:0007584 "response to nutrient" evidence=IDA]
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=ISO]
            [GO:0009268 "response to pH" evidence=IDA] [GO:0016021 "integral to
            membrane" evidence=ISO] [GO:0031232 "extrinsic to external side of
            plasma membrane" evidence=IDA] [GO:0043627 "response to estrogen
            stimulus" evidence=IDA] [GO:0045471 "response to ethanol"
            evidence=IDA] [GO:0051597 "response to methylmercury" evidence=IDA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 RGD:1310381 GO:GO:0005615 GO:GO:0045471 GO:GO:0005768
            GO:GO:0001669 GO:GO:0006914 GO:GO:0007584 GO:GO:0005509
            GO:GO:0007417 GO:GO:0005764 GO:GO:0009268 GO:GO:0007339
            GO:GO:0043627 Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0031232
            GO:GO:0051597 HOVERGEN:HBG004283 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0004065 IPI:IPI00361483 UniGene:Rn.23323
            EMBL:BC105852 ProteinModelPortal:Q3KR80 SMR:Q3KR80 IntAct:Q3KR80
            STRING:Q3KR80 ArrayExpress:Q3KR80 Genevestigator:Q3KR80
            Uniprot:Q3KR80
        Length = 497

 Score = 205 (77.2 bits), Expect = 2.4e-25, Sum P(3) = 2.4e-25
 Identities = 48/117 (41%), Positives = 64/117 (54%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPV--CTPSRASLMTGKYPIHTGMQGP 149
             G+ DL  +G     TPN+D LA  G+   + Y  PV  CTPSRA+L+TG+ P+ +GM  P
Sbjct:    32 GYGDLGSYGHPSSTTPNLDQLAAGGLRFTDFYV-PVSLCTPSRAALLTGRLPVRSGMY-P 89

Query:   150 PIWGAEPRG-VPLTERFLPEYLRELGYSTKAIGKWHLGFFRR-EYTPLYRGFESHFG 204
              + G   +G +PL E  L E L   GY T   GKWHLG      + P ++GF    G
Sbjct:    90 GVLGPSSQGGLPLEEVTLAEVLAARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLG 146

 Score = 143 (55.4 bits), Expect = 2.4e-25, Sum P(3) = 2.4e-25
 Identities = 48/140 (34%), Positives = 74/140 (52%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
             R  +   + +LD +VG +++A+   G+L  +++IF +DNG P     E     + G +  
Sbjct:   244 RGPFGDSLMELDGAVGALMTAVGDLGLLGETLVIFTADNG-P-----ELMRMSDGGCSGL 297

Query:   364 YRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP-LNI 422
              R  K T +EGGV+ PA+++ P     P V+ ++    D LPTL    G     LP + +
Sbjct:   298 LRCGKGTTFEGGVREPALVYWPG-HITPGVTHELASSLDLLPTLAALTGAP---LPNITL 353

Query:   423 DGLDQWSSLLLNT-PSRRNS 441
             DG+D  S LLL T  S RNS
Sbjct:   354 DGVDI-SPLLLGTGKSPRNS 372

 Score = 57 (25.1 bits), Expect = 2.4e-25, Sum P(3) = 2.4e-25
 Identities = 11/34 (32%), Positives = 19/34 (55%)

Query:   566 PCYLFNLGNDPCEQNNIASSRPDISSQLYELLKY 599
             P  L++L  DP E  N+  S  ++S +  + LK+
Sbjct:   415 PPLLYDLSKDPGENYNLLDSTEEVSPEALQALKH 448

 Score = 47 (21.6 bits), Expect = 1.8e-15, Sum P(3) = 1.8e-15
 Identities = 21/52 (40%), Positives = 27/52 (51%)

Query:   413 GDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNT-PSRRNSV 463
             G T  L  ++D L   ++L    P   N  +DG+D  S LLL T  S RNSV
Sbjct:   325 GVTHELASSLDLLPTLAALT-GAPLP-NITLDGVDI-SPLLLGTGKSPRNSV 373


>UNIPROTKB|F5H324 [details] [associations]
            symbol:ARSE "Arylsulfatase E" species:9606 "Homo sapiens"
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
            PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484 EMBL:AC005295
            HGNC:HGNC:719 IPI:IPI01015579 ProteinModelPortal:F5H324 SMR:F5H324
            Ensembl:ENST00000540563 UCSC:uc011mhi.2 ArrayExpress:F5H324
            Bgee:F5H324 Uniprot:F5H324
        Length = 544

 Score = 194 (73.4 bits), Expect = 2.8e-25, Sum P(3) = 2.8e-25
 Identities = 44/110 (40%), Positives = 63/110 (57%)

Query:   106 TPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQ---GPPI--WGAEPRGV 159
             TPNID LA +G+ L  ++ A  +CTPSRA+ +TG+YP+ +GM    G  +  W     G+
Sbjct:    18 TPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSIGYRVLQWTGASGGL 77

Query:   160 PLTERFLPEYLRELGYSTKAIGKWHLGFFRRE-----YTPLYRGFESHFG 204
             P  E    + L+E GY+T  IGKWHLG          + PL+ GF+  +G
Sbjct:    78 PTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFDHFYG 127

 Score = 148 (57.2 bits), Expect = 2.8e-25, Sum P(3) = 2.8e-25
 Identities = 42/130 (32%), Positives = 64/130 (49%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             Y   V+++D  VG ++  L  +G+  +++I F SD+G         + Y  W  N  Y+G
Sbjct:   274 YGDNVEEMDWMVGRILDTLDVEGLSNSTLIYFTSDHGGSLENQLGNTQYGGW--NGIYKG 331

Query:   367 VKNTL-WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
              K    WEGG++VP I   P +    RV  +   + D  PT+   AGG+  +  + IDG 
Sbjct:   332 GKGMGGWEGGIRVPGIFRWPGVLPAGRVIGEPTSLMDVFPTVVRLAGGEVPQDRV-IDGQ 390

Query:   426 DQWSSLLLNT 435
             D    LLL T
Sbjct:   391 DLLP-LLLGT 399

 Score = 66 (28.3 bits), Expect = 2.8e-25, Sum P(3) = 2.8e-25
 Identities = 21/69 (30%), Positives = 33/69 (47%)

Query:   566 PCYLFNLGNDPCEQNNIASSRPDISSQLYELLKY----HRRTLVPQSHEQPDLVQADPKR 621
             P  LF+L  DP E + +  +   +  Q+ E ++     H+RTL P     P  +Q D  R
Sbjct:   465 PPLLFDLSRDPSETHILTPASEPVFYQVMERVQQAVWEHQRTLSPV----P--LQLD--R 516

Query:   622 FNDTWSPWI 630
               + W PW+
Sbjct:   517 LGNIWRPWL 525


>UNIPROTKB|Q08DD1 [details] [associations]
            symbol:ARSA "Arylsulfatase A" species:9913 "Bos taurus"
            [GO:0005509 "calcium ion binding" evidence=ISS] [GO:0005764
            "lysosome" evidence=IEA] [GO:0016021 "integral to membrane"
            evidence=IEA] [GO:0007339 "binding of sperm to zona pellucida"
            evidence=IEA] [GO:0005886 "plasma membrane" evidence=IEA]
            [GO:0004098 "cerebroside-sulfatase activity" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 GO:GO:0005886 GO:GO:0005509 GO:GO:0005764
            GO:GO:0007339 Gene3D:3.40.720.10 SUPFAM:SSF53649 EMBL:BC123816
            IPI:IPI00713745 RefSeq:NP_001068673.1 UniGene:Bt.1076
            ProteinModelPortal:Q08DD1 SMR:Q08DD1 STRING:Q08DD1 PRIDE:Q08DD1
            Ensembl:ENSBTAT00000021364 GeneID:505514 KEGG:bta:505514 CTD:410
            eggNOG:COG3119 GeneTree:ENSGT00560000076940 HOGENOM:HOG000135352
            HOVERGEN:HBG004283 InParanoid:Q08DD1 KO:K01134 OMA:FGPSQMA
            OrthoDB:EOG4MKNG4 NextBio:20867174 GO:GO:0004098 InterPro:IPR024607
            PROSITE:PS00523 PROSITE:PS00149 Uniprot:Q08DD1
        Length = 507

 Score = 195 (73.7 bits), Expect = 4.7e-25, Sum P(3) = 4.7e-25
 Identities = 47/117 (40%), Positives = 61/117 (52%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPV--CTPSRASLMTGKYPIHTGMQGP 149
             G+ DL  +G     TPN+D LA  G+   + Y  PV  CTPSRA+L+TG+ P+  G+  P
Sbjct:    32 GYGDLGSYGHPSSTTPNLDQLAAGGLRFTDFYV-PVSLCTPSRAALLTGRLPVRMGLY-P 89

Query:   150 PIWGAEPRG-VPLTERFLPEYLRELGYSTKAIGKWHLGFFRR-EYTPLYRGFESHFG 204
              +     RG +PL E  L E L   GY T   GKWHLG      + P + GF    G
Sbjct:    90 GVLEPSSRGGLPLDEVTLAEVLAAQGYLTGIAGKWHLGVGPEGAFLPPHHGFHRFLG 146

 Score = 143 (55.4 bits), Expect = 4.7e-25, Sum P(3) = 4.7e-25
 Identities = 46/134 (34%), Positives = 72/134 (53%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
             R  +   + +LD +VG +++A+   G+L  +++ F +DNG P     ET    + G +  
Sbjct:   244 RGPFGDSLMELDAAVGALMTAVGDLGLLGETLVFFTADNG-P-----ETMRMSHGGCSGL 297

Query:   364 YRGVKNTLWEGGVKVPAI-LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP-LN 421
              R  K T +EGGV+ PA+  W   I   P V+ ++    D LPTL   AG   ++LP + 
Sbjct:   298 LRCGKGTTFEGGVREPALAFWPGHIA--PGVTHELASSLDLLPTLAALAG---AQLPNIT 352

Query:   422 IDGLDQWSSLLLNT 435
             +DG+D  S LLL T
Sbjct:   353 LDGVDL-SPLLLGT 365

 Score = 66 (28.3 bits), Expect = 4.7e-25, Sum P(3) = 4.7e-25
 Identities = 17/57 (29%), Positives = 30/57 (52%)

Query:   545 QATIHCG--ANPAPMTPSPCT-NGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLK 598
             Q ++H    A+PA    +P T + P  LF+L  DP E  N+  S  +++ +  + +K
Sbjct:   401 QGSVHSDTTADPACHASNPLTAHEPPLLFDLSEDPGENYNLLDSVDEVAPEALQAVK 457


>MGI|MGI:88077 [details] [associations]
            symbol:Arsa "arylsulfatase A" species:10090 "Mus musculus"
            [GO:0001669 "acrosomal vesicle" evidence=ISO] [GO:0003824
            "catalytic activity" evidence=IEA] [GO:0004065 "arylsulfatase
            activity" evidence=ISO] [GO:0004098 "cerebroside-sulfatase
            activity" evidence=IEA] [GO:0005509 "calcium ion binding"
            evidence=ISO] [GO:0005615 "extracellular space" evidence=ISO]
            [GO:0005737 "cytoplasm" evidence=ISO] [GO:0005764 "lysosome"
            evidence=ISO] [GO:0005768 "endosome" evidence=ISO] [GO:0005886
            "plasma membrane" evidence=IDA] [GO:0006914 "autophagy"
            evidence=ISO] [GO:0007339 "binding of sperm to zona pellucida"
            evidence=IMP] [GO:0007417 "central nervous system development"
            evidence=ISO] [GO:0007584 "response to nutrient" evidence=ISO]
            [GO:0008152 "metabolic process" evidence=IEA] [GO:0008484 "sulfuric
            ester hydrolase activity" evidence=ISO] [GO:0009268 "response to
            pH" evidence=ISO] [GO:0016021 "integral to membrane" evidence=IDA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0031232
            "extrinsic to external side of plasma membrane" evidence=ISO]
            [GO:0043627 "response to estrogen stimulus" evidence=ISO]
            [GO:0045471 "response to ethanol" evidence=ISO] [GO:0046872 "metal
            ion binding" evidence=IEA] [GO:0051597 "response to methylmercury"
            evidence=ISO] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 MGI:MGI:88077 GO:GO:0016021
            GO:GO:0005886 GO:GO:0005509 GO:GO:0005764 GO:GO:0007339
            EMBL:CH466550 Gene3D:3.40.720.10 SUPFAM:SSF53649 CTD:410
            eggNOG:COG3119 GeneTree:ENSGT00560000076940 HOGENOM:HOG000135352
            HOVERGEN:HBG004283 KO:K01134 OMA:FGPSQMA OrthoDB:EOG4MKNG4
            GO:GO:0004098 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            EMBL:X73230 EMBL:X73231 EMBL:AK004540 EMBL:AK132501 EMBL:BC011284
            EMBL:BC098075 EMBL:M82876 IPI:IPI00118039 PIR:A54190
            RefSeq:NP_033843.2 UniGene:Mm.620 ProteinModelPortal:P50428
            SMR:P50428 IntAct:P50428 STRING:P50428 PaxDb:P50428 PRIDE:P50428
            Ensembl:ENSMUST00000165199 GeneID:11883 KEGG:mmu:11883
            InParanoid:Q9DC66 SABIO-RK:P50428 NextBio:279915 Bgee:P50428
            CleanEx:MM_ARSA Genevestigator:P50428 GermOnline:ENSMUSG00000022620
            GO:GO:0008484 Uniprot:P50428
        Length = 506

 Score = 205 (77.2 bits), Expect = 5.6e-25, Sum P(3) = 5.6e-25
 Identities = 48/117 (41%), Positives = 64/117 (54%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPV--CTPSRASLMTGKYPIHTGMQGP 149
             G+ DL  +G     TPN+D LA  G+   + Y  PV  CTPSRA+L+TG+ P+ +GM  P
Sbjct:    31 GYGDLGSYGHPSSTTPNLDQLAEGGLRFTDFYV-PVSLCTPSRAALLTGRLPVRSGMY-P 88

Query:   150 PIWGAEPRG-VPLTERFLPEYLRELGYSTKAIGKWHLGFFRR-EYTPLYRGFESHFG 204
              + G   +G +PL E  L E L   GY T   GKWHLG      + P ++GF    G
Sbjct:    89 GVLGPSSQGGLPLEEVTLAEVLAARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLG 145

 Score = 144 (55.7 bits), Expect = 5.6e-25, Sum P(3) = 5.6e-25
 Identities = 48/140 (34%), Positives = 73/140 (52%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
             R  +   + +LD +VG +++ +   G+LE +++IF +DNG P     E     N G +  
Sbjct:   243 RGPFGDSLMELDGAVGALMTTVGDLGLLEETLVIFTADNG-P-----ELMRMSNGGCSGL 296

Query:   364 YRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP-LNI 422
              R  K T +EGGV+ PA+++ P     P V+ ++    D LPTL    G     LP + +
Sbjct:   297 LRCGKGTTFEGGVREPALVYWPG-HITPGVTHELASSLDLLPTLAALTGAP---LPNVTL 352

Query:   423 DGLDQWSSLLLNT-PSRRNS 441
             DG+D  S LLL T  S R S
Sbjct:   353 DGVDI-SPLLLGTGKSPRKS 371

 Score = 53 (23.7 bits), Expect = 5.6e-25, Sum P(3) = 5.6e-25
 Identities = 11/34 (32%), Positives = 18/34 (52%)

Query:   566 PCYLFNLGNDPCEQNNIASSRPDISSQLYELLKY 599
             P  L++L  DP E  N+  S   +S +  + LK+
Sbjct:   424 PPLLYDLSQDPGENYNVLESIEGVSPEALQALKH 457


>UNIPROTKB|E1BYN0 [details] [associations]
            symbol:ARSD "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 CTD:414 KO:K12374 OMA:RSWIPSG
            EMBL:AADN02017596 IPI:IPI00570897 RefSeq:XP_416855.2
            ProteinModelPortal:E1BYN0 Ensembl:ENSGALT00000026880 GeneID:418658
            KEGG:gga:418658 NextBio:20821812 Uniprot:E1BYN0
        Length = 596

 Score = 220 (82.5 bits), Expect = 6.1e-25, Sum P(3) = 6.1e-25
 Identities = 51/126 (40%), Positives = 69/126 (54%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
             G  D+  +G+N I TPNID LA  G+ L  ++ A P+CTPSRA+ +TG+YPI +GM    
Sbjct:    53 GIGDVGCYGNNTIRTPNIDRLAREGVKLTQHIAAAPLCTPSRAAFLTGRYPIRSGMASSN 112

Query:   151 I-----WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLG-----FFRREYTPLYRGFE 200
                   W A   G+P  E      L++ GY+T  IGKWH G     F    + PL  GF+
Sbjct:   113 RYRALQWNAGSGGLPANETTFARLLQQQGYTTGLIGKWHQGVNCESFSDHCHHPLNHGFD 172

Query:   201 SHFGYL 206
               +G L
Sbjct:   173 YFYGML 178

 Score = 118 (46.6 bits), Expect = 6.1e-25, Sum P(3) = 6.1e-25
 Identities = 30/108 (27%), Positives = 51/108 (47%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             Y   V+++D  VG ++  L ++G+  ++   F SD+G        ++    W  N  Y+G
Sbjct:   323 YGDNVEEMDWMVGKILDLLDKEGLKNHTFTYFASDHGGHLEAQDGSAQMGGW--NGIYKG 380

Query:   367 VKNTL-WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGG 413
              K    WEGG++VP +   P +     V  +   + D  PT+   AGG
Sbjct:   381 GKGMGGWEGGIRVPGVFRWPGVLPAGTVINEPTSLMDIFPTVVHLAGG 428

 Score = 66 (28.3 bits), Expect = 6.1e-25, Sum P(3) = 6.1e-25
 Identities = 24/71 (33%), Positives = 34/71 (47%)

Query:   566 PCYLFNLGNDPCEQNNI-ASSRP--D-ISSQLYELLKYHRRTL--VPQSHEQPDLVQADP 619
             P  L++L  DP E   + A + P  D +  Q+   ++ HRRTL  VPQ   Q  L     
Sbjct:   514 PPLLYDLSRDPSESQPLSADTEPLFDTVIEQIGRAIEEHRRTLTAVPQ---QLSL----- 565

Query:   620 KRFNDTWSPWI 630
               +N  W PW+
Sbjct:   566 --YNIIWKPWL 574


>UNIPROTKB|P15289 [details] [associations]
            symbol:ARSA "Arylsulfatase A" species:9606 "Homo sapiens"
            [GO:0005509 "calcium ion binding" evidence=IDA] [GO:0004065
            "arylsulfatase activity" evidence=TAS] [GO:0005764 "lysosome"
            evidence=TAS] [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IDA] [GO:0004098 "cerebroside-sulfatase activity"
            evidence=TAS] [GO:0005788 "endoplasmic reticulum lumen"
            evidence=TAS] [GO:0006644 "phospholipid metabolic process"
            evidence=TAS] [GO:0006665 "sphingolipid metabolic process"
            evidence=TAS] [GO:0006687 "glycosphingolipid metabolic process"
            evidence=TAS] [GO:0043202 "lysosomal lumen" evidence=TAS]
            [GO:0043687 "post-translational protein modification" evidence=TAS]
            [GO:0044267 "cellular protein metabolic process" evidence=TAS]
            [GO:0044281 "small molecule metabolic process" evidence=TAS]
            Reactome:REACT_17015 Reactome:REACT_111217 InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 GO:GO:0005886
            GO:GO:0044281 GO:GO:0006644 GO:GO:0005509 GO:GO:0005788
            GO:GO:0007339 GO:GO:0043687 GO:GO:0043202 Gene3D:3.40.720.10
            SUPFAM:SSF53649 CTD:410 eggNOG:COG3119 HOGENOM:HOG000135352
            HOVERGEN:HBG004283 KO:K01134 OrthoDB:EOG4MKNG4 GO:GO:0004098
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 EMBL:X52151
            EMBL:X52150 EMBL:AB448736 EMBL:CR456383 EMBL:AK315011 EMBL:AY271820
            EMBL:U62317 EMBL:BC014210 IPI:IPI00744184 PIR:S11031
            RefSeq:NP_000478.3 RefSeq:NP_001078894.2 RefSeq:NP_001078895.2
            RefSeq:NP_001078896.2 RefSeq:NP_001078897.1 UniGene:Hs.731715
            UniGene:Hs.88251 PDB:1AUK PDB:1E1Z PDB:1E2S PDB:1E33 PDB:1E3C
            PDB:1N2K PDB:1N2L PDB:2AIJ PDB:2AIK PDBsum:1AUK PDBsum:1E1Z
            PDBsum:1E2S PDBsum:1E33 PDBsum:1E3C PDBsum:1N2K PDBsum:1N2L
            PDBsum:2AIJ PDBsum:2AIK ProteinModelPortal:P15289 SMR:P15289
            IntAct:P15289 STRING:P15289 GlycoSuiteDB:P15289 PaxDb:P15289
            PRIDE:P15289 DNASU:410 Ensembl:ENST00000547307
            Ensembl:ENST00000547805 GeneID:410 KEGG:hsa:410 UCSC:uc003bna.4
            GeneCards:GC22M051063 HGNC:HGNC:713 HPA:CAB025183 HPA:HPA005554
            MIM:250100 MIM:272200 MIM:607574 neXtProt:NX_P15289 Orphanet:512
            Orphanet:751 PharmGKB:PA25005 InParanoid:P15289 PhylomeDB:P15289
            BRENDA:3.1.6.8 ChEMBL:CHEMBL2193 DrugBank:DB01141
            EvolutionaryTrace:P15289 GenomeRNAi:410 NextBio:1725
            PMAP-CutDB:P15289 ArrayExpress:P15289 Bgee:P15289 CleanEx:HS_ARSA
            Genevestigator:P15289 GermOnline:ENSG00000100299 GO:GO:0004065
            GO:GO:0006687 Uniprot:P15289
        Length = 507

 Score = 195 (73.7 bits), Expect = 9.5e-25, Sum P(3) = 9.5e-25
 Identities = 45/116 (38%), Positives = 61/116 (52%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPV--CTPSRASLMTGKYPIHTGMQGP 149
             G+ DL  +G     TPN+D LA  G+   + Y  PV  CTPSRA+L+TG+ P+  GM   
Sbjct:    32 GYGDLGCYGHPSSTTPNLDQLAAGGLRFTDFYV-PVSLCTPSRAALLTGRLPVRMGMYPG 90

Query:   150 PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR-EYTPLYRGFESHFG 204
              +  +   G+PL E  + E L   GY T   GKWHLG      + P ++GF    G
Sbjct:    91 VLVPSSRGGLPLEEVTVAEVLAARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLG 146

 Score = 156 (60.0 bits), Expect = 9.5e-25, Sum P(3) = 9.5e-25
 Identities = 52/141 (36%), Positives = 74/141 (52%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
             R  +   + +LD +VGT+++A+   G+LE +++IF +DNG P     ET      G +  
Sbjct:   244 RGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNG-P-----ETMRMSRGGCSGL 297

Query:   364 YRGVKNTLWEGGVKVPAI-LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP-LN 421
              R  K T +EGGV+ PA+  W   I   P V+ ++    D LPTL   AG     LP + 
Sbjct:   298 LRCGKGTTYEGGVREPALAFWPGHIA--PGVTHELASSLDLLPTLAALAGAP---LPNVT 352

Query:   422 IDGLDQWSSLLLNT-PSRRNS 441
             +DG D  S LLL T  S R S
Sbjct:   353 LDGFDL-SPLLLGTGKSPRQS 372

 Score = 50 (22.7 bits), Expect = 9.5e-25, Sum P(3) = 9.5e-25
 Identities = 16/50 (32%), Positives = 25/50 (50%)

Query:   552 ANPAPMTPSPCT-NGPCYLFNLGNDPCEQNN----IASSRPDISSQLYEL 596
             A+PA    S  T + P  L++L  DP E  N    +A + P++   L +L
Sbjct:   410 ADPACHASSSLTAHEPPLLYDLSKDPGENYNLLGGVAGATPEVLQALKQL 459


>MGI|MGI:98438 [details] [associations]
            symbol:Sts "steroid sulfatase" species:10090 "Mus musculus"
            [GO:0003824 "catalytic activity" evidence=IEA] [GO:0004773
            "steryl-sulfatase activity" evidence=ISO] [GO:0005635 "nuclear
            envelope" evidence=ISO] [GO:0005783 "endoplasmic reticulum"
            evidence=IEA] [GO:0005789 "endoplasmic reticulum membrane"
            evidence=ISO] [GO:0006629 "lipid metabolic process" evidence=IEA]
            [GO:0007565 "female pregnancy" evidence=IEA] [GO:0007611 "learning
            or memory" evidence=ISO] [GO:0008152 "metabolic process"
            evidence=ISO] [GO:0008202 "steroid metabolic process" evidence=IEA]
            [GO:0008284 "positive regulation of cell proliferation"
            evidence=ISO] [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=ISO] [GO:0009268 "response to pH" evidence=ISO]
            [GO:0014070 "response to organic cyclic compound" evidence=ISO]
            [GO:0016020 "membrane" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=ISO] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0043231 "intracellular membrane-bounded
            organelle" evidence=ISO] [GO:0043434 "response to peptide hormone
            stimulus" evidence=ISO] [GO:0043627 "response to estrogen stimulus"
            evidence=ISO] [GO:0046872 "metal ion binding" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 MGI:MGI:98438 GO:GO:0016021 GO:GO:0005789
            GO:GO:0008202 GO:GO:0046872 GO:GO:0007565 Gene3D:3.40.720.10
            SUPFAM:SSF53649 HOVERGEN:HBG004283 InterPro:IPR024607
            PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484 CTD:412 KO:K01131
            GO:GO:0004773 EMBL:U37545 IPI:IPI00118038 RefSeq:NP_033319.1
            UniGene:Mm.423011 ProteinModelPortal:P50427 SMR:P50427
            PhosphoSite:P50427 PRIDE:P50427 GeneID:20905 KEGG:mmu:20905
            NextBio:299773 CleanEx:MM_STS Genevestigator:P50427 Uniprot:P50427
        Length = 624

 Score = 202 (76.2 bits), Expect = 9.8e-25, Sum P(2) = 9.8e-25
 Identities = 48/124 (38%), Positives = 69/124 (55%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQ--- 147
             G  DL  +G+  + TP++D LA  G+ L  ++ A P+CTPSRA+ +TG+YP  +GM    
Sbjct:    46 GIGDLGCYGNKTLRTPHLDRLAREGVKLTQHLAAAPLCTPSRAAFLTGRYPPRSGMAAHG 105

Query:   148 --GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT-----PLYRGFE 200
               G  ++ A   G+P +E  +   L+  GY+T  IGKWHLG   R  T     PL  GF+
Sbjct:   106 RVGVYLFTASSGGLPPSEVTMARLLKGRGYATALIGKWHLGLSCRGATDFCHHPLRHGFD 165

Query:   201 SHFG 204
                G
Sbjct:   166 RFLG 169

 Score = 161 (61.7 bits), Expect = 9.8e-25, Sum P(2) = 9.8e-25
 Identities = 44/122 (36%), Positives = 64/122 (52%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             Y   V+++D  VG V++AL   G+   +++ F SD+GA  VE       R  GSN  +RG
Sbjct:   316 YGDSVEEMDWGVGRVLAALDELGLARETLVYFTSDHGAH-VEELGPRGERMGGSNGVFRG 374

Query:   367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--IDG 424
              K   WEGGV+VP ++  P+     RV  +   + D  PT+   AG +   LP +  IDG
Sbjct:   375 GKGNNWEGGVRVPCLVRWPRELSPGRVVAEPTSLMDVFPTVARLAGAE---LPGDRVIDG 431

Query:   425 LD 426
              D
Sbjct:   432 RD 433


>UNIPROTKB|I3LBW8 [details] [associations]
            symbol:STS "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 OMA:GLSCQCD EMBL:FP102981
            EMBL:FP339595 Ensembl:ENSSSCT00000032160 Uniprot:I3LBW8
        Length = 579

 Score = 205 (77.2 bits), Expect = 1.1e-24, Sum P(2) = 1.1e-24
 Identities = 49/125 (39%), Positives = 69/125 (55%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQ--- 147
             G  D   +G+  + TPNID LA  G+ L  ++ A P+CTPSRA+ +TG+YPI +GM    
Sbjct:    34 GIGDPGCYGNKTLRTPNIDRLAGGGVKLTQHLAAAPLCTPSRAAFLTGRYPIRSGMAAQN 93

Query:   148 --GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESH-FG 204
               G  I+ A   G+P +E    + L+  GY+T  IGKWHLG      +       SH F 
Sbjct:    94 QVGVFIFSASSGGLPPSEITFAKLLKSQGYTTALIGKWHLGTNCHNSSDFCHHPLSHGFD 153

Query:   205 YLNGV 209
             Y +G+
Sbjct:   154 YFHGI 158

 Score = 156 (60.0 bits), Expect = 1.1e-24, Sum P(2) = 1.1e-24
 Identities = 53/162 (32%), Positives = 73/162 (45%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             Y    +++D SVG ++  L    +  N++I F SD GA   E          GSN  Y+G
Sbjct:   304 YGDATEEMDWSVGQILDVLDELKLANNTLIYFSSDQGAHVEEVTVKGEVHG-GSNGIYKG 362

Query:   367 VKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
              K T WEGG++VP IL W   IQ    +     ++ D  PT+   AG       + IDG 
Sbjct:   363 GKATNWEGGIRVPGILRWPGVIQAGLELDAPTSNM-DLFPTVANLAGAPLPEDRI-IDGR 420

Query:   426 DQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPS--RRNSVLI 465
             D    LLL    R  S+ + L  + +  LN      RNS  I
Sbjct:   421 DLMP-LLLGQSQR--SDHEFLFHYCNFYLNAVRWHPRNSTSI 459


>UNIPROTKB|K7GLQ3 [details] [associations]
            symbol:STS "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 EMBL:FP102981 EMBL:FP339595
            Ensembl:ENSSSCT00000035627 Uniprot:K7GLQ3
        Length = 580

 Score = 205 (77.2 bits), Expect = 1.2e-24, Sum P(2) = 1.2e-24
 Identities = 49/125 (39%), Positives = 69/125 (55%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQ--- 147
             G  D   +G+  + TPNID LA  G+ L  ++ A P+CTPSRA+ +TG+YPI +GM    
Sbjct:    35 GIGDPGCYGNKTLRTPNIDRLAGGGVKLTQHLAAAPLCTPSRAAFLTGRYPIRSGMAAQN 94

Query:   148 --GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESH-FG 204
               G  I+ A   G+P +E    + L+  GY+T  IGKWHLG      +       SH F 
Sbjct:    95 QVGVFIFSASSGGLPPSEITFAKLLKSQGYTTALIGKWHLGTNCHNSSDFCHHPLSHGFD 154

Query:   205 YLNGV 209
             Y +G+
Sbjct:   155 YFHGI 159

 Score = 156 (60.0 bits), Expect = 1.2e-24, Sum P(2) = 1.2e-24
 Identities = 53/162 (32%), Positives = 73/162 (45%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             Y    +++D SVG ++  L    +  N++I F SD GA   E          GSN  Y+G
Sbjct:   305 YGDATEEMDWSVGQILDVLDELKLANNTLIYFSSDQGAHVEEVTVKGEVHG-GSNGIYKG 363

Query:   367 VKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
              K T WEGG++VP IL W   IQ    +     ++ D  PT+   AG       + IDG 
Sbjct:   364 GKATNWEGGIRVPGILRWPGVIQAGLELDAPTSNM-DLFPTVANLAGAPLPEDRI-IDGR 421

Query:   426 DQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPS--RRNSVLI 465
             D    LLL    R  S+ + L  + +  LN      RNS  I
Sbjct:   422 DLMP-LLLGQSQR--SDHEFLFHYCNFYLNAVRWHPRNSTSI 460


>RGD|1304917 [details] [associations]
            symbol:Arse "arylsulfatase E (chondrodysplasia punctata 1)"
            species:10116 "Rattus norvegicus" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0004065 "arylsulfatase activity" evidence=IEA]
            [GO:0005575 "cellular_component" evidence=ND] [GO:0008150
            "biological_process" evidence=ND] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 RGD:1304917
            Gene3D:3.40.720.10 SUPFAM:SSF53649 GeneTree:ENSGT00560000076940
            HOGENOM:HOG000135352 HOVERGEN:HBG004283 InterPro:IPR024607
            PROSITE:PS00523 PROSITE:PS00149 GO:GO:0004065 KO:K12374 CTD:415
            OMA:CHIVALA EMBL:BN000737 IPI:IPI00367421 RefSeq:NP_001041350.1
            UniGene:Rn.79118 STRING:Q32KK0 Ensembl:ENSRNOT00000033080
            GeneID:310326 KEGG:rno:310326 UCSC:RGD:1304917 InParanoid:Q32KK0
            NextBio:661844 Genevestigator:Q32KK0 Uniprot:Q32KK0
        Length = 611

 Score = 210 (79.0 bits), Expect = 2.1e-24, Sum P(2) = 2.1e-24
 Identities = 52/123 (42%), Positives = 67/123 (54%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNN-MYAQPVCTPSRASLMTGKYPIHTGM---Q 147
             G  DL  +G+  I TPNID LA +G+ L   + A+ VCTPSRA+ +TG+YPI +GM    
Sbjct:    46 GIGDLGCYGNTSIRTPNIDRLAEDGVRLTQYLAAESVCTPSRAAFLTGRYPIRSGMTSGN 105

Query:   148 GPPI--WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT-----PLYRGFE 200
             G  +  W A   G+P  E      L+  GY T  +GKWHLG   R  +     PL  GF 
Sbjct:   106 GHRVLQWAAGAGGLPPKEITFARILQGQGYVTGLVGKWHLGLSCRTVSDLCHHPLNHGFH 165

Query:   201 SHF 203
              HF
Sbjct:   166 -HF 167

 Score = 149 (57.5 bits), Expect = 2.1e-24, Sum P(2) = 2.1e-24
 Identities = 41/123 (33%), Positives = 66/123 (53%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             Y   V+++D  VG ++  L+ +G+ +++++ F SDNGA  +E  +    +  GSN  +RG
Sbjct:   321 YGDNVEEMDWVVGQILEVLEHEGLTDSTLVHFTSDNGA-WLE-AQAGGEQLGGSNGVFRG 378

Query:   367 VKNTL-WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--ID 423
              K    WEGG++VP +   P +    RV  Q + + D  PT+    GG    LP +  ID
Sbjct:   379 GKGMGGWEGGIRVPGVFRWPGVLPRGRVLDQPVSLMDVFPTVVRLGGGV---LPSDREID 435

Query:   424 GLD 426
             G D
Sbjct:   436 GRD 438


>UNIPROTKB|P54793 [details] [associations]
            symbol:ARSF "Arylsulfatase F" species:9606 "Homo sapiens"
            [GO:0046872 "metal ion binding" evidence=IEA] [GO:0005576
            "extracellular region" evidence=IEA] [GO:0004065 "arylsulfatase
            activity" evidence=TAS] [GO:0005788 "endoplasmic reticulum lumen"
            evidence=TAS] [GO:0006644 "phospholipid metabolic process"
            evidence=TAS] [GO:0006665 "sphingolipid metabolic process"
            evidence=TAS] [GO:0006687 "glycosphingolipid metabolic process"
            evidence=TAS] [GO:0043687 "post-translational protein modification"
            evidence=TAS] [GO:0044267 "cellular protein metabolic process"
            evidence=TAS] [GO:0044281 "small molecule metabolic process"
            evidence=TAS] Reactome:REACT_17015 Reactome:REACT_111217
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 GO:GO:0005576 GO:GO:0044281 GO:GO:0046872
            GO:GO:0006644 GO:GO:0005788 GO:GO:0043687 Gene3D:3.40.720.10
            SUPFAM:SSF53649 eggNOG:COG3119 HOGENOM:HOG000135352
            HOVERGEN:HBG004283 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0004065 GO:GO:0006687 KO:K12374
            OrthoDB:EOG4V4379 EMBL:X97868 EMBL:AC112653 EMBL:BC022389
            IPI:IPI00008405 PIR:A56217 RefSeq:NP_001188467.1
            RefSeq:NP_001188468.1 RefSeq:NP_004033.2 UniGene:Hs.101674
            ProteinModelPortal:P54793 SMR:P54793 IntAct:P54793 STRING:P54793
            PhosphoSite:P54793 DMDM:259016386 PaxDb:P54793 PRIDE:P54793
            Ensembl:ENST00000359361 Ensembl:ENST00000381127
            Ensembl:ENST00000537104 GeneID:416 KEGG:hsa:416 UCSC:uc004cre.2
            CTD:416 GeneCards:GC0XP002978 H-InvDB:HIX0016636 HGNC:HGNC:721
            HPA:HPA000549 MIM:300003 neXtProt:NX_P54793 PharmGKB:PA25012
            InParanoid:P54793 OMA:LKPCCGV PhylomeDB:P54793 GenomeRNAi:416
            NextBio:1759 Bgee:P54793 CleanEx:HS_ARSF Genevestigator:P54793
            GermOnline:ENSG00000062096 Uniprot:P54793
        Length = 590

 Score = 193 (73.0 bits), Expect = 2.4e-24, Sum P(3) = 2.4e-24
 Identities = 48/124 (38%), Positives = 71/124 (57%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
             G  DL  +G++ + TP+ID LA  G+ L  ++ A  +C+PSR++ +TG+YPI +GM    
Sbjct:    41 GIGDLGCYGNDTMRTPHIDRLAREGVRLTQHISAASLCSPSRSAFLTGRYPIRSGMVSSG 100

Query:   151 ----IWG-AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGF---FRRE--YTPLYRGFE 200
                 I   A P G+PL E  L   L++ GYST  IGKWH G     R +  + P   GF+
Sbjct:   101 NRRVIQNLAVPAGLPLNETTLAALLKKQGYSTGLIGKWHQGLNCDSRSDQCHHPYNYGFD 160

Query:   201 SHFG 204
              ++G
Sbjct:   161 YYYG 164

 Score = 144 (55.7 bits), Expect = 2.4e-24, Sum P(3) = 2.4e-24
 Identities = 39/124 (31%), Positives = 64/124 (51%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             Y   V+++D  VG ++ A+   G+  N+++ F SD+G      R  +    W  N  Y+G
Sbjct:   311 YGDNVEEMDSMVGKILDAIDDFGLRNNTLVYFTSDHGGHLEARRGHAQLGGW--NGIYKG 368

Query:   367 VKNTL-WEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--I 422
              K    WEGG++VP I+ W  ++    R+  +   + D LPT+ + +GG    LP +  I
Sbjct:   369 GKGMGGWEGGIRVPGIVRWPGKVPAG-RLIKEPTSLMDILPTVASVSGGS---LPQDRVI 424

Query:   423 DGLD 426
             DG D
Sbjct:   425 DGRD 428

 Score = 64 (27.6 bits), Expect = 2.4e-24, Sum P(3) = 2.4e-24
 Identities = 17/57 (29%), Positives = 31/57 (54%)

Query:   564 NGPCYLFNLGNDPCEQNNIA-SSRP--D-ISSQLYELLKYHRRTLVPQSHEQPDLVQ 616
             + P  LF+L  DP E   +  ++ P  D +  ++   LK H+ T+VP +++  +L Q
Sbjct:   501 HNPPLLFDLSRDPSESTPLTPATEPLHDFVIKKVANALKEHQETIVPVTYQLSELNQ 557


>TIGR_CMR|CPS_2985 [details] [associations]
            symbol:CPS_2985 "sulfatase family protein" species:167879
            "Colwellia psychrerythraea 34H" [GO:0006790 "sulfur compound
            metabolic process" evidence=ISS] [GO:0008484 "sulfuric ester
            hydrolase activity" evidence=ISS] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 EMBL:CP000083
            GenomeReviews:CP000083_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 GO:GO:0008484 KO:K01130 HOGENOM:HOG000135355
            RefSeq:YP_269685.1 ProteinModelPortal:Q47ZT4 STRING:Q47ZT4
            GeneID:3523028 KEGG:cps:CPS_2985 PATRIC:21468987 OMA:RNEFLPT
            BioCyc:CPSY167879:GI48-3034-MONOMER Uniprot:Q47ZT4
        Length = 502

 Score = 184 (69.8 bits), Expect = 5.2e-24, Sum P(2) = 5.2e-24
 Identities = 41/111 (36%), Positives = 68/111 (61%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             YA  + + D  VG ++  + + G+ EN+II++ +DNGA    + +       G   P+RG
Sbjct:   259 YADGMVEHDGHVGQLLDKIDKLGIAENTIIMYTTDNGAELALWPD-------GGYTPFRG 311

Query:   367 VKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTS 416
              KNT WEGG +VP ++ W+ +I+ N +VS +M+ + DW+PT+   AG DT+
Sbjct:   312 EKNTNWEGGYRVPMMVKWAGKIKPN-QVSNEMISLIDWMPTILAVAG-DTN 360

 Score = 170 (64.9 bits), Expect = 5.2e-24, Sum P(2) = 5.2e-24
 Identities = 39/101 (38%), Positives = 55/101 (54%)

Query:   106 TPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERF 165
             TPNID +A  GII  + Y    CT  RA  +TG++P+ TG+    + GA+  G+   +  
Sbjct:    59 TPNIDRIANEGIIFTDSYGDQSCTAGRAGFITGQHPMRTGLTKVGLPGAK-EGLNKKDPT 117

Query:   166 LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYL 206
             + E L+  GY T   GK HLG  + E+ P   GF+  FG L
Sbjct:   118 IAELLKPHGYMTGQFGKNHLGD-QDEHLPTNHGFDEFFGNL 157


>UNIPROTKB|F6PN86 [details] [associations]
            symbol:ARSF "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 GeneTree:ENSGT00560000076940 HOGENOM:HOG000135352
            HOVERGEN:HBG004283 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 OrthoDB:EOG4V4379 OMA:LKPCCGV
            EMBL:AAEX03026108 Ensembl:ENSCAFT00000017756 Uniprot:F6PN86
        Length = 584

 Score = 210 (79.0 bits), Expect = 6.1e-24, Sum P(3) = 6.1e-24
 Identities = 56/145 (38%), Positives = 75/145 (51%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNN-MYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
             G  DL   G++ I TPNID LA  G+ LN+ + A  +CTPSRA+ +TG+YPI +GM    
Sbjct:    41 GIGDLGCFGNDTIRTPNIDRLAREGVQLNHHIAAASMCTPSRAAFLTGRYPIRSGMVSNA 100

Query:   151 I------WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGF-FRREYTPLYRGFESHF 203
             +       GA P G+P  E      L++ GYST  IGKWH G   +  Y   +  +   F
Sbjct:   101 VDRVIITLGA-PAGLPHNETTFAALLKKQGYSTALIGKWHQGLNCQSRYDQCHHPYHYGF 159

Query:   204 GYLNGV-ISYYDHILSDQYSRTVEL 227
              Y  G+  +  D    D  SR  EL
Sbjct:   160 DYYYGMPFTLIDPCWPDP-SRDTEL 183

 Score = 131 (51.2 bits), Expect = 6.1e-24, Sum P(3) = 6.1e-24
 Identities = 37/130 (28%), Positives = 64/130 (49%)

Query:   297 QYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYR 356
             ++I       Y   V+++D  VG ++ A+    +   +++ F SD+G   +E R   + R
Sbjct:   301 EFIGTSKHGLYGDNVQEMDSMVGKILDAIDNFHLKNRTLVYFTSDHGGH-LESRVGHSQR 359

Query:   357 NWGSNYPYRGVKNTL-WEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGD 414
               G N  YRG K    WEGG++VP ++ WS ++    +V  +   + D  PTL   +G  
Sbjct:   360 G-GWNGIYRGGKGMAGWEGGIRVPGLIRWSGRLPAG-KVIEEPTSLMDIFPTLAAVSGSS 417

Query:   415 TSRLPLNIDG 424
               +  + IDG
Sbjct:   418 VPQDRV-IDG 426

 Score = 54 (24.1 bits), Expect = 6.1e-24, Sum P(3) = 6.1e-24
 Identities = 16/58 (27%), Positives = 28/58 (48%)

Query:   566 PCYLFNLGNDPCEQNNIAS-SRP--DISSQ-LYELLKYHRRTLVPQSHEQPDLVQADP 619
             P  LF+L  DP E   +   + P  D+  Q +   +K HR++++P   +  +L    P
Sbjct:   502 PPLLFDLTRDPSESTPLTQDTEPLYDVVIQTVANAVKEHRKSILPVQQQLSELNYDSP 559


>UNIPROTKB|F6PKZ1 [details] [associations]
            symbol:ARSA "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 GeneTree:ENSGT00560000076940 HOGENOM:HOG000135352
            HOVERGEN:HBG004283 OMA:FGPSQMA OrthoDB:EOG4MKNG4 InterPro:IPR024607
            PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484 EMBL:AAEX03007117
            Ensembl:ENSCAFT00000000876 Uniprot:F6PKZ1
        Length = 508

 Score = 191 (72.3 bits), Expect = 9.3e-24, Sum P(3) = 9.3e-24
 Identities = 43/115 (37%), Positives = 59/115 (51%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGPP 150
             G+ DL  +G     TPN+D LA  G+   + Y    +CTPSRA+L+TG+ P+  G+    
Sbjct:    33 GYGDLGCYGHPSSATPNLDQLAAGGLRFTDFYVPTSLCTPSRAALLTGRLPVRMGLYPGV 92

Query:   151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR-EYTPLYRGFESHFG 204
             +      G+PL E  L E L   GY T   GKWHLG      + P ++GF    G
Sbjct:    93 LEPGSRGGLPLEEVTLAEVLAARGYLTGIAGKWHLGVGPDGAFLPPHQGFHRFLG 147

 Score = 137 (53.3 bits), Expect = 9.3e-24, Sum P(3) = 9.3e-24
 Identities = 45/134 (33%), Positives = 71/134 (52%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
             R  +   + +LD +VG +++A+   G+L  +++IF +DNG P     ET    + G +  
Sbjct:   245 RGPFGDSLMELDAAVGALMTAVGDLGLLGETLVIFTADNG-P-----ETMRMSHGGCSGL 298

Query:   364 YRGVKNTLWEGGVKVPAI-LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP-LN 421
              R  K T ++GGV+ PA+  W   I   P V+ ++    D LPTL +  G     LP + 
Sbjct:   299 LRCGKGTTFDGGVREPALAFWPGHIA--PGVTHELASSLDLLPTLASLTGAP---LPNVT 353

Query:   422 IDGLDQWSSLLLNT 435
             +DG+D  S LLL T
Sbjct:   354 LDGVDL-SPLLLGT 366

 Score = 64 (27.6 bits), Expect = 9.3e-24, Sum P(3) = 9.3e-24
 Identities = 17/57 (29%), Positives = 26/57 (45%)

Query:   545 QATIHCGANPAPM--TPSPCT-NGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLK 598
             Q + H    P P     SP T + P  LF+L  DP E  N+     +++ +  + LK
Sbjct:   402 QGSPHSDTTPDPACHASSPLTAHEPPLLFDLSEDPGENYNLLGGMAEVAPEALQALK 458


>UNIPROTKB|F1PYB4 [details] [associations]
            symbol:ARSD "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 OMA:RSWIPSG EMBL:AAEX03026107
            EMBL:AAEX03026106 Ensembl:ENSCAFT00000017716 Uniprot:F1PYB4
        Length = 597

 Score = 194 (73.4 bits), Expect = 1.7e-23, Sum P(3) = 1.7e-23
 Identities = 45/124 (36%), Positives = 67/124 (54%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
             G  DL  +G++ + TPNID LA  G+ L  ++ A P+CTPSR+S +TG++   +GM+   
Sbjct:    55 GIGDLGCYGNSTLRTPNIDRLAEEGVRLTQHLAAAPLCTPSRSSFLTGRHSFRSGMEAHD 114

Query:   151 -----IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGF---FRREYT--PLYRGFE 200
                   W     G+P  E      L++ GY+T  IGKWH G     R ++   PL  GF+
Sbjct:   115 GYRALQWNGASGGLPENETTFARILQQQGYATGLIGKWHQGVNCESRTDHCHHPLNHGFD 174

Query:   201 SHFG 204
               +G
Sbjct:   175 YFYG 178

 Score = 133 (51.9 bits), Expect = 1.7e-23, Sum P(3) = 1.7e-23
 Identities = 37/129 (28%), Positives = 59/129 (45%)

Query:   297 QYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYR 356
             Q++       Y   V+++D  VG V++A++  G+   +   F SD+G       E     
Sbjct:   315 QFLGKSQHGLYGDNVEEMDWLVGEVLNAIEENGLKNTTFTYFTSDHGGHLEARDERGQLG 374

Query:   357 NWGSNYPYRGVKNTL-WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDT 415
              W  N  +RG K    WEGG++VP I   P +    RV  +   + D  PT+    GG+ 
Sbjct:   375 GW--NGIFRGGKGMGGWEGGIRVPGIFRWPGVLPAGRVIHEPTSLMDVFPTVVQLGGGEV 432

Query:   416 SRLPLNIDG 424
              +  + IDG
Sbjct:   433 PQDRV-IDG 440

 Score = 66 (28.3 bits), Expect = 1.7e-23, Sum P(3) = 1.7e-23
 Identities = 24/85 (28%), Positives = 35/85 (41%)

Query:   551 GANPAPMTPSPCT-NGPCYLFNLGNDPCEQNNIA-SSRP---DISSQLYELLKYHRRTLV 605
             G    P +    T + P  LF L  DP E   ++  S P    + +Q+ E ++ HR+TL 
Sbjct:   500 GRGVCPCSGDGVTQHSPPLLFELSRDPSEARPLSPDSEPLYNMVVAQVGEAVEQHRKTLS 559

Query:   606 PQSHEQPDLVQADPKRFNDTWSPWI 630
             P        V       N  W PW+
Sbjct:   560 P--------VPTQFSLSNIIWKPWL 576

 Score = 46 (21.3 bits), Expect = 1.9e-21, Sum P(3) = 1.9e-21
 Identities = 20/65 (30%), Positives = 27/65 (41%)

Query:   557 MTPS--PCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLK--YHRRTLVPQSHEQP 612
             MTP   P   G CY    G  PC  + +    P +   L+EL +     R L P S    
Sbjct:   486 MTPRFHPKGAGACY--GRGVCPCSGDGVTQHSPPL---LFELSRDPSEARPLSPDSEPLY 540

Query:   613 DLVQA 617
             ++V A
Sbjct:   541 NMVVA 545


>TIGR_CMR|CPS_0660 [details] [associations]
            symbol:CPS_0660 "sulfatase family protein" species:167879
            "Colwellia psychrerythraea 34H" [GO:0006790 "sulfur compound
            metabolic process" evidence=ISS] [GO:0008484 "sulfuric ester
            hydrolase activity" evidence=ISS] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 EMBL:CP000083
            GenomeReviews:CP000083_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 GO:GO:0008484 KO:K01130 HOGENOM:HOG000135355
            RefSeq:YP_267410.1 ProteinModelPortal:Q488V4 STRING:Q488V4
            GeneID:3519819 KEGG:cps:CPS_0660 PATRIC:21464645 OMA:NISAYTH
            BioCyc:CPSY167879:GI48-747-MONOMER Uniprot:Q488V4
        Length = 525

 Score = 174 (66.3 bits), Expect = 1.3e-22, Sum P(2) = 1.3e-22
 Identities = 37/108 (34%), Positives = 59/108 (54%)

Query:    99 HGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRG 158
             HG     T NID +A  G++  + Y +  CT  RA+ +TG+YP+ TG+    + G++ +G
Sbjct:    59 HGMMGYKTTNIDRIAKEGVLFTDYYGENSCTAGRAAFITGQYPVRTGLTKVGLPGSD-KG 117

Query:   159 VPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYL 206
             +   +  + E L++ GY T   GK HLG  + E+ P   GF+   G L
Sbjct:   118 LRAEDVTIAELLKDRGYVTGQFGKNHLGD-KDEFLPTNHGFDEFLGNL 164

 Score = 168 (64.2 bits), Expect = 1.3e-22, Sum P(2) = 1.3e-22
 Identities = 35/99 (35%), Positives = 55/99 (55%)

Query:   315 DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEG 374
             D  VG ++  L R  + +N+I+++ +DNGA    + +       G   P++G KNT WEG
Sbjct:   275 DYQVGVLLDQLDRLAIADNTIVLYTTDNGAEVFSWPD-------GGTIPFKGEKNTTWEG 327

Query:   375 GVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
             G +VPA++ W  +I       ++M+   DW PTL  AAG
Sbjct:   328 GFRVPAMVRWPGKITAGD-AKIEMVSHMDWAPTLLAAAG 365


>UNIPROTKB|P51689 [details] [associations]
            symbol:ARSD "Arylsulfatase D" species:9606 "Homo sapiens"
            [GO:0046872 "metal ion binding" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0004065 "arylsulfatase activity"
            evidence=TAS] [GO:0005788 "endoplasmic reticulum lumen"
            evidence=TAS] [GO:0006644 "phospholipid metabolic process"
            evidence=TAS] [GO:0006665 "sphingolipid metabolic process"
            evidence=TAS] [GO:0006687 "glycosphingolipid metabolic process"
            evidence=TAS] [GO:0043687 "post-translational protein modification"
            evidence=TAS] [GO:0044267 "cellular protein metabolic process"
            evidence=TAS] [GO:0044281 "small molecule metabolic process"
            evidence=TAS] Reactome:REACT_17015 Reactome:REACT_111217
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 GO:GO:0044281 GO:GO:0046872 GO:GO:0006644
            GO:GO:0005764 GO:GO:0005788 GO:GO:0043687 Gene3D:3.40.720.10
            SUPFAM:SSF53649 eggNOG:COG3119 HOGENOM:HOG000135352
            HOVERGEN:HBG004283 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0004065 GO:GO:0006687 EMBL:X83572
            EMBL:AF160499 EMBL:AC005295 EMBL:BC020229 IPI:IPI00019989
            IPI:IPI00028695 IPI:IPI00914575 PIR:I37186 RefSeq:NP_001660.2
            UniGene:Hs.528631 ProteinModelPortal:P51689 SMR:P51689
            STRING:P51689 DMDM:212276422 PaxDb:P51689 PRIDE:P51689 DNASU:414
            Ensembl:ENST00000381154 GeneID:414 KEGG:hsa:414 UCSC:uc004cqy.3
            CTD:414 GeneCards:GC0XM002818 HGNC:HGNC:717 HPA:HPA004694
            MIM:300002 neXtProt:NX_P51689 PharmGKB:PA25008 InParanoid:P51689
            KO:K12374 OMA:RSWIPSG OrthoDB:EOG4V4379 ChiTaRS:ARSD GenomeRNAi:414
            NextBio:1749 Bgee:P51689 CleanEx:HS_ARSD Genevestigator:P51689
            GermOnline:ENSG00000006756 Uniprot:P51689
        Length = 593

 Score = 199 (75.1 bits), Expect = 1.4e-22, Sum P(4) = 1.4e-22
 Identities = 47/124 (37%), Positives = 67/124 (54%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
             G  DL  +G+N + TPNID LA  G+ L  ++ A P+CTPSRA+ +TG++   +GM    
Sbjct:    52 GTGDLGCYGNNTLRTPNIDQLAEEGVRLTQHLAAAPLCTPSRAAFLTGRHSFRSGMDASN 111

Query:   151 -----IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGF---FRREYT--PLYRGFE 200
                   W A   G+P  E      L++ GY+T  IGKWH G     R ++   PL  GF+
Sbjct:   112 GYRALQWNAGSGGLPENETTFARILQQHGYATGLIGKWHQGVNCASRGDHCHHPLNHGFD 171

Query:   201 SHFG 204
               +G
Sbjct:   172 YFYG 175

 Score = 128 (50.1 bits), Expect = 1.4e-22, Sum P(4) = 1.4e-22
 Identities = 37/120 (30%), Positives = 61/120 (50%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRET-SNYRNWGSNYPYR 365
             Y   V+++D  +G V++A++  G+  ++   F SD+G   +E R+  S    W  N  Y+
Sbjct:   322 YGDNVEEMDWLIGKVLNAIEDNGLKNSTFTYFTSDHGGH-LEARDGHSQLGGW--NGIYK 378

Query:   366 GVKNTL-WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
             G K    WEGG++VP I   P +    RV  +   + D  PT+    GG+  +  + IDG
Sbjct:   379 GGKGMGGWEGGIRVPGIFHWPGVLPAGRVIGEPTSLMDVFPTVVQLVGGEVPQDRV-IDG 437

 Score = 57 (25.1 bits), Expect = 1.4e-22, Sum P(4) = 1.4e-22
 Identities = 21/68 (30%), Positives = 29/68 (42%)

Query:   566 PCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRF--- 622
             P  LF+L  DP E   +    PD S  LY  +       V + H Q   +   P++F   
Sbjct:   513 PPLLFDLSRDPSEARPLT---PD-SEPLYHAVIARVGAAVSE-HRQT--LSPVPQQFSMS 565

Query:   623 NDTWSPWI 630
             N  W PW+
Sbjct:   566 NILWKPWL 573

 Score = 37 (18.1 bits), Expect = 1.4e-22, Sum P(4) = 1.4e-22
 Identities = 12/29 (41%), Positives = 13/29 (44%)

Query:   232 MRRNLSTAWDTVGEYATDLFTKEAVQLIE 260
             MR +  T    V E    L  KEAV  IE
Sbjct:   259 MRNHDVTEQPMVLEKTASLMLKEAVSYIE 287

 Score = 37 (18.1 bits), Expect = 1.5e-05, Sum P(3) = 1.5e-05
 Identities = 12/29 (41%), Positives = 13/29 (44%)

Query:     1 MRRNLSTAWDTVGEYATDLFTKEAVQLIE 29
             MR +  T    V E    L  KEAV  IE
Sbjct:   259 MRNHDVTEQPMVLEKTASLMLKEAVSYIE 287


>TIGR_CMR|CPS_3032 [details] [associations]
            symbol:CPS_3032 "sulfatase family protein" species:167879
            "Colwellia psychrerythraea 34H" [GO:0006790 "sulfur compound
            metabolic process" evidence=ISS] [GO:0008484 "sulfuric ester
            hydrolase activity" evidence=ISS] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 EMBL:CP000083
            GenomeReviews:CP000083_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 GO:GO:0008484 KO:K01130 RefSeq:YP_269731.1
            ProteinModelPortal:Q47ZN9 STRING:Q47ZN9 GeneID:3518391
            KEGG:cps:CPS_3032 PATRIC:21469075 HOGENOM:HOG000135355 OMA:RWNDWKA
            BioCyc:CPSY167879:GI48-3081-MONOMER Uniprot:Q47ZN9
        Length = 522

 Score = 183 (69.5 bits), Expect = 2.8e-22, Sum P(2) = 2.8e-22
 Identities = 54/193 (27%), Positives = 90/193 (46%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             YA  + + D+ VG ++  L    + +N+I+I+ +DNGA T  + +       G N P+ G
Sbjct:   259 YADGMLEHDEHVGVLLDKLDDLKIADNTIVIYTTDNGAETFTWPD-------GGNTPFHG 311

Query:   367 VKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
              K T +EGG++VP ++ W   I+   +++  M HI DW+PTL  A G DT    L   G 
Sbjct:   312 EKGTTYEGGMRVPQLVRWPGTIKPGSKMNSMMSHI-DWMPTLAAAMGNDTLVADLKKGGE 370

Query:   426 DQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWK--- 482
                    +N    R  ++DG +                ++   +  +  A+R + WK   
Sbjct:   371 -------INNKKWR-VHLDGFNFKPYFAGEVDKGPRETIMYFSQSGQLNAIRWNDWKASF 422

Query:   483 -LVLGTQENGTMD 494
              LV G   +GT +
Sbjct:   423 ALVKGDMASGTRE 435

 Score = 155 (59.6 bits), Expect = 2.8e-22, Sum P(2) = 2.8e-22
 Identities = 36/108 (33%), Positives = 55/108 (50%)

Query:    99 HGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRG 158
             HG     TPNID +A  G +  + YAQ  CT  R++ + G+ P  TG+    + G+   G
Sbjct:    52 HGMMGYQTPNIDRIANEGALFTDQYAQQSCTAGRSAFILGQEPFRTGLLTIGMPGST-HG 110

Query:   159 VPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYL 206
             +P     + +  ++ GY T   GK HLG  + ++ P   GF+  FG L
Sbjct:   111 IPDWAPTIGDVAKDNGYMTAQFGKNHLGD-QDKHLPTKHGFDEFFGNL 157


>UNIPROTKB|E1BU03 [details] [associations]
            symbol:ARSG "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0005764 "lysosome" evidence=IEA] [GO:0005783 "endoplasmic
            reticulum" evidence=IEA] [GO:0006790 "sulfur compound metabolic
            process" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 GO:GO:0005783 GO:GO:0005615
            GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 GO:GO:0006790 EMBL:AADN02030038
            EMBL:AADN02030039 EMBL:AADN02030040 IPI:IPI00574852
            ProteinModelPortal:E1BU03 Ensembl:ENSGALT00000006665 OMA:SDEYIYW
            Uniprot:E1BU03
        Length = 505

 Score = 212 (79.7 bits), Expect = 5.4e-22, Sum P(3) = 5.4e-22
 Identities = 47/114 (41%), Positives = 65/114 (57%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
             GW DL  + +    TP++D LA  G    + + A   C+PSRASL+TG+  +  G+    
Sbjct:    39 GWGDLGANWAETKETPHLDELAAEGTRFVDFHSAASTCSPSRASLLTGRLGVRNGVTHN- 97

Query:   151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG 204
                +   G+PL E  L E LR  GYST AIGKWHLG     + P++RGF+ +FG
Sbjct:    98 FAISSVGGLPLNETTLAEVLRAAGYSTAAIGKWHLGHHGHHH-PIFRGFDYYFG 150

 Score = 116 (45.9 bits), Expect = 5.4e-22, Sum P(3) = 5.4e-22
 Identities = 43/136 (31%), Positives = 61/136 (44%)

Query:   297 QYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRET---- 352
             Q    P+R  Y A ++++D  VG V       G   ++++ F  DNG P ++   T    
Sbjct:   239 QIAPPPDRGIYGAALREMDALVGHVKHLADSCGK-GSTLLWFTGDNG-PWMQKSPTQGTL 296

Query:   353 SNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
             S   +     P    K T WEGG +VPA+ + P      R S  M+   D  PTL   AG
Sbjct:   297 SALLSLAGGSP---AKQTTWEGGHRVPALAYWPGHVPAKRSSHAMLSTLDVFPTLVALAG 353

Query:   413 GDTSRLPLN--IDGLD 426
                + LP N   DG+D
Sbjct:   354 ---ATLPPNRRFDGMD 366

 Score = 44 (20.5 bits), Expect = 5.4e-22, Sum P(3) = 5.4e-22
 Identities = 22/77 (28%), Positives = 31/77 (40%)

Query:   440 NSNIDGLDQWSSLLLNTPSRRNSVLI--NIDEKKRTAAV---RLDSWKLVLGTQENGTMD 494
             N   DG+D  S +L       + VL+  N     +  AV   RL  +K    T      D
Sbjct:   359 NRRFDGMDV-SPVLFGLSDVGHKVLLHPNSGAAGKDGAVEALRLAQYKAFYTTGGAKACD 417

Query:   495 GYYGQTRSNKVPLLNFN 511
             G  G    ++ PL+ FN
Sbjct:   418 GSIGPEEHHRPPLI-FN 433


>ZFIN|ZDB-GENE-081104-120 [details] [associations]
            symbol:arsh "arylsulfatase H" species:7955 "Danio
            rerio" [GO:0008152 "metabolic process" evidence=IEA] [GO:0008484
            "sulfuric ester hydrolase activity" evidence=IEA] [GO:0003824
            "catalytic activity" evidence=IEA] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            ZFIN:ZDB-GENE-081104-120 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 KO:K12374 EMBL:CR407703 EMBL:FP236869
            IPI:IPI00506361 RefSeq:XP_003199313.1 Ensembl:ENSDART00000032992
            GeneID:100332997 KEGG:dre:100332997 Uniprot:F8VNP0
        Length = 583

 Score = 195 (73.7 bits), Expect = 1.0e-21, Sum P(3) = 1.0e-21
 Identities = 46/124 (37%), Positives = 70/124 (56%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIIL-NNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
             G  D+  +G+  I TPNID LA +G+ L +++ A P+CTPSR + MTG+YP+  GM    
Sbjct:    45 GIGDIGCYGNTTIRTPNIDRLASDGVKLTHHLSAAPLCTPSRTAFMTGRYPLRAGMGSTG 104

Query:   151 -----IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGF---FRRE--YTPLYRGFE 200
                  ++ A   G+P  E    + L++ GY+T  +GKWHLG     R +  + P   GF+
Sbjct:   105 RVQVILFLAGSGGLPPNETTFAKLLQKQGYTTGIVGKWHLGVNCESRSDLCHHPNNHGFD 164

Query:   201 SHFG 204
               +G
Sbjct:   165 FFYG 168

 Score = 130 (50.8 bits), Expect = 1.0e-21, Sum P(3) = 1.0e-21
 Identities = 42/134 (31%), Positives = 65/134 (48%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             Y   V+++D  VG V+  + R G+ E +++ F SD+G       E      W  N  YRG
Sbjct:   315 YGDNVEEVDWMVGRVVDTIDRLGLTEKTLLYFTSDHGGGI----EEGPRGGW--NGIYRG 368

Query:   367 VKNTL-WEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
              K    W+GG++VP I  W  ++     V+ +   + D  PT+   AGG+  +  L +DG
Sbjct:   369 GKAMGGWDGGIRVPGIFRWPGRLAAGREVA-EPTSLMDVFPTVVKLAGGELPKDRL-LDG 426

Query:   425 LDQWSSLLLNTPSR 438
              D    LL  + SR
Sbjct:   427 HDLMP-LLEGSSSR 439

 Score = 50 (22.7 bits), Expect = 1.0e-21, Sum P(3) = 1.0e-21
 Identities = 12/52 (23%), Positives = 24/52 (46%)

Query:   559 PSPCTNGPCYLFNLGNDPCEQNNIASSR----PDISSQLYELLKYHRRTLVP 606
             P    + P  +F + +DP E   +        P++  ++   ++ HRR+L P
Sbjct:   495 PHVTYHSPPLVFLISSDPSESVPLTEQTDPRVPEVLQRVQRAVEEHRRSLTP 546


>UNIPROTKB|F5H325 [details] [associations]
            symbol:GALNS "N-acetylgalactosamine-6-sulfatase"
            species:9606 "Homo sapiens" [GO:0008484 "sulfuric ester hydrolase
            activity" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            EMBL:AC092384 InterPro:IPR024607 PROSITE:PS00149 GO:GO:0008484
            HGNC:HGNC:4122 ChiTaRS:Galns IPI:IPI00978346
            ProteinModelPortal:F5H325 SMR:F5H325 Ensembl:ENST00000542788
            ArrayExpress:F5H325 Bgee:F5H325 Uniprot:F5H325
        Length = 447

 Score = 257 (95.5 bits), Expect = 9.7e-21, Sum P(2) = 9.7e-21
 Identities = 81/261 (31%), Positives = 118/261 (45%)

Query:   158 GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHIL 217
             G+P +E+ LPE L++ GY +K +GKWHLG  R ++ PL  GF+  FG  N     YD+  
Sbjct:    41 GIPDSEQLLPELLKKAGYVSKIVGKWHLGH-RPQFHPLKHGFDEWFGSPNCHFGPYDNKA 99

Query:   218 SDQYS--RTVELNGH---DMRRNLSTAWDTVGEY-ATDLFTKEAVQLIEDQPVDKPXXXX 271
                    R  E+ G    +   NL T     GE   T ++ +EA+  I+ Q    P    
Sbjct:   100 RPNIPVYRDWEMVGRYYEEFPINLKT-----GEANLTQIYLQEALDFIKRQARHHPFFLY 154

Query:   272 XXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
                             AP      F  +    R  Y   V+++DDS+G ++  LQ   + 
Sbjct:   155 WAVDAT---------HAPVYASKPF--LGTSQRGRYGDAVREIDDSIGKILELLQDLHVA 203

Query:   332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
             +N+ + F SDNGA  +   E       GSN P+   K T +EGG++ PA+ W P      
Sbjct:   204 DNTFVFFTSDNGAALISAPEQG-----GSNGPFLCGKQTTFEGGMREPALAWWPGHVTAG 258

Query:   392 RVSLQMMHISDWLPTLYTAAG 412
             +VS Q+  I D   T    AG
Sbjct:   259 QVSHQLGSIMDLFTTSLALAG 279

 Score = 55 (24.4 bits), Expect = 9.7e-21, Sum P(2) = 9.7e-21
 Identities = 19/63 (30%), Positives = 33/63 (52%)

Query:   569 LFNLGNDPCEQN--NIASSR-PDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDT 625
             +F+LG DP E+   + AS+   +  S++  +++ H+  LVP    QP L   +    N  
Sbjct:   366 IFHLGRDPGERFPLSFASAEYQEALSRITSVVQQHQEALVPA---QPQLNVCNWAVMN-- 420

Query:   626 WSP 628
             W+P
Sbjct:   421 WAP 423


>UNIPROTKB|F1RV22 [details] [associations]
            symbol:ARSG "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0006790 "sulfur compound metabolic process"
            evidence=IEA] [GO:0005783 "endoplasmic reticulum" evidence=IEA]
            [GO:0005764 "lysosome" evidence=IEA] [GO:0005615 "extracellular
            space" evidence=IEA] [GO:0008484 "sulfuric ester hydrolase
            activity" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 GO:GO:0005783 GO:GO:0005615
            GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 KO:K12381 OMA:LPQDRHF GO:GO:0006790
            EMBL:FP085458 EMBL:FP085465 EMBL:FP067366 RefSeq:XP_003131311.1
            UniGene:Ssc.62110 Ensembl:ENSSSCT00000018790 GeneID:100521576
            KEGG:ssc:100521576 Uniprot:F1RV22
        Length = 525

 Score = 272 (100.8 bits), Expect = 1.1e-20, P = 1.1e-20
 Identities = 136/504 (26%), Positives = 201/504 (39%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGPP 150
             GW DL  + +    T N+D LA  G+   + +A    C+PSRASL+TG+  +  G+    
Sbjct:    47 GWGDLGANWAETKDTANLDKLAAEGMRFVDFHAAASTCSPSRASLLTGRLGLRNGVTHN- 105

Query:   151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
                    G+PL E  L E L+  GY T  IGKWHLG     + P +RGF+ +FG     I
Sbjct:   106 FAVTSVGGLPLNETTLAEVLQRAGYITGMIGKWHLGH-HGSFHPNFRGFDYYFG-----I 159

Query:   211 SY-YDHILSD----QYSRTVEL-NGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
              Y +D   +D     Y         H   RNL    D   + A  L+  E + ++E QPV
Sbjct:   160 PYSHDMGCTDTPGYNYPPCPACPRRHQPSRNLER--DCYSDVALPLY--ENLNIVE-QPV 214

Query:   265 DKPXXXXXXXXXXXXXXXXXXXEA-P----------QETINQFQYITDP-NRRTYAAMVK 312
             +                        P             +++ Q    P +RR YAA ++
Sbjct:   215 NLSGLARKYAEKATQFIQQARASGRPFLLYVGLAHMHVPLSRPQRSAGPWDRRPYAAGLR 274

Query:   313 KLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSN-----YRNWGSNYPYRGV 367
             ++D  VG +   + R     N+ + F  DNG P  +  E +        +W S       
Sbjct:   275 EMDRLVGQIKDKVDRTAK-NNTFLWFTGDNG-PWAQKCELAGSVGPFLGSWQSRQGGSPA 332

Query:   368 KNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP--LNIDG 424
             K T WEGG +VPA+  W  ++  N   S  ++ + D  PT+   AG     LP   + DG
Sbjct:   333 KQTTWEGGHRVPALAYWPGRVPTNV-TSTALLSVLDIFPTVVALAGAS---LPGDRHFDG 388

Query:   425 LDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLV 484
             LD    L    P+               +L  P   NS     D    T  VRL+ +K  
Sbjct:   389 LDASEVLFGGAPAGHR------------VLFHP---NSGAAGEDGALET--VRLERYKAF 431

Query:   485 LGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNI-DKMRST- 542
               T      DG  G  + ++ PL+ FN   ++     L++ S      +  + + +R   
Sbjct:   432 YVTGGAKACDGSVGPEQHHEPPLI-FNLDDDAAEAAPLERGSAEYQRVLPKVREALRGVL 490

Query:   543 RQQATIHCGANPAPMTPS--PCTN 564
             R  A  H         PS  PC N
Sbjct:   491 RDIADDHISRADYTRDPSVTPCCN 514


>UNIPROTKB|Q32KH9 [details] [associations]
            symbol:ARSG "Arylsulfatase G" species:9615 "Canis lupus
            familiaris" [GO:0005764 "lysosome" evidence=ISS] [GO:0004065
            "arylsulfatase activity" evidence=ISS] [GO:0006790 "sulfur compound
            metabolic process" evidence=IEA] [GO:0005783 "endoplasmic
            reticulum" evidence=IEA] [GO:0005615 "extracellular space"
            evidence=IEA] [GO:0046872 "metal ion binding" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 GO:GO:0005783 GO:GO:0005615 GO:GO:0046872
            GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            GeneTree:ENSGT00560000076940 HOGENOM:HOG000135352
            HOVERGEN:HBG004283 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0004065 EMBL:AAEX02034846 EMBL:BN000758
            RefSeq:NP_001041563.1 UniGene:Cfa.37363 ProteinModelPortal:Q32KH9
            STRING:Q32KH9 Ensembl:ENSCAFT00000017623 GeneID:480460
            KEGG:cfa:480460 CTD:22901 InParanoid:Q32KH9 KO:K12381 OMA:LPQDRHF
            OrthoDB:EOG4J9MZJ NextBio:20855470 GO:GO:0006790 Uniprot:Q32KH9
        Length = 535

 Score = 193 (73.0 bits), Expect = 1.2e-20, Sum P(2) = 1.2e-20
 Identities = 44/114 (38%), Positives = 62/114 (54%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGPP 150
             GW DL  + +    T N+D +A  G+   + +A    C+PSRASL+TG+  +  G+    
Sbjct:    47 GWGDLGANWAETKDTANLDKMAAEGMRFVDFHAAASTCSPSRASLLTGRLGLRNGVTHN- 105

Query:   151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG 204
                    G+PL E  L E L++ GY T  IGKWHLG     Y P +RGF+ +FG
Sbjct:   106 FAVTSVGGLPLNETTLAEVLQQAGYVTGMIGKWHLGH-HGPYHPNFRGFDYYFG 158

 Score = 129 (50.5 bits), Expect = 1.2e-20, Sum P(2) = 1.2e-20
 Identities = 38/129 (29%), Positives = 60/129 (46%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRN-----W 358
             RR Y A ++++D  VG +   + R    EN+ + F  DNG P  +  E +         W
Sbjct:   266 RRPYGAGLREMDSLVGQIKDKVDRTAK-ENTFLWFTGDNG-PWAQKCELAGSVGPFTGLW 323

Query:   359 GSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSR 417
              ++      K T WEGG +VPA+  W  ++  N   S  ++ + D  PT+   AG    +
Sbjct:   324 QTHQGGSPAKQTTWEGGHRVPALAYWPGRVPVNV-TSTALLSVLDIFPTVVALAGASLPQ 382

Query:   418 LPLNIDGLD 426
                + DGLD
Sbjct:   383 -DRHFDGLD 390


>TIGR_CMR|CPS_2983 [details] [associations]
            symbol:CPS_2983 "putative arylsulfatase" species:167879
            "Colwellia psychrerythraea 34H" [GO:0004065 "arylsulfatase
            activity" evidence=ISS] [GO:0008152 "metabolic process"
            evidence=ISS] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 EMBL:CP000083
            GenomeReviews:CP000083_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 GO:GO:0008484 KO:K01130 OMA:DDQVGIL
            HOGENOM:HOG000135355 RefSeq:YP_269683.1 ProteinModelPortal:Q47ZT6
            STRING:Q47ZT6 GeneID:3520535 KEGG:cps:CPS_2983 PATRIC:21468983
            BioCyc:CPSY167879:GI48-3032-MONOMER Uniprot:Q47ZT6
        Length = 522

 Score = 162 (62.1 bits), Expect = 1.4e-20, Sum P(3) = 1.4e-20
 Identities = 41/117 (35%), Positives = 61/117 (52%)

Query:    92 GWNDLSFH--GSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGP 149
             G+ ++S +  G     TPNID +A  G +  + YAQ  CT  RAS + G+ P  TG+   
Sbjct:    43 GYYNISAYNQGMMGYQTPNIDRIADEGALFTHHYAQQSCTAGRASFILGQEPFRTGLLTI 102

Query:   150 PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYL 206
              + G+   G+P     + + L+E GY T   GK HLG  + ++ P   GF+  FG L
Sbjct:   103 GMPGST-HGIPDWTPTIADLLKEKGYMTAQFGKNHLGD-QDKHLPTNHGFDEFFGNL 157

 Score = 157 (60.3 bits), Expect = 1.4e-20, Sum P(3) = 1.4e-20
 Identities = 35/99 (35%), Positives = 54/99 (54%)

Query:   315 DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEG 374
             DD VG ++  L    + +N+I+I+ +DNGA    + +       G   P+RG K T  EG
Sbjct:   267 DDQVGILLDKLDDLKIADNTIVIYSTDNGAEKFTWPD-------GGTSPFRGEKGTTTEG 319

Query:   375 GVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
             G++VP ++ W   I+   + +  M H  DW+PTL  AAG
Sbjct:   320 GMRVPQLVRWPGTIKAGSKFNNMMSH-EDWMPTLLAAAG 357

 Score = 46 (21.3 bits), Expect = 1.4e-20, Sum P(3) = 1.4e-20
 Identities = 8/22 (36%), Positives = 11/22 (50%)

Query:   475 AVRLDSWKLVLGTQENGTMDGY 496
             AVR + WK+    +E G    Y
Sbjct:   412 AVRWNEWKIAFAEEEGGISTAY 433


>ZFIN|ZDB-GENE-030717-5 [details] [associations]
            symbol:sts "steroid sulfatase (microsomal),
            arylsulfatase C, isozyme S" species:7955 "Danio rerio" [GO:0003824
            "catalytic activity" evidence=IEA] [GO:0008152 "metabolic process"
            evidence=IEA] [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 ZFIN:ZDB-GENE-030717-5
            Gene3D:3.40.720.10 SUPFAM:SSF53649 GeneTree:ENSGT00560000076940
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484 EMBL:CT990606
            EMBL:BX901898 IPI:IPI00963580 Ensembl:ENSDART00000075252
            ArrayExpress:F1Q8F9 Bgee:F1Q8F9 Uniprot:F1Q8F9
        Length = 587

 Score = 189 (71.6 bits), Expect = 2.3e-20, Sum P(2) = 2.3e-20
 Identities = 50/130 (38%), Positives = 69/130 (53%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHT---GM- 146
             G  DL  +G+  + TPNID LA  G+ L  ++ A P+CTPSRA+ +TG+YPI +   GM 
Sbjct:    45 GIGDLGCYGNTTLRTPNIDRLALEGVKLTQHIAAAPLCTPSRAAFLTGRYPIRSDAKGMA 104

Query:   147 ----QGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAI-GKWHLGFFRREYTPLYRGFES 201
                  G  ++ A   G+P  E    + ++  GYST  I GKWHLG    + +       S
Sbjct:   105 AHGHMGVFLFSASSGGLPQEEITFAKAVKVQGYSTAVIVGKWHLGLNCEDSSDHCHHPNS 164

Query:   202 H-FGYLNGVI 210
             H F Y  G I
Sbjct:   165 HGFDYFYGTI 174

 Score = 132 (51.5 bits), Expect = 2.3e-20, Sum P(2) = 2.3e-20
 Identities = 35/120 (29%), Positives = 58/120 (48%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             Y   V ++D SVG ++  L+R  + +N+++   SD G P +E        + G +  Y+ 
Sbjct:   316 YGDAVMEVDWSVGQIMQTLERLNLKDNTLVYMTSDQG-PHLEEISVHGEMHGGYSGIYKA 374

Query:   367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
              K+T WEGG+++P IL  P +     +  +     D  PT+   AG       + IDG D
Sbjct:   375 GKSTNWEGGIRIPGILSWPGVLPAGNIIDEPTSNMDIFPTVLNLAGASIPDDRV-IDGHD 433

 Score = 38 (18.4 bits), Expect = 1.4e-10, Sum P(2) = 1.4e-10
 Identities = 10/28 (35%), Positives = 14/28 (50%)

Query:   405 PTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
             P LY  +   T   PL+ D   Q+ S+L
Sbjct:   508 PLLYDLSKDPTESTPLSPDTEPQFHSVL 535


>TIGR_CMR|CPS_2984 [details] [associations]
            symbol:CPS_2984 "sulfatase family protein" species:167879
            "Colwellia psychrerythraea 34H" [GO:0006790 "sulfur compound
            metabolic process" evidence=ISS] [GO:0008484 "sulfuric ester
            hydrolase activity" evidence=ISS] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 EMBL:CP000083
            GenomeReviews:CP000083_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 GO:GO:0008484 KO:K01130 HOGENOM:HOG000135355
            RefSeq:YP_269684.1 ProteinModelPortal:Q47ZT5 STRING:Q47ZT5
            GeneID:3520029 KEGG:cps:CPS_2984 PATRIC:21468985 OMA:NGPHANT
            BioCyc:CPSY167879:GI48-3033-MONOMER Uniprot:Q47ZT5
        Length = 512

 Score = 167 (63.8 bits), Expect = 3.4e-20, Sum P(2) = 3.4e-20
 Identities = 39/108 (36%), Positives = 58/108 (53%)

Query:    99 HGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRG 158
             HG     TPNID +A  G++  + YA   CT  R++ +TG+  + TGM    + GA+  G
Sbjct:    47 HGIMGFKTPNIDRIAKEGMMFTDYYADQSCTAGRSTFITGQSGLRTGMTKVGLPGAK-EG 105

Query:   159 VPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYL 206
             +   +  + E L+  GY+T   GK HLG  + E+ P   GF+  FG L
Sbjct:   106 IQDRDITIAEMLKAKGYTTGQFGKNHLGD-KDEHLPSNHGFDEFFGNL 152

 Score = 152 (58.6 bits), Expect = 3.4e-20, Sum P(2) = 3.4e-20
 Identities = 48/180 (26%), Positives = 89/180 (49%)

Query:   315 DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEG 374
             D+ VG ++  L +  + +++I+++ +DNG   V Y   + + + G   P+ G KN+  EG
Sbjct:   262 DNHVGMMLDQLDKLKVTDSTIVMYSTDNG---VHY---NTWPDAGIT-PFDGEKNSEKEG 314

Query:   375 GVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLL 433
               +VP ++ W  +I+    VS +MM   DW+PTL  AA GDT      + G  ++     
Sbjct:   315 AYRVPMMVRWPGKIKAG-EVSNEMMAHLDWMPTL-AAAAGDTKLKEDMLKGKRRFG---- 368

Query:   434 NTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTM 493
             N  S+   ++DG +    L   T     ++   ++++    A+R+  WK+V       T+
Sbjct:   369 NKQSK--IHLDGYNMLPHLTGKTEKSPRNIYHYLNDEGFPVAIRIGDWKMVYAENRGKTL 426


>CGD|CAL0006319 [details] [associations]
            symbol:orf19.1608 species:5476 "Candida albicans" [GO:0005634
            "nucleus" evidence=IEA] [GO:0005829 "cytosol" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 CGD:CAL0006319 EMBL:AACQ01000014 EMBL:AACQ01000013
            Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00149 GO:GO:0008484 KO:K01130
            RefSeq:XP_721567.1 RefSeq:XP_721687.1 ProteinModelPortal:Q5AJI4
            GeneID:3636617 GeneID:3636713 KEGG:cal:CaO19.1608
            KEGG:cal:CaO19.9176 Uniprot:Q5AJI4
        Length = 588

 Score = 194 (73.4 bits), Expect = 3.8e-20, Sum P(3) = 3.8e-20
 Identities = 60/195 (30%), Positives = 96/195 (49%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAY--NGIILNNMYAQPVCTPSRASLMTGKYPIHTGM--- 146
             G+ DLS  G  EI TPN++ LA   NG+ L + +    C+P+R+ L++G      G+   
Sbjct:    22 GFTDLSPFGG-EINTPNLNKLATGANGVRLTDFHTASACSPTRSMLLSGTDNHIAGLGQM 80

Query:   147 -----QGPPIWGAEP--RGVPLTERF--LPEYLRELGYSTKAIGKWHLGFFRREYTPLYR 197
                  + P  +  +P   G  L ++   LPE L++ GY T   GKWHLG  ++ Y P  R
Sbjct:    81 AEFAQRHPEKFNNQPGYEGY-LNDKVVALPEILQDNGYHTFISGKWHLGL-KKPYWPNKR 138

Query:   198 GFESHFGYLNGVISYYDHILSDQYSRTVE----LNGHDMRRNLSTAWDTVGE-YATDLFT 252
             GF   F  L G  ++Y +I  D     +     +   D +  L    +   + Y+T+ FT
Sbjct:   139 GFNKSFTLLPGAGNHYKYITRDSQGNQIPFLPAIYVEDDKELLQPEIELPDDFYSTNYFT 198

Query:   253 KEAVQLIEDQPVDKP 267
              +A++ I++ P  KP
Sbjct:   199 DKAIEFIKETPQGKP 213

 Score = 103 (41.3 bits), Expect = 3.8e-20, Sum P(3) = 3.8e-20
 Identities = 21/40 (52%), Positives = 29/40 (72%)

Query:   305 RTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA 344
             +TYAAMV+ LD+++G +I  L     L N+ I+FMSDNGA
Sbjct:   294 QTYAAMVEILDENIGRLIDHLNSIDELNNTFILFMSDNGA 333

 Score = 63 (27.2 bits), Expect = 3.8e-20, Sum P(3) = 3.8e-20
 Identities = 20/56 (35%), Positives = 28/56 (50%)

Query:   544 QQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKY 599
             QQA I  G+  A   P P       LFN+  DP E N+++ S  +  + L ELL +
Sbjct:   495 QQA-IRKGSFKAIYIPKPFGPEKWQLFNIIEDPGEINDLSESSSEYQTILNELLDH 549

 Score = 55 (24.4 bits), Expect = 2.9e-05, Sum P(3) = 2.9e-05
 Identities = 9/22 (40%), Positives = 16/22 (72%)

Query:    15 YATDLFTKEAVQLIEDQPVDKP 36
             Y+T+ FT +A++ I++ P  KP
Sbjct:   192 YSTNYFTDKAIEFIKETPQGKP 213


>POMBASE|SPBPB10D8.02c [details] [associations]
            symbol:SPBPB10D8.02c "arylsulfatase (predicted)"
            species:4896 "Schizosaccharomyces pombe" [GO:0004065 "arylsulfatase
            activity" evidence=ISS] [GO:0005634 "nucleus" evidence=IDA]
            [GO:0005829 "cytosol" evidence=IDA] [GO:0006790 "sulfur compound
            metabolic process" evidence=ISS] [GO:0046872 "metal ion binding"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 PomBase:SPBPB10D8.02c GO:GO:0005829
            GO:GO:0005634 GO:GO:0046872 EMBL:CU329671 Gene3D:3.40.720.10
            SUPFAM:SSF53649 eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0004065 GO:GO:0006790 KO:K01130
            RefSeq:NP_595046.1 HSSP:P51691 ProteinModelPortal:Q9C0V7 SMR:Q9C0V7
            STRING:Q9C0V7 EnsemblFungi:SPBPB10D8.02c.1 GeneID:2541396
            KEGG:spo:SPBPB10D8.02c HOGENOM:HOG000135353 OMA:IEWTNIS
            OrthoDB:EOG4DJP4T NextBio:20802503 Uniprot:Q9C0V7
        Length = 554

 Score = 195 (73.7 bits), Expect = 4.3e-20, Sum P(3) = 4.3e-20
 Identities = 51/129 (39%), Positives = 68/129 (52%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGM----- 146
             GW+D+S  GS EI TPNI+ LA  G+ L N +    C+P+R+ L++G      G+     
Sbjct:    23 GWSDVSPFGS-EIHTPNIERLAKEGVRLTNFHTASACSPTRSMLLSGTDNHIAGLGQMAE 81

Query:   147 ---QGPPIWGAEP--RGVPLTERF--LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGF 199
                +   +WG +P   G  L +R   LPE L+E GY T   GKWHLG     Y P  RGF
Sbjct:    82 TVRRFSKVWGGKPGYEGY-LNDRVAALPEILQEAGYYTTMSGKWHLGLTPDRY-PSKRGF 139

Query:   200 ESHFGYLNG 208
             +  F  L G
Sbjct:   140 KESFALLPG 148

 Score = 106 (42.4 bits), Expect = 4.3e-20, Sum P(3) = 4.3e-20
 Identities = 22/38 (57%), Positives = 29/38 (76%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA 344
             YAAMV+ LD ++G VI  L+  G L+N+ +IFMSDNGA
Sbjct:   293 YAAMVELLDLNIGRVIDYLKTIGELDNTFVIFMSDNGA 330

 Score = 69 (29.3 bits), Expect = 2.7e-16, Sum P(3) = 2.7e-16
 Identities = 30/119 (25%), Positives = 55/119 (46%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETS-----NYRN--W- 358
             Y   + +LD++    +S    +G +  +I +  +    P V+Y + S     NY +  W 
Sbjct:   310 YLKTIGELDNTFVIFMSDNGAEGSVLEAIPVLSTK---PPVKYFDNSLENLGNYNSFIWY 366

Query:   359 GSNY------PYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
             G  +      P R  K  + EGG++ PAI+  P + +   +S + + + D LPT+   A
Sbjct:   367 GPRWAQAATAPSRLSKGFITEGGIRCPAIIRYPPLIKPDIISDEFVTVMDILPTILELA 425

 Score = 57 (25.1 bits), Expect = 4.3e-20, Sum P(3) = 4.3e-20
 Identities = 19/60 (31%), Positives = 24/60 (40%)

Query:   545 QATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYEL-LKYHRRT 603
             Q  I  G   A   P         L++L  D  E  N+A   PDI ++L E  L Y   T
Sbjct:   471 QRAIRKGNYKAIYVPKEGIKTEWELYDLSQDKGELENLAKVHPDILNELIEYWLVYEAET 530

 Score = 38 (18.4 bits), Expect = 2.4e-11, Sum P(2) = 2.4e-11
 Identities = 9/25 (36%), Positives = 12/25 (48%)

Query:   537 DKMRSTRQQATIHCGANPAPMTPSP 561
             D +R  R QA    G  P  + P+P
Sbjct:   242 DVLRKNRLQAQKDLGLIPENVIPAP 266


>MGI|MGI:1921258 [details] [associations]
            symbol:Arsg "arylsulfatase G" species:10090 "Mus musculus"
            [GO:0003824 "catalytic activity" evidence=IEA] [GO:0004065
            "arylsulfatase activity" evidence=ISO] [GO:0005615 "extracellular
            space" evidence=ISO] [GO:0005764 "lysosome" evidence=ISO]
            [GO:0005783 "endoplasmic reticulum" evidence=ISO] [GO:0006790
            "sulfur compound metabolic process" evidence=ISO] [GO:0008152
            "metabolic process" evidence=IEA] [GO:0008484 "sulfuric ester
            hydrolase activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0046872 "metal ion binding" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 MGI:MGI:1921258 GO:GO:0005783 GO:GO:0005615
            GO:GO:0046872 GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 GeneTree:ENSGT00560000076940 HOVERGEN:HBG004283
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0004065
            CTD:22901 KO:K12381 OrthoDB:EOG4J9MZJ GO:GO:0006790 EMBL:AK018132
            EMBL:AK158726 EMBL:AL645791 EMBL:BC022158 EMBL:BC039629
            EMBL:BC084731 EMBL:AK173082 EMBL:BN000747 IPI:IPI00135805
            IPI:IPI00648999 RefSeq:NP_001159649.1 RefSeq:NP_082986.3
            UniGene:Mm.482224 ProteinModelPortal:Q3TYD4 SMR:Q3TYD4
            STRING:Q3TYD4 PaxDb:Q3TYD4 PRIDE:Q3TYD4 Ensembl:ENSMUST00000020928
            Ensembl:ENSMUST00000106696 Ensembl:ENSMUST00000106697 GeneID:74008
            KEGG:mmu:74008 UCSC:uc007mcn.1 UCSC:uc007mcp.2 InParanoid:B1AT67
            OMA:GNTFLWF NextBio:339520 Bgee:Q3TYD4 CleanEx:MM_ARSG
            Genevestigator:Q3TYD4 GermOnline:ENSMUSG00000020604 Uniprot:Q3TYD4
        Length = 525

 Score = 194 (73.4 bits), Expect = 4.8e-20, Sum P(2) = 4.8e-20
 Identities = 44/114 (38%), Positives = 62/114 (54%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGPP 150
             GW DL  + +    T N+D +A  G+   + +A    C+PSRASL+TG+  +  G+    
Sbjct:    47 GWGDLGANWAETKDTTNLDKMASEGMRFVDFHAAASTCSPSRASLLTGRLGLRNGVTHN- 105

Query:   151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG 204
                    G+P+ E  L E LR+ GY T  IGKWHLG     Y P +RGF+ +FG
Sbjct:   106 FAVTSVGGLPVNETTLAEVLRQEGYVTAMIGKWHLGH-HGSYHPNFRGFDYYFG 158

 Score = 122 (48.0 bits), Expect = 4.8e-20, Sum P(2) = 4.8e-20
 Identities = 39/137 (28%), Positives = 64/137 (46%)

Query:   299 ITDPNRRT-YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSN--- 354
             +  P R++ Y A ++++D  VG +   +      EN+++ F  DNG P  +  E +    
Sbjct:   260 LAHPQRQSLYRASLREMDSLVGQIKDKVDHVAR-ENTLLWFTGDNG-PWAQKCELAGSVG 317

Query:   355 --YRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
               +  W ++      K T WEGG +VPA+  W  ++  N   S  ++ + D  PT+   A
Sbjct:   318 PFFGLWQTHQGGSPTKQTTWEGGHRVPALAYWPGRVPANV-TSTALLSLLDIFPTVIALA 376

Query:   412 GGDTSRLPLN--IDGLD 426
             G     LP N   DG D
Sbjct:   377 GAS---LPPNRKFDGRD 390


>RGD|1306571 [details] [associations]
            symbol:Arsg "arylsulfatase G" species:10116 "Rattus norvegicus"
            [GO:0004065 "arylsulfatase activity" evidence=ISO;ISS] [GO:0005615
            "extracellular space" evidence=IEA;ISO] [GO:0005764 "lysosome"
            evidence=ISO;ISS] [GO:0005783 "endoplasmic reticulum"
            evidence=IEA;ISO] [GO:0006790 "sulfur compound metabolic process"
            evidence=IEA;ISO] [GO:0046872 "metal ion binding" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 RGD:1306571 GO:GO:0005783 GO:GO:0005615 GO:GO:0046872
            GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            GeneTree:ENSGT00560000076940 HOGENOM:HOG000135352
            HOVERGEN:HBG004283 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0004065 CTD:22901 KO:K12381 OrthoDB:EOG4J9MZJ
            GO:GO:0006790 EMBL:AABR03073953 EMBL:AABR03074766 EMBL:AABR03075952
            EMBL:AABR03076519 EMBL:AABR03076696 EMBL:BN000738 IPI:IPI00361303
            RefSeq:NP_001041342.1 UniGene:Rn.221856 ProteinModelPortal:Q32KJ9
            PRIDE:Q32KJ9 Ensembl:ENSRNOT00000005257 GeneID:303631
            KEGG:rno:303631 InParanoid:Q32KJ9 OMA:WHYPHYS NextBio:651782
            Genevestigator:Q32KJ9 GermOnline:ENSRNOG00000003931 Uniprot:Q32KJ9
        Length = 526

 Score = 196 (74.1 bits), Expect = 5.0e-20, Sum P(3) = 5.0e-20
 Identities = 44/114 (38%), Positives = 62/114 (54%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGPP 150
             GW DL  + +    T N+D +A  G+   + +A    C+PSRASL+TG+  +  G+    
Sbjct:    47 GWGDLGANWAETKDTTNLDKMASEGMRFVDFHAAASTCSPSRASLLTGRLGLRNGVTHN- 105

Query:   151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG 204
                    G+PL E  L E L++ GY T  IGKWHLG     Y P +RGF+ +FG
Sbjct:   106 FAVTSVGGLPLNETTLAEVLQQAGYVTAMIGKWHLGH-HGSYHPSFRGFDYYFG 158

 Score = 123 (48.4 bits), Expect = 5.0e-20, Sum P(3) = 5.0e-20
 Identities = 39/137 (28%), Positives = 65/137 (47%)

Query:   299 ITDP-NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRN 357
             + +P ++R Y A ++++D  VG +   +      EN+++ F  DNG P  +  E +    
Sbjct:   260 LANPQSQRLYRASLQEMDSLVGQIKDKVDHVAK-ENTLLWFAGDNG-PWAQKCELAGSMG 317

Query:   358 -----WGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
                  W ++      K T WEGG +VPA+  W  ++  N   S  ++ + D  PT+   A
Sbjct:   318 PFSGLWQTHQGGSPAKQTTWEGGHRVPALAYWPGRVPVNV-TSTALLSLLDIFPTVIALA 376

Query:   412 GGDTSRLPLN--IDGLD 426
             G     LP N   DG+D
Sbjct:   377 GAS---LPPNRKFDGVD 390

 Score = 43 (20.2 bits), Expect = 0.00078, Sum P(3) = 0.00078
 Identities = 8/21 (38%), Positives = 11/21 (52%)

Query:    89 IVYGWNDLSFHGSNEIPTPNI 109
             ++Y + D S  G    P PNI
Sbjct:    18 LLYPFVDFSISGETRAPRPNI 38

 Score = 37 (18.1 bits), Expect = 5.0e-20, Sum P(3) = 5.0e-20
 Identities = 13/54 (24%), Positives = 22/54 (40%)

Query:   569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTL---VPQSHEQPDLVQADP 619
             +FNL +D  E + +    P+    L ++ +     L      +  Q D  Q DP
Sbjct:   455 IFNLEDDAAESSPLQKGSPEYQELLPKVTRVLADVLQDIADDNSSQADYTQ-DP 507


>ZFIN|ZDB-GENE-060503-154 [details] [associations]
            symbol:arsg "arylsulfatase G" species:7955 "Danio
            rerio" [GO:0003824 "catalytic activity" evidence=IEA] [GO:0008152
            "metabolic process" evidence=IEA] [GO:0008484 "sulfuric ester
            hydrolase activity" evidence=IEA] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            ZFIN:ZDB-GENE-060503-154 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 EMBL:CR926135 EMBL:CABZ01038699
            EMBL:CABZ01038700 IPI:IPI00502628 Ensembl:ENSDART00000091423
            Bgee:F1QQI9 Uniprot:F1QQI9
        Length = 526

 Score = 191 (72.3 bits), Expect = 6.4e-20, Sum P(2) = 6.4e-20
 Identities = 42/115 (36%), Positives = 64/115 (55%)

Query:    92 GWNDLSFHG-SNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGP 149
             GW DL  +   N  PTP +D+L   G    + ++    C+PSRAS++TG++ +  G+   
Sbjct:    46 GWGDLWLNRPDNSTPTPWLDSLMLKGKRFTDFHSPASTCSPSRASILTGRHGLRNGVTHN 105

Query:   150 PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG 204
                G+   G+PL E    + L + GY T  IGKWHLG     Y+P++RGF+ + G
Sbjct:   106 FAVGSVG-GLPLNETTFAQLLHDEGYYTAMIGKWHLGH-NGSYSPVHRGFDYYLG 158

 Score = 124 (48.7 bits), Expect = 6.4e-20, Sum P(2) = 6.4e-20
 Identities = 43/147 (29%), Positives = 68/147 (46%)

Query:   294 NQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETS 353
             N F  +T  +   Y A +  +D  VG ++ AL  +  LEN++I F  DNG P  +    +
Sbjct:   258 NTFLNVTTED--PYTASLSDMDSLVGNIMQALITE-QLENTLIWFTGDNG-PWEQKCLFA 313

Query:   354 NY-----RNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTL 407
              +       W +N      K T WEGG +VP ++ W  +I+ N      +  + D  PT+
Sbjct:   314 GHVGPFVGRWQTNKGGGSAKRTTWEGGHRVPTVVSWPKKIKHNSSSDALLSGL-DIFPTV 372

Query:   408 YTAAGGDTSRLPLNIDGLDQWSSLLLN 434
              + AG          DG+D  + +LLN
Sbjct:   373 LSLAGVKPPS-DRRYDGIDI-TDVLLN 397


>UNIPROTKB|F1NWF7 [details] [associations]
            symbol:ARSA "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] [GO:0005509 "calcium ion binding" evidence=IEA]
            [GO:0005886 "plasma membrane" evidence=IEA] [GO:0007339 "binding of
            sperm to zona pellucida" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 GO:GO:0005886 GO:GO:0005509
            Gene3D:3.40.720.10 SUPFAM:SSF53649 GeneTree:ENSGT00560000076940
            InterPro:IPR024607 PROSITE:PS00149 GO:GO:0008484 OMA:GFDENTI
            EMBL:AADN02075680 EMBL:AADN02075681 IPI:IPI00584710
            Ensembl:ENSGALT00000015860 Uniprot:F1NWF7
        Length = 493

 Score = 163 (62.4 bits), Expect = 1.1e-19, Sum P(3) = 1.1e-19
 Identities = 38/114 (33%), Positives = 60/114 (52%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCT-PSRASLMTGKYPIHTGMQGPP 150
             G+ DL  +G     TPN+  LA              C  P RA+L+TG++ + +G+    
Sbjct:    31 GFGDLGSYGHPSSATPNLSCLA-------RAAPYECCPYPCRAALLTGRFQMRSGIYPGV 83

Query:   151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR-EYTPLYRGFESHF 203
              +     G+PL+E  + E L+  GY+T  +GKWHLG   R  + P+++GF+ HF
Sbjct:    84 FYPGSRGGLPLSEVTIAEVLKAKGYATAIVGKWHLGLGARGSFLPIHQGFD-HF 136

 Score = 140 (54.3 bits), Expect = 1.1e-19, Sum P(3) = 1.1e-19
 Identities = 45/141 (31%), Positives = 70/141 (49%)

Query:   297 QYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYR 356
             +Y     R  +   + + D SVG ++ ALQ  G+   + + F SDNG P+     T    
Sbjct:   229 EYAGRSQRGPFGDALSEFDGSVGQLLQALQENGLENTTFVFFTSDNG-PS-----TMRMA 282

Query:   357 NWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDT 415
               GS+   +  K T +EGG++ PA+  W  +I   P V+ ++    D LPTL   AG   
Sbjct:   283 RGGSSGLLKCGKGTTYEGGMREPAVAYWPGRIA--PGVTHELASTLDILPTLTALAG--- 337

Query:   416 SRLP-LNIDGLDQWSSLLLNT 435
             + LP +++DG D  S LL  +
Sbjct:   338 AALPNVSLDGYDL-SPLLFES 357

 Score = 52 (23.4 bits), Expect = 1.1e-19, Sum P(3) = 1.1e-19
 Identities = 21/71 (29%), Positives = 34/71 (47%)

Query:   545 QATIHCGANP--APMTPSPCTNG-PCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHR 601
             Q + H    P  A    +P T+  P  LF+L +DP E  N+  S      +++++LK  +
Sbjct:   393 QGSFHSDTTPDQACHGLTPLTHHEPPLLFDLESDPAENYNLLQSTA--GPEVWQVLKDIK 450

Query:   602 --RTLVPQSHE 610
               +TL  Q  E
Sbjct:   451 LQKTLFEQRME 461


>UNIPROTKB|I3LM95 [details] [associations]
            symbol:ARSD "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 Ensembl:ENSSSCT00000027390
            OMA:INIGGHE Uniprot:I3LM95
        Length = 580

 Score = 146 (56.5 bits), Expect = 1.1e-18, Sum P(3) = 1.1e-18
 Identities = 39/120 (32%), Positives = 59/120 (49%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGM---Q 147
             G  DL  +G++ +  P +      G  L+  + A PVCTPSRA+ +TG++ + +G     
Sbjct:    79 GIGDLGCYGNDTLRYPGLGLRVGAGTRLSAXLAAAPVCTPSRAAFLTGRHALRSGRWKGD 138

Query:   148 GPPI--WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGF---FRREYT--PLYRGFE 200
             G  +  W     G+P  E      L+  GY+T  IGKWH G     R ++   PL  GF+
Sbjct:   139 GYRVLRWNGGSGGLPQNETTFARILQRQGYATGLIGKWHQGVNCESRTDHCHHPLNHGFD 198

 Score = 129 (50.5 bits), Expect = 1.1e-18, Sum P(3) = 1.1e-18
 Identities = 39/137 (28%), Positives = 60/137 (43%)

Query:   289 PQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVE 348
             P  T  +FQ         Y   V+++D  VG +++A++  G+   ++  F SD+G     
Sbjct:   292 PLMTTKEFQ--GKSQHGLYGDNVEEMDGLVGDILNAIEEHGLKNTTLTYFTSDHGGHLEA 349

Query:   349 YRETSNYRNWGSNYPYRGVKNTL-WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTL 407
                      W  N  YRG K    WEGG++VP I   P +    RV  +   + D  PT+
Sbjct:   350 IDGHVQLGGW--NGIYRGGKGMGGWEGGIRVPGIFRWPGVLPAGRVIQEPTSLMDVFPTV 407

Query:   408 YTAAGGDTSRLPLNIDG 424
                 GG   +  + IDG
Sbjct:   408 VQLGGGQVPQDRV-IDG 423

 Score = 75 (31.5 bits), Expect = 1.1e-18, Sum P(3) = 1.1e-18
 Identities = 24/76 (31%), Positives = 35/76 (46%)

Query:   559 PSPCTNGPCYLFNLGNDPCEQNNIA-SSRP---DISSQLYELLKYHRRTLVPQSHEQPDL 614
             P    + P  LF+L  DP E   +A  S P    + +++ + ++ HRRTL P     P L
Sbjct:   492 PGVTHHDPPLLFDLSGDPSEAQPLAPGSEPLYGAVLARVEQAVREHRRTLSPVP---PQL 548

Query:   615 VQADPKRFNDTWSPWI 630
                 P R    W PW+
Sbjct:   549 ---SPGRI--AWKPWL 559


>ASPGD|ASPL0000001694 [details] [associations]
            symbol:AN6847 species:162425 "Emericella nidulans"
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
            [GO:0008152 "metabolic process" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0005829 "cytosol" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 EMBL:BN001301 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00149 GO:GO:0008484 OMA:IEWTNIS
            ProteinModelPortal:C8V2I8 EnsemblFungi:CADANIAT00007645
            Uniprot:C8V2I8
        Length = 616

 Score = 113 (44.8 bits), Expect = 3.2e-18, Sum P(5) = 3.2e-18
 Identities = 22/47 (46%), Positives = 32/47 (68%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTG 138
             G++D+  +GS EI TPNID LA  G+   + +A   C+P+RA +MTG
Sbjct:    18 GFSDIGCYGS-EIRTPNIDKLAQKGVRFTDFHAAAACSPTRAMIMTG 63

 Score = 109 (43.4 bits), Expect = 3.2e-18, Sum P(5) = 3.2e-18
 Identities = 30/76 (39%), Positives = 38/76 (50%)

Query:   145 GMQGPPIWGAEPRGVP-----LTERF--LPEYLRELGYSTKAIGKWHLGFFRREYTPLYR 197
             G +G     A  RG+P     L ER   LPE LR+ GY T   GKWHLG    E +P  R
Sbjct:    85 GPKGSSTDTAPQRGMPGYEGYLNERVVALPEILRDAGYHTLMSGKWHLGL-TPERSPYKR 143

Query:   198 GFESHFGYLNGVISYY 213
             GF+    +L    ++Y
Sbjct:   144 GFDRSLAHLPACSNHY 159

 Score = 93 (37.8 bits), Expect = 3.2e-18, Sum P(5) = 3.2e-18
 Identities = 18/43 (41%), Positives = 27/43 (62%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEY 349
             +A MV+ +D +VG ++  L   G L+N+ + FMSDNGA    Y
Sbjct:   311 FAGMVECIDANVGKIVDYLDSIGELDNTFVCFMSDNGAEGAAY 353

 Score = 65 (27.9 bits), Expect = 3.2e-18, Sum P(5) = 3.2e-18
 Identities = 21/76 (27%), Positives = 37/76 (48%)

Query:   561 PCTNGP--CYLFNLGNDPCEQNNIASSRPDISSQLYELL-KYHRRT-LVPQSHEQPDLVQ 616
             P   GP    L+NL  DP E N++A   P+   +L +L  +Y   T ++P + +  D ++
Sbjct:   513 PKPKGPERWQLYNLVEDPGEINDLAEKYPERLQKLLKLWDQYVLETGVIPLNPDLGDFLE 572

Query:   617 ADPKRFNDT-WSPWIY 631
             A   +  +  W  + Y
Sbjct:   573 ATEAQMTENAWMEYDY 588

 Score = 46 (21.3 bits), Expect = 3.2e-18, Sum P(5) = 3.2e-18
 Identities = 16/56 (28%), Positives = 25/56 (44%)

Query:   363 PYRGVKNTLWEGGVKVPAILWSPQIQQ------NPRVSLQMMHISDWLPTLYTAAG 412
             P R  K    EGGV+VP +   P   +      N  ++ Q   + D  P++ + AG
Sbjct:   398 PSRLYKAYTTEGGVRVPFLARFPSSTKTAPQASNGAITDQFATVMDLAPSILSMAG 453

 Score = 37 (18.1 bits), Expect = 3.9e-11, Sum P(5) = 3.9e-11
 Identities = 9/23 (39%), Positives = 11/23 (47%)

Query:   197 RGFESHFGYLNGVISYYDHILSD 219
             RG   + GYLN  +     IL D
Sbjct:    97 RGMPGYEGYLNERVVALPEILRD 119


>UNIPROTKB|Q96EG1 [details] [associations]
            symbol:ARSG "Arylsulfatase G" species:9606 "Homo sapiens"
            [GO:0046872 "metal ion binding" evidence=IEA] [GO:0004065
            "arylsulfatase activity" evidence=IDA;TAS] [GO:0005764 "lysosome"
            evidence=IDA] [GO:0005615 "extracellular space" evidence=IDA]
            [GO:0006790 "sulfur compound metabolic process" evidence=IDA]
            [GO:0005783 "endoplasmic reticulum" evidence=IDA] [GO:0005788
            "endoplasmic reticulum lumen" evidence=TAS] [GO:0006644
            "phospholipid metabolic process" evidence=TAS] [GO:0006665
            "sphingolipid metabolic process" evidence=TAS] [GO:0006687
            "glycosphingolipid metabolic process" evidence=TAS] [GO:0043687
            "post-translational protein modification" evidence=TAS] [GO:0044267
            "cellular protein metabolic process" evidence=TAS] [GO:0044281
            "small molecule metabolic process" evidence=TAS]
            Reactome:REACT_17015 Reactome:REACT_111217 InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 GO:GO:0005615
            GO:GO:0044281 GO:GO:0046872 GO:GO:0006644 GO:GO:0005764
            GO:GO:0005788 GO:GO:0043687 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 HOGENOM:HOG000135352 HOVERGEN:HBG004283
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0004065
            GO:GO:0006687 CTD:22901 KO:K12381 OMA:LPQDRHF OrthoDB:EOG4J9MZJ
            GO:GO:0006790 EMBL:AB023218 EMBL:AY358380 EMBL:BC012375
            IPI:IPI00402293 RefSeq:NP_001254656.1 RefSeq:NP_055775.2
            UniGene:Hs.437249 ProteinModelPortal:Q96EG1 SMR:Q96EG1
            STRING:Q96EG1 DMDM:74731559 PaxDb:Q96EG1 PeptideAtlas:Q96EG1
            PRIDE:Q96EG1 Ensembl:ENST00000448504 Ensembl:ENST00000570630
            GeneID:22901 KEGG:hsa:22901 UCSC:uc002jhc.2 GeneCards:GC17P066255
            HGNC:HGNC:24102 HPA:HPA023245 HPA:HPA023285 MIM:610008
            neXtProt:NX_Q96EG1 PharmGKB:PA143485307 InParanoid:Q96EG1
            PhylomeDB:Q96EG1 SABIO-RK:Q96EG1 GenomeRNAi:22901 NextBio:43535
            ArrayExpress:Q96EG1 Bgee:Q96EG1 CleanEx:HS_ARSG
            Genevestigator:Q96EG1 GermOnline:ENSG00000141337 Uniprot:Q96EG1
        Length = 525

 Score = 194 (73.4 bits), Expect = 1.2e-17, Sum P(2) = 1.2e-17
 Identities = 44/114 (38%), Positives = 62/114 (54%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGPP 150
             GW DL  + +    T N+D +A  G+   + +A    C+PSRASL+TG+  +  G+    
Sbjct:    47 GWGDLGANWAETKDTANLDKMASEGMRFVDFHAAASTCSPSRASLLTGRLGLRNGVTRN- 105

Query:   151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG 204
                    G+PL E  L E L++ GY T  IGKWHLG     Y P +RGF+ +FG
Sbjct:   106 FAVTSVGGLPLNETTLAEVLQQAGYVTGIIGKWHLGH-HGSYHPNFRGFDYYFG 158

 Score = 99 (39.9 bits), Expect = 1.2e-17, Sum P(2) = 1.2e-17
 Identities = 34/129 (26%), Positives = 55/129 (42%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRN-----W 358
             R  Y A + ++D  VG +   +    + EN+ + F  DNG P  +  E +         W
Sbjct:   266 RSLYGAGLWEMDSLVGQIKDKVDHT-VKENTFLWFTGDNG-PWAQKCELAGSVGPFTGFW 323

Query:   359 GSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSR 417
              +       K T WEGG +VPA+  W  ++  N   S  ++ + D  PT+   A     +
Sbjct:   324 QTRQGGSPAKQTTWEGGHRVPALAYWPGRVPVNV-TSTALLSVLDIFPTVVALAQASLPQ 382

Query:   418 LPLNIDGLD 426
                  DG+D
Sbjct:   383 -GRRFDGVD 390


>UNIPROTKB|P77318 [details] [associations]
            symbol:ydeN "putative sulfatase" species:83333 "Escherichia
            coli K-12" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] [GO:0008152 "metabolic process" evidence=IEA]
            [GO:0003824 "catalytic activity" evidence=IEA] [GO:0046872 "metal
            ion binding" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 EMBL:U00096 EMBL:AP009048
            GenomeReviews:AP009048_GR GenomeReviews:U00096_GR GO:GO:0046872
            Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            HOGENOM:HOG000135352 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 PIR:E64903 RefSeq:NP_416015.2
            RefSeq:YP_489763.1 ProteinModelPortal:P77318 SMR:P77318
            DIP:DIP-11682N IntAct:P77318 PhosSite:P0810453 PRIDE:P77318
            EnsemblBacteria:EBESCT00000001979 EnsemblBacteria:EBESCT00000015602
            GeneID:12931856 GeneID:945957 KEGG:ecj:Y75_p1474 KEGG:eco:b1498
            PATRIC:32118290 EchoBASE:EB3557 EcoGene:EG13796 KO:K01138
            OMA:PVINRCA ProtClustDB:CLSK880035 BioCyc:EcoCyc:G6788-MONOMER
            BioCyc:ECOL316407:JW5243-MONOMER Genevestigator:P77318
            Uniprot:P77318
        Length = 560

 Score = 240 (89.5 bits), Expect = 4.9e-17, P = 4.9e-17
 Identities = 91/333 (27%), Positives = 145/333 (43%)

Query:   106 TPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTER 164
             TP + +L   G+   N Y A  V  PSRA++MTG+ P   G+       A+  G+PLTE 
Sbjct:   109 TPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT--DAQD-GIPLTET 165

Query:   165 FLPEYLRELGYSTKAIGKWHLGFFRREYTP---LYRGFESHFGYLNGVISY-----YDHI 216
             FLPE  +  GY T A+GKWHL        P     R +  +F   +          +D+ 
Sbjct:   166 FLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYF 225

Query:   217 LSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED-QPVDKPXXXXXXXX 275
             +    + T   N   + +N        G Y +D  T EA+ +++  + +D+P        
Sbjct:   226 MGFHAAGTAYYNSPSLFKNRERV-PAKG-YISDQLTDEAIGVVDRAKTLDQPFMLYLAYN 283

Query:   276 XXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSI 335
                        +  Q+  N      D     Y A V  +D  V  ++  L++ G  +N+I
Sbjct:   284 APHLPNDNPAPDQYQKQFNTGSQTAD----NYYASVYSVDQGVKRILEQLKKNGQYDNTI 339

Query:   336 IIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILW-SPQIQQNPRVS 394
             I+F SDNGA  ++     N    G+    +G K+  + GG   P  +W   ++Q  P   
Sbjct:   340 ILFTSDNGA-VIDGPLPLN----GAQ---KGYKSQTYPGGTHTPMFMWWKGKLQ--PGNY 389

Query:   395 LQMMHISDWLPTLYTAAGGDTSRLP--LNIDGL 425
              +++   D+ PT   AA  D S +P  L +DG+
Sbjct:   390 DKLISAMDFYPTALDAA--DIS-IPKDLKLDGV 419


>UNIPROTKB|F1PYB3 [details] [associations]
            symbol:ARSE "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 EMBL:AAEX03026107
            Ensembl:ENSCAFT00000017722 Uniprot:F1PYB3
        Length = 253

 Score = 218 (81.8 bits), Expect = 5.0e-17, P = 5.0e-17
 Identities = 48/126 (38%), Positives = 72/126 (57%)

Query:    91 YGWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQ-- 147
             +G  D+  +G+N I TPNID LA +G++L  ++ A  VCTPSRA+ +TG+YP+ +G+   
Sbjct:    16 FGIGDIGCYGNNSIRTPNIDRLAEDGVMLTQHIAAASVCTPSRAAFLTGRYPLRSGLSSL 75

Query:   148 --GPPI--WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE-----YTPLYRG 198
               G  +  W     G+P  E    + L++ GY+T  IGKWHLG          + PL  G
Sbjct:    76 INGYRVLQWTGVSGGLPTNETTFAKILKDRGYATGLIGKWHLGLNCESSNDHCHHPLNHG 135

Query:   199 FESHFG 204
             F+  +G
Sbjct:   136 FDHFYG 141


>TIGR_CMR|CPS_2381 [details] [associations]
            symbol:CPS_2381 "sulfatase family protein" species:167879
            "Colwellia psychrerythraea 34H" [GO:0008152 "metabolic process"
            evidence=ISS] [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=ISS] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 EMBL:CP000083
            GenomeReviews:CP000083_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            GO:GO:0008484 HOGENOM:HOG000014304 RefSeq:YP_269099.1
            ProteinModelPortal:Q482B9 STRING:Q482B9 GeneID:3523329
            KEGG:cps:CPS_2381 PATRIC:21467845 OMA:VAPKKYF
            ProtClustDB:CLSK494238 BioCyc:CPSY167879:GI48-2444-MONOMER
            Uniprot:Q482B9
        Length = 511

 Score = 155 (59.6 bits), Expect = 6.9e-17, Sum P(2) = 6.9e-17
 Identities = 41/99 (41%), Positives = 56/99 (56%)

Query:    94 NDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGM--QGPP 150
             NDL  +G + + +PNIDALA  GI  +  Y+Q P+CTPSR+S MTG YP  TG+   G  
Sbjct:    53 NDLGAYGHHLVKSPNIDALAKKGIRFDKAYSQSPMCTPSRSSFMTGLYPDQTGIIAHGSH 112

Query:   151 I-WGAEPRG-VPLTERFLPEYLRELGYSTKAIGK-WHLG 186
                 A  R  +P     LP+  +  GY +  +GK +H G
Sbjct:   113 TQMTAHFREHIPKVTT-LPQLFKNNGYFSGRVGKIYHQG 150

 Score = 133 (51.9 bits), Expect = 6.9e-17, Sum P(2) = 6.9e-17
 Identities = 37/121 (30%), Positives = 63/121 (52%)

Query:   292 TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE 351
             T+NQ + I     + Y A V  +D  VG V+ AL+++ + +N+I++F+SD+G     Y E
Sbjct:   293 TLNQRKQII----QGYYAAVSYVDAQVGRVLDALKQQDLSDNTIVVFLSDHG-----Y-E 342

Query:   352 TSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
                +  W         K +L+EG  + P I+++P ++ N RV    + + D  PTL    
Sbjct:   343 LGQHGLWQ--------KGSLFEGSARAPLIIYAPNVKDNGRVVTSPVELVDIYPTLAKLT 394

Query:   412 G 412
             G
Sbjct:   395 G 395


>UNIPROTKB|C9J5G7 [details] [associations]
            symbol:ARSE "Arylsulfatase E" species:9606 "Homo sapiens"
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
            PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484 EMBL:AC005295
            HGNC:HGNC:719 IPI:IPI00640709 ProteinModelPortal:C9J5G7 SMR:C9J5G7
            STRING:C9J5G7 Ensembl:ENST00000438544 HOGENOM:HOG000213821
            ArrayExpress:C9J5G7 Bgee:C9J5G7 Uniprot:C9J5G7
        Length = 178

 Score = 216 (81.1 bits), Expect = 8.3e-17, P = 8.3e-17
 Identities = 48/124 (38%), Positives = 71/124 (57%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQ--- 147
             G  D+  +G+N + TPNID LA +G+ L  ++ A  +CTPSRA+ +TG+YP+ +GM    
Sbjct:    49 GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 108

Query:   148 GPPI--WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE-----YTPLYRGFE 200
             G  +  W     G+P  E    + L+E GY+T  IGKWHLG          + PL+ GF+
Sbjct:   109 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 168

Query:   201 SHFG 204
               +G
Sbjct:   169 HFYG 172


>UNIPROTKB|F1NFL4 [details] [associations]
            symbol:F1NFL4 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 OMA:RWNDWKA EMBL:AADN02017596
            IPI:IPI00586912 Ensembl:ENSGALT00000026882 Uniprot:F1NFL4
        Length = 374

 Score = 183 (69.5 bits), Expect = 1.4e-16, Sum P(2) = 1.4e-16
 Identities = 43/105 (40%), Positives = 58/105 (55%)

Query:   106 TPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTER 164
             TP+ID LA  G+ L  ++ A  VCTPSRA+ +TG+YPI +  +    W     G+P  E 
Sbjct:    31 TPHIDGLAKEGVRLTQHIAAAAVCTPSRAAFLTGRYPIRSERR-ILFWNGCSGGLPPNET 89

Query:   165 FLPEYLRELGYSTKAIGKWHLGF---FRREYT--PLYRGFESHFG 204
                  L + GYST  +GKWHLG      R++   PL  GFE  +G
Sbjct:    90 TFARVLHQQGYSTALVGKWHLGVNCKSHRDHCHHPLNHGFEYFYG 134

 Score = 94 (38.1 bits), Expect = 1.4e-16, Sum P(2) = 1.4e-16
 Identities = 30/110 (27%), Positives = 52/110 (47%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             Y   V+++D  VG ++  + ++G+   + I F SD       ++E  N  N  + Y  + 
Sbjct:   233 YGDNVEEMDWMVGRLLDVIDKEGLKNTTFIYFASD-------HKE--NLTNCPNVYTSKF 283

Query:   367 VKNTL--WEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGG 413
                 +  WEGG++VP I+ W   +     +S +   I D  PT+   AGG
Sbjct:   284 SSEIMGGWEGGIRVPGIVRWPGALPAGIVIS-EPTSIMDIFPTVVHLAGG 332


>UNIPROTKB|I3LUP9 [details] [associations]
            symbol:ARSA "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0016021 "integral to membrane" evidence=IEA]
            [GO:0007339 "binding of sperm to zona pellucida" evidence=IEA]
            [GO:0005886 "plasma membrane" evidence=IEA] [GO:0005509 "calcium
            ion binding" evidence=IEA] [GO:0008484 "sulfuric ester hydrolase
            activity" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 GO:GO:0005886 GO:GO:0005509
            GO:GO:0007339 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 Ensembl:ENSSSCT00000031954
            OMA:GFDENTI Uniprot:I3LUP9
        Length = 486

 Score = 128 (50.1 bits), Expect = 3.4e-16, Sum P(3) = 3.4e-16
 Identities = 47/134 (35%), Positives = 69/134 (51%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
             R  +   + +LD +VG +++A+   G+L  +++IF +DNG P     ET    + G    
Sbjct:   227 RGPFGDSLMELDAAVGALMTAVGDLGLLGETLVIFTADNG-P-----ETMRMSHGGCXC- 279

Query:   364 YRGVKNTLWEGGVKVPAI-LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP-LN 421
               G K T +EGGV+ PA+  W   I   P  S  ++   D LPTL   AG     LP + 
Sbjct:   280 --G-KGTTFEGGVREPALAFWPGHIA--PGQSSGLLSSLDLLPTLAALAGAP---LPNVT 331

Query:   422 IDGLDQWSSLLLNT 435
             +DG+D  S LLL T
Sbjct:   332 LDGVDL-SPLLLGT 344

 Score = 123 (48.4 bits), Expect = 3.4e-16, Sum P(3) = 3.4e-16
 Identities = 25/57 (43%), Positives = 35/57 (61%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPV--CTPSRASLMTGKYPIHTGM 146
             G+ DL  +G     TPN+D LA  G+   + Y  PV  CTPSRA+L+TG+ P+  G+
Sbjct:    32 GYGDLGSYGHPSSTTPNLDQLAAGGLRFTDFYV-PVSLCTPSRAALLTGRLPVRMGL 87

 Score = 73 (30.8 bits), Expect = 3.4e-16, Sum P(3) = 3.4e-16
 Identities = 19/57 (33%), Positives = 31/57 (54%)

Query:   545 QATIHCG--ANPAPMTPSPCT-NGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLK 598
             Q +IH    A+PA    SP T + P  LF+L  DP E  N+     +++ ++ ++LK
Sbjct:   380 QGSIHSDTTADPACHASSPLTAHEPPLLFDLSEDPGENYNLLGGVAEVAPEVLQVLK 436


>UNIPROTKB|P95059 [details] [associations]
            symbol:atsA "POSSIBLE ARYLSULFATASE ATSA (ARYL-SULFATE
            SULPHOHYDROLASE) (ARYLSULPHATASE)" species:1773 "Mycobacterium
            tuberculosis" [GO:0005886 "plasma membrane" evidence=IDA]
            [GO:0010033 "response to organic substance" evidence=IEP]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 GO:GO:0005886 GenomeReviews:AL123456_GR GO:GO:0010033
            EMBL:BX842574 Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
            PROSITE:PS00523 PROSITE:PS00149 GO:GO:0004065 HSSP:P15289 KO:K01130
            HOGENOM:HOG000042725 EMBL:AL123456 PIR:B70643 RefSeq:NP_215225.1
            RefSeq:YP_006514055.1 ProteinModelPortal:P95059 SMR:P95059
            PRIDE:P95059 EnsemblBacteria:EBMYCT00000001675 GeneID:13318600
            GeneID:888394 KEGG:mtu:Rv0711 KEGG:mtv:RVBD_0711 PATRIC:18150088
            TubercuList:Rv0711 OMA:FAGFLEH ProtClustDB:CLSK790691
            Uniprot:P95059
        Length = 787

 Score = 160 (61.4 bits), Expect = 5.3e-15, Sum P(4) = 5.3e-15
 Identities = 49/171 (28%), Positives = 77/171 (45%)

Query:    69 TDPNRRTYAAXXXXXXXXXXIVYGWNDLS------FHGSNEIPTPNIDALAYNGIILNNM 122
             ++P+   YAA          +   W+D+       F G  E+P   +  +A  G+ L+  
Sbjct:    20 SEPDWGPYAAPVAPEHSPNILYLVWDDVGIATWDCFGGLVEMPA--MTRVAERGVRLSQF 77

Query:   123 YAQPVCTPSRASLMTGKYPIHTGMQG-PPIWGAEPR--G-VPLTERFLPEYLRELGYSTK 178
             +   +C+P+RASL+TG+     GM          P   G +P     LPE L E GY+T 
Sbjct:    78 HTTALCSPTRASLLTGRNATTVGMATIEEFTDGFPNCNGRIPADTALLPEVLAEHGYNTY 137

Query:   179 AIGKWHLGFFR-------REYTPLYRGFESHFGYLNGVIS-YYDHILSDQY 221
              +GKWHL           + + P  RGFE  +G+L G    +Y  ++ D +
Sbjct:   138 CVGKWHLTPLEESNMASTKRHWPTSRGFERFYGFLGGETDQWYPDLVYDNH 188

 Score = 86 (35.3 bits), Expect = 5.3e-15, Sum P(4) = 5.3e-15
 Identities = 17/62 (27%), Positives = 34/62 (54%)

Query:   290 QETINQFQYITDPNRRTYAAMVKKL-------DDSVGTVISALQRKGMLENSIIIFMSDN 342
             Q+T+  +  ++D  ++ +  M +         D  +G ++  L+  G L+N+II+ +SDN
Sbjct:   300 QDTVRPWDSLSDEEKKLFCRMAEVFAGFLSYTDAQIGRILDYLEESGQLDNTIIVVISDN 359

Query:   343 GA 344
             GA
Sbjct:   360 GA 361

 Score = 69 (29.3 bits), Expect = 5.3e-15, Sum P(4) = 5.3e-15
 Identities = 18/54 (33%), Positives = 27/54 (50%)

Query:   361 NYPYRGVKN-TLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
             N PY+  K     EGG+  PAI+ W   I  +  +    +++SD  PT+Y   G
Sbjct:   412 NTPYKLFKRYASHEGGIADPAIISWPNGIAAHGEIRDNYVNVSDITPTVYDLLG 465

 Score = 43 (20.2 bits), Expect = 5.3e-15, Sum P(4) = 5.3e-15
 Identities = 15/43 (34%), Positives = 22/43 (51%)

Query:   568 YLFN-LGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH 609
             Y++N LG     Q  ++SS P  S +    ++Y R   VP SH
Sbjct:   662 YVYNFLGE---RQQLVSSSGPVPSGRHLLGVRYLRTGTVPNSH 701

 Score = 37 (18.1 bits), Expect = 2.1e-14, Sum P(4) = 2.1e-14
 Identities = 8/28 (28%), Positives = 16/28 (57%)

Query:   569 LFNLGNDPCEQNNIASSRPDISSQLYEL 596
             LF++  D  + +++A+  PD   +L  L
Sbjct:   539 LFHIAADRSQCHDLAAEHPDKLEELKAL 566


>TIGR_CMR|CPS_0841 [details] [associations]
            symbol:CPS_0841 "arylsulfatase" species:167879 "Colwellia
            psychrerythraea 34H" [GO:0004065 "arylsulfatase activity"
            evidence=ISS] [GO:0006790 "sulfur compound metabolic process"
            evidence=ISS] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 EMBL:CP000083
            GenomeReviews:CP000083_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 GO:GO:0004065 KO:K01130 HOGENOM:HOG000135353
            RefSeq:YP_267590.1 ProteinModelPortal:Q488C5 STRING:Q488C5
            GeneID:3522242 KEGG:cps:CPS_0841 PATRIC:21464977 OMA:SSRIMEV
            BioCyc:CPSY167879:GI48-927-MONOMER Uniprot:Q488C5
        Length = 584

 Score = 162 (62.1 bits), Expect = 7.5e-15, Sum P(4) = 7.5e-15
 Identities = 57/194 (29%), Positives = 91/194 (46%)

Query:    93 WNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIW 152
             + D+  +GS E+ TPN++ +A  GI   N +  PVC+ +R+ L TG   I  G+ G   +
Sbjct:    49 FGDIGAYGS-EVHTPNMNEIANAGIRFTNFHVSPVCSVTRSMLFTGNDNIEVGL-GSFDY 106

Query:   153 GAEP--RGVPLTERFLP-------EYLRELGYSTKAIGKWHLGFFRREYT-PLYRGFESH 202
                P  RG    E +L        E L + GY     GKWHLG        PL  GF   
Sbjct:   107 SVYPATRGKKGYEGYLTKDAVTISELLNDDGYEVYKSGKWHLGGEESGGKGPLEWGFTKE 166

Query:   203 FGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--------YATDLFTKE 254
             FG L+G  ++++ +      +  + NG +++R  +  W   GE        Y+ +++T +
Sbjct:   167 FGILSGGSNHWNDLAMTPNFK--DPNGLNVKRKEN--WTLNGEPYDRPEGVYSGEIYTNQ 222

Query:   255 AVQLI-EDQPVDKP 267
              ++ I E    DKP
Sbjct:   223 MLEFIKEGAKNDKP 236

 Score = 105 (42.0 bits), Expect = 7.5e-15, Sum P(4) = 7.5e-15
 Identities = 20/60 (33%), Positives = 37/60 (61%)

Query:   306 TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYR--ETSN--YRNWGSN 361
             TYAAM++  D+ +G ++  L+  G L+N+++++M+DNG   +E    +T N  +  W  N
Sbjct:   319 TYAAMIEDQDNRIGQILDYLRESGQLDNTLVVYMTDNGPEGLEPTNPKTGNPEFAKWIEN 378

 Score = 43 (20.2 bits), Expect = 7.5e-15, Sum P(4) = 7.5e-15
 Identities = 7/25 (28%), Positives = 15/25 (60%)

Query:   564 NGPCYLFNLGNDPCEQNNIASSRPD 588
             +G  +L+N+ +DP E + +    P+
Sbjct:   527 DGQWHLYNVVSDPSESHPLEHKNPE 551

 Score = 39 (18.8 bits), Expect = 7.5e-15, Sum P(4) = 7.5e-15
 Identities = 10/32 (31%), Positives = 18/32 (56%)

Query:   357 NWGSNYPYRGVKNTLW---EGGVKVPAILWSP 385
             +W +N    G++   W   EGG++VP ++  P
Sbjct:   399 SW-ANSATGGLQWWKWFVGEGGIRVPLMIVPP 429


>UNIPROTKB|Q482D2 [details] [associations]
            symbol:CPS_2368 "Putative N-acetylglucosamine-6-sulfatase"
            species:167879 "Colwellia psychrerythraea 34H" [GO:0008152
            "metabolic process" evidence=ISS] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 EMBL:CP000083 GenomeReviews:CP000083_GR
            Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008449 RefSeq:YP_269086.1
            ProteinModelPortal:Q482D2 STRING:Q482D2 GeneID:3522371
            KEGG:cps:CPS_2368 PATRIC:21467821 HOGENOM:HOG000024136 OMA:SHKAVHS
            ProtClustDB:CLSK824923 BioCyc:CPSY167879:GI48-2431-MONOMER
            Uniprot:Q482D2
        Length = 537

 Score = 177 (67.4 bits), Expect = 7.9e-14, Sum P(3) = 7.9e-14
 Identities = 56/167 (33%), Positives = 79/167 (47%)

Query:   104 IPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLT 162
             I TPN+D LA  G+   N +    +C+PSRA+++TG+Y +H         G      P  
Sbjct:    58 IDTPNMDKLAAGGVYFKNAFVTTALCSPSRATILTGQY-MHNH-------GVVDNNNPAK 109

Query:   163 ER--FLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQ 220
             E   + P YL+E+GY T   GKWH+G       P   GF+ H+    G   YY     D+
Sbjct:   110 ESSVYFPSYLQEVGYETSFFGKWHMGGHGDSPQP---GFD-HWLSFAGQGHYYPK--KDK 163

Query:   221 YSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
               RT ++N +  R       D  G Y TD  T  AV  ++ +  DKP
Sbjct:   164 KGRTNKININGERV------DQKG-YITDELTDYAVDWLDKRDSDKP 203

 Score = 72 (30.4 bits), Expect = 7.9e-14, Sum P(3) = 7.9e-14
 Identities = 26/109 (23%), Positives = 48/109 (44%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
             +R Y   +  +DDS+G V+  L+   +  ++I++ M DNG    E+              
Sbjct:   276 KRQYHRALSAVDDSLGRVLKWLKDNNLENDTIVMLMGDNGFMFGEHGLID---------- 325

Query:   364 YRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
                 K   +E  ++VP + ++P   +   V  +M+   D  PT+   AG
Sbjct:   326 ----KRNAYEESMRVPLLAYAPGYFKPGTVVDEMVANLDIAPTILEIAG 370

 Score = 49 (22.3 bits), Expect = 7.9e-14, Sum P(3) = 7.9e-14
 Identities = 19/71 (26%), Positives = 36/71 (50%)

Query:   569 LFNLGNDPCEQNNIASS---RPDISSQLYEL--LKYHRR--TLVPQSHEQ-PDLVQADPK 620
             L++L NDP E NN+ ++   +P I+   ++L  L  +++   ++P + +  P  V  +  
Sbjct:   436 LYDLKNDPKEMNNLINTPKHQPLIAQMRHDLFNLLVNKKGDNVIPYTEKYTPGAVYRERD 495

Query:   621 R-----FNDTW 626
             R     F D W
Sbjct:   496 RGETADFPDNW 506

 Score = 46 (21.3 bits), Expect = 1.6e-13, Sum P(3) = 1.6e-13
 Identities = 28/117 (23%), Positives = 47/117 (40%)

Query:   422 IDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSR--RNSVL------INIDEKKRT 473
             +  LD   ++L    +++ ++ DG D W  L  N      R + L       N      T
Sbjct:   356 VANLDIAPTILEIAGAKKPAHFDG-DSWLPLAKNKEVNQWRENFLYEYYWEFNYPSTPTT 414

Query:   474 AAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIF 530
              A+R D++KL+   Q +G  D        N    +N N I   K    + Q+  ++F
Sbjct:   415 FALRTDNYKLI---QYHGVWDTEELYDLKNDPKEMN-NLINTPKHQPLIAQMRHDLF 467

 Score = 40 (19.1 bits), Expect = 6.5e-13, Sum P(3) = 6.5e-13
 Identities = 12/30 (40%), Positives = 15/30 (50%)

Query:   564 NGPCYLFNLGNDPCEQNNIASSRPDISSQL 593
             N P  + NL N P  Q  IA  R D+ + L
Sbjct:   441 NDPKEMNNLINTPKHQPLIAQMRHDLFNLL 470


>TIGR_CMR|CPS_2368 [details] [associations]
            symbol:CPS_2368 "putative N-acetylglucosamine-6-sulfatase"
            species:167879 "Colwellia psychrerythraea 34H" [GO:0008152
            "metabolic process" evidence=ISS] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 EMBL:CP000083 GenomeReviews:CP000083_GR
            Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008449 RefSeq:YP_269086.1
            ProteinModelPortal:Q482D2 STRING:Q482D2 GeneID:3522371
            KEGG:cps:CPS_2368 PATRIC:21467821 HOGENOM:HOG000024136 OMA:SHKAVHS
            ProtClustDB:CLSK824923 BioCyc:CPSY167879:GI48-2431-MONOMER
            Uniprot:Q482D2
        Length = 537

 Score = 177 (67.4 bits), Expect = 7.9e-14, Sum P(3) = 7.9e-14
 Identities = 56/167 (33%), Positives = 79/167 (47%)

Query:   104 IPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLT 162
             I TPN+D LA  G+   N +    +C+PSRA+++TG+Y +H         G      P  
Sbjct:    58 IDTPNMDKLAAGGVYFKNAFVTTALCSPSRATILTGQY-MHNH-------GVVDNNNPAK 109

Query:   163 ER--FLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQ 220
             E   + P YL+E+GY T   GKWH+G       P   GF+ H+    G   YY     D+
Sbjct:   110 ESSVYFPSYLQEVGYETSFFGKWHMGGHGDSPQP---GFD-HWLSFAGQGHYYPK--KDK 163

Query:   221 YSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
               RT ++N +  R       D  G Y TD  T  AV  ++ +  DKP
Sbjct:   164 KGRTNKININGERV------DQKG-YITDELTDYAVDWLDKRDSDKP 203

 Score = 72 (30.4 bits), Expect = 7.9e-14, Sum P(3) = 7.9e-14
 Identities = 26/109 (23%), Positives = 48/109 (44%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
             +R Y   +  +DDS+G V+  L+   +  ++I++ M DNG    E+              
Sbjct:   276 KRQYHRALSAVDDSLGRVLKWLKDNNLENDTIVMLMGDNGFMFGEHGLID---------- 325

Query:   364 YRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
                 K   +E  ++VP + ++P   +   V  +M+   D  PT+   AG
Sbjct:   326 ----KRNAYEESMRVPLLAYAPGYFKPGTVVDEMVANLDIAPTILEIAG 370

 Score = 49 (22.3 bits), Expect = 7.9e-14, Sum P(3) = 7.9e-14
 Identities = 19/71 (26%), Positives = 36/71 (50%)

Query:   569 LFNLGNDPCEQNNIASS---RPDISSQLYEL--LKYHRR--TLVPQSHEQ-PDLVQADPK 620
             L++L NDP E NN+ ++   +P I+   ++L  L  +++   ++P + +  P  V  +  
Sbjct:   436 LYDLKNDPKEMNNLINTPKHQPLIAQMRHDLFNLLVNKKGDNVIPYTEKYTPGAVYRERD 495

Query:   621 R-----FNDTW 626
             R     F D W
Sbjct:   496 RGETADFPDNW 506

 Score = 46 (21.3 bits), Expect = 1.6e-13, Sum P(3) = 1.6e-13
 Identities = 28/117 (23%), Positives = 47/117 (40%)

Query:   422 IDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSR--RNSVL------INIDEKKRT 473
             +  LD   ++L    +++ ++ DG D W  L  N      R + L       N      T
Sbjct:   356 VANLDIAPTILEIAGAKKPAHFDG-DSWLPLAKNKEVNQWRENFLYEYYWEFNYPSTPTT 414

Query:   474 AAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIF 530
              A+R D++KL+   Q +G  D        N    +N N I   K    + Q+  ++F
Sbjct:   415 FALRTDNYKLI---QYHGVWDTEELYDLKNDPKEMN-NLINTPKHQPLIAQMRHDLF 467

 Score = 40 (19.1 bits), Expect = 6.5e-13, Sum P(3) = 6.5e-13
 Identities = 12/30 (40%), Positives = 15/30 (50%)

Query:   564 NGPCYLFNLGNDPCEQNNIASSRPDISSQL 593
             N P  + NL N P  Q  IA  R D+ + L
Sbjct:   441 NDPKEMNNLINTPKHQPLIAQMRHDLFNLL 470


>UNIPROTKB|O65931 [details] [associations]
            symbol:atsB "Arylsulfatase" species:83332 "Mycobacterium
            tuberculosis H37Rv" [GO:0005829 "cytosol" evidence=IDA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 GO:GO:0005829 GenomeReviews:AL123456_GR EMBL:BX842582
            Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
            PROSITE:PS00523 GO:GO:0004065 GO:GO:0008484 HSSP:P15289 KO:K01130
            EMBL:CP003248 PIR:E70533 RefSeq:NP_217816.1 RefSeq:YP_006516776.1
            ProteinModelPortal:O65931 PRIDE:O65931
            EnsemblBacteria:EBMYCT00000000058 GeneID:13318122 GeneID:887500
            KEGG:mtu:Rv3299c KEGG:mtv:RVBD_3299c PATRIC:18155953
            TubercuList:Rv3299c HOGENOM:HOG000042725 OMA:EIMGSRA
            ProtClustDB:CLSK792415 InterPro:IPR009200 Pfam:PF06897
            Uniprot:O65931
        Length = 970

 Score = 168 (64.2 bits), Expect = 1.1e-13, Sum P(5) = 1.1e-13
 Identities = 54/149 (36%), Positives = 73/149 (48%)

Query:    91 YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGM---- 146
             +G  D +F G+  I TP +  LA NG+I N  +   VC+P+RA+L+TG+     G     
Sbjct:   224 FGGPD-TFGGA--IRTPTLSRLAQNGLIYNRFHVTAVCSPTRAALLTGRNHHRVGFGSVC 280

Query:   147 --QGP-PIWGA-EPRGVPLTERFLPEYLRELGYSTKAIGKWHL---------GFFRREYT 193
                GP P + A  PR        LP  LR+ GY T A GKWHL         G F  +  
Sbjct:   281 EFPGPYPGYSAVRPRSCAA----LPRILRDNGYVTGAFGKWHLTPDNVQGAAGPF--DNW 334

Query:   194 PLYRGFESHFGYLNGVISYYDHILSDQYS 222
             PL  GF+  +G+ +G    YD I+S   S
Sbjct:   335 PLGWGFDHFWGFPSGAAGQYDPIISQDNS 363

 Score = 75 (31.5 bits), Expect = 1.1e-13, Sum P(5) = 1.1e-13
 Identities = 22/70 (31%), Positives = 34/70 (48%)

Query:   360 SNYPYR-GVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSR 417
             SN P + G +     GG + P ++ W  +I+ + RV  Q  H  D  PT+  A G     
Sbjct:   580 SNTPLQWGKQMASHLGGTRDPLVVAWPARIRPDGRVRSQFTHCIDIAPTVLAAIGLPE-- 637

Query:   418 LPLNIDGLDQ 427
              P ++DG +Q
Sbjct:   638 -PTHVDGFEQ 646

 Score = 60 (26.2 bits), Expect = 1.1e-13, Sum P(5) = 1.1e-13
 Identities = 13/50 (26%), Positives = 30/50 (60%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFM-SDNGAPTVEYRETSNY 355
             +A   +  D +VG ++ A++  G  +N+++ ++  DNGA ++E   T ++
Sbjct:   486 FAGFSENADWNVGRLLDAIEDLGESDNTLVFYIWGDNGA-SMEGTNTGSF 534

 Score = 42 (19.8 bits), Expect = 1.1e-13, Sum P(5) = 1.1e-13
 Identities = 9/28 (32%), Positives = 17/28 (60%)

Query:   569 LFNLGNDPCEQNNIASSRPDISSQLYEL 596
             L+ L +D  +  N+A+  PD  ++L +L
Sbjct:   720 LYYLPDDFSQAKNLAAEHPDKVAELTQL 747

 Score = 41 (19.5 bits), Expect = 1.1e-13, Sum P(5) = 1.1e-13
 Identities = 8/22 (36%), Positives = 13/22 (59%)

Query:   233 RRNLSTAWDTVGEYATDLFTKE 254
             R +L  AWD++ E    LF ++
Sbjct:   461 RPDLFPAWDSMSEAQKRLFARQ 482


>UNIPROTKB|Q2KEF7 [details] [associations]
            symbol:MGCH7_ch7g1079 "Putative uncharacterized protein"
            species:242507 "Magnaporthe oryzae 70-15" [GO:0005575
            "cellular_component" evidence=ND] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0008484 EMBL:CM000230
            ProteinModelPortal:Q2KEF7 SMR:Q2KEF7 Uniprot:Q2KEF7
        Length = 480

 Score = 140 (54.3 bits), Expect = 1.7e-13, Sum P(3) = 1.7e-13
 Identities = 46/136 (33%), Positives = 65/136 (47%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAY--NGIILNNMYAQPVCTPSRASLMTGKYPIHTGM-QG 148
             G++D S  G  EI TPN+  L    NG +L N +    C+P+R+ L +G      G+ Q 
Sbjct:    20 GYSDTSPFGG-EINTPNLARLVSDGNGRLLTNFHTASACSPTRSMLFSGTDNHIAGLGQM 78

Query:   149 PPIWGAEP---RGVPLTERFL-------PEYLRELGYSTKAIGKWHLGFFRREYTPLYRG 198
                  A     R  P  E +L        E  ++ GY T   GKWHLG   RE +P  RG
Sbjct:    79 AENMRAHADLYRDKPGYEGYLNFRVAALSEVFQDAGYQTLMTGKWHLGL-TRETSPHARG 137

Query:   199 FESHFGYLNGVISYYD 214
             FE    +L+G  ++Y+
Sbjct:   138 FERSHVFLSGCHNHYN 153

 Score = 110 (43.8 bits), Expect = 1.7e-13, Sum P(3) = 1.7e-13
 Identities = 20/38 (52%), Positives = 29/38 (76%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA 344
             YAAMV  +D ++GTV+  L+  G L+N+ ++FMSDNGA
Sbjct:   299 YAAMVDGMDAAIGTVLDQLEADGELDNTFVLFMSDNGA 336

 Score = 46 (21.3 bits), Expect = 1.7e-13, Sum P(3) = 1.7e-13
 Identities = 9/23 (39%), Positives = 13/23 (56%)

Query:   363 PYRGVKNTLWEGGVKVPAILWSP 385
             P RG K  +  GG++ P I+  P
Sbjct:   388 PSRGFKTWITGGGIRCPCIVRYP 410


>TIGR_CMR|SPO_A0121 [details] [associations]
            symbol:SPO_A0121 "sulfatase family protein"
            species:246200 "Ruegeria pomeroyi DSS-3" [GO:0008152 "metabolic
            process" evidence=ISS] [GO:0008484 "sulfuric ester hydrolase
            activity" evidence=ISS] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GO:GO:0008484 EMBL:CP000032 GenomeReviews:CP000032_GR
            HOGENOM:HOG000230030 RefSeq:YP_164953.1 ProteinModelPortal:Q5LLA5
            GeneID:3196629 KEGG:sil:SPOA0121 PATRIC:23381566 OMA:FDYLSCY
            ProtClustDB:CLSK867183 Uniprot:Q5LLA5
        Length = 552

 Score = 135 (52.6 bits), Expect = 1.7e-12, Sum P(2) = 1.7e-12
 Identities = 36/93 (38%), Positives = 49/93 (52%)

Query:    96 LSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGPPIWGA 154
             LS +G   + TPNID LA  G+   N Y Q  VC PSR S  TG+Y + +        G+
Sbjct:    20 LSCYGHERLNTPNIDKLAKRGVRFTNAYVQATVCGPSRMSAYTGRY-VRSH-------GS 71

Query:   155 EPRGVPLT--ERFLPEYLRELGYSTKAIGKWHL 185
                G+PL   E  L ++LR++G     IGK H+
Sbjct:    72 TQNGIPLRVGEPTLGDHLRDVGMRNVLIGKTHM 104

 Score = 113 (44.8 bits), Expect = 1.7e-12, Sum P(2) = 1.7e-12
 Identities = 31/110 (28%), Positives = 56/110 (50%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             Y  ++K++DD +G + + +Q +G+ EN++I+F +D+G    +Y       +W       G
Sbjct:   294 YMGLIKQIDDQLGQLFAFMQERGLDENTMIVFTADHG----DYLGD----HW------MG 339

Query:   367 VKNTLWEGGVKVPAILWSPQIQQNPR---VSLQMMHISDWLPTLYTAAGG 413
              K   +E   KVP I++ P  + +     VS  ++ + D  PT    AGG
Sbjct:   340 EKYLFYEAAAKVPLIIYDPSDKADATRGTVSDALVEMIDLAPTFVDYAGG 389


>UNIPROTKB|F1N665 [details] [associations]
            symbol:ARSG "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0006790 "sulfur compound metabolic process"
            evidence=IEA] [GO:0005783 "endoplasmic reticulum" evidence=IEA]
            [GO:0005764 "lysosome" evidence=IEA] [GO:0005615 "extracellular
            space" evidence=IEA] [GO:0008484 "sulfuric ester hydrolase
            activity" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 GO:GO:0005783 GO:GO:0005615
            GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 GO:GO:0006790 EMBL:DAAA02049729
            EMBL:DAAA02049730 EMBL:DAAA02049731 EMBL:DAAA02049732
            EMBL:DAAA02049733 IPI:IPI00867152 UniGene:Bt.103824
            ProteinModelPortal:F1N665 Ensembl:ENSBTAT00000014061 OMA:GHARNAF
            Uniprot:F1N665
        Length = 328

 Score = 184 (69.8 bits), Expect = 2.6e-12, Sum P(2) = 2.6e-12
 Identities = 43/114 (37%), Positives = 60/114 (52%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGPP 150
             GW DL  + +    T N+D +A  G    + +A    C+PSRA+L+TG+  +  G+    
Sbjct:    47 GWGDLGANWAGTKDTANLDRMAAEGTRFVDFHAAASTCSPSRAALLTGRLGLRNGVTHN- 105

Query:   151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG 204
                    G+PL E  L E LR  GY T  IGKWHLG     + P +RGF+ +FG
Sbjct:   106 FAVTSVGGLPLNETTLAEVLRGAGYVTGMIGKWHLGHHGSHH-PNFRGFDYYFG 158

 Score = 48 (22.0 bits), Expect = 2.6e-12, Sum P(2) = 2.6e-12
 Identities = 11/40 (27%), Positives = 21/40 (52%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNG 343
             +R Y+A ++++D  VG +   +       N+ + F  DNG
Sbjct:   266 QRLYSAGLREMDHLVGRIKDTVDLVAK-NNTFLWFTGDNG 304


>UNIPROTKB|I3LCI6 [details] [associations]
            symbol:I3LCI6 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GO:GO:0008484 GeneTree:ENSGT00560000077076 EMBL:CU469102
            EMBL:AEMK01103856 EMBL:AEMK01167009 Ensembl:ENSSSCT00000031398
            Uniprot:I3LCI6
        Length = 121

 Score = 171 (65.3 bits), Expect = 5.9e-12, P = 5.9e-12
 Identities = 36/90 (40%), Positives = 53/90 (58%)

Query:   359 GSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDT-SR 417
             G+N+P RG K +LWEGGV+    +  P +++    + +++HISDWLPTL   AGG T   
Sbjct:    18 GNNWPLRGRKWSLWEGGVRGVGFVAGPLLKRKGVKNRELIHISDWLPTLVKLAGGSTHGT 77

Query:   418 LPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
              PL  DG D W ++   +PS R   +  +D
Sbjct:    78 KPL--DGFDVWKTISEGSPSPRMELLHNID 105


>RGD|1560491 [details] [associations]
            symbol:Ids "iduronate 2-sulfatase" species:10116 "Rattus
            norvegicus" [GO:0003674 "molecular_function" evidence=ND]
            [GO:0005575 "cellular_component" evidence=ND] [GO:0008150
            "biological_process" evidence=ND] [GO:0008484 "sulfuric ester
            hydrolase activity" evidence=ISO] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 RGD:1560491
            Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            HOGENOM:HOG000014304 HOVERGEN:HBG006120 OrthoDB:EOG49078W
            GO:GO:0004423 EMBL:BN000743 IPI:IPI00764641
            ProteinModelPortal:Q32KJ4 STRING:Q32KJ4 PhosphoSite:Q32KJ4
            InParanoid:Q32KJ4 Genevestigator:Q32KJ4 Uniprot:Q32KJ4
        Length = 543

 Score = 141 (54.7 bits), Expect = 7.8e-12, Sum P(2) = 7.8e-12
 Identities = 51/173 (29%), Positives = 76/173 (43%)

Query:    96 LSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGPPIWGA 154
             L  +G   + +PNID LA + I+  N +AQ  VC PSR S +TG+ P  T +     +  
Sbjct:    44 LGCYGDKLVRSPNIDQLASHSIVFENAFAQQAVCAPSRVSFLTGRRPDTTRLYDFNSYWR 103

Query:   155 EPRGVPLTERFLPEYLRELGYSTKAIGK-WHLGFFRREYTPLYRGFESHFGYLNGVISYY 213
                G   T   +P+Y +E GY T ++GK +H G        +       + Y      Y 
Sbjct:   104 VHSGNFST---IPQYFKENGYVTMSVGKVFHPG--------ISSNHSDDYPYSWSFPPY- 151

Query:   214 DHILSDQYSRTVELNGHD--MRRNLSTAWDTV----GEYATDLFTKEAVQLIE 260
              H  S++Y  T    G D  +  NL    D      G       T+EA++L+E
Sbjct:   152 -HPSSEKYENTKTCKGQDGKLHTNLLCPVDVADVPEGTLPDKQSTEEAIRLLE 203

 Score = 100 (40.3 bits), Expect = 7.8e-12, Sum P(2) = 7.8e-12
 Identities = 23/55 (41%), Positives = 33/55 (60%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNW 358
             R++Y A V  LD  VG ++SAL    +  N+II FMSD+G    E+ E + Y N+
Sbjct:   290 RQSYFASVSYLDTQVGHLLSALDDLRLAHNTIIAFMSDHGWALGEHGEWAKYSNF 344


>UNIPROTKB|P22304 [details] [associations]
            symbol:IDS "Iduronate 2-sulfatase" species:9606 "Homo
            sapiens" [GO:0046872 "metal ion binding" evidence=IEA] [GO:0004423
            "iduronate-2-sulfatase activity" evidence=TAS] [GO:0005975
            "carbohydrate metabolic process" evidence=TAS] [GO:0006027
            "glycosaminoglycan catabolic process" evidence=TAS] [GO:0030203
            "glycosaminoglycan metabolic process" evidence=TAS] [GO:0030204
            "chondroitin sulfate metabolic process" evidence=TAS] [GO:0030207
            "chondroitin sulfate catabolic process" evidence=TAS] [GO:0043202
            "lysosomal lumen" evidence=TAS] [GO:0044281 "small molecule
            metabolic process" evidence=TAS] Reactome:REACT_111217
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 Reactome:REACT_116125 GO:GO:0046872 GO:GO:0005975
            EMBL:CH471171 GO:GO:0043202 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            GO:GO:0030207 EMBL:AF011889 EMBL:M58342 EMBL:L13329 EMBL:L13321
            EMBL:L13322 EMBL:L13323 EMBL:L13324 EMBL:L13325 EMBL:L13326
            EMBL:L13327 EMBL:L13328 EMBL:L04586 EMBL:L04578 EMBL:L04579
            EMBL:L04580 EMBL:L04581 EMBL:L04583 EMBL:L04582 EMBL:L04584
            EMBL:L04585 EMBL:L40586 EMBL:AC233288 EMBL:BC006170 IPI:IPI00006121
            IPI:IPI00013771 IPI:IPI00026104 PIR:A47535 RefSeq:NP_000193.1
            RefSeq:NP_006114.1 UniGene:Hs.460960 ProteinModelPortal:P22304
            IntAct:P22304 STRING:P22304 PhosphoSite:P22304 DMDM:124174
            PRIDE:P22304 Ensembl:ENST00000340855 Ensembl:ENST00000370441
            Ensembl:ENST00000370443 Ensembl:ENST00000466323 GeneID:3423
            KEGG:hsa:3423 UCSC:uc004fcw.4 UCSC:uc011mxh.2 CTD:3423
            GeneCards:GC0XM148558 HGNC:HGNC:5389 MIM:300823 MIM:309900
            neXtProt:NX_P22304 Orphanet:217085 Orphanet:217093 PharmGKB:PA29636
            HOGENOM:HOG000014304 HOVERGEN:HBG006120 InParanoid:P22304 KO:K01136
            OMA:CREGKNL OrthoDB:EOG49078W PhylomeDB:P22304
            BioCyc:MetaCyc:HS00286-MONOMER ChiTaRS:IDS GenomeRNAi:3423
            NextBio:13500 PMAP-CutDB:P22304 ArrayExpress:P22304 Bgee:P22304
            CleanEx:HS_IDS Genevestigator:P22304 GermOnline:ENSG00000010404
            GO:GO:0004423 Uniprot:P22304
        Length = 550

 Score = 138 (53.6 bits), Expect = 1.1e-11, Sum P(2) = 1.1e-11
 Identities = 51/173 (29%), Positives = 80/173 (46%)

Query:    96 LSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGPPIWGA 154
             L  +G   + +PNID LA + ++  N +AQ  VC PSR S +TG+ P  T +     +  
Sbjct:    51 LGCYGDKLVRSPNIDQLASHSLLFQNAFAQQAVCAPSRVSFLTGRRPDTTRLYDFNSYWR 110

Query:   155 EPRGVPLTERFLPEYLRELGYSTKAIGK-WHLGFFRREYTPLYRGFESHFGYLNGVISYY 213
                G   T   +P+Y +E GY T ++GK +H G     +T      +S + +      Y 
Sbjct:   111 VHAGNFST---IPQYFKENGYVTMSVGKVFHPGI-SSNHTD-----DSPYSW--SFPPY- 158

Query:   214 DHILSDQYSRTVELNGHD--MRRNLSTAWDTV----GEYATDLFTKEAVQLIE 260
              H  S++Y  T    G D  +  NL    D +    G       T++A+QL+E
Sbjct:   159 -HPSSEKYENTKTCRGPDGELHANLLCPVDVLDVPEGTLPDKQSTEQAIQLLE 210

 Score = 102 (41.0 bits), Expect = 1.1e-11, Sum P(2) = 1.1e-11
 Identities = 37/119 (31%), Positives = 59/119 (49%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNW--GSN 361
             R++Y A V  LD  VG ++SAL    +  ++II F SD+G    E+ E + Y N+   ++
Sbjct:   297 RQSYFASVSYLDTQVGRLLSALDDLQLANSTIIAFTSDHGWALGEHGEWAKYSNFDVATH 356

Query:   362 YP---Y-RGVKNTLWEGGVKVPAIL----WSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
              P   Y  G   +L E G K+   L     + Q+ +  R S+ ++ +    PTL   AG
Sbjct:   357 VPLIFYVPGRTASLPEAGEKLFPYLDPFDSASQLMEPGRQSMDLVELVSLFPTLAGLAG 415


>UNIPROTKB|F1NFI0 [details] [associations]
            symbol:IDS "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            GeneTree:ENSGT00640000091539 EMBL:AADN02013672 EMBL:AADN02013673
            EMBL:AADN02013674 EMBL:AADN02013675 IPI:IPI00579251
            Ensembl:ENSGALT00000014910 OMA:SELDYAY Uniprot:F1NFI0
        Length = 525

 Score = 144 (55.7 bits), Expect = 2.2e-11, Sum P(2) = 2.2e-11
 Identities = 36/93 (38%), Positives = 51/93 (54%)

Query:    96 LSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGPPIWGA 154
             L  +G N + +PNID LA   I+ +N YAQ  VC PSR S +TG+ P  T +     +  
Sbjct:    16 LGCYGDNLVKSPNIDQLASQSIVFSNAYAQQAVCAPSRVSFLTGRRPDTTRLYDFYSYWR 75

Query:   155 EPRGVPLTERFLPEYLRELGYSTKAIGK-WHLG 186
                G   T   +P+Y +E GY T ++GK +H G
Sbjct:    76 VHSGNYST---MPQYFKENGYVTMSVGKVFHPG 105

 Score = 92 (37.4 bits), Expect = 2.2e-11, Sum P(2) = 2.2e-11
 Identities = 19/55 (34%), Positives = 34/55 (61%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNW 358
             R++Y A V  LD  VG +++AL   G+  ++I++F +D+G    E+ E + Y N+
Sbjct:   262 RQSYYAAVSYLDMQVGLLLNALDYVGLSNSTIVVFTADHGWSLGEHGEWAKYSNF 316


>MGI|MGI:96417 [details] [associations]
            symbol:Ids "iduronate 2-sulfatase" species:10090 "Mus
            musculus" [GO:0003824 "catalytic activity" evidence=IEA]
            [GO:0004423 "iduronate-2-sulfatase activity" evidence=IEA]
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005764 "lysosome"
            evidence=IEA] [GO:0008152 "metabolic process" evidence=IEA]
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IDA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0046872 "metal
            ion binding" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 MGI:MGI:96417 GO:GO:0046872
            GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            CTD:3423 HOVERGEN:HBG006120 KO:K01136 OrthoDB:EOG49078W ChiTaRS:IDS
            GO:GO:0004423 EMBL:AK166178 EMBL:BX294168 EMBL:L07921 EMBL:BN000750
            IPI:IPI00125815 PIR:A47153 RefSeq:NP_034628.2 UniGene:Mm.233083
            ProteinModelPortal:Q08890 SMR:Q08890 STRING:Q08890
            PhosphoSite:Q08890 PRIDE:Q08890 DNASU:15931
            Ensembl:ENSMUST00000101509 GeneID:15931 KEGG:mmu:15931
            GeneTree:ENSGT00640000091539 InParanoid:Q32KI7 NextBio:288652
            Bgee:Q08890 CleanEx:MM_IDS Genevestigator:Q08890
            GermOnline:ENSMUSG00000035847 Uniprot:Q08890
        Length = 552

 Score = 139 (54.0 bits), Expect = 2.8e-11, Sum P(2) = 2.8e-11
 Identities = 50/173 (28%), Positives = 76/173 (43%)

Query:    96 LSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGPPIWGA 154
             L  +G   + +PNID LA + ++  N +AQ  VC PSR S +TG+ P  T +     +  
Sbjct:    53 LGCYGDKLVRSPNIDQLASHSVLFQNAFAQQAVCAPSRVSFLTGRRPDTTRLYDFNSYWR 112

Query:   155 EPRGVPLTERFLPEYLRELGYSTKAIGK-WHLGFFRREYTPLYRGFESHFGYLNGVISYY 213
                G   T   +P+Y +E GY T ++GK +H G        +       + Y      Y 
Sbjct:   113 VHSGNFST---IPQYFKENGYVTMSVGKVFHPG--------ISSNHSDDYPYSWSFPPY- 160

Query:   214 DHILSDQYSRTVELNGHD--MRRNLSTAWDTV----GEYATDLFTKEAVQLIE 260
              H  S++Y  T    G D  +  NL    D      G       T+EA++L+E
Sbjct:   161 -HPSSEKYENTKTCKGQDGKLHANLLCPVDVADVPEGTLPDKQSTEEAIRLLE 212

 Score = 97 (39.2 bits), Expect = 2.8e-11, Sum P(2) = 2.8e-11
 Identities = 23/55 (41%), Positives = 32/55 (58%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNW 358
             R++Y A V  LD  VG V+SAL    +  N+II F SD+G    E+ E + Y N+
Sbjct:   299 RQSYFASVSYLDTQVGHVLSALDDLRLAHNTIIAFTSDHGWALGEHGEWAKYSNF 353


>UNIPROTKB|F6PNP7 [details] [associations]
            symbol:IDS "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            GO:GO:0008484 HOGENOM:HOG000014304 HOVERGEN:HBG006120 OMA:CREGKNL
            OrthoDB:EOG49078W GeneTree:ENSGT00640000091539 EMBL:AAEX03027034
            Ensembl:ENSCAFT00000030323 Uniprot:F6PNP7
        Length = 468

 Score = 139 (54.0 bits), Expect = 5.5e-11, Sum P(2) = 5.5e-11
 Identities = 51/173 (29%), Positives = 80/173 (46%)

Query:    96 LSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGPPIWGA 154
             L  +G   + +PNID LA + ++  N +AQ  VC PSR S +TG+ P  T +     +  
Sbjct:    49 LGCYGDKLVRSPNIDQLASHSLLFQNAFAQQAVCAPSRVSFLTGRRPDTTRLYDFNSYWR 108

Query:   155 EPRGVPLTERFLPEYLRELGYSTKAIGK-WHLGFFRREYTPLYRGFESHFGYLNGVISYY 213
                G   T   LP+Y +E GY T ++GK +H G     Y+      +S + +   +  Y 
Sbjct:   109 VHAGNFST---LPQYFKENGYVTMSVGKVFHPGI-SSNYSD-----DSPYSW--SIPPY- 156

Query:   214 DHILSDQYSRTVELNGHD--MRRNLSTAWDTV----GEYATDLFTKEAVQLIE 260
              H  S++Y  T    G D  +  NL    D      G       T++A++L+E
Sbjct:   157 -HPSSEKYENTKTCRGPDGELHANLLCPVDIADVPEGTLPDKQSTEQAIRLLE 208

 Score = 92 (37.4 bits), Expect = 5.5e-11, Sum P(2) = 5.5e-11
 Identities = 20/55 (36%), Positives = 33/55 (60%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNW 358
             R++Y A +  LD  VG ++SAL    +  ++II+F SD+G    E+ E + Y N+
Sbjct:   295 RQSYFASISYLDTQVGHLLSALDDLQLANSTIIVFASDHGWALGEHGEWAKYSNF 349


>TIGR_CMR|SPO_3593 [details] [associations]
            symbol:SPO_3593 "sulfatase family protein" species:246200
            "Ruegeria pomeroyi DSS-3" [GO:0008152 "metabolic process"
            evidence=ISS] [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=ISS] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 EMBL:CP000031
            GenomeReviews:CP000031_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
            GO:GO:0008484 HOGENOM:HOG000230030 ProtClustDB:CLSK867183
            RefSeq:YP_168788.1 ProteinModelPortal:Q5LMH0 GeneID:3195684
            KEGG:sil:SPO3593 PATRIC:23380663 OMA:MNILFIM Uniprot:Q5LMH0
        Length = 552

 Score = 132 (51.5 bits), Expect = 1.1e-10, Sum P(3) = 1.1e-10
 Identities = 34/96 (35%), Positives = 50/96 (52%)

Query:    93 WNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPPI 151
             W+ LS +G   + TP+ID LA  G+  +  Y Q P+C  SR S  TG+Y +H+       
Sbjct:    13 WDYLSCYGHKTLNTPHIDRLAAKGVRFDRAYIQSPICGSSRMSTYTGRY-VHSH------ 65

Query:   152 WGAEPRGVPLT--ERFLPEYLRELGYSTKAIGKWHL 185
              GA   G+PL   E  + ++LR  G     +GK H+
Sbjct:    66 -GASWNGIPLKVGEMTMGDHLRAAGMGCWLVGKTHM 100

 Score = 85 (35.0 bits), Expect = 1.1e-10, Sum P(3) = 1.1e-10
 Identities = 29/120 (24%), Positives = 55/120 (45%)

Query:   297 QYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYR 356
             Q + D     Y  ++K+ DD +G +   L+  G +++++I+  SD+G    ++       
Sbjct:   282 QEVRDAVIPAYMGLIKQADDQMGRLFKWLEDTGRMQDTMIVLTSDHGDFLGDH------- 334

Query:   357 NWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP-RVSL--QMMHISDWLPTLYTAAGG 413
              W       G K    +   +VP I++ P+ + +  R S+   ++   D  PT   AAGG
Sbjct:   335 -W------MGEKTFFHDASTRVPLIIYDPRPEADATRGSVCDALVESIDLAPTFVEAAGG 387

 Score = 55 (24.4 bits), Expect = 1.1e-10, Sum P(3) = 1.1e-10
 Identities = 15/36 (41%), Positives = 18/36 (50%)

Query:   548 IHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIA 583
             IH  A+P PM      N P  L +LG DP   + IA
Sbjct:   450 IHFEADPRPML-FDLKNDPQELVDLGGDPAHADVIA 484

 Score = 37 (18.1 bits), Expect = 6.9e-09, Sum P(3) = 6.9e-09
 Identities = 8/37 (21%), Positives = 18/37 (48%)

Query:   437 SRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT 473
             +RR S      +   + + T SR+  +++ I ++  T
Sbjct:   494 TRRQSQRTTRSEEQLIAMRTKSRKRGIVLGIYDENET 530


>UNIPROTKB|F1N2D5 [details] [associations]
            symbol:IDS "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            GeneTree:ENSGT00640000091539 EMBL:DAAA02068060 EMBL:DAAA02068061
            IPI:IPI00709383 Ensembl:ENSBTAT00000014683 OMA:CREGRNL
            Uniprot:F1N2D5
        Length = 546

 Score = 139 (54.0 bits), Expect = 1.8e-10, Sum P(2) = 1.8e-10
 Identities = 52/172 (30%), Positives = 78/172 (45%)

Query:    96 LSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGPPIWGA 154
             L  +G+  I +PNID LA   ++  N +AQ  VC PSR S +TG+ P  T +     +  
Sbjct:    48 LGCYGNKLIRSPNIDQLASRSLLFQNAFAQQAVCAPSRVSFLTGRRPDTTRLYDFNSYWR 107

Query:   155 EPRGVPLTERFLPEYLRELGYSTKAIGK-WHLGFFRREYTPLYRGFESHFGYLNGVISYY 213
                G   T   +P+Y +E GY T ++GK +H G             +S + +   V  Y 
Sbjct:   108 VHAGNFST---IPQYFKENGYVTMSVGKVFHPGISSNHSD------DSPYSW--SVPPY- 155

Query:   214 DHILSDQYSRTVELNGHD--MRRNLSTAWDTV----GEYATDLFTKEAVQLI 259
              H  S++Y  T    G D  +  NL    D V    G       T++A+QL+
Sbjct:   156 -HPSSEKYENTKTCRGPDGELHANLLCPVDVVDVPEGTLPDKQSTEQAIQLL 206

 Score = 89 (36.4 bits), Expect = 1.8e-10, Sum P(2) = 1.8e-10
 Identities = 20/55 (36%), Positives = 32/55 (58%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNW 358
             R++Y A V  LD  VG ++SAL    +  ++I+ F SD+G    E+ E + Y N+
Sbjct:   294 RQSYFACVSYLDTQVGRLLSALDDLQLASSTIVAFTSDHGWALGEHGEWAKYSNF 348


>UNIPROTKB|F1S048 [details] [associations]
            symbol:F1S048 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484
            GeneTree:ENSGT00560000077076 EMBL:FP104542
            Ensembl:ENSSSCT00000018625 OMA:MAPRDFA Uniprot:F1S048
        Length = 142

 Score = 156 (60.0 bits), Expect = 2.4e-10, P = 2.4e-10
 Identities = 29/55 (52%), Positives = 39/55 (70%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGM 146
             G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY  H+G+
Sbjct:    91 GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKY--HSGI 142


>UNIPROTKB|P31447 [details] [associations]
            symbol:yidJ "putative sulfatase" species:83333 "Escherichia
            coli K-12" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] [GO:0008152 "metabolic process" evidence=IEA]
            [GO:0003824 "catalytic activity" evidence=IEA] [GO:0046872 "metal
            ion binding" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 EMBL:U00096 EMBL:AP009048
            GenomeReviews:AP009048_GR GenomeReviews:U00096_GR GO:GO:0046872
            EMBL:L10328 Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            OMA:RKDWENT KO:K01138 PIR:G65169 RefSeq:NP_418134.1
            RefSeq:YP_491756.1 ProteinModelPortal:P31447 SMR:P31447
            PRIDE:P31447 EnsemblBacteria:EBESCT00000001975
            EnsemblBacteria:EBESCT00000016174 GeneID:12932459 GeneID:948188
            KEGG:ecj:Y75_p3496 KEGG:eco:b3678 PATRIC:32122847 EchoBASE:EB1656
            EcoGene:EG11705 HOGENOM:HOG000126316 ProtClustDB:CLSK880765
            BioCyc:EcoCyc:EG11705-MONOMER BioCyc:ECOL316407:JW3654-MONOMER
            Genevestigator:P31447 Uniprot:P31447
        Length = 497

 Score = 145 (56.1 bits), Expect = 3.2e-10, Sum P(3) = 3.2e-10
 Identities = 36/93 (38%), Positives = 48/93 (51%)

Query:    94 NDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPPIW 152
             N +  +    + T NID+LA  GI  N+ Y   PVCTP+RA L TG   I+    GP   
Sbjct:    17 NMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG---IYANQSGPWTN 73

Query:   153 GAEPRGVPLTERFLPEYLRELGYSTKAIGKWHL 185
                P G  ++   +  Y ++ GY T  IGKWHL
Sbjct:    74 NVAP-GKNIST--MGRYFKDAGYHTCYIGKWHL 103

 Score = 76 (31.8 bits), Expect = 3.2e-10, Sum P(3) = 3.2e-10
 Identities = 30/105 (28%), Positives = 52/105 (49%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             Y A    +DD +G VI+AL  +   EN+ +I+ SD+G     ++  S           +G
Sbjct:   251 YFACNDFVDDQIGRVINALTPE-QRENTWVIYTSDHGEMMGAHKLIS-----------KG 298

Query:   367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
                 +++   ++P I+ SPQ ++  +V   + HI D LPT+   A
Sbjct:   299 A--AMYDDITRIPLIIRSPQGERR-QVDTPVSHI-DLLPTMMALA 339

 Score = 43 (20.2 bits), Expect = 3.2e-10, Sum P(3) = 3.2e-10
 Identities = 12/34 (35%), Positives = 21/34 (61%)

Query:   569 LFNLGNDPCEQNNIASS-R-PDISSQLYE-LLKY 599
             L++  NDP E +N+    R  D+ S++++ LL Y
Sbjct:   401 LYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDY 434


>UNIPROTKB|Q48QH2 [details] [associations]
            symbol:betC "Choline sulfatase" species:264730 "Pseudomonas
            syringae pv. phaseolicola 1448A" [GO:0006790 "sulfur compound
            metabolic process" evidence=ISS] [GO:0030104 "water homeostasis"
            evidence=ISS] [GO:0047753 "choline-sulfatase activity"
            evidence=ISS] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 EMBL:CP000058
            GenomeReviews:CP000058_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
            GO:GO:0030104 eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00149
            GO:GO:0006790 HOGENOM:HOG000217625 KO:K01133 InterPro:IPR017785
            InterPro:IPR025863 Pfam:PF12411 TIGRFAMs:TIGR03417
            RefSeq:YP_272344.1 ProteinModelPortal:Q48QH2 STRING:Q48QH2
            GeneID:3556452 KEGG:psp:PSPPH_0030 PATRIC:19969019 OMA:MIRRGAY
            ProtClustDB:CLSK864791 GO:GO:0047753 Uniprot:Q48QH2
        Length = 501

 Score = 123 (48.4 bits), Expect = 3.4e-10, Sum P(3) = 3.4e-10
 Identities = 31/91 (34%), Positives = 46/91 (50%)

Query:    96 LSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPPIWGA 154
             L F+  + I  PN+  LA +G++ ++ Y   P+C PSR +L++G+ P   G        A
Sbjct:    19 LPFYSRSPILMPNLSRLAADGVVFDSAYCNSPLCAPSRFTLVSGQLPSKIGAYDN---AA 75

Query:   155 E-PRGVPLTERFLPEYLRELGYSTKAIGKWH 184
             + P  +P        YLR LGY T   GK H
Sbjct:    76 DFPADIPT----YAHYLRALGYKTALAGKMH 102

 Score = 85 (35.0 bits), Expect = 3.4e-10, Sum P(3) = 3.4e-10
 Identities = 34/119 (28%), Positives = 53/119 (44%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
             RR Y      +D +VG ++  L   G+ E++I++F  D+G    E          G  Y 
Sbjct:   252 RRAYFGACSYIDLNVGKLMQTLDEVGLAEDTIVVFSGDHGDMLGEK---------GLWY- 301

Query:   364 YRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSR-LPLN 421
                 K   +E   +VP +++SP   +  RVS  +   +D LPT    A G     LPL+
Sbjct:   302 ----KMHWFEMAARVPLVVYSPGQFKPGRVSASVS-TADLLPTFVEMANGKLDAGLPLD 355

 Score = 58 (25.5 bits), Expect = 3.4e-10, Sum P(3) = 3.4e-10
 Identities = 18/58 (31%), Positives = 27/58 (46%)

Query:   566 PCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTL-VPQSHEQPDLVQADPKRF 622
             PC LF++  DP EQ +++ S P       + L   R    +P  H+Q  L     +RF
Sbjct:   402 PCLLFDVKKDPKEQKDLSQS-PAHEKLFNDFLAEARAKWDIPAIHQQV-LASQRRRRF 457


>ZFIN|ZDB-GENE-060929-332 [details] [associations]
            symbol:arsk "arylsulfatase family, member K"
            species:7955 "Danio rerio" [GO:0008152 "metabolic process"
            evidence=IEA] [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] [GO:0003824 "catalytic activity" evidence=IEA]
            [GO:0005576 "extracellular region" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0046872 "metal ion binding"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 ZFIN:ZDB-GENE-060929-332
            GO:GO:0005576 GO:GO:0046872 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            HOGENOM:HOG000034080 HOVERGEN:HBG054703 CTD:153642 KO:K12376
            EMBL:BC124212 IPI:IPI00806585 RefSeq:NP_001070625.1
            UniGene:Dr.90831 ProteinModelPortal:Q08CJ7 GeneID:562412
            KEGG:dre:562412 InParanoid:Q08CJ7 NextBio:20884395 Uniprot:Q08CJ7
        Length = 523

 Score = 106 (42.4 bits), Expect = 4.5e-10, Sum P(3) = 4.5e-10
 Identities = 32/109 (29%), Positives = 55/109 (50%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
             R  Y AM  + D  +G V++AL+  G L  ++++F SD+G   +E+R+   Y        
Sbjct:   271 RAFYYAMCAETDGMLGEVMAALRDTGSLNKTVVLFTSDHGDLAMEHRQF--Y-------- 320

Query:   364 YRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
                 K +++EG   VP ++  P ++    VSL +  + D  PT+   AG
Sbjct:   321 ----KMSMFEGSSHVPLLIMGPGVKSGFEVSLPVSLV-DIYPTVLDLAG 364

 Score = 86 (35.3 bits), Expect = 4.5e-10, Sum P(3) = 4.5e-10
 Identities = 25/89 (28%), Positives = 44/89 (49%)

Query:    96 LSFHGSNEI-PTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPPIWG 153
             L+F   N++   P I+ +   G +  N Y   P+C PSRA++ +G++ +H        W 
Sbjct:    41 LTFQPGNKVVQLPYINYMRELGSVFLNSYTNSPICCPSRAAMWSGQF-VHLTQS----WN 95

Query:   154 AEPRGVPLTERFLPEYLRELGYSTKAIGK 182
                   P    ++ + LR+ GY T ++GK
Sbjct:    96 NNKCLHPNATTWMDD-LRKSGYHTHSMGK 123

 Score = 75 (31.5 bits), Expect = 4.5e-10, Sum P(3) = 4.5e-10
 Identities = 17/35 (48%), Positives = 20/35 (57%)

Query:   564 NGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLK 598
             N P  LFNL  D  E  NIAS  PD+   L +LL+
Sbjct:   426 NVPPQLFNLSKDESELRNIASQFPDVCQDLDKLLR 460

 Score = 43 (20.2 bits), Expect = 9.0e-06, Sum P(3) = 9.0e-06
 Identities = 12/44 (27%), Positives = 20/44 (45%)

Query:   140 YPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKW 183
             +P  T   GP   G+       T R  P +L ++ Y+  ++ KW
Sbjct:   203 HPYRTDSLGPTAGGS-------TFRTSPYWLNKVSYNQVSVPKW 239


>UNIPROTKB|H3BP66 [details] [associations]
            symbol:GALNS "N-acetylgalactosamine-6-sulfatase"
            species:9606 "Homo sapiens" [GO:0008484 "sulfuric ester hydrolase
            activity" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            EMBL:AC092384 InterPro:IPR024607 PROSITE:PS00149 GO:GO:0008484
            HGNC:HGNC:4122 ChiTaRS:Galns Ensembl:ENST00000562831 Bgee:H3BP66
            Uniprot:H3BP66
        Length = 170

 Score = 152 (58.6 bits), Expect = 6.5e-10, P = 6.5e-10
 Identities = 49/150 (32%), Positives = 72/150 (48%)

Query:   131 SRASLMTGKYPIHTGMQGPPIWGAE---PR----GVPLTERFLPEYLRELGYSTKAIGKW 183
             +RA+L+TG+ PI  G             P+    G+P +E+ LPE L++ GY +K +GKW
Sbjct:    10 ARAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKW 69

Query:   184 HLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNGH---DMRRNLST 238
             HLG  R ++ PL  GF+  FG  N     YD+         R  E+ G    +   NL T
Sbjct:    70 HLGH-RPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKT 128

Query:   239 AWDTVGEY-ATDLFTKEAVQLIEDQPVDKP 267
                  GE   T ++ +EA+  I+ Q    P
Sbjct:   129 -----GEANLTQIYLQEALDFIKRQARHHP 153


>ASPGD|ASPL0000029545 [details] [associations]
            symbol:AN5449 species:162425 "Emericella nidulans"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0008484 "sulfuric
            ester hydrolase activity" evidence=IEA] [GO:0008152 "metabolic
            process" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 EMBL:BN001305 Gene3D:3.40.720.10
            SUPFAM:SSF53649 eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 EMBL:AACD01000094 RefSeq:XP_663053.1
            ProteinModelPortal:Q5B1Y1 EnsemblFungi:CADANIAT00003640
            GeneID:2871741 KEGG:ani:AN5449.2 HOGENOM:HOG000217625 KO:K01133
            OMA:YIMADQM OrthoDB:EOG45F0XM InterPro:IPR017785 InterPro:IPR025863
            Pfam:PF12411 TIGRFAMs:TIGR03417 Uniprot:Q5B1Y1
        Length = 594

 Score = 123 (48.4 bits), Expect = 1.4e-09, Sum P(3) = 1.4e-09
 Identities = 32/92 (34%), Positives = 46/92 (50%)

Query:    96 LSFHGSNE-IPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPPIWG 153
             L+FH  +  I TPN++ LA  G++ ++ Y   P+C PSR  ++TG+ P   G        
Sbjct:    21 LAFHDKDSPIKTPNLNKLAEEGVVFDSAYCNSPLCAPSRFVMVTGQLPSKIGAYDN---A 77

Query:   154 AE-PRGVPLTERFLPEYLRELGYSTKAIGKWH 184
             A+ P  +P        YLR  GY T   GK H
Sbjct:    78 ADLPADIPT----YAHYLRREGYHTALAGKMH 105

 Score = 85 (35.0 bits), Expect = 1.4e-09, Sum P(3) = 1.4e-09
 Identities = 37/123 (30%), Positives = 54/123 (43%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
             RR Y A    +D +VG ++  L   G+ +++II+F  D+G    E R       W     
Sbjct:   256 RRAYYAACTYVDTNVGKLLKVLDETGLRDDTIIVFTGDHGDMLGE-RGL-----W----- 304

Query:   364 YRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGD-TSRLPLNI 422
             Y   K   +E   +VP I+ +P      R+S Q +   D LPT     G      LPL  
Sbjct:   305 Y---KMAWFENSARVPFIVNAPNRFAPARIS-QNVSTMDILPTFAELVGAPLVKELPL-- 358

Query:   423 DGL 425
             DG+
Sbjct:   359 DGV 361

 Score = 55 (24.4 bits), Expect = 1.4e-09, Sum P(3) = 1.4e-09
 Identities = 11/25 (44%), Positives = 16/25 (64%)

Query:   566 PCYLFNLGNDPCEQNNIASSRPDIS 590
             P  LF++ NDP E+ N+ +  PD S
Sbjct:   408 PPMLFDVQNDPLEKVNLVAGLPDPS 432


>UNIPROTKB|D6RDH0 [details] [associations]
            symbol:ARSI "Arylsulfatase I" species:9606 "Homo sapiens"
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0008484
            HOGENOM:HOG000135354 HGNC:HGNC:32521 EMBL:AC011372 IPI:IPI00967848
            ProteinModelPortal:D6RDH0 SMR:D6RDH0 Ensembl:ENST00000509146
            ArrayExpress:D6RDH0 Bgee:D6RDH0 Uniprot:D6RDH0
        Length = 86

 Score = 149 (57.5 bits), Expect = 1.4e-09, P = 1.4e-09
 Identities = 29/88 (32%), Positives = 46/88 (52%)

Query:   180 IGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTA 239
             +GKWHLGF+R+E  P  RGF++  G L G + YY +   D       + G D+    + A
Sbjct:     2 VGKWHLGFYRKECLPTRRGFDTFLGSLTGNVDYYTYDNCDGPG----VCGFDLHEGENVA 57

Query:   240 WDTVGEYATDLFTKEAVQLIEDQPVDKP 267
             W   G+Y+T L+ + A  ++      +P
Sbjct:    58 WGLSGQYSTMLYAQRASHILASHSPQRP 85


>TIGR_CMR|CPS_2358 [details] [associations]
            symbol:CPS_2358 "sulfatase family protein" species:167879
            "Colwellia psychrerythraea 34H" [GO:0008152 "metabolic process"
            evidence=ISS] [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=ISS] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 EMBL:CP000083
            GenomeReviews:CP000083_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484
            HOGENOM:HOG000014304 RefSeq:YP_269076.1 ProteinModelPortal:Q482E2
            STRING:Q482E2 GeneID:3518855 KEGG:cps:CPS_2358 PATRIC:21467803
            OMA:ETIRIDS BioCyc:CPSY167879:GI48-2421-MONOMER Uniprot:Q482E2
        Length = 499

 Score = 128 (50.1 bits), Expect = 1.9e-09, Sum P(3) = 1.9e-09
 Identities = 31/85 (36%), Positives = 45/85 (52%)

Query:    99 HGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPPIWGAEPR 157
             +G+ ++ TPNID LA    +    Y+Q PVC PSR S++TG  P   G+        + R
Sbjct:    70 YGTAKVQTPNIDKLASQSTVFTRAYSQYPVCGPSRMSILTGLRPESNGIMNLK---DKIR 126

Query:   158 GVPLTERFLPEYLRELGYSTKAIGK 182
              V  +   LP++ +  GY T A GK
Sbjct:   127 DVNPSVITLPQFFKNNGYETAATGK 151

 Score = 89 (36.4 bits), Expect = 1.9e-09, Sum P(3) = 1.9e-09
 Identities = 31/106 (29%), Positives = 50/106 (47%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             Y A V  +D  VG ++  L++ G  EN++I+F  D+G          ++  WG       
Sbjct:   306 YFASVSFIDSLVGELLEELEKTGQAENTVIVFWGDHGF------HLGDHGLWG------- 352

Query:   367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
              K+T  E    VP I+  P  + N R + + + + D  P+L  AAG
Sbjct:   353 -KHTTMEQANHVPLIIKIPGSKAN-RYA-KPVELLDVFPSLTEAAG 395

 Score = 41 (19.5 bits), Expect = 1.9e-09, Sum P(3) = 1.9e-09
 Identities = 8/17 (47%), Positives = 12/17 (70%)

Query:   569 LFNLGNDPCEQNNIASS 585
             L++L NDP E  NI ++
Sbjct:   458 LYDLINDPLETKNIINT 474


>UNIPROTKB|Q148F3 [details] [associations]
            symbol:ARSK "Arylsulfatase K" species:9913 "Bos taurus"
            [GO:0005576 "extracellular region" evidence=IEA] [GO:0046872 "metal
            ion binding" evidence=IEA] [GO:0008484 "sulfuric ester hydrolase
            activity" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 GO:GO:0005576 GO:GO:0046872
            Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 EMBL:BC118383 IPI:IPI00705188
            UniGene:Bt.16276 ProteinModelPortal:Q148F3 PRIDE:Q148F3
            Ensembl:ENSBTAT00000026877 GeneTree:ENSGT00400000022041
            HOGENOM:HOG000034080 HOVERGEN:HBG054703 InParanoid:Q148F3
            OMA:TYMLRTD OrthoDB:EOG42BX86 Uniprot:Q148F3
        Length = 540

 Score = 118 (46.6 bits), Expect = 4.1e-09, Sum P(3) = 4.1e-09
 Identities = 34/109 (31%), Positives = 58/109 (53%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
             R  Y AM  + D  +G +I AL++ G+L+ +I+I+ SD+G   +E+R+   Y        
Sbjct:   280 RAFYYAMCAETDAMLGEIILALRQLGLLQKTIVIYTSDHGELAMEHRQF--Y-------- 329

Query:   364 YRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
                 K +++E    VP ++  P IQ N +VS  ++ + D  PT+   AG
Sbjct:   330 ----KMSMYEASSHVPLLIMGPGIQANLQVS-SVVSLVDIYPTMLDIAG 373

 Score = 74 (31.1 bits), Expect = 4.1e-09, Sum P(3) = 4.1e-09
 Identities = 24/89 (26%), Positives = 40/89 (44%)

Query:    96 LSFH-GSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPPIWG 153
             L+F+ GS  +  P I+ +  +G    N Y   P+C PSRA++ +G +  H         G
Sbjct:    50 LTFYPGSQVVKLPFINFMKAHGTSFLNAYTNSPICCPSRAAMWSGLFT-HLTESWNNFKG 108

Query:   154 AEPRGVPLTERFLPEYLRELGYSTKAIGK 182
              +P      +      + + GY T+  GK
Sbjct:   109 LDPNYTTWMD-----VMEKHGYRTQKFGK 132

 Score = 65 (27.9 bits), Expect = 4.1e-09, Sum P(3) = 4.1e-09
 Identities = 13/30 (43%), Positives = 21/30 (70%)

Query:   569 LFNLGNDPCEQNNIASSRPDISSQLYELLK 598
             LF+L +DP E  NIA+  P+++S L + L+
Sbjct:   443 LFDLSSDPDELTNIAAKFPEVTSSLDQKLR 472


>UNIPROTKB|D6RGC1 [details] [associations]
            symbol:ARSJ "Arylsulfatase J" species:9606 "Homo sapiens"
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
            PROSITE:PS00523 GO:GO:0008484 EMBL:AC104779 HGNC:HGNC:26286
            ChiTaRS:ARSJ IPI:IPI00966139 ProteinModelPortal:D6RGC1 SMR:D6RGC1
            Ensembl:ENST00000509829 HOGENOM:HOG000172533 ArrayExpress:D6RGC1
            Bgee:D6RGC1 Uniprot:D6RGC1
        Length = 133

 Score = 144 (55.7 bits), Expect = 4.7e-09, P = 4.7e-09
 Identities = 26/48 (54%), Positives = 34/48 (70%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGK 139
             G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGK
Sbjct:    87 GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGK 133


>ASPGD|ASPL0000046382 [details] [associations]
            symbol:AN11149 species:162425 "Emericella nidulans"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0008484 "sulfuric
            ester hydrolase activity" evidence=IEA] [GO:0008152 "metabolic
            process" evidence=IEA] InterPro:IPR000917 InterPro:IPR012083
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            PIRSF:PIRSF000972 GO:GO:0018958 EMBL:BN001307 Gene3D:3.40.720.10
            SUPFAM:SSF53649 InterPro:IPR024607 PROSITE:PS00523 GO:GO:0004065
            HOGENOM:HOG000169239 ProteinModelPortal:C8VLL2
            EnsemblFungi:CADANIAT00007963 OMA:TENDPAN Uniprot:C8VLL2
        Length = 565

 Score = 115 (45.5 bits), Expect = 5.2e-09, Sum P(3) = 5.2e-09
 Identities = 43/152 (28%), Positives = 67/152 (44%)

Query:   120 NNMYAQPVCTPSRASLMTGKYPIHTGMQ--GPPIWGAEPRGVP--LTERFLPEYLRELGY 175
             N+     +C PSR SL TG+   +T +    PP +G  P+ V     E + P +L++ GY
Sbjct:    60 NHFVTTALCCPSRVSLWTGRQAHNTNVTWVAPP-YGGYPKFVSQGFNEDWFPLWLQDAGY 118

Query:   176 STKAIGKWHLGFFRREYT-PLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRR 234
             +T  +GK         Y  P  +GF       NG     D +L D +  T        +R
Sbjct:   119 NTYYVGKLFNAHSVTTYNNPFVKGF-------NGS----DFLL-DPF--TYSYWNSSYQR 164

Query:   235 NLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK 266
             N        G+Y TD+  ++A+  ++D   DK
Sbjct:   165 NHEAPKSYAGQYTTDVTEEKALGFVDDALEDK 196

 Score = 98 (39.6 bits), Expect = 5.2e-09, Sum P(3) = 5.2e-09
 Identities = 29/102 (28%), Positives = 51/102 (50%)

Query:   311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
             ++ +D+ V  ++  L+R G L N+ +I+ SDNG     +R            P  G K+T
Sbjct:   283 LQSVDEMVDKLLDRLERSGQLNNTYVIYSSDNGFHIGHHR-----------LP-PG-KST 329

Query:   371 LWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
              +E  ++VP  +  P I+   +V+    HI D+ PT++   G
Sbjct:   330 SYEEDIRVPFFIRGPGIKSGGKVTQVTTHI-DFAPTIFELLG 370

 Score = 44 (20.5 bits), Expect = 5.2e-09, Sum P(3) = 5.2e-09
 Identities = 10/21 (47%), Positives = 13/21 (61%)

Query:   562 CTNGPCYLFNLGNDPCEQNNI 582
             CT G   LF+L  DP + +NI
Sbjct:   440 CT-GDHELFDLNTDPYQMHNI 459


>ZFIN|ZDB-GENE-061215-37 [details] [associations]
            symbol:ids "iduronate 2-sulfatase" species:7955
            "Danio rerio" [GO:0008152 "metabolic process" evidence=IEA]
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
            [GO:0003824 "catalytic activity" evidence=IEA] [GO:0030512
            "negative regulation of transforming growth factor beta receptor
            signaling pathway" evidence=IMP] [GO:0005737 "cytoplasm"
            evidence=IDA] [GO:0009790 "embryo development" evidence=IMP]
            [GO:0060536 "cartilage morphogenesis" evidence=IMP] [GO:0004423
            "iduronate-2-sulfatase activity" evidence=IDA] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            ZFIN:ZDB-GENE-061215-37 GO:GO:0005737 GO:GO:0009790
            Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
            PROSITE:PS00149 GO:GO:0030512 GO:GO:0060536 OMA:CREGKNL
            GO:GO:0004423 GeneTree:ENSGT00640000091539 EMBL:CR774199
            IPI:IPI00495228 Ensembl:ENSDART00000106205 Bgee:F1R4Q5
            Uniprot:F1R4Q5
        Length = 561

 Score = 120 (47.3 bits), Expect = 7.3e-09, Sum P(3) = 7.3e-09
 Identities = 31/85 (36%), Positives = 46/85 (54%)

Query:   104 IPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLT 162
             + +PNID LA   ++ +N YAQ  VC PSR S +T + P  T +     +     G   T
Sbjct:    53 VKSPNIDQLASLSVVFHNAYAQQAVCGPSRVSFLTSRRPDTTKLYDFNSYWRVHAGNYTT 112

Query:   163 ERFLPEYLRELGYSTKAIGK-WHLG 186
                LP+Y +  GY+T ++GK +H G
Sbjct:   113 ---LPQYFKSNGYTTLSVGKVFHPG 134

 Score = 91 (37.1 bits), Expect = 7.3e-09, Sum P(3) = 7.3e-09
 Identities = 32/119 (26%), Positives = 56/119 (47%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNW--GSN 361
             R+ Y A V  +D  VG ++  L   G+ +N+I++  SD+G    E+ E + Y N+   + 
Sbjct:   291 RQHYFASVSYVDAQVGKILQTLDDVGLAKNTIVVLSSDHGWSLGEHGEWAKYSNFDVATR 350

Query:   362 YP---YR-GVKNTLWEGGVKV-PAILWSPQIQQN---PRVSLQMMHISDWLPTLYTAAG 412
              P   Y+ GV +     G K  P I      +++    ++   ++ + D  PTL   AG
Sbjct:   351 VPLMVYKAGVSSRRSRTGAKTFPFIDVFQDTREHFGKGKIVNSVVELLDVFPTLANLAG 409

 Score = 44 (20.5 bits), Expect = 7.3e-09, Sum P(3) = 7.3e-09
 Identities = 21/80 (26%), Positives = 35/80 (43%)

Query:   508 LNFNAIVESKTYQSLQQLSQNIFLP-ISNIDKM----RSTRQQATIHCGANPAPMTPSPC 562
             LN  A   S+  +    + +N  LP +++I  M    RS   + T+  G +P    P+  
Sbjct:   442 LNREAYSFSQYPRPSDSIQENSDLPNLADIRIMGYSIRSNDYRYTLWVGFDPLHCKPNMT 501

Query:   563 TNGPCYLFNLGNDPCEQNNI 582
                   L+ L  DP + NN+
Sbjct:   502 EIHAGELYILTEDPGQDNNL 521


>ZFIN|ZDB-GENE-050107-5 [details] [associations]
            symbol:gnsa "glucosamine (N-acetyl)-6-sulfatase a"
            species:7955 "Danio rerio" [GO:0030203 "glycosaminoglycan metabolic
            process" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0008152 "metabolic process" evidence=IEA] [GO:0008484 "sulfuric
            ester hydrolase activity" evidence=IEA] [GO:0003824 "catalytic
            activity" evidence=IEA] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR012251 InterPro:IPR015981
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            PIRSF:PIRSF036666 ZFIN:ZDB-GENE-050107-5 GO:GO:0005764
            Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0030203
            HOGENOM:HOG000169239 HOVERGEN:HBG005840 KO:K01137 OrthoDB:EOG4NGGMF
            GO:GO:0008449 PANTHER:PTHR10342:SF5 EMBL:BC097128 IPI:IPI00499007
            RefSeq:NP_001025379.1 UniGene:Dr.84802 ProteinModelPortal:Q4V902
            STRING:Q4V902 GeneID:566506 KEGG:dre:566506 CTD:566506
            InParanoid:Q4V902 NextBio:20888220 ArrayExpress:Q4V902
            Uniprot:Q4V902
        Length = 538

 Score = 109 (43.4 bits), Expect = 1.2e-08, Sum P(3) = 1.2e-08
 Identities = 48/168 (28%), Positives = 76/168 (45%)

Query:   116 GIILNNMY-AQPVCTPSRASLMTGKYP-----IHTGMQG---PPIW--GAEPRGVPLTER 164
             GI   N + A P+C PSRAS++TGKYP     ++  ++G      W  G EP   P    
Sbjct:    64 GITFTNAFVASPLCCPSRASILTGKYPHNHHVVNNTLEGNCSSTAWQKGQEPDAFPA--- 120

Query:   165 FLPEYLRELGYSTKAIGKW--HLGFFRR---EYTPLYRGFESHFGYLNGVISYYDHILSD 219
             FL ++     Y T   GK+    G  +    E+ PL  G++ H+  L     YY++ LS 
Sbjct:   121 FLQKHA---AYQTFFAGKYLNEYGSKKAGGVEHVPL--GWD-HWFALERNSKYYNYTLS- 173

Query:   220 QYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
                    +NG   R   + + D    Y TD+    ++  +E++   +P
Sbjct:   174 -------VNGRAQRHGQNYSED----YLTDVLANVSIDFLENKSNRRP 210

 Score = 96 (38.9 bits), Expect = 1.2e-08, Sum P(3) = 1.2e-08
 Identities = 39/147 (26%), Positives = 70/147 (47%)

Query:   297 QYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYR 356
             +++ +  R+ +  ++  +DD V  ++  L  +G L N+ +IF SDNG  T ++       
Sbjct:   269 EFLDNAYRKRWRTLLS-VDDLVEKLVRKLDIRGELSNTYVIFTSDNGYHTGQF------- 320

Query:   357 NWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG---G 413
                 + P    K  L+E  ++VP ++  P I+ N    L + ++ D  PT+   AG    
Sbjct:   321 ----SLPMD--KRQLYEFDIRVPLLVRGPNIKPNQTSPLPIANV-DLGPTILDIAGYNVN 373

Query:   414 DT-----SRLPLNIDGLDQ--WSSLLL 433
             DT     S LP+ +  L+   W S +L
Sbjct:   374 DTQMDGMSFLPIMVGELNSSVWRSDVL 400

 Score = 48 (22.0 bits), Expect = 1.2e-08, Sum P(3) = 1.2e-08
 Identities = 14/40 (35%), Positives = 24/40 (60%)

Query:   569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQS 608
             ++NL +DP + +NIA S   I  ++ E  K + R ++ QS
Sbjct:   465 VYNLTSDPFQLSNIAKS---IDQEVLE--KMNHRLMMLQS 499


>WB|WBGene00006309 [details] [associations]
            symbol:sul-2 species:6239 "Caenorhabditis elegans"
            [GO:0008152 "metabolic process" evidence=IEA] [GO:0008484 "sulfuric
            ester hydrolase activity" evidence=IEA] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            GeneTree:ENSGT00560000076940 HOGENOM:HOG000135352
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            HSSP:P15289 EMBL:FO080993 PIR:T29618 RefSeq:NP_505102.1
            ProteinModelPortal:Q18924 SMR:Q18924 PaxDb:Q18924
            EnsemblMetazoa:D1014.1 GeneID:179194 KEGG:cel:CELE_D1014.1
            UCSC:D1014.1 CTD:179194 WormBase:D1014.1 InParanoid:Q18924
            OMA:HITHHEP NextBio:904322 Uniprot:Q18924
        Length = 452

 Score = 162 (62.1 bits), Expect = 1.3e-08, P = 1.3e-08
 Identities = 40/117 (34%), Positives = 60/117 (51%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
             G+ D++ +G        +D +A  G      Y A  +C+PSRA  +TG+ PI  G+ G  
Sbjct:    44 GYGDIASYGHPTQEYTQVDRMAAEGTRFTQAYSADSMCSPSRAGFITGRLPIRLGIVGGR 103

Query:   151 -IWGA-EPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT-----PLYRGFE 200
              ++   +  G+P +E  + E L+E GY+T  +GKWHLG      T     P  RGFE
Sbjct:   104 RVFVPYDIGGLPKSETTMAEMLQEAGYATGMVGKWHLGINENNATDGAHLPSKRGFE 160


>UNIPROTKB|F1NAA9 [details] [associations]
            symbol:ARSK "Arylsulfatase K" species:9031 "Gallus gallus"
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0008484
            GeneTree:ENSGT00400000022041 OMA:TYMLRTD IPI:IPI00571159
            EMBL:AADN02058337 EMBL:AADN02058338 Ensembl:ENSGALT00000023643
            Uniprot:F1NAA9
        Length = 535

 Score = 111 (44.1 bits), Expect = 1.8e-08, Sum P(3) = 1.8e-08
 Identities = 34/108 (31%), Positives = 57/108 (52%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
             R  Y AM  + D  +G +ISALQ   +L+ +II+F SD+G   +E+R+   Y        
Sbjct:   277 RAFYYAMCAETDAMLGEIISALQDTDLLKKTIIMFTSDHGELAMEHRQF--Y-------- 326

Query:   364 YRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
                 K +++EG   VP ++  P I++  +VS  ++ + D  PT+   A
Sbjct:   327 ----KMSMYEGSSHVPLLVMGPGIRKQQQVSA-VVSLVDIYPTMLDLA 369

 Score = 86 (35.3 bits), Expect = 1.8e-08, Sum P(3) = 1.8e-08
 Identities = 42/168 (25%), Positives = 69/168 (41%)

Query:    96 LSFHGSNE-IPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPPIWG 153
             L+F+  N+ +  P I+ +  +G +  N Y   P+C PSRA++ +G +  H         G
Sbjct:    47 LTFYPGNQTVDLPFINFMKRHGSVFLNAYTNSPICCPSRAAMWSGLFT-HLTESWNNFKG 105

Query:   154 AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYY 213
              +P  V   +      +++ GY T+  GK        +YT    G  S    +       
Sbjct:   106 LDPDDVTWMD-----LMQKHGYYTQKYGKL-------DYTS---GHHSVSNRVEAWTRDV 150

Query:   214 DHILSDQYSRTVELNGHDMR-RNLSTAWDTVGEYATDLFTKEAVQLIE 260
             + +L  +    V L G     R + T W  V + A     KEAV L +
Sbjct:   151 EFLLRQEGRPKVNLTGDRRHVRVMKTDWQ-VTDKAVTWIKKEAVNLTQ 197

 Score = 54 (24.1 bits), Expect = 1.8e-08, Sum P(3) = 1.8e-08
 Identities = 14/42 (33%), Positives = 22/42 (52%)

Query:   557 MTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLK 598
             +T S   + P  LF+L  DP E  N+A   P+    L ++L+
Sbjct:   428 ITYSDGVSVPPQLFDLSADPDELTNVAIKFPETVQSLDKILR 469


>UNIPROTKB|F1NI04 [details] [associations]
            symbol:GNS "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0005764 "lysosome" evidence=IEA] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=IEA]
            [GO:0030203 "glycosaminoglycan metabolic process" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR012251 InterPro:IPR015981
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            PIRSF:PIRSF036666 GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            GeneTree:ENSGT00400000022041 GO:GO:0030203 GO:GO:0008449
            PANTHER:PTHR10342:SF5 OMA:MCGYQTF EMBL:AADN02009911 IPI:IPI00596266
            Ensembl:ENSGALT00000016025 Uniprot:F1NI04
        Length = 546

 Score = 108 (43.1 bits), Expect = 2.6e-08, Sum P(3) = 2.6e-08
 Identities = 36/134 (26%), Positives = 66/134 (49%)

Query:   292 TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE 351
             T +  Q++ D  R+ +  ++  +DD V  ++  L+  G L+N+ I + SDNG  T ++  
Sbjct:   270 TNSSIQFLDDAYRKRWQTLLS-VDDLVEKLVKKLEIHGELDNTYIFYTSDNGFHTGQF-- 326

Query:   352 TSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
                      + P    K  L+E  +KVP ++  P I+ N    + + +I D  PT+   A
Sbjct:   327 ---------SLPID--KRQLYEFDIKVPLLVRGPGIKPNQTNKMLVANI-DLGPTILDIA 374

Query:   412 GGDTSRLPLNIDGL 425
             G D ++  +  DG+
Sbjct:   375 GYDLNKTQM--DGM 386

 Score = 94 (38.1 bits), Expect = 2.6e-08, Sum P(3) = 2.6e-08
 Identities = 45/163 (27%), Positives = 69/163 (42%)

Query:   105 PTPNIDAL-AYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPPIWG--AEPRGVP 160
             P    +AL A  G+  +N Y    +C PSRAS++TGKYP +  +    + G  +      
Sbjct:    58 PMKKTNALIAQMGVTFSNAYVPSALCCPSRASILTGKYPHNHHVVNNTLEGNCSSKLWQK 117

Query:   161 LTE-RFLPEYLREL-GYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYD-HIL 217
             + E    P  L+ + GY T   GK     +  EY     G  SH     G   +Y     
Sbjct:   118 IQEPNTFPALLKSMCGYQTFFAGK-----YLNEYGAEDAGGVSHVP--PGWSFWYALEKN 170

Query:   218 SDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIE 260
             S  Y+ T+ +NG   R   + + D    Y TD+    ++  +E
Sbjct:   171 SKYYNYTLSVNGKARRHGENYSVD----YLTDVLANMSLDFLE 209

 Score = 48 (22.0 bits), Expect = 2.6e-08, Sum P(3) = 2.6e-08
 Identities = 14/40 (35%), Positives = 23/40 (57%)

Query:   569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQS 608
             ++NL  DP + NNIA +   I  ++ E + Y  R ++ QS
Sbjct:   470 VYNLTADPHQINNIAKT---IDQEILEKMNY--RLMMLQS 504


>UNIPROTKB|Q5ZK90 [details] [associations]
            symbol:ARSK "Arylsulfatase K" species:9031 "Gallus gallus"
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
            [GO:0046872 "metal ion binding" evidence=IEA] [GO:0005576
            "extracellular region" evidence=IEA] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 GO:GO:0005576
            GO:GO:0046872 Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484 HOGENOM:HOG000034080
            HOVERGEN:HBG054703 OrthoDB:EOG42BX86 CTD:153642 KO:K12376
            EMBL:AJ720194 IPI:IPI00571159 RefSeq:NP_001026586.1
            UniGene:Gga.22351 ProteinModelPortal:Q5ZK90 GeneID:427116
            KEGG:gga:427116 InParanoid:Q5ZK90 NextBio:20828431 Uniprot:Q5ZK90
        Length = 535

 Score = 111 (44.1 bits), Expect = 2.9e-08, Sum P(3) = 2.9e-08
 Identities = 34/108 (31%), Positives = 57/108 (52%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
             R  Y AM  + D  +G +ISALQ   +L+ +II+F SD+G   +E+R+   Y        
Sbjct:   277 RAFYYAMCAETDAMLGEIISALQDTDLLKKTIIMFTSDHGELAMEHRQF--Y-------- 326

Query:   364 YRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
                 K +++EG   VP ++  P I++  +VS  ++ + D  PT+   A
Sbjct:   327 ----KMSMYEGSSHVPLLVMGPGIRKQQQVSA-VVSLVDIYPTMLDLA 369

 Score = 84 (34.6 bits), Expect = 2.9e-08, Sum P(3) = 2.9e-08
 Identities = 42/168 (25%), Positives = 69/168 (41%)

Query:    96 LSFHGSNE-IPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPPIWG 153
             L+F+  N+ +  P I+ +  +G +  N Y   P+C PSRA++ +G +  H         G
Sbjct:    47 LTFYPGNQTVDLPFINFMKRHGSVFLNAYTNSPICCPSRAAMWSGLFT-HLTESWNNFKG 105

Query:   154 AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYY 213
              +P  V   +      +++ GY T+  GK        +YT    G  S    +       
Sbjct:   106 LDPDYVTWMD-----LMQKHGYYTQKYGKL-------DYTS---GHHSVSNRVEAWTRDV 150

Query:   214 DHILSDQYSRTVELNGHDMR-RNLSTAWDTVGEYATDLFTKEAVQLIE 260
             + +L  +    V L G     R + T W  V + A     KEAV L +
Sbjct:   151 EFLLRQEGRPKVNLTGDRRHVRVMKTDWQ-VTDKAVTWIKKEAVNLTQ 197

 Score = 54 (24.1 bits), Expect = 2.9e-08, Sum P(3) = 2.9e-08
 Identities = 14/42 (33%), Positives = 22/42 (52%)

Query:   557 MTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLK 598
             +T S   + P  LF+L  DP E  N+A   P+    L ++L+
Sbjct:   428 ITYSDGVSVPPQLFDLSADPDELTNVAIKFPETVQSLDKILR 469


>FB|FBgn0038660 [details] [associations]
            symbol:CG14291 species:7227 "Drosophila melanogaster"
            [GO:0016250 "N-sulfoglucosamine sulfohydrolase activity"
            evidence=ISS] [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] [GO:0008152 "metabolic process" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 EMBL:AE014297 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484
            HSSP:P15289 KO:K01565 OMA:RDPHETQ GO:GO:0016250 EMBL:AY071569
            RefSeq:NP_650760.1 UniGene:Dm.5859 SMR:Q9VE24 STRING:Q9VE24
            EnsemblMetazoa:FBtr0083724 GeneID:42266 KEGG:dme:Dmel_CG14291
            UCSC:CG14291-RA FlyBase:FBgn0038660 GeneTree:ENSGT00390000013080
            InParanoid:Q9VE24 OrthoDB:EOG49ZW4K GenomeRNAi:42266 NextBio:827964
            Uniprot:Q9VE24
        Length = 524

 Score = 105 (42.0 bits), Expect = 3.2e-08, Sum P(4) = 3.2e-08
 Identities = 30/85 (35%), Positives = 43/85 (50%)

Query:   106 TPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTER 164
             TPN+DALA  G++ NN +     C+PSR+ L+TG+    +GM G    G     V     
Sbjct:    45 TPNLDALAKRGLLFNNAFTSVSSCSPSRSQLLTGQAGHSSGMYGLH-QGVHNFNVLPDTG 103

Query:   165 FLPEYLRELGYS---TKAIGKWHLG 186
              LP  +R+       +  IGK H+G
Sbjct:   104 SLPNLIRDQSGGRILSGIIGKKHVG 128

 Score = 83 (34.3 bits), Expect = 3.2e-08, Sum P(4) = 3.2e-08
 Identities = 17/49 (34%), Positives = 29/49 (59%)

Query:   300 TDPNRRTYAAM---VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAP 345
             TD  R+  AA    + +LD  VG ++  L+  G+ + +++I+ SDNG P
Sbjct:   232 TDVVRQELAAQYMTISRLDQGVGLMLKELEAAGVADQTLVIYTSDNGPP 280

 Score = 58 (25.5 bits), Expect = 3.2e-08, Sum P(4) = 3.2e-08
 Identities = 13/49 (26%), Positives = 27/49 (55%)

Query:   363 PYRGVKNTLWEGGVKVPAILWSPQIQ-QNPRVSLQMMHISDWLPTLYTA 410
             P+ G +  L+E G++ P I+ SP  + ++   +  M+ + D  P++  A
Sbjct:   280 PFPGGRTNLYEHGIRSPLIISSPNKEDRHHEATAAMVSLLDIYPSVMDA 328

 Score = 47 (21.6 bits), Expect = 7.4e-07, Sum P(3) = 7.4e-07
 Identities = 39/196 (19%), Positives = 78/196 (39%)

Query:   395 LQMMHISDWLPTLYTAAGGDT---SRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
             L+   ++D    +YT+  G      R  L   G+   S L++++P++ + + +      S
Sbjct:   260 LEAAGVADQTLVIYTSDNGPPFPGGRTNLYEHGIR--SPLIISSPNKEDRHHEATAAMVS 317

Query:   452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
             LL   PS  ++  + I     T  V      ++         D  +G    ++V +    
Sbjct:   318 LLDIYPSVMDA--LQIPRPNDTKIVGRSILPVLREEPPIKESDSVFGSHSYHEVTMAYPM 375

Query:   512 AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQ---ATIHCGANPAPMTPSPCTNGPCY 568
              +V ++ Y+ +  ++     PI        T QQ   AT+     P   +       P +
Sbjct:   376 RMVRNRRYKLIHNINYWADFPIDQDFYTSPTFQQILNATLRKQTLPWYRSLLQYYQRPEW 435

Query:   569 -LFNLGNDPCEQNNIA 583
              L+++  DP E+ N+A
Sbjct:   436 ELYDIKTDPLERFNLA 451

 Score = 40 (19.1 bits), Expect = 3.2e-08, Sum P(4) = 3.2e-08
 Identities = 9/25 (36%), Positives = 15/25 (60%)

Query:   506 PLLNFNAIVESKTYQSLQQLSQNIF 530
             PL  FN   ++K   +L+QL + +F
Sbjct:   444 PLERFNLADKAKYNGTLKQLREQLF 468


>UNIPROTKB|Q32KH0 [details] [associations]
            symbol:ARSK "Arylsulfatase K" species:9615 "Canis lupus
            familiaris" [GO:0005576 "extracellular region" evidence=IEA]
            [GO:0046872 "metal ion binding" evidence=IEA] [GO:0008484 "sulfuric
            ester hydrolase activity" evidence=IEA] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 GO:GO:0005576
            GO:GO:0046872 Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            GeneTree:ENSGT00400000022041 HOGENOM:HOG000034080
            HOVERGEN:HBG054703 OMA:TYMLRTD OrthoDB:EOG42BX86 EMBL:AAEX02030697
            EMBL:BN000767 RefSeq:NP_001041582.1 UniGene:Cfa.39082
            ProteinModelPortal:Q32KH0 Ensembl:ENSCAFT00000012646 GeneID:488903
            KEGG:cfa:488903 CTD:153642 InParanoid:Q32KH0 KO:K12376
            NextBio:20862170 Uniprot:Q32KH0
        Length = 535

 Score = 116 (45.9 bits), Expect = 3.3e-08, Sum P(3) = 3.3e-08
 Identities = 32/109 (29%), Positives = 58/109 (53%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
             R  Y AM  + D  +G +I AL++  +L+N+I+I+ SD+G   +E+R+   Y        
Sbjct:   280 RAFYYAMCAETDAMLGEIILALRQLDLLQNTIVIYTSDHGELAMEHRQF--Y-------- 329

Query:   364 YRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
                 K +++E    +P ++  P I+ N +VS  ++ + D  PT+   AG
Sbjct:   330 ----KMSMYEASAHIPLLMMGPGIKANQQVS-NVVSLVDIYPTMLDIAG 373

 Score = 76 (31.8 bits), Expect = 3.3e-08, Sum P(3) = 3.3e-08
 Identities = 24/89 (26%), Positives = 40/89 (44%)

Query:    96 LSFH-GSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPPIWG 153
             L+F+ GS  +  P I+ +  +G    N Y   P+C PSRA++ +G +  H         G
Sbjct:    50 LTFYPGSQAVKLPFINLMKAHGTSFLNAYTNSPICCPSRAAMWSGLFT-HLTESWNNFKG 108

Query:   154 AEPRGVPLTERFLPEYLRELGYSTKAIGK 182
              +P      +      + + GY T+  GK
Sbjct:   109 LDPNYTTWMD-----IMEKHGYRTQKFGK 132

 Score = 56 (24.8 bits), Expect = 3.3e-08, Sum P(3) = 3.3e-08
 Identities = 12/30 (40%), Positives = 20/30 (66%)

Query:   569 LFNLGNDPCEQNNIASSRPDISSQLYELLK 598
             LF+L +DP E  NIA+  P+++  L + L+
Sbjct:   443 LFDLFSDPDELTNIATKFPEVTYSLDQKLR 472


>UNIPROTKB|Q6MX51 [details] [associations]
            symbol:Rv0296c "Sulfatase" species:83332 "Mycobacterium
            tuberculosis H37Rv" [GO:0004065 "arylsulfatase activity"
            evidence=IDA] [GO:0005618 "cell wall" evidence=IDA] [GO:0046872
            "metal ion binding" evidence=IDA] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 GO:GO:0005618
            GenomeReviews:AL123456_GR GO:GO:0046872 EMBL:BX842573
            Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
            PROSITE:PS00523 GO:GO:0004065 GO:GO:0008484 KO:K01567 EMBL:CP003248
            PIR:F70837 RefSeq:YP_006513622.1 RefSeq:YP_177712.1
            ProteinModelPortal:Q6MX51 SMR:Q6MX51 PRIDE:Q6MX51
            EnsemblBacteria:EBMYCT00000002598 GeneID:13316285 GeneID:886600
            KEGG:mtu:Rv0296c KEGG:mtv:RVBD_0296c PATRIC:18149150
            TubercuList:Rv0296c HOGENOM:HOG000045150 OMA:DPGMAEP
            ProtClustDB:CLSK799699 Uniprot:Q6MX51
        Length = 465

 Score = 141 (54.7 bits), Expect = 6.5e-08, Sum P(3) = 6.5e-08
 Identities = 61/191 (31%), Positives = 87/191 (45%)

Query:    89 IVYGWNDLS-FHGSNEIP---TPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIH 143
             ++  W+DL  + G    P   +P +D LA  GI+    +A  P+CTPSR SL TG+YP  
Sbjct:    14 LIVHWHDLGRYLGVYHHPDVYSPRLDRLAAEGILFTRAHATAPLCTPSRGSLFTGRYPQS 73

Query:   144 TGMQGPPIWGAEPR-GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESH 202
              G+ G    G E R GV    + LP+ L E G+ +   G  H       Y P   GF+  
Sbjct:    74 NGLVGLAHHGWEYRTGV----QTLPQLLSESGWYSALFGMQH----ETSY-PKRLGFDE- 123

Query:   203 FGYLNGVISYYDHILSDQ-YSRTVELNGHDMRRNLSTA--WDTVGEYATDLFT---KEAV 256
             F   N    Y      D  ++R   L+G   +R L TA  ++T   Y  + +      AV
Sbjct:   124 FDVSNSYCEYVVAKAQDWLHNRVPALDG---QRFLLTAGFFETHRPYPHERYRPADSAAV 180

Query:   257 QLIEDQPVDKP 267
             +L +  P D P
Sbjct:   181 ELPDYLP-DTP 190

 Score = 50 (22.7 bits), Expect = 6.5e-08, Sum P(3) = 6.5e-08
 Identities = 8/29 (27%), Positives = 20/29 (68%)

Query:   315 DDSVGTVISALQRKGMLENSIIIFMSDNG 343
             D++VG ++  L   G+  ++ ++F++D+G
Sbjct:   207 DEAVGRLLDTLADTGLDASTWVVFVTDHG 235

 Score = 49 (22.3 bits), Expect = 6.5e-08, Sum P(3) = 6.5e-08
 Identities = 15/43 (34%), Positives = 22/43 (51%)

Query:   553 NPAPMTPSPCTNGPC---YLFNLGNDPCEQNNIASSRPDISSQ 592
             +PA M  +P    P     L++L  DP E NN+ +   D S+Q
Sbjct:   359 SPAGMAVAPLVKAPRPQRELYDLRADPTETNNLLAG--DDSTQ 399


>UNIPROTKB|H9L0P8 [details] [associations]
            symbol:H9L0P8 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 GO:GO:0008484 EMBL:AADN02076197
            Ensembl:ENSGALT00000026883 OMA:NNTFVLF Uniprot:H9L0P8
        Length = 233

 Score = 147 (56.8 bits), Expect = 7.4e-08, P = 7.4e-08
 Identities = 41/121 (33%), Positives = 65/121 (53%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             Y   V+++D  VG V+ A+ +KG+ +N+++ F SD+G   +E R+    +  G N  YRG
Sbjct:    64 YGDNVEEMDWMVGQVLDAIDKKGLKKNTLVYFASDHGG-WLE-RQEGKRQLGGWNGIYRG 121

Query:   367 VKNTL-WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
              K    WEGG++VP I   P +    +V  +   + D  PT+   AGG   +  + IDG 
Sbjct:   122 GKAMGGWEGGIRVPGIFRWPGVLPAGKVISEPTSLMDIYPTVVHLAGGVVPQDRV-IDGR 180

Query:   426 D 426
             D
Sbjct:   181 D 181


>TIGR_CMR|SPO_1083 [details] [associations]
            symbol:SPO_1083 "choline sulfatase" species:246200
            "Ruegeria pomeroyi DSS-3" [GO:0006790 "sulfur compound metabolic
            process" evidence=ISS] [GO:0047753 "choline-sulfatase activity"
            evidence=ISS] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 EMBL:CP000031
            GenomeReviews:CP000031_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00149 HOGENOM:HOG000217625 KO:K01133
            InterPro:IPR017785 InterPro:IPR025863 Pfam:PF12411
            TIGRFAMs:TIGR03417 ProtClustDB:CLSK864791 GO:GO:0047753
            RefSeq:YP_166334.1 ProteinModelPortal:Q5LUH1 GeneID:3195014
            KEGG:sil:SPO1083 PATRIC:23375467 OMA:QEAIILF Uniprot:Q5LUH1
        Length = 502

 Score = 101 (40.6 bits), Expect = 1.4e-07, Sum P(3) = 1.4e-07
 Identities = 35/99 (35%), Positives = 43/99 (43%)

Query:   107 PNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRG-VPLTER 164
             PN+  LA       N Y A P+C P RAS M+G+ P  TG+       AE R  +P    
Sbjct:    30 PNLKRLAARSTRFANAYTASPLCAPGRASFMSGQLPSRTGVYDN---AAEFRSDIPT--- 83

Query:   165 FLPEYLRELGYSTKAIGKWHL-------GFFRREYTPLY 196
                 +LR  GY T   GK H        GF  R  T +Y
Sbjct:    84 -YAHHLRRAGYYTCLSGKMHFVGPDQLHGFEERLTTDIY 121

 Score = 90 (36.7 bits), Expect = 1.4e-07, Sum P(3) = 1.4e-07
 Identities = 35/122 (28%), Positives = 57/122 (46%)

Query:   303 NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNY 362
             +RR Y A +  LDD +G ++  L+     + +II+F+SD+G    E R       W    
Sbjct:   253 SRRAYFANISYLDDKLGEILEVLETTR--QEAIILFVSDHGDMLGE-RGL-----W---- 300

Query:   363 PYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNI 422
                  K   +EG  +VP ++ +P ++   R+   +  I D  PTL   AG D + +    
Sbjct:   301 ----FKMNFYEGSARVPLMVAAPGMEPG-RIDTPVSTI-DVTPTLGELAGVDMAEIAPWT 354

Query:   423 DG 424
             DG
Sbjct:   355 DG 356

 Score = 51 (23.0 bits), Expect = 1.4e-07, Sum P(3) = 1.4e-07
 Identities = 12/27 (44%), Positives = 14/27 (51%)

Query:   562 CTNGPCYLFNLGNDPCEQNNIASSRPD 588
             C   P  LF+L  DP E  N+A   PD
Sbjct:   398 CALDPDQLFDLDADPHEMTNLAD-HPD 423


>UNIPROTKB|O60597 [details] [associations]
            symbol:IDS "Iduronate-2-sulfatase" species:9606 "Homo
            sapiens" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            HSSP:P08842 EMBL:AC233288 UniGene:Hs.460960 HGNC:HGNC:5389
            ChiTaRS:IDS EMBL:AF050145 IPI:IPI00640469 SMR:O60597 STRING:O60597
            Ensembl:ENST00000428056 UCSC:uc011mxj.2 HOGENOM:HOG000207088
            HOVERGEN:HBG053054 Uniprot:O60597
        Length = 179

 Score = 130 (50.8 bits), Expect = 3.0e-07, P = 3.0e-07
 Identities = 33/93 (35%), Positives = 50/93 (53%)

Query:    96 LSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGPPIWGA 154
             L  +G   + +PNID LA + ++  N +AQ  VC PSR S +TG+ P  T +     +  
Sbjct:    51 LGCYGDKLVRSPNIDQLASHSLLFQNAFAQQAVCAPSRVSFLTGRRPDTTRLYDFNSYWR 110

Query:   155 EPRGVPLTERFLPEYLRELGYSTKAIGK-WHLG 186
                G   T   +P+Y +E GY T ++GK +H G
Sbjct:   111 VHAGNFST---IPQYFKENGYVTMSVGKVFHPG 140


>MGI|MGI:1924291 [details] [associations]
            symbol:Arsk "arylsulfatase K" species:10090 "Mus musculus"
            [GO:0003824 "catalytic activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0005576 "extracellular
            region" evidence=IEA] [GO:0008152 "metabolic process" evidence=IEA]
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0046872 "metal
            ion binding" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 MGI:MGI:1924291 GO:GO:0005576
            GO:GO:0046872 Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            GeneTree:ENSGT00400000022041 HOGENOM:HOG000034080
            HOVERGEN:HBG054703 OrthoDB:EOG42BX86 CTD:153642 KO:K12376
            EMBL:AK046378 EMBL:AK013194 EMBL:AK086667 EMBL:AK019515
            EMBL:AK031147 EMBL:AK032812 EMBL:AK035464 EMBL:AK039765
            EMBL:AK083188 EMBL:AK084090 EMBL:AK144817 EMBL:AK165970
            EMBL:BC046790 EMBL:BC058351 EMBL:BN000751 IPI:IPI00112271
            RefSeq:NP_084123.2 UniGene:Mm.196399 ProteinModelPortal:Q9D2L1
            SMR:Q9D2L1 STRING:Q9D2L1 PhosphoSite:Q9D2L1 PRIDE:Q9D2L1
            Ensembl:ENSMUST00000120573 GeneID:77041 KEGG:mmu:77041
            UCSC:uc007rgh.1 InParanoid:Q8BL50 NextBio:346358 Bgee:Q9D2L1
            Genevestigator:Q9D2L1 Uniprot:Q9D2L1
        Length = 553

 Score = 109 (43.4 bits), Expect = 3.4e-07, Sum P(3) = 3.4e-07
 Identities = 32/109 (29%), Positives = 56/109 (51%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
             R  Y AM  + D  +G +I AL +  +L+ +I+I+ SD+G   +E+R+   Y        
Sbjct:   274 RAFYYAMCAETDAMLGEIILALHKLDLLQKTIVIYTSDHGEMAMEHRQF--Y-------- 323

Query:   364 YRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
                 K +++E  V VP ++  P I+ N +V   ++ + D  PT+   AG
Sbjct:   324 ----KMSMYEASVHVPLLMMGPGIKANLQVP-SVVSLVDIYPTMLDIAG 367

 Score = 74 (31.1 bits), Expect = 3.4e-07, Sum P(3) = 3.4e-07
 Identities = 24/89 (26%), Positives = 39/89 (43%)

Query:    96 LSFH-GSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPPIWG 153
             L+F  GS  +  P I+ +  +G    N Y   P+C PSRA++ +G +  H         G
Sbjct:    46 LTFQPGSQVVKLPFINFMRAHGTTFLNAYTNSPICCPSRAAMWSGLFT-HLTESWNNFKG 104

Query:   154 AEPRGVPLTERFLPEYLRELGYSTKAIGK 182
              +P      +      + + GY T+  GK
Sbjct:   105 LDPNYTTWMD-----IMEKHGYQTQKFGK 128

 Score = 56 (24.8 bits), Expect = 3.4e-07, Sum P(3) = 3.4e-07
 Identities = 13/30 (43%), Positives = 19/30 (63%)

Query:   569 LFNLGNDPCEQNNIASSRPDISSQLYELLK 598
             LF+L  DP E  NIA+  P+I+  L + L+
Sbjct:   437 LFDLSLDPDELTNIATEFPEITYSLDQKLR 466


>UNIPROTKB|I3L814 [details] [associations]
            symbol:ARSE "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 GO:GO:0008484
            Ensembl:ENSSSCT00000025117 OMA:CHIVALA Uniprot:I3L814
        Length = 448

 Score = 120 (47.3 bits), Expect = 3.6e-07, Sum P(3) = 3.6e-07
 Identities = 38/130 (29%), Positives = 58/130 (44%)

Query:   307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             Y   V+++D  VG V+  L  +G+   +++ F SD+G     +     Y  W  N  Y+G
Sbjct:   178 YGDNVEEMDWMVGQVLDVLDTEGLSNGTLVYFSSDHGGSLEAHFGNQQYGGW--NGIYKG 235

Query:   367 VKNTL-WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
              K    WEGG++VP I   P +    RV      I + +  +     G    + + IDG 
Sbjct:   236 GKGMGGWEGGIRVPGIFRWPGVLPAGRVLRAPQSIHEDVVAVAQEGAGYILHIRV-IDGR 294

Query:   426 DQWSSLLLNT 435
             D    LLL T
Sbjct:   295 DLLP-LLLGT 303

 Score = 68 (29.0 bits), Expect = 3.6e-07, Sum P(3) = 3.6e-07
 Identities = 21/69 (30%), Positives = 33/69 (47%)

Query:   566 PCYLFNLGNDPCEQNNIASSRPDISSQLYELLKY----HRRTLVPQSHEQPDLVQADPKR 621
             P  LF+L  DP E + +      +  Q+ E +K     H++TL P     P  +Q D  R
Sbjct:   369 PPLLFDLSRDPSEAHPLTPDTEPLFHQVVERVKEAVRDHQQTLSPV----P--LQLD--R 420

Query:   622 FNDTWSPWI 630
              ++ W PW+
Sbjct:   421 LDNVWKPWL 429

 Score = 46 (21.3 bits), Expect = 3.6e-07, Sum P(3) = 3.6e-07
 Identities = 11/29 (37%), Positives = 14/29 (48%)

Query:   181 GKWHLGFFRRE-----YTPLYRGFESHFG 204
             GKWHLG          + PL  GF+  +G
Sbjct:     3 GKWHLGLNCESSEDHCHHPLNHGFDLFYG 31


>ZFIN|ZDB-GENE-030131-4958 [details] [associations]
            symbol:sgsh "N-sulfoglucosamine sulfohydrolase
            (sulfamidase)" species:7955 "Danio rerio" [GO:0003824 "catalytic
            activity" evidence=IEA] [GO:0008152 "metabolic process"
            evidence=IEA] [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 ZFIN:ZDB-GENE-030131-4958
            Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484 CTD:6448
            HOGENOM:HOG000234731 HOVERGEN:HBG012598 KO:K01565 OMA:RDPHETQ
            OrthoDB:EOG4RXZ01 GeneTree:ENSGT00390000013080 EMBL:CU459096
            IPI:IPI00616379 RefSeq:NP_001116740.1 UniGene:Dr.80125
            Ensembl:ENSDART00000063147 GeneID:563849 KEGG:dre:563849
            NextBio:20885106 Uniprot:B0V3V9
        Length = 511

 Score = 87 (35.7 bits), Expect = 4.1e-07, Sum P(4) = 4.1e-07
 Identities = 28/97 (28%), Positives = 45/97 (46%)

Query:    92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIH-TGMQGP 149
             G  +   + +  + TP++ AL+   +I  N +     C+PSR++++TG  P H  GM G 
Sbjct:    34 GGFETDVYNNTVVQTPHLRALSKRSLIFKNAFTSVSSCSPSRSTILTG-LPQHQNGMYGL 92

Query:   150 PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLG 186
                G          + LP  L+     T  IGK H+G
Sbjct:    93 H-QGVHHFNSFDGVQSLPLLLKRANIHTGIIGKKHVG 128

 Score = 79 (32.9 bits), Expect = 4.1e-07, Sum P(4) = 4.1e-07
 Identities = 13/35 (37%), Positives = 23/35 (65%)

Query:   311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAP 345
             V +LD  +G V+  L++ G   ++++I+ SDNG P
Sbjct:   252 VSRLDQGIGLVLEELRKAGFENDTLVIYSSDNGIP 286

 Score = 61 (26.5 bits), Expect = 4.1e-07, Sum P(4) = 4.1e-07
 Identities = 15/46 (32%), Positives = 25/46 (54%)

Query:   363 PYRGVKNTLWEGGVKVPAILWSPQIQQN-PRVSLQMMHISDWLPTL 407
             P+   +  L+  GVK P +L SP+ QQ   ++S   + + D  PT+
Sbjct:   286 PFPNGRTNLYGSGVKEPMLLSSPEHQQRWGKLSQAYVSLLDITPTI 331

 Score = 49 (22.3 bits), Expect = 4.1e-07, Sum P(4) = 4.1e-07
 Identities = 18/50 (36%), Positives = 26/50 (52%)

Query:   569 LFNLGNDPCEQNNIASSRP--DISSQLYELL-KYHRRTLVPQSHEQPDLV 615
             LF++  DP E+ N+A      ++   L +LL K+  RT  P   E PD V
Sbjct:   447 LFDVRTDPMEKVNLAGDLDYSEVLESLKDLLLKWQWRTEDPWVCE-PDAV 495


>UNIPROTKB|F1NGI6 [details] [associations]
            symbol:SGSH "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484 EMBL:AADN02053526
            IPI:IPI00570654 Ensembl:ENSGALT00000011369 OMA:CYNPAVS
            Uniprot:F1NGI6
        Length = 119

 Score = 125 (49.1 bits), Expect = 5.2e-07, P = 5.2e-07
 Identities = 35/90 (38%), Positives = 47/90 (52%)

Query:    99 HGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIH-TGMQGPPIWGAEP 156
             + ++ I TPN+DALA  G++  N +     C+PSRAS++TG  P H  GM G    G   
Sbjct:    26 YNNSAIRTPNLDALARRGLLFQNAFTSVSSCSPSRASVLTG-LPQHQNGMYGLH-QGVHH 83

Query:   157 RGVPLTERFLPEYLRELGYSTKAIGKWHLG 186
                    R LP  LR+    T  IGK H+G
Sbjct:    84 FNSFDAVRSLPGLLRQANIRTGIIGKKHVG 113


>RGD|1310182 [details] [associations]
            symbol:Arsk "arylsulfatase family, member K" species:10116
            "Rattus norvegicus" [GO:0005576 "extracellular region"
            evidence=IEA] [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] [GO:0046872 "metal ion binding" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 RGD:1310182 GO:GO:0005576 GO:GO:0046872
            Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 HOGENOM:HOG000034080
            HOVERGEN:HBG054703 OMA:TYMLRTD OrthoDB:EOG42BX86 CTD:153642
            KO:K12376 EMBL:AABR03012430 EMBL:AABR03016345 EMBL:AABR03016385
            EMBL:BN000745 IPI:IPI00367917 RefSeq:NP_001041382.1
            UniGene:Rn.202360 ProteinModelPortal:Q32KJ2 STRING:Q32KJ2
            GeneID:365619 KEGG:rno:365619 UCSC:RGD:1310182 InParanoid:Q32KJ2
            NextBio:687770 Genevestigator:Q32KJ2 Uniprot:Q32KJ2
        Length = 563

 Score = 107 (42.7 bits), Expect = 7.4e-07, Sum P(3) = 7.4e-07
 Identities = 31/109 (28%), Positives = 55/109 (50%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
             R  Y AM  + D  +G +I AL +  +L+ +I+I+ SD+G   +E+R+   Y        
Sbjct:   274 RAFYYAMCAETDAMLGEIILALHKLNLLQKTIVIYTSDHGEMAMEHRQF--Y-------- 323

Query:   364 YRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
                 K +++E    VP ++  P I+ N +V   ++ + D  PT+   AG
Sbjct:   324 ----KMSMYEASAHVPILMMGPGIKANLQVP-SLVSLVDIYPTMLDIAG 367

 Score = 73 (30.8 bits), Expect = 7.4e-07, Sum P(3) = 7.4e-07
 Identities = 24/89 (26%), Positives = 38/89 (42%)

Query:    96 LSFH-GSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPPIWG 153
             L+F  GS  +  P I+ +   G    N Y   P+C PSRA++ +G +  H         G
Sbjct:    46 LTFQPGSQVVKLPFINFMRARGTTFLNAYTNSPICCPSRAAMWSGLFT-HLTESWNNFKG 104

Query:   154 AEPRGVPLTERFLPEYLRELGYSTKAIGK 182
              +P      +      + + GY T+  GK
Sbjct:   105 LDPNYTTWMD-----VMEKHGYQTQKFGK 128

 Score = 56 (24.8 bits), Expect = 7.4e-07, Sum P(3) = 7.4e-07
 Identities = 13/30 (43%), Positives = 19/30 (63%)

Query:   569 LFNLGNDPCEQNNIASSRPDISSQLYELLK 598
             LF+L  DP E  NIA+  P+I+  L + L+
Sbjct:   437 LFDLSLDPDELTNIATEFPEITYSLDQQLR 466


>UNIPROTKB|Q32KJ2 [details] [associations]
            symbol:Arsk "Arylsulfatase K" species:10116 "Rattus
            norvegicus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 RGD:1310182 GO:GO:0005576
            GO:GO:0046872 Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484 HOGENOM:HOG000034080
            HOVERGEN:HBG054703 OMA:TYMLRTD OrthoDB:EOG42BX86 CTD:153642
            KO:K12376 EMBL:AABR03012430 EMBL:AABR03016345 EMBL:AABR03016385
            EMBL:BN000745 IPI:IPI00367917 RefSeq:NP_001041382.1
            UniGene:Rn.202360 ProteinModelPortal:Q32KJ2 STRING:Q32KJ2
            GeneID:365619 KEGG:rno:365619 UCSC:RGD:1310182 InParanoid:Q32KJ2
            NextBio:687770 Genevestigator:Q32KJ2 Uniprot:Q32KJ2
        Length = 563

 Score = 107 (42.7 bits), Expect = 7.4e-07, Sum P(3) = 7.4e-07
 Identities = 31/109 (28%), Positives = 55/109 (50%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
             R  Y AM  + D  +G +I AL +  +L+ +I+I+ SD+G   +E+R+   Y        
Sbjct:   274 RAFYYAMCAETDAMLGEIILALHKLNLLQKTIVIYTSDHGEMAMEHRQF--Y-------- 323

Query:   364 YRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
                 K +++E    VP ++  P I+ N +V   ++ + D  PT+   AG
Sbjct:   324 ----KMSMYEASAHVPILMMGPGIKANLQVP-SLVSLVDIYPTMLDIAG 367

 Score = 73 (30.8 bits), Expect = 7.4e-07, Sum P(3) = 7.4e-07
 Identities = 24/89 (26%), Positives = 38/89 (42%)

Query:    96 LSFH-GSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPPIWG 153
             L+F  GS  +  P I+ +   G    N Y   P+C PSRA++ +G +  H         G
Sbjct:    46 LTFQPGSQVVKLPFINFMRARGTTFLNAYTNSPICCPSRAAMWSGLFT-HLTESWNNFKG 104

Query:   154 AEPRGVPLTERFLPEYLRELGYSTKAIGK 182
              +P      +      + + GY T+  GK
Sbjct:   105 LDPNYTTWMD-----VMEKHGYQTQKFGK 128

 Score = 56 (24.8 bits), Expect = 7.4e-07, Sum P(3) = 7.4e-07
 Identities = 13/30 (43%), Positives = 19/30 (63%)

Query:   569 LFNLGNDPCEQNNIASSRPDISSQLYELLK 598
             LF+L  DP E  NIA+  P+I+  L + L+
Sbjct:   437 LFDLSLDPDELTNIATEFPEITYSLDQQLR 466


>UNIPROTKB|P51688 [details] [associations]
            symbol:SGSH "N-sulphoglucosamine sulphohydrolase"
            species:9606 "Homo sapiens" [GO:0008484 "sulfuric ester hydrolase
            activity" evidence=IEA] [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0016250 "N-sulfoglucosamine sulfohydrolase
            activity" evidence=IEA] [GO:0006029 "proteoglycan metabolic
            process" evidence=TAS] [GO:0003824 "catalytic activity"
            evidence=TAS] [GO:0005975 "carbohydrate metabolic process"
            evidence=TAS] [GO:0006027 "glycosaminoglycan catabolic process"
            evidence=TAS] [GO:0030203 "glycosaminoglycan metabolic process"
            evidence=TAS] [GO:0043202 "lysosomal lumen" evidence=TAS]
            [GO:0044281 "small molecule metabolic process" evidence=TAS]
            Reactome:REACT_111217 InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Reactome:REACT_116125 GO:GO:0003824
            GO:GO:0044281 GO:GO:0046872 GO:GO:0005975 GO:GO:0043202
            GO:GO:0006027 Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            GO:GO:0006029 EMBL:U30894 EMBL:U60111 EMBL:U60107 EMBL:U60108
            EMBL:U60109 EMBL:U60110 EMBL:AK291257 EMBL:BC047318 IPI:IPI00019988
            RefSeq:NP_000190.1 UniGene:Hs.31074 ProteinModelPortal:P51688
            SMR:P51688 IntAct:P51688 STRING:P51688 PhosphoSite:P51688
            DMDM:1711493 PaxDb:P51688 PRIDE:P51688 Ensembl:ENST00000326317
            GeneID:6448 KEGG:hsa:6448 UCSC:uc002jxz.4 CTD:6448
            GeneCards:GC17M078183 HGNC:HGNC:10818 HPA:HPA023436 HPA:HPA023451
            MIM:252900 MIM:605270 neXtProt:NX_P51688 Orphanet:79269
            PharmGKB:PA35726 HOGENOM:HOG000234731 HOVERGEN:HBG012598
            InParanoid:P51688 KO:K01565 OMA:RDPHETQ OrthoDB:EOG4RXZ01
            PhylomeDB:P51688 ChiTaRS:SGSH GenomeRNAi:6448 NextBio:25061
            ArrayExpress:P51688 Bgee:P51688 CleanEx:HS_SGSH
            Genevestigator:P51688 GermOnline:ENSG00000181523 GO:GO:0016250
            Uniprot:P51688
        Length = 502

 Score = 110 (43.8 bits), Expect = 8.5e-07, Sum P(2) = 8.5e-07
 Identities = 33/90 (36%), Positives = 45/90 (50%)

Query:    99 HGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIH-TGMQGPPIWGAEP 156
             + ++ I TP++DALA   ++  N +     C+PSRASL+TG  P H  GM G        
Sbjct:    40 YNNSAIATPHLDALARRSLLFRNAFTSVSSCSPSRASLLTG-LPQHQNGMYGLHQDVHHF 98

Query:   157 RGVPLTERFLPEYLRELGYSTKAIGKWHLG 186
                    R LP  L + G  T  IGK H+G
Sbjct:    99 NSFDKV-RSLPLLLSQAGVRTGIIGKKHVG 127

 Score = 83 (34.3 bits), Expect = 8.5e-07, Sum P(2) = 8.5e-07
 Identities = 15/35 (42%), Positives = 24/35 (68%)

Query:   311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAP 345
             V ++D  VG V+  L+  G+L ++++IF SDNG P
Sbjct:   243 VGRMDQGVGLVLQELRDAGVLNDTLVIFTSDNGIP 277


>UNIPROTKB|F1REY9 [details] [associations]
            symbol:ARSK "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GO:GO:0008484 GeneTree:ENSGT00400000022041 CTD:153642 KO:K12376
            EMBL:FP016071 RefSeq:XP_003354254.2 UniGene:Ssc.4959
            Ensembl:ENSSSCT00000015443 GeneID:100627205 KEGG:ssc:100627205
            OMA:XNRVEAW Uniprot:F1REY9
        Length = 540

 Score = 103 (41.3 bits), Expect = 8.6e-07, Sum P(3) = 8.6e-07
 Identities = 31/109 (28%), Positives = 55/109 (50%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
             R  Y AM  + D  +G +I AL +  +L+ +I+I+ SD+G   +E+R+   Y        
Sbjct:   280 RAFYYAMCAETDAMLGEIILALHQLDLLQKTIVIYTSDHGELAMEHRQF--Y-------- 329

Query:   364 YRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
                 K +++E    VP ++  P I+ N +V   ++ + D  PT+   AG
Sbjct:   330 ----KMSMYEASAHVPLLIMGPGIKANLQVP-NVVSLVDIYPTMLDIAG 373

 Score = 67 (28.6 bits), Expect = 8.6e-07, Sum P(3) = 8.6e-07
 Identities = 22/89 (24%), Positives = 40/89 (44%)

Query:    96 LSFHGSNEI-PTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPPIWG 153
             L+F+  +++   P I+ +  +G    N Y   P+C PSRA++ +G +  H         G
Sbjct:    50 LTFYPESQVVKLPFINFMKAHGTSFLNAYTNSPICCPSRAAMWSGLFT-HLTESWNNFKG 108

Query:   154 AEPRGVPLTERFLPEYLRELGYSTKAIGK 182
              +P      +      + + GY T+  GK
Sbjct:   109 LDPNYTTWMD-----VMEKHGYRTQKFGK 132

 Score = 65 (27.9 bits), Expect = 8.6e-07, Sum P(3) = 8.6e-07
 Identities = 13/30 (43%), Positives = 21/30 (70%)

Query:   569 LFNLGNDPCEQNNIASSRPDISSQLYELLK 598
             LF+L +DP E  NIA+  P+++S L + L+
Sbjct:   443 LFDLSSDPDELTNIATKFPEVTSSLDQKLR 472


>UNIPROTKB|F1RZ89 [details] [associations]
            symbol:LOC100737146 "Uncharacterized protein" species:9823
            "Sus scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            OMA:RDPHETQ GeneTree:ENSGT00390000013080 EMBL:CU914710
            Ensembl:ENSSSCT00000018673 ArrayExpress:F1RZ89 Uniprot:F1RZ89
        Length = 496

 Score = 109 (43.4 bits), Expect = 1.1e-06, Sum P(2) = 1.1e-06
 Identities = 34/92 (36%), Positives = 46/92 (50%)

Query:    99 HGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIH-TGMQGPPIWGAEP 156
             + ++ I TP++DALA   I+  N +     C+PSRASL+TG  P H  GM G      + 
Sbjct:    46 YNNSAITTPHLDALARRSIVFRNAFTSVSSCSPSRASLLTG-LPQHQNGMYG---LHQDV 101

Query:   157 RGVPLTERF--LPEYLRELGYSTKAIGKWHLG 186
                   +R   LP  L   G  T  IGK H+G
Sbjct:   102 HHFNSFDRVQSLPLLLGRAGVRTGIIGKKHVG 133

 Score = 83 (34.3 bits), Expect = 1.1e-06, Sum P(2) = 1.1e-06
 Identities = 20/61 (32%), Positives = 33/61 (54%)

Query:   289 PQET-INQFQYITDPNRRTYAAM---VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA 344
             PQ+  +  F   T   R   AA    + ++D  +G V+  L+  G+L ++++IF SDNG 
Sbjct:   223 PQDVQVPYFVPDTPAARADLAAQYTTIGRMDQGIGLVLQELRGAGVLNDTLVIFTSDNGV 282

Query:   345 P 345
             P
Sbjct:   283 P 283


>UNIPROTKB|Q6UWY0 [details] [associations]
            symbol:ARSK "Arylsulfatase K" species:9606 "Homo sapiens"
            [GO:0046872 "metal ion binding" evidence=IEA] [GO:0005576
            "extracellular region" evidence=IEA] [GO:0004065 "arylsulfatase
            activity" evidence=TAS] [GO:0005788 "endoplasmic reticulum lumen"
            evidence=TAS] [GO:0006644 "phospholipid metabolic process"
            evidence=TAS] [GO:0006665 "sphingolipid metabolic process"
            evidence=TAS] [GO:0006687 "glycosphingolipid metabolic process"
            evidence=TAS] [GO:0043687 "post-translational protein modification"
            evidence=TAS] [GO:0044267 "cellular protein metabolic process"
            evidence=TAS] [GO:0044281 "small molecule metabolic process"
            evidence=TAS] Reactome:REACT_17015 Reactome:REACT_111217
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 GO:GO:0005576 GO:GO:0044281 GO:GO:0046872
            GO:GO:0006644 GO:GO:0005788 GO:GO:0043687 Gene3D:3.40.720.10
            SUPFAM:SSF53649 eggNOG:COG3119 PROSITE:PS00523 PROSITE:PS00149
            GO:GO:0004065 GO:GO:0006687 HOGENOM:HOG000034080 HOVERGEN:HBG054703
            OrthoDB:EOG42BX86 CTD:153642 KO:K12376 EMBL:AY875939 EMBL:AY358596
            EMBL:AK303855 EMBL:BC036047 EMBL:BC130329 EMBL:AL832711
            IPI:IPI00296524 RefSeq:NP_937793.1 UniGene:Hs.585051
            ProteinModelPortal:Q6UWY0 SMR:Q6UWY0 STRING:Q6UWY0
            PhosphoSite:Q6UWY0 DMDM:74738157 PRIDE:Q6UWY0 DNASU:153642
            Ensembl:ENST00000380009 GeneID:153642 KEGG:hsa:153642
            UCSC:uc003kld.3 GeneCards:GC05P094917 HGNC:HGNC:25239 HPA:HPA042384
            MIM:610011 neXtProt:NX_Q6UWY0 PharmGKB:PA143485311
            InParanoid:Q6UWY0 OMA:RKDWENT PhylomeDB:Q6UWY0 GenomeRNAi:153642
            NextBio:87151 ArrayExpress:Q6UWY0 Bgee:Q6UWY0 CleanEx:HS_ARSK
            Genevestigator:Q6UWY0 GermOnline:ENSG00000164291 Uniprot:Q6UWY0
        Length = 536

 Score = 102 (41.0 bits), Expect = 1.1e-06, Sum P(3) = 1.1e-06
 Identities = 31/109 (28%), Positives = 55/109 (50%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
             R  Y AM  + D  +G +I AL +  +L+ +I+I+ SD+G   +E+R+   Y        
Sbjct:   276 RAFYYAMCAETDAMLGEIILALHQLDLLQKTIVIYSSDHGELAMEHRQF--Y-------- 325

Query:   364 YRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
                 K +++E    VP ++  P I+   +VS  ++ + D  PT+   AG
Sbjct:   326 ----KMSMYEASAHVPLLMMGPGIKAGLQVS-NVVSLVDIYPTMLDIAG 369

 Score = 78 (32.5 bits), Expect = 1.1e-06, Sum P(3) = 1.1e-06
 Identities = 25/89 (28%), Positives = 38/89 (42%)

Query:    96 LSFH-GSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPPIWG 153
             L+FH GS  +  P I+ +   G    N Y   P+C PSRA++ +G +  H         G
Sbjct:    46 LTFHPGSQVVKLPFINFMKTRGTSFLNAYTNSPICCPSRAAMWSGLFT-HLTESWNNFKG 104

Query:   154 AEPRGVPLTERFLPEYLRELGYSTKAIGK 182
              +P      +      +   GY T+  GK
Sbjct:   105 LDPNYTTWMD-----VMERHGYRTQKFGK 128

 Score = 54 (24.1 bits), Expect = 1.1e-06, Sum P(3) = 1.1e-06
 Identities = 12/29 (41%), Positives = 18/29 (62%)

Query:   569 LFNLGNDPCEQNNIASSRPDISSQLYELL 597
             LF+L +DP E  N+A   P+I+  L + L
Sbjct:   439 LFDLSSDPDELTNVAVKFPEITYSLDQKL 467


>FB|FBgn0260475 [details] [associations]
            symbol:CG30059 species:7227 "Drosophila melanogaster"
            [GO:0008449 "N-acetylglucosamine-6-sulfatase activity"
            evidence=ISS] [GO:0006044 "N-acetylglucosamine metabolic process"
            evidence=ISS] [GO:0005764 "lysosome" evidence=ISS] [GO:0030203
            "glycosaminoglycan metabolic process" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR012251 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036666 EMBL:AE013599
            Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
            PROSITE:PS00523 GeneTree:ENSGT00400000022041 GO:GO:0030203
            KO:K01137 GO:GO:0008449 OrthoDB:EOG43TXB4 EMBL:AY061585
            RefSeq:NP_610872.1 UniGene:Dm.21320 SMR:Q95R73 STRING:Q95R73
            EnsemblMetazoa:FBtr0087715 GeneID:246425 KEGG:dme:Dmel_CG30059
            UCSC:CG30059-RA FlyBase:FBgn0260475 InParanoid:Q95R73 OMA:GNSQYYN
            GenomeRNAi:246425 NextBio:842420 Uniprot:Q95R73
        Length = 492

 Score = 150 (57.9 bits), Expect = 1.1e-06, Sum P(2) = 1.1e-06
 Identities = 91/355 (25%), Positives = 143/355 (40%)

Query:   109 IDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPPIWGA--EPRGVPLTE-R 164
             I+ L + G + +N Y   P+C P+R SL+TG Y  + G +   + G    P      E R
Sbjct:    48 IEMLGFGGALFHNAYTPSPICCPARTSLLTGMYAHNHGTRNNSVSGGCYGPHWRRALEPR 107

Query:   165 FLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRT 224
              LP  L++ GY+T   GK+   ++     P  +G+ + +G L+G   YY++ L +     
Sbjct:   108 ALPYILQQHGYNTFFGGKYLNQYWGAGDVP--KGWNNFYG-LHGNSRYYNYTLRENTG-- 162

Query:   225 VELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED--QPVDKPXXXXXXXXXXXXXXX 282
                N H     LS   D + + A D F + A Q  E     V  P               
Sbjct:   163 ---NVHYESTYLS---DLLRDRAAD-FLRNATQSSEPFFAMVAPPAAHEPFTPAPRHEGV 215

Query:   283 XXXXEAPQE-TINQFQ----YITDPNRR-------TYAAMVKK-------LDDSVGTVIS 323
                 EA +  + NQ +    ++    RR       T     +K       +D+ V T++ 
Sbjct:   216 FSHIEALRTPSFNQVKQDKHWLVRAARRLPNETINTIDTYFQKRWETLLAVDELVVTLMG 275

Query:   324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILW 383
              L     LEN+ II+ SDNG    ++ +  + R      PY        E  + VP ++ 
Sbjct:   276 VLNDTQSLENTYIIYTSDNGYHVGQFAQPFDKRQ-----PY--------ETDINVPLLIR 322

Query:   384 SPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
              P I     +   +  + D  PT+   A  DT   P  +DG   +  LLLN   R
Sbjct:   323 GPGIAPESHIDTAVSLV-DLAPTILAWADIDT---PSYMDG-QSFHELLLNKRRR 372

 Score = 39 (18.8 bits), Expect = 1.1e-06, Sum P(2) = 1.1e-06
 Identities = 10/27 (37%), Positives = 13/27 (48%)

Query:   570 FNLGNDPCEQNNIASSRPDISSQLYEL 596
             ++L  DP +  NIA     I   LY L
Sbjct:   451 YDLQLDPFQMTNIAYDLLPIERALYSL 477


>FB|FBgn0033836 [details] [associations]
            symbol:CG18278 species:7227 "Drosophila melanogaster"
            [GO:0006044 "N-acetylglucosamine metabolic process" evidence=ISS]
            [GO:0008449 "N-acetylglucosamine-6-sulfatase activity"
            evidence=ISS] [GO:0005764 "lysosome" evidence=ISS] [GO:0030203
            "glycosaminoglycan metabolic process" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR012251 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036666 EMBL:AE013599
            Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00523 GeneTree:ENSGT00400000022041
            GO:GO:0030203 KO:K01137 GO:GO:0008449 OMA:MCGYQTF EMBL:BT021205
            RefSeq:NP_725289.1 UniGene:Dm.28273 SMR:Q5BIL9 STRING:Q5BIL9
            EnsemblMetazoa:FBtr0087716 GeneID:36487 KEGG:dme:Dmel_CG18278
            UCSC:CG18278-RA FlyBase:FBgn0033836 InParanoid:Q5BIL9
            OrthoDB:EOG43TXB4 GenomeRNAi:36487 NextBio:798808 Uniprot:Q5BIL9
        Length = 492

 Score = 149 (57.5 bits), Expect = 1.5e-06, Sum P(2) = 1.5e-06
 Identities = 85/351 (24%), Positives = 135/351 (38%)

Query:   109 IDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPPIWGA--EPRGVPLTE-R 164
             I+ L + G + +N Y   P+C P+R SL+TG Y  + G +   + G    P      E R
Sbjct:    48 IEMLGFGGALFHNAYTPSPICCPARTSLLTGMYAHNHGTRNNSVSGGCYGPHWRRALEPR 107

Query:   165 FLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRT 224
              LP  L++ GY+T   GK+   ++     P  +G+ +HF  L+G   YY++ L +  S  
Sbjct:   108 ALPYILQQHGYNTFFGGKYLNQYWGAGDVP--KGW-NHFYGLHGNSRYYNYTLREN-SGN 163

Query:   225 VELNGH---DMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP-VDKPXXXXXXXXXXXXX 280
             V        D+ R+ +  +      +++ F          +P    P             
Sbjct:   164 VHYESTYLTDLLRDRAADFLRNATQSSEPFFAMVAPPAAHEPFTPAPRHEGVFSHIEALR 223

Query:   281 XXX-------------XXXEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
                                  P ETIN           T  A+    D+ V T++  L  
Sbjct:   224 TPSFNQVKQDKHWLVRAARRLPNETINTIDTYFQKRWETLLAV----DELVVTLMGVLND 279

Query:   328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
                LEN+ II+ SDNG    ++ +  + R      PY        E  + VP ++  P I
Sbjct:   280 TQSLENTYIIYTSDNGYHVGQFAQPFDKRQ-----PY--------ETDINVPLLIRGPGI 326

Query:   388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
                  +   +  + D  PT+   A  DT   P  +DG   +  LLLN   R
Sbjct:   327 APESHIDTAVSLV-DLAPTILAWADIDT---PSYMDG-QSFHELLLNKRRR 372

 Score = 39 (18.8 bits), Expect = 1.5e-06, Sum P(2) = 1.5e-06
 Identities = 10/27 (37%), Positives = 13/27 (48%)

Query:   570 FNLGNDPCEQNNIASSRPDISSQLYEL 596
             ++L  DP +  NIA     I   LY L
Sbjct:   451 YDLQLDPFQMTNIAYDLLPIERALYSL 477


>UNIPROTKB|Q5LVA2 [details] [associations]
            symbol:SPO0800 "Choline sulfatase, putative" species:246200
            "Ruegeria pomeroyi DSS-3" [GO:0006790 "sulfur compound metabolic
            process" evidence=ISS] [GO:0047753 "choline-sulfatase activity"
            evidence=ISS] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 EMBL:CP000031
            GenomeReviews:CP000031_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
            GO:GO:0006790 GO:GO:0047753 RefSeq:YP_166053.1
            ProteinModelPortal:Q5LVA2 GeneID:3195931 KEGG:sil:SPO0800
            PATRIC:23374875 HOGENOM:HOG000061225 ProtClustDB:CLSK279175
            Uniprot:Q5LVA2
        Length = 482

 Score = 105 (42.0 bits), Expect = 1.6e-06, Sum P(2) = 1.6e-06
 Identities = 32/84 (38%), Positives = 41/84 (48%)

Query:   104 IPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPPIW-GAEP-RGVP 160
             + TPN+DALA  G +    Y   P+C P+RA+L TG +   TG      W  A P  G P
Sbjct:    28 VKTPNLDALAARGTMFEAAYTPSPMCVPTRAALATGDWIHRTGH-----WDSATPYAGQP 82

Query:   161 LTERFLPEYLRELGYSTKAIGKWH 184
                R     LR+ G    +IGK H
Sbjct:    83 ---RSWMHDLRDAGREVVSIGKLH 103

 Score = 85 (35.0 bits), Expect = 1.6e-06, Sum P(2) = 1.6e-06
 Identities = 24/91 (26%), Positives = 42/91 (46%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
             +  Y  +   +DD VG V++AL+  G  +N++++++SD+G    +      +  W     
Sbjct:   248 KAAYYGLTSFMDDCVGRVLAALEAGGKADNTVVLYVSDHG----DMMGDQGF--W----- 296

Query:   364 YRGVKNTLWEGGVKVPAILWSPQIQQNPRVS 394
                 K  ++E    VP I   P I    RVS
Sbjct:   297 ---TKQVMYEASAGVPMIAAGPGIPAGHRVS 324


>TIGR_CMR|SPO_0800 [details] [associations]
            symbol:SPO_0800 "choline sulfatase, putative"
            species:246200 "Ruegeria pomeroyi DSS-3" [GO:0006790 "sulfur
            compound metabolic process" evidence=ISS] [GO:0047753
            "choline-sulfatase activity" evidence=ISS] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 EMBL:CP000031
            GenomeReviews:CP000031_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
            GO:GO:0006790 GO:GO:0047753 RefSeq:YP_166053.1
            ProteinModelPortal:Q5LVA2 GeneID:3195931 KEGG:sil:SPO0800
            PATRIC:23374875 HOGENOM:HOG000061225 ProtClustDB:CLSK279175
            Uniprot:Q5LVA2
        Length = 482

 Score = 105 (42.0 bits), Expect = 1.6e-06, Sum P(2) = 1.6e-06
 Identities = 32/84 (38%), Positives = 41/84 (48%)

Query:   104 IPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPPIW-GAEP-RGVP 160
             + TPN+DALA  G +    Y   P+C P+RA+L TG +   TG      W  A P  G P
Sbjct:    28 VKTPNLDALAARGTMFEAAYTPSPMCVPTRAALATGDWIHRTGH-----WDSATPYAGQP 82

Query:   161 LTERFLPEYLRELGYSTKAIGKWH 184
                R     LR+ G    +IGK H
Sbjct:    83 ---RSWMHDLRDAGREVVSIGKLH 103

 Score = 85 (35.0 bits), Expect = 1.6e-06, Sum P(2) = 1.6e-06
 Identities = 24/91 (26%), Positives = 42/91 (46%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
             +  Y  +   +DD VG V++AL+  G  +N++++++SD+G    +      +  W     
Sbjct:   248 KAAYYGLTSFMDDCVGRVLAALEAGGKADNTVVLYVSDHG----DMMGDQGF--W----- 296

Query:   364 YRGVKNTLWEGGVKVPAILWSPQIQQNPRVS 394
                 K  ++E    VP I   P I    RVS
Sbjct:   297 ---TKQVMYEASAGVPMIAAGPGIPAGHRVS 324


>UNIPROTKB|H0YB91 [details] [associations]
            symbol:IDS "Iduronate 2-sulfatase 14 kDa chain"
            species:9606 "Homo sapiens" [GO:0008484 "sulfuric ester hydrolase
            activity" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            EMBL:AC233288 HGNC:HGNC:5389 ChiTaRS:IDS Ensembl:ENST00000464251
            Bgee:H0YB91 Uniprot:H0YB91
        Length = 106

 Score = 119 (46.9 bits), Expect = 2.3e-06, P = 2.3e-06
 Identities = 29/78 (37%), Positives = 43/78 (55%)

Query:   106 TPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTER 164
             +PNID LA + ++  N +AQ  VC PSR S +TG+ P  T +     +     G   T  
Sbjct:     2 SPNIDQLASHSLLFQNAFAQQAVCAPSRVSFLTGRRPDTTRLYDFNSYWRVHAGNFST-- 59

Query:   165 FLPEYLRELGYSTKAIGK 182
              +P+Y +E GY T ++GK
Sbjct:    60 -IPQYFKENGYVTMSVGK 76


>MGI|MGI:1922862 [details] [associations]
            symbol:Gns "glucosamine (N-acetyl)-6-sulfatase"
            species:10090 "Mus musculus" [GO:0003824 "catalytic activity"
            evidence=IEA] [GO:0005539 "glycosaminoglycan binding" evidence=ISO]
            [GO:0005764 "lysosome" evidence=ISO] [GO:0008152 "metabolic
            process" evidence=IEA] [GO:0008449 "N-acetylglucosamine-6-sulfatase
            activity" evidence=ISO] [GO:0008484 "sulfuric ester hydrolase
            activity" evidence=ISO] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0030203 "glycosaminoglycan metabolic process"
            evidence=IEA] [GO:0042340 "keratan sulfate catabolic process"
            evidence=ISO] [GO:0043199 "sulfate binding" evidence=ISO]
            [GO:0046872 "metal ion binding" evidence=IEA] InterPro:IPR000917
            InterPro:IPR012251 InterPro:IPR015981 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036666 MGI:MGI:1922862
            GO:GO:0046872 GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            GO:GO:0008484 GeneTree:ENSGT00400000022041 GO:GO:0042340
            GO:GO:0043199 GO:GO:0005539 CTD:2799 HOGENOM:HOG000169239
            HOVERGEN:HBG005840 KO:K01137 OrthoDB:EOG4NGGMF GO:GO:0008449
            PANTHER:PTHR10342:SF5 ChiTaRS:GNS EMBL:AK030773 EMBL:AK049162
            EMBL:AK054046 EMBL:AK083597 EMBL:AK159562 EMBL:AK169485
            EMBL:AK165180 EMBL:AK170791 EMBL:BC055328 IPI:IPI00221426
            RefSeq:NP_083640.1 UniGene:Mm.207683 ProteinModelPortal:Q8BFR4
            SMR:Q8BFR4 STRING:Q8BFR4 PhosphoSite:Q8BFR4 PaxDb:Q8BFR4
            PRIDE:Q8BFR4 Ensembl:ENSMUST00000040344 GeneID:75612 KEGG:mmu:75612
            UCSC:uc007hfo.1 InParanoid:Q8BFR4 OMA:MCGYQTF NextBio:343508
            Bgee:Q8BFR4 CleanEx:MM_GNS Genevestigator:Q8BFR4 Uniprot:Q8BFR4
        Length = 544

 Score = 107 (42.7 bits), Expect = 2.6e-06, Sum P(3) = 2.6e-06
 Identities = 36/134 (26%), Positives = 65/134 (48%)

Query:   292 TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE 351
             T +  +++ D  RR +  ++  +DD V  ++  L   G L+N+ I + SDNG  T ++  
Sbjct:   270 TNSSIRFLDDAFRRRWQTLLS-VDDLVEKLVKRLDSTGELDNTYIFYTSDNGYHTGQF-- 326

Query:   352 TSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
                      + P    K  L+E  +KVP ++  P I+ N    + + +I D  PT+   A
Sbjct:   327 ---------SLPID--KRQLYEFDIKVPLLVRGPGIKPNQTSKMLVSNI-DLGPTILDLA 374

Query:   412 GGDTSRLPLNIDGL 425
             G D ++  +  DG+
Sbjct:   375 GYDLNKTQM--DGM 386

 Score = 83 (34.3 bits), Expect = 2.6e-06, Sum P(3) = 2.6e-06
 Identities = 40/158 (25%), Positives = 68/158 (43%)

Query:   116 GIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPPIWG--AEPRGVPLTERF-LPEYLR 171
             G+  ++ Y    +C PSRAS++TGKYP +  +    + G  +      + E +  P  L+
Sbjct:    70 GMTFSSAYVPSALCCPSRASILTGKYPHNHHVVNNTLEGNCSSKAWQKIQEPYTFPAILK 129

Query:   172 EL-GYSTKAIGKWHLGFFRREY-TPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNG 229
              + GY T   GK     +  EY  P   G E H   L     Y     S  Y+ T+ +NG
Sbjct:   130 SVCGYQTFFAGK-----YLNEYGAPDAGGLE-HIP-LGWSYWYALEKNSKYYNYTLSING 182

Query:   230 HDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
                +   + + D    Y TD+    ++  ++ +   +P
Sbjct:   183 KARKHGENYSVD----YLTDVLANLSLDFLDYKSNSEP 216

 Score = 40 (19.1 bits), Expect = 2.6e-06, Sum P(3) = 2.6e-06
 Identities = 10/31 (32%), Positives = 19/31 (61%)

Query:   569 LFNLGNDPCEQNNIASS-RPDISSQL-YELL 597
             ++N+  DP +  NIA S  P++  ++ Y L+
Sbjct:   470 VYNITADPDQITNIAKSIDPELLGKMNYRLM 500


>RGD|1305877 [details] [associations]
            symbol:Gns "glucosamine (N-acetyl)-6-sulfatase" species:10116
            "Rattus norvegicus" [GO:0005539 "glycosaminoglycan binding"
            evidence=IPI] [GO:0005764 "lysosome" evidence=IDA] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=IDA]
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=ISO]
            [GO:0042340 "keratan sulfate catabolic process" evidence=IDA]
            [GO:0043199 "sulfate binding" evidence=IPI] InterPro:IPR000917
            InterPro:IPR012251 InterPro:IPR015981 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036666 RGD:1305877
            GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
            PROSITE:PS00523 PROSITE:PS00149 GO:GO:0042340 GO:GO:0043199
            GO:GO:0005539 CTD:2799 HOGENOM:HOG000169239 HOVERGEN:HBG005840
            KO:K01137 GO:GO:0008449 PANTHER:PTHR10342:SF5 UniGene:Rn.228654
            EMBL:BC087741 IPI:IPI00951484 RefSeq:NP_001011989.1 IntAct:Q5M918
            STRING:Q5M918 Ensembl:ENSRNOT00000064349 GeneID:299825
            KEGG:rno:299825 InParanoid:Q5M918 NextBio:645846
            Genevestigator:Q5M918 Uniprot:Q5M918
        Length = 519

 Score = 107 (42.7 bits), Expect = 2.7e-06, Sum P(3) = 2.7e-06
 Identities = 44/151 (29%), Positives = 70/151 (46%)

Query:   292 TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE 351
             T +  +++ D  RR +  ++  +DD V  ++  L   G L+N+ I + SDNG  T ++  
Sbjct:   270 TNSSIKFLDDAFRRRWQTLLS-VDDLVEKLVKRLDSTGELDNTYIFYTSDNGYHTGQF-- 326

Query:   352 TSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
                      + P    K  L+E  +KVP ++  P I+ N    + + +I D  PT+   A
Sbjct:   327 ---------SLPID--KRQLYEFDIKVPLLVRGPGIKPNQTSKMLVSNI-DLGPTILDLA 374

Query:   412 GGD--------TSRLP-LNIDGLDQWSSLLL 433
             G D        TS LP L  DG   W S +L
Sbjct:   375 GYDLNKTQMDGTSLLPILKGDGNLTWRSDVL 405

 Score = 82 (33.9 bits), Expect = 2.7e-06, Sum P(3) = 2.7e-06
 Identities = 41/158 (25%), Positives = 68/158 (43%)

Query:   116 GIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPPIWG--AEPRGVPLTERF-LPEYLR 171
             G+  ++ Y    +C PSRAS++TGKYP +  +    + G  +      + E +  P  L+
Sbjct:    70 GMTFSSAYVPSALCCPSRASILTGKYPHNHHVVNNTLEGNCSSKSWQKIQEPYTFPAILK 129

Query:   172 EL-GYSTKAIGKWHLGFFRREY-TPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNG 229
              + GY T   GK     +  EY  P   G E H   L     Y     S  Y+ T+ +NG
Sbjct:   130 LVCGYQTFFAGK-----YLNEYGAPDAGGLE-HVP-LGWSYWYALEKNSKYYNYTLSING 182

Query:   230 HDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
                R   + + D    Y TD+    ++  ++ +   +P
Sbjct:   183 KARRHGENYSVD----YLTDVLANLSLDFLDYKSNSEP 216

 Score = 40 (19.1 bits), Expect = 2.7e-06, Sum P(3) = 2.7e-06
 Identities = 10/31 (32%), Positives = 19/31 (61%)

Query:   569 LFNLGNDPCEQNNIASS-RPDISSQL-YELL 597
             ++N+  DP +  NIA S  P++  ++ Y L+
Sbjct:   470 VYNITADPDQITNIAKSIDPELLGKMNYRLM 500


>UNIPROTKB|E1BFX4 [details] [associations]
            symbol:SGSH "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484 CTD:6448 KO:K01565
            OMA:RDPHETQ GeneTree:ENSGT00390000013080 EMBL:DAAA02049454
            RefSeq:NP_001095659.2 UniGene:Bt.12396 GeneID:535442
            KEGG:bta:535442 NextBio:20876750 IPI:IPI00907105
            ProteinModelPortal:E1BFX4 Ensembl:ENSBTAT00000020308
            ArrayExpress:E1BFX4 Uniprot:E1BFX4
        Length = 505

 Score = 107 (42.7 bits), Expect = 3.1e-06, Sum P(3) = 3.1e-06
 Identities = 33/92 (35%), Positives = 46/92 (50%)

Query:    99 HGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIH-TGMQGPPIWGAEP 156
             + ++ I TP++DALA   ++  N +     C+PSRASL+TG  P H  GM G      + 
Sbjct:    43 YNNSAISTPHLDALARRSLVFRNAFTSVSSCSPSRASLLTG-LPQHQNGMYG---LHQDV 98

Query:   157 RGVPLTERF--LPEYLRELGYSTKAIGKWHLG 186
                   +R   LP  L   G  T  IGK H+G
Sbjct:    99 HHFNSFDRVQSLPLLLGRAGIHTGIIGKKHVG 130

 Score = 81 (33.6 bits), Expect = 3.1e-06, Sum P(3) = 3.1e-06
 Identities = 13/35 (37%), Positives = 24/35 (68%)

Query:   311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAP 345
             + ++D  +G V+  L+  G+L ++++IF SDNG P
Sbjct:   246 IGRMDQGIGLVLQELRGAGVLNDTLVIFTSDNGIP 280

 Score = 40 (19.1 bits), Expect = 3.1e-06, Sum P(3) = 3.1e-06
 Identities = 11/30 (36%), Positives = 19/30 (63%)

Query:   569 LFNLGNDPCEQNNIASSRPDISSQLYELLK 598
             L++   DP E +N+A+  P  + Q+ ELL+
Sbjct:   441 LYDRNQDPHETHNLAAD-PRYT-QVLELLQ 468


>UNIPROTKB|Q32KJ5 [details] [associations]
            symbol:Gns "Glucosamine (N-acetyl)-6-sulfatase"
            species:10116 "Rattus norvegicus" [GO:0005764 "lysosome"
            evidence=IEA] [GO:0008449 "N-acetylglucosamine-6-sulfatase
            activity" evidence=IEA] [GO:0030203 "glycosaminoglycan metabolic
            process" evidence=IEA] InterPro:IPR000917 InterPro:IPR012251
            InterPro:IPR015981 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 PIRSF:PIRSF036666 RGD:1305877 GO:GO:0005764
            Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0030203
            HOGENOM:HOG000169239 HOVERGEN:HBG005840 KO:K01137 OrthoDB:EOG4NGGMF
            GO:GO:0008449 PANTHER:PTHR10342:SF5 OMA:MCGYQTF EMBL:BN000742
            IPI:IPI00366226 RefSeq:XP_003750373.1 UniGene:Rn.228654
            IntAct:Q32KJ5 STRING:Q32KJ5 Ensembl:ENSRNOT00000006566
            GeneID:100909505 KEGG:rno:100909505 InParanoid:Q32KJ5
            Genevestigator:Q32KJ5 Uniprot:Q32KJ5
        Length = 544

 Score = 107 (42.7 bits), Expect = 3.2e-06, Sum P(3) = 3.2e-06
 Identities = 44/151 (29%), Positives = 70/151 (46%)

Query:   292 TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE 351
             T +  +++ D  RR +  ++  +DD V  ++  L   G L+N+ I + SDNG  T ++  
Sbjct:   270 TNSSIKFLDDAFRRRWQTLLS-VDDLVEKLVKRLDSTGELDNTYIFYTSDNGYHTGQF-- 326

Query:   352 TSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
                      + P    K  L+E  +KVP ++  P I+ N    + + +I D  PT+   A
Sbjct:   327 ---------SLPID--KRQLYEFDIKVPLLVRGPGIKPNQTSKMLVSNI-DLGPTILDLA 374

Query:   412 GGD--------TSRLP-LNIDGLDQWSSLLL 433
             G D        TS LP L  DG   W S +L
Sbjct:   375 GYDLNKTQMDGTSLLPILKGDGNLTWRSDVL 405

 Score = 82 (33.9 bits), Expect = 3.2e-06, Sum P(3) = 3.2e-06
 Identities = 41/158 (25%), Positives = 68/158 (43%)

Query:   116 GIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPPIWG--AEPRGVPLTERF-LPEYLR 171
             G+  ++ Y    +C PSRAS++TGKYP +  +    + G  +      + E +  P  L+
Sbjct:    70 GMTFSSAYVPSALCCPSRASILTGKYPHNHHVVNNTLEGNCSSKSWQKIQEPYTFPAILK 129

Query:   172 EL-GYSTKAIGKWHLGFFRREY-TPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNG 229
              + GY T   GK     +  EY  P   G E H   L     Y     S  Y+ T+ +NG
Sbjct:   130 LVCGYQTFFAGK-----YLNEYGAPDAGGLE-HVP-LGWSYWYALEKNSKYYNYTLSING 182

Query:   230 HDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
                R   + + D    Y TD+    ++  ++ +   +P
Sbjct:   183 KARRHGENYSVD----YLTDVLANLSLDFLDYKSNSEP 216

 Score = 40 (19.1 bits), Expect = 3.2e-06, Sum P(3) = 3.2e-06
 Identities = 10/31 (32%), Positives = 19/31 (61%)

Query:   569 LFNLGNDPCEQNNIASS-RPDISSQL-YELL 597
             ++N+  DP +  NIA S  P++  ++ Y L+
Sbjct:   470 VYNITADPDQITNIAKSIDPELLGKMNYRLM 500


>UNIPROTKB|F5H260 [details] [associations]
            symbol:GNS "N-acetylglucosamine-6-sulfatase" species:9606
            "Homo sapiens" [GO:0005764 "lysosome" evidence=IEA] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=IEA]
            [GO:0030203 "glycosaminoglycan metabolic process" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR015981 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 GO:GO:0005764 Gene3D:3.40.720.10
            SUPFAM:SSF53649 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            GO:GO:0042340 GO:GO:0043199 GO:GO:0005539 GO:GO:0008449
            PANTHER:PTHR10342:SF5 EMBL:AC025262 HGNC:HGNC:4422 ChiTaRS:GNS
            IPI:IPI01010051 ProteinModelPortal:F5H260 SMR:F5H260
            Ensembl:ENST00000545471 ArrayExpress:F5H260 Bgee:F5H260
            Uniprot:F5H260
        Length = 344

 Score = 101 (40.6 bits), Expect = 3.4e-06, Sum P(2) = 3.4e-06
 Identities = 35/134 (26%), Positives = 65/134 (48%)

Query:   292 TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE 351
             T +  Q++ +  R+ +  ++  +DD V  ++  L+  G L N+ I + SDNG  T ++  
Sbjct:   215 TNSSIQFLDNAFRKRWQTLLS-VDDLVEKLVKRLEFTGELNNTYIFYTSDNGYHTGQF-- 271

Query:   352 TSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
                      + P    K  L+E  +KVP ++  P I+ N    + + +I D  PT+   A
Sbjct:   272 ---------SLPID--KRQLYEFDIKVPLLVRGPGIKPNQTSKMLVANI-DLGPTILDIA 319

Query:   412 GGDTSRLPLNIDGL 425
             G D ++  +  DG+
Sbjct:   320 GYDLNKTQM--DGM 331

 Score = 82 (33.9 bits), Expect = 3.4e-06, Sum P(2) = 3.4e-06
 Identities = 40/151 (26%), Positives = 64/151 (42%)

Query:   116 GIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPPIWG--AEPRGVPLTE-RFLPEYLR 171
             G+  ++ Y    +C PSRAS++TGKYP +  +    + G  +      + E    P  LR
Sbjct:    15 GMTFSSAYVPSALCCPSRASILTGKYPHNHHVVNNTLEGNCSSKSWQKIQEPNTFPAILR 74

Query:   172 EL-GYSTKAIGKWHLGFFRREY-TPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNG 229
              + GY T   GK     +  EY  P   G E H   L     Y     S  Y+ T+ +NG
Sbjct:    75 SMCGYQTFFAGK-----YLNEYGAPDAGGLE-HVP-LGWSYWYALEKNSKYYNYTLSING 127

Query:   230 HDMRRNLSTAWDTVGEYATDLFTKEAVQLIE 260
                +   + + D    Y TD+    ++  ++
Sbjct:   128 KARKHGENYSVD----YLTDVLANVSLDFLD 154


>TIGR_CMR|CPS_2367 [details] [associations]
            symbol:CPS_2367 "sulfatase family protein" species:167879
            "Colwellia psychrerythraea 34H" [GO:0008152 "metabolic process"
            evidence=ISS] [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=ISS] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 EMBL:CP000083
            GenomeReviews:CP000083_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484
            RefSeq:YP_269085.1 ProteinModelPortal:Q482D3 STRING:Q482D3
            GeneID:3522074 KEGG:cps:CPS_2367 PATRIC:21467819
            HOGENOM:HOG000220675 OMA:TAGVCAP ProtClustDB:CLSK2525596
            BioCyc:CPSY167879:GI48-2430-MONOMER Uniprot:Q482D3
        Length = 558

 Score = 101 (40.6 bits), Expect = 3.4e-06, Sum P(4) = 3.4e-06
 Identities = 30/88 (34%), Positives = 41/88 (46%)

Query:   100 GSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTG---MQGPPIWGAE 155
             G     TP +D LA + +   N +    VC PSR SL+TG + I  G   M+      + 
Sbjct:    47 GDTVAKTPVLDELAKSSVRYPNTFTTAGVCAPSRTSLITGVHQITVGGQHMRTRSFKASN 106

Query:   156 PRGVPLTE-RFLPEYLRELGYSTKAIGK 182
              R VP  + +  PE LR+ GY T    K
Sbjct:   107 YRAVPAPDVKAFPELLRKSGYYTYVSSK 134

 Score = 66 (28.3 bits), Expect = 3.4e-06, Sum P(4) = 3.4e-06
 Identities = 10/33 (30%), Positives = 24/33 (72%)

Query:   311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNG 343
             +  +D  VG +++ L++ G+ +N+I+I+ +D+G
Sbjct:   230 IHAMDTQVGKLLAELKKDGLSDNTIVIWTTDHG 262

 Score = 57 (25.1 bits), Expect = 3.4e-06, Sum P(4) = 3.4e-06
 Identities = 21/71 (29%), Positives = 33/71 (46%)

Query:   359 GSNYPYRGVKNTLWEGGVKVPAIL-WS----PQIQQNPRVSLQMMHISDWLPTLYTAAGG 413
             G + P RG K  +++ G+KVP I+ W     P    N  +  Q++   D  P++   A  
Sbjct:   262 GDSLP-RG-KREVYDSGLKVPMIIHWPDKYRPSKTVNGSIDSQLLSFVDIAPSILAMANI 319

Query:   414 DTSRLPLNIDG 424
             +T   P  I G
Sbjct:   320 NT---PAYIQG 327

 Score = 43 (20.2 bits), Expect = 3.4e-06, Sum P(4) = 3.4e-06
 Identities = 10/25 (40%), Positives = 16/25 (64%)

Query:   569 LFNLGNDPCEQNNIASSRPDISSQL 593
             L+++ NDP E NN+A  + +   QL
Sbjct:   421 LYDIINDPEEVNNLAE-KVEYQQQL 444


>UNIPROTKB|F1LLW8 [details] [associations]
            symbol:Ids "Protein Ids" species:10116 "Rattus norvegicus"
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 RGD:1560491 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            GeneTree:ENSGT00640000091539 OMA:CREGRNL IPI:IPI00569342
            Ensembl:ENSRNOT00000042925 ArrayExpress:F1LLW8 Uniprot:F1LLW8
        Length = 544

 Score = 141 (54.7 bits), Expect = 3.5e-06, P = 3.5e-06
 Identities = 51/173 (29%), Positives = 76/173 (43%)

Query:    96 LSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGPPIWGA 154
             L  +G   + +PNID LA + I+  N +AQ  VC PSR S +TG+ P  T +     +  
Sbjct:    44 LGCYGDKLVRSPNIDQLASHSIVFENAFAQQAVCAPSRVSFLTGRRPDTTRLYDFNSYWR 103

Query:   155 EPRGVPLTERFLPEYLRELGYSTKAIGK-WHLGFFRREYTPLYRGFESHFGYLNGVISYY 213
                G   T   +P+Y +E GY T ++GK +H G        +       + Y      Y 
Sbjct:   104 VHSGNFST---IPQYFKENGYVTMSVGKVFHPG--------ISSNHSDDYPYSWSFPPY- 151

Query:   214 DHILSDQYSRTVELNGHD--MRRNLSTAWDTV----GEYATDLFTKEAVQLIE 260
              H  S++Y  T    G D  +  NL    D      G       T+EA++L+E
Sbjct:   152 -HPSSEKYENTKTCKGQDGKLHTNLLCPVDVADVPEGTLPDKQSTEEAIRLLE 203


>UNIPROTKB|B4DYH8 [details] [associations]
            symbol:GNS "N-acetylglucosamine-6-sulfatase" species:9606
            "Homo sapiens" [GO:0005764 "lysosome" evidence=IEA] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=IEA]
            [GO:0030203 "glycosaminoglycan metabolic process" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR012251 InterPro:IPR015981
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            PIRSF:PIRSF036666 GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0030203
            HOVERGEN:HBG005840 GO:GO:0008449 PANTHER:PTHR10342:SF5
            EMBL:AC025262 UniGene:Hs.334534 HGNC:HGNC:4422 ChiTaRS:GNS
            EMBL:AK302443 IPI:IPI01011079 SMR:B4DYH8 STRING:B4DYH8
            Ensembl:ENST00000542058 UCSC:uc010ssr.2 Uniprot:B4DYH8
        Length = 532

 Score = 101 (40.6 bits), Expect = 5.3e-06, Sum P(3) = 5.3e-06
 Identities = 35/134 (26%), Positives = 65/134 (48%)

Query:   292 TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE 351
             T +  Q++ +  R+ +  ++  +DD V  ++  L+  G L N+ I + SDNG  T ++  
Sbjct:   258 TNSSIQFLDNAFRKRWQTLLS-VDDLVEKLVKRLEFTGELNNTYIFYTSDNGYHTGQF-- 314

Query:   352 TSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
                      + P    K  L+E  +KVP ++  P I+ N    + + +I D  PT+   A
Sbjct:   315 ---------SLPID--KRQLYEFDIKVPLLVRGPGIKPNQTSKMLVANI-DLGPTILDIA 362

Query:   412 GGDTSRLPLNIDGL 425
             G D ++  +  DG+
Sbjct:   363 GYDLNKTQM--DGM 374

 Score = 87 (35.7 bits), Expect = 5.3e-06, Sum P(3) = 5.3e-06
 Identities = 41/149 (27%), Positives = 63/149 (42%)

Query:   118 ILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPPIWG--AEPRGVPLTE-RFLPEYLREL 173
             +L  MY    +C PSRAS++TGKYP +  +    + G  +      + E    P  LR +
Sbjct:    60 VLGGMYVPSALCCPSRASILTGKYPHNHHVVNNTLEGNCSSKSWQKIQEPNTFPAILRSM 119

Query:   174 -GYSTKAIGKWHLGFFRREY-TPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHD 231
              GY T   GK     +  EY  P   G E H   L     Y     S  Y+ T+ +NG  
Sbjct:   120 CGYQTFFAGK-----YLNEYGAPDAGGLE-HVP-LGWSYWYALEKNSKYYNYTLSINGKA 172

Query:   232 MRRNLSTAWDTVGEYATDLFTKEAVQLIE 260
              +   + + D    Y TD+    ++  ++
Sbjct:   173 RKHGENYSVD----YLTDVLANVSLDFLD 197

 Score = 39 (18.8 bits), Expect = 5.3e-06, Sum P(3) = 5.3e-06
 Identities = 10/31 (32%), Positives = 19/31 (61%)

Query:   569 LFNLGNDPCEQNNIASS-RPDISSQL-YELL 597
             ++NL  DP +  NIA +  P++  ++ Y L+
Sbjct:   458 VYNLTADPDQITNIAKTIDPELLGKMNYRLM 488


>UNIPROTKB|H7C3P4 [details] [associations]
            symbol:GNS "Glucosamine (N-acetyl)-6-sulfatase (Sanfilippo
            disease IIID), isoform CRA_b" species:9606 "Homo sapiens"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=IEA]
            [GO:0030203 "glycosaminoglycan metabolic process" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR012251 InterPro:IPR015981
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            PIRSF:PIRSF036666 EMBL:CH471054 GO:GO:0005764 Gene3D:3.40.720.10
            SUPFAM:SSF53649 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            GO:GO:0030203 GO:GO:0008449 PANTHER:PTHR10342:SF5 EMBL:AC025262
            UniGene:Hs.334534 HGNC:HGNC:4422 ChiTaRS:GNS SMR:H7C3P4
            Ensembl:ENST00000418919 Uniprot:H7C3P4
        Length = 496

 Score = 101 (40.6 bits), Expect = 1.3e-05, Sum P(3) = 1.3e-05
 Identities = 35/134 (26%), Positives = 65/134 (48%)

Query:   292 TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE 351
             T +  Q++ +  R+ +  ++  +DD V  ++  L+  G L N+ I + SDNG  T ++  
Sbjct:   222 TNSSIQFLDNAFRKRWQTLLS-VDDLVEKLVKRLEFTGELNNTYIFYTSDNGYHTGQF-- 278

Query:   352 TSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
                      + P    K  L+E  +KVP ++  P I+ N    + + +I D  PT+   A
Sbjct:   279 ---------SLPID--KRQLYEFDIKVPLLVRGPGIKPNQTSKMLVANI-DLGPTILDIA 326

Query:   412 GGDTSRLPLNIDGL 425
             G D ++  +  DG+
Sbjct:   327 GYDLNKTQM--DGM 338

 Score = 82 (33.9 bits), Expect = 1.3e-05, Sum P(3) = 1.3e-05
 Identities = 40/151 (26%), Positives = 64/151 (42%)

Query:   116 GIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPPIWG--AEPRGVPLTE-RFLPEYLR 171
             G+  ++ Y    +C PSRAS++TGKYP +  +    + G  +      + E    P  LR
Sbjct:    22 GMTFSSAYVPSALCCPSRASILTGKYPHNHHVVNNTLEGNCSSKSWQKIQEPNTFPAILR 81

Query:   172 EL-GYSTKAIGKWHLGFFRREY-TPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNG 229
              + GY T   GK     +  EY  P   G E H   L     Y     S  Y+ T+ +NG
Sbjct:    82 SMCGYQTFFAGK-----YLNEYGAPDAGGLE-HVP-LGWSYWYALEKNSKYYNYTLSING 134

Query:   230 HDMRRNLSTAWDTVGEYATDLFTKEAVQLIE 260
                +   + + D    Y TD+    ++  ++
Sbjct:   135 KARKHGENYSVD----YLTDVLANVSLDFLD 161

 Score = 39 (18.8 bits), Expect = 1.3e-05, Sum P(3) = 1.3e-05
 Identities = 10/31 (32%), Positives = 19/31 (61%)

Query:   569 LFNLGNDPCEQNNIASS-RPDISSQL-YELL 597
             ++NL  DP +  NIA +  P++  ++ Y L+
Sbjct:   422 VYNLTADPDQITNIAKTIDPELLGKMNYRLM 452


>ZFIN|ZDB-GENE-030131-9242 [details] [associations]
            symbol:sulf1 "sulfatase 1" species:7955 "Danio
            rerio" [GO:0005509 "calcium ion binding" evidence=IEA] [GO:0008152
            "metabolic process" evidence=IEA] [GO:0008484 "sulfuric ester
            hydrolase activity" evidence=IEA] [GO:0003824 "catalytic activity"
            evidence=IEA] [GO:0005783 "endoplasmic reticulum" evidence=IEA]
            [GO:0005794 "Golgi apparatus" evidence=IEA] [GO:0009986 "cell
            surface" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 ZFIN:ZDB-GENE-030131-9242
            Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
            PROSITE:PS00523 GO:GO:0008484 GeneTree:ENSGT00400000022041
            HOGENOM:HOG000290161 KO:K14607 HOVERGEN:HBG056431
            InterPro:IPR024609 Pfam:PF12548 CTD:23213 OMA:SVRVTHK EMBL:CR385071
            EMBL:CR382282 EMBL:AY332604 IPI:IPI00509599 RefSeq:NP_001003846.1
            UniGene:Dr.81473 Ensembl:ENSDART00000056081 GeneID:337298
            KEGG:dre:337298 InParanoid:Q6EFA1 NextBio:20812164 Uniprot:Q6EFA1
        Length = 1099

 Score = 138 (53.6 bits), Expect = 1.9e-05, P = 1.9e-05
 Identities = 83/356 (23%), Positives = 139/356 (39%)

Query:   100 GSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIH-----TGMQG--PPI 151
             GS ++       +   G    N +   P+C PSR+S++TGKY +H     T  +    P 
Sbjct:    57 GSLQVMNKTRKIMEDGGTSFTNAFVTTPMCCPSRSSMLTGKY-VHNHNTYTNNENCSSPS 115

Query:   152 WGA--EPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGV 209
             W A  EPR   +       YL   GY T   GK+ L  +   Y P   G+    G +   
Sbjct:   116 WQAQHEPRSFAV-------YLNNTGYRTAFFGKY-LNEYNGSYIP--PGWREWVGLIKNS 165

Query:   210 ISYYDHILSDQYSRTVELNGHDMRRNLSTAW---DTVGEYATD--LFTKEAVQLIEDQ-- 262
               +Y++ +    ++  E +G D  ++  T     D++  + T   +F    V ++     
Sbjct:   166 -RFYNYTVCRNGNK--EKHGADYAKDYFTDLITNDSINYFRTSKRMFPHRPVMMVISHAA 222

Query:   263 ---PVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMV-----KKL 314
                P D                      AP    +     T P +  +         K+L
Sbjct:   223 PHGPEDSAPQYSELFPNASQHITPSYNYAPNMDKHWIMQYTGPMKPIHMEFTNYLHRKRL 282

Query:   315 ------DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
                   DDSV  V +AL   G L+N+ II+ +D+G    ++         G + PY    
Sbjct:   283 QTLMSVDDSVEKVYNALVDTGELDNTYIIYTADHGYHIGQFGLVK-----GKSMPY---- 333

Query:   369 NTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
                 +  ++VP  +  P ++   R +  +++I D  PT+   AG DT   P ++DG
Sbjct:   334 ----DFDIRVPFFVRGPNVEPGARNNHVVLNI-DLAPTILDIAGLDT---PPDMDG 381


>UNIPROTKB|P15586 [details] [associations]
            symbol:GNS "N-acetylglucosamine-6-sulfatase" species:9606
            "Homo sapiens" [GO:0046872 "metal ion binding" evidence=IEA]
            [GO:0008449 "N-acetylglucosamine-6-sulfatase activity"
            evidence=IEA] [GO:0006027 "glycosaminoglycan catabolic process"
            evidence=TAS] [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IDA] [GO:0005975 "carbohydrate metabolic process"
            evidence=TAS] [GO:0030203 "glycosaminoglycan metabolic process"
            evidence=TAS] [GO:0042339 "keratan sulfate metabolic process"
            evidence=TAS] [GO:0042340 "keratan sulfate catabolic process"
            evidence=TAS] [GO:0043202 "lysosomal lumen" evidence=TAS]
            [GO:0044281 "small molecule metabolic process" evidence=TAS]
            [GO:0005515 "protein binding" evidence=IPI] Reactome:REACT_11123
            Reactome:REACT_111217 InterPro:IPR000917 InterPro:IPR012251
            InterPro:IPR015981 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 PIRSF:PIRSF036666 Reactome:REACT_116125 GO:GO:0046872
            GO:GO:0005975 GO:GO:0043202 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            GO:GO:0042340 GO:GO:0043199 GO:GO:0005539 CTD:2799
            HOGENOM:HOG000169239 HOVERGEN:HBG005840 KO:K01137 OrthoDB:EOG4NGGMF
            GO:GO:0008449 PANTHER:PTHR10342:SF5 EMBL:Z12173 EMBL:AK223484
            EMBL:AC025262 EMBL:BC012482 IPI:IPI00012102 PIR:S27164
            RefSeq:NP_002067.1 UniGene:Hs.334534 ProteinModelPortal:P15586
            SMR:P15586 IntAct:P15586 STRING:P15586 PhosphoSite:P15586
            DMDM:232126 PaxDb:P15586 PeptideAtlas:P15586 PRIDE:P15586
            DNASU:2799 Ensembl:ENST00000258145 GeneID:2799 KEGG:hsa:2799
            UCSC:uc001ssf.3 GeneCards:GC12M065107 H-InvDB:HIX0010785
            HGNC:HGNC:4422 HPA:CAB026011 HPA:HPA013695 MIM:252940 MIM:607664
            neXtProt:NX_P15586 Orphanet:79272 PharmGKB:PA28802
            InParanoid:P15586 PhylomeDB:P15586 BioCyc:MetaCyc:HS06046-MONOMER
            BRENDA:3.1.6.14 SABIO-RK:P15586 ChiTaRS:GNS GenomeRNAi:2799
            NextBio:11033 ArrayExpress:P15586 Bgee:P15586 CleanEx:HS_GNS
            Genevestigator:P15586 GermOnline:ENSG00000135677 Uniprot:P15586
        Length = 552

 Score = 101 (40.6 bits), Expect = 1.9e-05, Sum P(3) = 1.9e-05
 Identities = 35/134 (26%), Positives = 65/134 (48%)

Query:   292 TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE 351
             T +  Q++ +  R+ +  ++  +DD V  ++  L+  G L N+ I + SDNG  T ++  
Sbjct:   278 TNSSIQFLDNAFRKRWQTLLS-VDDLVEKLVKRLEFTGELNNTYIFYTSDNGYHTGQF-- 334

Query:   352 TSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
                      + P    K  L+E  +KVP ++  P I+ N    + + +I D  PT+   A
Sbjct:   335 ---------SLPID--KRQLYEFDIKVPLLVRGPGIKPNQTSKMLVANI-DLGPTILDIA 382

Query:   412 GGDTSRLPLNIDGL 425
             G D ++  +  DG+
Sbjct:   383 GYDLNKTQM--DGM 394

 Score = 82 (33.9 bits), Expect = 1.9e-05, Sum P(3) = 1.9e-05
 Identities = 40/151 (26%), Positives = 64/151 (42%)

Query:   116 GIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPPIWG--AEPRGVPLTE-RFLPEYLR 171
             G+  ++ Y    +C PSRAS++TGKYP +  +    + G  +      + E    P  LR
Sbjct:    78 GMTFSSAYVPSALCCPSRASILTGKYPHNHHVVNNTLEGNCSSKSWQKIQEPNTFPAILR 137

Query:   172 EL-GYSTKAIGKWHLGFFRREY-TPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNG 229
              + GY T   GK     +  EY  P   G E H   L     Y     S  Y+ T+ +NG
Sbjct:   138 SMCGYQTFFAGK-----YLNEYGAPDAGGLE-HVP-LGWSYWYALEKNSKYYNYTLSING 190

Query:   230 HDMRRNLSTAWDTVGEYATDLFTKEAVQLIE 260
                +   + + D    Y TD+    ++  ++
Sbjct:   191 KARKHGENYSVD----YLTDVLANVSLDFLD 217

 Score = 39 (18.8 bits), Expect = 1.9e-05, Sum P(3) = 1.9e-05
 Identities = 10/31 (32%), Positives = 19/31 (61%)

Query:   569 LFNLGNDPCEQNNIASS-RPDISSQL-YELL 597
             ++NL  DP +  NIA +  P++  ++ Y L+
Sbjct:   478 VYNLTADPDQITNIAKTIDPELLGKMNYRLM 508


>UNIPROTKB|F1SBF1 [details] [associations]
            symbol:LOC100739059 "Uncharacterized protein" species:9823
            "Sus scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484
            GeneTree:ENSGT00400000022041 OMA:TENDPAN EMBL:FP339597
            RefSeq:XP_003484028.1 Ensembl:ENSSSCT00000008161 GeneID:100739169
            KEGG:ssc:100739169 Uniprot:F1SBF1
        Length = 527

 Score = 134 (52.2 bits), Expect = 1.9e-05, P = 1.9e-05
 Identities = 80/352 (22%), Positives = 136/352 (38%)

Query:   119 LNNMYAQPVCTPSRASLMTGKYPIH-----TGMQG--PPIWGAEPRGVPLTERFLPEYLR 171
             +N     P+C PSR+S++TGKY +H     T  +    P W A+        R    YL 
Sbjct:    79 INAFVTTPMCCPSRSSILTGKY-VHNHNTYTNNENCSSPSWQAQHES-----RTFAVYLN 132

Query:   172 ELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHD 231
               GY T   GK+ L  +   Y P   G++   G L     +Y++ L     +  E +G D
Sbjct:   133 STGYRTAFFGKY-LNEYNGSYVP--PGWKEWVGLLKNS-RFYNYTLCRNGVK--EKHGFD 186

Query:   232 MRRNLSTAW---DTVGEYATD--LFTKEAVQLIEDQ-----PVDKPXXXXXXXXXXXXXX 281
               ++  T     D+V  + T   ++    V ++        P D                
Sbjct:   187 YSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYSRLFPNASQHI 246

Query:   282 XXXXXEAPQETINQFQYITDPNR---RTYAAMVKK--------LDDSVGTVISALQRKGM 330
                   AP    +     T P +     +  M+++        +DDS+ T+ + L   G 
Sbjct:   247 TPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMETIYNMLVETGE 306

Query:   331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ- 389
             L+N+ I++ +D+G    ++         G + PY        E  ++VP  +  P ++  
Sbjct:   307 LDNTYIVYTADHGYHIGQFGLVK-----GKSMPY--------EFDIRVPFYVRGPNVEAG 353

Query:   390 --NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRR 439
               NP + L +    D  PT+   AG D   +P ++DG      L    P+ R
Sbjct:   354 SLNPHIVLNI----DLAPTILDIAGLD---IPSDMDGKSILKLLDTERPANR 398


>UNIPROTKB|F1P6L7 [details] [associations]
            symbol:GNS "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0030203 "glycosaminoglycan metabolic process"
            evidence=IEA] [GO:0008449 "N-acetylglucosamine-6-sulfatase
            activity" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR012251 InterPro:IPR015981
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            PIRSF:PIRSF036666 GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            GeneTree:ENSGT00400000022041 GO:GO:0030203 GO:GO:0008449
            PANTHER:PTHR10342:SF5 OMA:MCGYQTF EMBL:AAEX03007000
            Ensembl:ENSCAFT00000000563 Uniprot:F1P6L7
        Length = 489

 Score = 97 (39.2 bits), Expect = 2.0e-05, Sum P(3) = 2.0e-05
 Identities = 34/134 (25%), Positives = 65/134 (48%)

Query:   292 TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE 351
             T +  Q++ +  R+ +  ++  +DD V  ++  L+  G L N+ I + SDNG  T ++  
Sbjct:   215 TNSSIQFLDNAFRKRWQTLLS-VDDLVEKLVKRLEFNGELNNTYIFYTSDNGYHTGQF-- 271

Query:   352 TSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
                      + P    K  L+E  +KVP ++  P I+ N    + + +I D  PT+   A
Sbjct:   272 ---------SLPID--KRQLYEFDIKVPLLVRGPGIKPNQTSKMLVANI-DLGPTILDIA 319

Query:   412 GGDTSRLPLNIDGL 425
             G + ++  +  DG+
Sbjct:   320 GYNLNKTQM--DGM 331

 Score = 81 (33.6 bits), Expect = 2.0e-05, Sum P(3) = 2.0e-05
 Identities = 41/158 (25%), Positives = 67/158 (42%)

Query:   116 GIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPPIWG--AEPRGVPLTE-RFLPEYLR 171
             G+  ++ Y    +C PSRAS++TGKYP +  +    + G  +      + E    P  LR
Sbjct:    15 GMTFSSAYVPSALCCPSRASILTGKYPHNHHVVNNTLEGNCSSKSWQKIQEPNTFPAILR 74

Query:   172 EL-GYSTKAIGKWHLGFFRREY-TPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNG 229
              + GY T   GK     +  EY  P   G E H   L     Y     S  Y+ T+ +NG
Sbjct:    75 SMCGYQTFFAGK-----YLNEYGAPDAGGLE-HVP-LGWSYWYALEKNSKYYNYTLSING 127

Query:   230 HDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
                +   + + D    Y TD+    ++  ++ +   +P
Sbjct:   128 KARKHGENYSVD----YLTDVLANISLGFLDYKSNSEP 161

 Score = 42 (19.8 bits), Expect = 2.0e-05, Sum P(3) = 2.0e-05
 Identities = 11/31 (35%), Positives = 19/31 (61%)

Query:   569 LFNLGNDPCEQNNIASS-RPDISSQL-YELL 597
             ++NL  DP +  NIA S  P++  ++ Y L+
Sbjct:   415 VYNLTADPDQITNIAKSIDPELLGKMNYRLM 445


>UNIPROTKB|F6S8M0 [details] [associations]
            symbol:GNS "N-acetylglucosamine-6-sulfatase" species:9606
            "Homo sapiens" [GO:0005764 "lysosome" evidence=IEA] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=IEA]
            [GO:0030203 "glycosaminoglycan metabolic process" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR012251 InterPro:IPR015981
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            PIRSF:PIRSF036666 GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0030203
            GO:GO:0008449 PANTHER:PTHR10342:SF5 EMBL:AC025262 HGNC:HGNC:4422
            ChiTaRS:GNS OMA:MCGYQTF IPI:IPI00908404 ProteinModelPortal:F6S8M0
            SMR:F6S8M0 Ensembl:ENST00000543646 UCSC:uc010ssq.2
            ArrayExpress:F6S8M0 Bgee:F6S8M0 Uniprot:F6S8M0
        Length = 584

 Score = 101 (40.6 bits), Expect = 2.4e-05, Sum P(3) = 2.4e-05
 Identities = 35/134 (26%), Positives = 65/134 (48%)

Query:   292 TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE 351
             T +  Q++ +  R+ +  ++  +DD V  ++  L+  G L N+ I + SDNG  T ++  
Sbjct:   310 TNSSIQFLDNAFRKRWQTLLS-VDDLVEKLVKRLEFTGELNNTYIFYTSDNGYHTGQF-- 366

Query:   352 TSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
                      + P    K  L+E  +KVP ++  P I+ N    + + +I D  PT+   A
Sbjct:   367 ---------SLPID--KRQLYEFDIKVPLLVRGPGIKPNQTSKMLVANI-DLGPTILDIA 414

Query:   412 GGDTSRLPLNIDGL 425
             G D ++  +  DG+
Sbjct:   415 GYDLNKTQM--DGM 426

 Score = 82 (33.9 bits), Expect = 2.4e-05, Sum P(3) = 2.4e-05
 Identities = 40/151 (26%), Positives = 64/151 (42%)

Query:   116 GIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPPIWG--AEPRGVPLTE-RFLPEYLR 171
             G+  ++ Y    +C PSRAS++TGKYP +  +    + G  +      + E    P  LR
Sbjct:   110 GMTFSSAYVPSALCCPSRASILTGKYPHNHHVVNNTLEGNCSSKSWQKIQEPNTFPAILR 169

Query:   172 EL-GYSTKAIGKWHLGFFRREY-TPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNG 229
              + GY T   GK     +  EY  P   G E H   L     Y     S  Y+ T+ +NG
Sbjct:   170 SMCGYQTFFAGK-----YLNEYGAPDAGGLE-HVP-LGWSYWYALEKNSKYYNYTLSING 222

Query:   230 HDMRRNLSTAWDTVGEYATDLFTKEAVQLIE 260
                +   + + D    Y TD+    ++  ++
Sbjct:   223 KARKHGENYSVD----YLTDVLANVSLDFLD 249

 Score = 39 (18.8 bits), Expect = 2.4e-05, Sum P(3) = 2.4e-05
 Identities = 10/31 (32%), Positives = 19/31 (61%)

Query:   569 LFNLGNDPCEQNNIASS-RPDISSQL-YELL 597
             ++NL  DP +  NIA +  P++  ++ Y L+
Sbjct:   510 VYNLTADPDQITNIAKTIDPELLGKMNYRLM 540


>TIGR_CMR|SPO_2214 [details] [associations]
            symbol:SPO_2214 "choline sulfatase" species:246200
            "Ruegeria pomeroyi DSS-3" [GO:0006790 "sulfur compound metabolic
            process" evidence=ISS] [GO:0047753 "choline-sulfatase activity"
            evidence=ISS] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 EMBL:CP000031
            GenomeReviews:CP000031_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00149 HOGENOM:HOG000217625 KO:K01133
            ProtClustDB:CLSK864791 GO:GO:0047753 RefSeq:YP_167440.1
            ProteinModelPortal:Q5LRB5 GeneID:3194829 KEGG:sil:SPO2214
            PATRIC:23377781 OMA:LLIMADQ Uniprot:Q5LRB5
        Length = 498

 Score = 91 (37.1 bits), Expect = 3.1e-05, Sum P(3) = 3.1e-05
 Identities = 36/112 (32%), Positives = 43/112 (38%)

Query:   100 GSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRG 158
             G     T ++  LA   +   N Y   P+C P+R+  MTG Y   TG            G
Sbjct:    36 GGTGARTRHLTRLAGRAVQFTNAYTPSPICVPARSCFMTGLYTSTTGCYD--------NG 87

Query:   159 VPLTERFLP---EYLRELGYSTKAIGKWHL-------GFFRREYTPLY-RGF 199
              P    FLP    YL   GY T   GK H        GF RR    +Y  GF
Sbjct:    88 DPY-HSFLPTFAHYLTNAGYETVLSGKMHFIGADQLHGFQRRLNPDIYPSGF 138

 Score = 81 (33.6 bits), Expect = 3.1e-05, Sum P(3) = 3.1e-05
 Identities = 19/63 (30%), Positives = 32/63 (50%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVE---YRETSNYRNWGS 360
             RR +AA+   +DD +G ++  L   G  + ++II  SD+G    E    ++ S Y  W +
Sbjct:   267 RRGFAALAHYVDDKIGALLEVLDETGQRDETVIIVTSDHGEMLGEKGLIQKRSLYE-WSA 325

Query:   361 NYP 363
               P
Sbjct:   326 RIP 328

 Score = 47 (21.6 bits), Expect = 3.1e-05, Sum P(3) = 3.1e-05
 Identities = 11/29 (37%), Positives = 15/29 (51%)

Query:   562 CTNGPCYLFNLGNDPCEQNNIASSRPDIS 590
             C      L+NL  DP E +N A   PD++
Sbjct:   411 CHGSAPQLYNLARDPGEWHNRAGE-PDLA 438


>UNIPROTKB|G3XAE6 [details] [associations]
            symbol:SULF2 "Extracellular sulfatase Sulf-2" species:9606
            "Homo sapiens" [GO:0005509 "calcium ion binding" evidence=IEA]
            [GO:0005783 "endoplasmic reticulum" evidence=IEA] [GO:0005794
            "Golgi apparatus" evidence=IEA] [GO:0008484 "sulfuric ester
            hydrolase activity" evidence=IEA] [GO:0009986 "cell surface"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR014615
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            PIRSF:PIRSF036665 GO:GO:0005783 GO:GO:0005794 EMBL:CH471077
            GO:GO:0009986 GO:GO:0005509 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484 EMBL:AL034418
            InterPro:IPR024609 Pfam:PF12548 EMBL:AL354813 UniGene:Hs.162016
            HGNC:HGNC:20392 EMBL:AL121777 ProteinModelPortal:G3XAE6 SMR:G3XAE6
            PRIDE:G3XAE6 Ensembl:ENST00000361612 ArrayExpress:G3XAE6
            Bgee:G3XAE6 Uniprot:G3XAE6
        Length = 852

 Score = 133 (51.9 bits), Expect = 4.8e-05, P = 4.8e-05
 Identities = 77/337 (22%), Positives = 132/337 (39%)

Query:   119 LNNMYAQPVCTPSRASLMTGKYPIH-----TGMQG--PPIWGAEPRGVPLTERFLPEYLR 171
             +N     P+C PSR+S++TGKY +H     T  +    P W A+        R    YL 
Sbjct:    79 INAFVTTPMCCPSRSSILTGKY-VHNHNTYTNNENCSSPSWQAQHES-----RTFAVYLN 132

Query:   172 ELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHD 231
               GY T   GK+ L  +   Y P   G++   G L     +Y++ L     +  E +G D
Sbjct:   133 STGYRTAFFGKY-LNEYNGSYVP--PGWKEWVGLLKNS-RFYNYTLCRNGVK--EKHGSD 186

Query:   232 MRRNLSTAW---DTVGEYATD--LFTKEAVQLIEDQ-----PVDKPXXXXXXXXXXXXXX 281
               ++  T     D+V  + T   ++    V ++        P D                
Sbjct:   187 YSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYSRLFPNASQHI 246

Query:   282 XXXXXEAPQETINQFQYITDPNR---RTYAAMVKK--------LDDSVGTVISALQRKGM 330
                   AP    +     T P +     +  M+++        +DDS+ T+ + L   G 
Sbjct:   247 TPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMETIYNMLVETGE 306

Query:   331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ- 389
             L+N+ I++ +D+G    ++         G + PY        E  ++VP  +  P ++  
Sbjct:   307 LDNTYIVYTADHGYHIGQFGLVK-----GKSMPY--------EFDIRVPFYVRGPNVEAG 353

Query:   390 --NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
               NP + L +    D  PT+   AG D   +P ++DG
Sbjct:   354 CLNPHIVLNI----DLAPTILDIAGLD---IPADMDG 383


>UNIPROTKB|G3T2L0 [details] [associations]
            symbol:SULF1 "Uncharacterized protein" species:9785
            "Loxodonta africana" [GO:0001502 "cartilage condensation"
            evidence=ISS] [GO:0001822 "kidney development" evidence=ISS]
            [GO:0001937 "negative regulation of endothelial cell proliferation"
            evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
            [GO:0004065 "arylsulfatase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005783 "endoplasmic
            reticulum" evidence=ISS] [GO:0005794 "Golgi apparatus"
            evidence=ISS] [GO:0007155 "cell adhesion" evidence=ISS] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
            [GO:0009986 "cell surface" evidence=ISS] [GO:0014846 "esophagus
            smooth muscle contraction" evidence=ISS] [GO:0016525 "negative
            regulation of angiogenesis" evidence=ISS] [GO:0030177 "positive
            regulation of Wnt receptor signaling pathway" evidence=ISS]
            [GO:0030201 "heparan sulfate proteoglycan metabolic process"
            evidence=ISS] [GO:0030336 "negative regulation of cell migration"
            evidence=ISS] [GO:0030513 "positive regulation of BMP signaling
            pathway" evidence=ISS] [GO:0035860 "glial cell-derived neurotrophic
            factor receptor signaling pathway" evidence=ISS] [GO:0036022 "limb
            joint morphogenesis" evidence=ISS] [GO:0040037 "negative regulation
            of fibroblast growth factor receptor signaling pathway"
            evidence=ISS] [GO:0045121 "membrane raft" evidence=ISS] [GO:0048010
            "vascular endothelial growth factor receptor signaling pathway"
            evidence=ISS] [GO:0048661 "positive regulation of smooth muscle
            cell proliferation" evidence=ISS] [GO:0048706 "embryonic skeletal
            system development" evidence=ISS] [GO:0051216 "cartilage
            development" evidence=ISS] [GO:0060348 "bone development"
            evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0060686
            "negative regulation of prostatic bud formation" evidence=ISS]
            InterPro:IPR000917 InterPro:IPR014615 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783
            GO:GO:0005886 GO:GO:0005794 GO:GO:0005615 GO:GO:0009986
            GO:GO:0048661 GO:GO:0005509 GO:GO:0010575 GO:GO:0045121
            GO:GO:0030336 GO:GO:0001822 GO:GO:0001937 GO:GO:0030513
            GO:GO:0016525 GO:GO:0001502 GO:GO:0060348 Gene3D:3.40.720.10
            SUPFAM:SSF53649 GO:GO:0030177 GO:GO:0003094 InterPro:IPR024607
            PROSITE:PS00523 GO:GO:0004065 GeneTree:ENSGT00400000022041
            GO:GO:0048706 GO:GO:0048010 GO:GO:0060686 GO:GO:0002063
            GO:GO:0040037 GO:GO:0032836 GO:GO:0060384 GO:GO:0008449
            GO:GO:0030201 GO:GO:0014846 GO:GO:0035860 GO:GO:0036022
            InterPro:IPR024609 Pfam:PF12548 OMA:QRKGDEC
            Ensembl:ENSLAFT00000008824 Uniprot:G3T2L0
        Length = 857

 Score = 133 (51.9 bits), Expect = 4.9e-05, P = 4.9e-05
 Identities = 82/370 (22%), Positives = 140/370 (37%)

Query:   100 GSNEIPTPNIDALAYNGI-ILNNMYAQPVCTPSRASLMTGKYP----IHTGMQG--PPIW 152
             GS ++       + + G   +N     P+C PSR+S++TGKY     ++T  +    P W
Sbjct:    59 GSLQVMNKTRKIMEHGGATFINAFVTTPMCCPSRSSMLTGKYVHNHNVYTNNENCSSPSW 118

Query:   153 GA--EPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
              A  EPR   +       YL   GY T   GK+ L  +   Y P   G+    G +    
Sbjct:   119 QAMHEPRTFAV-------YLNNTGYRTAFFGKY-LNEYNGSYIP--PGWREWLGLIKNS- 167

Query:   211 SYYDHILSDQYSRTVELNGHDMRRNLSTAW---DTVGEYATD--LFTKEAVQLIEDQ--- 262
              +Y++ +     +  E +G D  ++  T     +++  +     L+    + ++      
Sbjct:   168 RFYNYTVCRNGIK--EKHGFDYAKDYFTDLITNESINYFKMSKRLYPHRPIMMVISHAAP 225

Query:   263 --PVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDP---------N--RRTYAA 309
               P D                      AP    +     T P         N  +R    
Sbjct:   226 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLPIHMEFTNVLQRKRLQ 285

Query:   310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
              +  +DDSV  + + L   G LEN+ II+ +D+G    ++         G + PY     
Sbjct:   286 TLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQFGLVK-----GKSMPY----- 335

Query:   370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWS 429
                +  ++VP  +  P ++    V   +++I D  PT+   AG DT   P ++DG     
Sbjct:   336 ---DFDIRVPFFIRGPSVEPGSIVPQIVLNI-DLAPTILDIAGLDT---PPDVDGKSVLK 388

Query:   430 SLLLNTPSRR 439
              L L  P  R
Sbjct:   389 LLDLEKPGNR 398


>UNIPROTKB|G1SJB8 [details] [associations]
            symbol:SULF1 "Uncharacterized protein" species:9986
            "Oryctolagus cuniculus" [GO:0001502 "cartilage condensation"
            evidence=ISS] [GO:0001822 "kidney development" evidence=ISS]
            [GO:0001937 "negative regulation of endothelial cell proliferation"
            evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
            [GO:0004065 "arylsulfatase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005783 "endoplasmic
            reticulum" evidence=ISS] [GO:0005794 "Golgi apparatus"
            evidence=ISS] [GO:0007155 "cell adhesion" evidence=ISS] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
            [GO:0009986 "cell surface" evidence=ISS] [GO:0014846 "esophagus
            smooth muscle contraction" evidence=ISS] [GO:0016525 "negative
            regulation of angiogenesis" evidence=ISS] [GO:0030177 "positive
            regulation of Wnt receptor signaling pathway" evidence=ISS]
            [GO:0030201 "heparan sulfate proteoglycan metabolic process"
            evidence=ISS] [GO:0030336 "negative regulation of cell migration"
            evidence=ISS] [GO:0030513 "positive regulation of BMP signaling
            pathway" evidence=ISS] [GO:0035860 "glial cell-derived neurotrophic
            factor receptor signaling pathway" evidence=ISS] [GO:0036022 "limb
            joint morphogenesis" evidence=ISS] [GO:0040037 "negative regulation
            of fibroblast growth factor receptor signaling pathway"
            evidence=ISS] [GO:0045121 "membrane raft" evidence=ISS] [GO:0048010
            "vascular endothelial growth factor receptor signaling pathway"
            evidence=ISS] [GO:0048661 "positive regulation of smooth muscle
            cell proliferation" evidence=ISS] [GO:0048706 "embryonic skeletal
            system development" evidence=ISS] [GO:0051216 "cartilage
            development" evidence=ISS] [GO:0060348 "bone development"
            evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0060686
            "negative regulation of prostatic bud formation" evidence=ISS]
            InterPro:IPR000917 InterPro:IPR014615 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783
            GO:GO:0005886 GO:GO:0005794 GO:GO:0005615 GO:GO:0009986
            GO:GO:0048661 GO:GO:0005509 GO:GO:0010575 GO:GO:0045121
            GO:GO:0030336 GO:GO:0001822 GO:GO:0001937 GO:GO:0030513
            GO:GO:0016525 GO:GO:0001502 GO:GO:0060348 Gene3D:3.40.720.10
            SUPFAM:SSF53649 GO:GO:0030177 GO:GO:0003094 InterPro:IPR024607
            PROSITE:PS00523 GO:GO:0004065 GeneTree:ENSGT00400000022041
            GO:GO:0048706 GO:GO:0048010 GO:GO:0060686 GO:GO:0002063
            GO:GO:0040037 GO:GO:0032836 GO:GO:0060384 GO:GO:0008449
            GO:GO:0030201 GO:GO:0014846 GO:GO:0035860 GO:GO:0036022
            InterPro:IPR024609 Pfam:PF12548 EMBL:AAGW02046925 EMBL:AAGW02046926
            Ensembl:ENSOCUT00000003251 Uniprot:G1SJB8
        Length = 869

 Score = 133 (51.9 bits), Expect = 4.9e-05, P = 4.9e-05
 Identities = 81/370 (21%), Positives = 140/370 (37%)

Query:   100 GSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYP----IHTGMQG--PPIW 152
             GS ++       + + G    N +   P+C PSR+S++TGKY     ++T  +    P W
Sbjct:    58 GSLQVMNKTRKIMEHGGATFTNAFVTTPMCCPSRSSMLTGKYVHNHNVYTNNENCSSPSW 117

Query:   153 GA--EPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
              A  EPR   +       YL   GY T   GK+ L  +   Y P   G+    G +    
Sbjct:   118 QAMHEPRTFAV-------YLNNTGYRTAFFGKY-LNEYNGSYIP--PGWREWLGLIKNS- 166

Query:   211 SYYDHILSDQYSRTVELNGHDMRRNLSTAW---DTVGEYATD--LFTKEAVQLIEDQ--- 262
              +Y++ +     +  E +G D  ++  T     +++  +     ++    + ++      
Sbjct:   167 RFYNYTVCRNGIK--EKHGFDYAKDYFTDLITNESINYFKMSKRMYPHRPIMMVISHAAP 224

Query:   263 --PVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDP---------N--RRTYAA 309
               P D                      AP    +     T P         N  +R    
Sbjct:   225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLPIHMEFTNVLQRKRLQ 284

Query:   310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
              +  +DDSV  + + L   G LEN+ II+ +D+G    ++         G + PY     
Sbjct:   285 TLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQFGLVK-----GKSMPY----- 334

Query:   370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWS 429
                +  ++VP  +  P ++    V   +++I D  PT+   AG DT   P ++DG     
Sbjct:   335 ---DFDIRVPFFIRGPSVEPGSVVPQIVLNI-DLAPTILDIAGLDT---PPDVDGKSVLK 387

Query:   430 SLLLNTPSRR 439
              L L  P  R
Sbjct:   388 LLDLEKPGNR 397


>UNIPROTKB|Q8IWU5 [details] [associations]
            symbol:SULF2 "Extracellular sulfatase Sulf-2" species:9606
            "Homo sapiens" [GO:0005509 "calcium ion binding" evidence=IEA]
            [GO:0005795 "Golgi stack" evidence=IEA] [GO:0004065 "arylsulfatase
            activity" evidence=IMP;IDA] [GO:0005615 "extracellular space"
            evidence=NAS] [GO:0009986 "cell surface" evidence=IDA] [GO:0030201
            "heparan sulfate proteoglycan metabolic process" evidence=IDA;NAS]
            [GO:0030177 "positive regulation of Wnt receptor signaling pathway"
            evidence=IDA] [GO:0008449 "N-acetylglucosamine-6-sulfatase
            activity" evidence=IDA;IMP] [GO:0005783 "endoplasmic reticulum"
            evidence=IDA] [GO:0002063 "chondrocyte development" evidence=ISS]
            [GO:0014846 "esophagus smooth muscle contraction" evidence=ISS]
            [GO:0035860 "glial cell-derived neurotrophic factor receptor
            signaling pathway" evidence=ISS] [GO:0048706 "embryonic skeletal
            system development" evidence=ISS] [GO:0051216 "cartilage
            development" evidence=ISS] [GO:0060384 "innervation" evidence=ISS]
            [GO:0005886 "plasma membrane" evidence=ISS] [GO:0010575 "positive
            regulation vascular endothelial growth factor production"
            evidence=ISS] [GO:0040037 "negative regulation of fibroblast growth
            factor receptor signaling pathway" evidence=ISS] [GO:0003094
            "glomerular filtration" evidence=ISS] [GO:0032836 "glomerular
            basement membrane development" evidence=ISS] [GO:0001822 "kidney
            development" evidence=ISS] [GO:0060348 "bone development"
            evidence=ISS] InterPro:IPR000917 InterPro:IPR014615
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            PIRSF:PIRSF036665 EMBL:AY101176 GO:GO:0005783 GO:GO:0005886
            EMBL:CH471077 GO:GO:0005615 GO:GO:0009986 GO:GO:0005795
            GO:GO:0005509 GO:GO:0010575 GO:GO:0060348 Gene3D:3.40.720.10
            SUPFAM:SSF53649 GO:GO:0030177 GO:GO:0003094 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0004065
            GO:GO:0048706 GO:GO:0002063 GO:GO:0040037 GO:GO:0032836
            GO:GO:0060384 GO:GO:0008449 GO:GO:0030201 EMBL:AL034418 KO:K14607
            HOVERGEN:HBG056431 GO:GO:0014846 GO:GO:0035860 InterPro:IPR024609
            Pfam:PF12548 EMBL:AB033073 EMBL:AY358461 EMBL:CR749319
            EMBL:AL354813 EMBL:BC020962 EMBL:BC110539 EMBL:AL133001
            IPI:IPI00297252 IPI:IPI00555879 RefSeq:NP_001155313.1
            RefSeq:NP_061325.1 RefSeq:NP_940998.2 UniGene:Hs.162016
            ProteinModelPortal:Q8IWU5 SMR:Q8IWU5 IntAct:Q8IWU5 STRING:Q8IWU5
            PhosphoSite:Q8IWU5 DMDM:33112446 PaxDb:Q8IWU5 PRIDE:Q8IWU5
            DNASU:55959 Ensembl:ENST00000359930 Ensembl:ENST00000467815
            Ensembl:ENST00000484875 GeneID:55959 KEGG:hsa:55959 UCSC:uc002xto.3
            UCSC:uc002xtr.3 CTD:55959 GeneCards:GC20M046285 H-InvDB:HIX0027735
            HGNC:HGNC:20392 HPA:HPA002325 MIM:610013 neXtProt:NX_Q8IWU5
            PharmGKB:PA134902131 InParanoid:Q8IWU5 OMA:PKYYGQG
            OrthoDB:EOG49KFPX PhylomeDB:Q8IWU5 GenomeRNAi:55959 NextBio:61367
            ArrayExpress:Q8IWU5 Bgee:Q8IWU5 CleanEx:HS_SULF2
            Genevestigator:Q8IWU5 GermOnline:ENSG00000196562 Uniprot:Q8IWU5
        Length = 870

 Score = 133 (51.9 bits), Expect = 5.0e-05, P = 5.0e-05
 Identities = 77/337 (22%), Positives = 132/337 (39%)

Query:   119 LNNMYAQPVCTPSRASLMTGKYPIH-----TGMQG--PPIWGAEPRGVPLTERFLPEYLR 171
             +N     P+C PSR+S++TGKY +H     T  +    P W A+        R    YL 
Sbjct:    79 INAFVTTPMCCPSRSSILTGKY-VHNHNTYTNNENCSSPSWQAQHES-----RTFAVYLN 132

Query:   172 ELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHD 231
               GY T   GK+ L  +   Y P   G++   G L     +Y++ L     +  E +G D
Sbjct:   133 STGYRTAFFGKY-LNEYNGSYVP--PGWKEWVGLLKNS-RFYNYTLCRNGVK--EKHGSD 186

Query:   232 MRRNLSTAW---DTVGEYATD--LFTKEAVQLIEDQ-----PVDKPXXXXXXXXXXXXXX 281
               ++  T     D+V  + T   ++    V ++        P D                
Sbjct:   187 YSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYSRLFPNASQHI 246

Query:   282 XXXXXEAPQETINQFQYITDPNR---RTYAAMVKK--------LDDSVGTVISALQRKGM 330
                   AP    +     T P +     +  M+++        +DDS+ T+ + L   G 
Sbjct:   247 TPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMETIYNMLVETGE 306

Query:   331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ- 389
             L+N+ I++ +D+G    ++         G + PY        E  ++VP  +  P ++  
Sbjct:   307 LDNTYIVYTADHGYHIGQFGLVK-----GKSMPY--------EFDIRVPFYVRGPNVEAG 353

Query:   390 --NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
               NP + L +    D  PT+   AG D   +P ++DG
Sbjct:   354 CLNPHIVLNI----DLAPTILDIAGLD---IPADMDG 383


>UNIPROTKB|G3WVX3 [details] [associations]
            symbol:SULF1 "Uncharacterized protein" species:9305
            "Sarcophilus harrisii" [GO:0001502 "cartilage condensation"
            evidence=ISS] [GO:0001822 "kidney development" evidence=ISS]
            [GO:0001937 "negative regulation of endothelial cell proliferation"
            evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
            [GO:0004065 "arylsulfatase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005783 "endoplasmic
            reticulum" evidence=ISS] [GO:0005794 "Golgi apparatus"
            evidence=ISS] [GO:0007155 "cell adhesion" evidence=ISS] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
            [GO:0009986 "cell surface" evidence=ISS] [GO:0014846 "esophagus
            smooth muscle contraction" evidence=ISS] [GO:0016525 "negative
            regulation of angiogenesis" evidence=ISS] [GO:0030177 "positive
            regulation of Wnt receptor signaling pathway" evidence=ISS]
            [GO:0030201 "heparan sulfate proteoglycan metabolic process"
            evidence=ISS] [GO:0030336 "negative regulation of cell migration"
            evidence=ISS] [GO:0030513 "positive regulation of BMP signaling
            pathway" evidence=ISS] [GO:0035860 "glial cell-derived neurotrophic
            factor receptor signaling pathway" evidence=ISS] [GO:0036022 "limb
            joint morphogenesis" evidence=ISS] [GO:0040037 "negative regulation
            of fibroblast growth factor receptor signaling pathway"
            evidence=ISS] [GO:0045121 "membrane raft" evidence=ISS] [GO:0048010
            "vascular endothelial growth factor receptor signaling pathway"
            evidence=ISS] [GO:0048661 "positive regulation of smooth muscle
            cell proliferation" evidence=ISS] [GO:0048706 "embryonic skeletal
            system development" evidence=ISS] [GO:0051216 "cartilage
            development" evidence=ISS] [GO:0060348 "bone development"
            evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0060686
            "negative regulation of prostatic bud formation" evidence=ISS]
            InterPro:IPR000917 InterPro:IPR014615 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783
            GO:GO:0005794 GO:GO:0005615 GO:GO:0009986 GO:GO:0048661
            GO:GO:0005509 GO:GO:0045121 GO:GO:0030336 GO:GO:0001822
            GO:GO:0001937 GO:GO:0030513 GO:GO:0016525 GO:GO:0001502
            GO:GO:0060348 Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0030177
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0004065
            GeneTree:ENSGT00400000022041 GO:GO:0048706 GO:GO:0048010
            GO:GO:0060686 GO:GO:0002063 GO:GO:0040037 GO:GO:0060384
            GO:GO:0008449 GO:GO:0030201 GO:GO:0014846 GO:GO:0035860
            GO:GO:0036022 InterPro:IPR024609 Pfam:PF12548 EMBL:AEFK01056197
            EMBL:AEFK01056198 EMBL:AEFK01056199 EMBL:AEFK01056200
            Ensembl:ENSSHAT00000019735 Uniprot:G3WVX3
        Length = 870

 Score = 133 (51.9 bits), Expect = 5.0e-05, P = 5.0e-05
 Identities = 81/370 (21%), Positives = 141/370 (38%)

Query:   100 GSNEIPTPNIDALAYNGI-ILNNMYAQPVCTPSRASLMTGKYP----IHTGMQG--PPIW 152
             GS ++       + + G   +N     P+C PSR+S++TGKY     ++T  +    P W
Sbjct:    58 GSLQVMNKTRKIMEHGGATFINAFVTTPMCCPSRSSMLTGKYVHNHNVYTNNENCSSPSW 117

Query:   153 GA--EPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
              A  EPR   +       YL   GY T   GK+ L  +   Y P   G+    G +    
Sbjct:   118 QAMHEPRTFAV-------YLNNTGYRTAFFGKY-LNEYNGSYIP--PGWREWLGLIKNS- 166

Query:   211 SYYDHILSDQYSRTVELNGHDMRRNLSTAW---DTVGEYATD--LFTKEAVQLIEDQ--- 262
              +Y++ +     +  E +G D  ++  T     +++  +     ++    + ++      
Sbjct:   167 RFYNYTVCRNGIK--EKHGFDYAKDYFTDLITNESINYFKMSKRMYPHRPIMMVISHAAP 224

Query:   263 --PVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDP---------N--RRTYAA 309
               P D                      AP    +     T P         N  +R    
Sbjct:   225 HGPEDSAPQFSDLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLPIHMEFTNVLQRKRLQ 284

Query:   310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
              +  +DDSV  + + L   G L+N+ II+ +D+G    ++         G + PY     
Sbjct:   285 TLMSVDDSVERLYNMLVETGELDNTYIIYTADHGYHIGQFGLVK-----GKSMPY----- 334

Query:   370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWS 429
                +  ++VP  +  P ++    VS  +++I D  PT+   AG DT   P ++DG     
Sbjct:   335 ---DFDIRVPFFIRGPSVEPGSIVSQIVLNI-DLAPTILDIAGLDT---PPDVDGKSVLK 387

Query:   430 SLLLNTPSRR 439
              L L  P  R
Sbjct:   388 LLDLEKPGNR 397


>UNIPROTKB|E9PGI0 [details] [associations]
            symbol:ARSK "Arylsulfatase K" species:9606 "Homo sapiens"
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0008484
            HGNC:HGNC:25239 EMBL:AC008547 EMBL:AC090071 IPI:IPI00967230
            ProteinModelPortal:E9PGI0 SMR:E9PGI0 Ensembl:ENST00000504873
            ArrayExpress:E9PGI0 Bgee:E9PGI0 Uniprot:E9PGI0
        Length = 395

 Score = 95 (38.5 bits), Expect = 6.0e-05, Sum P(2) = 6.0e-05
 Identities = 29/104 (27%), Positives = 53/104 (50%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
             R  Y AM  + D  +G +I AL +  +L+ +I+I+ SD+G   +E+R+   Y        
Sbjct:   276 RAFYYAMCAETDAMLGEIILALHQLDLLQKTIVIYSSDHGELAMEHRQF--Y-------- 325

Query:   364 YRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTL 407
                 K +++E    VP ++  P I+   +VS  ++ + D  PT+
Sbjct:   326 ----KMSMYEASAHVPLLMMGPGIKAGLQVS-NVVSLVDIYPTM 364

 Score = 78 (32.5 bits), Expect = 6.0e-05, Sum P(2) = 6.0e-05
 Identities = 25/89 (28%), Positives = 38/89 (42%)

Query:    96 LSFH-GSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPPIWG 153
             L+FH GS  +  P I+ +   G    N Y   P+C PSRA++ +G +  H         G
Sbjct:    46 LTFHPGSQVVKLPFINFMKTRGTSFLNAYTNSPICCPSRAAMWSGLFT-HLTESWNNFKG 104

Query:   154 AEPRGVPLTERFLPEYLRELGYSTKAIGK 182
              +P      +      +   GY T+  GK
Sbjct:   105 LDPNYTTWMD-----VMERHGYRTQKFGK 128


>UNIPROTKB|G1LHX9 [details] [associations]
            symbol:SULF1 "Uncharacterized protein" species:9646
            "Ailuropoda melanoleuca" [GO:0001502 "cartilage condensation"
            evidence=ISS] [GO:0001822 "kidney development" evidence=ISS]
            [GO:0001937 "negative regulation of endothelial cell proliferation"
            evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
            [GO:0004065 "arylsulfatase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005783 "endoplasmic
            reticulum" evidence=ISS] [GO:0005794 "Golgi apparatus"
            evidence=ISS] [GO:0007155 "cell adhesion" evidence=ISS] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
            [GO:0009986 "cell surface" evidence=ISS] [GO:0014846 "esophagus
            smooth muscle contraction" evidence=ISS] [GO:0016525 "negative
            regulation of angiogenesis" evidence=ISS] [GO:0030177 "positive
            regulation of Wnt receptor signaling pathway" evidence=ISS]
            [GO:0030201 "heparan sulfate proteoglycan metabolic process"
            evidence=ISS] [GO:0030336 "negative regulation of cell migration"
            evidence=ISS] [GO:0030513 "positive regulation of BMP signaling
            pathway" evidence=ISS] [GO:0035860 "glial cell-derived neurotrophic
            factor receptor signaling pathway" evidence=ISS] [GO:0036022 "limb
            joint morphogenesis" evidence=ISS] [GO:0040037 "negative regulation
            of fibroblast growth factor receptor signaling pathway"
            evidence=ISS] [GO:0045121 "membrane raft" evidence=ISS] [GO:0048010
            "vascular endothelial growth factor receptor signaling pathway"
            evidence=ISS] [GO:0048661 "positive regulation of smooth muscle
            cell proliferation" evidence=ISS] [GO:0048706 "embryonic skeletal
            system development" evidence=ISS] [GO:0051216 "cartilage
            development" evidence=ISS] [GO:0060348 "bone development"
            evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0060686
            "negative regulation of prostatic bud formation" evidence=ISS]
            InterPro:IPR000917 InterPro:IPR014615 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783
            GO:GO:0005886 GO:GO:0005794 GO:GO:0005615 GO:GO:0009986
            GO:GO:0048661 GO:GO:0005509 GO:GO:0010575 GO:GO:0045121
            GO:GO:0030336 GO:GO:0001822 GO:GO:0001937 GO:GO:0030513
            GO:GO:0016525 GO:GO:0001502 GO:GO:0060348 Gene3D:3.40.720.10
            SUPFAM:SSF53649 GO:GO:0030177 GO:GO:0003094 InterPro:IPR024607
            PROSITE:PS00523 GO:GO:0004065 GeneTree:ENSGT00400000022041
            GO:GO:0048706 GO:GO:0048010 GO:GO:0060686 GO:GO:0002063
            GO:GO:0040037 GO:GO:0032836 GO:GO:0060384 GO:GO:0008449
            GO:GO:0030201 GO:GO:0014846 GO:GO:0035860 GO:GO:0036022
            InterPro:IPR024609 Pfam:PF12548 OMA:SVRVTHK EMBL:ACTA01145671
            EMBL:ACTA01153670 Ensembl:ENSAMET00000006800 Uniprot:G1LHX9
        Length = 868

 Score = 132 (51.5 bits), Expect = 6.4e-05, P = 6.4e-05
 Identities = 81/370 (21%), Positives = 139/370 (37%)

Query:   100 GSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYP----IHTGMQG--PPIW 152
             GS ++       + + G    N +   P+C PSR+S++TGKY     ++T  +    P W
Sbjct:    58 GSLQVMNKTRKIMEHGGATFTNAFVTTPMCCPSRSSMLTGKYVHNHNVYTNNENCSSPSW 117

Query:   153 GA--EPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
              A  EPR   +       YL   GY T   GK+ L  +   Y P   G+    G +    
Sbjct:   118 QATHEPRTFAV-------YLNNTGYRTAFFGKY-LNEYNGSYIP--PGWREWLGLIKNS- 166

Query:   211 SYYDHILSDQYSRTVELNGHDMRRNLSTAW---DTVGEYATD--LFTKEAVQLIEDQ--- 262
              +Y++ +     +  E +G D  ++  T     +++  +     ++    + ++      
Sbjct:   167 RFYNYTVCRNGIK--EKHGFDYAKDYFTDLITNESINYFKMSKRMYPHRPIMMVISHAAP 224

Query:   263 --PVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDP---------N--RRTYAA 309
               P D                      AP    +     T P         N   R    
Sbjct:   225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLPIHMEFTNVLHRKRLQ 284

Query:   310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
              +  +DDSV  + + L   G LEN+ II+ +D+G    ++         G + PY     
Sbjct:   285 TLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQFGLVK-----GKSMPY----- 334

Query:   370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWS 429
                +  ++VP  +  P ++    V   +++I D  PT+   AG DT   P ++DG     
Sbjct:   335 ---DFDIRVPFFIRGPSVEPGSVVPQIVLNI-DLAPTILDIAGLDT---PPDVDGKSVLK 387

Query:   430 SLLLNTPSRR 439
              L L  P  R
Sbjct:   388 LLDLEKPGNR 397


>FB|FBgn0035445 [details] [associations]
            symbol:CG12014 species:7227 "Drosophila melanogaster"
            [GO:0004423 "iduronate-2-sulfatase activity" evidence=ISS]
            [GO:0008152 "metabolic process" evidence=IEA] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 EMBL:AE014296
            Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
            PROSITE:PS00523 HSSP:P15289 KO:K01136 GO:GO:0004423
            GeneTree:ENSGT00640000091539 RefSeq:NP_647814.1 UniGene:Dm.15756
            ProteinModelPortal:Q9VZP8 STRING:Q9VZP8 PRIDE:Q9VZP8
            EnsemblMetazoa:FBtr0073077 GeneID:38423 KEGG:dme:Dmel_CG12014
            UCSC:CG12014-RA FlyBase:FBgn0035445 InParanoid:Q9VZP8 OMA:ERVIPAY
            OrthoDB:EOG45DV4P PhylomeDB:Q9VZP8 GenomeRNAi:38423 NextBio:808590
            ArrayExpress:Q9VZP8 Bgee:Q9VZP8 Uniprot:Q9VZP8
        Length = 512

 Score = 98 (39.6 bits), Expect = 7.3e-05, Sum P(2) = 7.3e-05
 Identities = 32/94 (34%), Positives = 44/94 (46%)

Query:    99 HGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYP--IHTGMQGPPIWGAE 155
             +G     TP +D  A    I   +Y+Q  +C PSR SL+TG+ P  +H        W   
Sbjct:    42 YGDTLASTPYLDNFARGSHIFTRVYSQQSLCAPSRNSLLTGRRPDTLHL-YDFYSYWRT- 99

Query:   156 PRGVPLTERF--LPEYLRELGYSTKAIGK-WHLG 186
                   T  F  LP+Y +E GY T + GK +H G
Sbjct:   100 -----FTGNFTTLPQYFKEHGYYTYSCGKVFHPG 128

 Score = 77 (32.2 bits), Expect = 7.3e-05, Sum P(2) = 7.3e-05
 Identities = 29/110 (26%), Positives = 49/110 (44%)

Query:   304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
             R++Y A V  +DD  G +I  L     L+ ++++ + D+G    E+ E + Y N+     
Sbjct:   287 RQSYYASVSYVDDLFGKLIGGLD----LDETVVVALGDHGWSLGEHAEWAKYSNF----- 337

Query:   364 YRGVKNTLWEGGVKVPAILWSPQIQ-QNPRVSLQMMHISDWLPTLYTAAG 412
                      E  ++VP I+ SPQ      +    +  + D  PTL   AG
Sbjct:   338 ---------EVALRVPLIIRSPQFPVAQTKYYHGITELLDVFPTLVDLAG 378


>UNIPROTKB|Q90XB6 [details] [associations]
            symbol:SULF1 "Extracellular sulfatase Sulf-1" species:9091
            "Coturnix coturnix" [GO:0001502 "cartilage condensation"
            evidence=IDA] [GO:0001822 "kidney development" evidence=ISS]
            [GO:0001937 "negative regulation of endothelial cell proliferation"
            evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
            [GO:0004065 "arylsulfatase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=IDA] [GO:0005783 "endoplasmic
            reticulum" evidence=IDA] [GO:0005794 "Golgi apparatus"
            evidence=ISS] [GO:0007155 "cell adhesion" evidence=IDA] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=IDA]
            [GO:0009986 "cell surface" evidence=ISS;IDA] [GO:0014846 "esophagus
            smooth muscle contraction" evidence=ISS] [GO:0016525 "negative
            regulation of angiogenesis" evidence=ISS] [GO:0030177 "positive
            regulation of Wnt receptor signaling pathway" evidence=ISS]
            [GO:0030201 "heparan sulfate proteoglycan metabolic process"
            evidence=IDA] [GO:0030336 "negative regulation of cell migration"
            evidence=ISS] [GO:0030513 "positive regulation of BMP signaling
            pathway" evidence=ISS] [GO:0035860 "glial cell-derived neurotrophic
            factor receptor signaling pathway" evidence=ISS] [GO:0036022 "limb
            joint morphogenesis" evidence=IDA] [GO:0040037 "negative regulation
            of fibroblast growth factor receptor signaling pathway"
            evidence=ISS] [GO:0045121 "membrane raft" evidence=ISS] [GO:0048010
            "vascular endothelial growth factor receptor signaling pathway"
            evidence=ISS] [GO:0048661 "positive regulation of smooth muscle
            cell proliferation" evidence=IDA] [GO:0048706 "embryonic skeletal
            system development" evidence=ISS] [GO:0051216 "cartilage
            development" evidence=ISS] [GO:0060070 "canonical Wnt receptor
            signaling pathway" evidence=IDA] [GO:0060348 "bone development"
            evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0060686
            "negative regulation of prostatic bud formation" evidence=ISS]
            InterPro:IPR000917 InterPro:IPR014615 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783
            GO:GO:0005794 GO:GO:0005615 GO:GO:0009986 GO:GO:0048661
            GO:GO:0005795 GO:GO:0005509 GO:GO:0045121 GO:GO:0030336
            GO:GO:0001822 GO:GO:0001937 GO:GO:0030513 GO:GO:0016525
            GO:GO:0001502 GO:GO:0060348 GO:GO:0060070 Gene3D:3.40.720.10
            SUPFAM:SSF53649 GO:GO:0030177 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0004065 GO:GO:0048706 GO:GO:0048010
            GO:GO:0060686 GO:GO:0002063 GO:GO:0040037 GO:GO:0060384
            GO:GO:0008449 GO:GO:0030201 EMBL:AF410802 ProteinModelPortal:Q90XB6
            HOVERGEN:HBG056431 GO:GO:0014846 GO:GO:0035860 GO:GO:0036022
            InterPro:IPR024609 Pfam:PF12548 Uniprot:Q90XB6
        Length = 867

 Score = 131 (51.2 bits), Expect = 8.1e-05, P = 8.1e-05
 Identities = 79/350 (22%), Positives = 134/350 (38%)

Query:   119 LNNMYAQPVCTPSRASLMTGKYP----IHTGMQG--PPIWGA--EPRGVPLTERFLPEYL 170
             +N     P+C PSR+S++TGKY     I+T  +    P W A  EPR   +       YL
Sbjct:    78 INAFVTTPMCCPSRSSMLTGKYVHNHNIYTNNENCSSPSWQATHEPRTFAV-------YL 130

Query:   171 RELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGH 230
                GY T   GK+ L  +   Y P   G+    G +     +Y++ +S   ++  E +G 
Sbjct:   131 NNTGYRTAFFGKY-LNEYNGSYIP--PGWREWVGLVKNS-RFYNYTISRNGNK--EKHGF 184

Query:   231 DMRRNLSTAW---DTVGEYATD--LFTKEAVQLIEDQ-----PVDKPXXXXXXXXXXXXX 280
             D  ++  T     +++  +     ++    + ++        P D               
Sbjct:   185 DYAKDYFTDLITNESINYFRMSKRIYPHRPIMMVISHAAPHGPEDSAPQFSELYPNASQH 244

Query:   281 XXXXXXEAPQETINQFQYITDP---------N--RRTYAAMVKKLDDSVGTVISALQRKG 329
                    AP    +     T P         N  +R     +  +DDS+  +   L   G
Sbjct:   245 ITPSYNYAPNMDKHWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSMERLYQMLAEMG 304

Query:   330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
              LEN+ II+ +D+G    ++         G + PY        +  ++VP  +  P ++ 
Sbjct:   305 ELENTYIIYTADHGYHIGQFGLVK-----GKSMPY--------DFDIRVPFFIRGPSVEP 351

Query:   390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRR 439
                V   +++I D  PT+   AG DT   P ++DG      L L  P  R
Sbjct:   352 GSVVPQIVLNI-DLAPTILDIAGLDT---PPDMDGKSVLKLLDLERPGNR 397


>UNIPROTKB|E1BRF7 [details] [associations]
            symbol:SULF1 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0005509 "calcium ion binding" evidence=IEA] [GO:0003094
            "glomerular filtration" evidence=IEA] [GO:0005886 "plasma membrane"
            evidence=IEA] [GO:0010575 "positive regulation vascular endothelial
            growth factor production" evidence=IEA] [GO:0032836 "glomerular
            basement membrane development" evidence=IEA] [GO:0001502 "cartilage
            condensation" evidence=ISS] [GO:0036022 "limb joint morphogenesis"
            evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
            [GO:0035860 "glial cell-derived neurotrophic factor receptor
            signaling pathway" evidence=ISS] [GO:0051216 "cartilage
            development" evidence=ISS] [GO:0040037 "negative regulation of
            fibroblast growth factor receptor signaling pathway" evidence=ISS]
            [GO:0005794 "Golgi apparatus" evidence=ISS] [GO:0001937 "negative
            regulation of endothelial cell proliferation" evidence=ISS]
            [GO:0016525 "negative regulation of angiogenesis" evidence=ISS]
            [GO:0030177 "positive regulation of Wnt receptor signaling pathway"
            evidence=ISS] [GO:0030201 "heparan sulfate proteoglycan metabolic
            process" evidence=ISS] [GO:0030336 "negative regulation of cell
            migration" evidence=ISS] [GO:0030513 "positive regulation of BMP
            signaling pathway" evidence=ISS] [GO:0048010 "vascular endothelial
            growth factor receptor signaling pathway" evidence=ISS] [GO:0004065
            "arylsulfatase activity" evidence=ISS] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005783
            "endoplasmic reticulum" evidence=ISS] [GO:0009986 "cell surface"
            evidence=ISS] [GO:0045121 "membrane raft" evidence=ISS] [GO:0007155
            "cell adhesion" evidence=ISS] [GO:0048661 "positive regulation of
            smooth muscle cell proliferation" evidence=ISS] [GO:0014846
            "esophagus smooth muscle contraction" evidence=ISS] [GO:0048706
            "embryonic skeletal system development" evidence=ISS] [GO:0001822
            "kidney development" evidence=ISS] [GO:0060348 "bone development"
            evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0060686
            "negative regulation of prostatic bud formation" evidence=ISS]
            InterPro:IPR000917 InterPro:IPR014615 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783
            GO:GO:0005886 GO:GO:0005794 GO:GO:0005615 GO:GO:0009986
            GO:GO:0048661 GO:GO:0005509 GO:GO:0010575 GO:GO:0045121
            GO:GO:0030336 GO:GO:0001822 GO:GO:0001937 GO:GO:0030513
            GO:GO:0016525 GO:GO:0001502 GO:GO:0060348 Gene3D:3.40.720.10
            SUPFAM:SSF53649 GO:GO:0030177 InterPro:IPR024607 PROSITE:PS00523
            GO:GO:0004065 GeneTree:ENSGT00400000022041 GO:GO:0048706
            GO:GO:0048010 GO:GO:0060686 GO:GO:0002063 GO:GO:0040037
            GO:GO:0060384 GO:GO:0008449 GO:GO:0030201 GO:GO:0014846
            GO:GO:0035860 GO:GO:0036022 InterPro:IPR024609 Pfam:PF12548
            OMA:SVRVTHK EMBL:AADN02048527 EMBL:AADN02048528 EMBL:AADN02048529
            EMBL:AADN02048530 EMBL:AADN02048531 EMBL:AADN02048532
            EMBL:AADN02048533 EMBL:AADN02048534 EMBL:AADN02048535
            EMBL:AADN02048536 EMBL:AADN02048537 EMBL:AADN02048538
            EMBL:AADN02048539 EMBL:AADN02048540 EMBL:AADN02048541
            IPI:IPI00571776 ProteinModelPortal:E1BRF7
            Ensembl:ENSGALT00000018383 ArrayExpress:E1BRF7 Uniprot:E1BRF7
        Length = 868

 Score = 131 (51.2 bits), Expect = 8.1e-05, P = 8.1e-05
 Identities = 79/350 (22%), Positives = 134/350 (38%)

Query:   119 LNNMYAQPVCTPSRASLMTGKYP----IHTGMQG--PPIWGA--EPRGVPLTERFLPEYL 170
             +N     P+C PSR+S++TGKY     I+T  +    P W A  EPR   +       YL
Sbjct:    78 INAFVTTPMCCPSRSSMLTGKYVHNHNIYTNNENCSSPSWQATHEPRTFAV-------YL 130

Query:   171 RELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGH 230
                GY T   GK+ L  +   Y P   G+    G +     +Y++ +S   ++  E +G 
Sbjct:   131 NNTGYRTAFFGKY-LNEYNGSYIP--PGWREWVGLVKNS-RFYNYTISRNGNK--EKHGF 184

Query:   231 DMRRNLSTAW---DTVGEYATD--LFTKEAVQLIEDQ-----PVDKPXXXXXXXXXXXXX 280
             D  ++  T     +++  +     ++    + ++        P D               
Sbjct:   185 DYAKDYFTDLITNESINYFRMSKRIYPHRPIMMVISHAAPHGPEDSAPQFSELYPNASQH 244

Query:   281 XXXXXXEAPQETINQFQYITDP---------N--RRTYAAMVKKLDDSVGTVISALQRKG 329
                    AP    +     T P         N  +R     +  +DDS+  +   L   G
Sbjct:   245 ITPSYNYAPNMDKHWIMQYTGPMLPIHMEFTNVLQRKRLQTLMSVDDSMERLYQMLAEMG 304

Query:   330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
              LEN+ II+ +D+G    ++         G + PY        +  ++VP  +  P ++ 
Sbjct:   305 ELENTYIIYTADHGYHIGQFGLVK-----GKSMPY--------DFDIRVPFFIRGPSVEP 351

Query:   390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRR 439
                V   +++I D  PT+   AG DT   P ++DG      L L  P  R
Sbjct:   352 GSVVPQIVLNI-DLAPTILDIAGLDT---PPDMDGKSVLKLLDLERPGNR 397


>UNIPROTKB|Q0C044 [details] [associations]
            symbol:HNE_2203 "Sulfatase domain protein" species:228405
            "Hyphomonas neptunium ATCC 15444" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0008150 "biological_process" evidence=ND] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0008484 EMBL:CP000158
            GenomeReviews:CP000158_GR RefSeq:YP_760899.1
            ProteinModelPortal:Q0C044 STRING:Q0C044 GeneID:4287652
            KEGG:hne:HNE_2203 PATRIC:32217253 eggNOG:NOG280633
            BioCyc:HNEP228405:GI69-2226-MONOMER Uniprot:Q0C044
        Length = 192

 Score = 101 (40.6 bits), Expect = 9.4e-05, Sum P(2) = 9.4e-05
 Identities = 27/67 (40%), Positives = 37/67 (55%)

Query:   368 KNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGG-DTSRLPLNIDGL 425
             K+ L EGG++VP I+ W  ++      S Q+M   DW PTL + AGG   +R P   DG+
Sbjct:    33 KSDLLEGGLRVPTIVRWPNRVPAGS-TSDQVMITMDWYPTLLSVAGGAPDARFPS--DGM 89

Query:   426 DQWSSLL 432
             D    LL
Sbjct:    90 DLTDQLL 96

 Score = 57 (25.1 bits), Expect = 9.4e-05, Sum P(2) = 9.4e-05
 Identities = 11/39 (28%), Positives = 23/39 (58%)

Query:   568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVP 606
             +LFN+  DP E+ N+ +  PD+ + L +  +    +++P
Sbjct:   132 FLFNIVEDPRERANLKARLPDLFTLLQDKYQAWNASVLP 170


>UNIPROTKB|E1BIY5 [details] [associations]
            symbol:SULF2 "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0060384 "innervation" evidence=IEA] [GO:0060348 "bone
            development" evidence=IEA] [GO:0048706 "embryonic skeletal system
            development" evidence=IEA] [GO:0040037 "negative regulation of
            fibroblast growth factor receptor signaling pathway" evidence=IEA]
            [GO:0035860 "glial cell-derived neurotrophic factor receptor
            signaling pathway" evidence=IEA] [GO:0032836 "glomerular basement
            membrane development" evidence=IEA] [GO:0030201 "heparan sulfate
            proteoglycan metabolic process" evidence=IEA] [GO:0030177 "positive
            regulation of Wnt receptor signaling pathway" evidence=IEA]
            [GO:0014846 "esophagus smooth muscle contraction" evidence=IEA]
            [GO:0010575 "positive regulation vascular endothelial growth factor
            production" evidence=IEA] [GO:0009986 "cell surface" evidence=IEA]
            [GO:0008449 "N-acetylglucosamine-6-sulfatase activity"
            evidence=IEA] [GO:0005886 "plasma membrane" evidence=IEA]
            [GO:0005783 "endoplasmic reticulum" evidence=IEA] [GO:0004065
            "arylsulfatase activity" evidence=IEA] [GO:0003094 "glomerular
            filtration" evidence=IEA] [GO:0002063 "chondrocyte development"
            evidence=IEA] [GO:0005794 "Golgi apparatus" evidence=IEA]
            [GO:0005509 "calcium ion binding" evidence=IEA] InterPro:IPR000917
            InterPro:IPR014615 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783 GO:GO:0005886
            GO:GO:0005794 GO:GO:0009986 GO:GO:0005509 GO:GO:0010575
            GO:GO:0060348 Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0030177
            GO:GO:0003094 InterPro:IPR024607 PROSITE:PS00523 GO:GO:0004065
            GeneTree:ENSGT00400000022041 GO:GO:0048706 GO:GO:0002063
            GO:GO:0040037 GO:GO:0032836 GO:GO:0060384 GO:GO:0030201 KO:K14607
            GO:GO:0014846 GO:GO:0035860 InterPro:IPR024609 Pfam:PF12548
            CTD:55959 OMA:PKYYGQG EMBL:DAAA02036810 IPI:IPI00698144
            RefSeq:NP_001179867.1 UniGene:Bt.90452 ProteinModelPortal:E1BIY5
            Ensembl:ENSBTAT00000009852 GeneID:533264 KEGG:bta:533264
            NextBio:20875979 Uniprot:E1BIY5
        Length = 862

 Score = 130 (50.8 bits), Expect = 0.00010, P = 0.00010
 Identities = 79/352 (22%), Positives = 135/352 (38%)

Query:   119 LNNMYAQPVCTPSRASLMTGKYPIH-----TGMQG--PPIWGAEPRGVPLTERFLPEYLR 171
             +N     P+C PSR+S++TGKY +H     T  +    P W A+        R    YL 
Sbjct:    79 INAFVTTPMCCPSRSSILTGKY-VHNHNTYTNNENCSSPSWQAQHES-----RTFAVYLN 132

Query:   172 ELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHD 231
               GY T   GK+ L  +   Y P   G++   G L     +Y++ L     +  E +G D
Sbjct:   133 STGYRTAFFGKY-LNEYNGSYVP--PGWKEWVGLLKNS-RFYNYTLCRNGVK--EKHGFD 186

Query:   232 MRRNLSTAW---DTVGEYATD--LFTKEAVQLIEDQ-----PVDKPXXXXXXXXXXXXXX 281
               ++  T     D+V  +     ++    V ++        P D                
Sbjct:   187 YSKDYLTDLITNDSVSFFRASKKMYPHRPVLMVLSHAAPHGPEDSAPQYSSLFPNASQHI 246

Query:   282 XXXXXEAPQETINQFQYITDPNR---RTYAAMVKK--------LDDSVGTVISALQRKGM 330
                   AP    +     T P +     +  M+++        +DDS+ T+ + L   G 
Sbjct:   247 TPSYNYAPNPDKHWIMRYTGPMKPIHMQFTNMLQRKRLQTLLSVDDSMETIYNMLVETGE 306

Query:   331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ- 389
             L+N+ I++ +D+G    ++         G + PY        E  ++VP  +  P ++  
Sbjct:   307 LDNTYIVYTADHGYHIGQFGLVK-----GKSMPY--------EFDIRVPFYVRGPNVEAG 353

Query:   390 --NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRR 439
               NP + L +    D  PT+   AG D   +P ++DG      L    P+ R
Sbjct:   354 SLNPHIVLNI----DLAPTILDIAGLD---IPSDMDGKSILKLLDTERPANR 398


>UNIPROTKB|F1RU06 [details] [associations]
            symbol:SULF1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0030513 "positive regulation of BMP signaling pathway"
            evidence=ISS] [GO:0016525 "negative regulation of angiogenesis"
            evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
            [GO:0009986 "cell surface" evidence=ISS] [GO:0060348 "bone
            development" evidence=ISS] [GO:0001822 "kidney development"
            evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0048706
            "embryonic skeletal system development" evidence=ISS] [GO:0014846
            "esophagus smooth muscle contraction" evidence=ISS] [GO:0048661
            "positive regulation of smooth muscle cell proliferation"
            evidence=ISS] [GO:0007155 "cell adhesion" evidence=ISS] [GO:0045121
            "membrane raft" evidence=ISS] [GO:0005783 "endoplasmic reticulum"
            evidence=ISS] [GO:0005615 "extracellular space" evidence=ISS]
            [GO:0008449 "N-acetylglucosamine-6-sulfatase activity"
            evidence=ISS] [GO:0004065 "arylsulfatase activity" evidence=ISS]
            [GO:0048010 "vascular endothelial growth factor receptor signaling
            pathway" evidence=ISS] [GO:0040037 "negative regulation of
            fibroblast growth factor receptor signaling pathway" evidence=ISS]
            [GO:0030336 "negative regulation of cell migration" evidence=ISS]
            [GO:0030201 "heparan sulfate proteoglycan metabolic process"
            evidence=ISS] [GO:0030177 "positive regulation of Wnt receptor
            signaling pathway" evidence=ISS] [GO:0001937 "negative regulation
            of endothelial cell proliferation" evidence=ISS] [GO:0005794 "Golgi
            apparatus" evidence=ISS] [GO:0060686 "negative regulation of
            prostatic bud formation" evidence=ISS] [GO:0051216 "cartilage
            development" evidence=ISS] [GO:0035860 "glial cell-derived
            neurotrophic factor receptor signaling pathway" evidence=ISS]
            [GO:0036022 "limb joint morphogenesis" evidence=ISS] [GO:0001502
            "cartilage condensation" evidence=ISS] [GO:0032836 "glomerular
            basement membrane development" evidence=IEA] [GO:0010575 "positive
            regulation vascular endothelial growth factor production"
            evidence=IEA] [GO:0005886 "plasma membrane" evidence=IEA]
            [GO:0003094 "glomerular filtration" evidence=IEA] [GO:0005509
            "calcium ion binding" evidence=IEA] InterPro:IPR000917
            InterPro:IPR014615 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783 GO:GO:0005886
            GO:GO:0005794 GO:GO:0005615 GO:GO:0009986 GO:GO:0048661
            GO:GO:0005509 GO:GO:0010575 GO:GO:0045121 GO:GO:0030336
            GO:GO:0001822 GO:GO:0001937 GO:GO:0030513 GO:GO:0016525
            GO:GO:0001502 GO:GO:0060348 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GO:GO:0030177 GO:GO:0003094 InterPro:IPR024607 PROSITE:PS00523
            GO:GO:0004065 GeneTree:ENSGT00400000022041 GO:GO:0048706
            GO:GO:0048010 GO:GO:0060686 GO:GO:0002063 GO:GO:0040037
            GO:GO:0032836 GO:GO:0060384 GO:GO:0008449 GO:GO:0030201
            GO:GO:0014846 GO:GO:0035860 GO:GO:0036022 InterPro:IPR024609
            Pfam:PF12548 OMA:SVRVTHK EMBL:CU179692 EMBL:CU302274
            Ensembl:ENSSSCT00000006792 Uniprot:F1RU06
        Length = 871

 Score = 130 (50.8 bits), Expect = 0.00010, P = 0.00010
 Identities = 81/370 (21%), Positives = 138/370 (37%)

Query:   100 GSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYP----IHTGMQG--PPIW 152
             GS ++       +   G    N +   P+C PSR+S++TGKY     ++T  +    P W
Sbjct:    59 GSLQVMNKTRKIMELGGATFTNAFVTTPMCCPSRSSMLTGKYVHNHNVYTNNENCSSPSW 118

Query:   153 GA--EPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
              A  EPR   +       YL   GY T   GK+ L  +   Y P   G+    G +    
Sbjct:   119 QAVHEPRTFAV-------YLNSTGYRTAFFGKY-LNEYNGSYVP--PGWREWLGLIKNS- 167

Query:   211 SYYDHILSDQYSRTVELNGHDMRRNLSTAW---DTVGEYATD--LFTKEAVQLIEDQ--- 262
              +Y++ +     +  E +G D  ++  T     +++  +     ++    + ++      
Sbjct:   168 RFYNYTVCRNGIK--EKHGFDYAKDYFTDLITNESINYFKMSKRMYPHRPIMMVISHAAP 225

Query:   263 --PVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDP---------N--RRTYAA 309
               P D                      AP    +     T P         N   R    
Sbjct:   226 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLPIHMEFTNVLHRKRLQ 285

Query:   310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
              +  +DDSV  + + L   G LEN+ II+ +D+G    ++         G + PY     
Sbjct:   286 TLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQFGLVK-----GKSMPY----- 335

Query:   370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWS 429
                +  ++VP  +  P ++    V   +++I D  PT+   AG DT   P ++DG     
Sbjct:   336 ---DFDIRVPFFIRGPSVEPGSIVPQIVLNI-DLAPTILDIAGLDT---PPDVDGKSVLK 388

Query:   430 SLLLNTPSRR 439
              L L  P  R
Sbjct:   389 LLDLEKPGNR 398


>UNIPROTKB|Q3L472 [details] [associations]
            symbol:Sulf2 "Protein Sulf2" species:10116 "Rattus
            norvegicus" [GO:0002063 "chondrocyte development" evidence=IEA]
            [GO:0003094 "glomerular filtration" evidence=IEA] [GO:0004065
            "arylsulfatase activity" evidence=IEA] [GO:0005509 "calcium ion
            binding" evidence=IEA] [GO:0005783 "endoplasmic reticulum"
            evidence=IEA] [GO:0005794 "Golgi apparatus" evidence=IEA]
            [GO:0005886 "plasma membrane" evidence=IEA] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=IEA]
            [GO:0009986 "cell surface" evidence=IEA] [GO:0010575 "positive
            regulation vascular endothelial growth factor production"
            evidence=IEA] [GO:0014846 "esophagus smooth muscle contraction"
            evidence=IEA] [GO:0030177 "positive regulation of Wnt receptor
            signaling pathway" evidence=IEA] [GO:0030201 "heparan sulfate
            proteoglycan metabolic process" evidence=IEA] [GO:0032836
            "glomerular basement membrane development" evidence=IEA]
            [GO:0035860 "glial cell-derived neurotrophic factor receptor
            signaling pathway" evidence=IEA] [GO:0040037 "negative regulation
            of fibroblast growth factor receptor signaling pathway"
            evidence=IEA] [GO:0048706 "embryonic skeletal system development"
            evidence=IEA] [GO:0060348 "bone development" evidence=IEA]
            [GO:0060384 "innervation" evidence=IEA] InterPro:IPR000917
            InterPro:IPR014615 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 PIRSF:PIRSF036665 RGD:1305078 GO:GO:0005783
            GO:GO:0005886 GO:GO:0005794 GO:GO:0009986 GO:GO:0005509
            GO:GO:0010575 GO:GO:0060348 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GO:GO:0030177 GO:GO:0003094 InterPro:IPR024607 PROSITE:PS00523
            GO:GO:0004065 GeneTree:ENSGT00400000022041 GO:GO:0048706
            GO:GO:0002063 GO:GO:0040037 GO:GO:0032836 EMBL:CH474005
            GO:GO:0060384 GO:GO:0030201 KO:K14607 HOVERGEN:HBG056431
            GO:GO:0014846 GO:GO:0035860 InterPro:IPR024609 Pfam:PF12548
            CTD:55959 OMA:PKYYGQG EMBL:AY742216 IPI:IPI00767654
            RefSeq:NP_001030099.1 UniGene:Rn.4228 STRING:Q3L472
            Ensembl:ENSRNOT00000008478 GeneID:311642 KEGG:rno:311642
            InParanoid:Q3L472 NextBio:663979 Genevestigator:Q3L472
            Uniprot:Q3L472
        Length = 875

 Score = 130 (50.8 bits), Expect = 0.00011, P = 0.00011
 Identities = 77/337 (22%), Positives = 130/337 (38%)

Query:   119 LNNMYAQPVCTPSRASLMTGKYPIH-----TGMQG--PPIWGAEPRGVPLTERFLPEYLR 171
             +N     P+C PSR+S++TGKY +H     T  +    P W A+        R    YL 
Sbjct:    79 INAFVTTPMCCPSRSSILTGKY-VHNHNTYTNNENCSSPSWQAQHES-----RTFAVYLN 132

Query:   172 ELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHD 231
               GY T   GK+ L  +   Y P   G++   G L     +Y++ L     +  E +G D
Sbjct:   133 STGYRTAFFGKY-LNEYNGSYVP--PGWKEWVGLLKNS-RFYNYTLCRNGMK--EKHGSD 186

Query:   232 MRRNLSTAW---DTVGEYATD--LFTKEAVQLIEDQ-----PVDKPXXXXXXXXXXXXXX 281
                +  T     D+V  + T   ++    V ++        P D                
Sbjct:   187 YSTDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYSRLFPNASQHI 246

Query:   282 XXXXXEAPQETINQFQYITDPNR---RTYAAMVKK--------LDDSVGTVISALQRKGM 330
                   AP    +     T P +     +  M+++        +DDS+ T+   L   G 
Sbjct:   247 TPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMETIYDMLVETGE 306

Query:   331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ- 389
             L+N+ I++ +D+G    ++         G + PY        E  ++VP  +  P ++  
Sbjct:   307 LDNTYIVYTADHGYHIGQFGLVK-----GKSMPY--------EFDIRVPFYVRGPSVEAG 353

Query:   390 --NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
               NP + L +    D  PT+   AG D   +P ++DG
Sbjct:   354 SLNPHIVLNI----DLAPTILDIAGLD---IPADMDG 383


>UNIPROTKB|Q32KH2 [details] [associations]
            symbol:sulf1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0030201 "heparan sulfate proteoglycan
            metabolic process" evidence=ISS] [GO:0051216 "cartilage
            development" evidence=ISS] [GO:0048706 "embryonic skeletal system
            development" evidence=ISS] [GO:0001822 "kidney development"
            evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0014846
            "esophagus smooth muscle contraction" evidence=ISS] [GO:0048661
            "positive regulation of smooth muscle cell proliferation"
            evidence=ISS] [GO:0007155 "cell adhesion" evidence=ISS] [GO:0045121
            "membrane raft" evidence=ISS] [GO:0009986 "cell surface"
            evidence=ISS] [GO:0005783 "endoplasmic reticulum" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
            [GO:0004065 "arylsulfatase activity" evidence=ISS] [GO:0048010
            "vascular endothelial growth factor receptor signaling pathway"
            evidence=ISS] [GO:0040037 "negative regulation of fibroblast growth
            factor receptor signaling pathway" evidence=ISS] [GO:0030513
            "positive regulation of BMP signaling pathway" evidence=ISS]
            [GO:0030336 "negative regulation of cell migration" evidence=ISS]
            [GO:0030177 "positive regulation of Wnt receptor signaling pathway"
            evidence=ISS] [GO:0016525 "negative regulation of angiogenesis"
            evidence=ISS] [GO:0001937 "negative regulation of endothelial cell
            proliferation" evidence=ISS] [GO:0005794 "Golgi apparatus"
            evidence=ISS] [GO:0060348 "bone development" evidence=ISS]
            [GO:0060686 "negative regulation of prostatic bud formation"
            evidence=ISS] [GO:0035860 "glial cell-derived neurotrophic factor
            receptor signaling pathway" evidence=ISS] [GO:0002063 "chondrocyte
            development" evidence=ISS] [GO:0036022 "limb joint morphogenesis"
            evidence=ISS] [GO:0001502 "cartilage condensation" evidence=ISS]
            [GO:0032836 "glomerular basement membrane development"
            evidence=IEA] [GO:0010575 "positive regulation vascular endothelial
            growth factor production" evidence=IEA] [GO:0005886 "plasma
            membrane" evidence=IEA] [GO:0003094 "glomerular filtration"
            evidence=IEA] [GO:0005509 "calcium ion binding" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR014615 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783
            GO:GO:0005886 GO:GO:0005794 GO:GO:0005615 GO:GO:0009986
            GO:GO:0048661 GO:GO:0005509 GO:GO:0010575 GO:GO:0045121
            GO:GO:0030336 GO:GO:0001822 GO:GO:0001937 GO:GO:0030513
            GO:GO:0016525 GO:GO:0001502 GO:GO:0060348 Gene3D:3.40.720.10
            SUPFAM:SSF53649 GO:GO:0030177 GO:GO:0003094 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0004065
            GeneTree:ENSGT00400000022041 GO:GO:0048706 GO:GO:0048010
            GO:GO:0060686 GO:GO:0002063 GO:GO:0040037 GO:GO:0032836
            GO:GO:0060384 GO:GO:0008449 GO:GO:0030201 HOGENOM:HOG000290161
            KO:K14607 HOVERGEN:HBG056431 GO:GO:0014846 GO:GO:0035860
            GO:GO:0036022 InterPro:IPR024609 Pfam:PF12548 CTD:23213
            OrthoDB:EOG4VT5WH EMBL:AAEX03015848 EMBL:BN000765
            RefSeq:NP_001041580.1 UniGene:Cfa.36649 Ensembl:ENSCAFT00000046451
            GeneID:486986 KEGG:cfa:486986 InParanoid:Q32KH2 NextBio:20860674
            Uniprot:Q32KH2
        Length = 869

 Score = 129 (50.5 bits), Expect = 0.00013, P = 0.00013
 Identities = 80/370 (21%), Positives = 139/370 (37%)

Query:   100 GSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYP----IHTGMQG--PPIW 152
             GS ++       + + G    N +   P+C PSR+S++TGKY     ++T  +    P W
Sbjct:    58 GSLQVMNKTRKIMEHGGATFTNAFVTTPMCCPSRSSMLTGKYVHNHNVYTNNENCSSPSW 117

Query:   153 GA--EPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
              A  EPR   +       YL   GY T   GK+ L  +   Y P   G+    G +    
Sbjct:   118 QAMHEPRTFAV-------YLNNTGYRTAFFGKY-LNEYNGSYIP--PGWREWLGLIKNS- 166

Query:   211 SYYDHILSDQYSRTVELNGHDMRRNLSTAW---DTVGEYATD--LFTKEAVQLIEDQ--- 262
              +Y++ +     +  E +G D  ++  T     +++  +     ++    + ++      
Sbjct:   167 RFYNYTVCRNGIK--EKHGFDYAKDYFTDLITNESINYFKMSKRMYPHRPIMMVISHAAP 224

Query:   263 --PVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDP---------N--RRTYAA 309
               P D                      AP    +     T P         N   R    
Sbjct:   225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLPIHMEFTNVLHRKRLQ 284

Query:   310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
              +  +DDSV  + + L   G L+N+ II+ +D+G    ++         G + PY     
Sbjct:   285 TLMSVDDSVERLYNMLVETGELDNTYIIYTADHGYHIGQFGLVK-----GKSMPY----- 334

Query:   370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWS 429
                +  ++VP  +  P ++    V   +++I D  PT+   AG DT   P ++DG     
Sbjct:   335 ---DFDIRVPFFIRGPSVEPGSIVPQIVLNI-DLAPTILDIAGLDT---PPDVDGKSVLK 387

Query:   430 SLLLNTPSRR 439
              L L  P  R
Sbjct:   388 LLDLEKPGNR 397


>MGI|MGI:1919293 [details] [associations]
            symbol:Sulf2 "sulfatase 2" species:10090 "Mus musculus"
            [GO:0001822 "kidney development" evidence=IGI] [GO:0002063
            "chondrocyte development" evidence=IMP] [GO:0003094 "glomerular
            filtration" evidence=IGI] [GO:0003824 "catalytic activity"
            evidence=IEA] [GO:0004065 "arylsulfatase activity" evidence=ISO]
            [GO:0005509 "calcium ion binding" evidence=IEA] [GO:0005615
            "extracellular space" evidence=ISO] [GO:0005783 "endoplasmic
            reticulum" evidence=ISO] [GO:0005794 "Golgi apparatus"
            evidence=IEA] [GO:0005886 "plasma membrane" evidence=IDA]
            [GO:0006790 "sulfur compound metabolic process" evidence=ISO]
            [GO:0008152 "metabolic process" evidence=IEA] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=ISO;IMP]
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
            [GO:0009986 "cell surface" evidence=ISO] [GO:0010575 "positive
            regulation vascular endothelial growth factor production"
            evidence=IGI] [GO:0014846 "esophagus smooth muscle contraction"
            evidence=IGI] [GO:0016787 "hydrolase activity" evidence=IEA]
            [GO:0030177 "positive regulation of Wnt receptor signaling pathway"
            evidence=ISO] [GO:0030201 "heparan sulfate proteoglycan metabolic
            process" evidence=ISO;IMP] [GO:0032836 "glomerular basement
            membrane development" evidence=IGI] [GO:0035860 "glial cell-derived
            neurotrophic factor receptor signaling pathway" evidence=IDA]
            [GO:0040037 "negative regulation of fibroblast growth factor
            receptor signaling pathway" evidence=IGI] [GO:0046872 "metal ion
            binding" evidence=IEA] [GO:0048706 "embryonic skeletal system
            development" evidence=IGI] [GO:0051216 "cartilage development"
            evidence=IMP] [GO:0060348 "bone development" evidence=IGI]
            [GO:0060384 "innervation" evidence=IGI] InterPro:IPR000917
            InterPro:IPR014615 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 PIRSF:PIRSF036665 MGI:MGI:1919293 GO:GO:0005783
            GO:GO:0005886 GO:GO:0005615 GO:GO:0009986 GO:GO:0005795
            GO:GO:0005509 GO:GO:0010575 GO:GO:0060348 Gene3D:3.40.720.10
            SUPFAM:SSF53649 GO:GO:0030177 GO:GO:0003094 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0004065
            GeneTree:ENSGT00400000022041 GO:GO:0048706 GO:GO:0002063
            GO:GO:0040037 GO:GO:0032836 GO:GO:0060384 GO:GO:0008449
            GO:GO:0030201 KO:K14607 HOVERGEN:HBG056431 GO:GO:0014846
            GO:GO:0035860 InterPro:IPR024609 Pfam:PF12548 CTD:55959 OMA:PKYYGQG
            OrthoDB:EOG49KFPX EMBL:AY101177 EMBL:AK008108 EMBL:AK028874
            EMBL:AK034712 EMBL:AK036685 EMBL:AK049170 EMBL:AK081643
            EMBL:AK133336 EMBL:AK165183 EMBL:AL589873 EMBL:BC027238
            EMBL:BC141086 IPI:IPI00268030 RefSeq:NP_001239507.1
            RefSeq:NP_001239508.1 RefSeq:NP_082348.2 UniGene:Mm.1011
            ProteinModelPortal:Q8CFG0 SMR:Q8CFG0 STRING:Q8CFG0
            PhosphoSite:Q8CFG0 PRIDE:Q8CFG0 Ensembl:ENSMUST00000088086
            Ensembl:ENSMUST00000109249 GeneID:72043 KEGG:mmu:72043
            InParanoid:B2RUD5 NextBio:335292 Bgee:Q8CFG0 CleanEx:MM_SULF2
            Genevestigator:Q8CFG0 GermOnline:ENSMUSG00000006800 Uniprot:Q8CFG0
        Length = 875

 Score = 128 (50.1 bits), Expect = 0.00017, P = 0.00017
 Identities = 77/337 (22%), Positives = 130/337 (38%)

Query:   119 LNNMYAQPVCTPSRASLMTGKYPIH-----TGMQG--PPIWGAEPRGVPLTERFLPEYLR 171
             +N     P+C PSR+S++TGKY +H     T  +    P W A+        R    YL 
Sbjct:    79 INAFVTTPMCCPSRSSILTGKY-VHNHNTYTNNENCSSPSWQAQHES-----RTFAVYLN 132

Query:   172 ELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHD 231
               GY T   GK+ L  +   Y P   G++   G L     +Y++ L     +  E +G D
Sbjct:   133 STGYRTAFFGKY-LNEYNGSYVP--PGWKEWVGLLKNS-RFYNYTLCRNGVK--EKHGSD 186

Query:   232 MRRNLSTAW---DTVGEYATD--LFTKEAVQLIEDQ-----PVDKPXXXXXXXXXXXXXX 281
                +  T     D+V  + T   ++    V ++        P D                
Sbjct:   187 YSTDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYSRLFPNASQHI 246

Query:   282 XXXXXEAPQETINQFQYITDPNR---RTYAAMVKK--------LDDSVGTVISALQRKGM 330
                   AP    +     T P +     +  M+++        +DDS+ T+   L   G 
Sbjct:   247 TPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMETIYDMLVETGE 306

Query:   331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ- 389
             L+N+ I++ +D+G    ++         G + PY        E  ++VP  +  P ++  
Sbjct:   307 LDNTYILYTADHGYHIGQFGLVK-----GKSMPY--------EFDIRVPFYVRGPNVEAG 353

Query:   390 --NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
               NP + L +    D  PT+   AG D   +P ++DG
Sbjct:   354 SLNPHIVLNI----DLAPTILDIAGLD---IPADMDG 383


>ZFIN|ZDB-GENE-030131-5846 [details] [associations]
            symbol:gnsb "glucosamine (N-acetyl)-6-sulfatase
            (Sanfilippo disease IIID), b" species:7955 "Danio rerio"
            [GO:0030203 "glycosaminoglycan metabolic process" evidence=IEA]
            [GO:0003824 "catalytic activity" evidence=IEA] [GO:0008152
            "metabolic process" evidence=IEA] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=IEA]
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008484 "sulfuric ester
            hydrolase activity" evidence=IEA] InterPro:IPR000917
            InterPro:IPR012251 InterPro:IPR015981 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036666
            ZFIN:ZDB-GENE-030131-5846 GO:GO:0005764 Gene3D:3.40.720.10
            SUPFAM:SSF53649 GeneTree:ENSGT00400000022041 GO:GO:0030203
            GO:GO:0008449 PANTHER:PTHR10342:SF5 EMBL:CU896586 IPI:IPI00971874
            Ensembl:ENSDART00000112103 ArrayExpress:F1QJ04 Bgee:F1QJ04
            Uniprot:F1QJ04
        Length = 507

 Score = 93 (37.8 bits), Expect = 0.00024, Sum P(2) = 0.00024
 Identities = 38/157 (24%), Positives = 67/157 (42%)

Query:   116 GIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPPIWG--AEPRGVPLTERF-LPEYLR 171
             G   +N +   P+C PSR+S ++G+YP +  +    + G  +        E F  P YL 
Sbjct:    64 GATFSNAFTSTPLCCPSRSSFLSGRYPHNHLVHNNSVEGNCSSAAWQKTAEPFAFPVYLN 123

Query:   172 ELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHI-LSDQYSRTVELNGH 230
             ++ Y T   GK     +  +Y     G  +H     G   ++  +  S  Y+ T+ +NG 
Sbjct:   124 KMRYQTFYCGK-----YLNQYGSKDAGGVAHVP--PGWDQWHALVGNSKYYNYTLSVNGK 176

Query:   231 DMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
             + +   S   D    Y TDL    ++  +E++    P
Sbjct:   177 EEKHGDSYEKD----YLTDLVLNRSLHFLEERSPSHP 209

 Score = 77 (32.2 bits), Expect = 0.00024, Sum P(2) = 0.00024
 Identities = 36/149 (24%), Positives = 68/149 (45%)

Query:   294 NQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETS 353
             +   Y+ +  RR +  ++  +DD V  ++  L     L+N+ I + SD+G  T ++    
Sbjct:   265 SSIDYLDNAFRRRWQTLLS-VDDLVERLLKKLDSVKELDNTYIFYTSDHGYHTGQF---- 319

Query:   354 NYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLP-TLYTAAG 412
                    + P    K  L+E  +++P ++  P I+    +   +++I   LP T+   AG
Sbjct:   320 -------SLPID--KRQLYEFDIRIPLLVRGPGIKAKQTLQSPVLNID--LPMTILDIAG 368

Query:   413 GDTSRLPLNIDG---LDQWSSLLLNTPSR 438
              + S +  N+DG   L Q +  L N   R
Sbjct:   369 VNLSTV--NMDGQSFLPQMAPSLRNGTER 395


>UNIPROTKB|F6VXY6 [details] [associations]
            symbol:SULF1 "Uncharacterized protein" species:9483
            "Callithrix jacchus" [GO:0001502 "cartilage condensation"
            evidence=ISS] [GO:0001822 "kidney development" evidence=ISS]
            [GO:0001937 "negative regulation of endothelial cell proliferation"
            evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
            [GO:0004065 "arylsulfatase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005783 "endoplasmic
            reticulum" evidence=ISS] [GO:0005794 "Golgi apparatus"
            evidence=ISS] [GO:0007155 "cell adhesion" evidence=ISS] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
            [GO:0009986 "cell surface" evidence=ISS] [GO:0014846 "esophagus
            smooth muscle contraction" evidence=ISS] [GO:0016525 "negative
            regulation of angiogenesis" evidence=ISS] [GO:0030177 "positive
            regulation of Wnt receptor signaling pathway" evidence=ISS]
            [GO:0030201 "heparan sulfate proteoglycan metabolic process"
            evidence=ISS] [GO:0030336 "negative regulation of cell migration"
            evidence=ISS] [GO:0030513 "positive regulation of BMP signaling
            pathway" evidence=ISS] [GO:0035860 "glial cell-derived neurotrophic
            factor receptor signaling pathway" evidence=ISS] [GO:0036022 "limb
            joint morphogenesis" evidence=ISS] [GO:0040037 "negative regulation
            of fibroblast growth factor receptor signaling pathway"
            evidence=ISS] [GO:0045121 "membrane raft" evidence=ISS] [GO:0048010
            "vascular endothelial growth factor receptor signaling pathway"
            evidence=ISS] [GO:0048661 "positive regulation of smooth muscle
            cell proliferation" evidence=ISS] [GO:0048706 "embryonic skeletal
            system development" evidence=ISS] [GO:0051216 "cartilage
            development" evidence=ISS] [GO:0060348 "bone development"
            evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0060686
            "negative regulation of prostatic bud formation" evidence=ISS]
            InterPro:IPR000917 InterPro:IPR014615 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783
            GO:GO:0005794 GO:GO:0005615 GO:GO:0009986 GO:GO:0048661
            GO:GO:0005509 GO:GO:0045121 GO:GO:0030336 GO:GO:0001822
            GO:GO:0001937 GO:GO:0030513 GO:GO:0016525 GO:GO:0001502
            GO:GO:0060348 Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0030177
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0004065
            GeneTree:ENSGT00400000022041 GO:GO:0048706 GO:GO:0048010
            GO:GO:0060686 GO:GO:0002063 GO:GO:0040037 GO:GO:0060384
            GO:GO:0008449 GO:GO:0030201 GO:GO:0014846 GO:GO:0035860
            GO:GO:0036022 InterPro:IPR024609 Pfam:PF12548 CTD:23213 OMA:SVRVTHK
            EMBL:ACFV01096449 EMBL:ACFV01096450 EMBL:ACFV01096451
            EMBL:ACFV01096452 EMBL:ACFV01096453 EMBL:ACFV01096454
            EMBL:ACFV01096455 EMBL:ACFV01096456 EMBL:ACFV01096457
            EMBL:ACFV01096458 EMBL:ACFV01096459 EMBL:ACFV01096460
            EMBL:ACFV01096461 RefSeq:XP_002759021.1 Ensembl:ENSCJAT00000009824
            Ensembl:ENSCJAT00000053576 GeneID:100390937 Uniprot:F6VXY6
        Length = 869

 Score = 126 (49.4 bits), Expect = 0.00028, P = 0.00028
 Identities = 79/355 (22%), Positives = 136/355 (38%)

Query:   100 GSNEIPTPNIDALAYNGI-ILNNMYAQPVCTPSRASLMTGKYP----IHTGMQG--PPIW 152
             GS ++       + + G   +N     P+C PSR+S++TGKY     ++T  +    P W
Sbjct:    58 GSLQVMNKTRKIMEHGGATFINAFVTTPMCCPSRSSMLTGKYVHNHNVYTNNENCSSPSW 117

Query:   153 GA--EPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
              A  EPR   +       YL   GY T   GK+ L  +   Y P   G+    G +    
Sbjct:   118 QAMHEPRTFAV-------YLNSTGYRTAFFGKY-LNEYNGSYIP--PGWREWLGLIKNS- 166

Query:   211 SYYDHILSDQYSRTVELNGHDMRRNLSTAW---DTVGEYATD--LFTKEAVQLIEDQ--- 262
              +Y++ +     +  E +G D  ++  T     ++V  +     ++    V ++      
Sbjct:   167 RFYNYTVCRNGIK--EKHGFDYAKDYFTDLITNESVNYFKMSKRMYPHRPVMMVISHAAP 224

Query:   263 --PVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDP---------N--RRTYAA 309
               P D                      AP    +     T P         N  +R    
Sbjct:   225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLPIHMEFTNILQRKRLQ 284

Query:   310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
              +  +DDSV  + + L   G LEN+ II+ +D+G    ++         G + PY     
Sbjct:   285 TLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQFGLVK-----GKSMPY----- 334

Query:   370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
                +  ++VP  +  P ++    V   +++I D  PT+   AG DT   P ++DG
Sbjct:   335 ---DFDIRVPFFIRGPSVEPGSIVPQIVLNI-DLAPTILDIAGLDT---PPDVDG 382


>MGI|MGI:2138563 [details] [associations]
            symbol:Sulf1 "sulfatase 1" species:10090 "Mus musculus"
            [GO:0001822 "kidney development" evidence=IGI] [GO:0001937
            "negative regulation of endothelial cell proliferation"
            evidence=ISO] [GO:0002063 "chondrocyte development" evidence=IMP]
            [GO:0003094 "glomerular filtration" evidence=IGI] [GO:0003824
            "catalytic activity" evidence=IEA] [GO:0004065 "arylsulfatase
            activity" evidence=ISO] [GO:0005509 "calcium ion binding"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=ISO]
            [GO:0005783 "endoplasmic reticulum" evidence=ISO] [GO:0005794
            "Golgi apparatus" evidence=ISO] [GO:0005886 "plasma membrane"
            evidence=IDA] [GO:0006790 "sulfur compound metabolic process"
            evidence=ISO] [GO:0006915 "apoptotic process" evidence=IEA]
            [GO:0008152 "metabolic process" evidence=IEA] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=ISO;IMP]
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
            [GO:0009986 "cell surface" evidence=ISO] [GO:0010575 "positive
            regulation vascular endothelial growth factor production"
            evidence=IGI] [GO:0014846 "esophagus smooth muscle contraction"
            evidence=IGI] [GO:0016525 "negative regulation of angiogenesis"
            evidence=ISO] [GO:0016787 "hydrolase activity" evidence=IEA]
            [GO:0030177 "positive regulation of Wnt receptor signaling pathway"
            evidence=ISO] [GO:0030201 "heparan sulfate proteoglycan metabolic
            process" evidence=ISO;IMP] [GO:0030336 "negative regulation of cell
            migration" evidence=ISO] [GO:0030513 "positive regulation of BMP
            signaling pathway" evidence=ISO] [GO:0032836 "glomerular basement
            membrane development" evidence=IGI] [GO:0035860 "glial cell-derived
            neurotrophic factor receptor signaling pathway" evidence=IDA]
            [GO:0040036 "regulation of fibroblast growth factor receptor
            signaling pathway" evidence=ISO] [GO:0040037 "negative regulation
            of fibroblast growth factor receptor signaling pathway"
            evidence=ISO;IGI;IDA] [GO:0045121 "membrane raft" evidence=ISO]
            [GO:0046872 "metal ion binding" evidence=IEA] [GO:0048010 "vascular
            endothelial growth factor receptor signaling pathway" evidence=ISO]
            [GO:0048706 "embryonic skeletal system development" evidence=IGI]
            [GO:0051216 "cartilage development" evidence=IMP] [GO:0060348 "bone
            development" evidence=IGI] [GO:0060384 "innervation" evidence=IGI]
            [GO:0060686 "negative regulation of prostatic bud formation"
            evidence=IDA] InterPro:IPR000917 InterPro:IPR014615
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            PIRSF:PIRSF036665 MGI:MGI:2138563 GO:GO:0005783 GO:GO:0005886
            GO:GO:0005794 GO:GO:0006915 GO:GO:0005615 GO:GO:0009986
            GO:GO:0048661 GO:GO:0005795 GO:GO:0005509 GO:GO:0010575
            GO:GO:0045121 GO:GO:0030336 GO:GO:0001937 GO:GO:0030513
            GO:GO:0016525 GO:GO:0001502 GO:GO:0060348 Gene3D:3.40.720.10
            SUPFAM:SSF53649 GO:GO:0030177 GO:GO:0003094 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0004065
            GeneTree:ENSGT00400000022041 GO:GO:0048706 GO:GO:0048010
            GO:GO:0060686 GO:GO:0002063 GO:GO:0040037 GO:GO:0032836
            GO:GO:0060384 GO:GO:0008449 GO:GO:0030201 HOGENOM:HOG000290161
            KO:K14607 HOVERGEN:HBG056431 GO:GO:0014846 GO:GO:0035860
            GO:GO:0036022 InterPro:IPR024609 Pfam:PF12548 CTD:23213 OMA:SVRVTHK
            OrthoDB:EOG4VT5WH ChiTaRS:SULF1 EMBL:AY101178 EMBL:AK129278
            EMBL:AK028285 EMBL:AK045002 EMBL:BC034547 EMBL:BC049276
            IPI:IPI00111481 RefSeq:NP_001185494.1 RefSeq:NP_001185495.1
            RefSeq:NP_758498.1 UniGene:Mm.45563 ProteinModelPortal:Q8K007
            SMR:Q8K007 STRING:Q8K007 PhosphoSite:Q8K007 PRIDE:Q8K007
            Ensembl:ENSMUST00000088585 Ensembl:ENSMUST00000177608
            Ensembl:ENSMUST00000180062 GeneID:240725 KEGG:mmu:240725
            UCSC:uc007aia.2 NextBio:384701 Bgee:Q8K007 CleanEx:MM_SULF1
            Genevestigator:Q8K007 GermOnline:ENSMUSG00000016918 Uniprot:Q8K007
        Length = 870

 Score = 126 (49.4 bits), Expect = 0.00028, P = 0.00028
 Identities = 80/370 (21%), Positives = 139/370 (37%)

Query:   100 GSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYP----IHTGMQG--PPIW 152
             GS ++       +   G    N +   P+C PSR+S++TGKY     ++T  +    P W
Sbjct:    58 GSLQVMNKTRKIMEQGGATFTNAFVTTPMCCPSRSSMLTGKYVHNHNVYTNNENCSSPSW 117

Query:   153 GA--EPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
              A  EPR   +       YL   GY T   GK+ L  +   Y P   G+    G +    
Sbjct:   118 QAMHEPRTFAV-------YLNNTGYRTAFFGKY-LNEYNGSYIP--PGWREWLGLIKNS- 166

Query:   211 SYYDHILSDQYSRTVELNGHDMRRNLSTAW---DTVGEYATD--LFTKEAVQLIEDQ--- 262
              +Y++ +     +  E +G D  ++  T     +++  +     ++    + ++      
Sbjct:   167 RFYNYTVCRNGIK--EKHGFDYAKDYFTDLITNESINYFKMSKRMYPHRPIMMVISHAAP 224

Query:   263 --PVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDP---------N--RRTYAA 309
               P D                      AP    +     T P         N  +R    
Sbjct:   225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLPIHMEFTNVLQRKRLQ 284

Query:   310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
              +  +DDSV  + + L   G L+N+ II+ +D+G    ++         G + PY     
Sbjct:   285 TLMSVDDSVERLYNMLVESGELDNTYIIYTADHGYHIGQFGLVK-----GKSMPY----- 334

Query:   370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWS 429
                +  ++VP  +  P I+    V   +++I D  PT+   AG D+   P ++DG     
Sbjct:   335 ---DFDIRVPFFIRGPSIEPGSIVPQIVLNI-DLAPTILDIAGLDS---PSDVDGKSVLK 387

Query:   430 SLLLNTPSRR 439
              L L  P  R
Sbjct:   388 LLDLEKPGNR 397


>UNIPROTKB|G1PHQ1 [details] [associations]
            symbol:SULF1 "Uncharacterized protein" species:59463
            "Myotis lucifugus" [GO:0001502 "cartilage condensation"
            evidence=ISS] [GO:0001822 "kidney development" evidence=ISS]
            [GO:0001937 "negative regulation of endothelial cell proliferation"
            evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
            [GO:0004065 "arylsulfatase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005783 "endoplasmic
            reticulum" evidence=ISS] [GO:0005794 "Golgi apparatus"
            evidence=ISS] [GO:0007155 "cell adhesion" evidence=ISS] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
            [GO:0009986 "cell surface" evidence=ISS] [GO:0014846 "esophagus
            smooth muscle contraction" evidence=ISS] [GO:0016525 "negative
            regulation of angiogenesis" evidence=ISS] [GO:0030177 "positive
            regulation of Wnt receptor signaling pathway" evidence=ISS]
            [GO:0030201 "heparan sulfate proteoglycan metabolic process"
            evidence=ISS] [GO:0030336 "negative regulation of cell migration"
            evidence=ISS] [GO:0030513 "positive regulation of BMP signaling
            pathway" evidence=ISS] [GO:0035860 "glial cell-derived neurotrophic
            factor receptor signaling pathway" evidence=ISS] [GO:0036022 "limb
            joint morphogenesis" evidence=ISS] [GO:0040037 "negative regulation
            of fibroblast growth factor receptor signaling pathway"
            evidence=ISS] [GO:0045121 "membrane raft" evidence=ISS] [GO:0048010
            "vascular endothelial growth factor receptor signaling pathway"
            evidence=ISS] [GO:0048661 "positive regulation of smooth muscle
            cell proliferation" evidence=ISS] [GO:0048706 "embryonic skeletal
            system development" evidence=ISS] [GO:0051216 "cartilage
            development" evidence=ISS] [GO:0060348 "bone development"
            evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0060686
            "negative regulation of prostatic bud formation" evidence=ISS]
            InterPro:IPR000917 InterPro:IPR014615 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783
            GO:GO:0005794 GO:GO:0005615 GO:GO:0009986 GO:GO:0048661
            GO:GO:0005509 GO:GO:0045121 GO:GO:0030336 GO:GO:0001822
            GO:GO:0001937 GO:GO:0030513 GO:GO:0016525 GO:GO:0001502
            GO:GO:0060348 Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0030177
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0004065
            GeneTree:ENSGT00400000022041 GO:GO:0048706 GO:GO:0048010
            GO:GO:0060686 GO:GO:0002063 GO:GO:0040037 GO:GO:0060384
            GO:GO:0008449 GO:GO:0030201 GO:GO:0014846 GO:GO:0035860
            GO:GO:0036022 InterPro:IPR024609 Pfam:PF12548 OMA:SVRVTHK
            EMBL:AAPE02021694 Ensembl:ENSMLUT00000011203 Uniprot:G1PHQ1
        Length = 871

 Score = 126 (49.4 bits), Expect = 0.00028, P = 0.00028
 Identities = 80/370 (21%), Positives = 138/370 (37%)

Query:   100 GSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYP----IHTGMQG--PPIW 152
             GS ++       + + G    N +   P+C PSR+S++TGKY     ++T  +    P W
Sbjct:    58 GSLQVMNKTRKIMEHGGATFTNAFVTTPMCCPSRSSMLTGKYVHNHNVYTNNENCSSPSW 117

Query:   153 GA--EPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
              A  EPR   +       YL   GY T   GK+ L  +   Y P   G+    G +    
Sbjct:   118 QAMHEPRTFAV-------YLNNTGYRTAFFGKY-LNEYNGSYIP--PGWREWLGLIKNS- 166

Query:   211 SYYDHILSDQYSRTVELNGHDMRRNLSTAW---DTVGEYATD--LFTKEAVQLIEDQ--- 262
              +Y++ +     +  E +G D  ++  T     +++  +     ++    + ++      
Sbjct:   167 RFYNYTVCRNGIK--EKHGFDYAKDYFTDLITNESINYFKMSKRMYPHRPIMMVISHAAP 224

Query:   263 --PVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDP---------N--RRTYAA 309
               P D                      AP    +     T P         N   R    
Sbjct:   225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLPIHMEFTNVLHRKRLQ 284

Query:   310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
              +  +DDSV  + + L   G LEN+ II+ +D+G    ++         G + PY     
Sbjct:   285 TLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQFGLVK-----GKSMPY----- 334

Query:   370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWS 429
                +  ++VP  +  P ++    V   +++I D  PT+   AG D    P ++DG     
Sbjct:   335 ---DFDIRVPFFIRGPSVEPGSIVPQIVLNI-DLAPTILDIAGLDP---PPDVDGKSVLK 387

Query:   430 SLLLNTPSRR 439
              L L  P  R
Sbjct:   388 LLDLEKPGNR 397


>UNIPROTKB|G1KQZ3 [details] [associations]
            symbol:SULF1 "Uncharacterized protein" species:28377
            "Anolis carolinensis" [GO:0001502 "cartilage condensation"
            evidence=ISS] [GO:0001822 "kidney development" evidence=ISS]
            [GO:0001937 "negative regulation of endothelial cell proliferation"
            evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
            [GO:0004065 "arylsulfatase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005783 "endoplasmic
            reticulum" evidence=ISS] [GO:0005794 "Golgi apparatus"
            evidence=ISS] [GO:0007155 "cell adhesion" evidence=ISS] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
            [GO:0009986 "cell surface" evidence=ISS] [GO:0014846 "esophagus
            smooth muscle contraction" evidence=ISS] [GO:0016525 "negative
            regulation of angiogenesis" evidence=ISS] [GO:0030177 "positive
            regulation of Wnt receptor signaling pathway" evidence=ISS]
            [GO:0030201 "heparan sulfate proteoglycan metabolic process"
            evidence=ISS] [GO:0030336 "negative regulation of cell migration"
            evidence=ISS] [GO:0030513 "positive regulation of BMP signaling
            pathway" evidence=ISS] [GO:0035860 "glial cell-derived neurotrophic
            factor receptor signaling pathway" evidence=ISS] [GO:0036022 "limb
            joint morphogenesis" evidence=ISS] [GO:0040037 "negative regulation
            of fibroblast growth factor receptor signaling pathway"
            evidence=ISS] [GO:0045121 "membrane raft" evidence=ISS] [GO:0048010
            "vascular endothelial growth factor receptor signaling pathway"
            evidence=ISS] [GO:0048661 "positive regulation of smooth muscle
            cell proliferation" evidence=ISS] [GO:0048706 "embryonic skeletal
            system development" evidence=ISS] [GO:0051216 "cartilage
            development" evidence=ISS] [GO:0060348 "bone development"
            evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0060686
            "negative regulation of prostatic bud formation" evidence=ISS]
            InterPro:IPR000917 InterPro:IPR014615 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783
            GO:GO:0005886 GO:GO:0005794 GO:GO:0005615 GO:GO:0009986
            GO:GO:0048661 GO:GO:0005509 GO:GO:0010575 GO:GO:0045121
            GO:GO:0030336 GO:GO:0001822 GO:GO:0001937 GO:GO:0030513
            GO:GO:0016525 GO:GO:0001502 GO:GO:0060348 Gene3D:3.40.720.10
            SUPFAM:SSF53649 GO:GO:0030177 InterPro:IPR024607 PROSITE:PS00523
            GO:GO:0004065 GeneTree:ENSGT00400000022041 GO:GO:0048706
            GO:GO:0048010 GO:GO:0060686 GO:GO:0002063 GO:GO:0040037
            GO:GO:0060384 GO:GO:0008449 GO:GO:0030201 GO:GO:0014846
            GO:GO:0035860 GO:GO:0036022 InterPro:IPR024609 Pfam:PF12548
            OMA:SVRVTHK Ensembl:ENSACAT00000015364 Uniprot:G1KQZ3
        Length = 878

 Score = 126 (49.4 bits), Expect = 0.00029, P = 0.00029
 Identities = 81/350 (23%), Positives = 133/350 (38%)

Query:   119 LNNMYAQPVCTPSRASLMTGKYP----IHTGMQG--PPIWGA--EPRGVPLTERFLPEYL 170
             +N     P+C PSR+S++TGKY     I+T  +    P W A  EPR   +       YL
Sbjct:    78 VNAFVTTPMCCPSRSSMLTGKYVHNHNIYTNNENCSSPSWQATHEPRTFAV-------YL 130

Query:   171 RELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGH 230
                GY T   GK+ L  +   Y P   G+    G +     +Y++ +     +  E +G 
Sbjct:   131 NNTGYRTAFFGKY-LNEYNGSYIP--PGWREWVGLIKNS-RFYNYTVCRNGLK--EKHGF 184

Query:   231 DMRRNLSTAW---DTVGEYATD--LFTKEAVQLIEDQ-----PVDKPXXXXXXXXXXXXX 280
             D  ++  T     D++  +     ++    + ++        P D               
Sbjct:   185 DYAKDYFTDLITNDSIHYFKMSKRIYPHRPIMMVISHAAPHGPEDSAPQFSKLYPNASQH 244

Query:   281 XXXXXXEAPQETINQFQYITDP---------N--RRTYAAMVKKLDDSVGTVISALQRKG 329
                    AP    +     T P         N  +R     +  +DDS+  +   L   G
Sbjct:   245 ITPSYNYAPNMDKHWIMQYTGPMLPIHMEFTNVLQRKRLQTLLSVDDSMERLYHMLVETG 304

Query:   330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
              LEN+ II+ +D+G    ++         G + PY        +  ++VP  +  P I+ 
Sbjct:   305 ELENTYIIYTADHGYHIGQFGLVK-----GKSMPY--------DFDIRVPFFIRGPSIEP 351

Query:   390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRR 439
                VS  +++I D  PT+   AG DT   P ++DG      L L  P  R
Sbjct:   352 GSVVSQIVLNI-DLAPTVLDIAGLDT---PPDMDGKSVLKLLDLEKPGNR 397


>UNIPROTKB|F7FJY3 [details] [associations]
            symbol:SULF1 "Uncharacterized protein" species:9544 "Macaca
            mulatta" [GO:0001502 "cartilage condensation" evidence=ISS]
            [GO:0001822 "kidney development" evidence=ISS] [GO:0001937
            "negative regulation of endothelial cell proliferation"
            evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
            [GO:0004065 "arylsulfatase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005783 "endoplasmic
            reticulum" evidence=ISS] [GO:0005794 "Golgi apparatus"
            evidence=ISS] [GO:0007155 "cell adhesion" evidence=ISS] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
            [GO:0009986 "cell surface" evidence=ISS] [GO:0014846 "esophagus
            smooth muscle contraction" evidence=ISS] [GO:0016525 "negative
            regulation of angiogenesis" evidence=ISS] [GO:0030177 "positive
            regulation of Wnt receptor signaling pathway" evidence=ISS]
            [GO:0030201 "heparan sulfate proteoglycan metabolic process"
            evidence=ISS] [GO:0030336 "negative regulation of cell migration"
            evidence=ISS] [GO:0030513 "positive regulation of BMP signaling
            pathway" evidence=ISS] [GO:0035860 "glial cell-derived neurotrophic
            factor receptor signaling pathway" evidence=ISS] [GO:0036022 "limb
            joint morphogenesis" evidence=ISS] [GO:0040037 "negative regulation
            of fibroblast growth factor receptor signaling pathway"
            evidence=ISS] [GO:0045121 "membrane raft" evidence=ISS] [GO:0048010
            "vascular endothelial growth factor receptor signaling pathway"
            evidence=ISS] [GO:0048661 "positive regulation of smooth muscle
            cell proliferation" evidence=ISS] [GO:0048706 "embryonic skeletal
            system development" evidence=ISS] [GO:0051216 "cartilage
            development" evidence=ISS] [GO:0060348 "bone development"
            evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0060686
            "negative regulation of prostatic bud formation" evidence=ISS]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 GO:GO:0005783 GO:GO:0005886 GO:GO:0005794
            GO:GO:0005615 GO:GO:0009986 GO:GO:0048661 GO:GO:0010575
            GO:GO:0045121 GO:GO:0030336 GO:GO:0001822 GO:GO:0001937
            GO:GO:0030513 GO:GO:0016525 GO:GO:0001502 GO:GO:0060348
            Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0030177 GO:GO:0003094
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0004065
            GeneTree:ENSGT00400000022041 GO:GO:0048706 GO:GO:0048010
            GO:GO:0060686 GO:GO:0002063 GO:GO:0040037 GO:GO:0032836
            GO:GO:0060384 GO:GO:0008449 GO:GO:0030201 GO:GO:0014846
            GO:GO:0035860 GO:GO:0036022 InterPro:IPR024609 Pfam:PF12548
            OMA:QRKGDEC Ensembl:ENSMMUT00000032744 Ensembl:ENSMMUT00000032745
            Uniprot:F7FJY3
        Length = 759

 Score = 125 (49.1 bits), Expect = 0.00031, P = 0.00031
 Identities = 78/355 (21%), Positives = 136/355 (38%)

Query:   100 GSNEIPTPNIDALAYNGI-ILNNMYAQPVCTPSRASLMTGKYP----IHTGMQG--PPIW 152
             GS ++       + + G   +N     P+C PSR+S++TGKY     ++T  +    P W
Sbjct:    58 GSLQVMNKTRKIMEHGGATFINAFVTTPMCCPSRSSMLTGKYVHNHNVYTNNENCSSPSW 117

Query:   153 GA--EPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
              A  EPR   +       YL   GY T   GK+ L  +   Y P   G+    G +    
Sbjct:   118 QAMHEPRTFAV-------YLNNTGYRTAFFGKY-LNEYNGSYIP--PGWREWLGLIKNS- 166

Query:   211 SYYDHILSDQYSRTVELNGHDMRRNLSTAW---DTVGEYATD--LFTKEAVQLIEDQ--- 262
              +Y++ +     +  E +G D  ++  T     +++  +     ++    V ++      
Sbjct:   167 RFYNYTVCRNGIK--EKHGFDYAKDYFTDLITNESINYFKMSKRMYPHRPVMMVISHAAP 224

Query:   263 --PVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDP---------N--RRTYAA 309
               P D                      AP    +     T P         N  +R    
Sbjct:   225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLPIHMEFTNILQRKRLQ 284

Query:   310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
              +  +DDSV  + + L   G LEN+ II+ +D+G    ++         G + PY     
Sbjct:   285 TLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQFGLVK-----GKSMPY----- 334

Query:   370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
                +  ++VP  +  P ++    V   +++I D  PT+   AG DT   P ++DG
Sbjct:   335 ---DFDIRVPFFIRGPSVEPGSIVPQIVLNI-DLAPTILDIAGLDT---PPDVDG 382


>UNIPROTKB|G3R9R9 [details] [associations]
            symbol:SULF1 "Uncharacterized protein" species:9595
            "Gorilla gorilla gorilla" [GO:0001502 "cartilage condensation"
            evidence=ISS] [GO:0001822 "kidney development" evidence=ISS]
            [GO:0001937 "negative regulation of endothelial cell proliferation"
            evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
            [GO:0004065 "arylsulfatase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005783 "endoplasmic
            reticulum" evidence=ISS] [GO:0005794 "Golgi apparatus"
            evidence=ISS] [GO:0007155 "cell adhesion" evidence=ISS] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
            [GO:0009986 "cell surface" evidence=ISS] [GO:0014846 "esophagus
            smooth muscle contraction" evidence=ISS] [GO:0016525 "negative
            regulation of angiogenesis" evidence=ISS] [GO:0030177 "positive
            regulation of Wnt receptor signaling pathway" evidence=ISS]
            [GO:0030201 "heparan sulfate proteoglycan metabolic process"
            evidence=ISS] [GO:0030336 "negative regulation of cell migration"
            evidence=ISS] [GO:0030513 "positive regulation of BMP signaling
            pathway" evidence=ISS] [GO:0035860 "glial cell-derived neurotrophic
            factor receptor signaling pathway" evidence=ISS] [GO:0036022 "limb
            joint morphogenesis" evidence=ISS] [GO:0040037 "negative regulation
            of fibroblast growth factor receptor signaling pathway"
            evidence=ISS] [GO:0045121 "membrane raft" evidence=ISS] [GO:0048010
            "vascular endothelial growth factor receptor signaling pathway"
            evidence=ISS] [GO:0048661 "positive regulation of smooth muscle
            cell proliferation" evidence=ISS] [GO:0048706 "embryonic skeletal
            system development" evidence=ISS] [GO:0051216 "cartilage
            development" evidence=ISS] [GO:0060348 "bone development"
            evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0060686
            "negative regulation of prostatic bud formation" evidence=ISS]
            InterPro:IPR000917 InterPro:IPR014615 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783
            GO:GO:0005794 GO:GO:0005615 GO:GO:0009986 GO:GO:0048661
            GO:GO:0005509 GO:GO:0045121 GO:GO:0030336 GO:GO:0001822
            GO:GO:0001937 GO:GO:0030513 GO:GO:0016525 GO:GO:0001502
            GO:GO:0060348 Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0030177
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0004065 GO:GO:0048706
            GO:GO:0048010 GO:GO:0060686 GO:GO:0002063 GO:GO:0040037
            GO:GO:0060384 GO:GO:0008449 GO:GO:0030201 GO:GO:0014846
            GO:GO:0035860 GO:GO:0036022 InterPro:IPR024609 Pfam:PF12548
            OMA:SVRVTHK RefSeq:XP_004047178.1 RefSeq:XP_004047179.1
            Ensembl:ENSGGOT00000012515 GeneID:101141420 Uniprot:G3R9R9
        Length = 869

 Score = 125 (49.1 bits), Expect = 0.00036, P = 0.00036
 Identities = 78/355 (21%), Positives = 136/355 (38%)

Query:   100 GSNEIPTPNIDALAYNGI-ILNNMYAQPVCTPSRASLMTGKYP----IHTGMQG--PPIW 152
             GS ++       + + G   +N     P+C PSR+S++TGKY     ++T  +    P W
Sbjct:    58 GSLQVMNKTRKIMEHGGATFINAFVTTPMCCPSRSSMLTGKYVHNHNVYTNNENCSSPSW 117

Query:   153 GA--EPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
              A  EPR   +       YL   GY T   GK+ L  +   Y P   G+    G +    
Sbjct:   118 QAMHEPRTFAV-------YLNNTGYRTAFFGKY-LNEYNGSYIP--PGWREWLGLIKNS- 166

Query:   211 SYYDHILSDQYSRTVELNGHDMRRNLSTAW---DTVGEYATD--LFTKEAVQLIEDQ--- 262
              +Y++ +     +  E +G D  ++  T     +++  +     ++    V ++      
Sbjct:   167 RFYNYTVCRNGIK--EKHGFDYAKDYFTDLITNESINYFKMSKRMYPHRPVMMVISHAAP 224

Query:   263 --PVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDP---------N--RRTYAA 309
               P D                      AP    +     T P         N  +R    
Sbjct:   225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLPIHMEFTNILQRKRLQ 284

Query:   310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
              +  +DDSV  + + L   G LEN+ II+ +D+G    ++         G + PY     
Sbjct:   285 TLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQFGLVK-----GKSMPY----- 334

Query:   370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
                +  ++VP  +  P ++    V   +++I D  PT+   AG DT   P ++DG
Sbjct:   335 ---DFDIRVPFFIRGPSVEPGSIVPQIVLNI-DLAPTILDIAGLDT---PPDVDG 382


>UNIPROTKB|Q8IWU6 [details] [associations]
            symbol:SULF1 "Extracellular sulfatase Sulf-1" species:9606
            "Homo sapiens" [GO:0005509 "calcium ion binding" evidence=IEA]
            [GO:0006915 "apoptotic process" evidence=IEA] [GO:0005795 "Golgi
            stack" evidence=IEA] [GO:0004065 "arylsulfatase activity"
            evidence=IMP;IDA] [GO:0005615 "extracellular space"
            evidence=IDA;NAS] [GO:0009986 "cell surface" evidence=IDA]
            [GO:0030201 "heparan sulfate proteoglycan metabolic process"
            evidence=IDA;NAS] [GO:0030336 "negative regulation of cell
            migration" evidence=IMP] [GO:0040036 "regulation of fibroblast
            growth factor receptor signaling pathway" evidence=IMP] [GO:0040037
            "negative regulation of fibroblast growth factor receptor signaling
            pathway" evidence=ISS;IMP] [GO:0030513 "positive regulation of BMP
            signaling pathway" evidence=IMP] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=IMP;IDA]
            [GO:0030177 "positive regulation of Wnt receptor signaling pathway"
            evidence=IDA] [GO:0045121 "membrane raft" evidence=IDA] [GO:0005783
            "endoplasmic reticulum" evidence=IDA] [GO:0048010 "vascular
            endothelial growth factor receptor signaling pathway" evidence=IDA]
            [GO:0002063 "chondrocyte development" evidence=ISS] [GO:0035860
            "glial cell-derived neurotrophic factor receptor signaling pathway"
            evidence=ISS] [GO:0051216 "cartilage development" evidence=ISS]
            [GO:0060686 "negative regulation of prostatic bud formation"
            evidence=ISS] [GO:0005794 "Golgi apparatus" evidence=ISS]
            [GO:0005886 "plasma membrane" evidence=ISS] [GO:0010575 "positive
            regulation vascular endothelial growth factor production"
            evidence=ISS] [GO:0003094 "glomerular filtration" evidence=ISS]
            [GO:0032836 "glomerular basement membrane development"
            evidence=ISS] [GO:0016525 "negative regulation of angiogenesis"
            evidence=IDA] [GO:0001937 "negative regulation of endothelial cell
            proliferation" evidence=IDA] [GO:0014846 "esophagus smooth muscle
            contraction" evidence=ISS] [GO:0048706 "embryonic skeletal system
            development" evidence=ISS] [GO:0060384 "innervation" evidence=ISS]
            [GO:0001822 "kidney development" evidence=ISS] [GO:0060348 "bone
            development" evidence=ISS] InterPro:IPR000917 InterPro:IPR014615
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            PIRSF:PIRSF036665 EMBL:AY101175 GO:GO:0005783 GO:GO:0005886
            GO:GO:0005794 GO:GO:0006915 GO:GO:0005615 GO:GO:0009986
            GO:GO:0005795 GO:GO:0005509 GO:GO:0010575 GO:GO:0045121
            GO:GO:0030336 GO:GO:0001937 GO:GO:0030513 GO:GO:0016525
            GO:GO:0060348 Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0030177
            GO:GO:0003094 eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0004065 GO:GO:0048706 GO:GO:0048010
            GO:GO:0060686 GO:GO:0002063 GO:GO:0040037 GO:GO:0032836
            GO:GO:0060384 GO:GO:0008449 GO:GO:0030201 Orphanet:2496
            HOGENOM:HOG000290161 KO:K14607 HOVERGEN:HBG056431 GO:GO:0014846
            GO:GO:0035860 InterPro:IPR024609 Pfam:PF12548 CTD:23213
            EMBL:AF545571 EMBL:AB029000 EMBL:AK074873 IPI:IPI00293203
            RefSeq:NP_001121676.1 RefSeq:NP_001121677.1 RefSeq:NP_001121678.1
            RefSeq:NP_055985.2 UniGene:Hs.409602 ProteinModelPortal:Q8IWU6
            SMR:Q8IWU6 STRING:Q8IWU6 PhosphoSite:Q8IWU6 DMDM:33112447
            PaxDb:Q8IWU6 PRIDE:Q8IWU6 DNASU:23213 Ensembl:ENST00000260128
            Ensembl:ENST00000402687 Ensembl:ENST00000419716
            Ensembl:ENST00000458141 GeneID:23213 KEGG:hsa:23213 UCSC:uc003xyd.2
            GeneCards:GC08P070428 HGNC:HGNC:20391 MIM:610012 neXtProt:NX_Q8IWU6
            PharmGKB:PA134861022 InParanoid:Q8IWU6 OMA:SVRVTHK
            OrthoDB:EOG4VT5WH ChiTaRS:SULF1 GenomeRNAi:23213 NextBio:44771
            ArrayExpress:Q8IWU6 Bgee:Q8IWU6 CleanEx:HS_SULF1
            Genevestigator:Q8IWU6 Uniprot:Q8IWU6
        Length = 871

 Score = 125 (49.1 bits), Expect = 0.00036, P = 0.00036
 Identities = 78/355 (21%), Positives = 136/355 (38%)

Query:   100 GSNEIPTPNIDALAYNGI-ILNNMYAQPVCTPSRASLMTGKYP----IHTGMQG--PPIW 152
             GS ++       + + G   +N     P+C PSR+S++TGKY     ++T  +    P W
Sbjct:    58 GSLQVMNKTRKIMEHGGATFINAFVTTPMCCPSRSSMLTGKYVHNHNVYTNNENCSSPSW 117

Query:   153 GA--EPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
              A  EPR   +       YL   GY T   GK+ L  +   Y P   G+    G +    
Sbjct:   118 QAMHEPRTFAV-------YLNNTGYRTAFFGKY-LNEYNGSYIP--PGWREWLGLIKNS- 166

Query:   211 SYYDHILSDQYSRTVELNGHDMRRNLSTAW---DTVGEYATD--LFTKEAVQLIEDQ--- 262
              +Y++ +     +  E +G D  ++  T     +++  +     ++    V ++      
Sbjct:   167 RFYNYTVCRNGIK--EKHGFDYAKDYFTDLITNESINYFKMSKRMYPHRPVMMVISHAAP 224

Query:   263 --PVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDP---------N--RRTYAA 309
               P D                      AP    +     T P         N  +R    
Sbjct:   225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLPIHMEFTNILQRKRLQ 284

Query:   310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
              +  +DDSV  + + L   G LEN+ II+ +D+G    ++         G + PY     
Sbjct:   285 TLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQFGLVK-----GKSMPY----- 334

Query:   370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
                +  ++VP  +  P ++    V   +++I D  PT+   AG DT   P ++DG
Sbjct:   335 ---DFDIRVPFFIRGPSVEPGSIVPQIVLNI-DLAPTILDIAGLDT---PPDVDG 382


>UNIPROTKB|I3L4C9 [details] [associations]
            symbol:SGSH "N-sulphoglucosamine sulphohydrolase"
            species:9606 "Homo sapiens" [GO:0008484 "sulfuric ester hydrolase
            activity" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484 EMBL:AC087741
            EMBL:AC123764 HGNC:HGNC:10818 ChiTaRS:SGSH
            ProteinModelPortal:I3L4C9 SMR:I3L4C9 Ensembl:ENST00000576941
            Bgee:I3L4C9 Uniprot:I3L4C9
        Length = 108

 Score = 98 (39.6 bits), Expect = 0.00041, P = 0.00041
 Identities = 21/59 (35%), Positives = 34/59 (57%)

Query:    99 HGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGK----YPIHTGMQGPPIW 152
             + ++ I TP++DALA   ++  N +     C+PSRASL+TG      P+    + PP+W
Sbjct:    40 YNNSAIATPHLDALARRSLLFRNAFTSVSSCSPSRASLLTGLPQAFLPLRRLPRPPPLW 98


>FB|FBgn0040271 [details] [associations]
            symbol:Sulf1 "Sulfated" species:7227 "Drosophila
            melanogaster" [GO:0008449 "N-acetylglucosamine-6-sulfatase
            activity" evidence=ISS] [GO:0007389 "pattern specification process"
            evidence=IMP] [GO:0018741 "alkyl sulfatase activity" evidence=ISS]
            [GO:0008152 "metabolic process" evidence=ISS] [GO:0005783
            "endoplasmic reticulum" evidence=ISS] [GO:0009986 "cell surface"
            evidence=ISS] [GO:0005795 "Golgi stack" evidence=ISS] [GO:0017015
            "regulation of transforming growth factor beta receptor signaling
            pathway" evidence=IMP] [GO:0030111 "regulation of Wnt receptor
            signaling pathway" evidence=IMP] [GO:0045880 "positive regulation
            of smoothened signaling pathway" evidence=IMP] [GO:0045879
            "negative regulation of smoothened signaling pathway" evidence=IMP]
            [GO:0042059 "negative regulation of epidermal growth factor
            receptor signaling pathway" evidence=IGI] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 GO:GO:0005783
            EMBL:AE014297 GO:GO:0009986 GO:GO:0030111 GO:GO:0046872
            GO:GO:0005795 GO:GO:0042059 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            GO:GO:0008484 GeneTree:ENSGT00400000022041 GO:GO:0017015
            GO:GO:0045879 GO:GO:0045880 KO:K14607 InterPro:IPR024609
            Pfam:PF12548 EMBL:AY119658 EMBL:AF211192 RefSeq:NP_524987.1
            UniGene:Dm.13781 ProteinModelPortal:Q9VEX0 SMR:Q9VEX0
            DIP:DIP-21001N MINT:MINT-1598983 STRING:Q9VEX0 PaxDb:Q9VEX0
            PRIDE:Q9VEX0 EnsemblMetazoa:FBtr0083273 GeneID:53437
            KEGG:dme:Dmel_CG6725 UCSC:CG6725-RA CTD:23213 FlyBase:FBgn0040271
            InParanoid:Q9VEX0 OMA:QWILQVT OrthoDB:EOG4GB5N2 PhylomeDB:Q9VEX0
            GenomeRNAi:53437 NextBio:841154 Bgee:Q9VEX0 GermOnline:CG6725
            Uniprot:Q9VEX0
        Length = 1114

 Score = 125 (49.1 bits), Expect = 0.00049, P = 0.00049
 Identities = 101/420 (24%), Positives = 163/420 (38%)

Query:    95 DLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPPIWG 153
             D+     N +P   +  L   G    + Y   P+C P+R+SL+TG Y +H  M       
Sbjct:    65 DVELGSLNFMPR-TLRLLRDGGAEFRHAYTTTPMCCPARSSLLTGMY-VHNHMVFTNNDN 122

Query:   154 -AEPRGVPLTE-RFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              + P+     E R    YL   GY T   GK+ L  +   Y P   G+   +G   G+I 
Sbjct:   123 CSSPQWQATHETRSYATYLSNAGYRTGYFGKY-LNKYNGSYIP--PGWRE-WG---GLI- 174

Query:   212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEY-ATD--LFTKEAVQLIEDQPV---- 264
                 + S  Y+ ++ LNG  ++     A D   +  A D   F + + Q  + +PV    
Sbjct:   175 ----MNSKYYNYSINLNGQKIKHGFDYAKDYYPDLIANDSIAFLRSSKQQNQRKPVLLTM 230

Query:   265 ---------DKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDP-----NRRTYAAM 310
                      D                      AP         +T+P      R T   M
Sbjct:   231 SFPAPHGPEDSAPQYSHLFFNVTTHHTPSYDHAPNPDKQWILRVTEPMQPVHKRFTNLLM 290

Query:   311 VKKL------DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPY 364
              K+L      D +V  V + L+  G L+N+ I++ SD+G    ++         G ++P+
Sbjct:   291 TKRLQTLQSVDVAVERVYNELKELGELDNTYIVYTSDHGYHLGQFGLIK-----GKSFPF 345

Query:   365 RGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
                     E  V+VP ++  P IQ +  V+  ++++ D  PT     G  T   P ++DG
Sbjct:   346 --------EFDVRVPFLIRGPGIQASKVVNEIVLNV-DLAPTFLDMGGVPT---PQHMDG 393

Query:   425 LDQWSSLLLNTPSRRNSNIDGLDQW-SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKL 483
                   LLL+    RN  +   D W  S L+ +  RR +    I E +    +   + KL
Sbjct:   394 RSILP-LLLS----RNRAVR--DNWPDSFLIESSGRRETAE-QIAESRARLQIERRNMKL 445


>UNIPROTKB|I3L2I6 [details] [associations]
            symbol:SGSH "N-sulphoglucosamine sulphohydrolase"
            species:9606 "Homo sapiens" [GO:0008484 "sulfuric ester hydrolase
            activity" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484 EMBL:AC087741
            EMBL:AC123764 HGNC:HGNC:10818 ChiTaRS:SGSH Ensembl:ENST00000574505
            Uniprot:I3L2I6
        Length = 106

 Score = 97 (39.2 bits), Expect = 0.00053, P = 0.00053
 Identities = 22/52 (42%), Positives = 32/52 (61%)

Query:    99 HGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIH-TGMQG 148
             + ++ I TP++DALA   ++  N +     C+PSRASL+TG  P H  GM G
Sbjct:    22 YNNSAIATPHLDALARRSLLFRNAFTSVSSCSPSRASLLTG-LPQHQNGMYG 72


>UNIPROTKB|Q32KH1 [details] [associations]
            symbol:sulf2 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0060384 "innervation" evidence=IEA]
            [GO:0060348 "bone development" evidence=IEA] [GO:0048706 "embryonic
            skeletal system development" evidence=IEA] [GO:0040037 "negative
            regulation of fibroblast growth factor receptor signaling pathway"
            evidence=IEA] [GO:0035860 "glial cell-derived neurotrophic factor
            receptor signaling pathway" evidence=IEA] [GO:0032836 "glomerular
            basement membrane development" evidence=IEA] [GO:0030201 "heparan
            sulfate proteoglycan metabolic process" evidence=IEA] [GO:0030177
            "positive regulation of Wnt receptor signaling pathway"
            evidence=IEA] [GO:0014846 "esophagus smooth muscle contraction"
            evidence=IEA] [GO:0010575 "positive regulation vascular endothelial
            growth factor production" evidence=IEA] [GO:0009986 "cell surface"
            evidence=IEA] [GO:0008449 "N-acetylglucosamine-6-sulfatase
            activity" evidence=IEA] [GO:0005886 "plasma membrane" evidence=IEA]
            [GO:0005783 "endoplasmic reticulum" evidence=IEA] [GO:0004065
            "arylsulfatase activity" evidence=IEA] [GO:0003094 "glomerular
            filtration" evidence=IEA] [GO:0002063 "chondrocyte development"
            evidence=IEA] [GO:0005794 "Golgi apparatus" evidence=IEA]
            [GO:0005509 "calcium ion binding" evidence=IEA] InterPro:IPR000917
            InterPro:IPR014615 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783 GO:GO:0005886
            GO:GO:0005794 GO:GO:0009986 GO:GO:0005509 GO:GO:0010575
            GO:GO:0060348 Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0030177
            GO:GO:0003094 eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523
            GO:GO:0004065 GeneTree:ENSGT00400000022041 GO:GO:0048706
            GO:GO:0002063 GO:GO:0040037 GO:GO:0032836 GO:GO:0060384
            GO:GO:0030201 HOGENOM:HOG000290161 KO:K14607 HOVERGEN:HBG056431
            GO:GO:0014846 GO:GO:0035860 InterPro:IPR024609 Pfam:PF12548
            CTD:55959 OMA:PKYYGQG OrthoDB:EOG49KFPX EMBL:AAEX03013985
            EMBL:AAEX03013986 EMBL:BN000766 RefSeq:NP_001041555.1
            UniGene:Cfa.6393 STRING:Q32KH1 Ensembl:ENSCAFT00000017345
            GeneID:477254 KEGG:cfa:477254 InParanoid:Q32KH1 NextBio:20852774
            Uniprot:Q32KH1
        Length = 869

 Score = 132 (51.5 bits), Expect = 0.00060, Sum P(2) = 0.00060
 Identities = 77/337 (22%), Positives = 132/337 (39%)

Query:   119 LNNMYAQPVCTPSRASLMTGKYPIH-----TGMQG--PPIWGAEPRGVPLTERFLPEYLR 171
             +N     P+C PSR+S++TGKY +H     T  +    P W A+        R    YL 
Sbjct:    79 INAFVTTPMCCPSRSSILTGKY-VHNHNTYTNNENCSSPSWQAQHES-----RTFAVYLN 132

Query:   172 ELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHD 231
               GY T   GK+ L  +   Y P   G++   G L     +Y++ L     +  E +G D
Sbjct:   133 STGYRTAFFGKY-LNEYNGSYVP--PGWKEWVGLLKNS-RFYNYTLCRNGVK--EKHGFD 186

Query:   232 MRRNLSTAW---DTVGEYATD--LFTKEAVQLIEDQ-----PVDKPXXXXXXXXXXXXXX 281
               ++  T     D+V  + T   ++    V ++        P D                
Sbjct:   187 YSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYSGLFPNASQHI 246

Query:   282 XXXXXEAPQETINQFQYITDPNR---RTYAAMVKK--------LDDSVGTVISALQRKGM 330
                   AP    +     T P +     +  M+++        +DDS+ T+ + L   G 
Sbjct:   247 TPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMETIYNMLVETGE 306

Query:   331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ- 389
             L+N+ I++ +D+G    ++         G + PY        E  ++VP  +  P ++  
Sbjct:   307 LDNTYIVYTADHGYHIGQFGLVK-----GKSMPY--------EFDIRVPFYVRGPNVEAG 353

Query:   390 --NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
               NP + L +    D  PT+   AG D   +P ++DG
Sbjct:   354 SLNPHIVLNI----DLAPTILDIAGLD---IPSDMDG 383

 Score = 38 (18.4 bits), Expect = 0.00060, Sum P(2) = 0.00060
 Identities = 6/23 (26%), Positives = 9/23 (39%)

Query:   607 QSHEQPDLVQADPKRFNDTWSPW 629
             Q  + PD+ +   K     W  W
Sbjct:   845 QRRKWPDMKRPSSKSLGQLWEGW 867


>UNIPROTKB|E1BZH8 [details] [associations]
            symbol:SULF2 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0005509 "calcium ion binding" evidence=IEA] [GO:0005794
            "Golgi apparatus" evidence=IEA] [GO:0003094 "glomerular filtration"
            evidence=IEA] [GO:0004065 "arylsulfatase activity" evidence=IEA]
            [GO:0005783 "endoplasmic reticulum" evidence=IEA] [GO:0005886
            "plasma membrane" evidence=IEA] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=IEA]
            [GO:0009986 "cell surface" evidence=IEA] [GO:0010575 "positive
            regulation vascular endothelial growth factor production"
            evidence=IEA] [GO:0014846 "esophagus smooth muscle contraction"
            evidence=IEA] [GO:0030177 "positive regulation of Wnt receptor
            signaling pathway" evidence=IEA] [GO:0030201 "heparan sulfate
            proteoglycan metabolic process" evidence=IEA] [GO:0032836
            "glomerular basement membrane development" evidence=IEA]
            [GO:0035860 "glial cell-derived neurotrophic factor receptor
            signaling pathway" evidence=IEA] [GO:0040037 "negative regulation
            of fibroblast growth factor receptor signaling pathway"
            evidence=IEA] [GO:0048706 "embryonic skeletal system development"
            evidence=IEA] [GO:0060348 "bone development" evidence=IEA]
            [GO:0060384 "innervation" evidence=IEA] [GO:0002063 "chondrocyte
            development" evidence=IEA] InterPro:IPR000917 InterPro:IPR014615
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            PIRSF:PIRSF036665 GO:GO:0005783 GO:GO:0005886 GO:GO:0005794
            GO:GO:0009986 GO:GO:0005509 GO:GO:0010575 Gene3D:3.40.720.10
            SUPFAM:SSF53649 GO:GO:0030177 InterPro:IPR024607 PROSITE:PS00523
            GO:GO:0004065 GeneTree:ENSGT00400000022041 GO:GO:0040037
            GO:GO:0030201 GO:GO:0035860 InterPro:IPR024609 Pfam:PF12548
            OMA:PKYYGQG EMBL:AADN02019298 IPI:IPI00571119
            ProteinModelPortal:E1BZH8 Ensembl:ENSGALT00000007309 Uniprot:E1BZH8
        Length = 879

 Score = 123 (48.4 bits), Expect = 0.00061, P = 0.00061
 Identities = 79/357 (22%), Positives = 138/357 (38%)

Query:   100 GSNEIPTPNIDALAYNGI-ILNNMYAQPVCTPSRASLMTGKYPIH-----TGMQG--PPI 151
             GS ++       + + G   +N     P+C PSR+S++TGKY +H     T  +    P 
Sbjct:    59 GSMQVMNKTRRIMEHGGAHFINAFVTTPMCCPSRSSILTGKY-VHNHNTYTNNENCSSPS 117

Query:   152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             W A+        R    YL   GY T   GK+ L  +   Y P   G++   G L     
Sbjct:   118 WQAQHE-----IRTFAVYLNNTGYRTAFFGKY-LNEYNGSYVP--PGWKEWVGLLKNS-R 168

Query:   212 YYDHILSDQYSRTVELNGHDMRRNLSTAW---DTVGEY--ATDLFTKEAVQLIEDQ---- 262
             +Y++ L     +  E +G D  R+  T     D++  +  +  ++    V ++       
Sbjct:   169 FYNYTLCRNGVK--EKHGFDYSRDYLTDLITNDSITFFRISKKMYPHRPVLMVISHAAPH 226

Query:   263 -PVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDPNR---RTYAAMVKK----- 313
              P D                      AP    +     T P +     +  M+++     
Sbjct:   227 GPEDSAPQYSHLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQT 286

Query:   314 ---LDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
                +DDS+  + + L   G L+N+ II+ +D+G    ++         G + PY      
Sbjct:   287 LMSVDDSMEMIYNTLVETGELDNTYIIYTADHGYHIGQFGLVK-----GKSMPY------ 335

Query:   371 LWEGGVKVPAILWSPQIQQ---NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
               E  ++VP  +  P ++    NP + L +    D  PT+   AG D   +P ++DG
Sbjct:   336 --EFDIRVPFYVRGPNVEAGSLNPHIVLNI----DLAPTILDIAGLD---IPSDMDG 383


>UNIPROTKB|I3L643 [details] [associations]
            symbol:GNS "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0030203 "glycosaminoglycan metabolic process"
            evidence=IEA] [GO:0008449 "N-acetylglucosamine-6-sulfatase
            activity" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR012251 InterPro:IPR015981
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            PIRSF:PIRSF036666 GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            GeneTree:ENSGT00400000022041 GO:GO:0030203 GO:GO:0008449
            PANTHER:PTHR10342:SF5 EMBL:AEMK01192095 EMBL:FP700150
            Ensembl:ENSSSCT00000032527 OMA:FARAFAN Uniprot:I3L643
        Length = 369

 Score = 82 (33.9 bits), Expect = 0.00063, Sum P(2) = 0.00063
 Identities = 39/157 (24%), Positives = 66/157 (42%)

Query:   116 GIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPPIWG--AEPRGVPLTE-RFLPEYLR 171
             G+  ++ Y    +C PSRAS++TGKYP +  +    + G  +      + E    P  LR
Sbjct:    78 GMTFSSAYVPSALCCPSRASILTGKYPHNHHVVNNTLEGNCSSKSWQKIEEPNTFPAILR 137

Query:   172 EL-GYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGH 230
              + GY T   GK     +  EY     G  +H   L     Y     S  Y+ T+ +NG 
Sbjct:   138 SVCGYQTFFAGK-----YLNEYGAPDAGGLAHVP-LGWSYWYALEKNSKYYNYTLSINGK 191

Query:   231 DMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
               +   + + D    Y TD+    ++  ++ +   +P
Sbjct:   192 ARKHGENYSVD----YLTDVLANVSLDFLDYKSNSEP 224

 Score = 81 (33.6 bits), Expect = 0.00063, Sum P(2) = 0.00063
 Identities = 26/99 (26%), Positives = 48/99 (48%)

Query:   292 TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE 351
             T +  Q++ +  R+ +  ++  +DD V  ++  L+  G L N+ I + SDNG  T ++  
Sbjct:   278 TNSSIQFLDNAFRKRWQTLLS-VDDLVEKLVKRLEFNGELNNTYIFYTSDNGYHTGQF-- 334

Query:   352 TSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
                      + P    K  L+E  +KVP ++  P I+ N
Sbjct:   335 ---------SLPID--KRQLYEFDIKVPLLVRGPGIKPN 362


>WB|WBGene00006308 [details] [associations]
            symbol:sul-1 species:6239 "Caenorhabditis elegans"
            [GO:0008152 "metabolic process" evidence=IEA] [GO:0008484 "sulfuric
            ester hydrolase activity" evidence=IEA] [GO:0003824 "catalytic
            activity" evidence=IEA] [GO:0016021 "integral to membrane"
            evidence=IEA] [GO:0015015 "heparan sulfate proteoglycan
            biosynthetic process, enzymatic modification" evidence=IMP]
            [GO:0017095 "heparan sulfate 6-O-sulfotransferase activity"
            evidence=IMP] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 GO:GO:0005783 GO:GO:0009986
            GO:GO:0046872 GO:GO:0005795 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            GO:GO:0008484 GeneTree:ENSGT00400000022041 GO:GO:0015015
            GO:GO:0017095 EMBL:FO081118 PIR:T16584 RefSeq:NP_508560.1
            ProteinModelPortal:Q21376 SMR:Q21376 STRING:Q21376
            EnsemblMetazoa:K09C4.8 GeneID:180619 KEGG:cel:CELE_K09C4.8
            UCSC:K09C4.8 CTD:180619 WormBase:K09C4.8 HOGENOM:HOG000290161
            InParanoid:Q21376 KO:K14607 OMA:TVEDRWR NextBio:910136
            Uniprot:Q21376
        Length = 709

 Score = 105 (42.0 bits), Expect = 0.00078, Sum P(2) = 0.00078
 Identities = 39/139 (28%), Positives = 65/139 (46%)

Query:   126 PVCTPSRASLMTGKYP----IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIG 181
             P+C PSR++++TG Y     +HT  Q     G E R V   ++ +  YL+E GY T  +G
Sbjct:    77 PICCPSRSTILTGLYVHNHHVHTNNQN--CTGVEWRKVH-EKKSIGVYLQEAGYRTAYLG 133

Query:   182 KWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD 241
             K+ L  +   Y P   G++     +           S  Y+ T+  NG   R    + ++
Sbjct:   134 KY-LNEYDGSYIP--PGWDEWHAIVKN---------SKFYNYTMNSNGE--REKFGSEYE 179

Query:   242 TVGEYATDLFTKEAVQLIE 260
                +Y TDL T  +++ I+
Sbjct:   180 K--DYFTDLVTNRSLKFID 196

 Score = 63 (27.2 bits), Expect = 0.00078, Sum P(2) = 0.00078
 Identities = 22/103 (21%), Positives = 45/103 (43%)

Query:   300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
             TD   R     ++ +D+ +  + + L+    L N+  I+ SD+G    ++         G
Sbjct:   267 TDLLHRRRLQTLQSVDEGIERLFNLLRELNQLWNTYAIYTSDHGYHLGQFGLLK-----G 321

Query:   360 SNYPYR-GVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHIS 401
              N PY   ++   +  G  +P  +   +I  N  ++  M+HI+
Sbjct:   322 KNMPYEFDIRVPFFMRGPGIPRNVTFNEIVTNVDIAPTMLHIA 364


>RGD|708554 [details] [associations]
            symbol:Sulf1 "sulfatase 1" species:10116 "Rattus norvegicus"
            [GO:0001502 "cartilage condensation" evidence=ISS] [GO:0001822
            "kidney development" evidence=ISO;ISS] [GO:0001937 "negative
            regulation of endothelial cell proliferation" evidence=IEA;ISO;ISS]
            [GO:0002063 "chondrocyte development" evidence=IEA;ISO;ISS]
            [GO:0003094 "glomerular filtration" evidence=IEA;ISO] [GO:0004065
            "arylsulfatase activity" evidence=IEA;ISO;ISS] [GO:0005509 "calcium
            ion binding" evidence=IEA] [GO:0005615 "extracellular space"
            evidence=ISO;ISS] [GO:0005783 "endoplasmic reticulum"
            evidence=IEA;ISO;NAS;IDA] [GO:0005794 "Golgi apparatus"
            evidence=IEA;IDA] [GO:0005795 "Golgi stack" evidence=NAS]
            [GO:0005886 "plasma membrane" evidence=IEA;ISO] [GO:0007155 "cell
            adhesion" evidence=ISS] [GO:0008152 "metabolic process"
            evidence=NAS] [GO:0008449 "N-acetylglucosamine-6-sulfatase
            activity" evidence=IEA;ISO;ISS] [GO:0009986 "cell surface"
            evidence=IEA;ISO;ISS;NAS;IDA] [GO:0010575 "positive regulation
            vascular endothelial growth factor production" evidence=IEA;ISO]
            [GO:0014846 "esophagus smooth muscle contraction"
            evidence=IEA;ISO;ISS] [GO:0016525 "negative regulation of
            angiogenesis" evidence=IEA;ISO;ISS] [GO:0018741 "alkyl sulfatase
            activity" evidence=NAS] [GO:0030177 "positive regulation of Wnt
            receptor signaling pathway" evidence=IEA;ISO;ISS] [GO:0030201
            "heparan sulfate proteoglycan metabolic process"
            evidence=IEA;ISO;ISS] [GO:0030336 "negative regulation of cell
            migration" evidence=IEA;ISO;ISS] [GO:0030513 "positive regulation
            of BMP signaling pathway" evidence=IEA;ISO;ISS] [GO:0032836
            "glomerular basement membrane development" evidence=IEA;ISO]
            [GO:0035860 "glial cell-derived neurotrophic factor receptor
            signaling pathway" evidence=IEA;ISO;ISS] [GO:0036022 "limb joint
            morphogenesis" evidence=ISS] [GO:0040036 "regulation of fibroblast
            growth factor receptor signaling pathway" evidence=ISO] [GO:0040037
            "negative regulation of fibroblast growth factor receptor signaling
            pathway" evidence=IEA;ISO;ISS] [GO:0045121 "membrane raft"
            evidence=IEA;ISO;ISS] [GO:0048010 "vascular endothelial growth
            factor receptor signaling pathway" evidence=IEA;ISO;ISS]
            [GO:0048661 "positive regulation of smooth muscle cell
            proliferation" evidence=ISS] [GO:0048706 "embryonic skeletal system
            development" evidence=IEA;ISO;ISS] [GO:0051216 "cartilage
            development" evidence=ISO;ISS] [GO:0060348 "bone development"
            evidence=IEA;ISO;ISS] [GO:0060384 "innervation"
            evidence=IEA;ISO;ISS] [GO:0060686 "negative regulation of prostatic
            bud formation" evidence=IEA;ISO;ISS] InterPro:IPR000917
            InterPro:IPR014615 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 PIRSF:PIRSF036665 RGD:708554 GO:GO:0005783
            GO:GO:0005886 GO:GO:0005615 GO:GO:0009986 GO:GO:0048661
            GO:GO:0005795 GO:GO:0005509 GO:GO:0010575 GO:GO:0045121
            GO:GO:0030336 GO:GO:0001822 GO:GO:0001937 GO:GO:0030513
            GO:GO:0016525 GO:GO:0001502 GO:GO:0060348 Gene3D:3.40.720.10
            SUPFAM:SSF53649 GO:GO:0030177 GO:GO:0003094 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0004065
            GO:GO:0048706 GO:GO:0048010 GO:GO:0018741 GO:GO:0060686
            GO:GO:0002063 GO:GO:0040037 GO:GO:0032836 GO:GO:0060384
            GO:GO:0008449 GO:GO:0030201 HOGENOM:HOG000290161 KO:K14607
            HOVERGEN:HBG056431 GO:GO:0014846 GO:GO:0035860 GO:GO:0036022
            InterPro:IPR024609 Pfam:PF12548 CTD:23213 OrthoDB:EOG4VT5WH
            EMBL:AF230072 IPI:IPI00331986 RefSeq:NP_599205.1 UniGene:Rn.161961
            ProteinModelPortal:Q8VI60 STRING:Q8VI60 GeneID:171396
            KEGG:rno:171396 UCSC:RGD:708554 NextBio:622244 ArrayExpress:Q8VI60
            Genevestigator:Q8VI60 GermOnline:ENSRNOG00000009037 Uniprot:Q8VI60
        Length = 870

 Score = 129 (50.5 bits), Expect = 0.0010, Sum P(2) = 0.0010
 Identities = 82/370 (22%), Positives = 139/370 (37%)

Query:   100 GSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYP----IHTGMQG--PPIW 152
             GS ++       + + G    N +   P+C PSR+S++TGKY     ++T  +    P W
Sbjct:    58 GSLQVMNKTRKIMEHGGATFTNAFVTTPMCCPSRSSMLTGKYVHNHNVYTNNENCSSPSW 117

Query:   153 GA--EPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
              A  EPR   +       YL   GY T   GK+ L  +   Y P   G+    G +    
Sbjct:   118 QALHEPRTFAV-------YLNNTGYRTAFFGKY-LNEYNGSYIP--PGWREWLGLIKNS- 166

Query:   211 SYYDHILSDQYSRTVELNGHDMRRNLSTAW---DTVGEYATD--LFTKEAVQLIEDQ--- 262
              +Y++ +     +  E +G D  ++  T     +++  +     ++    V ++      
Sbjct:   167 RFYNYTVCRNGIK--EKHGFDYAKDYFTDLITNESINYFKMSKRMYPHRPVMMVISHAAP 224

Query:   263 --PVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDP---------N--RRTYAA 309
               P D                      AP    +     T P         N  +R    
Sbjct:   225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLPIHMEFTNVLQRKRLQ 284

Query:   310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
              +  +DDSV  + + L   G L N+ II+ +D+G    ++         G + PY     
Sbjct:   285 TLMSVDDSVERLYNMLVETGELGNTYIIYTADHGYHIGQFGLVK-----GKSMPY----- 334

Query:   370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWS 429
                +  ++VP  +  P I+    V   +++I D  PT+   AG DT   P ++DG     
Sbjct:   335 ---DFDIRVPFFIRGPSIEPGSIVPQIVLNI-DLAPTILDIAGLDT---PSDVDGKSVLK 387

Query:   430 SLLLNTPSRR 439
              L L  P  R
Sbjct:   388 LLDLEKPGNR 397

 Score = 39 (18.8 bits), Expect = 0.0010, Sum P(2) = 0.0010
 Identities = 8/17 (47%), Positives = 11/17 (64%)

Query:   483 LVLGTQENGTMDGYYGQ 499
             L +GT+E G  D + GQ
Sbjct:   847 LDVGTKEGGNYDPHRGQ 863


>UNIPROTKB|Q8VI60 [details] [associations]
            symbol:Sulf1 "Extracellular sulfatase Sulf-1" species:10116
            "Rattus norvegicus" [GO:0005509 "calcium ion binding" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR014615 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036665 RGD:708554
            GO:GO:0005783 GO:GO:0005886 GO:GO:0005615 GO:GO:0009986
            GO:GO:0048661 GO:GO:0005795 GO:GO:0005509 GO:GO:0010575
            GO:GO:0045121 GO:GO:0030336 GO:GO:0001822 GO:GO:0001937
            GO:GO:0030513 GO:GO:0016525 GO:GO:0001502 GO:GO:0060348
            Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0030177 GO:GO:0003094
            eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            GO:GO:0004065 GO:GO:0048706 GO:GO:0048010 GO:GO:0018741
            GO:GO:0060686 GO:GO:0002063 GO:GO:0040037 GO:GO:0032836
            GO:GO:0060384 GO:GO:0008449 GO:GO:0030201 HOGENOM:HOG000290161
            KO:K14607 HOVERGEN:HBG056431 GO:GO:0014846 GO:GO:0035860
            GO:GO:0036022 InterPro:IPR024609 Pfam:PF12548 CTD:23213
            OrthoDB:EOG4VT5WH EMBL:AF230072 IPI:IPI00331986 RefSeq:NP_599205.1
            UniGene:Rn.161961 ProteinModelPortal:Q8VI60 STRING:Q8VI60
            GeneID:171396 KEGG:rno:171396 UCSC:RGD:708554 NextBio:622244
            ArrayExpress:Q8VI60 Genevestigator:Q8VI60
            GermOnline:ENSRNOG00000009037 Uniprot:Q8VI60
        Length = 870

 Score = 129 (50.5 bits), Expect = 0.0010, Sum P(2) = 0.0010
 Identities = 82/370 (22%), Positives = 139/370 (37%)

Query:   100 GSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYP----IHTGMQG--PPIW 152
             GS ++       + + G    N +   P+C PSR+S++TGKY     ++T  +    P W
Sbjct:    58 GSLQVMNKTRKIMEHGGATFTNAFVTTPMCCPSRSSMLTGKYVHNHNVYTNNENCSSPSW 117

Query:   153 GA--EPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
              A  EPR   +       YL   GY T   GK+ L  +   Y P   G+    G +    
Sbjct:   118 QALHEPRTFAV-------YLNNTGYRTAFFGKY-LNEYNGSYIP--PGWREWLGLIKNS- 166

Query:   211 SYYDHILSDQYSRTVELNGHDMRRNLSTAW---DTVGEYATD--LFTKEAVQLIEDQ--- 262
              +Y++ +     +  E +G D  ++  T     +++  +     ++    V ++      
Sbjct:   167 RFYNYTVCRNGIK--EKHGFDYAKDYFTDLITNESINYFKMSKRMYPHRPVMMVISHAAP 224

Query:   263 --PVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDP---------N--RRTYAA 309
               P D                      AP    +     T P         N  +R    
Sbjct:   225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLPIHMEFTNVLQRKRLQ 284

Query:   310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
              +  +DDSV  + + L   G L N+ II+ +D+G    ++         G + PY     
Sbjct:   285 TLMSVDDSVERLYNMLVETGELGNTYIIYTADHGYHIGQFGLVK-----GKSMPY----- 334

Query:   370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWS 429
                +  ++VP  +  P I+    V   +++I D  PT+   AG DT   P ++DG     
Sbjct:   335 ---DFDIRVPFFIRGPSIEPGSIVPQIVLNI-DLAPTILDIAGLDT---PSDVDGKSVLK 387

Query:   430 SLLLNTPSRR 439
              L L  P  R
Sbjct:   388 LLDLEKPGNR 397

 Score = 39 (18.8 bits), Expect = 0.0010, Sum P(2) = 0.0010
 Identities = 8/17 (47%), Positives = 11/17 (64%)

Query:   483 LVLGTQENGTMDGYYGQ 499
             L +GT+E G  D + GQ
Sbjct:   847 LDVGTKEGGNYDPHRGQ 863


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.317   0.134   0.412    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      632       584   0.00082  120 3  11 22  0.42    34
                                                     36  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  205
  No. of states in DFA:  629 (67 KB)
  Total size of DFA:  364 KB (2177 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:01
  No. of threads or processors used:  24
  Search cpu time:  47.77u 0.10s 47.87t   Elapsed:  00:00:10
  Total cpu time:  47.83u 0.10s 47.93t   Elapsed:  00:00:11
  Start:  Thu Aug 15 11:03:05 2013   End:  Thu Aug 15 11:03:16 2013
WARNINGS ISSUED:  1

Back to top