BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>psy1088
MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMAFAVLPLAFTLSMVFVDLVASSGPP
HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP
IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES
HLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDE
PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALE
QRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRG
IVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIE
NSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTH
EYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQI
SALTRGKWKLVKENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNID
DEWQISALTKGKWKLVKVVKVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQ
CGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEE
EEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEPQIA
PCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKKKKKK
KKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSIF
GDDLK

High Scoring Gene Products

Symbol, full name Information P value
CG8646 protein from Drosophila melanogaster 5.1e-128
CG32191 protein from Drosophila melanogaster 4.3e-103
CG7402 protein from Drosophila melanogaster 9.1e-101
CG7408 protein from Drosophila melanogaster 8.5e-99
ARSB
Arylsulfatase B
protein from Homo sapiens 2.8e-96
ARSB
ARSB protein
protein from Bos taurus 1.8e-94
arsb
Arylsulfatase B
protein from Canis lupus familiaris 2.3e-92
Arsb
arylsulfatase B
gene from Rattus norvegicus 2.0e-91
Arsb
arylsulfatase B
protein from Mus musculus 2.0e-91
ARSB
Uncharacterized protein
protein from Gallus gallus 2.0e-90
ARSJ
Uncharacterized protein
protein from Bos taurus 1.1e-89
ARSJ
Arylsulfatase J
protein from Homo sapiens 2.2e-89
arsj
Uncharacterized protein
protein from Canis lupus familiaris 3.6e-89
Arsi
arylsulfatase family, member I
gene from Rattus norvegicus 2.5e-88
ARSI
Uncharacterized protein
protein from Bos taurus 8.3e-88
ARSI
Arylsulfatase I
protein from Homo sapiens 1.7e-87
ARSI
Arylsulfatase I
protein from Canis lupus familiaris 3.5e-87
LOC100517463
Uncharacterized protein
protein from Sus scrofa 1.3e-85
Arsj
arylsulfatase J
protein from Mus musculus 1.6e-85
Arsj
arylsulfatase family, member J
gene from Rattus norvegicus 2.0e-85
ARSI
Uncharacterized protein
protein from Gallus gallus 4.6e-84
Arsi
arylsulfatase i
protein from Mus musculus 9.5e-84
ARSJ
Uncharacterized protein
protein from Canis lupus familiaris 1.1e-69
ARSJ
Uncharacterized protein
protein from Sus scrofa 1.2e-69
ARSJ
Uncharacterized protein
protein from Gallus gallus 1.1e-66
sul-3 gene from Caenorhabditis elegans 3.1e-60
GALNS
N-acetylgalactosamine-6-sulfatase
protein from Homo sapiens 2.1e-42
Galns
galactosamine (N-acetyl)-6-sulfate sulfatase
protein from Mus musculus 2.0e-41
F1S2F1
Uncharacterized protein
protein from Sus scrofa 4.1e-40
GALNS
N-acetylgalactosamine-6-sulfatase
protein from Canis lupus familiaris 2.4e-39
GALNS
N-acetylgalactosamine-6-sulfatase
protein from Canis lupus familiaris 2.4e-39
GALNS
Uncharacterized protein
protein from Gallus gallus 5.2e-39
Galns
galactosamine (N-acetyl)-6-sulfate sulfatase
gene from Rattus norvegicus 8.2e-39
GALNS
N-acetylgalactosamine-6-sulfatase
protein from Sus scrofa 1.1e-38
GALNS
Uncharacterized protein
protein from Bos taurus 4.7e-38
galns
galactosamine (N-acetyl)-6-sulfate sulfatase
gene_product from Danio rerio 9.8e-37
F1RL71
Uncharacterized protein
protein from Sus scrofa 3.5e-34
ARSA
Arylsulfatase A
protein from Bos taurus 2.2e-33
GALNS
N-acetylgalactosamine-6-sulfatase
protein from Sus scrofa 2.5e-32
arsa
arylsulfatase A
gene_product from Danio rerio 9.9e-32
Arsa
arylsulfatase A
gene from Rattus norvegicus 2.8e-30
ARSA
Arylsulfatase A
protein from Homo sapiens 8.4e-30
ARSA
Uncharacterized protein
protein from Canis lupus familiaris 2.2e-29
Arsa
arylsulfatase A
protein from Mus musculus 6.4e-28
Arse
arylsulfatase E (chondrodysplasia punctata 1)
gene from Rattus norvegicus 2.2e-26
STS
Uncharacterized protein
protein from Canis lupus familiaris 2.3e-26
STS
Uncharacterized protein
protein from Bos taurus 4.1e-26
STS
Uncharacterized protein
protein from Canis lupus familiaris 4.1e-26
STS
Steryl-sulfatase
protein from Homo sapiens 5.8e-26
aslA
arylsulfatase
protein from Escherichia coli K-12 1.6e-25
ARSA
Uncharacterized protein
protein from Gallus gallus 1.6e-25
GALNS
N-acetylgalactosamine-6-sulfatase
protein from Homo sapiens 2.1e-25
STS
Uncharacterized protein
protein from Gallus gallus 3.2e-25
ARSF
Uncharacterized protein
protein from Canis lupus familiaris 4.3e-25
sts
steroid sulfatase (microsomal), arylsulfatase C, isozyme S
gene_product from Danio rerio 4.4e-25
CPS_2368
Putative N-acetylglucosamine-6-sulfatase
protein from Colwellia psychrerythraea 34H 5.2e-25
CPS_2368
putative N-acetylglucosamine-6-sulfatase
protein from Colwellia psychrerythraea 34H 5.2e-25
STS
Uncharacterized protein
protein from Sus scrofa 7.1e-25
STS
Uncharacterized protein
protein from Sus scrofa 7.2e-25
Sts
steroid sulfatase (microsomal), isozyme S
gene from Rattus norvegicus 9.0e-25
SPO_3286
arylsulfatase
protein from Ruegeria pomeroyi DSS-3 1.1e-24
ARSH
Uncharacterized protein
protein from Gallus gallus 1.8e-24
ARSH
Uncharacterized protein
protein from Bos taurus 4.0e-24
ARSF
Arylsulfatase F
protein from Homo sapiens 1.1e-23
ARSE
Arylsulfatase E
protein from Homo sapiens 1.9e-23
Sts
steroid sulfatase
protein from Mus musculus 3.1e-23
CPS_2364
sulfatase family protein
protein from Colwellia psychrerythraea 34H 3.3e-23
ARSE
Arylsulfatase E
protein from Homo sapiens 6.9e-23
ARSH
Arylsulfatase H
protein from Canis lupus familiaris 1.1e-22
CPS_0660
sulfatase family protein
protein from Colwellia psychrerythraea 34H 1.6e-22
Arsg
arylsulfatase G
gene from Rattus norvegicus 1.8e-22
ARSH
Arylsulfatase H
protein from Homo sapiens 1.8e-22
ARSH
Arylsulfatase H
protein from Canis lupus familiaris 2.2e-22
arse
Arylsulfatase E
protein from Canis lupus familiaris 3.5e-22
Arsg
arylsulfatase G
protein from Mus musculus 3.8e-22
ARSE
Uncharacterized protein
protein from Bos taurus 9.0e-22
ARSG
Uncharacterized protein
protein from Gallus gallus 1.2e-21
ARSG
Arylsulfatase G
protein from Homo sapiens 2.3e-21
ARSD
Uncharacterized protein
protein from Canis lupus familiaris 3.5e-21
arsh
arylsulfatase H
gene_product from Danio rerio 6.8e-21
ARSD
Arylsulfatase D
protein from Homo sapiens 7.1e-21
orf19.1608 gene_product from Candida albicans 1.0e-20
ARSD
Uncharacterized protein
protein from Gallus gallus 1.1e-20
CPS_2985
sulfatase family protein
protein from Colwellia psychrerythraea 34H 1.2e-20
ARSE
Arylsulfatase E
protein from Homo sapiens 1.3e-20
arsg
arylsulfatase G
gene_product from Danio rerio 2.4e-20
CPS_2983
putative arylsulfatase
protein from Colwellia psychrerythraea 34H 3.9e-20
ARSG
Uncharacterized protein
protein from Bos taurus 4.4e-20
ARSE
Uncharacterized protein
protein from Gallus gallus 1.1e-19
ARSG
Arylsulfatase G
protein from Canis lupus familiaris 1.2e-19
CPS_2984
sulfatase family protein
protein from Colwellia psychrerythraea 34H 2.0e-19
ydeN
putative sulfatase
protein from Escherichia coli K-12 2.8e-19
LOC100521576
Uncharacterized protein
protein from Sus scrofa 5.0e-19
ARSE
Uncharacterized protein
protein from Canis lupus familiaris 5.2e-19

The BLAST search returned 5 gene products which did not match your query constraints. Please see the full BLAST report below for the details.

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  psy1088
        (905 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

FB|FBgn0033763 - symbol:CG8646 species:7227 "Drosophila m...  1091  5.1e-128  3
FB|FBgn0052191 - symbol:CG32191 species:7227 "Drosophila ...   796  4.3e-103  4
FB|FBgn0036768 - symbol:CG7402 species:7227 "Drosophila m...   880  9.1e-101  3
FB|FBgn0036765 - symbol:CG7408 species:7227 "Drosophila m...   768  8.5e-99   4
UNIPROTKB|P15848 - symbol:ARSB "Arylsulfatase B" species:...   825  2.8e-96   4
UNIPROTKB|A6QLZ3 - symbol:ARSB "Uncharacterized protein" ...   843  1.8e-94   3
UNIPROTKB|Q32KI4 - symbol:arsb "Arylsulfatase B" species:...   829  2.3e-92   3
RGD|2158 - symbol:Arsb "arylsulfatase B" species:10116 "R...   824  2.0e-91   3
MGI|MGI:88075 - symbol:Arsb "arylsulfatase B" species:100...   823  2.0e-91   3
UNIPROTKB|F1P099 - symbol:ARSB "Uncharacterized protein" ...   779  2.0e-90   4
UNIPROTKB|E1BKH3 - symbol:ARSJ "Uncharacterized protein" ...   764  1.1e-89   4
UNIPROTKB|Q5FYB0 - symbol:ARSJ "Arylsulfatase J" species:...   766  2.2e-89   4
UNIPROTKB|Q32KH6 - symbol:arsj "Uncharacterized protein" ...   761  3.6e-89   4
RGD|1310242 - symbol:Arsi "arylsulfatase family, member I...   743  2.5e-88   4
UNIPROTKB|E1BIN3 - symbol:ARSI "Uncharacterized protein" ...   744  8.3e-88   4
UNIPROTKB|Q5FYB1 - symbol:ARSI "Arylsulfatase I" species:...   738  1.7e-87   4
UNIPROTKB|Q32KH7 - symbol:ARSI "Arylsulfatase I" species:...   738  3.5e-87   4
UNIPROTKB|F1RL69 - symbol:LOC100517463 "Uncharacterized p...   723  1.3e-85   4
MGI|MGI:2443513 - symbol:Arsj "arylsulfatase J" species:1...   758  1.6e-85   3
RGD|1307640 - symbol:Arsj "arylsulfatase family, member J...   757  2.0e-85   3
UNIPROTKB|F1P095 - symbol:ARSB "Uncharacterized protein" ...   779  4.5e-84   2
UNIPROTKB|F1NQP9 - symbol:ARSI "Uncharacterized protein" ...   747  4.6e-84   3
MGI|MGI:2670959 - symbol:Arsi "arylsulfatase i" species:1...   743  9.5e-84   3
UNIPROTKB|F1NT29 - symbol:ARSB "Uncharacterized protein" ...   779  1.9e-83   2
UNIPROTKB|F1P098 - symbol:ARSB "Uncharacterized protein" ...   779  2.5e-79   2
UNIPROTKB|F6PKT4 - symbol:ARSJ "Uncharacterized protein" ...   577  1.1e-69   4
UNIPROTKB|F1S147 - symbol:ARSJ "Uncharacterized protein" ...   575  1.2e-69   4
UNIPROTKB|F1NH07 - symbol:ARSJ "Uncharacterized protein" ...   560  1.1e-66   3
WB|WBGene00006310 - symbol:sul-3 species:6239 "Caenorhabd...   544  3.1e-60   3
UNIPROTKB|P34059 - symbol:GALNS "N-acetylgalactosamine-6-...   440  2.1e-42   2
MGI|MGI:1355303 - symbol:Galns "galactosamine (N-acetyl)-...   433  2.0e-41   2
UNIPROTKB|F1S2F1 - symbol:F1S2F1 "Uncharacterized protein...   435  4.1e-40   1
UNIPROTKB|F1PHF0 - symbol:GALNS "N-acetylgalactosamine-6-...   428  2.4e-39   1
UNIPROTKB|Q32KH5 - symbol:GALNS "N-acetylgalactosamine-6-...   428  2.4e-39   1
UNIPROTKB|F1NW57 - symbol:GALNS "Uncharacterized protein"...   409  5.2e-39   2
RGD|1565391 - symbol:Galns "galactosamine (N-acetyl)-6-su...   423  8.2e-39   1
UNIPROTKB|Q8WNQ7 - symbol:GALNS "N-acetylgalactosamine-6-...   422  1.1e-38   1
UNIPROTKB|F1MU84 - symbol:GALNS "Uncharacterized protein"...   416  4.7e-38   1
ZFIN|ZDB-GENE-070112-1152 - symbol:galns "galactosamine (...   391  9.8e-37   2
UNIPROTKB|F1RL71 - symbol:F1RL71 "Uncharacterized protein...   205  3.5e-34   5
UNIPROTKB|Q08DD1 - symbol:ARSA "Arylsulfatase A" species:...   358  2.2e-33   3
UNIPROTKB|F1S6M1 - symbol:GALNS "N-acetylgalactosamine-6-...   363  2.5e-32   1
ZFIN|ZDB-GENE-050320-118 - symbol:arsa "arylsulfatase A" ...   345  9.9e-32   3
UNIPROTKB|Q32KK2 - symbol:Arsa "Arylsulfatase A" species:...   339  1.1e-30   3
RGD|1310381 - symbol:Arsa "arylsulfatase A" species:10116...   339  2.8e-30   3
UNIPROTKB|P15289 - symbol:ARSA "Arylsulfatase A" species:...   349  8.4e-30   2
UNIPROTKB|F6PKZ1 - symbol:ARSA "Uncharacterized protein" ...   344  2.2e-29   2
MGI|MGI:88077 - symbol:Arsa "arylsulfatase A" species:100...   335  6.4e-28   2
RGD|1304917 - symbol:Arse "arylsulfatase E (chondrodyspla...   263  2.2e-26   3
UNIPROTKB|F1Q1V3 - symbol:STS "Uncharacterized protein" s...   246  2.3e-26   4
UNIPROTKB|F1MFZ8 - symbol:STS "Uncharacterized protein" s...   252  4.1e-26   3
UNIPROTKB|F1Q1V2 - symbol:STS "Uncharacterized protein" s...   246  4.1e-26   4
UNIPROTKB|P08842 - symbol:STS "Steryl-sulfatase" species:...   248  5.8e-26   4
UNIPROTKB|P25549 - symbol:aslA "arylsulfatase" species:83...   319  1.6e-25   1
UNIPROTKB|F1NWF7 - symbol:ARSA "Uncharacterized protein" ...   308  1.6e-25   2
UNIPROTKB|F5H325 - symbol:GALNS "N-acetylgalactosamine-6-...   285  2.1e-25   3
UNIPROTKB|F1NGC8 - symbol:STS "Uncharacterized protein" s...   239  3.2e-25   3
UNIPROTKB|F6PN86 - symbol:ARSF "Uncharacterized protein" ...   236  4.3e-25   3
ZFIN|ZDB-GENE-030717-5 - symbol:sts "steroid sulfatase (m...   236  4.4e-25   3
UNIPROTKB|Q482D2 - symbol:CPS_2368 "Putative N-acetylgluc...   275  5.2e-25   3
TIGR_CMR|CPS_2368 - symbol:CPS_2368 "putative N-acetylglu...   275  5.2e-25   3
UNIPROTKB|I3LBW8 - symbol:STS "Uncharacterized protein" s...   242  7.1e-25   3
UNIPROTKB|K7GLQ3 - symbol:STS "Uncharacterized protein" s...   242  7.2e-25   3
RGD|3783 - symbol:Sts "steroid sulfatase (microsomal), is...   251  9.0e-25   3
TIGR_CMR|SPO_3286 - symbol:SPO_3286 "arylsulfatase" speci...   254  1.1e-24   4
UNIPROTKB|F1NFQ0 - symbol:ARSH "Uncharacterized protein" ...   259  1.8e-24   3
UNIPROTKB|G3N2T7 - symbol:ARSH "Uncharacterized protein" ...   229  4.0e-24   3
UNIPROTKB|F1NFQ1 - symbol:ARSH "Uncharacterized protein" ...   259  5.0e-24   3
UNIPROTKB|P54793 - symbol:ARSF "Arylsulfatase F" species:...   233  1.1e-23   3
UNIPROTKB|F5GYY5 - symbol:ARSE "Arylsulfatase E" species:...   260  1.9e-23   3
MGI|MGI:98438 - symbol:Sts "steroid sulfatase" species:10...   243  3.1e-23   3
TIGR_CMR|CPS_2364 - symbol:CPS_2364 "sulfatase family pro...   295  3.3e-23   2
UNIPROTKB|P51690 - symbol:ARSE "Arylsulfatase E" species:...   254  6.9e-23   3
UNIPROTKB|Q32KH8 - symbol:ARSH "Arylsulfatase H" species:...   232  1.1e-22   3
POMBASE|SPBPB10D8.02c - symbol:SPBPB10D8.02c "arylsulfata...   233  1.6e-22   3
TIGR_CMR|CPS_0660 - symbol:CPS_0660 "sulfatase family pro...   293  1.6e-22   2
RGD|1306571 - symbol:Arsg "arylsulfatase G" species:10116...   264  1.8e-22   3
UNIPROTKB|Q5FYA8 - symbol:ARSH "Arylsulfatase H" species:...   230  1.8e-22   3
UNIPROTKB|F1PY85 - symbol:ARSH "Arylsulfatase H" species:...   232  2.2e-22   3
UNIPROTKB|Q32KI1 - symbol:arse "Uncharacterized protein" ...   244  3.5e-22   3
MGI|MGI:1921258 - symbol:Arsg "arylsulfatase G" species:1...   257  3.8e-22   2
UNIPROTKB|G5E629 - symbol:ARSE "Uncharacterized protein" ...   243  9.0e-22   3
UNIPROTKB|E1BU03 - symbol:ARSG "Uncharacterized protein" ...   283  1.2e-21   1
UNIPROTKB|Q96EG1 - symbol:ARSG "Arylsulfatase G" species:...   252  2.3e-21   2
UNIPROTKB|F1PYB4 - symbol:ARSD "Uncharacterized protein" ...   231  3.5e-21   3
ZFIN|ZDB-GENE-081104-120 - symbol:arsh "arylsulfatase H" ...   238  6.8e-21   3
UNIPROTKB|P51689 - symbol:ARSD "Arylsulfatase D" species:...   230  7.1e-21   3
CGD|CAL0006319 - symbol:orf19.1608 species:5476 "Candida ...   238  1.0e-20   4
UNIPROTKB|E1BYN0 - symbol:ARSD "Uncharacterized protein" ...   243  1.1e-20   3
TIGR_CMR|CPS_2985 - symbol:CPS_2985 "sulfatase family pro...   281  1.2e-20   2
UNIPROTKB|C9J5G7 - symbol:ARSE "Arylsulfatase E" species:...   254  1.3e-20   1
ZFIN|ZDB-GENE-060503-154 - symbol:arsg "arylsulfatase G" ...   274  2.4e-20   3
TIGR_CMR|CPS_2983 - symbol:CPS_2983 "putative arylsulfata...   270  3.9e-20   1
UNIPROTKB|F1N665 - symbol:ARSG "Uncharacterized protein" ...   249  4.4e-20   1
UNIPROTKB|F1NFL4 - symbol:F1NFL4 "Uncharacterized protein...   258  1.1e-19   1
UNIPROTKB|Q32KH9 - symbol:ARSG "Arylsulfatase G" species:...   266  1.2e-19   1
TIGR_CMR|CPS_2984 - symbol:CPS_2984 "sulfatase family pro...   250  2.0e-19   2
UNIPROTKB|P77318 - symbol:ydeN "putative sulfatase" speci...   257  2.8e-19   2
UNIPROTKB|F1RV22 - symbol:ARSG "Uncharacterized protein" ...   260  5.0e-19   1
UNIPROTKB|F1PYB3 - symbol:ARSE "Uncharacterized protein" ...   239  5.2e-19   1

WARNING:  Descriptions of 223 database sequences were not reported due to the
          limiting value of parameter V = 100.


>FB|FBgn0033763 [details] [associations]
            symbol:CG8646 species:7227 "Drosophila melanogaster"
            [GO:0003943 "N-acetylgalactosamine-4-sulfatase activity"
            evidence=ISS] [GO:0008152 "metabolic process" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 EMBL:AE013599 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            KO:K01135 GO:GO:0003943 GeneTree:ENSGT00560000077076 EMBL:AY071072
            RefSeq:NP_610807.3 UniGene:Dm.6132 HSSP:P15848 SMR:Q8SZ72
            STRING:Q8SZ72 EnsemblMetazoa:FBtr0301237 GeneID:36394
            KEGG:dme:Dmel_CG8646 FlyBase:FBgn0033763 InParanoid:Q8SZ72
            OMA:FRGSAQI OrthoDB:EOG4W6MBG GenomeRNAi:36394 NextBio:798315
            Uniprot:Q8SZ72
        Length = 562

 Score = 1091 (389.1 bits), Expect = 5.1e-128, Sum P(3) = 5.1e-128
 Identities = 202/326 (61%), Positives = 243/326 (74%)

Query:    56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
             S   P+IIFILADDLG+NDVGFHG  +IPTPNIDALAYSGIIL  YY   +CTPSRSA+M
Sbjct:    22 SPAKPNIIFILADDLGFNDVGFHGSAEIPTPNIDALAYSGIILNRYYVAPICTPSRSALM 81

Query:   116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
             TGK+PIHTGMQH VLY  E  GLPL EKILPQYL ELGY + I GKWHLG +K +YTP +
Sbjct:    82 TGKYPIHTGMQHTVLYAAEPRGLPLEEKILPQYLNELGYTSHIAGKWHLGHWKLKYTPLY 141

Query:   176 RGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
             RGF SH+G+W+GHQDY DH+A E   WGLDMR   + A+DLHG Y+TDV T  +V +I N
Sbjct:   142 RGFSSHVGFWSGHQDYNDHTAVENNQWGLDMRNGTQVAYDLHGHYTTDVITDHSVKVIAN 201

Query:   236 HS-TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
             H+ T  PLFLY+AHAA HS+NPY PL  PD+ +    HI ++KR KFAA++ K+D SVG+
Sbjct:   202 HNATKGPLFLYVAHAACHSSNPYNPLPVPDNDVIKMSHIPNYKRRKFAAMVSKMDNSVGQ 261

Query:   295 VVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWSP 354
             +V+ L +  ML NSII+F SD             SN+PL+GVKNTLWEGGVR AGL+WSP
Sbjct:   262 IVDQLRKSNMLENSIIIFSSDNGGPAQGFNLNFASNYPLKGVKNTLWEGGVRAAGLMWSP 321

Query:   355 LLESRGIVAEQYVHVSDWLPTLLSAA 380
             LL+    V+ Q +H+ DWLPTLL AA
Sbjct:   322 LLKKSQRVSNQTMHIIDWLPTLLEAA 347

 Score = 137 (53.3 bits), Expect = 3.2e-20, Sum P(3) = 3.2e-20
 Identities = 51/184 (27%), Positives = 83/184 (45%)

Query:   383 SDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRY--ENG--THEYNPKYE 438
             S IPNY       ++ + +NS+ +  +   + N   +ENS   +  +NG     +N  + 
Sbjct:   238 SHIPNYKRRKFAAMVSKMDNSVGQIVDQLRKSNM--LENSIIIFSSDNGGPAQGFNLNFA 295

Query:   439 NRYE-NGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGT-HEYN-IPRLENSINGNG 495
             + Y   G    N  +E         ++ P  + +  R  N T H  + +P L  +  G  
Sbjct:   296 SNYPLKGVK--NTLWEGGVRAAGLMWS-PLLKKSQ-RVSNQTMHIIDWLPTLLEAAGGQP 351

Query:   496 TSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENS 555
                N S       +IDG  +W  L +++ S R  +LHNIDD W  +AL+ G WKLVK  +
Sbjct:   352 ALSNLSK------QIDGQSIWRALVQDKASPRLNVLHNIDDIWGSAALSVGDWKLVKGTN 405

Query:   556 INGN 559
               G+
Sbjct:   406 YRGS 409

 Score = 122 (48.0 bits), Expect = 3.2e-20, Sum P(3) = 3.2e-20
 Identities = 24/34 (70%), Positives = 25/34 (73%)

Query:     1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 34
             MQH VLY  E  GLPL EKILPQYL ELGY + I
Sbjct:    91 MQHTVLYAAEPRGLPLEEKILPQYLNELGYTSHI 124

 Score = 112 (44.5 bits), Expect = 5.1e-128, Sum P(3) = 5.1e-128
 Identities = 20/45 (44%), Positives = 29/45 (64%)

Query:   858 YPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSIFGD 902
             YP+V++ +  EL   N TAV P NKP D   DP+ +++ W+ FGD
Sbjct:   499 YPEVVNALMTELERFNATAVPPSNKPADPRADPRFWNYTWTNFGD 543

 Score = 112 (44.5 bits), Expect = 5.1e-128, Sum P(3) = 5.1e-128
 Identities = 27/69 (39%), Positives = 40/69 (57%)

Query:   651 RKLRDAASIQC-GPVKE-VPCEPQI--APCLFDIKNDPCEKNNLADRSEDQRINHYTTEV 706
             +++R AA++ C G   +   C      APCLF I++DPCE+ NLA +   + +N   TE+
Sbjct:   452 QRIRAAATVSCPGQSSQGTSCVATAFSAPCLFHIRDDPCEQYNLA-KQYPEVVNALMTEL 510

Query:   707 GRFNQIAYP 715
              RFN  A P
Sbjct:   511 ERFNATAVP 519

 Score = 90 (36.7 bits), Expect = 1.0e-125, Sum P(3) = 1.0e-125
 Identities = 28/74 (37%), Positives = 41/74 (55%)

Query:   757 RKLRDAASIQC-GPVKE-VPCEPQI--APCLFDIKNDPCEKNNLADR-SEVQRINHYTTE 811
             +++R AA++ C G   +   C      APCLF I++DPCE+ NLA +  EV  +N   TE
Sbjct:   452 QRIRAAATVSCPGQSSQGTSCVATAFSAPCLFHIRDDPCEQYNLAKQYPEV--VNALMTE 509

Query:   812 VGYLDPKQRFNQIA 825
             +      +RFN  A
Sbjct:   510 L------ERFNATA 517

 Score = 89 (36.4 bits), Expect = 5.1e-128, Sum P(3) = 5.1e-128
 Identities = 16/41 (39%), Positives = 26/41 (63%)

Query:   569 NSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALT 609
             ++   +IDG  +W  L +++ S R  +LHNIDD W  +AL+
Sbjct:   354 SNLSKQIDGQSIWRALVQDKASPRLNVLHNIDDIWGSAALS 394

 Score = 37 (18.1 bits), Expect = 3.0e-112, Sum P(2) = 3.0e-112
 Identities = 12/41 (29%), Positives = 18/41 (43%)

Query:   362 VAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYEN 402
             +A+QY  V + L T L   N + +P           PR+ N
Sbjct:   495 LAKQYPEVVNALMTELERFNATAVPPSNKPADPRADPRFWN 535


>FB|FBgn0052191 [details] [associations]
            symbol:CG32191 species:7227 "Drosophila melanogaster"
            [GO:0003943 "N-acetylgalactosamine-4-sulfatase activity"
            evidence=ISS] [GO:0008152 "metabolic process" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 EMBL:AE014296 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00149 GO:GO:0003943 HSSP:P08842
            RefSeq:NP_730304.2 UniGene:Dm.15184 ProteinModelPortal:Q8IQS4
            SMR:Q8IQS4 MINT:MINT-943884 PRIDE:Q8IQS4 GeneID:317903
            KEGG:dme:Dmel_CG32191 UCSC:CG32191-RA FlyBase:FBgn0052191
            InParanoid:Q8IQS4 OrthoDB:EOG43FFBZ PhylomeDB:Q8IQS4
            GenomeRNAi:317903 NextBio:844132 Bgee:Q8IQS4 Uniprot:Q8IQS4
        Length = 554

 Score = 796 (285.3 bits), Expect = 4.3e-103, Sum P(4) = 4.3e-103
 Identities = 162/345 (46%), Positives = 219/345 (63%)

Query:    42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
             L   L  V  D  A++  P+II I+ADD+G++DV F G  +  TPNIDALAY G +L   
Sbjct:     9 LLLCLQRVKSDESAAARRPNIIIIMADDMGFDDVSFRGGREFLTPNIDALAYHGRLLDRL 68

Query:   102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
             Y   +CTPSR A+++G++PIHTG QH V+   E   L L+  ++P+  KE GY T +VGK
Sbjct:    69 YAPAMCTPSRGALLSGRYPIHTGTQHFVISNEEPWALTLNATLMPEIFKEAGYSTNLVGK 128

Query:   162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKM----WGLDMRRDLEPAWDLH 217
             WHLGF + EYTPT RGF+ H GYW  + DYF   ++ M +     G D RR++E      
Sbjct:   129 WHLGFSRPEYTPTRRGFDYHFGYWGAYIDYFQRRSK-MPVANYSLGYDFRRNMELECRDR 187

Query:   218 GKYSTDVFTAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF 276
             G Y TD+ TAEA  +I +H+  E PLFL L+H A H+AN  +PLQAP+  +    +I+D 
Sbjct:   188 GVYVTDLLTAEAERLIKDHADKEQPLFLMLSHLAAHTANEDDPLQAPEEEIQKFSYIKDP 247

Query:   277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGV 336
              R K+AA++ KLD+SVG+++ AL     L NSI++F SD             SN+PLRG 
Sbjct:   248 NRRKYAAMISKLDQSVGRIITALSSTDQLENSIVIFYSDNGAPSVGMFSNTGSNFPLRGQ 307

Query:   337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
             KNT WEGGVR AG IWS  L++RG +  Q ++V+DWLPTL  AA+
Sbjct:   308 KNTPWEGGVRVAGAIWSSGLQARGSIFRQPLYVADWLPTLSRAAD 352

 Score = 117 (46.2 bits), Expect = 4.3e-103, Sum P(4) = 4.3e-103
 Identities = 33/92 (35%), Positives = 51/92 (55%)

Query:   509 EIDGIDVWSVLS--RNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRS 566
             ++DGID+W  LS   + P     ILH +DD W++SAL  G+WK V       NGT+ +  
Sbjct:   360 KLDGIDLWPELSGSADAPHVPREILHILDDVWRLSALQMGQWKYV-------NGTTASGR 412

Query:   567 NDN--SYQNEIDGIDV----WSVLSRNEPSKR 592
              D+  +Y+ E+D +D     ++V  RN  + R
Sbjct:   413 YDSVLTYR-ELDDLDPRDSRYAVTVRNSATSR 443

 Score = 105 (42.0 bits), Expect = 4.3e-103, Sum P(4) = 4.3e-103
 Identities = 25/76 (32%), Positives = 38/76 (50%)

Query:   623 RYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKN 682
             RY V +        LS    R      +   R  A+++CG ++   C P +  CL+DI +
Sbjct:   431 RYAVTVRNSATSRALSRYDLRRLTQQRISLTRRLAAVRCGDLQR-SCNPLLEECLYDILS 489

Query:   683 DPCEKNNL--ADRSED 696
             DPCE+NNL  ++R  D
Sbjct:   490 DPCEQNNLVYSERHSD 505

 Score = 100 (40.3 bits), Expect = 1.4e-102, Sum P(4) = 1.4e-102
 Identities = 17/37 (45%), Positives = 26/37 (70%)

Query:   760 RDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNL 796
             R  A+++CG ++   C P +  CL+DI +DPCE+NNL
Sbjct:   462 RRLAAVRCGDLQR-SCNPLLEECLYDILSDPCEQNNL 497

 Score = 82 (33.9 bits), Expect = 2.0e-99, Sum P(4) = 2.0e-99
 Identities = 17/37 (45%), Positives = 24/37 (64%)

Query:   574 EIDGIDVWSVLS--RNEPSKRNTILHNIDDEWQISAL 608
             ++DGID+W  LS   + P     ILH +DD W++SAL
Sbjct:   360 KLDGIDLWPELSGSADAPHVPREILHILDDVWRLSAL 396

 Score = 58 (25.5 bits), Expect = 1.5e-11, Sum P(4) = 1.5e-11
 Identities = 12/34 (35%), Positives = 19/34 (55%)

Query:     2 QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIM 35
             QH V+   E   L L+  ++P+  KE GY T ++
Sbjct:    93 QHFVISNEEPWALTLNATLMPEIFKEAGYSTNLV 126

 Score = 47 (21.6 bits), Expect = 4.3e-103, Sum P(4) = 4.3e-103
 Identities = 11/43 (25%), Positives = 19/43 (44%)

Query:   858 YPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSIF 900
             + DVL+ + + +  +  +A  P N+      DP     AW  F
Sbjct:   503 HSDVLTALRRRVQELRASASRPGNRASMPEADPTLHTCAWESF 545


>FB|FBgn0036768 [details] [associations]
            symbol:CG7402 species:7227 "Drosophila melanogaster"
            [GO:0003943 "N-acetylgalactosamine-4-sulfatase activity"
            evidence=ISS] [GO:0008152 "metabolic process" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 EMBL:AE014296 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00149 KO:K01135 GO:GO:0003943
            GeneTree:ENSGT00560000077076 HSSP:P15848 RefSeq:NP_649023.1
            UniGene:Dm.13635 ProteinModelPortal:Q9VVM4 STRING:Q9VVM4
            PRIDE:Q9VVM4 EnsemblMetazoa:FBtr0075143 GeneID:39994
            KEGG:dme:Dmel_CG7402 UCSC:CG7402-RA FlyBase:FBgn0036768
            InParanoid:Q9VVM4 OMA:LYWAGPG PhylomeDB:Q9VVM4 GenomeRNAi:39994
            NextBio:816457 ArrayExpress:Q9VVM4 Bgee:Q9VVM4 Uniprot:Q9VVM4
        Length = 579

 Score = 880 (314.8 bits), Expect = 9.1e-101, Sum P(3) = 9.1e-101
 Identities = 180/412 (43%), Positives = 252/412 (61%)

Query:    57 SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMT 116
             S  P+I+ IL DD+G NDV FHG +QI TPNIDALAY+GI+L  +Y   LCTPSR+ ++T
Sbjct:    25 STKPNIVIILIDDMGMNDVSFHGSNQILTPNIDALAYNGILLNKHYVPNLCTPSRATLLT 84

Query:   117 GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
             GK+PIHTGMQH V+   E  GLP  E+++P+  ++ GY T +VGKWHLGF++K+ TPT R
Sbjct:    85 GKYPIHTGMQHFVIITDEPWGLPQRERLMPEIFRDAGYSTHLVGKWHLGFWRKDLTPTMR 144

Query:   177 GFESHLGYWTGHQDYFDHSAEEMKM---WGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
             GF+ H GY+ G+ DY+DH    +      GLD RRDLEP  + +G Y+T+ FT+EA  II
Sbjct:   145 GFDHHFGYYNGYIDYYDHQVRMLDRNYSAGLDFRRDLEPCPEANGTYATEAFTSEAKRII 204

Query:   234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
               H   +PLF+ L+H A H+ N   P+QAP+  +    HI D KR  +A ++  LD+SV 
Sbjct:   205 EQHDKSKPLFMVLSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPKRRTYAGMISSLDKSVA 264

Query:   294 KVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWS 353
             + + AL+   ML+NSII+  SD             SN+P RG K + WEGG+R AG +WS
Sbjct:   265 QTIGALKDNGMLNNSIILLYSDNGAPTIGIHSNAGSNYPYRGQKESPWEGGIRSAGALWS 324

Query:   354 PLLESRGIVAEQYVHVSDWLPTLLSAANKS---DIP-NYVN---STVENIIPRYENSILR 406
             PLL+ RG V+ Q +H  DWLPTL  AA  S   D+P + +N       N  P+   +++ 
Sbjct:   325 PLLKERGYVSNQAIHAVDWLPTLAGAAGVSLPQDLPLDGINLWPMLSGNEEPK-PRTMIH 383

Query:   407 YENGTHEYNSPRIENSNTRYENGTHEYNPKYENRY-ENGTHEYNPKYENRYE 457
               +    Y+S        +Y NG+  +  +Y+    E  T+E +P  E+ YE
Sbjct:   384 VLDEVFGYSS--YMRDTLKYVNGS-SFKGRYDQWLGELETNEDDPLGES-YE 431

 Score = 103 (41.3 bits), Expect = 9.1e-101, Sum P(3) = 9.1e-101
 Identities = 22/58 (37%), Positives = 34/58 (58%)

Query:   753 EEGMRKLRDAASIQCGPVK-EVP------CEPQIAPCLFDIKNDPCEKNNLADRSEVQ 803
             ++ +R++R  A+  C P++ + P      CEP  APC FD+  DPCE+ NLA    +Q
Sbjct:   450 KDRIRQMRSEATETCPPIEGQNPLESHFKCEPLKAPCFFDLAKDPCERYNLAQMYPLQ 507

 Score = 103 (41.3 bits), Expect = 6.0e-97, Sum P(2) = 6.0e-97
 Identities = 25/73 (34%), Positives = 38/73 (52%)

Query:   650 MRKLRDAASIQCGPVK-EVP------CEPQIAPCLFDIKNDPCEKNNLADRSEDQRINHY 702
             +R++R  A+  C P++ + P      CEP  APC FD+  DPCE+ NLA     Q +   
Sbjct:   453 IRQMRSEATETCPPIEGQNPLESHFKCEPLKAPCFFDLAKDPCERYNLAQMYPLQ-LQQL 511

Query:   703 TTEVGRFNQIAYP 715
               E+ +  + A P
Sbjct:   512 ADELEQIRKTAIP 524

 Score = 102 (41.0 bits), Expect = 2.5e-11, Sum P(4) = 2.5e-11
 Identities = 28/90 (31%), Positives = 44/90 (48%)

Query:   510 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGN-----GTSEN 564
             +DGI++W +LS NE  K  T++H +D+ +  S+  R   K V  +S  G      G  E 
Sbjct:   361 LDGINLWPMLSGNEEPKPRTMIHVLDEVFGYSSYMRDTLKYVNGSSFKGRYDQWLGELET 420

Query:   565 RSND---NSYQNEIDGIDVWSVLSRNEPSK 591
               +D    SY+  +   DV S+L     +K
Sbjct:   421 NEDDPLGESYEQHVLASDVQSLLGNRGLTK 450

 Score = 73 (30.8 bits), Expect = 1.8e-08, Sum P(4) = 1.8e-08
 Identities = 13/33 (39%), Positives = 23/33 (69%)

Query:   575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISA 607
             +DGI++W +LS NE  K  T++H +D+ +  S+
Sbjct:   361 LDGINLWPMLSGNEEPKPRTMIHVLDEVFGYSS 393

 Score = 72 (30.4 bits), Expect = 2.5e-11, Sum P(4) = 2.5e-11
 Identities = 13/35 (37%), Positives = 22/35 (62%)

Query:     1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIM 35
             MQH V+   E  GLP  E+++P+  ++ GY T ++
Sbjct:    93 MQHFVIITDEPWGLPQRERLMPEIFRDAGYSTHLV 127

 Score = 50 (22.7 bits), Expect = 9.1e-101, Sum P(3) = 9.1e-101
 Identities = 13/41 (31%), Positives = 18/41 (43%)

Query:   858 YPDVLSQMEKELANINRTAVAPINKPF-DKGGDPKNFDHAW 897
             YP  L Q+  EL  I +TA+     P  D   +P   +  W
Sbjct:   504 YPLQLQQLADELEQIRKTAIPSARVPHSDSRANPTFHNGNW 544


>FB|FBgn0036765 [details] [associations]
            symbol:CG7408 species:7227 "Drosophila melanogaster"
            [GO:0003943 "N-acetylgalactosamine-4-sulfatase activity"
            evidence=ISS] [GO:0008152 "metabolic process" evidence=IEA]
            [GO:0042742 "defense response to bacterium" evidence=IMP]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 EMBL:AE014296 GO:GO:0042742 Gene3D:3.40.720.10
            SUPFAM:SSF53649 InterPro:IPR024607 PROSITE:PS00523 GO:GO:0003943
            GeneTree:ENSGT00560000077076 HSSP:P15289 FlyBase:FBgn0036765
            RefSeq:NP_001163462.1 RefSeq:NP_001163463.1 RefSeq:NP_649020.1
            UniGene:Dm.13634 EnsemblMetazoa:FBtr0075142
            EnsemblMetazoa:FBtr0300281 EnsemblMetazoa:FBtr0300282 GeneID:39991
            KEGG:dme:Dmel_CG7408 UCSC:CG7408-RB InParanoid:Q9VVM1 OMA:TRENERD
            GenomeRNAi:39991 NextBio:816442 Uniprot:Q9VVM1
        Length = 585

 Score = 768 (275.4 bits), Expect = 8.5e-99, Sum P(4) = 8.5e-99
 Identities = 154/339 (45%), Positives = 216/339 (63%)

Query:    53 LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRS 112
             +VA+S  P+II I+ADDLG++DV F G +   TPNIDALAYSG+IL N Y   +CTPSR+
Sbjct:    28 IVATSDKPNIIIIMADDLGFDDVSFRGSNNFLTPNIDALAYSGVILNNLYVAPMCTPSRA 87

Query:   113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
             A++TGK+PI+TGMQH V+   +  GLPL+E  + +  +E GYRT ++GKWHLG  ++ +T
Sbjct:    88 ALLTGKYPINTGMQHYVIVNDQPWGLPLNETTMAEIFRENGYRTSLLGKWHLGLSQRNFT 147

Query:   173 PTFRGFESHLGYWTGHQDYFDHSAEEMKMW--GLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
             PT RGF+ HLGY   + DY+  S E+      G D R  L+   D  G Y TD+ T  AV
Sbjct:   148 PTERGFDRHLGYLGAYVDYYTQSYEQQNKGYNGHDFRDSLKSTHDHVGHYVTDLLTDAAV 207

Query:   231 DIIHNH---STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
               I +H   ++ +PLFL L H A H+AN  +P+QAP   ++   +I +     +AA++ +
Sbjct:   208 KEIEDHGSKNSSQPLFLLLNHLAPHAANDDDPMQAPAEEVSRFEYISNKTHRYYAAMVSR 267

Query:   288 LDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRG 347
             LD+SVG V++AL ++ ML NSII+F+SD             SN+PLRG KN+ WEG +R 
Sbjct:   268 LDKSVGSVIDALARQEMLQNSIILFLSDNGGPTQGQHSTTASNYPLRGQKNSPWEGALRS 327

Query:   348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
             +  IWS   E  G V +Q +++ D LPTL +AA  S  P
Sbjct:   328 SAAIWSTEFERLGSVWKQQIYIGDLLPTLAAAAGISPDP 366

 Score = 114 (45.2 bits), Expect = 8.5e-99, Sum P(4) = 8.5e-99
 Identities = 23/46 (50%), Positives = 30/46 (65%)

Query:   753 EEGMRKLRDAASIQC-GPVKEV-PCEPQIAPCLFDIKNDPCEKNNL 796
             E  + +LRD + I+C  P   V PC P   PCLFDI+ DPCE++NL
Sbjct:   461 ERNISELRDQSRIECPDPATGVKPCLPLEGPCLFDIEADPCERSNL 506

 Score = 111 (44.1 bits), Expect = 1.8e-98, Sum P(4) = 1.8e-98
 Identities = 26/67 (38%), Positives = 39/67 (58%)

Query:   652 KLRDAASIQC-GPVKEV-PCEPQIAPCLFDIKNDPCEKNNLADRSEDQRIN-HYTTEVGR 708
             +LRD + I+C  P   V PC P   PCLFDI+ DPCE++NL    ++  I     + + +
Sbjct:   466 ELRDQSRIECPDPATGVKPCLPLEGPCLFDIEADPCERSNLYAEYQNSTIFLDLWSRIQQ 525

Query:   709 FNQIAYP 715
             F + A+P
Sbjct:   526 FAKQAHP 532

 Score = 90 (36.7 bits), Expect = 8.5e-99, Sum P(4) = 8.5e-99
 Identities = 28/94 (29%), Positives = 43/94 (45%)

Query:   510 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISAL--TRGKWKLVKENSING--NG----- 560
             +DG+++WS L     S    I+H ID++     L  TRGKWK++   +  G  +G     
Sbjct:   370 LDGLNLWSALKYGYESVEREIVHVIDEDVAEPHLSYTRGKWKVISGTTNQGLYDGWLGHR 429

Query:   561 -TSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRN 593
              TSE       Y+  +    VW  L +    +RN
Sbjct:   430 ETSEVDPRAVEYEELVRNTSVWLQLQQVSFGERN 463

 Score = 73 (30.8 bits), Expect = 1.3e-11, Sum P(4) = 1.3e-11
 Identities = 14/35 (40%), Positives = 22/35 (62%)

Query:     1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIM 35
             MQH V+   +  GLPL+E  + +  +E GYRT ++
Sbjct:   100 MQHYVIVNDQPWGLPLNETTMAEIFRENGYRTSLL 134

 Score = 57 (25.1 bits), Expect = 2.4e-95, Sum P(4) = 2.4e-95
 Identities = 10/28 (35%), Positives = 17/28 (60%)

Query:   575 IDGIDVWSVLSRNEPSKRNTILHNIDDE 602
             +DG+++WS L     S    I+H ID++
Sbjct:   370 LDGLNLWSALKYGYESVEREIVHVIDED 397

 Score = 52 (23.4 bits), Expect = 8.5e-99, Sum P(4) = 8.5e-99
 Identities = 10/30 (33%), Positives = 17/30 (56%)

Query:   874 RTAVAPINKPFDKGGDPKNFDHAWSIFGDD 903
             + A  P NKP D   DP+ + + W+ + D+
Sbjct:   528 KQAHPPNNKPGDPNCDPRFYHNEWTWWQDE 557


>UNIPROTKB|P15848 [details] [associations]
            symbol:ARSB "Arylsulfatase B" species:9606 "Homo sapiens"
            [GO:0046872 "metal ion binding" evidence=IEA] [GO:0004065
            "arylsulfatase activity" evidence=IEA] [GO:0005739 "mitochondrion"
            evidence=IEA] [GO:0005791 "rough endoplasmic reticulum"
            evidence=IEA] [GO:0005794 "Golgi apparatus" evidence=IEA]
            [GO:0006914 "autophagy" evidence=IEA] [GO:0007417 "central nervous
            system development" evidence=IEA] [GO:0007584 "response to
            nutrient" evidence=IEA] [GO:0009268 "response to pH" evidence=IEA]
            [GO:0043627 "response to estrogen stimulus" evidence=IEA]
            [GO:0051597 "response to methylmercury" evidence=IEA] [GO:0005764
            "lysosome" evidence=TAS] [GO:0007041 "lysosomal transport"
            evidence=TAS] [GO:0007040 "lysosome organization" evidence=TAS]
            [GO:0003943 "N-acetylgalactosamine-4-sulfatase activity"
            evidence=TAS] [GO:0005788 "endoplasmic reticulum lumen"
            evidence=TAS] [GO:0005975 "carbohydrate metabolic process"
            evidence=TAS] [GO:0006644 "phospholipid metabolic process"
            evidence=TAS] [GO:0006665 "sphingolipid metabolic process"
            evidence=TAS] [GO:0006687 "glycosphingolipid metabolic process"
            evidence=TAS] [GO:0030203 "glycosaminoglycan metabolic process"
            evidence=TAS] [GO:0030204 "chondroitin sulfate metabolic process"
            evidence=TAS] [GO:0030207 "chondroitin sulfate catabolic process"
            evidence=TAS] [GO:0043202 "lysosomal lumen" evidence=TAS]
            [GO:0043687 "post-translational protein modification" evidence=TAS]
            [GO:0044267 "cellular protein metabolic process" evidence=TAS]
            [GO:0044281 "small molecule metabolic process" evidence=TAS]
            Reactome:REACT_17015 Reactome:REACT_111217 InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 GO:GO:0005739
            GO:GO:0005794 Reactome:REACT_116125 GO:GO:0046872 GO:GO:0005975
            GO:GO:0005791 GO:GO:0006914 GO:GO:0006644 GO:GO:0007584
            GO:GO:0007417 GO:GO:0007040 GO:GO:0009268 GO:GO:0005788
            EMBL:CH471084 GO:GO:0043627 GO:GO:0043687 GO:GO:0043202
            GO:GO:0007041 Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0051597
            eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            MIM:272200 GO:GO:0004065 GO:GO:0006687 EMBL:J05225 EMBL:M32373
            EMBL:X72735 EMBL:X72736 EMBL:X72737 EMBL:X72738 EMBL:X72739
            EMBL:X72740 EMBL:X72741 EMBL:X72742 EMBL:AK314903 EMBL:AC020937
            EMBL:AC025755 EMBL:AC099485 EMBL:AC114963 EMBL:BC029051 EMBL:S57777
            IPI:IPI00306576 IPI:IPI00413690 PIR:S35990 RefSeq:NP_000037.2
            RefSeq:NP_942002.1 UniGene:Hs.149103 UniGene:Hs.604199 PDB:1FSU
            PDBsum:1FSU ProteinModelPortal:P15848 SMR:P15848 IntAct:P15848
            STRING:P15848 PhosphoSite:P15848 DMDM:114223 PaxDb:P15848
            PRIDE:P15848 Ensembl:ENST00000264914 Ensembl:ENST00000396151
            Ensembl:ENST00000565165 GeneID:411 KEGG:hsa:411 UCSC:uc003kfq.3
            CTD:411 GeneCards:GC05M078108 HGNC:HGNC:714 HPA:HPA037770
            HPA:HPA037771 MIM:253200 MIM:611542 neXtProt:NX_P15848
            Orphanet:276212 Orphanet:276223 PharmGKB:PA25006
            HOGENOM:HOG000135354 HOVERGEN:HBG004282 InParanoid:P15848 KO:K01135
            OMA:WLFDIDR OrthoDB:EOG4DV5M0 PhylomeDB:P15848
            BioCyc:MetaCyc:HS03665-MONOMER BRENDA:3.1.6.12 ChEMBL:CHEMBL2399
            EvolutionaryTrace:P15848 GenomeRNAi:411 NextBio:1737
            ArrayExpress:P15848 Bgee:P15848 CleanEx:HS_ARSB
            Genevestigator:P15848 GermOnline:ENSG00000113273 GO:GO:0003943
            GO:GO:0030207 Uniprot:P15848
        Length = 533

 Score = 825 (295.5 bits), Expect = 2.8e-96, Sum P(4) = 2.8e-96
 Identities = 162/355 (45%), Positives = 225/355 (63%)

Query:    33 RIMAFAVLPLAFTLSMVFVDLVA-SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
             R++   VLPL   L +      A +S PPH++F+LADDLGWNDVGFHG  +I TP++DAL
Sbjct:    17 RLLLPVVLPLLLLLLLAPPGSGAGASRPPHLVFLLADDLGWNDVGFHG-SRIRTPHLDAL 75

Query:    92 AYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKE 151
             A  G++L NYYT  LCTPSRS ++TG++ I TG+QH +++ C+   +PL EK+LPQ LKE
Sbjct:    76 AAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKE 135

Query:   152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHS------AEEMKMWGLD 205
              GY T +VGKWHLG Y+KE  PT RGF+++ GY  G +DY+ H       A  +    LD
Sbjct:   136 AGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALD 195

Query:   206 MRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDH 265
              R   E A      YST++FT  A+ +I NH  ++PLFLYLA  + H     EPLQ P+ 
Sbjct:   196 FRDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEE 250

Query:   266 YLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXX 325
             YL  +  I+D  R  +A ++  +DE+VG V  AL+   + +N++ +F +D          
Sbjct:   251 YLKPYDFIQDKNRHHYAGMVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAGG- 309

Query:   326 XXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
                +NWPLRG K +LWEGGVRG G + SPLL+ +G+   + +H+SDWLPTL+  A
Sbjct:   310 ---NNWPLRGRKWSLWEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLA 361

 Score = 97 (39.2 bits), Expect = 4.6e-06, Sum P(4) = 4.6e-06
 Identities = 16/35 (45%), Positives = 25/35 (71%)

Query:     1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIM 35
             +QH +++ C+   +PL EK+LPQ LKE GY T ++
Sbjct:   109 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMV 143

 Score = 84 (34.6 bits), Expect = 1.1e-92, Sum P(3) = 1.1e-92
 Identities = 18/47 (38%), Positives = 24/47 (51%)

Query:   510 IDGIDVWSVLSRNEPSKRNTILHNIDDEW-QISALTRGKWKLVKENS 555
             +DG DVW  +S   PS R  +LHNID  +   S   R      K++S
Sbjct:   371 LDGFDVWKTISEGSPSPRIELLHNIDPNFVDSSPCPRNSMAPAKDDS 417

 Score = 82 (33.9 bits), Expect = 2.8e-96, Sum P(4) = 2.8e-96
 Identities = 14/26 (53%), Positives = 17/26 (65%)

Query:   575 IDGIDVWSVLSRNEPSKRNTILHNID 600
             +DG DVW  +S   PS R  +LHNID
Sbjct:   371 LDGFDVWKTISEGSPSPRIELLHNID 396

 Score = 47 (21.6 bits), Expect = 2.8e-96, Sum P(4) = 2.8e-96
 Identities = 12/30 (40%), Positives = 17/30 (56%)

Query:   770 VKEVPCE--PQIAPCLFDIKNDPCEKNNLA 797
             V E+P    P     LFDI  DP E+++L+
Sbjct:   459 VSEIPSSDPPTKTLWLFDIDRDPEERHDLS 488

 Score = 47 (21.6 bits), Expect = 1.1e-92, Sum P(3) = 1.1e-92
 Identities = 12/30 (40%), Positives = 17/30 (56%)

Query:   664 VKEVPCE--PQIAPCLFDIKNDPCEKNNLA 691
             V E+P    P     LFDI  DP E+++L+
Sbjct:   459 VSEIPSSDPPTKTLWLFDIDRDPEERHDLS 488

 Score = 46 (21.3 bits), Expect = 2.8e-96, Sum P(4) = 2.8e-96
 Identities = 6/24 (25%), Positives = 14/24 (58%)

Query:   680 IKNDPCEKNNLADRSEDQRINHYT 703
             + + PC +N++A   +D  +  Y+
Sbjct:   400 VDSSPCPRNSMAPAKDDSSLPEYS 423

 Score = 38 (18.4 bits), Expect = 1.6e-91, Sum P(3) = 1.6e-91
 Identities = 8/30 (26%), Positives = 16/30 (53%)

Query:   786 IKNDPCEKNNLA---DRSEVQRINHYTTEV 812
             + + PC +N++A   D S +   + + T V
Sbjct:   400 VDSSPCPRNSMAPAKDDSSLPEYSAFNTSV 429

 Score = 38 (18.4 bits), Expect = 2.8e-84, Sum P(2) = 2.8e-84
 Identities = 9/32 (28%), Positives = 16/32 (50%)

Query:   357 ESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
             E R  ++ +Y H+   L + L   +K  +P Y
Sbjct:   482 EERHDLSREYPHIVTKLLSRLQFYHKHSVPVY 513


>UNIPROTKB|A6QLZ3 [details] [associations]
            symbol:ARSB "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0004065 "arylsulfatase activity" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0004065
            CTD:411 HOGENOM:HOG000135354 HOVERGEN:HBG004282 KO:K01135
            OMA:WLFDIDR OrthoDB:EOG4DV5M0 GeneTree:ENSGT00560000077076
            EMBL:DAAA02027809 EMBL:DAAA02027810 EMBL:DAAA02027811
            EMBL:DAAA02027812 EMBL:DAAA02027813 EMBL:DAAA02027814
            EMBL:DAAA02027815 EMBL:DAAA02027816 EMBL:DAAA02027817 EMBL:BC148139
            IPI:IPI00710068 RefSeq:NP_001094645.1 UniGene:Bt.35850 SMR:A6QLZ3
            STRING:A6QLZ3 Ensembl:ENSBTAT00000010988 GeneID:538401
            KEGG:bta:538401 InParanoid:A6QLZ3 NextBio:20877344 Uniprot:A6QLZ3
        Length = 533

 Score = 843 (301.8 bits), Expect = 1.8e-94, Sum P(3) = 1.8e-94
 Identities = 166/356 (46%), Positives = 224/356 (62%)

Query:    38 AVLPLAFTLSMVFVDLVAS----SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
             A+LPL   L ++ +  + S    S PPH++F+LADDLGWNDVGFHG   I TP +DALA 
Sbjct:    19 AILPLGLLLLLLLLPPLGSGAGASRPPHLVFVLADDLGWNDVGFHG-SAIRTPRLDALAA 77

Query:    94 SGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELG 153
              G++L NYYT  LCTPSRS ++TG++ IHTG+QH ++  C+   +PL EK+LPQ LKE G
Sbjct:    78 GGVLLDNYYTQPLCTPSRSQLLTGRYQIHTGLQHQIILPCQPSCIPLDEKLLPQLLKEAG 137

Query:   154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHS------AEEMKMWGLDMR 207
             Y T +VGKWHLG Y+KE  PT RGF+++ GY  G +DY+ H       A  +    LD R
Sbjct:   138 YATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFR 197

Query:   208 RDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL 267
                E A      YST+VFT  A  +I NH  ++PLFLYLA  + H     EPLQ P+ YL
Sbjct:   198 DGEEVATGYKNMYSTNVFTERATTLITNHPPEKPLFLYLALQSVH-----EPLQVPEEYL 252

Query:   268 NIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXX 327
               +  I+D  R  +A +   +DE+VG V  ALE+R + +N++ +F +D            
Sbjct:   253 KPYDFIQDRNRRYYAGMASVMDEAVGNVTAALERRGLWNNTVFIFSTDNGGQTLAGG--- 309

Query:   328 XSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
              +NWPLRG K +LWEGGVRG G + SPLL+ +G+   + +H+SDWLPTL+  A  S
Sbjct:   310 -NNWPLRGRKWSLWEGGVRGVGFVASPLLKRKGVKTRELIHISDWLPTLVKLAGGS 364

 Score = 95 (38.5 bits), Expect = 3.1e-05, Sum P(3) = 3.1e-05
 Identities = 16/35 (45%), Positives = 24/35 (68%)

Query:     1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIM 35
             +QH ++  C+   +PL EK+LPQ LKE GY T ++
Sbjct:   109 LQHQIILPCQPSCIPLDEKLLPQLLKEAGYATHMV 143

 Score = 88 (36.0 bits), Expect = 1.8e-94, Sum P(3) = 1.8e-94
 Identities = 19/47 (40%), Positives = 26/47 (55%)

Query:   510 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGK-WKLVKENS 555
             +DG DVW+ +S   PS R  +LHNID  +  +A   G    L K+ S
Sbjct:   371 LDGFDVWNTISEGSPSPRMELLHNIDPNFVDTAPCPGNSMALAKDES 417

 Score = 84 (34.6 bits), Expect = 4.8e-94, Sum P(3) = 4.8e-94
 Identities = 14/26 (53%), Positives = 18/26 (69%)

Query:   575 IDGIDVWSVLSRNEPSKRNTILHNID 600
             +DG DVW+ +S   PS R  +LHNID
Sbjct:   371 LDGFDVWNTISEGSPSPRMELLHNID 396

 Score = 42 (19.8 bits), Expect = 1.8e-94, Sum P(3) = 1.8e-94
 Identities = 8/15 (53%), Positives = 12/15 (80%)

Query:   677 LFDIKNDPCEKNNLA 691
             LFDI  DP E+++L+
Sbjct:   474 LFDIDQDPEERHDLS 488

 Score = 42 (19.8 bits), Expect = 1.8e-94, Sum P(3) = 1.8e-94
 Identities = 8/15 (53%), Positives = 12/15 (80%)

Query:   783 LFDIKNDPCEKNNLA 797
             LFDI  DP E+++L+
Sbjct:   474 LFDIDQDPEERHDLS 488

 Score = 37 (18.1 bits), Expect = 4.4e-86, Sum P(2) = 4.4e-86
 Identities = 9/32 (28%), Positives = 15/32 (46%)

Query:   357 ESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
             E R  ++ +Y H+   L + L    K  +P Y
Sbjct:   482 EERHDLSREYPHIVKKLLSRLQFYQKHSVPVY 513


>UNIPROTKB|Q32KI4 [details] [associations]
            symbol:arsb "Arylsulfatase B" species:9615 "Canis lupus
            familiaris" [GO:0004065 "arylsulfatase activity" evidence=IEA]
            [GO:0003943 "N-acetylgalactosamine-4-sulfatase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            GO:GO:0004065 CTD:411 HOGENOM:HOG000135354 HOVERGEN:HBG004282
            KO:K01135 OMA:WLFDIDR OrthoDB:EOG4DV5M0 GO:GO:0003943
            GeneTree:ENSGT00560000077076 EMBL:AAEX03002118 EMBL:AAEX03002119
            EMBL:BN000753 RefSeq:NP_001041598.1 UniGene:Cfa.39080 SMR:Q32KI4
            STRING:Q32KI4 Ensembl:ENSCAFT00000014585 GeneID:610364
            KEGG:cfa:610364 InParanoid:Q32KI4 NextBio:20895924 Uniprot:Q32KI4
        Length = 535

 Score = 829 (296.9 bits), Expect = 2.3e-92, Sum P(3) = 2.3e-92
 Identities = 158/334 (47%), Positives = 218/334 (65%)

Query:    56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
             ++GPPH++F+LADDLGW+DVGFHG  +I TP++DALA +G++L NYYT  LCTPSRS ++
Sbjct:    43 AAGPPHLVFVLADDLGWHDVGFHG-SRIRTPHLDALAAAGVLLDNYYTQPLCTPSRSQLL 101

Query:   116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
             TG++ IHTG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  PT 
Sbjct:   102 TGRYQIHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 161

Query:   176 RGFESHLGYWTGHQDYFDHS------AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
             RGF+++ GY  G +DY+ H       A  +    LD R   E A      YST++FT  A
Sbjct:   162 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTERA 221

Query:   230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
               +I NH  ++PLFLYLA  + H     EPLQ P+ YL  +  I D  R  +A ++  +D
Sbjct:   222 TALISNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIHDKNRRYYAGMVSLMD 276

Query:   290 ESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAG 349
             E+VG V  AL+   + +N++ VF +D             +NWPLRG K +LWEGGVRG G
Sbjct:   277 EAVGNVTAALKSHGLWNNTVFVFSTDNGGQTLAGG----NNWPLRGRKWSLWEGGVRGVG 332

Query:   350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
              + SPLL+ +G+ + + VH+SDWLPTL+  A  S
Sbjct:   333 FVASPLLKRKGVKSRELVHISDWLPTLVGLAGGS 366

 Score = 97 (39.2 bits), Expect = 7.4e-05, Sum P(3) = 7.4e-05
 Identities = 16/35 (45%), Positives = 25/35 (71%)

Query:     1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIM 35
             +QH +++ C+   +PL EK+LPQ LKE GY T ++
Sbjct:   111 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMV 145

 Score = 82 (33.9 bits), Expect = 2.3e-92, Sum P(3) = 2.3e-92
 Identities = 14/26 (53%), Positives = 17/26 (65%)

Query:   510 IDGIDVWSVLSRNEPSKRNTILHNID 535
             +DG DVW  +S   PS R  +LHNID
Sbjct:   373 LDGFDVWRTISEGSPSPRMELLHNID 398

 Score = 82 (33.9 bits), Expect = 2.3e-92, Sum P(3) = 2.3e-92
 Identities = 14/26 (53%), Positives = 17/26 (65%)

Query:   575 IDGIDVWSVLSRNEPSKRNTILHNID 600
             +DG DVW  +S   PS R  +LHNID
Sbjct:   373 LDGFDVWRTISEGSPSPRMELLHNID 398

 Score = 42 (19.8 bits), Expect = 2.3e-92, Sum P(3) = 2.3e-92
 Identities = 8/15 (53%), Positives = 12/15 (80%)

Query:   677 LFDIKNDPCEKNNLA 691
             LFDI  DP E+++L+
Sbjct:   476 LFDIDQDPEERHDLS 490

 Score = 42 (19.8 bits), Expect = 2.3e-92, Sum P(3) = 2.3e-92
 Identities = 8/15 (53%), Positives = 12/15 (80%)

Query:   783 LFDIKNDPCEKNNLA 797
             LFDI  DP E+++L+
Sbjct:   476 LFDIDQDPEERHDLS 490


>RGD|2158 [details] [associations]
            symbol:Arsb "arylsulfatase B" species:10116 "Rattus norvegicus"
          [GO:0003943 "N-acetylgalactosamine-4-sulfatase activity"
          evidence=IEA] [GO:0004065 "arylsulfatase activity" evidence=ISO;TAS]
          [GO:0005739 "mitochondrion" evidence=IDA] [GO:0005764 "lysosome"
          evidence=IDA] [GO:0005791 "rough endoplasmic reticulum" evidence=IDA]
          [GO:0005794 "Golgi apparatus" evidence=IDA] [GO:0006914 "autophagy"
          evidence=IDA] [GO:0007417 "central nervous system development"
          evidence=IDA] [GO:0007584 "response to nutrient" evidence=IDA]
          [GO:0008152 "metabolic process" evidence=ISO] [GO:0008484 "sulfuric
          ester hydrolase activity" evidence=IDA] [GO:0009268 "response to pH"
          evidence=IDA] [GO:0043627 "response to estrogen stimulus"
          evidence=IDA] [GO:0046872 "metal ion binding" evidence=IEA]
          [GO:0051597 "response to methylmercury" evidence=IDA]
          InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
          RGD:2158 GO:GO:0005739 GO:GO:0005794 GO:GO:0046872 GO:GO:0005791
          GO:GO:0006914 GO:GO:0007584 GO:GO:0007417 GO:GO:0005764 GO:GO:0009268
          GO:GO:0043627 Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0051597
          eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
          GO:GO:0004065 CTD:411 HOGENOM:HOG000135354 HOVERGEN:HBG004282
          KO:K01135 OrthoDB:EOG4DV5M0 GO:GO:0003943
          GeneTree:ENSGT00560000077076 EMBL:AABR03012149 EMBL:AABR03015281
          EMBL:AABR03016930 EMBL:AABR03021723 EMBL:D49434 EMBL:BN000736
          IPI:IPI00198405 PIR:I54210 RefSeq:NP_254278.1 UniGene:Rn.94004
          ProteinModelPortal:P50430 SMR:P50430 IntAct:P50430 STRING:P50430
          PRIDE:P50430 Ensembl:ENSRNOT00000014860 GeneID:25227 KEGG:rno:25227
          UCSC:RGD:2158 InParanoid:P50430 OMA:ALMTARY NextBio:605779
          ArrayExpress:P50430 Genevestigator:P50430
          GermOnline:ENSRNOG00000011150 Uniprot:P50430
        Length = 528

 Score = 824 (295.1 bits), Expect = 2.0e-91, Sum P(3) = 2.0e-91
 Identities = 158/350 (45%), Positives = 227/350 (64%)

Query:    40 LPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
             LPL   L +       ++ PPH++F+LADDLGWND+GFHG   I TP++DALA  G++L 
Sbjct:    20 LPLLLLLLLWPARASDAAPPPHVVFVLADDLGWNDLGFHG-SVIRTPHLDALAAGGVVLD 78

Query:   100 NYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIV 159
             NYY   LCTPSRS ++TG++ IH G+QH ++  C+   +PL EK+LPQ LK+ GY T +V
Sbjct:    79 NYYVQPLCTPSRSQLLTGRYQIHMGLQHYLIMTCQPNCVPLDEKLLPQLLKDAGYATHMV 138

Query:   160 GKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSA----EEMK--MWGLDMRRDLEPA 213
             GKWHLG Y+KE  PT RGF+++ GY  G +DY+ H A    E +      LD+R   EPA
Sbjct:   139 GKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYTHEACAPIECLNGTRCALDLRDGEEPA 198

Query:   214 WDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
              +    YST++FT  A  +I NH  ++PLFLYLA  + H     +PLQ P+ Y+  +  I
Sbjct:   199 KEYTDIYSTNIFTKRATTLIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYDFI 253

Query:   274 EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPL 333
             +D  R  +A ++  LDE+VG V +AL+ R + +N++++F +D             +NWPL
Sbjct:   254 QDKHRRIYAGMVSLLDEAVGNVTKALKSRGLWNNTVLIFSTDNGGQTRSGG----NNWPL 309

Query:   334 RGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
             RG K TLWEGG+RGAG + SPLL+ +G+ + + +H++DWLPTL++ A  S
Sbjct:   310 RGRKGTLWEGGIRGAGFVASPLLKQKGVKSRELMHITDWLPTLVNLAGGS 359

 Score = 73 (30.8 bits), Expect = 2.0e-91, Sum P(3) = 2.0e-91
 Identities = 13/29 (44%), Positives = 18/29 (62%)

Query:   510 IDGIDVWSVLSRNEPSKRNTILHNIDDEW 538
             +DG DVW  +S   PS R  +L NID ++
Sbjct:   366 LDGFDVWETISEGSPSPRVELLLNIDPDF 394

 Score = 73 (30.8 bits), Expect = 2.0e-91, Sum P(3) = 2.0e-91
 Identities = 13/29 (44%), Positives = 18/29 (62%)

Query:   575 IDGIDVWSVLSRNEPSKRNTILHNIDDEW 603
             +DG DVW  +S   PS R  +L NID ++
Sbjct:   366 LDGFDVWETISEGSPSPRVELLLNIDPDF 394

 Score = 47 (21.6 bits), Expect = 2.0e-91, Sum P(3) = 2.0e-91
 Identities = 13/54 (24%), Positives = 27/54 (50%)

Query:   664 VKEVPC--EPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
             + EVP    P     LFDI  DP E+++++ R     + +  + +  +++ + P
Sbjct:   454 ISEVPSVDSPTKTLWLFDINRDPEERHDVS-REHPHIVQNLLSRLQYYHEHSVP 506

 Score = 45 (20.9 bits), Expect = 3.3e-91, Sum P(3) = 3.3e-91
 Identities = 11/30 (36%), Positives = 17/30 (56%)

Query:   770 VKEVPC--EPQIAPCLFDIKNDPCEKNNLA 797
             + EVP    P     LFDI  DP E+++++
Sbjct:   454 ISEVPSVDSPTKTLWLFDINRDPEERHDVS 483


>MGI|MGI:88075 [details] [associations]
            symbol:Arsb "arylsulfatase B" species:10090 "Mus musculus"
            [GO:0003824 "catalytic activity" evidence=IEA] [GO:0003943
            "N-acetylgalactosamine-4-sulfatase activity" evidence=IEA]
            [GO:0004065 "arylsulfatase activity" evidence=IDA] [GO:0005739
            "mitochondrion" evidence=ISO] [GO:0005764 "lysosome" evidence=ISO]
            [GO:0005791 "rough endoplasmic reticulum" evidence=ISO] [GO:0005794
            "Golgi apparatus" evidence=ISO] [GO:0006914 "autophagy"
            evidence=ISO] [GO:0007417 "central nervous system development"
            evidence=ISO] [GO:0007584 "response to nutrient" evidence=ISO]
            [GO:0008152 "metabolic process" evidence=IDA] [GO:0008484 "sulfuric
            ester hydrolase activity" evidence=ISO] [GO:0009268 "response to
            pH" evidence=ISO] [GO:0016787 "hydrolase activity" evidence=IEA]
            [GO:0043627 "response to estrogen stimulus" evidence=ISO]
            [GO:0046872 "metal ion binding" evidence=IEA] [GO:0051597 "response
            to methylmercury" evidence=ISO] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 MGI:MGI:88075
            GO:GO:0005739 GO:GO:0005794 GO:GO:0046872 GO:GO:0005791
            GO:GO:0006914 GO:GO:0007584 GO:GO:0007417 GO:GO:0005764
            GO:GO:0009268 GO:GO:0043627 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GO:GO:0051597 eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0004065 CTD:411 HOGENOM:HOG000135354
            HOVERGEN:HBG004282 KO:K01135 OrthoDB:EOG4DV5M0 GO:GO:0003943
            EMBL:AK083309 EMBL:AK154098 EMBL:AK158312 EMBL:AC131739
            EMBL:AC136976 EMBL:M82877 EMBL:X92096 EMBL:BN000746 IPI:IPI00406459
            IPI:IPI00652358 RefSeq:NP_033842.3 UniGene:Mm.300178
            UniGene:Mm.472255 ProteinModelPortal:P50429 SMR:P50429
            STRING:P50429 PhosphoSite:P50429 PaxDb:P50429 PRIDE:P50429
            DNASU:11881 Ensembl:ENSMUST00000091403 GeneID:11881 KEGG:mmu:11881
            UCSC:uc007rlo.1 UCSC:uc011zcv.1 GeneTree:ENSGT00560000077076
            InParanoid:P50429 SABIO-RK:P50429 NextBio:279911 Bgee:P50429
            CleanEx:MM_ARSB Genevestigator:P50429 GermOnline:ENSMUSG00000042093
            Uniprot:P50429
        Length = 534

 Score = 823 (294.8 bits), Expect = 2.0e-91, Sum P(3) = 2.0e-91
 Identities = 159/356 (44%), Positives = 229/356 (64%)

Query:    40 LPLAFTLSMVFVDLVA---SSG---PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
             LPL   L  + + L++   +SG   PPH++F+LADDLGWND+GFHG   I TP++DALA 
Sbjct:    20 LPLLLLLLQLLLLLLSPARASGATQPPHVVFVLADDLGWNDLGFHG-SVIRTPHLDALAA 78

Query:    94 SGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELG 153
              G++L NYY   LCTPSRS ++TG++ IH G+QH ++  C+   +PL EK+LPQ LKE G
Sbjct:    79 GGVVLDNYYVQPLCTPSRSQLLTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAG 138

Query:   154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSA----EEMK--MWGLDMR 207
             Y T +VGKWHLG Y+KE  PT RGF+++ GY  G +DY+ H A    E +      LD+R
Sbjct:   139 YATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYTHEACAPIESLNGTRCALDLR 198

Query:   208 RDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL 267
                EPA + +  YST++FT  A  +I NH  ++PLFLYLA  + H     +PLQ P+ Y+
Sbjct:   199 DGEEPAKEYNNIYSTNIFTKRATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYM 253

Query:   268 NIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXX 327
               +  I+D  R  +A ++  +DE+VG V +AL+   + +N++ +F +D            
Sbjct:   254 EPYGFIQDKHRRIYAGMVSLMDEAVGNVTKALKSHGLWNNTVFIFSTDNGGQTRSGG--- 310

Query:   328 XSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
              +NWPLRG K TLWEGG+RG G + SPLL+ +G+ + + +H++DWLPTL+  A  S
Sbjct:   311 -NNWPLRGRKGTLWEGGIRGTGFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGS 365

 Score = 90 (36.7 bits), Expect = 0.00079, Sum P(3) = 0.00079
 Identities = 16/35 (45%), Positives = 24/35 (68%)

Query:     1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIM 35
             +QH ++  C+   +PL EK+LPQ LKE GY T ++
Sbjct:   110 LQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMV 144

 Score = 77 (32.2 bits), Expect = 2.0e-91, Sum P(3) = 2.0e-91
 Identities = 12/29 (41%), Positives = 19/29 (65%)

Query:   510 IDGIDVWSVLSRNEPSKRNTILHNIDDEW 538
             +DG ++W  +S   PS R  +LHNID ++
Sbjct:   372 LDGFNMWKTISEGHPSPRVELLHNIDQDF 400

 Score = 77 (32.2 bits), Expect = 2.0e-91, Sum P(3) = 2.0e-91
 Identities = 12/29 (41%), Positives = 19/29 (65%)

Query:   575 IDGIDVWSVLSRNEPSKRNTILHNIDDEW 603
             +DG ++W  +S   PS R  +LHNID ++
Sbjct:   372 LDGFNMWKTISEGHPSPRVELLHNIDQDF 400

 Score = 44 (20.5 bits), Expect = 2.0e-91, Sum P(3) = 2.0e-91
 Identities = 13/54 (24%), Positives = 27/54 (50%)

Query:   664 VKEVPC--EPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
             V E+P    P     LFDI  DP E+++++ R     + +  + +  +++ + P
Sbjct:   460 VSEIPPVGPPTKTLWLFDINQDPEERHDVS-REHPHIVQNLLSRLQYYHEHSVP 512

 Score = 42 (19.8 bits), Expect = 3.3e-91, Sum P(3) = 3.3e-91
 Identities = 11/30 (36%), Positives = 17/30 (56%)

Query:   770 VKEVPC--EPQIAPCLFDIKNDPCEKNNLA 797
             V E+P    P     LFDI  DP E+++++
Sbjct:   460 VSEIPPVGPPTKTLWLFDINQDPEERHDVS 489


>UNIPROTKB|F1P099 [details] [associations]
            symbol:ARSB "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0004065 "arylsulfatase activity" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
            PROSITE:PS00149 GO:GO:0004065 OMA:WLFDIDR
            GeneTree:ENSGT00560000077076 EMBL:AADN02046142 EMBL:AADN02046143
            EMBL:AADN02046144 EMBL:AADN02046145 EMBL:AADN02046146
            EMBL:AADN02046147 EMBL:AADN02046148 EMBL:AADN02046149
            EMBL:AADN02046150 IPI:IPI00822500 Ensembl:ENSGALT00000038612
            ArrayExpress:F1P099 Uniprot:F1P099
        Length = 527

 Score = 779 (279.3 bits), Expect = 2.0e-90, Sum P(4) = 2.0e-90
 Identities = 149/332 (44%), Positives = 213/332 (64%)

Query:    55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
             A+  PPH++ +LADDLGW DVG+HG   I TP +DAL   G+ LK Y T  LCTPSR  +
Sbjct:    34 AARPPPHLVLVLADDLGWGDVGWHG-SAIRTPRLDALGAGGVRLKGY-TQPLCTPSRPFL 91

Query:   115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
             + G + IHTG+QH +++ C+   LPL EK+LP+ LK+ GY T +VGKWHLG Y+KE  PT
Sbjct:    92 LFGGYYIHTGLQHQIIWPCQPSCLPLDEKLLPELLKDAGYVTHMVGKWHLGMYRKECLPT 151

Query:   175 FRGFESHLGYWTGHQDYFDHS------AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
              RGF+++ GY  G +DY+ H       A+ +    LD R   E A      YST++FT  
Sbjct:   152 RRGFDTYFGYLLGSEDYYSHDHCVLIKAKNVTRCALDFRDGEEVATGFKNMYSTNLFTER 211

Query:   229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
             A+D+I NH T++PLFLYLA  + H     EPL+    Y+  +  I+D KR ++A ++  +
Sbjct:   212 AIDLIANHKTEKPLFLYLAFQSVH-----EPLEVSAEYMKPYSSIKDVKRRRYAGMVSLM 266

Query:   289 DESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGA 348
             DE+VG + +AL++  + +N+++VF +D             +NWPLRG K TLWEGGVRG 
Sbjct:   267 DEAVGNLTDALKEYGLWNNTVLVFSTDNGGQTMAGG----NNWPLRGRKWTLWEGGVRGV 322

Query:   349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
             G + SPLL+ +G+ + + +H+SDWLPTL+  A
Sbjct:   323 GFVASPLLKQKGVESHELIHISDWLPTLVHLA 354

 Score = 92 (37.4 bits), Expect = 0.00013, Sum P(4) = 0.00013
 Identities = 15/35 (42%), Positives = 25/35 (71%)

Query:     1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIM 35
             +QH +++ C+   LPL EK+LP+ LK+ GY T ++
Sbjct:   102 LQHQIIWPCQPSCLPLDEKLLPELLKDAGYVTHMV 136

 Score = 82 (33.9 bits), Expect = 2.0e-90, Sum P(4) = 2.0e-90
 Identities = 14/26 (53%), Positives = 17/26 (65%)

Query:   510 IDGIDVWSVLSRNEPSKRNTILHNID 535
             +DG DVW  +S   PS R  +LHNID
Sbjct:   364 LDGFDVWKTISEGRPSPRVELLHNID 389

 Score = 82 (33.9 bits), Expect = 2.0e-87, Sum P(3) = 2.0e-87
 Identities = 14/26 (53%), Positives = 17/26 (65%)

Query:   575 IDGIDVWSVLSRNEPSKRNTILHNID 600
             +DG DVW  +S   PS R  +LHNID
Sbjct:   364 LDGFDVWKTISEGRPSPRVELLHNID 389

 Score = 45 (20.9 bits), Expect = 2.0e-90, Sum P(4) = 2.0e-90
 Identities = 9/17 (52%), Positives = 13/17 (76%)

Query:   677 LFDIKNDPCEKNNLADR 693
             LFDI +DP EK  L+++
Sbjct:   468 LFDIVHDPEEKYELSEK 484

 Score = 45 (20.9 bits), Expect = 2.0e-90, Sum P(4) = 2.0e-90
 Identities = 9/17 (52%), Positives = 13/17 (76%)

Query:   783 LFDIKNDPCEKNNLADR 799
             LFDI +DP EK  L+++
Sbjct:   468 LFDIVHDPEEKYELSEK 484

 Score = 38 (18.4 bits), Expect = 2.0e-90, Sum P(4) = 2.0e-90
 Identities = 6/11 (54%), Positives = 9/11 (81%)

Query:   541 SALTRGKWKLV 551
             +A+  GKWKL+
Sbjct:   425 AAIRHGKWKLL 435


>UNIPROTKB|E1BKH3 [details] [associations]
            symbol:ARSJ "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            GeneTree:ENSGT00560000077076 KO:K12375 CTD:79642 OMA:AAGYGIW
            EMBL:DAAA02016458 EMBL:DAAA02016459 EMBL:DAAA02016460
            IPI:IPI00825946 RefSeq:XP_002688145.1 RefSeq:XP_611819.3
            UniGene:Bt.87496 ProteinModelPortal:E1BKH3
            Ensembl:ENSBTAT00000023672 GeneID:540514 KEGG:bta:540514
            NextBio:20878676 Uniprot:E1BKH3
        Length = 599

 Score = 764 (274.0 bits), Expect = 1.1e-89, Sum P(4) = 1.1e-89
 Identities = 150/329 (45%), Positives = 205/329 (62%)

Query:    54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             V +   PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct:    71 VTALSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 129

Query:   114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
              +TGK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+KE  P
Sbjct:   130 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 189

Query:   174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
             T RGF++  G   G  DY+ H   +   M G D+  +   AWD  +G YST ++T     
Sbjct:   190 TKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGVYSTQMYTQRVQQ 249

Query:   232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
             I+ +H   +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE+
Sbjct:   250 ILASHDPRKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIVNINRRRYAAMLSCLDEA 304

Query:   292 VGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLI 351
             +  V  AL+     +NSII++ SD             SNWPLRG K T WEGG+R  G +
Sbjct:   305 INNVTLALKMYGFYNNSIIIYSSDNGGQPTAGG----SNWPLRGSKGTYWEGGIRAIGFV 360

Query:   352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
              SPLL+++G V ++ VH++DW PTL+S A
Sbjct:   361 HSPLLKNKGTVCKELVHITDWYPTLISLA 389

 Score = 75 (31.5 bits), Expect = 1.1e-89, Sum P(4) = 1.1e-89
 Identities = 16/40 (40%), Positives = 21/40 (52%)

Query:   509 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKW 548
             ++DG DVW  +S    S R  ILHNID  +  +    G W
Sbjct:   398 QLDGYDVWETISEGLRSPRVDILHNIDPIY--TKAKNGSW 435

 Score = 73 (30.8 bits), Expect = 1.8e-89, Sum P(4) = 1.8e-89
 Identities = 14/27 (51%), Positives = 17/27 (62%)

Query:   574 EIDGIDVWSVLSRNEPSKRNTILHNID 600
             ++DG DVW  +S    S R  ILHNID
Sbjct:   398 QLDGYDVWETISEGLRSPRVDILHNID 424

 Score = 59 (25.8 bits), Expect = 1.1e-89, Sum P(4) = 1.1e-89
 Identities = 12/40 (30%), Positives = 20/40 (50%)

Query:   858 YPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAW 897
             YP ++ Q+ + L+  N+TAV     P D   +P+     W
Sbjct:   513 YPGIVKQLLRRLSQFNKTAVPVRYPPKDPRSNPRLNGGVW 552

 Score = 39 (18.8 bits), Expect = 1.1e-89, Sum P(4) = 1.1e-89
 Identities = 8/17 (47%), Positives = 13/17 (76%)

Query:   783 LFDIKNDPCEKNNLADR 799
             LF+I  DP E+ +L++R
Sbjct:   496 LFNITADPYERVDLSNR 512


>UNIPROTKB|Q5FYB0 [details] [associations]
            symbol:ARSJ "Arylsulfatase J" species:9606 "Homo sapiens"
            [GO:0046872 "metal ion binding" evidence=IEA] [GO:0005576
            "extracellular region" evidence=IEA] [GO:0004065 "arylsulfatase
            activity" evidence=TAS] [GO:0005788 "endoplasmic reticulum lumen"
            evidence=TAS] [GO:0006644 "phospholipid metabolic process"
            evidence=TAS] [GO:0006665 "sphingolipid metabolic process"
            evidence=TAS] [GO:0006687 "glycosphingolipid metabolic process"
            evidence=TAS] [GO:0043687 "post-translational protein modification"
            evidence=TAS] [GO:0044267 "cellular protein metabolic process"
            evidence=TAS] [GO:0044281 "small molecule metabolic process"
            evidence=TAS] Reactome:REACT_17015 Reactome:REACT_111217
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 GO:GO:0005576 GO:GO:0044281 GO:GO:0046872
            GO:GO:0006644 GO:GO:0005788 GO:GO:0043687 Gene3D:3.40.720.10
            SUPFAM:SSF53649 eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0004065 GO:GO:0006687 HOGENOM:HOG000135354
            HOVERGEN:HBG004282 KO:K12375 EMBL:AY875938 EMBL:AM049401
            EMBL:AY358647 EMBL:AC104779 EMBL:BC089445 EMBL:BC132879
            EMBL:BC132881 EMBL:BC144265 IPI:IPI00413865 RefSeq:NP_078866.3
            UniGene:Hs.22895 UniGene:Hs.700496 UniGene:Hs.712042
            ProteinModelPortal:Q5FYB0 SMR:Q5FYB0 STRING:Q5FYB0
            PhosphoSite:Q5FYB0 DMDM:74722580 PRIDE:Q5FYB0
            Ensembl:ENST00000315366 Ensembl:ENST00000541197 GeneID:79642
            KEGG:hsa:79642 UCSC:uc003ibq.1 CTD:79642 GeneCards:GC04M114821
            HGNC:HGNC:26286 HPA:HPA036482 MIM:610010 neXtProt:NX_Q5FYB0
            PharmGKB:PA143485310 InParanoid:Q5FYB0 OMA:AAGYGIW
            OrthoDB:EOG45HRX5 ChiTaRS:ARSJ GenomeRNAi:79642 NextBio:68769
            ArrayExpress:Q5FYB0 Bgee:Q5FYB0 CleanEx:HS_ARSJ
            Genevestigator:Q5FYB0 Uniprot:Q5FYB0
        Length = 599

 Score = 766 (274.7 bits), Expect = 2.2e-89, Sum P(4) = 2.2e-89
 Identities = 150/327 (45%), Positives = 206/327 (62%)

Query:    56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
             S+  PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS  +
Sbjct:    72 STSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFI 130

Query:   116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
             TGK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+KE  PT 
Sbjct:   131 TGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTR 190

Query:   176 RGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDII 233
             RGF++  G   G  DY+ H   +   M G D+  +   AWD  +G YST ++T     I+
Sbjct:   191 RGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQIL 250

Query:   234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
              +H+  +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE++ 
Sbjct:   251 ASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAIN 305

Query:   294 KVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWS 353
              V  AL+     +NSII++ SD             SNWPLRG K T WEGG+R  G + S
Sbjct:   306 NVTLALKTYGFYNNSIIIYSSDNGGQPTAGG----SNWPLRGSKGTYWEGGIRAVGFVHS 361

Query:   354 PLLESRGIVAEQYVHVSDWLPTLLSAA 380
             PLL+++G V ++ VH++DW PTL+S A
Sbjct:   362 PLLKNKGTVCKELVHITDWYPTLISLA 388

 Score = 74 (31.1 bits), Expect = 2.2e-89, Sum P(4) = 2.2e-89
 Identities = 15/40 (37%), Positives = 21/40 (52%)

Query:   509 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKW 548
             ++DG D+W  +S    S R  ILHNID  +  +    G W
Sbjct:   397 QLDGYDIWETISEGLRSPRVDILHNIDPIY--TKAKNGSW 434

 Score = 72 (30.4 bits), Expect = 3.6e-89, Sum P(4) = 3.6e-89
 Identities = 13/27 (48%), Positives = 17/27 (62%)

Query:   574 EIDGIDVWSVLSRNEPSKRNTILHNID 600
             ++DG D+W  +S    S R  ILHNID
Sbjct:   397 QLDGYDIWETISEGLRSPRVDILHNID 423

 Score = 55 (24.4 bits), Expect = 2.2e-89, Sum P(4) = 2.2e-89
 Identities = 11/40 (27%), Positives = 20/40 (50%)

Query:   858 YPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAW 897
             YP ++ ++ + L+  N+TAV     P D   +P+     W
Sbjct:   512 YPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPRLNGGVW 551

 Score = 39 (18.8 bits), Expect = 2.2e-89, Sum P(4) = 2.2e-89
 Identities = 8/17 (47%), Positives = 13/17 (76%)

Query:   783 LFDIKNDPCEKNNLADR 799
             LF+I  DP E+ +L++R
Sbjct:   495 LFNITADPYERVDLSNR 511


>UNIPROTKB|Q32KH6 [details] [associations]
            symbol:arsj "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            GO:GO:0008484 HOGENOM:HOG000135354 HOVERGEN:HBG004282
            GeneTree:ENSGT00560000077076 KO:K12375 CTD:79642 OMA:AAGYGIW
            OrthoDB:EOG45HRX5 EMBL:AAEX03016834 EMBL:BN000761
            RefSeq:NP_001041581.1 UniGene:Cfa.28600 SMR:Q32KH6
            Ensembl:ENSCAFT00000048607 GeneID:487909 KEGG:cfa:487909
            InParanoid:Q32KH6 NextBio:20861390 Uniprot:Q32KH6
        Length = 598

 Score = 761 (272.9 bits), Expect = 3.6e-89, Sum P(4) = 3.6e-89
 Identities = 149/327 (45%), Positives = 205/327 (62%)

Query:    56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
             ++  PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS  +
Sbjct:    70 ATSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFI 128

Query:   116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
             TGK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+KE  PT 
Sbjct:   129 TGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTK 188

Query:   176 RGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDII 233
             RGF++  G   G  DY+ H   +   M G D+  +   AWD  +G YST ++T     I+
Sbjct:   189 RGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQIL 248

Query:   234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
              +H   +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE++ 
Sbjct:   249 ASHDPRKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAIN 303

Query:   294 KVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWS 353
              V  AL+     +NSII++ SD             SNWPLRG K T WEGG+R  G + S
Sbjct:   304 NVTLALKTYGFYNNSIIIYSSDNGGQPTAGG----SNWPLRGSKGTYWEGGIRAVGFVHS 359

Query:   354 PLLESRGIVAEQYVHVSDWLPTLLSAA 380
             PLL+++G V ++ VH++DW PTL+S A
Sbjct:   360 PLLKNKGTVCKELVHITDWYPTLISLA 386

 Score = 75 (31.5 bits), Expect = 3.6e-89, Sum P(4) = 3.6e-89
 Identities = 16/40 (40%), Positives = 21/40 (52%)

Query:   509 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKW 548
             ++DG DVW  +S    S R  ILHNID  +  +    G W
Sbjct:   395 QLDGYDVWETISEGLRSPRVDILHNIDPIY--TKAKNGSW 432

 Score = 73 (30.8 bits), Expect = 5.9e-89, Sum P(4) = 5.9e-89
 Identities = 14/27 (51%), Positives = 17/27 (62%)

Query:   574 EIDGIDVWSVLSRNEPSKRNTILHNID 600
             ++DG DVW  +S    S R  ILHNID
Sbjct:   395 QLDGYDVWETISEGLRSPRVDILHNID 421

 Score = 59 (25.8 bits), Expect = 3.6e-89, Sum P(4) = 3.6e-89
 Identities = 12/40 (30%), Positives = 20/40 (50%)

Query:   858 YPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAW 897
             YP ++ Q+ + L+  N+TAV     P D   +P+     W
Sbjct:   510 YPGIVKQLLRRLSQFNKTAVPVRYPPKDPRSNPRLNGGVW 549

 Score = 37 (18.1 bits), Expect = 3.6e-89, Sum P(4) = 3.6e-89
 Identities = 8/17 (47%), Positives = 12/17 (70%)

Query:   783 LFDIKNDPCEKNNLADR 799
             LF+I  DP E+ +L+ R
Sbjct:   493 LFNITADPYERVDLSHR 509


>RGD|1310242 [details] [associations]
            symbol:Arsi "arylsulfatase family, member I" species:10116
            "Rattus norvegicus" [GO:0005576 "extracellular region"
            evidence=IEA] [GO:0005783 "endoplasmic reticulum" evidence=IEA]
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
            [GO:0046872 "metal ion binding" evidence=IEA] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 RGD:1310242
            GO:GO:0005783 GO:GO:0005576 GO:GO:0046872 Gene3D:3.40.720.10
            SUPFAM:SSF53649 eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 HOGENOM:HOG000135354
            HOVERGEN:HBG004282 GeneTree:ENSGT00560000077076 HSSP:P15289
            CTD:340075 KO:K12375 OrthoDB:EOG4DFPN6 OMA:YHGSDIE
            EMBL:AABR03109797 EMBL:BN000739 IPI:IPI00367540
            RefSeq:NP_001041346.1 UniGene:Rn.202490 ProteinModelPortal:Q32KJ8
            SMR:Q32KJ8 STRING:Q32KJ8 PhosphoSite:Q32KJ8
            Ensembl:ENSRNOT00000030966 GeneID:307404 KEGG:rno:307404
            UCSC:RGD:1310242 InParanoid:Q32KJ8 NextBio:657343
            Genevestigator:Q32KJ8 Uniprot:Q32KJ8
        Length = 573

 Score = 743 (266.6 bits), Expect = 2.5e-88, Sum P(4) = 2.5e-88
 Identities = 147/323 (45%), Positives = 206/323 (63%)

Query:    59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
             PPHIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++TG+
Sbjct:    46 PPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGR 104

Query:   119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
             + IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFY+KE  PT RGF
Sbjct:   105 YQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGF 164

Query:   179 ESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
             ++ LG  TG+ DY+ + + +   + G D+      AW L G+YST ++   A  I+ +HS
Sbjct:   165 DTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHS 224

Query:   238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
               +PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V  +  
Sbjct:   225 PQKPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 279

Query:   298 ALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
             AL++    +NS+I+F SD             SNWPLRG K T WEGGVRG G + SPLL+
Sbjct:   280 ALKRYGFYNNSVIIFSSDNGGQTFSGG----SNWPLRGRKGTYWEGGVRGLGFVHSPLLK 335

Query:   358 SRGIVAEQYVHVSDWLPTLLSAA 380
              +   +   VH++DW PTL+  A
Sbjct:   336 KKRRTSRALVHITDWYPTLVGLA 358

 Score = 77 (32.2 bits), Expect = 2.5e-88, Sum P(4) = 2.5e-88
 Identities = 14/26 (53%), Positives = 16/26 (61%)

Query:   510 IDGIDVWSVLSRNEPSKRNTILHNID 535
             +DG DVW  +S    S R  ILHNID
Sbjct:   368 LDGYDVWPAISEGRASPRTEILHNID 393

 Score = 77 (32.2 bits), Expect = 2.5e-88, Sum P(4) = 2.5e-88
 Identities = 14/26 (53%), Positives = 16/26 (61%)

Query:   575 IDGIDVWSVLSRNEPSKRNTILHNID 600
             +DG DVW  +S    S R  ILHNID
Sbjct:   368 LDGYDVWPAISEGRASPRTEILHNID 393

 Score = 57 (25.1 bits), Expect = 2.2e-84, Sum P(3) = 2.2e-84
 Identities = 13/39 (33%), Positives = 21/39 (53%)

Query:   677 LFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
             LF+I  DP E+ +LAD+  D  +      +  +N+ A P
Sbjct:   464 LFNISADPYEREDLADQRPDV-VRTLLARLADYNRTAIP 501

 Score = 54 (24.1 bits), Expect = 2.5e-88, Sum P(4) = 2.5e-88
 Identities = 15/46 (32%), Positives = 23/46 (50%)

Query:   859 PDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDH-AWSIFGDD 903
             PDV+  +   LA+ NRTA+ P+  P        +F+  AW  +  D
Sbjct:   482 PDVVRTLLARLADYNRTAI-PVRYPAANPRAHPDFNGGAWGPWASD 526

 Score = 50 (22.7 bits), Expect = 2.5e-88, Sum P(4) = 2.5e-88
 Identities = 12/23 (52%), Positives = 16/23 (69%)

Query:   783 LFDIKNDPCEKNNLAD-RSEVQR 804
             LF+I  DP E+ +LAD R +V R
Sbjct:   464 LFNISADPYEREDLADQRPDVVR 486


>UNIPROTKB|E1BIN3 [details] [associations]
            symbol:ARSI "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            GeneTree:ENSGT00560000077076 OMA:YHGSDIE EMBL:DAAA02020627
            IPI:IPI00695273 Ensembl:ENSBTAT00000017050 Uniprot:E1BIN3
        Length = 572

 Score = 744 (267.0 bits), Expect = 8.3e-88, Sum P(4) = 8.3e-88
 Identities = 147/323 (45%), Positives = 207/323 (64%)

Query:    59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
             PPHIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++TG+
Sbjct:    47 PPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGR 105

Query:   119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
             + IHTG+QH+++   +   LPL +  LPQ L+ELGY T +VGKWHLGFY+KE  PT RGF
Sbjct:   106 YQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQELGYSTHMVGKWHLGFYRKECLPTRRGF 165

Query:   179 ESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
             ++ LG  TG+ DY+ + + +   + G D+      AW L G+YST ++      I+ +HS
Sbjct:   166 DTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTLLYAQRVSHILASHS 225

Query:   238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
               +PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V  +  
Sbjct:   226 PRQPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 280

Query:   298 ALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
             AL++    +NS+I+F SD             SNWPLRG K T WEGGVRG G + SPLL+
Sbjct:   281 ALKRHGFYNNSVIIFSSDNGGQTFSGG----SNWPLRGRKGTYWEGGVRGLGFVHSPLLK 336

Query:   358 SRGIVAEQYVHVSDWLPTLLSAA 380
              +   +   VH++DW PTL++ A
Sbjct:   337 RKRRTSRALVHITDWYPTLVALA 359

 Score = 77 (32.2 bits), Expect = 8.3e-88, Sum P(4) = 8.3e-88
 Identities = 14/26 (53%), Positives = 16/26 (61%)

Query:   510 IDGIDVWSVLSRNEPSKRNTILHNID 535
             +DG DVW  +S    S R  ILHNID
Sbjct:   369 LDGYDVWPAISEGRASPRTEILHNID 394

 Score = 77 (32.2 bits), Expect = 8.3e-88, Sum P(4) = 8.3e-88
 Identities = 14/26 (53%), Positives = 16/26 (61%)

Query:   575 IDGIDVWSVLSRNEPSKRNTILHNID 600
             +DG DVW  +S    S R  ILHNID
Sbjct:   369 LDGYDVWPAISEGRASPRTEILHNID 394

 Score = 54 (24.1 bits), Expect = 8.3e-88, Sum P(4) = 8.3e-88
 Identities = 14/46 (30%), Positives = 23/46 (50%)

Query:   859 PDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDH-AWSIFGDD 903
             PDV+  +   L + NRTA+ P+  P +      +F+  AW  +  D
Sbjct:   483 PDVVRALLARLVDYNRTAI-PVRYPAENPRAHPDFNGGAWGPWASD 527

 Score = 47 (21.6 bits), Expect = 2.0e-83, Sum P(3) = 2.0e-83
 Identities = 12/39 (30%), Positives = 20/39 (51%)

Query:   677 LFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
             LF+I  DP E+ +LA +  D  +      +  +N+ A P
Sbjct:   465 LFNISADPYEREDLAGQRPDV-VRALLARLVDYNRTAIP 502

 Score = 44 (20.5 bits), Expect = 8.3e-88, Sum P(4) = 8.3e-88
 Identities = 11/23 (47%), Positives = 15/23 (65%)

Query:   783 LFDIKNDPCEKNNLA-DRSEVQR 804
             LF+I  DP E+ +LA  R +V R
Sbjct:   465 LFNISADPYEREDLAGQRPDVVR 487


>UNIPROTKB|Q5FYB1 [details] [associations]
            symbol:ARSI "Arylsulfatase I" species:9606 "Homo sapiens"
            [GO:0046872 "metal ion binding" evidence=IEA] [GO:0005576
            "extracellular region" evidence=IEA] [GO:0004065 "arylsulfatase
            activity" evidence=TAS] [GO:0005788 "endoplasmic reticulum lumen"
            evidence=TAS] [GO:0006644 "phospholipid metabolic process"
            evidence=TAS] [GO:0006665 "sphingolipid metabolic process"
            evidence=TAS] [GO:0006687 "glycosphingolipid metabolic process"
            evidence=TAS] [GO:0043687 "post-translational protein modification"
            evidence=TAS] [GO:0044267 "cellular protein metabolic process"
            evidence=TAS] [GO:0044281 "small molecule metabolic process"
            evidence=TAS] Reactome:REACT_17015 Reactome:REACT_111217
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 GO:GO:0005576 GO:GO:0044281 GO:GO:0046872
            GO:GO:0006644 GO:GO:0005788 GO:GO:0043687 Gene3D:3.40.720.10
            SUPFAM:SSF53649 eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0004065 GO:GO:0006687 HOGENOM:HOG000135354
            HOVERGEN:HBG004282 CTD:340075 KO:K12375 OrthoDB:EOG4DFPN6
            EMBL:AY875937 EMBL:AB448735 EMBL:AK122641 EMBL:BC129995
            EMBL:BC129996 IPI:IPI00257076 IPI:IPI00915442 RefSeq:NP_001012301.1
            UniGene:Hs.591252 ProteinModelPortal:Q5FYB1 SMR:Q5FYB1
            STRING:Q5FYB1 PhosphoSite:Q5FYB1 DMDM:74722581 PRIDE:Q5FYB1
            Ensembl:ENST00000328668 Ensembl:ENST00000515301 GeneID:340075
            KEGG:hsa:340075 UCSC:uc003lrv.2 GeneCards:GC05M149657
            HGNC:HGNC:32521 HPA:HPA038386 MIM:610009 neXtProt:NX_Q5FYB1
            PharmGKB:PA143485309 InParanoid:Q5FYB1 OMA:YHGSDIE
            GenomeRNAi:340075 NextBio:97681 ArrayExpress:Q5FYB1 Bgee:Q5FYB1
            CleanEx:HS_ARSI Genevestigator:Q5FYB1 GermOnline:ENSG00000183876
            Uniprot:Q5FYB1
        Length = 569

 Score = 738 (264.8 bits), Expect = 1.7e-87, Sum P(4) = 1.7e-87
 Identities = 146/323 (45%), Positives = 205/323 (63%)

Query:    59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
             PPHIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++TG+
Sbjct:    46 PPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGR 104

Query:   119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
             + IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFY+KE  PT RGF
Sbjct:   105 YQIHTGLQHSIIRPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGF 164

Query:   179 ESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
             ++ LG  TG+ DY+ + + +   + G D+      AW L G+YST ++   A  I+ +HS
Sbjct:   165 DTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHS 224

Query:   238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
                PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V  +  
Sbjct:   225 PQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 279

Query:   298 ALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
             AL++    +NS+I+F SD             SNWPLRG K T WEGGVRG G + SPLL+
Sbjct:   280 ALKRYGFYNNSVIIFSSDNGGQTFSGG----SNWPLRGRKGTYWEGGVRGLGFVHSPLLK 335

Query:   358 SRGIVAEQYVHVSDWLPTLLSAA 380
              +   +   +H++DW PTL+  A
Sbjct:   336 RKQRTSRALMHITDWYPTLVGLA 358

 Score = 77 (32.2 bits), Expect = 1.7e-87, Sum P(4) = 1.7e-87
 Identities = 14/26 (53%), Positives = 16/26 (61%)

Query:   510 IDGIDVWSVLSRNEPSKRNTILHNID 535
             +DG DVW  +S    S R  ILHNID
Sbjct:   368 LDGYDVWPAISEGRASPRTEILHNID 393

 Score = 77 (32.2 bits), Expect = 1.7e-87, Sum P(4) = 1.7e-87
 Identities = 14/26 (53%), Positives = 16/26 (61%)

Query:   575 IDGIDVWSVLSRNEPSKRNTILHNID 600
             +DG DVW  +S    S R  ILHNID
Sbjct:   368 LDGYDVWPAISEGRASPRTEILHNID 393

 Score = 57 (25.1 bits), Expect = 1.7e-87, Sum P(4) = 1.7e-87
 Identities = 15/46 (32%), Positives = 23/46 (50%)

Query:   859 PDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDH-AWSIFGDD 903
             PDV+  +   LA  NRTA+ P+  P +      +F+  AW  +  D
Sbjct:   482 PDVVRTLLARLAEYNRTAI-PVRYPAENPRAHPDFNGGAWGPWASD 526

 Score = 52 (23.4 bits), Expect = 2.5e-83, Sum P(3) = 2.5e-83
 Identities = 12/39 (30%), Positives = 20/39 (51%)

Query:   677 LFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
             LF+I  DP E+ +LA +  D  +      +  +N+ A P
Sbjct:   464 LFNISADPYEREDLAGQRPDV-VRTLLARLAEYNRTAIP 501

 Score = 44 (20.5 bits), Expect = 1.7e-87, Sum P(4) = 1.7e-87
 Identities = 11/23 (47%), Positives = 15/23 (65%)

Query:   783 LFDIKNDPCEKNNLA-DRSEVQR 804
             LF+I  DP E+ +LA  R +V R
Sbjct:   464 LFNISADPYEREDLAGQRPDVVR 486


>UNIPROTKB|Q32KH7 [details] [associations]
            symbol:ARSI "Arylsulfatase I" species:9615 "Canis lupus
            familiaris" [GO:0005783 "endoplasmic reticulum" evidence=IEA]
            [GO:0005576 "extracellular region" evidence=IEA] [GO:0046872 "metal
            ion binding" evidence=IEA] [GO:0008484 "sulfuric ester hydrolase
            activity" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 GO:GO:0005783 GO:GO:0005576
            GO:GO:0046872 Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            HOGENOM:HOG000135354 HOVERGEN:HBG004282
            GeneTree:ENSGT00560000077076 HSSP:P15289 EMBL:AAEX02012119
            EMBL:BN000760 RefSeq:NP_001041583.1 UniGene:Cfa.39081
            ProteinModelPortal:Q32KH7 Ensembl:ENSCAFT00000028793 GeneID:489186
            KEGG:cfa:489186 CTD:340075 InParanoid:Q32KH7 KO:K12375
            OrthoDB:EOG4DFPN6 NextBio:20862393 Uniprot:Q32KH7
        Length = 573

 Score = 738 (264.8 bits), Expect = 3.5e-87, Sum P(4) = 3.5e-87
 Identities = 146/323 (45%), Positives = 204/323 (63%)

Query:    59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
             PPHIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++TG+
Sbjct:    47 PPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGR 105

Query:   119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
             + IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFY+KE  PT RGF
Sbjct:   106 YQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGF 165

Query:   179 ESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
             ++ LG  TG+ DY+ + + +   + G D+      AW L G+YST ++      I+ +HS
Sbjct:   166 DTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTMLYAQRVSHILASHS 225

Query:   238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
                PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V  +  
Sbjct:   226 PRRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITS 280

Query:   298 ALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
             AL++    +NS+I+F SD             SNWPLRG K T WEGGVRG G + SPLL+
Sbjct:   281 ALKRYGFYNNSVIIFSSDNGGQTFSGG----SNWPLRGRKGTYWEGGVRGLGFVHSPLLK 336

Query:   358 SRGIVAEQYVHVSDWLPTLLSAA 380
              +   +   VH++DW PTL+  A
Sbjct:   337 RKRRTSRALVHITDWYPTLVGLA 359

 Score = 77 (32.2 bits), Expect = 3.5e-87, Sum P(4) = 3.5e-87
 Identities = 14/26 (53%), Positives = 16/26 (61%)

Query:   510 IDGIDVWSVLSRNEPSKRNTILHNID 535
             +DG DVW  +S    S R  ILHNID
Sbjct:   369 LDGYDVWPAISEGRASPRTEILHNID 394

 Score = 77 (32.2 bits), Expect = 3.5e-87, Sum P(4) = 3.5e-87
 Identities = 14/26 (53%), Positives = 16/26 (61%)

Query:   575 IDGIDVWSVLSRNEPSKRNTILHNID 600
             +DG DVW  +S    S R  ILHNID
Sbjct:   369 LDGYDVWPAISEGRASPRTEILHNID 394

 Score = 54 (24.1 bits), Expect = 3.5e-87, Sum P(4) = 3.5e-87
 Identities = 14/46 (30%), Positives = 23/46 (50%)

Query:   859 PDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDH-AWSIFGDD 903
             PDV+  +   L + NRTA+ P+  P +      +F+  AW  +  D
Sbjct:   483 PDVVRALLARLVDYNRTAI-PVRYPAENPRAHPDFNGGAWGPWASD 527

 Score = 47 (21.6 bits), Expect = 8.3e-83, Sum P(3) = 8.3e-83
 Identities = 12/39 (30%), Positives = 20/39 (51%)

Query:   677 LFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
             LF+I  DP E+ +LA +  D  +      +  +N+ A P
Sbjct:   465 LFNISADPYEREDLAGQRPDV-VRALLARLVDYNRTAIP 502

 Score = 44 (20.5 bits), Expect = 3.5e-87, Sum P(4) = 3.5e-87
 Identities = 11/23 (47%), Positives = 15/23 (65%)

Query:   783 LFDIKNDPCEKNNLA-DRSEVQR 804
             LF+I  DP E+ +LA  R +V R
Sbjct:   465 LFNISADPYEREDLAGQRPDVVR 487


>UNIPROTKB|F1RL69 [details] [associations]
            symbol:LOC100517463 "Uncharacterized protein" species:9823
            "Sus scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            GeneTree:ENSGT00560000077076 OMA:YHGSDIE EMBL:FP102406
            Ensembl:ENSSSCT00000015795 Uniprot:F1RL69
        Length = 596

 Score = 723 (259.6 bits), Expect = 1.3e-85, Sum P(4) = 1.3e-85
 Identities = 145/325 (44%), Positives = 203/325 (62%)

Query:    57 SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMT 116
             S  PHIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++T
Sbjct:    69 SQQPHIIFILTDDQGYHDVGYHGSD-IQTPTLDRLAAEGVKLENYYIQPICTPSRSQLLT 127

Query:   117 GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
             G++ IHTG+QH+++   +   LPL +  LPQ L++LGY T +VGKWHLGFY+KE  PT R
Sbjct:   128 GRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQRLQQLGYATHMVGKWHLGFYRKECLPTRR 187

Query:   177 GFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
             GF++ LG  TG+ DY+ + + +   + G D+      AW L G+YST ++      I+  
Sbjct:   188 GFDTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGESVAWGLSGQYSTLLYAQRVSRILAG 247

Query:   236 HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKV 295
             HS   PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V  +
Sbjct:   248 HSPRRPLFLYVAFQAVHT-----PLQSPREYLYRYRGMGNVARRKYAAMVTCMDEAVRNI 302

Query:   296 VEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWSPL 355
               AL++    +NS+I+F SD             SNWPLRG K T WEGGVRG G + SPL
Sbjct:   303 TGALKRYGFYNNSVIIFSSDNGGQTFSGG----SNWPLRGRKGTYWEGGVRGLGFVHSPL 358

Query:   356 LESRGIVAEQYVHVSDWLPTLLSAA 380
             L+     +   +H++DW PTL+  A
Sbjct:   359 LKRTRRTSRALLHITDWYPTLVGLA 383

 Score = 77 (32.2 bits), Expect = 1.3e-85, Sum P(4) = 1.3e-85
 Identities = 14/26 (53%), Positives = 16/26 (61%)

Query:   510 IDGIDVWSVLSRNEPSKRNTILHNID 535
             +DG DVW  +S    S R  ILHNID
Sbjct:   393 LDGYDVWPAISEGRASPRTEILHNID 418

 Score = 77 (32.2 bits), Expect = 1.3e-85, Sum P(4) = 1.3e-85
 Identities = 14/26 (53%), Positives = 16/26 (61%)

Query:   575 IDGIDVWSVLSRNEPSKRNTILHNID 600
             +DG DVW  +S    S R  ILHNID
Sbjct:   393 LDGYDVWPAISEGRASPRTEILHNID 418

 Score = 54 (24.1 bits), Expect = 1.3e-85, Sum P(4) = 1.3e-85
 Identities = 14/46 (30%), Positives = 23/46 (50%)

Query:   859 PDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDH-AWSIFGDD 903
             PDV+  +   L + NRTA+ P+  P +      +F+  AW  +  D
Sbjct:   507 PDVVRALLARLVDYNRTAI-PVRYPAENPRAHPDFNGGAWGPWASD 551

 Score = 47 (21.6 bits), Expect = 3.1e-81, Sum P(3) = 3.1e-81
 Identities = 12/39 (30%), Positives = 20/39 (51%)

Query:   677 LFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
             LF+I  DP E+ +LA +  D  +      +  +N+ A P
Sbjct:   489 LFNISADPYEREDLAGQRPDV-VRALLARLVDYNRTAIP 526

 Score = 44 (20.5 bits), Expect = 1.3e-85, Sum P(4) = 1.3e-85
 Identities = 11/23 (47%), Positives = 15/23 (65%)

Query:   783 LFDIKNDPCEKNNLA-DRSEVQR 804
             LF+I  DP E+ +LA  R +V R
Sbjct:   489 LFNISADPYEREDLAGQRPDVVR 511


>MGI|MGI:2443513 [details] [associations]
            symbol:Arsj "arylsulfatase J" species:10090 "Mus musculus"
            [GO:0003824 "catalytic activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0005576 "extracellular
            region" evidence=IEA] [GO:0008152 "metabolic process" evidence=IEA]
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0046872 "metal
            ion binding" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 MGI:MGI:2443513 GO:GO:0005576
            GO:GO:0046872 Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            HOVERGEN:HBG004282 GeneTree:ENSGT00560000077076 KO:K12375 CTD:79642
            OMA:AAGYGIW OrthoDB:EOG45HRX5 EMBL:AK034454 EMBL:AK046410
            EMBL:AK052931 IPI:IPI00986759 RefSeq:NP_775627.1 UniGene:Mm.317021
            ProteinModelPortal:Q8BM89 SMR:Q8BM89 STRING:Q8BM89
            PhosphoSite:Q8BM89 PRIDE:Q8BM89 Ensembl:ENSMUST00000093976
            GeneID:271970 KEGG:mmu:271970 InParanoid:Q8BM89 NextBio:393532
            Bgee:Q8BM89 CleanEx:MM_ARSJ Genevestigator:Q8BM89
            GermOnline:ENSMUSG00000046561 Uniprot:Q8BM89
        Length = 598

 Score = 758 (271.9 bits), Expect = 1.6e-85, Sum P(3) = 1.6e-85
 Identities = 149/328 (45%), Positives = 205/328 (62%)

Query:    55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
             A +  PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS  
Sbjct:    69 AGTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQF 127

Query:   115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
             +TGK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+K+  PT
Sbjct:   128 ITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMPT 187

Query:   175 FRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDI 232
              RGF++  G   G  DY+ H   +   + G D+  +   AWD  +G YST ++T     I
Sbjct:   188 KRGFDTFFGSLLGSGDYYTHYKCDSPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQI 247

Query:   233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
             +  H   +PLFLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE++
Sbjct:   248 LATHDPTKPLFLYVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAI 302

Query:   293 GKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIW 352
               V  AL++    +NSII++ SD             SNWPLRG K T WEGG+R  G + 
Sbjct:   303 HNVTLALKRYGFYNNSIIIYSSDNGGQPTAGG----SNWPLRGSKGTYWEGGIRAVGFVH 358

Query:   353 SPLLESRGIVAEQYVHVSDWLPTLLSAA 380
             SPLL+++G V ++ VH++DW PTL+S A
Sbjct:   359 SPLLKNKGTVCKELVHITDWYPTLISLA 386

 Score = 74 (31.1 bits), Expect = 1.6e-85, Sum P(3) = 1.6e-85
 Identities = 15/40 (37%), Positives = 21/40 (52%)

Query:   509 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKW 548
             ++DG D+W  +S    S R  ILHNID  +  +    G W
Sbjct:   395 QLDGYDIWETISEGLRSPRVDILHNIDPIY--TKAKNGSW 432

 Score = 72 (30.4 bits), Expect = 2.5e-85, Sum P(3) = 2.5e-85
 Identities = 13/27 (48%), Positives = 17/27 (62%)

Query:   574 EIDGIDVWSVLSRNEPSKRNTILHNID 600
             ++DG D+W  +S    S R  ILHNID
Sbjct:   395 QLDGYDIWETISEGLRSPRVDILHNID 421

 Score = 56 (24.8 bits), Expect = 1.6e-85, Sum P(3) = 1.6e-85
 Identities = 23/111 (20%), Positives = 41/111 (36%)

Query:   795 NLADRSEVQRINHY---TTEVGYLD--PKQRFNQIA---YLDKEXXXXXXXXXXXXXXXX 846
             N A +S + R+ H+   T   GY D  P Q F+ +    + ++                 
Sbjct:   440 NTAIQSAI-RVQHWKLLTGNPGYSDWVPPQAFSNLGPNRWHNERITLSTGKSIWLFNITA 498

Query:   847 XXXXXXXXXXGYPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAW 897
                        YP ++ ++ + L+  N+TAV     P D   +P+     W
Sbjct:   499 DPYERVDLSSRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPRLNGGVW 549

 Score = 45 (20.9 bits), Expect = 2.2e-84, Sum P(3) = 2.2e-84
 Identities = 12/39 (30%), Positives = 20/39 (51%)

Query:   677 LFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
             LF+I  DP E+ +L+ R     +      + +FN+ A P
Sbjct:   493 LFNITADPYERVDLSSRYPGI-VKKLLRRLSQFNKTAVP 530

 Score = 38 (18.4 bits), Expect = 1.2e-83, Sum P(3) = 1.2e-83
 Identities = 8/17 (47%), Positives = 12/17 (70%)

Query:   783 LFDIKNDPCEKNNLADR 799
             LF+I  DP E+ +L+ R
Sbjct:   493 LFNITADPYERVDLSSR 509


>RGD|1307640 [details] [associations]
            symbol:Arsj "arylsulfatase family, member J" species:10116
            "Rattus norvegicus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 RGD:1307640 Gene3D:3.40.720.10
            SUPFAM:SSF53649 eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 HOGENOM:HOG000135354
            HOVERGEN:HBG004282 GeneTree:ENSGT00560000077076 KO:K12375 CTD:79642
            OMA:AAGYGIW OrthoDB:EOG45HRX5 EMBL:CH473952 EMBL:BN000740
            IPI:IPI00777558 RefSeq:NP_001041352.1 UniGene:Rn.202364 SMR:Q32KJ7
            STRING:Q32KJ7 Ensembl:ENSRNOT00000055633 GeneID:311013
            KEGG:rno:311013 UCSC:RGD:1307640 InParanoid:Q32KJ7 NextBio:662880
            Genevestigator:Q32KJ7 Uniprot:Q32KJ7
        Length = 597

 Score = 757 (271.5 bits), Expect = 2.0e-85, Sum P(3) = 2.0e-85
 Identities = 149/328 (45%), Positives = 206/328 (62%)

Query:    55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
             A +  PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS  
Sbjct:    69 AVTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQF 127

Query:   115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
             +TGK+ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+K+  PT
Sbjct:   128 ITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMPT 187

Query:   175 FRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDI 232
              RGF++  G   G  DY+ H   +   + G D+  +   AWD  +G YST ++T     I
Sbjct:   188 KRGFDTFFGSLLGSGDYYTHYKCDSPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQI 247

Query:   233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
             + +H   +PLFLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE++
Sbjct:   248 LASHDPTKPLFLYVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAI 302

Query:   293 GKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIW 352
               V  AL++    +NSII++ SD             SNWPLRG K T WEGG+R  G + 
Sbjct:   303 HNVTLALKRYGFYNNSIIIYSSDNGGQPTAGG----SNWPLRGSKGTYWEGGIRAVGFVH 358

Query:   353 SPLLESRGIVAEQYVHVSDWLPTLLSAA 380
             SPLL+++G V ++ VH++DW PTL+S A
Sbjct:   359 SPLLKNKGTVCKELVHITDWYPTLISLA 386

 Score = 74 (31.1 bits), Expect = 2.0e-85, Sum P(3) = 2.0e-85
 Identities = 15/40 (37%), Positives = 21/40 (52%)

Query:   509 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKW 548
             ++DG D+W  +S    S R  ILHNID  +  +    G W
Sbjct:   395 QLDGYDIWETISEGLRSPRVDILHNIDPIY--TKAKNGSW 432

 Score = 72 (30.4 bits), Expect = 3.2e-85, Sum P(3) = 3.2e-85
 Identities = 13/27 (48%), Positives = 17/27 (62%)

Query:   574 EIDGIDVWSVLSRNEPSKRNTILHNID 600
             ++DG D+W  +S    S R  ILHNID
Sbjct:   395 QLDGYDIWETISEGLRSPRVDILHNID 421

 Score = 56 (24.8 bits), Expect = 2.0e-85, Sum P(3) = 2.0e-85
 Identities = 23/111 (20%), Positives = 41/111 (36%)

Query:   795 NLADRSEVQRINHY---TTEVGYLD--PKQRFNQIA---YLDKEXXXXXXXXXXXXXXXX 846
             N A +S + R+ H+   T   GY D  P Q F+ +    + ++                 
Sbjct:   440 NTAIQSAI-RVQHWKLLTGNPGYSDWVPPQAFSNLGPNRWHNERITLSTGKSIWLFNITA 498

Query:   847 XXXXXXXXXXGYPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAW 897
                        YP ++ ++ + L+  N+TAV     P D   +P+     W
Sbjct:   499 DPYERVDLSSRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPRLNGGVW 549

 Score = 45 (20.9 bits), Expect = 2.8e-84, Sum P(3) = 2.8e-84
 Identities = 12/39 (30%), Positives = 20/39 (51%)

Query:   677 LFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
             LF+I  DP E+ +L+ R     +      + +FN+ A P
Sbjct:   493 LFNITADPYERVDLSSRYPGI-VKKLLRRLSQFNKTAVP 530

 Score = 38 (18.4 bits), Expect = 1.5e-83, Sum P(3) = 1.5e-83
 Identities = 8/17 (47%), Positives = 12/17 (70%)

Query:   783 LFDIKNDPCEKNNLADR 799
             LF+I  DP E+ +L+ R
Sbjct:   493 LFNITADPYERVDLSSR 509


>UNIPROTKB|F1P095 [details] [associations]
            symbol:ARSB "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00149 GO:GO:0008484
            GeneTree:ENSGT00560000077076 EMBL:AADN02046142 EMBL:AADN02046143
            EMBL:AADN02046144 EMBL:AADN02046145 EMBL:AADN02046146
            EMBL:AADN02046147 EMBL:AADN02046148 EMBL:AADN02046149
            EMBL:AADN02046150 IPI:IPI00820595 Ensembl:ENSGALT00000038618
            ArrayExpress:F1P095 Uniprot:F1P095
        Length = 407

 Score = 779 (279.3 bits), Expect = 4.5e-84, Sum P(2) = 4.5e-84
 Identities = 149/332 (44%), Positives = 213/332 (64%)

Query:    55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
             A+  PPH++ +LADDLGW DVG+HG   I TP +DAL   G+ LK Y T  LCTPSR  +
Sbjct:    35 AARPPPHLVLVLADDLGWGDVGWHG-SAIRTPRLDALGAGGVRLKGY-TQPLCTPSRPFL 92

Query:   115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
             + G + IHTG+QH +++ C+   LPL EK+LP+ LK+ GY T +VGKWHLG Y+KE  PT
Sbjct:    93 LFGGYYIHTGLQHQIIWPCQPSCLPLDEKLLPELLKDAGYVTHMVGKWHLGMYRKECLPT 152

Query:   175 FRGFESHLGYWTGHQDYFDHS------AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
              RGF+++ GY  G +DY+ H       A+ +    LD R   E A      YST++FT  
Sbjct:   153 RRGFDTYFGYLLGSEDYYSHDHCVLIKAKNVTRCALDFRDGEEVATGFKNMYSTNLFTER 212

Query:   229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
             A+D+I NH T++PLFLYLA  + H     EPL+    Y+  +  I+D KR ++A ++  +
Sbjct:   213 AIDLIANHKTEKPLFLYLAFQSVH-----EPLEVSAEYMKPYSSIKDVKRRRYAGMVSLM 267

Query:   289 DESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGA 348
             DE+VG + +AL++  + +N+++VF +D             +NWPLRG K TLWEGGVRG 
Sbjct:   268 DEAVGNLTDALKEYGLWNNTVLVFSTDNGGQTMAGG----NNWPLRGRKWTLWEGGVRGV 323

Query:   349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
             G + SPLL+ +G+ + + +H+SDWLPTL+  A
Sbjct:   324 GFVASPLLKQKGVESHELIHISDWLPTLVHLA 355

 Score = 92 (37.4 bits), Expect = 0.00012, Sum P(2) = 0.00012
 Identities = 15/35 (42%), Positives = 25/35 (71%)

Query:     1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIM 35
             +QH +++ C+   LPL EK+LP+ LK+ GY T ++
Sbjct:   103 LQHQIIWPCQPSCLPLDEKLLPELLKDAGYVTHMV 137

 Score = 82 (33.9 bits), Expect = 4.5e-84, Sum P(2) = 4.5e-84
 Identities = 14/26 (53%), Positives = 17/26 (65%)

Query:   510 IDGIDVWSVLSRNEPSKRNTILHNID 535
             +DG DVW  +S   PS R  +LHNID
Sbjct:   365 LDGFDVWKTISEGRPSPRVELLHNID 390

 Score = 82 (33.9 bits), Expect = 4.5e-84, Sum P(2) = 4.5e-84
 Identities = 14/26 (53%), Positives = 17/26 (65%)

Query:   575 IDGIDVWSVLSRNEPSKRNTILHNID 600
             +DG DVW  +S   PS R  +LHNID
Sbjct:   365 LDGFDVWKTISEGRPSPRVELLHNID 390


>UNIPROTKB|F1NQP9 [details] [associations]
            symbol:ARSI "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            GeneTree:ENSGT00560000077076 OMA:YHGSDIE EMBL:AADN02028629
            IPI:IPI00587142 Ensembl:ENSGALT00000009011 Uniprot:F1NQP9
        Length = 572

 Score = 747 (268.0 bits), Expect = 4.6e-84, Sum P(3) = 4.6e-84
 Identities = 150/335 (44%), Positives = 212/335 (63%)

Query:    55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
             A + PPHIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS +
Sbjct:    43 AFARPPHIIFILTDDQGYHDVGYHGSD-IQTPTLDRLAAEGVKLENYYIQPICTPSRSQL 101

Query:   115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
             +TG++ IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFYKKE  PT
Sbjct:   102 ITGRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPT 161

Query:   175 FRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
              RGF++ LG  TG+ DY+ + + +   + G D+      AWD  GKYST ++      I+
Sbjct:   162 RRGFDTFLGSLTGNVDYYTYDNCDGPGVCGYDLHEGENVAWDQSGKYSTFLYAQRVSKIL 221

Query:   234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
              +HS  EP+F+Y+A  A H+     PLQ+P  Y+  +R + +  R K+AA++  +DE+V 
Sbjct:   222 ASHSPKEPIFIYVAFQAVHT-----PLQSPKEYIYRYRSMGNVARRKYAAMVTCMDEAVK 276

Query:   294 KVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWS 353
              +  AL++     NS+IVF +D             SNWPLRG K T WEGGVRG G + S
Sbjct:   277 NITWALKKYGYYDNSVIVFSTDNGGQTFSGG----SNWPLRGRKGTYWEGGVRGIGFVHS 332

Query:   354 PLLESRGIVAEQYVHVSDWLPTLLSAA--NKSDIP 386
             PL++ +   +   VH++DW PTL+S A  N S++P
Sbjct:   333 PLIKRKRRTSWALVHITDWYPTLVSLARGNLSNVP 367

 Score = 76 (31.8 bits), Expect = 4.6e-84, Sum P(3) = 4.6e-84
 Identities = 20/56 (35%), Positives = 26/56 (46%)

Query:   545 RGKWKLVKENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNID 600
             R  W LV          S  R N ++    +DG +VW  +S  + S R  ILHNID
Sbjct:   340 RTSWALVHITDWYPTLVSLARGNLSNVPG-LDGYNVWPAISEGKESPRTEILHNID 394

 Score = 73 (30.8 bits), Expect = 9.5e-84, Sum P(3) = 9.5e-84
 Identities = 13/26 (50%), Positives = 17/26 (65%)

Query:   510 IDGIDVWSVLSRNEPSKRNTILHNID 535
             +DG +VW  +S  + S R  ILHNID
Sbjct:   369 LDGYNVWPAISEGKESPRTEILHNID 394

 Score = 51 (23.0 bits), Expect = 4.6e-84, Sum P(3) = 4.6e-84
 Identities = 12/39 (30%), Positives = 22/39 (56%)

Query:   677 LFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
             LF+I  DP E+ +L+++  D  +    T +  +N+ A P
Sbjct:   466 LFNITADPYERYDLSEQRPDV-VRALLTRLVHYNRTAIP 503

 Score = 50 (22.7 bits), Expect = 5.8e-84, Sum P(3) = 5.8e-84
 Identities = 13/40 (32%), Positives = 21/40 (52%)

Query:   859 PDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDH-AW 897
             PDV+  +   L + NRTA+ P+  P +      +F+  AW
Sbjct:   484 PDVVRALLTRLVHYNRTAI-PVRYPAENPRAHPDFNGGAW 522

 Score = 40 (19.1 bits), Expect = 6.6e-83, Sum P(3) = 6.6e-83
 Identities = 10/23 (43%), Positives = 16/23 (69%)

Query:   783 LFDIKNDPCEKNNLAD-RSEVQR 804
             LF+I  DP E+ +L++ R +V R
Sbjct:   466 LFNITADPYERYDLSEQRPDVVR 488


>MGI|MGI:2670959 [details] [associations]
            symbol:Arsi "arylsulfatase i" species:10090 "Mus musculus"
            [GO:0003824 "catalytic activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0005576 "extracellular
            region" evidence=IEA] [GO:0005783 "endoplasmic reticulum"
            evidence=IEA] [GO:0008152 "metabolic process" evidence=IEA]
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0046872 "metal
            ion binding" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 MGI:MGI:2670959 GO:GO:0005783
            GO:GO:0005576 EMBL:CH466528 GO:GO:0046872 Gene3D:3.40.720.10
            SUPFAM:SSF53649 eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 HOGENOM:HOG000135354
            HOVERGEN:HBG004282 GeneTree:ENSGT00560000077076 HSSP:P15289
            CTD:340075 KO:K12375 OrthoDB:EOG4DFPN6 OMA:YHGSDIE EMBL:BC138970
            EMBL:BC141169 EMBL:BN000748 IPI:IPI00462991 RefSeq:NP_001033588.1
            UniGene:Mm.20147 ProteinModelPortal:Q32KI9 SMR:Q32KI9 STRING:Q32KI9
            PRIDE:Q32KI9 Ensembl:ENSMUST00000040359 GeneID:545260
            KEGG:mmu:545260 UCSC:uc008fbe.1 InParanoid:Q32KI9 NextBio:412424
            Bgee:Q32KI9 Genevestigator:Q32KI9 Uniprot:Q32KI9
        Length = 573

 Score = 743 (266.6 bits), Expect = 9.5e-84, Sum P(3) = 9.5e-84
 Identities = 148/328 (45%), Positives = 207/328 (63%)

Query:    54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             VA   PPHIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct:    41 VAPPQPPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQ 99

Query:   114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
             ++TG++ IHTG+QH+++   +   LPL +  LPQ L+E GY T +VGKWHLGFY+KE  P
Sbjct:   100 LLTGRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLP 159

Query:   174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
             T RGF++ LG  TG+ DY+ + + +   + G D+      AW L G+YST ++   A  I
Sbjct:   160 TRRGFDTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHI 219

Query:   233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
             + +H+   PLFLY+A  A H+     PLQ+P  YL  +R + +  R K+AA++  +DE+V
Sbjct:   220 LASHNPQNPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAV 274

Query:   293 GKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIW 352
               +  AL++    +NS+I+F SD             SNWPLRG K T WEGGVRG G + 
Sbjct:   275 RNITWALKRYGFYNNSVIIFSSDNGGQTFSGG----SNWPLRGRKGTYWEGGVRGLGFVH 330

Query:   353 SPLLESRGIVAEQYVHVSDWLPTLLSAA 380
             SPLL+ +   +   VH++DW PTL+  A
Sbjct:   331 SPLLKKKRRTSRALVHITDWYPTLVGLA 358

 Score = 77 (32.2 bits), Expect = 9.5e-84, Sum P(3) = 9.5e-84
 Identities = 14/26 (53%), Positives = 16/26 (61%)

Query:   510 IDGIDVWSVLSRNEPSKRNTILHNID 535
             +DG DVW  +S    S R  ILHNID
Sbjct:   368 LDGYDVWPAISEGRASPRTEILHNID 393

 Score = 77 (32.2 bits), Expect = 9.5e-84, Sum P(3) = 9.5e-84
 Identities = 14/26 (53%), Positives = 16/26 (61%)

Query:   575 IDGIDVWSVLSRNEPSKRNTILHNID 600
             +DG DVW  +S    S R  ILHNID
Sbjct:   368 LDGYDVWPAISEGRASPRTEILHNID 393

 Score = 51 (23.0 bits), Expect = 9.5e-84, Sum P(3) = 9.5e-84
 Identities = 11/25 (44%), Positives = 16/25 (64%)

Query:   859 PDVLSQMEKELANINRTAVAPINKP 883
             PDV+  +   LA+ NRTA+ P+  P
Sbjct:   482 PDVVRTLLARLADYNRTAI-PVRYP 505

 Score = 50 (22.7 bits), Expect = 1.2e-83, Sum P(3) = 1.2e-83
 Identities = 12/39 (30%), Positives = 20/39 (51%)

Query:   677 LFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
             LF+I  DP E+ +LA +  D  +      +  +N+ A P
Sbjct:   464 LFNISADPYEREDLAGQRPDV-VRTLLARLADYNRTAIP 501

 Score = 44 (20.5 bits), Expect = 5.1e-83, Sum P(3) = 5.1e-83
 Identities = 11/23 (47%), Positives = 15/23 (65%)

Query:   783 LFDIKNDPCEKNNLA-DRSEVQR 804
             LF+I  DP E+ +LA  R +V R
Sbjct:   464 LFNISADPYEREDLAGQRPDVVR 486


>UNIPROTKB|F1NT29 [details] [associations]
            symbol:ARSB "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00149 GO:GO:0008484
            GeneTree:ENSGT00560000077076 EMBL:AADN02046142 EMBL:AADN02046143
            EMBL:AADN02046144 EMBL:AADN02046145 EMBL:AADN02046146
            EMBL:AADN02046147 EMBL:AADN02046148 EMBL:AADN02046149
            EMBL:AADN02046150 IPI:IPI00582830 Ensembl:ENSGALT00000007062
            ArrayExpress:F1NT29 Uniprot:F1NT29
        Length = 395

 Score = 779 (279.3 bits), Expect = 1.9e-83, Sum P(2) = 1.9e-83
 Identities = 149/332 (44%), Positives = 213/332 (64%)

Query:    55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
             A+  PPH++ +LADDLGW DVG+HG   I TP +DAL   G+ LK Y T  LCTPSR  +
Sbjct:    41 AARPPPHLVLVLADDLGWGDVGWHG-SAIRTPRLDALGAGGVRLKGY-TQPLCTPSRPFL 98

Query:   115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
             + G + IHTG+QH +++ C+   LPL EK+LP+ LK+ GY T +VGKWHLG Y+KE  PT
Sbjct:    99 LFGGYYIHTGLQHQIIWPCQPSCLPLDEKLLPELLKDAGYVTHMVGKWHLGMYRKECLPT 158

Query:   175 FRGFESHLGYWTGHQDYFDHS------AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
              RGF+++ GY  G +DY+ H       A+ +    LD R   E A      YST++FT  
Sbjct:   159 RRGFDTYFGYLLGSEDYYSHDHCVLIKAKNVTRCALDFRDGEEVATGFKNMYSTNLFTER 218

Query:   229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
             A+D+I NH T++PLFLYLA  + H     EPL+    Y+  +  I+D KR ++A ++  +
Sbjct:   219 AIDLIANHKTEKPLFLYLAFQSVH-----EPLEVSAEYMKPYSSIKDVKRRRYAGMVSLM 273

Query:   289 DESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGA 348
             DE+VG + +AL++  + +N+++VF +D             +NWPLRG K TLWEGGVRG 
Sbjct:   274 DEAVGNLTDALKEYGLWNNTVLVFSTDNGGQTMAGG----NNWPLRGRKWTLWEGGVRGV 329

Query:   349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
             G + SPLL+ +G+ + + +H+SDWLPTL+  A
Sbjct:   330 GFVASPLLKQKGVESHELIHISDWLPTLVHLA 361

 Score = 92 (37.4 bits), Expect = 0.00045, Sum P(2) = 0.00045
 Identities = 15/35 (42%), Positives = 25/35 (71%)

Query:     1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIM 35
             +QH +++ C+   LPL EK+LP+ LK+ GY T ++
Sbjct:   109 LQHQIIWPCQPSCLPLDEKLLPELLKDAGYVTHMV 143

 Score = 76 (31.8 bits), Expect = 1.9e-83, Sum P(2) = 1.9e-83
 Identities = 13/25 (52%), Positives = 16/25 (64%)

Query:   510 IDGIDVWSVLSRNEPSKRNTILHNI 534
             +DG DVW  +S   PS R  +LHNI
Sbjct:   371 LDGFDVWKTISEGRPSPRVELLHNI 395

 Score = 76 (31.8 bits), Expect = 1.9e-83, Sum P(2) = 1.9e-83
 Identities = 13/25 (52%), Positives = 16/25 (64%)

Query:   575 IDGIDVWSVLSRNEPSKRNTILHNI 599
             +DG DVW  +S   PS R  +LHNI
Sbjct:   371 LDGFDVWKTISEGRPSPRVELLHNI 395


>UNIPROTKB|F1P098 [details] [associations]
            symbol:ARSB "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00149 GO:GO:0008484
            GeneTree:ENSGT00560000077076 EMBL:AADN02046142 EMBL:AADN02046143
            EMBL:AADN02046144 EMBL:AADN02046145 EMBL:AADN02046146
            EMBL:AADN02046147 EMBL:AADN02046148 EMBL:AADN02046149
            EMBL:AADN02046150 IPI:IPI00820025 Ensembl:ENSGALT00000038614
            ArrayExpress:F1P098 Uniprot:F1P098
        Length = 388

 Score = 779 (279.3 bits), Expect = 2.5e-79, Sum P(2) = 2.5e-79
 Identities = 149/332 (44%), Positives = 213/332 (64%)

Query:    55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
             A+  PPH++ +LADDLGW DVG+HG   I TP +DAL   G+ LK Y T  LCTPSR  +
Sbjct:    41 AARPPPHLVLVLADDLGWGDVGWHG-SAIRTPRLDALGAGGVRLKGY-TQPLCTPSRPFL 98

Query:   115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
             + G + IHTG+QH +++ C+   LPL EK+LP+ LK+ GY T +VGKWHLG Y+KE  PT
Sbjct:    99 LFGGYYIHTGLQHQIIWPCQPSCLPLDEKLLPELLKDAGYVTHMVGKWHLGMYRKECLPT 158

Query:   175 FRGFESHLGYWTGHQDYFDHS------AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
              RGF+++ GY  G +DY+ H       A+ +    LD R   E A      YST++FT  
Sbjct:   159 RRGFDTYFGYLLGSEDYYSHDHCVLIKAKNVTRCALDFRDGEEVATGFKNMYSTNLFTER 218

Query:   229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
             A+D+I NH T++PLFLYLA  + H     EPL+    Y+  +  I+D KR ++A ++  +
Sbjct:   219 AIDLIANHKTEKPLFLYLAFQSVH-----EPLEVSAEYMKPYSSIKDVKRRRYAGMVSLM 273

Query:   289 DESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGA 348
             DE+VG + +AL++  + +N+++VF +D             +NWPLRG K TLWEGGVRG 
Sbjct:   274 DEAVGNLTDALKEYGLWNNTVLVFSTDNGGQTMAGG----NNWPLRGRKWTLWEGGVRGV 329

Query:   349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
             G + SPLL+ +G+ + + +H+SDWLPTL+  A
Sbjct:   330 GFVASPLLKQKGVESHELIHISDWLPTLVHLA 361

 Score = 37 (18.1 bits), Expect = 2.5e-79, Sum P(2) = 2.5e-79
 Identities = 5/10 (50%), Positives = 7/10 (70%)

Query:   510 IDGIDVWSVL 519
             +DG DVW  +
Sbjct:   371 LDGFDVWKTI 380

 Score = 37 (18.1 bits), Expect = 2.5e-79, Sum P(2) = 2.5e-79
 Identities = 5/10 (50%), Positives = 7/10 (70%)

Query:   575 IDGIDVWSVL 584
             +DG DVW  +
Sbjct:   371 LDGFDVWKTI 380


>UNIPROTKB|F6PKT4 [details] [associations]
            symbol:ARSJ "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00149 GO:GO:0008484
            HOGENOM:HOG000135354 HOVERGEN:HBG004282
            GeneTree:ENSGT00560000077076 OrthoDB:EOG45HRX5 EMBL:AAEX03016834
            Ensembl:ENSCAFT00000019312 Uniprot:F6PKT4
        Length = 489

 Score = 577 (208.2 bits), Expect = 1.1e-69, Sum P(4) = 1.1e-69
 Identities = 114/269 (42%), Positives = 163/269 (60%)

Query:   114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
             +++ ++ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+KE  P
Sbjct:     1 LLSSRYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 60

Query:   174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
             T RGF++  G   G  DY+ H   +   M G D+  +   AWD  +G YST ++T     
Sbjct:    61 TKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 120

Query:   232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
             I+ +H   +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE+
Sbjct:   121 ILASHDPRKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 175

Query:   292 VGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLI 351
             +  V  AL+     +NSII++ SD             SNWPLRG K T WEGG+R  G +
Sbjct:   176 INNVTLALKTYGFYNNSIIIYSSDNGGQPTAGG----SNWPLRGSKGTYWEGGIRAVGFV 231

Query:   352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
              SPLL+++G V ++ VH++DW PTL+S A
Sbjct:   232 HSPLLKNKGTVCKELVHITDWYPTLISLA 260

 Score = 75 (31.5 bits), Expect = 1.1e-69, Sum P(4) = 1.1e-69
 Identities = 16/40 (40%), Positives = 21/40 (52%)

Query:   509 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKW 548
             ++DG DVW  +S    S R  ILHNID  +  +    G W
Sbjct:   269 QLDGYDVWETISEGLRSPRVDILHNIDPIY--TKAKNGSW 306

 Score = 73 (30.8 bits), Expect = 1.8e-69, Sum P(4) = 1.8e-69
 Identities = 14/27 (51%), Positives = 17/27 (62%)

Query:   574 EIDGIDVWSVLSRNEPSKRNTILHNID 600
             ++DG DVW  +S    S R  ILHNID
Sbjct:   269 QLDGYDVWETISEGLRSPRVDILHNID 295

 Score = 59 (25.8 bits), Expect = 1.1e-69, Sum P(4) = 1.1e-69
 Identities = 12/40 (30%), Positives = 20/40 (50%)

Query:   858 YPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAW 897
             YP ++ Q+ + L+  N+TAV     P D   +P+     W
Sbjct:   384 YPGIVKQLLRRLSQFNKTAVPVRYPPKDPRSNPRLNGGVW 423

 Score = 37 (18.1 bits), Expect = 1.1e-69, Sum P(4) = 1.1e-69
 Identities = 8/17 (47%), Positives = 12/17 (70%)

Query:   783 LFDIKNDPCEKNNLADR 799
             LF+I  DP E+ +L+ R
Sbjct:   367 LFNITADPYERVDLSHR 383


>UNIPROTKB|F1S147 [details] [associations]
            symbol:ARSJ "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00149 GO:GO:0008484
            GeneTree:ENSGT00560000077076 OMA:AAGYGIW EMBL:CU694917
            Ensembl:ENSSSCT00000009989 Uniprot:F1S147
        Length = 467

 Score = 575 (207.5 bits), Expect = 1.2e-69, Sum P(4) = 1.2e-69
 Identities = 114/268 (42%), Positives = 160/268 (59%)

Query:   115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
             +  ++ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY+KE  PT
Sbjct:     1 LLSRYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPT 60

Query:   175 FRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDI 232
              RGF++  G   G  DY+ H   +   M G D+  +   AWD  +G YST ++T     I
Sbjct:    61 KRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENENAAWDYDNGIYSTQMYTQRVQQI 120

Query:   233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
             + +H    P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE++
Sbjct:   121 LASHDPKRPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAI 175

Query:   293 GKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIW 352
               V  AL+     +NSII++ SD             SNWPLRG K T WEGG+R  G + 
Sbjct:   176 NNVTLALKMYGFYNNSIIIYSSDNGGQPTAGG----SNWPLRGSKGTYWEGGIRAVGFVH 231

Query:   353 SPLLESRGIVAEQYVHVSDWLPTLLSAA 380
             SPLL+++G V ++ VH++DW PTL+S A
Sbjct:   232 SPLLKNKGTVCKELVHITDWYPTLISLA 259

 Score = 75 (31.5 bits), Expect = 1.2e-69, Sum P(4) = 1.2e-69
 Identities = 16/40 (40%), Positives = 21/40 (52%)

Query:   509 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKW 548
             ++DG DVW  +S    S R  ILHNID  +  +    G W
Sbjct:   268 QLDGYDVWETISEGLRSPRVDILHNIDPIY--TKAKNGSW 305

 Score = 74 (31.1 bits), Expect = 0.00074, Sum P(4) = 0.00074
 Identities = 14/35 (40%), Positives = 22/35 (62%)

Query:     1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIM 35
             +QH+++   +   LPL    LPQ LKE+GY T ++
Sbjct:    11 LQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMV 45

 Score = 73 (30.8 bits), Expect = 1.9e-69, Sum P(4) = 1.9e-69
 Identities = 14/27 (51%), Positives = 17/27 (62%)

Query:   574 EIDGIDVWSVLSRNEPSKRNTILHNID 600
             ++DG DVW  +S    S R  ILHNID
Sbjct:   268 QLDGYDVWETISEGLRSPRVDILHNID 294

 Score = 60 (26.2 bits), Expect = 1.2e-69, Sum P(4) = 1.2e-69
 Identities = 13/40 (32%), Positives = 20/40 (50%)

Query:   858 YPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAW 897
             YP V+ Q+ + L+  N+TAV     P D   +P+     W
Sbjct:   383 YPGVVKQLLRRLSQFNKTAVPVRYPPKDPRSNPRLNGGVW 422

 Score = 39 (18.8 bits), Expect = 1.2e-69, Sum P(4) = 1.2e-69
 Identities = 8/17 (47%), Positives = 13/17 (76%)

Query:   783 LFDIKNDPCEKNNLADR 799
             LF+I  DP E+ +L++R
Sbjct:   366 LFNITADPYERVDLSNR 382


>UNIPROTKB|F1NH07 [details] [associations]
            symbol:ARSJ "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00149 GO:GO:0008484
            GeneTree:ENSGT00560000077076 OMA:AAGYGIW EMBL:AADN02009321
            IPI:IPI00574604 Ensembl:ENSGALT00000019613 Uniprot:F1NH07
        Length = 472

 Score = 560 (202.2 bits), Expect = 1.1e-66, Sum P(3) = 1.1e-66
 Identities = 111/265 (41%), Positives = 162/265 (61%)

Query:   118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRG 177
             ++ IHTG+QH+++   +   LPL    LPQ LKE+GY T +VGKWHLGFY++E  PT RG
Sbjct:     1 RYQIHTGLQHSIIRPTQPNCLPLDNITLPQKLKEVGYSTHMVGKWHLGFYRRECMPTQRG 60

Query:   178 FESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHN 235
             F++  G   G  DY+ H   +   + G D+  +   AWD  +G YST ++T +   I+ +
Sbjct:    61 FDTFFGSLLGSGDYYTHFKCDSPGICGYDLYENDNAAWDHDNGIYSTQMYTQKVQQILAS 120

Query:   236 HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKV 295
             H+  +P+FLY+A+ A HS     PLQAP  Y   +R I +  R ++AA+L  LDE++  V
Sbjct:   121 HNPRKPIFLYIAYQAVHS-----PLQAPGKYFEHYRSINNINRRRYAAMLACLDEAINNV 175

Query:   296 VEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWSPL 355
               AL++     NSII++ SD             SNWPLRG K T WEGG+R  G + SPL
Sbjct:   176 TLALKKYGYYDNSIIIYSSDNGGQPMAGG----SNWPLRGSKGTYWEGGIRAVGFVHSPL 231

Query:   356 LESRGIVAEQYVHVSDWLPTLLSAA 380
             L+++G V ++ VH++DW PTL++ A
Sbjct:   232 LKNKGSVCKELVHITDWFPTLITLA 256

 Score = 79 (32.9 bits), Expect = 1.1e-66, Sum P(3) = 1.1e-66
 Identities = 31/120 (25%), Positives = 47/120 (39%)

Query:   795 NLADRSEVQRINHY---TTEVGYLD--PKQRFNQIA---YLDKEXXXXXXXXXXXXXXXX 846
             N A +S + R+NH+   T   GY D  P Q F+ +    + ++                 
Sbjct:   310 NTAIQSAI-RVNHWKLLTGNPGYSDWVPPQAFSNVGPNRWHNERVSWSAGKTVWLFNITA 368

Query:   847 XXXXXXXXXXGYPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSI-FGDDLK 905
                        YPDV+ Q+ + L+  N+TAV     P D   +PK     W   F +D K
Sbjct:   369 DPYERVDLSAKYPDVVKQLLRRLSQFNKTAVPVRYPPKDPRSNPKLNGGVWGPWFKEDEK 428

 Score = 77 (32.2 bits), Expect = 1.1e-66, Sum P(3) = 1.1e-66
 Identities = 15/40 (37%), Positives = 21/40 (52%)

Query:   509 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKW 548
             ++DG D+W  +S    S R  ILHNID  +  +    G W
Sbjct:   265 QLDGYDIWETISEGRRSPRVDILHNIDPIY--TKAKNGSW 302

 Score = 75 (31.5 bits), Expect = 1.7e-66, Sum P(3) = 1.7e-66
 Identities = 13/27 (48%), Positives = 17/27 (62%)

Query:   574 EIDGIDVWSVLSRNEPSKRNTILHNID 600
             ++DG D+W  +S    S R  ILHNID
Sbjct:   265 QLDGYDIWETISEGRRSPRVDILHNID 291

 Score = 72 (30.4 bits), Expect = 1.3e-05, Sum P(3) = 1.3e-05
 Identities = 14/35 (40%), Positives = 22/35 (62%)

Query:     1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIM 35
             +QH+++   +   LPL    LPQ LKE+GY T ++
Sbjct:     8 LQHSIIRPTQPNCLPLDNITLPQKLKEVGYSTHMV 42

 Score = 49 (22.3 bits), Expect = 1.5e-63, Sum P(3) = 1.5e-63
 Identities = 12/39 (30%), Positives = 21/39 (53%)

Query:   677 LFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
             LF+I  DP E+ +L+ +  D  +      + +FN+ A P
Sbjct:   363 LFNITADPYERVDLSAKYPDV-VKQLLRRLSQFNKTAVP 400


>WB|WBGene00006310 [details] [associations]
            symbol:sul-3 species:6239 "Caenorhabditis elegans"
            [GO:0008152 "metabolic process" evidence=IEA] [GO:0008484 "sulfuric
            ester hydrolase activity" evidence=IEA] [GO:0003824 "catalytic
            activity" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GO:GO:0008484 GeneTree:ENSGT00560000077076 EMBL:FO080947
            UniGene:Cel.8880 GeneID:183778 KEGG:cel:CELE_C54D2.4 CTD:183778
            RefSeq:NP_001041231.1 ProteinModelPortal:H2KZF6 SMR:H2KZF6
            EnsemblMetazoa:C54D2.4a WormBase:C54D2.4a OMA:RGMMVSD
            Uniprot:H2KZF6
        Length = 488

 Score = 544 (196.6 bits), Expect = 3.1e-60, Sum P(3) = 3.1e-60
 Identities = 132/375 (35%), Positives = 203/375 (54%)

Query:    31 RTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDA 90
             RT +  F +L L     +  VD   ++  P+++FI+ADDLG++DV +     + TPN+  
Sbjct:     3 RTTLPTFLLL-LLHNHGITGVDGQTATQKPNVLFIMADDLGFSDVDWKD-STLHTPNLRH 60

Query:    91 LAY--SGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQY 148
             LA+  +  +L N Y  QLCTP+RSA MTG +P   G Q+ V    E  G+P     L + 
Sbjct:    61 LAFHKNTALLSNSYVNQLCTPTRSAFMTGYYPFRVGTQNGVFLHMEPAGVPTMFPFLSEN 120

Query:   149 LKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAE----EMK--MW 202
             +++L Y T +VGKWHLG+ KKE+ PT RGF+   G++     YF+HSA+    E+K  + 
Sbjct:   121 MRQLDYSTYLVGKWHLGYCKKEFLPTNRGFDYFYGFYGPQTGYFNHSADQYHRELKRVVK 180

Query:   203 GLDMRRDLE-----PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPY 257
             GLD+  ++      P +  +G YSTD+FT  A+ ++ NH+  +P F++L++ A H   P 
Sbjct:   181 GLDLFEEVGSGKSVPDFSQNGVYSTDLFTDVAMSVLDNHNNSKPFFMFLSYQAVH---P- 236

Query:   258 EPLQAPDHYLNIHRHIE-DF---KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFV 313
              PLQ       I +  E  F     +    +L  +D ++G++VE L+   +  N++IVF 
Sbjct:   237 -PLQVSQQSKTIGQGKEATFILRSHAHSTRMLTAMDFAIGRLVEYLKASNLYENTVIVFT 295

Query:   314 SDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWL 373
             SD             SN PLRG K+T+WEGG +    + SP+    G   +   HV DW 
Sbjct:   296 SDNGGTANFGA----SNAPLRGEKDTIWEGGTKTTTFVHSPMYIEEGGTRDMMFHVVDWH 351

Query:   374 PTLLSAANKSDIPNY 388
              T+LS     +I +Y
Sbjct:   352 ATILSITGL-EIDSY 365

 Score = 67 (28.6 bits), Expect = 3.1e-60, Sum P(3) = 3.1e-60
 Identities = 18/57 (31%), Positives = 28/57 (49%)

Query:   511 DGIDVWSVLSRNEPS-KRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRS 566
             DGI+ W  L    P  +R   ++NID+    SA+  G +KL+  N        +NR+
Sbjct:   367 DGINQWEYLKTGRPKFRRFQFVYNIDNHG--SAIRDGDYKLIVGNVDRKMSKDKNRT 421

 Score = 49 (22.3 bits), Expect = 2.4e-58, Sum P(3) = 2.4e-58
 Identities = 10/27 (37%), Positives = 15/27 (55%)

Query:   576 DGIDVWSVLSRNEPS-KRNTILHNIDD 601
             DGI+ W  L    P  +R   ++NID+
Sbjct:   367 DGINQWEYLKTGRPKFRRFQFVYNIDN 393

 Score = 47 (21.6 bits), Expect = 3.1e-60, Sum P(3) = 3.1e-60
 Identities = 9/45 (20%), Positives = 22/45 (48%)

Query:   859 PDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSIFGDD 903
             P ++ ++  +L  + +     + KP    G P+ F+ ++S +  D
Sbjct:   441 PKIVRRLLAKLDQLKKFLHKNVRKPLSLNGSPERFNGSYSSYWCD 485

 Score = 37 (18.1 bits), Expect = 3.5e-59, Sum P(3) = 3.5e-59
 Identities = 9/38 (23%), Positives = 18/38 (47%)

Query:   677 LFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAY 714
             LF I  DP E  ++A RS  + +     ++ +  +  +
Sbjct:   423 LFRITTDPTESKDIA-RSNPKIVRRLLAKLDQLKKFLH 459


>UNIPROTKB|P34059 [details] [associations]
            symbol:GALNS "N-acetylgalactosamine-6-sulfatase"
            species:9606 "Homo sapiens" [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0043890 "N-acetylgalactosamine-6-sulfatase
            activity" evidence=IEA] [GO:0003943
            "N-acetylgalactosamine-4-sulfatase activity" evidence=TAS]
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IDA]
            [GO:0005975 "carbohydrate metabolic process" evidence=TAS]
            [GO:0030203 "glycosaminoglycan metabolic process" evidence=TAS]
            [GO:0042339 "keratan sulfate metabolic process" evidence=TAS]
            [GO:0042340 "keratan sulfate catabolic process" evidence=TAS]
            [GO:0043202 "lysosomal lumen" evidence=TAS] [GO:0044281 "small
            molecule metabolic process" evidence=TAS] Reactome:REACT_111217
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 Reactome:REACT_116125 GO:GO:0046872 GO:GO:0005975
            GO:GO:0043202 DrugBank:DB00070 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 HOGENOM:HOG000135352 HOVERGEN:HBG004283
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0003943
            Orphanet:582 GO:GO:0042340 CTD:2588 KO:K01132 OrthoDB:EOG480HWH
            GO:GO:0043890 EMBL:D17629 EMBL:U06088 EMBL:U06078 EMBL:U06079
            EMBL:U06080 EMBL:U06081 EMBL:U06082 EMBL:U06083 EMBL:U06084
            EMBL:U06085 EMBL:U06086 EMBL:U06087 EMBL:BC050684 EMBL:BC056151
            IPI:IPI00029605 PIR:JQ1299 RefSeq:NP_000503.1 UniGene:Hs.271383
            PDB:4FDI PDB:4FDJ PDBsum:4FDI PDBsum:4FDJ ProteinModelPortal:P34059
            SMR:P34059 STRING:P34059 PhosphoSite:P34059 DMDM:462148
            PaxDb:P34059 PRIDE:P34059 DNASU:2588 Ensembl:ENST00000268695
            GeneID:2588 KEGG:hsa:2588 UCSC:uc002fly.4 GeneCards:GC16M088880
            H-InvDB:HIX0134371 HGNC:HGNC:4122 HPA:CAB026404 MIM:253000
            MIM:612222 neXtProt:NX_P34059 PharmGKB:PA28535 InParanoid:P34059
            OMA:GAISHAF PhylomeDB:P34059 BioCyc:MetaCyc:HS06790-MONOMER
            BRENDA:3.1.6.4 ChiTaRS:Galns GenomeRNAi:2588 NextBio:10237
            ArrayExpress:P34059 Bgee:P34059 CleanEx:HS_GALNS
            Genevestigator:P34059 GermOnline:ENSG00000141012 Uniprot:P34059
        Length = 522

 Score = 440 (159.9 bits), Expect = 2.1e-42, Sum P(2) = 2.1e-42
 Identities = 113/353 (32%), Positives = 178/353 (50%)

Query:    42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
             L   LS   +    +  PP+I+ +L DD+GW D+G +G     TPN+D +A  G++  N+
Sbjct:    13 LLLVLSAAGMGASGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGLLFPNF 72

Query:   102 YTVQ-LCTPSRSAIMTGKHPIHTGMQ----H--NVLYGCER-GGLPLSEKILPQYLKELG 153
             Y+   LC+PSR+A++TG+ PI  G      H  N     E  GG+P SE++LP+ LK+ G
Sbjct:    73 YSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAG 132

Query:   154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEE----MKMWGLDMRRD 209
             Y ++IVGKWHLG ++ ++ P   GF+   G    H   +D+ A       + W +  R  
Sbjct:   133 YVSKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYY 191

Query:   210 LEPAWDLH-GKYS-TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL 267
              E   +L  G+ + T ++  EA+D I   +   P FLY A  ATH+     P+ A   +L
Sbjct:   192 EEFPINLKTGEANLTQIYLQEALDFIKRQARHHPFFLYWAVDATHA-----PVYASKPFL 246

Query:   268 NIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXX 327
                +      R ++   + ++D+S+GK++E L+   +  N+ + F SD            
Sbjct:   247 GTSQ------RGRYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQG 300

Query:   328 XSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
              SN P    K T +EGG+R   L W P   + G V+ Q   + D   T L+ A
Sbjct:   301 GSNGPFLCGKQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLALA 353

 Score = 41 (19.5 bits), Expect = 2.1e-42, Sum P(2) = 2.1e-42
 Identities = 10/24 (41%), Positives = 14/24 (58%)

Query:   667 VPCEPQIAPCLFDIKN--DP-CEK 687
             VP +PQ+  C + + N   P CEK
Sbjct:   480 VPAQPQLNVCNWAVMNWAPPGCEK 503

 Score = 41 (19.5 bits), Expect = 2.1e-42, Sum P(2) = 2.1e-42
 Identities = 10/24 (41%), Positives = 14/24 (58%)

Query:   773 VPCEPQIAPCLFDIKN--DP-CEK 793
             VP +PQ+  C + + N   P CEK
Sbjct:   480 VPAQPQLNVCNWAVMNWAPPGCEK 503


>MGI|MGI:1355303 [details] [associations]
            symbol:Galns "galactosamine (N-acetyl)-6-sulfate sulfatase"
            species:10090 "Mus musculus" [GO:0003824 "catalytic activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0008152
            "metabolic process" evidence=ISO] [GO:0008484 "sulfuric ester
            hydrolase activity" evidence=ISO] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0043890 "N-acetylgalactosamine-6-sulfatase
            activity" evidence=IEA] [GO:0046872 "metal ion binding"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 MGI:MGI:1355303 GO:GO:0046872
            GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            HOGENOM:HOG000135352 HOVERGEN:HBG004283 InterPro:IPR024607
            PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            GeneTree:ENSGT00560000077076 HSSP:P15289 CTD:2588 KO:K01132
            OrthoDB:EOG480HWH GO:GO:0043890 BRENDA:3.1.6.4 EMBL:AF111346
            EMBL:AF112242 EMBL:AF112230 EMBL:AF112231 EMBL:AF112233
            EMBL:AF112232 EMBL:AF112234 EMBL:AF112235 EMBL:AF112236
            EMBL:AF112237 EMBL:AF112238 EMBL:AF112239 EMBL:AF112240
            EMBL:AF112241 EMBL:AK220245 EMBL:AK159592 EMBL:BC004002
            IPI:IPI00310090 RefSeq:NP_001180574.1 RefSeq:NP_057931.3
            UniGene:Mm.34702 ProteinModelPortal:Q571E4 SMR:Q571E4 STRING:Q571E4
            PhosphoSite:Q571E4 PaxDb:Q571E4 PRIDE:Q571E4
            Ensembl:ENSMUST00000015171 GeneID:50917 KEGG:mmu:50917
            UCSC:uc012gmh.1 InParanoid:Q571E4 OMA:RKTGEAN NextBio:307919
            Bgee:Q571E4 CleanEx:MM_GALNS Genevestigator:Q571E4 Uniprot:Q571E4
        Length = 520

 Score = 433 (157.5 bits), Expect = 2.0e-41, Sum P(2) = 2.0e-41
 Identities = 115/358 (32%), Positives = 178/358 (49%)

Query:    38 AVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGII 97
             A   L   LS + +    +  PP+I+ +L DD+GW D+G +G     TPN+D +A  G++
Sbjct:     6 AAQQLLLVLSALGLLAAGAPQPPNIVLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGML 65

Query:    98 LKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQ----H--NVLYGCE-RGGLPLSEKILPQYL 149
               ++Y+   LC+PSR+A++TG+ PI  G      H  N     E  GG+P SE +LP+ L
Sbjct:    66 FPSFYSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELL 125

Query:   150 KELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEE----MKMWGLD 205
             K+ GY  +IVGKWHLG ++ ++ P   GF+   G    H   +D+ A+      + W + 
Sbjct:   126 KKAGYTNKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKAKPNIPVYRDWEMV 184

Query:   206 MRRDLE-PAWDLHGKYS-TDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQA 262
              R   E P     G+ + T ++T EA+D I   H+   P FLY A  ATH+     P+ A
Sbjct:   185 GRFYEEFPINRKTGEANLTQLYTQEALDFIQTQHARQSPFFLYWAIDATHA-----PVYA 239

Query:   263 PDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXX 322
                +L          R ++   + ++D+SVGK++  L+   +  N+ + F SD       
Sbjct:   240 SRQFLGTSL------RGRYGDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALIS 293

Query:   323 XXXXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
                   SN P    K T +EGG+R   + W P   + G V+ Q   + D   T LS A
Sbjct:   294 APNEGGSNGPFLCGKQTTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLA 351

 Score = 39 (18.8 bits), Expect = 2.0e-41, Sum P(2) = 2.0e-41
 Identities = 12/38 (31%), Positives = 21/38 (55%)

Query:   675 PCLFDIKNDPCEKNNLADRSED-QRINHYTTEVGRFNQ 711
             P +F +  DP E+  L+  S++ Q     TT+V + +Q
Sbjct:   437 PLIFHLGRDPGERFPLSFHSDEYQDALSRTTQVVQEHQ 474


>UNIPROTKB|F1S2F1 [details] [associations]
            symbol:F1S2F1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            GeneTree:ENSGT00560000077076 EMBL:CU468550
            Ensembl:ENSSSCT00000015408 Uniprot:F1S2F1
        Length = 151

 Score = 435 (158.2 bits), Expect = 4.1e-40, P = 4.1e-40
 Identities = 74/127 (58%), Positives = 98/127 (77%)

Query:    62 IIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPI 121
             ++F+LADDLGWNDVGFHG  +I TP++DALA  G++L NYYT  LCTPSRS ++TG++ I
Sbjct:    15 LVFVLADDLGWNDVGFHG-SEIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQI 73

Query:   122 HTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
             HTG+QH +++ C+   +PL EK+LPQ LKE GY T +VGKWHLG Y+KE  PT RGF+++
Sbjct:    74 HTGLQHQIIWPCQPSCIPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTY 133

Query:   182 LGYWTGH 188
              G    H
Sbjct:   134 FGNGNAH 140


>UNIPROTKB|F1PHF0 [details] [associations]
            symbol:GALNS "N-acetylgalactosamine-6-sulfatase"
            species:9615 "Canis lupus familiaris" [GO:0008484 "sulfuric ester
            hydrolase activity" evidence=IEA] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
            PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            GeneTree:ENSGT00560000077076 OMA:RKTGEAN EMBL:AAEX03003965
            Ensembl:ENSCAFT00000031604 Uniprot:F1PHF0
        Length = 524

 Score = 428 (155.7 bits), Expect = 2.4e-39, P = 2.4e-39
 Identities = 113/354 (31%), Positives = 175/354 (49%)

Query:    42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
             L   LS   +    +  PP+I+ +L DD+GW D+G +G     TPN+D +A  G++  ++
Sbjct:    14 LLLVLSAAGLGAAGAPQPPNILLLLMDDMGWGDLGIYGEPSRETPNLDRMAAEGMLFPSF 73

Query:   102 YTVQ-LCTPSRSAIMTGKHPIHTGM----QH-NVLYGCER--GGLPLSEKILPQYLKELG 153
             Y+   LC+PSR+A++TG+ PI  G     +H    Y  +   GG+P  E +LP+ LKE G
Sbjct:    74 YSANPLCSPSRAALLTGRLPIRNGFYTTNRHARNAYTPQEIVGGIPDQEHVLPELLKEAG 133

Query:   154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEE----MKMWGLDMRRD 209
             Y ++IVGKWHLG ++ ++ P   GF+   G    H   +D+ A       + W +  R  
Sbjct:   134 YVSKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRYY 192

Query:   210 LEPAWDLH-GKYS-TDVFTAEAVDII-HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHY 266
              E   +L  G+ + T V+  EA+D I    +   P FLY A  ATH+     P+ A   +
Sbjct:   193 EEFPINLKTGEANLTQVYLQEALDFIKRQQAAQRPFFLYWAIDATHA-----PVYASRPF 247

Query:   267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
             L   +      R ++   + ++D SVGK++  L+  R+  N+ + F SD           
Sbjct:   248 LGTSQ------RGRYGDAVREIDNSVGKILSLLQDLRISENTFVFFTSDNGAALISAPNQ 301

Query:   327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
               SN P    K T +EGG+R   + W P     G V+ Q   + D   T LS A
Sbjct:   302 GGSNGPFLCGKQTTFEGGMREPAIAWWPGRIPAGRVSHQLGSIMDLFTTSLSLA 355


>UNIPROTKB|Q32KH5 [details] [associations]
            symbol:GALNS "N-acetylgalactosamine-6-sulfatase"
            species:9615 "Canis lupus familiaris" [GO:0005764 "lysosome"
            evidence=IEA] [GO:0043890 "N-acetylgalactosamine-6-sulfatase
            activity" evidence=IEA] [GO:0046872 "metal ion binding"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 GO:GO:0046872 GO:GO:0005764
            Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            HOGENOM:HOG000135352 HOVERGEN:HBG004283 InterPro:IPR024607
            PROSITE:PS00523 PROSITE:PS00149 HSSP:P15289 EMBL:BN000762
            RefSeq:NP_001041585.1 UniGene:Cfa.37704 ProteinModelPortal:Q32KH5
            STRING:Q32KH5 PRIDE:Q32KH5 GeneID:489661 KEGG:cfa:489661 CTD:2588
            InParanoid:Q32KH5 KO:K01132 OrthoDB:EOG480HWH NextBio:20862813
            GO:GO:0043890 Uniprot:Q32KH5
        Length = 522

 Score = 428 (155.7 bits), Expect = 2.4e-39, P = 2.4e-39
 Identities = 113/354 (31%), Positives = 175/354 (49%)

Query:    42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
             L   LS   +    +  PP+I+ +L DD+GW D+G +G     TPN+D +A  G++  ++
Sbjct:    12 LLLVLSAAGLGAAGAPQPPNILLLLMDDMGWGDLGIYGEPSRETPNLDRMAAEGMLFPSF 71

Query:   102 YTVQ-LCTPSRSAIMTGKHPIHTGM----QH-NVLYGCER--GGLPLSEKILPQYLKELG 153
             Y+   LC+PSR+A++TG+ PI  G     +H    Y  +   GG+P  E +LP+ LKE G
Sbjct:    72 YSANPLCSPSRAALLTGRLPIRNGFYTTNRHARNAYTPQEIVGGIPDQEHVLPELLKEAG 131

Query:   154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEE----MKMWGLDMRRD 209
             Y ++IVGKWHLG ++ ++ P   GF+   G    H   +D+ A       + W +  R  
Sbjct:   132 YVSKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRYY 190

Query:   210 LEPAWDLH-GKYS-TDVFTAEAVDII-HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHY 266
              E   +L  G+ + T V+  EA+D I    +   P FLY A  ATH+     P+ A   +
Sbjct:   191 EEFPINLKTGEANLTQVYLQEALDFIKRQQAAQRPFFLYWAIDATHA-----PVYASRPF 245

Query:   267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
             L   +      R ++   + ++D SVGK++  L+  R+  N+ + F SD           
Sbjct:   246 LGTSQ------RGRYGDAVREIDNSVGKILSLLQDLRISENTFVFFTSDNGAALISAPNQ 299

Query:   327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
               SN P    K T +EGG+R   + W P     G V+ Q   + D   T LS A
Sbjct:   300 GGSNGPFLCGKQTTFEGGMREPAIAWWPGRIPAGRVSHQLGSIMDLFTTSLSLA 353


>UNIPROTKB|F1NW57 [details] [associations]
            symbol:GALNS "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484
            GeneTree:ENSGT00560000077076 OMA:DDQVGIL EMBL:AADN02054103
            IPI:IPI00577734 Ensembl:ENSGALT00000010149 Uniprot:F1NW57
        Length = 521

 Score = 409 (149.0 bits), Expect = 5.2e-39, Sum P(2) = 5.2e-39
 Identities = 103/337 (30%), Positives = 169/337 (50%)

Query:    57 SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIM 115
             + PP+++ +L DD+GW D+G  G     TPN+D +A  G++  ++Y    LC+PSR+A++
Sbjct:    26 AAPPNVVLLLMDDMGWGDLGAFGEPSKETPNLDQMASEGMLFLDFYAANPLCSPSRAALL 85

Query:   116 TGKHPIHTGMQHNVLYGCER-------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYK 168
             TG+ P+  G      +           GG+  SE +LP+ LK+ GY  +I+GKWHLG ++
Sbjct:    86 TGRLPVRNGFYTTNAHARNAYTPQDIVGGIQDSEILLPELLKKAGYTNKIIGKWHLG-HR 144

Query:   169 KEYTPTFRGFESHLGYWTGHQDYFDHSA----EEMKMWGLDMRRDLEPAWDLH-GKYS-T 222
              ++ P   GF+   G    H   +D+ A       + W +  R   +   DL  G+ + T
Sbjct:   145 PQFHPLKHGFDEWFGSPNCHFGPYDNRALPNIPVYRDWEMIGRYYEDFKIDLRTGEANLT 204

Query:   223 DVFTAEAVDIIHNH-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
              ++  EA+D I    ++ +P FLY A  ATH+     P+ A  H+L   +      R ++
Sbjct:   205 QIYLQEALDFISKQQASQQPFFLYWAIDATHA-----PVYASKHFLGTSQ------RGRY 253

Query:   282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLW 341
                + ++D+SVGK+++ L++  +  N+ + F SD             SN P    K T +
Sbjct:   254 GDAVREIDDSVGKILKHLQKLGISENTFVFFTSDNGAALISAPKQGGSNGPFLCGKQTTF 313

Query:   342 EGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
             EGG+R   + W P     G V+ Q   V D   T LS
Sbjct:   314 EGGMREPAIAWWPGHIPAGSVSRQLGSVMDLFTTSLS 350

 Score = 41 (19.5 bits), Expect = 5.2e-39, Sum P(2) = 5.2e-39
 Identities = 10/27 (37%), Positives = 14/27 (51%)

Query:   670 EPQIAPCLFDIKNDPCEKNNLADRSED 696
             E    P LF +  DP EK  L+  S++
Sbjct:   433 EHSTLPLLFHLGRDPGEKYPLSFASDE 459

 Score = 39 (18.8 bits), Expect = 8.4e-39, Sum P(2) = 8.4e-39
 Identities = 10/26 (38%), Positives = 13/26 (50%)

Query:   776 EPQIAPCLFDIKNDPCEKNNLADRSE 801
             E    P LF +  DP EK  L+  S+
Sbjct:   433 EHSTLPLLFHLGRDPGEKYPLSFASD 458


>RGD|1565391 [details] [associations]
            symbol:Galns "galactosamine (N-acetyl)-6-sulfate sulfatase"
            species:10116 "Rattus norvegicus" [GO:0005575 "cellular_component"
            evidence=ND] [GO:0005764 "lysosome" evidence=IEA] [GO:0008152
            "metabolic process" evidence=RCA] [GO:0008484 "sulfuric ester
            hydrolase activity" evidence=IEA;ISO;RCA] [GO:0043890
            "N-acetylgalactosamine-6-sulfatase activity" evidence=IEA]
            [GO:0046872 "metal ion binding" evidence=IEA] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 RGD:1565391
            GO:GO:0046872 GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 HOGENOM:HOG000135352 HOVERGEN:HBG004283
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            GeneTree:ENSGT00560000077076 HSSP:P15289 CTD:2588 KO:K01132
            OrthoDB:EOG480HWH GO:GO:0043890 EMBL:AC134009 EMBL:BN000741
            IPI:IPI00359847 RefSeq:NP_001041316.1 UniGene:Rn.101398
            ProteinModelPortal:Q32KJ6 STRING:Q32KJ6 PRIDE:Q32KJ6
            Ensembl:ENSRNOT00000019528 GeneID:292073 KEGG:rno:292073
            UCSC:RGD:1565391 InParanoid:Q32KJ6 NextBio:633705
            Genevestigator:Q32KJ6 Uniprot:Q32KJ6
        Length = 524

 Score = 423 (154.0 bits), Expect = 8.2e-39, P = 8.2e-39
 Identities = 112/357 (31%), Positives = 178/357 (49%)

Query:    39 VLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
             +LP+   L ++      +  PP+I+ +L DD+GW D+G +G     TPN+D +A  G++ 
Sbjct:    14 LLPVLSALGLL---AAGAPQPPNIVLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLF 70

Query:    99 KNYYTVQ-LCTPSRSAIMTGKHPIHTGMQ----H--NVLYGCE-RGGLPLSEKILPQYLK 150
              ++Y+   LC+PSR+A++TG+ PI  G      H  N     E  GG+P SE +LP+ LK
Sbjct:    71 PSFYSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLK 130

Query:   151 ELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEE----MKMWGLDM 206
             + GY  +IVGKWHLG ++ ++ P   GF+   G    H   +D+  +      + W +  
Sbjct:   131 KAGYTNKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKVKPNIPVYRDWEMVG 189

Query:   207 RRDLEPAWDLH-GKYS-TDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAP 263
             R   E   +L  G+ + T ++  EA+D I   H+   P FLY A  ATH+     P+ A 
Sbjct:   190 RFYEEFPINLKTGEANLTQLYLQEALDFIRTQHARQSPFFLYWAIDATHA-----PVYAS 244

Query:   264 DHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXX 323
               +L          R ++   + ++D+SVGK++  L+   +  N+ + F SD        
Sbjct:   245 KQFLGTSL------RGRYGDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALISA 298

Query:   324 XXXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
                  SN P    K T +EGG+R   + W P   + G V+ Q   + D   T LS A
Sbjct:   299 PKEGGSNGPFLCGKQTTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLA 355


>UNIPROTKB|Q8WNQ7 [details] [associations]
            symbol:GALNS "N-acetylgalactosamine-6-sulfatase"
            species:9823 "Sus scrofa" [GO:0005764 "lysosome" evidence=IEA]
            [GO:0043890 "N-acetylgalactosamine-6-sulfatase activity"
            evidence=IEA] [GO:0046872 "metal ion binding" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 GO:GO:0046872 GO:GO:0005764 Gene3D:3.40.720.10
            SUPFAM:SSF53649 eggNOG:COG3119 HOGENOM:HOG000135352
            HOVERGEN:HBG004283 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 HSSP:P15289 CTD:2588 KO:K01132 OrthoDB:EOG480HWH
            GO:GO:0043890 EMBL:AF322917 RefSeq:NP_999120.1 UniGene:Ssc.4371
            ProteinModelPortal:Q8WNQ7 STRING:Q8WNQ7 GeneID:397000
            KEGG:ssc:397000 ArrayExpress:Q8WNQ7 Uniprot:Q8WNQ7
        Length = 522

 Score = 422 (153.6 bits), Expect = 1.1e-38, P = 1.1e-38
 Identities = 113/354 (31%), Positives = 175/354 (49%)

Query:    42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
             L   LS   + +  +  PP+I+ +L DD+GW D+G +G     TPN+D +A  G++  ++
Sbjct:    12 LLLVLSAAGLGVTGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSF 71

Query:   102 YTVQ-LCTPSRSAIMTGKHPIHTGMQ----H-NVLYGCER--GGLPLSEKILPQYLKELG 153
             Y    LC+PSR+A++TG+ PI TG      H    Y  +   GG+P  E +LP+ LK  G
Sbjct:    72 YAANPLCSPSRAALLTGRLPIRTGFYTTNGHARNAYTPQEIVGGIPDPEHLLPELLKGAG 131

Query:   154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEE----MKMWGLDMRRD 209
             Y ++IVGKWHLG ++ ++ P   GF+   G    H   +D+ A       + W +  R  
Sbjct:   132 YASKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRFY 190

Query:   210 LEPAWDLH-GKYS-TDVFTAEAVDII-HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHY 266
              E   +L  G+ + T ++  EA+D I    +T  P FLY A  ATH+     P+ A   +
Sbjct:   191 EEFPINLKTGESNLTQIYLQEALDFIKRQQATHHPFFLYWAIDATHA-----PVYASRAF 245

Query:   267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
             L   +      R ++   + ++D+SVG++V  L   ++  N+ + F SD           
Sbjct:   246 LGTSQ------RGRYGDAVREIDDSVGRIVGLLRDLKIAGNTFVFFTSDNGAALVSAPKQ 299

Query:   327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
               SN P    K T +EGG+R   + W P     G V+ Q   V D   T LS A
Sbjct:   300 GGSNGPFLCGKQTTFEGGMREPAIAWWPGHIPAGQVSHQLGSVMDLFTTSLSLA 353


>UNIPROTKB|F1MU84 [details] [associations]
            symbol:GALNS "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            GeneTree:ENSGT00560000077076 EMBL:DAAA02046255 IPI:IPI00703141
            Ensembl:ENSBTAT00000006001 OMA:DDQVGIL Uniprot:F1MU84
        Length = 527

 Score = 416 (151.5 bits), Expect = 4.7e-38, P = 4.7e-38
 Identities = 111/358 (31%), Positives = 172/358 (48%)

Query:    42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
             L   LS   + +  +  PP+I+ +L DD+GW D+G +G     TPN+D +A  G++  N+
Sbjct:    17 LLLVLSAAELGVARALQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAVEGMLFPNF 76

Query:   102 YTVQ-LCTPSRSAIMTGKHPIHTGMQ----H--NVLYGCER-GGLPLSEKILPQYLKELG 153
             YT   LC+PSR+A++TG+ PI +G      H  N     E  GG+P SE +LP  LK  G
Sbjct:    77 YTANPLCSPSRAALLTGRLPIRSGFYTTNGHARNAYTPQEIVGGIPDSELLLPALLKGAG 136

Query:   154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
             Y ++IVGKWHLG ++ ++ P   GF+   G    H   +D+ A       + + RD E  
Sbjct:   137 YASKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKARP----NIPVYRDQEMV 191

Query:   214 WDLHGKYS----------TDVFTAEAVDIIHNH-STDEPLFLYLAHAATHSANPYEPLQA 262
                + ++           T ++  EA++ I    +   P FLY A  ATH+     P+ A
Sbjct:   192 GRFYEEFPINLKTGEANLTQIYLQEALEFIQRQQAAHRPFFLYWAVDATHA-----PIYA 246

Query:   263 PDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXX 322
                +L   +      R ++   + +LD+SVG+++  L    +  N+ + F SD       
Sbjct:   247 SKPFLGTSQ------RGRYGDAIRELDDSVGRILRLLRDLSIAENTFVFFTSDNGAALIS 300

Query:   323 XXXXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
                   SN P    K T +EGG+R   + W P     G V+ Q   + D   T LS A
Sbjct:   301 APRQGGSNGPFLCGKQTTFEGGMREPAIAWWPGHIPAGQVSHQLGSIMDLFTTSLSLA 358


>ZFIN|ZDB-GENE-070112-1152 [details] [associations]
            symbol:galns "galactosamine (N-acetyl)-6-sulfate
            sulfatase" species:7955 "Danio rerio" [GO:0008152 "metabolic
            process" evidence=IEA] [GO:0008484 "sulfuric ester hydrolase
            activity" evidence=IEA] [GO:0003824 "catalytic activity"
            evidence=IEA] [GO:0005575 "cellular_component" evidence=ND]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 ZFIN:ZDB-GENE-070112-1152 Gene3D:3.40.720.10
            SUPFAM:SSF53649 InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484
            GeneTree:ENSGT00560000077076 EMBL:CR376726 EMBL:BX248306
            EMBL:CR388041 IPI:IPI01023807 ProteinModelPortal:F8W261
            Ensembl:ENSDART00000149478 ArrayExpress:F8W261 Bgee:F8W261
            Uniprot:F8W261
        Length = 514

 Score = 391 (142.7 bits), Expect = 9.8e-37, Sum P(2) = 9.8e-37
 Identities = 105/343 (30%), Positives = 167/343 (48%)

Query:    56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAI 114
             +SG P+II +L DD+GW D+G  G     TP +D +A  G++  N+YT   LC+PSR+A+
Sbjct:    18 TSGSPNIIIMLMDDMGWGDLGVFGEPSKETPYLDLMAAQGMLFPNFYTANPLCSPSRAAL 77

Query:   115 MTGKHPIHTGMQ----H-NVLYGCER--GGLPLSEKILPQYLKELGYRTRIVGKWHLGFY 167
             +TG+ P+  G      H    Y  +   GG+   E +LP+ LK   Y ++IVGKWHLG +
Sbjct:    78 LTGRLPVRNGFYTTNAHARNAYTPQEIVGGISADEILLPELLKNKHYVSKIVGKWHLG-H 136

Query:   168 KKEYTPTFRGFESHLGYWTGH-QDYFDHSAEEMKMWG-LDMRRDLEPAWDLHGKYS---- 221
             + +Y P   GF+   G    H   Y D S   + ++   +M+      ++++ K      
Sbjct:   137 RTQYLPLKHGFDEWFGAPNCHFGPYNDSSRPNIPVYNNSEMKGRYYEEFEINVKTGESNL 196

Query:   222 TDVFTAEAVDIIHNHS-TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSK 280
             T ++  E +D I   +    P FLY A  ATH+     P+ A   +L         +R +
Sbjct:   197 TQLYLKEGLDFISQQAMAQRPFFLYWAPDATHA-----PVYASKPFLG------KSQRGR 245

Query:   281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTL 340
             +   + +LD+S+G+++  L    + +++++ F SD             SN P    K T 
Sbjct:   246 YGDAVMELDDSIGQILAHLVSLGIQNDTLVFFTSDNGAALMSGPLQSGSNAPFLCGKETT 305

Query:   341 WEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
             +EGG+R   + W P     G V+ Q   V D   T LS A  S
Sbjct:   306 FEGGMREPAMAWWPGQIPAGTVSHQLASVMDLFSTSLSVAGVS 348

 Score = 38 (18.4 bits), Expect = 9.8e-37, Sum P(2) = 9.8e-37
 Identities = 11/48 (22%), Positives = 22/48 (45%)

Query:   670 EPQIAPCLFDIKNDPCEKNNLADRSEDQR--INHYTTEVGRFNQIAYP 715
             E  + P +F +  DP E+  L+ + ++ R      T  V +  ++  P
Sbjct:   426 EHTMQPLIFHLGRDPGERYPLSVQCKEYRDVFRRVTAVVEQHQKLLIP 473


>UNIPROTKB|F1RL71 [details] [associations]
            symbol:F1RL71 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484
            GeneTree:ENSGT00560000077076 EMBL:CU914366
            Ensembl:ENSSSCT00000015793 Uniprot:F1RL71
        Length = 561

 Score = 205 (77.2 bits), Expect = 3.5e-34, Sum P(5) = 3.5e-34
 Identities = 38/69 (55%), Positives = 49/69 (71%)

Query:    57 SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMT 116
             S  PHIIFIL DD G++DVG+HG D I TP +D LA  G+ L+NYY   +CTPSRS ++T
Sbjct:    44 SQQPHIIFILTDDQGYHDVGYHGSD-IQTPTLDRLAAEGVKLENYYIQPICTPSRSQLLT 102

Query:   117 GKHPIHTGM 125
             G H +  G+
Sbjct:   103 GSHSLDRGL 111

 Score = 199 (75.1 bits), Expect = 3.5e-34, Sum P(5) = 3.5e-34
 Identities = 42/102 (41%), Positives = 60/102 (58%)

Query:   279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKN 338
             +K+AA++  +DE+V  +  AL+     +NS+I+F SD             SNWPLRG K 
Sbjct:   252 AKYAAMVTCMDEAVRNITGALKYG-FYNNSVIIFSSDNGGQTFSGG----SNWPLRGRKG 306

Query:   339 TLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
             T WEGGVRG G + SPLL+     +   +H++DW PTL+  A
Sbjct:   307 TYWEGGVRGLGFVHSPLLKRTRRTSRALLHITDWYPTLVGLA 348

 Score = 77 (32.2 bits), Expect = 3.5e-34, Sum P(5) = 3.5e-34
 Identities = 14/26 (53%), Positives = 16/26 (61%)

Query:   510 IDGIDVWSVLSRNEPSKRNTILHNID 535
             +DG DVW  +S    S R  ILHNID
Sbjct:   358 LDGYDVWPAISEGRASPRTEILHNID 383

 Score = 77 (32.2 bits), Expect = 3.5e-34, Sum P(5) = 3.5e-34
 Identities = 14/26 (53%), Positives = 16/26 (61%)

Query:   575 IDGIDVWSVLSRNEPSKRNTILHNID 600
             +DG DVW  +S    S R  ILHNID
Sbjct:   358 LDGYDVWPAISEGRASPRTEILHNID 383

 Score = 54 (24.1 bits), Expect = 3.5e-34, Sum P(5) = 3.5e-34
 Identities = 14/46 (30%), Positives = 23/46 (50%)

Query:   859 PDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDH-AWSIFGDD 903
             PDV+  +   L + NRTA+ P+  P +      +F+  AW  +  D
Sbjct:   472 PDVVRALLARLVDYNRTAI-PVRYPAENPRAHPDFNGGAWGPWASD 516

 Score = 47 (21.6 bits), Expect = 8.2e-33, Sum P(4) = 8.2e-33
 Identities = 12/39 (30%), Positives = 20/39 (51%)

Query:   677 LFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
             LF+I  DP E+ +LA +  D  +      +  +N+ A P
Sbjct:   454 LFNISADPYEREDLAGQRPDV-VRALLARLVDYNRTAIP 491

 Score = 44 (20.5 bits), Expect = 3.5e-34, Sum P(5) = 3.5e-34
 Identities = 11/23 (47%), Positives = 15/23 (65%)

Query:   783 LFDIKNDPCEKNNLA-DRSEVQR 804
             LF+I  DP E+ +LA  R +V R
Sbjct:   454 LFNISADPYEREDLAGQRPDVVR 476

 Score = 39 (18.8 bits), Expect = 2.3e-17, Sum P(5) = 2.3e-17
 Identities = 18/75 (24%), Positives = 33/75 (44%)

Query:    76 GFHGLDQ-IPTPNIDALAYSGIILKNYYTVQLCTPSRSAI---MTGKHPIHTGMQHNVLY 131
             G H LD+ +P      L+ S I+       Q     +S     +TG   +  G+  ++ +
Sbjct:   103 GSHSLDRGLPRLQPRELSPSCILTTKTALSQRTRNRKSPAGTRLTGVRDLGPGLTRSLPW 162

Query:   132 GCERGGL--PLSEKI 144
             G  RGG+  P  +++
Sbjct:   163 GRGRGGVLTPCGDEV 177


>UNIPROTKB|Q08DD1 [details] [associations]
            symbol:ARSA "Arylsulfatase A" species:9913 "Bos taurus"
            [GO:0005509 "calcium ion binding" evidence=ISS] [GO:0005764
            "lysosome" evidence=IEA] [GO:0016021 "integral to membrane"
            evidence=IEA] [GO:0007339 "binding of sperm to zona pellucida"
            evidence=IEA] [GO:0005886 "plasma membrane" evidence=IEA]
            [GO:0004098 "cerebroside-sulfatase activity" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 GO:GO:0005886 GO:GO:0005509 GO:GO:0005764
            GO:GO:0007339 Gene3D:3.40.720.10 SUPFAM:SSF53649 EMBL:BC123816
            IPI:IPI00713745 RefSeq:NP_001068673.1 UniGene:Bt.1076
            ProteinModelPortal:Q08DD1 SMR:Q08DD1 STRING:Q08DD1 PRIDE:Q08DD1
            Ensembl:ENSBTAT00000021364 GeneID:505514 KEGG:bta:505514 CTD:410
            eggNOG:COG3119 GeneTree:ENSGT00560000076940 HOGENOM:HOG000135352
            HOVERGEN:HBG004283 InParanoid:Q08DD1 KO:K01134 OMA:FGPSQMA
            OrthoDB:EOG4MKNG4 NextBio:20867174 GO:GO:0004098 InterPro:IPR024607
            PROSITE:PS00523 PROSITE:PS00149 Uniprot:Q08DD1
        Length = 507

 Score = 358 (131.1 bits), Expect = 2.2e-33, Sum P(3) = 2.2e-33
 Identities = 115/363 (31%), Positives = 169/363 (46%)

Query:    44 FTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT 103
             +TL++     +A++ PP+I+ I ADDLG+ D+G +G     TPN+D LA  G+   ++Y 
Sbjct:     5 WTLTLALAAGLAAASPPNILLIFADDLGYGDLGSYGHPSSTTPNLDQLAAGGLRFTDFYV 64

Query:   104 -VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKW 162
              V LCTPSR+A++TG+ P+  G+   VL    RGGLPL E  L + L   GY T I GKW
Sbjct:    65 PVSLCTPSRAALLTGRLPVRMGLYPGVLEPSSRGGLPLDEVTLAEVLAAQGYLTGIAGKW 124

Query:   163 HLGFYKK-EYTPTFRGFESHLGYWTGHQD-------YFDHSA--EEMKMWGL-------D 205
             HLG   +  + P   GF   LG    H          F  +   E +   GL       +
Sbjct:   125 HLGVGPEGAFLPPHHGFHRFLGIPYSHDQGPCQNLTCFPPATPCEGICDQGLVPIPLLAN 184

Query:   206 MRRDLEPAWDLHGKYSTDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPD 264
             +  + +P W L G  +   + A A D++ +      P FLY A   TH    +     P 
Sbjct:   185 LSVEAQPPW-LPGLEAR--YVAFARDLMTDAQHQGRPFFLYYASHHTHYPQ-FSGQSFPG 240

Query:   265 HYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXX 324
             H            R  F   L +LD +VG ++ A+    +L  +++ F +D         
Sbjct:   241 HS----------GRGPFGDSLMELDAAVGALMTAVGDLGLLGETLVFFTADNGPETMRMS 290

Query:   325 XXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSD 384
                 S   LR  K T +EGGVR   L + P   + G+  E    + D LPTL + A  + 
Sbjct:   291 HGGCSGL-LRCGKGTTFEGGVREPALAFWPGHIAPGVTHELASSL-DLLPTLAALAG-AQ 347

Query:   385 IPN 387
             +PN
Sbjct:   348 LPN 350

 Score = 53 (23.7 bits), Expect = 2.2e-33, Sum P(3) = 2.2e-33
 Identities = 10/22 (45%), Positives = 13/22 (59%)

Query:   675 PCLFDIKNDPCEKNNLADRSED 696
             P LFD+  DP E  NL D  ++
Sbjct:   426 PLLFDLSEDPGENYNLLDSVDE 447

 Score = 52 (23.4 bits), Expect = 2.8e-33, Sum P(3) = 2.8e-33
 Identities = 10/18 (55%), Positives = 11/18 (61%)

Query:   781 PCLFDIKNDPCEKNNLAD 798
             P LFD+  DP E  NL D
Sbjct:   426 PLLFDLSEDPGENYNLLD 443

 Score = 44 (20.5 bits), Expect = 2.2e-33, Sum P(3) = 2.2e-33
 Identities = 15/59 (25%), Positives = 31/59 (52%)

Query:   510 IDGIDVWSVLSRNEPSKRNTILHN--IDDEWQ-ISALTRGKWK--LVKENSINGNGTSE 563
             +DG+D+  +L     S R+T+       DE + + A+  GK+K     + S++ + T++
Sbjct:   353 LDGVDLSPLLLGTGKSPRHTLFFYSAYPDEVRGVFAVRSGKYKAHFFTQGSVHSDTTAD 411


>UNIPROTKB|F1S6M1 [details] [associations]
            symbol:GALNS "N-acetylgalactosamine-6-sulfatase"
            species:9823 "Sus scrofa" [GO:0008484 "sulfuric ester hydrolase
            activity" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            GeneTree:ENSGT00560000077076 EMBL:FP102571
            Ensembl:ENSSSCT00000002935 OMA:HISAGQX ArrayExpress:F1S6M1
            Uniprot:F1S6M1
        Length = 305

 Score = 363 (132.8 bits), Expect = 2.5e-32, P = 2.5e-32
 Identities = 94/289 (32%), Positives = 151/289 (52%)

Query:    42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
             L   LS   + +  +  PP+I+ +L DD+GW D+G +G     TPN+D +A  G++  ++
Sbjct:    12 LLLVLSAAGLGVTGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSF 71

Query:   102 YTVQ-LCTPSRSAIMTGKHPIHTGMQ----H--NVLYGCER-GGLPLSEKILPQYLKELG 153
             Y    LC+PSR+A++TG+ PI TG      H  N     E  GG+P  E +LP+ LK  G
Sbjct:    72 YAANPLCSPSRAALLTGRLPIRTGFYTTNGHARNAYTPQEIVGGIPDPEHLLPELLKGAG 131

Query:   154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEE----MKMWGLDMRRD 209
             Y ++IVGKWHLG ++ ++ P   GF+   G    H   +D+ A       + W +  R  
Sbjct:   132 YASKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRFY 190

Query:   210 LEPAWDLH-GKYS-TDVFTAEAVDII-HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHY 266
              E   +L  G+ + T ++  EA+D I    +T  P FLY A  ATH+     P+ A   +
Sbjct:   191 EEFPINLKTGESNLTQIYLQEALDFIKRQQATHHPFFLYWAIDATHA-----PVYASRAF 245

Query:   267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
             L   +      R ++   + ++D+SVG++V  L   ++  N+ + F SD
Sbjct:   246 LGTSQ------RGRYGDAVREIDDSVGRIVGLLRDLKIAGNTFVFFTSD 288


>ZFIN|ZDB-GENE-050320-118 [details] [associations]
            symbol:arsa "arylsulfatase A" species:7955 "Danio
            rerio" [GO:0008152 "metabolic process" evidence=IEA] [GO:0008484
            "sulfuric ester hydrolase activity" evidence=IEA] [GO:0003824
            "catalytic activity" evidence=IEA] [GO:0005575 "cellular_component"
            evidence=ND] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 ZFIN:ZDB-GENE-050320-118
            Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            GeneTree:ENSGT00560000076940 HOGENOM:HOG000135352
            HOVERGEN:HBG004283 OrthoDB:EOG4MKNG4 InterPro:IPR024607
            PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484 EMBL:CR936412
            IPI:IPI00488891 UniGene:Dr.91521 SMR:A5WV48
            Ensembl:ENSDART00000140193 Uniprot:A5WV48
        Length = 503

 Score = 345 (126.5 bits), Expect = 9.9e-32, Sum P(3) = 9.9e-32
 Identities = 112/364 (30%), Positives = 164/364 (45%)

Query:    47 SMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYY-TVQ 105
             +++    V +S PP+ + + ADDLG+ D+G  G     TPN+D LA +G+   ++Y T  
Sbjct:    12 ALIAAHCVGAS-PPNFVLLFADDLGYGDLGCFGHPCSLTPNLDRLAANGLRFTDFYVTSP 70

Query:   106 LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
             +C+PSR+A++TG++   +G+   VLY   RGGLPL+E  + + LK  GY T IVGKWHLG
Sbjct:    71 VCSPSRAALLTGRYQTRSGIYPGVLYPGSRGGLPLNETTIAEVLKTQGYSTAIVGKWHLG 130

Query:   166 F-YKKEYTPTFRGFESHLGYWTGHQD----YFDHSAEEMKMWGL-DMRRDLEPAW--DLH 217
                   Y PT  GF+S+LG    H             ++K +GL D      P    ++ 
Sbjct:   131 VGLNGTYLPTRHGFDSYLGIPYSHDQGPCQNLSCFPPDVKCFGLCDQGVVTVPLLFNEII 190

Query:   218 GKYSTDVFTAE------AVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
              +   D    E      A   I +   D  P FLY     TH    Y P  A   Y    
Sbjct:   191 KQQPADFLQLEKAYGEFASQFISDSVKDNRPFFLYYPSHHTH----Y-PQYAGADYAG-- 243

Query:   271 RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSN 330
                    R  F   L + D +VGK+++ LE+  +++N++I F  D             + 
Sbjct:   244 ----KSPRGPFGDALMEFDGTVGKILQTLEETGVINNTLIFFTGDNGPELMRKSRGGNAG 299

Query:   331 WPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVN 390
                 G K T +EGG+R   +   P     G V        D LPT    A  + +P    
Sbjct:   300 LMKCG-KGTTYEGGMREPAIAHWPGFIKPG-VTRALASSLDILPTFAKLAG-APLPEVQL 356

Query:   391 STVE 394
               VE
Sbjct:   357 DGVE 360

 Score = 56 (24.8 bits), Expect = 9.9e-32, Sum P(3) = 9.9e-32
 Identities = 14/62 (22%), Positives = 27/62 (43%)

Query:   509 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSND 568
             ++DG+++  +L    PSKR T+ +   D      +   +W+  K +           + D
Sbjct:   355 QLDGVEMTDILFNLGPSKRQTMFYYPTDPSVKYGVFAVRWENFKAHYYTRGAAHSESTPD 414

Query:   569 NS 570
             NS
Sbjct:   415 NS 416

 Score = 49 (22.3 bits), Expect = 5.3e-31, Sum P(3) = 5.3e-31
 Identities = 8/24 (33%), Positives = 16/24 (66%)

Query:   574 EIDGIDVWSVLSRNEPSKRNTILH 597
             ++DG+++  +L    PSKR T+ +
Sbjct:   355 QLDGVEMTDILFNLGPSKRQTMFY 378

 Score = 45 (20.9 bits), Expect = 9.9e-32, Sum P(3) = 9.9e-32
 Identities = 8/16 (50%), Positives = 11/16 (68%)

Query:   675 PCLFDIKNDPCEKNNL 690
             P LF+++ DP E  NL
Sbjct:   429 PLLFNLETDPSENYNL 444

 Score = 45 (20.9 bits), Expect = 9.9e-32, Sum P(3) = 9.9e-32
 Identities = 8/16 (50%), Positives = 11/16 (68%)

Query:   781 PCLFDIKNDPCEKNNL 796
             P LF+++ DP E  NL
Sbjct:   429 PLLFNLETDPSENYNL 444


>UNIPROTKB|Q32KK2 [details] [associations]
            symbol:Arsa "Arylsulfatase A" species:10116 "Rattus
            norvegicus" [GO:0004098 "cerebroside-sulfatase activity"
            evidence=IEA] [GO:0005509 "calcium ion binding" evidence=IEA]
            [GO:0005886 "plasma membrane" evidence=IEA] [GO:0007339 "binding of
            sperm to zona pellucida" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 RGD:1310381 GO:GO:0005886
            GO:GO:0005509 GO:GO:0007339 Gene3D:3.40.720.10 SUPFAM:SSF53649
            CTD:410 eggNOG:COG3119 GeneTree:ENSGT00560000076940
            HOGENOM:HOG000135352 HOVERGEN:HBG004283 KO:K01134 OMA:FGPSQMA
            OrthoDB:EOG4MKNG4 GO:GO:0004098 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 EMBL:CH474027 EMBL:BN000735 IPI:IPI00361483
            RefSeq:NP_001030105.2 UniGene:Rn.23323 SMR:Q32KK2 IntAct:Q32KK2
            STRING:Q32KK2 Ensembl:ENSRNOT00000017783 GeneID:315222
            KEGG:rno:315222 InParanoid:Q32KK2 NextBio:668936
            Genevestigator:Q32KK2 Uniprot:Q32KK2
        Length = 507

 Score = 339 (124.4 bits), Expect = 1.1e-30, Sum P(3) = 1.1e-30
 Identities = 109/363 (30%), Positives = 166/363 (45%)

Query:    45 TLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT- 103
             TL +     ++++ PP+I+ I ADDLG+ D+G +G     TPN+D LA  G+   ++Y  
Sbjct:     6 TLVLALAAGLSTASPPNIMLIFADDLGYGDLGSYGHPSSTTPNLDQLAAGGLRFTDFYVP 65

Query:   104 VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWH 163
             V LCTPSR+A++TG+ P+ +GM   VL    +GGLPL E  L + L   GY T + GKWH
Sbjct:    66 VSLCTPSRAALLTGRLPVRSGMYPGVLGPSSQGGLPLEEVTLAEVLAARGYLTGMAGKWH 125

Query:   164 LGFYKK-EYTPTFRGFESHLGYWTGH-----QDYFDHSAEEMKMWGLD-----------M 206
             LG   +  + P  +GF   LG    H     Q+      +     G D           +
Sbjct:   126 LGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPDITCSGGCDQGLVPIPLLANL 185

Query:   207 RRDLEPAW--DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPD 264
               + +P W   L  +Y +  F+ + +          P FLY A   TH    Y       
Sbjct:   186 TVEAQPPWLPGLEARYVS--FSRDLMADAQRQG--RPFFLYYASHHTH----YPQFSGQS 237

Query:   265 HYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXX 324
                      +   R  F   L +LD +VG ++ A+    +L  ++++F +D         
Sbjct:   238 F-------TKRSGRGPFGDSLMELDGAVGALMTAVGDLGLLGETLVIFTADNGPELMRMS 290

Query:   325 XXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSD 384
                 S   LR  K T +EGGVR   L++ P   + G+  E    + D LPTL +A   + 
Sbjct:   291 DGGCSGL-LRCGKGTTFEGGVREPALVYWPGHITPGVTHELASSL-DLLPTL-AALTGAP 347

Query:   385 IPN 387
             +PN
Sbjct:   348 LPN 350

 Score = 56 (24.8 bits), Expect = 1.1e-30, Sum P(3) = 1.1e-30
 Identities = 10/22 (45%), Positives = 14/22 (63%)

Query:   675 PCLFDIKNDPCEKNNLADRSED 696
             P L+D+  DP E  NL D +E+
Sbjct:   426 PLLYDLSKDPGENYNLLDSTEE 447

 Score = 54 (24.1 bits), Expect = 1.7e-30, Sum P(3) = 1.7e-30
 Identities = 10/21 (47%), Positives = 13/21 (61%)

Query:   781 PCLFDIKNDPCEKNNLADRSE 801
             P L+D+  DP E  NL D +E
Sbjct:   426 PLLYDLSKDPGENYNLLDSTE 446

 Score = 45 (20.9 bits), Expect = 1.1e-30, Sum P(3) = 1.1e-30
 Identities = 13/43 (30%), Positives = 22/43 (51%)

Query:   510 IDGIDVWSVLSRNEPSKRNTILHN--IDDEWQ-ISALTRGKWK 549
             +DG+D+  +L     S RN++       DE   + A+  GK+K
Sbjct:   353 LDGVDISPLLLGTGKSPRNSVFFYPPFPDEIHGVFAVRNGKYK 395

 Score = 38 (18.4 bits), Expect = 5.6e-30, Sum P(3) = 5.6e-30
 Identities = 7/21 (33%), Positives = 13/21 (61%)

Query:   575 IDGIDVWSVLSRNEPSKRNTI 595
             +DG+D+  +L     S RN++
Sbjct:   353 LDGVDISPLLLGTGKSPRNSV 373


>RGD|1310381 [details] [associations]
            symbol:Arsa "arylsulfatase A" species:10116 "Rattus norvegicus"
            [GO:0001669 "acrosomal vesicle" evidence=IDA] [GO:0004065
            "arylsulfatase activity" evidence=IDA] [GO:0005509 "calcium ion
            binding" evidence=ISO] [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0005737 "cytoplasm" evidence=IDA] [GO:0005764
            "lysosome" evidence=IDA] [GO:0005768 "endosome" evidence=IDA]
            [GO:0005886 "plasma membrane" evidence=ISO] [GO:0006914 "autophagy"
            evidence=IDA] [GO:0007339 "binding of sperm to zona pellucida"
            evidence=ISO] [GO:0007417 "central nervous system development"
            evidence=IDA] [GO:0007584 "response to nutrient" evidence=IDA]
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=ISO]
            [GO:0009268 "response to pH" evidence=IDA] [GO:0016021 "integral to
            membrane" evidence=ISO] [GO:0031232 "extrinsic to external side of
            plasma membrane" evidence=IDA] [GO:0043627 "response to estrogen
            stimulus" evidence=IDA] [GO:0045471 "response to ethanol"
            evidence=IDA] [GO:0051597 "response to methylmercury" evidence=IDA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 RGD:1310381 GO:GO:0005615 GO:GO:0045471 GO:GO:0005768
            GO:GO:0001669 GO:GO:0006914 GO:GO:0007584 GO:GO:0005509
            GO:GO:0007417 GO:GO:0005764 GO:GO:0009268 GO:GO:0007339
            GO:GO:0043627 Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0031232
            GO:GO:0051597 HOVERGEN:HBG004283 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0004065 IPI:IPI00361483 UniGene:Rn.23323
            EMBL:BC105852 ProteinModelPortal:Q3KR80 SMR:Q3KR80 IntAct:Q3KR80
            STRING:Q3KR80 ArrayExpress:Q3KR80 Genevestigator:Q3KR80
            Uniprot:Q3KR80
        Length = 497

 Score = 339 (124.4 bits), Expect = 2.8e-30, Sum P(3) = 2.8e-30
 Identities = 109/363 (30%), Positives = 166/363 (45%)

Query:    45 TLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT- 103
             TL +     ++++ PP+I+ I ADDLG+ D+G +G     TPN+D LA  G+   ++Y  
Sbjct:     6 TLVLALAAGLSTASPPNIMLIFADDLGYGDLGSYGHPSSTTPNLDQLAAGGLRFTDFYVP 65

Query:   104 VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWH 163
             V LCTPSR+A++TG+ P+ +GM   VL    +GGLPL E  L + L   GY T + GKWH
Sbjct:    66 VSLCTPSRAALLTGRLPVRSGMYPGVLGPSSQGGLPLEEVTLAEVLAARGYLTGMAGKWH 125

Query:   164 LGFYKK-EYTPTFRGFESHLGYWTGH-----QDYFDHSAEEMKMWGLD-----------M 206
             LG   +  + P  +GF   LG    H     Q+      +     G D           +
Sbjct:   126 LGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPDITCSGGCDQGLVPIPLLANL 185

Query:   207 RRDLEPAW--DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPD 264
               + +P W   L  +Y +  F+ + +          P FLY A   TH    Y       
Sbjct:   186 TVEAQPPWLPGLEARYVS--FSRDLMADAQRQG--RPFFLYYASHHTH----YPQFSGQS 237

Query:   265 HYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXX 324
                      +   R  F   L +LD +VG ++ A+    +L  ++++F +D         
Sbjct:   238 F-------TKRSGRGPFGDSLMELDGAVGALMTAVGDLGLLGETLVIFTADNGPELMRMS 290

Query:   325 XXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSD 384
                 S   LR  K T +EGGVR   L++ P   + G+  E    + D LPTL +A   + 
Sbjct:   291 DGGCSGL-LRCGKGTTFEGGVREPALVYWPGHITPGVTHELASSL-DLLPTL-AALTGAP 347

Query:   385 IPN 387
             +PN
Sbjct:   348 LPN 350

 Score = 56 (24.8 bits), Expect = 2.8e-30, Sum P(3) = 2.8e-30
 Identities = 10/22 (45%), Positives = 14/22 (63%)

Query:   675 PCLFDIKNDPCEKNNLADRSED 696
             P L+D+  DP E  NL D +E+
Sbjct:   416 PLLYDLSKDPGENYNLLDSTEE 437

 Score = 54 (24.1 bits), Expect = 4.5e-30, Sum P(3) = 4.5e-30
 Identities = 10/21 (47%), Positives = 13/21 (61%)

Query:   781 PCLFDIKNDPCEKNNLADRSE 801
             P L+D+  DP E  NL D +E
Sbjct:   416 PLLYDLSKDPGENYNLLDSTE 436

 Score = 38 (18.4 bits), Expect = 2.8e-30, Sum P(3) = 2.8e-30
 Identities = 7/21 (33%), Positives = 13/21 (61%)

Query:   510 IDGIDVWSVLSRNEPSKRNTI 530
             +DG+D+  +L     S RN++
Sbjct:   353 LDGVDISPLLLGTGKSPRNSV 373

 Score = 38 (18.4 bits), Expect = 2.8e-30, Sum P(3) = 2.8e-30
 Identities = 7/21 (33%), Positives = 13/21 (61%)

Query:   575 IDGIDVWSVLSRNEPSKRNTI 595
             +DG+D+  +L     S RN++
Sbjct:   353 LDGVDISPLLLGTGKSPRNSV 373


>UNIPROTKB|P15289 [details] [associations]
            symbol:ARSA "Arylsulfatase A" species:9606 "Homo sapiens"
            [GO:0005509 "calcium ion binding" evidence=IDA] [GO:0004065
            "arylsulfatase activity" evidence=TAS] [GO:0005764 "lysosome"
            evidence=TAS] [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IDA] [GO:0004098 "cerebroside-sulfatase activity"
            evidence=TAS] [GO:0005788 "endoplasmic reticulum lumen"
            evidence=TAS] [GO:0006644 "phospholipid metabolic process"
            evidence=TAS] [GO:0006665 "sphingolipid metabolic process"
            evidence=TAS] [GO:0006687 "glycosphingolipid metabolic process"
            evidence=TAS] [GO:0043202 "lysosomal lumen" evidence=TAS]
            [GO:0043687 "post-translational protein modification" evidence=TAS]
            [GO:0044267 "cellular protein metabolic process" evidence=TAS]
            [GO:0044281 "small molecule metabolic process" evidence=TAS]
            Reactome:REACT_17015 Reactome:REACT_111217 InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 GO:GO:0005886
            GO:GO:0044281 GO:GO:0006644 GO:GO:0005509 GO:GO:0005788
            GO:GO:0007339 GO:GO:0043687 GO:GO:0043202 Gene3D:3.40.720.10
            SUPFAM:SSF53649 CTD:410 eggNOG:COG3119 HOGENOM:HOG000135352
            HOVERGEN:HBG004283 KO:K01134 OrthoDB:EOG4MKNG4 GO:GO:0004098
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 EMBL:X52151
            EMBL:X52150 EMBL:AB448736 EMBL:CR456383 EMBL:AK315011 EMBL:AY271820
            EMBL:U62317 EMBL:BC014210 IPI:IPI00744184 PIR:S11031
            RefSeq:NP_000478.3 RefSeq:NP_001078894.2 RefSeq:NP_001078895.2
            RefSeq:NP_001078896.2 RefSeq:NP_001078897.1 UniGene:Hs.731715
            UniGene:Hs.88251 PDB:1AUK PDB:1E1Z PDB:1E2S PDB:1E33 PDB:1E3C
            PDB:1N2K PDB:1N2L PDB:2AIJ PDB:2AIK PDBsum:1AUK PDBsum:1E1Z
            PDBsum:1E2S PDBsum:1E33 PDBsum:1E3C PDBsum:1N2K PDBsum:1N2L
            PDBsum:2AIJ PDBsum:2AIK ProteinModelPortal:P15289 SMR:P15289
            IntAct:P15289 STRING:P15289 GlycoSuiteDB:P15289 PaxDb:P15289
            PRIDE:P15289 DNASU:410 Ensembl:ENST00000547307
            Ensembl:ENST00000547805 GeneID:410 KEGG:hsa:410 UCSC:uc003bna.4
            GeneCards:GC22M051063 HGNC:HGNC:713 HPA:CAB025183 HPA:HPA005554
            MIM:250100 MIM:272200 MIM:607574 neXtProt:NX_P15289 Orphanet:512
            Orphanet:751 PharmGKB:PA25005 InParanoid:P15289 PhylomeDB:P15289
            BRENDA:3.1.6.8 ChEMBL:CHEMBL2193 DrugBank:DB01141
            EvolutionaryTrace:P15289 GenomeRNAi:410 NextBio:1725
            PMAP-CutDB:P15289 ArrayExpress:P15289 Bgee:P15289 CleanEx:HS_ARSA
            Genevestigator:P15289 GermOnline:ENSG00000100299 GO:GO:0004065
            GO:GO:0006687 Uniprot:P15289
        Length = 507

 Score = 349 (127.9 bits), Expect = 8.4e-30, Sum P(2) = 8.4e-30
 Identities = 111/353 (31%), Positives = 164/353 (46%)

Query:    54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRS 112
             +A + PP+I+ I ADDLG+ D+G +G     TPN+D LA  G+   ++Y  V LCTPSR+
Sbjct:    15 LAVARPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLRFTDFYVPVSLCTPSRA 74

Query:   113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK-EY 171
             A++TG+ P+  GM   VL    RGGLPL E  + + L   GY T + GKWHLG   +  +
Sbjct:    75 ALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLTGMAGKWHLGVGPEGAF 134

Query:   172 TPTFRGFESHLGYWTGHQD-------YFDHSAE-----EMKMWGLDMRRDL----EPAWD 215
              P  +GF   LG    H          F  +       +  +  + +  +L    +P W 
Sbjct:   135 LPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVPIPLLANLSVEAQPPW- 193

Query:   216 LHGKYSTDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE 274
             L G  +   + A A D++ +    D P FLY A   TH    Y                E
Sbjct:   194 LPGLEAR--YMAFAHDLMADAQRQDRPFFLYYASHHTH----YPQFSGQSF-------AE 240

Query:   275 DFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLR 334
                R  F   L +LD +VG ++ A+    +L  ++++F +D             S   LR
Sbjct:   241 RSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGL-LR 299

Query:   335 GVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
               K T +EGGVR   L + P   + G+  E    + D LPTL + A  + +PN
Sbjct:   300 CGKGTTYEGGVREPALAFWPGHIAPGVTHELASSL-DLLPTLAALAG-APLPN 350

 Score = 44 (20.5 bits), Expect = 8.4e-30, Sum P(2) = 8.4e-30
 Identities = 8/16 (50%), Positives = 10/16 (62%)

Query:   675 PCLFDIKNDPCEKNNL 690
             P L+D+  DP E  NL
Sbjct:   426 PLLYDLSKDPGENYNL 441

 Score = 44 (20.5 bits), Expect = 8.4e-30, Sum P(2) = 8.4e-30
 Identities = 8/16 (50%), Positives = 10/16 (62%)

Query:   781 PCLFDIKNDPCEKNNL 796
             P L+D+  DP E  NL
Sbjct:   426 PLLYDLSKDPGENYNL 441


>UNIPROTKB|F6PKZ1 [details] [associations]
            symbol:ARSA "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 GeneTree:ENSGT00560000076940 HOGENOM:HOG000135352
            HOVERGEN:HBG004283 OMA:FGPSQMA OrthoDB:EOG4MKNG4 InterPro:IPR024607
            PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484 EMBL:AAEX03007117
            Ensembl:ENSCAFT00000000876 Uniprot:F6PKZ1
        Length = 508

 Score = 344 (126.2 bits), Expect = 2.2e-29, Sum P(2) = 2.2e-29
 Identities = 117/369 (31%), Positives = 173/369 (46%)

Query:    38 AVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGII 97
             A+ PL + L++     +A++GPP+I+ I ADDLG+ D+G +G     TPN+D LA  G+ 
Sbjct:     1 AMGPL-WALALASAVGLAAAGPPNIVLIFADDLGYGDLGCYGHPSSATPNLDQLAAGGLR 59

Query:    98 LKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRT 156
               ++Y    LCTPSR+A++TG+ P+  G+   VL    RGGLPL E  L + L   GY T
Sbjct:    60 FTDFYVPTSLCTPSRAALLTGRLPVRMGLYPGVLEPGSRGGLPLEEVTLAEVLAARGYLT 119

Query:   157 RIVGKWHLGFYKK-EYTPTFRGFESHLGYWTGHQD-------YFDHSAE-----EMKMWG 203
              I GKWHLG      + P  +GF   LG    H          F  S       +  +  
Sbjct:   120 GIAGKWHLGVGPDGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPSTPCDGSCDQGLVP 179

Query:   204 LDMRRDL----EPAWDLHGKYSTDVFTAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYE 258
             + +  +L    +P W L G  +   + A A D++ +      P FLY A   TH    Y 
Sbjct:   180 IPLLANLSVEAQPPW-LPGLEAR--YVAFARDLMADAQRQGLPFFLYYASHHTH----Y- 231

Query:   259 PLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXX 318
             P Q      + H       R  F   L +LD +VG ++ A+    +L  ++++F +D   
Sbjct:   232 P-QFGGQSFSGHSG-----RGPFGDSLMELDAAVGALMTAVGDLGLLGETLVIFTADNGP 285

Query:   319 XXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
                       S   LR  K T ++GGVR   L + P   + G+  E    + D LPTL S
Sbjct:   286 ETMRMSHGGCSGL-LRCGKGTTFDGGVREPALAFWPGHIAPGVTHELASSL-DLLPTLAS 343

Query:   379 AANKSDIPN 387
                 + +PN
Sbjct:   344 LTG-APLPN 351

 Score = 47 (21.6 bits), Expect = 2.2e-29, Sum P(2) = 2.2e-29
 Identities = 9/16 (56%), Positives = 10/16 (62%)

Query:   675 PCLFDIKNDPCEKNNL 690
             P LFD+  DP E  NL
Sbjct:   427 PLLFDLSEDPGENYNL 442

 Score = 47 (21.6 bits), Expect = 2.2e-29, Sum P(2) = 2.2e-29
 Identities = 9/16 (56%), Positives = 10/16 (62%)

Query:   781 PCLFDIKNDPCEKNNL 796
             P LFD+  DP E  NL
Sbjct:   427 PLLFDLSEDPGENYNL 442


>MGI|MGI:88077 [details] [associations]
            symbol:Arsa "arylsulfatase A" species:10090 "Mus musculus"
            [GO:0001669 "acrosomal vesicle" evidence=ISO] [GO:0003824
            "catalytic activity" evidence=IEA] [GO:0004065 "arylsulfatase
            activity" evidence=ISO] [GO:0004098 "cerebroside-sulfatase
            activity" evidence=IEA] [GO:0005509 "calcium ion binding"
            evidence=ISO] [GO:0005615 "extracellular space" evidence=ISO]
            [GO:0005737 "cytoplasm" evidence=ISO] [GO:0005764 "lysosome"
            evidence=ISO] [GO:0005768 "endosome" evidence=ISO] [GO:0005886
            "plasma membrane" evidence=IDA] [GO:0006914 "autophagy"
            evidence=ISO] [GO:0007339 "binding of sperm to zona pellucida"
            evidence=IMP] [GO:0007417 "central nervous system development"
            evidence=ISO] [GO:0007584 "response to nutrient" evidence=ISO]
            [GO:0008152 "metabolic process" evidence=IEA] [GO:0008484 "sulfuric
            ester hydrolase activity" evidence=ISO] [GO:0009268 "response to
            pH" evidence=ISO] [GO:0016021 "integral to membrane" evidence=IDA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0031232
            "extrinsic to external side of plasma membrane" evidence=ISO]
            [GO:0043627 "response to estrogen stimulus" evidence=ISO]
            [GO:0045471 "response to ethanol" evidence=ISO] [GO:0046872 "metal
            ion binding" evidence=IEA] [GO:0051597 "response to methylmercury"
            evidence=ISO] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 MGI:MGI:88077 GO:GO:0016021
            GO:GO:0005886 GO:GO:0005509 GO:GO:0005764 GO:GO:0007339
            EMBL:CH466550 Gene3D:3.40.720.10 SUPFAM:SSF53649 CTD:410
            eggNOG:COG3119 GeneTree:ENSGT00560000076940 HOGENOM:HOG000135352
            HOVERGEN:HBG004283 KO:K01134 OMA:FGPSQMA OrthoDB:EOG4MKNG4
            GO:GO:0004098 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            EMBL:X73230 EMBL:X73231 EMBL:AK004540 EMBL:AK132501 EMBL:BC011284
            EMBL:BC098075 EMBL:M82876 IPI:IPI00118039 PIR:A54190
            RefSeq:NP_033843.2 UniGene:Mm.620 ProteinModelPortal:P50428
            SMR:P50428 IntAct:P50428 STRING:P50428 PaxDb:P50428 PRIDE:P50428
            Ensembl:ENSMUST00000165199 GeneID:11883 KEGG:mmu:11883
            InParanoid:Q9DC66 SABIO-RK:P50428 NextBio:279915 Bgee:P50428
            CleanEx:MM_ARSA Genevestigator:P50428 GermOnline:ENSMUSG00000022620
            GO:GO:0008484 Uniprot:P50428
        Length = 506

 Score = 335 (123.0 bits), Expect = 6.4e-28, Sum P(2) = 6.4e-28
 Identities = 108/363 (29%), Positives = 165/363 (45%)

Query:    45 TLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT- 103
             TL +     ++++ PP+I+ I ADDLG+ D+G +G     TPN+D LA  G+   ++Y  
Sbjct:     5 TLFLALAAGLSTASPPNILLIFADDLGYGDLGSYGHPSSTTPNLDQLAEGGLRFTDFYVP 64

Query:   104 VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWH 163
             V LCTPSR+A++TG+ P+ +GM   VL    +GGLPL E  L + L   GY T + GKWH
Sbjct:    65 VSLCTPSRAALLTGRLPVRSGMYPGVLGPSSQGGLPLEEVTLAEVLAARGYLTGMAGKWH 124

Query:   164 LGFYKK-EYTPTFRGFESHLGYWTGH-----QDYFDHSAEEMKMWGLD-----------M 206
             LG   +  + P  +GF   LG    H     Q+      +     G D           +
Sbjct:   125 LGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPDIPCKGGCDQGLVPIPLLANL 184

Query:   207 RRDLEPAW--DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPD 264
               + +P W   L  +Y +  F+ + +          P FLY A   TH    Y       
Sbjct:   185 TVEAQPPWLPGLEARYVS--FSRDLMADAQRQG--RPFFLYYASHHTH----YPQFSGQS 236

Query:   265 HYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXX 324
                      +   R  F   L +LD +VG ++  +    +L  ++++F +D         
Sbjct:   237 F-------TKRSGRGPFGDSLMELDGAVGALMTTVGDLGLLEETLVIFTADNGPELMRMS 289

Query:   325 XXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSD 384
                 S   LR  K T +EGGVR   L++ P   + G+  E    + D LPTL +A   + 
Sbjct:   290 NGGCSGL-LRCGKGTTFEGGVREPALVYWPGHITPGVTHELASSL-DLLPTL-AALTGAP 346

Query:   385 IPN 387
             +PN
Sbjct:   347 LPN 349

 Score = 44 (20.5 bits), Expect = 6.4e-28, Sum P(2) = 6.4e-28
 Identities = 8/21 (38%), Positives = 12/21 (57%)

Query:   675 PCLFDIKNDPCEKNNLADRSE 695
             P L+D+  DP E  N+ +  E
Sbjct:   425 PLLYDLSQDPGENYNVLESIE 445

 Score = 44 (20.5 bits), Expect = 6.4e-28, Sum P(2) = 6.4e-28
 Identities = 8/21 (38%), Positives = 12/21 (57%)

Query:   781 PCLFDIKNDPCEKNNLADRSE 801
             P L+D+  DP E  N+ +  E
Sbjct:   425 PLLYDLSQDPGENYNVLESIE 445


>RGD|1304917 [details] [associations]
            symbol:Arse "arylsulfatase E (chondrodysplasia punctata 1)"
            species:10116 "Rattus norvegicus" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0004065 "arylsulfatase activity" evidence=IEA]
            [GO:0005575 "cellular_component" evidence=ND] [GO:0008150
            "biological_process" evidence=ND] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 RGD:1304917
            Gene3D:3.40.720.10 SUPFAM:SSF53649 GeneTree:ENSGT00560000076940
            HOGENOM:HOG000135352 HOVERGEN:HBG004283 InterPro:IPR024607
            PROSITE:PS00523 PROSITE:PS00149 GO:GO:0004065 KO:K12374 CTD:415
            OMA:CHIVALA EMBL:BN000737 IPI:IPI00367421 RefSeq:NP_001041350.1
            UniGene:Rn.79118 STRING:Q32KK0 Ensembl:ENSRNOT00000033080
            GeneID:310326 KEGG:rno:310326 UCSC:RGD:1304917 InParanoid:Q32KK0
            NextBio:661844 Genevestigator:Q32KK0 Uniprot:Q32KK0
        Length = 611

 Score = 263 (97.6 bits), Expect = 2.2e-26, Sum P(3) = 2.2e-26
 Identities = 69/182 (37%), Positives = 93/182 (51%)

Query:    38 AVLPLAFTLSMVFVDLVASSGP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
             A L     L + +VD + S  P P+ + I+ADDLG  D+G +G   I TPNID LA  G+
Sbjct:    12 ATLLCIVLLGLQYVDALRSPPPRPNFLIIMADDLGIGDLGCYGNTSIRTPNIDRLAEDGV 71

Query:    97 ILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQ----HNVL-YGCERGGLPLSEKILPQYLK 150
              L  Y   + +CTPSR+A +TG++PI +GM     H VL +    GGLP  E    + L+
Sbjct:    72 RLTQYLAAESVCTPSRAAFLTGRYPIRSGMTSGNGHRVLQWAAGAGGLPPKEITFARILQ 131

Query:   151 ELGYRTRIVGKWHLGFYKKEYT-----PTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLD 205
               GY T +VGKWHLG   +  +     P   GF   LG   G       +    K  GL+
Sbjct:   132 GQGYVTGLVGKWHLGLSCRTVSDLCHHPLNHGFHHFLGLPLGMMGDCAGAEPSEKRAGLE 191

Query:   206 MR 207
              R
Sbjct:   192 RR 193

 Score = 119 (46.9 bits), Expect = 2.2e-26, Sum P(3) = 2.2e-26
 Identities = 42/160 (26%), Positives = 68/160 (42%)

Query:   222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
             T +   EA D +  H    P  L+L+   TH+     PL     +     H       ++
Sbjct:   274 TPLLLREAKDFLRRHR-HAPFLLFLSLLHTHT-----PLVTSPEFRGRSAH------GRY 321

Query:   282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXX---XXXXXXXXXXSNWPLRGVKN 338
                + ++D  VG+++E LE   +  ++++ F SD                SN   RG K 
Sbjct:   322 GDNVEEMDWVVGQILEVLEHEGLTDSTLVHFTSDNGAWLEAQAGGEQLGGSNGVFRGGKG 381

Query:   339 TL-WEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLL 377
                WEGG+R  G+   P +  RG V +Q V + D  PT++
Sbjct:   382 MGGWEGGIRVPGVFRWPGVLPRGRVLDQPVSLMDVFPTVV 421

 Score = 40 (19.1 bits), Expect = 2.2e-26, Sum P(3) = 2.2e-26
 Identities = 14/51 (27%), Positives = 25/51 (49%)

Query:   762 AASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEV 812
             AA++ C  V +V  E    P LF++ +DP E   L   +++    + T  +
Sbjct:   500 AAAV-CPCVGKV--EEHDPPLLFELTSDPGEVRPLRAPAKMSEAPNLTAAI 547

 Score = 38 (18.4 bits), Expect = 3.6e-26, Sum P(3) = 3.6e-26
 Identities = 12/31 (38%), Positives = 18/31 (58%)

Query:   656 AASIQCGPVKEVPCEPQIAPCLFDIKNDPCE 686
             AA++ C  V +V  E    P LF++ +DP E
Sbjct:   500 AAAV-CPCVGKV--EEHDPPLLFELTSDPGE 527


>UNIPROTKB|F1Q1V3 [details] [associations]
            symbol:STS "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 EMBL:AAEX03026120 EMBL:AAEX03026121
            Ensembl:ENSCAFT00000017942 Uniprot:F1Q1V3
        Length = 594

 Score = 246 (91.7 bits), Expect = 2.3e-26, Sum P(4) = 2.3e-26
 Identities = 52/131 (39%), Positives = 73/131 (55%)

Query:    42 LAFTLSMVFVDLVASSGP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
             L   L ++  +    + P P+ + ++ADDLG  D G +G   + TPNID LA  G+ L  
Sbjct:    22 LRLLLLLLLCEAQGHAAPRPNFVLLMADDLGIGDPGCYGNTTLRTPNIDRLAAEGVKLTQ 81

Query:   101 YYTVQ-LCTPSRSAIMTGKHPIHTGMQHN-----VLYGCERGGLPLSEKILPQYLKELGY 154
             +     LCTPSR+A MTG++PI +GM         ++    GGLP SE    + LK  GY
Sbjct:    82 HLAASPLCTPSRAAFMTGRYPIRSGMASQSFIGVFIFSASSGGLPTSEITFAKLLKNQGY 141

Query:   155 RTRIVGKWHLG 165
              T ++GKWHLG
Sbjct:   142 STALIGKWHLG 152

 Score = 125 (49.1 bits), Expect = 2.3e-26, Sum P(4) = 2.3e-26
 Identities = 42/170 (24%), Positives = 77/170 (45%)

Query:   222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
             T   TA+A   I  ++   P  L L++   H+A       +PD +    +H        +
Sbjct:   275 TQRLTADAAQFIRRNA-GTPFLLLLSYLHVHTAL----FSSPD-FAGHSQH------GAY 322

Query:   282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXX----SNWPLRGVK 337
                  +LD SVG+++  L++ ++ +N+++ F SD                 SN   +G K
Sbjct:   323 GDAAEELDWSVGQILNVLDELKLANNTLVYFTSDQGAHVEEVTTKGEVHGGSNGIYKGGK 382

Query:   338 NTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
                WEGG+R  G++ W  ++++ G+V ++     D  PT+   A  S +P
Sbjct:   383 ANNWEGGIRIPGILRWPGVIQA-GLVIDEPTSNMDIFPTVAKLAG-SPLP 430

 Score = 56 (24.8 bits), Expect = 4.5e-06, Sum P(4) = 4.5e-06
 Identities = 11/30 (36%), Positives = 16/30 (53%)

Query:     6 LYGCERGGLPLSEKILPQYLKELGYRTRIM 35
             ++    GGLP SE    + LK  GY T ++
Sbjct:   117 IFSASSGGLPTSEITFAKLLKNQGYSTALI 146

 Score = 48 (22.0 bits), Expect = 2.3e-26, Sum P(4) = 2.3e-26
 Identities = 9/21 (42%), Positives = 13/21 (61%)

Query:   675 PCLFDIKNDPCEKNNLADRSE 695
             P LFD+  DP E+  L+  +E
Sbjct:   514 PLLFDVAKDPGERTPLSPATE 534

 Score = 48 (22.0 bits), Expect = 2.3e-26, Sum P(4) = 2.3e-26
 Identities = 9/21 (42%), Positives = 13/21 (61%)

Query:   781 PCLFDIKNDPCEKNNLADRSE 801
             P LFD+  DP E+  L+  +E
Sbjct:   514 PLLFDVAKDPGERTPLSPATE 534

 Score = 43 (20.2 bits), Expect = 2.3e-26, Sum P(4) = 2.3e-26
 Identities = 10/41 (24%), Positives = 19/41 (46%)

Query:   431 HEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENT 471
             H+  P  + + ++  HE+   Y N Y N    +  P+N  +
Sbjct:   438 HDLMPLLQGKTQHSDHEFLFHYCNFYLNAVRWH--PRNSTS 476


>UNIPROTKB|F1MFZ8 [details] [associations]
            symbol:STS "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 OMA:GLSCQCD EMBL:DAAA02075641
            EMBL:DAAA02075642 EMBL:DAAA02075643 EMBL:DAAA02075644
            EMBL:DAAA02075645 IPI:IPI00693675 UniGene:Bt.63535
            Ensembl:ENSBTAT00000027703 Uniprot:F1MFZ8
        Length = 578

 Score = 252 (93.8 bits), Expect = 4.1e-26, Sum P(3) = 4.1e-26
 Identities = 55/135 (40%), Positives = 78/135 (57%)

Query:    56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAI 114
             ++  P+ + ++ADDLG  D G +G   + TPNID LA  G+ L  +     LCTPSR+A 
Sbjct:    18 AASKPNFVLLMADDLGIGDPGCYGNKTLRTPNIDRLARGGVKLTQHLAASPLCTPSRAAF 77

Query:   115 MTGKHPIHTGM--QHNV---LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
             MTG++P+ +GM  Q  V   L+    GGLP SE    + LK+ GY T ++GKWHLG    
Sbjct:    78 MTGRYPVRSGMASQSQVGVFLFSASSGGLPPSEITFAKLLKDQGYSTALIGKWHLGISCH 137

Query:   170 E-----YTPTFRGFE 179
             +     + PT  GF+
Sbjct:   138 DPGDFCHHPTSHGFD 152

 Score = 119 (46.9 bits), Expect = 4.1e-26, Sum P(3) = 4.1e-26
 Identities = 69/324 (21%), Positives = 129/324 (39%)

Query:   144 ILPQYLKELGYRTRIVGKWHLGFYKKE---YTPTFRGFESHLGYWTGHQDYFDHSAEEMK 200
             +LP  L  L   T +V KW LG ++     +   F      LG   G   YF      + 
Sbjct:   182 LLPMQLIALALLTLVVLKW-LGLFRAPPCAFLFLFLLATLLLGLLLGFLHYF----RPLN 236

Query:   201 MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPL 260
              + L   RD+     +     T   TA+A   +  ++ + P  L L+    H+A     L
Sbjct:   237 CF-LMRNRDITQQ-PMSYDNLTQRLTADAAHFLRRNA-ETPFLLVLSFLHMHTA-----L 288

Query:   261 QAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXX 320
              +   +    +H        +     ++D SVG++++ L + ++ +N+++ F SD     
Sbjct:   289 FSSKDFAGKSQH------GSYGDAAEEMDWSVGQILDVLHELKLANNTLVYFSSDQGAHV 342

Query:   321 XXXXXXXX----SNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPT 375
                         SN   +G K   WEGG+R  G++ W  ++++ G+  ++     D  PT
Sbjct:   343 EEVTVKGEVQGGSNGIYKGGKANNWEGGIRVPGIVRWPGVIQA-GLEIDEPTSNMDIFPT 401

Query:   376 LLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNP 435
             +   A  S +P        +++P  +   +R +   HE+        N+ Y N    + P
Sbjct:   402 VAKLAG-SPLPQDRVIDGRDLMPLLQ---MRTQRSEHEF---LFHYCNS-YLNAVRWHPP 453

Query:   436 KYENRYENGTHEYNPKYENRYENG 459
                + ++     + PK+     NG
Sbjct:   454 NSTSIWK--AFFFTPKFSPEGANG 475

 Score = 58 (25.5 bits), Expect = 2.9e-05, Sum P(3) = 2.9e-05
 Identities = 12/30 (40%), Positives = 17/30 (56%)

Query:     6 LYGCERGGLPLSEKILPQYLKELGYRTRIM 35
             L+    GGLP SE    + LK+ GY T ++
Sbjct:    98 LFSASSGGLPPSEITFAKLLKDQGYSTALI 127

 Score = 48 (22.0 bits), Expect = 4.1e-26, Sum P(3) = 4.1e-26
 Identities = 10/21 (47%), Positives = 12/21 (57%)

Query:   675 PCLFDIKNDPCEKNNLADRSE 695
             P LF+I  DP E+N L    E
Sbjct:   495 PLLFEISRDPRERNPLTPTLE 515

 Score = 48 (22.0 bits), Expect = 4.1e-26, Sum P(3) = 4.1e-26
 Identities = 10/21 (47%), Positives = 12/21 (57%)

Query:   781 PCLFDIKNDPCEKNNLADRSE 801
             P LF+I  DP E+N L    E
Sbjct:   495 PLLFEISRDPRERNPLTPTLE 515

 Score = 44 (20.5 bits), Expect = 1.0e-25, Sum P(3) = 1.0e-25
 Identities = 20/79 (25%), Positives = 31/79 (39%)

Query:   584 LSRNEPSKRNTILHNIDDE-WQISALTXXXXXXXXXXXXMRYQVDLTGGPDQVYLSGLSD 642
             +SR +P +RN +   ++   W+I                 R+   L   P+Q+ L  L  
Sbjct:   500 ISR-DPRERNPLTPTLEPRFWEI--------LEAMQEAAARHARTLQDVPNQLSLGNLMW 550

Query:   643 REWLALAMRKLRDAASIQC 661
             + WL L    L    S QC
Sbjct:   551 KPWLQLCCSSL--GLSCQC 567


>UNIPROTKB|F1Q1V2 [details] [associations]
            symbol:STS "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 OMA:GLSCQCD EMBL:AAEX03026120
            EMBL:AAEX03026121 Ensembl:ENSCAFT00000017943 Uniprot:F1Q1V2
        Length = 637

 Score = 246 (91.7 bits), Expect = 4.1e-26, Sum P(4) = 4.1e-26
 Identities = 52/131 (39%), Positives = 73/131 (55%)

Query:    42 LAFTLSMVFVDLVASSGP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
             L   L ++  +    + P P+ + ++ADDLG  D G +G   + TPNID LA  G+ L  
Sbjct:     3 LRLLLLLLLCEAQGHAAPRPNFVLLMADDLGIGDPGCYGNTTLRTPNIDRLAAEGVKLTQ 62

Query:   101 YYTVQ-LCTPSRSAIMTGKHPIHTGMQHN-----VLYGCERGGLPLSEKILPQYLKELGY 154
             +     LCTPSR+A MTG++PI +GM         ++    GGLP SE    + LK  GY
Sbjct:    63 HLAASPLCTPSRAAFMTGRYPIRSGMASQSFIGVFIFSASSGGLPTSEITFAKLLKNQGY 122

Query:   155 RTRIVGKWHLG 165
              T ++GKWHLG
Sbjct:   123 STALIGKWHLG 133

 Score = 125 (49.1 bits), Expect = 4.1e-26, Sum P(4) = 4.1e-26
 Identities = 42/170 (24%), Positives = 77/170 (45%)

Query:   222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
             T   TA+A   I  ++   P  L L++   H+A       +PD +    +H        +
Sbjct:   256 TQRLTADAAQFIRRNA-GTPFLLLLSYLHVHTAL----FSSPD-FAGHSQH------GAY 303

Query:   282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXX----SNWPLRGVK 337
                  +LD SVG+++  L++ ++ +N+++ F SD                 SN   +G K
Sbjct:   304 GDAAEELDWSVGQILNVLDELKLANNTLVYFTSDQGAHVEEVTTKGEVHGGSNGIYKGGK 363

Query:   338 NTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
                WEGG+R  G++ W  ++++ G+V ++     D  PT+   A  S +P
Sbjct:   364 ANNWEGGIRIPGILRWPGVIQA-GLVIDEPTSNMDIFPTVAKLAG-SPLP 411

 Score = 56 (24.8 bits), Expect = 6.5e-06, Sum P(4) = 6.5e-06
 Identities = 11/30 (36%), Positives = 16/30 (53%)

Query:     6 LYGCERGGLPLSEKILPQYLKELGYRTRIM 35
             ++    GGLP SE    + LK  GY T ++
Sbjct:    98 IFSASSGGLPTSEITFAKLLKNQGYSTALI 127

 Score = 48 (22.0 bits), Expect = 4.1e-26, Sum P(4) = 4.1e-26
 Identities = 9/21 (42%), Positives = 13/21 (61%)

Query:   675 PCLFDIKNDPCEKNNLADRSE 695
             P LFD+  DP E+  L+  +E
Sbjct:   495 PLLFDVAKDPGERTPLSPATE 515

 Score = 48 (22.0 bits), Expect = 4.1e-26, Sum P(4) = 4.1e-26
 Identities = 9/21 (42%), Positives = 13/21 (61%)

Query:   781 PCLFDIKNDPCEKNNLADRSE 801
             P LFD+  DP E+  L+  +E
Sbjct:   495 PLLFDVAKDPGERTPLSPATE 515

 Score = 43 (20.2 bits), Expect = 4.1e-26, Sum P(4) = 4.1e-26
 Identities = 10/41 (24%), Positives = 19/41 (46%)

Query:   431 HEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENT 471
             H+  P  + + ++  HE+   Y N Y N    +  P+N  +
Sbjct:   419 HDLMPLLQGKTQHSDHEFLFHYCNFYLNAVRWH--PRNSTS 457


>UNIPROTKB|P08842 [details] [associations]
            symbol:STS "Steryl-sulfatase" species:9606 "Homo sapiens"
            [GO:0007565 "female pregnancy" evidence=IEA] [GO:0016021 "integral
            to membrane" evidence=IEA] [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0016020 "membrane" evidence=TAS] [GO:0005764
            "lysosome" evidence=TAS] [GO:0005768 "endosome" evidence=TAS]
            [GO:0005783 "endoplasmic reticulum" evidence=TAS] [GO:0043231
            "intracellular membrane-bounded organelle" evidence=TAS]
            [GO:0005794 "Golgi apparatus" evidence=TAS] [GO:0005886 "plasma
            membrane" evidence=TAS] [GO:0006706 "steroid catabolic process"
            evidence=TAS] [GO:0008544 "epidermis development" evidence=TAS]
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IDA]
            [GO:0004773 "steryl-sulfatase activity" evidence=TAS] [GO:0005788
            "endoplasmic reticulum lumen" evidence=TAS] [GO:0005789
            "endoplasmic reticulum membrane" evidence=TAS] [GO:0006644
            "phospholipid metabolic process" evidence=TAS] [GO:0006665
            "sphingolipid metabolic process" evidence=TAS] [GO:0006687
            "glycosphingolipid metabolic process" evidence=TAS] [GO:0043687
            "post-translational protein modification" evidence=TAS] [GO:0044267
            "cellular protein metabolic process" evidence=TAS] [GO:0044281
            "small molecule metabolic process" evidence=TAS]
            Reactome:REACT_17015 Reactome:REACT_111217 InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 GO:GO:0016021
            GO:GO:0005886 GO:GO:0005794 GO:GO:0005635 GO:GO:0043588
            GO:GO:0044281 GO:GO:0005789 GO:GO:0046872 GO:GO:0006706
            GO:GO:0008284 GO:GO:0005768 GO:GO:0043434 GO:GO:0006644
            GO:GO:0007565 GO:GO:0005764 GO:GO:0009268 GO:GO:0007611
            GO:GO:0005788 GO:GO:0043627 GO:GO:0043687 GO:GO:0008544
            DrugBank:DB00655 Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            HOGENOM:HOG000135352 HOVERGEN:HBG004283 InterPro:IPR024607
            PROSITE:PS00523 PROSITE:PS00149 GO:GO:0006687 OrthoDB:EOG4V4379
            EMBL:J04964 EMBL:M16505 EMBL:AK314034 EMBL:BC075030 EMBL:M23945
            EMBL:M23556 IPI:IPI00307433 PIR:A32641 RefSeq:NP_000342.2
            UniGene:Hs.522578 UniGene:Hs.700558 UniGene:Hs.700559
            UniGene:Hs.740067 PDB:1P49 PDBsum:1P49 ProteinModelPortal:P08842
            SMR:P08842 MINT:MINT-1177440 STRING:P08842 PhosphoSite:P08842
            DMDM:135006 PaxDb:P08842 PRIDE:P08842 Ensembl:ENST00000217961
            GeneID:412 KEGG:hsa:412 UCSC:uc004cry.4 CTD:412
            GeneCards:GC0XP007147 HGNC:HGNC:11425 HPA:HPA002904 MIM:300747
            MIM:308100 neXtProt:NX_P08842 Orphanet:461 PharmGKB:PA36225
            InParanoid:P08842 KO:K01131 OMA:GLSCQCD PhylomeDB:P08842
            BindingDB:P08842 ChEMBL:CHEMBL3559 EvolutionaryTrace:P08842
            GenomeRNAi:412 NextBio:1743 Bgee:P08842 CleanEx:HS_STS
            Genevestigator:P08842 GermOnline:ENSG00000101846 GO:GO:0004773
            Uniprot:P08842
        Length = 583

 Score = 248 (92.4 bits), Expect = 5.8e-26, Sum P(4) = 5.8e-26
 Identities = 57/135 (42%), Positives = 76/135 (56%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
             P+II ++ADDLG  D G +G   I TPNID LA  G+ L  +     LCTPSR+A MTG+
Sbjct:    27 PNIILVMADDLGIGDPGCYGNKTIRTPNIDRLASGGVKLTQHLAASPLCTPSRAAFMTGR 86

Query:   119 HPIHTGM----QHNV-LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFY---KKE 170
             +P+ +GM    +  V L+    GGLP  E    + LK+ GY T ++GKWHLG     K +
Sbjct:    87 YPVRSGMASWSRTGVFLFTASSGGLPTDEITFAKLLKDQGYSTALIGKWHLGMSCHSKTD 146

Query:   171 YT--PTFRGFESHLG 183
             +   P   GF    G
Sbjct:   147 FCHHPLHHGFNYFYG 161

 Score = 112 (44.5 bits), Expect = 5.8e-26, Sum P(4) = 5.8e-26
 Identities = 45/197 (22%), Positives = 82/197 (41%)

Query:   222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
             T   T EA   I  + T+ P  L L++   H+A     L +   +    +H        +
Sbjct:   261 TQRLTVEAAQFIQRN-TETPFLLVLSYLHVHTA-----LFSSKDFAGKSQH------GVY 308

Query:   282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXX----SNWPLRGVK 337
                + ++D SVG+++  L++ R+ ++++I F SD                 SN   +G K
Sbjct:   309 GDAVEEMDWSVGQILNLLDELRLANDTLIYFTSDQGAHVEEVSSKGEIHGGSNGIYKGGK 368

Query:   338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENII 397
                WEGG+R  G++  P +   G   ++     D  PT+   A  + +P        +++
Sbjct:   369 ANNWEGGIRVPGILRWPRVIQAGQKIDEPTSNMDIFPTVAKLAG-APLPEDRIIDGRDLM 427

Query:   398 PRYENSILRYENGTHEY 414
             P  E    R +   HE+
Sbjct:   428 PLLEGKSQRSD---HEF 441

 Score = 58 (25.5 bits), Expect = 5.8e-26, Sum P(4) = 5.8e-26
 Identities = 12/21 (57%), Positives = 13/21 (61%)

Query:   675 PCLFDIKNDPCEKNNLADRSE 695
             P LFDI  DP E+N L   SE
Sbjct:   500 PLLFDISKDPRERNPLTPASE 520

 Score = 58 (25.5 bits), Expect = 5.8e-26, Sum P(4) = 5.8e-26
 Identities = 12/21 (57%), Positives = 13/21 (61%)

Query:   781 PCLFDIKNDPCEKNNLADRSE 801
             P LFDI  DP E+N L   SE
Sbjct:   500 PLLFDISKDPRERNPLTPASE 520

 Score = 54 (24.1 bits), Expect = 3.9e-05, Sum P(4) = 3.9e-05
 Identities = 11/30 (36%), Positives = 16/30 (53%)

Query:     6 LYGCERGGLPLSEKILPQYLKELGYRTRIM 35
             L+    GGLP  E    + LK+ GY T ++
Sbjct:   103 LFTASSGGLPTDEITFAKLLKDQGYSTALI 132

 Score = 39 (18.8 bits), Expect = 5.8e-26, Sum P(4) = 5.8e-26
 Identities = 10/37 (27%), Positives = 16/37 (43%)

Query:   435 PKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENT 471
             P  E + +   HE+   Y N Y N    +  P+N  +
Sbjct:   428 PLLEGKSQRSDHEFLFHYCNAYLNAVRWH--PQNSTS 462


>UNIPROTKB|P25549 [details] [associations]
            symbol:aslA "arylsulfatase" species:83333 "Escherichia coli
            K-12" [GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
            [GO:0008152 "metabolic process" evidence=IEA] [GO:0003824
            "catalytic activity" evidence=IEA] [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0042597 "periplasmic space" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0004065
            "arylsulfatase activity" evidence=IEA] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 EMBL:U00096
            EMBL:AP009048 GenomeReviews:AP009048_GR GenomeReviews:U00096_GR
            GO:GO:0046872 GO:GO:0042597 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 OMA:FGPSQMA InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0004065 EMBL:M90498 EMBL:M87049 PIR:S30691
            RefSeq:NP_418245.1 RefSeq:YP_491641.1 ProteinModelPortal:P25549
            SMR:P25549 IntAct:P25549 EnsemblBacteria:EBESCT00000000559
            EnsemblBacteria:EBESCT00000017339 GeneID:12933611 GeneID:949015
            KEGG:ecj:Y75_p3377 KEGG:eco:b3801 PATRIC:32123099 EchoBASE:EB0087
            EcoGene:EG10089 HOGENOM:HOG000126460 KO:K01130
            ProtClustDB:CLSK880785 BioCyc:EcoCyc:ARYLSULFAT-MONOMER
            BioCyc:ECOL316407:JW3773-MONOMER BioCyc:MetaCyc:ARYLSULFAT-MONOMER
            Genevestigator:P25549 Uniprot:P25549
        Length = 551

 Score = 319 (117.4 bits), Expect = 1.6e-25, P = 1.6e-25
 Identities = 109/370 (29%), Positives = 173/370 (46%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQI---PTPNIDALAYSGIILKNYYTVQLCTPSRSAIMT 116
             P+++  L DD+GW DVGF+G       PTP+IDA+A  G+IL + Y+    +P+R+ I+T
Sbjct:    86 PNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILT 145

Query:   117 GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT-- 174
             G++ IH G+    +YG + GGL      LPQ L + GY T+ +GKWH+G   KE  P   
Sbjct:   146 GQYSIHHGILMPPMYG-QPGGLQ-GLTTLPQLLHDQGYVTQAIGKWHMG-ENKESQPQNV 202

Query:   175 ----FRGFESHLGYWTGHQDYF---------DHSAEEMKM--WGLD----MRRDLEPAW- 214
                 FRGF S    +T  +D           D S E +K   +  D    +R   + A  
Sbjct:   203 GFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRS-EYIKQLPFSKDDVHAVRGGEQQAIA 261

Query:   215 DLHGKYSTDV---FTAEAVDIIHNHS-TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
             D+  KY  D+   +    V  +   + +D+P FLY      H           D+Y N  
Sbjct:   262 DITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF----------DNYPNAK 311

Query:   271 RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSN 330
                    R+ +   + ++++    + + LE+   L N++IVF SD               
Sbjct:   312 YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT-- 369

Query:   331 WPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYV 389
              P RG K + WEGGVR    + W  +++ R   ++  V ++D  PT L      D+  + 
Sbjct:   370 -PFRGAKGSTWEGGVRVPTFVYWKGMIQPRK--SDGIVDLADLFPTAL------DLAGHP 420

Query:   390 NSTVENIIPR 399
              + V N++P+
Sbjct:   421 GAKVANLVPK 430


>UNIPROTKB|F1NWF7 [details] [associations]
            symbol:ARSA "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] [GO:0005509 "calcium ion binding" evidence=IEA]
            [GO:0005886 "plasma membrane" evidence=IEA] [GO:0007339 "binding of
            sperm to zona pellucida" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 GO:GO:0005886 GO:GO:0005509
            Gene3D:3.40.720.10 SUPFAM:SSF53649 GeneTree:ENSGT00560000076940
            InterPro:IPR024607 PROSITE:PS00149 GO:GO:0008484 OMA:GFDENTI
            EMBL:AADN02075680 EMBL:AADN02075681 IPI:IPI00584710
            Ensembl:ENSGALT00000015860 Uniprot:F1NWF7
        Length = 493

 Score = 308 (113.5 bits), Expect = 1.6e-25, Sum P(2) = 1.6e-25
 Identities = 108/377 (28%), Positives = 170/377 (45%)

Query:    55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCT-PSRSA 113
             A+ GPP  + +LADDLG+ D+G +G     TPN+  LA +          + C  P R+A
Sbjct:    15 AAGGPPSFVLLLADDLGFGDLGSYGHPSSATPNLSCLARAA-------PYECCPYPCRAA 67

Query:   114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK-EYT 172
             ++TG+  + +G+   V Y   RGGLPLSE  + + LK  GY T IVGKWHLG   +  + 
Sbjct:    68 LLTGRFQMRSGIYPGVFYPGSRGGLPLSEVTIAEVLKAKGYATAIVGKWHLGLGARGSFL 127

Query:   173 PTFRGFESHLGYWTGHQD----YFDHSAEEMKMWGLDMRRDLEPA---WDLHGKYSTDVF 225
             P  +GF+  LG    H             ++K +G    + L P    W+        V 
Sbjct:   128 PIHQGFDHFLGVPYSHDQGPCQNLTCFPPDIKCFGT-CDQGLVPVPLFWN-QSIVQQPVS 185

Query:   226 TAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDH--YLNI--HRHIEDFKRSKF 281
               + V + +  + D     ++A  A     P+    A  H  Y       +    +R  F
Sbjct:   186 FPDLVPLYNKFARD-----FIADCARRGV-PFLLYYASHHTHYPQFASQEYAGRSQRGPF 239

Query:   282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLW 341
                L + D SVG++++AL++  + + + + F SD             S   L+  K T +
Sbjct:   240 GDALSEFDGSVGQLLQALQENGLENTTFVFFTSDNGPSTMRMARGGSSGL-LKCGKGTTY 298

Query:   342 EGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYE 401
             EGG+R   + + P   + G+  E      D LPTL + A  + +PN V+      +  Y+
Sbjct:   299 EGGMREPAVAYWPGRIAPGVTHE-LASTLDILPTLTALAGAA-LPN-VS------LDGYD 349

Query:   402 NSILRYENGTHEYNSPR 418
              S L +E+G     SPR
Sbjct:   350 LSPLLFESG----KSPR 362

 Score = 52 (23.4 bits), Expect = 1.6e-25, Sum P(2) = 1.6e-25
 Identities = 9/16 (56%), Positives = 12/16 (75%)

Query:   675 PCLFDIKNDPCEKNNL 690
             P LFD+++DP E  NL
Sbjct:   418 PLLFDLESDPAENYNL 433

 Score = 52 (23.4 bits), Expect = 1.6e-25, Sum P(2) = 1.6e-25
 Identities = 9/16 (56%), Positives = 12/16 (75%)

Query:   781 PCLFDIKNDPCEKNNL 796
             P LFD+++DP E  NL
Sbjct:   418 PLLFDLESDPAENYNL 433


>UNIPROTKB|F5H325 [details] [associations]
            symbol:GALNS "N-acetylgalactosamine-6-sulfatase"
            species:9606 "Homo sapiens" [GO:0008484 "sulfuric ester hydrolase
            activity" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            EMBL:AC092384 InterPro:IPR024607 PROSITE:PS00149 GO:GO:0008484
            HGNC:HGNC:4122 ChiTaRS:Galns IPI:IPI00978346
            ProteinModelPortal:F5H325 SMR:F5H325 Ensembl:ENST00000542788
            ArrayExpress:F5H325 Bgee:F5H325 Uniprot:F5H325
        Length = 447

 Score = 285 (105.4 bits), Expect = 2.1e-25, Sum P(3) = 2.1e-25
 Identities = 77/251 (30%), Positives = 123/251 (49%)

Query:   136 GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHS 195
             GG+P SE++LP+ LK+ GY ++IVGKWHLG ++ ++ P   GF+   G    H   +D+ 
Sbjct:    40 GGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNK 98

Query:   196 AEE----MKMWGLDMRRDLEPAWDLH-GKYS-TDVFTAEAVDIIHNHSTDEPLFLYLAHA 249
             A       + W +  R   E   +L  G+ + T ++  EA+D I   +   P FLY A  
Sbjct:    99 ARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQARHHPFFLYWAVD 158

Query:   250 ATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSI 309
             ATH+     P+ A   +L   +      R ++   + ++D+S+GK++E L+   +  N+ 
Sbjct:   159 ATHA-----PVYASKPFLGTSQ------RGRYGDAVREIDDSIGKILELLQDLHVADNTF 207

Query:   310 IVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHV 369
             + F SD             SN P    K T +EGG+R   L W P   + G V+ Q   +
Sbjct:   208 VFFTSDNGAALISAPEQGGSNGPFLCGKQTTFEGGMREPALAWWPGHVTAGQVSHQLGSI 267

Query:   370 SDWLPTLLSAA 380
              D   T L+ A
Sbjct:   268 MDLFTTSLALA 278

 Score = 68 (29.0 bits), Expect = 2.1e-25, Sum P(3) = 2.1e-25
 Identities = 12/24 (50%), Positives = 20/24 (83%)

Query:    12 GGLPLSEKILPQYLKELGYRTRIM 35
             GG+P SE++LP+ LK+ GY ++I+
Sbjct:    40 GGIPDSEQLLPELLKKAGYVSKIV 63

 Score = 41 (19.5 bits), Expect = 2.1e-25, Sum P(3) = 2.1e-25
 Identities = 10/24 (41%), Positives = 14/24 (58%)

Query:   667 VPCEPQIAPCLFDIKN--DP-CEK 687
             VP +PQ+  C + + N   P CEK
Sbjct:   405 VPAQPQLNVCNWAVMNWAPPGCEK 428

 Score = 41 (19.5 bits), Expect = 2.1e-25, Sum P(3) = 2.1e-25
 Identities = 10/24 (41%), Positives = 14/24 (58%)

Query:   773 VPCEPQIAPCLFDIKN--DP-CEK 793
             VP +PQ+  C + + N   P CEK
Sbjct:   405 VPAQPQLNVCNWAVMNWAPPGCEK 428


>UNIPROTKB|F1NGC8 [details] [associations]
            symbol:STS "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 EMBL:AADN02017431 EMBL:AADN02017432
            EMBL:AADN02017433 IPI:IPI00584657 Ensembl:ENSGALT00000026830
            OMA:HTAMFAS Uniprot:F1NGC8
        Length = 471

 Score = 239 (89.2 bits), Expect = 3.2e-25, Sum P(3) = 3.2e-25
 Identities = 48/112 (42%), Positives = 69/112 (61%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
             P+++ ++ADDLG  D+G +G   + TP+ID LA  G+ L  +     LCTPSR+A +TG+
Sbjct:    38 PNVVLLIADDLGIGDLGCYGNRTLRTPHIDRLAKEGVTLTQHIAASPLCTPSRAAFLTGR 97

Query:   119 HPIHTGMQ--HNV---LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
             +PI +GM     V   L+    GGLP  E    + LK+ GY T ++GKWHLG
Sbjct:    98 YPIRSGMAAFSRVGVFLFSASSGGLPSEEITFSKLLKQRGYATALIGKWHLG 149

 Score = 127 (49.8 bits), Expect = 3.2e-25, Sum P(3) = 3.2e-25
 Identities = 49/199 (24%), Positives = 86/199 (43%)

Query:   222 TDVFTAEAVDIIH-NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSK 280
             T   T EAV  I  NH+   P  L L++   H+A     L A   +    RH        
Sbjct:   272 TQRLTTEAVRFIERNHNA--PFLLVLSYLHVHTA-----LYASKMFRGKSRH------GL 318

Query:   281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXX----SNWPLRGV 336
             +   + ++D SVG++++ LE   + + S++ F SD                  N   +G 
Sbjct:   319 YGDAVEEMDWSVGQILDVLENYNLSNRSLVYFSSDQGAHIEEISSSGEVHGGCNGIYKGG 378

Query:   337 KNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVEN 395
             K+T WEGG+R  GL+ W  ++ + G   +      D  PT++  A  + +P        +
Sbjct:   379 KSTNWEGGIRVPGLLRWPGVIHA-GTYIDDPTSNMDIFPTIVKLAG-AQLPYDRIIDGHD 436

Query:   396 IIPRYENSILRYENGTHEY 414
             ++P  +  ++R +   HE+
Sbjct:   437 LMPLLQGKVIRSK---HEF 452

 Score = 55 (24.4 bits), Expect = 2.7e-05, Sum P(3) = 2.7e-05
 Identities = 11/30 (36%), Positives = 16/30 (53%)

Query:     6 LYGCERGGLPLSEKILPQYLKELGYRTRIM 35
             L+    GGLP  E    + LK+ GY T ++
Sbjct:   114 LFSASSGGLPSEEITFSKLLKQRGYATALI 143

 Score = 39 (18.8 bits), Expect = 3.2e-25, Sum P(3) = 3.2e-25
 Identities = 10/38 (26%), Positives = 16/38 (42%)

Query:   431 HEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKN 468
             H+  P  + +     HE+   Y N Y N    +  P+N
Sbjct:   435 HDLMPLLQGKVIRSKHEFLFHYCNAYLNAVRWH--PRN 470


>UNIPROTKB|F6PN86 [details] [associations]
            symbol:ARSF "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 GeneTree:ENSGT00560000076940 HOGENOM:HOG000135352
            HOVERGEN:HBG004283 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 OrthoDB:EOG4V4379 OMA:LKPCCGV
            EMBL:AAEX03026108 Ensembl:ENSCAFT00000017756 Uniprot:F6PN86
        Length = 584

 Score = 236 (88.1 bits), Expect = 4.3e-25, Sum P(3) = 4.3e-25
 Identities = 53/136 (38%), Positives = 76/136 (55%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
             P+I+ ++ DDLG  D+G  G D I TPNID LA  G+ L ++     +CTPSR+A +TG+
Sbjct:    30 PNIVLMMVDDLGIGDLGCFGNDTIRTPNIDRLAREGVQLNHHIAAASMCTPSRAAFLTGR 89

Query:   119 HPIHTGMQHN------VLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGF-----Y 167
             +PI +GM  N      +  G    GLP +E      LK+ GY T ++GKWH G      Y
Sbjct:    90 YPIRSGMVSNAVDRVIITLGAP-AGLPHNETTFAALLKKQGYSTALIGKWHQGLNCQSRY 148

Query:   168 KKEYTPTFRGFESHLG 183
              + + P   GF+ + G
Sbjct:   149 DQCHHPYHYGFDYYYG 164

 Score = 129 (50.5 bits), Expect = 4.3e-25, Sum P(3) = 4.3e-25
 Identities = 40/143 (27%), Positives = 66/143 (46%)

Query:   277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXX-SNWP--L 333
             K   +   + ++D  VGK+++A++   + + +++ F SD                W    
Sbjct:   307 KHGLYGDNVQEMDSMVGKILDAIDNFHLKNRTLVYFTSDHGGHLESRVGHSQRGGWNGIY 366

Query:   334 RGVKNTL-WEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNS 391
             RG K    WEGG+R  GLI WS  L + G V E+   + D  PTL +A + S +P     
Sbjct:   367 RGGKGMAGWEGGIRVPGLIRWSGRLPA-GKVIEEPTSLMDIFPTL-AAVSGSSVPQDRVI 424

Query:   392 TVENIIPRYENSILRYENGTHEY 414
                N++P  +  + R E   HE+
Sbjct:   425 DGRNLMPLLQGEVQRSE---HEF 444

 Score = 46 (21.3 bits), Expect = 4.3e-25, Sum P(3) = 4.3e-25
 Identities = 9/21 (42%), Positives = 11/21 (52%)

Query:   675 PCLFDIKNDPCEKNNLADRSE 695
             P LFD+  DP E   L   +E
Sbjct:   503 PLLFDLTRDPSESTPLTQDTE 523

 Score = 46 (21.3 bits), Expect = 4.3e-25, Sum P(3) = 4.3e-25
 Identities = 9/21 (42%), Positives = 11/21 (52%)

Query:   781 PCLFDIKNDPCEKNNLADRSE 801
             P LFD+  DP E   L   +E
Sbjct:   503 PLLFDLTRDPSESTPLTQDTE 523

 Score = 42 (19.8 bits), Expect = 0.00016, Sum P(3) = 0.00016
 Identities = 9/23 (39%), Positives = 13/23 (56%)

Query:    13 GLPLSEKILPQYLKELGYRTRIM 35
             GLP +E      LK+ GY T ++
Sbjct:   113 GLPHNETTFAALLKKQGYSTALI 135

 Score = 38 (18.4 bits), Expect = 2.7e-15, Sum P(2) = 2.7e-15
 Identities = 11/31 (35%), Positives = 15/31 (48%)

Query:   389 VNSTVENIIPRYENSILRYENGTHE--YNSP 417
             V  TV N +  +  SIL  +    E  Y+SP
Sbjct:   529 VIQTVANAVKEHRKSILPVQQQLSELNYDSP 559


>ZFIN|ZDB-GENE-030717-5 [details] [associations]
            symbol:sts "steroid sulfatase (microsomal),
            arylsulfatase C, isozyme S" species:7955 "Danio rerio" [GO:0003824
            "catalytic activity" evidence=IEA] [GO:0008152 "metabolic process"
            evidence=IEA] [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 ZFIN:ZDB-GENE-030717-5
            Gene3D:3.40.720.10 SUPFAM:SSF53649 GeneTree:ENSGT00560000076940
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484 EMBL:CT990606
            EMBL:BX901898 IPI:IPI00963580 Ensembl:ENSDART00000075252
            ArrayExpress:F1Q8F9 Bgee:F1Q8F9 Uniprot:F1Q8F9
        Length = 587

 Score = 236 (88.1 bits), Expect = 4.4e-25, Sum P(3) = 4.4e-25
 Identities = 62/164 (37%), Positives = 86/164 (52%)

Query:    36 AFAVLPLAFTLSMVFVDLVASSGP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
             +F  +P  F L +   D  A SG  P+ +F++ DDLG  D+G +G   + TPNID LA  
Sbjct:    11 SFQWIPCTFCLLLYTAD--AGSGTKPNFVFMMVDDLGIGDLGCYGNTTLRTPNIDRLALE 68

Query:    95 GIILKNYYTVQ-LCTPSRSAIMTGKHPIHT---GMQ-HN----VLYGCERGGLPLSEKIL 145
             G+ L  +     LCTPSR+A +TG++PI +   GM  H      L+    GGLP  E   
Sbjct:    69 GVKLTQHIAAAPLCTPSRAAFLTGRYPIRSDAKGMAAHGHMGVFLFSASSGGLPQEEITF 128

Query:   146 PQYLKELGYRTR-IVGKWHLGFYKKE-----YTPTFRGFESHLG 183
              + +K  GY T  IVGKWHLG   ++     + P   GF+   G
Sbjct:   129 AKAVKVQGYSTAVIVGKWHLGLNCEDSSDHCHHPNSHGFDYFYG 172

 Score = 126 (49.4 bits), Expect = 4.4e-25, Sum P(3) = 4.4e-25
 Identities = 44/193 (22%), Positives = 88/193 (45%)

Query:   222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
             T   T+EA++ +  +S + P  L+ +    H+     PL     +    +H        +
Sbjct:   269 TQRMTSEAIEFLERNS-ETPFLLFFSFIQVHTGVFASPL-----FRGRSQH------GLY 316

Query:   282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGV----K 337
                + ++D SVG++++ LE+  +  N+++   SD              +    G+    K
Sbjct:   317 GDAVMEVDWSVGQIMQTLERLNLKDNTLVYMTSDQGPHLEEISVHGEMHGGYSGIYKAGK 376

Query:   338 NTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENI 396
             +T WEGG+R  G++ W  +L +  I+ E   ++ D  PT+L+ A  S IP+       ++
Sbjct:   377 STNWEGGIRIPGILSWPGVLPAGNIIDEPTSNM-DIFPTVLNLAGAS-IPDDRVIDGHDL 434

Query:   397 IPRYENSILRYEN 409
             +P  +  + R E+
Sbjct:   435 LPLLQGQVKRSEH 447

 Score = 49 (22.3 bits), Expect = 4.4e-25, Sum P(3) = 4.4e-25
 Identities = 13/34 (38%), Positives = 17/34 (50%)

Query:   675 PCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGR 708
             P L+D+  DP E   L+  +E Q   H   EV R
Sbjct:   508 PLLYDLSKDPTESTPLSPDTEPQF--HSVLEVIR 539

 Score = 48 (22.0 bits), Expect = 4.4e-05, Sum P(3) = 4.4e-05
 Identities = 10/30 (33%), Positives = 15/30 (50%)

Query:     6 LYGCERGGLPLSEKILPQYLKELGYRTRIM 35
             L+    GGLP  E    + +K  GY T ++
Sbjct:   113 LFSASSGGLPQEEITFAKAVKVQGYSTAVI 142

 Score = 47 (21.6 bits), Expect = 7.1e-25, Sum P(3) = 7.1e-25
 Identities = 9/23 (39%), Positives = 13/23 (56%)

Query:   781 PCLFDIKNDPCEKNNLADRSEVQ 803
             P L+D+  DP E   L+  +E Q
Sbjct:   508 PLLYDLSKDPTESTPLSPDTEPQ 530


>UNIPROTKB|Q482D2 [details] [associations]
            symbol:CPS_2368 "Putative N-acetylglucosamine-6-sulfatase"
            species:167879 "Colwellia psychrerythraea 34H" [GO:0008152
            "metabolic process" evidence=ISS] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 EMBL:CP000083 GenomeReviews:CP000083_GR
            Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008449 RefSeq:YP_269086.1
            ProteinModelPortal:Q482D2 STRING:Q482D2 GeneID:3522371
            KEGG:cps:CPS_2368 PATRIC:21467821 HOGENOM:HOG000024136 OMA:SHKAVHS
            ProtClustDB:CLSK824923 BioCyc:CPSY167879:GI48-2431-MONOMER
            Uniprot:Q482D2
        Length = 537

 Score = 275 (101.9 bits), Expect = 5.2e-25, Sum P(3) = 5.2e-25
 Identities = 77/230 (33%), Positives = 120/230 (52%)

Query:    40 LPLAFTLSMVFVDLVAS-SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
             L L F++S +   +  +     ++I+IL DD  +++VGF    +I TPN+D LA  G+  
Sbjct:    15 LSLCFSVSSLSATVNKTVKQKKNVIYILTDDQRYDEVGFLN-PRIDTPNMDKLAAGGVYF 73

Query:    99 KN-YYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKIL--PQYLKELGYR 155
             KN + T  LC+PSR+ I+TG++       HN  +G      P  E  +  P YL+E+GY 
Sbjct:    74 KNAFVTTALCSPSRATILTGQY------MHN--HGVVDNNNPAKESSVYFPSYLQEVGYE 125

Query:   156 TRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
             T   GKWH+G +     P   GF+  L +  G   Y+    ++ +   +++  +     D
Sbjct:   126 TSFFGKWHMGGHGDSPQP---GFDHWLSF-AGQGHYYPKKDKKGRTNKININGERV---D 178

Query:   216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDH 265
               G Y TD  T  AVD +    +D+P F+YL+H A HS N ++P  AP H
Sbjct:   179 QKG-YITDELTDYAVDWLDKRDSDKPFFMYLSHKAVHS-N-FDP--APRH 223

 Score = 69 (29.3 bits), Expect = 5.2e-25, Sum P(3) = 5.2e-25
 Identities = 31/144 (21%), Positives = 64/144 (44%)

Query:   273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWP 332
             ++++KR    A L  +D+S+G+V++ L+   + +++I++ + D                 
Sbjct:   272 VQEYKRQYHRA-LSAVDDSLGRVLKWLKDNNLENDTIVMLMGDNGFMFGEHGLID----- 325

Query:   333 LRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNST 392
                 K   +E  +R   L ++P     G V ++ V   D  PT+L  A     P + +  
Sbjct:   326 ----KRNAYEESMRVPLLAYAPGYFKPGTVVDEMVANLDIAPTILEIAGAKK-PAHFDG- 379

Query:   393 VENIIPRYENSILRY--ENGTHEY 414
              ++ +P  +N  +    EN  +EY
Sbjct:   380 -DSWLPLAKNKEVNQWRENFLYEY 402

 Score = 59 (25.8 bits), Expect = 5.2e-25, Sum P(3) = 5.2e-25
 Identities = 11/21 (52%), Positives = 15/21 (71%)

Query:   677 LFDIKNDPCEKNNLADRSEDQ 697
             L+D+KNDP E NNL +  + Q
Sbjct:   436 LYDLKNDPKEMNNLINTPKHQ 456

 Score = 57 (25.1 bits), Expect = 8.3e-25, Sum P(3) = 8.3e-25
 Identities = 11/21 (52%), Positives = 15/21 (71%)

Query:   783 LFDIKNDPCEKNNLADRSEVQ 803
             L+D+KNDP E NNL +  + Q
Sbjct:   436 LYDLKNDPKEMNNLINTPKHQ 456


>TIGR_CMR|CPS_2368 [details] [associations]
            symbol:CPS_2368 "putative N-acetylglucosamine-6-sulfatase"
            species:167879 "Colwellia psychrerythraea 34H" [GO:0008152
            "metabolic process" evidence=ISS] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 EMBL:CP000083 GenomeReviews:CP000083_GR
            Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008449 RefSeq:YP_269086.1
            ProteinModelPortal:Q482D2 STRING:Q482D2 GeneID:3522371
            KEGG:cps:CPS_2368 PATRIC:21467821 HOGENOM:HOG000024136 OMA:SHKAVHS
            ProtClustDB:CLSK824923 BioCyc:CPSY167879:GI48-2431-MONOMER
            Uniprot:Q482D2
        Length = 537

 Score = 275 (101.9 bits), Expect = 5.2e-25, Sum P(3) = 5.2e-25
 Identities = 77/230 (33%), Positives = 120/230 (52%)

Query:    40 LPLAFTLSMVFVDLVAS-SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
             L L F++S +   +  +     ++I+IL DD  +++VGF    +I TPN+D LA  G+  
Sbjct:    15 LSLCFSVSSLSATVNKTVKQKKNVIYILTDDQRYDEVGFLN-PRIDTPNMDKLAAGGVYF 73

Query:    99 KN-YYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKIL--PQYLKELGYR 155
             KN + T  LC+PSR+ I+TG++       HN  +G      P  E  +  P YL+E+GY 
Sbjct:    74 KNAFVTTALCSPSRATILTGQY------MHN--HGVVDNNNPAKESSVYFPSYLQEVGYE 125

Query:   156 TRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
             T   GKWH+G +     P   GF+  L +  G   Y+    ++ +   +++  +     D
Sbjct:   126 TSFFGKWHMGGHGDSPQP---GFDHWLSF-AGQGHYYPKKDKKGRTNKININGERV---D 178

Query:   216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDH 265
               G Y TD  T  AVD +    +D+P F+YL+H A HS N ++P  AP H
Sbjct:   179 QKG-YITDELTDYAVDWLDKRDSDKPFFMYLSHKAVHS-N-FDP--APRH 223

 Score = 69 (29.3 bits), Expect = 5.2e-25, Sum P(3) = 5.2e-25
 Identities = 31/144 (21%), Positives = 64/144 (44%)

Query:   273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWP 332
             ++++KR    A L  +D+S+G+V++ L+   + +++I++ + D                 
Sbjct:   272 VQEYKRQYHRA-LSAVDDSLGRVLKWLKDNNLENDTIVMLMGDNGFMFGEHGLID----- 325

Query:   333 LRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNST 392
                 K   +E  +R   L ++P     G V ++ V   D  PT+L  A     P + +  
Sbjct:   326 ----KRNAYEESMRVPLLAYAPGYFKPGTVVDEMVANLDIAPTILEIAGAKK-PAHFDG- 379

Query:   393 VENIIPRYENSILRY--ENGTHEY 414
              ++ +P  +N  +    EN  +EY
Sbjct:   380 -DSWLPLAKNKEVNQWRENFLYEY 402

 Score = 59 (25.8 bits), Expect = 5.2e-25, Sum P(3) = 5.2e-25
 Identities = 11/21 (52%), Positives = 15/21 (71%)

Query:   677 LFDIKNDPCEKNNLADRSEDQ 697
             L+D+KNDP E NNL +  + Q
Sbjct:   436 LYDLKNDPKEMNNLINTPKHQ 456

 Score = 57 (25.1 bits), Expect = 8.3e-25, Sum P(3) = 8.3e-25
 Identities = 11/21 (52%), Positives = 15/21 (71%)

Query:   783 LFDIKNDPCEKNNLADRSEVQ 803
             L+D+KNDP E NNL +  + Q
Sbjct:   436 LYDLKNDPKEMNNLINTPKHQ 456


>UNIPROTKB|I3LBW8 [details] [associations]
            symbol:STS "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 OMA:GLSCQCD EMBL:FP102981
            EMBL:FP339595 Ensembl:ENSSSCT00000032160 Uniprot:I3LBW8
        Length = 579

 Score = 242 (90.2 bits), Expect = 7.1e-25, Sum P(3) = 7.1e-25
 Identities = 50/112 (44%), Positives = 69/112 (61%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
             P+ + ++ADDLG  D G +G   + TPNID LA  G+ L  +     LCTPSR+A +TG+
Sbjct:    23 PNFVLLMADDLGIGDPGCYGNKTLRTPNIDRLAGGGVKLTQHLAAAPLCTPSRAAFLTGR 82

Query:   119 HPIHTGM--QHNV---LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
             +PI +GM  Q+ V   ++    GGLP SE    + LK  GY T ++GKWHLG
Sbjct:    83 YPIRSGMAAQNQVGVFIFSASSGGLPPSEITFAKLLKSQGYTTALIGKWHLG 134

 Score = 116 (45.9 bits), Expect = 7.1e-25, Sum P(3) = 7.1e-25
 Identities = 40/164 (24%), Positives = 75/164 (45%)

Query:   222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
             T   TA+AV  I  ++ + P  L L+    H+A     L +   +    +H        +
Sbjct:   257 TQRLTADAVRFIQRNA-ESPFLLVLSFLHVHTA-----LFSSKIFAGKSKH------GAY 304

Query:   282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXX----SNWPLRGVK 337
                  ++D SVG++++ L++ ++ +N++I F SD                 SN   +G K
Sbjct:   305 GDATEEMDWSVGQILDVLDELKLANNTLIYFSSDQGAHVEEVTVKGEVHGGSNGIYKGGK 364

Query:   338 NTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
              T WEGG+R  G++ W  ++++ G+  +      D  PT+ + A
Sbjct:   365 ATNWEGGIRVPGILRWPGVIQA-GLELDAPTSNMDLFPTVANLA 407

 Score = 54 (24.1 bits), Expect = 9.5e-05, Sum P(3) = 9.5e-05
 Identities = 11/30 (36%), Positives = 16/30 (53%)

Query:     6 LYGCERGGLPLSEKILPQYLKELGYRTRIM 35
             ++    GGLP SE    + LK  GY T ++
Sbjct:    99 IFSASSGGLPPSEITFAKLLKSQGYTTALI 128

 Score = 50 (22.7 bits), Expect = 7.1e-25, Sum P(3) = 7.1e-25
 Identities = 11/21 (52%), Positives = 12/21 (57%)

Query:   675 PCLFDIKNDPCEKNNLADRSE 695
             P LFDI  DP E + L   SE
Sbjct:   496 PLLFDISQDPRETDPLTPTSE 516

 Score = 50 (22.7 bits), Expect = 7.1e-25, Sum P(3) = 7.1e-25
 Identities = 11/21 (52%), Positives = 12/21 (57%)

Query:   781 PCLFDIKNDPCEKNNLADRSE 801
             P LFDI  DP E + L   SE
Sbjct:   496 PLLFDISQDPRETDPLTPTSE 516


>UNIPROTKB|K7GLQ3 [details] [associations]
            symbol:STS "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 EMBL:FP102981 EMBL:FP339595
            Ensembl:ENSSSCT00000035627 Uniprot:K7GLQ3
        Length = 580

 Score = 242 (90.2 bits), Expect = 7.2e-25, Sum P(3) = 7.2e-25
 Identities = 50/112 (44%), Positives = 69/112 (61%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
             P+ + ++ADDLG  D G +G   + TPNID LA  G+ L  +     LCTPSR+A +TG+
Sbjct:    24 PNFVLLMADDLGIGDPGCYGNKTLRTPNIDRLAGGGVKLTQHLAAAPLCTPSRAAFLTGR 83

Query:   119 HPIHTGM--QHNV---LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
             +PI +GM  Q+ V   ++    GGLP SE    + LK  GY T ++GKWHLG
Sbjct:    84 YPIRSGMAAQNQVGVFIFSASSGGLPPSEITFAKLLKSQGYTTALIGKWHLG 135

 Score = 116 (45.9 bits), Expect = 7.2e-25, Sum P(3) = 7.2e-25
 Identities = 40/164 (24%), Positives = 75/164 (45%)

Query:   222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
             T   TA+AV  I  ++ + P  L L+    H+A     L +   +    +H        +
Sbjct:   258 TQRLTADAVRFIQRNA-ESPFLLVLSFLHVHTA-----LFSSKIFAGKSKH------GAY 305

Query:   282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXX----SNWPLRGVK 337
                  ++D SVG++++ L++ ++ +N++I F SD                 SN   +G K
Sbjct:   306 GDATEEMDWSVGQILDVLDELKLANNTLIYFSSDQGAHVEEVTVKGEVHGGSNGIYKGGK 365

Query:   338 NTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
              T WEGG+R  G++ W  ++++ G+  +      D  PT+ + A
Sbjct:   366 ATNWEGGIRVPGILRWPGVIQA-GLELDAPTSNMDLFPTVANLA 408

 Score = 54 (24.1 bits), Expect = 9.5e-05, Sum P(3) = 9.5e-05
 Identities = 11/30 (36%), Positives = 16/30 (53%)

Query:     6 LYGCERGGLPLSEKILPQYLKELGYRTRIM 35
             ++    GGLP SE    + LK  GY T ++
Sbjct:   100 IFSASSGGLPPSEITFAKLLKSQGYTTALI 129

 Score = 50 (22.7 bits), Expect = 7.2e-25, Sum P(3) = 7.2e-25
 Identities = 11/21 (52%), Positives = 12/21 (57%)

Query:   675 PCLFDIKNDPCEKNNLADRSE 695
             P LFDI  DP E + L   SE
Sbjct:   497 PLLFDISQDPRETDPLTPTSE 517

 Score = 50 (22.7 bits), Expect = 7.2e-25, Sum P(3) = 7.2e-25
 Identities = 11/21 (52%), Positives = 12/21 (57%)

Query:   781 PCLFDIKNDPCEKNNLADRSE 801
             P LFDI  DP E + L   SE
Sbjct:   497 PLLFDISQDPRETDPLTPTSE 517


>RGD|3783 [details] [associations]
            symbol:Sts "steroid sulfatase (microsomal), isozyme S"
          species:10116 "Rattus norvegicus" [GO:0004773 "steryl-sulfatase
          activity" evidence=IDA] [GO:0005635 "nuclear envelope" evidence=IDA]
          [GO:0005789 "endoplasmic reticulum membrane" evidence=IDA]
          [GO:0007565 "female pregnancy" evidence=IEA] [GO:0007611 "learning or
          memory" evidence=IMP] [GO:0008202 "steroid metabolic process"
          evidence=IEA] [GO:0008284 "positive regulation of cell proliferation"
          evidence=IMP] [GO:0008484 "sulfuric ester hydrolase activity"
          evidence=IEA;ISO] [GO:0009268 "response to pH" evidence=IDA]
          [GO:0014070 "response to organic cyclic compound" evidence=IDA]
          [GO:0016021 "integral to membrane" evidence=IDA] [GO:0043231
          "intracellular membrane-bounded organelle" evidence=IDA] [GO:0043434
          "response to peptide hormone stimulus" evidence=IDA] [GO:0043588
          "skin development" evidence=IEP] [GO:0043627 "response to estrogen
          stimulus" evidence=IDA] [GO:0046872 "metal ion binding" evidence=IEA]
          InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
          RGD:3783 GO:GO:0016021 GO:GO:0005635 GO:GO:0043588 GO:GO:0005789
          GO:GO:0008202 GO:GO:0046872 GO:GO:0008284 GO:GO:0043434 GO:GO:0007565
          GO:GO:0009268 GO:GO:0007611 GO:GO:0043627 Gene3D:3.40.720.10
          SUPFAM:SSF53649 eggNOG:COG3119 HOGENOM:HOG000135352
          HOVERGEN:HBG004283 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
          OrthoDB:EOG4V4379 CTD:412 KO:K01131 GO:GO:0004773 EMBL:U37138
          IPI:IPI00210494 PIR:S05414 RefSeq:NP_036793.1 UniGene:Rn.6312
          ProteinModelPortal:P15589 SMR:P15589 STRING:P15589 PRIDE:P15589
          GeneID:24800 KEGG:rno:24800 InParanoid:P15589 BindingDB:P15589
          ChEMBL:CHEMBL3531 NextBio:604458 Genevestigator:P15589
          GermOnline:ENSRNOG00000032487 Uniprot:P15589
        Length = 577

 Score = 251 (93.4 bits), Expect = 9.0e-25, Sum P(3) = 9.0e-25
 Identities = 61/154 (39%), Positives = 84/154 (54%)

Query:    42 LAFTLSMVFVDLVASSGP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
             LA  LS +     A  GP P+ + I+ADDLG  D+G +G   + TP+ID LA  G+ L  
Sbjct:     7 LALLLSQLNFLCAARPGPGPNFLLIMADDLGIGDLGCYGNRTLRTPHIDRLALEGVKLTQ 66

Query:   101 YYTVQ-LCTPSRSAIMTGKHPIHTGM-QHN----VLYGCERGGLPLSEKILPQYLKELGY 154
             +     LCTPSR+A +TG++P+ +GM  H      L+    GGLP +E    + LK  GY
Sbjct:    67 HLAAAPLCTPSRAAFLTGRYPVRSGMASHGRLGVFLFSASSGGLPPNEVTFAKLLKGQGY 126

Query:   155 RTRIVGKWHLGFYKKE-----YTPTFRGFESHLG 183
              T +VGKWHLG   +      + P   GF+  LG
Sbjct:   127 TTGLVGKWHLGLSCQAASDFCHHPGRHGFDRFLG 160

 Score = 105 (42.0 bits), Expect = 9.0e-25, Sum P(3) = 9.0e-25
 Identities = 41/169 (24%), Positives = 72/169 (42%)

Query:   222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
             T    +EA D +  +  D P  L+L+    H+A+   P  A     ++H          +
Sbjct:   260 TQRLASEAGDFLRRNR-DTPFLLFLSFMHVHTAHFANPEFAGQ---SLH--------GAY 307

Query:   282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXX----XXXXSNWPLRGVK 337
                + ++D +VG+V+  L++  + +N+++   SD                 SN   RG K
Sbjct:   308 GDAVEEMDWAVGQVLATLDKLGLANNTLVYLTSDHGAHVEELGPNGERHGGSNGIYRGGK 367

Query:   338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
                WEGG+R  GL+  P +   G   E+     D  PT+   A  +++P
Sbjct:   368 ANTWEGGIRVPGLVRWPGVIVPGQEVEEPTSNMDVFPTVARLAG-AELP 415

 Score = 50 (22.7 bits), Expect = 9.0e-25, Sum P(3) = 9.0e-25
 Identities = 10/21 (47%), Positives = 13/21 (61%)

Query:   675 PCLFDIKNDPCEKNNLADRSE 695
             P LFDI  DP E++ L   +E
Sbjct:   499 PLLFDIARDPRERHPLTPETE 519

 Score = 50 (22.7 bits), Expect = 9.0e-25, Sum P(3) = 9.0e-25
 Identities = 10/21 (47%), Positives = 13/21 (61%)

Query:   781 PCLFDIKNDPCEKNNLADRSE 801
             P LFDI  DP E++ L   +E
Sbjct:   499 PLLFDIARDPRERHPLTPETE 519

 Score = 39 (18.8 bits), Expect = 1.2e-23, Sum P(3) = 1.2e-23
 Identities = 8/34 (23%), Positives = 17/34 (50%)

Query:   628 LTGGPDQVYLSGLSDREWLALAMRKLRDAASIQC 661
             L   P+Q+ +S ++ + WL L +       + +C
Sbjct:   540 LEEAPNQLSMSNVAWKPWLQLCLPSKPHPLACRC 573


>TIGR_CMR|SPO_3286 [details] [associations]
            symbol:SPO_3286 "arylsulfatase" species:246200 "Ruegeria
            pomeroyi DSS-3" [GO:0004065 "arylsulfatase activity" evidence=ISS]
            [GO:0006790 "sulfur compound metabolic process" evidence=ISS]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 EMBL:CP000031 GenomeReviews:CP000031_GR
            Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
            PROSITE:PS00149 GO:GO:0004065 KO:K01130 HOGENOM:HOG000135353
            RefSeq:YP_168482.1 ProteinModelPortal:Q5LNC6 GeneID:3193868
            KEGG:sil:SPO3286 PATRIC:23380015 Uniprot:Q5LNC6
        Length = 535

 Score = 254 (94.5 bits), Expect = 1.1e-24, Sum P(4) = 1.1e-24
 Identities = 76/213 (35%), Positives = 105/213 (49%)

Query:    57 SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMT 116
             S  P+II ILADDLG+ D+G  G  +I TPNID LA  G +L   Y    C P+R++++T
Sbjct:     2 SRKPNIILILADDLGFADLGCTG-SEIRTPNIDGLARDGALLTAMYNCARCCPTRASLLT 60

Query:   117 GKHPIHTGMQH-NVLYGCE--RGGLPLSEKILPQYLKELGYRTRIVGKWHLG--FYKKEY 171
             G +P + G+ H     G    RG L      + ++L+  GYRT + GKWH+G  F  +E 
Sbjct:    61 GLYPHNAGIGHMGADLGTPAYRGFLRNDCATIAEHLRAAGYRTCMSGKWHVGGDFMAREV 120

Query:   172 -----------TPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKY 220
                        TP  RGF+   G   G   +F        M   D R +  P  D    Y
Sbjct:   121 DSWRVGDVDHPTPRQRGFDRFYGIVDGVTHFFSPHY----MLEDDTRVETFPD-DF---Y 172

Query:   221 STDVFTAEAVDIIHNH-STDEPLFLYLAHAATH 252
              TD  T +A+ ++      ++P FLYLAH A H
Sbjct:   173 FTDAITDKAIGMVEEAVEMEQPFFLYLAHTAPH 205

 Score = 85 (35.0 bits), Expect = 1.1e-24, Sum P(4) = 1.1e-24
 Identities = 15/46 (32%), Positives = 32/46 (69%)

Query:   270 HRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
             H+  E  K + +AA++ ++D+S+G ++ AL++     N++I+F+SD
Sbjct:   264 HKDWEARKMATYAAMVDRMDQSIGTLLAALKRMGQFDNTLILFLSD 309

 Score = 63 (27.2 bits), Expect = 1.1e-24, Sum P(4) = 1.1e-24
 Identities = 19/57 (33%), Positives = 26/57 (45%)

Query:   329 SNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDI 385
             SN P R  K+ + EGG+    +   P   +  +      HV D LPT+L AA    I
Sbjct:   363 SNAPFRKFKHYVHEGGISTPLIAHWPGRIAAPVPLHAACHVVDILPTILEAAGAPPI 419

 Score = 38 (18.4 bits), Expect = 1.1e-24, Sum P(4) = 1.1e-24
 Identities = 11/29 (37%), Positives = 14/29 (48%)

Query:   677 LFDIKNDPCEKNNLADRSEDQRINHYTTE 705
             L+DI+ D  E N+L  R E  R      E
Sbjct:   477 LYDIEADRTELNDLI-RGEPDRAKALVAE 504


>UNIPROTKB|F1NFQ0 [details] [associations]
            symbol:ARSH "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 EMBL:AADN02017595 IPI:IPI00587215
            Ensembl:ENSGALT00000026860 OMA:GHYKAVF ArrayExpress:F1NFQ0
            Uniprot:F1NFQ0
        Length = 590

 Score = 259 (96.2 bits), Expect = 1.8e-24, Sum P(3) = 1.8e-24
 Identities = 57/135 (42%), Positives = 82/135 (60%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
             P+ + +LADDLG  DVG +G D I TPNID LA  G+ L  + T   LCTPSR+A++TG+
Sbjct:    35 PNFVLLLADDLGIGDVGCYGNDTIRTPNIDRLAREGVKLTQHITAAPLCTPSRAALLTGR 94

Query:   119 HPIHTGMQ--HN---VLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGF---YKKE 170
             +PI +GM   +N   + +    GGLP +E    + L++ GY T ++GKWHLG    ++ +
Sbjct:    95 YPIRSGMDAVNNYRVIFWNGGSGGLPPNETTFAKILQQQGYSTGLIGKWHLGVNCEHRND 154

Query:   171 YT--PTFRGFESHLG 183
             +   P   GFE   G
Sbjct:   155 HCHHPLNHGFEYFYG 169

 Score = 101 (40.6 bits), Expect = 1.8e-24, Sum P(3) = 1.8e-24
 Identities = 42/199 (21%), Positives = 86/199 (43%)

Query:   221 STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSK 280
             +T     E++  I  +   +P  L+L+   +H+     PL   + +L    H        
Sbjct:   268 TTSFILRESISFIERNK-HKPFLLFLSFLHSHT-----PLLTTEKFLGKSGH------GL 315

Query:   281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXX-XXXXXXXXXXSNWP--LRGVK 337
             +   + ++D  VG+V++A++++ +  N+++ F SD                W    RG K
Sbjct:   316 YGDNVEEMDWMVGQVLDAIDKKGLKKNTLVYFASDHGGWLERQEGKRQLGGWNGIYRGGK 375

Query:   338 NTL-WEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVEN 395
                 WEGG+R  G+  W  +L + G V  +   + D  PT++  A    +P        +
Sbjct:   376 AMGGWEGGIRVPGIFRWPGVLPA-GKVINEPTSLMDIYPTVVHLAG-GVVPQDRVIDGRD 433

Query:   396 IIPRYENSILRYENGTHEY 414
             ++P  + ++   E+  H++
Sbjct:   434 LMPLLQGTV---EHSEHKF 449

 Score = 43 (20.2 bits), Expect = 1.8e-24, Sum P(3) = 1.8e-24
 Identities = 8/21 (38%), Positives = 12/21 (57%)

Query:   675 PCLFDIKNDPCEKNNLADRSE 695
             P L+D+  DP E   L+  +E
Sbjct:   508 PLLYDLSRDPSESQPLSADTE 528

 Score = 43 (20.2 bits), Expect = 1.8e-24, Sum P(3) = 1.8e-24
 Identities = 8/21 (38%), Positives = 12/21 (57%)

Query:   781 PCLFDIKNDPCEKNNLADRSE 801
             P L+D+  DP E   L+  +E
Sbjct:   508 PLLYDLSRDPSESQPLSADTE 528


>UNIPROTKB|G3N2T7 [details] [associations]
            symbol:ARSH "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 OMA:ATVWKVH EMBL:DAAA02075309
            EMBL:DAAA02075310 EMBL:DAAA02075311 Ensembl:ENSBTAT00000063647
            Uniprot:G3N2T7
        Length = 557

 Score = 229 (85.7 bits), Expect = 4.0e-24, Sum P(3) = 4.0e-24
 Identities = 50/136 (36%), Positives = 78/136 (57%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
             P+I+ ++ADDLG  D+  +G + + TPNID LA  G+ L  +     +CTPSR+A +TG+
Sbjct:     2 PNIVLLMADDLGVGDLCCYGNNSVSTPNIDRLASEGVRLTQHLAAASMCTPSRAAFLTGR 61

Query:   119 HPIHTGM------QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFY--KKE 170
             +P+ +GM        +V++    GGLP +E    + L+  GYRT ++GKWH G     ++
Sbjct:    62 YPVRSGMASSSNLNRDVVWLGGSGGLPPNETTFAKLLQHRGYRTGLIGKWHQGLSCASRD 121

Query:   171 ---YTPTFRGFESHLG 183
                Y P   GF+   G
Sbjct:   122 DHCYHPLNHGFDYFYG 137

 Score = 124 (48.7 bits), Expect = 4.0e-24, Sum P(3) = 4.0e-24
 Identities = 46/163 (28%), Positives = 77/163 (47%)

Query:   266 YLNIHRHI---EDFK-RSKFAAI---LHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXX 318
             +L++H  +   E F   SKF      + ++D  VGKV+EAL++ R+ +++++ F SD   
Sbjct:   262 FLHVHTPLVTKEKFVGHSKFGLYGDNVEEMDWMVGKVLEALDRERLANHTLVYFTSDNGG 321

Query:   319 XXXXXXXXXX-SNWP--LRGVKNTL-WEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWL 373
                          W    RG +    WEGG+R  G+  W  +LE+ G V ++   + D  
Sbjct:   322 RLEAQDRSGQLGGWNGRYRGGRGMAGWEGGIRVPGIFRWPTVLEA-GKVIDEPTSLMDIF 380

Query:   374 PTLLSAANKSDIPNYVNSTVE--NIIPRYENSILRYENGTHEY 414
             PTL        IP  +   ++  N++P  E  + R E   HE+
Sbjct:   381 PTLSYIGG--GIPP-LGRVIDGRNLMPLLEGRVSRSE---HEF 417

 Score = 50 (22.7 bits), Expect = 4.6e-05, Sum P(3) = 4.6e-05
 Identities = 10/24 (41%), Positives = 15/24 (62%)

Query:    12 GGLPLSEKILPQYLKELGYRTRIM 35
             GGLP +E    + L+  GYRT ++
Sbjct:    85 GGLPPNETTFAKLLQHRGYRTGLI 108

 Score = 48 (22.0 bits), Expect = 4.0e-24, Sum P(3) = 4.0e-24
 Identities = 12/32 (37%), Positives = 14/32 (43%)

Query:   668 PCEPQIA----PCLFDIKNDPCEKNNLADRSE 695
             PC   +     P LFDI  DP E   L   +E
Sbjct:   464 PCSGDVTYHDPPLLFDISRDPSESRPLNPDNE 495

 Score = 48 (22.0 bits), Expect = 4.0e-24, Sum P(3) = 4.0e-24
 Identities = 12/32 (37%), Positives = 14/32 (43%)

Query:   774 PCEPQIA----PCLFDIKNDPCEKNNLADRSE 801
             PC   +     P LFDI  DP E   L   +E
Sbjct:   464 PCSGDVTYHDPPLLFDISRDPSESRPLNPDNE 495


>UNIPROTKB|F1NFQ1 [details] [associations]
            symbol:ARSH "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 EMBL:AADN02017595 IPI:IPI00600266
            Ensembl:ENSGALT00000026858 ArrayExpress:F1NFQ1 Uniprot:F1NFQ1
        Length = 579

 Score = 259 (96.2 bits), Expect = 5.0e-24, Sum P(3) = 5.0e-24
 Identities = 57/135 (42%), Positives = 82/135 (60%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
             P+ + +LADDLG  DVG +G D I TPNID LA  G+ L  + T   LCTPSR+A++TG+
Sbjct:    22 PNFVLLLADDLGIGDVGCYGNDTIRTPNIDRLAREGVKLTQHITAAPLCTPSRAALLTGR 81

Query:   119 HPIHTGMQ--HN---VLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGF---YKKE 170
             +PI +GM   +N   + +    GGLP +E    + L++ GY T ++GKWHLG    ++ +
Sbjct:    82 YPIRSGMDAVNNYRVIFWNGGSGGLPPNETTFAKILQQQGYSTGLIGKWHLGVNCEHRND 141

Query:   171 YT--PTFRGFESHLG 183
             +   P   GFE   G
Sbjct:   142 HCHHPLNHGFEYFYG 156

 Score = 96 (38.9 bits), Expect = 5.0e-24, Sum P(3) = 5.0e-24
 Identities = 41/201 (20%), Positives = 87/201 (43%)

Query:   221 STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSK 280
             +T     E++  I  +   +P  L+L+   +H+     PL   + +L    H        
Sbjct:   255 TTSFILRESISFIERNK-HKPFLLFLSFLHSHT-----PLLTTEKFLGKSGH------GL 302

Query:   281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXX-XXXXXXXXXXSNWP----LRG 335
             +   + ++D  VG+V++A++++ +  N+++ F SD                W     ++G
Sbjct:   303 YGDNVEEMDWMVGQVLDAIDKKGLKKNTLVYFASDHGGWLERQEGKRQLGGWNGIYRVKG 362

Query:   336 VKNTL-WEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTV 393
              K    WEGG+R  G+  W  +L + G V  +   + D  PT++  A    +P       
Sbjct:   363 GKAMGGWEGGIRVPGIFRWPGVLPA-GKVINEPTSLMDIYPTVVHLAG-GVVPQDRVIDG 420

Query:   394 ENIIPRYENSILRYENGTHEY 414
              +++P  + ++   E+  H++
Sbjct:   421 RDLMPLLQGTV---EHSEHKF 438

 Score = 43 (20.2 bits), Expect = 5.0e-24, Sum P(3) = 5.0e-24
 Identities = 8/21 (38%), Positives = 12/21 (57%)

Query:   675 PCLFDIKNDPCEKNNLADRSE 695
             P L+D+  DP E   L+  +E
Sbjct:   497 PLLYDLSRDPSESQPLSADTE 517

 Score = 43 (20.2 bits), Expect = 5.0e-24, Sum P(3) = 5.0e-24
 Identities = 8/21 (38%), Positives = 12/21 (57%)

Query:   781 PCLFDIKNDPCEKNNLADRSE 801
             P L+D+  DP E   L+  +E
Sbjct:   497 PLLYDLSRDPSESQPLSADTE 517


>UNIPROTKB|P54793 [details] [associations]
            symbol:ARSF "Arylsulfatase F" species:9606 "Homo sapiens"
            [GO:0046872 "metal ion binding" evidence=IEA] [GO:0005576
            "extracellular region" evidence=IEA] [GO:0004065 "arylsulfatase
            activity" evidence=TAS] [GO:0005788 "endoplasmic reticulum lumen"
            evidence=TAS] [GO:0006644 "phospholipid metabolic process"
            evidence=TAS] [GO:0006665 "sphingolipid metabolic process"
            evidence=TAS] [GO:0006687 "glycosphingolipid metabolic process"
            evidence=TAS] [GO:0043687 "post-translational protein modification"
            evidence=TAS] [GO:0044267 "cellular protein metabolic process"
            evidence=TAS] [GO:0044281 "small molecule metabolic process"
            evidence=TAS] Reactome:REACT_17015 Reactome:REACT_111217
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 GO:GO:0005576 GO:GO:0044281 GO:GO:0046872
            GO:GO:0006644 GO:GO:0005788 GO:GO:0043687 Gene3D:3.40.720.10
            SUPFAM:SSF53649 eggNOG:COG3119 HOGENOM:HOG000135352
            HOVERGEN:HBG004283 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0004065 GO:GO:0006687 KO:K12374
            OrthoDB:EOG4V4379 EMBL:X97868 EMBL:AC112653 EMBL:BC022389
            IPI:IPI00008405 PIR:A56217 RefSeq:NP_001188467.1
            RefSeq:NP_001188468.1 RefSeq:NP_004033.2 UniGene:Hs.101674
            ProteinModelPortal:P54793 SMR:P54793 IntAct:P54793 STRING:P54793
            PhosphoSite:P54793 DMDM:259016386 PaxDb:P54793 PRIDE:P54793
            Ensembl:ENST00000359361 Ensembl:ENST00000381127
            Ensembl:ENST00000537104 GeneID:416 KEGG:hsa:416 UCSC:uc004cre.2
            CTD:416 GeneCards:GC0XP002978 H-InvDB:HIX0016636 HGNC:HGNC:721
            HPA:HPA000549 MIM:300003 neXtProt:NX_P54793 PharmGKB:PA25012
            InParanoid:P54793 OMA:LKPCCGV PhylomeDB:P54793 GenomeRNAi:416
            NextBio:1759 Bgee:P54793 CleanEx:HS_ARSF Genevestigator:P54793
            GermOnline:ENSG00000062096 Uniprot:P54793
        Length = 590

 Score = 233 (87.1 bits), Expect = 1.1e-23, Sum P(3) = 1.1e-23
 Identities = 49/112 (43%), Positives = 69/112 (61%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
             P+I+ I+ DDLG  D+G +G D + TP+ID LA  G+ L  + +   LC+PSRSA +TG+
Sbjct:    30 PNIVLIMVDDLGIGDLGCYGNDTMRTPHIDRLAREGVRLTQHISAASLCSPSRSAFLTGR 89

Query:   119 HPIHTGM----QHNVLYGCE-RGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
             +PI +GM       V+       GLPL+E  L   LK+ GY T ++GKWH G
Sbjct:    90 YPIRSGMVSSGNRRVIQNLAVPAGLPLNETTLAALLKKQGYSTGLIGKWHQG 141

 Score = 118 (46.6 bits), Expect = 1.1e-23, Sum P(3) = 1.1e-23
 Identities = 42/195 (21%), Positives = 82/195 (42%)

Query:   224 VFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAA 283
             +   EA+  +  HS  E   L+ +    H+     PL   D +    +H        +  
Sbjct:   266 IMVKEAISFLERHSK-ETFLLFFSFLHVHT-----PLPTTDDFTGTSKH------GLYGD 313

Query:   284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXX-SNWP--LRGVKNTL 340
              + ++D  VGK+++A++   + +N+++ F SD                W    +G K   
Sbjct:   314 NVEEMDSMVGKILDAIDDFGLRNNTLVYFTSDHGGHLEARRGHAQLGGWNGIYKGGKGMG 373

Query:   341 -WEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPR 399
              WEGG+R  G++  P     G + ++   + D LPT+ S +  S +P        +++P 
Sbjct:   374 GWEGGIRVPGIVRWPGKVPAGRLIKEPTSLMDILPTVASVSGGS-LPQDRVIDGRDLMPL 432

Query:   400 YENSILRYENGTHEY 414
              + ++   E   HE+
Sbjct:   433 LQGNVRHSE---HEF 444

 Score = 52 (23.4 bits), Expect = 0.00020, Sum P(3) = 0.00020
 Identities = 11/23 (47%), Positives = 15/23 (65%)

Query:    13 GLPLSEKILPQYLKELGYRTRIM 35
             GLPL+E  L   LK+ GY T ++
Sbjct:   113 GLPLNETTLAALLKKQGYSTGLI 135

 Score = 47 (21.6 bits), Expect = 1.1e-23, Sum P(3) = 1.1e-23
 Identities = 9/21 (42%), Positives = 11/21 (52%)

Query:   675 PCLFDIKNDPCEKNNLADRSE 695
             P LFD+  DP E   L   +E
Sbjct:   504 PLLFDLSRDPSESTPLTPATE 524

 Score = 47 (21.6 bits), Expect = 1.1e-23, Sum P(3) = 1.1e-23
 Identities = 9/21 (42%), Positives = 11/21 (52%)

Query:   781 PCLFDIKNDPCEKNNLADRSE 801
             P LFD+  DP E   L   +E
Sbjct:   504 PLLFDLSRDPSESTPLTPATE 524


>UNIPROTKB|F5GYY5 [details] [associations]
            symbol:ARSE "Arylsulfatase E" species:9606 "Homo sapiens"
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
            PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484 EMBL:AC005295
            HGNC:HGNC:719 OMA:PVINRCA IPI:IPI00020005 ProteinModelPortal:F5GYY5
            SMR:F5GYY5 Ensembl:ENST00000545496 UCSC:uc011mhh.2
            ArrayExpress:F5GYY5 Bgee:F5GYY5 Uniprot:F5GYY5
        Length = 614

 Score = 260 (96.6 bits), Expect = 1.9e-23, Sum P(3) = 1.9e-23
 Identities = 71/197 (36%), Positives = 107/197 (54%)

Query:     5 VLYGCERGGLPLSEKILPQYLKE--LGYRTRI-MAF-AVLP--LAFTLSMV-FVDLVASS 57
             V+  C  G L L   +LPQ   E  + +  +I + F + LP  LA  LS+        S+
Sbjct:     4 VINRCAPGSLDL---MLPQAASEGIVFHSLQISLCFRSWLPAMLAVLLSLAPSASSDISA 60

Query:    58 GPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMT 116
               P+I+ ++ADDLG  D+G +G + + TPNID LA  G+ L  + +   LCTPSR+A +T
Sbjct:    61 SRPNILLLMADDLGIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLT 120

Query:   117 GKHPIHTGMQHNVLYGCER-----GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE- 170
             G++P+ +GM  ++ Y   +     GGLP +E    + LKE GY T ++GKWHLG   +  
Sbjct:   121 GRYPVRSGMVSSIGYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESA 180

Query:   171 ----YTPTFRGFESHLG 183
                 + P   GF+   G
Sbjct:   181 SDHCHHPLHHGFDHFYG 197

 Score = 85 (35.0 bits), Expect = 1.9e-23, Sum P(3) = 1.9e-23
 Identities = 25/107 (23%), Positives = 50/107 (46%)

Query:   285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXX-SNWP--LRGVKNTL- 340
             + ++D  VG++++ L+   + ++++I F SD                W    +G K    
Sbjct:   348 VEEMDWMVGRILDTLDVEGLSNSTLIYFTSDHGGSLENQLGNTQYGGWNGIYKGGKGMGG 407

Query:   341 WEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
             WEGG+R  G+  W  +L +  ++ E    + D  PT++  A   ++P
Sbjct:   408 WEGGIRVPGIFRWPGVLPAGRVIGEP-TSLMDVFPTVVRLAG-GEVP 452

 Score = 49 (22.3 bits), Expect = 1.9e-23, Sum P(3) = 1.9e-23
 Identities = 10/21 (47%), Positives = 12/21 (57%)

Query:   675 PCLFDIKNDPCEKNNLADRSE 695
             P LFD+  DP E + L   SE
Sbjct:   536 PLLFDLSRDPSETHILTPASE 556

 Score = 49 (22.3 bits), Expect = 1.9e-23, Sum P(3) = 1.9e-23
 Identities = 10/21 (47%), Positives = 12/21 (57%)

Query:   781 PCLFDIKNDPCEKNNLADRSE 801
             P LFD+  DP E + L   SE
Sbjct:   536 PLLFDLSRDPSETHILTPASE 556


>MGI|MGI:98438 [details] [associations]
            symbol:Sts "steroid sulfatase" species:10090 "Mus musculus"
            [GO:0003824 "catalytic activity" evidence=IEA] [GO:0004773
            "steryl-sulfatase activity" evidence=ISO] [GO:0005635 "nuclear
            envelope" evidence=ISO] [GO:0005783 "endoplasmic reticulum"
            evidence=IEA] [GO:0005789 "endoplasmic reticulum membrane"
            evidence=ISO] [GO:0006629 "lipid metabolic process" evidence=IEA]
            [GO:0007565 "female pregnancy" evidence=IEA] [GO:0007611 "learning
            or memory" evidence=ISO] [GO:0008152 "metabolic process"
            evidence=ISO] [GO:0008202 "steroid metabolic process" evidence=IEA]
            [GO:0008284 "positive regulation of cell proliferation"
            evidence=ISO] [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=ISO] [GO:0009268 "response to pH" evidence=ISO]
            [GO:0014070 "response to organic cyclic compound" evidence=ISO]
            [GO:0016020 "membrane" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=ISO] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0043231 "intracellular membrane-bounded
            organelle" evidence=ISO] [GO:0043434 "response to peptide hormone
            stimulus" evidence=ISO] [GO:0043627 "response to estrogen stimulus"
            evidence=ISO] [GO:0046872 "metal ion binding" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 MGI:MGI:98438 GO:GO:0016021 GO:GO:0005789
            GO:GO:0008202 GO:GO:0046872 GO:GO:0007565 Gene3D:3.40.720.10
            SUPFAM:SSF53649 HOVERGEN:HBG004283 InterPro:IPR024607
            PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484 CTD:412 KO:K01131
            GO:GO:0004773 EMBL:U37545 IPI:IPI00118038 RefSeq:NP_033319.1
            UniGene:Mm.423011 ProteinModelPortal:P50427 SMR:P50427
            PhosphoSite:P50427 PRIDE:P50427 GeneID:20905 KEGG:mmu:20905
            NextBio:299773 CleanEx:MM_STS Genevestigator:P50427 Uniprot:P50427
        Length = 624

 Score = 243 (90.6 bits), Expect = 3.1e-23, Sum P(3) = 3.1e-23
 Identities = 55/136 (40%), Positives = 77/136 (56%)

Query:    59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTG 117
             PP+ + I+ADDLG  D+G +G   + TP++D LA  G+ L  +     LCTPSR+A +TG
Sbjct:    34 PPNFLLIMADDLGIGDLGCYGNKTLRTPHLDRLAREGVKLTQHLAAAPLCTPSRAAFLTG 93

Query:   118 KHPIHTGMQ-HN----VLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
             ++P  +GM  H      L+    GGLP SE  + + LK  GY T ++GKWHLG   +  T
Sbjct:    94 RYPPRSGMAAHGRVGVYLFTASSGGLPPSEVTMARLLKGRGYATALIGKWHLGLSCRGAT 153

Query:   173 -----PTFRGFESHLG 183
                  P   GF+  LG
Sbjct:   154 DFCHHPLRHGFDRFLG 169

 Score = 102 (41.0 bits), Expect = 3.1e-23, Sum P(3) = 3.1e-23
 Identities = 44/161 (27%), Positives = 69/161 (42%)

Query:   266 YLNIHR-HIED--FK-RSKFAAI---LHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXX 318
             +L++H  H  D  F  RS   A    + ++D  VG+V+ AL++  +   +++ F SD   
Sbjct:   294 FLHVHTAHFADPGFAGRSLHGAYGDSVEEMDWGVGRVLAALDELGLARETLVYFTSDHGA 353

Query:   319 XXXXXX----XXXXSNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWL 373
                           SN   RG K   WEGGVR   L+ W   L    +VAE    + D  
Sbjct:   354 HVEELGPRGERMGGSNGVFRGGKGNNWEGGVRVPCLVRWPRELSPGRVVAEP-TSLMDVF 412

Query:   374 PTLLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEY 414
             PT+   A  +++P        +++P       R E   HE+
Sbjct:   413 PTVARLAG-AELPGDRVIDGRDLMPLLRGDAQRSE---HEF 449

 Score = 49 (22.3 bits), Expect = 3.1e-23, Sum P(3) = 3.1e-23
 Identities = 12/34 (35%), Positives = 16/34 (47%)

Query:   662 GPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSE 695
             GP      +P   P LFD+  DP E+  L   +E
Sbjct:   497 GPAHVTAHDP---PLLFDLTRDPGERRPLTPEAE 527

 Score = 49 (22.3 bits), Expect = 3.1e-23, Sum P(3) = 3.1e-23
 Identities = 12/34 (35%), Positives = 16/34 (47%)

Query:   768 GPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSE 801
             GP      +P   P LFD+  DP E+  L   +E
Sbjct:   497 GPAHVTAHDP---PLLFDLTRDPGERRPLTPEAE 527


>TIGR_CMR|CPS_2364 [details] [associations]
            symbol:CPS_2364 "sulfatase family protein" species:167879
            "Colwellia psychrerythraea 34H" [GO:0008152 "metabolic process"
            evidence=ISS] [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=ISS] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 EMBL:CP000083
            GenomeReviews:CP000083_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 HOGENOM:HOG000135352 InterPro:IPR024607
            PROSITE:PS00149 GO:GO:0008484 RefSeq:YP_269082.1
            ProteinModelPortal:Q482D6 STRING:Q482D6 GeneID:3521400
            KEGG:cps:CPS_2364 PATRIC:21467813 OMA:MEIAVIN
            BioCyc:CPSY167879:GI48-2427-MONOMER Uniprot:Q482D6
        Length = 492

 Score = 295 (108.9 bits), Expect = 3.3e-23, Sum P(2) = 3.3e-23
 Identities = 91/305 (29%), Positives = 143/305 (46%)

Query:    35 MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
             + F+ L L FT S        S+  P+++ +L DD G  D+  +G +   TPNID LA  
Sbjct:     7 LLFSGLSL-FTCSQAVATPDKSTSKPNVVMLLVDDFGRQDLSTYGSNFYETPNIDQLAAD 65

Query:    95 GIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELG 153
             G+   N Y     C PSR AI +G +P   G+      G  +  LPLS     ++LKE G
Sbjct:    66 GMKFDNAYAAHPRCVPSRVAIFSGSYPTRYGVPQGERVG--KHHLPLSAVTFGEHLKEAG 123

Query:   154 YRTRIVGKWHLGFYKKEYTPTFRGFESHL--GYWTGHQDYFDHSAEEMKMWGLDMRRDLE 211
             Y+T  +GKWHLG  K+   PT +GF+S +  G+W     Y+     +M   G +  +   
Sbjct:   124 YQTGYIGKWHLG--KEGGDPTKQGFDSSIMAGHWGAPPSYY-FPYTKMSKSGKN--KGFA 178

Query:   212 PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
                    +Y TD  T EA+  I     D+P  L LAH A H+    +P     +   + +
Sbjct:   179 KVEGSEEEYLTDRLTDEALTFIEQKK-DQPFLLVLAHYAVHTPIEGKPALVKKYKTKMKK 237

Query:   272 H-------------IED---FKRS-----KFAAILHKLDESVGKVVEALEQRRMLSNSII 310
                           I+D   + ++      +AA++  +D SVG++ + L++  +  N+II
Sbjct:   238 LGIANAGPKSDADLIKDSTGYHKTIQNNPDYAAMVESVDISVGRIEQQLKRLGLEDNTII 297

Query:   311 VFVSD 315
             +  SD
Sbjct:   298 ILTSD 302

 Score = 45 (20.9 bits), Expect = 3.3e-23, Sum P(2) = 3.3e-23
 Identities = 17/71 (23%), Positives = 30/71 (42%)

Query:   511 DGIDVWSVLSRNEPSKRNTILHNI------DDEWQISALTRGKWKLVKENSINGNGTSEN 564
             DG+   + L+ +E  ++    H+         +   SA+  G+WKL+   S  G     N
Sbjct:   382 DGVSYLAALNSDETPRKAMFWHSPAARPSKTGDTNSSAIIEGEWKLLDFWS-TGKVELYN 440

Query:   565 RSNDNSYQNEI 575
               +D S  N +
Sbjct:   441 LKDDKSEANNL 451

 Score = 44 (20.5 bits), Expect = 4.1e-23, Sum P(2) = 4.1e-23
 Identities = 8/15 (53%), Positives = 12/15 (80%)

Query:   677 LFDIKNDPCEKNNLA 691
             L+++K+D  E NNLA
Sbjct:   438 LYNLKDDKSEANNLA 452

 Score = 44 (20.5 bits), Expect = 4.1e-23, Sum P(2) = 4.1e-23
 Identities = 8/15 (53%), Positives = 12/15 (80%)

Query:   783 LFDIKNDPCEKNNLA 797
             L+++K+D  E NNLA
Sbjct:   438 LYNLKDDKSEANNLA 452

 Score = 42 (19.8 bits), Expect = 6.7e-23, Sum P(2) = 6.7e-23
 Identities = 17/72 (23%), Positives = 33/72 (45%)

Query:   534 IDDEWQISAL-TRGKWKL--VKENSINGNGTSENRSNDNSYQ-----NEIDGIDVWSVLS 585
             I+ EW++    + GK +L  +K++    N  ++      +       N  D ID  +V  
Sbjct:   421 IEGEWKLLDFWSTGKVELYNLKDDKSEANNLAKLMPEKTAEMLAKLTNWKDDIDAHTVKK 480

Query:   586 RNEPSKRNTILH 597
             +N+ SK+ +  H
Sbjct:   481 KNKKSKKKSKSH 492

 Score = 39 (18.8 bits), Expect = 1.4e-22, Sum P(2) = 1.4e-22
 Identities = 8/22 (36%), Positives = 13/22 (59%)

Query:   511 DGIDVWSVLSRNEPSKRNTILH 532
             D ID  +V  +N+ SK+ +  H
Sbjct:   471 DDIDAHTVKKKNKKSKKKSKSH 492


>UNIPROTKB|P51690 [details] [associations]
            symbol:ARSE "Arylsulfatase E" species:9606 "Homo sapiens"
            [GO:0046872 "metal ion binding" evidence=IEA] [GO:0005795 "Golgi
            stack" evidence=IEA] [GO:0001501 "skeletal system development"
            evidence=TAS] [GO:0004065 "arylsulfatase activity" evidence=TAS]
            [GO:0005788 "endoplasmic reticulum lumen" evidence=TAS] [GO:0006644
            "phospholipid metabolic process" evidence=TAS] [GO:0006665
            "sphingolipid metabolic process" evidence=TAS] [GO:0006687
            "glycosphingolipid metabolic process" evidence=TAS] [GO:0043687
            "post-translational protein modification" evidence=TAS] [GO:0044267
            "cellular protein metabolic process" evidence=TAS] [GO:0044281
            "small molecule metabolic process" evidence=TAS]
            Reactome:REACT_17015 Reactome:REACT_111217 InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 GO:GO:0044281
            GO:GO:0046872 GO:GO:0006644 GO:GO:0005795 GO:GO:0005788
            GO:GO:0001501 GO:GO:0043687 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 HOVERGEN:HBG004283 InterPro:IPR024607
            PROSITE:PS00523 PROSITE:PS00149 GO:GO:0004065 GO:GO:0006687
            KO:K12374 OrthoDB:EOG4V4379 EMBL:X83573 EMBL:AK223183 EMBL:AK223199
            IPI:IPI01014058 PIR:I37187 RefSeq:NP_000038.2 UniGene:Hs.386975
            ProteinModelPortal:P51690 SMR:P51690 IntAct:P51690
            MINT:MINT-1382153 STRING:P51690 PhosphoSite:P51690 DMDM:77416850
            PaxDb:P51690 PRIDE:P51690 DNASU:415 Ensembl:ENST00000381134
            GeneID:415 KEGG:hsa:415 UCSC:uc004crc.4 CTD:415
            GeneCards:GC0XM002846 HGNC:HGNC:719 MIM:300180 MIM:302950
            neXtProt:NX_P51690 Orphanet:79345 PharmGKB:PA25010
            InParanoid:P51690 GenomeRNAi:415 NextBio:1755 ArrayExpress:P51690
            Bgee:P51690 CleanEx:HS_ARSE Genevestigator:P51690
            GermOnline:ENSG00000157399 Uniprot:P51690
        Length = 589

 Score = 254 (94.5 bits), Expect = 6.9e-23, Sum P(3) = 6.9e-23
 Identities = 54/139 (38%), Positives = 82/139 (58%)

Query:    56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAI 114
             S+  P+I+ ++ADDLG  D+G +G + + TPNID LA  G+ L  + +   LCTPSR+A 
Sbjct:    34 SASRPNILLLMADDLGIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAF 93

Query:   115 MTGKHPIHTGMQHNVLYGCER-----GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
             +TG++P+ +GM  ++ Y   +     GGLP +E    + LKE GY T ++GKWHLG   +
Sbjct:    94 LTGRYPVRSGMVSSIGYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCE 153

Query:   170 E-----YTPTFRGFESHLG 183
                   + P   GF+   G
Sbjct:   154 SASDHCHHPLHHGFDHFYG 172

 Score = 85 (35.0 bits), Expect = 6.9e-23, Sum P(3) = 6.9e-23
 Identities = 25/107 (23%), Positives = 50/107 (46%)

Query:   285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXX-SNWP--LRGVKNTL- 340
             + ++D  VG++++ L+   + ++++I F SD                W    +G K    
Sbjct:   323 VEEMDWMVGRILDTLDVEGLSNSTLIYFTSDHGGSLENQLGNTQYGGWNGIYKGGKGMGG 382

Query:   341 WEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
             WEGG+R  G+  W  +L +  ++ E    + D  PT++  A   ++P
Sbjct:   383 WEGGIRVPGIFRWPGVLPAGRVIGEP-TSLMDVFPTVVRLAG-GEVP 427

 Score = 49 (22.3 bits), Expect = 6.9e-23, Sum P(3) = 6.9e-23
 Identities = 10/21 (47%), Positives = 12/21 (57%)

Query:   675 PCLFDIKNDPCEKNNLADRSE 695
             P LFD+  DP E + L   SE
Sbjct:   511 PLLFDLSRDPSETHILTPASE 531

 Score = 49 (22.3 bits), Expect = 6.9e-23, Sum P(3) = 6.9e-23
 Identities = 10/21 (47%), Positives = 12/21 (57%)

Query:   781 PCLFDIKNDPCEKNNLADRSE 801
             P LFD+  DP E + L   SE
Sbjct:   511 PLLFDLSRDPSETHILTPASE 531


>UNIPROTKB|Q32KH8 [details] [associations]
            symbol:ARSH "Arylsulfatase H" species:9615 "Canis lupus
            familiaris" [GO:0046872 "metal ion binding" evidence=IEA]
            [GO:0016021 "integral to membrane" evidence=IEA] [GO:0008484
            "sulfuric ester hydrolase activity" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 GO:GO:0016021 GO:GO:0046872 Gene3D:3.40.720.10
            SUPFAM:SSF53649 eggNOG:COG3119 HOGENOM:HOG000135352
            HOVERGEN:HBG004283 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 KO:K12374 OrthoDB:EOG4V4379
            EMBL:AAEX01047377 EMBL:BN000759 RefSeq:NP_001041588.1
            UniGene:Cfa.39079 HSSP:P15289 ProteinModelPortal:Q32KH8 SMR:Q32KH8
            GeneID:491720 KEGG:cfa:491720 CTD:347527 InParanoid:Q32KH8
            NextBio:20864464 Uniprot:Q32KH8
        Length = 562

 Score = 232 (86.7 bits), Expect = 1.1e-22, Sum P(3) = 1.1e-22
 Identities = 52/136 (38%), Positives = 78/136 (57%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
             P+I+ ++ADDLG  D+  +G + + TPNID LA  G+ L  +     +CTPSR+A +TG+
Sbjct:     7 PNIVLLMADDLGVGDLCCYGNNTVSTPNIDRLASEGVRLTQHLAAASVCTPSRAAFLTGR 66

Query:   119 HPIHTGMQ--HNVLYGCE----RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFY---KK 169
             +PI +GM   +N+  G       GGLP +E    + L+  GYRT ++GKWH G     + 
Sbjct:    67 YPIRSGMASPYNLNRGLTWLGGSGGLPTNETTFAKLLQHYGYRTGLIGKWHQGLSCASRN 126

Query:   170 E--YTPTFRGFESHLG 183
             +  Y P   GF+   G
Sbjct:   127 DHCYHPLNHGFDYFYG 142

 Score = 109 (43.4 bits), Expect = 1.1e-22, Sum P(3) = 1.1e-22
 Identities = 47/192 (24%), Positives = 82/192 (42%)

Query:   228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
             EA+  I  +    P  L+++    H+     PL   D ++  H      K   +   + +
Sbjct:   248 EALAFIDRYKRG-PFLLFVSFLHVHT-----PLITKDKFVG-HS-----KYGLYGDNVEE 295

Query:   288 LDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXX---SNWPLRGVKNTL-WEG 343
             +D  VGK++E L+Q R+ +++++ F SD                SN   +G +    WEG
Sbjct:   296 MDWMVGKILETLDQERLTNHTLVYFTSDNGGRLEVQEGEVQLGGSNGIYKGGQGMGGWEG 355

Query:   344 GVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYEN 402
             G+R  G+  W  +L++ G V  +   + D  PTL S      +P        N++P  E 
Sbjct:   356 GIRVPGIFRWPTVLQA-GKVINEPTSLMDIYPTL-SYIGGGMLPQDRVIDGRNLMPLLEG 413

Query:   403 SILRYENGTHEY 414
                R  +  HE+
Sbjct:   414 ---RVSHSDHEF 422

 Score = 46 (21.3 bits), Expect = 1.1e-22, Sum P(3) = 1.1e-22
 Identities = 11/32 (34%), Positives = 14/32 (43%)

Query:   668 PCEPQIA----PCLFDIKNDPCEKNNLADRSE 695
             PC   +     P LFD+  DP E   L   +E
Sbjct:   469 PCSGDVTYHDPPLLFDVSRDPSETRPLNPDNE 500

 Score = 46 (21.3 bits), Expect = 1.1e-22, Sum P(3) = 1.1e-22
 Identities = 11/32 (34%), Positives = 14/32 (43%)

Query:   774 PCEPQIA----PCLFDIKNDPCEKNNLADRSE 801
             PC   +     P LFD+  DP E   L   +E
Sbjct:   469 PCSGDVTYHDPPLLFDVSRDPSETRPLNPDNE 500


>POMBASE|SPBPB10D8.02c [details] [associations]
            symbol:SPBPB10D8.02c "arylsulfatase (predicted)"
            species:4896 "Schizosaccharomyces pombe" [GO:0004065 "arylsulfatase
            activity" evidence=ISS] [GO:0005634 "nucleus" evidence=IDA]
            [GO:0005829 "cytosol" evidence=IDA] [GO:0006790 "sulfur compound
            metabolic process" evidence=ISS] [GO:0046872 "metal ion binding"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 PomBase:SPBPB10D8.02c GO:GO:0005829
            GO:GO:0005634 GO:GO:0046872 EMBL:CU329671 Gene3D:3.40.720.10
            SUPFAM:SSF53649 eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0004065 GO:GO:0006790 KO:K01130
            RefSeq:NP_595046.1 HSSP:P51691 ProteinModelPortal:Q9C0V7 SMR:Q9C0V7
            STRING:Q9C0V7 EnsemblFungi:SPBPB10D8.02c.1 GeneID:2541396
            KEGG:spo:SPBPB10D8.02c HOGENOM:HOG000135353 OMA:IEWTNIS
            OrthoDB:EOG4DJP4T NextBio:20802503 Uniprot:Q9C0V7
        Length = 554

 Score = 233 (87.1 bits), Expect = 1.6e-22, Sum P(3) = 1.6e-22
 Identities = 70/232 (30%), Positives = 113/232 (48%)

Query:    55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
             A S  P+ + I+ADDLGW+DV   G  +I TPNI+ LA  G+ L N++T   C+P+RS +
Sbjct:     7 AESKKPNFLVIVADDLGWSDVSPFG-SEIHTPNIERLAKEGVRLTNFHTASACSPTRSML 65

Query:   115 MTG--KHPIHTGMQHNVLYGCER--GGLP-----LSEKI--LPQYLKELGYRTRIVGKWH 163
             ++G   H    G     +    +  GG P     L++++  LP+ L+E GY T + GKWH
Sbjct:    66 LSGTDNHIAGLGQMAETVRRFSKVWGGKPGYEGYLNDRVAALPEILQEAGYYTTMSGKWH 125

Query:   164 LGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA-WD-LHGKYS 221
             LG     Y P+ RGF+       G  ++F +     +   +     L     D +  K  
Sbjct:   126 LGLTPDRY-PSKRGFKESFALLPGGGNHFAYEPGTRENPAVPFLPPLYTHNHDPVDHKSL 184

Query:   222 TDVFTAE--AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
              + +++   A  +I      E    + A+    +  P+ PLQ+P  Y+N +R
Sbjct:   185 KNFYSSNYFAEKLIDQLKNREKSQSFFAYLPFTA--PHWPLQSPKEYINKYR 234

 Score = 84 (34.6 bits), Expect = 1.6e-22, Sum P(3) = 1.6e-22
 Identities = 25/79 (31%), Positives = 40/79 (50%)

Query:   332 PLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNS 391
             P R  K  + EGG+R   +I  P L    I+++++V V D LPT+L  A     P +   
Sbjct:   377 PSRLSKGFITEGGIRCPAIIRYPPLIKPDIISDEFVTVMDILPTILELAEVPH-PGHKFQ 435

Query:   392 TVENIIPRYENSILRYENG 410
               + +IPR +  I  + +G
Sbjct:   436 GRDVVIPRGKPWIDHFVHG 454

 Score = 68 (29.0 bits), Expect = 1.6e-22, Sum P(3) = 1.6e-22
 Identities = 12/35 (34%), Positives = 25/35 (71%)

Query:   281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
             +AA++  LD ++G+V++ L+    L N+ ++F+SD
Sbjct:   293 YAAMVELLDLNIGRVIDYLKTIGELDNTFVIFMSD 327

 Score = 37 (18.1 bits), Expect = 5.9e-15, Sum P(2) = 5.9e-15
 Identities = 7/13 (53%), Positives = 7/13 (53%)

Query:   432 EYNPKYENRYENG 444
             EY  KY  RY  G
Sbjct:   228 EYINKYRGRYSEG 240

 Score = 37 (18.1 bits), Expect = 5.9e-15, Sum P(2) = 5.9e-15
 Identities = 7/13 (53%), Positives = 7/13 (53%)

Query:   447 EYNPKYENRYENG 459
             EY  KY  RY  G
Sbjct:   228 EYINKYRGRYSEG 240


>TIGR_CMR|CPS_0660 [details] [associations]
            symbol:CPS_0660 "sulfatase family protein" species:167879
            "Colwellia psychrerythraea 34H" [GO:0006790 "sulfur compound
            metabolic process" evidence=ISS] [GO:0008484 "sulfuric ester
            hydrolase activity" evidence=ISS] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 EMBL:CP000083
            GenomeReviews:CP000083_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 GO:GO:0008484 KO:K01130 HOGENOM:HOG000135355
            RefSeq:YP_267410.1 ProteinModelPortal:Q488V4 STRING:Q488V4
            GeneID:3519819 KEGG:cps:CPS_0660 PATRIC:21464645 OMA:NISAYTH
            BioCyc:CPSY167879:GI48-747-MONOMER Uniprot:Q488V4
        Length = 525

 Score = 293 (108.2 bits), Expect = 1.6e-22, Sum P(2) = 1.6e-22
 Identities = 100/347 (28%), Positives = 159/347 (45%)

Query:    60 PHIIFILADDLGWNDVGF--HGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTG 117
             P+I+ I  DD+G +++    HG+    T NID +A  G++  +YY    CT  R+A +TG
Sbjct:    39 PNILAIWGDDIGQSNISAYTHGMMGYKTTNIDRIAKEGVLFTDYYGENSCTAGRAAFITG 98

Query:   118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRG 177
             ++P+ TG+    L G ++G L   +  + + LK+ GY T   GK HLG  K E+ PT  G
Sbjct:    99 QYPVRTGLTKVGLPGSDKG-LRAEDVTIAELLKDRGYVTGQFGKNHLGD-KDEFLPTNHG 156

Query:   178 FESHLG--YWTG------HQDYFDHSAEEMKMWG----LDMRRD--LEPAWDLHGK-YST 222
             F+  LG  Y         H DY    A + K +G    +    D  +E +  L  K   T
Sbjct:   157 FDEFLGNLYHLNAEEEPEHPDYPKDQAYK-KRFGPRGVIHSFADGKIEDSGPLTKKRMET 215

Query:   223 --DVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRS 279
               D F A     I   H  ++P F++      H    +  L+     L+    I      
Sbjct:   216 IDDEFLAATTKFIDKAHKNNKPFFVWFNATRMHI---WTHLKEESKGLSKRGGI------ 266

Query:   280 KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNT 339
              +   + + D  VG +++ L++  +  N+I+++ +D                P +G KNT
Sbjct:   267 -YGDGMMEHDYQVGVLLDQLDRLAIADNTIVLYTTDNGAEVFSWPDGGTI--PFKGEKNT 323

Query:   340 LWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDI 385
              WEGG R   ++ W   + +     E   H+ DW PTLL+AA  +DI
Sbjct:   324 TWEGGFRVPAMVRWPGKITAGDAKIEMVSHM-DWAPTLLAAAGVTDI 369

 Score = 43 (20.2 bits), Expect = 1.6e-22, Sum P(2) = 1.6e-22
 Identities = 19/67 (28%), Positives = 29/67 (43%)

Query:   628 LTGGPDQV----YLSGLSDREWLALAMRKLRDAASIQ-CGPVKE--VPCEPQIAPCLFDI 680
             LTG  D+     YL      +  A+    ++   SIQ C  +     P  P  AP L ++
Sbjct:   397 LTGATDEAPRPSYLYFTDGGDLSAVRFGDMKLQYSIQECEGLNVWICPLTPLRAPLLTNL 456

Query:   681 KNDPCEK 687
             + DP E+
Sbjct:   457 RQDPYER 463

 Score = 42 (19.8 bits), Expect = 2.1e-22, Sum P(2) = 2.1e-22
 Identities = 8/20 (40%), Positives = 12/20 (60%)

Query:   774 PCEPQIAPCLFDIKNDPCEK 793
             P  P  AP L +++ DP E+
Sbjct:   444 PLTPLRAPLLTNLRQDPYER 463


>RGD|1306571 [details] [associations]
            symbol:Arsg "arylsulfatase G" species:10116 "Rattus norvegicus"
            [GO:0004065 "arylsulfatase activity" evidence=ISO;ISS] [GO:0005615
            "extracellular space" evidence=IEA;ISO] [GO:0005764 "lysosome"
            evidence=ISO;ISS] [GO:0005783 "endoplasmic reticulum"
            evidence=IEA;ISO] [GO:0006790 "sulfur compound metabolic process"
            evidence=IEA;ISO] [GO:0046872 "metal ion binding" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 RGD:1306571 GO:GO:0005783 GO:GO:0005615 GO:GO:0046872
            GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            GeneTree:ENSGT00560000076940 HOGENOM:HOG000135352
            HOVERGEN:HBG004283 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0004065 CTD:22901 KO:K12381 OrthoDB:EOG4J9MZJ
            GO:GO:0006790 EMBL:AABR03073953 EMBL:AABR03074766 EMBL:AABR03075952
            EMBL:AABR03076519 EMBL:AABR03076696 EMBL:BN000738 IPI:IPI00361303
            RefSeq:NP_001041342.1 UniGene:Rn.221856 ProteinModelPortal:Q32KJ9
            PRIDE:Q32KJ9 Ensembl:ENSRNOT00000005257 GeneID:303631
            KEGG:rno:303631 InParanoid:Q32KJ9 OMA:WHYPHYS NextBio:651782
            Genevestigator:Q32KJ9 GermOnline:ENSRNOG00000003931 Uniprot:Q32KJ9
        Length = 526

 Score = 264 (98.0 bits), Expect = 1.8e-22, Sum P(3) = 1.8e-22
 Identities = 54/139 (38%), Positives = 84/139 (60%)

Query:    50 FVDLVASS---GP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV- 104
             FVD   S     P P+I+ ILADD+GW D+G +  +   T N+D +A  G+   +++   
Sbjct:    22 FVDFSISGETRAPRPNIVIILADDMGWGDLGANWAETKDTTNLDKMASEGMRFVDFHAAA 81

Query:   105 QLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL 164
               C+PSR++++TG+  +  G+ HN       GGLPL+E  L + L++ GY T ++GKWHL
Sbjct:    82 STCSPSRASLLTGRLGLRNGVTHNFAV-TSVGGLPLNETTLAEVLQQAGYVTAMIGKWHL 140

Query:   165 GFYKKEYTPTFRGFESHLG 183
             G +   Y P+FRGF+ + G
Sbjct:   141 GHHGS-YHPSFRGFDYYFG 158

 Score = 72 (30.4 bits), Expect = 1.8e-22, Sum P(3) = 1.8e-22
 Identities = 43/177 (24%), Positives = 67/177 (37%)

Query:   225 FTAEAVDIIHNHSTD-EPLFLYLAHAATH---SANPYEPLQAPDHYLNIHRHIEDFKRSK 280
             +   AV+ I   ST   P  LY+  A  H   S  P  PL  P     ++R         
Sbjct:   223 YAERAVEFIEQASTSGRPFLLYVGLAHMHVPLSVTP--PLANPQSQ-RLYR--------- 270

Query:   281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGV---- 336
               A L ++D  VG++ + ++      N+++ F  D             S  P  G+    
Sbjct:   271 --ASLQEMDSLVGQIKDKVDHVAK-ENTLLWFAGDNGPWAQKCELAG-SMGPFSGLWQTH 326

Query:   337 ------KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
                   K T WEGG R   L + P      + +   + + D  PT+++ A  S  PN
Sbjct:   327 QGGSPAKQTTWEGGHRVPALAYWPGRVPVNVTSTALLSLLDIFPTVIALAGASLPPN 383

 Score = 43 (20.2 bits), Expect = 1.8e-22, Sum P(3) = 1.8e-22
 Identities = 11/40 (27%), Positives = 21/40 (52%)

Query:   774 PCEPQIAPCLFDIKNDPCEKNNLADRS-EVQRINHYTTEV 812
             P +  ++P +F++++D  E + L   S E Q +    T V
Sbjct:   446 PEQHHVSPLIFNLEDDAAESSPLQKGSPEYQELLPKVTRV 485

 Score = 41 (19.5 bits), Expect = 2.9e-22, Sum P(3) = 2.9e-22
 Identities = 11/40 (27%), Positives = 21/40 (52%)

Query:   668 PCEPQIAPCLFDIKNDPCEKNNLADRS-EDQRINHYTTEV 706
             P +  ++P +F++++D  E + L   S E Q +    T V
Sbjct:   446 PEQHHVSPLIFNLEDDAAESSPLQKGSPEYQELLPKVTRV 485

 Score = 39 (18.8 bits), Expect = 4.7e-22, Sum P(3) = 4.7e-22
 Identities = 26/130 (20%), Positives = 48/130 (36%)

Query:   574 EIDGIDVWSVLSRNEPSKRNTILH-NIDDEWQISALTXXXXXXXXXXXXMRYQVDLTGG- 631
             + DG+DV  VL     +    + H N     +  AL                     GG 
Sbjct:   385 KFDGVDVSEVLFGKSQTGHRVLFHPNSGAAGEYGALQTVRLDRYKAFYITGGAKACDGGV 444

Query:   632 -PDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNL 690
              P+Q ++S L     +        +++ +Q G  +     P++   L D+  D  + N+ 
Sbjct:   445 GPEQHHVSPL-----IFNLEDDAAESSPLQKGSPEYQELLPKVTRVLADVLQDIADDNSS 499

Query:   691 -ADRSEDQRI 699
              AD ++D  +
Sbjct:   500 QADYTQDPSV 509


>UNIPROTKB|Q5FYA8 [details] [associations]
            symbol:ARSH "Arylsulfatase H" species:9606 "Homo sapiens"
            [GO:0016021 "integral to membrane" evidence=IEA] [GO:0046872 "metal
            ion binding" evidence=IEA] [GO:0004065 "arylsulfatase activity"
            evidence=TAS] [GO:0005788 "endoplasmic reticulum lumen"
            evidence=TAS] [GO:0006644 "phospholipid metabolic process"
            evidence=TAS] [GO:0006665 "sphingolipid metabolic process"
            evidence=TAS] [GO:0006687 "glycosphingolipid metabolic process"
            evidence=TAS] [GO:0043687 "post-translational protein modification"
            evidence=TAS] [GO:0044267 "cellular protein metabolic process"
            evidence=TAS] [GO:0044281 "small molecule metabolic process"
            evidence=TAS] Reactome:REACT_17015 Reactome:REACT_111217
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 GO:GO:0016021 GO:GO:0044281 GO:GO:0046872
            GO:GO:0006644 GO:GO:0005788 GO:GO:0043687 Gene3D:3.40.720.10
            SUPFAM:SSF53649 eggNOG:COG3119 HOGENOM:HOG000135352
            HOVERGEN:HBG004283 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0004065 GO:GO:0006687 KO:K12374
            OrthoDB:EOG4V4379 CTD:347527 EMBL:AY875940 IPI:IPI00233062
            RefSeq:NP_001011719.1 UniGene:Hs.351533 HSSP:P08842
            ProteinModelPortal:Q5FYA8 SMR:Q5FYA8 STRING:Q5FYA8 DMDM:74722579
            PRIDE:Q5FYA8 DNASU:347527 Ensembl:ENST00000381130 GeneID:347527
            KEGG:hsa:347527 UCSC:uc011mhj.2 GeneCards:GC0XP002919
            HGNC:HGNC:32488 HPA:HPA050011 MIM:300586 neXtProt:NX_Q5FYA8
            PharmGKB:PA143485308 InParanoid:Q5FYA8 OMA:ATVWKVH
            GenomeRNAi:347527 NextBio:99177 Bgee:Q5FYA8 CleanEx:HS_ARSH
            Genevestigator:Q5FYA8 Uniprot:Q5FYA8
        Length = 562

 Score = 230 (86.0 bits), Expect = 1.8e-22, Sum P(3) = 1.8e-22
 Identities = 48/115 (41%), Positives = 69/115 (60%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
             P+I+ ++ADDLG  D+  +G + + TPNID LA  G+ L  +     +CTPSR+A +TG+
Sbjct:     7 PNIVLLMADDLGVGDLCCYGNNSVSTPNIDRLASEGVRLTQHLAAASMCTPSRAAFLTGR 66

Query:   119 HPIHTGMQHNVLYGCER--------GGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
             +PI +GM     Y   R        GGLP +E    + L+  GYRT ++GKWHLG
Sbjct:    67 YPIRSGMVS--AYNLNRAFTWLGGSGGLPTNETTFAKLLQHRGYRTGLIGKWHLG 119

 Score = 112 (44.5 bits), Expect = 1.8e-22, Sum P(3) = 1.8e-22
 Identities = 37/131 (28%), Positives = 62/131 (47%)

Query:   258 EPLQAPDHYLNIHRHIEDFK----RSKFAAI---LHKLDESVGKVVEALEQRRMLSNSII 310
             EP      +L++H  +   K    RSK+      + ++D  VGK+++AL+Q R+ +++++
Sbjct:   259 EPFLLFFSFLHVHTPLISKKKFVGRSKYGRYGDNVEEMDWMVGKILDALDQERLANHTLV 318

Query:   311 VFVSDXXXXXX-XXXXXXXSNWP--LRGVKNTL-WEGGVRGAGLI-WSPLLESRGIVAEQ 365
              F SD                W    +G K    WEGG+R  G+  W  +LE+ G V  +
Sbjct:   319 YFTSDNGGHLEPLDGAVQLGGWNGIYKGGKGMGGWEGGIRVPGIFRWPSVLEA-GRVINE 377

Query:   366 YVHVSDWLPTL 376
                + D  PTL
Sbjct:   378 PTSLMDIYPTL 388

 Score = 43 (20.2 bits), Expect = 1.8e-22, Sum P(3) = 1.8e-22
 Identities = 8/12 (66%), Positives = 8/12 (66%)

Query:   675 PCLFDIKNDPCE 686
             P LFDI  DP E
Sbjct:   480 PLLFDISRDPSE 491

 Score = 43 (20.2 bits), Expect = 1.8e-22, Sum P(3) = 1.8e-22
 Identities = 8/12 (66%), Positives = 8/12 (66%)

Query:   781 PCLFDIKNDPCE 792
             P LFDI  DP E
Sbjct:   480 PLLFDISRDPSE 491


>UNIPROTKB|F1PY85 [details] [associations]
            symbol:ARSH "Arylsulfatase H" species:9615 "Canis lupus
            familiaris" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 OMA:ATVWKVH EMBL:AAEX03026108
            Ensembl:ENSCAFT00000017754 Uniprot:F1PY85
        Length = 562

 Score = 232 (86.7 bits), Expect = 2.2e-22, Sum P(3) = 2.2e-22
 Identities = 52/136 (38%), Positives = 78/136 (57%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
             P+I+ ++ADDLG  D+  +G + + TPNID LA  G+ L  +     +CTPSR+A +TG+
Sbjct:     7 PNIVLLMADDLGVGDLCCYGNNTVSTPNIDRLASEGVRLTQHLAAASVCTPSRAAFLTGR 66

Query:   119 HPIHTGMQ--HNVLYGCE----RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFY---KK 169
             +PI +GM   +N+  G       GGLP +E    + L+  GYRT ++GKWH G     + 
Sbjct:    67 YPIRSGMASPYNLNRGLTWLGGSGGLPTNETTFAKLLQHYGYRTGLIGKWHQGLSCASRN 126

Query:   170 E--YTPTFRGFESHLG 183
             +  Y P   GF+   G
Sbjct:   127 DHCYHPLNHGFDYFYG 142

 Score = 106 (42.4 bits), Expect = 2.2e-22, Sum P(3) = 2.2e-22
 Identities = 46/192 (23%), Positives = 82/192 (42%)

Query:   228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
             EA+  I  +    P  L+++    H+     PL   D ++  H      K   +   + +
Sbjct:   248 EALAFIDRYKRG-PFLLFVSFLHVHT-----PLITKDKFVG-HS-----KYGLYGDNVEE 295

Query:   288 LDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXX---SNWPLRGVKNTL-WEG 343
             +D  VG+++E L+Q R+ +++++ F SD                SN   +G +    WEG
Sbjct:   296 MDWMVGRILETLDQERLTNHTLVYFTSDNGGRLEVQEGEVQLGGSNGIYKGGQGMGGWEG 355

Query:   344 GVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYEN 402
             G+R  G+  W  +L++ G V  +   + D  PTL S      +P        N++P  E 
Sbjct:   356 GIRVPGIFRWPTVLQA-GKVINEPTSLMDIYPTL-SYIGGGMLPQDRVIDGRNLMPLLEG 413

Query:   403 SILRYENGTHEY 414
                R  +  HE+
Sbjct:   414 ---RVSHSDHEF 422

 Score = 46 (21.3 bits), Expect = 2.2e-22, Sum P(3) = 2.2e-22
 Identities = 11/32 (34%), Positives = 14/32 (43%)

Query:   668 PCEPQIA----PCLFDIKNDPCEKNNLADRSE 695
             PC   +     P LFD+  DP E   L   +E
Sbjct:   469 PCSGDVTYHDPPLLFDVSRDPSETRPLNPDNE 500

 Score = 46 (21.3 bits), Expect = 2.2e-22, Sum P(3) = 2.2e-22
 Identities = 11/32 (34%), Positives = 14/32 (43%)

Query:   774 PCEPQIA----PCLFDIKNDPCEKNNLADRSE 801
             PC   +     P LFD+  DP E   L   +E
Sbjct:   469 PCSGDVTYHDPPLLFDVSRDPSETRPLNPDNE 500


>UNIPROTKB|Q32KI1 [details] [associations]
            symbol:arse "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0004065 "arylsulfatase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 GeneTree:ENSGT00560000076940 HOGENOM:HOG000135352
            HOVERGEN:HBG004283 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0004065 KO:K12374 OrthoDB:EOG4V4379 CTD:415
            EMBL:AAEX03026107 OMA:VCFQIMA EMBL:BN000756 RefSeq:NP_001041587.1
            UniGene:Cfa.28960 SMR:Q32KI1 STRING:Q32KI1
            Ensembl:ENSCAFT00000045735 GeneID:491719 KEGG:cfa:491719
            InParanoid:Q32KI1 NextBio:20864462 Uniprot:Q32KI1
        Length = 585

 Score = 244 (91.0 bits), Expect = 3.5e-22, Sum P(3) = 3.5e-22
 Identities = 49/116 (42%), Positives = 73/116 (62%)

Query:    56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAI 114
             S   P+I+ ++ADD G  D+G +G + I TPNID LA  G++L  +     +CTPSR+A 
Sbjct:    30 SGSRPNILLLMADDFGIGDIGCYGNNSIRTPNIDRLAEDGVMLTQHIAAASVCTPSRAAF 89

Query:   115 MTGKHPIHTGMQ----HNVL-YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
             +TG++P+ +GM     + VL +    GGLP +E    + LK+ GY T ++GKWHLG
Sbjct:    90 LTGRYPLRSGMVSSNGYRVLQWTGVSGGLPTNETTFAKILKDRGYATGLIGKWHLG 145

 Score = 88 (36.0 bits), Expect = 3.5e-22, Sum P(3) = 3.5e-22
 Identities = 34/145 (23%), Positives = 65/145 (44%)

Query:   255 NPYEPLQAPDHYLNIHRHI---EDFKRSKFAAILH-----KLDESVGKVVEALEQRRMLS 306
             N + P      +L++H  +   E F R K A  L+     ++D  VG++++ L+   + +
Sbjct:   282 NKHRPFLLFVSFLHVHTPLITTEKF-RGKSAHGLYGDNTEEMDWMVGQILDTLDMEGLTN 340

Query:   307 NSIIVFVSDXXXXXXXXXXXXX-SNWP--LRGVKNTL-WEGGVRGAGLI-WSPLLESRGI 361
             ++++ F SD                W    +G K    WEGG+R  G+  W  +L++ G 
Sbjct:   341 STLVYFTSDHGGSLEAQLGKEQYGGWNGIYKGGKGMGGWEGGIRVPGIFRWPGVLQA-GR 399

Query:   362 VAEQYVHVSDWLPTLLSAANKSDIP 386
             V  +   + D  PT++      ++P
Sbjct:   400 VIHEPTSLMDVFPTVVQLGG-GEVP 423

 Score = 50 (22.7 bits), Expect = 3.5e-22, Sum P(3) = 3.5e-22
 Identities = 16/46 (34%), Positives = 20/46 (43%)

Query:   668 PCE-PQIA----PCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGR 708
             PC   Q+A    P LFD+  DP E + L   +E     H    V R
Sbjct:   495 PCSGDQVAHHDPPLLFDLSRDPSEAHALTPDTEPS-FYHVMDTVAR 539

 Score = 49 (22.3 bits), Expect = 4.5e-22, Sum P(3) = 4.5e-22
 Identities = 13/33 (39%), Positives = 17/33 (51%)

Query:   774 PCE-PQIA----PCLFDIKNDPCEKNNLADRSE 801
             PC   Q+A    P LFD+  DP E + L   +E
Sbjct:   495 PCSGDQVAHHDPPLLFDLSRDPSEAHALTPDTE 527


>MGI|MGI:1921258 [details] [associations]
            symbol:Arsg "arylsulfatase G" species:10090 "Mus musculus"
            [GO:0003824 "catalytic activity" evidence=IEA] [GO:0004065
            "arylsulfatase activity" evidence=ISO] [GO:0005615 "extracellular
            space" evidence=ISO] [GO:0005764 "lysosome" evidence=ISO]
            [GO:0005783 "endoplasmic reticulum" evidence=ISO] [GO:0006790
            "sulfur compound metabolic process" evidence=ISO] [GO:0008152
            "metabolic process" evidence=IEA] [GO:0008484 "sulfuric ester
            hydrolase activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0046872 "metal ion binding" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 MGI:MGI:1921258 GO:GO:0005783 GO:GO:0005615
            GO:GO:0046872 GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 GeneTree:ENSGT00560000076940 HOVERGEN:HBG004283
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0004065
            CTD:22901 KO:K12381 OrthoDB:EOG4J9MZJ GO:GO:0006790 EMBL:AK018132
            EMBL:AK158726 EMBL:AL645791 EMBL:BC022158 EMBL:BC039629
            EMBL:BC084731 EMBL:AK173082 EMBL:BN000747 IPI:IPI00135805
            IPI:IPI00648999 RefSeq:NP_001159649.1 RefSeq:NP_082986.3
            UniGene:Mm.482224 ProteinModelPortal:Q3TYD4 SMR:Q3TYD4
            STRING:Q3TYD4 PaxDb:Q3TYD4 PRIDE:Q3TYD4 Ensembl:ENSMUST00000020928
            Ensembl:ENSMUST00000106696 Ensembl:ENSMUST00000106697 GeneID:74008
            KEGG:mmu:74008 UCSC:uc007mcn.1 UCSC:uc007mcp.2 InParanoid:B1AT67
            OMA:GNTFLWF NextBio:339520 Bgee:Q3TYD4 CleanEx:MM_ARSG
            Genevestigator:Q3TYD4 GermOnline:ENSMUSG00000020604 Uniprot:Q3TYD4
        Length = 525

 Score = 257 (95.5 bits), Expect = 3.8e-22, Sum P(2) = 3.8e-22
 Identities = 48/125 (38%), Positives = 78/125 (62%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
             P+I+ ILADD+GW D+G +  +   T N+D +A  G+   +++     C+PSR++++TG+
Sbjct:    36 PNIVIILADDMGWGDLGANWAETKDTTNLDKMASEGMRFVDFHAAASTCSPSRASLLTGR 95

Query:   119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
               +  G+ HN       GGLP++E  L + L++ GY T ++GKWHLG +   Y P FRGF
Sbjct:    96 LGLRNGVTHNFAV-TSVGGLPVNETTLAEVLRQEGYVTAMIGKWHLGHHGS-YHPNFRGF 153

Query:   179 ESHLG 183
             + + G
Sbjct:   154 DYYFG 158

 Score = 79 (32.9 bits), Expect = 3.8e-22, Sum P(2) = 3.8e-22
 Identities = 42/176 (23%), Positives = 67/176 (38%)

Query:   225 FTAEAVDIIHNHSTD-EPLFLYLAHAATH---SANPYEPLQAPDHYLNIHRHIEDFKRSK 280
             +   AV+ I   ST   P  LY+  A  H   S  P  PL  P             ++S 
Sbjct:   223 YAERAVEFIEQASTSGRPFLLYVGLAHMHVPLSVTP--PLAHPQ------------RQSL 268

Query:   281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSN-----WPL-R 334
             + A L ++D  VG++ + ++      N+++ F  D                    W   +
Sbjct:   269 YRASLREMDSLVGQIKDKVDHVAR-ENTLLWFTGDNGPWAQKCELAGSVGPFFGLWQTHQ 327

Query:   335 G---VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
             G    K T WEGG R   L + P      + +   + + D  PT+++ A  S  PN
Sbjct:   328 GGSPTKQTTWEGGHRVPALAYWPGRVPANVTSTALLSLLDIFPTVIALAGASLPPN 383


>UNIPROTKB|G5E629 [details] [associations]
            symbol:ARSE "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 EMBL:DAAA02075311 EMBL:DAAA02075312
            EMBL:DAAA02075313 UniGene:Bt.6471 Ensembl:ENSBTAT00000050377
            OMA:VCFQIMA Uniprot:G5E629
        Length = 583

 Score = 243 (90.6 bits), Expect = 9.0e-22, Sum P(3) = 9.0e-22
 Identities = 54/115 (46%), Positives = 72/115 (62%)

Query:    58 GP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIM 115
             GP P+I+ ++ADDLG  DVG +G   I TPNID LA  G+ L  +     LCTPSR+A +
Sbjct:    31 GPRPNILLLMADDLGIGDVGCYGNTTIRTPNIDRLAADGVRLTQHLAAAPLCTPSRAAFL 90

Query:   116 TGKHPIHTGMQHN----VL-YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
             TG++P+ +GM  +    VL +    GGLP SE    + LK  GY T ++GKWHLG
Sbjct:    91 TGRYPLRSGMVSSQGLRVLQWTAVSGGLPPSEITFAKILKAKGYTTGLIGKWHLG 145

 Score = 89 (36.4 bits), Expect = 9.0e-22, Sum P(3) = 9.0e-22
 Identities = 32/133 (24%), Positives = 61/133 (45%)

Query:   266 YLNIHRHI---EDFK-RSK---FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXX 318
             +L++H  +   E+F+ RS    +     ++D  VG+++E L+   + +++++ F SD   
Sbjct:   294 FLHVHTPLVTTENFRGRSPHGLYGDNTEEMDWMVGQILETLDTEGLTNSTLVYFTSDHGG 353

Query:   319 XXXXXXXXXX-SNWP--LRGVKNTL-WEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWL 373
                          W    +G K    WEGG+R  G+  W  +L + G V  +   + D  
Sbjct:   354 SLEARFGNNQYGGWNGIYKGGKGMAGWEGGIRVPGIFRWPGVLPA-GRVIHEPTSLMDIF 412

Query:   374 PTLLSAANKSDIP 386
             PT++  A    +P
Sbjct:   413 PTVVHLAG-GQVP 424

 Score = 46 (21.3 bits), Expect = 9.0e-22, Sum P(3) = 9.0e-22
 Identities = 9/21 (42%), Positives = 12/21 (57%)

Query:   675 PCLFDIKNDPCEKNNLADRSE 695
             P LFD+  DP E + L   +E
Sbjct:   505 PLLFDLSRDPSEAHALTPDTE 525

 Score = 46 (21.3 bits), Expect = 9.0e-22, Sum P(3) = 9.0e-22
 Identities = 9/21 (42%), Positives = 12/21 (57%)

Query:   781 PCLFDIKNDPCEKNNLADRSE 801
             P LFD+  DP E + L   +E
Sbjct:   505 PLLFDLSRDPSEAHALTPDTE 525


>UNIPROTKB|E1BU03 [details] [associations]
            symbol:ARSG "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0005764 "lysosome" evidence=IEA] [GO:0005783 "endoplasmic
            reticulum" evidence=IEA] [GO:0006790 "sulfur compound metabolic
            process" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 GO:GO:0005783 GO:GO:0005615
            GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 GO:GO:0006790 EMBL:AADN02030038
            EMBL:AADN02030039 EMBL:AADN02030040 IPI:IPI00574852
            ProteinModelPortal:E1BU03 Ensembl:ENSGALT00000006665 OMA:SDEYIYW
            Uniprot:E1BU03
        Length = 505

 Score = 283 (104.7 bits), Expect = 1.2e-21, P = 1.2e-21
 Identities = 103/365 (28%), Positives = 152/365 (41%)

Query:    41 PLAFTLSMVFVDLVAS---SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGII 97
             P +  L  V V L  S    G P+ I ILADDLGW D+G +  +   TP++D LA  G  
Sbjct:     6 PWSVLLLAVLVGLCTSPVAQGKPNFIVILADDLGWGDLGANWAETKETPHLDELAAEGTR 65

Query:    98 LKNYYTV-QLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRT 156
               ++++    C+PSR++++TG+  +  G+ HN       GGLPL+E  L + L+  GY T
Sbjct:    66 FVDFHSAASTCSPSRASLLTGRLGVRNGVTHNFAISSV-GGLPLNETTLAEVLRAAGYST 124

Query:   157 RIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDL 216
               +GKWHLG +   + P FRGF+ + G    H    D    +   + +       P    
Sbjct:   125 AAIGKWHLGHHGHHH-PIFRGFDYYFGIPYSH----DMGCTDTPGYNVPPC----PPCPQ 175

Query:   217 HGKYSTDVFTA--EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDH-YLNI-HRH 272
             HG  + DV     E + II        L      AA            P   YL + H H
Sbjct:   176 HGAATRDVALPLFENLTIIQQPVDLSSLVEQYMEAAARFIQQARDSSRPFFLYLALAHMH 235

Query:   273 IE-----DFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXX--XXXXXXXX 325
             +         R  + A L ++D  VG V + L       ++++ F  D            
Sbjct:   236 VPLQIAPPPDRGIYGAALREMDALVGHV-KHLADSCGKGSTLLWFTGDNGPWMQKSPTQG 294

Query:   326 XXXSNWPLRG---VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANK 382
                +   L G    K T WEGG R   L + P        +   +   D  PTL++ A  
Sbjct:   295 TLSALLSLAGGSPAKQTTWEGGHRVPALAYWPGHVPAKRSSHAMLSTLDVFPTLVALAGA 354

Query:   383 SDIPN 387
             +  PN
Sbjct:   355 TLPPN 359


>UNIPROTKB|Q96EG1 [details] [associations]
            symbol:ARSG "Arylsulfatase G" species:9606 "Homo sapiens"
            [GO:0046872 "metal ion binding" evidence=IEA] [GO:0004065
            "arylsulfatase activity" evidence=IDA;TAS] [GO:0005764 "lysosome"
            evidence=IDA] [GO:0005615 "extracellular space" evidence=IDA]
            [GO:0006790 "sulfur compound metabolic process" evidence=IDA]
            [GO:0005783 "endoplasmic reticulum" evidence=IDA] [GO:0005788
            "endoplasmic reticulum lumen" evidence=TAS] [GO:0006644
            "phospholipid metabolic process" evidence=TAS] [GO:0006665
            "sphingolipid metabolic process" evidence=TAS] [GO:0006687
            "glycosphingolipid metabolic process" evidence=TAS] [GO:0043687
            "post-translational protein modification" evidence=TAS] [GO:0044267
            "cellular protein metabolic process" evidence=TAS] [GO:0044281
            "small molecule metabolic process" evidence=TAS]
            Reactome:REACT_17015 Reactome:REACT_111217 InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 GO:GO:0005615
            GO:GO:0044281 GO:GO:0046872 GO:GO:0006644 GO:GO:0005764
            GO:GO:0005788 GO:GO:0043687 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 HOGENOM:HOG000135352 HOVERGEN:HBG004283
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0004065
            GO:GO:0006687 CTD:22901 KO:K12381 OMA:LPQDRHF OrthoDB:EOG4J9MZJ
            GO:GO:0006790 EMBL:AB023218 EMBL:AY358380 EMBL:BC012375
            IPI:IPI00402293 RefSeq:NP_001254656.1 RefSeq:NP_055775.2
            UniGene:Hs.437249 ProteinModelPortal:Q96EG1 SMR:Q96EG1
            STRING:Q96EG1 DMDM:74731559 PaxDb:Q96EG1 PeptideAtlas:Q96EG1
            PRIDE:Q96EG1 Ensembl:ENST00000448504 Ensembl:ENST00000570630
            GeneID:22901 KEGG:hsa:22901 UCSC:uc002jhc.2 GeneCards:GC17P066255
            HGNC:HGNC:24102 HPA:HPA023245 HPA:HPA023285 MIM:610008
            neXtProt:NX_Q96EG1 PharmGKB:PA143485307 InParanoid:Q96EG1
            PhylomeDB:Q96EG1 SABIO-RK:Q96EG1 GenomeRNAi:22901 NextBio:43535
            ArrayExpress:Q96EG1 Bgee:Q96EG1 CleanEx:HS_ARSG
            Genevestigator:Q96EG1 GermOnline:ENSG00000141337 Uniprot:Q96EG1
        Length = 525

 Score = 252 (93.8 bits), Expect = 2.3e-21, Sum P(2) = 2.3e-21
 Identities = 49/130 (37%), Positives = 77/130 (59%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
             P+ + ILADD+GW D+G +  +   T N+D +A  G+   +++     C+PSR++++TG+
Sbjct:    36 PNFVIILADDMGWGDLGANWAETKDTANLDKMASEGMRFVDFHAAASTCSPSRASLLTGR 95

Query:   119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
               +  G+  N       GGLPL+E  L + L++ GY T I+GKWHLG +   Y P FRGF
Sbjct:    96 LGLRNGVTRNFAV-TSVGGLPLNETTLAEVLQQAGYVTGIIGKWHLGHHGS-YHPNFRGF 153

Query:   179 ESHLGYWTGH 188
             + + G    H
Sbjct:   154 DYYFGIPYSH 163

 Score = 77 (32.2 bits), Expect = 2.3e-21, Sum P(2) = 2.3e-21
 Identities = 41/169 (24%), Positives = 62/169 (36%)

Query:   225 FTAEAVDIIHNHSTD-EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAA 283
             +  +A   I   ST   P  LY+A A  H   P   L A               RS + A
Sbjct:   223 YAEKATQFIQRASTSGRPFLLYVALAHMHVPLPVTQLPAAPR-----------GRSLYGA 271

Query:   284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSN-----WPLR--G- 335
              L ++D  VG++ + ++   +  N+ + F  D                    W  R  G 
Sbjct:   272 GLWEMDSLVGQIKDKVDHT-VKENTFLWFTGDNGPWAQKCELAGSVGPFTGFWQTRQGGS 330

Query:   336 -VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
               K T WEGG R   L + P      + +   + V D  PT+++ A  S
Sbjct:   331 PAKQTTWEGGHRVPALAYWPGRVPVNVTSTALLSVLDIFPTVVALAQAS 379


>UNIPROTKB|F1PYB4 [details] [associations]
            symbol:ARSD "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 OMA:RSWIPSG EMBL:AAEX03026107
            EMBL:AAEX03026106 Ensembl:ENSCAFT00000017716 Uniprot:F1PYB4
        Length = 597

 Score = 231 (86.4 bits), Expect = 3.5e-21, Sum P(3) = 3.5e-21
 Identities = 48/117 (41%), Positives = 72/117 (61%)

Query:    55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSA 113
             A++  P+I+ I+ADDLG  D+G +G   + TPNID LA  G+ L  +     LCTPSRS+
Sbjct:    39 ANAFKPNILLIMADDLGIGDLGCYGNSTLRTPNIDRLAEEGVRLTQHLAAAPLCTPSRSS 98

Query:   114 IMTGKHPIHTGMQ-HN----VLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
              +TG+H   +GM+ H+    + +    GGLP +E    + L++ GY T ++GKWH G
Sbjct:    99 FLTGRHSFRSGMEAHDGYRALQWNGASGGLPENETTFARILQQQGYATGLIGKWHQG 155

 Score = 96 (38.9 bits), Expect = 3.5e-21, Sum P(3) = 3.5e-21
 Identities = 49/224 (21%), Positives = 88/224 (39%)

Query:   199 MKMWGLDMRRD---LEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSAN 255
             ++ W   + R+    E   DL  + +T     EAV  I  +   +P  L+L+    H   
Sbjct:   254 VRRWNCILMRNHDVTEQPMDL--ERTTSHMLREAVSYIERNK-HQPFLLFLSLLHVHI-- 308

Query:   256 PYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
                PL     +L   +H        +   + ++D  VG+V+ A+E+  + + +   F SD
Sbjct:   309 ---PLVTTKQFLGKSQH------GLYGDNVEEMDWLVGEVLNAIEENGLKNTTFTYFTSD 359

Query:   316 XXXXXXXXXXXXX-SNWP--LRGVKNTL-WEGGVRGAGLI-WSPLLESRGIVAEQYVHVS 370
                             W    RG K    WEGG+R  G+  W  +L + G V  +   + 
Sbjct:   360 HGGHLEARDERGQLGGWNGIFRGGKGMGGWEGGIRVPGIFRWPGVLPA-GRVIHEPTSLM 418

Query:   371 DWLPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEY 414
             D  PT++      ++P        +++P    +    E+  HE+
Sbjct:   419 DVFPTVVQLGG-GEVPQDRVIDGRSLVPLLRGAA---EHSAHEF 458

 Score = 47 (21.6 bits), Expect = 3.5e-21, Sum P(3) = 3.5e-21
 Identities = 12/33 (36%), Positives = 16/33 (48%)

Query:   675 PCLFDIKNDPCEKNNLADRSEDQRINHYTTEVG 707
             P LF++  DP E   L+  SE    N    +VG
Sbjct:   517 PLLFELSRDPSEARPLSPDSEPL-YNMVVAQVG 548

 Score = 47 (21.6 bits), Expect = 3.5e-21, Sum P(3) = 3.5e-21
 Identities = 12/33 (36%), Positives = 16/33 (48%)

Query:   781 PCLFDIKNDPCEKNNLADRSEVQRINHYTTEVG 813
             P LF++  DP E   L+  SE    N    +VG
Sbjct:   517 PLLFELSRDPSEARPLSPDSE-PLYNMVVAQVG 548


>ZFIN|ZDB-GENE-081104-120 [details] [associations]
            symbol:arsh "arylsulfatase H" species:7955 "Danio
            rerio" [GO:0008152 "metabolic process" evidence=IEA] [GO:0008484
            "sulfuric ester hydrolase activity" evidence=IEA] [GO:0003824
            "catalytic activity" evidence=IEA] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            ZFIN:ZDB-GENE-081104-120 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 KO:K12374 EMBL:CR407703 EMBL:FP236869
            IPI:IPI00506361 RefSeq:XP_003199313.1 Ensembl:ENSDART00000032992
            GeneID:100332997 KEGG:dre:100332997 Uniprot:F8VNP0
        Length = 583

 Score = 238 (88.8 bits), Expect = 6.8e-21, Sum P(3) = 6.8e-21
 Identities = 49/112 (43%), Positives = 69/112 (61%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
             P+ + ++ DDLG  D+G +G   I TPNID LA  G+ L ++ +   LCTPSR+A MTG+
Sbjct:    34 PNFVLMMVDDLGIGDIGCYGNTTIRTPNIDRLASDGVKLTHHLSAAPLCTPSRTAFMTGR 93

Query:   119 HPIHTGMQHN-----VLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
             +P+  GM        +L+    GGLP +E    + L++ GY T IVGKWHLG
Sbjct:    94 YPLRAGMGSTGRVQVILFLAGSGGLPPNETTFAKLLQKQGYTTGIVGKWHLG 145

 Score = 92 (37.4 bits), Expect = 6.8e-21, Sum P(3) = 6.8e-21
 Identities = 43/189 (22%), Positives = 78/189 (41%)

Query:   228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
             EA   +  H  D P  L+++    H+     P+   + +    RH        +   + +
Sbjct:   274 EAEQFMERHR-DGPFLLFVSFPQVHT-----PMLVTEGFAGKSRH------GLYGDNVEE 321

Query:   288 LDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTL-WEGGVR 346
             +D  VG+VV+ +++  +   +++ F SD              N   RG K    W+GG+R
Sbjct:   322 VDWMVGRVVDTIDRLGLTEKTLLYFTSDHGGGIEEGPRGGW-NGIYRGGKAMGGWDGGIR 380

Query:   347 GAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENSIL 405
               G+  W   L +   VAE    + D  PT++  A   ++P        +++P  E S  
Sbjct:   381 VPGIFRWPGRLAAGREVAEP-TSLMDVFPTVVKLAG-GELPKDRLLDGHDLMPLLEGSSS 438

Query:   406 RYENGTHEY 414
             R +   HE+
Sbjct:   439 RSQ---HEF 444

 Score = 40 (19.1 bits), Expect = 6.8e-21, Sum P(3) = 6.8e-21
 Identities = 11/34 (32%), Positives = 17/34 (50%)

Query:   675 PCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGR 708
             P +F I +DP E   L +++ D R+      V R
Sbjct:   503 PLVFLISSDPSESVPLTEQT-DPRVPEVLQRVQR 535


>UNIPROTKB|P51689 [details] [associations]
            symbol:ARSD "Arylsulfatase D" species:9606 "Homo sapiens"
            [GO:0046872 "metal ion binding" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0004065 "arylsulfatase activity"
            evidence=TAS] [GO:0005788 "endoplasmic reticulum lumen"
            evidence=TAS] [GO:0006644 "phospholipid metabolic process"
            evidence=TAS] [GO:0006665 "sphingolipid metabolic process"
            evidence=TAS] [GO:0006687 "glycosphingolipid metabolic process"
            evidence=TAS] [GO:0043687 "post-translational protein modification"
            evidence=TAS] [GO:0044267 "cellular protein metabolic process"
            evidence=TAS] [GO:0044281 "small molecule metabolic process"
            evidence=TAS] Reactome:REACT_17015 Reactome:REACT_111217
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 GO:GO:0044281 GO:GO:0046872 GO:GO:0006644
            GO:GO:0005764 GO:GO:0005788 GO:GO:0043687 Gene3D:3.40.720.10
            SUPFAM:SSF53649 eggNOG:COG3119 HOGENOM:HOG000135352
            HOVERGEN:HBG004283 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0004065 GO:GO:0006687 EMBL:X83572
            EMBL:AF160499 EMBL:AC005295 EMBL:BC020229 IPI:IPI00019989
            IPI:IPI00028695 IPI:IPI00914575 PIR:I37186 RefSeq:NP_001660.2
            UniGene:Hs.528631 ProteinModelPortal:P51689 SMR:P51689
            STRING:P51689 DMDM:212276422 PaxDb:P51689 PRIDE:P51689 DNASU:414
            Ensembl:ENST00000381154 GeneID:414 KEGG:hsa:414 UCSC:uc004cqy.3
            CTD:414 GeneCards:GC0XM002818 HGNC:HGNC:717 HPA:HPA004694
            MIM:300002 neXtProt:NX_P51689 PharmGKB:PA25008 InParanoid:P51689
            KO:K12374 OMA:RSWIPSG OrthoDB:EOG4V4379 ChiTaRS:ARSD GenomeRNAi:414
            NextBio:1749 Bgee:P51689 CleanEx:HS_ARSD Genevestigator:P51689
            GermOnline:ENSG00000006756 Uniprot:P51689
        Length = 593

 Score = 230 (86.0 bits), Expect = 7.1e-21, Sum P(3) = 7.1e-21
 Identities = 48/117 (41%), Positives = 71/117 (60%)

Query:    55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSA 113
             A++  P+I+ I+ADDLG  D+G +G + + TPNID LA  G+ L  +     LCTPSR+A
Sbjct:    36 ANAFKPNILLIMADDLGTGDLGCYGNNTLRTPNIDQLAEEGVRLTQHLAAAPLCTPSRAA 95

Query:   114 IMTGKHPIHTGMQ----HNVL-YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
              +TG+H   +GM     +  L +    GGLP +E    + L++ GY T ++GKWH G
Sbjct:    96 FLTGRHSFRSGMDASNGYRALQWNAGSGGLPENETTFARILQQHGYATGLIGKWHQG 152

 Score = 93 (37.8 bits), Expect = 7.1e-21, Sum P(3) = 7.1e-21
 Identities = 42/192 (21%), Positives = 75/192 (39%)

Query:   228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
             EAV  I  H    P  L+L+    H      PL     +L   +H        +   + +
Sbjct:   281 EAVSYIERHKHG-PFLLFLSLLHVHI-----PLVTTSAFLGKSQH------GLYGDNVEE 328

Query:   288 LDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXX-SNWP--LRGVKNTL-WEG 343
             +D  +GKV+ A+E   + +++   F SD                W    +G K    WEG
Sbjct:   329 MDWLIGKVLNAIEDNGLKNSTFTYFTSDHGGHLEARDGHSQLGGWNGIYKGGKGMGGWEG 388

Query:   344 GVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYEN 402
             G+R  G+  W  +L +  ++ E    + D  PT++      ++P        +++P  + 
Sbjct:   389 GIRVPGIFHWPGVLPAGRVIGEP-TSLMDVFPTVVQLVG-GEVPQDRVIDGHSLVPLLQG 446

Query:   403 SILRYENGTHEY 414
             +  R     HE+
Sbjct:   447 AEAR---SAHEF 455

 Score = 48 (22.0 bits), Expect = 7.1e-21, Sum P(3) = 7.1e-21
 Identities = 10/21 (47%), Positives = 11/21 (52%)

Query:   675 PCLFDIKNDPCEKNNLADRSE 695
             P LFD+  DP E   L   SE
Sbjct:   514 PLLFDLSRDPSEARPLTPDSE 534

 Score = 48 (22.0 bits), Expect = 7.1e-21, Sum P(3) = 7.1e-21
 Identities = 10/21 (47%), Positives = 11/21 (52%)

Query:   781 PCLFDIKNDPCEKNNLADRSE 801
             P LFD+  DP E   L   SE
Sbjct:   514 PLLFDLSRDPSEARPLTPDSE 534


>CGD|CAL0006319 [details] [associations]
            symbol:orf19.1608 species:5476 "Candida albicans" [GO:0005634
            "nucleus" evidence=IEA] [GO:0005829 "cytosol" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 CGD:CAL0006319 EMBL:AACQ01000014 EMBL:AACQ01000013
            Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00149 GO:GO:0008484 KO:K01130
            RefSeq:XP_721567.1 RefSeq:XP_721687.1 ProteinModelPortal:Q5AJI4
            GeneID:3636617 GeneID:3636713 KEGG:cal:CaO19.1608
            KEGG:cal:CaO19.9176 Uniprot:Q5AJI4
        Length = 588

 Score = 238 (88.8 bits), Expect = 1.0e-20, Sum P(4) = 1.0e-20
 Identities = 76/245 (31%), Positives = 122/245 (49%)

Query:    55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY--SGIILKNYYTVQLCTPSRS 112
             +SS  P+ + I+ADDLG+ D+   G  +I TPN++ LA   +G+ L +++T   C+P+RS
Sbjct:     6 SSSKQPNFLIIVADDLGFTDLSPFG-GEINTPNLNKLATGANGVRLTDFHTASACSPTRS 64

Query:   113 AIMTG--KHPIHTGM------QHNVLYGCERG--GLPLSEKI--LPQYLKELGYRTRIVG 160
              +++G   H    G       +H   +  + G  G  L++K+  LP+ L++ GY T I G
Sbjct:    65 MLLSGTDNHIAGLGQMAEFAQRHPEKFNNQPGYEGY-LNDKVVALPEILQDNGYHTFISG 123

Query:   161 KWHLGFYKKEYTPTFRGFESHLGYWTG---HQDYFDHSAEEMKMWGL------DMRRDLE 211
             KWHLG  KK Y P  RGF        G   H  Y    ++  ++  L      D +  L+
Sbjct:   124 KWHLGL-KKPYWPNKRGFNKSFTLLPGAGNHYKYITRDSQGNQIPFLPAIYVEDDKELLQ 182

Query:   212 PAWDLHGK-YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
             P  +L    YST+ FT +A++ I      +P F  + + A     P+ P QAP   +  +
Sbjct:   183 PEIELPDDFYSTNYFTDKAIEFIKETPQGKPFFGMITYTA-----PHWPYQAPQDKIAKY 237

Query:   271 RHIED 275
               + D
Sbjct:   238 NGVYD 242

 Score = 72 (30.4 bits), Expect = 1.0e-20, Sum P(4) = 1.0e-20
 Identities = 13/35 (37%), Positives = 26/35 (74%)

Query:   281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
             +AA++  LDE++G++++ L     L+N+ I+F+SD
Sbjct:   296 YAAMVEILDENIGRLIDHLNSIDELNNTFILFMSD 330

 Score = 53 (23.7 bits), Expect = 1.0e-20, Sum P(4) = 1.0e-20
 Identities = 16/60 (26%), Positives = 29/60 (48%)

Query:   360 GIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPR---YENSILRYENGTHEYNS 416
             G + +++  V D LPT+L  AN S  P       + + PR   + N ++   +  H+ N+
Sbjct:   428 GKILKEFTTVMDILPTILELANVSH-PGETYKGRQVVKPRGKSWVNYLINKTDQVHDENT 486

 Score = 44 (20.5 bits), Expect = 1.0e-20, Sum P(4) = 1.0e-20
 Identities = 12/24 (50%), Positives = 16/24 (66%)

Query:   783 LFDIKNDPCEKNNLADRS-EVQRI 805
             LF+I  DP E N+L++ S E Q I
Sbjct:   519 LFNIIEDPGEINDLSESSSEYQTI 542

 Score = 44 (20.5 bits), Expect = 1.0e-20, Sum P(4) = 1.0e-20
 Identities = 14/52 (26%), Positives = 24/52 (46%)

Query:   525 SKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQNEID 576
             S+  TIL+ + D W + A   G  +L  +  +      E  SN+  Y+  +D
Sbjct:   537 SEYQTILNELLDHWAVYAAETGLIELGSD--LFEKEKIEGESNEVVYRTILD 586

 Score = 43 (20.2 bits), Expect = 1.3e-20, Sum P(4) = 1.3e-20
 Identities = 9/20 (45%), Positives = 14/20 (70%)

Query:   677 LFDIKNDPCEKNNLADRSED 696
             LF+I  DP E N+L++ S +
Sbjct:   519 LFNIIEDPGEINDLSESSSE 538

 Score = 41 (19.5 bits), Expect = 4.7e-16, Sum P(3) = 4.7e-16
 Identities = 10/32 (31%), Positives = 17/32 (53%)

Query:   463 YNGPKNE--NTNPRYENGTHEYNIPRLENSIN 492
             Y  P+++    N  Y+NG  E    RL+++ N
Sbjct:   227 YQAPQDKIAKYNGVYDNGPEELRQKRLQSAKN 258

 Score = 39 (18.8 bits), Expect = 7.5e-16, Sum P(3) = 7.5e-16
 Identities = 9/37 (24%), Positives = 20/37 (54%)

Query:   386 PNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENS 422
             P++     ++ I +Y N +  Y+NG  E    R++++
Sbjct:   223 PHWPYQAPQDKIAKY-NGV--YDNGPEELRQKRLQSA 256

 Score = 37 (18.1 bits), Expect = 1.2e-15, Sum P(3) = 1.2e-15
 Identities = 7/16 (43%), Positives = 8/16 (50%)

Query:   436 KYENRYENGTHEYNPK 451
             KY   Y+NG  E   K
Sbjct:   236 KYNGVYDNGPEELRQK 251


>UNIPROTKB|E1BYN0 [details] [associations]
            symbol:ARSD "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 CTD:414 KO:K12374 OMA:RSWIPSG
            EMBL:AADN02017596 IPI:IPI00570897 RefSeq:XP_416855.2
            ProteinModelPortal:E1BYN0 Ensembl:ENSGALT00000026880 GeneID:418658
            KEGG:gga:418658 NextBio:20821812 Uniprot:E1BYN0
        Length = 596

 Score = 243 (90.6 bits), Expect = 1.1e-20, Sum P(3) = 1.1e-20
 Identities = 56/146 (38%), Positives = 81/146 (55%)

Query:    49 VFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LC 107
             +F    A    P+I+  LADDLG  DVG +G + I TPNID LA  G+ L  +     LC
Sbjct:    31 IFGFSTAVDSKPNILLFLADDLGIGDVGCYGNNTIRTPNIDRLAREGVKLTQHIAAAPLC 90

Query:   108 TPSRSAIMTGKHPIHTGM----QHNVL-YGCERGGLPLSEKILPQYLKELGYRTRIVGKW 162
             TPSR+A +TG++PI +GM    ++  L +    GGLP +E    + L++ GY T ++GKW
Sbjct:    91 TPSRAAFLTGRYPIRSGMASSNRYRALQWNAGSGGLPANETTFARLLQQQGYTTGLIGKW 150

Query:   163 HLGFYKKEYT-----PTFRGFESHLG 183
             H G   + ++     P   GF+   G
Sbjct:   151 HQGVNCESFSDHCHHPLNHGFDYFYG 176

 Score = 82 (33.9 bits), Expect = 1.1e-20, Sum P(3) = 1.1e-20
 Identities = 35/158 (22%), Positives = 65/158 (41%)

Query:   228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
             EA+  I   + + P  L+++    H+     PL     +L    H        +   + +
Sbjct:   282 EAISFI-KRNRNGPFLLFVSFLHVHT-----PLFTTVKFLGKSHH------GLYGDNVEE 329

Query:   288 LDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXX-SNWP--LRGVKNTL-WEG 343
             +D  VGK+++ L++  + +++   F SD                W    +G K    WEG
Sbjct:   330 MDWMVGKILDLLDKEGLKNHTFTYFASDHGGHLEAQDGSAQMGGWNGIYKGGKGMGGWEG 389

Query:   344 GVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
             G+R  G+  W  +L + G V  +   + D  PT++  A
Sbjct:   390 GIRVPGVFRWPGVLPA-GTVINEPTSLMDIFPTVVHLA 426

 Score = 43 (20.2 bits), Expect = 1.1e-20, Sum P(3) = 1.1e-20
 Identities = 8/21 (38%), Positives = 12/21 (57%)

Query:   675 PCLFDIKNDPCEKNNLADRSE 695
             P L+D+  DP E   L+  +E
Sbjct:   515 PLLYDLSRDPSESQPLSADTE 535

 Score = 43 (20.2 bits), Expect = 1.1e-20, Sum P(3) = 1.1e-20
 Identities = 8/21 (38%), Positives = 12/21 (57%)

Query:   781 PCLFDIKNDPCEKNNLADRSE 801
             P L+D+  DP E   L+  +E
Sbjct:   515 PLLYDLSRDPSESQPLSADTE 535


>TIGR_CMR|CPS_2985 [details] [associations]
            symbol:CPS_2985 "sulfatase family protein" species:167879
            "Colwellia psychrerythraea 34H" [GO:0006790 "sulfur compound
            metabolic process" evidence=ISS] [GO:0008484 "sulfuric ester
            hydrolase activity" evidence=ISS] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 EMBL:CP000083
            GenomeReviews:CP000083_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 GO:GO:0008484 KO:K01130 HOGENOM:HOG000135355
            RefSeq:YP_269685.1 ProteinModelPortal:Q47ZT4 STRING:Q47ZT4
            GeneID:3523028 KEGG:cps:CPS_2985 PATRIC:21468987 OMA:RNEFLPT
            BioCyc:CPSY167879:GI48-3034-MONOMER Uniprot:Q47ZT4
        Length = 502

 Score = 281 (104.0 bits), Expect = 1.2e-20, Sum P(2) = 1.2e-20
 Identities = 97/359 (27%), Positives = 165/359 (45%)

Query:    39 VLPLAFTLSMVFVDLVASSGPPHIIFILADDLG-WNDVGFH-GLDQIPTPNIDALAYSGI 96
             VL L+   +         +  P+I+ I  DD+G +N   ++ G+    TPNID +A  GI
Sbjct:    11 VLGLSLIAASSAAMATTDTAKPNILAIWGDDIGPFNISAYNRGIMGYKTPNIDRIANEGI 70

Query:    97 ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRT 156
             I  + Y  Q CT  R+  +TG+HP+ TG+    L G + G L   +  + + LK  GY T
Sbjct:    71 IFTDSYGDQSCTAGRAGFITGQHPMRTGLTKVGLPGAKEG-LNKKDPTIAELLKPHGYMT 129

Query:   157 RIVGKWHLGFYKKEYTPTFRGFESHLG--YWTGHQDYFDHSAEEMKMWGLDMRRDLEPAW 214
                GK HLG  + E+ PT  GF+   G  Y    +D  +H  +  K      R    P  
Sbjct:   130 GQFGKNHLGD-QDEHLPTNHGFDEFFGNLYHLNAEDEPEHP-DYPKDPAFKKR--FGPRG 185

Query:   215 DLH----GKYS-TDVFTAEAVDIIHNHSTDEPL-FLYLAHAATHSANPYEPLQAPDHYLN 268
              +H    GK + T   T + ++ I        L F+  AHAA      +        +  
Sbjct:   186 AIHSFADGKITDTGPVTKKRMETIDEEFLGAALKFIDKAHAAKKPFFVWFNSTRMHVWTR 245

Query:   269 IHRHIEDFK-RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXX 327
             +    +    +  +A  + + D  VG++++ +++  +  N+II++ +D            
Sbjct:   246 LKPESDGVTGQGLYADGMVEHDGHVGQLLDKIDKLGIAENTIIMYTTDNGAELALWPDGG 305

Query:   328 XSNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDI 385
              +  P RG KNT WEGG R   ++ W+  ++    V+ + + + DW+PT+L+ A  ++I
Sbjct:   306 YT--PFRGEKNTNWEGGYRVPMMVKWAGKIKPNQ-VSNEMISLIDWMPTILAVAGDTNI 361

 Score = 37 (18.1 bits), Expect = 1.2e-20, Sum P(2) = 1.2e-20
 Identities = 6/14 (42%), Positives = 11/14 (78%)

Query:   674 APCLFDIKNDPCEK 687
             AP +F+++ DP E+
Sbjct:   442 APKIFNLRMDPYER 455

 Score = 37 (18.1 bits), Expect = 1.2e-20, Sum P(2) = 1.2e-20
 Identities = 6/14 (42%), Positives = 11/14 (78%)

Query:   780 APCLFDIKNDPCEK 793
             AP +F+++ DP E+
Sbjct:   442 APKIFNLRMDPYER 455


>UNIPROTKB|C9J5G7 [details] [associations]
            symbol:ARSE "Arylsulfatase E" species:9606 "Homo sapiens"
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
            PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484 EMBL:AC005295
            HGNC:HGNC:719 IPI:IPI00640709 ProteinModelPortal:C9J5G7 SMR:C9J5G7
            STRING:C9J5G7 Ensembl:ENST00000438544 HOGENOM:HOG000213821
            ArrayExpress:C9J5G7 Bgee:C9J5G7 Uniprot:C9J5G7
        Length = 178

 Score = 254 (94.5 bits), Expect = 1.3e-20, P = 1.3e-20
 Identities = 54/139 (38%), Positives = 82/139 (58%)

Query:    56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAI 114
             S+  P+I+ ++ADDLG  D+G +G + + TPNID LA  G+ L  + +   LCTPSR+A 
Sbjct:    34 SASRPNILLLMADDLGIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAF 93

Query:   115 MTGKHPIHTGMQHNVLYGCER-----GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
             +TG++P+ +GM  ++ Y   +     GGLP +E    + LKE GY T ++GKWHLG   +
Sbjct:    94 LTGRYPVRSGMVSSIGYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCE 153

Query:   170 E-----YTPTFRGFESHLG 183
                   + P   GF+   G
Sbjct:   154 SASDHCHHPLHHGFDHFYG 172


>ZFIN|ZDB-GENE-060503-154 [details] [associations]
            symbol:arsg "arylsulfatase G" species:7955 "Danio
            rerio" [GO:0003824 "catalytic activity" evidence=IEA] [GO:0008152
            "metabolic process" evidence=IEA] [GO:0008484 "sulfuric ester
            hydrolase activity" evidence=IEA] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            ZFIN:ZDB-GENE-060503-154 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 EMBL:CR926135 EMBL:CABZ01038699
            EMBL:CABZ01038700 IPI:IPI00502628 Ensembl:ENSDART00000091423
            Bgee:F1QQI9 Uniprot:F1QQI9
        Length = 526

 Score = 274 (101.5 bits), Expect = 2.4e-20, Sum P(3) = 2.4e-20
 Identities = 87/283 (30%), Positives = 137/283 (48%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQ-IPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTG 117
             P+ I ILADD+GW D+  +  D   PTP +D+L   G    ++++    C+PSR++I+TG
Sbjct:    35 PNFIIILADDIGWGDLWLNRPDNSTPTPWLDSLMLKGKRFTDFHSPASTCSPSRASILTG 94

Query:   118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRG 177
             +H +  G+ HN   G   GGLPL+E    Q L + GY T ++GKWHLG +   Y+P  RG
Sbjct:    95 RHGLRNGVTHNFAVGSV-GGLPLNETTFAQLLHDEGYYTAMIGKWHLG-HNGSYSPVHRG 152

Query:   178 FESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTD-----VFTAEAVDI 232
             F+ +LG    +    D    +    GLD+     P   +H +YS +      +T   + +
Sbjct:   153 FDYYLGIPYSN----DMGCTDKP--GLDLPC-CPPC--VHSQYSINKKHEGCYTKVGLPL 203

Query:   233 IHNHST-DEPLFLY-----LAHAA-----THSANPYEPLQAPDHYLNI-HRHIEDFKRS- 279
               N    ++PL  +      A AA     T S    +P      Y+ + H H+  F  + 
Sbjct:   204 FENEKIIEQPLDTWSLKDRYATAAVQQIFTASVTKKQPFLL---YVALAHMHVPLFHNTF 260

Query:   280 -------KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
                     + A L  +D  VG +++AL   + L N++I F  D
Sbjct:   261 LNVTTEDPYTASLSDMDSLVGNIMQALITEQ-LENTLIWFTGD 302

 Score = 42 (19.8 bits), Expect = 2.4e-20, Sum P(3) = 2.4e-20
 Identities = 11/41 (26%), Positives = 19/41 (46%)

Query:   658 SIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEDQR 698
             S+ C   +  P +    P +FD+  D  E+  L   S++ R
Sbjct:   437 SVACDG-ESGPQQHHDPPLIFDLSQDEAEETPLDPESKEFR 476

 Score = 41 (19.5 bits), Expect = 2.4e-20, Sum P(3) = 2.4e-20
 Identities = 7/22 (31%), Positives = 13/22 (59%)

Query:   511 DGIDVWSVLSRNEPSKRNTILH 532
             DGID+  VL  +  +   +++H
Sbjct:   387 DGIDITDVLLNDSETGHESLMH 408

 Score = 41 (19.5 bits), Expect = 2.4e-20, Sum P(3) = 2.4e-20
 Identities = 7/22 (31%), Positives = 13/22 (59%)

Query:   576 DGIDVWSVLSRNEPSKRNTILH 597
             DGID+  VL  +  +   +++H
Sbjct:   387 DGIDITDVLLNDSETGHESLMH 408

 Score = 38 (18.4 bits), Expect = 6.1e-20, Sum P(3) = 6.1e-20
 Identities = 10/38 (26%), Positives = 17/38 (44%)

Query:   764 SIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSE 801
             S+ C   +  P +    P +FD+  D  E+  L   S+
Sbjct:   437 SVACDG-ESGPQQHHDPPLIFDLSQDEAEETPLDPESK 473


>TIGR_CMR|CPS_2983 [details] [associations]
            symbol:CPS_2983 "putative arylsulfatase" species:167879
            "Colwellia psychrerythraea 34H" [GO:0004065 "arylsulfatase
            activity" evidence=ISS] [GO:0008152 "metabolic process"
            evidence=ISS] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 EMBL:CP000083
            GenomeReviews:CP000083_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 GO:GO:0008484 KO:K01130 OMA:DDQVGIL
            HOGENOM:HOG000135355 RefSeq:YP_269683.1 ProteinModelPortal:Q47ZT6
            STRING:Q47ZT6 GeneID:3520535 KEGG:cps:CPS_2983 PATRIC:21468983
            BioCyc:CPSY167879:GI48-3032-MONOMER Uniprot:Q47ZT6
        Length = 522

 Score = 270 (100.1 bits), Expect = 3.9e-20, P = 3.9e-20
 Identities = 100/381 (26%), Positives = 167/381 (43%)

Query:    27 ELGYRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFH--GLDQIP 84
             E+  R + +A  +  LA   S        +   P+++ I  DD+G+ ++  +  G+    
Sbjct:     2 EMNNRLKKLALGIGVLAIATSAA---ATTNKAKPNVLAIWGDDIGYYNISAYNQGMMGYQ 58

Query:    85 TPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKI 144
             TPNID +A  G +  ++Y  Q CT  R++ + G+ P  TG+    + G   G +P     
Sbjct:    59 TPNIDRIADEGALFTHHYAQQSCTAGRASFILGQEPFRTGLLTIGMPGSTHG-IPDWTPT 117

Query:   145 LPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLG--YWTGHQD-----YFDHSAE 197
             +   LKE GY T   GK HLG   K + PT  GF+   G  Y    ++     Y+    E
Sbjct:   118 IADLLKEKGYMTAQFGKNHLGDQDK-HLPTNHGFDEFFGNLYHLNAEEEPETYYYPKDKE 176

Query:   198 EMKMWGLDMRRDLEPAWDLHGKY-STDVFTAEAVDIIHNHSTDEPL-FLYLAHAATHSAN 255
               K +G   R  +    D  GK  +T   T + ++          L F+  AH A     
Sbjct:   177 FHKKYG--PRGVIHSFAD--GKIENTGSMTRKRMETADGEFLAGTLKFIDKAHKAK---K 229

Query:   256 PYEPLQAPDHYLNIHRHIEDFKRSKFAAILH-----KLDESVGKVVEALEQRRMLSNSII 310
             P+    +    +++   +++  R K    L      + D+ VG +++ L+  ++  N+I+
Sbjct:   230 PFFIWHSSTR-MHVWTRLQEKYRGKSGVSLTADGMLEHDDQVGILLDKLDDLKIADNTIV 288

Query:   311 VFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHV 369
             ++ +D             S  P RG K T  EGG+R   L+ W   +++         H 
Sbjct:   289 IYSTDNGAEKFTWPDGGTS--PFRGEKGTTTEGGMRVPQLVRWPGTIKAGSKFNNMMSH- 345

Query:   370 SDWLPTLLSAANKSDIPNYVN 390
              DW+PTLL+AA +   PN VN
Sbjct:   346 EDWMPTLLAAAGE---PNIVN 363


>UNIPROTKB|F1N665 [details] [associations]
            symbol:ARSG "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0006790 "sulfur compound metabolic process"
            evidence=IEA] [GO:0005783 "endoplasmic reticulum" evidence=IEA]
            [GO:0005764 "lysosome" evidence=IEA] [GO:0005615 "extracellular
            space" evidence=IEA] [GO:0008484 "sulfuric ester hydrolase
            activity" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 GO:GO:0005783 GO:GO:0005615
            GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 GO:GO:0006790 EMBL:DAAA02049729
            EMBL:DAAA02049730 EMBL:DAAA02049731 EMBL:DAAA02049732
            EMBL:DAAA02049733 IPI:IPI00867152 UniGene:Bt.103824
            ProteinModelPortal:F1N665 Ensembl:ENSBTAT00000014061 OMA:GHARNAF
            Uniprot:F1N665
        Length = 328

 Score = 249 (92.7 bits), Expect = 4.4e-20, P = 4.4e-20
 Identities = 49/130 (37%), Positives = 75/130 (57%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
             P+ + ILADD+GW D+G +      T N+D +A  G    +++     C+PSR+A++TG+
Sbjct:    36 PNFVIILADDMGWGDLGANWAGTKDTANLDRMAAEGTRFVDFHAAASTCSPSRAALLTGR 95

Query:   119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
               +  G+ HN       GGLPL+E  L + L+  GY T ++GKWHLG +   + P FRGF
Sbjct:    96 LGLRNGVTHNFAV-TSVGGLPLNETTLAEVLRGAGYVTGMIGKWHLGHHGSHH-PNFRGF 153

Query:   179 ESHLGYWTGH 188
             + + G    H
Sbjct:   154 DYYFGVPYSH 163


>UNIPROTKB|F1NFL4 [details] [associations]
            symbol:F1NFL4 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 OMA:RWNDWKA EMBL:AADN02017596
            IPI:IPI00586912 Ensembl:ENSGALT00000026882 Uniprot:F1NFL4
        Length = 374

 Score = 258 (95.9 bits), Expect = 1.1e-19, P = 1.1e-19
 Identities = 97/347 (27%), Positives = 160/347 (46%)

Query:    60 PHIIFILADDLGWND--VGFHGLDQI---PTPNIDALAYSGIILKNYYTVQ-LCTPSRSA 113
             P+ + ILADDLG  D  +  H +D I    TP+ID LA  G+ L  +     +CTPSR+A
Sbjct:     2 PNFLLILADDLGIGDTSIKMH-IDMIFLFRTPHIDGLAKEGVRLTQHIAAAAVCTPSRAA 60

Query:   114 IMTGKHPIHTGMQHNVLY--GCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
              +TG++PI +  +  +L+  GC  GGLP +E    + L + GY T +VGKWHLG   K +
Sbjct:    61 FLTGRYPIRS--ERRILFWNGCS-GGLPPNETTFARVLHQQGYSTALVGKWHLGVNCKSH 117

Query:   172 T-----PTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKY-STDVF 225
                   P   GFE    Y+ G      +  +     G D     + + D    Y S   F
Sbjct:   118 RDHCHHPLNHGFE----YFYGMSFTILNECQ-----GTDDPELAKSSQD--NLYCSAYAF 166

Query:   226 TAEAVDII----HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI---EDFK- 277
               +   +I     N+S  + L+  L   +    N + P       L++H  +   ++F  
Sbjct:   167 VWKTYPLILSKMENNSMCDHLWSPLVSFSGKVRNKHRPFLLFLSLLHVHTPLITTKEFLG 226

Query:   278 RSK---FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLR 334
             RS+   +   + ++D  VG++++ +++  + + + I F SD             S +   
Sbjct:   227 RSRHGLYGDNVEEMDWMVGRLLDVIDKEGLKNTTFIYFASDHKENLTNCPNVYTSKFSSE 286

Query:   335 GVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
              +    WEGG+R  G++ W   L + GIV  +   + D  PT++  A
Sbjct:   287 IMGG--WEGGIRVPGIVRWPGALPA-GIVISEPTSIMDIFPTVVHLA 330


>UNIPROTKB|Q32KH9 [details] [associations]
            symbol:ARSG "Arylsulfatase G" species:9615 "Canis lupus
            familiaris" [GO:0005764 "lysosome" evidence=ISS] [GO:0004065
            "arylsulfatase activity" evidence=ISS] [GO:0006790 "sulfur compound
            metabolic process" evidence=IEA] [GO:0005783 "endoplasmic
            reticulum" evidence=IEA] [GO:0005615 "extracellular space"
            evidence=IEA] [GO:0046872 "metal ion binding" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 GO:GO:0005783 GO:GO:0005615 GO:GO:0046872
            GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            GeneTree:ENSGT00560000076940 HOGENOM:HOG000135352
            HOVERGEN:HBG004283 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0004065 EMBL:AAEX02034846 EMBL:BN000758
            RefSeq:NP_001041563.1 UniGene:Cfa.37363 ProteinModelPortal:Q32KH9
            STRING:Q32KH9 Ensembl:ENSCAFT00000017623 GeneID:480460
            KEGG:cfa:480460 CTD:22901 InParanoid:Q32KH9 KO:K12381 OMA:LPQDRHF
            OrthoDB:EOG4J9MZJ NextBio:20855470 GO:GO:0006790 Uniprot:Q32KH9
        Length = 535

 Score = 266 (98.7 bits), Expect = 1.2e-19, P = 1.2e-19
 Identities = 78/272 (28%), Positives = 129/272 (47%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
             P+ + ILADD+GW D+G +  +   T N+D +A  G+   +++     C+PSR++++TG+
Sbjct:    36 PNFVIILADDMGWGDLGANWAETKDTANLDKMAAEGMRFVDFHAAASTCSPSRASLLTGR 95

Query:   119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
               +  G+ HN       GGLPL+E  L + L++ GY T ++GKWHLG +   Y P FRGF
Sbjct:    96 LGLRNGVTHNFAV-TSVGGLPLNETTLAEVLQQAGYVTGMIGKWHLGHHGP-YHPNFRGF 153

Query:   179 ESHLGYWTGHQ-DYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA--EAVDIIHN 235
             + + G    H     D             R D  P+  L     TDV     E ++I+  
Sbjct:   154 DYYFGIPYSHDMGCTDTPGYNHPPCPACPRGD-RPSRSLERDCYTDVALPLYENLNIVEQ 212

Query:   236 --------HSTDEPLFLYLAHAATHSANPYEPLQAPDH-YLNIHR-HIEDFKRSK--FAA 283
                     H   E    ++ HA+  S  P+       H ++ I R  +    R +  + A
Sbjct:   213 PVNLSSLAHKYAEKAIQFIQHASA-SGRPFLLYMGLAHMHVPISRTQLSAVLRGRRPYGA 271

Query:   284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
              L ++D  VG++ + ++ R    N+ + F  D
Sbjct:   272 GLREMDSLVGQIKDKVD-RTAKENTFLWFTGD 302


>TIGR_CMR|CPS_2984 [details] [associations]
            symbol:CPS_2984 "sulfatase family protein" species:167879
            "Colwellia psychrerythraea 34H" [GO:0006790 "sulfur compound
            metabolic process" evidence=ISS] [GO:0008484 "sulfuric ester
            hydrolase activity" evidence=ISS] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 EMBL:CP000083
            GenomeReviews:CP000083_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 GO:GO:0008484 KO:K01130 HOGENOM:HOG000135355
            RefSeq:YP_269684.1 ProteinModelPortal:Q47ZT5 STRING:Q47ZT5
            GeneID:3520029 KEGG:cps:CPS_2984 PATRIC:21468985 OMA:NGPHANT
            BioCyc:CPSY167879:GI48-3033-MONOMER Uniprot:Q47ZT5
        Length = 512

 Score = 250 (93.1 bits), Expect = 2.0e-19, Sum P(2) = 2.0e-19
 Identities = 89/359 (24%), Positives = 158/359 (44%)

Query:    47 SMVFVDLVASSGPPHIIFILADDLGWNDVGF--HGLDQIPTPNIDALAYSGIILKNYYTV 104
             S++      ++  P+I+F   DD+G  ++    HG+    TPNID +A  G++  +YY  
Sbjct:    14 SLIATASATAAEKPNILFFWGDDIGRTNISAYSHGIMGFKTPNIDRIAKEGMMFTDYYAD 73

Query:   105 QLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL 164
             Q CT  RS  +TG+  + TGM    L G + G +   +  + + LK  GY T   GK HL
Sbjct:    74 QSCTAGRSTFITGQSGLRTGMTKVGLPGAKEG-IQDRDITIAEMLKAKGYTTGQFGKNHL 132

Query:   165 GFYKKEYTPTFRGFESHLG--YWTGHQ------DYFDHSAEEMKMW--GL-----DMR-R 208
             G  K E+ P+  GF+   G  Y    +      DY    A + K    G+     D +  
Sbjct:   133 GD-KDEHLPSNHGFDEFFGNLYHLNAEEEPEDPDYPKDPAFKKKFGPRGVIHSYADGKIE 191

Query:   209 DLEPAWDLHGKYSTDVFTAEAVDIIHNH-STDEPLFLYLAHAATHSANPYEPLQAPDHYL 267
             D  P      + + D F A A+  +       +P F+++  A  H      P        
Sbjct:   192 DTGPLTKKRMETADDEFVAAAMKFVDKAVKAKKPFFVWVNTAGMHFRTHINP-------- 243

Query:   268 NIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXX 327
                +H+    +  +  ++   D  VG +++ L++ ++  ++I+++ +D            
Sbjct:   244 ---KHVGLSGQGFYNDVMVAHDNHVGMMLDQLDKLKVTDSTIVMYSTDNGVHYNTWPDAG 300

Query:   328 XSNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDI 385
              +  P  G KN+  EG  R   ++ W   +++  +  E   H+ DW+PTL +AA  + +
Sbjct:   301 IT--PFDGEKNSEKEGAYRVPMMVRWPGKIKAGEVSNEMMAHL-DWMPTLAAAAGDTKL 356

 Score = 60 (26.2 bits), Expect = 2.0e-19, Sum P(2) = 2.0e-19
 Identities = 19/59 (32%), Positives = 31/59 (52%)

Query:   498 ENRSNDNSYQNEIDGIDVWSVLS-RNEPSKRNTILHNIDDEWQISALTRGKWKLV-KEN 554
             + R  +   +  +DG ++   L+ + E S RN I H ++DE    A+  G WK+V  EN
Sbjct:   364 KRRFGNKQSKIHLDGYNMLPHLTGKTEKSPRN-IYHYLNDEGFPVAIRIGDWKMVYAEN 421


>UNIPROTKB|P77318 [details] [associations]
            symbol:ydeN "putative sulfatase" species:83333 "Escherichia
            coli K-12" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] [GO:0008152 "metabolic process" evidence=IEA]
            [GO:0003824 "catalytic activity" evidence=IEA] [GO:0046872 "metal
            ion binding" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 EMBL:U00096 EMBL:AP009048
            GenomeReviews:AP009048_GR GenomeReviews:U00096_GR GO:GO:0046872
            Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            HOGENOM:HOG000135352 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 PIR:E64903 RefSeq:NP_416015.2
            RefSeq:YP_489763.1 ProteinModelPortal:P77318 SMR:P77318
            DIP:DIP-11682N IntAct:P77318 PhosSite:P0810453 PRIDE:P77318
            EnsemblBacteria:EBESCT00000001979 EnsemblBacteria:EBESCT00000015602
            GeneID:12931856 GeneID:945957 KEGG:ecj:Y75_p1474 KEGG:eco:b1498
            PATRIC:32118290 EchoBASE:EB3557 EcoGene:EG13796 KO:K01138
            OMA:PVINRCA ProtClustDB:CLSK880035 BioCyc:EcoCyc:G6788-MONOMER
            BioCyc:ECOL316407:JW5243-MONOMER Genevestigator:P77318
            Uniprot:P77318
        Length = 560

 Score = 257 (95.5 bits), Expect = 2.8e-19, Sum P(2) = 2.8e-19
 Identities = 105/390 (26%), Positives = 166/390 (42%)

Query:    82 QIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPL 140
             Q  TP + +L   G+   N Y    +  PSR+AIMTG+ P   G+  N      + G+PL
Sbjct:   106 QKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT---DAQDGIPL 162

Query:   141 SEKILPQYLKELGYRTRIVGKWHLGFYK----------KEYTPTFRGFESHLGYWTGHQ- 189
             +E  LP+  +  GY T  VGKWHL              ++Y   F  F +    W     
Sbjct:   163 TETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAE--EWQPQNR 220

Query:   190 --DYFD--HSAEEM--KMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST-DEPL 242
               DYF   H+A         L   R+  PA    G Y +D  T EA+ ++    T D+P 
Sbjct:   221 GFDYFMGFHAAGTAYYNSPSLFKNRERVPA---KG-YISDQLTDEAIGVVDRAKTLDQPF 276

Query:   243 FLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQR 302
              LYLA+ A H  N   P  APD Y            + +A++ + +D+ V +++E L++ 
Sbjct:   277 MLYLAYNAPHLPND-NP--APDQYQKQFNTGSQTADNYYASV-YSVDQGVKRILEQLKKN 332

Query:   303 RMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIV 362
                 N+II+F SD              N   +G K+  + GG      +W       G  
Sbjct:   333 GQYDNTIILFTSDNGAVIDGPLPL---NGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY 389

Query:   363 AEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENS-----------ILRYENGT 411
              ++ +   D+ PT L AA+ S IP  +     +++P  ++            I  Y +  
Sbjct:   390 -DKLISAMDFYPTALDAADIS-IPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWF 447

Query:   412 HEYNSPRIENSN--TRYENGTHEYNPKYEN 439
              E N P  +N +   R+++  + +NP  E+
Sbjct:   448 DEENIPFWDNYHKFVRHQSDDYPHNPNTED 477

 Score = 53 (23.7 bits), Expect = 2.8e-19, Sum P(2) = 2.8e-19
 Identities = 18/64 (28%), Positives = 29/64 (45%)

Query:    55 ASSGPPHIIFILADDLGWNDVGFH--GLDQIPTPNIDAL-AYSGIILKNYYTVQLCTPSR 111
             ++ G P+II +  DDLG+  + F     D     N + +  Y   I K     Q  TP+ 
Sbjct:    53 STKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTL 112

Query:   112 SAIM 115
              ++M
Sbjct:   113 LSLM 116


>UNIPROTKB|F1RV22 [details] [associations]
            symbol:ARSG "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0006790 "sulfur compound metabolic process"
            evidence=IEA] [GO:0005783 "endoplasmic reticulum" evidence=IEA]
            [GO:0005764 "lysosome" evidence=IEA] [GO:0005615 "extracellular
            space" evidence=IEA] [GO:0008484 "sulfuric ester hydrolase
            activity" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 GO:GO:0005783 GO:GO:0005615
            GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 KO:K12381 OMA:LPQDRHF GO:GO:0006790
            EMBL:FP085458 EMBL:FP085465 EMBL:FP067366 RefSeq:XP_003131311.1
            UniGene:Ssc.62110 Ensembl:ENSSSCT00000018790 GeneID:100521576
            KEGG:ssc:100521576 Uniprot:F1RV22
        Length = 525

 Score = 260 (96.6 bits), Expect = 5.0e-19, P = 5.0e-19
 Identities = 78/275 (28%), Positives = 134/275 (48%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
             P+ + ILADD+GW D+G +  +   T N+D LA  G+   +++     C+PSR++++TG+
Sbjct:    36 PNFVIILADDMGWGDLGANWAETKDTANLDKLAAEGMRFVDFHAAASTCSPSRASLLTGR 95

Query:   119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
               +  G+ HN       GGLPL+E  L + L+  GY T ++GKWHLG +   + P FRGF
Sbjct:    96 LGLRNGVTHNFAV-TSVGGLPLNETTLAEVLQRAGYITGMIGKWHLGHHGS-FHPNFRGF 153

Query:   179 ESHLGYWTGHQ-DYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN-H 236
             + + G    H     D             RR  +P+ +L     +DV    A+ +  N +
Sbjct:   154 DYYFGIPYSHDMGCTDTPGYNYPPCPACPRRH-QPSRNLERDCYSDV----ALPLYENLN 208

Query:   237 STDEPLFLY-LAHAATHSANPY-EPLQAPDH----YLNI-HRHIE---------DFKRSK 280
               ++P+ L  LA      A  + +  +A       Y+ + H H+           + R  
Sbjct:   209 IVEQPVNLSGLARKYAEKATQFIQQARASGRPFLLYVGLAHMHVPLSRPQRSAGPWDRRP 268

Query:   281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
             +AA L ++D  VG++ + ++ R   +N+ + F  D
Sbjct:   269 YAAGLREMDRLVGQIKDKVD-RTAKNNTFLWFTGD 302


>UNIPROTKB|F1PYB3 [details] [associations]
            symbol:ARSE "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 EMBL:AAEX03026107
            Ensembl:ENSCAFT00000017722 Uniprot:F1PYB3
        Length = 253

 Score = 239 (89.2 bits), Expect = 5.2e-19, P = 5.2e-19
 Identities = 47/113 (41%), Positives = 72/113 (63%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
             P+I+ ++ADD G  D+G +G + I TPNID LA  G++L  +     +CTPSR+A +TG+
Sbjct:     6 PNILLLMADDFGIGDIGCYGNNSIRTPNIDRLAEDGVMLTQHIAAASVCTPSRAAFLTGR 65

Query:   119 HPIHTGMQ-----HNVL-YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
             +P+ +G+      + VL +    GGLP +E    + LK+ GY T ++GKWHLG
Sbjct:    66 YPLRSGLSSLINGYRVLQWTGVSGGLPTNETTFAKILKDRGYATGLIGKWHLG 118


>TIGR_CMR|CPS_3032 [details] [associations]
            symbol:CPS_3032 "sulfatase family protein" species:167879
            "Colwellia psychrerythraea 34H" [GO:0006790 "sulfur compound
            metabolic process" evidence=ISS] [GO:0008484 "sulfuric ester
            hydrolase activity" evidence=ISS] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 EMBL:CP000083
            GenomeReviews:CP000083_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 GO:GO:0008484 KO:K01130 RefSeq:YP_269731.1
            ProteinModelPortal:Q47ZN9 STRING:Q47ZN9 GeneID:3518391
            KEGG:cps:CPS_3032 PATRIC:21469075 HOGENOM:HOG000135355 OMA:RWNDWKA
            BioCyc:CPSY167879:GI48-3081-MONOMER Uniprot:Q47ZN9
        Length = 522

 Score = 258 (95.9 bits), Expect = 8.2e-19, P = 8.2e-19
 Identities = 97/366 (26%), Positives = 161/366 (43%)

Query:    32 TRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLG-WNDVGF-HGLDQIPTPNID 89
             T+   FA+     T S   +    +S  P+I+ I  DD+G +N   + HG+    TPNID
Sbjct:     5 TKFTQFAIALGMLTASATALATTDTS-KPNILAIWGDDIGIYNISAYNHGMMGYQTPNID 63

Query:    90 ALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYL 149
              +A  G +  + Y  Q CT  RSA + G+ P  TG+    + G   G +P     +    
Sbjct:    64 RIANEGALFTDQYAQQSCTAGRSAFILGQEPFRTGLLTIGMPGSTHG-IPDWAPTIGDVA 122

Query:   150 KELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGL--DMR 207
             K+ GY T   GK HLG   K + PT  GF+   G       Y  ++ EE + +    D R
Sbjct:   123 KDNGYMTAQFGKNHLGDQDK-HLPTKHGFDEFFGNL-----YHLNAEEEPETYYYPKDPR 176

Query:   208 --RDLEPAWDLH----GKYS-TDVFTAEAVDIIHNHSTDEPL-FLYLAHAATHSANPYEP 259
               +   P   LH    G+   T   T + ++          L F+  AH A     P+  
Sbjct:   177 FKKKFGPRGVLHTFADGRMEDTGALTRKRMETADEEFLGATLKFIDKAHKAD---KPFF- 232

Query:   260 LQAPDHYLNIHRHIEDFKRSK-----FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVS 314
             +      +++H  +++  + K     +A  + + DE VG +++ L+  ++  N+I+++ +
Sbjct:   233 IWYNSTRMHVHTRLQEKWQGKSGISIYADGMLEHDEHVGVLLDKLDDLKIADNTIVIYTT 292

Query:   315 DXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWL 373
             D              N P  G K T +EGG+R   L+ W   ++    +     H+ DW+
Sbjct:   293 DNGAETFTWPDG--GNTPFHGEKGTTYEGGMRVPQLVRWPGTIKPGSKMNSMMSHI-DWM 349

Query:   374 PTLLSA 379
             PTL +A
Sbjct:   350 PTLAAA 355


>UNIPROTKB|I3LCI6 [details] [associations]
            symbol:I3LCI6 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GO:GO:0008484 GeneTree:ENSGT00560000077076 EMBL:CU469102
            EMBL:AEMK01103856 EMBL:AEMK01167009 Ensembl:ENSSSCT00000031398
            Uniprot:I3LCI6
        Length = 121

 Score = 176 (67.0 bits), Expect = 2.4e-18, Sum P(2) = 2.4e-18
 Identities = 30/55 (54%), Positives = 40/55 (72%)

Query:   329 SNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
             +NWPLRG K +LWEGGVRG G +  PLL+ +G+   + +H+SDWLPTL+  A  S
Sbjct:    19 NNWPLRGRKWSLWEGGVRGVGFVAGPLLKRKGVKNRELIHISDWLPTLVKLAGGS 73

 Score = 83 (34.3 bits), Expect = 2.4e-18, Sum P(2) = 2.4e-18
 Identities = 14/26 (53%), Positives = 17/26 (65%)

Query:   510 IDGIDVWSVLSRNEPSKRNTILHNID 535
             +DG DVW  +S   PS R  +LHNID
Sbjct:    80 LDGFDVWKTISEGSPSPRMELLHNID 105

 Score = 83 (34.3 bits), Expect = 2.4e-18, Sum P(2) = 2.4e-18
 Identities = 14/26 (53%), Positives = 17/26 (65%)

Query:   575 IDGIDVWSVLSRNEPSKRNTILHNID 600
             +DG DVW  +S   PS R  +LHNID
Sbjct:    80 LDGFDVWKTISEGSPSPRMELLHNID 105


>ASPGD|ASPL0000001694 [details] [associations]
            symbol:AN6847 species:162425 "Emericella nidulans"
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
            [GO:0008152 "metabolic process" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0005829 "cytosol" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 EMBL:BN001301 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00149 GO:GO:0008484 OMA:IEWTNIS
            ProteinModelPortal:C8V2I8 EnsemblFungi:CADANIAT00007645
            Uniprot:C8V2I8
        Length = 616

 Score = 144 (55.7 bits), Expect = 4.8e-18, Sum P(4) = 4.8e-18
 Identities = 26/58 (44%), Positives = 41/58 (70%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTG 117
             P+ + I+ADDLG++D+G +G  +I TPNID LA  G+   +++    C+P+R+ IMTG
Sbjct:     7 PNFLVIVADDLGFSDIGCYG-SEIRTPNIDKLAQKGVRFTDFHAAAACSPTRAMIMTG 63

 Score = 138 (53.6 bits), Expect = 4.8e-18, Sum P(4) = 4.8e-18
 Identities = 57/173 (32%), Positives = 72/173 (41%)

Query:   140 LSEKI--LPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE---SHLGYWTGHQDYFDH 194
             L+E++  LP+ L++ GY T + GKWHLG    E +P  RGF+   +HL   + H  Y   
Sbjct:   106 LNERVVALPEILRDAGYHTLMSGKWHLGL-TPERSPYKRGFDRSLAHLPACSNHYAYEPQ 164

Query:   195 SAE--------EMKMWGLDMR-----RDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDE- 240
               +        E     L M      R L   W     Y  D      VD   N   DE 
Sbjct:   165 LRDQDETPTFLEASYIALHMEDDKYVRSLPEGWYSSNGYG-DKMREYLVDWHKNKKEDED 223

Query:   241 -PLFLYLAHAATHSANPYEPLQAP----DHYLNIHRHIEDFKRSKFAAILHKL 288
              P F YL   A     P+ PLQAP    DHY  ++    D  R K  A L KL
Sbjct:   224 KPFFAYLPFTA-----PHWPLQAPREYIDHYRGVYDDGPDALRLKRLASLKKL 271

 Score = 68 (29.0 bits), Expect = 4.8e-18, Sum P(4) = 4.8e-18
 Identities = 13/35 (37%), Positives = 23/35 (65%)

Query:   281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
             FA ++  +D +VGK+V+ L+    L N+ + F+SD
Sbjct:   311 FAGMVECIDANVGKIVDYLDSIGELDNTFVCFMSD 345

 Score = 42 (19.8 bits), Expect = 4.8e-18, Sum P(4) = 4.8e-18
 Identities = 11/39 (28%), Positives = 22/39 (56%)

Query:   783 LFDIKNDPCEKNNLADR--SEVQRI----NHYTTEVGYL 815
             L+++  DP E N+LA++    +Q++    + Y  E G +
Sbjct:   523 LYNLVEDPGEINDLAEKYPERLQKLLKLWDQYVLETGVI 561

 Score = 39 (18.8 bits), Expect = 9.5e-18, Sum P(4) = 9.5e-18
 Identities = 7/17 (41%), Positives = 13/17 (76%)

Query:   677 LFDIKNDPCEKNNLADR 693
             L+++  DP E N+LA++
Sbjct:   523 LYNLVEDPGEINDLAEK 539


>TIGR_CMR|CPS_2381 [details] [associations]
            symbol:CPS_2381 "sulfatase family protein" species:167879
            "Colwellia psychrerythraea 34H" [GO:0008152 "metabolic process"
            evidence=ISS] [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=ISS] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 EMBL:CP000083
            GenomeReviews:CP000083_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            GO:GO:0008484 HOGENOM:HOG000014304 RefSeq:YP_269099.1
            ProteinModelPortal:Q482B9 STRING:Q482B9 GeneID:3523329
            KEGG:cps:CPS_2381 PATRIC:21467845 OMA:VAPKKYF
            ProtClustDB:CLSK494238 BioCyc:CPSY167879:GI48-2444-MONOMER
            Uniprot:Q482B9
        Length = 511

 Score = 175 (66.7 bits), Expect = 7.3e-18, Sum P(3) = 7.3e-18
 Identities = 53/132 (40%), Positives = 70/132 (53%)

Query:    42 LAFTLSMV-FVDLVASSG-PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL- 98
             LAF ++ V    L  SS    +++FI  DDL  ND+G +G   + +PNIDALA  GI   
Sbjct:    21 LAFVVNSVGAAQLKKSSTLSMNVLFITIDDLN-NDLGAYGHHLVKSPNIDALAKKGIRFD 79

Query:    99 KNYYTVQLCTPSRSAIMTGKHPIHTGM----QHNVLYGCERGGLPLSEKILPQYLKELGY 154
             K Y    +CTPSRS+ MTG +P  TG+     H  +    R  +P     LPQ  K  GY
Sbjct:    80 KAYSQSPMCTPSRSSFMTGLYPDQTGIIAHGSHTQMTAHFREHIP-KVTTLPQLFKNNGY 138

Query:   155 RTRIVGK-WHLG 165
              +  VGK +H G
Sbjct:   139 FSGRVGKIYHQG 150

 Score = 109 (43.4 bits), Expect = 7.3e-18, Sum P(3) = 7.3e-18
 Identities = 36/146 (24%), Positives = 67/146 (45%)

Query:   281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTL 340
             +AA+ + +D  VG+V++AL+Q+ +  N+I+VF+SD               W     K +L
Sbjct:   305 YAAVSY-VDAQVGRVLDALKQQDLSDNTIVVFLSDHGYELGQHGL-----WQ----KGSL 354

Query:   341 WEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRY 400
             +EG  R   +I++P ++  G V    V + D  PTL         P Y+    +++ P  
Sbjct:   355 FEGSARAPLIIYAPNVKDNGRVVTSPVELVDIYPTLAKLTGLV-APEYLAG--KDLTPAL 411

Query:   401 ENSILRYENGTHEYNSPRIENSNTRY 426
              +   +   G +     R +  N ++
Sbjct:   412 NDVDFQVRKGAYSAILNRNKGDNNQF 437

 Score = 59 (25.8 bits), Expect = 7.3e-18, Sum P(3) = 7.3e-18
 Identities = 11/23 (47%), Positives = 16/23 (69%)

Query:   783 LFDIKNDPCEKNNLADRSEVQRI 805
             L+D KNDP E  NLAD+  ++ +
Sbjct:   466 LYDHKNDPQELKNLADKVSLESV 488

 Score = 56 (24.8 bits), Expect = 1.5e-17, Sum P(3) = 1.5e-17
 Identities = 11/17 (64%), Positives = 13/17 (76%)

Query:   677 LFDIKNDPCEKNNLADR 693
             L+D KNDP E  NLAD+
Sbjct:   466 LYDHKNDPQELKNLADK 482


>UNIPROTKB|I3LM95 [details] [associations]
            symbol:ARSD "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 Ensembl:ENSSSCT00000027390
            OMA:INIGGHE Uniprot:I3LM95
        Length = 580

 Score = 196 (74.1 bits), Expect = 7.9e-17, Sum P(3) = 7.9e-17
 Identities = 46/135 (34%), Positives = 72/135 (53%)

Query:    40 LPLAFTLSMVFVDLVASSGP---PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
             LP   T+ ++     + +GP   P+I+ I+ADDLG  D+G +G D +  P +     +G 
Sbjct:    45 LPGLLTVCLLLPTCASKAGPAFKPNILLIMADDLGIGDLGCYGNDTLRYPGLGLRVGAGT 104

Query:    97 ILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQ----HNVL-YGCERGGLPLSEKILPQYLK 150
              L        +CTPSR+A +TG+H + +G      + VL +    GGLP +E    + L+
Sbjct:   105 RLSAXLAAAPVCTPSRAAFLTGRHALRSGRWKGDGYRVLRWNGGSGGLPQNETTFARILQ 164

Query:   151 ELGYRTRIVGKWHLG 165
               GY T ++GKWH G
Sbjct:   165 RQGYATGLIGKWHQG 179

 Score = 86 (35.3 bits), Expect = 7.9e-17, Sum P(3) = 7.9e-17
 Identities = 30/135 (22%), Positives = 57/135 (42%)

Query:   285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXX-SNWP--LRGVKNTL- 340
             + ++D  VG ++ A+E+  + + ++  F SD                W    RG K    
Sbjct:   312 VEEMDGLVGDILNAIEEHGLKNTTLTYFTSDHGGHLEAIDGHVQLGGWNGIYRGGKGMGG 371

Query:   341 WEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPR 399
             WEGG+R  G+  W  +L + G V ++   + D  PT++       +P        +++P 
Sbjct:   372 WEGGIRVPGIFRWPGVLPA-GRVIQEPTSLMDVFPTVVQLGG-GQVPQDRVIDGRSLVPL 429

Query:   400 YENSILRYENGTHEY 414
              +      E+  HE+
Sbjct:   430 LQGET---EHSAHEF 441

 Score = 52 (23.4 bits), Expect = 7.9e-17, Sum P(3) = 7.9e-17
 Identities = 11/21 (52%), Positives = 12/21 (57%)

Query:   675 PCLFDIKNDPCEKNNLADRSE 695
             P LFD+  DP E   LA  SE
Sbjct:   500 PLLFDLSGDPSEAQPLAPGSE 520

 Score = 52 (23.4 bits), Expect = 7.9e-17, Sum P(3) = 7.9e-17
 Identities = 11/21 (52%), Positives = 12/21 (57%)

Query:   781 PCLFDIKNDPCEKNNLADRSE 801
             P LFD+  DP E   LA  SE
Sbjct:   500 PLLFDLSGDPSEAQPLAPGSE 520


>UNIPROTKB|F5H324 [details] [associations]
            symbol:ARSE "Arylsulfatase E" species:9606 "Homo sapiens"
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
            PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484 EMBL:AC005295
            HGNC:HGNC:719 IPI:IPI01015579 ProteinModelPortal:F5H324 SMR:F5H324
            Ensembl:ENST00000540563 UCSC:uc011mhi.2 ArrayExpress:F5H324
            Bgee:F5H324 Uniprot:F5H324
        Length = 544

 Score = 193 (73.0 bits), Expect = 3.1e-16, Sum P(3) = 3.1e-16
 Identities = 43/110 (39%), Positives = 62/110 (56%)

Query:    85 TPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGKHPIHTGMQHNVLYGCER-----GGL 138
             TPNID LA  G+ L  + +   LCTPSR+A +TG++P+ +GM  ++ Y   +     GGL
Sbjct:    18 TPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSIGYRVLQWTGASGGL 77

Query:   139 PLSEKILPQYLKELGYRTRIVGKWHLGFYKKE-----YTPTFRGFESHLG 183
             P +E    + LKE GY T ++GKWHLG   +      + P   GF+   G
Sbjct:    78 PTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFDHFYG 127

 Score = 85 (35.0 bits), Expect = 3.1e-16, Sum P(3) = 3.1e-16
 Identities = 25/107 (23%), Positives = 50/107 (46%)

Query:   285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXX-SNWP--LRGVKNTL- 340
             + ++D  VG++++ L+   + ++++I F SD                W    +G K    
Sbjct:   278 VEEMDWMVGRILDTLDVEGLSNSTLIYFTSDHGGSLENQLGNTQYGGWNGIYKGGKGMGG 337

Query:   341 WEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
             WEGG+R  G+  W  +L +  ++ E    + D  PT++  A   ++P
Sbjct:   338 WEGGIRVPGIFRWPGVLPAGRVIGEP-TSLMDVFPTVVRLAG-GEVP 382

 Score = 49 (22.3 bits), Expect = 3.1e-16, Sum P(3) = 3.1e-16
 Identities = 10/21 (47%), Positives = 12/21 (57%)

Query:   675 PCLFDIKNDPCEKNNLADRSE 695
             P LFD+  DP E + L   SE
Sbjct:   466 PLLFDLSRDPSETHILTPASE 486

 Score = 49 (22.3 bits), Expect = 3.1e-16, Sum P(3) = 3.1e-16
 Identities = 10/21 (47%), Positives = 12/21 (57%)

Query:   781 PCLFDIKNDPCEKNNLADRSE 801
             P LFD+  DP E + L   SE
Sbjct:   466 PLLFDLSRDPSETHILTPASE 486


>UNIPROTKB|I3LUP9 [details] [associations]
            symbol:ARSA "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0016021 "integral to membrane" evidence=IEA]
            [GO:0007339 "binding of sperm to zona pellucida" evidence=IEA]
            [GO:0005886 "plasma membrane" evidence=IEA] [GO:0005509 "calcium
            ion binding" evidence=IEA] [GO:0008484 "sulfuric ester hydrolase
            activity" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 GO:GO:0005886 GO:GO:0005509
            GO:GO:0007339 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 Ensembl:ENSSSCT00000031954
            OMA:GFDENTI Uniprot:I3LUP9
        Length = 486

 Score = 231 (86.4 bits), Expect = 4.6e-16, Sum P(2) = 4.6e-16
 Identities = 86/291 (29%), Positives = 134/291 (46%)

Query:    44 FTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT 103
             + L++     +A++ PP+I+ I ADDLG+ D+G +G     TPN+D LA  G+   ++Y 
Sbjct:     5 WALTLALASGLAATSPPNIVLIFADDLGYGDLGSYGHPSSTTPNLDQLAAGGLRFTDFYV 64

Query:   104 -VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKW 162
              V LCTPSR+A++TG+ P+  G     LY       P +E +  +     GY T + GKW
Sbjct:    65 PVSLCTPSRAALLTGRLPVRMG-----LY-------PGAEVLAAR-----GYLTGMAGKW 107

Query:   163 HLGFYKK-EYTPTFRGFESHLGYWTGHQD-------YFDHSA--EEMKMWGL-------D 205
             HLG   +  + P   GF   LG    H          F  S   +     GL       +
Sbjct:   108 HLGVGPEGAFLPPHXGFHRFLGIPYSHDQGPCQNLTCFPPSTPCDGSCDQGLVPVPLLAN 167

Query:   206 MRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD-EPLFLYLAHAATHSANPYEPLQAPD 264
             +  + +P W L G  +   + A A D++ +      P FLY A   TH    Y P Q   
Sbjct:   168 LSVEAQPPW-LPGLEAR--YVAFARDLMADAQRQGRPFFLYYASHHTH----Y-P-QFSG 218

Query:   265 HYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
                + H       R  F   L +LD +VG ++ A+    +L  ++++F +D
Sbjct:   219 QSFSGHSG-----RGPFGDSLMELDAAVGALMTAVGDLGLLGETLVIFTAD 264

 Score = 47 (21.6 bits), Expect = 4.6e-16, Sum P(2) = 4.6e-16
 Identities = 9/16 (56%), Positives = 10/16 (62%)

Query:   675 PCLFDIKNDPCEKNNL 690
             P LFD+  DP E  NL
Sbjct:   405 PLLFDLSEDPGENYNL 420

 Score = 47 (21.6 bits), Expect = 4.6e-16, Sum P(2) = 4.6e-16
 Identities = 9/16 (56%), Positives = 10/16 (62%)

Query:   781 PCLFDIKNDPCEKNNL 796
             P LFD+  DP E  NL
Sbjct:   405 PLLFDLSEDPGENYNL 420


>UNIPROTKB|P95059 [details] [associations]
            symbol:atsA "POSSIBLE ARYLSULFATASE ATSA (ARYL-SULFATE
            SULPHOHYDROLASE) (ARYLSULPHATASE)" species:1773 "Mycobacterium
            tuberculosis" [GO:0005886 "plasma membrane" evidence=IDA]
            [GO:0010033 "response to organic substance" evidence=IEP]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 GO:GO:0005886 GenomeReviews:AL123456_GR GO:GO:0010033
            EMBL:BX842574 Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
            PROSITE:PS00523 PROSITE:PS00149 GO:GO:0004065 HSSP:P15289 KO:K01130
            HOGENOM:HOG000042725 EMBL:AL123456 PIR:B70643 RefSeq:NP_215225.1
            RefSeq:YP_006514055.1 ProteinModelPortal:P95059 SMR:P95059
            PRIDE:P95059 EnsemblBacteria:EBMYCT00000001675 GeneID:13318600
            GeneID:888394 KEGG:mtu:Rv0711 KEGG:mtv:RVBD_0711 PATRIC:18150088
            TubercuList:Rv0711 OMA:FAGFLEH ProtClustDB:CLSK790691
            Uniprot:P95059
        Length = 787

 Score = 196 (74.1 bits), Expect = 6.9e-16, Sum P(3) = 6.9e-16
 Identities = 66/223 (29%), Positives = 104/223 (46%)

Query:    54 VASSGPPHIIFILADDLG---WNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPS 110
             VA    P+I++++ DD+G   W+  G  GL ++P   +  +A  G+ L  ++T  LC+P+
Sbjct:    31 VAPEHSPNILYLVWDDVGIATWDCFG--GLVEMPA--MTRVAERGVRLSQFHTTALCSPT 86

Query:   111 RSAIMTGKHPIHTGMQ--HNVLYG---CERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
             R++++TG++    GM        G   C  G +P    +LP+ L E GY T  VGKWHL 
Sbjct:    87 RASLLTGRNATTVGMATIEEFTDGFPNCN-GRIPADTALLPEVLAEHGYNTYCVGKWHLT 145

Query:   166 FYK-------KEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRD---LEPAWD 215
               +       K + PT RGFE   G+  G  D           W  D+  D   + P   
Sbjct:   146 PLEESNMASTKRHWPTSRGFERFYGFLGGETD----------QWYPDLVYDNHPVSPPGT 195

Query:   216 LHGKY--STDVFTAEAVDIIHNHST---DEPLFLYLAHAATHS 253
               G Y  S D+   + ++ I +      D+P F Y+   A H+
Sbjct:   196 PEGGYHLSKDI-ADKTIEFIRDAKVIAPDKPWFSYVCPGAGHA 237

 Score = 73 (30.8 bits), Expect = 6.9e-16, Sum P(3) = 6.9e-16
 Identities = 14/35 (40%), Positives = 22/35 (62%)

Query:   281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
             FA  L   D  +G++++ LE+   L N+IIV +SD
Sbjct:   324 FAGFLSYTDAQIGRILDYLEESGQLDNTIIVVISD 358

 Score = 62 (26.9 bits), Expect = 6.9e-16, Sum P(3) = 6.9e-16
 Identities = 13/36 (36%), Positives = 21/36 (58%)

Query:   342 EGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTL 376
             EGG+    +I W   + + G + + YV+VSD  PT+
Sbjct:   425 EGGIADPAIISWPNGIAAHGEIRDNYVNVSDITPTV 460

 Score = 37 (18.1 bits), Expect = 2.4e-13, Sum P(3) = 2.4e-13
 Identities = 8/19 (42%), Positives = 11/19 (57%)

Query:   877 VAPINKPFDKGGDPKNFDH 895
             VA   K FD  G P+ ++H
Sbjct:   384 VAESMKLFDHLGGPQTYNH 402


>UNIPROTKB|F1S048 [details] [associations]
            symbol:F1S048 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484
            GeneTree:ENSGT00560000077076 EMBL:FP104542
            Ensembl:ENSSSCT00000018625 OMA:MAPRDFA Uniprot:F1S048
        Length = 142

 Score = 205 (77.2 bits), Expect = 2.3e-15, P = 2.3e-15
 Identities = 39/72 (54%), Positives = 52/72 (72%)

Query:    54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
             V +S  PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS 
Sbjct:    74 VTASSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 132

Query:   114 IMTGKHPIHTGM 125
              +TGK+  H+G+
Sbjct:   133 FITGKY--HSGI 142


>UNIPROTKB|D6RGC1 [details] [associations]
            symbol:ARSJ "Arylsulfatase J" species:9606 "Homo sapiens"
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
            PROSITE:PS00523 GO:GO:0008484 EMBL:AC104779 HGNC:HGNC:26286
            ChiTaRS:ARSJ IPI:IPI00966139 ProteinModelPortal:D6RGC1 SMR:D6RGC1
            Ensembl:ENST00000509829 HOGENOM:HOG000172533 ArrayExpress:D6RGC1
            Bgee:D6RGC1 Uniprot:D6RGC1
        Length = 133

 Score = 194 (73.4 bits), Expect = 3.4e-14, P = 3.4e-14
 Identities = 36/63 (57%), Positives = 46/63 (73%)

Query:    56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
             S+  PH+IFILADD G+ DVG+HG  +I TP +D LA  G+ L+NYY   +CTPSRS  +
Sbjct:    72 STSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFI 130

Query:   116 TGK 118
             TGK
Sbjct:   131 TGK 133


>UNIPROTKB|D6RDH0 [details] [associations]
            symbol:ARSI "Arylsulfatase I" species:9606 "Homo sapiens"
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0008484
            HOGENOM:HOG000135354 HGNC:HGNC:32521 EMBL:AC011372 IPI:IPI00967848
            ProteinModelPortal:D6RDH0 SMR:D6RDH0 Ensembl:ENST00000509146
            ArrayExpress:D6RDH0 Bgee:D6RDH0 Uniprot:D6RDH0
        Length = 86

 Score = 191 (72.3 bits), Expect = 7.2e-14, P = 7.2e-14
 Identities = 37/86 (43%), Positives = 53/86 (61%)

Query:   158 IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL 216
             +VGKWHLGFY+KE  PT RGF++ LG  TG+ DY+ + + +   + G D+      AW L
Sbjct:     1 MVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGL 60

Query:   217 HGKYSTDVFTAEAVDIIHNHSTDEPL 242
              G+YST ++   A  I+ +HS   PL
Sbjct:    61 SGQYSTMLYAQRASHILASHSPQRPL 86


>UNIPROTKB|H3BP66 [details] [associations]
            symbol:GALNS "N-acetylgalactosamine-6-sulfatase"
            species:9606 "Homo sapiens" [GO:0008484 "sulfuric ester hydrolase
            activity" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            EMBL:AC092384 InterPro:IPR024607 PROSITE:PS00149 GO:GO:0008484
            HGNC:HGNC:4122 ChiTaRS:Galns Ensembl:ENST00000562831 Bgee:H3BP66
            Uniprot:H3BP66
        Length = 170

 Score = 188 (71.2 bits), Expect = 1.5e-13, P = 1.5e-13
 Identities = 53/157 (33%), Positives = 83/157 (52%)

Query:   110 SRSAIMTGKHPIHTGMQ----H-NVLYGCER--GGLPLSEKILPQYLKELGYRTRIVGKW 162
             +R+A++TG+ PI  G      H    Y  +   GG+P SE++LP+ LK+ GY ++IVGKW
Sbjct:    10 ARAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKW 69

Query:   163 HLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEE----MKMWGLDMRRDLEPAWDLH- 217
             HLG ++ ++ P   GF+   G    H   +D+ A       + W +  R   E   +L  
Sbjct:    70 HLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKT 128

Query:   218 GKYS-TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHS 253
             G+ + T ++  EA+D I   +   P FLY A  ATH+
Sbjct:   129 GEANLTQIYLQEALDFIKRQARHHPFFLYWAVDATHA 165


>UNIPROTKB|Q2KEF7 [details] [associations]
            symbol:MGCH7_ch7g1079 "Putative uncharacterized protein"
            species:242507 "Magnaporthe oryzae 70-15" [GO:0005575
            "cellular_component" evidence=ND] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0008484 EMBL:CM000230
            ProteinModelPortal:Q2KEF7 SMR:Q2KEF7 Uniprot:Q2KEF7
        Length = 480

 Score = 216 (81.1 bits), Expect = 1.8e-13, Sum P(2) = 1.8e-13
 Identities = 88/284 (30%), Positives = 133/284 (46%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY--SGIILKNYYTVQLCTPSRSAIMTG 117
             P  + I+ADDLG++D    G  +I TPN+  L    +G +L N++T   C+P+RS + +G
Sbjct:     9 PKFLIIVADDLGYSDTSPFG-GEINTPNLARLVSDGNGRLLTNFHTASACSPTRSMLFSG 67

Query:   118 --KHPIHTG-MQHNV-----LY----GCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
                H    G M  N+     LY    G E G L      L +  ++ GY+T + GKWHLG
Sbjct:    68 TDNHIAGLGQMAENMRAHADLYRDKPGYE-GYLNFRVAALSEVFQDAGYQTLMTGKWHLG 126

Query:   166 FYKKEYTPTFRGFE-SHLGYWTG-HQDY-FDHSAEE-----------MKMWGLDMRRDLE 211
                +E +P  RGFE SH+ + +G H  Y F+   E+            K W ++  R L+
Sbjct:   127 L-TRETSPHARGFERSHV-FLSGCHNHYNFEPQLEDPAHGLGDVISQAKFW-MEDDRFLD 183

Query:   212 PAWDLHGK-YSTDVFTAEAVDIIHNHS-TDEPLFLYLAHAATHSANPYEPLQAPDHYLNI 269
                DL    YS+  +  +    +   + +D P F YL   A     P+ PLQAP   +  
Sbjct:   184 RTKDLPKDFYSSTFYGNKMAQYLRERAGSDRPFFAYLPFTA-----PHWPLQAPADLVAK 238

Query:   270 HRHIEDFKRSKFAAI-LHKLDESVGKVVEALEQRRMLSNSIIVF 312
             ++ + D   S   A  L +L E +G V    E   M+   I V+
Sbjct:   239 YKGVYDDGPSALRARRLERLVE-LGIVKAGTEPAPMVGRKIRVW 281

 Score = 38 (18.4 bits), Expect = 1.8e-13, Sum P(2) = 1.8e-13
 Identities = 18/54 (33%), Positives = 22/54 (40%)

Query:   332 PLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDI 385
             P RG K  +  GG+R   ++  P    RG    Q    S   PT  S A   DI
Sbjct:   388 PSRGFKTWITGGGIRCPCIVRYPG-SGRG--QAQSREKSQPTPTTDSFATVMDI 438


>WB|WBGene00006309 [details] [associations]
            symbol:sul-2 species:6239 "Caenorhabditis elegans"
            [GO:0008152 "metabolic process" evidence=IEA] [GO:0008484 "sulfuric
            ester hydrolase activity" evidence=IEA] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            GeneTree:ENSGT00560000076940 HOGENOM:HOG000135352
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            HSSP:P15289 EMBL:FO080993 PIR:T29618 RefSeq:NP_505102.1
            ProteinModelPortal:Q18924 SMR:Q18924 PaxDb:Q18924
            EnsemblMetazoa:D1014.1 GeneID:179194 KEGG:cel:CELE_D1014.1
            UCSC:D1014.1 CTD:179194 WormBase:D1014.1 InParanoid:Q18924
            OMA:HITHHEP NextBio:904322 Uniprot:Q18924
        Length = 452

 Score = 208 (78.3 bits), Expect = 4.1e-13, Sum P(2) = 4.1e-13
 Identities = 46/128 (35%), Positives = 69/128 (53%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
             P+I+ ++ DDLG+ D+  +G        +D +A  G      Y+   +C+PSR+  +TG+
Sbjct:    33 PNIVILMIDDLGYGDIASYGHPTQEYTQVDRMAAEGTRFTQAYSADSMCSPSRAGFITGR 92

Query:   119 HPIHTGMQ--HNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT---- 172
              PI  G+     V    + GGLP SE  + + L+E GY T +VGKWHLG  +   T    
Sbjct:    93 LPIRLGIVGGRRVFVPYDIGGLPKSETTMAEMLQEAGYATGMVGKWHLGINENNATDGAH 152

Query:   173 -PTFRGFE 179
              P+ RGFE
Sbjct:   153 LPSKRGFE 160

 Score = 42 (19.8 bits), Expect = 4.1e-13, Sum P(2) = 4.1e-13
 Identities = 9/25 (36%), Positives = 14/25 (56%)

Query:   675 PCLFDIKNDPCEKNNLADRSEDQRI 699
             P +FD+  DP E+  L +  + Q I
Sbjct:   353 PLVFDLIRDPYEQYPLQNTVKSQEI 377

 Score = 40 (19.1 bits), Expect = 6.7e-13, Sum P(2) = 6.7e-13
 Identities = 9/25 (36%), Positives = 14/25 (56%)

Query:   781 PCLFDIKNDPCEKNNLADRSEVQRI 805
             P +FD+  DP E+  L +  + Q I
Sbjct:   353 PLVFDLIRDPYEQYPLQNTVKSQEI 377


>FB|FBgn0038660 [details] [associations]
            symbol:CG14291 species:7227 "Drosophila melanogaster"
            [GO:0016250 "N-sulfoglucosamine sulfohydrolase activity"
            evidence=ISS] [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] [GO:0008152 "metabolic process" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 EMBL:AE014297 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484
            HSSP:P15289 KO:K01565 OMA:RDPHETQ GO:GO:0016250 EMBL:AY071569
            RefSeq:NP_650760.1 UniGene:Dm.5859 SMR:Q9VE24 STRING:Q9VE24
            EnsemblMetazoa:FBtr0083724 GeneID:42266 KEGG:dme:Dmel_CG14291
            UCSC:CG14291-RA FlyBase:FBgn0038660 GeneTree:ENSGT00390000013080
            InParanoid:Q9VE24 OrthoDB:EOG49ZW4K GenomeRNAi:42266 NextBio:827964
            Uniprot:Q9VE24
        Length = 524

 Score = 148 (57.2 bits), Expect = 1.8e-12, Sum P(3) = 1.8e-12
 Identities = 39/115 (33%), Positives = 66/115 (57%)

Query:    56 SSGPPHIIFILADDLGWNDVGFHGLDQI-PTPNIDALAYSGIILKNYYT-VQLCTPSRSA 113
             S+GP +++ +LADD G+    +  L++   TPN+DALA  G++  N +T V  C+PSRS 
Sbjct:    17 SAGPQNVLLLLADDAGFESGAY--LNKFCQTPNLDALAKRGLLFNNAFTSVSSCSPSRSQ 74

Query:   114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKEL-GYR--TRIVGKWHLG 165
             ++TG+    +GM + +  G     +      LP  +++  G R  + I+GK H+G
Sbjct:    75 LLTGQAGHSSGM-YGLHQGVHNFNVLPDTGSLPNLIRDQSGGRILSGIIGKKHVG 128

 Score = 83 (34.3 bits), Expect = 1.8e-12, Sum P(3) = 1.8e-12
 Identities = 38/149 (25%), Positives = 67/149 (44%)

Query:   275 DFKRSKFAA---ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNW 331
             D  R + AA    + +LD+ VG +++ LE   +   +++++ SD             +  
Sbjct:   233 DVVRQELAAQYMTISRLDQGVGLMLKELEAAGVADQTLVIYTSD-------------NGP 279

Query:   332 PLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQ-YVHVSDWLPTLLSAANKSDIPNYVN 390
             P  G +  L+E G+R   +I SP  E R   A    V + D  P+++ A  +   PN   
Sbjct:   280 PFPGGRTNLYEHGIRSPLIISSPNKEDRHHEATAAMVSLLDIYPSVMDAL-QIPRPNDTK 338

Query:   391 STVENIIP--RYENSILRYEN--GTHEYN 415
                 +I+P  R E  I   ++  G+H Y+
Sbjct:   339 IVGRSILPVLREEPPIKESDSVFGSHSYH 367

 Score = 62 (26.9 bits), Expect = 1.8e-12, Sum P(3) = 1.8e-12
 Identities = 11/19 (57%), Positives = 16/19 (84%)

Query:   677 LFDIKNDPCEKNNLADRSE 695
             L+DIK DP E+ NLAD+++
Sbjct:   437 LYDIKTDPLERFNLADKAK 455

 Score = 62 (26.9 bits), Expect = 1.8e-12, Sum P(3) = 1.8e-12
 Identities = 11/19 (57%), Positives = 16/19 (84%)

Query:   783 LFDIKNDPCEKNNLADRSE 801
             L+DIK DP E+ NLAD+++
Sbjct:   437 LYDIKTDPLERFNLADKAK 455


>UNIPROTKB|P31447 [details] [associations]
            symbol:yidJ "putative sulfatase" species:83333 "Escherichia
            coli K-12" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] [GO:0008152 "metabolic process" evidence=IEA]
            [GO:0003824 "catalytic activity" evidence=IEA] [GO:0046872 "metal
            ion binding" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 EMBL:U00096 EMBL:AP009048
            GenomeReviews:AP009048_GR GenomeReviews:U00096_GR GO:GO:0046872
            EMBL:L10328 Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            OMA:RKDWENT KO:K01138 PIR:G65169 RefSeq:NP_418134.1
            RefSeq:YP_491756.1 ProteinModelPortal:P31447 SMR:P31447
            PRIDE:P31447 EnsemblBacteria:EBESCT00000001975
            EnsemblBacteria:EBESCT00000016174 GeneID:12932459 GeneID:948188
            KEGG:ecj:Y75_p3496 KEGG:eco:b3678 PATRIC:32122847 EchoBASE:EB1656
            EcoGene:EG11705 HOGENOM:HOG000126316 ProtClustDB:CLSK880765
            BioCyc:EcoCyc:EG11705-MONOMER BioCyc:ECOL316407:JW3654-MONOMER
            Genevestigator:P31447 Uniprot:P31447
        Length = 497

 Score = 190 (71.9 bits), Expect = 1.8e-12, Sum P(3) = 1.8e-12
 Identities = 62/218 (28%), Positives = 96/218 (44%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
             P+ +F++ D    N VG +    + T NID+LA  GI   + YT   +CTP+R+ + TG 
Sbjct:     4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63

Query:   119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR-G 177
             +   +G   N +      G  +S   + +Y K+ GY T  +GKWHL  +  +Y  T    
Sbjct:    64 YANQSGPWTNNV----APGKNIST--MGRYFKDAGYHTCYIGKWHLDGH--DYFGTGECP 115

Query:   178 FESHLGYWTGHQDYFDHSAE-EMKMWGLDMRRDLEPAWDLHGKYSTDVFT------AEAV 230
              E    YW    +Y     E E+ +W    R  L    DL   +  + FT        AV
Sbjct:   116 PEWDADYWFDGANYLSELTEKEISLW----RNGLNSVEDLQANHIDETFTWAHRISNRAV 171

Query:   231 DIIHNHS-TDEPLFLYLAHAATHSANPYEPLQAPDHYL 267
             D +   +  DEP  + +++       P+ P   P  YL
Sbjct:   172 DFLQQPARADEPFLMVVSYD-----EPHHPFTCPVEYL 204

 Score = 49 (22.3 bits), Expect = 1.8e-12, Sum P(3) = 1.8e-12
 Identities = 11/30 (36%), Positives = 20/30 (66%)

Query:   288 LDESVGKVVEAL--EQRRMLSNSIIVFVSD 315
             +D+ +G+V+ AL  EQR    N+ +++ SD
Sbjct:   258 VDDQIGRVINALTPEQRE---NTWVIYTSD 284

 Score = 49 (22.3 bits), Expect = 1.8e-12, Sum P(3) = 1.8e-12
 Identities = 14/41 (34%), Positives = 21/41 (51%)

Query:   677 LFDIKNDPCEKNNLAD--RSEDQRINHYTTEVGRFNQIAYP 715
             L+D +NDP E +NL D  R  D R   +   +   ++I  P
Sbjct:   401 LYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDP 441

 Score = 48 (22.0 bits), Expect = 2.3e-12, Sum P(3) = 2.3e-12
 Identities = 9/16 (56%), Positives = 12/16 (75%)

Query:   783 LFDIKNDPCEKNNLAD 798
             L+D +NDP E +NL D
Sbjct:   401 LYDRRNDPNEMHNLID 416


>ZFIN|ZDB-GENE-050107-5 [details] [associations]
            symbol:gnsa "glucosamine (N-acetyl)-6-sulfatase a"
            species:7955 "Danio rerio" [GO:0030203 "glycosaminoglycan metabolic
            process" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0008152 "metabolic process" evidence=IEA] [GO:0008484 "sulfuric
            ester hydrolase activity" evidence=IEA] [GO:0003824 "catalytic
            activity" evidence=IEA] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR012251 InterPro:IPR015981
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            PIRSF:PIRSF036666 ZFIN:ZDB-GENE-050107-5 GO:GO:0005764
            Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0030203
            HOGENOM:HOG000169239 HOVERGEN:HBG005840 KO:K01137 OrthoDB:EOG4NGGMF
            GO:GO:0008449 PANTHER:PTHR10342:SF5 EMBL:BC097128 IPI:IPI00499007
            RefSeq:NP_001025379.1 UniGene:Dr.84802 ProteinModelPortal:Q4V902
            STRING:Q4V902 GeneID:566506 KEGG:dre:566506 CTD:566506
            InParanoid:Q4V902 NextBio:20888220 ArrayExpress:Q4V902
            Uniprot:Q4V902
        Length = 538

 Score = 175 (66.7 bits), Expect = 2.6e-11, Sum P(2) = 2.6e-11
 Identities = 71/253 (28%), Positives = 111/253 (43%)

Query:    34 IMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDAL-A 92
             ++   ++ +  TL  V +    ++  P+I+ IL DDL   DV   G+  IP      L  
Sbjct:     7 VLLHCIIVICVTLHCVNLAAAKTNPKPNIVLILTDDL---DVSIGGM--IPLVKTKKLIG 61

Query:    93 YSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYG-CERGGLPLSEK--ILPQY 148
              +GI   N +    LC PSR++I+TGK+P +  + +N L G C        ++    P +
Sbjct:    62 DAGITFTNAFVASPLCCPSRASILTGKYPHNHHVVNNTLEGNCSSTAWQKGQEPDAFPAF 121

Query:   149 L-KELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMR 207
             L K   Y+T   GK     Y  EY     G   H+     H    + +++    + L + 
Sbjct:   122 LQKHAAYQTFFAGK-----YLNEYGSKKAGGVEHVPLGWDHWFALERNSKYYN-YTLSVN 175

Query:   208 -RDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHS---ANP-YE---P 259
              R      +    Y TDV    ++D + N S   P F+ ++  A HS   A P Y+   P
Sbjct:   176 GRAQRHGQNYSEDYLTDVLANVSIDFLENKSNRRPFFMMVSTPAPHSPWTAAPQYDSSFP 235

Query:   260 -LQAP-DHYLNIH 270
              L+AP D   NIH
Sbjct:   236 DLKAPRDPNFNIH 248

 Score = 63 (27.2 bits), Expect = 2.6e-11, Sum P(2) = 2.6e-11
 Identities = 14/43 (32%), Positives = 27/43 (62%)

Query:   273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
             +++  R ++  +L  +D+ V K+V  L+ R  LSN+ ++F SD
Sbjct:   271 LDNAYRKRWRTLL-SVDDLVEKLVRKLDIRGELSNTYVIFTSD 312


>TIGR_CMR|CPS_0841 [details] [associations]
            symbol:CPS_0841 "arylsulfatase" species:167879 "Colwellia
            psychrerythraea 34H" [GO:0004065 "arylsulfatase activity"
            evidence=ISS] [GO:0006790 "sulfur compound metabolic process"
            evidence=ISS] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 EMBL:CP000083
            GenomeReviews:CP000083_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 GO:GO:0004065 KO:K01130 HOGENOM:HOG000135353
            RefSeq:YP_267590.1 ProteinModelPortal:Q488C5 STRING:Q488C5
            GeneID:3522242 KEGG:cps:CPS_0841 PATRIC:21464977 OMA:SSRIMEV
            BioCyc:CPSY167879:GI48-927-MONOMER Uniprot:Q488C5
        Length = 584

 Score = 170 (64.9 bits), Expect = 4.6e-11, Sum P(2) = 4.6e-11
 Identities = 49/178 (27%), Positives = 89/178 (50%)

Query:    55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
             A +  P+I+ ++ADD  + D+G +G  ++ TPN++ +A +GI   N++   +C+ +RS +
Sbjct:    32 ADAKKPNILLLVADDTAFGDIGAYG-SEVHTPNMNEIANAGIRFTNFHVSPVCSVTRSML 90

Query:   115 MTGKHPIHTGM---QHNVLYGCERG-----GLPLSEKI-LPQYLKELGYRTRIVGKWHLG 165
              TG   I  G+    ++V Y   RG     G    + + + + L + GY     GKWHLG
Sbjct:    91 FTGNDNIEVGLGSFDYSV-YPATRGKKGYEGYLTKDAVTISELLNDDGYEVYKSGKWHLG 149

Query:   166 FYKKEYT-PTFRGFESHLGYWTGHQDYFDHSAEEMKMW---GLDMRRDLEPAWDLHGK 219
               +     P   GF    G  +G  ++++  A         GL+++R  +  W L+G+
Sbjct:   150 GEESGGKGPLEWGFTKEFGILSGGSNHWNDLAMTPNFKDPNGLNVKR--KENWTLNGE 205

 Score = 67 (28.6 bits), Expect = 4.6e-11, Sum P(2) = 4.6e-11
 Identities = 16/71 (22%), Positives = 38/71 (53%)

Query:   246 LAHAATHSANPYEPLQAPDHYLNIHRHIEDFK-RSKFAAILHKLDESVGKVVEALEQRRM 304
             ++H AT +  P+  L      L+     +  K  + +AA++   D  +G++++ L +   
Sbjct:   286 ISHEATEA--PFNNLTKKWQDLSQENKEKQAKIMATYAAMIEDQDNRIGQILDYLRESGQ 343

Query:   305 LSNSIIVFVSD 315
             L N+++V+++D
Sbjct:   344 LDNTLVVYMTD 354


>UNIPROTKB|F1NGI6 [details] [associations]
            symbol:SGSH "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484 EMBL:AADN02053526
            IPI:IPI00570654 Ensembl:ENSGALT00000011369 OMA:CYNPAVS
            Uniprot:F1NGI6
        Length = 119

 Score = 162 (62.1 bits), Expect = 9.1e-11, P = 9.1e-11
 Identities = 42/115 (36%), Positives = 66/115 (57%)

Query:    58 GPP--HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAI 114
             G P  +++ +LADD G+   G +    I TPN+DALA  G++ +N +T V  C+PSR+++
Sbjct:     5 GAPARNVLLLLADDGGFES-GAYNNSAIRTPNLDALARRGLLFQNAFTSVSSCSPSRASV 63

Query:   115 MTGKHPIHTGMQHNVLYGCERGGLPLSE----KILPQYLKELGYRTRIVGKWHLG 165
             +TG  P H     N +YG  +G    +     + LP  L++   RT I+GK H+G
Sbjct:    64 LTGL-PQH----QNGMYGLHQGVHHFNSFDAVRSLPGLLRQANIRTGIIGKKHVG 113


>UNIPROTKB|O65931 [details] [associations]
            symbol:atsB "Arylsulfatase" species:83332 "Mycobacterium
            tuberculosis H37Rv" [GO:0005829 "cytosol" evidence=IDA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 GO:GO:0005829 GenomeReviews:AL123456_GR EMBL:BX842582
            Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
            PROSITE:PS00523 GO:GO:0004065 GO:GO:0008484 HSSP:P15289 KO:K01130
            EMBL:CP003248 PIR:E70533 RefSeq:NP_217816.1 RefSeq:YP_006516776.1
            ProteinModelPortal:O65931 PRIDE:O65931
            EnsemblBacteria:EBMYCT00000000058 GeneID:13318122 GeneID:887500
            KEGG:mtu:Rv3299c KEGG:mtv:RVBD_3299c PATRIC:18155953
            TubercuList:Rv3299c HOGENOM:HOG000042725 OMA:EIMGSRA
            ProtClustDB:CLSK792415 InterPro:IPR009200 Pfam:PF06897
            Uniprot:O65931
        Length = 970

 Score = 167 (63.8 bits), Expect = 1.2e-10, Sum P(2) = 1.2e-10
 Identities = 61/215 (28%), Positives = 100/215 (46%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
             P+++ +L DD G+      G   I TP +  LA +G+I   ++   +C+P+R+A++TG++
Sbjct:   212 PNVLIVLIDDAGFGGPDTFG-GAIRTPTLSRLAQNGLIYNRFHVTAVCSPTRAALLTGRN 270

Query:   120 PIHTGMQHNVLYG--CERGG--------LPLSEKILPQYLKELGYRTRIVGKWHL----- 164
               H    H V +G  CE  G         P S   LP+ L++ GY T   GKWHL     
Sbjct:   271 --H----HRVGFGSVCEFPGPYPGYSAVRPRSCAALPRILRDNGYVTGAFGKWHLTPDNV 324

Query:   165 -GFYKK-EYTPTFRGFESHLGYWTGHQDYFDHS-AEEMKMWGLDMRRDLEPAWDLHGKYS 221
              G     +  P   GF+   G+ +G    +D   +++  + G+      E   D    Y 
Sbjct:   325 QGAAGPFDNWPLGWGFDHFWGFPSGAAGQYDPIISQDNSVIGIPEGSG-E---DGRPYYF 380

Query:   222 TDVFTAEAVDIIHN---HSTDEPLFLYLAHAATHS 253
              D  T +A++ +H     +  +P  LY A  ATH+
Sbjct:   381 PDDLTDKAIEWLHTVRAQNATKPWMLYYATGATHA 415

 Score = 72 (30.4 bits), Expect = 1.2e-10, Sum P(2) = 1.2e-10
 Identities = 41/185 (22%), Positives = 73/185 (39%)

Query:   285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLR-GVKNTLWEG 343
             L+ LD    + +E +EQ       I     +             SN PL+ G +     G
Sbjct:   540 LNGLDLDAERQLELIEQY----GGIAALGDEFTAPHFASAWAHASNTPLQWGKQMASHLG 595

Query:   344 GVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYEN 402
             G R   ++ W   +   G V  Q+ H  D  PT+L+A    + P +V+   +   P    
Sbjct:   596 GTRDPLVVAWPARIRPDGRVRSQFTHCIDIAPTVLAAIGLPE-PTHVDGFEQE--PMDGT 652

Query:   403 SILR-YENGTHE--YNSPRIENSNTR--YENGTHEYNPKYENRYENGTHEYNPKYENRYE 457
             S +R +++   E  +     EN  +R  Y++G          R +    + +P+   R+ 
Sbjct:   653 SFVRTFDDAEAEDRHTVQYFENFGSRAIYKDGWWACA-----RLDKAPWDLSPETMRRFA 707

Query:   458 NGTHE 462
              GT++
Sbjct:   708 PGTYD 712

 Score = 47 (21.6 bits), Expect = 4.6e-08, Sum P(2) = 4.6e-08
 Identities = 8/33 (24%), Positives = 19/33 (57%)

Query:   281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFV 313
             FA      D +VG++++A+E      N+++ ++
Sbjct:   486 FAGFSENADWNVGRLLDAIEDLGESDNTLVFYI 518


>UNIPROTKB|P51688 [details] [associations]
            symbol:SGSH "N-sulphoglucosamine sulphohydrolase"
            species:9606 "Homo sapiens" [GO:0008484 "sulfuric ester hydrolase
            activity" evidence=IEA] [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0016250 "N-sulfoglucosamine sulfohydrolase
            activity" evidence=IEA] [GO:0006029 "proteoglycan metabolic
            process" evidence=TAS] [GO:0003824 "catalytic activity"
            evidence=TAS] [GO:0005975 "carbohydrate metabolic process"
            evidence=TAS] [GO:0006027 "glycosaminoglycan catabolic process"
            evidence=TAS] [GO:0030203 "glycosaminoglycan metabolic process"
            evidence=TAS] [GO:0043202 "lysosomal lumen" evidence=TAS]
            [GO:0044281 "small molecule metabolic process" evidence=TAS]
            Reactome:REACT_111217 InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Reactome:REACT_116125 GO:GO:0003824
            GO:GO:0044281 GO:GO:0046872 GO:GO:0005975 GO:GO:0043202
            GO:GO:0006027 Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            GO:GO:0006029 EMBL:U30894 EMBL:U60111 EMBL:U60107 EMBL:U60108
            EMBL:U60109 EMBL:U60110 EMBL:AK291257 EMBL:BC047318 IPI:IPI00019988
            RefSeq:NP_000190.1 UniGene:Hs.31074 ProteinModelPortal:P51688
            SMR:P51688 IntAct:P51688 STRING:P51688 PhosphoSite:P51688
            DMDM:1711493 PaxDb:P51688 PRIDE:P51688 Ensembl:ENST00000326317
            GeneID:6448 KEGG:hsa:6448 UCSC:uc002jxz.4 CTD:6448
            GeneCards:GC17M078183 HGNC:HGNC:10818 HPA:HPA023436 HPA:HPA023451
            MIM:252900 MIM:605270 neXtProt:NX_P51688 Orphanet:79269
            PharmGKB:PA35726 HOGENOM:HOG000234731 HOVERGEN:HBG012598
            InParanoid:P51688 KO:K01565 OMA:RDPHETQ OrthoDB:EOG4RXZ01
            PhylomeDB:P51688 ChiTaRS:SGSH GenomeRNAi:6448 NextBio:25061
            ArrayExpress:P51688 Bgee:P51688 CleanEx:HS_SGSH
            Genevestigator:P51688 GermOnline:ENSG00000181523 GO:GO:0016250
            Uniprot:P51688
        Length = 502

 Score = 154 (59.3 bits), Expect = 1.8e-10, Sum P(4) = 1.8e-10
 Identities = 42/130 (32%), Positives = 72/130 (55%)

Query:    41 PLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
             P+    +++ V  +  + P + + +LADD G+   G +    I TP++DALA   ++ +N
Sbjct:     4 PVPACCALLLVLGLCRARPRNALLLLADDGGFES-GAYNNSAIATPHLDALARRSLLFRN 62

Query:   101 YYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLS--EKI--LPQYLKELGYR 155
              +T V  C+PSR++++TG  P H     N +YG  +     +  +K+  LP  L + G R
Sbjct:    63 AFTSVSSCSPSRASLLTGL-PQH----QNGMYGLHQDVHHFNSFDKVRSLPLLLSQAGVR 117

Query:   156 TRIVGKWHLG 165
             T I+GK H+G
Sbjct:   118 TGIIGKKHVG 127

 Score = 77 (32.2 bits), Expect = 1.8e-10, Sum P(4) = 1.8e-10
 Identities = 27/92 (29%), Positives = 44/92 (47%)

Query:   287 KLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVR 346
             ++D+ VG V++ L    +L++++++F SD               +P  G  N  W G   
Sbjct:   245 RMDQGVGLVLQELRDAGVLNDTLVIFTSDNGIP-----------FP-SGRTNLYWPGTAE 292

Query:   347 GAGLIWSPLLESR-GIVAEQYVHVSDWLPTLL 377
                L+ SP    R G V+E YV + D  PT+L
Sbjct:   293 PL-LVSSPEHPKRWGQVSEAYVSLLDLTPTIL 323

 Score = 41 (19.5 bits), Expect = 1.8e-10, Sum P(4) = 1.8e-10
 Identities = 8/21 (38%), Positives = 10/21 (47%)

Query:   239 DEPLFLYLAHAATHSANPYEP 259
             D P FLY+A    H     +P
Sbjct:   168 DRPFFLYVAFHDPHRCGHSQP 188

 Score = 38 (18.4 bits), Expect = 1.8e-10, Sum P(4) = 1.8e-10
 Identities = 8/15 (53%), Positives = 9/15 (60%)

Query:   677 LFDIKNDPCEKNNLA 691
             L+D   DP E  NLA
Sbjct:   438 LYDRSRDPHETQNLA 452

 Score = 38 (18.4 bits), Expect = 1.8e-10, Sum P(4) = 1.8e-10
 Identities = 8/15 (53%), Positives = 9/15 (60%)

Query:   783 LFDIKNDPCEKNNLA 797
             L+D   DP E  NLA
Sbjct:   438 LYDRSRDPHETQNLA 452


>ASPGD|ASPL0000046382 [details] [associations]
            symbol:AN11149 species:162425 "Emericella nidulans"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0008484 "sulfuric
            ester hydrolase activity" evidence=IEA] [GO:0008152 "metabolic
            process" evidence=IEA] InterPro:IPR000917 InterPro:IPR012083
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            PIRSF:PIRSF000972 GO:GO:0018958 EMBL:BN001307 Gene3D:3.40.720.10
            SUPFAM:SSF53649 InterPro:IPR024607 PROSITE:PS00523 GO:GO:0004065
            HOGENOM:HOG000169239 ProteinModelPortal:C8VLL2
            EnsemblFungi:CADANIAT00007963 OMA:TENDPAN Uniprot:C8VLL2
        Length = 565

 Score = 143 (55.4 bits), Expect = 2.3e-10, Sum P(3) = 2.3e-10
 Identities = 64/249 (25%), Positives = 109/249 (43%)

Query:    40 LPLAFTLSMVFVDLVASSGP---PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
             + L+  +++V V  ++ + P   P+ +F+  DD    D+  + ++ +P      +   G+
Sbjct:     1 MKLSSLVALVGVSALSEASPRPKPNFVFVFTDD---QDLTMNSVEYMPHV-AGRIRDRGL 56

Query:    97 ILKNYY-TVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCER-GGLP------LSEKILPQY 148
                N++ T  LC PSR ++ TG+   +T    NV +     GG P       +E   P +
Sbjct:    57 DFTNHFVTTALCCPSRVSLWTGRQAHNT----NVTWVAPPYGGYPKFVSQGFNEDWFPLW 112

Query:   149 LKELGYRTRIVGKWHLGFYKKEYT-PTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMR 207
             L++ GY T  VGK         Y  P  +GF     +     D F +S      W    +
Sbjct:   113 LQDAGYNTYYVGKLFNAHSVTTYNNPFVKGFNGS-DFLL---DPFTYS-----YWNSSYQ 163

Query:   208 RDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDE--PLFLYLAHAATH-------SAN--P 256
             R+ E      G+Y+TDV   +A+  + +   D+  P FL +A  A H       S++  P
Sbjct:   164 RNHEAPKSYAGQYTTDVTEEKALGFVDDALEDKERPFFLTVAPIAPHFEQDPGHSSDTPP 223

Query:   257 YEPLQAPDH 265
               P+ AP H
Sbjct:   224 QAPIPAPRH 232

 Score = 88 (36.0 bits), Expect = 2.3e-10, Sum P(3) = 2.3e-10
 Identities = 51/188 (27%), Positives = 83/188 (44%)

Query:   274 EDF-KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWP 332
             EDF  R +  A L  +DE V K+++ LE+   L+N+ +++ SD              +  
Sbjct:   272 EDFFYRQRLRA-LQSVDEMVDKLLDRLERSGQLNNTYVIYSSDNGFHI--------GHHR 322

Query:   333 LRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPT---LLSAANKSDIPNYV 389
             L   K+T +E  +R    I  P ++S G V +   H+ D+ PT   LL    +SD     
Sbjct:   323 LPPGKSTSYEEDIRVPFFIRGPGIKSGGKVTQVTTHI-DFAPTIFELLGLPPRSDFDGTP 381

Query:   390 NSTVEN--IIPRYENSILRYENG-----THEYNSPRIENSNTRYENGTHEYNPKYENRYE 442
                +++   IP +E+ I+ Y        T   N+ R+ N  T Y++     + KY   Y 
Sbjct:   382 MRIMKDSAAIP-HEHVIVEYWGQALMMVTAPTNTDRMPN--TTYKS-VRLLSEKYNLFYA 437

Query:   443 ---NGTHE 447
                 G HE
Sbjct:   438 VWCTGDHE 445

 Score = 43 (20.2 bits), Expect = 2.3e-10, Sum P(3) = 2.3e-10
 Identities = 10/27 (37%), Positives = 16/27 (59%)

Query:   677 LFDIKNDPCEKNNL---ADRSEDQRIN 700
             LFD+  DP + +N+   A RS   R++
Sbjct:   446 LFDLNTDPYQMHNIYNTASRSFKNRLD 472

 Score = 42 (19.8 bits), Expect = 2.9e-10, Sum P(3) = 2.9e-10
 Identities = 10/27 (37%), Positives = 16/27 (59%)

Query:   783 LFDIKNDPCEKNNL---ADRSEVQRIN 806
             LFD+  DP + +N+   A RS   R++
Sbjct:   446 LFDLNTDPYQMHNIYNTASRSFKNRLD 472


>UNIPROTKB|F1NFI0 [details] [associations]
            symbol:IDS "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            GeneTree:ENSGT00640000091539 EMBL:AADN02013672 EMBL:AADN02013673
            EMBL:AADN02013674 EMBL:AADN02013675 IPI:IPI00579251
            Ensembl:ENSGALT00000014910 OMA:SELDYAY Uniprot:F1NFI0
        Length = 525

 Score = 162 (62.1 bits), Expect = 2.4e-10, Sum P(2) = 2.4e-10
 Identities = 61/216 (28%), Positives = 98/216 (45%)

Query:    61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKH 119
             +++FI+ DDL    +G +G + + +PNID LA   I+  N Y  Q +C PSR + +TG+ 
Sbjct:     3 NVLFIVVDDLR-PVLGCYGDNLVKSPNIDQLASQSIVFSNAYAQQAVCAPSRVSFLTGRR 61

Query:   120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK-WHLGF---YKKEYTPTF 175
             P  T +     Y     G   +   +PQY KE GY T  VGK +H G    Y  +Y  ++
Sbjct:    62 PDTTRLYDFYSYWRVHSG---NYSTMPQYFKENGYVTMSVGKVFHPGISSNYSDDYPYSW 118

Query:   176 RGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAE-AVDIIH 234
                  H        D      +      L    D+    ++ G    D+ T E A+ +++
Sbjct:   119 SIPPFHPSTEKYENDKTCRGKDGRLYANLVCPIDVT---EMPGGTLPDIETTEEAIRLLN 175

Query:   235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
                T +  F +LA    H   P+ PL+ P  +L ++
Sbjct:   176 VMKTKKQKF-FLA-VGYHK--PHIPLRYPQEFLKLY 207

 Score = 67 (28.6 bits), Expect = 2.4e-10, Sum P(2) = 2.4e-10
 Identities = 20/60 (33%), Positives = 35/60 (58%)

Query:   256 PYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
             PY PL  PD +  + R      +S +AA+ + LD  VG ++ AL+   + +++I+VF +D
Sbjct:   249 PYGPL--PDDFQRLIR------QSYYAAVSY-LDMQVGLLLNALDYVGLSNSTIVVFTAD 299


>TIGR_CMR|CPS_2367 [details] [associations]
            symbol:CPS_2367 "sulfatase family protein" species:167879
            "Colwellia psychrerythraea 34H" [GO:0008152 "metabolic process"
            evidence=ISS] [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=ISS] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 EMBL:CP000083
            GenomeReviews:CP000083_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484
            RefSeq:YP_269085.1 ProteinModelPortal:Q482D3 STRING:Q482D3
            GeneID:3522074 KEGG:cps:CPS_2367 PATRIC:21467819
            HOGENOM:HOG000220675 OMA:TAGVCAP ProtClustDB:CLSK2525596
            BioCyc:CPSY167879:GI48-2430-MONOMER Uniprot:Q482D3
        Length = 558

 Score = 142 (55.0 bits), Expect = 2.8e-10, Sum P(3) = 2.8e-10
 Identities = 45/156 (28%), Positives = 69/156 (44%)

Query:    42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
             L   L++  V   A    P+I+ I+A+D+    VG  G     TP +D LA S +   N 
Sbjct:    11 LCLALALSSVTSFAKEQRPNILLIVAEDMSAK-VGAFGDTVAKTPVLDELAKSSVRYPNT 69

Query:   102 YTVQ-LCTPSRSAIMTGKHPIHTGMQH---NVLYGCERGGLPLSE-KILPQYLKELGYRT 156
             +T   +C PSR++++TG H I  G QH             +P  + K  P+ L++ GY T
Sbjct:    70 FTTAGVCAPSRTSLITGVHQITVGGQHMRTRSFKASNYRAVPAPDVKAFPELLRKSGYYT 129

Query:   157 RIVGKWHLGFYKKE-YTPTFR--GFESHLGYWTGHQ 189
              +  K    F     +T  F    +E     W G +
Sbjct:   130 YVSSKLDYQFSNTSPHTGPFTIWNYEGKKPTWRGRE 165

 Score = 67 (28.6 bits), Expect = 2.8e-10, Sum P(3) = 2.8e-10
 Identities = 37/163 (22%), Positives = 68/163 (41%)

Query:   285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGG 344
             +H +D  VGK++  L++  +  N+I+++ +D              + P RG K  +++ G
Sbjct:   230 IHAMDTQVGKLLAELKKDGLSDNTIVIWTTDHG-----------DSLP-RG-KREVYDSG 276

Query:   345 VRGAGLI-WS----PLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPR 399
             ++   +I W     P     G +  Q +   D  P++L+ AN  + P Y+       IP 
Sbjct:   277 LKVPMIIHWPDKYRPSKTVNGSIDSQLLSFVDIAPSILAMAN-INTPAYIQGKAR--IPN 333

Query:   400 YENSILRYENGTHEYNSP-RIENSNTRYENGTHEYNPKYENRY 441
               N+  +     + Y S  R++    R E        KY   Y
Sbjct:   334 -NNATNKIAKREYIYASKDRLDEFPFR-ERAVRNNKFKYIKNY 374

 Score = 64 (27.6 bits), Expect = 2.8e-10, Sum P(3) = 2.8e-10
 Identities = 13/22 (59%), Positives = 17/22 (77%)

Query:   783 LFDIKNDPCEKNNLADRSEVQR 804
             L+DI NDP E NNLA++ E Q+
Sbjct:   421 LYDIINDPEEVNNLAEKVEYQQ 442

 Score = 64 (27.6 bits), Expect = 2.8e-10, Sum P(3) = 2.8e-10
 Identities = 14/25 (56%), Positives = 19/25 (76%)

Query:   677 LFDIKNDPCEKNNLADRSE-DQRIN 700
             L+DI NDP E NNLA++ E  Q++N
Sbjct:   421 LYDIINDPEEVNNLAEKVEYQQQLN 445

 Score = 39 (18.8 bits), Expect = 9.0e-08, Sum P(3) = 9.0e-08
 Identities = 11/41 (26%), Positives = 19/41 (46%)

Query:   499 NRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQ 539
             NR  +  Y    D  +V ++  + E  ++  I+ N   EWQ
Sbjct:   415 NRPGEELYDIINDPEEVNNLAEKVEYQQQLNIMRNALKEWQ 455

 Score = 39 (18.8 bits), Expect = 9.0e-08, Sum P(3) = 9.0e-08
 Identities = 11/41 (26%), Positives = 19/41 (46%)

Query:   564 NRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQ 604
             NR  +  Y    D  +V ++  + E  ++  I+ N   EWQ
Sbjct:   415 NRPGEELYDIINDPEEVNNLAEKVEYQQQLNIMRNALKEWQ 455


>UNIPROTKB|F6PP52 [details] [associations]
            symbol:SULF1 "Uncharacterized protein" species:9796 "Equus
            caballus" [GO:0001502 "cartilage condensation" evidence=ISS]
            [GO:0001822 "kidney development" evidence=ISS] [GO:0001937
            "negative regulation of endothelial cell proliferation"
            evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
            [GO:0004065 "arylsulfatase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005783 "endoplasmic
            reticulum" evidence=ISS] [GO:0005794 "Golgi apparatus"
            evidence=ISS] [GO:0007155 "cell adhesion" evidence=ISS] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
            [GO:0009986 "cell surface" evidence=ISS] [GO:0014846 "esophagus
            smooth muscle contraction" evidence=ISS] [GO:0016525 "negative
            regulation of angiogenesis" evidence=ISS] [GO:0030177 "positive
            regulation of Wnt receptor signaling pathway" evidence=ISS]
            [GO:0030201 "heparan sulfate proteoglycan metabolic process"
            evidence=ISS] [GO:0030336 "negative regulation of cell migration"
            evidence=ISS] [GO:0030513 "positive regulation of BMP signaling
            pathway" evidence=ISS] [GO:0035860 "glial cell-derived neurotrophic
            factor receptor signaling pathway" evidence=ISS] [GO:0036022 "limb
            joint morphogenesis" evidence=ISS] [GO:0040037 "negative regulation
            of fibroblast growth factor receptor signaling pathway"
            evidence=ISS] [GO:0045121 "membrane raft" evidence=ISS] [GO:0048010
            "vascular endothelial growth factor receptor signaling pathway"
            evidence=ISS] [GO:0048661 "positive regulation of smooth muscle
            cell proliferation" evidence=ISS] [GO:0048706 "embryonic skeletal
            system development" evidence=ISS] [GO:0051216 "cartilage
            development" evidence=ISS] [GO:0060348 "bone development"
            evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0060686
            "negative regulation of prostatic bud formation" evidence=ISS]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 GO:GO:0005783 GO:GO:0005886 GO:GO:0005794
            GO:GO:0005615 GO:GO:0009986 GO:GO:0048661 GO:GO:0010575
            GO:GO:0045121 GO:GO:0030336 GO:GO:0001822 GO:GO:0001937
            GO:GO:0030513 GO:GO:0016525 GO:GO:0001502 GO:GO:0060348
            Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0030177 GO:GO:0003094
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0004065
            GeneTree:ENSGT00400000022041 GO:GO:0048706 GO:GO:0048010
            GO:GO:0060686 GO:GO:0002063 GO:GO:0040037 GO:GO:0032836
            GO:GO:0060384 GO:GO:0008449 GO:GO:0030201 GO:GO:0014846
            GO:GO:0035860 GO:GO:0036022 InterPro:IPR024609 Pfam:PF12548
            OMA:SVRVTHK Ensembl:ENSECAT00000019009 Uniprot:F6PP52
        Length = 1129

 Score = 161 (61.7 bits), Expect = 4.1e-10, Sum P(3) = 4.1e-10
 Identities = 62/223 (27%), Positives = 96/223 (43%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
             P+II +L DD    DV    L Q+       + + G    N + T  +C PSRS+++TGK
Sbjct:    43 PNIILVLTDD---QDVELGSL-QVMNKTRKIMEHGGATFTNAFVTTPMCCPSRSSMLTGK 98

Query:   119 HPIHTGMQHNVLYGCERGGLP----LSE-KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
             + +H    HNV    E    P    + E +    YL   GYRT   GK+ L  Y   Y P
Sbjct:    99 Y-VHN---HNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153

Query:   174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
                G+   LG     + +++++       G+  +   + A D    Y TD+ T E+++  
Sbjct:   154 P--GWREWLGLIKNSR-FYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESINYF 203

Query:   234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
                       P+ + ++HAA H      P Q    Y N  +HI
Sbjct:   204 KMSKRMYPHRPIMMVISHAAPHGPEDSAP-QFSKLYPNASQHI 245

 Score = 69 (29.3 bits), Expect = 4.1e-10, Sum P(3) = 4.1e-10
 Identities = 28/131 (21%), Positives = 52/131 (39%)

Query:   267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
             L IH    +    K    L  +D+SV ++   L + R L N+ I++ +D           
Sbjct:   268 LPIHMEFTNVLHRKRLQTLMSVDDSVERLYNMLVETRELENTYIIYTADHGYHIGQFGLV 327

Query:   327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
                + P        ++  +R    I  P +E   IV +  +++ D  PT+L  A     P
Sbjct:   328 KGKSMP--------YDFDIRVPFFIRGPSVEPGSIVPQIVLNI-DLAPTILDIAGLDTPP 378

Query:   387 NYVNSTVENII 397
             +    +V  ++
Sbjct:   379 DVDGKSVLKLL 389

 Score = 52 (23.4 bits), Expect = 4.1e-10, Sum P(3) = 4.1e-10
 Identities = 13/52 (25%), Positives = 25/52 (48%)

Query:   448 YNPKYENRYENGTHEYNGPKNEN-TNPRYENGTHEYNIPRLENSINGNGTSE 498
             ++P+ +  + N   E +   +E+  N + E       + RLE   +GNG +E
Sbjct:   986 FSPESKLEWNNNIPEVSRLNSEHWRNHKTEKWMEHEELNRLETDFSGNGMTE 1037

 Score = 52 (23.4 bits), Expect = 8.9e-08, Sum P(2) = 8.9e-08
 Identities = 19/87 (21%), Positives = 35/87 (40%)

Query:   428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGPKNENTNPRYENGTHEYNI 484
             +G  +  P++   Y N +    P Y    N  ++   +Y GP     +  + N  H   +
Sbjct:   225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLP-IHMEFTNVLHRKRL 283

Query:   485 PRL---ENSING--NGTSENRSNDNSY 506
               L   ++S+    N   E R  +N+Y
Sbjct:   284 QTLMSVDDSVERLYNMLVETRELENTY 310


>UNIPROTKB|G3WVX3 [details] [associations]
            symbol:SULF1 "Uncharacterized protein" species:9305
            "Sarcophilus harrisii" [GO:0001502 "cartilage condensation"
            evidence=ISS] [GO:0001822 "kidney development" evidence=ISS]
            [GO:0001937 "negative regulation of endothelial cell proliferation"
            evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
            [GO:0004065 "arylsulfatase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005783 "endoplasmic
            reticulum" evidence=ISS] [GO:0005794 "Golgi apparatus"
            evidence=ISS] [GO:0007155 "cell adhesion" evidence=ISS] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
            [GO:0009986 "cell surface" evidence=ISS] [GO:0014846 "esophagus
            smooth muscle contraction" evidence=ISS] [GO:0016525 "negative
            regulation of angiogenesis" evidence=ISS] [GO:0030177 "positive
            regulation of Wnt receptor signaling pathway" evidence=ISS]
            [GO:0030201 "heparan sulfate proteoglycan metabolic process"
            evidence=ISS] [GO:0030336 "negative regulation of cell migration"
            evidence=ISS] [GO:0030513 "positive regulation of BMP signaling
            pathway" evidence=ISS] [GO:0035860 "glial cell-derived neurotrophic
            factor receptor signaling pathway" evidence=ISS] [GO:0036022 "limb
            joint morphogenesis" evidence=ISS] [GO:0040037 "negative regulation
            of fibroblast growth factor receptor signaling pathway"
            evidence=ISS] [GO:0045121 "membrane raft" evidence=ISS] [GO:0048010
            "vascular endothelial growth factor receptor signaling pathway"
            evidence=ISS] [GO:0048661 "positive regulation of smooth muscle
            cell proliferation" evidence=ISS] [GO:0048706 "embryonic skeletal
            system development" evidence=ISS] [GO:0051216 "cartilage
            development" evidence=ISS] [GO:0060348 "bone development"
            evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0060686
            "negative regulation of prostatic bud formation" evidence=ISS]
            InterPro:IPR000917 InterPro:IPR014615 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783
            GO:GO:0005794 GO:GO:0005615 GO:GO:0009986 GO:GO:0048661
            GO:GO:0005509 GO:GO:0045121 GO:GO:0030336 GO:GO:0001822
            GO:GO:0001937 GO:GO:0030513 GO:GO:0016525 GO:GO:0001502
            GO:GO:0060348 Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0030177
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0004065
            GeneTree:ENSGT00400000022041 GO:GO:0048706 GO:GO:0048010
            GO:GO:0060686 GO:GO:0002063 GO:GO:0040037 GO:GO:0060384
            GO:GO:0008449 GO:GO:0030201 GO:GO:0014846 GO:GO:0035860
            GO:GO:0036022 InterPro:IPR024609 Pfam:PF12548 EMBL:AEFK01056197
            EMBL:AEFK01056198 EMBL:AEFK01056199 EMBL:AEFK01056200
            Ensembl:ENSSHAT00000019735 Uniprot:G3WVX3
        Length = 870

 Score = 166 (63.5 bits), Expect = 6.2e-10, Sum P(2) = 6.2e-10
 Identities = 63/223 (28%), Positives = 97/223 (43%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
             P+II +L DD    DV    L Q+       + + G    N + T  +C PSRS+++TGK
Sbjct:    43 PNIILVLTDD---QDVELGSL-QVMNKTRKIMEHGGATFINAFVTTPMCCPSRSSMLTGK 98

Query:   119 HPIHTGMQHNVLYGCERGGLP----LSE-KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
             + +H    HNV    E    P    + E +    YL   GYRT   GK+ L  Y   Y P
Sbjct:    99 Y-VHN---HNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153

Query:   174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
                G+   LG     + +++++       G+  +   + A D    Y TD+ T E+++  
Sbjct:   154 P--GWREWLGLIKNSR-FYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESINYF 203

Query:   234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
                       P+ + ++HAA H      P Q  D Y N  +HI
Sbjct:   204 KMSKRMYPHRPIMMVISHAAPHGPEDSAP-QFSDLYPNASQHI 245

 Score = 65 (27.9 bits), Expect = 6.2e-10, Sum P(2) = 6.2e-10
 Identities = 27/131 (20%), Positives = 53/131 (40%)

Query:   267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
             L IH    +  + K    L  +D+SV ++   L +   L N+ I++ +D           
Sbjct:   268 LPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVETGELDNTYIIYTADHGYHIGQFGLV 327

Query:   327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
                + P        ++  +R    I  P +E   IV++  +++ D  PT+L  A     P
Sbjct:   328 KGKSMP--------YDFDIRVPFFIRGPSVEPGSIVSQIVLNI-DLAPTILDIAGLDTPP 378

Query:   387 NYVNSTVENII 397
             +    +V  ++
Sbjct:   379 DVDGKSVLKLL 389

 Score = 48 (22.0 bits), Expect = 3.5e-08, Sum P(2) = 3.5e-08
 Identities = 10/42 (23%), Positives = 19/42 (45%)

Query:   428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGP 466
             +G  +  P++ + Y N +    P Y    N  ++   +Y GP
Sbjct:   225 HGPEDSAPQFSDLYPNASQHITPSYNYAPNMDKHWIMQYTGP 266


>ASPGD|ASPL0000029545 [details] [associations]
            symbol:AN5449 species:162425 "Emericella nidulans"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0008484 "sulfuric
            ester hydrolase activity" evidence=IEA] [GO:0008152 "metabolic
            process" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 EMBL:BN001305 Gene3D:3.40.720.10
            SUPFAM:SSF53649 eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0008484 EMBL:AACD01000094 RefSeq:XP_663053.1
            ProteinModelPortal:Q5B1Y1 EnsemblFungi:CADANIAT00003640
            GeneID:2871741 KEGG:ani:AN5449.2 HOGENOM:HOG000217625 KO:K01133
            OMA:YIMADQM OrthoDB:EOG45F0XM InterPro:IPR017785 InterPro:IPR025863
            Pfam:PF12411 TIGRFAMs:TIGR03417 Uniprot:Q5B1Y1
        Length = 594

 Score = 161 (61.7 bits), Expect = 6.7e-10, Sum P(3) = 6.7e-10
 Identities = 65/254 (25%), Positives = 111/254 (43%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQ-IPTPNIDALAYSGIILKNYY-TVQLCTPSRSAIMTG 117
             P+I++I+AD +    + FH  D  I TPN++ LA  G++  + Y    LC PSR  ++TG
Sbjct:     6 PNILYIMADQMAAPLLAFHDKDSPIKTPNLNKLAEEGVVFDSAYCNSPLCAPSRFVMVTG 65

Query:   118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRG 177
             + P   G   N         LP        YL+  GY T + GK H  F   +      G
Sbjct:    66 QLPSKIGAYDNA------ADLPADIPTYAHYLRREGYHTALAGKMH--FCGPDQ---LHG 114

Query:   178 FESHLGYWTGHQDY-FDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDV--FTAEAV---- 230
             +E  L       DY +  + +E  +  LD   ++    D      T+   F  E +    
Sbjct:   115 YEQRLTSDIYPGDYGWSVNWDEPDV-RLDYYHNMSSVMDAGPVVRTNQLDFDEEVIYKSK 173

Query:   231 DIIHNH---STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILH- 286
               +++H    TD+P  L ++   TH   P++P      + +++  +E     K AAI H 
Sbjct:   174 QYLYDHVRQRTDQPFCLTVS--MTH---PHDPYAMTKEFWDLYEDVE-IPLPKHAAIPHD 227

Query:   287 KLDESVGKVVEALE 300
             + D    ++++ ++
Sbjct:   228 QQDPHSQRILKCID 241

 Score = 61 (26.5 bits), Expect = 6.7e-10, Sum P(3) = 6.7e-10
 Identities = 11/16 (68%), Positives = 13/16 (81%)

Query:   675 PCLFDIKNDPCEKNNL 690
             P LFD++NDP EK NL
Sbjct:   409 PMLFDVQNDPLEKVNL 424

 Score = 61 (26.5 bits), Expect = 6.7e-10, Sum P(3) = 6.7e-10
 Identities = 11/16 (68%), Positives = 13/16 (81%)

Query:   781 PCLFDIKNDPCEKNNL 796
             P LFD++NDP EK NL
Sbjct:   409 PMLFDVQNDPLEKVNL 424

 Score = 47 (21.6 bits), Expect = 6.7e-10, Sum P(3) = 6.7e-10
 Identities = 13/45 (28%), Positives = 22/45 (48%)

Query:   510 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALT---RGKWKLV 551
             +DG+ +   L+  +  K +T+L     E   S +    RG+WK V
Sbjct:   358 LDGVSLVPYLTGEDGVKTDTVLGEYMGEGTQSPVVMIRRGRWKFV 402


>UNIPROTKB|F1RZ89 [details] [associations]
            symbol:LOC100737146 "Uncharacterized protein" species:9823
            "Sus scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            OMA:RDPHETQ GeneTree:ENSGT00390000013080 EMBL:CU914710
            Ensembl:ENSSSCT00000018673 ArrayExpress:F1RZ89 Uniprot:F1RZ89
        Length = 496

 Score = 150 (57.9 bits), Expect = 7.1e-10, Sum P(3) = 7.1e-10
 Identities = 44/131 (33%), Positives = 70/131 (53%)

Query:    41 PLAFTLSMVFVDLVASSGPP-HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
             P+ + L +      A  G   +++ ILADD G+   G +    I TP++DALA   I+ +
Sbjct:     9 PVGWVLLLALGLCCAQGGRRRNVLLILADDGGFES-GAYNNSAITTPHLDALARRSIVFR 67

Query:   100 NYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLS--EKI--LPQYLKELGY 154
             N +T V  C+PSR++++TG  P H     N +YG  +     +  +++  LP  L   G 
Sbjct:    68 NAFTSVSSCSPSRASLLTGL-PQH----QNGMYGLHQDVHHFNSFDRVQSLPLLLGRAGV 122

Query:   155 RTRIVGKWHLG 165
             RT I+GK H+G
Sbjct:   123 RTGIIGKKHVG 133

 Score = 75 (31.5 bits), Expect = 7.1e-10, Sum P(3) = 7.1e-10
 Identities = 26/92 (28%), Positives = 44/92 (47%)

Query:   287 KLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVR 346
             ++D+ +G V++ L    +L++++++F SD               +P  G  N  W G   
Sbjct:   251 RMDQGIGLVLQELRGAGVLNDTLVIFTSDNGVP-----------FP-SGRTNLYWPGAAE 298

Query:   347 GAGLIWSPLLESR-GIVAEQYVHVSDWLPTLL 377
                L+ SP    R G V+E YV + D  PT+L
Sbjct:   299 PL-LVSSPEHPQRWGQVSEAYVSLLDLTPTVL 329

 Score = 41 (19.5 bits), Expect = 7.1e-10, Sum P(3) = 7.1e-10
 Identities = 8/21 (38%), Positives = 10/21 (47%)

Query:   239 DEPLFLYLAHAATHSANPYEP 259
             D P FLY+A    H     +P
Sbjct:   174 DRPFFLYVAFHDPHRCGHSQP 194


>MGI|MGI:96417 [details] [associations]
            symbol:Ids "iduronate 2-sulfatase" species:10090 "Mus
            musculus" [GO:0003824 "catalytic activity" evidence=IEA]
            [GO:0004423 "iduronate-2-sulfatase activity" evidence=IEA]
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005764 "lysosome"
            evidence=IEA] [GO:0008152 "metabolic process" evidence=IEA]
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IDA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0046872 "metal
            ion binding" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 MGI:MGI:96417 GO:GO:0046872
            GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            CTD:3423 HOVERGEN:HBG006120 KO:K01136 OrthoDB:EOG49078W ChiTaRS:IDS
            GO:GO:0004423 EMBL:AK166178 EMBL:BX294168 EMBL:L07921 EMBL:BN000750
            IPI:IPI00125815 PIR:A47153 RefSeq:NP_034628.2 UniGene:Mm.233083
            ProteinModelPortal:Q08890 SMR:Q08890 STRING:Q08890
            PhosphoSite:Q08890 PRIDE:Q08890 DNASU:15931
            Ensembl:ENSMUST00000101509 GeneID:15931 KEGG:mmu:15931
            GeneTree:ENSGT00640000091539 InParanoid:Q32KI7 NextBio:288652
            Bgee:Q08890 CleanEx:MM_IDS Genevestigator:Q08890
            GermOnline:ENSMUSG00000035847 Uniprot:Q08890
        Length = 552

 Score = 144 (55.7 bits), Expect = 7.1e-10, Sum P(2) = 7.1e-10
 Identities = 38/108 (35%), Positives = 57/108 (52%)

Query:    61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKH 119
             +I+ I+ DDL    +G +G   + +PNID LA   ++ +N +  Q +C PSR + +TG+ 
Sbjct:    40 NILLIIVDDLR-PSLGCYGDKLVRSPNIDQLASHSVLFQNAFAQQAVCAPSRVSFLTGRR 98

Query:   120 PIHTGM-QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK-WHLG 165
             P  T +   N  +    G        +PQY KE GY T  VGK +H G
Sbjct:    99 PDTTRLYDFNSYWRVHSGNF----STIPQYFKENGYVTMSVGKVFHPG 142

 Score = 82 (33.9 bits), Expect = 7.1e-10, Sum P(2) = 7.1e-10
 Identities = 21/46 (45%), Positives = 29/46 (63%)

Query:   274 EDFKR----SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
             EDF+R    S FA++ + LD  VG V+ AL+  R+  N+II F SD
Sbjct:   292 EDFQRKIRQSYFASVSY-LDTQVGHVLSALDDLRLAHNTIIAFTSD 336


>UNIPROTKB|F6PNP7 [details] [associations]
            symbol:IDS "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            GO:GO:0008484 HOGENOM:HOG000014304 HOVERGEN:HBG006120 OMA:CREGKNL
            OrthoDB:EOG49078W GeneTree:ENSGT00640000091539 EMBL:AAEX03027034
            Ensembl:ENSCAFT00000030323 Uniprot:F6PNP7
        Length = 468

 Score = 150 (57.9 bits), Expect = 7.9e-10, Sum P(2) = 7.9e-10
 Identities = 39/113 (34%), Positives = 60/113 (53%)

Query:    56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAI 114
             ++ P +++ I+ DDL    +G +G   + +PNID LA   ++ +N +  Q +C PSR + 
Sbjct:    31 TTAPLNVLLIIVDDLR-PSLGCYGDKLVRSPNIDQLASHSLLFQNAFAQQAVCAPSRVSF 89

Query:   115 MTGKHPIHTGM-QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK-WHLG 165
             +TG+ P  T +   N  +    G        LPQY KE GY T  VGK +H G
Sbjct:    90 LTGRRPDTTRLYDFNSYWRVHAGNF----STLPQYFKENGYVTMSVGKVFHPG 138

 Score = 73 (30.8 bits), Expect = 7.9e-10, Sum P(2) = 7.9e-10
 Identities = 22/60 (36%), Positives = 37/60 (61%)

Query:   256 PYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
             PY P+  P   ++  R I   ++S FA+I + LD  VG ++ AL+  ++ +++IIVF SD
Sbjct:   282 PYGPI--P---VDFQRKI---RQSYFASISY-LDTQVGHLLSALDDLQLANSTIIVFASD 332


>UNIPROTKB|Q48QH2 [details] [associations]
            symbol:betC "Choline sulfatase" species:264730 "Pseudomonas
            syringae pv. phaseolicola 1448A" [GO:0006790 "sulfur compound
            metabolic process" evidence=ISS] [GO:0030104 "water homeostasis"
            evidence=ISS] [GO:0047753 "choline-sulfatase activity"
            evidence=ISS] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 EMBL:CP000058
            GenomeReviews:CP000058_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
            GO:GO:0030104 eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00149
            GO:GO:0006790 HOGENOM:HOG000217625 KO:K01133 InterPro:IPR017785
            InterPro:IPR025863 Pfam:PF12411 TIGRFAMs:TIGR03417
            RefSeq:YP_272344.1 ProteinModelPortal:Q48QH2 STRING:Q48QH2
            GeneID:3556452 KEGG:psp:PSPPH_0030 PATRIC:19969019 OMA:MIRRGAY
            ProtClustDB:CLSK864791 GO:GO:0047753 Uniprot:Q48QH2
        Length = 501

 Score = 133 (51.9 bits), Expect = 8.1e-10, Sum P(3) = 8.1e-10
 Identities = 32/104 (30%), Positives = 49/104 (47%)

Query:    61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYY-TVQLCTPSRSAIMTGKH 119
             +I+FI+AD +    + F+    I  PN+  LA  G++  + Y    LC PSR  +++G+ 
Sbjct:     5 NILFIMADQMAAPMLPFYSRSPILMPNLSRLAADGVVFDSAYCNSPLCAPSRFTLVSGQL 64

Query:   120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWH 163
             P   G   N          P        YL+ LGY+T + GK H
Sbjct:    65 PSKIGAYDNA------ADFPADIPTYAHYLRALGYKTALAGKMH 102

 Score = 76 (31.8 bits), Expect = 8.1e-10, Sum P(3) = 8.1e-10
 Identities = 28/109 (25%), Positives = 50/109 (45%)

Query:   273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWP 332
             I D +R+ F A  + +D +VGK+++ L++  +  ++I+VF  D              +W 
Sbjct:   248 IRDARRAYFGACSY-IDLNVGKLMQTLDEVGLAEDTIVVFSGDHGDMLGEKGLWYKMHW- 305

Query:   333 LRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
                     +E   R   +++SP     G V+   V  +D LPT +  AN
Sbjct:   306 --------FEMAARVPLVVYSPGQFKPGRVSAS-VSTADLLPTFVEMAN 345

 Score = 58 (25.5 bits), Expect = 8.1e-10, Sum P(3) = 8.1e-10
 Identities = 12/33 (36%), Positives = 20/33 (60%)

Query:   675 PCL-FDIKNDPCEKNNLADRSEDQRI-NHYTTE 705
             PCL FD+K DP E+ +L+     +++ N +  E
Sbjct:   402 PCLLFDVKKDPKEQKDLSQSPAHEKLFNDFLAE 434

 Score = 56 (24.8 bits), Expect = 1.3e-09, Sum P(3) = 1.3e-09
 Identities = 12/33 (36%), Positives = 20/33 (60%)

Query:   781 PCL-FDIKNDPCEKNNLADRSEVQRI-NHYTTE 811
             PCL FD+K DP E+ +L+     +++ N +  E
Sbjct:   402 PCLLFDVKKDPKEQKDLSQSPAHEKLFNDFLAE 434


>RGD|1560491 [details] [associations]
            symbol:Ids "iduronate 2-sulfatase" species:10116 "Rattus
            norvegicus" [GO:0003674 "molecular_function" evidence=ND]
            [GO:0005575 "cellular_component" evidence=ND] [GO:0008150
            "biological_process" evidence=ND] [GO:0008484 "sulfuric ester
            hydrolase activity" evidence=ISO] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 RGD:1560491
            Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            HOGENOM:HOG000014304 HOVERGEN:HBG006120 OrthoDB:EOG49078W
            GO:GO:0004423 EMBL:BN000743 IPI:IPI00764641
            ProteinModelPortal:Q32KJ4 STRING:Q32KJ4 PhosphoSite:Q32KJ4
            InParanoid:Q32KJ4 Genevestigator:Q32KJ4 Uniprot:Q32KJ4
        Length = 543

 Score = 149 (57.5 bits), Expect = 8.1e-10, Sum P(2) = 8.1e-10
 Identities = 45/137 (32%), Positives = 71/137 (51%)

Query:    33 RIMAFAVLPLAFTLSMVFVDLVASSGPP-HIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
             R ++F++L   F +++V      S+    +I+ I+ DDL    +G +G   + +PNID L
Sbjct:     2 RQLSFSLLLGFFCIALVSAAQGNSATDALNILLIIVDDLR-PSLGCYGDKLVRSPNIDQL 60

Query:    92 AYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGM-QHNVLYGCERGGLPLSEKILPQYL 149
             A   I+ +N +  Q +C PSR + +TG+ P  T +   N  +    G        +PQY 
Sbjct:    61 ASHSIVFENAFAQQAVCAPSRVSFLTGRRPDTTRLYDFNSYWRVHSGNF----STIPQYF 116

Query:   150 KELGYRTRIVGK-WHLG 165
             KE GY T  VGK +H G
Sbjct:   117 KENGYVTMSVGKVFHPG 133

 Score = 76 (31.8 bits), Expect = 8.1e-10, Sum P(2) = 8.1e-10
 Identities = 22/60 (36%), Positives = 36/60 (60%)

Query:   256 PYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
             PY P+  P   ++  R I   ++S FA++ + LD  VG ++ AL+  R+  N+II F+SD
Sbjct:   277 PYGPI--P---VDFQRKI---RQSYFASVSY-LDTQVGHLLSALDDLRLAHNTIIAFMSD 327


>ZFIN|ZDB-GENE-030131-4958 [details] [associations]
            symbol:sgsh "N-sulfoglucosamine sulfohydrolase
            (sulfamidase)" species:7955 "Danio rerio" [GO:0003824 "catalytic
            activity" evidence=IEA] [GO:0008152 "metabolic process"
            evidence=IEA] [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 ZFIN:ZDB-GENE-030131-4958
            Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484 CTD:6448
            HOGENOM:HOG000234731 HOVERGEN:HBG012598 KO:K01565 OMA:RDPHETQ
            OrthoDB:EOG4RXZ01 GeneTree:ENSGT00390000013080 EMBL:CU459096
            IPI:IPI00616379 RefSeq:NP_001116740.1 UniGene:Dr.80125
            Ensembl:ENSDART00000063147 GeneID:563849 KEGG:dre:563849
            NextBio:20885106 Uniprot:B0V3V9
        Length = 511

 Score = 138 (53.6 bits), Expect = 9.1e-10, Sum P(4) = 9.1e-10
 Identities = 45/136 (33%), Positives = 67/136 (49%)

Query:    35 MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
             MAF        L + F D V      +++ I+ADD G+ +   +    + TP++ AL+  
Sbjct:     1 MAFVFAWTLLCLLLCF-D-VGGCRSRNVLLIIADDGGF-ETDVYNNTVVQTPHLRALSKR 57

Query:    95 GIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSE----KILPQYL 149
              +I KN +T V  C+PSRS I+TG  P H     N +YG  +G    +     + LP  L
Sbjct:    58 SLIFKNAFTSVSSCSPSRSTILTGL-PQH----QNGMYGLHQGVHHFNSFDGVQSLPLLL 112

Query:   150 KELGYRTRIVGKWHLG 165
             K     T I+GK H+G
Sbjct:   113 KRANIHTGIIGKKHVG 128

 Score = 72 (30.4 bits), Expect = 9.1e-10, Sum P(4) = 9.1e-10
 Identities = 32/128 (25%), Positives = 61/128 (47%)

Query:   257 YEP-LQAPDHYLNIHRHIEDFK--RSKFAA---ILHKLDESVGKVVEALEQRRMLSNSII 310
             +EP   +PD  + +   I D    R+  AA    + +LD+ +G V+E L +    +++++
Sbjct:   219 WEPKYYSPDQ-VKVPYFIPDTPAARADIAAQYTTVSRLDQGIGLVLEELRKAGFENDTLV 277

Query:   311 VFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESR-GIVAEQYVHV 369
             ++ SD             +  P    +  L+  GV+   L+ SP  + R G +++ YV +
Sbjct:   278 IYSSD-------------NGIPFPNGRTNLYGSGVKEPMLLSSPEHQQRWGKLSQAYVSL 324

Query:   370 SDWLPTLL 377
              D  PT+L
Sbjct:   325 LDITPTIL 332

 Score = 58 (25.5 bits), Expect = 9.1e-10, Sum P(4) = 9.1e-10
 Identities = 14/23 (60%), Positives = 16/23 (69%)

Query:   783 LFDIKNDPCEKNNLA---DRSEV 802
             LFD++ DP EK NLA   D SEV
Sbjct:   447 LFDVRTDPMEKVNLAGDLDYSEV 469

 Score = 54 (24.1 bits), Expect = 2.2e-09, Sum P(4) = 2.2e-09
 Identities = 10/15 (66%), Positives = 12/15 (80%)

Query:   677 LFDIKNDPCEKNNLA 691
             LFD++ DP EK NLA
Sbjct:   447 LFDVRTDPMEKVNLA 461

 Score = 37 (18.1 bits), Expect = 9.1e-10, Sum P(4) = 9.1e-10
 Identities = 7/21 (33%), Positives = 10/21 (47%)

Query:   239 DEPLFLYLAHAATHSANPYEP 259
             + P FLY+A    H     +P
Sbjct:   177 ERPFFLYVAFHDPHRCGHSQP 197


>FB|FBgn0033836 [details] [associations]
            symbol:CG18278 species:7227 "Drosophila melanogaster"
            [GO:0006044 "N-acetylglucosamine metabolic process" evidence=ISS]
            [GO:0008449 "N-acetylglucosamine-6-sulfatase activity"
            evidence=ISS] [GO:0005764 "lysosome" evidence=ISS] [GO:0030203
            "glycosaminoglycan metabolic process" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR012251 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036666 EMBL:AE013599
            Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00523 GeneTree:ENSGT00400000022041
            GO:GO:0030203 KO:K01137 GO:GO:0008449 OMA:MCGYQTF EMBL:BT021205
            RefSeq:NP_725289.1 UniGene:Dm.28273 SMR:Q5BIL9 STRING:Q5BIL9
            EnsemblMetazoa:FBtr0087716 GeneID:36487 KEGG:dme:Dmel_CG18278
            UCSC:CG18278-RA FlyBase:FBgn0033836 InParanoid:Q5BIL9
            OrthoDB:EOG43TXB4 GenomeRNAi:36487 NextBio:798808 Uniprot:Q5BIL9
        Length = 492

 Score = 174 (66.3 bits), Expect = 1.2e-09, P = 1.2e-09
 Identities = 70/253 (27%), Positives = 117/253 (46%)

Query:    39 VLPLAFTLSMVFVDL--VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPN-IDALAYSG 95
             ++ LA  + +V   L   AS   P+I+ IL+DD    DV   G+   P  + I+ L + G
Sbjct:     1 MISLAPLIILVLACLGNTASEKLPNILLILSDD---QDVELRGM--FPMEHTIEMLGFGG 55

Query:    96 IILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHN-VLYGC--ERGGLPLSEKILPQYLKE 151
              +  N YT   +C P+R++++TG +  + G ++N V  GC        L  + LP  L++
Sbjct:    56 ALFHNAYTPSPICCPARTSLLTGMYAHNHGTRNNSVSGGCYGPHWRRALEPRALPYILQQ 115

Query:   152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLE 211
              GY T   GK+   ++     P  +G+    G   G+  Y++++          +R   E
Sbjct:   116 HGYNTFFGGKYLNQYWGAGDVP--KGWNHFYGLH-GNSRYYNYT----------LR---E 159

Query:   212 PAWDLH--GKYSTDVFTAEAVDIIHNHS-TDEPLFLYLAHAATHSANPYEPLQAPDHYLN 268
              + ++H    Y TD+    A D + N + + EP F  +A  A H   P+ P  AP H   
Sbjct:   160 NSGNVHYESTYLTDLLRDRAADFLRNATQSSEPFFAMVAPPAAHE--PFTP--APRHE-G 214

Query:   269 IHRHIEDFKRSKF 281
             +  HIE  +   F
Sbjct:   215 VFSHIEALRTPSF 227


>UNIPROTKB|F1N2D5 [details] [associations]
            symbol:IDS "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            GeneTree:ENSGT00640000091539 EMBL:DAAA02068060 EMBL:DAAA02068061
            IPI:IPI00709383 Ensembl:ENSBTAT00000014683 OMA:CREGRNL
            Uniprot:F1N2D5
        Length = 546

 Score = 149 (57.5 bits), Expect = 1.3e-09, Sum P(2) = 1.3e-09
 Identities = 39/113 (34%), Positives = 60/113 (53%)

Query:    56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAI 114
             ++ P +++ I+ DDL    +G +G   I +PNID LA   ++ +N +  Q +C PSR + 
Sbjct:    30 ATDPLNVLLIIVDDLR-PSLGCYGNKLIRSPNIDQLASRSLLFQNAFAQQAVCAPSRVSF 88

Query:   115 MTGKHPIHTGM-QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK-WHLG 165
             +TG+ P  T +   N  +    G        +PQY KE GY T  VGK +H G
Sbjct:    89 LTGRRPDTTRLYDFNSYWRVHAGNF----STIPQYFKENGYVTMSVGKVFHPG 137

 Score = 74 (31.1 bits), Expect = 1.3e-09, Sum P(2) = 1.3e-09
 Identities = 20/60 (33%), Positives = 35/60 (58%)

Query:   256 PYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
             PY P+ A     +  R I   ++S FA + + LD  VG+++ AL+  ++ S++I+ F SD
Sbjct:   281 PYGPIPA-----DFQRKI---RQSYFACVSY-LDTQVGRLLSALDDLQLASSTIVAFTSD 331


>UNIPROTKB|E1BFX4 [details] [associations]
            symbol:SGSH "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484 CTD:6448 KO:K01565
            OMA:RDPHETQ GeneTree:ENSGT00390000013080 EMBL:DAAA02049454
            RefSeq:NP_001095659.2 UniGene:Bt.12396 GeneID:535442
            KEGG:bta:535442 NextBio:20876750 IPI:IPI00907105
            ProteinModelPortal:E1BFX4 Ensembl:ENSBTAT00000020308
            ArrayExpress:E1BFX4 Uniprot:E1BFX4
        Length = 505

 Score = 147 (56.8 bits), Expect = 1.7e-09, Sum P(4) = 1.7e-09
 Identities = 39/112 (34%), Positives = 63/112 (56%)

Query:    59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTG 117
             P +++ ILADD G+   G +    I TP++DALA   ++ +N +T V  C+PSR++++TG
Sbjct:    25 PRNVLLILADDGGFES-GAYNNSAISTPHLDALARRSLVFRNAFTSVSSCSPSRASLLTG 83

Query:   118 KHPIHTGMQHNVLYGCERGGLPLS--EKI--LPQYLKELGYRTRIVGKWHLG 165
               P H     N +YG  +     +  +++  LP  L   G  T I+GK H+G
Sbjct:    84 L-PQH----QNGMYGLHQDVHHFNSFDRVQSLPLLLGRAGIHTGIIGKKHVG 130

 Score = 74 (31.1 bits), Expect = 1.7e-09, Sum P(4) = 1.7e-09
 Identities = 26/92 (28%), Positives = 44/92 (47%)

Query:   287 KLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVR 346
             ++D+ +G V++ L    +L++++++F SD               +P  G  N  W G   
Sbjct:   248 RMDQGIGLVLQELRGAGVLNDTLVIFTSDNGIP-----------FP-SGRTNLYWPGTAE 295

Query:   347 GAGLIWSPLLESR-GIVAEQYVHVSDWLPTLL 377
                L+ SP    R G V+E YV + D  PT+L
Sbjct:   296 PM-LVSSPEHPKRWGQVSEAYVSLLDLTPTIL 326

 Score = 41 (19.5 bits), Expect = 1.7e-09, Sum P(4) = 1.7e-09
 Identities = 8/21 (38%), Positives = 10/21 (47%)

Query:   239 DEPLFLYLAHAATHSANPYEP 259
             D P FLY+A    H     +P
Sbjct:   171 DRPFFLYVAFHDPHRCGHSQP 191

 Score = 39 (18.8 bits), Expect = 1.7e-09, Sum P(4) = 1.7e-09
 Identities = 8/15 (53%), Positives = 10/15 (66%)

Query:   677 LFDIKNDPCEKNNLA 691
             L+D   DP E +NLA
Sbjct:   441 LYDRNQDPHETHNLA 455

 Score = 39 (18.8 bits), Expect = 1.7e-09, Sum P(4) = 1.7e-09
 Identities = 8/15 (53%), Positives = 10/15 (66%)

Query:   783 LFDIKNDPCEKNNLA 797
             L+D   DP E +NLA
Sbjct:   441 LYDRNQDPHETHNLA 455


>TIGR_CMR|SPO_A0121 [details] [associations]
            symbol:SPO_A0121 "sulfatase family protein"
            species:246200 "Ruegeria pomeroyi DSS-3" [GO:0008152 "metabolic
            process" evidence=ISS] [GO:0008484 "sulfuric ester hydrolase
            activity" evidence=ISS] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GO:GO:0008484 EMBL:CP000032 GenomeReviews:CP000032_GR
            HOGENOM:HOG000230030 RefSeq:YP_164953.1 ProteinModelPortal:Q5LLA5
            GeneID:3196629 KEGG:sil:SPOA0121 PATRIC:23381566 OMA:FDYLSCY
            ProtClustDB:CLSK867183 Uniprot:Q5LLA5
        Length = 552

 Score = 139 (54.0 bits), Expect = 1.7e-09, Sum P(4) = 1.7e-09
 Identities = 33/107 (30%), Positives = 61/107 (57%)

Query:    61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGKH 119
             +I++I+ D L ++ +  +G +++ TPNID LA  G+   N Y    +C PSR +  TG++
Sbjct:     6 NILWIMCDQLRFDYLSCYGHERLNTPNIDKLAKRGVRFTNAYVQATVCGPSRMSAYTGRY 65

Query:   120 PIHTGMQHNVLYGCERGGLPL--SEKILPQYLKELGYRTRIVGKWHL 164
              + +       +G  + G+PL   E  L  +L+++G R  ++GK H+
Sbjct:    66 -VRS-------HGSTQNGIPLRVGEPTLGDHLRDVGMRNVLIGKTHM 104

 Score = 71 (30.1 bits), Expect = 1.7e-09, Sum P(4) = 1.7e-09
 Identities = 22/103 (21%), Positives = 49/103 (47%)

Query:   281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTL 340
             +  ++ ++D+ +G++   +++R +  N++IVF +D              +W   G K   
Sbjct:   294 YMGLIKQIDDQLGQLFAFMQERGLDENTMIVFTADHGDYLG-------DHW--MGEKYLF 344

Query:   341 WEGGVRGAGLIWSPLLES---RGIVAEQYVHVSDWLPTLLSAA 380
             +E   +   +I+ P  ++   RG V++  V + D  PT +  A
Sbjct:   345 YEAAAKVPLIIYDPSDKADATRGTVSDALVEMIDLAPTFVDYA 387

 Score = 48 (22.0 bits), Expect = 1.7e-09, Sum P(4) = 1.7e-09
 Identities = 18/61 (29%), Positives = 30/61 (49%)

Query:   204 LD-MRR-DLEPAWDLHGKYSTDVFTA-EAVDIIH--NHSTDEPLFL-YLAHAATHSANPY 257
             LD M+R  ++P  ++  +     F A +  D +H   +   EP +  YL HA   + NP+
Sbjct:   108 LDGMKRLGIDPDSEIGARVGEGGFDAFDRDDGVHPTGYRKKEPAYNDYLRHAGFQAENPW 167

Query:   258 E 258
             E
Sbjct:   168 E 168

 Score = 46 (21.3 bits), Expect = 1.7e-09, Sum P(4) = 1.7e-09
 Identities = 11/25 (44%), Positives = 16/25 (64%)

Query:   674 APCLFDIKNDPCEKNNLA-DRSEDQ 697
             AP LFD++ DP E  +L  D S ++
Sbjct:   458 APILFDLEVDPDELKDLGRDPSAEE 482

 Score = 46 (21.3 bits), Expect = 1.7e-09, Sum P(4) = 1.7e-09
 Identities = 10/31 (32%), Positives = 15/31 (48%)

Query:   780 APCLFDIKNDPCEKNNLADRSEVQRINHYTT 810
             AP LFD++ DP E  +L      + +    T
Sbjct:   458 APILFDLEVDPDELKDLGRDPSAEEVRQRLT 488

 Score = 41 (19.5 bits), Expect = 1.4e-05, Sum P(3) = 1.4e-05
 Identities = 14/41 (34%), Positives = 18/41 (43%)

Query:   224 VFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPD 264
             VFTA+  D + +H   E    Y A AA      Y+P    D
Sbjct:   324 VFTADHGDYLGDHWMGEKYLFYEA-AAKVPLIIYDPSDKAD 363


>RGD|708554 [details] [associations]
            symbol:Sulf1 "sulfatase 1" species:10116 "Rattus norvegicus"
            [GO:0001502 "cartilage condensation" evidence=ISS] [GO:0001822
            "kidney development" evidence=ISO;ISS] [GO:0001937 "negative
            regulation of endothelial cell proliferation" evidence=IEA;ISO;ISS]
            [GO:0002063 "chondrocyte development" evidence=IEA;ISO;ISS]
            [GO:0003094 "glomerular filtration" evidence=IEA;ISO] [GO:0004065
            "arylsulfatase activity" evidence=IEA;ISO;ISS] [GO:0005509 "calcium
            ion binding" evidence=IEA] [GO:0005615 "extracellular space"
            evidence=ISO;ISS] [GO:0005783 "endoplasmic reticulum"
            evidence=IEA;ISO;NAS;IDA] [GO:0005794 "Golgi apparatus"
            evidence=IEA;IDA] [GO:0005795 "Golgi stack" evidence=NAS]
            [GO:0005886 "plasma membrane" evidence=IEA;ISO] [GO:0007155 "cell
            adhesion" evidence=ISS] [GO:0008152 "metabolic process"
            evidence=NAS] [GO:0008449 "N-acetylglucosamine-6-sulfatase
            activity" evidence=IEA;ISO;ISS] [GO:0009986 "cell surface"
            evidence=IEA;ISO;ISS;NAS;IDA] [GO:0010575 "positive regulation
            vascular endothelial growth factor production" evidence=IEA;ISO]
            [GO:0014846 "esophagus smooth muscle contraction"
            evidence=IEA;ISO;ISS] [GO:0016525 "negative regulation of
            angiogenesis" evidence=IEA;ISO;ISS] [GO:0018741 "alkyl sulfatase
            activity" evidence=NAS] [GO:0030177 "positive regulation of Wnt
            receptor signaling pathway" evidence=IEA;ISO;ISS] [GO:0030201
            "heparan sulfate proteoglycan metabolic process"
            evidence=IEA;ISO;ISS] [GO:0030336 "negative regulation of cell
            migration" evidence=IEA;ISO;ISS] [GO:0030513 "positive regulation
            of BMP signaling pathway" evidence=IEA;ISO;ISS] [GO:0032836
            "glomerular basement membrane development" evidence=IEA;ISO]
            [GO:0035860 "glial cell-derived neurotrophic factor receptor
            signaling pathway" evidence=IEA;ISO;ISS] [GO:0036022 "limb joint
            morphogenesis" evidence=ISS] [GO:0040036 "regulation of fibroblast
            growth factor receptor signaling pathway" evidence=ISO] [GO:0040037
            "negative regulation of fibroblast growth factor receptor signaling
            pathway" evidence=IEA;ISO;ISS] [GO:0045121 "membrane raft"
            evidence=IEA;ISO;ISS] [GO:0048010 "vascular endothelial growth
            factor receptor signaling pathway" evidence=IEA;ISO;ISS]
            [GO:0048661 "positive regulation of smooth muscle cell
            proliferation" evidence=ISS] [GO:0048706 "embryonic skeletal system
            development" evidence=IEA;ISO;ISS] [GO:0051216 "cartilage
            development" evidence=ISO;ISS] [GO:0060348 "bone development"
            evidence=IEA;ISO;ISS] [GO:0060384 "innervation"
            evidence=IEA;ISO;ISS] [GO:0060686 "negative regulation of prostatic
            bud formation" evidence=IEA;ISO;ISS] InterPro:IPR000917
            InterPro:IPR014615 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 PIRSF:PIRSF036665 RGD:708554 GO:GO:0005783
            GO:GO:0005886 GO:GO:0005615 GO:GO:0009986 GO:GO:0048661
            GO:GO:0005795 GO:GO:0005509 GO:GO:0010575 GO:GO:0045121
            GO:GO:0030336 GO:GO:0001822 GO:GO:0001937 GO:GO:0030513
            GO:GO:0016525 GO:GO:0001502 GO:GO:0060348 Gene3D:3.40.720.10
            SUPFAM:SSF53649 GO:GO:0030177 GO:GO:0003094 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0004065
            GO:GO:0048706 GO:GO:0048010 GO:GO:0018741 GO:GO:0060686
            GO:GO:0002063 GO:GO:0040037 GO:GO:0032836 GO:GO:0060384
            GO:GO:0008449 GO:GO:0030201 HOGENOM:HOG000290161 KO:K14607
            HOVERGEN:HBG056431 GO:GO:0014846 GO:GO:0035860 GO:GO:0036022
            InterPro:IPR024609 Pfam:PF12548 CTD:23213 OrthoDB:EOG4VT5WH
            EMBL:AF230072 IPI:IPI00331986 RefSeq:NP_599205.1 UniGene:Rn.161961
            ProteinModelPortal:Q8VI60 STRING:Q8VI60 GeneID:171396
            KEGG:rno:171396 UCSC:RGD:708554 NextBio:622244 ArrayExpress:Q8VI60
            Genevestigator:Q8VI60 GermOnline:ENSRNOG00000009037 Uniprot:Q8VI60
        Length = 870

 Score = 162 (62.1 bits), Expect = 2.1e-09, Sum P(2) = 2.1e-09
 Identities = 63/223 (28%), Positives = 96/223 (43%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
             P+II +L DD    DV    L Q+       + + G    N + T  +C PSRS+++TGK
Sbjct:    43 PNIILVLTDD---QDVELGSL-QVMNKTRKIMEHGGATFTNAFVTTPMCCPSRSSMLTGK 98

Query:   119 HPIHTGMQHNVLYGCERGGLP----LSE-KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
             + +H    HNV    E    P    L E +    YL   GYRT   GK+ L  Y   Y P
Sbjct:    99 Y-VHN---HNVYTNNENCSSPSWQALHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153

Query:   174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
                G+   LG     + +++++       G+  +   + A D    Y TD+ T E+++  
Sbjct:   154 P--GWREWLGLIKNSR-FYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESINYF 203

Query:   234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
                       P+ + ++HAA H      P Q    Y N  +HI
Sbjct:   204 KMSKRMYPHRPVMMVISHAAPHGPEDSAP-QFSKLYPNASQHI 245

 Score = 64 (27.6 bits), Expect = 2.1e-09, Sum P(2) = 2.1e-09
 Identities = 28/124 (22%), Positives = 51/124 (41%)

Query:   267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
             L IH    +  + K    L  +D+SV ++   L +   L N+ I++ +D           
Sbjct:   268 LPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVETGELGNTYIIYTADHGYHIGQFGLV 327

Query:   327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
                + P        ++  +R    I  P +E   IV +  +++ D  PT+L  A   D P
Sbjct:   328 KGKSMP--------YDFDIRVPFFIRGPSIEPGSIVPQIVLNI-DLAPTILDIAGL-DTP 377

Query:   387 NYVN 390
             + V+
Sbjct:   378 SDVD 381

 Score = 47 (21.6 bits), Expect = 1.2e-07, Sum P(2) = 1.2e-07
 Identities = 10/42 (23%), Positives = 18/42 (42%)

Query:   428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGP 466
             +G  +  P++   Y N +    P Y    N  ++   +Y GP
Sbjct:   225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGP 266


>UNIPROTKB|Q8VI60 [details] [associations]
            symbol:Sulf1 "Extracellular sulfatase Sulf-1" species:10116
            "Rattus norvegicus" [GO:0005509 "calcium ion binding" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR014615 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036665 RGD:708554
            GO:GO:0005783 GO:GO:0005886 GO:GO:0005615 GO:GO:0009986
            GO:GO:0048661 GO:GO:0005795 GO:GO:0005509 GO:GO:0010575
            GO:GO:0045121 GO:GO:0030336 GO:GO:0001822 GO:GO:0001937
            GO:GO:0030513 GO:GO:0016525 GO:GO:0001502 GO:GO:0060348
            Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0030177 GO:GO:0003094
            eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            GO:GO:0004065 GO:GO:0048706 GO:GO:0048010 GO:GO:0018741
            GO:GO:0060686 GO:GO:0002063 GO:GO:0040037 GO:GO:0032836
            GO:GO:0060384 GO:GO:0008449 GO:GO:0030201 HOGENOM:HOG000290161
            KO:K14607 HOVERGEN:HBG056431 GO:GO:0014846 GO:GO:0035860
            GO:GO:0036022 InterPro:IPR024609 Pfam:PF12548 CTD:23213
            OrthoDB:EOG4VT5WH EMBL:AF230072 IPI:IPI00331986 RefSeq:NP_599205.1
            UniGene:Rn.161961 ProteinModelPortal:Q8VI60 STRING:Q8VI60
            GeneID:171396 KEGG:rno:171396 UCSC:RGD:708554 NextBio:622244
            ArrayExpress:Q8VI60 Genevestigator:Q8VI60
            GermOnline:ENSRNOG00000009037 Uniprot:Q8VI60
        Length = 870

 Score = 162 (62.1 bits), Expect = 2.1e-09, Sum P(2) = 2.1e-09
 Identities = 63/223 (28%), Positives = 96/223 (43%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
             P+II +L DD    DV    L Q+       + + G    N + T  +C PSRS+++TGK
Sbjct:    43 PNIILVLTDD---QDVELGSL-QVMNKTRKIMEHGGATFTNAFVTTPMCCPSRSSMLTGK 98

Query:   119 HPIHTGMQHNVLYGCERGGLP----LSE-KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
             + +H    HNV    E    P    L E +    YL   GYRT   GK+ L  Y   Y P
Sbjct:    99 Y-VHN---HNVYTNNENCSSPSWQALHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153

Query:   174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
                G+   LG     + +++++       G+  +   + A D    Y TD+ T E+++  
Sbjct:   154 P--GWREWLGLIKNSR-FYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESINYF 203

Query:   234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
                       P+ + ++HAA H      P Q    Y N  +HI
Sbjct:   204 KMSKRMYPHRPVMMVISHAAPHGPEDSAP-QFSKLYPNASQHI 245

 Score = 64 (27.6 bits), Expect = 2.1e-09, Sum P(2) = 2.1e-09
 Identities = 28/124 (22%), Positives = 51/124 (41%)

Query:   267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
             L IH    +  + K    L  +D+SV ++   L +   L N+ I++ +D           
Sbjct:   268 LPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVETGELGNTYIIYTADHGYHIGQFGLV 327

Query:   327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
                + P        ++  +R    I  P +E   IV +  +++ D  PT+L  A   D P
Sbjct:   328 KGKSMP--------YDFDIRVPFFIRGPSIEPGSIVPQIVLNI-DLAPTILDIAGL-DTP 377

Query:   387 NYVN 390
             + V+
Sbjct:   378 SDVD 381

 Score = 47 (21.6 bits), Expect = 1.2e-07, Sum P(2) = 1.2e-07
 Identities = 10/42 (23%), Positives = 18/42 (42%)

Query:   428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGP 466
             +G  +  P++   Y N +    P Y    N  ++   +Y GP
Sbjct:   225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGP 266


>FB|FBgn0260475 [details] [associations]
            symbol:CG30059 species:7227 "Drosophila melanogaster"
            [GO:0008449 "N-acetylglucosamine-6-sulfatase activity"
            evidence=ISS] [GO:0006044 "N-acetylglucosamine metabolic process"
            evidence=ISS] [GO:0005764 "lysosome" evidence=ISS] [GO:0030203
            "glycosaminoglycan metabolic process" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR012251 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036666 EMBL:AE013599
            Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
            PROSITE:PS00523 GeneTree:ENSGT00400000022041 GO:GO:0030203
            KO:K01137 GO:GO:0008449 OrthoDB:EOG43TXB4 EMBL:AY061585
            RefSeq:NP_610872.1 UniGene:Dm.21320 SMR:Q95R73 STRING:Q95R73
            EnsemblMetazoa:FBtr0087715 GeneID:246425 KEGG:dme:Dmel_CG30059
            UCSC:CG30059-RA FlyBase:FBgn0260475 InParanoid:Q95R73 OMA:GNSQYYN
            GenomeRNAi:246425 NextBio:842420 Uniprot:Q95R73
        Length = 492

 Score = 171 (65.3 bits), Expect = 2.6e-09, P = 2.6e-09
 Identities = 68/249 (27%), Positives = 114/249 (45%)

Query:    41 PLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPN-IDALAYSGIILK 99
             PL   L +  +   AS   P+I+ IL+DD    DV   G+   P  + I+ L + G +  
Sbjct:     6 PL-IVLVLACLGNTASEKLPNILLILSDD---QDVELRGM--FPMEHTIEMLGFGGALFH 59

Query:   100 NYYTVQ-LCTPSRSAIMTGKHPIHTGMQHN-VLYGC--ERGGLPLSEKILPQYLKELGYR 155
             N YT   +C P+R++++TG +  + G ++N V  GC        L  + LP  L++ GY 
Sbjct:    60 NAYTPSPICCPARTSLLTGMYAHNHGTRNNSVSGGCYGPHWRRALEPRALPYILQQHGYN 119

Query:   156 TRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
             T   GK+   ++     P  +G+ +  G   G+  Y++++          +R   E   +
Sbjct:   120 TFFGGKYLNQYWGAGDVP--KGWNNFYGLH-GNSRYYNYT----------LR---ENTGN 163

Query:   216 LH--GKYSTDVFTAEAVDIIHNHS-TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH 272
             +H    Y +D+    A D + N + + EP F  +A  A H   P+ P  AP H   +  H
Sbjct:   164 VHYESTYLSDLLRDRAADFLRNATQSSEPFFAMVAPPAAHE--PFTP--APRHE-GVFSH 218

Query:   273 IEDFKRSKF 281
             IE  +   F
Sbjct:   219 IEALRTPSF 227


>UNIPROTKB|Q32KH2 [details] [associations]
            symbol:sulf1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0030201 "heparan sulfate proteoglycan
            metabolic process" evidence=ISS] [GO:0051216 "cartilage
            development" evidence=ISS] [GO:0048706 "embryonic skeletal system
            development" evidence=ISS] [GO:0001822 "kidney development"
            evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0014846
            "esophagus smooth muscle contraction" evidence=ISS] [GO:0048661
            "positive regulation of smooth muscle cell proliferation"
            evidence=ISS] [GO:0007155 "cell adhesion" evidence=ISS] [GO:0045121
            "membrane raft" evidence=ISS] [GO:0009986 "cell surface"
            evidence=ISS] [GO:0005783 "endoplasmic reticulum" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
            [GO:0004065 "arylsulfatase activity" evidence=ISS] [GO:0048010
            "vascular endothelial growth factor receptor signaling pathway"
            evidence=ISS] [GO:0040037 "negative regulation of fibroblast growth
            factor receptor signaling pathway" evidence=ISS] [GO:0030513
            "positive regulation of BMP signaling pathway" evidence=ISS]
            [GO:0030336 "negative regulation of cell migration" evidence=ISS]
            [GO:0030177 "positive regulation of Wnt receptor signaling pathway"
            evidence=ISS] [GO:0016525 "negative regulation of angiogenesis"
            evidence=ISS] [GO:0001937 "negative regulation of endothelial cell
            proliferation" evidence=ISS] [GO:0005794 "Golgi apparatus"
            evidence=ISS] [GO:0060348 "bone development" evidence=ISS]
            [GO:0060686 "negative regulation of prostatic bud formation"
            evidence=ISS] [GO:0035860 "glial cell-derived neurotrophic factor
            receptor signaling pathway" evidence=ISS] [GO:0002063 "chondrocyte
            development" evidence=ISS] [GO:0036022 "limb joint morphogenesis"
            evidence=ISS] [GO:0001502 "cartilage condensation" evidence=ISS]
            [GO:0032836 "glomerular basement membrane development"
            evidence=IEA] [GO:0010575 "positive regulation vascular endothelial
            growth factor production" evidence=IEA] [GO:0005886 "plasma
            membrane" evidence=IEA] [GO:0003094 "glomerular filtration"
            evidence=IEA] [GO:0005509 "calcium ion binding" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR014615 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783
            GO:GO:0005886 GO:GO:0005794 GO:GO:0005615 GO:GO:0009986
            GO:GO:0048661 GO:GO:0005509 GO:GO:0010575 GO:GO:0045121
            GO:GO:0030336 GO:GO:0001822 GO:GO:0001937 GO:GO:0030513
            GO:GO:0016525 GO:GO:0001502 GO:GO:0060348 Gene3D:3.40.720.10
            SUPFAM:SSF53649 GO:GO:0030177 GO:GO:0003094 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0004065
            GeneTree:ENSGT00400000022041 GO:GO:0048706 GO:GO:0048010
            GO:GO:0060686 GO:GO:0002063 GO:GO:0040037 GO:GO:0032836
            GO:GO:0060384 GO:GO:0008449 GO:GO:0030201 HOGENOM:HOG000290161
            KO:K14607 HOVERGEN:HBG056431 GO:GO:0014846 GO:GO:0035860
            GO:GO:0036022 InterPro:IPR024609 Pfam:PF12548 CTD:23213
            OrthoDB:EOG4VT5WH EMBL:AAEX03015848 EMBL:BN000765
            RefSeq:NP_001041580.1 UniGene:Cfa.36649 Ensembl:ENSCAFT00000046451
            GeneID:486986 KEGG:cfa:486986 InParanoid:Q32KH2 NextBio:20860674
            Uniprot:Q32KH2
        Length = 869

 Score = 161 (61.7 bits), Expect = 4.3e-09, Sum P(2) = 4.3e-09
 Identities = 62/223 (27%), Positives = 96/223 (43%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
             P+II +L DD    DV    L Q+       + + G    N + T  +C PSRS+++TGK
Sbjct:    43 PNIILVLTDD---QDVELGSL-QVMNKTRKIMEHGGATFTNAFVTTPMCCPSRSSMLTGK 98

Query:   119 HPIHTGMQHNVLYGCERGGLP----LSE-KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
             + +H    HNV    E    P    + E +    YL   GYRT   GK+ L  Y   Y P
Sbjct:    99 Y-VHN---HNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153

Query:   174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
                G+   LG     + +++++       G+  +   + A D    Y TD+ T E+++  
Sbjct:   154 P--GWREWLGLIKNSR-FYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESINYF 203

Query:   234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
                       P+ + ++HAA H      P Q    Y N  +HI
Sbjct:   204 KMSKRMYPHRPIMMVISHAAPHGPEDSAP-QFSKLYPNASQHI 245

 Score = 62 (26.9 bits), Expect = 4.3e-09, Sum P(2) = 4.3e-09
 Identities = 27/131 (20%), Positives = 51/131 (38%)

Query:   267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
             L IH    +    K    L  +D+SV ++   L +   L N+ I++ +D           
Sbjct:   268 LPIHMEFTNVLHRKRLQTLMSVDDSVERLYNMLVETGELDNTYIIYTADHGYHIGQFGLV 327

Query:   327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
                + P        ++  +R    I  P +E   IV +  +++ D  PT+L  A     P
Sbjct:   328 KGKSMP--------YDFDIRVPFFIRGPSVEPGSIVPQIVLNI-DLAPTILDIAGLDTPP 378

Query:   387 NYVNSTVENII 397
             +    +V  ++
Sbjct:   379 DVDGKSVLKLL 389

 Score = 49 (22.3 bits), Expect = 9.4e-08, Sum P(2) = 9.4e-08
 Identities = 19/87 (21%), Positives = 34/87 (39%)

Query:   428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGPKNENTNPRYENGTHEYNI 484
             +G  +  P++   Y N +    P Y    N  ++   +Y GP     +  + N  H   +
Sbjct:   225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLP-IHMEFTNVLHRKRL 283

Query:   485 PRL---ENSING--NGTSENRSNDNSY 506
               L   ++S+    N   E    DN+Y
Sbjct:   284 QTLMSVDDSVERLYNMLVETGELDNTY 310


>UNIPROTKB|G1PHQ1 [details] [associations]
            symbol:SULF1 "Uncharacterized protein" species:59463
            "Myotis lucifugus" [GO:0001502 "cartilage condensation"
            evidence=ISS] [GO:0001822 "kidney development" evidence=ISS]
            [GO:0001937 "negative regulation of endothelial cell proliferation"
            evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
            [GO:0004065 "arylsulfatase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005783 "endoplasmic
            reticulum" evidence=ISS] [GO:0005794 "Golgi apparatus"
            evidence=ISS] [GO:0007155 "cell adhesion" evidence=ISS] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
            [GO:0009986 "cell surface" evidence=ISS] [GO:0014846 "esophagus
            smooth muscle contraction" evidence=ISS] [GO:0016525 "negative
            regulation of angiogenesis" evidence=ISS] [GO:0030177 "positive
            regulation of Wnt receptor signaling pathway" evidence=ISS]
            [GO:0030201 "heparan sulfate proteoglycan metabolic process"
            evidence=ISS] [GO:0030336 "negative regulation of cell migration"
            evidence=ISS] [GO:0030513 "positive regulation of BMP signaling
            pathway" evidence=ISS] [GO:0035860 "glial cell-derived neurotrophic
            factor receptor signaling pathway" evidence=ISS] [GO:0036022 "limb
            joint morphogenesis" evidence=ISS] [GO:0040037 "negative regulation
            of fibroblast growth factor receptor signaling pathway"
            evidence=ISS] [GO:0045121 "membrane raft" evidence=ISS] [GO:0048010
            "vascular endothelial growth factor receptor signaling pathway"
            evidence=ISS] [GO:0048661 "positive regulation of smooth muscle
            cell proliferation" evidence=ISS] [GO:0048706 "embryonic skeletal
            system development" evidence=ISS] [GO:0051216 "cartilage
            development" evidence=ISS] [GO:0060348 "bone development"
            evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0060686
            "negative regulation of prostatic bud formation" evidence=ISS]
            InterPro:IPR000917 InterPro:IPR014615 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783
            GO:GO:0005794 GO:GO:0005615 GO:GO:0009986 GO:GO:0048661
            GO:GO:0005509 GO:GO:0045121 GO:GO:0030336 GO:GO:0001822
            GO:GO:0001937 GO:GO:0030513 GO:GO:0016525 GO:GO:0001502
            GO:GO:0060348 Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0030177
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0004065
            GeneTree:ENSGT00400000022041 GO:GO:0048706 GO:GO:0048010
            GO:GO:0060686 GO:GO:0002063 GO:GO:0040037 GO:GO:0060384
            GO:GO:0008449 GO:GO:0030201 GO:GO:0014846 GO:GO:0035860
            GO:GO:0036022 InterPro:IPR024609 Pfam:PF12548 OMA:SVRVTHK
            EMBL:AAPE02021694 Ensembl:ENSMLUT00000011203 Uniprot:G1PHQ1
        Length = 871

 Score = 161 (61.7 bits), Expect = 4.4e-09, Sum P(2) = 4.4e-09
 Identities = 62/223 (27%), Positives = 96/223 (43%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
             P+II +L DD    DV    L Q+       + + G    N + T  +C PSRS+++TGK
Sbjct:    43 PNIILVLTDD---QDVELGSL-QVMNKTRKIMEHGGATFTNAFVTTPMCCPSRSSMLTGK 98

Query:   119 HPIHTGMQHNVLYGCERGGLP----LSE-KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
             + +H    HNV    E    P    + E +    YL   GYRT   GK+ L  Y   Y P
Sbjct:    99 Y-VHN---HNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153

Query:   174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
                G+   LG     + +++++       G+  +   + A D    Y TD+ T E+++  
Sbjct:   154 P--GWREWLGLIKNSR-FYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESINYF 203

Query:   234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
                       P+ + ++HAA H      P Q    Y N  +HI
Sbjct:   204 KMSKRMYPHRPIMMVISHAAPHGPEDSAP-QFSKLYPNASQHI 245

 Score = 62 (26.9 bits), Expect = 4.4e-09, Sum P(2) = 4.4e-09
 Identities = 27/131 (20%), Positives = 51/131 (38%)

Query:   267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
             L IH    +    K    L  +D+SV ++   L +   L N+ I++ +D           
Sbjct:   268 LPIHMEFTNVLHRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQFGLV 327

Query:   327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
                + P        ++  +R    I  P +E   IV +  +++ D  PT+L  A     P
Sbjct:   328 KGKSMP--------YDFDIRVPFFIRGPSVEPGSIVPQIVLNI-DLAPTILDIAGLDPPP 378

Query:   387 NYVNSTVENII 397
             +    +V  ++
Sbjct:   379 DVDGKSVLKLL 389

 Score = 47 (21.6 bits), Expect = 1.5e-07, Sum P(2) = 1.5e-07
 Identities = 10/42 (23%), Positives = 18/42 (42%)

Query:   428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGP 466
             +G  +  P++   Y N +    P Y    N  ++   +Y GP
Sbjct:   225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGP 266


>UNIPROTKB|F1Q233 [details] [associations]
            symbol:SULF1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0009986 "cell surface" evidence=IEA]
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
            [GO:0005794 "Golgi apparatus" evidence=IEA] [GO:0005783
            "endoplasmic reticulum" evidence=IEA] [GO:0005509 "calcium ion
            binding" evidence=IEA] InterPro:IPR000917 InterPro:IPR014615
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            PIRSF:PIRSF036665 GO:GO:0005783 GO:GO:0005794 GO:GO:0009986
            GO:GO:0005509 Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
            PROSITE:PS00523 GO:GO:0008484 GeneTree:ENSGT00400000022041
            InterPro:IPR024609 Pfam:PF12548 OMA:SVRVTHK EMBL:AAEX03015848
            Ensembl:ENSCAFT00000012295 Uniprot:F1Q233
        Length = 891

 Score = 161 (61.7 bits), Expect = 4.6e-09, Sum P(2) = 4.6e-09
 Identities = 62/223 (27%), Positives = 96/223 (43%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
             P+II +L DD    DV    L Q+       + + G    N + T  +C PSRS+++TGK
Sbjct:    43 PNIILVLTDD---QDVELGSL-QVMNKTRKIMEHGGATFTNAFVTTPMCCPSRSSMLTGK 98

Query:   119 HPIHTGMQHNVLYGCERGGLP----LSE-KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
             + +H    HNV    E    P    + E +    YL   GYRT   GK+ L  Y   Y P
Sbjct:    99 Y-VHN---HNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153

Query:   174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
                G+   LG     + +++++       G+  +   + A D    Y TD+ T E+++  
Sbjct:   154 P--GWREWLGLIKNSR-FYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESINYF 203

Query:   234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
                       P+ + ++HAA H      P Q    Y N  +HI
Sbjct:   204 KMSKRMYPHRPIMMVISHAAPHGPEDSAP-QFSKLYPNASQHI 245

 Score = 62 (26.9 bits), Expect = 4.6e-09, Sum P(2) = 4.6e-09
 Identities = 27/131 (20%), Positives = 51/131 (38%)

Query:   267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
             L IH    +    K    L  +D+SV ++   L +   L N+ I++ +D           
Sbjct:   268 LPIHMEFTNVLHRKRLQTLMSVDDSVERLYNMLVETGELDNTYIIYTADHGYHIGQFGLV 327

Query:   327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
                + P        ++  +R    I  P +E   IV +  +++ D  PT+L  A     P
Sbjct:   328 KGKSMP--------YDFDIRVPFFIRGPSVEPGSIVPQIVLNI-DLAPTILDIAGLDTPP 378

Query:   387 NYVNSTVENII 397
             +    +V  ++
Sbjct:   379 DVDGKSVLKLL 389

 Score = 49 (22.3 bits), Expect = 1.0e-07, Sum P(2) = 1.0e-07
 Identities = 19/87 (21%), Positives = 34/87 (39%)

Query:   428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGPKNENTNPRYENGTHEYNI 484
             +G  +  P++   Y N +    P Y    N  ++   +Y GP     +  + N  H   +
Sbjct:   225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLP-IHMEFTNVLHRKRL 283

Query:   485 PRL---ENSING--NGTSENRSNDNSY 506
               L   ++S+    N   E    DN+Y
Sbjct:   284 QTLMSVDDSVERLYNMLVETGELDNTY 310


>UNIPROTKB|G3T2L0 [details] [associations]
            symbol:SULF1 "Uncharacterized protein" species:9785
            "Loxodonta africana" [GO:0001502 "cartilage condensation"
            evidence=ISS] [GO:0001822 "kidney development" evidence=ISS]
            [GO:0001937 "negative regulation of endothelial cell proliferation"
            evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
            [GO:0004065 "arylsulfatase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005783 "endoplasmic
            reticulum" evidence=ISS] [GO:0005794 "Golgi apparatus"
            evidence=ISS] [GO:0007155 "cell adhesion" evidence=ISS] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
            [GO:0009986 "cell surface" evidence=ISS] [GO:0014846 "esophagus
            smooth muscle contraction" evidence=ISS] [GO:0016525 "negative
            regulation of angiogenesis" evidence=ISS] [GO:0030177 "positive
            regulation of Wnt receptor signaling pathway" evidence=ISS]
            [GO:0030201 "heparan sulfate proteoglycan metabolic process"
            evidence=ISS] [GO:0030336 "negative regulation of cell migration"
            evidence=ISS] [GO:0030513 "positive regulation of BMP signaling
            pathway" evidence=ISS] [GO:0035860 "glial cell-derived neurotrophic
            factor receptor signaling pathway" evidence=ISS] [GO:0036022 "limb
            joint morphogenesis" evidence=ISS] [GO:0040037 "negative regulation
            of fibroblast growth factor receptor signaling pathway"
            evidence=ISS] [GO:0045121 "membrane raft" evidence=ISS] [GO:0048010
            "vascular endothelial growth factor receptor signaling pathway"
            evidence=ISS] [GO:0048661 "positive regulation of smooth muscle
            cell proliferation" evidence=ISS] [GO:0048706 "embryonic skeletal
            system development" evidence=ISS] [GO:0051216 "cartilage
            development" evidence=ISS] [GO:0060348 "bone development"
            evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0060686
            "negative regulation of prostatic bud formation" evidence=ISS]
            InterPro:IPR000917 InterPro:IPR014615 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783
            GO:GO:0005886 GO:GO:0005794 GO:GO:0005615 GO:GO:0009986
            GO:GO:0048661 GO:GO:0005509 GO:GO:0010575 GO:GO:0045121
            GO:GO:0030336 GO:GO:0001822 GO:GO:0001937 GO:GO:0030513
            GO:GO:0016525 GO:GO:0001502 GO:GO:0060348 Gene3D:3.40.720.10
            SUPFAM:SSF53649 GO:GO:0030177 GO:GO:0003094 InterPro:IPR024607
            PROSITE:PS00523 GO:GO:0004065 GeneTree:ENSGT00400000022041
            GO:GO:0048706 GO:GO:0048010 GO:GO:0060686 GO:GO:0002063
            GO:GO:0040037 GO:GO:0032836 GO:GO:0060384 GO:GO:0008449
            GO:GO:0030201 GO:GO:0014846 GO:GO:0035860 GO:GO:0036022
            InterPro:IPR024609 Pfam:PF12548 OMA:QRKGDEC
            Ensembl:ENSLAFT00000008824 Uniprot:G3T2L0
        Length = 857

 Score = 159 (61.0 bits), Expect = 5.4e-09, Sum P(2) = 5.4e-09
 Identities = 62/223 (27%), Positives = 96/223 (43%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
             P+II +L DD    DV    L Q+       + + G    N + T  +C PSRS+++TGK
Sbjct:    44 PNIILVLTDD---QDVELGSL-QVMNKTRKIMEHGGATFINAFVTTPMCCPSRSSMLTGK 99

Query:   119 HPIHTGMQHNVLYGCERGGLP----LSE-KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
             + +H    HNV    E    P    + E +    YL   GYRT   GK+ L  Y   Y P
Sbjct:   100 Y-VHN---HNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 154

Query:   174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
                G+   LG     + +++++       G+  +   + A D    Y TD+ T E+++  
Sbjct:   155 P--GWREWLGLIKNSR-FYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESINYF 204

Query:   234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
                       P+ + ++HAA H      P Q    Y N  +HI
Sbjct:   205 KMSKRLYPHRPIMMVISHAAPHGPEDSAP-QFSKLYPNASQHI 246

 Score = 63 (27.2 bits), Expect = 5.4e-09, Sum P(2) = 5.4e-09
 Identities = 27/131 (20%), Positives = 52/131 (39%)

Query:   267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
             L IH    +  + K    L  +D+SV ++   L +   L N+ I++ +D           
Sbjct:   269 LPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQFGLV 328

Query:   327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
                + P        ++  +R    I  P +E   IV +  +++ D  PT+L  A     P
Sbjct:   329 KGKSMP--------YDFDIRVPFFIRGPSVEPGSIVPQIVLNI-DLAPTILDIAGLDTPP 379

Query:   387 NYVNSTVENII 397
             +    +V  ++
Sbjct:   380 DVDGKSVLKLL 390

 Score = 47 (21.6 bits), Expect = 2.4e-07, Sum P(2) = 2.4e-07
 Identities = 10/42 (23%), Positives = 18/42 (42%)

Query:   428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGP 466
             +G  +  P++   Y N +    P Y    N  ++   +Y GP
Sbjct:   226 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGP 267


>UNIPROTKB|F6VXY6 [details] [associations]
            symbol:SULF1 "Uncharacterized protein" species:9483
            "Callithrix jacchus" [GO:0001502 "cartilage condensation"
            evidence=ISS] [GO:0001822 "kidney development" evidence=ISS]
            [GO:0001937 "negative regulation of endothelial cell proliferation"
            evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
            [GO:0004065 "arylsulfatase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005783 "endoplasmic
            reticulum" evidence=ISS] [GO:0005794 "Golgi apparatus"
            evidence=ISS] [GO:0007155 "cell adhesion" evidence=ISS] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
            [GO:0009986 "cell surface" evidence=ISS] [GO:0014846 "esophagus
            smooth muscle contraction" evidence=ISS] [GO:0016525 "negative
            regulation of angiogenesis" evidence=ISS] [GO:0030177 "positive
            regulation of Wnt receptor signaling pathway" evidence=ISS]
            [GO:0030201 "heparan sulfate proteoglycan metabolic process"
            evidence=ISS] [GO:0030336 "negative regulation of cell migration"
            evidence=ISS] [GO:0030513 "positive regulation of BMP signaling
            pathway" evidence=ISS] [GO:0035860 "glial cell-derived neurotrophic
            factor receptor signaling pathway" evidence=ISS] [GO:0036022 "limb
            joint morphogenesis" evidence=ISS] [GO:0040037 "negative regulation
            of fibroblast growth factor receptor signaling pathway"
            evidence=ISS] [GO:0045121 "membrane raft" evidence=ISS] [GO:0048010
            "vascular endothelial growth factor receptor signaling pathway"
            evidence=ISS] [GO:0048661 "positive regulation of smooth muscle
            cell proliferation" evidence=ISS] [GO:0048706 "embryonic skeletal
            system development" evidence=ISS] [GO:0051216 "cartilage
            development" evidence=ISS] [GO:0060348 "bone development"
            evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0060686
            "negative regulation of prostatic bud formation" evidence=ISS]
            InterPro:IPR000917 InterPro:IPR014615 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783
            GO:GO:0005794 GO:GO:0005615 GO:GO:0009986 GO:GO:0048661
            GO:GO:0005509 GO:GO:0045121 GO:GO:0030336 GO:GO:0001822
            GO:GO:0001937 GO:GO:0030513 GO:GO:0016525 GO:GO:0001502
            GO:GO:0060348 Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0030177
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0004065
            GeneTree:ENSGT00400000022041 GO:GO:0048706 GO:GO:0048010
            GO:GO:0060686 GO:GO:0002063 GO:GO:0040037 GO:GO:0060384
            GO:GO:0008449 GO:GO:0030201 GO:GO:0014846 GO:GO:0035860
            GO:GO:0036022 InterPro:IPR024609 Pfam:PF12548 CTD:23213 OMA:SVRVTHK
            EMBL:ACFV01096449 EMBL:ACFV01096450 EMBL:ACFV01096451
            EMBL:ACFV01096452 EMBL:ACFV01096453 EMBL:ACFV01096454
            EMBL:ACFV01096455 EMBL:ACFV01096456 EMBL:ACFV01096457
            EMBL:ACFV01096458 EMBL:ACFV01096459 EMBL:ACFV01096460
            EMBL:ACFV01096461 RefSeq:XP_002759021.1 Ensembl:ENSCJAT00000009824
            Ensembl:ENSCJAT00000053576 GeneID:100390937 Uniprot:F6VXY6
        Length = 869

 Score = 159 (61.0 bits), Expect = 6.6e-09, Sum P(3) = 6.6e-09
 Identities = 63/223 (28%), Positives = 96/223 (43%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
             P+II +L DD    DV    L Q+       + + G    N + T  +C PSRS+++TGK
Sbjct:    43 PNIILVLTDD---QDVELGSL-QVMNKTRKIMEHGGATFINAFVTTPMCCPSRSSMLTGK 98

Query:   119 HPIHTGMQHNVLYGCERGGLP----LSE-KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
             + +H    HNV    E    P    + E +    YL   GYRT   GK+ L  Y   Y P
Sbjct:    99 Y-VHN---HNVYTNNENCSSPSWQAMHEPRTFAVYLNSTGYRTAFFGKY-LNEYNGSYIP 153

Query:   174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
                G+   LG     + +++++       G+  +   + A D    Y TD+ T E+V+  
Sbjct:   154 P--GWREWLGLIKNSR-FYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESVNYF 203

Query:   234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
                       P+ + ++HAA H      P Q    Y N  +HI
Sbjct:   204 KMSKRMYPHRPVMMVISHAAPHGPEDSAP-QFSKLYPNASQHI 245

 Score = 64 (27.6 bits), Expect = 6.6e-09, Sum P(3) = 6.6e-09
 Identities = 27/131 (20%), Positives = 52/131 (39%)

Query:   267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
             L IH    +  + K    L  +D+SV ++   L +   L N+ I++ +D           
Sbjct:   268 LPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQFGLV 327

Query:   327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
                + P        ++  +R    I  P +E   IV +  +++ D  PT+L  A     P
Sbjct:   328 KGKSMP--------YDFDIRVPFFIRGPSVEPGSIVPQIVLNI-DLAPTILDIAGLDTPP 378

Query:   387 NYVNSTVENII 397
             +    +V  ++
Sbjct:   379 DVDGKSVLKLL 389

 Score = 47 (21.6 bits), Expect = 2.5e-07, Sum P(2) = 2.5e-07
 Identities = 10/42 (23%), Positives = 18/42 (42%)

Query:   428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGP 466
             +G  +  P++   Y N +    P Y    N  ++   +Y GP
Sbjct:   225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGP 266

 Score = 43 (20.2 bits), Expect = 6.6e-09, Sum P(3) = 6.6e-09
 Identities = 26/101 (25%), Positives = 42/101 (41%)

Query:   408 ENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTH--EYNPK-YENRYENGTHEYN 464
             E+G     S R          GT +Y P++ +  +  +   E+  + Y+   E    +  
Sbjct:   508 ESGYRASRSQRKSQRQFLRNQGTPKYKPRFVHTRQTRSLSVEFEGEIYDINLEEEELQVL 567

Query:   465 GPKNENTNPRYENGTHEYNIPR-LENSINGN--GTSENRSN 502
              P+N     R++ G HE   PR L+ S  GN  G   + SN
Sbjct:   568 HPRN--IAKRHDEG-HEE--PRGLQASSGGNRGGMLADSSN 603


>UNIPROTKB|F1RU06 [details] [associations]
            symbol:SULF1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0030513 "positive regulation of BMP signaling pathway"
            evidence=ISS] [GO:0016525 "negative regulation of angiogenesis"
            evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
            [GO:0009986 "cell surface" evidence=ISS] [GO:0060348 "bone
            development" evidence=ISS] [GO:0001822 "kidney development"
            evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0048706
            "embryonic skeletal system development" evidence=ISS] [GO:0014846
            "esophagus smooth muscle contraction" evidence=ISS] [GO:0048661
            "positive regulation of smooth muscle cell proliferation"
            evidence=ISS] [GO:0007155 "cell adhesion" evidence=ISS] [GO:0045121
            "membrane raft" evidence=ISS] [GO:0005783 "endoplasmic reticulum"
            evidence=ISS] [GO:0005615 "extracellular space" evidence=ISS]
            [GO:0008449 "N-acetylglucosamine-6-sulfatase activity"
            evidence=ISS] [GO:0004065 "arylsulfatase activity" evidence=ISS]
            [GO:0048010 "vascular endothelial growth factor receptor signaling
            pathway" evidence=ISS] [GO:0040037 "negative regulation of
            fibroblast growth factor receptor signaling pathway" evidence=ISS]
            [GO:0030336 "negative regulation of cell migration" evidence=ISS]
            [GO:0030201 "heparan sulfate proteoglycan metabolic process"
            evidence=ISS] [GO:0030177 "positive regulation of Wnt receptor
            signaling pathway" evidence=ISS] [GO:0001937 "negative regulation
            of endothelial cell proliferation" evidence=ISS] [GO:0005794 "Golgi
            apparatus" evidence=ISS] [GO:0060686 "negative regulation of
            prostatic bud formation" evidence=ISS] [GO:0051216 "cartilage
            development" evidence=ISS] [GO:0035860 "glial cell-derived
            neurotrophic factor receptor signaling pathway" evidence=ISS]
            [GO:0036022 "limb joint morphogenesis" evidence=ISS] [GO:0001502
            "cartilage condensation" evidence=ISS] [GO:0032836 "glomerular
            basement membrane development" evidence=IEA] [GO:0010575 "positive
            regulation vascular endothelial growth factor production"
            evidence=IEA] [GO:0005886 "plasma membrane" evidence=IEA]
            [GO:0003094 "glomerular filtration" evidence=IEA] [GO:0005509
            "calcium ion binding" evidence=IEA] InterPro:IPR000917
            InterPro:IPR014615 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783 GO:GO:0005886
            GO:GO:0005794 GO:GO:0005615 GO:GO:0009986 GO:GO:0048661
            GO:GO:0005509 GO:GO:0010575 GO:GO:0045121 GO:GO:0030336
            GO:GO:0001822 GO:GO:0001937 GO:GO:0030513 GO:GO:0016525
            GO:GO:0001502 GO:GO:0060348 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GO:GO:0030177 GO:GO:0003094 InterPro:IPR024607 PROSITE:PS00523
            GO:GO:0004065 GeneTree:ENSGT00400000022041 GO:GO:0048706
            GO:GO:0048010 GO:GO:0060686 GO:GO:0002063 GO:GO:0040037
            GO:GO:0032836 GO:GO:0060384 GO:GO:0008449 GO:GO:0030201
            GO:GO:0014846 GO:GO:0035860 GO:GO:0036022 InterPro:IPR024609
            Pfam:PF12548 OMA:SVRVTHK EMBL:CU179692 EMBL:CU302274
            Ensembl:ENSSSCT00000006792 Uniprot:F1RU06
        Length = 871

 Score = 159 (61.0 bits), Expect = 7.1e-09, Sum P(2) = 7.1e-09
 Identities = 62/223 (27%), Positives = 96/223 (43%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
             P+II +L DD    DV    L Q+       +   G    N + T  +C PSRS+++TGK
Sbjct:    44 PNIILVLTDD---QDVELGSL-QVMNKTRKIMELGGATFTNAFVTTPMCCPSRSSMLTGK 99

Query:   119 HPIHTGMQHNVLYGCERGGLPLSEKIL-PQ----YLKELGYRTRIVGKWHLGFYKKEYTP 173
             + +H    HNV    E    P  + +  P+    YL   GYRT   GK+ L  Y   Y P
Sbjct:   100 Y-VHN---HNVYTNNENCSSPSWQAVHEPRTFAVYLNSTGYRTAFFGKY-LNEYNGSYVP 154

Query:   174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
                G+   LG     + +++++       G+  +   + A D    Y TD+ T E+++  
Sbjct:   155 P--GWREWLGLIKNSR-FYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESINYF 204

Query:   234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
                       P+ + ++HAA H      P Q    Y N  +HI
Sbjct:   205 KMSKRMYPHRPIMMVISHAAPHGPEDSAP-QFSKLYPNASQHI 246

 Score = 62 (26.9 bits), Expect = 7.1e-09, Sum P(2) = 7.1e-09
 Identities = 27/131 (20%), Positives = 51/131 (38%)

Query:   267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
             L IH    +    K    L  +D+SV ++   L +   L N+ I++ +D           
Sbjct:   269 LPIHMEFTNVLHRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQFGLV 328

Query:   327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
                + P        ++  +R    I  P +E   IV +  +++ D  PT+L  A     P
Sbjct:   329 KGKSMP--------YDFDIRVPFFIRGPSVEPGSIVPQIVLNI-DLAPTILDIAGLDTPP 379

Query:   387 NYVNSTVENII 397
             +    +V  ++
Sbjct:   380 DVDGKSVLKLL 390

 Score = 47 (21.6 bits), Expect = 2.5e-07, Sum P(2) = 2.5e-07
 Identities = 10/42 (23%), Positives = 18/42 (42%)

Query:   428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGP 466
             +G  +  P++   Y N +    P Y    N  ++   +Y GP
Sbjct:   226 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGP 267


>MGI|MGI:2138563 [details] [associations]
            symbol:Sulf1 "sulfatase 1" species:10090 "Mus musculus"
            [GO:0001822 "kidney development" evidence=IGI] [GO:0001937
            "negative regulation of endothelial cell proliferation"
            evidence=ISO] [GO:0002063 "chondrocyte development" evidence=IMP]
            [GO:0003094 "glomerular filtration" evidence=IGI] [GO:0003824
            "catalytic activity" evidence=IEA] [GO:0004065 "arylsulfatase
            activity" evidence=ISO] [GO:0005509 "calcium ion binding"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=ISO]
            [GO:0005783 "endoplasmic reticulum" evidence=ISO] [GO:0005794
            "Golgi apparatus" evidence=ISO] [GO:0005886 "plasma membrane"
            evidence=IDA] [GO:0006790 "sulfur compound metabolic process"
            evidence=ISO] [GO:0006915 "apoptotic process" evidence=IEA]
            [GO:0008152 "metabolic process" evidence=IEA] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=ISO;IMP]
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
            [GO:0009986 "cell surface" evidence=ISO] [GO:0010575 "positive
            regulation vascular endothelial growth factor production"
            evidence=IGI] [GO:0014846 "esophagus smooth muscle contraction"
            evidence=IGI] [GO:0016525 "negative regulation of angiogenesis"
            evidence=ISO] [GO:0016787 "hydrolase activity" evidence=IEA]
            [GO:0030177 "positive regulation of Wnt receptor signaling pathway"
            evidence=ISO] [GO:0030201 "heparan sulfate proteoglycan metabolic
            process" evidence=ISO;IMP] [GO:0030336 "negative regulation of cell
            migration" evidence=ISO] [GO:0030513 "positive regulation of BMP
            signaling pathway" evidence=ISO] [GO:0032836 "glomerular basement
            membrane development" evidence=IGI] [GO:0035860 "glial cell-derived
            neurotrophic factor receptor signaling pathway" evidence=IDA]
            [GO:0040036 "regulation of fibroblast growth factor receptor
            signaling pathway" evidence=ISO] [GO:0040037 "negative regulation
            of fibroblast growth factor receptor signaling pathway"
            evidence=ISO;IGI;IDA] [GO:0045121 "membrane raft" evidence=ISO]
            [GO:0046872 "metal ion binding" evidence=IEA] [GO:0048010 "vascular
            endothelial growth factor receptor signaling pathway" evidence=ISO]
            [GO:0048706 "embryonic skeletal system development" evidence=IGI]
            [GO:0051216 "cartilage development" evidence=IMP] [GO:0060348 "bone
            development" evidence=IGI] [GO:0060384 "innervation" evidence=IGI]
            [GO:0060686 "negative regulation of prostatic bud formation"
            evidence=IDA] InterPro:IPR000917 InterPro:IPR014615
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            PIRSF:PIRSF036665 MGI:MGI:2138563 GO:GO:0005783 GO:GO:0005886
            GO:GO:0005794 GO:GO:0006915 GO:GO:0005615 GO:GO:0009986
            GO:GO:0048661 GO:GO:0005795 GO:GO:0005509 GO:GO:0010575
            GO:GO:0045121 GO:GO:0030336 GO:GO:0001937 GO:GO:0030513
            GO:GO:0016525 GO:GO:0001502 GO:GO:0060348 Gene3D:3.40.720.10
            SUPFAM:SSF53649 GO:GO:0030177 GO:GO:0003094 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0004065
            GeneTree:ENSGT00400000022041 GO:GO:0048706 GO:GO:0048010
            GO:GO:0060686 GO:GO:0002063 GO:GO:0040037 GO:GO:0032836
            GO:GO:0060384 GO:GO:0008449 GO:GO:0030201 HOGENOM:HOG000290161
            KO:K14607 HOVERGEN:HBG056431 GO:GO:0014846 GO:GO:0035860
            GO:GO:0036022 InterPro:IPR024609 Pfam:PF12548 CTD:23213 OMA:SVRVTHK
            OrthoDB:EOG4VT5WH ChiTaRS:SULF1 EMBL:AY101178 EMBL:AK129278
            EMBL:AK028285 EMBL:AK045002 EMBL:BC034547 EMBL:BC049276
            IPI:IPI00111481 RefSeq:NP_001185494.1 RefSeq:NP_001185495.1
            RefSeq:NP_758498.1 UniGene:Mm.45563 ProteinModelPortal:Q8K007
            SMR:Q8K007 STRING:Q8K007 PhosphoSite:Q8K007 PRIDE:Q8K007
            Ensembl:ENSMUST00000088585 Ensembl:ENSMUST00000177608
            Ensembl:ENSMUST00000180062 GeneID:240725 KEGG:mmu:240725
            UCSC:uc007aia.2 NextBio:384701 Bgee:Q8K007 CleanEx:MM_SULF1
            Genevestigator:Q8K007 GermOnline:ENSMUSG00000016918 Uniprot:Q8K007
        Length = 870

 Score = 158 (60.7 bits), Expect = 7.2e-09, Sum P(2) = 7.2e-09
 Identities = 62/223 (27%), Positives = 95/223 (42%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
             P+II +L DD    DV    L Q+       +   G    N + T  +C PSRS+++TGK
Sbjct:    43 PNIILVLTDD---QDVELGSL-QVMNKTRKIMEQGGATFTNAFVTTPMCCPSRSSMLTGK 98

Query:   119 HPIHTGMQHNVLYGCERGGLP----LSE-KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
             + +H    HNV    E    P    + E +    YL   GYRT   GK+ L  Y   Y P
Sbjct:    99 Y-VHN---HNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153

Query:   174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
                G+   LG     + +++++       G+  +   + A D    Y TD+ T E+++  
Sbjct:   154 P--GWREWLGLIKNSR-FYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESINYF 203

Query:   234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
                       P+ + ++HAA H      P Q    Y N  +HI
Sbjct:   204 KMSKRMYPHRPIMMVISHAAPHGPEDSAP-QFSKLYPNASQHI 245

 Score = 63 (27.2 bits), Expect = 7.2e-09, Sum P(2) = 7.2e-09
 Identities = 28/124 (22%), Positives = 51/124 (41%)

Query:   267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
             L IH    +  + K    L  +D+SV ++   L +   L N+ I++ +D           
Sbjct:   268 LPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVESGELDNTYIIYTADHGYHIGQFGLV 327

Query:   327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
                + P        ++  +R    I  P +E   IV +  +++ D  PT+L  A   D P
Sbjct:   328 KGKSMP--------YDFDIRVPFFIRGPSIEPGSIVPQIVLNI-DLAPTILDIAGL-DSP 377

Query:   387 NYVN 390
             + V+
Sbjct:   378 SDVD 381

 Score = 47 (21.6 bits), Expect = 3.2e-07, Sum P(2) = 3.2e-07
 Identities = 10/42 (23%), Positives = 18/42 (42%)

Query:   428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGP 466
             +G  +  P++   Y N +    P Y    N  ++   +Y GP
Sbjct:   225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGP 266


>DICTYBASE|DDB_G0287225 [details] [associations]
            symbol:DDB_G0287225 species:44689 "Dictyostelium
            discoideum" [GO:0051082 "unfolded protein binding" evidence=IEA]
            [GO:0006457 "protein folding" evidence=IEA] InterPro:IPR002939
            InterPro:IPR008971 Pfam:PF01556 InterPro:IPR001623
            InterPro:IPR018253 dictyBase:DDB_G0287225 Pfam:PF00226
            GO:GO:0006457 eggNOG:COG0484 Gene3D:1.10.287.110 PRINTS:PR00625
            SMART:SM00271 SUPFAM:SSF46565 SUPFAM:SSF49493 PROSITE:PS00636
            PROSITE:PS50076 EMBL:AAFI02000099 RefSeq:XP_637319.1
            ProteinModelPortal:Q54KN8 EnsemblProtists:DDB0187373 GeneID:8626017
            KEGG:ddi:DDB_G0287225 OMA:CDYTSIN Uniprot:Q54KN8
        Length = 701

 Score = 169 (64.5 bits), Expect = 7.6e-09, P = 7.6e-09
 Identities = 48/191 (25%), Positives = 81/191 (42%)

Query:   387 NYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTH 446
             N  NS++ +++    NS     N  +   +    NSN    N  + YN    N   N  +
Sbjct:    50 NSNNSSISSLVNNSNNSDNNNNNNNNNNKNKNNNNSNNNNSNNNNNYNNNNNNNNNNNNN 109

Query:   447 EYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSY 506
               N    N+  N  +  N   N N N    N  + YN     NS N N +S N++N NS 
Sbjct:   110 NNNNNNNNKNNNNKNNNN---NNNNNYNNNNNNNNYNYNYNNNSNNSNNSS-NKNNSNSN 165

Query:   507 QNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQI--SALTRGKWKL--VKENSINGNGTS 562
              N ++ ++ +  +     +  +    N  D+ +I  S + + K     +  NSIN N ++
Sbjct:   166 SNSLNDLNQFMEIKEAYETLMDPTRKNKYDKSEILNSVILKHKSDFLPISLNSINNNISN 225

Query:   563 ENRSNDNSYQN 573
              N +N+N+  N
Sbjct:   226 NNNNNNNNNNN 236

 Score = 133 (51.9 bits), Expect = 5.9e-05, P = 5.9e-05
 Identities = 51/213 (23%), Positives = 89/213 (41%)

Query:   373 LPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHE 432
             + +L++ +N SD  N  N+   N      NS     N  + YN+    N+N    N  + 
Sbjct:    56 ISSLVNNSNNSDNNNNNNNN-NNKNKNNNNSNNNNSNNNNNYNNNNNNNNNNNNNNNNNN 114

Query:   433 YNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSIN 492
              N K  N  +N  +  N  Y N   N  + YN   N N +    N +++ N     NS+N
Sbjct:   115 NNNKNNNN-KNNNNNNNNNYNNNNNNNNYNYNYNNNSNNS---NNSSNKNNSNSNSNSLN 170

Query:   493 G-NGTSENRSN-----DNSYQNEIDGIDVW-SVLSRNEPSKRNTILHNIDDEWQISALTR 545
               N   E +       D + +N+ D  ++  SV+ +++       L++I++   IS    
Sbjct:   171 DLNQFMEIKEAYETLMDPTRKNKYDKSEILNSVILKHKSDFLPISLNSINNN--ISNNNN 228

Query:   546 GKWKLVKENSINGNGTSENRSNDNSYQNEIDGI 578
                     N+ N N  + N +N+N+  N  + I
Sbjct:   229 NNNNNNNNNNNNNNNNNNNNNNNNNSNNSNNNI 261


>UNIPROTKB|G1LHX9 [details] [associations]
            symbol:SULF1 "Uncharacterized protein" species:9646
            "Ailuropoda melanoleuca" [GO:0001502 "cartilage condensation"
            evidence=ISS] [GO:0001822 "kidney development" evidence=ISS]
            [GO:0001937 "negative regulation of endothelial cell proliferation"
            evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
            [GO:0004065 "arylsulfatase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005783 "endoplasmic
            reticulum" evidence=ISS] [GO:0005794 "Golgi apparatus"
            evidence=ISS] [GO:0007155 "cell adhesion" evidence=ISS] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
            [GO:0009986 "cell surface" evidence=ISS] [GO:0014846 "esophagus
            smooth muscle contraction" evidence=ISS] [GO:0016525 "negative
            regulation of angiogenesis" evidence=ISS] [GO:0030177 "positive
            regulation of Wnt receptor signaling pathway" evidence=ISS]
            [GO:0030201 "heparan sulfate proteoglycan metabolic process"
            evidence=ISS] [GO:0030336 "negative regulation of cell migration"
            evidence=ISS] [GO:0030513 "positive regulation of BMP signaling
            pathway" evidence=ISS] [GO:0035860 "glial cell-derived neurotrophic
            factor receptor signaling pathway" evidence=ISS] [GO:0036022 "limb
            joint morphogenesis" evidence=ISS] [GO:0040037 "negative regulation
            of fibroblast growth factor receptor signaling pathway"
            evidence=ISS] [GO:0045121 "membrane raft" evidence=ISS] [GO:0048010
            "vascular endothelial growth factor receptor signaling pathway"
            evidence=ISS] [GO:0048661 "positive regulation of smooth muscle
            cell proliferation" evidence=ISS] [GO:0048706 "embryonic skeletal
            system development" evidence=ISS] [GO:0051216 "cartilage
            development" evidence=ISS] [GO:0060348 "bone development"
            evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0060686
            "negative regulation of prostatic bud formation" evidence=ISS]
            InterPro:IPR000917 InterPro:IPR014615 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783
            GO:GO:0005886 GO:GO:0005794 GO:GO:0005615 GO:GO:0009986
            GO:GO:0048661 GO:GO:0005509 GO:GO:0010575 GO:GO:0045121
            GO:GO:0030336 GO:GO:0001822 GO:GO:0001937 GO:GO:0030513
            GO:GO:0016525 GO:GO:0001502 GO:GO:0060348 Gene3D:3.40.720.10
            SUPFAM:SSF53649 GO:GO:0030177 GO:GO:0003094 InterPro:IPR024607
            PROSITE:PS00523 GO:GO:0004065 GeneTree:ENSGT00400000022041
            GO:GO:0048706 GO:GO:0048010 GO:GO:0060686 GO:GO:0002063
            GO:GO:0040037 GO:GO:0032836 GO:GO:0060384 GO:GO:0008449
            GO:GO:0030201 GO:GO:0014846 GO:GO:0035860 GO:GO:0036022
            InterPro:IPR024609 Pfam:PF12548 OMA:SVRVTHK EMBL:ACTA01145671
            EMBL:ACTA01153670 Ensembl:ENSAMET00000006800 Uniprot:G1LHX9
        Length = 868

 Score = 159 (61.0 bits), Expect = 9.0e-09, Sum P(2) = 9.0e-09
 Identities = 61/223 (27%), Positives = 95/223 (42%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
             P+II +L DD    DV    L Q+       + + G    N + T  +C PSRS+++TGK
Sbjct:    43 PNIILVLTDD---QDVELGSL-QVMNKTRKIMEHGGATFTNAFVTTPMCCPSRSSMLTGK 98

Query:   119 HPIHTGMQHNVLYGCERGGLPLSE-----KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
             + +H    HNV    E    P  +     +    YL   GYRT   GK+ L  Y   Y P
Sbjct:    99 Y-VHN---HNVYTNNENCSSPSWQATHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153

Query:   174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
                G+   LG     + +++++       G+  +   + A D    Y TD+ T E+++  
Sbjct:   154 P--GWREWLGLIKNSR-FYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESINYF 203

Query:   234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
                       P+ + ++HAA H      P Q    Y N  +HI
Sbjct:   204 KMSKRMYPHRPIMMVISHAAPHGPEDSAP-QFSKLYPNASQHI 245

 Score = 61 (26.5 bits), Expect = 9.0e-09, Sum P(2) = 9.0e-09
 Identities = 26/131 (19%), Positives = 51/131 (38%)

Query:   267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
             L IH    +    K    L  +D+SV ++   L +   L N+ I++ +D           
Sbjct:   268 LPIHMEFTNVLHRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQFGLV 327

Query:   327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
                + P        ++  +R    I  P +E   +V +  +++ D  PT+L  A     P
Sbjct:   328 KGKSMP--------YDFDIRVPFFIRGPSVEPGSVVPQIVLNI-DLAPTILDIAGLDTPP 378

Query:   387 NYVNSTVENII 397
             +    +V  ++
Sbjct:   379 DVDGKSVLKLL 389

 Score = 47 (21.6 bits), Expect = 2.5e-07, Sum P(2) = 2.5e-07
 Identities = 10/42 (23%), Positives = 18/42 (42%)

Query:   428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGP 466
             +G  +  P++   Y N +    P Y    N  ++   +Y GP
Sbjct:   225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGP 266


>UNIPROTKB|F7FJY3 [details] [associations]
            symbol:SULF1 "Uncharacterized protein" species:9544 "Macaca
            mulatta" [GO:0001502 "cartilage condensation" evidence=ISS]
            [GO:0001822 "kidney development" evidence=ISS] [GO:0001937
            "negative regulation of endothelial cell proliferation"
            evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
            [GO:0004065 "arylsulfatase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005783 "endoplasmic
            reticulum" evidence=ISS] [GO:0005794 "Golgi apparatus"
            evidence=ISS] [GO:0007155 "cell adhesion" evidence=ISS] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
            [GO:0009986 "cell surface" evidence=ISS] [GO:0014846 "esophagus
            smooth muscle contraction" evidence=ISS] [GO:0016525 "negative
            regulation of angiogenesis" evidence=ISS] [GO:0030177 "positive
            regulation of Wnt receptor signaling pathway" evidence=ISS]
            [GO:0030201 "heparan sulfate proteoglycan metabolic process"
            evidence=ISS] [GO:0030336 "negative regulation of cell migration"
            evidence=ISS] [GO:0030513 "positive regulation of BMP signaling
            pathway" evidence=ISS] [GO:0035860 "glial cell-derived neurotrophic
            factor receptor signaling pathway" evidence=ISS] [GO:0036022 "limb
            joint morphogenesis" evidence=ISS] [GO:0040037 "negative regulation
            of fibroblast growth factor receptor signaling pathway"
            evidence=ISS] [GO:0045121 "membrane raft" evidence=ISS] [GO:0048010
            "vascular endothelial growth factor receptor signaling pathway"
            evidence=ISS] [GO:0048661 "positive regulation of smooth muscle
            cell proliferation" evidence=ISS] [GO:0048706 "embryonic skeletal
            system development" evidence=ISS] [GO:0051216 "cartilage
            development" evidence=ISS] [GO:0060348 "bone development"
            evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0060686
            "negative regulation of prostatic bud formation" evidence=ISS]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 GO:GO:0005783 GO:GO:0005886 GO:GO:0005794
            GO:GO:0005615 GO:GO:0009986 GO:GO:0048661 GO:GO:0010575
            GO:GO:0045121 GO:GO:0030336 GO:GO:0001822 GO:GO:0001937
            GO:GO:0030513 GO:GO:0016525 GO:GO:0001502 GO:GO:0060348
            Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0030177 GO:GO:0003094
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0004065
            GeneTree:ENSGT00400000022041 GO:GO:0048706 GO:GO:0048010
            GO:GO:0060686 GO:GO:0002063 GO:GO:0040037 GO:GO:0032836
            GO:GO:0060384 GO:GO:0008449 GO:GO:0030201 GO:GO:0014846
            GO:GO:0035860 GO:GO:0036022 InterPro:IPR024609 Pfam:PF12548
            OMA:QRKGDEC Ensembl:ENSMMUT00000032744 Ensembl:ENSMMUT00000032745
            Uniprot:F7FJY3
        Length = 759

 Score = 158 (60.7 bits), Expect = 1.0e-08, Sum P(3) = 1.0e-08
 Identities = 62/223 (27%), Positives = 96/223 (43%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
             P+II +L DD    DV    L Q+       + + G    N + T  +C PSRS+++TGK
Sbjct:    43 PNIILVLTDD---QDVELGSL-QVMNKTRKIMEHGGATFINAFVTTPMCCPSRSSMLTGK 98

Query:   119 HPIHTGMQHNVLYGCERGGLP----LSE-KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
             + +H    HNV    E    P    + E +    YL   GYRT   GK+ L  Y   Y P
Sbjct:    99 Y-VHN---HNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153

Query:   174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
                G+   LG     + +++++       G+  +   + A D    Y TD+ T E+++  
Sbjct:   154 P--GWREWLGLIKNSR-FYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESINYF 203

Query:   234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
                       P+ + ++HAA H      P Q    Y N  +HI
Sbjct:   204 KMSKRMYPHRPVMMVISHAAPHGPEDSAP-QFSKLYPNASQHI 245

 Score = 64 (27.6 bits), Expect = 1.0e-08, Sum P(3) = 1.0e-08
 Identities = 27/131 (20%), Positives = 52/131 (39%)

Query:   267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
             L IH    +  + K    L  +D+SV ++   L +   L N+ I++ +D           
Sbjct:   268 LPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQFGLV 327

Query:   327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
                + P        ++  +R    I  P +E   IV +  +++ D  PT+L  A     P
Sbjct:   328 KGKSMP--------YDFDIRVPFFIRGPSVEPGSIVPQIVLNI-DLAPTILDIAGLDTPP 378

Query:   387 NYVNSTVENII 397
             +    +V  ++
Sbjct:   379 DVDGKSVLKLL 389

 Score = 47 (21.6 bits), Expect = 2.2e-07, Sum P(2) = 2.2e-07
 Identities = 10/42 (23%), Positives = 18/42 (42%)

Query:   428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGP 466
             +G  +  P++   Y N +    P Y    N  ++   +Y GP
Sbjct:   225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGP 266

 Score = 40 (19.1 bits), Expect = 1.0e-08, Sum P(3) = 1.0e-08
 Identities = 25/101 (24%), Positives = 41/101 (40%)

Query:   408 ENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENR-YENGTHEYNGP 466
             E+G     S R          GT +Y P++ +  +  T   + ++E   Y+    E    
Sbjct:   508 ESGYRASGSQRKSQRQFLRNQGTPKYKPRFVHTRQ--TRSLSVEFEGEIYDINLEEEELQ 565

Query:   467 --KNENTNPRYENGTHEYNIPR-LENSINGN--GTSENRSN 502
               +  N   R++ G H+   PR L+ S  GN  G   + SN
Sbjct:   566 VLQPRNIAKRHDEG-HKG--PRDLQASSGGNRGGMLADSSN 603


>UNIPROTKB|O60597 [details] [associations]
            symbol:IDS "Iduronate-2-sulfatase" species:9606 "Homo
            sapiens" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            HSSP:P08842 EMBL:AC233288 UniGene:Hs.460960 HGNC:HGNC:5389
            ChiTaRS:IDS EMBL:AF050145 IPI:IPI00640469 SMR:O60597 STRING:O60597
            Ensembl:ENST00000428056 UCSC:uc011mxj.2 HOGENOM:HOG000207088
            HOVERGEN:HBG053054 Uniprot:O60597
        Length = 179

 Score = 142 (55.0 bits), Expect = 1.2e-08, P = 1.2e-08
 Identities = 37/108 (34%), Positives = 57/108 (52%)

Query:    61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKH 119
             +++ I+ DDL    +G +G   + +PNID LA   ++ +N +  Q +C PSR + +TG+ 
Sbjct:    38 NVLLIIVDDLR-PSLGCYGDKLVRSPNIDQLASHSLLFQNAFAQQAVCAPSRVSFLTGRR 96

Query:   120 PIHTGM-QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK-WHLG 165
             P  T +   N  +    G        +PQY KE GY T  VGK +H G
Sbjct:    97 PDTTRLYDFNSYWRVHAGNF----STIPQYFKENGYVTMSVGKVFHPG 140


>UNIPROTKB|G1SJB8 [details] [associations]
            symbol:SULF1 "Uncharacterized protein" species:9986
            "Oryctolagus cuniculus" [GO:0001502 "cartilage condensation"
            evidence=ISS] [GO:0001822 "kidney development" evidence=ISS]
            [GO:0001937 "negative regulation of endothelial cell proliferation"
            evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
            [GO:0004065 "arylsulfatase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005783 "endoplasmic
            reticulum" evidence=ISS] [GO:0005794 "Golgi apparatus"
            evidence=ISS] [GO:0007155 "cell adhesion" evidence=ISS] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
            [GO:0009986 "cell surface" evidence=ISS] [GO:0014846 "esophagus
            smooth muscle contraction" evidence=ISS] [GO:0016525 "negative
            regulation of angiogenesis" evidence=ISS] [GO:0030177 "positive
            regulation of Wnt receptor signaling pathway" evidence=ISS]
            [GO:0030201 "heparan sulfate proteoglycan metabolic process"
            evidence=ISS] [GO:0030336 "negative regulation of cell migration"
            evidence=ISS] [GO:0030513 "positive regulation of BMP signaling
            pathway" evidence=ISS] [GO:0035860 "glial cell-derived neurotrophic
            factor receptor signaling pathway" evidence=ISS] [GO:0036022 "limb
            joint morphogenesis" evidence=ISS] [GO:0040037 "negative regulation
            of fibroblast growth factor receptor signaling pathway"
            evidence=ISS] [GO:0045121 "membrane raft" evidence=ISS] [GO:0048010
            "vascular endothelial growth factor receptor signaling pathway"
            evidence=ISS] [GO:0048661 "positive regulation of smooth muscle
            cell proliferation" evidence=ISS] [GO:0048706 "embryonic skeletal
            system development" evidence=ISS] [GO:0051216 "cartilage
            development" evidence=ISS] [GO:0060348 "bone development"
            evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0060686
            "negative regulation of prostatic bud formation" evidence=ISS]
            InterPro:IPR000917 InterPro:IPR014615 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783
            GO:GO:0005886 GO:GO:0005794 GO:GO:0005615 GO:GO:0009986
            GO:GO:0048661 GO:GO:0005509 GO:GO:0010575 GO:GO:0045121
            GO:GO:0030336 GO:GO:0001822 GO:GO:0001937 GO:GO:0030513
            GO:GO:0016525 GO:GO:0001502 GO:GO:0060348 Gene3D:3.40.720.10
            SUPFAM:SSF53649 GO:GO:0030177 GO:GO:0003094 InterPro:IPR024607
            PROSITE:PS00523 GO:GO:0004065 GeneTree:ENSGT00400000022041
            GO:GO:0048706 GO:GO:0048010 GO:GO:0060686 GO:GO:0002063
            GO:GO:0040037 GO:GO:0032836 GO:GO:0060384 GO:GO:0008449
            GO:GO:0030201 GO:GO:0014846 GO:GO:0035860 GO:GO:0036022
            InterPro:IPR024609 Pfam:PF12548 EMBL:AAGW02046925 EMBL:AAGW02046926
            Ensembl:ENSOCUT00000003251 Uniprot:G1SJB8
        Length = 869

 Score = 161 (61.7 bits), Expect = 1.3e-08, Sum P(4) = 1.3e-08
 Identities = 62/223 (27%), Positives = 96/223 (43%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
             P+II +L DD    DV    L Q+       + + G    N + T  +C PSRS+++TGK
Sbjct:    43 PNIILVLTDD---QDVELGSL-QVMNKTRKIMEHGGATFTNAFVTTPMCCPSRSSMLTGK 98

Query:   119 HPIHTGMQHNVLYGCERGGLP----LSE-KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
             + +H    HNV    E    P    + E +    YL   GYRT   GK+ L  Y   Y P
Sbjct:    99 Y-VHN---HNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153

Query:   174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
                G+   LG     + +++++       G+  +   + A D    Y TD+ T E+++  
Sbjct:   154 P--GWREWLGLIKNSR-FYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESINYF 203

Query:   234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
                       P+ + ++HAA H      P Q    Y N  +HI
Sbjct:   204 KMSKRMYPHRPIMMVISHAAPHGPEDSAP-QFSKLYPNASQHI 245

 Score = 62 (26.9 bits), Expect = 1.3e-08, Sum P(4) = 1.3e-08
 Identities = 26/131 (19%), Positives = 52/131 (39%)

Query:   267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
             L IH    +  + K    L  +D+SV ++   L +   L N+ I++ +D           
Sbjct:   268 LPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQFGLV 327

Query:   327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
                + P        ++  +R    I  P +E   +V +  +++ D  PT+L  A     P
Sbjct:   328 KGKSMP--------YDFDIRVPFFIRGPSVEPGSVVPQIVLNI-DLAPTILDIAGLDTPP 378

Query:   387 NYVNSTVENII 397
             +    +V  ++
Sbjct:   379 DVDGKSVLKLL 389

 Score = 47 (21.6 bits), Expect = 5.1e-07, Sum P(3) = 5.1e-07
 Identities = 10/42 (23%), Positives = 18/42 (42%)

Query:   428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGP 466
             +G  +  P++   Y N +    P Y    N  ++   +Y GP
Sbjct:   225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGP 266

 Score = 42 (19.8 bits), Expect = 1.3e-08, Sum P(4) = 1.3e-08
 Identities = 21/98 (21%), Positives = 37/98 (37%)

Query:   408 ENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENR-YENGTHEYNGP 466
             E+G     S R          GT +Y P++ +  +  T   + ++E   Y+    E    
Sbjct:   508 ESGYRASRSQRKSQRQFLRNQGTPKYKPRFVHTRQ--TRSLSVEFEGEIYDINLEEEELQ 565

Query:   467 --KNENTNPRYENGTHEYNIPRLENSINGNGTSENRSN 502
               +  N   R++ G       +  +S NG G   + SN
Sbjct:   566 VLQPRNIAKRHDEGHRGLRGRQAGSSGNGAGMLADSSN 603

 Score = 39 (18.8 bits), Expect = 1.3e-08, Sum P(4) = 1.3e-08
 Identities = 11/28 (39%), Positives = 14/28 (50%)

Query:   749 YSNEEEGMRKLRDAASIQCGPVKEVPCE 776
             Y N+E+G+RK     S    P KE   E
Sbjct:   681 YYNKEKGVRKQEKLKS-HLHPFKEAAQE 707


>UNIPROTKB|G3R9R9 [details] [associations]
            symbol:SULF1 "Uncharacterized protein" species:9595
            "Gorilla gorilla gorilla" [GO:0001502 "cartilage condensation"
            evidence=ISS] [GO:0001822 "kidney development" evidence=ISS]
            [GO:0001937 "negative regulation of endothelial cell proliferation"
            evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
            [GO:0004065 "arylsulfatase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005783 "endoplasmic
            reticulum" evidence=ISS] [GO:0005794 "Golgi apparatus"
            evidence=ISS] [GO:0007155 "cell adhesion" evidence=ISS] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
            [GO:0009986 "cell surface" evidence=ISS] [GO:0014846 "esophagus
            smooth muscle contraction" evidence=ISS] [GO:0016525 "negative
            regulation of angiogenesis" evidence=ISS] [GO:0030177 "positive
            regulation of Wnt receptor signaling pathway" evidence=ISS]
            [GO:0030201 "heparan sulfate proteoglycan metabolic process"
            evidence=ISS] [GO:0030336 "negative regulation of cell migration"
            evidence=ISS] [GO:0030513 "positive regulation of BMP signaling
            pathway" evidence=ISS] [GO:0035860 "glial cell-derived neurotrophic
            factor receptor signaling pathway" evidence=ISS] [GO:0036022 "limb
            joint morphogenesis" evidence=ISS] [GO:0040037 "negative regulation
            of fibroblast growth factor receptor signaling pathway"
            evidence=ISS] [GO:0045121 "membrane raft" evidence=ISS] [GO:0048010
            "vascular endothelial growth factor receptor signaling pathway"
            evidence=ISS] [GO:0048661 "positive regulation of smooth muscle
            cell proliferation" evidence=ISS] [GO:0048706 "embryonic skeletal
            system development" evidence=ISS] [GO:0051216 "cartilage
            development" evidence=ISS] [GO:0060348 "bone development"
            evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0060686
            "negative regulation of prostatic bud formation" evidence=ISS]
            InterPro:IPR000917 InterPro:IPR014615 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783
            GO:GO:0005794 GO:GO:0005615 GO:GO:0009986 GO:GO:0048661
            GO:GO:0005509 GO:GO:0045121 GO:GO:0030336 GO:GO:0001822
            GO:GO:0001937 GO:GO:0030513 GO:GO:0016525 GO:GO:0001502
            GO:GO:0060348 Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0030177
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0004065 GO:GO:0048706
            GO:GO:0048010 GO:GO:0060686 GO:GO:0002063 GO:GO:0040037
            GO:GO:0060384 GO:GO:0008449 GO:GO:0030201 GO:GO:0014846
            GO:GO:0035860 GO:GO:0036022 InterPro:IPR024609 Pfam:PF12548
            OMA:SVRVTHK RefSeq:XP_004047178.1 RefSeq:XP_004047179.1
            Ensembl:ENSGGOT00000012515 GeneID:101141420 Uniprot:G3R9R9
        Length = 869

 Score = 158 (60.7 bits), Expect = 1.3e-08, Sum P(3) = 1.3e-08
 Identities = 62/223 (27%), Positives = 96/223 (43%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
             P+II +L DD    DV    L Q+       + + G    N + T  +C PSRS+++TGK
Sbjct:    43 PNIILVLTDD---QDVELGSL-QVMNKTRKIMEHGGATFINAFVTTPMCCPSRSSMLTGK 98

Query:   119 HPIHTGMQHNVLYGCERGGLP----LSE-KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
             + +H    HNV    E    P    + E +    YL   GYRT   GK+ L  Y   Y P
Sbjct:    99 Y-VHN---HNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153

Query:   174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
                G+   LG     + +++++       G+  +   + A D    Y TD+ T E+++  
Sbjct:   154 P--GWREWLGLIKNSR-FYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESINYF 203

Query:   234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
                       P+ + ++HAA H      P Q    Y N  +HI
Sbjct:   204 KMSKRMYPHRPVMMVISHAAPHGPEDSAP-QFSKLYPNASQHI 245

 Score = 64 (27.6 bits), Expect = 1.3e-08, Sum P(3) = 1.3e-08
 Identities = 27/131 (20%), Positives = 52/131 (39%)

Query:   267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
             L IH    +  + K    L  +D+SV ++   L +   L N+ I++ +D           
Sbjct:   268 LPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQFGLV 327

Query:   327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
                + P        ++  +R    I  P +E   IV +  +++ D  PT+L  A     P
Sbjct:   328 KGKSMP--------YDFDIRVPFFIRGPSVEPGSIVPQIVLNI-DLAPTILDIAGLDTPP 378

Query:   387 NYVNSTVENII 397
             +    +V  ++
Sbjct:   379 DVDGKSVLKLL 389

 Score = 47 (21.6 bits), Expect = 3.2e-07, Sum P(2) = 3.2e-07
 Identities = 10/42 (23%), Positives = 18/42 (42%)

Query:   428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGP 466
             +G  +  P++   Y N +    P Y    N  ++   +Y GP
Sbjct:   225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGP 266

 Score = 41 (19.5 bits), Expect = 1.3e-08, Sum P(3) = 1.3e-08
 Identities = 25/101 (24%), Positives = 41/101 (40%)

Query:   408 ENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENR-YENGTHEYNGP 466
             E+G     S R          GT +Y P++ +  +  T   + ++E   Y+    E    
Sbjct:   508 ESGYRASRSQRKSQRQFLRNQGTPKYKPRFVHTRQ--TRSLSVEFEGEIYDINLEEEELQ 565

Query:   467 --KNENTNPRYENGTHEYNIPR-LENSINGN--GTSENRSN 502
               +  N   R++ G H+   PR L+ S  GN  G   + SN
Sbjct:   566 VLQPRNIAKRHDEG-HKR--PRDLQASSGGNRGGMLADSSN 603


>UNIPROTKB|P22304 [details] [associations]
            symbol:IDS "Iduronate 2-sulfatase" species:9606 "Homo
            sapiens" [GO:0046872 "metal ion binding" evidence=IEA] [GO:0004423
            "iduronate-2-sulfatase activity" evidence=TAS] [GO:0005975
            "carbohydrate metabolic process" evidence=TAS] [GO:0006027
            "glycosaminoglycan catabolic process" evidence=TAS] [GO:0030203
            "glycosaminoglycan metabolic process" evidence=TAS] [GO:0030204
            "chondroitin sulfate metabolic process" evidence=TAS] [GO:0030207
            "chondroitin sulfate catabolic process" evidence=TAS] [GO:0043202
            "lysosomal lumen" evidence=TAS] [GO:0044281 "small molecule
            metabolic process" evidence=TAS] Reactome:REACT_111217
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 Reactome:REACT_116125 GO:GO:0046872 GO:GO:0005975
            EMBL:CH471171 GO:GO:0043202 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            GO:GO:0030207 EMBL:AF011889 EMBL:M58342 EMBL:L13329 EMBL:L13321
            EMBL:L13322 EMBL:L13323 EMBL:L13324 EMBL:L13325 EMBL:L13326
            EMBL:L13327 EMBL:L13328 EMBL:L04586 EMBL:L04578 EMBL:L04579
            EMBL:L04580 EMBL:L04581 EMBL:L04583 EMBL:L04582 EMBL:L04584
            EMBL:L04585 EMBL:L40586 EMBL:AC233288 EMBL:BC006170 IPI:IPI00006121
            IPI:IPI00013771 IPI:IPI00026104 PIR:A47535 RefSeq:NP_000193.1
            RefSeq:NP_006114.1 UniGene:Hs.460960 ProteinModelPortal:P22304
            IntAct:P22304 STRING:P22304 PhosphoSite:P22304 DMDM:124174
            PRIDE:P22304 Ensembl:ENST00000340855 Ensembl:ENST00000370441
            Ensembl:ENST00000370443 Ensembl:ENST00000466323 GeneID:3423
            KEGG:hsa:3423 UCSC:uc004fcw.4 UCSC:uc011mxh.2 CTD:3423
            GeneCards:GC0XM148558 HGNC:HGNC:5389 MIM:300823 MIM:309900
            neXtProt:NX_P22304 Orphanet:217085 Orphanet:217093 PharmGKB:PA29636
            HOGENOM:HOG000014304 HOVERGEN:HBG006120 InParanoid:P22304 KO:K01136
            OMA:CREGKNL OrthoDB:EOG49078W PhylomeDB:P22304
            BioCyc:MetaCyc:HS00286-MONOMER ChiTaRS:IDS GenomeRNAi:3423
            NextBio:13500 PMAP-CutDB:P22304 ArrayExpress:P22304 Bgee:P22304
            CleanEx:HS_IDS Genevestigator:P22304 GermOnline:ENSG00000010404
            GO:GO:0004423 Uniprot:P22304
        Length = 550

 Score = 142 (55.0 bits), Expect = 1.6e-08, Sum P(2) = 1.6e-08
 Identities = 37/108 (34%), Positives = 57/108 (52%)

Query:    61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKH 119
             +++ I+ DDL    +G +G   + +PNID LA   ++ +N +  Q +C PSR + +TG+ 
Sbjct:    38 NVLLIIVDDLR-PSLGCYGDKLVRSPNIDQLASHSLLFQNAFAQQAVCAPSRVSFLTGRR 96

Query:   120 PIHTGM-QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK-WHLG 165
             P  T +   N  +    G        +PQY KE GY T  VGK +H G
Sbjct:    97 PDTTRLYDFNSYWRVHAGNF----STIPQYFKENGYVTMSVGKVFHPG 140

 Score = 71 (30.1 bits), Expect = 1.6e-08, Sum P(2) = 1.6e-08
 Identities = 20/60 (33%), Positives = 37/60 (61%)

Query:   256 PYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
             PY P+  P   ++  R I   ++S FA++ + LD  VG+++ AL+  ++ +++II F SD
Sbjct:   284 PYGPI--P---VDFQRKI---RQSYFASVSY-LDTQVGRLLSALDDLQLANSTIIAFTSD 334


>UNIPROTKB|F1NI04 [details] [associations]
            symbol:GNS "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0005764 "lysosome" evidence=IEA] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=IEA]
            [GO:0030203 "glycosaminoglycan metabolic process" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR012251 InterPro:IPR015981
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            PIRSF:PIRSF036666 GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            GeneTree:ENSGT00400000022041 GO:GO:0030203 GO:GO:0008449
            PANTHER:PTHR10342:SF5 OMA:MCGYQTF EMBL:AADN02009911 IPI:IPI00596266
            Ensembl:ENSGALT00000016025 Uniprot:F1NI04
        Length = 546

 Score = 174 (66.3 bits), Expect = 1.7e-08, Sum P(2) = 1.7e-08
 Identities = 73/253 (28%), Positives = 109/253 (43%)

Query:    31 RTRIM---AFAVLP--LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPT 85
             R RIM   A A L   LA    +V     A+   P+++ IL DD    DV   G+  +  
Sbjct:     5 RRRIMSRSALAALARGLALAALLVLSPAQAARQRPNVVLILTDD---QDVFLGGMTPMKK 61

Query:    86 PNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYG-CE-RGGLPLSE 142
              N   +A  G+   N Y    LC PSR++I+TGK+P +  + +N L G C  +    + E
Sbjct:    62 TNA-LIAQMGVTFSNAYVPSALCCPSRASILTGKYPHNHHVVNNTLEGNCSSKLWQKIQE 120

Query:   143 -KILPQYLKEL-GYRTRIVGKWHLGFYKKEYTPTFRGFESHL----GYWTGHQDYFDHSA 196
                 P  LK + GY+T   GK     Y  EY     G  SH+     +W   +    +  
Sbjct:   121 PNTFPALLKSMCGYQTFFAGK-----YLNEYGAEDAGGVSHVPPGWSFWYALEKNSKYYN 175

Query:   197 EEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANP 256
               + + G   R     + D    Y TDV    ++D +   S  EP F+ ++  A HS   
Sbjct:   176 YTLSVNGKARRHGENYSVD----YLTDVLANMSLDFLEYKSNFEPFFMMISTPAPHSPWT 231

Query:   257 YEPLQAPDHYLNI 269
               P Q  + +LN+
Sbjct:   232 AAP-QYKNDFLNV 243

 Score = 37 (18.1 bits), Expect = 1.7e-08, Sum P(2) = 1.7e-08
 Identities = 9/28 (32%), Positives = 15/28 (53%)

Query:   450 PKYENRYENGTHEYNGPKNENTNPRYEN 477
             P+Y+N + N     + P+N N N   +N
Sbjct:   234 PQYKNDFLN----VSAPRNSNFNIHGKN 257


>UNIPROTKB|Q8IWU6 [details] [associations]
            symbol:SULF1 "Extracellular sulfatase Sulf-1" species:9606
            "Homo sapiens" [GO:0005509 "calcium ion binding" evidence=IEA]
            [GO:0006915 "apoptotic process" evidence=IEA] [GO:0005795 "Golgi
            stack" evidence=IEA] [GO:0004065 "arylsulfatase activity"
            evidence=IMP;IDA] [GO:0005615 "extracellular space"
            evidence=IDA;NAS] [GO:0009986 "cell surface" evidence=IDA]
            [GO:0030201 "heparan sulfate proteoglycan metabolic process"
            evidence=IDA;NAS] [GO:0030336 "negative regulation of cell
            migration" evidence=IMP] [GO:0040036 "regulation of fibroblast
            growth factor receptor signaling pathway" evidence=IMP] [GO:0040037
            "negative regulation of fibroblast growth factor receptor signaling
            pathway" evidence=ISS;IMP] [GO:0030513 "positive regulation of BMP
            signaling pathway" evidence=IMP] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=IMP;IDA]
            [GO:0030177 "positive regulation of Wnt receptor signaling pathway"
            evidence=IDA] [GO:0045121 "membrane raft" evidence=IDA] [GO:0005783
            "endoplasmic reticulum" evidence=IDA] [GO:0048010 "vascular
            endothelial growth factor receptor signaling pathway" evidence=IDA]
            [GO:0002063 "chondrocyte development" evidence=ISS] [GO:0035860
            "glial cell-derived neurotrophic factor receptor signaling pathway"
            evidence=ISS] [GO:0051216 "cartilage development" evidence=ISS]
            [GO:0060686 "negative regulation of prostatic bud formation"
            evidence=ISS] [GO:0005794 "Golgi apparatus" evidence=ISS]
            [GO:0005886 "plasma membrane" evidence=ISS] [GO:0010575 "positive
            regulation vascular endothelial growth factor production"
            evidence=ISS] [GO:0003094 "glomerular filtration" evidence=ISS]
            [GO:0032836 "glomerular basement membrane development"
            evidence=ISS] [GO:0016525 "negative regulation of angiogenesis"
            evidence=IDA] [GO:0001937 "negative regulation of endothelial cell
            proliferation" evidence=IDA] [GO:0014846 "esophagus smooth muscle
            contraction" evidence=ISS] [GO:0048706 "embryonic skeletal system
            development" evidence=ISS] [GO:0060384 "innervation" evidence=ISS]
            [GO:0001822 "kidney development" evidence=ISS] [GO:0060348 "bone
            development" evidence=ISS] InterPro:IPR000917 InterPro:IPR014615
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            PIRSF:PIRSF036665 EMBL:AY101175 GO:GO:0005783 GO:GO:0005886
            GO:GO:0005794 GO:GO:0006915 GO:GO:0005615 GO:GO:0009986
            GO:GO:0005795 GO:GO:0005509 GO:GO:0010575 GO:GO:0045121
            GO:GO:0030336 GO:GO:0001937 GO:GO:0030513 GO:GO:0016525
            GO:GO:0060348 Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0030177
            GO:GO:0003094 eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0004065 GO:GO:0048706 GO:GO:0048010
            GO:GO:0060686 GO:GO:0002063 GO:GO:0040037 GO:GO:0032836
            GO:GO:0060384 GO:GO:0008449 GO:GO:0030201 Orphanet:2496
            HOGENOM:HOG000290161 KO:K14607 HOVERGEN:HBG056431 GO:GO:0014846
            GO:GO:0035860 InterPro:IPR024609 Pfam:PF12548 CTD:23213
            EMBL:AF545571 EMBL:AB029000 EMBL:AK074873 IPI:IPI00293203
            RefSeq:NP_001121676.1 RefSeq:NP_001121677.1 RefSeq:NP_001121678.1
            RefSeq:NP_055985.2 UniGene:Hs.409602 ProteinModelPortal:Q8IWU6
            SMR:Q8IWU6 STRING:Q8IWU6 PhosphoSite:Q8IWU6 DMDM:33112447
            PaxDb:Q8IWU6 PRIDE:Q8IWU6 DNASU:23213 Ensembl:ENST00000260128
            Ensembl:ENST00000402687 Ensembl:ENST00000419716
            Ensembl:ENST00000458141 GeneID:23213 KEGG:hsa:23213 UCSC:uc003xyd.2
            GeneCards:GC08P070428 HGNC:HGNC:20391 MIM:610012 neXtProt:NX_Q8IWU6
            PharmGKB:PA134861022 InParanoid:Q8IWU6 OMA:SVRVTHK
            OrthoDB:EOG4VT5WH ChiTaRS:SULF1 GenomeRNAi:23213 NextBio:44771
            ArrayExpress:Q8IWU6 Bgee:Q8IWU6 CleanEx:HS_SULF1
            Genevestigator:Q8IWU6 Uniprot:Q8IWU6
        Length = 871

 Score = 158 (60.7 bits), Expect = 2.7e-08, Sum P(3) = 2.7e-08
 Identities = 62/223 (27%), Positives = 96/223 (43%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
             P+II +L DD    DV    L Q+       + + G    N + T  +C PSRS+++TGK
Sbjct:    43 PNIILVLTDD---QDVELGSL-QVMNKTRKIMEHGGATFINAFVTTPMCCPSRSSMLTGK 98

Query:   119 HPIHTGMQHNVLYGCERGGLP----LSE-KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
             + +H    HNV    E    P    + E +    YL   GYRT   GK+ L  Y   Y P
Sbjct:    99 Y-VHN---HNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153

Query:   174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
                G+   LG     + +++++       G+  +   + A D    Y TD+ T E+++  
Sbjct:   154 P--GWREWLGLIKNSR-FYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESINYF 203

Query:   234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
                       P+ + ++HAA H      P Q    Y N  +HI
Sbjct:   204 KMSKRMYPHRPVMMVISHAAPHGPEDSAP-QFSKLYPNASQHI 245

 Score = 64 (27.6 bits), Expect = 2.7e-08, Sum P(3) = 2.7e-08
 Identities = 27/131 (20%), Positives = 52/131 (39%)

Query:   267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
             L IH    +  + K    L  +D+SV ++   L +   L N+ I++ +D           
Sbjct:   268 LPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQFGLV 327

Query:   327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
                + P        ++  +R    I  P +E   IV +  +++ D  PT+L  A     P
Sbjct:   328 KGKSMP--------YDFDIRVPFFIRGPSVEPGSIVPQIVLNI-DLAPTILDIAGLDTPP 378

Query:   387 NYVNSTVENII 397
             +    +V  ++
Sbjct:   379 DVDGKSVLKLL 389

 Score = 47 (21.6 bits), Expect = 3.2e-07, Sum P(2) = 3.2e-07
 Identities = 10/42 (23%), Positives = 18/42 (42%)

Query:   428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGP 466
             +G  +  P++   Y N +    P Y    N  ++   +Y GP
Sbjct:   225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGP 266

 Score = 38 (18.4 bits), Expect = 2.7e-08, Sum P(3) = 2.7e-08
 Identities = 22/92 (23%), Positives = 37/92 (40%)

Query:   408 ENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENR-YE---NGTHEY 463
             E+G     S R          GT +Y P++ +  +  T   + ++E   Y+       E 
Sbjct:   508 ESGYRASRSQRKSQRQFLRNQGTPKYKPRFVHTRQ--TRSLSVEFEGEIYDINLEEEEEL 565

Query:   464 NGPKNENTNPRYENGTHEYNIPR-LENSINGN 494
                +  N   R++ G H+   PR L+ S  GN
Sbjct:   566 QVLQPRNIAKRHDEG-HKG--PRDLQASSGGN 594


>UNIPROTKB|G1KQZ3 [details] [associations]
            symbol:SULF1 "Uncharacterized protein" species:28377
            "Anolis carolinensis" [GO:0001502 "cartilage condensation"
            evidence=ISS] [GO:0001822 "kidney development" evidence=ISS]
            [GO:0001937 "negative regulation of endothelial cell proliferation"
            evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
            [GO:0004065 "arylsulfatase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005783 "endoplasmic
            reticulum" evidence=ISS] [GO:0005794 "Golgi apparatus"
            evidence=ISS] [GO:0007155 "cell adhesion" evidence=ISS] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
            [GO:0009986 "cell surface" evidence=ISS] [GO:0014846 "esophagus
            smooth muscle contraction" evidence=ISS] [GO:0016525 "negative
            regulation of angiogenesis" evidence=ISS] [GO:0030177 "positive
            regulation of Wnt receptor signaling pathway" evidence=ISS]
            [GO:0030201 "heparan sulfate proteoglycan metabolic process"
            evidence=ISS] [GO:0030336 "negative regulation of cell migration"
            evidence=ISS] [GO:0030513 "positive regulation of BMP signaling
            pathway" evidence=ISS] [GO:0035860 "glial cell-derived neurotrophic
            factor receptor signaling pathway" evidence=ISS] [GO:0036022 "limb
            joint morphogenesis" evidence=ISS] [GO:0040037 "negative regulation
            of fibroblast growth factor receptor signaling pathway"
            evidence=ISS] [GO:0045121 "membrane raft" evidence=ISS] [GO:0048010
            "vascular endothelial growth factor receptor signaling pathway"
            evidence=ISS] [GO:0048661 "positive regulation of smooth muscle
            cell proliferation" evidence=ISS] [GO:0048706 "embryonic skeletal
            system development" evidence=ISS] [GO:0051216 "cartilage
            development" evidence=ISS] [GO:0060348 "bone development"
            evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0060686
            "negative regulation of prostatic bud formation" evidence=ISS]
            InterPro:IPR000917 InterPro:IPR014615 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783
            GO:GO:0005886 GO:GO:0005794 GO:GO:0005615 GO:GO:0009986
            GO:GO:0048661 GO:GO:0005509 GO:GO:0010575 GO:GO:0045121
            GO:GO:0030336 GO:GO:0001822 GO:GO:0001937 GO:GO:0030513
            GO:GO:0016525 GO:GO:0001502 GO:GO:0060348 Gene3D:3.40.720.10
            SUPFAM:SSF53649 GO:GO:0030177 InterPro:IPR024607 PROSITE:PS00523
            GO:GO:0004065 GeneTree:ENSGT00400000022041 GO:GO:0048706
            GO:GO:0048010 GO:GO:0060686 GO:GO:0002063 GO:GO:0040037
            GO:GO:0060384 GO:GO:0008449 GO:GO:0030201 GO:GO:0014846
            GO:GO:0035860 GO:GO:0036022 InterPro:IPR024609 Pfam:PF12548
            OMA:SVRVTHK Ensembl:ENSACAT00000015364 Uniprot:G1KQZ3
        Length = 878

 Score = 147 (56.8 bits), Expect = 3.2e-08, Sum P(3) = 3.2e-08
 Identities = 59/223 (26%), Positives = 93/223 (41%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
             P+II +L DD    DV    L Q+       +   G    N + T  +C PSRS+++TGK
Sbjct:    43 PNIILVLTDD---QDVELGSL-QVMNKTRKIMESGGATFVNAFVTTPMCCPSRSSMLTGK 98

Query:   119 HPIHTGMQHNVLYGCERGGLPLSE-----KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
             + +H    HN+    E    P  +     +    YL   GYRT   GK+ L  Y   Y P
Sbjct:    99 Y-VHN---HNIYTNNENCSSPSWQATHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153

Query:   174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
                G+   +G     + +++++       GL  +   + A D    Y TD+ T +++   
Sbjct:   154 P--GWREWVGLIKNSR-FYNYTVCRN---GLKEKHGFDYAKD----YFTDLITNDSIHYF 203

Query:   234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
                       P+ + ++HAA H      P Q    Y N  +HI
Sbjct:   204 KMSKRIYPHRPIMMVISHAAPHGPEDSAP-QFSKLYPNASQHI 245

 Score = 60 (26.2 bits), Expect = 3.2e-08, Sum P(3) = 3.2e-08
 Identities = 25/131 (19%), Positives = 53/131 (40%)

Query:   267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
             L IH    +  + K    L  +D+S+ ++   L +   L N+ I++ +D           
Sbjct:   268 LPIHMEFTNVLQRKRLQTLLSVDDSMERLYHMLVETGELENTYIIYTADHGYHIGQFGLV 327

Query:   327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
                + P        ++  +R    I  P +E   +V++  +++ D  PT+L  A     P
Sbjct:   328 KGKSMP--------YDFDIRVPFFIRGPSIEPGSVVSQIVLNI-DLAPTVLDIAGLDTPP 378

Query:   387 NYVNSTVENII 397
             +    +V  ++
Sbjct:   379 DMDGKSVLKLL 389

 Score = 53 (23.7 bits), Expect = 3.2e-08, Sum P(3) = 3.2e-08
 Identities = 23/112 (20%), Positives = 41/112 (36%)

Query:   402 NSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNP---KYENRYEN 458
             + IL     T   +S    N     +    +Y      R        NP   KY+ R+ +
Sbjct:   480 SDILAIRKRTRSIHSQGYNNQENECDCREADYRSSRTQRKNQRAFMRNPSMPKYKPRFVH 539

Query:   459 GTHEYNGPKNENTNPRYE-NGTHEYNIPRLENSINGNGTSENRSNDNSYQNE 509
              T +      E     Y+ N   E +IP+ ++ +  +G+     +DN  Q +
Sbjct:   540 -TRQTRSLSVEFEGEIYDINLEEELHIPQPKSIVKRHGSYSEEDDDNEDQEQ 590

 Score = 47 (21.6 bits), Expect = 4.8e-06, Sum P(2) = 4.8e-06
 Identities = 10/42 (23%), Positives = 18/42 (42%)

Query:   428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGP 466
             +G  +  P++   Y N +    P Y    N  ++   +Y GP
Sbjct:   225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGP 266


>UNIPROTKB|Q90XB6 [details] [associations]
            symbol:SULF1 "Extracellular sulfatase Sulf-1" species:9091
            "Coturnix coturnix" [GO:0001502 "cartilage condensation"
            evidence=IDA] [GO:0001822 "kidney development" evidence=ISS]
            [GO:0001937 "negative regulation of endothelial cell proliferation"
            evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
            [GO:0004065 "arylsulfatase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=IDA] [GO:0005783 "endoplasmic
            reticulum" evidence=IDA] [GO:0005794 "Golgi apparatus"
            evidence=ISS] [GO:0007155 "cell adhesion" evidence=IDA] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=IDA]
            [GO:0009986 "cell surface" evidence=ISS;IDA] [GO:0014846 "esophagus
            smooth muscle contraction" evidence=ISS] [GO:0016525 "negative
            regulation of angiogenesis" evidence=ISS] [GO:0030177 "positive
            regulation of Wnt receptor signaling pathway" evidence=ISS]
            [GO:0030201 "heparan sulfate proteoglycan metabolic process"
            evidence=IDA] [GO:0030336 "negative regulation of cell migration"
            evidence=ISS] [GO:0030513 "positive regulation of BMP signaling
            pathway" evidence=ISS] [GO:0035860 "glial cell-derived neurotrophic
            factor receptor signaling pathway" evidence=ISS] [GO:0036022 "limb
            joint morphogenesis" evidence=IDA] [GO:0040037 "negative regulation
            of fibroblast growth factor receptor signaling pathway"
            evidence=ISS] [GO:0045121 "membrane raft" evidence=ISS] [GO:0048010
            "vascular endothelial growth factor receptor signaling pathway"
            evidence=ISS] [GO:0048661 "positive regulation of smooth muscle
            cell proliferation" evidence=IDA] [GO:0048706 "embryonic skeletal
            system development" evidence=ISS] [GO:0051216 "cartilage
            development" evidence=ISS] [GO:0060070 "canonical Wnt receptor
            signaling pathway" evidence=IDA] [GO:0060348 "bone development"
            evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0060686
            "negative regulation of prostatic bud formation" evidence=ISS]
            InterPro:IPR000917 InterPro:IPR014615 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783
            GO:GO:0005794 GO:GO:0005615 GO:GO:0009986 GO:GO:0048661
            GO:GO:0005795 GO:GO:0005509 GO:GO:0045121 GO:GO:0030336
            GO:GO:0001822 GO:GO:0001937 GO:GO:0030513 GO:GO:0016525
            GO:GO:0001502 GO:GO:0060348 GO:GO:0060070 Gene3D:3.40.720.10
            SUPFAM:SSF53649 GO:GO:0030177 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0004065 GO:GO:0048706 GO:GO:0048010
            GO:GO:0060686 GO:GO:0002063 GO:GO:0040037 GO:GO:0060384
            GO:GO:0008449 GO:GO:0030201 EMBL:AF410802 ProteinModelPortal:Q90XB6
            HOVERGEN:HBG056431 GO:GO:0014846 GO:GO:0035860 GO:GO:0036022
            InterPro:IPR024609 Pfam:PF12548 Uniprot:Q90XB6
        Length = 867

 Score = 150 (57.9 bits), Expect = 6.4e-08, Sum P(2) = 6.4e-08
 Identities = 59/223 (26%), Positives = 94/223 (42%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
             P+II +L DD    DV    L Q+       +   G    N + T  +C PSRS+++TGK
Sbjct:    43 PNIILVLTDD---QDVELGSL-QVMNKTRRIMENGGASFINAFVTTPMCCPSRSSMLTGK 98

Query:   119 HPIHTGMQHNVLYGCERGGLPLSE-----KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
             + +H    HN+    E    P  +     +    YL   GYRT   GK+ L  Y   Y P
Sbjct:    99 Y-VHN---HNIYTNNENCSSPSWQATHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153

Query:   174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
                G+   +G    +  +++++       G   +   + A D    Y TD+ T E+++  
Sbjct:   154 P--GWREWVGL-VKNSRFYNYTISRN---GNKEKHGFDYAKD----YFTDLITNESINYF 203

Query:   234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
                       P+ + ++HAA H      P Q  + Y N  +HI
Sbjct:   204 RMSKRIYPHRPIMMVISHAAPHGPEDSAP-QFSELYPNASQHI 245

 Score = 62 (26.9 bits), Expect = 6.4e-08, Sum P(2) = 6.4e-08
 Identities = 25/131 (19%), Positives = 53/131 (40%)

Query:   267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
             L IH    +  + K    L  +D+S+ ++ + L +   L N+ I++ +D           
Sbjct:   268 LPIHMEFTNVLQRKRLQTLMSVDDSMERLYQMLAEMGELENTYIIYTADHGYHIGQFGLV 327

Query:   327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
                + P        ++  +R    I  P +E   +V +  +++ D  PT+L  A     P
Sbjct:   328 KGKSMP--------YDFDIRVPFFIRGPSVEPGSVVPQIVLNI-DLAPTILDIAGLDTPP 378

Query:   387 NYVNSTVENII 397
             +    +V  ++
Sbjct:   379 DMDGKSVLKLL 389

 Score = 47 (21.6 bits), Expect = 2.2e-06, Sum P(2) = 2.2e-06
 Identities = 10/42 (23%), Positives = 18/42 (42%)

Query:   428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGP 466
             +G  +  P++   Y N +    P Y    N  ++   +Y GP
Sbjct:   225 HGPEDSAPQFSELYPNASQHITPSYNYAPNMDKHWIMQYTGP 266


>UNIPROTKB|E1BRF7 [details] [associations]
            symbol:SULF1 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0005509 "calcium ion binding" evidence=IEA] [GO:0003094
            "glomerular filtration" evidence=IEA] [GO:0005886 "plasma membrane"
            evidence=IEA] [GO:0010575 "positive regulation vascular endothelial
            growth factor production" evidence=IEA] [GO:0032836 "glomerular
            basement membrane development" evidence=IEA] [GO:0001502 "cartilage
            condensation" evidence=ISS] [GO:0036022 "limb joint morphogenesis"
            evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
            [GO:0035860 "glial cell-derived neurotrophic factor receptor
            signaling pathway" evidence=ISS] [GO:0051216 "cartilage
            development" evidence=ISS] [GO:0040037 "negative regulation of
            fibroblast growth factor receptor signaling pathway" evidence=ISS]
            [GO:0005794 "Golgi apparatus" evidence=ISS] [GO:0001937 "negative
            regulation of endothelial cell proliferation" evidence=ISS]
            [GO:0016525 "negative regulation of angiogenesis" evidence=ISS]
            [GO:0030177 "positive regulation of Wnt receptor signaling pathway"
            evidence=ISS] [GO:0030201 "heparan sulfate proteoglycan metabolic
            process" evidence=ISS] [GO:0030336 "negative regulation of cell
            migration" evidence=ISS] [GO:0030513 "positive regulation of BMP
            signaling pathway" evidence=ISS] [GO:0048010 "vascular endothelial
            growth factor receptor signaling pathway" evidence=ISS] [GO:0004065
            "arylsulfatase activity" evidence=ISS] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005783
            "endoplasmic reticulum" evidence=ISS] [GO:0009986 "cell surface"
            evidence=ISS] [GO:0045121 "membrane raft" evidence=ISS] [GO:0007155
            "cell adhesion" evidence=ISS] [GO:0048661 "positive regulation of
            smooth muscle cell proliferation" evidence=ISS] [GO:0014846
            "esophagus smooth muscle contraction" evidence=ISS] [GO:0048706
            "embryonic skeletal system development" evidence=ISS] [GO:0001822
            "kidney development" evidence=ISS] [GO:0060348 "bone development"
            evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0060686
            "negative regulation of prostatic bud formation" evidence=ISS]
            InterPro:IPR000917 InterPro:IPR014615 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783
            GO:GO:0005886 GO:GO:0005794 GO:GO:0005615 GO:GO:0009986
            GO:GO:0048661 GO:GO:0005509 GO:GO:0010575 GO:GO:0045121
            GO:GO:0030336 GO:GO:0001822 GO:GO:0001937 GO:GO:0030513
            GO:GO:0016525 GO:GO:0001502 GO:GO:0060348 Gene3D:3.40.720.10
            SUPFAM:SSF53649 GO:GO:0030177 InterPro:IPR024607 PROSITE:PS00523
            GO:GO:0004065 GeneTree:ENSGT00400000022041 GO:GO:0048706
            GO:GO:0048010 GO:GO:0060686 GO:GO:0002063 GO:GO:0040037
            GO:GO:0060384 GO:GO:0008449 GO:GO:0030201 GO:GO:0014846
            GO:GO:0035860 GO:GO:0036022 InterPro:IPR024609 Pfam:PF12548
            OMA:SVRVTHK EMBL:AADN02048527 EMBL:AADN02048528 EMBL:AADN02048529
            EMBL:AADN02048530 EMBL:AADN02048531 EMBL:AADN02048532
            EMBL:AADN02048533 EMBL:AADN02048534 EMBL:AADN02048535
            EMBL:AADN02048536 EMBL:AADN02048537 EMBL:AADN02048538
            EMBL:AADN02048539 EMBL:AADN02048540 EMBL:AADN02048541
            IPI:IPI00571776 ProteinModelPortal:E1BRF7
            Ensembl:ENSGALT00000018383 ArrayExpress:E1BRF7 Uniprot:E1BRF7
        Length = 868

 Score = 150 (57.9 bits), Expect = 6.4e-08, Sum P(2) = 6.4e-08
 Identities = 59/223 (26%), Positives = 94/223 (42%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
             P+II +L DD    DV    L Q+       +   G    N + T  +C PSRS+++TGK
Sbjct:    43 PNIILVLTDD---QDVELGSL-QVMNKTRRIMENGGASFINAFVTTPMCCPSRSSMLTGK 98

Query:   119 HPIHTGMQHNVLYGCERGGLPLSE-----KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
             + +H    HN+    E    P  +     +    YL   GYRT   GK+ L  Y   Y P
Sbjct:    99 Y-VHN---HNIYTNNENCSSPSWQATHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153

Query:   174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
                G+   +G    +  +++++       G   +   + A D    Y TD+ T E+++  
Sbjct:   154 P--GWREWVGL-VKNSRFYNYTISRN---GNKEKHGFDYAKD----YFTDLITNESINYF 203

Query:   234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
                       P+ + ++HAA H      P Q  + Y N  +HI
Sbjct:   204 RMSKRIYPHRPIMMVISHAAPHGPEDSAP-QFSELYPNASQHI 245

 Score = 62 (26.9 bits), Expect = 6.4e-08, Sum P(2) = 6.4e-08
 Identities = 25/131 (19%), Positives = 53/131 (40%)

Query:   267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
             L IH    +  + K    L  +D+S+ ++ + L +   L N+ I++ +D           
Sbjct:   268 LPIHMEFTNVLQRKRLQTLMSVDDSMERLYQMLAEMGELENTYIIYTADHGYHIGQFGLV 327

Query:   327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
                + P        ++  +R    I  P +E   +V +  +++ D  PT+L  A     P
Sbjct:   328 KGKSMP--------YDFDIRVPFFIRGPSVEPGSVVPQIVLNI-DLAPTILDIAGLDTPP 378

Query:   387 NYVNSTVENII 397
             +    +V  ++
Sbjct:   379 DMDGKSVLKLL 389

 Score = 47 (21.6 bits), Expect = 2.2e-06, Sum P(2) = 2.2e-06
 Identities = 10/42 (23%), Positives = 18/42 (42%)

Query:   428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGP 466
             +G  +  P++   Y N +    P Y    N  ++   +Y GP
Sbjct:   225 HGPEDSAPQFSELYPNASQHITPSYNYAPNMDKHWIMQYTGP 266


>DICTYBASE|DDB_G0281179 [details] [associations]
            symbol:clkA "protein kinase, CMGC group"
            species:44689 "Dictyostelium discoideum" [GO:0016772 "transferase
            activity, transferring phosphorus-containing groups" evidence=IEA]
            [GO:0006468 "protein phosphorylation" evidence=IEA] [GO:0005524
            "ATP binding" evidence=IEA] [GO:0004674 "protein serine/threonine
            kinase activity" evidence=IEA;ISS] [GO:0004672 "protein kinase
            activity" evidence=IEA] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0016740 "transferase activity" evidence=IEA]
            [GO:0016310 "phosphorylation" evidence=IEA] [GO:0016301 "kinase
            activity" evidence=IEA] [GO:0000166 "nucleotide binding"
            evidence=IEA] InterPro:IPR000719 InterPro:IPR008271
            InterPro:IPR011009 InterPro:IPR017441 Pfam:PF00069 PROSITE:PS00107
            PROSITE:PS00108 PROSITE:PS50011 dictyBase:DDB_G0281179
            GO:GO:0005524 GenomeReviews:CM000152_GR eggNOG:COG0515
            SUPFAM:SSF56112 EMBL:AAFI02000040 GO:GO:0004674 KO:K08287
            HSSP:P49761 RefSeq:XP_640867.1 ProteinModelPortal:Q54UA9
            EnsemblProtists:DDB0230105 GeneID:8622923 KEGG:ddi:DDB_G0281179
            OMA:ICNENDY ProtClustDB:CLSZ2846791 Uniprot:Q54UA9
        Length = 932

 Score = 162 (62.1 bits), Expect = 6.5e-08, P = 6.5e-08
 Identities = 48/165 (29%), Positives = 71/165 (43%)

Query:   409 NGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKN 468
             N ++ YN     N+N  Y N  + YN  Y N   N ++ +N  Y N   N  + YN   N
Sbjct:   369 NNSNNYNHNN-SNNNGGYNNYNNGYN-NYNNNNSNNSN-HNSSYNN---NNNNNYNNNNN 422

Query:   469 ENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRN 528
              N N    N  +  N     N+ N N  + N +N+N+  N  +  ++    S N  S  N
Sbjct:   423 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNI----SNN--SNNN 476

Query:   529 TILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQN 573
                +N D++   S    G +     NS N N  + N +N NSY N
Sbjct:   477 NFNYNNDNDRNNS---NGNYN---NNSSNINNNNNNNNNSNSYHN 515

 Score = 146 (56.5 bits), Expect = 3.4e-06, P = 3.4e-06
 Identities = 50/202 (24%), Positives = 79/202 (39%)

Query:   378 SAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKY 437
             S  N ++  NY NS   N    Y +    Y +  + YN+    N+N    N ++  +   
Sbjct:   216 SVNNNNNNRNYSNSYNNN---NYNDGNNNYNSNNYNYNNNNNNNNNINNNNNSNSNSNSN 272

Query:   438 ENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTS 497
              N   N     N    N Y N  + YN  K+ N   RY +           N+ N N   
Sbjct:   273 SNSNSNSNSNSNSN-NNNYNN--YGYNNHKSNNGGNRYSDDDDNVFNNNNNNNNNNNNNY 329

Query:   498 ENRSNDNSYQNEIDGIDVW--SVLSRNEPSKRNTIL---HNIDDEWQISALTRGKWKLVK 552
              N +++N+Y N+ D  D    ++ SRN  +  N      +N ++    ++   G +    
Sbjct:   330 NNYNSNNNYNNDYDYNDGKRANIYSRNNSNNNNNSKSGNNNSNNYNHNNSNNNGGYNNYN 389

Query:   553 ENSINGNGTSENRSNDNS-YQN 573
                 N N  + N SN NS Y N
Sbjct:   390 NGYNNYNNNNSNNSNHNSSYNN 411

 Score = 137 (53.3 bits), Expect = 3.2e-05, P = 3.2e-05
 Identities = 49/201 (24%), Positives = 89/201 (44%)

Query:   381 NKSDIPNYVNSTVENIIPR-YENSILRYENGTHEYNSPRIENSNTR--YENGTHEYNPKY 437
             N ++I N  NS   N      EN+  + EN +++  +    +S  R   +N  H  N   
Sbjct:    99 NNNNINNNGNSNNNNNNSNGSENNYFQSENQSNKDQNSYFNSSYLRNPVDNYNHNNNNHN 158

Query:   438 ENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRL--EN---SIN 492
              N ++N  +      +  Y+N  +  +   N+N N   +    +Y+I ++  EN   S+N
Sbjct:   159 NNAFDNNNYNTQNLGDYSYKNDGYNNDNNNNDNNNSYGDTDREKYSIEKICNENDYDSVN 218

Query:   493 GNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVK 552
              N  + N SN  +  N  DG + ++  + N  +  N   +NI++    ++ +        
Sbjct:   219 NNNNNRNYSNSYNNNNYNDGNNNYNSNNYNYNNNNNNN-NNINNNNNSNSNSNSN----- 272

Query:   553 ENSINGNGTSENRSNDNSYQN 573
              NS N N  S + SN+N+Y N
Sbjct:   273 SNS-NSNSNSNSNSNNNNYNN 292

 Score = 135 (52.6 bits), Expect = 5.2e-05, P = 5.2e-05
 Identities = 39/168 (23%), Positives = 70/168 (41%)

Query:   407 YENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGP 466
             Y N  + YN+ +  N   RY +          N   N  + YN    N   N  ++YN  
Sbjct:   290 YNN--YGYNNHKSNNGGNRYSDDDDNVFNNNNNNNNNNNNNYNNYNSNNNYNNDYDYNDG 347

Query:   467 KNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNS-YQNEIDGIDVWSVLSRNEPS 525
             K  N   R  N ++  N  +  N+ N N  + N SN+N  Y N  +G + ++  + N  +
Sbjct:   348 KRANIYSR--NNSNNNNNSKSGNN-NSNNYNHNNSNNNGGYNNYNNGYNNYNNNNSNNSN 404

Query:   526 KRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQN 573
               ++  +N ++ +  +            N+ N N  + N +N+N+  N
Sbjct:   405 HNSSYNNNNNNNYNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 452

 Score = 130 (50.8 bits), Expect = 0.00018, P = 0.00018
 Identities = 49/213 (23%), Positives = 85/213 (39%)

Query:   387 NYVNSTVENIIPRYEN-SILRYENGTHEYNSPRIENSNTRYENGTHEYNPKY---ENRYE 442
             N+ N+  +N     +N     Y+N  +  ++   +N+N+  +    +Y+ +    EN Y+
Sbjct:   156 NHNNNAFDNNNYNTQNLGDYSYKNDGYNNDNNNNDNNNSYGDTDREKYSIEKICNENDYD 215

Query:   443 N-GTHEYNPKYENRYENGTHEYNGPKNENTNP-RYENGTHEYNIPRLENSINGNGTSENR 500
             +   +  N  Y N Y N  +  +G  N N+N   Y N  +  N     N+ N N  S + 
Sbjct:   216 SVNNNNNNRNYSNSYNNNNYN-DGNNNYNSNNYNYNNNNNNNNNINNNNNSNSNSNSNSN 274

Query:   501 SNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNG 560
             SN NS  N     + ++    N     N      DD+  +             N+ N   
Sbjct:   275 SNSNSNSNSNSNNNNYNNYGYNNHKSNNGGNRYSDDDDNVFNNNNNN-NNNNNNNYNNYN 333

Query:   561 TSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRN 593
             ++ N +ND  Y    DG    ++ SRN  +  N
Sbjct:   334 SNNNYNNDYDYN---DGKRA-NIYSRNNSNNNN 362

 Score = 129 (50.5 bits), Expect = 0.00023, P = 0.00023
 Identities = 39/169 (23%), Positives = 59/169 (34%)

Query:   407 YENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGP 466
             Y N  + Y +    N N  Y N  H  N  Y N   N        + N   N  +  N  
Sbjct:    16 YSNNDYGYYNNNCSNVN--YNNDIHYKNNNYNNNNNNNNSNSGNNFNNNNNNNNNNNNNN 73

Query:   467 KNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSK 526
              N N N    N  + Y      NS N N    N  N N+  N  +G +     S N+ +K
Sbjct:    74 NNNNNNNN-NNNNYTYGNNNNNNSNNNNNNINNNGNSNNNNNNSNGSENNYFQSENQSNK 132

Query:   527 -RNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQNE 574
              +N+  ++      +             N+ + N  +     D SY+N+
Sbjct:   133 DQNSYFNSSYLRNPVDNYNHNN-NNHNNNAFDNNNYNTQNLGDYSYKND 180

 Score = 129 (50.5 bits), Expect = 0.00023, P = 0.00023
 Identities = 49/196 (25%), Positives = 73/196 (37%)

Query:   381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
             N S+  N+ +S   N    Y N+     N  +  N+    N+N    N  +  N    N 
Sbjct:   398 NNSNNSNHNSSYNNNNNNNYNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 457

Query:   441 YENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENR 500
               N  +  N    N   N    YN   + N +    NG   YN     NS N N  + N 
Sbjct:   458 NNNNNNNNNNNISNNSNNNNFNYNNDNDRNNS----NGN--YN----NNSSNINNNNNNN 507

Query:   501 SNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNG 560
             +N NSY N           S+N  S +N    N +++ Q +A            S N   
Sbjct:   508 NNSNSYHNSCISYSNGGSNSKN--SNKN----NYNNQ-QSNANGNHVGNSKNNESCNNTN 560

Query:   561 TSENRSNDNSYQNEID 576
             T+  +SN + + +E D
Sbjct:   561 TNIEKSNKSMWDDEND 576

 Score = 127 (49.8 bits), Expect = 0.00038, P = 0.00038
 Identities = 43/194 (22%), Positives = 75/194 (38%)

Query:   381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
             N ++I N  NS   +      NS     + ++  N      +N +  NG + Y+   +N 
Sbjct:   255 NNNNINNNNNSNSNSNSNSNSNSNSNSNSNSNNNNYNNYGYNNHKSNNGGNRYSDDDDNV 314

Query:   441 YENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENR 500
             + N  +  N    N Y N    YN   N N +  Y +G    NI    NS N N +    
Sbjct:   315 FNNNNNNNNNN-NNNYNN----YNSNNNYNNDYDYNDGKRA-NIYSRNNSNNNNNSKSGN 368

Query:   501 SNDNSYQ-NEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGN 559
             +N N+Y  N  +    ++  +    +  N   +N +     +      +     N+ N N
Sbjct:   369 NNSNNYNHNNSNNNGGYNNYNNGYNNYNNNNSNNSNHNSSYNNNNNNNYNNNNNNNNNNN 428

Query:   560 GTSENRSNDNSYQN 573
               + N +N+N+  N
Sbjct:   429 NNNNNNNNNNNNNN 442

 Score = 127 (49.8 bits), Expect = 0.00038, P = 0.00038
 Identities = 52/214 (24%), Positives = 78/214 (36%)

Query:   388 YVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHE 447
             Y N+   N+   Y N I  Y+N  + YN+    N++    N  +  N    N   N  + 
Sbjct:    23 YYNNNCSNV--NYNNDI-HYKN--NNYNNNNNNNNSNSGNNFNNNNNNNNNNNNNNNNNN 77

Query:   448 YNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSN--DNS 505
              N    N Y  G +  N   N N N    NG    N      S N    SEN+SN   NS
Sbjct:    78 NNNNNNNNYTYGNNNNNNSNNNNNNIN-NNGNSNNNNNNSNGSENNYFQSENQSNKDQNS 136

Query:   506 YQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENR 565
             Y N     +     + N  +  N    N  + +    L  G +    +   N N    N 
Sbjct:   137 YFNSSYLRNPVDNYNHNNNNHNNNAFDN--NNYNTQNL--GDYSYKNDGYNNDNN---NN 189

Query:   566 SNDNSY-QNEIDGIDVWSVLSRNEPSKRNTILHN 598
              N+NSY   + +   +  + + N+    N   +N
Sbjct:   190 DNNNSYGDTDREKYSIEKICNENDYDSVNNNNNN 223


>UNIPROTKB|I3L2L4 [details] [associations]
            symbol:SGSH "N-sulphoglucosamine sulphohydrolase"
            species:9606 "Homo sapiens" [GO:0008484 "sulfuric ester hydrolase
            activity" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484 EMBL:AC087741
            EMBL:AC123764 HGNC:HGNC:10818 ChiTaRS:SGSH Ensembl:ENST00000570427
            Bgee:I3L2L4 Uniprot:I3L2L4
        Length = 188

 Score = 135 (52.6 bits), Expect = 6.7e-08, Sum P(2) = 6.7e-08
 Identities = 42/136 (30%), Positives = 72/136 (52%)

Query:    41 PLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
             P+    +++ V  +  + P + + +LADD G+   G +    I TP++DALA   ++ +N
Sbjct:     4 PVPACCALLLVLGLCRARPRNALLLLADDGGFES-GAYNNSAIATPHLDALARRSLLFRN 62

Query:   101 YYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLS--EKI--LPQYLKELGYR 155
              +T V  C+PSR++++TG  P H     N +YG  +     +  +K+  LP  L + G R
Sbjct:    63 AFTSVSSCSPSRASLLTGL-PQH----QNGMYGLHQDVHHFNSFDKVRSLPLLLSQAGVR 117

Query:   156 TR------IVGKWHLG 165
             T       I+GK H+G
Sbjct:   118 TGLSSRPGIIGKKHVG 133

 Score = 37 (18.1 bits), Expect = 6.7e-08, Sum P(2) = 6.7e-08
 Identities = 7/14 (50%), Positives = 8/14 (57%)

Query:   239 DEPLFLYLAHAATH 252
             D P FLY+A    H
Sbjct:   174 DRPFFLYVAFHDPH 187


>RGD|1305877 [details] [associations]
            symbol:Gns "glucosamine (N-acetyl)-6-sulfatase" species:10116
            "Rattus norvegicus" [GO:0005539 "glycosaminoglycan binding"
            evidence=IPI] [GO:0005764 "lysosome" evidence=IDA] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=IDA]
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=ISO]
            [GO:0042340 "keratan sulfate catabolic process" evidence=IDA]
            [GO:0043199 "sulfate binding" evidence=IPI] InterPro:IPR000917
            InterPro:IPR012251 InterPro:IPR015981 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036666 RGD:1305877
            GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
            PROSITE:PS00523 PROSITE:PS00149 GO:GO:0042340 GO:GO:0043199
            GO:GO:0005539 CTD:2799 HOGENOM:HOG000169239 HOVERGEN:HBG005840
            KO:K01137 GO:GO:0008449 PANTHER:PTHR10342:SF5 UniGene:Rn.228654
            EMBL:BC087741 IPI:IPI00951484 RefSeq:NP_001011989.1 IntAct:Q5M918
            STRING:Q5M918 Ensembl:ENSRNOT00000064349 GeneID:299825
            KEGG:rno:299825 InParanoid:Q5M918 NextBio:645846
            Genevestigator:Q5M918 Uniprot:Q5M918
        Length = 519

 Score = 156 (60.0 bits), Expect = 1.2e-07, P = 1.2e-07
 Identities = 69/235 (29%), Positives = 104/235 (44%)

Query:    29 GYRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNI 88
             G  +R+ A  +LPL   LS   + LV ++  P+++ +L DD    D    G+   P    
Sbjct:    12 GCPSRLPALLLLPL---LSGC-LGLVGAARRPNVLLLLTDD---QDAELGGMT--PLKKT 62

Query:    89 DAL-AYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYG-CE-RGGLPLSEKI 144
              AL    G+   + Y    LC PSR++I+TGK+P +  + +N L G C  +    + E  
Sbjct:    63 KALIGEKGMTFSSAYVPSALCCPSRASILTGKYPHNHHVVNNTLEGNCSSKSWQKIQEPY 122

Query:   145 -LPQYLKEL-GYRTRIVGKWHLGFYKKEY-TPTFRGFES-HLG--YWTGHQDYFDHSAEE 198
               P  LK + GY+T   GK     Y  EY  P   G E   LG  YW   +    +    
Sbjct:   123 TFPAILKLVCGYQTFFAGK-----YLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYT 177

Query:   199 MKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHS 253
             + + G   R     + D    Y TDV    ++D +   S  EP F+ ++  A HS
Sbjct:   178 LSINGKARRHGENYSVD----YLTDVLANLSLDFLDYKSNSEPFFMMISTPAPHS 228


>TIGR_CMR|CPS_2358 [details] [associations]
            symbol:CPS_2358 "sulfatase family protein" species:167879
            "Colwellia psychrerythraea 34H" [GO:0008152 "metabolic process"
            evidence=ISS] [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=ISS] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 EMBL:CP000083
            GenomeReviews:CP000083_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484
            HOGENOM:HOG000014304 RefSeq:YP_269076.1 ProteinModelPortal:Q482E2
            STRING:Q482E2 GeneID:3518855 KEGG:cps:CPS_2358 PATRIC:21467803
            OMA:ETIRIDS BioCyc:CPSY167879:GI48-2421-MONOMER Uniprot:Q482E2
        Length = 499

 Score = 135 (52.6 bits), Expect = 1.2e-07, Sum P(3) = 1.2e-07
 Identities = 39/103 (37%), Positives = 53/103 (51%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALA-YSGIILKNYYTVQLCTPSRSAIMTGK 118
             P+I+FI  DDL    +  +G  ++ TPNID LA  S +  + Y    +C PSR +I+TG 
Sbjct:    53 PNILFIAVDDLK-PLIRDYGTAKVQTPNIDKLASQSTVFTRAYSQYPVCGPSRMSILTGL 111

Query:   119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
              P   G+ +  L    R   P S   LPQ+ K  GY T   GK
Sbjct:   112 RPESNGIMN--LKDKIRDVNP-SVITLPQFFKNNGYETAATGK 151

 Score = 65 (27.9 bits), Expect = 1.2e-07, Sum P(3) = 1.2e-07
 Identities = 26/105 (24%), Positives = 45/105 (42%)

Query:   219 KYSTDVFTAEAVDIIHNHSTDEPL-FLYLAHAATHSANPYEPLQAPDHYLNIH------- 270
             KY  D+++ E+ D+    S  E     YL H        Y+P       +  +       
Sbjct:   239 KYY-DLYSRESFDLASYQSAPEDADTTYLFHK-NQELRGYKPTPIKGGEIKPYPKGKLSS 296

Query:   271 RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
              H ++     FA++   +D  VG+++E LE+     N++IVF  D
Sbjct:   297 AHQKELLHGYFASVSF-IDSLVGELLEELEKTGQAENTVIVFWGD 340

 Score = 45 (20.9 bits), Expect = 1.2e-07, Sum P(3) = 1.2e-07
 Identities = 8/19 (42%), Positives = 12/19 (63%)

Query:   677 LFDIKNDPCEKNNLADRSE 695
             L+D+ NDP E  N+ +  E
Sbjct:   458 LYDLINDPLETKNIINTPE 476

 Score = 45 (20.9 bits), Expect = 1.2e-07, Sum P(3) = 1.2e-07
 Identities = 8/19 (42%), Positives = 12/19 (63%)

Query:   783 LFDIKNDPCEKNNLADRSE 801
             L+D+ NDP E  N+ +  E
Sbjct:   458 LYDLINDPLETKNIINTPE 476


>UNIPROTKB|I3L643 [details] [associations]
            symbol:GNS "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0030203 "glycosaminoglycan metabolic process"
            evidence=IEA] [GO:0008449 "N-acetylglucosamine-6-sulfatase
            activity" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR012251 InterPro:IPR015981
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            PIRSF:PIRSF036666 GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            GeneTree:ENSGT00400000022041 GO:GO:0030203 GO:GO:0008449
            PANTHER:PTHR10342:SF5 EMBL:AEMK01192095 EMBL:FP700150
            Ensembl:ENSSSCT00000032527 OMA:FARAFAN Uniprot:I3L643
        Length = 369

 Score = 146 (56.5 bits), Expect = 1.3e-07, Sum P(2) = 1.3e-07
 Identities = 63/229 (27%), Positives = 98/229 (42%)

Query:    55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDAL-AYSGIILKNYYTVQ-LCTPSRS 112
             A S  P+++ +LADD    D    G+   P     AL    G+   + Y    LC PSR+
Sbjct:    42 ADSRRPNVVLLLADD---QDEVLGGMT--PLKKTKALIGEMGMTFSSAYVPSALCCPSRA 96

Query:   113 AIMTGKHPIHTGMQHNVLYG-CE-RGGLPLSE-KILPQYLKEL-GYRTRIVGKWHLGFYK 168
             +I+TGK+P +  + +N L G C  +    + E    P  L+ + GY+T   GK     Y 
Sbjct:    97 SILTGKYPHNHHVVNNTLEGNCSSKSWQKIEEPNTFPAILRSVCGYQTFFAGK-----YL 151

Query:   169 KEYTPTFRGFESH--LG--YWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDV 224
              EY     G  +H  LG  YW   +    +    + + G   +     + D    Y TDV
Sbjct:   152 NEYGAPDAGGLAHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVD----YLTDV 207

Query:   225 FTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
                 ++D +   S  EP F+ ++  A HS     P  A   Y N  +++
Sbjct:   208 LANVSLDFLDYKSNSEPFFMMISTPAPHS-----PWTAAPQYQNTFQNV 251

 Score = 52 (23.4 bits), Expect = 1.3e-07, Sum P(2) = 1.3e-07
 Identities = 13/38 (34%), Positives = 23/38 (60%)

Query:   278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
             R ++  +L  +D+ V K+V+ LE    L+N+ I + SD
Sbjct:   290 RKRWQTLL-SVDDLVEKLVKRLEFNGELNNTYIFYTSD 326

 Score = 40 (19.1 bits), Expect = 1.6e-06, Sum P(3) = 1.6e-06
 Identities = 8/23 (34%), Positives = 14/23 (60%)

Query:   450 PKYENRYENGTHEYNGPKNENTN 472
             P+Y+N ++N       P+N+N N
Sbjct:   242 PQYQNTFQN----VFAPRNKNFN 260

 Score = 40 (19.1 bits), Expect = 1.6e-06, Sum P(3) = 1.6e-06
 Identities = 18/62 (29%), Positives = 26/62 (41%)

Query:   516 WSVLSRNEPSKRNTI--LHN-IDDEWQ-ISALTRGKWKLVKENSING--NGTSENRSNDN 569
             W +     P   ++I  L N     WQ + ++     KLVK    NG  N T    ++DN
Sbjct:   268 WLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFNGELNNTYIFYTSDN 327

Query:   570 SY 571
              Y
Sbjct:   328 GY 329


>UNIPROTKB|Q32KJ5 [details] [associations]
            symbol:Gns "Glucosamine (N-acetyl)-6-sulfatase"
            species:10116 "Rattus norvegicus" [GO:0005764 "lysosome"
            evidence=IEA] [GO:0008449 "N-acetylglucosamine-6-sulfatase
            activity" evidence=IEA] [GO:0030203 "glycosaminoglycan metabolic
            process" evidence=IEA] InterPro:IPR000917 InterPro:IPR012251
            InterPro:IPR015981 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 PIRSF:PIRSF036666 RGD:1305877 GO:GO:0005764
            Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0030203
            HOGENOM:HOG000169239 HOVERGEN:HBG005840 KO:K01137 OrthoDB:EOG4NGGMF
            GO:GO:0008449 PANTHER:PTHR10342:SF5 OMA:MCGYQTF EMBL:BN000742
            IPI:IPI00366226 RefSeq:XP_003750373.1 UniGene:Rn.228654
            IntAct:Q32KJ5 STRING:Q32KJ5 Ensembl:ENSRNOT00000006566
            GeneID:100909505 KEGG:rno:100909505 InParanoid:Q32KJ5
            Genevestigator:Q32KJ5 Uniprot:Q32KJ5
        Length = 544

 Score = 156 (60.0 bits), Expect = 1.3e-07, P = 1.3e-07
 Identities = 69/235 (29%), Positives = 104/235 (44%)

Query:    29 GYRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNI 88
             G  +R+ A  +LPL   LS   + LV ++  P+++ +L DD    D    G+   P    
Sbjct:    12 GCPSRLPALLLLPL---LSGC-LGLVGAARRPNVLLLLTDD---QDAELGGMT--PLKKT 62

Query:    89 DAL-AYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYG-CE-RGGLPLSEKI 144
              AL    G+   + Y    LC PSR++I+TGK+P +  + +N L G C  +    + E  
Sbjct:    63 KALIGEKGMTFSSAYVPSALCCPSRASILTGKYPHNHHVVNNTLEGNCSSKSWQKIQEPY 122

Query:   145 -LPQYLKEL-GYRTRIVGKWHLGFYKKEY-TPTFRGFES-HLG--YWTGHQDYFDHSAEE 198
               P  LK + GY+T   GK     Y  EY  P   G E   LG  YW   +    +    
Sbjct:   123 TFPAILKLVCGYQTFFAGK-----YLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYT 177

Query:   199 MKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHS 253
             + + G   R     + D    Y TDV    ++D +   S  EP F+ ++  A HS
Sbjct:   178 LSINGKARRHGENYSVD----YLTDVLANLSLDFLDYKSNSEPFFMMISTPAPHS 228


>DICTYBASE|DDB_G0287057 [details] [associations]
            symbol:gtaN "GATA zinc finger domain-containing
            protein 14" species:44689 "Dictyostelium discoideum" [GO:0043565
            "sequence-specific DNA binding" evidence=IEA] [GO:0008270 "zinc ion
            binding" evidence=IEA] [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=IEA] [GO:0003700 "sequence-specific DNA
            binding transcription factor activity" evidence=IEA] [GO:0046872
            "metal ion binding" evidence=IEA] InterPro:IPR000679 Pfam:PF00320
            PROSITE:PS00344 PROSITE:PS50114 SMART:SM00401
            dictyBase:DDB_G0287057 GenomeReviews:CM000153_GR GO:GO:0046872
            GO:GO:0043565 GO:GO:0008270 GO:GO:0003700 eggNOG:COG5641
            EMBL:AAFI02000096 HSSP:P17679 RefSeq:XP_637400.1
            ProteinModelPortal:Q54KX0 EnsemblProtists:DDB0220469 GeneID:8625931
            KEGG:ddi:DDB_G0287057 OMA:GANEDHL Uniprot:Q54KX0
        Length = 953

 Score = 159 (61.0 bits), Expect = 1.4e-07, P = 1.4e-07
 Identities = 56/229 (24%), Positives = 104/229 (45%)

Query:   381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
             NK++  N  N+   N    + N+   + N  + +N+    N+   + N  + +N    N 
Sbjct:   448 NKNNHNNNHNNNNHNN-NNHNNNNNNHNNNNNNHNNNNNHNNQNNHNNQNNNHNNNQNNN 506

Query:   441 YENG-THEYNPK-YENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSE 498
             Y N   + YNP  Y N Y N  + YN   N N NP   N  + +N     N+ N N  + 
Sbjct:   507 YNNNQNNNYNPNNYGNNY-NPNNNYN---NSN-NPNNMNNNYNHN-QNNNNNNNNNNQNY 560

Query:   499 NRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSING 558
             N +++N + N+ + I   S  ++N  ++ N   HN +++   +          + N+ N 
Sbjct:   561 NNNHNNQFNNQNNQIHNQSN-NQNNYNQNNN--HNNNNQNNNNNNQNNNNNNNQNNNNNN 617

Query:   559 NGTSENRSNDNSYQNEIDGIDVWSVLSRNE-P--SKRNTIL-HNIDDEW 603
             N  + N +N+N+  N   G+   +  S++  P  S  N+ L +N ++E+
Sbjct:   618 NNINNNNNNNNNNNNGNTGLSSSTNNSKHSSPRSSPNNSPLNYNTNEEY 666

 Score = 145 (56.1 bits), Expect = 4.5e-06, P = 4.5e-06
 Identities = 48/197 (24%), Positives = 77/197 (39%)

Query:   381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYE-NGTHEYNPKY-E 438
             N ++  N  NS + N      NS    +N  +  N+  + + N+    NG +    K  E
Sbjct:   282 NNNNNNNSSNSNINNNNNNSNNSNNNIDNSNNNNNNNNVRSGNSNVNANGHNRLKRKSKE 341

Query:   439 NRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYEN--GTHEYNIPRLENSINGNGT 496
             N Y N     N +  N+  N  + +N   N N N    N   T++ NI    N  N N  
Sbjct:   342 NIYNNNNQNNNNQNNNQNNNHNNNHNNNHNNNQNNNQNNIQNTNQNNIQNNHNQQNNNNH 401

Query:   497 SENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSI 556
               N + +N+YQN  +     S  + N+    N    N +      +  + K      N  
Sbjct:   402 QNNNNQNNNYQNNNNQN---SGNNNNQNHHNNKFNQNNNHNQNNHSNNQNK-NNHNNNHN 457

Query:   557 NGNGTSENRSNDNSYQN 573
             N N  + N +N+N+  N
Sbjct:   458 NNNHNNNNHNNNNNNHN 474

 Score = 141 (54.7 bits), Expect = 1.2e-05, P = 1.2e-05
 Identities = 45/214 (21%), Positives = 88/214 (41%)

Query:   387 NYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEY--NPKYENRYENG 444
             N  N+   NI    +N+I    N  +  N     N N  Y+N  ++   N   +N + N 
Sbjct:   372 NNQNNNQNNIQNTNQNNIQNNHNQQNNNNHQNNNNQNNNYQNNNNQNSGNNNNQNHHNNK 431

Query:   445 THEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDN 504
              ++ N   +N + N  ++ N   N N N  + N  H  N     N+ N +  + N +N N
Sbjct:   432 FNQNNNHNQNNHSNNQNKNNHNNNHNNN-NHNNNNHNNNNNNHNNNNNNHNNNNNHNNQN 490

Query:   505 SYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSEN 564
             ++ N+ +  +     + N     N   +N  + +  +            N++N N  + N
Sbjct:   491 NHNNQNNNHNNNQNNNYNNNQNNNYNPNNYGNNYNPNNNYNNS---NNPNNMNNN-YNHN 546

Query:   565 RSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHN 598
             ++N+N+  N       ++    N+ + +N  +HN
Sbjct:   547 QNNNNNNNNNNQN---YNNNHNNQFNNQNNQIHN 577

 Score = 140 (54.3 bits), Expect = 1.6e-05, P = 1.6e-05
 Identities = 54/248 (21%), Positives = 94/248 (37%)

Query:   354 PLLESRGIVAEQYVHVSDWLPTLLSAANK--SDIPNYVNSTVENIIPRYENSILRYENGT 411
             P++ ++ I+       S  L  + S  N    D PN  N+T  N    +  S     + T
Sbjct:   163 PIINTKSIIPSASQLQSQNLNIINSINNNFSKDSPNSQNNTSFNEDTIFIASTTYGSSNT 222

Query:   412 HEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENT 471
                N+  I N+N+   N ++  N    N   N  +  N    N + N  +  N   N N 
Sbjct:   223 PNNNNNNINNNNSNNNNNSNNSNNN-NNSTNNNNNSSNINSPNDFNNNHNNNNNNNNNNN 281

Query:   472 NPRYENGTHEYNIPRLEN-SINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTI 530
             N    N +   NI    N S N N   +N +N+N+  N   G    +    N   +++  
Sbjct:   282 NNNNNNNSSNSNINNNNNNSNNSNNNIDNSNNNNNNNNVRSGNSNVNANGHNRLKRKSK- 340

Query:   531 LHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPS 590
               NI   +  +          + N+ N N  + + +N N+ QN I   +  ++ + +   
Sbjct:   341 -ENI---YNNNNQNNNNQNNNQNNNHNNNHNNNHNNNQNNNQNNIQNTNQNNIQNNHNQQ 396

Query:   591 KRNTILHN 598
               N   +N
Sbjct:   397 NNNNHQNN 404

 Score = 135 (52.6 bits), Expect = 5.4e-05, P = 5.4e-05
 Identities = 52/207 (25%), Positives = 85/207 (41%)

Query:   381 NKSDIPNYVNSTVENIIPR-YENSIL---RYENGTHEYNSPRIENSNTRYENGTHEYNPK 436
             N +   NY N+   N  P  Y N+      Y N  +  N     N N    N  +  N  
Sbjct:   500 NNNQNNNYNNNQNNNYNPNNYGNNYNPNNNYNNSNNPNNMNNNYNHNQNNNNNNNNNNQN 559

Query:   437 YENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPR--LENSINGN 494
             Y N + N  +  N +  N+  N  + YN   N N N +  N  ++ N       N+ N N
Sbjct:   560 YNNNHNNQFNNQNNQIHNQ-SNNQNNYNQNNNHNNNNQNNNNNNQNNNNNNNQNNNNNNN 618

Query:   495 GTSENRSNDNSYQNEIDGIDVWSVLSRNE-P--SKRNTIL-HNIDDEWQISALTRGKWKL 550
               + N +N+N+  N   G+   +  S++  P  S  N+ L +N ++E+  S  +      
Sbjct:   619 NINNNNNNNNNNNNGNTGLSSSTNNSKHSSPRSSPNNSPLNYNTNEEYYNSGSSSPSSPG 678

Query:   551 VKENSI----NGNGTSENRSNDNSYQN 573
                +SI    +GN    N++N N+  N
Sbjct:   679 SPNSSILQITDGNNGFNNQNNLNNGNN 705

 Score = 134 (52.2 bits), Expect = 6.9e-05, P = 6.9e-05
 Identities = 44/218 (20%), Positives = 90/218 (41%)

Query:   381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
             N +   N  N+   N    + N+    +N     N   I+N++ +  N  H+ N    N 
Sbjct:   351 NNNQNNNQNNNHNNNHNNNHNNNQNNNQNNIQNTNQNNIQNNHNQQNNNNHQNNNNQNNN 410

Query:   441 YENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENR 500
             Y+N  ++ N    N   +  +++N   N N N  + N  ++ N     N+ N N  + N 
Sbjct:   411 YQNNNNQ-NSGNNNNQNHHNNKFNQNNNHNQN-NHSNNQNKNNHNNNHNNNNHNNNNHNN 468

Query:   501 SNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNG 560
             +N+N   N  +  +  +  ++N  + +N   HN +            +   + N+ N N 
Sbjct:   469 NNNNHNNNNNNHNNNNNHNNQNNHNNQNNN-HNNNQN--------NNYNNNQNNNYNPNN 519

Query:   561 TSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHN 598
                N + +N+Y N  +  ++ +  + N+ +  N   +N
Sbjct:   520 YGNNYNPNNNYNNSNNPNNMNNNYNHNQNNNNNNNNNN 557

 Score = 129 (50.5 bits), Expect = 0.00024, P = 0.00024
 Identities = 47/196 (23%), Positives = 76/196 (38%)

Query:   381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
             N +   NY N+  +N      N+   + N     N+   +N+++  +N  +  N    N 
Sbjct:   404 NNNQNNNYQNNNNQN---SGNNNNQNHHNNKFNQNNNHNQNNHSNNQNKNNHNNNHNNNN 460

Query:   441 YENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENR 500
             + N  H  N    N   N  H  N   N   N   +N  H  N    +N+ N N    N 
Sbjct:   461 HNNNNHNNNNNNHNN-NNNNHNNNNNHNNQNNHNNQNNNHNNN----QNN-NYNNNQNNN 514

Query:   501 SNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSING-N 559
              N N+Y N  +  + ++  S N  +  N   HN ++    +      +     N  N  N
Sbjct:   515 YNPNNYGNNYNPNNNYNN-SNNPNNMNNNYNHNQNNN-NNNNNNNQNYNNNHNNQFNNQN 572

Query:   560 GTSENRSND-NSY-QN 573
                 N+SN+ N+Y QN
Sbjct:   573 NQIHNQSNNQNNYNQN 588

 Score = 124 (48.7 bits), Expect = 0.00082, P = 0.00082
 Identities = 41/194 (21%), Positives = 80/194 (41%)

Query:   380 ANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYEN 439
             +N ++  N V S   N+     N + R ++  + YN+   +N+N +  N  + +N  + N
Sbjct:   311 SNNNNNNNNVRSGNSNVNANGHNRLKR-KSKENIYNNNN-QNNNNQNNNQNNNHNNNHNN 368

Query:   440 RYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSEN 499
              + N  +      +N  +N     +  +N N +    N  + Y     +NS  GN  ++N
Sbjct:   369 NHNNNQNNNQNNIQNTNQNNIQNNHNQQNNNNHQNNNNQNNNYQNNNNQNS--GNNNNQN 426

Query:   500 RSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGN 559
               N+   QN     +  S  ++N+ +  N   HN ++    +            N+ N N
Sbjct:   427 HHNNKFNQNNNHNQNNHSN-NQNKNNHNNN--HNNNNHNNNNHNNNNNNHNNNNNNHNNN 483

Query:   560 GTSENRSNDNSYQN 573
                 N++N N+  N
Sbjct:   484 NNHNNQNNHNNQNN 497


>DICTYBASE|DDB_G0288501 [details] [associations]
            symbol:ddx42 "DEAD/DEAH box helicase" species:44689
            "Dictyostelium discoideum" [GO:0008026 "ATP-dependent helicase
            activity" evidence=IEA] [GO:0005524 "ATP binding" evidence=IEA]
            [GO:0004386 "helicase activity" evidence=IEA] [GO:0003676 "nucleic
            acid binding" evidence=IEA] [GO:0008150 "biological_process"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0003723 "RNA binding" evidence=IEA]
            [GO:0000166 "nucleotide binding" evidence=IEA] InterPro:IPR000629
            InterPro:IPR001650 InterPro:IPR011545 Pfam:PF00270 Pfam:PF00271
            PROSITE:PS00039 PROSITE:PS51194 SMART:SM00490
            dictyBase:DDB_G0288501 GO:GO:0005524 GO:GO:0005634
            GenomeReviews:CM000154_GR GO:GO:0003723 InterPro:IPR014001
            SMART:SM00487 PROSITE:PS51192 EMBL:AAFI02000112 GO:GO:0008026
            eggNOG:COG0513 InterPro:IPR014014 PROSITE:PS51195 HSSP:P09052
            KO:K12835 RefSeq:XP_636700.1 ProteinModelPortal:Q54IV3
            STRING:Q54IV3 EnsemblProtists:DDB0233432 GeneID:8626657
            KEGG:ddi:DDB_G0288501 OMA:DRDKRGG Uniprot:Q54IV3
        Length = 986

 Score = 159 (61.0 bits), Expect = 1.5e-07, P = 1.5e-07
 Identities = 54/202 (26%), Positives = 82/202 (40%)

Query:   381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
             NK    N  +S   N I    NS     N T+   +    NSN+      + YN  Y+N 
Sbjct:   774 NKFSNNNSGSSNDRNSINYRNNSFNNNSNNTNNSGNSNFNNSNSNNGYSNNNYNNNYKNN 833

Query:   441 --YENGTHEYNPKYENRYENGTHE--YNGPKNENTNPR--YENGTHE--YNIPRL--ENS 490
               Y N  +  N  Y N   N  +   YN   N N N    Y NG +   YN       N+
Sbjct:   834 SNYNNSNNNNNSYYNNNNSNNNNNSNYNNSSNNNNNNNNNYRNGNNNNNYNNNNYYNNNN 893

Query:   491 INGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTIL--HNIDDEWQISALTRGKW 548
              N N ++ N SN+NS  N  +  +  +  + N+ S  N  L  ++ ++    S      +
Sbjct:   894 SNNNSSNNNNSNNNSSNNNFNN-NFNNNNNNNDNSNFNRALPFNDFNNNNNNSNNNNFNY 952

Query:   549 KLVKENSINGNGTSENRSNDNS 570
                  NS N N ++  ++N+NS
Sbjct:   953 NNNFNNSYNANNSNHYKNNNNS 974

 Score = 149 (57.5 bits), Expect = 1.7e-06, P = 1.7e-06
 Identities = 48/177 (27%), Positives = 71/177 (40%)

Query:   402 NSILRYENGTHEYNSPRIENS-NTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGT 460
             NSI  Y N +   NS    NS N+ + N     N  Y N   N  ++ N  Y N   N  
Sbjct:   788 NSI-NYRNNSFNNNSNNTNNSGNSNFNNSNS--NNGYSNNNYNNNYKNNSNYNNSNNNNN 844

Query:   461 HEYNGPK-NENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVL 519
               YN    N N N  Y N ++  N     N  NGN  + N +N+N Y N     +  +  
Sbjct:   845 SYYNNNNSNNNNNSNYNNSSNNNNNNN-NNYRNGNNNN-NYNNNNYYNNNNSNNNSSNNN 902

Query:   520 SRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQNEID 576
             + N  S  N   +N ++    +  +    + +  N  N N  + N +N N Y N  +
Sbjct:   903 NSNNNSSNNNFNNNFNNNNNNNDNSNFN-RALPFNDFNNNNNNSNNNNFN-YNNNFN 957

 Score = 140 (54.3 bits), Expect = 1.6e-05, P = 1.6e-05
 Identities = 47/200 (23%), Positives = 81/200 (40%)

Query:   401 ENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGT 460
             +NS +  EN     N  +  N+N+   N  +  N +  N + N ++  N    + + N +
Sbjct:   758 DNSEINNENEKSINNENKFSNNNSGSSNDRNSINYR-NNSFNNNSNNTNNSGNSNFNN-S 815

Query:   461 HEYNGPKNENTNPRYENGTHEYNIPRLENSI--NGNGTSENRSNDNSYQNEIDGIDVWSV 518
             +  NG  N N N  Y+N ++  N     NS   N N  + N SN N+  N  +  +    
Sbjct:   816 NSNNGYSNNNYNNNYKNNSNYNNSNNNNNSYYNNNNSNNNNNSNYNNSSNNNNNNNNNYR 875

Query:   519 LSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQNEIDGI 578
                N  +  N   +N ++    S+           N+ N N  + N +NDNS  N     
Sbjct:   876 NGNNNNNYNNNNYYNNNNSNNNSSNNNNSNNNSSNNNFNNNFNNNNNNNDNSNFNRALPF 935

Query:   579 DVWSVLSRNEPSKRNTILHN 598
             + ++  + N  S  N   +N
Sbjct:   936 NDFN--NNNNNSNNNNFNYN 953


>UNIPROTKB|E1BZH8 [details] [associations]
            symbol:SULF2 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0005509 "calcium ion binding" evidence=IEA] [GO:0005794
            "Golgi apparatus" evidence=IEA] [GO:0003094 "glomerular filtration"
            evidence=IEA] [GO:0004065 "arylsulfatase activity" evidence=IEA]
            [GO:0005783 "endoplasmic reticulum" evidence=IEA] [GO:0005886
            "plasma membrane" evidence=IEA] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=IEA]
            [GO:0009986 "cell surface" evidence=IEA] [GO:0010575 "positive
            regulation vascular endothelial growth factor production"
            evidence=IEA] [GO:0014846 "esophagus smooth muscle contraction"
            evidence=IEA] [GO:0030177 "positive regulation of Wnt receptor
            signaling pathway" evidence=IEA] [GO:0030201 "heparan sulfate
            proteoglycan metabolic process" evidence=IEA] [GO:0032836
            "glomerular basement membrane development" evidence=IEA]
            [GO:0035860 "glial cell-derived neurotrophic factor receptor
            signaling pathway" evidence=IEA] [GO:0040037 "negative regulation
            of fibroblast growth factor receptor signaling pathway"
            evidence=IEA] [GO:0048706 "embryonic skeletal system development"
            evidence=IEA] [GO:0060348 "bone development" evidence=IEA]
            [GO:0060384 "innervation" evidence=IEA] [GO:0002063 "chondrocyte
            development" evidence=IEA] InterPro:IPR000917 InterPro:IPR014615
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            PIRSF:PIRSF036665 GO:GO:0005783 GO:GO:0005886 GO:GO:0005794
            GO:GO:0009986 GO:GO:0005509 GO:GO:0010575 Gene3D:3.40.720.10
            SUPFAM:SSF53649 GO:GO:0030177 InterPro:IPR024607 PROSITE:PS00523
            GO:GO:0004065 GeneTree:ENSGT00400000022041 GO:GO:0040037
            GO:GO:0030201 GO:GO:0035860 InterPro:IPR024609 Pfam:PF12548
            OMA:PKYYGQG EMBL:AADN02019298 IPI:IPI00571119
            ProteinModelPortal:E1BZH8 Ensembl:ENSGALT00000007309 Uniprot:E1BZH8
        Length = 879

 Score = 145 (56.1 bits), Expect = 1.6e-07, Sum P(3) = 1.6e-07
 Identities = 60/222 (27%), Positives = 90/222 (40%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
             P+II +L DD    DV   G  Q+       + + G    N + T  +C PSRS+I+TGK
Sbjct:    44 PNIILVLTDD---QDVEL-GSMQVMNKTRRIMEHGGAHFINAFVTTPMCCPSRSSILTGK 99

Query:   119 HPIHTGMQHNVLYGCERGGLPLSEKI--LPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
             + +H    +     C         +I     YL   GYRT   GK+ L  Y   Y P   
Sbjct:   100 Y-VHNHNTYTNNENCSSPSWQAQHEIRTFAVYLNNTGYRTAFFGKY-LNEYNGSYVPP-- 155

Query:   177 GFESHLGYWTGHQDY-FDHSAEEMK-MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD--- 231
             G++  +G     + Y +      +K   G D  RD          Y TD+ T +++    
Sbjct:   156 GWKEWVGLLKNSRFYNYTLCRNGVKEKHGFDYSRD----------YLTDLITNDSITFFR 205

Query:   232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
             I        P+ + ++HAA H      P Q    + N  +HI
Sbjct:   206 ISKKMYPHRPVLMVISHAAPHGPEDSAP-QYSHLFPNASQHI 246

 Score = 65 (27.9 bits), Expect = 1.6e-07, Sum P(3) = 1.6e-07
 Identities = 46/231 (19%), Positives = 89/231 (38%)

Query:   234 HNHSTDEPLFLYL-AHAATHSANPYEPLQAPD-HYLN--------IHRHIEDFKRSKFAA 283
             H      P + +L  +A+ H    Y     PD H++         IH    +  + K   
Sbjct:   226 HGPEDSAPQYSHLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQ 285

Query:   284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEG 343
              L  +D+S+  +   L +   L N+ I++ +D              + P        +E 
Sbjct:   286 TLMSVDDSMEMIYNTLVETGELDNTYIIYTADHGYHIGQFGLVKGKSMP--------YEF 337

Query:   344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENS 403
              +R    +  P +E+  +     +++ D  PT+L  A   DIP+ ++   ++I+   ++ 
Sbjct:   338 DIRVPFYVRGPNVEAGSLNPHIVLNI-DLAPTILDIAGL-DIPSDMDG--KSILKLLDSE 393

Query:   404 ILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHE-YNPKYE 453
               R  N  H     ++   +   E G   +  K EN   +   E + PKY+
Sbjct:   394 --RPVNRFHLKKKVKVWRDSFLVERGKLLH--KRENEKVDAQEENFLPKYQ 440

 Score = 43 (20.2 bits), Expect = 1.6e-07, Sum P(3) = 1.6e-07
 Identities = 19/79 (24%), Positives = 33/79 (41%)

Query:   429 GTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLE 488
             GT    PKY NR    + + N + EN Y+     + G + +  + +    ++  N     
Sbjct:   489 GTSNLLPKYYNR---NSEDCNCE-ENEYKLS---HTGRRKKLFSKKKYKPSYARNRSTRS 541

Query:   489 NSINGNGTSENRSNDNSYQ 507
              S+  NG   N   ++ YQ
Sbjct:   542 VSVELNGAMFNLGLEDGYQ 560

 Score = 40 (19.1 bits), Expect = 4.0e-05, Sum P(2) = 4.0e-05
 Identities = 13/56 (23%), Positives = 28/56 (50%)

Query:   387 NYVNSTVENIIPRYENSI--LRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
             N VN+   +++ +    +  LR   G  + N PR  N +   ++G++E   +++ R
Sbjct:   802 NAVNTLDRDVLNQLHVQLMELRSCKGYKQCN-PRTRNIDLGLKDGSYEQYRQFQRR 856


>DICTYBASE|DDB_G0275173 [details] [associations]
            symbol:hbx2 "putative homeobox transcription factor"
            species:44689 "Dictyostelium discoideum" [GO:0043565
            "sequence-specific DNA binding" evidence=IEA] [GO:0006355
            "regulation of transcription, DNA-dependent" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] [GO:0003700 "sequence-specific
            DNA binding transcription factor activity" evidence=IEA]
            [GO:0003677 "DNA binding" evidence=IEA] [GO:0007275 "multicellular
            organismal development" evidence=IEA] [GO:0006351 "transcription,
            DNA-dependent" evidence=IEA] InterPro:IPR001356 InterPro:IPR009057
            InterPro:IPR017970 Pfam:PF00046 PROSITE:PS00027 PROSITE:PS50071
            SMART:SM00389 dictyBase:DDB_G0275173 GO:GO:0007275 GO:GO:0005634
            GO:GO:0043565 GenomeReviews:CM000151_GR GO:GO:0003700 GO:GO:0006351
            EMBL:AAFI02000013 Gene3D:1.10.10.60 SUPFAM:SSF46689 EMBL:AF036171
            RefSeq:XP_643746.1 ProteinModelPortal:Q869W0
            EnsemblProtists:DDB0185105 GeneID:8619790 KEGG:ddi:DDB_G0275173
            eggNOG:NOG301813 InParanoid:Q869W0 OMA:HAPENIK
            ProtClustDB:CLSZ2846877 Uniprot:Q869W0
        Length = 942

 Score = 158 (60.7 bits), Expect = 1.8e-07, P = 1.8e-07
 Identities = 48/189 (25%), Positives = 77/189 (40%)

Query:   388 YVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTH- 446
             Y N+   N      N I   ++   +Y S    N+N  Y NG + YN    N   N  + 
Sbjct:   750 YFNNNNNNNNNNNNNRIS--DSSDDQYFSDDTNNNNDNYNNGNNNYNNNNNNNNFNNNYM 807

Query:   447 -EYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNS 505
               YN  Y N   N  + YN   N N N  + N  +  N    +N+ N N  ++  +N+ +
Sbjct:   808 NNYNNNYNNNNYNNNNSYN---NSNGNNNFNNNNNNNN----QNNNNNNNNNQYNNNNKN 860

Query:   506 YQNEIDGIDVWSVLSRNEPSKRNTILHN-IDDEWQISALTRGKWKLVKENSINGNGTSEN 564
             Y N I       +    E  +RN++ ++ I + +          K    N+ N NG   N
Sbjct:   861 YLNNIPSSKKHQLQGNYE--RRNSLPNSQIQNNFNGDNNNNNNNKNNNNNNQNNNGNGNN 918

Query:   565 RSNDNSYQN 573
              +N+N+  N
Sbjct:   919 NNNNNNDNN 927

 Score = 145 (56.1 bits), Expect = 4.4e-06, P = 4.4e-06
 Identities = 54/228 (23%), Positives = 96/228 (42%)

Query:   353 SPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTH 412
             +P+LES  +  +Q  +++ +    +   NK+ +P+   S   N      N+     N  +
Sbjct:   592 TPILESLNV--KQQNNINFFKNNNMDNNNKN-VPHLSLSNNNNNNNNNNNN--NNNNNNN 646

Query:   413 EYNSPRIENSNTRYENGTHEYNPKYENRYEN----GTHEYNP---KYENRYENGTHEYNG 465
               N+ R  N+N  Y N  +  N    NR +N    G+ + +    ++ N   N ++ YN 
Sbjct:   647 NNNNNRNRNNNNIYNNNNNNNNNNSNNRGKNFSDSGSSDSDSELNRHNNNNNNNSNNYNN 706

Query:   466 PKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPS 525
               + + N R  N  + YN     N IN N  + N +  +    E D     +  + N  +
Sbjct:   707 GNSNSNNNRNNNNNYNYN-----NYINNNNYNNNNNRQHCDDEEEDEQYFNNNNNNNNNN 761

Query:   526 KRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQN 573
               N I  + DD++  S  T        +N  NGN    N +N+N++ N
Sbjct:   762 NNNRISDSSDDQY-FSDDTNNN----NDNYNNGNNNYNNNNNNNNFNN 804

 Score = 139 (54.0 bits), Expect = 2.0e-05, P = 2.0e-05
 Identities = 52/200 (26%), Positives = 94/200 (47%)

Query:   409 NGTHEYNSPRI-ENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPK 467
             N  +  N+ RI ++S+ +Y +   + N   +N Y NG + YN    N   N  +  N   
Sbjct:   756 NNNNNNNNNRISDSSDDQYFSD--DTNNNNDN-YNNGNNNYNNNNNNNNFNNNYMNNYNN 812

Query:   468 NENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKR 527
             N N N  Y N  + YN     N+ N N  + N++N+N+  N     +  + L+    SK+
Sbjct:   813 NYNNN-NYNNN-NSYNNSNGNNNFNNNNNNNNQNNNNNNNNNQYNNNNKNYLNNIPSSKK 870

Query:   528 NTILHNIDDEWQISALTRGKWKLVKENSING--NGTSENRSNDNSYQNEI-DGIDVWSVL 584
             + +  N +     ++L   +     +N+ NG  N  + N++N+N+ QN   +G +  +  
Sbjct:   871 HQLQGNYERR---NSLPNSQI----QNNFNGDNNNNNNNKNNNNNNQNNNGNGNNNNNNN 923

Query:   585 SRNEPSKRNTILHNIDDEWQ 604
             + N   KR    H++DD+ Q
Sbjct:   924 NDNNIYKRR---HSMDDDCQ 940

 Score = 129 (50.5 bits), Expect = 0.00023, P = 0.00023
 Identities = 40/177 (22%), Positives = 74/177 (41%)

Query:   380 ANKSDIPNYVNSTVENIIPRYENSILRYENGTHE--YNSPRIENSNTRYENGTHEYNPKY 437
             ++ SD   Y +    N    Y N    Y N  +   +N+  + N N  Y N  +  N  Y
Sbjct:   767 SDSSD-DQYFSDDTNNNNDNYNNGNNNYNNNNNNNNFNNNYMNNYNNNYNNNNYNNNNSY 825

Query:   438 ENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYEN-----------GTHEY--NI 484
              N   NG + +N    N  +N  +  N  +  N N  Y N           G +E   ++
Sbjct:   826 NN--SNGNNNFNNNNNNNNQNNNNNNNNNQYNNNNKNYLNNIPSSKKHQLQGNYERRNSL 883

Query:   485 P--RLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQ 539
             P  +++N+ NG+  + N + +N+  N+ +  +  +  + N  +      H++DD+ Q
Sbjct:   884 PNSQIQNNFNGDNNNNNNNKNNNNNNQNNNGNGNNNNNNNNDNNIYKRRHSMDDDCQ 940


>DICTYBASE|DDB_G0292550 [details] [associations]
            symbol:DDB_G0292550 "protein kinase, CMGC group"
            species:44689 "Dictyostelium discoideum" [GO:0016772 "transferase
            activity, transferring phosphorus-containing groups" evidence=IEA]
            [GO:0006468 "protein phosphorylation" evidence=IEA] [GO:0005524
            "ATP binding" evidence=IEA] [GO:0004674 "protein serine/threonine
            kinase activity" evidence=IEA] [GO:0004672 "protein kinase
            activity" evidence=IEA] [GO:0007049 "cell cycle" evidence=IEA]
            [GO:0004693 "cyclin-dependent protein serine/threonine kinase
            activity" evidence=IEA] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0016740 "transferase activity" evidence=IEA]
            [GO:0016310 "phosphorylation" evidence=IEA] [GO:0016301 "kinase
            activity" evidence=IEA] [GO:0000166 "nucleotide binding"
            evidence=IEA] InterPro:IPR000719 InterPro:IPR002290
            InterPro:IPR008271 InterPro:IPR011009 InterPro:IPR017441
            Pfam:PF00069 PROSITE:PS00107 PROSITE:PS00108 PROSITE:PS50011
            SMART:SM00220 dictyBase:DDB_G0292550 GO:GO:0005524 eggNOG:COG0515
            SUPFAM:SSF56112 GO:GO:0007049 EMBL:AAFI02000190 GO:GO:0004693
            HSSP:P24941 RefSeq:XP_629621.1 ProteinModelPortal:Q54D75
            EnsemblProtists:DDB0229424 GeneID:8628684 KEGG:ddi:DDB_G0292550
            InParanoid:Q54D75 OMA:RRETSEY Uniprot:Q54D75
        Length = 1397

 Score = 159 (61.0 bits), Expect = 2.2e-07, P = 2.2e-07
 Identities = 55/227 (24%), Positives = 86/227 (37%)

Query:   382 KSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTR-YENGTHEYNPKYENR 440
             ++ +PN++   V  +    +  ILR +      N+  + N+N   Y N  H  N    N 
Sbjct:   379 QTSVPNHIYKEVYEVNQLLKQYILRLKQQKVNLNNNNLNNNNNNLYGNNNHNNNNNNNNN 438

Query:   441 YENGTHE--YNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSE 498
               N  +   YN    N   N  H+ N   N N N  Y+N  +  N     NS N N  + 
Sbjct:   439 NNNNNNNNNYNNNNNNHNNNYNHDNNNNNNYNNN-NYKNNNNSNNNFSFNNSNNNNNNNN 497

Query:   499 NRS----NDNSYQNEIDGIDVWSVLSRNEPSKRN-TILHNIDDEWQISALTRGKWKLVKE 553
             N +    N+N+  N  +  + ++  S N     N     N +D           +  V  
Sbjct:   498 NNNRNNRNNNNNNNNNNNNNNYNNNSNNNSYNNNFNNGFNNNDNINDDNNNNNSYNNVNN 557

Query:   554 NSINGNGTSENRSND-NSYQNEIDGIDV-WSVLSRNEPSKRNTILHN 598
             N+IN N  + N  N  N+Y N  +  +   +    N  S  NT   N
Sbjct:   558 NNINNNNNNNNGFNGFNNYGNNFNNSNNNGNQFGANNNSFNNTDFSN 604

 Score = 137 (53.3 bits), Expect = 5.2e-05, P = 5.2e-05
 Identities = 46/190 (24%), Positives = 72/190 (37%)

Query:   387 NYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYE-NGT 445
             N  N+   N      N+     N  H  N     N+N  Y N  ++ N    N +  N +
Sbjct:   430 NNNNNNNNNNNNNNNNNNYNNNNNNHNNNYNHDNNNNNNYNNNNYKNNNNSNNNFSFNNS 489

Query:   446 HEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHE--YNIPRLENSINGNGTSENRSND 503
             +  N    N   N  +  N   N N N  Y N ++   YN     N  NG   ++N ++D
Sbjct:   490 NNNNNNNNNNNRNNRNNNNNNNNNNNNNNYNNNSNNNSYN----NNFNNGFNNNDNINDD 545

Query:   504 NSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSE 563
             N+  N  + ++  ++ + N  +      +N  + +  S    G       NS N    S 
Sbjct:   546 NNNNNSYNNVNNNNINNNNNNNNGFNGFNNYGNNFNNSN-NNGNQFGANNNSFNNTDFS- 603

Query:   564 NRSNDNSYQN 573
             N SN  SY N
Sbjct:   604 NDSNYGSYCN 613

 Score = 129 (50.5 bits), Expect = 0.00037, P = 0.00037
 Identities = 52/205 (25%), Positives = 75/205 (36%)

Query:   381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
             N +   NY +    N    Y N+   Y+N  +  N+    NSN    N  +       N 
Sbjct:   452 NNNHNNNYNHDNNNN--NNYNNN--NYKNNNNSNNNFSFNNSNNNNNNNNNNNRNNRNNN 507

Query:   441 YENGTHEYNPKYENRYENGTHE--YNGPKNENTNPRYENGTHE-YNIPRLENSINGNGTS 497
               N  +  N  Y N   N ++   +N   N N N   +N  +  YN     N+IN N  +
Sbjct:   508 NNNNNNNNNNNYNNNSNNNSYNNNFNNGFNNNDNINDDNNNNNSYNNVN-NNNINNNNNN 566

Query:   498 ENRSND-NSYQNEIDGIDV-WSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENS 555
              N  N  N+Y N  +  +   +    N  S  NT   N D  +   +   G   L+  NS
Sbjct:   567 NNGFNGFNNYGNNFNNSNNNGNQFGANNNSFNNTDFSN-DSNY--GSYCNGLMDLINNNS 623

Query:   556 I--NGNGTSENRSNDNSYQNEIDGI 578
             +   GN    N S     Q  I  I
Sbjct:   624 MYNGGNYYMNNASFHQRIQEHIQKI 648


>DICTYBASE|DDB_G0269922 [details] [associations]
            symbol:xrn2 "CCHC-type zinc finger-containing
            protein" species:44689 "Dictyostelium discoideum" [GO:0008270 "zinc
            ion binding" evidence=IEA] [GO:0006139 "nucleobase-containing
            compound metabolic process" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0005622 "intracellular" evidence=IEA] [GO:0004534
            "5'-3' exoribonuclease activity" evidence=IEA] [GO:0004527
            "exonuclease activity" evidence=IEA] [GO:0003676 "nucleic acid
            binding" evidence=IEA] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR001878 InterPro:IPR004859
            InterPro:IPR017151 Pfam:PF03159 PIRSF:PIRSF037239 PROSITE:PS50158
            SMART:SM00343 dictyBase:DDB_G0269922 GO:GO:0005634
            EMBL:AAFI02000005 GenomeReviews:CM000150_GR GO:GO:0008270
            GO:GO:0003676 Gene3D:4.10.60.10 eggNOG:COG5049 InterPro:IPR027073
            PANTHER:PTHR12341 GO:GO:0004534 KO:K12619 RefSeq:XP_646407.1
            STRING:Q55CS4 EnsemblProtists:DDB0237528 GeneID:8617364
            KEGG:ddi:DDB_G0269922 InParanoid:Q55CS4 Uniprot:Q55CS4
        Length = 1190

 Score = 165 (63.1 bits), Expect = 2.5e-07, Sum P(2) = 2.5e-07
 Identities = 54/211 (25%), Positives = 88/211 (41%)

Query:   381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYEN-GTHEYNPKYEN 439
             N+ +  NY N+   N      N+     N  +  N+    N+N  Y N   +  N  Y+N
Sbjct:   919 NRFNNQNYNNNRYNNNNNNNNNN--NNNNNNNNNNNNNNNNNNNNYNNYNNYNNNNNYKN 976

Query:   440 RYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSEN 499
                N    YN    N Y N  + YN     N N  Y NG + YN     N+ NGNG + N
Sbjct:   977 NNYNNNGNYNGNNSNNYNNNNN-YNNSNYNNYNNSYNNGNN-YN----NNNNNGNGYNSN 1030

Query:   500 RSND--NSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSIN 557
              +N+  N+Y N  +  + ++    N  +  N    N ++    +    G +     N  N
Sbjct:  1031 YNNNYNNNYNNGNNNGNNYNNNYNNNYNNGNNNGFNNNNNNNYNNNNYGGYD--NNNGFN 1088

Query:   558 GNGTSENRSNDN-SYQNEIDGIDVWSVLSRN 587
              N  + N +N+N SY  + + ++  S++  N
Sbjct:  1089 NNNNNNNNNNNNNSYNYDFNNLNDPSLIDIN 1119

 Score = 144 (55.7 bits), Expect = 4.1e-05, Sum P(2) = 4.1e-05
 Identities = 48/196 (24%), Positives = 75/196 (38%)

Query:   407 YENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGP 466
             Y N    YN  R  N N  Y N  +  N    N   N  +  N    N   N  + YN  
Sbjct:   911 YNNKQIGYN--RFNNQN--YNNNRYNNNNNNNNNNNNNNNNNNNNNNNN-NNNNNNYNNY 965

Query:   467 KNENTNPRYENGTHEYNIPRLENSING--NGTSENRSNDNSYQNEIDGIDVWSVLSRNEP 524
              N N N  Y+N  +  N     N+ N   N  + N SN N+Y N  +  + ++  + N  
Sbjct:   966 NNYNNNNNYKNNNYNNNGNYNGNNSNNYNNNNNYNNSNYNNYNNSYNNGNNYNNNNNNGN 1025

Query:   525 SKRNTILHNIDDEWQISALTRGKWKLVKENSIN-GNGTSENRSNDNSYQNE-IDGIDVWS 582
                +   +N ++ +         +     N+ N GN    N +N+N+Y N    G D  +
Sbjct:  1026 GYNSNYNNNYNNNYNNGNNNGNNYNNNYNNNYNNGNNNGFNNNNNNNYNNNNYGGYDNNN 1085

Query:   583 VLSRNEPSKRNTILHN 598
               + N  +  N   +N
Sbjct:  1086 GFNNNNNNNNNNNNNN 1101

 Score = 44 (20.5 bits), Expect = 2.5e-07, Sum P(2) = 2.5e-07
 Identities = 14/53 (26%), Positives = 24/53 (45%)

Query:    23 QYLKELGYRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDV 75
             QY++E   + +  +  + P       +F+D VA S   ++   L  D  W DV
Sbjct:   149 QYMEEKKNKFKFDSNCITP-----GTLFMDRVAESLRTYVAEKLTTDPAWKDV 196


>UNIPROTKB|F1SBF1 [details] [associations]
            symbol:LOC100739059 "Uncharacterized protein" species:9823
            "Sus scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484
            GeneTree:ENSGT00400000022041 OMA:TENDPAN EMBL:FP339597
            RefSeq:XP_003484028.1 Ensembl:ENSSSCT00000008161 GeneID:100739169
            KEGG:ssc:100739169 Uniprot:F1SBF1
        Length = 527

 Score = 141 (54.7 bits), Expect = 3.0e-07, Sum P(2) = 3.0e-07
 Identities = 61/225 (27%), Positives = 89/225 (39%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
             P+II +L DD    DV   G  Q+       +   G    N + T  +C PSRS+I+TGK
Sbjct:    44 PNIILVLTDD---QDVEL-GSMQVMNKTRRIMEQGGAHFINAFVTTPMCCPSRSSILTGK 99

Query:   119 HPIHTGMQHNVLYGCERGGLPL-----SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
             + +H    HN     E    P        +    YL   GYRT   GK+ L  Y   Y P
Sbjct:   100 Y-VHN---HNTYTNNENCSSPSWQAQHESRTFAVYLNSTGYRTAFFGKY-LNEYNGSYVP 154

Query:   174 TFRGFESHLGYWTGHQDY-FDHSAEEMK-MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
                G++  +G     + Y +      +K   G D  +D          Y TD+ T ++V 
Sbjct:   155 P--GWKEWVGLLKNSRFYNYTLCRNGVKEKHGFDYSKD----------YLTDLITNDSVS 202

Query:   232 IIHNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
                         P+ + ++HAA H      P Q    + N  +HI
Sbjct:   203 FFRTSKKMYPHRPVLMVISHAAPHGPEDSAP-QYSRLFPNASQHI 246

 Score = 59 (25.8 bits), Expect = 3.0e-07, Sum P(2) = 3.0e-07
 Identities = 48/233 (20%), Positives = 89/233 (38%)

Query:   234 HNHSTDEPLFLYL-AHAATHSANPYEPLQAPD-HYLN--------IHRHIEDFKRSKFAA 283
             H      P +  L  +A+ H    Y     PD H++         IH    +  + K   
Sbjct:   226 HGPEDSAPQYSRLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQ 285

Query:   284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEG 343
              L  +D+S+  +   L +   L N+ IV+ +D              + P        +E 
Sbjct:   286 TLMSVDDSMETIYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMP--------YEF 337

Query:   344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENS 403
              +R    +  P +E+  +     +++ D  PT+L  A   DIP+ ++   ++I+   +  
Sbjct:   338 DIRVPFYVRGPNVEAGSLNPHIVLNI-DLAPTILDIAGL-DIPSDMDG--KSILKLLDTE 393

Query:   404 ILRYENGTHEYNSPRIENSNTRYENGT--HEYNP-KYENRYENGTHEYNPKYE 453
               R  N  H     R+   +   E G   H+ +  K + + EN    + PKY+
Sbjct:   394 --RPANRFHLKKKMRVWRDSFLVERGKLLHKRDSDKVDAQEEN----FLPKYQ 440


>DICTYBASE|DDB_G0273013 [details] [associations]
            symbol:uglB "uracil glycosylase" species:44689
            "Dictyostelium discoideum" [GO:0006284 "base-excision repair"
            evidence=IEA] [GO:0006281 "DNA repair" evidence=IEA] [GO:0004844
            "uracil DNA N-glycosylase activity" evidence=IEA]
            InterPro:IPR002043 dictyBase:DDB_G0273013 HAMAP:MF_00148
            Pfam:PF03167 GO:GO:0005737 GO:GO:0006284 GenomeReviews:CM000151_GR
            EMBL:AAFI02000008 GO:GO:0004844 Gene3D:3.40.470.10
            InterPro:IPR005122 SMART:SM00986 SUPFAM:SSF52141 eggNOG:COG0692
            KO:K03648 PANTHER:PTHR11264 TIGRFAMs:TIGR00628
            RefSeq:XP_001134629.1 ProteinModelPortal:Q1ZXM2
            EnsemblProtists:DDB0232990 GeneID:8618639 KEGG:ddi:DDB_G0273013
            InParanoid:Q1ZXM2 OMA:FININEP Uniprot:Q1ZXM2
        Length = 597

 Score = 151 (58.2 bits), Expect = 3.9e-07, Sum P(2) = 3.9e-07
 Identities = 48/192 (25%), Positives = 76/192 (39%)

Query:   391 STVENIIPRYENSILRYENGTHEYNSPRIENSNTR------YENGTHEYNPKYENRYENG 444
             ST  NI+ + + S + ++    + N+  I N N        Y N  +  N    N   N 
Sbjct:   405 STANNILIQSQQSPIDWDLDNIDCNNNNINNKNKNLNLNVDYNNNNNNNNNNNNNNNNNN 464

Query:   445 THEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDN 504
              +  N    N   N  +  N   N N N    N  +  N     N+IN N T+ N+SNDN
Sbjct:   465 NNNNNNNNNNNNNNNNNNNNNNNNNNINNNNHNNNNNNNNNNTNNNINNN-TNNNKSNDN 523

Query:   505 SYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSEN 564
             +  N  +  +  +  S N  +K N    N + ++  +            N+ N N ++ N
Sbjct:   524 NNHNNTNNNNT-NNNSNNIDNKENNNEENENVDFNNNNNNNNN-NNNNNNNNNINNSNNN 581

Query:   565 RSNDNSYQNEID 576
              +ND S    ID
Sbjct:   582 TNNDKSNSKSID 593

 Score = 122 (48.0 bits), Expect = 0.00049, Sum P(2) = 0.00049
 Identities = 41/160 (25%), Positives = 65/160 (40%)

Query:   384 DIPNYVNSTVENIIPRYENSILR--YENGTHEYNSPRIENSNTRYENGTHEYNPKYENRY 441
             D+ N ++    NI  + +N  L   Y N  +  N+    N+N    N  +  N    N  
Sbjct:   422 DLDN-IDCNNNNINNKNKNLNLNVDYNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 480

Query:   442 ENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTH--EYNIPRLENSINGNGTSEN 499
              N  +  N    N   N  +  N   N NTN    N T+  + N     N+ N N T+ N
Sbjct:   481 NNNNNNNNNNINNNNHNNNNNNN---NNNTNNNINNNTNNNKSNDNNNHNNTNNNNTNNN 537

Query:   500 RSN-DNSYQN--EIDGIDVWSVLSRNEPSKRNTILHNIDD 536
              +N DN   N  E + +D  +  + N  +  N   +NI++
Sbjct:   538 SNNIDNKENNNEENENVDFNNNNNNNNNNNNNNNNNNINN 577

 Score = 49 (22.3 bits), Expect = 3.9e-07, Sum P(2) = 3.9e-07
 Identities = 18/71 (25%), Positives = 33/71 (46%)

Query:    98 LKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTR 157
             L N +T  +     + I+  K  +H   ++N +   E      SE I    L E G+R+R
Sbjct:   144 LFNGFTSPIDQNINNNIINNKR-LHLNNENNFININEPTNENFSENIKKSLL-EKGWRSR 201

Query:   158 IVGKWHLGFYK 168
             + G++   ++K
Sbjct:   202 LQGEFEKDYFK 212


>ZFIN|ZDB-GENE-061215-37 [details] [associations]
            symbol:ids "iduronate 2-sulfatase" species:7955
            "Danio rerio" [GO:0008152 "metabolic process" evidence=IEA]
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
            [GO:0003824 "catalytic activity" evidence=IEA] [GO:0030512
            "negative regulation of transforming growth factor beta receptor
            signaling pathway" evidence=IMP] [GO:0005737 "cytoplasm"
            evidence=IDA] [GO:0009790 "embryo development" evidence=IMP]
            [GO:0060536 "cartilage morphogenesis" evidence=IMP] [GO:0004423
            "iduronate-2-sulfatase activity" evidence=IDA] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            ZFIN:ZDB-GENE-061215-37 GO:GO:0005737 GO:GO:0009790
            Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
            PROSITE:PS00149 GO:GO:0030512 GO:GO:0060536 OMA:CREGKNL
            GO:GO:0004423 GeneTree:ENSGT00640000091539 EMBL:CR774199
            IPI:IPI00495228 Ensembl:ENSDART00000106205 Bgee:F1R4Q5
            Uniprot:F1R4Q5
        Length = 561

 Score = 135 (52.6 bits), Expect = 4.1e-07, Sum P(3) = 4.1e-07
 Identities = 38/113 (33%), Positives = 57/113 (50%)

Query:    55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSA 113
             A S   ++++++ADDL    +G +    + +PNID LA   ++  N Y  Q +C PSR +
Sbjct:    26 AKSKDFNVLYLIADDLR-PSLGCYSDPVVKSPNIDQLASLSVVFHNAYAQQAVCGPSRVS 84

Query:   114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK-WHLG 165
              +T + P  T +     Y     G   +   LPQY K  GY T  VGK +H G
Sbjct:    85 FLTSRRPDTTKLYDFNSYWRVHAG---NYTTLPQYFKSNGYTTLSVGKVFHPG 134

 Score = 69 (29.3 bits), Expect = 4.1e-07, Sum P(3) = 4.1e-07
 Identities = 19/60 (31%), Positives = 32/60 (53%)

Query:   256 PYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
             PY P+   D  L I +H        FA++ + +D  VGK+++ L+   +  N+I+V  SD
Sbjct:   278 PYGPIPK-DFQLRIRQHY-------FASVSY-VDAQVGKILQTLDDVGLAKNTIVVLSSD 328

 Score = 44 (20.5 bits), Expect = 0.00012, Sum P(3) = 0.00012
 Identities = 12/46 (26%), Positives = 22/46 (47%)

Query:   226 TAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
             T EA+ ++ +   + +P FL     A     P+ P + P  YL ++
Sbjct:   196 TEEAIRLLRSMKGSQKPFFL-----AVGFYKPHIPFRIPQEYLKLY 236

 Score = 38 (18.4 bits), Expect = 4.1e-07, Sum P(3) = 4.1e-07
 Identities = 11/41 (26%), Positives = 18/41 (43%)

Query:   775 CEPQIAPC----LFDIKNDPCEKNNLADR-SEVQRINHYTT 810
             C+P +       L+ +  DP + NNL D       +N + T
Sbjct:   496 CKPNMTEIHAGELYILTEDPGQDNNLFDEFGHAALLNKFGT 536

 Score = 37 (18.1 bits), Expect = 5.2e-07, Sum P(3) = 5.2e-07
 Identities = 9/28 (32%), Positives = 14/28 (50%)

Query:   669 CEPQIAPC----LFDIKNDPCEKNNLAD 692
             C+P +       L+ +  DP + NNL D
Sbjct:   496 CKPNMTEIHAGELYILTEDPGQDNNLFD 523


>UNIPROTKB|Q1LZH9 [details] [associations]
            symbol:GNS "N-acetylglucosamine-6-sulfatase" species:9913
            "Bos taurus" [GO:0005764 "lysosome" evidence=IEA] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=IEA]
            [GO:0046872 "metal ion binding" evidence=IEA] [GO:0030203
            "glycosaminoglycan metabolic process" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR012251 InterPro:IPR015981
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            PIRSF:PIRSF036666 GO:GO:0046872 GO:GO:0005764 Gene3D:3.40.720.10
            SUPFAM:SSF53649 eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523
            PROSITE:PS00149 GO:GO:0030203 EMBL:BC115990 IPI:IPI00703612
            RefSeq:NP_001069030.1 UniGene:Bt.20235 ProteinModelPortal:Q1LZH9
            STRING:Q1LZH9 PRIDE:Q1LZH9 GeneID:512444 KEGG:bta:512444 CTD:2799
            HOGENOM:HOG000169239 HOVERGEN:HBG005840 InParanoid:Q1LZH9 KO:K01137
            OrthoDB:EOG4NGGMF NextBio:20870390 GO:GO:0008449
            PANTHER:PTHR10342:SF5 Uniprot:Q1LZH9
        Length = 560

 Score = 147 (56.8 bits), Expect = 4.3e-07, Sum P(2) = 4.3e-07
 Identities = 63/228 (27%), Positives = 97/228 (42%)

Query:    56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDAL-AYSGIILKNYYTVQ-LCTPSRSA 113
             SS  P+++ +LADD    D    G+   P     AL    G+   + Y    LC PSR++
Sbjct:    51 SSRRPNVVLLLADD---QDEVLGGMT--PLKKTKALIGEMGMTFSSAYVPSALCCPSRAS 105

Query:   114 IMTGKHPIHTGMQHNVLYG-CE-RGGLPLSE-KILPQYLKEL-GYRTRIVGKWHLGFYKK 169
             I+TGK+P +  + +N L G C  +    + E    P  L+ + GY+T   GK     Y  
Sbjct:   106 ILTGKYPHNLHVVNNTLEGNCSSKSWQKIQEPNTFPAILRSMCGYQTFFAGK-----YLN 160

Query:   170 EYTPTFRGFESH--LG--YWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVF 225
             EY     G   H  LG  YW   +    +    + + G   +     + D    Y TDV 
Sbjct:   161 EYGAPDAGGLGHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVD----YLTDVL 216

Query:   226 TAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
                ++D +   S  EP F+ ++  A HS     P  A   Y N  +++
Sbjct:   217 ANVSLDFLDYKSNSEPFFMMISTPAPHS-----PWTAAPQYQNAFQNV 259

 Score = 52 (23.4 bits), Expect = 4.3e-07, Sum P(2) = 4.3e-07
 Identities = 13/38 (34%), Positives = 23/38 (60%)

Query:   278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
             R ++  +L  +D+ V K+V+ LE    L+N+ I + SD
Sbjct:   298 RKRWQTLL-SVDDLVEKLVKRLEFNGELNNTYIFYTSD 334

 Score = 41 (19.5 bits), Expect = 5.8e-06, Sum P(2) = 5.8e-06
 Identities = 27/105 (25%), Positives = 42/105 (40%)

Query:   473 PRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTI-- 530
             P+Y+N       PR +N  N +GT+++                W +     P   ++I  
Sbjct:   250 PQYQNAFQNVFAPRNKN-FNIHGTNKH----------------WLIRQAKTPMTNSSIQF 292

Query:   531 LHN-IDDEWQ-ISALTRGKWKLVKENSING--NGTSENRSNDNSY 571
             L N     WQ + ++     KLVK    NG  N T    ++DN Y
Sbjct:   293 LDNAFRKRWQTLLSVDDLVEKLVKRLEFNGELNNTYIFYTSDNGY 337

 Score = 40 (19.1 bits), Expect = 7.3e-06, Sum P(2) = 7.3e-06
 Identities = 8/23 (34%), Positives = 14/23 (60%)

Query:   450 PKYENRYENGTHEYNGPKNENTN 472
             P+Y+N ++N       P+N+N N
Sbjct:   250 PQYQNAFQN----VFAPRNKNFN 268


>DICTYBASE|DDB_G0277905 [details] [associations]
            symbol:snfA "AMP-activated protein kinase alpha
            subunit" species:44689 "Dictyostelium discoideum" [GO:0046956
            "positive phototaxis" evidence=IMP] [GO:0008283 "cell
            proliferation" evidence=IMP] [GO:0007005 "mitochondrion
            organization" evidence=IMP] [GO:0006754 "ATP biosynthetic process"
            evidence=IMP] [GO:0016772 "transferase activity, transferring
            phosphorus-containing groups" evidence=IEA] [GO:0006468 "protein
            phosphorylation" evidence=IEA;ISS] [GO:0005524 "ATP binding"
            evidence=IEA] [GO:0004674 "protein serine/threonine kinase
            activity" evidence=IEA] [GO:0004672 "protein kinase activity"
            evidence=IEA] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0007165 "signal transduction" evidence=ISS] [GO:0004679
            "AMP-activated protein kinase activity" evidence=ISS] [GO:0016740
            "transferase activity" evidence=IEA] [GO:0016310 "phosphorylation"
            evidence=IEA] [GO:0016301 "kinase activity" evidence=IEA]
            [GO:0000166 "nucleotide binding" evidence=IEA] InterPro:IPR000719
            InterPro:IPR002290 InterPro:IPR008271 InterPro:IPR011009
            InterPro:IPR017441 Pfam:PF00069 PROSITE:PS00107 PROSITE:PS00108
            PROSITE:PS50011 SMART:SM00220 dictyBase:DDB_G0277905
            InterPro:IPR001772 GO:GO:0005524 GO:GO:0007165
            GenomeReviews:CM000152_GR eggNOG:COG0515 GO:GO:0008283
            EMBL:AAFI02000023 SUPFAM:SSF56112 GO:GO:0004679 GO:GO:0006754
            HSSP:P06782 GO:GO:0007005 EMBL:AF118151 RefSeq:XP_642250.1
            ProteinModelPortal:Q54YF2 SMR:Q54YF2 STRING:Q54YF2
            EnsemblProtists:DDB0215396 GeneID:8621459 KEGG:ddi:DDB_G0277905
            OMA:KREANSI GO:GO:0046956 Pfam:PF02149 PROSITE:PS50032
            Uniprot:Q54YF2
        Length = 727

 Score = 153 (58.9 bits), Expect = 4.3e-07, P = 4.3e-07
 Identities = 52/208 (25%), Positives = 79/208 (37%)

Query:   375 TLLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYN 434
             T  + +N + I N  N+   N      N+     N     N+  I N+N    N  +  N
Sbjct:   386 TGFNPSNSNSISNNNNNNNNNNNNTTNNNNNTTNNNNSIINNNNINNNNINNNNNNNNNN 445

Query:   435 PKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNI-PRLENSING 493
                 N   N  +  N    N   N  +  N   N N N     GT  ++I P L NS N 
Sbjct:   446 INNNNIINNNNNNNN----NNNNNNNNNNNNNNNNNNNSSISGGTEVFSISPNLNNSYNS 501

Query:   494 NGT-SENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVK 552
             N + + N SN N+  N     D  +  + N  +  N   +N ++    + +      L  
Sbjct:   502 NSSGNSNGSNSNNNSNNNTNNDNNNNNNNNNNNNNNNNNNNNNNNNNNNCIDSVNNSLNN 561

Query:   553 ENSING---NGTSENRSNDNSYQNEIDG 577
             EN +N    N  + N S+D S  N  +G
Sbjct:   562 ENDVNNSNINNNNNNNSDDGSNNNSYEG 589

 Score = 137 (53.3 bits), Expect = 2.3e-05, P = 2.3e-05
 Identities = 50/222 (22%), Positives = 87/222 (39%)

Query:   378 SAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKY 437
             S  N+ + PN V+     I+   + S + +   T  +N P   NS +   N  +  N   
Sbjct:   353 SYENEINSPNLVSPITTPIMSSAQKSPIMFTTTTG-FN-PSNSNSISNNNNNNNNNNNNT 410

Query:   438 ENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTS 497
              N   N T+  N    N   N  +  N   N N N    N  +  N     N+ N N  +
Sbjct:   411 TNNNNNTTNNNNSIINNNNINNNNINNNNNNNNNNINNNNIINNNNNNNNNNNNNNNNNN 470

Query:   498 ENRSNDNSYQNEIDGIDVWSVLSR-NEPSKRNTILHNIDDEWQISALTRGKWKLVKENSI 556
              N +N+N+  +   G +V+S+    N     N+  ++       ++           N+ 
Sbjct:   471 NNNNNNNNNSSISGGTEVFSISPNLNNSYNSNSSGNSNGSNSNNNSNNNTNNDNNNNNNN 530

Query:   557 NGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHN 598
             N N  + N +N+N+  N  + ID  +    NE    N+ ++N
Sbjct:   531 NNNNNNNNNNNNNNNNNNNNCIDSVNNSLNNENDVNNSNINN 572


>UNIPROTKB|Q32KH1 [details] [associations]
            symbol:sulf2 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0060384 "innervation" evidence=IEA]
            [GO:0060348 "bone development" evidence=IEA] [GO:0048706 "embryonic
            skeletal system development" evidence=IEA] [GO:0040037 "negative
            regulation of fibroblast growth factor receptor signaling pathway"
            evidence=IEA] [GO:0035860 "glial cell-derived neurotrophic factor
            receptor signaling pathway" evidence=IEA] [GO:0032836 "glomerular
            basement membrane development" evidence=IEA] [GO:0030201 "heparan
            sulfate proteoglycan metabolic process" evidence=IEA] [GO:0030177
            "positive regulation of Wnt receptor signaling pathway"
            evidence=IEA] [GO:0014846 "esophagus smooth muscle contraction"
            evidence=IEA] [GO:0010575 "positive regulation vascular endothelial
            growth factor production" evidence=IEA] [GO:0009986 "cell surface"
            evidence=IEA] [GO:0008449 "N-acetylglucosamine-6-sulfatase
            activity" evidence=IEA] [GO:0005886 "plasma membrane" evidence=IEA]
            [GO:0005783 "endoplasmic reticulum" evidence=IEA] [GO:0004065
            "arylsulfatase activity" evidence=IEA] [GO:0003094 "glomerular
            filtration" evidence=IEA] [GO:0002063 "chondrocyte development"
            evidence=IEA] [GO:0005794 "Golgi apparatus" evidence=IEA]
            [GO:0005509 "calcium ion binding" evidence=IEA] InterPro:IPR000917
            InterPro:IPR014615 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783 GO:GO:0005886
            GO:GO:0005794 GO:GO:0009986 GO:GO:0005509 GO:GO:0010575
            GO:GO:0060348 Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0030177
            GO:GO:0003094 eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523
            GO:GO:0004065 GeneTree:ENSGT00400000022041 GO:GO:0048706
            GO:GO:0002063 GO:GO:0040037 GO:GO:0032836 GO:GO:0060384
            GO:GO:0030201 HOGENOM:HOG000290161 KO:K14607 HOVERGEN:HBG056431
            GO:GO:0014846 GO:GO:0035860 InterPro:IPR024609 Pfam:PF12548
            CTD:55959 OMA:PKYYGQG OrthoDB:EOG49KFPX EMBL:AAEX03013985
            EMBL:AAEX03013986 EMBL:BN000766 RefSeq:NP_001041555.1
            UniGene:Cfa.6393 STRING:Q32KH1 Ensembl:ENSCAFT00000017345
            GeneID:477254 KEGG:cfa:477254 InParanoid:Q32KH1 NextBio:20852774
            Uniprot:Q32KH1
        Length = 869

 Score = 142 (55.0 bits), Expect = 4.6e-07, Sum P(2) = 4.6e-07
 Identities = 61/225 (27%), Positives = 89/225 (39%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
             P+II +L DD    DV   G  Q+       +   G    N + T  +C PSRS+I+TGK
Sbjct:    44 PNIILVLTDD---QDVEL-GSMQVMNKTRRIMEQGGAHFINAFVTTPMCCPSRSSILTGK 99

Query:   119 HPIHTGMQHNVLYGCERGGLPL-----SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
             + +H    HN     E    P        +    YL   GYRT   GK+ L  Y   Y P
Sbjct:   100 Y-VHN---HNTYTNNENCSSPSWQAQHESRTFAVYLNSTGYRTAFFGKY-LNEYNGSYVP 154

Query:   174 TFRGFESHLGYWTGHQDY-FDHSAEEMK-MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
                G++  +G     + Y +      +K   G D  +D          Y TD+ T ++V 
Sbjct:   155 P--GWKEWVGLLKNSRFYNYTLCRNGVKEKHGFDYSKD----------YLTDLITNDSVS 202

Query:   232 IIHNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
                         P+ + ++HAA H      P Q    + N  +HI
Sbjct:   203 FFRTSKKMYPHRPVLMVISHAAPHGPEDSAP-QYSGLFPNASQHI 246

 Score = 62 (26.9 bits), Expect = 4.6e-07, Sum P(2) = 4.6e-07
 Identities = 40/188 (21%), Positives = 75/188 (39%)

Query:   269 IHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXX 328
             IH    +  + K    L  +D+S+  +   L +   L N+ IV+ +D             
Sbjct:   271 IHMEFTNMLQRKRLQTLMSVDDSMETIYNMLVETGELDNTYIVYTADHGYHIGQFGLVKG 330

Query:   329 SNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
              + P        +E  +R    +  P +E+  +     +++ D  PT+L  A   DIP+ 
Sbjct:   331 KSMP--------YEFDIRVPFYVRGPNVEAGSLNPHIVLNI-DLAPTILDIAGL-DIPSD 380

Query:   389 VNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGT--HEY-NPKYENRYENGT 445
             ++   ++I+   +    R  N  H     R+   +   E G   H+  N K + + EN  
Sbjct:   381 MDG--KSILKLLDTE--RPVNRFHLKKKMRVWRDSFLVERGKLLHKRDNDKVDAQEEN-- 434

Query:   446 HEYNPKYE 453
               + PKY+
Sbjct:   435 --FLPKYQ 440

 Score = 44 (20.5 bits), Expect = 3.2e-05, Sum P(2) = 3.2e-05
 Identities = 10/42 (23%), Positives = 15/42 (35%)

Query:   428 NGTHEYNPKYENRYENGTHEYNPKYENRYENGTH---EYNGP 466
             +G  +  P+Y   + N +    P Y        H    Y GP
Sbjct:   226 HGPEDSAPQYSGLFPNASQHITPSYNYAPNPDKHWIMRYTGP 267


>UNIPROTKB|E9PJL8 [details] [associations]
            symbol:SULF1 "Extracellular sulfatase Sulf-1" species:9606
            "Homo sapiens" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] [GO:0002063 "chondrocyte development" evidence=IEA]
            [GO:0003094 "glomerular filtration" evidence=IEA] [GO:0005886
            "plasma membrane" evidence=IEA] [GO:0010575 "positive regulation
            vascular endothelial growth factor production" evidence=IEA]
            [GO:0014846 "esophagus smooth muscle contraction" evidence=IEA]
            [GO:0030201 "heparan sulfate proteoglycan metabolic process"
            evidence=IEA] [GO:0032836 "glomerular basement membrane
            development" evidence=IEA] [GO:0035860 "glial cell-derived
            neurotrophic factor receptor signaling pathway" evidence=IEA]
            [GO:0040037 "negative regulation of fibroblast growth factor
            receptor signaling pathway" evidence=IEA] [GO:0048706 "embryonic
            skeletal system development" evidence=IEA] [GO:0060348 "bone
            development" evidence=IEA] [GO:0060384 "innervation" evidence=IEA]
            [GO:0060686 "negative regulation of prostatic bud formation"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 GO:GO:0005886 GO:GO:0005794
            GO:GO:0010575 GO:GO:0060348 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GO:GO:0003094 InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484
            GO:GO:0048706 GO:GO:0060686 GO:GO:0002063 GO:GO:0040037
            GO:GO:0032836 GO:GO:0060384 GO:GO:0030201 EMBL:AC091047
            GO:GO:0014846 GO:GO:0035860 HGNC:HGNC:20391 ChiTaRS:SULF1
            EMBL:AC013746 EMBL:AC022790 IPI:IPI00978157
            ProteinModelPortal:E9PJL8 SMR:E9PJL8 Ensembl:ENST00000525999
            ArrayExpress:E9PJL8 Bgee:E9PJL8 Uniprot:E9PJL8
        Length = 172

 Score = 127 (49.8 bits), Expect = 5.0e-07, P = 5.0e-07
 Identities = 43/130 (33%), Positives = 59/130 (45%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
             P+II +L DD    DV    L Q+       + + G    N + T  +C PSRS+++TGK
Sbjct:    43 PNIILVLTDD---QDVELGSL-QVMNKTRKIMEHGGATFINAFVTTPMCCPSRSSMLTGK 98

Query:   119 HPIHTGMQHNVLYGCERGGLP----LSE-KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
             + +H    HNV    E    P    + E +    YL   GYRT   GK+ L  Y   Y P
Sbjct:    99 Y-VHN---HNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153

Query:   174 TFRGFESHLG 183
                G+   LG
Sbjct:   154 P--GWREWLG 161


>WB|WBGene00006308 [details] [associations]
            symbol:sul-1 species:6239 "Caenorhabditis elegans"
            [GO:0008152 "metabolic process" evidence=IEA] [GO:0008484 "sulfuric
            ester hydrolase activity" evidence=IEA] [GO:0003824 "catalytic
            activity" evidence=IEA] [GO:0016021 "integral to membrane"
            evidence=IEA] [GO:0015015 "heparan sulfate proteoglycan
            biosynthetic process, enzymatic modification" evidence=IMP]
            [GO:0017095 "heparan sulfate 6-O-sulfotransferase activity"
            evidence=IMP] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 GO:GO:0005783 GO:GO:0009986
            GO:GO:0046872 GO:GO:0005795 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            GO:GO:0008484 GeneTree:ENSGT00400000022041 GO:GO:0015015
            GO:GO:0017095 EMBL:FO081118 PIR:T16584 RefSeq:NP_508560.1
            ProteinModelPortal:Q21376 SMR:Q21376 STRING:Q21376
            EnsemblMetazoa:K09C4.8 GeneID:180619 KEGG:cel:CELE_K09C4.8
            UCSC:K09C4.8 CTD:180619 WormBase:K09C4.8 HOGENOM:HOG000290161
            InParanoid:Q21376 KO:K14607 OMA:TVEDRWR NextBio:910136
            Uniprot:Q21376
        Length = 709

 Score = 148 (57.2 bits), Expect = 5.2e-07, Sum P(2) = 5.2e-07
 Identities = 65/233 (27%), Positives = 104/233 (44%)

Query:    37 FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
             F ++P+  T S+ FVD        ++I IL DD    D+    +D +P  +   +   G 
Sbjct:    18 FLIIPIKVT-SIHFVD-----SQHNVILILTDD---QDIELGSMDFMPKTS-QIMKERGT 67

Query:    97 -ILKNYYTVQLCTPSRSAIMTG----KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKE 151
                  Y T  +C PSRS I+TG     H +HT  Q+    G E   +   +K +  YL+E
Sbjct:    68 EFTSGYVTTPICCPSRSTILTGLYVHNHHVHTNNQNCT--GVEWRKVH-EKKSIGVYLQE 124

Query:   152 LGYRTRIVGKWHLGFYKKEYTPTFRGFES-H-LGYWTGHQDYFDHSAEEMKMWGLDMRRD 209
              GYRT  +GK+ L  Y   Y P   G++  H +   +   +Y  +S  E + +G +  +D
Sbjct:   125 AGYRTAYLGKY-LNEYDGSYIPP--GWDEWHAIVKNSKFYNYTMNSNGEREKFGSEYEKD 181

Query:   210 LEPAWDLHGKYSTDVFTAEAVDIIHNH---STDEPLFLYLAHAATHSANPYEP 259
                       Y TD+ T  ++  I  H      +P  L +++ A H   P +P
Sbjct:   182 ----------YFTDLVTNRSLKFIDKHIKIRAWQPFALIISYPAPHG--PEDP 222

 Score = 53 (23.7 bits), Expect = 5.2e-07, Sum P(2) = 5.2e-07
 Identities = 26/121 (21%), Positives = 45/121 (37%)

Query:   260 LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXX 319
             LQ      ++H    D    +    L  +DE + ++   L +   L N+  ++ SD    
Sbjct:   253 LQRTGKMNDVHISFTDLLHRRRLQTLQSVDEGIERLFNLLRELNQLWNTYAIYTSDHGYH 312

Query:   320 XXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSA 379
                          L+G KN  +E  +R    +  P +  R +   + V   D  PT+L  
Sbjct:   313 LGQFGL-------LKG-KNMPYEFDIRVPFFMRGPGIP-RNVTFNEIVTNVDIAPTMLHI 363

Query:   380 A 380
             A
Sbjct:   364 A 364


>UNIPROTKB|F1MXZ0 [details] [associations]
            symbol:GNS "N-acetylglucosamine-6-sulfatase" species:9913
            "Bos taurus" [GO:0030203 "glycosaminoglycan metabolic process"
            evidence=IEA] [GO:0008449 "N-acetylglucosamine-6-sulfatase
            activity" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR012251 InterPro:IPR015981
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            PIRSF:PIRSF036666 GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            GeneTree:ENSGT00400000022041 GO:GO:0030203 IPI:IPI00703612
            UniGene:Bt.20235 GO:GO:0008449 PANTHER:PTHR10342:SF5 OMA:MCGYQTF
            EMBL:DAAA02013337 ProteinModelPortal:F1MXZ0
            Ensembl:ENSBTAT00000023218 ArrayExpress:F1MXZ0 Uniprot:F1MXZ0
        Length = 560

 Score = 146 (56.5 bits), Expect = 5.5e-07, Sum P(2) = 5.5e-07
 Identities = 63/228 (27%), Positives = 97/228 (42%)

Query:    56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDAL-AYSGIILKNYYTVQ-LCTPSRSA 113
             SS  P+++ +LADD    D    G+   P     AL    G+   + Y    LC PSR++
Sbjct:    51 SSRRPNVVLLLADD---QDEVLGGMT--PLKKTKALIGEMGMTFSSAYVPSALCCPSRAS 105

Query:   114 IMTGKHPIHTGMQHNVLYG-CE-RGGLPLSE-KILPQYLKEL-GYRTRIVGKWHLGFYKK 169
             I+TGK+P +  + +N L G C  +    + E    P  L+ + GY+T   GK     Y  
Sbjct:   106 ILTGKYPHNHHVVNNTLEGNCSSKSWQKIQEPNTFPAILRSMCGYQTFFAGK-----YLN 160

Query:   170 EYTPTFRGFESH--LG--YWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVF 225
             EY     G   H  LG  YW   +    +    + + G   +     + D    Y TDV 
Sbjct:   161 EYGAPDAGGLGHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVD----YLTDVL 216

Query:   226 TAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
                ++D +   S  EP F+ ++  A HS     P  A   Y N  +++
Sbjct:   217 ANVSLDFLDYKSNSEPFFMMISTPAPHS-----PWTAAPQYQNAFQNV 259

 Score = 52 (23.4 bits), Expect = 5.5e-07, Sum P(2) = 5.5e-07
 Identities = 13/38 (34%), Positives = 23/38 (60%)

Query:   278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
             R ++  +L  +D+ V K+V+ LE    L+N+ I + SD
Sbjct:   298 RKRWQTLL-SVDDLVEKLVKRLEFNGELNNTYIFYTSD 334

 Score = 41 (19.5 bits), Expect = 7.4e-06, Sum P(2) = 7.4e-06
 Identities = 27/105 (25%), Positives = 42/105 (40%)

Query:   473 PRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTI-- 530
             P+Y+N       PR +N  N +GT+++                W +     P   ++I  
Sbjct:   250 PQYQNAFQNVFAPRNKN-FNIHGTNKH----------------WLIRQAKTPMTNSSIQF 292

Query:   531 LHN-IDDEWQ-ISALTRGKWKLVKENSING--NGTSENRSNDNSY 571
             L N     WQ + ++     KLVK    NG  N T    ++DN Y
Sbjct:   293 LDNAFRKRWQTLLSVDDLVEKLVKRLEFNGELNNTYIFYTSDNGY 337

 Score = 40 (19.1 bits), Expect = 9.4e-06, Sum P(2) = 9.4e-06
 Identities = 8/23 (34%), Positives = 14/23 (60%)

Query:   450 PKYENRYENGTHEYNGPKNENTN 472
             P+Y+N ++N       P+N+N N
Sbjct:   250 PQYQNAFQN----VFAPRNKNFN 268


>MGI|MGI:1922862 [details] [associations]
            symbol:Gns "glucosamine (N-acetyl)-6-sulfatase"
            species:10090 "Mus musculus" [GO:0003824 "catalytic activity"
            evidence=IEA] [GO:0005539 "glycosaminoglycan binding" evidence=ISO]
            [GO:0005764 "lysosome" evidence=ISO] [GO:0008152 "metabolic
            process" evidence=IEA] [GO:0008449 "N-acetylglucosamine-6-sulfatase
            activity" evidence=ISO] [GO:0008484 "sulfuric ester hydrolase
            activity" evidence=ISO] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0030203 "glycosaminoglycan metabolic process"
            evidence=IEA] [GO:0042340 "keratan sulfate catabolic process"
            evidence=ISO] [GO:0043199 "sulfate binding" evidence=ISO]
            [GO:0046872 "metal ion binding" evidence=IEA] InterPro:IPR000917
            InterPro:IPR012251 InterPro:IPR015981 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036666 MGI:MGI:1922862
            GO:GO:0046872 GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            GO:GO:0008484 GeneTree:ENSGT00400000022041 GO:GO:0042340
            GO:GO:0043199 GO:GO:0005539 CTD:2799 HOGENOM:HOG000169239
            HOVERGEN:HBG005840 KO:K01137 OrthoDB:EOG4NGGMF GO:GO:0008449
            PANTHER:PTHR10342:SF5 ChiTaRS:GNS EMBL:AK030773 EMBL:AK049162
            EMBL:AK054046 EMBL:AK083597 EMBL:AK159562 EMBL:AK169485
            EMBL:AK165180 EMBL:AK170791 EMBL:BC055328 IPI:IPI00221426
            RefSeq:NP_083640.1 UniGene:Mm.207683 ProteinModelPortal:Q8BFR4
            SMR:Q8BFR4 STRING:Q8BFR4 PhosphoSite:Q8BFR4 PaxDb:Q8BFR4
            PRIDE:Q8BFR4 Ensembl:ENSMUST00000040344 GeneID:75612 KEGG:mmu:75612
            UCSC:uc007hfo.1 InParanoid:Q8BFR4 OMA:MCGYQTF NextBio:343508
            Bgee:Q8BFR4 CleanEx:MM_GNS Genevestigator:Q8BFR4 Uniprot:Q8BFR4
        Length = 544

 Score = 150 (57.9 bits), Expect = 5.9e-07, P = 5.9e-07
 Identities = 66/235 (28%), Positives = 101/235 (42%)

Query:    29 GYRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNI 88
             G   R+ A  +LPL        + LV ++  P+++ +L DD    D    G+   P    
Sbjct:    12 GRPRRLPALLLLPLLGGC----LGLVGAARRPNVLLLLTDD---QDAELGGMT--PLKKT 62

Query:    89 DAL-AYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYG-CE-RGGLPLSEKI 144
              AL    G+   + Y    LC PSR++I+TGK+P +  + +N L G C  +    + E  
Sbjct:    63 KALIGEKGMTFSSAYVPSALCCPSRASILTGKYPHNHHVVNNTLEGNCSSKAWQKIQEPY 122

Query:   145 -LPQYLKEL-GYRTRIVGKWHLGFYKKEY-TPTFRGFES-HLG--YWTGHQDYFDHSAEE 198
               P  LK + GY+T   GK     Y  EY  P   G E   LG  YW   +    +    
Sbjct:   123 TFPAILKSVCGYQTFFAGK-----YLNEYGAPDAGGLEHIPLGWSYWYALEKNSKYYNYT 177

Query:   199 MKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHS 253
             + + G   +     + D    Y TDV    ++D +   S  EP F+ ++  A HS
Sbjct:   178 LSINGKARKHGENYSVD----YLTDVLANLSLDFLDYKSNSEPFFMMISTPAPHS 228


>TIGR_CMR|SPO_3593 [details] [associations]
            symbol:SPO_3593 "sulfatase family protein" species:246200
            "Ruegeria pomeroyi DSS-3" [GO:0008152 "metabolic process"
            evidence=ISS] [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=ISS] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 EMBL:CP000031
            GenomeReviews:CP000031_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
            GO:GO:0008484 HOGENOM:HOG000230030 ProtClustDB:CLSK867183
            RefSeq:YP_168788.1 ProteinModelPortal:Q5LMH0 GeneID:3195684
            KEGG:sil:SPO3593 PATRIC:23380663 OMA:MNILFIM Uniprot:Q5LMH0
        Length = 552

 Score = 121 (47.7 bits), Expect = 6.0e-07, Sum P(3) = 6.0e-07
 Identities = 32/107 (29%), Positives = 55/107 (51%)

Query:    61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL-KNYYTVQLCTPSRSAIMTGKH 119
             +I+FI+ D L W+ +  +G   + TP+ID LA  G+   + Y    +C  SR +  TG++
Sbjct:     2 NILFIMFDQLRWDYLSCYGHKTLNTPHIDRLAAKGVRFDRAYIQSPICGSSRMSTYTGRY 61

Query:   120 PIHTGMQHNVLYGCERGGLPLS--EKILPQYLKELGYRTRIVGKWHL 164
              +H+       +G    G+PL   E  +  +L+  G    +VGK H+
Sbjct:    62 -VHS-------HGASWNGIPLKVGEMTMGDHLRAAGMGCWLVGKTHM 100

 Score = 69 (29.3 bits), Expect = 6.0e-07, Sum P(3) = 6.0e-07
 Identities = 28/131 (21%), Positives = 54/131 (41%)

Query:   271 RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSN 330
             + + D     +  ++ + D+ +G++ + LE    + +++IV  SD              +
Sbjct:   282 QEVRDAVIPAYMGLIKQADDQMGRLFKWLEDTGRMQDTMIVLTSDHGDFLG-------DH 334

Query:   331 WPLRGVKNTLWEGGVRGAGLIWSPLLES---RGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
             W   G K    +   R   +I+ P  E+   RG V +  V   D  PT + AA      +
Sbjct:   335 W--MGEKTFFHDASTRVPLIIYDPRPEADATRGSVCDALVESIDLAPTFVEAAGGKPAMH 392

Query:   388 YVNSTVENIIP 398
              +    E++IP
Sbjct:   393 ILEG--ESLIP 401

 Score = 51 (23.0 bits), Expect = 6.0e-07, Sum P(3) = 6.0e-07
 Identities = 11/22 (50%), Positives = 13/22 (59%)

Query:   665 KEVPCEPQIAPCLFDIKNDPCE 686
             K +  E    P LFD+KNDP E
Sbjct:   448 KLIHFEADPRPMLFDLKNDPQE 469

 Score = 51 (23.0 bits), Expect = 6.0e-07, Sum P(3) = 6.0e-07
 Identities = 11/22 (50%), Positives = 13/22 (59%)

Query:   771 KEVPCEPQIAPCLFDIKNDPCE 792
             K +  E    P LFD+KNDP E
Sbjct:   448 KLIHFEADPRPMLFDLKNDPQE 469


>DICTYBASE|DDB_G0286855 [details] [associations]
            symbol:gtaD "GATA zinc finger domain-containing
            protein 4" species:44689 "Dictyostelium discoideum" [GO:0043565
            "sequence-specific DNA binding" evidence=IEA] [GO:0008270 "zinc ion
            binding" evidence=IEA] [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=IEA] [GO:0003700 "sequence-specific DNA
            binding transcription factor activity" evidence=IEA] [GO:0046872
            "metal ion binding" evidence=IEA] InterPro:IPR000679
            PROSITE:PS00344 PROSITE:PS50114 dictyBase:DDB_G0286855
            GenomeReviews:CM000153_GR GO:GO:0046872 GO:GO:0043565 GO:GO:0008270
            GO:GO:0003700 EMBL:AAFI02000090 RefSeq:XP_637531.1 PRIDE:Q54L72
            EnsemblProtists:DDB0220503 GeneID:8625829 KEGG:ddi:DDB_G0286855
            eggNOG:NOG258313 Uniprot:Q54L72
        Length = 530

 Score = 149 (57.5 bits), Expect = 7.3e-07, P = 7.3e-07
 Identities = 46/185 (24%), Positives = 75/185 (40%)

Query:   381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
             N ++  N  NS   N    Y N+  + + G    N+    N+N  + N  +  N  Y N 
Sbjct:   267 NNNNNNNNNNSNNNNNNNNYFNNNKKNKIGDCNSNNSN-NNNNNNHNNNNNNNNYNYNNN 325

Query:   441 YENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTS-EN 499
               N  +  N    N   N  +  N   N N N    N  +  NI    N+ N N  +  N
Sbjct:   326 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNSNNNKNNNNINNNNNNNNNNNNNINN 385

Query:   500 RSNDNSYQNEIDGIDVWSVLS-RNEPSKRNTILHNIDDE--WQISALTRGKWKLVKENSI 556
              +N+NS  N I+  + ++  +  N     N++ +N  +   W+ S+       L+KE S+
Sbjct:   386 NNNNNSINNIINNNNNFNNNNINNNLFNNNSMNYNKKENYNWESSSSEEDNNNLIKEQSV 445

Query:   557 NGNGT 561
               N T
Sbjct:   446 KKNET 450

 Score = 145 (56.1 bits), Expect = 2.0e-06, P = 2.0e-06
 Identities = 46/200 (23%), Positives = 76/200 (38%)

Query:   374 PTLLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEY 433
             PT++S+ +     N  N+   N      N+     N  +  N+    N+N  Y N   + 
Sbjct:   236 PTIISSNSPLKTRNKNNNN--NYNNNNNNNNNNNNNNNNNNNNSNNNNNNNNYFNNNKK- 292

Query:   434 NPKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSING 493
             N   +    N  +  N  + N   N  + YN   N N N    N  +  N     N+ N 
Sbjct:   293 NKIGDCNSNNSNNNNNNNHNNNNNNNNYNYNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 352

Query:   494 NGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKE 553
             N  + N SN+N   N I+        + N  +  N  ++N ++   I+ +          
Sbjct:   353 NNNNNNNSNNNKNNNNINN-------NNNNNNNNNNNINNNNNNNSINNIINNNNNF-NN 404

Query:   554 NSINGNGTSENRSNDNSYQN 573
             N+IN N  + N  N N  +N
Sbjct:   405 NNINNNLFNNNSMNYNKKEN 424

 Score = 137 (53.3 bits), Expect = 1.5e-05, P = 1.5e-05
 Identities = 43/190 (22%), Positives = 73/190 (38%)

Query:   386 PNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGT 445
             P    ST   II        R +N  + YN+    N+N    N  +  N    N   N  
Sbjct:   228 PQSSQSTTPTIISSNSPLKTRNKNNNNNYNN---NNNNNNNNNNNNNNNNNNSNNNNNNN 284

Query:   446 HEYNPKYENRYEN-GTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDN 504
             + +N   +N+  +  ++  N   N N N    N  + YN     N+ N N  + N +N+N
Sbjct:   285 NYFNNNKKNKIGDCNSNNSNNNNNNNHNNNNNNNNYNYNNNNNNNNNNNNNNNNNNNNNN 344

Query:   505 SYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSEN 564
             +  N  +  +  +  S N  +  N   +N ++    + +          N IN N    N
Sbjct:   345 NNNNNNNNNNNNNNNSNNNKNNNNINNNNNNNNNNNNNINNNNNNNSINNIINNNNNFNN 404

Query:   565 RS-NDNSYQN 573
              + N+N + N
Sbjct:   405 NNINNNLFNN 414

 Score = 128 (50.1 bits), Expect = 0.00014, P = 0.00014
 Identities = 47/230 (20%), Positives = 89/230 (38%)

Query:   375 TLLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYN 434
             T+   ++ S  P +     ENI     ++++   N  + Y+ P    S+          N
Sbjct:   185 TICIPSHHSPSPQFPVYYTENINNATPSTVV--SNSPNNYSQPISPQSSQSTTPTIISSN 242

Query:   435 PKYENRYENGTHEYNPKYENRYENGTHEYNG---PKNENTNPRYENGTHEYNIP--RLEN 489
                + R +N  + YN    N   N  +  N      N N N  Y N   +  I      N
Sbjct:   243 SPLKTRNKNNNNNYNNNNNNNNNNNNNNNNNNNNSNNNNNNNNYFNNNKKNKIGDCNSNN 302

Query:   490 SINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWK 549
             S N N  + N +N+N+  N  +  +  +  + N  +  N   +N ++    +        
Sbjct:   303 SNNNNNNNHNNNNNNNNYNYNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNSNN 362

Query:   550 LVKENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNI 599
                 N+IN N  + N +N+N+  N  +   + ++++ N     N I +N+
Sbjct:   363 NKNNNNINNNNNNNN-NNNNNINNNNNNNSINNIINNNNNFNNNNINNNL 411


>UNIPROTKB|F1LLW8 [details] [associations]
            symbol:Ids "Protein Ids" species:10116 "Rattus norvegicus"
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 RGD:1560491 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            GeneTree:ENSGT00640000091539 OMA:CREGRNL IPI:IPI00569342
            Ensembl:ENSRNOT00000042925 ArrayExpress:F1LLW8 Uniprot:F1LLW8
        Length = 544

 Score = 149 (57.5 bits), Expect = 7.6e-07, P = 7.6e-07
 Identities = 45/137 (32%), Positives = 71/137 (51%)

Query:    33 RIMAFAVLPLAFTLSMVFVDLVASSGPP-HIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
             R ++F++L   F +++V      S+    +I+ I+ DDL    +G +G   + +PNID L
Sbjct:     2 RQLSFSLLLGFFCIALVSAAQGNSATDALNILLIIVDDLR-PSLGCYGDKLVRSPNIDQL 60

Query:    92 AYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGM-QHNVLYGCERGGLPLSEKILPQYL 149
             A   I+ +N +  Q +C PSR + +TG+ P  T +   N  +    G        +PQY 
Sbjct:    61 ASHSIVFENAFAQQAVCAPSRVSFLTGRRPDTTRLYDFNSYWRVHSGNF----STIPQYF 116

Query:   150 KELGYRTRIVGK-WHLG 165
             KE GY T  VGK +H G
Sbjct:   117 KENGYVTMSVGKVFHPG 133


>TIGR_CMR|SPO_1083 [details] [associations]
            symbol:SPO_1083 "choline sulfatase" species:246200
            "Ruegeria pomeroyi DSS-3" [GO:0006790 "sulfur compound metabolic
            process" evidence=ISS] [GO:0047753 "choline-sulfatase activity"
            evidence=ISS] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 EMBL:CP000031
            GenomeReviews:CP000031_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00149 HOGENOM:HOG000217625 KO:K01133
            InterPro:IPR017785 InterPro:IPR025863 Pfam:PF12411
            TIGRFAMs:TIGR03417 ProtClustDB:CLSK864791 GO:GO:0047753
            RefSeq:YP_166334.1 ProteinModelPortal:Q5LUH1 GeneID:3195014
            KEGG:sil:SPO1083 PATRIC:23375467 OMA:QEAIILF Uniprot:Q5LUH1
        Length = 502

 Score = 109 (43.4 bits), Expect = 7.9e-07, Sum P(3) = 7.9e-07
 Identities = 31/105 (29%), Positives = 46/105 (43%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
             P+I+ ++ D L          D +  PN+  LA       N YT   LC P R++ M+G+
Sbjct:     4 PNILILMVDQLNGTLFPDGPADWLHAPNLKRLAARSTRFANAYTASPLCAPGRASFMSGQ 63

Query:   119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWH 163
              P  TG+  N      R  +P        +L+  GY T + GK H
Sbjct:    64 LPSRTGVYDNAAEF--RSDIPT----YAHHLRRAGYYTCLSGKMH 102

 Score = 76 (31.8 bits), Expect = 7.9e-07, Sum P(3) = 7.9e-07
 Identities = 34/123 (27%), Positives = 57/123 (46%)

Query:   274 EDFKRSKFA--AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNW 331
             ED ++S+ A  A +  LD+ +G+++E LE  R    +II+FVSD               W
Sbjct:   248 EDIRKSRRAYFANISYLDDKLGEILEVLETTRQ--EAIILFVSDHGDMLGERGL-----W 300

Query:   332 PLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL--LSAANKSDIPNYV 389
                  K   +EG  R   ++ +P +E   I  +  V   D  PTL  L+  + ++I  + 
Sbjct:   301 ----FKMNFYEGSARVPLMVAAPGMEPGRI--DTPVSTIDVTPTLGELAGVDMAEIAPWT 354

Query:   390 NST 392
             + T
Sbjct:   355 DGT 357

 Score = 54 (24.1 bits), Expect = 7.9e-07, Sum P(3) = 7.9e-07
 Identities = 11/21 (52%), Positives = 13/21 (61%)

Query:   677 LFDIKNDPCEKNNLADRSEDQ 697
             LFD+  DP E  NLAD  + Q
Sbjct:   405 LFDLDADPHEMTNLADHPDHQ 425

 Score = 52 (23.4 bits), Expect = 1.3e-06, Sum P(3) = 1.3e-06
 Identities = 11/21 (52%), Positives = 13/21 (61%)

Query:   783 LFDIKNDPCEKNNLADRSEVQ 803
             LFD+  DP E  NLAD  + Q
Sbjct:   405 LFDLDADPHEMTNLADHPDHQ 425


>UNIPROTKB|Q3L472 [details] [associations]
            symbol:Sulf2 "Protein Sulf2" species:10116 "Rattus
            norvegicus" [GO:0002063 "chondrocyte development" evidence=IEA]
            [GO:0003094 "glomerular filtration" evidence=IEA] [GO:0004065
            "arylsulfatase activity" evidence=IEA] [GO:0005509 "calcium ion
            binding" evidence=IEA] [GO:0005783 "endoplasmic reticulum"
            evidence=IEA] [GO:0005794 "Golgi apparatus" evidence=IEA]
            [GO:0005886 "plasma membrane" evidence=IEA] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=IEA]
            [GO:0009986 "cell surface" evidence=IEA] [GO:0010575 "positive
            regulation vascular endothelial growth factor production"
            evidence=IEA] [GO:0014846 "esophagus smooth muscle contraction"
            evidence=IEA] [GO:0030177 "positive regulation of Wnt receptor
            signaling pathway" evidence=IEA] [GO:0030201 "heparan sulfate
            proteoglycan metabolic process" evidence=IEA] [GO:0032836
            "glomerular basement membrane development" evidence=IEA]
            [GO:0035860 "glial cell-derived neurotrophic factor receptor
            signaling pathway" evidence=IEA] [GO:0040037 "negative regulation
            of fibroblast growth factor receptor signaling pathway"
            evidence=IEA] [GO:0048706 "embryonic skeletal system development"
            evidence=IEA] [GO:0060348 "bone development" evidence=IEA]
            [GO:0060384 "innervation" evidence=IEA] InterPro:IPR000917
            InterPro:IPR014615 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 PIRSF:PIRSF036665 RGD:1305078 GO:GO:0005783
            GO:GO:0005886 GO:GO:0005794 GO:GO:0009986 GO:GO:0005509
            GO:GO:0010575 GO:GO:0060348 Gene3D:3.40.720.10 SUPFAM:SSF53649
            GO:GO:0030177 GO:GO:0003094 InterPro:IPR024607 PROSITE:PS00523
            GO:GO:0004065 GeneTree:ENSGT00400000022041 GO:GO:0048706
            GO:GO:0002063 GO:GO:0040037 GO:GO:0032836 EMBL:CH474005
            GO:GO:0060384 GO:GO:0030201 KO:K14607 HOVERGEN:HBG056431
            GO:GO:0014846 GO:GO:0035860 InterPro:IPR024609 Pfam:PF12548
            CTD:55959 OMA:PKYYGQG EMBL:AY742216 IPI:IPI00767654
            RefSeq:NP_001030099.1 UniGene:Rn.4228 STRING:Q3L472
            Ensembl:ENSRNOT00000008478 GeneID:311642 KEGG:rno:311642
            InParanoid:Q3L472 NextBio:663979 Genevestigator:Q3L472
            Uniprot:Q3L472
        Length = 875

 Score = 141 (54.7 bits), Expect = 9.5e-07, Sum P(2) = 9.5e-07
 Identities = 58/223 (26%), Positives = 93/223 (41%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
             P+II +L DD    DV   G  Q+       +   G    N + T  +C PSRS+I+TGK
Sbjct:    44 PNIILVLTDD---QDVEL-GSMQVMNKTRRIMEQGGAHFINAFVTTPMCCPSRSSILTGK 99

Query:   119 HPIHTGMQHNVLYGCERGGLPL-----SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
             + +H    HN     E    P        +    YL   GYRT   GK+ L  Y   Y P
Sbjct:   100 Y-VHN---HNTYTNNENCSSPSWQAQHESRTFAVYLNSTGYRTAFFGKY-LNEYNGSYVP 154

Query:   174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
                G++  +G     + +++++   +   G+  +   + + D    Y TD+ T ++V   
Sbjct:   155 P--GWKEWVGLLKNSR-FYNYT---LCRNGMKEKHGSDYSTD----YLTDLITNDSVSFF 204

Query:   234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
                       P+ + ++HAA H      P Q    + N  +HI
Sbjct:   205 RTSKKMYPHRPVLMVISHAAPHGPEDSAP-QYSRLFPNASQHI 246

 Score = 60 (26.2 bits), Expect = 9.5e-07, Sum P(2) = 9.5e-07
 Identities = 48/231 (20%), Positives = 87/231 (37%)

Query:   234 HNHSTDEPLFLYL-AHAATHSANPYEPLQAPD-HYLN--------IHRHIEDFKRSKFAA 283
             H      P +  L  +A+ H    Y     PD H++         IH    +  + K   
Sbjct:   226 HGPEDSAPQYSRLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQ 285

Query:   284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEG 343
              L  +D+S+  + + L +   L N+ IV+ +D              + P        +E 
Sbjct:   286 TLMSVDDSMETIYDMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMP--------YEF 337

Query:   344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENS 403
              +R    +  P +E+  +     +++ D  PT+L  A   DIP  ++   ++I+   ++ 
Sbjct:   338 DIRVPFYVRGPSVEAGSLNPHIVLNI-DLAPTILDIAGL-DIPADMDG--KSILKLLDSE 393

Query:   404 ILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHE-YNPKYE 453
               R  N  H     R+   +   E G   +  K E    N   E + PKY+
Sbjct:   394 --RPVNRFHLKKKLRVWRDSFLVERGKLLH--KREGDKVNAQEENFLPKYQ 440


>UNIPROTKB|I3L2I6 [details] [associations]
            symbol:SGSH "N-sulphoglucosamine sulphohydrolase"
            species:9606 "Homo sapiens" [GO:0008484 "sulfuric ester hydrolase
            activity" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484 EMBL:AC087741
            EMBL:AC123764 HGNC:HGNC:10818 ChiTaRS:SGSH Ensembl:ENST00000574505
            Uniprot:I3L2I6
        Length = 106

 Score = 124 (48.7 bits), Expect = 1.0e-06, P = 1.0e-06
 Identities = 35/103 (33%), Positives = 57/103 (55%)

Query:    59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTG 117
             P + + +LADD G+   G +    I TP++DALA   ++ +N +T V  C+PSR++++TG
Sbjct:     4 PRNALLLLADDGGFES-GAYNNSAIATPHLDALARRSLLFRNAFTSVSSCSPSRASLLTG 62

Query:   118 KHPIHTGMQHNVLYGCERGGLPLS--EKI--LPQYLKELGYRT 156
               P H     N +YG  +     +  +K+  LP  L + G RT
Sbjct:    63 L-PQH----QNGMYGLHQDVHHFNSFDKVRSLPLLLSQAGVRT 100


>UNIPROTKB|E1BIY5 [details] [associations]
            symbol:SULF2 "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0060384 "innervation" evidence=IEA] [GO:0060348 "bone
            development" evidence=IEA] [GO:0048706 "embryonic skeletal system
            development" evidence=IEA] [GO:0040037 "negative regulation of
            fibroblast growth factor receptor signaling pathway" evidence=IEA]
            [GO:0035860 "glial cell-derived neurotrophic factor receptor
            signaling pathway" evidence=IEA] [GO:0032836 "glomerular basement
            membrane development" evidence=IEA] [GO:0030201 "heparan sulfate
            proteoglycan metabolic process" evidence=IEA] [GO:0030177 "positive
            regulation of Wnt receptor signaling pathway" evidence=IEA]
            [GO:0014846 "esophagus smooth muscle contraction" evidence=IEA]
            [GO:0010575 "positive regulation vascular endothelial growth factor
            production" evidence=IEA] [GO:0009986 "cell surface" evidence=IEA]
            [GO:0008449 "N-acetylglucosamine-6-sulfatase activity"
            evidence=IEA] [GO:0005886 "plasma membrane" evidence=IEA]
            [GO:0005783 "endoplasmic reticulum" evidence=IEA] [GO:0004065
            "arylsulfatase activity" evidence=IEA] [GO:0003094 "glomerular
            filtration" evidence=IEA] [GO:0002063 "chondrocyte development"
            evidence=IEA] [GO:0005794 "Golgi apparatus" evidence=IEA]
            [GO:0005509 "calcium ion binding" evidence=IEA] InterPro:IPR000917
            InterPro:IPR014615 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783 GO:GO:0005886
            GO:GO:0005794 GO:GO:0009986 GO:GO:0005509 GO:GO:0010575
            GO:GO:0060348 Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0030177
            GO:GO:0003094 InterPro:IPR024607 PROSITE:PS00523 GO:GO:0004065
            GeneTree:ENSGT00400000022041 GO:GO:0048706 GO:GO:0002063
            GO:GO:0040037 GO:GO:0032836 GO:GO:0060384 GO:GO:0030201 KO:K14607
            GO:GO:0014846 GO:GO:0035860 InterPro:IPR024609 Pfam:PF12548
            CTD:55959 OMA:PKYYGQG EMBL:DAAA02036810 IPI:IPI00698144
            RefSeq:NP_001179867.1 UniGene:Bt.90452 ProteinModelPortal:E1BIY5
            Ensembl:ENSBTAT00000009852 GeneID:533264 KEGG:bta:533264
            NextBio:20875979 Uniprot:E1BIY5
        Length = 862

 Score = 143 (55.4 bits), Expect = 1.1e-06, Sum P(2) = 1.1e-06
 Identities = 62/225 (27%), Positives = 89/225 (39%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
             P+II +L DD    DV   G  Q+       +   G    N + T  +C PSRS+I+TGK
Sbjct:    44 PNIILVLTDD---QDVEL-GSMQVMNKTRRIMEQGGAHFINAFVTTPMCCPSRSSILTGK 99

Query:   119 HPIHTGMQHNVLYGCERGGLPL-----SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
             + +H    HN     E    P        +    YL   GYRT   GK+ L  Y   Y P
Sbjct:   100 Y-VHN---HNTYTNNENCSSPSWQAQHESRTFAVYLNSTGYRTAFFGKY-LNEYNGSYVP 154

Query:   174 TFRGFESHLGYWTGHQDY-FDHSAEEMK-MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
                G++  +G     + Y +      +K   G D  +D          Y TD+ T ++V 
Sbjct:   155 P--GWKEWVGLLKNSRFYNYTLCRNGVKEKHGFDYSKD----------YLTDLITNDSVS 202

Query:   232 IIHNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
                         P+ + L+HAA H      P Q    + N  +HI
Sbjct:   203 FFRASKKMYPHRPVLMVLSHAAPHGPEDSAP-QYSSLFPNASQHI 246

 Score = 57 (25.1 bits), Expect = 1.1e-06, Sum P(2) = 1.1e-06
 Identities = 48/233 (20%), Positives = 89/233 (38%)

Query:   234 HNHSTDEPLFLYL-AHAATHSANPYEPLQAPD-HYLN--------IHRHIEDFKRSKFAA 283
             H      P +  L  +A+ H    Y     PD H++         IH    +  + K   
Sbjct:   226 HGPEDSAPQYSSLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMQFTNMLQRKRLQ 285

Query:   284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEG 343
              L  +D+S+  +   L +   L N+ IV+ +D              + P        +E 
Sbjct:   286 TLLSVDDSMETIYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMP--------YEF 337

Query:   344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENS 403
              +R    +  P +E+  +     +++ D  PT+L  A   DIP+ ++   ++I+   +  
Sbjct:   338 DIRVPFYVRGPNVEAGSLNPHIVLNI-DLAPTILDIAGL-DIPSDMDG--KSILKLLDTE 393

Query:   404 ILRYENGTHEYNSPRIENSNTRYENGT--HEYNP-KYENRYENGTHEYNPKYE 453
               R  N  H     R+   +   E G   H+ +  K + + EN    + PKY+
Sbjct:   394 --RPANRFHLKKKLRVWRDSFLVERGKLLHKRDSDKVDAQEEN----FLPKYQ 440


>UNIPROTKB|G3XAE6 [details] [associations]
            symbol:SULF2 "Extracellular sulfatase Sulf-2" species:9606
            "Homo sapiens" [GO:0005509 "calcium ion binding" evidence=IEA]
            [GO:0005783 "endoplasmic reticulum" evidence=IEA] [GO:0005794
            "Golgi apparatus" evidence=IEA] [GO:0008484 "sulfuric ester
            hydrolase activity" evidence=IEA] [GO:0009986 "cell surface"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR014615
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            PIRSF:PIRSF036665 GO:GO:0005783 GO:GO:0005794 EMBL:CH471077
            GO:GO:0009986 GO:GO:0005509 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484 EMBL:AL034418
            InterPro:IPR024609 Pfam:PF12548 EMBL:AL354813 UniGene:Hs.162016
            HGNC:HGNC:20392 EMBL:AL121777 ProteinModelPortal:G3XAE6 SMR:G3XAE6
            PRIDE:G3XAE6 Ensembl:ENST00000361612 ArrayExpress:G3XAE6
            Bgee:G3XAE6 Uniprot:G3XAE6
        Length = 852

 Score = 139 (54.0 bits), Expect = 1.1e-06, Sum P(2) = 1.1e-06
 Identities = 61/225 (27%), Positives = 89/225 (39%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
             P+II +L DD    DV   G  Q+       +   G    N + T  +C PSRS+I+TGK
Sbjct:    44 PNIILVLTDD---QDVEL-GSMQVMNKTRRIMEQGGAHFINAFVTTPMCCPSRSSILTGK 99

Query:   119 HPIHTGMQHNVLYGCERGGLPL-----SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
             + +H    HN     E    P        +    YL   GYRT   GK+ L  Y   Y P
Sbjct:   100 Y-VHN---HNTYTNNENCSSPSWQAQHESRTFAVYLNSTGYRTAFFGKY-LNEYNGSYVP 154

Query:   174 TFRGFESHLGYWTGHQDY-FDHSAEEMK-MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
                G++  +G     + Y +      +K   G D  +D          Y TD+ T ++V 
Sbjct:   155 P--GWKEWVGLLKNSRFYNYTLCRNGVKEKHGSDYSKD----------YLTDLITNDSVS 202

Query:   232 IIHNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
                         P+ + ++HAA H      P Q    + N  +HI
Sbjct:   203 FFRTSKKMYPHRPVLMVISHAAPHGPEDSAP-QYSRLFPNASQHI 246

 Score = 61 (26.5 bits), Expect = 1.1e-06, Sum P(2) = 1.1e-06
 Identities = 51/233 (21%), Positives = 87/233 (37%)

Query:   234 HNHSTDEPLFLYL-AHAATHSANPYEPLQAPD-HYLN--------IHRHIEDFKRSKFAA 283
             H      P +  L  +A+ H    Y     PD H++         IH    +  + K   
Sbjct:   226 HGPEDSAPQYSRLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQ 285

Query:   284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEG 343
              L  +D+S+  +   L +   L N+ IV+ +D              + P        +E 
Sbjct:   286 TLMSVDDSMETIYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMP--------YEF 337

Query:   344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENS 403
              +R    +  P +E+ G +    V   D  PT+L  A   DIP  ++   ++I+   +  
Sbjct:   338 DIRVPFYVRGPNVEA-GCLNPHIVLNIDLAPTILDIAGL-DIPADMDG--KSILKLLDTE 393

Query:   404 ILRYENGTHEYNSPRIENSNTRYENGT--HEY-NPKYENRYENGTHEYNPKYE 453
               R  N  H     R+   +   E G   H+  N K + + EN    + PKY+
Sbjct:   394 --RPVNRFHLKKKMRVWRDSFLVERGKLLHKRDNDKVDAQEEN----FLPKYQ 440


>UNIPROTKB|Q8IWU5 [details] [associations]
            symbol:SULF2 "Extracellular sulfatase Sulf-2" species:9606
            "Homo sapiens" [GO:0005509 "calcium ion binding" evidence=IEA]
            [GO:0005795 "Golgi stack" evidence=IEA] [GO:0004065 "arylsulfatase
            activity" evidence=IMP;IDA] [GO:0005615 "extracellular space"
            evidence=NAS] [GO:0009986 "cell surface" evidence=IDA] [GO:0030201
            "heparan sulfate proteoglycan metabolic process" evidence=IDA;NAS]
            [GO:0030177 "positive regulation of Wnt receptor signaling pathway"
            evidence=IDA] [GO:0008449 "N-acetylglucosamine-6-sulfatase
            activity" evidence=IDA;IMP] [GO:0005783 "endoplasmic reticulum"
            evidence=IDA] [GO:0002063 "chondrocyte development" evidence=ISS]
            [GO:0014846 "esophagus smooth muscle contraction" evidence=ISS]
            [GO:0035860 "glial cell-derived neurotrophic factor receptor
            signaling pathway" evidence=ISS] [GO:0048706 "embryonic skeletal
            system development" evidence=ISS] [GO:0051216 "cartilage
            development" evidence=ISS] [GO:0060384 "innervation" evidence=ISS]
            [GO:0005886 "plasma membrane" evidence=ISS] [GO:0010575 "positive
            regulation vascular endothelial growth factor production"
            evidence=ISS] [GO:0040037 "negative regulation of fibroblast growth
            factor receptor signaling pathway" evidence=ISS] [GO:0003094
            "glomerular filtration" evidence=ISS] [GO:0032836 "glomerular
            basement membrane development" evidence=ISS] [GO:0001822 "kidney
            development" evidence=ISS] [GO:0060348 "bone development"
            evidence=ISS] InterPro:IPR000917 InterPro:IPR014615
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            PIRSF:PIRSF036665 EMBL:AY101176 GO:GO:0005783 GO:GO:0005886
            EMBL:CH471077 GO:GO:0005615 GO:GO:0009986 GO:GO:0005795
            GO:GO:0005509 GO:GO:0010575 GO:GO:0060348 Gene3D:3.40.720.10
            SUPFAM:SSF53649 GO:GO:0030177 GO:GO:0003094 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0004065
            GO:GO:0048706 GO:GO:0002063 GO:GO:0040037 GO:GO:0032836
            GO:GO:0060384 GO:GO:0008449 GO:GO:0030201 EMBL:AL034418 KO:K14607
            HOVERGEN:HBG056431 GO:GO:0014846 GO:GO:0035860 InterPro:IPR024609
            Pfam:PF12548 EMBL:AB033073 EMBL:AY358461 EMBL:CR749319
            EMBL:AL354813 EMBL:BC020962 EMBL:BC110539 EMBL:AL133001
            IPI:IPI00297252 IPI:IPI00555879 RefSeq:NP_001155313.1
            RefSeq:NP_061325.1 RefSeq:NP_940998.2 UniGene:Hs.162016
            ProteinModelPortal:Q8IWU5 SMR:Q8IWU5 IntAct:Q8IWU5 STRING:Q8IWU5
            PhosphoSite:Q8IWU5 DMDM:33112446 PaxDb:Q8IWU5 PRIDE:Q8IWU5
            DNASU:55959 Ensembl:ENST00000359930 Ensembl:ENST00000467815
            Ensembl:ENST00000484875 GeneID:55959 KEGG:hsa:55959 UCSC:uc002xto.3
            UCSC:uc002xtr.3 CTD:55959 GeneCards:GC20M046285 H-InvDB:HIX0027735
            HGNC:HGNC:20392 HPA:HPA002325 MIM:610013 neXtProt:NX_Q8IWU5
            PharmGKB:PA134902131 InParanoid:Q8IWU5 OMA:PKYYGQG
            OrthoDB:EOG49KFPX PhylomeDB:Q8IWU5 GenomeRNAi:55959 NextBio:61367
            ArrayExpress:Q8IWU5 Bgee:Q8IWU5 CleanEx:HS_SULF2
            Genevestigator:Q8IWU5 GermOnline:ENSG00000196562 Uniprot:Q8IWU5
        Length = 870

 Score = 139 (54.0 bits), Expect = 1.2e-06, Sum P(2) = 1.2e-06
 Identities = 61/225 (27%), Positives = 89/225 (39%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
             P+II +L DD    DV   G  Q+       +   G    N + T  +C PSRS+I+TGK
Sbjct:    44 PNIILVLTDD---QDVEL-GSMQVMNKTRRIMEQGGAHFINAFVTTPMCCPSRSSILTGK 99

Query:   119 HPIHTGMQHNVLYGCERGGLPL-----SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
             + +H    HN     E    P        +    YL   GYRT   GK+ L  Y   Y P
Sbjct:   100 Y-VHN---HNTYTNNENCSSPSWQAQHESRTFAVYLNSTGYRTAFFGKY-LNEYNGSYVP 154

Query:   174 TFRGFESHLGYWTGHQDY-FDHSAEEMK-MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
                G++  +G     + Y +      +K   G D  +D          Y TD+ T ++V 
Sbjct:   155 P--GWKEWVGLLKNSRFYNYTLCRNGVKEKHGSDYSKD----------YLTDLITNDSVS 202

Query:   232 IIHNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
                         P+ + ++HAA H      P Q    + N  +HI
Sbjct:   203 FFRTSKKMYPHRPVLMVISHAAPHGPEDSAP-QYSRLFPNASQHI 246

 Score = 61 (26.5 bits), Expect = 1.2e-06, Sum P(2) = 1.2e-06
 Identities = 51/233 (21%), Positives = 87/233 (37%)

Query:   234 HNHSTDEPLFLYL-AHAATHSANPYEPLQAPD-HYLN--------IHRHIEDFKRSKFAA 283
             H      P +  L  +A+ H    Y     PD H++         IH    +  + K   
Sbjct:   226 HGPEDSAPQYSRLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQ 285

Query:   284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEG 343
              L  +D+S+  +   L +   L N+ IV+ +D              + P        +E 
Sbjct:   286 TLMSVDDSMETIYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMP--------YEF 337

Query:   344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENS 403
              +R    +  P +E+ G +    V   D  PT+L  A   DIP  ++   ++I+   +  
Sbjct:   338 DIRVPFYVRGPNVEA-GCLNPHIVLNIDLAPTILDIAGL-DIPADMDG--KSILKLLDTE 393

Query:   404 ILRYENGTHEYNSPRIENSNTRYENGT--HEY-NPKYENRYENGTHEYNPKYE 453
               R  N  H     R+   +   E G   H+  N K + + EN    + PKY+
Sbjct:   394 --RPVNRFHLKKKMRVWRDSFLVERGKLLHKRDNDKVDAQEEN----FLPKYQ 440


>DICTYBASE|DDB_G0282469 [details] [associations]
            symbol:gnt13 "putative
            beta-1,3-N-acetylglucosaminyltransferase" species:44689
            "Dictyostelium discoideum" [GO:0008150 "biological_process"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0003674 "molecular_function" evidence=ND] [GO:0016021 "integral
            to membrane" evidence=IEA] [GO:0016020 "membrane" evidence=IEA]
            dictyBase:DDB_G0282469 GO:GO:0016021 EMBL:AAFI02000047
            GenomeReviews:CM000152_GR RefSeq:XP_640067.1
            EnsemblProtists:DDB0231851 GeneID:8623596 KEGG:ddi:DDB_G0282469
            eggNOG:NOG279004 InParanoid:Q54SH2 ProtClustDB:CLSZ2430453
            Uniprot:Q54SH2
        Length = 635

 Score = 147 (56.8 bits), Expect = 1.6e-06, P = 1.6e-06
 Identities = 39/153 (25%), Positives = 62/153 (40%)

Query:   421 NSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTH 480
             N+++ + N  +      EN Y    +  N   +N Y N  +  N   N N N    N  +
Sbjct:   277 NTDSEFNNINYNMENLNENEYLKNINNNNNNNDNNYNNNNNNNNNNNNNNNNNNNNNNNN 336

Query:   481 EYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQI 540
               N     N+ N N  + N + DN+  N+ID ID       N     N I  NI++   I
Sbjct:   337 NNNNNNNNNN-NNNNNNNNNNIDNNIDNKIDNID-------NNIDNNNNI-DNINNNNNI 387

Query:   541 SALTRGKWKLVKENSINGNGTSENRSNDNSYQN 573
             + +          N+ N N  + N +N+N+  N
Sbjct:   388 NNIDNNNSNYNDNNNNNNNNNNNNNNNNNNNNN 420


>DICTYBASE|DDB_G0287637 [details] [associations]
            symbol:mybD "myb domain-containing protein"
            species:44689 "Dictyostelium discoideum" [GO:0003677 "DNA binding"
            evidence=IEA;ISS] [GO:0003682 "chromatin binding" evidence=IEA]
            [GO:0008150 "biological_process" evidence=ND] [GO:0006355
            "regulation of transcription, DNA-dependent" evidence=IEA]
            [GO:0006351 "transcription, DNA-dependent" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] [GO:0044351 "macropinocytosis"
            evidence=RCA] InterPro:IPR001005 InterPro:IPR009057 Pfam:PF00249
            SMART:SM00717 dictyBase:DDB_G0287637 GO:GO:0005634 GO:GO:0006355
            GO:GO:0003677 GO:GO:0006351 GO:GO:0003682 GenomeReviews:CM000154_GR
            EMBL:AAFI02000103 Gene3D:1.10.10.60 SUPFAM:SSF46689
            InterPro:IPR017930 PROSITE:PS51294 RefSeq:XP_637145.1
            ProteinModelPortal:Q54K19 EnsemblProtists:DDB0220512 GeneID:8626240
            KEGG:ddi:DDB_G0287637 eggNOG:NOG321969 OMA:HNNYINH
            ProtClustDB:CLSZ2846665 Uniprot:Q54K19
        Length = 595

 Score = 146 (56.5 bits), Expect = 1.9e-06, P = 1.9e-06
 Identities = 45/191 (23%), Positives = 84/191 (43%)

Query:   384 DIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYEN-RYE 442
             D+ +  N+   NI     NSI  YEN  +    P+  N N +Y++   + N  +++   +
Sbjct:    16 DLSDNYNNNNSNINTNNNNSINDYENQNNGLVVPQ-SNQNQQYQD---DQNDSFDDDSMD 71

Query:   443 NGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSN 502
              G  + N   +   +N  +  N   +EN N    N +   NI   EN+I+ N  + N +N
Sbjct:    72 EGEEKSNLIIDESQQNSLNNNNN-NSENNNI---NNSENNNINNSENNIHNNNNNNNNNN 127

Query:   503 DNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTS 562
             +N+  N  +  +  +  + N  +  N  ++N ++   I+            N+IN N   
Sbjct:   128 NNNNNNNNNNNNNNNNNNNNNNNNNNNTINNNNNNNNINNNINNNNNNYNNNNINNNNNI 187

Query:   563 ENRSNDNSYQN 573
              N +N+N+  N
Sbjct:   188 NNNNNNNNENN 198

 Score = 134 (52.2 bits), Expect = 3.7e-05, P = 3.7e-05
 Identities = 45/196 (22%), Positives = 75/196 (38%)

Query:   376 LLSAANKSDIPNYVNSTVENIIPRYENS-ILRYENGTHEYNSPRIENSNTRYENGTHEYN 434
             ++  + ++ + N  N++  N I   EN+ I   EN  H  N+    N+N    N  +  N
Sbjct:    80 IIDESQQNSLNNNNNNSENNNINNSENNNINNSENNIHNNNNNNNNNNNNNNNNNNNNNN 139

Query:   435 PKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSIN-G 493
                 N   N  +  N    N   N  +  N   N N N    N  +  NI    N+ N  
Sbjct:   140 NNNNNNNNNNNNNNNTINNNNNNNNIN--NNINNNNNNYNNNNINNNNNINNNNNNNNEN 197

Query:   494 NGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKE 553
             N  +EN +N+N   N+  G  +      N  +  N   +N ++    +   +        
Sbjct:   198 NNNNENNNNNNENNNKFIGSPMGEPQINNNNNNNNNNNNNNNNNNNNNNNNKNN----NN 253

Query:   554 NSINGNGTSENRSNDN 569
             N+ N N  + NR  D+
Sbjct:   254 NNNNNNNNNNNRKFDD 269

 Score = 131 (51.2 bits), Expect = 7.8e-05, P = 7.8e-05
 Identities = 43/174 (24%), Positives = 69/174 (39%)

Query:   401 ENSILRYENGTHEYNSPRIENSN-TRYENGTHEYNPKYENRYENGTHEYNPKYENRYENG 459
             +NS+    N +   N    EN+N    EN  H  N    N   N  +  N    N   N 
Sbjct:    86 QNSLNNNNNNSENNNINNSENNNINNSENNIHNNNNNNNNNNNNNNNNNNNNNNNNNNN- 144

Query:   460 THEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVL 519
                 N   N N N    N  +  NI    N+IN N  + N +N N+  N I+  +  +  
Sbjct:   145 ----NNNNNNNNNNTINNNNNNNNI---NNNINNNNNNYNNNNINN-NNNINNNNNNNNE 196

Query:   520 SRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQN 573
             + N     N   +N ++  +      G+ ++   N+ N N  + N +N+N+  N
Sbjct:   197 NNNNNENNN---NNNENNNKFIGSPMGEPQINNNNNNNNNNNNNNNNNNNNNNN 247

 Score = 127 (49.8 bits), Expect = 0.00021, P = 0.00021
 Identities = 44/197 (22%), Positives = 77/197 (39%)

Query:   413 EYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTN 472
             + NS    N+N+   N  +  N    N  EN  H  N    N   N  +  N   N N N
Sbjct:    85 QQNSLNNNNNNSENNNINNSENNNINNS-ENNIHNNNNNNNNNNNNNNNNNNNNNNNNNN 143

Query:   473 PRYENGTHEYNIPRLENSINGNGTSEN-RSNDNSYQNEIDGIDVWSVLSRNEPSKRNTIL 531
                 N  +  N   + N+ N N  + N  +N+N+Y N  + I+  + ++ N  +      
Sbjct:   144 NNNNNNNNNNNT--INNNNNNNNINNNINNNNNNYNN--NNINNNNNINNNNNNNNENNN 199

Query:   532 HNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSK 591
             +N ++        +     + E  IN N  + N +N+N+  N  +  +  +  + N  + 
Sbjct:   200 NNENNNNNNENNNKFIGSPMGEPQINNNNNNNNNNNNNNNNNNNNNNNNKNNNNNNNNNN 259

Query:   592 RNTILHNIDDEWQISAL 608
              N      DD+  I  L
Sbjct:   260 NNNNNRKFDDQQIIKDL 276

 Score = 121 (47.7 bits), Expect = 0.00094, P = 0.00094
 Identities = 42/187 (22%), Positives = 67/187 (35%)

Query:   381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
             N ++  N  N+   N      N+     N  +  N+    N+N    N  +  N  Y N 
Sbjct:   120 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTINNNNNNNNINNNINNNNNNYNNN 179

Query:   441 YENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENR 500
               N  +  N    N  EN  +      NEN N   EN       P  E  IN N  + N 
Sbjct:   180 NINNNNNINNNNNNNNENNNN------NENNNNNNENNNKFIGSPMGEPQINNNNNNNNN 233

Query:   501 SNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNG 560
             +N+N+  N  +  +  +  + N  +  N      DD+  I  L     +  K N +    
Sbjct:   234 NNNNNNNNNNNNNNNKNNNNNNNNNNNNNNNRKFDDQQIIKDLENRLKEAKKTNQLLDEK 293

Query:   561 TSENRSN 567
              ++ + N
Sbjct:   294 CNQLKKN 300


>UNIPROTKB|Q6MX51 [details] [associations]
            symbol:Rv0296c "Sulfatase" species:83332 "Mycobacterium
            tuberculosis H37Rv" [GO:0004065 "arylsulfatase activity"
            evidence=IDA] [GO:0005618 "cell wall" evidence=IDA] [GO:0046872
            "metal ion binding" evidence=IDA] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 GO:GO:0005618
            GenomeReviews:AL123456_GR GO:GO:0046872 EMBL:BX842573
            Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
            PROSITE:PS00523 GO:GO:0004065 GO:GO:0008484 KO:K01567 EMBL:CP003248
            PIR:F70837 RefSeq:YP_006513622.1 RefSeq:YP_177712.1
            ProteinModelPortal:Q6MX51 SMR:Q6MX51 PRIDE:Q6MX51
            EnsemblBacteria:EBMYCT00000002598 GeneID:13316285 GeneID:886600
            KEGG:mtu:Rv0296c KEGG:mtv:RVBD_0296c PATRIC:18149150
            TubercuList:Rv0296c HOGENOM:HOG000045150 OMA:DPGMAEP
            ProtClustDB:CLSK799699 Uniprot:Q6MX51
        Length = 465

 Score = 133 (51.9 bits), Expect = 1.9e-06, Sum P(2) = 1.9e-06
 Identities = 36/104 (34%), Positives = 56/104 (53%)

Query:    72 WNDVG-FHGLDQIP---TPNIDALAYSGIIL-KNYYTVQLCTPSRSAIMTGKHPIHTGMQ 126
             W+D+G + G+   P   +P +D LA  GI+  + + T  LCTPSR ++ TG++P   G+ 
Sbjct:    18 WHDLGRYLGVYHHPDVYSPRLDRLAAEGILFTRAHATAPLCTPSRGSLFTGRYPQSNGLV 77

Query:   127 HNVLYGCE-RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
                 +G E R G+    + LPQ L E G+ + + G  H   Y K
Sbjct:    78 GLAHHGWEYRTGV----QTLPQLLSESGWYSALFGMQHETSYPK 117

 Score = 58 (25.5 bits), Expect = 1.9e-06, Sum P(2) = 1.9e-06
 Identities = 17/55 (30%), Positives = 25/55 (45%)

Query:   636 YLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNL 690
             Y+   + R  L L        A +   P+ + P  PQ    L+D++ DP E NNL
Sbjct:   340 YIENYAPRPLLDLPWDIQESPAGMAVAPLVKAP-RPQRE--LYDLRADPTETNNL 391

 Score = 52 (23.4 bits), Expect = 7.8e-06, Sum P(2) = 7.8e-06
 Identities = 13/34 (38%), Positives = 19/34 (55%)

Query:   763 ASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNL 796
             A +   P+ + P  PQ    L+D++ DP E NNL
Sbjct:   361 AGMAVAPLVKAP-RPQRE--LYDLRADPTETNNL 391


>CGD|CAL0006287 [details] [associations]
            symbol:SHE3 species:5476 "Candida albicans" [GO:0008298
            "intracellular mRNA localization" evidence=IMP] [GO:0003729 "mRNA
            binding" evidence=IDA] [GO:0001897 "cytolysis by symbiont of host
            cells" evidence=IMP] [GO:0030447 "filamentous growth" evidence=IMP]
            [GO:0009267 "cellular response to starvation" evidence=IMP]
            [GO:0071216 "cellular response to biotic stimulus" evidence=IMP]
            [GO:0005934 "cellular bud tip" evidence=IEA] [GO:0005737
            "cytoplasm" evidence=IEA] [GO:0036170 "filamentous growth of a
            population of unicellular organisms in response to starvation"
            evidence=IMP] [GO:0036180 "filamentous growth of a population of
            unicellular organisms in response to biotic stimulus" evidence=IMP]
            [GO:0048309 "endoplasmic reticulum inheritance" evidence=IEA]
            [GO:0007533 "mating type switching" evidence=IEA] CGD:CAL0006287
            GO:GO:0071216 GO:GO:0036180 GO:GO:0005789 GO:GO:0003729
            GO:GO:0009267 GO:GO:0036170 GO:GO:0008298 GO:GO:0051028
            EMBL:AACQ01000034 EMBL:AACQ01000033 RefSeq:XP_719156.1
            RefSeq:XP_719272.1 GeneID:3639162 GeneID:3639277
            KEGG:cal:CaO19.13040 KEGG:cal:CaO19.5595 eggNOG:NOG245845
            GO:GO:0001897 Uniprot:Q5ABV6
        Length = 519

 Score = 145 (56.1 bits), Expect = 1.9e-06, P = 1.9e-06
 Identities = 39/126 (30%), Positives = 61/126 (48%)

Query:   409 NGTHEYNSPRIENS-NTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPK 467
             NG +  N+ R  NS ++R +NG H  N  Y++R ++G H   P  +N   N  + YN   
Sbjct:   390 NGNNNINNHRRNNSVDSRSDNGQHRRNNSYDSRSDHGQHRRQPSQQNNNYNNNN-YNNNN 448

Query:   468 NENTNPRYENG--THEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPS 525
             N N N    NG      ++  + N  N NG + N  N N++ N+      ++  + N  S
Sbjct:   449 NNNNN-NSNNGFVKRSGSVRNVNNYNNNNGNANN--NGNNHGNKSKRRSTYNN-NNNNNS 504

Query:   526 KRNTIL 531
             KRN+ L
Sbjct:   505 KRNSQL 510


>UNIPROTKB|Q5ABV6 [details] [associations]
            symbol:SHE3 "SWI5-dependent HO expression protein 3"
            species:237561 "Candida albicans SC5314" [GO:0001897 "cytolysis by
            symbiont of host cells" evidence=IMP] [GO:0003729 "mRNA binding"
            evidence=IDA] [GO:0008298 "intracellular mRNA localization"
            evidence=IMP] [GO:0009267 "cellular response to starvation"
            evidence=IMP] [GO:0030447 "filamentous growth" evidence=IMP]
            [GO:0036170 "filamentous growth of a population of unicellular
            organisms in response to starvation" evidence=IMP] [GO:0036180
            "filamentous growth of a population of unicellular organisms in
            response to biotic stimulus" evidence=IMP] [GO:0071216 "cellular
            response to biotic stimulus" evidence=IMP] CGD:CAL0006287
            GO:GO:0071216 GO:GO:0036180 GO:GO:0005789 GO:GO:0003729
            GO:GO:0009267 GO:GO:0036170 GO:GO:0008298 GO:GO:0051028
            EMBL:AACQ01000034 EMBL:AACQ01000033 RefSeq:XP_719156.1
            RefSeq:XP_719272.1 GeneID:3639162 GeneID:3639277
            KEGG:cal:CaO19.13040 KEGG:cal:CaO19.5595 eggNOG:NOG245845
            GO:GO:0001897 Uniprot:Q5ABV6
        Length = 519

 Score = 145 (56.1 bits), Expect = 1.9e-06, P = 1.9e-06
 Identities = 39/126 (30%), Positives = 61/126 (48%)

Query:   409 NGTHEYNSPRIENS-NTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPK 467
             NG +  N+ R  NS ++R +NG H  N  Y++R ++G H   P  +N   N  + YN   
Sbjct:   390 NGNNNINNHRRNNSVDSRSDNGQHRRNNSYDSRSDHGQHRRQPSQQNNNYNNNN-YNNNN 448

Query:   468 NENTNPRYENG--THEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPS 525
             N N N    NG      ++  + N  N NG + N  N N++ N+      ++  + N  S
Sbjct:   449 NNNNN-NSNNGFVKRSGSVRNVNNYNNNNGNANN--NGNNHGNKSKRRSTYNN-NNNNNS 504

Query:   526 KRNTIL 531
             KRN+ L
Sbjct:   505 KRNSQL 510


>ZFIN|ZDB-GENE-030131-9242 [details] [associations]
            symbol:sulf1 "sulfatase 1" species:7955 "Danio
            rerio" [GO:0005509 "calcium ion binding" evidence=IEA] [GO:0008152
            "metabolic process" evidence=IEA] [GO:0008484 "sulfuric ester
            hydrolase activity" evidence=IEA] [GO:0003824 "catalytic activity"
            evidence=IEA] [GO:0005783 "endoplasmic reticulum" evidence=IEA]
            [GO:0005794 "Golgi apparatus" evidence=IEA] [GO:0009986 "cell
            surface" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 ZFIN:ZDB-GENE-030131-9242
            Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
            PROSITE:PS00523 GO:GO:0008484 GeneTree:ENSGT00400000022041
            HOGENOM:HOG000290161 KO:K14607 HOVERGEN:HBG056431
            InterPro:IPR024609 Pfam:PF12548 CTD:23213 OMA:SVRVTHK EMBL:CR385071
            EMBL:CR382282 EMBL:AY332604 IPI:IPI00509599 RefSeq:NP_001003846.1
            UniGene:Dr.81473 Ensembl:ENSDART00000056081 GeneID:337298
            KEGG:dre:337298 InParanoid:Q6EFA1 NextBio:20812164 Uniprot:Q6EFA1
        Length = 1099

 Score = 136 (52.9 bits), Expect = 2.0e-06, Sum P(3) = 2.0e-06
 Identities = 57/223 (25%), Positives = 93/223 (41%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
             P+II I+ DD    DV    L Q+       +   G    N + T  +C PSRS+++TGK
Sbjct:    42 PNIILIMTDD---QDVELGSL-QVMNKTRKIMEDGGTSFTNAFVTTPMCCPSRSSMLTGK 97

Query:   119 HPIHTGMQHNVLYGCERGGLPLSE-----KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
             + +H    HN     E    P  +     +    YL   GYRT   GK+ L  Y   Y P
Sbjct:    98 Y-VHN---HNTYTNNENCSSPSWQAQHEPRSFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 152

Query:   174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
                G+   +G     + +++++       G   +   + A D    Y TD+ T ++++  
Sbjct:   153 P--GWREWVGLIKNSR-FYNYTVCRN---GNKEKHGADYAKD----YFTDLITNDSINYF 202

Query:   234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
                       P+ + ++HAA H      P Q  + + N  +HI
Sbjct:   203 RTSKRMFPHRPVMMVISHAAPHGPEDSAP-QYSELFPNASQHI 244

 Score = 63 (27.2 bits), Expect = 2.0e-06, Sum P(3) = 2.0e-06
 Identities = 15/47 (31%), Positives = 23/47 (48%)

Query:   269 IHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
             IH    ++   K    L  +D+SV KV  AL     L N+ I++ +D
Sbjct:   269 IHMEFTNYLHRKRLQTLMSVDDSVEKVYNALVDTGELDNTYIIYTAD 315

 Score = 52 (23.4 bits), Expect = 2.4e-05, Sum P(3) = 2.4e-05
 Identities = 18/87 (20%), Positives = 34/87 (39%)

Query:   428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGPKNENTNPRYENGTHEYNI 484
             +G  +  P+Y   + N +    P Y    N  ++   +Y GP     +  + N  H   +
Sbjct:   224 HGPEDSAPQYSELFPNASQHITPSYNYAPNMDKHWIMQYTGPMKP-IHMEFTNYLHRKRL 282

Query:   485 PRL---ENSING--NGTSENRSNDNSY 506
               L   ++S+    N   +    DN+Y
Sbjct:   283 QTLMSVDDSVEKVYNALVDTGELDNTY 309

 Score = 47 (21.6 bits), Expect = 2.0e-06, Sum P(3) = 2.0e-06
 Identities = 12/39 (30%), Positives = 16/39 (41%)

Query:   537 EWQISALTRGKWKLVK-ENSINGNGTSENRS-NDNSYQN 573
             +W       GKW+L K + S+        RS    SY N
Sbjct:   461 KWHCVEEVSGKWRLQKCKGSLKEGSKKRTRSLRSRSYDN 499


>DICTYBASE|DDB_G0280253 [details] [associations]
            symbol:DDB_G0280253 "putative GTPase activating
            protein (GAP)" species:44689 "Dictyostelium discoideum" [GO:0032851
            "positive regulation of Rab GTPase activity" evidence=IEA]
            [GO:0032313 "regulation of Rab GTPase activity" evidence=IEA]
            [GO:0005622 "intracellular" evidence=IEA] [GO:0005097 "Rab GTPase
            activator activity" evidence=IEA] [GO:0043547 "positive regulation
            of GTPase activity" evidence=IEA] [GO:0005096 "GTPase activator
            activity" evidence=IEA] InterPro:IPR000195 Pfam:PF00566
            PROSITE:PS50086 SMART:SM00164 dictyBase:DDB_G0280253
            GenomeReviews:CM000152_GR GO:GO:0005622 EMBL:AAFI02000035
            eggNOG:COG5210 GO:GO:0005097 GO:GO:0032851 SUPFAM:SSF47923
            RefSeq:XP_641332.1 ProteinModelPortal:Q54VM3
            EnsemblProtists:DDB0235314 GeneID:8622466 KEGG:ddi:DDB_G0280253
            InParanoid:Q54VM3 OMA:ISHDISR ProtClustDB:CLSZ2846777
            Uniprot:Q54VM3
        Length = 1173

 Score = 149 (57.5 bits), Expect = 2.2e-06, P = 2.2e-06
 Identities = 52/205 (25%), Positives = 89/205 (43%)

Query:   380 ANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRI---ENSNT-RYENGTHEYNP 435
             +N S+  N  N+ + N   R  N+     N    Y    +   EN+++  Y +  + ++ 
Sbjct:    74 SNNSNNSNNNNNNINNNNNRNNNNFNNNNNNNVNYFEQDVDFGENAHSSNYGDNNNIFSD 133

Query:   436 KYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNG 495
             +  N Y N  +  +    N Y+N  + YN   NEN N  Y N  +  N     N+ N N 
Sbjct:   134 E-SNNYNNNNNNNDYNNNNYYDN--NNYNENYNENYNENYNNNNNNNNNNNNNNN-NNNN 189

Query:   496 TSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENS 555
              + N +N+N+Y NE +           +  ++N   +N ++E+ I+           +NS
Sbjct:   190 NNNNNNNNNNYYNENNN---------QQQLQQNYSNNNYNNEY-INNFNNN------DNS 233

Query:   556 INGNGTSENR-SNDNSYQNEIDGID 579
              N N  + N  SN N+Y N  +G D
Sbjct:   234 YNNNNNNNNNNSNFNNYNNNNNGYD 258

 Score = 135 (52.6 bits), Expect = 6.9e-05, P = 6.9e-05
 Identities = 37/137 (27%), Positives = 54/137 (39%)

Query:   381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
             N  D  NY  +  EN    Y N+     N  +  N+    N+N    N  +  N   +  
Sbjct:   151 NYYDNNNYNENYNENYNENYNNNNNNNNNNNNNNNNNNNNNNNNNNNNNYYNENNNQQQL 210

Query:   441 YEN-GTHEYNPKYENRYENGTHEYNGPKNENTNP----RYENGTHEYNIPRLENSINGN- 494
              +N   + YN +Y N + N  + YN   N N N      Y N  + Y+     NS N N 
Sbjct:   211 QQNYSNNNYNNEYINNFNNNDNSYNNNNNNNNNNSNFNNYNNNNNGYD-NSYSNSNNNNY 269

Query:   495 --GTSENRSNDNSYQNE 509
                ++ N  NDN Y  +
Sbjct:   270 YDNSNNNSKNDNQYNQQ 286


>DICTYBASE|DDB_G0267636 [details] [associations]
            symbol:mybM "putative myb transcription factor"
            species:44689 "Dictyostelium discoideum" [GO:0003682 "chromatin
            binding" evidence=IEA] [GO:0003677 "DNA binding" evidence=IEA]
            [GO:0008150 "biological_process" evidence=ND] [GO:0006355
            "regulation of transcription, DNA-dependent" evidence=IEA]
            [GO:0006351 "transcription, DNA-dependent" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] InterPro:IPR001005
            InterPro:IPR009057 SMART:SM00717 dictyBase:DDB_G0267636
            GO:GO:0005634 GenomeReviews:CM000150_GR GO:GO:0006355 GO:GO:0003677
            EMBL:AAFI02000003 GO:GO:0006351 GO:GO:0003682 Gene3D:1.10.10.60
            SUPFAM:SSF46689 InterPro:IPR017930 PROSITE:PS51294
            InterPro:IPR017877 PROSITE:PS50090 HSSP:P06876 RefSeq:XP_647181.1
            ProteinModelPortal:Q55GK3 EnsemblProtists:DDB0220517 GeneID:8615985
            KEGG:ddi:DDB_G0267636 eggNOG:NOG244606 OMA:KRICKRT Uniprot:Q55GK3
        Length = 669

 Score = 146 (56.5 bits), Expect = 2.2e-06, P = 2.2e-06
 Identities = 47/209 (22%), Positives = 85/209 (40%)

Query:   365 QYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNT 424
             +Y+ ++    ++L   N+S++ + +NS+  N      N  L+Y+    +    + +    
Sbjct:   218 RYLQLTGKGGSILPPLNQSNVSS-LNSSSANTF----NQQLQYQQQQQQQQQQQQQQQQQ 272

Query:   425 RYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNI 484
             + +   + YN  Y N   N  + YN  + N+  N  + +N   N        N  + YN 
Sbjct:   273 QQQQMNNNYNNNYNNNNNNINNNYNNNHNNQNNNNNNNHNHYNNHYNQMNNNNNNNHYN- 331

Query:   485 PRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALT 544
                 N+ N N    N +N+N YQ   +     S  + N  S   + L +I D    S   
Sbjct:   332 ----NNNNNNNNINNNNNNNMYQMNNNN----SNSNNNNKSHNLSPLSSIIDSNTSSPSF 383

Query:   545 RGKWKLVKENSINGNGTSENRSNDNSYQN 573
              G       N+ N N  + N +N+N+  N
Sbjct:   384 EGCEDNNNNNNNNNNNNNNNNNNNNNNNN 412

 Score = 104 (41.7 bits), Expect = 0.00015, Sum P(2) = 0.00015
 Identities = 35/133 (26%), Positives = 52/133 (39%)

Query:   381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
             N ++I N  N+   N      N+   Y N  H YN     N+N  Y N  +  N    N 
Sbjct:   288 NNNNINNNYNNNHNNQNNNNNNNHNHYNN--H-YNQMNNNNNNNHYNNNNNNNN-NINNN 343

Query:   441 YENGTHEYNPKYENRYENGTHEYNGPKNE----NTN-PRYENGTHEYNIPRLENSINGNG 495
               N  ++ N    N   N       P +     NT+ P +E G  + N     N+ N N 
Sbjct:   344 NNNNMYQMNNNNSNSNNNNKSHNLSPLSSIIDSNTSSPSFE-GCEDNNNNNNNNNNNNNN 402

Query:   496 TSENRSNDNSYQN 508
              + N +N+N+  N
Sbjct:   403 NNNNNNNNNNSNN 415

 Score = 74 (31.1 bits), Expect = 0.00015, Sum P(2) = 0.00015
 Identities = 32/116 (27%), Positives = 52/116 (44%)

Query:   490 SINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWK 549
             SIN    SEN +N+N+  N+ D I V      NEP  +       + E+    + + K  
Sbjct:   503 SINNIIDSENNNNNNN--NDNDNIKVEDN-GCNEPVMKKV---RSNGEFYYQPI-KNKLN 555

Query:   550 LVKENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQI 605
                 N+ N N  + N +N+N+  N  +G +  S  S N  S  + +L  + +  QI
Sbjct:   556 NNNNNNNNNNNNNNNNNNNNNNNNNNNGNNTLSYNSDN--SSDDDMLPKLKNNKQI 609


>UNIPROTKB|P15586 [details] [associations]
            symbol:GNS "N-acetylglucosamine-6-sulfatase" species:9606
            "Homo sapiens" [GO:0046872 "metal ion binding" evidence=IEA]
            [GO:0008449 "N-acetylglucosamine-6-sulfatase activity"
            evidence=IEA] [GO:0006027 "glycosaminoglycan catabolic process"
            evidence=TAS] [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IDA] [GO:0005975 "carbohydrate metabolic process"
            evidence=TAS] [GO:0030203 "glycosaminoglycan metabolic process"
            evidence=TAS] [GO:0042339 "keratan sulfate metabolic process"
            evidence=TAS] [GO:0042340 "keratan sulfate catabolic process"
            evidence=TAS] [GO:0043202 "lysosomal lumen" evidence=TAS]
            [GO:0044281 "small molecule metabolic process" evidence=TAS]
            [GO:0005515 "protein binding" evidence=IPI] Reactome:REACT_11123
            Reactome:REACT_111217 InterPro:IPR000917 InterPro:IPR012251
            InterPro:IPR015981 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 PIRSF:PIRSF036666 Reactome:REACT_116125 GO:GO:0046872
            GO:GO:0005975 GO:GO:0043202 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            GO:GO:0042340 GO:GO:0043199 GO:GO:0005539 CTD:2799
            HOGENOM:HOG000169239 HOVERGEN:HBG005840 KO:K01137 OrthoDB:EOG4NGGMF
            GO:GO:0008449 PANTHER:PTHR10342:SF5 EMBL:Z12173 EMBL:AK223484
            EMBL:AC025262 EMBL:BC012482 IPI:IPI00012102 PIR:S27164
            RefSeq:NP_002067.1 UniGene:Hs.334534 ProteinModelPortal:P15586
            SMR:P15586 IntAct:P15586 STRING:P15586 PhosphoSite:P15586
            DMDM:232126 PaxDb:P15586 PeptideAtlas:P15586 PRIDE:P15586
            DNASU:2799 Ensembl:ENST00000258145 GeneID:2799 KEGG:hsa:2799
            UCSC:uc001ssf.3 GeneCards:GC12M065107 H-InvDB:HIX0010785
            HGNC:HGNC:4422 HPA:CAB026011 HPA:HPA013695 MIM:252940 MIM:607664
            neXtProt:NX_P15586 Orphanet:79272 PharmGKB:PA28802
            InParanoid:P15586 PhylomeDB:P15586 BioCyc:MetaCyc:HS06046-MONOMER
            BRENDA:3.1.6.14 SABIO-RK:P15586 ChiTaRS:GNS GenomeRNAi:2799
            NextBio:11033 ArrayExpress:P15586 Bgee:P15586 CleanEx:HS_GNS
            Genevestigator:P15586 GermOnline:ENSG00000135677 Uniprot:P15586
        Length = 552

 Score = 141 (54.7 bits), Expect = 2.3e-06, Sum P(2) = 2.3e-06
 Identities = 65/228 (28%), Positives = 98/228 (42%)

Query:    36 AFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDAL-AYS 94
             A  +L L   L  VF  + A +  P+++ +L DD    D    G+   P     AL    
Sbjct:    25 ALLLLVLGGCLG-VF-GVAAGTRRPNVVLLLTDD---QDEVLGGMT--PLKKTKALIGEM 77

Query:    95 GIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYG-CE-RGGLPLSE-KILPQYLK 150
             G+   + Y    LC PSR++I+TGK+P +  + +N L G C  +    + E    P  L+
Sbjct:    78 GMTFSSAYVPSALCCPSRASILTGKYPHNHHVVNNTLEGNCSSKSWQKIQEPNTFPAILR 137

Query:   151 EL-GYRTRIVGKWHLGFYKKEY-TPTFRGFES-HLG--YWTGHQDYFDHSAEEMKMWGLD 205
              + GY+T   GK     Y  EY  P   G E   LG  YW   +    +    + + G  
Sbjct:   138 SMCGYQTFFAGK-----YLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKA 192

Query:   206 MRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHS 253
              +     + D    Y TDV    ++D +   S  EP F+ +A  A HS
Sbjct:   193 RKHGENYSVD----YLTDVLANVSLDFLDYKSNFEPFFMMIATPAPHS 236

 Score = 51 (23.0 bits), Expect = 2.3e-06, Sum P(2) = 2.3e-06
 Identities = 13/38 (34%), Positives = 23/38 (60%)

Query:   278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
             R ++  +L  +D+ V K+V+ LE    L+N+ I + SD
Sbjct:   290 RKRWQTLL-SVDDLVEKLVKRLEFTGELNNTYIFYTSD 326


>DICTYBASE|DDB_G0282835 [details] [associations]
            symbol:srfB "putative MADS-box transcription factor"
            species:44689 "Dictyostelium discoideum" [GO:0043565
            "sequence-specific DNA binding" evidence=ISS] [GO:0019933
            "cAMP-mediated signaling" evidence=IMP] [GO:0006355 "regulation of
            transcription, DNA-dependent" evidence=IEA;ISS] [GO:0003700
            "sequence-specific DNA binding transcription factor activity"
            evidence=ISS] [GO:0046983 "protein dimerization activity"
            evidence=IEA] [GO:0003677 "DNA binding" evidence=IEA] [GO:0006351
            "transcription, DNA-dependent" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0044351 "macropinocytosis" evidence=RCA]
            InterPro:IPR002100 Pfam:PF00319 PRINTS:PR00404 PROSITE:PS00350
            PROSITE:PS50066 SMART:SM00432 dictyBase:DDB_G0282835 GO:GO:0005634
            EMBL:AAFI02000047 GenomeReviews:CM000152_GR GO:GO:0019933
            GO:GO:0043565 GO:GO:0003700 GO:GO:0006351 eggNOG:COG5068
            SUPFAM:SSF55455 HSSP:P11831 ProtClustDB:CLSZ2430546
            RefSeq:XP_639351.1 ProteinModelPortal:Q54RY6 SMR:Q54RY6
            PRIDE:Q54RY6 EnsemblProtists:DDB0220492 GeneID:8623792
            KEGG:ddi:DDB_G0282835 OMA:WSTASSC Uniprot:Q54RY6
        Length = 467

 Score = 117 (46.2 bits), Expect = 3.0e-06, Sum P(2) = 3.0e-06
 Identities = 37/137 (27%), Positives = 57/137 (41%)

Query:   375 TLLSAA-NKSDIP--NYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTH 431
             TL+ A  N  D+P  +  +    N      N+     N ++  N+    N NT   NG +
Sbjct:   108 TLIQACLNTPDVPPVSKDDGNNNNGNNSNNNNNSNNNNSSNNNNNGNNNNGNTNNNNGNN 167

Query:   432 EYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSI 491
               N    N   N  +  N  Y N   N  +  N   N N N + E   +  N  + +N+I
Sbjct:   168 N-NSNNNNSGNNNNNNNNNSYNNNNNNNNNNNNN--NNNNNCKEEQNMNIPNERKSKNNI 224

Query:   492 NGNGTSENRSNDNSYQN 508
             N N  ++N +N N+ QN
Sbjct:   225 NNNNNNQN-NNQNNNQN 240

 Score = 73 (30.8 bits), Expect = 3.0e-06, Sum P(2) = 3.0e-06
 Identities = 27/85 (31%), Positives = 36/85 (42%)

Query:   491 INGNGTSENRSNDNSYQN--EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKW 548
             INGNG + N +N+N+  N  E   + + S    N  S  N   +N       +  T    
Sbjct:   316 INGNGMNGNNNNNNNSNNIPEYGQVIIQSYRGSN--SGGNNSSNNTSTNTNTNTNTNTNN 373

Query:   549 KLVKENSINGNGTSENRSNDNSYQN 573
                  NS NGN  S N SN+   QN
Sbjct:   374 NNNNSNSSNGNN-SNNNSNNILPQN 397


>MGI|MGI:1919293 [details] [associations]
            symbol:Sulf2 "sulfatase 2" species:10090 "Mus musculus"
            [GO:0001822 "kidney development" evidence=IGI] [GO:0002063
            "chondrocyte development" evidence=IMP] [GO:0003094 "glomerular
            filtration" evidence=IGI] [GO:0003824 "catalytic activity"
            evidence=IEA] [GO:0004065 "arylsulfatase activity" evidence=ISO]
            [GO:0005509 "calcium ion binding" evidence=IEA] [GO:0005615
            "extracellular space" evidence=ISO] [GO:0005783 "endoplasmic
            reticulum" evidence=ISO] [GO:0005794 "Golgi apparatus"
            evidence=IEA] [GO:0005886 "plasma membrane" evidence=IDA]
            [GO:0006790 "sulfur compound metabolic process" evidence=ISO]
            [GO:0008152 "metabolic process" evidence=IEA] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=ISO;IMP]
            [GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
            [GO:0009986 "cell surface" evidence=ISO] [GO:0010575 "positive
            regulation vascular endothelial growth factor production"
            evidence=IGI] [GO:0014846 "esophagus smooth muscle contraction"
            evidence=IGI] [GO:0016787 "hydrolase activity" evidence=IEA]
            [GO:0030177 "positive regulation of Wnt receptor signaling pathway"
            evidence=ISO] [GO:0030201 "heparan sulfate proteoglycan metabolic
            process" evidence=ISO;IMP] [GO:0032836 "glomerular basement
            membrane development" evidence=IGI] [GO:0035860 "glial cell-derived
            neurotrophic factor receptor signaling pathway" evidence=IDA]
            [GO:0040037 "negative regulation of fibroblast growth factor
            receptor signaling pathway" evidence=IGI] [GO:0046872 "metal ion
            binding" evidence=IEA] [GO:0048706 "embryonic skeletal system
            development" evidence=IGI] [GO:0051216 "cartilage development"
            evidence=IMP] [GO:0060348 "bone development" evidence=IGI]
            [GO:0060384 "innervation" evidence=IGI] InterPro:IPR000917
            InterPro:IPR014615 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 PIRSF:PIRSF036665 MGI:MGI:1919293 GO:GO:0005783
            GO:GO:0005886 GO:GO:0005615 GO:GO:0009986 GO:GO:0005795
            GO:GO:0005509 GO:GO:0010575 GO:GO:0060348 Gene3D:3.40.720.10
            SUPFAM:SSF53649 GO:GO:0030177 GO:GO:0003094 eggNOG:COG3119
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0004065
            GeneTree:ENSGT00400000022041 GO:GO:0048706 GO:GO:0002063
            GO:GO:0040037 GO:GO:0032836 GO:GO:0060384 GO:GO:0008449
            GO:GO:0030201 KO:K14607 HOVERGEN:HBG056431 GO:GO:0014846
            GO:GO:0035860 InterPro:IPR024609 Pfam:PF12548 CTD:55959 OMA:PKYYGQG
            OrthoDB:EOG49KFPX EMBL:AY101177 EMBL:AK008108 EMBL:AK028874
            EMBL:AK034712 EMBL:AK036685 EMBL:AK049170 EMBL:AK081643
            EMBL:AK133336 EMBL:AK165183 EMBL:AL589873 EMBL:BC027238
            EMBL:BC141086 IPI:IPI00268030 RefSeq:NP_001239507.1
            RefSeq:NP_001239508.1 RefSeq:NP_082348.2 UniGene:Mm.1011
            ProteinModelPortal:Q8CFG0 SMR:Q8CFG0 STRING:Q8CFG0
            PhosphoSite:Q8CFG0 PRIDE:Q8CFG0 Ensembl:ENSMUST00000088086
            Ensembl:ENSMUST00000109249 GeneID:72043 KEGG:mmu:72043
            InParanoid:B2RUD5 NextBio:335292 Bgee:Q8CFG0 CleanEx:MM_SULF2
            Genevestigator:Q8CFG0 GermOnline:ENSMUSG00000006800 Uniprot:Q8CFG0
        Length = 875

 Score = 140 (54.3 bits), Expect = 3.1e-06, Sum P(2) = 3.1e-06
 Identities = 58/223 (26%), Positives = 93/223 (41%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
             P+II +L DD    DV   G  Q+       +   G    N + T  +C PSRS+I+TGK
Sbjct:    44 PNIILVLTDD---QDVEL-GSMQVMNKTRRIMEQGGAHFINAFVTTPMCCPSRSSILTGK 99

Query:   119 HPIHTGMQHNVLYGCERGGLPL-----SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
             + +H    HN     E    P        +    YL   GYRT   GK+ L  Y   Y P
Sbjct:   100 Y-VHN---HNTYTNNENCSSPSWQAQHESRTFAVYLNSTGYRTAFFGKY-LNEYNGSYVP 154

Query:   174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
                G++  +G     + +++++   +   G+  +   + + D    Y TD+ T ++V   
Sbjct:   155 P--GWKEWVGLLKNSR-FYNYT---LCRNGVKEKHGSDYSTD----YLTDLITNDSVSFF 204

Query:   234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
                       P+ + ++HAA H      P Q    + N  +HI
Sbjct:   205 RTSKKMYPHRPVLMVISHAAPHGPEDSAP-QYSRLFPNASQHI 246

 Score = 56 (24.8 bits), Expect = 3.1e-06, Sum P(2) = 3.1e-06
 Identities = 47/231 (20%), Positives = 87/231 (37%)

Query:   234 HNHSTDEPLFLYL-AHAATHSANPYEPLQAPD-HYLN--------IHRHIEDFKRSKFAA 283
             H      P +  L  +A+ H    Y     PD H++         IH    +  + K   
Sbjct:   226 HGPEDSAPQYSRLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQ 285

Query:   284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEG 343
              L  +D+S+  + + L +   L N+ I++ +D              + P        +E 
Sbjct:   286 TLMSVDDSMETIYDMLVETGELDNTYILYTADHGYHIGQFGLVKGKSMP--------YEF 337

Query:   344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENS 403
              +R    +  P +E+  +     +++ D  PT+L  A   DIP  ++   ++I+   ++ 
Sbjct:   338 DIRVPFYVRGPNVEAGSLNPHIVLNI-DLAPTILDIAGL-DIPADMDG--KSILKLLDSE 393

Query:   404 ILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHE-YNPKYE 453
               R  N  H     R+   +   E G   +  K E    N   E + PKY+
Sbjct:   394 --RPVNRFHLKKKLRVWRDSFLVERGKLLH--KREGDKVNAQEENFLPKYQ 440


>DICTYBASE|DDB_G0275409 [details] [associations]
            symbol:DDB_G0275409 "RNA-binding region RNP-1
            domain-containing protein" species:44689 "Dictyostelium discoideum"
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0000166
            "nucleotide binding" evidence=IEA] [GO:0008150 "biological_process"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            InterPro:IPR000504 InterPro:IPR012677 Pfam:PF00076 PROSITE:PS50102
            SMART:SM00360 dictyBase:DDB_G0275409 GO:GO:0000166
            Gene3D:3.30.70.330 GO:GO:0003676 EMBL:AAFI02000013
            RefSeq:XP_001134599.1 ProteinModelPortal:Q1ZXL1
            EnsemblProtists:DDB0233346 GeneID:8619946 KEGG:ddi:DDB_G0275409
            eggNOG:NOG288151 InParanoid:Q1ZXL1 OMA:DIKNGYA Uniprot:Q1ZXL1
        Length = 737

 Score = 145 (56.1 bits), Expect = 3.2e-06, P = 3.2e-06
 Identities = 56/208 (26%), Positives = 91/208 (43%)

Query:   377 LSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPK 436
             L+  N ++  N VN + +    R   SI R+ +G +  N+ R  N+N  Y+N  + YN  
Sbjct:   453 LNDTNGNNTDNGVNYSQQRDRSR-SRSIERFRDGRNNRNNFR-NNNNNNYQNNNN-YNRN 509

Query:   437 YENRYENGTHEYNPKYENRYENGTHEYNGPKNENTN-PRYENGT-HEYNIPRLENS---I 491
               N   N   +YN    NR       YNG  N+  N  RY N   H  N  +  N+    
Sbjct:   510 NINN-SNNNRDYNNSDRNR-----EFYNGNDNDRNNGDRYSNNNRHNINFNKRNNNDRNY 563

Query:   492 NGNGTSENRSNDNSYQNEIDGIDV-WSVLSRNEPSK--RNTILHNIDDEWQISALTRGKW 548
             N N    N +N+N+  +  +G DV ++ ++ N  +   R+    N ++E++ +  T    
Sbjct:   564 NNNNNRFNNNNNNNNNSSNNGRDVDFNGINNNNNNNNYRDDNNFNNNEEFENNRRTYNND 623

Query:   549 KLVKENSINGNGTSENRSND---NSYQN 573
             K    +   G   S + S D   N+Y N
Sbjct:   624 KKRSRSHSRGRSRSRSHSGDRRNNNYNN 651


>DICTYBASE|DDB_G0287645 [details] [associations]
            symbol:DDB_G0287645 "DUF1222 family protein"
            species:44689 "Dictyostelium discoideum" [GO:0008150
            "biological_process" evidence=ND] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0003674 "molecular_function" evidence=ND]
            dictyBase:DDB_G0287645 EMBL:AAFI02000103 eggNOG:NOG81106
            InterPro:IPR009613 Pfam:PF06762 RefSeq:XP_637151.1
            EnsemblProtists:DDB0238347 GeneID:8626246 KEGG:ddi:DDB_G0287645
            Uniprot:Q54K13
        Length = 771

 Score = 145 (56.1 bits), Expect = 3.4e-06, P = 3.4e-06
 Identities = 48/190 (25%), Positives = 75/190 (39%)

Query:   403 SILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHE 462
             S+L Y   T +     I     RY+  T   N    N  +N  +  N    N   N  + 
Sbjct:   535 SLLEYSPFTTDKPPIYIRAQKYRYKFTTFN-NENINNNNDNNNNNDNNNNNNNNNNNNNN 593

Query:   463 YNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRN 522
              N   N N N    N  +  N     N+ N N  + N SN+N+Y N  +  D     + N
Sbjct:   594 NNNNNNNNNNNNNNNNNNNNN----NNNNNNNNNNNNDSNNNNYSNNNNNND-----NNN 644

Query:   523 EPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQNEI--DGIDV 580
             + + +N   +N ++    +            N+ N N  + N +NDN+ QN +  D  + 
Sbjct:   645 DNNNKNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNDNNSQNPVSYDNNEE 704

Query:   581 WSVLSRNEPS 590
                +S NEPS
Sbjct:   705 DRNISTNEPS 714

 Score = 133 (51.9 bits), Expect = 6.7e-05, P = 6.7e-05
 Identities = 38/147 (25%), Positives = 58/147 (39%)

Query:   381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
             N +D  N  N+   N      N+     N  +  N+    N+N    N  +  N    N 
Sbjct:   576 NNNDNNNNNNNNNNN---NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNDSNNNN 632

Query:   441 YENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENR 500
             Y N  +  +   +N  +N  +  N   N N N    N  +  N     N+ N N  + N 
Sbjct:   633 YSNNNNNNDNNNDNNNKNNNNNNNNNNNNNNNNNNNNNNNNNN-----NNNNNNNNNNNN 687

Query:   501 SNDNSYQNEI--DGIDVWSVLSRNEPS 525
             +NDN+ QN +  D  +    +S NEPS
Sbjct:   688 NNDNNSQNPVSYDNNEEDRNISTNEPS 714


>UNIPROTKB|F1RZ87 [details] [associations]
            symbol:SGSH "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
            evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484
            GeneTree:ENSGT00390000013080 EMBL:CU655945
            Ensembl:ENSSSCT00000018675 OMA:LCRAHRA Uniprot:F1RZ87
        Length = 231

 Score = 133 (51.9 bits), Expect = 3.7e-06, Sum P(2) = 3.7e-06
 Identities = 44/133 (33%), Positives = 69/133 (51%)

Query:    41 PLAFTLSMVFVDLVASSGPP-HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
             P+ + L +      A  G   +++ ILADD G+   G +    I TP++DALA   I+ +
Sbjct:     6 PVGWVLLLALGLCCAQGGRRRNVLLILADDGGFES-GAYNNSAITTPHLDALARRSIVFR 64

Query:   100 NYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLS--EKI--LPQYLKELGY 154
             N +T V  C+PSR++++TG  P H     N +YG  +     +  +++  LP  L   G 
Sbjct:    65 NAFTSVSSCSPSRASLLTGL-PQH----QNGMYGLHQDVHHFNSFDRVQSLPLLLGRAGV 119

Query:   155 RT--RIVGKWHLG 165
             RT  R  GK H+G
Sbjct:   120 RTGSRHHGKKHVG 132

 Score = 41 (19.5 bits), Expect = 3.7e-06, Sum P(2) = 3.7e-06
 Identities = 8/21 (38%), Positives = 10/21 (47%)

Query:   239 DEPLFLYLAHAATHSANPYEP 259
             D P FLY+A    H     +P
Sbjct:   173 DRPFFLYVAFHDPHRCGHSQP 193


>RGD|1306654 [details] [associations]
            symbol:Bub3 "budding uninhibited by benzimidazoles 3 homolog (S.
            cerevisiae)" species:10116 "Rattus norvegicus" [GO:0000070 "mitotic
            sister chromatid segregation" evidence=ISO] [GO:0000776
            "kinetochore" evidence=ISO] [GO:0005634 "nucleus" evidence=ISO]
            [GO:0007059 "chromosome segregation" evidence=ISO] [GO:0008608
            "attachment of spindle microtubules to kinetochore" evidence=ISO]
            [GO:0051983 "regulation of chromosome segregation" evidence=ISO]
            [GO:0071173 "spindle assembly checkpoint" evidence=ISO] [GO:0005730
            "nucleolus" evidence=ISO] InterPro:IPR017986 InterPro:IPR001680
            InterPro:IPR015943 Pfam:PF00400 PROSITE:PS50082 PROSITE:PS50294
            SMART:SM00320 RGD:1306654 Gene3D:2.130.10.10 SUPFAM:SSF50978
            HOVERGEN:HBG002942 EMBL:AY325173 IPI:IPI00382243 UniGene:Rn.6897
            ProteinModelPortal:Q7TP72 IntAct:Q7TP72 PRIDE:Q7TP72
            UCSC:RGD:1306654 InParanoid:Q7TP72 ArrayExpress:Q7TP72
            Genevestigator:Q7TP72 Uniprot:Q7TP72
        Length = 628

 Score = 143 (55.4 bits), Expect = 4.2e-06, P = 4.2e-06
 Identities = 57/230 (24%), Positives = 86/230 (37%)

Query:   347 GAGLIWSPLLESRGIVAEQYV--HVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENSI 404
             G G   SP  +   ++A Q+   H S+      S  N S+  N  N+   N      NS 
Sbjct:   369 GEGKSGSPKSQKHFLLALQFFIWHNSNSNNNNNSNNNNSNNNNSNNNNNSNNSSS-NNS- 426

Query:   405 LRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYN 464
                 N ++  NS     SN    N ++  N    N   N ++  N    +   N  +  N
Sbjct:   427 --NSNNSNSNNSSSNSTSNNSNSNNSNSNNSNNNNNNSNNSNSNNSNSNSNNSNNKNSNN 484

Query:   465 GPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEP 524
                N N+N    N  +  N     N+ N N  S N +N N+  N        S  S N+ 
Sbjct:   485 NSNNNNSNSNSNNSNNSSNNSSSSNNSNSNNNSNNSNNSNNSSNNSSS----SNNSNNKN 540

Query:   525 SKRNTILHNIDDEWQISALTRGKWKL-VKENSINGNGTSENRSNDNSYQN 573
             +  N   +N ++    S+            +S N N +S N SN+NS  N
Sbjct:   541 NSNNNNSNNNNNSNSSSSNNNSNSNNNSNSSSSNNNSSSNNNSNNNSNNN 590

 Score = 139 (54.0 bits), Expect = 1.1e-05, P = 1.1e-05
 Identities = 47/190 (24%), Positives = 72/190 (37%)

Query:   409 NGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKN 468
             N ++  NS    NSN    N ++  N    N   N T   +    +   N  +  N   N
Sbjct:   405 NNSNNNNSNNNNNSNNSSSNNSNSNNSNSNNSSSNSTSNNSNSNNSNSNNSNNNNNNSNN 464

Query:   469 ENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRN 528
              N+N    N  +  N     NS N N  S + +++NS  N     +  S  + N  +  N
Sbjct:   465 SNSNNSNSNSNNSNNKNSNNNSNNNNSNSNSNNSNNSSNNSSSSNN--SNSNNNSNNSNN 522

Query:   529 TILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNE 588
             +  +N  +    S  +  K      NS N N ++ + SN+NS  N        S  S N 
Sbjct:   523 S--NNSSNNSSSSNNSNNKNNSNNNNSNNNNNSNSSSSNNNSNSNNNSN----SSSSNNN 576

Query:   589 PSKRNTILHN 598
              S  N   +N
Sbjct:   577 SSSNNNSNNN 586


>DICTYBASE|DDB_G0281825 [details] [associations]
            symbol:comB "Rab GTPase domain-containing protein"
            species:44689 "Dictyostelium discoideum" [GO:0005525 "GTP binding"
            evidence=ISS] [GO:0031154 "culmination involved in sorocarp
            development" evidence=IMP] [GO:0016021 "integral to membrane"
            evidence=ISS] [GO:0015031 "protein transport" evidence=IEA]
            [GO:0007264 "small GTPase mediated signal transduction"
            evidence=IEA] [GO:0000166 "nucleotide binding" evidence=IEA]
            InterPro:IPR001806 InterPro:IPR003579 InterPro:IPR005225
            InterPro:IPR011990 Pfam:PF00071 PRINTS:PR00449 SMART:SM00175
            dictyBase:DDB_G0281825 TIGRFAMs:TIGR00231 GO:GO:0016021
            GO:GO:0007264 GO:GO:0000166 GenomeReviews:CM000152_GR GO:GO:0015031
            Gene3D:1.25.40.10 EMBL:AAFI02000043 GO:GO:0031154 eggNOG:COG1100
            InterPro:IPR025697 Pfam:PF13236 RefSeq:XP_640497.1
            ProteinModelPortal:Q54T92 EnsemblProtists:DDB0214836 GeneID:8623311
            KEGG:ddi:DDB_G0281825 InParanoid:Q54T92 OMA:NELASKF Uniprot:Q54T92
        Length = 2107

 Score = 149 (57.5 bits), Expect = 4.2e-06, P = 4.2e-06
 Identities = 57/264 (21%), Positives = 107/264 (40%)

Query:   332 PLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNS 391
             PL    + ++ G + G G     LL S G+  EQ    S+ L    ++   S + N V+S
Sbjct:   312 PLNSSNHYIFSGSISG-GSNRDQLLSSNGL-REQD---SNSLSVNSNSGLASSV-NSVSS 365

Query:   392 TVE--NIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYN 449
             T    N++    +S+    N ++  N+  + N+N    N  +  N    N   N     N
Sbjct:   366 TSSGSNLLTSSNSSVNNNSNNSNSINNNNV-NNNININNNNNTNNTNNNNIINNNNININ 424

Query:   450 PKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNE 509
                 +   +     N   N N+N    N  +  N     N+ N N  + N +N N+  N 
Sbjct:   425 ENSTSGINSNNSGNNINNNNNSNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNSNNNNNN 484

Query:   510 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDN 569
              + I+  +  + N  +  N   +N ++    S+++         N+ N N  + N +N+N
Sbjct:   485 TNSINNNNNNNNNNNNNNNNNNNNNNNN-NNSSISNNNNNNNNNNNNNNNNNNNNNNNNN 543

Query:   570 SYQNEIDGIDVWSVLSRNEPSKRN 593
             +  +  + I+  ++ + N  S  N
Sbjct:   544 NSSSSNNNINNNNINTDNNSSNNN 567

 Score = 140 (54.3 bits), Expect = 0.00019, Sum P(2) = 0.00019
 Identities = 49/203 (24%), Positives = 79/203 (38%)

Query:   376 LLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRY--ENGTHEY 433
             LL+++N S   N  NS   N      N  +   N T+  N+  I N+N     EN T   
Sbjct:   372 LLTSSNSSVNNNSNNSNSINNNNVNNNININNNNNTNNTNNNNIINNNNININENSTSGI 431

Query:   434 NPKYE-NRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYN--IPRLENS 490
             N     N   N  +  N    N   N  +  N   N N N    N ++  N     + N+
Sbjct:   432 NSNNSGNNINNNNNSNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNSNNNNNNTNSINNN 491

Query:   491 INGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKL 550
              N N  + N +N+N+  N  +     S  + N  +  N   +N ++    +  +      
Sbjct:   492 NNNNNNNNNNNNNNNNNNNNNNNSSISNNNNNNNNNNNNNNNNNNNNNNNNNNSSSSNNN 551

Query:   551 VKENSINGNGTSENRSNDNSYQN 573
             +  N+IN +  S N +N+N   N
Sbjct:   552 INNNNINTDNNSSNNNNNNMNNN 574

 Score = 47 (21.6 bits), Expect = 0.00019, Sum P(2) = 0.00019
 Identities = 14/45 (31%), Positives = 24/45 (53%)

Query:   559 NGTSENRSNDNSYQNEIDG--IDVWSVLSRNEPSKRN--TILHNI 599
             + T+ N SN+N+  N++        S LS N+P   N  T ++N+
Sbjct:   770 SSTNNNNSNNNNNNNQLQPPQTPTSSSLSVNQPFNLNSSTNINNL 814


>DICTYBASE|DDB_G0269328 [details] [associations]
            symbol:DDB_G0269328 species:44689 "Dictyostelium
            discoideum" [GO:0016021 "integral to membrane" evidence=IEA]
            InterPro:IPR004240 Pfam:PF02990 dictyBase:DDB_G0269328
            GO:GO:0016021 EMBL:AAFI02000005 RefSeq:XP_645881.2
            EnsemblProtists:DDB0190180 GeneID:8616821 KEGG:ddi:DDB_G0269328
            eggNOG:KOG1277 OMA:RVINECK Uniprot:Q55EA0
        Length = 1140

 Score = 146 (56.5 bits), Expect = 4.4e-06, P = 4.4e-06
 Identities = 49/194 (25%), Positives = 80/194 (41%)

Query:   400 YENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENG 459
             Y+    +Y N  + +N+  I NS     N  +  N    N   N  +  N    N   N 
Sbjct:   680 YKFKYNQYFNYYNTFNNNSINNSINN-NNNINNINSIINNNNNNNNNNNNNNNNNNNNNN 738

Query:   460 THEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVL 519
              +  N   N N N    N ++  N     NS N N  + N +N+N+  N  + ID  +++
Sbjct:   739 NNNNNNNNNNNNNNNNNNNSNS-NSSSNSNSNNNNNNNNNNNNNNNNNNNNNSIDNNNII 797

Query:   520 SRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQNEIDGID 579
             + N     N    NI++      +    +K V  N+IN    S + +N NS  N I+ I+
Sbjct:   798 NNNNNIISNINNSNINNNISSDGINTNGYK-VNNNNINN---SNDVNNINSATN-INNIN 852

Query:   580 VWSVLSRNEPSKRN 593
             + +  + N  S  N
Sbjct:   853 ISNGNNNNNNSINN 866

 Score = 133 (51.9 bits), Expect = 0.00026, Sum P(2) = 0.00026
 Identities = 49/194 (25%), Positives = 74/194 (38%)

Query:   381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
             N + I N +N+   NI     NSI+   N  +  N+    N+N    N  +  N    N 
Sbjct:   695 NNNSINNSINNN-NNI--NNINSIINNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 751

Query:   441 YENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENR 500
               N  +  +    N   N  +  N   N N N    N + + N     N IN N    + 
Sbjct:   752 NNNNNNSNSNSSSNSNSNNNNNNNNNNNNNNNNNNNNNSIDNN-----NIINNNNNIISN 806

Query:   501 SNDNSYQNEI--DGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSING 558
              N+++  N I  DGI+       N     +  ++NI+    I+ +          NSIN 
Sbjct:   807 INNSNINNNISSDGINTNGYKVNNNNINNSNDVNNINSATNINNINISNGNNNNNNSINN 866

Query:   559 NG-TSENR-SNDNS 570
             N   S N  S +NS
Sbjct:   867 NNIVSGNIISGNNS 880

 Score = 47 (21.6 bits), Expect = 0.00026, Sum P(2) = 0.00026
 Identities = 10/20 (50%), Positives = 13/20 (65%)

Query:   554 NSINGNGTSENRSNDNSYQN 573
             NS N N  S N +N+N+Y N
Sbjct:  1043 NSNNNNNNS-NSNNNNNYNN 1061


>DICTYBASE|DDB_G0280599 [details] [associations]
            symbol:fhkB "forkhead-associated kinase protein B"
            species:44689 "Dictyostelium discoideum" [GO:0016772 "transferase
            activity, transferring phosphorus-containing groups" evidence=IEA]
            [GO:0006468 "protein phosphorylation" evidence=IEA] [GO:0005524
            "ATP binding" evidence=IEA] [GO:0004674 "protein serine/threonine
            kinase activity" evidence=IEA] [GO:0004672 "protein kinase
            activity" evidence=IEA] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0016740 "transferase activity" evidence=IEA]
            [GO:0016310 "phosphorylation" evidence=IEA] [GO:0016301 "kinase
            activity" evidence=IEA] [GO:0000166 "nucleotide binding"
            evidence=IEA] InterPro:IPR000253 InterPro:IPR000719
            InterPro:IPR002290 InterPro:IPR008271 InterPro:IPR008984
            InterPro:IPR011009 InterPro:IPR017441 Pfam:PF00069 Pfam:PF00498
            PROSITE:PS00107 PROSITE:PS00108 PROSITE:PS50006 PROSITE:PS50011
            SMART:SM00220 SMART:SM00240 dictyBase:DDB_G0280599 GO:GO:0005524
            GenomeReviews:CM000152_GR eggNOG:COG0515 SUPFAM:SSF56112
            GO:GO:0004674 Gene3D:2.60.200.20 SUPFAM:SSF49879 EMBL:AAFI02000037
            InterPro:IPR020636 PANTHER:PTHR24347 HSSP:O43293
            RefSeq:XP_001134559.1 ProteinModelPortal:Q1ZXH2
            EnsemblProtists:DDB0233266 GeneID:8622630 KEGG:ddi:DDB_G0280599
            InParanoid:Q1ZXH2 Uniprot:Q1ZXH2
        Length = 1142

 Score = 146 (56.5 bits), Expect = 4.4e-06, P = 4.4e-06
 Identities = 35/135 (25%), Positives = 55/135 (40%)

Query:   381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYE-N 439
             N ++  N   + + N    Y NS   + N  H +N     N N    N  H +N  +  N
Sbjct:   993 NNNNNNNTNTNNINNNNNNYNNSH-NHNNNNHNHN----HNLNNHNHNNNHHHNHNHNHN 1047

Query:   440 RYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSEN 499
                N  H +N  + + + +  H  N   N N N    N  +  N     N+ N N  + N
Sbjct:  1048 HNHNHNHNHNHNHNHNHNHNNHNNNNNNNNNNNNNNNNNNNNNNN---NNNNNNNNNNNN 1104

Query:   500 RSNDNSYQNEIDGID 514
              +N+N Y N I+ I+
Sbjct:  1105 NNNNNYYNNNINNIN 1119

 Score = 137 (53.3 bits), Expect = 4.1e-05, P = 4.1e-05
 Identities = 35/131 (26%), Positives = 53/131 (40%)

Query:   381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYEN- 439
             N ++  N  N+   N I    N+I    N  +  N+    N+N    N  +  N  Y N 
Sbjct:   958 NNNNNNNNNNNNNNNNINNNNNNI--NNNNINNNNNNNNNNNNNTNTNNINNNNNNYNNS 1015

Query:   440 -RYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYE-NGTHEYNIPRLENSINGNGTS 497
               + N  H +N    N   N  H +N   N N N  +  N  H +N     N+ N N  +
Sbjct:  1016 HNHNNNNHNHNHNLNNHNHNNNHHHNHNHNHNHNHNHNHNHNHNHNHNHNHNNHNNNNNN 1075

Query:   498 ENRSNDNSYQN 508
              N +N+N+  N
Sbjct:  1076 NNNNNNNNNNN 1086


>DICTYBASE|DDB_G0275313 [details] [associations]
            symbol:dhx9 "ATP-dependent RNA helicase"
            species:44689 "Dictyostelium discoideum" [GO:0008026 "ATP-dependent
            helicase activity" evidence=IEA] [GO:0005524 "ATP binding"
            evidence=IEA] [GO:0004386 "helicase activity" evidence=IEA]
            [GO:0003725 "double-stranded RNA binding" evidence=IEA] [GO:0003676
            "nucleic acid binding" evidence=IEA] [GO:0008150
            "biological_process" evidence=ND] [GO:0005737 "cytoplasm"
            evidence=ISS] [GO:0005634 "nucleus" evidence=ISS] [GO:0004004
            "ATP-dependent RNA helicase activity" evidence=ISS] [GO:0004003
            "ATP-dependent DNA helicase activity" evidence=ISS] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0000166 "nucleotide binding"
            evidence=IEA] InterPro:IPR001159 InterPro:IPR001650
            InterPro:IPR002464 InterPro:IPR007502 InterPro:IPR011545
            Pfam:PF00035 Pfam:PF00270 Pfam:PF00271 Pfam:PF04408 PROSITE:PS00690
            PROSITE:PS50137 PROSITE:PS51194 SMART:SM00358 SMART:SM00490
            SMART:SM00847 dictyBase:DDB_G0275313 GO:GO:0005524 GO:GO:0005634
            GO:GO:0005737 GenomeReviews:CM000151_GR EMBL:AAFI02000013
            GO:GO:0003725 GO:GO:0004003 InterPro:IPR014001 SMART:SM00487
            PROSITE:PS51192 eggNOG:COG1643 InterPro:IPR011709 Pfam:PF07717
            GO:GO:0004004 RefSeq:XP_643861.1 ProteinModelPortal:Q869Z1
            EnsemblProtists:DDB0233740 GeneID:8619912 KEGG:ddi:DDB_G0275313
            InParanoid:Q869Z1 OMA:CEYLLEN Uniprot:Q869Z1
        Length = 1472

 Score = 147 (56.8 bits), Expect = 4.6e-06, P = 4.6e-06
 Identities = 49/222 (22%), Positives = 85/222 (38%)

Query:   387 NYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTH 446
             +Y NS   N      NS     N  +  N+    N+N    N  + YN  + +   +  +
Sbjct:     6 SYNNSYNSN--NNNNNSYNSNNNNNNNNNNNNNNNNNNNSNNNNNNYNNNFSSGGRSNYN 63

Query:   447 EYNP--KYENRYENGTHEYNGPK---NENTNPRYENGTHEYNIPRLE---NSINGNGTSE 498
              YN    Y N + N  + Y G     N+N +   + G   YN        N+ N N  + 
Sbjct:    64 NYNNYNSYNNDFNNSNNNYRGNSVFGNKNNSYLNKGGNKVYNTSNSNINYNNNNNNNNNN 123

Query:   499 NRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKE--NSI 556
             N +N+NS  N  +     +  + N  +  N   +N   +   +        ++    NS+
Sbjct:   124 NNNNNNSNNNNNNNNQGQTSYNNNNNNSNNNNNNNNQGQTSYNNNNNINNNIINSGLNSL 183

Query:   557 NGNGTSENRSNDNSYQNEIDGIDVWS-VLSRNEPSKRNTILH 597
             N N  + N +N +  +N I+       +L + +P   N I H
Sbjct:   184 NNNNNNNNNNNYSGLENNINNYQQTPPILQQQQPLLSNPINH 225


>DICTYBASE|DDB_G0287625 [details] [associations]
            symbol:DDB_G0287625 species:44689 "Dictyostelium
            discoideum" [GO:0005615 "extracellular space" evidence=IDA]
            dictyBase:DDB_G0287625 GO:GO:0005615 EMBL:AAFI02000103
            RefSeq:XP_637128.1 EnsemblProtists:DDB0187557 GeneID:8626223
            KEGG:ddi:DDB_G0287625 eggNOG:NOG285146 OMA:ESTERNE Uniprot:Q54K36
        Length = 981

 Score = 145 (56.1 bits), Expect = 4.7e-06, P = 4.7e-06
 Identities = 47/208 (22%), Positives = 88/208 (42%)

Query:   390 NSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYE---NGTH 446
             NST ENI    ++ +L+ +N   + N  + + +  + E+   E N    +R++   + T 
Sbjct:   566 NSTPENISTDLDSPLLK-KN--QQLNLIKEQTNKLKTEDSIDESNNNGNDRFKTKCSSTE 622

Query:   447 EYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSY 506
               N   EN   N  +  N P N N N    N  +  N     N+ N N  + N +N+N+ 
Sbjct:   623 NENKNRENEKNNSENSKNNPNNNNPNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 682

Query:   507 QNEIDGIDVWSVLSRNEPSKRNTILHNIDD-EWQISALTRGKWKLVKENSINGNGTSENR 565
              N  +  +  +  + N  +  N   +N ++     +       K + +N+ N +  S N 
Sbjct:   683 NNNNNNNNNNNNNNNNSNNNNNPNNYNNNNPNNNPNNNNNNNNKNINKNNSNNSNNSNNS 742

Query:   566 SNDNSYQNEIDGIDVWSVLSRNEPSKRN 593
             SN  +  N  +  +  + L+ N P+  N
Sbjct:   743 SNSRNNSNNSNNNNNNNNLNNNNPNNNN 770

 Score = 140 (54.3 bits), Expect = 1.6e-05, P = 1.6e-05
 Identities = 40/167 (23%), Positives = 65/167 (38%)

Query:   409 NGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKN 468
             N  +  N+    N+N    N  +  N    N   N  +  N    +   N  + YN   N
Sbjct:   654 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNSNNNNNPNNYNN-NN 712

Query:   469 ENTNPRYENGTHEYNIPR--LENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSK 526
              N NP   N  +  NI +    NS N N +S +R+N N+  N  +  +    L+ N P+ 
Sbjct:   713 PNNNPNNNNNNNNKNINKNNSNNSNNSNNSSNSRNNSNNSNNNNNNNN----LNNNNPNN 768

Query:   527 RNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQN 573
              N   +N ++    +            N+ N N  + N +N+N   N
Sbjct:   769 NNPNNNNPNNNNPNNNNPNNNNNNNNNNNNNNNNNNNNNNNNNKNNN 815

 Score = 133 (51.9 bits), Expect = 9.1e-05, P = 9.1e-05
 Identities = 40/161 (24%), Positives = 61/161 (37%)

Query:   380 ANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYEN 439
             +N ++ PN  N+   N  P   N+        ++ NS    NSN    N  +  N    N
Sbjct:   699 SNNNNNPNNYNNNNPNNNPNNNNN--NNNKNINKNNSNNSNNSNNS-SNSRNNSNNSNNN 755

Query:   440 RYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSEN 499
                N  +  NP   N   N  +  N P N N N    N  +  N     N+ N N    N
Sbjct:   756 NNNNNLNNNNPNNNNPNNNNPNN-NNPNNNNPNNNNNNNNNNNNNNNNNNNNNNNNNKNN 814

Query:   500 RSNDNSYQNEIDGIDVWSVLSRNEPS--KRNTILHNIDDEW 538
              +N+NS+  E +     + +    P   K N  +  ++ EW
Sbjct:   815 NNNNNSFSEEEEEEGSLNQVRNISPKIGKYNEDISFLEKEW 855

 Score = 131 (51.2 bits), Expect = 0.00015, P = 0.00015
 Identities = 45/187 (24%), Positives = 67/187 (35%)

Query:   391 STVENIIPRYENSILRYENGTHE--YNSPRIENSNTRYENGTHEYNPKYENRYENGTHEY 448
             S+ EN     EN     EN  +    N+P   N+N    N  +  N    N   N  +  
Sbjct:   619 SSTENENKNRENEKNNSENSKNNPNNNNPNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 678

Query:   449 NPKYENRYENGTHEYNGPKNENTN--PRYENGTHEYNIPRLENSINGNGTSENRSNDNSY 506
             N    N   N  +  N   N N N  P   N  +  N P   N+ N    ++N SN+++ 
Sbjct:   679 NNNNNNNNNNNNNNNNNNNNSNNNNNPNNYNNNNPNNNPNNNNNNNNKNINKNNSNNSNN 738

Query:   507 QNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRS 566
              N        S  S N  +  N  L+N +                  N+ N N  + N +
Sbjct:   739 SNNSSNSRNNSNNSNNNNNNNN--LNNNNPNNNNPNNNNPNNNNPNNNNPNNNNNNNNNN 796

Query:   567 NDNSYQN 573
             N+N+  N
Sbjct:   797 NNNNNNN 803


>DICTYBASE|DDB_G0283357 [details] [associations]
            symbol:DDB_G0283357 "unknown" species:44689
            "Dictyostelium discoideum" [GO:0008150 "biological_process"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0003674 "molecular_function" evidence=ND]
            dictyBase:DDB_G0283357 EMBL:AAFI02000054 RefSeq:XP_639110.1
            ProteinModelPortal:Q54R73 EnsemblProtists:DDB0302625 GeneID:8624043
            KEGG:ddi:DDB_G0283357 OMA:NSANEND Uniprot:Q54R73
        Length = 1247

 Score = 146 (56.5 bits), Expect = 4.9e-06, P = 4.9e-06
 Identities = 59/208 (28%), Positives = 78/208 (37%)

Query:   374 PTLLSAANKSDIPNYV---NSTVENIIPRYE---NSILRYENGTHEYNSPRIENSNTRYE 427
             PT  S    SD  N V   N+   N  P      NS     N  +  N+    NS+    
Sbjct:   152 PTFKSLDLSSDTVNSVGAANNGSSNSSPTINGISNSNTMNNNNNNNNNNNNNSNSSNNNN 211

Query:   428 NGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYN-GPKNENTNPRYENGTHEYNIPR 486
             NG +  N  Y N + N T   N    N Y N T+  N G  N N N    N     N   
Sbjct:   212 NGNNNNNNNY-NSFVNITKNNNNTNSNNYNNSTNSNNNGYNNNNNNNSISNSNSNSNSNS 270

Query:   487 LENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRG 546
               NS N N  S + SN NS  N     +  S  S N  +  N   +N ++    S+ + G
Sbjct:   271 NSNS-NSNSNSNSNSNSNSNSNSNSSSN--SSSSSNNNNNNNNNNNNNNNSSSSSSNSNG 327

Query:   547 KWKLVKENSINGNGTSENRSNDN-SYQN 573
                    N+ +  G S ++ N   SY N
Sbjct:   328 N----NNNNYHSYGYSNSKYNQQKSYNN 351

 Score = 141 (54.7 bits), Expect = 1.7e-05, P = 1.7e-05
 Identities = 52/204 (25%), Positives = 89/204 (43%)

Query:   398 PRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHE--YNPKYEN- 454
             P + N I   +N T++  S +  N++  Y N     N  + N Y NG++   YN    N 
Sbjct:     4 PIFSNDIDSIKNNTYQAKSYQKYNNSNNYNNNN---NNSFNN-YSNGSNYGGYNNSGNNS 59

Query:   455 RYENGTHEYNGPK--NENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDG 512
              Y N  + YN     N N N    N  +  N     N+IN N ++ N +N+N+  N  + 
Sbjct:    60 NYNNNNNLYNNNNINNNNNNNNNNNINNNNNNINNNNNINNNNSNNNNNNNNNNSNSNNS 119

Query:   513 IDVWSVLSRNEPSKRNTILHN---IDDEWQISALTRGKWKLVKENSINGNGTSENRSNDN 569
             I+  S    N P++     H+   I+    +   T     L   +++N  G + N S+++
Sbjct:   120 INSNSY-KVNTPTQNGKSSHSPPLINANANVVFPTFKSLDL-SSDTVNSVGAANNGSSNS 177

Query:   570 SYQNEIDGIDVWSVLSRNEPSKRN 593
             S    I+GI   + ++ N  +  N
Sbjct:   178 S--PTINGISNSNTMNNNNNNNNN 199

 Score = 125 (49.1 bits), Expect = 0.00088, P = 0.00088
 Identities = 51/202 (25%), Positives = 74/202 (36%)

Query:   402 NSILRYENGTHEYNSPRIE---NSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYEN 458
             NS+    NG+   +SP I    NSNT   N  +  N    +   N  +  N    N Y +
Sbjct:   165 NSVGAANNGSSN-SSPTINGISNSNTMNNNNNNNNNNNNNSNSSNNNNNGNNNNNNNYNS 223

Query:   459 GTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSV 518
               +      N N+N  Y N T+  N     N+ N N  S + SN NS  N     +  S 
Sbjct:   224 FVNITKNNNNTNSN-NYNNSTNSNN-NGYNNNNNNNSISNSNSNSNSNSNSNSNSNSNSN 281

Query:   519 LSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQNEIDGI 578
              + N  S  N+   N       S+           N+ N N +S + SN N   N     
Sbjct:   282 SNSNSNSNSNS---NSSSNSSSSSNNNNN---NNNNNNNNNNSSSSSSNSNGNNNNNYHS 335

Query:   579 DVWSVLSRNEPSKRNTILHNID 600
               +S    N+    N   H ++
Sbjct:   336 YGYSNSKYNQQKSYNNAPHQLN 357


>DICTYBASE|DDB_G0277589 [details] [associations]
            symbol:gtaC "GATA zinc finger domain-containing
            protein 3" species:44689 "Dictyostelium discoideum" [GO:0005634
            "nucleus" evidence=IDA] [GO:0031149 "sorocarp stalk cell
            differentiation" evidence=IMP] [GO:0005737 "cytoplasm"
            evidence=IDA] [GO:0043565 "sequence-specific DNA binding"
            evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=IEA] [GO:0003700 "sequence-specific DNA binding
            transcription factor activity" evidence=IEA] [GO:0046872 "metal ion
            binding" evidence=IEA] InterPro:IPR000679 InterPro:IPR013088
            Pfam:PF00320 PROSITE:PS00344 PROSITE:PS50114 SMART:SM00401
            dictyBase:DDB_G0277589 GO:GO:0005737 GO:GO:0046872 GO:GO:0043565
            GO:GO:0008270 Gene3D:3.30.50.10 GenomeReviews:CM000151_GR
            GO:GO:0003700 EMBL:AAFI02000020 eggNOG:COG5641 GO:GO:0031149
            RefSeq:XP_642533.1 HSSP:P17678 ProteinModelPortal:Q75JZ1
            EnsemblProtists:DDB0220470 GeneID:8621095 KEGG:ddi:DDB_G0277589
            OMA:SNIRVEE Uniprot:Q75JZ1
        Length = 587

 Score = 142 (55.0 bits), Expect = 4.9e-06, P = 4.9e-06
 Identities = 53/235 (22%), Positives = 101/235 (42%)

Query:   372 WLPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTH 431
             ++P+ + +   S + N VN ++ N+     N+   Y N  + YN+  I N+N    N  +
Sbjct:     5 YIPSPIYSDQNSGVHN-VNKSLHNLNINNGNNNYNYSN--NNYNN-NINNNNN-INNNIN 59

Query:   432 EYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSI 491
               N    N   N  ++Y+  + ++Y +     +   N N N    N  +  NI    N+I
Sbjct:    60 NNNNNNNNNNNNNINQYHQNHYDQYSDNNCNNSNSNNINNNNNINNNINNNNINNNNNNI 119

Query:   492 NGNGTSENRSNDNSYQN--EIDGIDVW--SVLSRNEPSKRNTILHNIDDEWQISALTRGK 547
             N N  + N +N+N+  N  +I  +++    V   N  S  N + + I  +  +S +    
Sbjct:   120 NSNNNNNNNNNNNNNNNLLKIPQLNISPNGVGGGNGISNGNGV-NKIFSKLDLSKVPNS- 177

Query:   548 WKLVKENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNE--PSKRNTILHNID 600
             ++L   +S+  + TS N S        ++   + S+L      P+   +  HN D
Sbjct:   178 YQLAHNSSMPNSPTSSNISPSTPTSMALNLSSLKSILDSPPAAPAHSASSSHNND 232


>DICTYBASE|DDB_G0283697 [details] [associations]
            symbol:DDB_G0283697 "unknown" species:44689
            "Dictyostelium discoideum" [GO:0008150 "biological_process"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0003674 "molecular_function" evidence=ND]
            dictyBase:DDB_G0283697 EMBL:AAFI02000056 RefSeq:XP_638971.1
            HSSP:Q9BYW2 ProteinModelPortal:Q54QQ2 EnsemblProtists:DDB0237901
            GeneID:8624217 KEGG:ddi:DDB_G0283697 OMA:PPKENFF Uniprot:Q54QQ2
        Length = 853

 Score = 144 (55.7 bits), Expect = 5.0e-06, P = 5.0e-06
 Identities = 40/183 (21%), Positives = 74/183 (40%)

Query:   414 YNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYE-NRYENGTHEYNGPKNENTN 472
             Y    ++ +N    N ++  +    N + N  + YN     N   + T  YN   N N+N
Sbjct:   502 YRDDSLQQNNDNNNNSSNNNSNNSNNNFNNDNNPYNNSNNYNMNNSNTSPYNNSNNSNSN 561

Query:   473 PRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILH 532
               Y N     N     N+ N N  + N +N+N+  N  +  + ++  + N    +    +
Sbjct:   562 SSYYNDNDYNNNNNNNNNSNNNNNNNNNNNNNNNNNNNNNNNNFNNSNSNSSESKPNYFN 621

Query:   533 NIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQNEIDGIDVWSVL--SRNEPS 590
             N+ + +  + +T+   +  K N  N N  + N  N N  +  ID +     L  S++ P 
Sbjct:   622 NLSNVF--NQITK-PLENYKNNGKNENNNNNNNKNKNEDEKRIDLVQTKLSLKSSKSTPQ 678

Query:   591 KRN 593
               N
Sbjct:   679 TYN 681

 Score = 130 (50.8 bits), Expect = 0.00016, P = 0.00016
 Identities = 35/172 (20%), Positives = 74/172 (43%)

Query:   398 PRYENSILR-YENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRY 456
             P ++  + + Y + + + N+   +N+N    N ++  N  + N  +N  +  +  Y N  
Sbjct:   492 PNFQIPLSKPYRDDSLQQNN---DNNNNSSNNNSNNSNNNFNN--DNNPYNNSNNY-NMN 545

Query:   457 ENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVW 516
              + T  YN   N N+N  Y N     N     N+ N N  + N +N+N+  N  +  + +
Sbjct:   546 NSNTSPYNNSNNSNSNSSYYNDNDYNNNNNNNNNSNNNNNNNNNNNNNNNNNNNNNNNNF 605

Query:   517 SVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSND 568
             +  + N    +    +N+ + +         +K   +N  N N  ++N++ D
Sbjct:   606 NNSNSNSSESKPNYFNNLSNVFNQITKPLENYKNNGKNENNNNNNNKNKNED 657


>DICTYBASE|DDB_G0292046 [details] [associations]
            symbol:DDB_G0292046 "Ubiquitin carboxyl-terminal
            hydrolase 34" species:44689 "Dictyostelium discoideum" [GO:0006511
            "ubiquitin-dependent protein catabolic process" evidence=IEA]
            [GO:0004221 "ubiquitin thiolesterase activity" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000626 InterPro:IPR001394
            InterPro:IPR018200 Pfam:PF00443 PROSITE:PS00972 PROSITE:PS00973
            PROSITE:PS50235 SMART:SM00213 dictyBase:DDB_G0292046
            EMBL:AAFI02000187 GO:GO:0008234 GO:GO:0006511 GO:GO:0004221
            eggNOG:COG5077 RefSeq:XP_629788.1 ProteinModelPortal:Q54DT4
            EnsemblProtists:DDB0184183 GeneID:8628465 KEGG:ddi:DDB_G0292046
            InParanoid:Q54DT4 OMA:ISKECTH Uniprot:Q54DT4
        Length = 3240

 Score = 150 (57.9 bits), Expect = 5.3e-06, P = 5.3e-06
 Identities = 48/194 (24%), Positives = 77/194 (39%)

Query:   406 RYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNG 465
             + +N  +  N+    N+N    N     N   EN   N  + YN  Y N Y N     N 
Sbjct:   427 KQDNNNNNNNNNNNNNNNNNNNNNNVNCNFNSENS-NNNNNNYNNNYNNNYNNSNSSSNN 485

Query:   466 PKNENTNPR-YENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEP 524
               N N N     NG +  N     N+ N N  + N +N+N+  N  +        + N  
Sbjct:   486 NNNSNDNGNGNSNGINNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN-------NNNNN 538

Query:   525 SKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQNEIDGIDVWSVL 584
             +  N   +N ++    ++++ G       NS N N ++ + SN NS  N  +  +  +V 
Sbjct:   539 NNNNNNNNNNNNNNNGNSISNGNNN--SNNSNNSNNSNNSNSNSNSNNNNSNNNNNSNVN 596

Query:   585 SRNEPSKRNTILHN 598
             S N     + IL N
Sbjct:   597 SPNPQILYDWILKN 610

 Score = 140 (54.3 bits), Expect = 0.00023, Sum P(2) = 0.00023
 Identities = 51/194 (26%), Positives = 72/194 (37%)

Query:   382 KSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRY 441
             K D  N  N+   N      N+     N    +NS   ENSN    N  + YN  Y N Y
Sbjct:   427 KQDNNNNNNNNNNN---NNNNNNNNNNNVNCNFNS---ENSN----NNNNNYNNNYNNNY 476

Query:   442 ENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRS 501
              N     N    N  +NG    NG  N N N    N  +  N     N+ N N  + N +
Sbjct:   477 NNSNSSSNNN-NNSNDNGNGNSNGINNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 535

Query:   502 NDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGT 561
             N+N+  N  +        + N  +  N+I +  ++    +            NS N N  
Sbjct:   536 NNNNNNNNNNN-------NNNNNNNGNSISNGNNNSNNSNNSNNSNNSNSNSNSNNNNSN 588

Query:   562 SENRSNDNSYQNEI 575
             + N SN NS   +I
Sbjct:   589 NNNNSNVNSPNPQI 602

 Score = 50 (22.7 bits), Expect = 0.00023, Sum P(2) = 0.00023
 Identities = 14/43 (32%), Positives = 23/43 (53%)

Query:   554 NSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNE--PSKRNT 594
             N+ N N  + N +N+N+  N  +G +  S+ S +E  P   NT
Sbjct:  1592 NNNNNNNNNNNNNNNNNNNNNSNG-NSNSLTSSSERMPGTPNT 1633


>DICTYBASE|DDB_G0288611 [details] [associations]
            symbol:DDB_G0288611 species:44689 "Dictyostelium
            discoideum" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0000166 "nucleotide binding" evidence=IEA] InterPro:IPR000504
            InterPro:IPR012677 Pfam:PF00076 PROSITE:PS50102 SMART:SM00360
            dictyBase:DDB_G0288611 GO:GO:0000166 Gene3D:3.30.70.330
            GO:GO:0003676 EMBL:AAFI02000118 RefSeq:XP_636639.1
            EnsemblProtists:DDB0220605 GeneID:8626713 KEGG:ddi:DDB_G0288611
            eggNOG:NOG283861 InParanoid:Q54IP7 Uniprot:Q54IP7
        Length = 524

 Score = 141 (54.7 bits), Expect = 5.3e-06, P = 5.3e-06
 Identities = 38/135 (28%), Positives = 59/135 (43%)

Query:   381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYN---PKY 437
             N+++  NY NS+ +N    Y N+   Y N  +  N      + +  +   +E N   P  
Sbjct:   280 NRNNRDNYNNSSRDNYNNNYNNNYNNYNNNNNNNNDDSYRGAVSFNDENNNEENSIVPNN 339

Query:   438 ENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTS 497
             EN+  +    Y   YE+ Y +G+   N   N N N  Y N   EYN  + +NS N    +
Sbjct:   340 ENKSNDFNKGYGAFYESDYYDGSQFNNNNNNRNINNDYNN---EYN--KHKNSYNSENNN 394

Query:   498 ENRSNDNSYQNEIDG 512
              N  N+N+  N   G
Sbjct:   395 NNNYNNNNNNNNNGG 409

 Score = 121 (47.7 bits), Expect = 0.00079, P = 0.00079
 Identities = 48/178 (26%), Positives = 73/178 (41%)

Query:   408 ENGTHEYNSPRIENSNTRYE--NGTHEY-NPKYENRYENGTHEYNPKYENRYENGTHEYN 464
             EN + E N P+    +  Y+  +G  E  N    N  +N  +     Y N Y N  + YN
Sbjct:   249 EN-SFENNKPKHSQFSKEYQFLDGLIENDNRNNRNNRDNYNNSSRDNYNNNYNNNYNNYN 307

Query:   465 GPKNENTNPRYENGTHEYNIPRL--ENSINGNGTSENRSND-NSYQNEIDGIDVWSVLSR 521
                N N +  Y  G   +N      ENSI  N  +EN+SND N         D +     
Sbjct:   308 NNNNNNNDDSYR-GAVSFNDENNNEENSIVPN--NENKSNDFNKGYGAFYESDYYDGSQF 364

Query:   522 NEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQNEIDGID 579
             N  +    I ++ ++E+      + K     EN+ N N  + N +N+N    E +G D
Sbjct:   365 NNNNNNRNINNDYNNEYN-----KHKNSYNSENNNNNNYNNNNNNNNNGGYGE-EGYD 416


>DICTYBASE|DDB_G0279041 [details] [associations]
            symbol:DDB_G0279041 "unknown" species:44689
            "Dictyostelium discoideum" [GO:0008150 "biological_process"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0003674 "molecular_function" evidence=ND]
            dictyBase:DDB_G0279041 EMBL:AAFI02000026 RefSeq:XP_641933.1
            EnsemblProtists:DDB0266471 GeneID:8621845 KEGG:ddi:DDB_G0279041
            OMA:NNGMMNQ Uniprot:Q54XC8
        Length = 637

 Score = 142 (55.0 bits), Expect = 5.5e-06, P = 5.5e-06
 Identities = 57/224 (25%), Positives = 97/224 (43%)

Query:   383 SDIPNYVNSTVENIIPRYENSILRYE-NGTHEYNSPRIENSNTRYENGTHEYNPKYENRY 441
             +D P++++   +++IP+Y N     + N    YN+    N+N    N  + YN    N +
Sbjct:    11 NDSPSFLS---DDLIPQYNNQFQSLQQNPQLNYNNNN-NNNNNNNNNNNNNYNNNNNNNF 66

Query:   442 ENGTHEYNPK---YENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSE 498
             +N     N     ++N   N  +      N N N  + N  +  N     N+IN   T  
Sbjct:    67 KNDNLFQNSNLFVFQNDLNNNINININNNNFNNNNNFNNNINFNNFNN--NNINNGFTYS 124

Query:   499 NRSNDNSYQNEIDGIDV----WSVLSRNEPS----KRNTILHNIDDEWQISALTRGKWKL 550
             N  N+N   N  +G DV     SV+S    S      N  ++N+++    +  T     L
Sbjct:   125 NNQNNNFKPNN-NGCDVEYSDHSVISTPTSSIYNENENNNINNLNNNINNTDNTCNI--L 181

Query:   551 VKENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRN-EPSKRN 593
                N+ N N  + N +N+++ QNE+  I+  S +S N E   +N
Sbjct:   182 NNNNNSNNNDMNNNNNNNSNNQNEVTNIN--SNISPNYENQNQN 223

 Score = 134 (52.2 bits), Expect = 4.1e-05, P = 4.1e-05
 Identities = 55/203 (27%), Positives = 82/203 (40%)

Query:   390 NSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYN 449
             N    N I    N+I   +N  +  N+    NSN    N  +  N   +N   N     +
Sbjct:   157 NENENNNINNLNNNINNTDNTCNILNNNN--NSNNNDMNNNNNNNSNNQNEVTNINSNIS 214

Query:   450 PKYENRYENGTHEYNGPKNENTNPR---YENGTHEY----NI------PRL---ENSING 493
             P YEN+ +N     N   N N  P     EN T++     NI      P+L   EN IN 
Sbjct:   215 PNYENQNQNQNENENNSNNNNNKPNDNLVENNTNQITNPNNIDQQQEQPQLNQVENKINN 274

Query:   494 NGTSENRSNDNSYQNEIDGIDVWSVLSR-NEPSKRNTILHNIDDEWQISALTRGKWKLVK 552
             N  + N +N+N+   E     V+ V  + NE S    IL    D    ++  +G   ++ 
Sbjct:   275 NSNNNNINNNNNNSGEFCPDYVYFVNKQLNEFSNCLPILEK--DMPDFASTIKG---IIS 329

Query:   553 ENSINGNGTSENRSNDNSYQNEI 575
              N +  +  +EN+S  NS    I
Sbjct:   330 PNIVGSSIKNENKSTPNSTSTSI 352


>UNIPROTKB|H0YB91 [details] [associations]
            symbol:IDS "Iduronate 2-sulfatase 14 kDa chain"
            species:9606 "Homo sapiens" [GO:0008484 "sulfuric ester hydrolase
            activity" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
            EMBL:AC233288 HGNC:HGNC:5389 ChiTaRS:IDS Ensembl:ENST00000464251
            Bgee:H0YB91 Uniprot:H0YB91
        Length = 106

 Score = 117 (46.2 bits), Expect = 5.9e-06, P = 5.9e-06
 Identities = 29/79 (36%), Positives = 41/79 (51%)

Query:    85 TPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGM-QHNVLYGCERGGLPLSE 142
             +PNID LA   ++ +N +  Q +C PSR + +TG+ P  T +   N  +    G      
Sbjct:     2 SPNIDQLASHSLLFQNAFAQQAVCAPSRVSFLTGRRPDTTRLYDFNSYWRVHAGNF---- 57

Query:   143 KILPQYLKELGYRTRIVGK 161
               +PQY KE GY T  VGK
Sbjct:    58 STIPQYFKENGYVTMSVGK 76


>FB|FBgn0040271 [details] [associations]
            symbol:Sulf1 "Sulfated" species:7227 "Drosophila
            melanogaster" [GO:0008449 "N-acetylglucosamine-6-sulfatase
            activity" evidence=ISS] [GO:0007389 "pattern specification process"
            evidence=IMP] [GO:0018741 "alkyl sulfatase activity" evidence=ISS]
            [GO:0008152 "metabolic process" evidence=ISS] [GO:0005783
            "endoplasmic reticulum" evidence=ISS] [GO:0009986 "cell surface"
            evidence=ISS] [GO:0005795 "Golgi stack" evidence=ISS] [GO:0017015
            "regulation of transforming growth factor beta receptor signaling
            pathway" evidence=IMP] [GO:0030111 "regulation of Wnt receptor
            signaling pathway" evidence=IMP] [GO:0045880 "positive regulation
            of smoothened signaling pathway" evidence=IMP] [GO:0045879
            "negative regulation of smoothened signaling pathway" evidence=IMP]
            [GO:0042059 "negative regulation of epidermal growth factor
            receptor signaling pathway" evidence=IGI] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 GO:GO:0005783
            EMBL:AE014297 GO:GO:0009986 GO:GO:0030111 GO:GO:0046872
            GO:GO:0005795 GO:GO:0042059 Gene3D:3.40.720.10 SUPFAM:SSF53649
            eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            GO:GO:0008484 GeneTree:ENSGT00400000022041 GO:GO:0017015
            GO:GO:0045879 GO:GO:0045880 KO:K14607 InterPro:IPR024609
            Pfam:PF12548 EMBL:AY119658 EMBL:AF211192 RefSeq:NP_524987.1
            UniGene:Dm.13781 ProteinModelPortal:Q9VEX0 SMR:Q9VEX0
            DIP:DIP-21001N MINT:MINT-1598983 STRING:Q9VEX0 PaxDb:Q9VEX0
            PRIDE:Q9VEX0 EnsemblMetazoa:FBtr0083273 GeneID:53437
            KEGG:dme:Dmel_CG6725 UCSC:CG6725-RA CTD:23213 FlyBase:FBgn0040271
            InParanoid:Q9VEX0 OMA:QWILQVT OrthoDB:EOG4GB5N2 PhylomeDB:Q9VEX0
            GenomeRNAi:53437 NextBio:841154 Bgee:Q9VEX0 GermOnline:CG6725
            Uniprot:Q9VEX0
        Length = 1114

 Score = 122 (48.0 bits), Expect = 6.3e-06, Sum P(2) = 6.3e-06
 Identities = 54/222 (24%), Positives = 92/222 (41%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
             P+II IL DD    DV    L+ +P   +  L   G   ++ YT   +C P+RS+++TG 
Sbjct:    54 PNIILILTDD---QDVELGSLNFMPR-TLRLLRDGGAEFRHAYTTTPMCCPARSSLLTGM 109

Query:   119 HPIHTGMQHNVLYGCERGGLPLSEKI--LPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
             + +H  M       C       + +      YL   GYRT   GK+ L  Y   Y P   
Sbjct:   110 Y-VHNHMVFTNNDNCSSPQWQATHETRSYATYLSNAGYRTGYFGKY-LNKYNGSYIPP-- 165

Query:   177 GFESHLGYWTG---HQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
             G+      W G   +  Y+++S   + + G  ++   + A D    Y  D+   +++  +
Sbjct:   166 GWRE----WGGLIMNSKYYNYS---INLNGQKIKHGFDYAKD----YYPDLIANDSIAFL 214

Query:   234 HNHSTD---EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH 272
              +       +P+ L ++  A H      P Q    + N+  H
Sbjct:   215 RSSKQQNQRKPVLLTMSFPAPHGPEDSAP-QYSHLFFNVTTH 255

 Score = 74 (31.1 bits), Expect = 6.3e-06, Sum P(2) = 6.3e-06
 Identities = 34/149 (22%), Positives = 62/149 (41%)

Query:   252 HSANPYEP--LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSI 309
             H+ NP +   L+  +    +H+   +   +K    L  +D +V +V   L++   L N+ 
Sbjct:   262 HAPNPDKQWILRVTEPMQPVHKRFTNLLMTKRLQTLQSVDVAVERVYNELKELGELDNTY 321

Query:   310 IVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHV 369
             IV+ SD              ++P        +E  VR   LI  P +++  +V E  ++V
Sbjct:   322 IVYTSDHGYHLGQFGLIKGKSFP--------FEFDVRVPFLIRGPGIQASKVVNEIVLNV 373

Query:   370 SDWLPTLLSAANKSDIPNYVNSTVENIIP 398
              D  PT L       +P   +    +I+P
Sbjct:   374 -DLAPTFLDMGG---VPTPQHMDGRSILP 398

 Score = 54 (24.1 bits), Expect = 0.00069, Sum P(2) = 0.00069
 Identities = 14/51 (27%), Positives = 26/51 (50%)

Query:   381 NKSDIPNYVNSTVENIIPRYENS--ILRYENGTHEYNSPRIENSNTRYENG 429
             +K D+P   N T+  +I + +++  IL  +   HE ++    +S   YE G
Sbjct:   674 SKRDLPASSNETIAQVIQQIQSTLEILELKFNEHELHASN--SSGNSYERG 722


>DICTYBASE|DDB_G0273645 [details] [associations]
            symbol:hbx5-2 "putative homeobox transcription
            factor" species:44689 "Dictyostelium discoideum" [GO:0043565
            "sequence-specific DNA binding" evidence=IEA] [GO:0006355
            "regulation of transcription, DNA-dependent" evidence=IEA]
            [GO:0005509 "calcium ion binding" evidence=IEA] [GO:0003700
            "sequence-specific DNA binding transcription factor activity"
            evidence=IEA] [GO:0003677 "DNA binding" evidence=IEA] [GO:0007275
            "multicellular organismal development" evidence=IEA] [GO:0006351
            "transcription, DNA-dependent" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] InterPro:IPR001356 InterPro:IPR002048
            InterPro:IPR009057 Pfam:PF00046 PROSITE:PS00027 PROSITE:PS50071
            PROSITE:PS50222 SMART:SM00389 dictyBase:DDB_G0273645
            dictyBase:DDB_G0273127 GO:GO:0007275 GO:GO:0005634 GO:GO:0043565
            GenomeReviews:CM000151_GR GO:GO:0003700 GO:GO:0006351
            EMBL:AAFI02000011 EMBL:AAFI02000009 Gene3D:1.10.10.60
            SUPFAM:SSF46689 RefSeq:XP_644439.1 RefSeq:XP_644811.1
            ProteinModelPortal:Q557C9 EnsemblProtists:DDB0220481
            EnsemblProtists:DDB0266662 GeneID:8618913 GeneID:8619064
            KEGG:ddi:DDB_G0273127 KEGG:ddi:DDB_G0273645 OMA:THHINIF
            ProtClustDB:CLSZ2431129 Uniprot:Q557C9
        Length = 1723

 Score = 153 (58.9 bits), Expect = 6.8e-06, Sum P(2) = 6.8e-06
 Identities = 48/204 (23%), Positives = 82/204 (40%)

Query:   381 NKSDIPNYVNSTVENIIPRYENS-ILRYENGTHEYNSPRIENSNTRYENG---THEYNPK 436
             N ++   ++N+   N      +S  + ++N +  +N+    NSN    N    +++YN  
Sbjct:    85 NNNNNNQHMNNQYSNSFHNNNSSGFMAFQNNSSNFNNQNNNNSNNNNNNNNINSYDYNNS 144

Query:   437 YENRYENG--THEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGN 494
               N Y N   TH  N    N   N  + +N   N N N    N  +  N     N+ N N
Sbjct:   145 NNNNYNNNNNTHSNNSNNNNNNNNSNY-WNNNNNNNNNNNNNNNNNNNNNNNNNNNNNSN 203

Query:   495 GTSENRSNDNSYQNEIDGIDVWSVLSRNEPSK--RNTILHNIDD---EWQISALTRGKWK 549
               + N +N+N + +            +++P+    N I HN +D     Q +    G   
Sbjct:   204 NNNNNNNNNNHHHHHHQ--------QQSQPTSPYNNPIQHNPNDMKFNGQHNPFN-GNQM 254

Query:   550 LVKENSINGNGTSENRSNDNSYQN 573
             ++  N+ N N  + N  N NS  N
Sbjct:   255 VMDNNNNNNNNNNSNVFNSNSNSN 278

 Score = 134 (52.2 bits), Expect = 0.00065, Sum P(2) = 0.00065
 Identities = 47/188 (25%), Positives = 76/188 (40%)

Query:   387 NYVNSTVENIIPRYENSILRY-ENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGT 445
             ++V+  ++N  P+       Y +NG   YN     NSN    N  H  N +Y N + N  
Sbjct:    52 SFVSPNLDNNNPQIHVQSNNYNQNGFVGYN-----NSNNNNNNNQH-MNNQYSNSFHNNN 105

Query:   446 HEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNS 505
                   ++N   N  ++ N   N N N    N +++YN     N  N N T  N SN+N+
Sbjct:   106 SSGFMAFQNNSSNFNNQNNNNSNNNNNNNNIN-SYDYNNSNNNNYNNNNNTHSNNSNNNN 164

Query:   506 YQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENR 565
               N  +    W+  + N  +  N   +N ++                 N+ N N  S N 
Sbjct:   165 NNNNSN---YWNNNNNNNNNNNNNNNNNNNNN----------------NNNNNNNNSNNN 205

Query:   566 SNDNSYQN 573
             +N+N+  N
Sbjct:   206 NNNNNNNN 213

 Score = 46 (21.3 bits), Expect = 6.8e-06, Sum P(2) = 6.8e-06
 Identities = 8/24 (33%), Positives = 13/24 (54%)

Query:   554 NSINGNGTSENRSNDNSYQNEIDG 577
             N+ N N  + N +N+N   N + G
Sbjct:   552 NNNNNNNNNNNNNNNNITNNPLSG 575

 Score = 46 (21.3 bits), Expect = 6.8e-06, Sum P(2) = 6.8e-06
 Identities = 10/35 (28%), Positives = 19/35 (54%)

Query:   554 NSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNE 588
             N+ N N  + N +N+N+  N  + I   ++ + NE
Sbjct:  1689 NNNNNNNNNNNNNNNNNNNNNNNNIINNNITTINE 1723

 Score = 44 (20.5 bits), Expect = 1.1e-05, Sum P(2) = 1.1e-05
 Identities = 21/109 (19%), Positives = 39/109 (35%)

Query:   554 NSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTXXXX 613
             N+ N N  + N +N+N+  N        +  S N      T   N+   +Q ++ +    
Sbjct:   898 NNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTSANTVQSGTTSNSNL--VFQQTSNSNTLS 955

Query:   614 XXXXXXXXMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCG 662
                      + Q  + G         LSD ++  L +     +A+  CG
Sbjct:   956 PSQQQQQQTQQQQSINGSST----GSLSDAQYQDLGIHLDTSSANSGCG 1000

 Score = 43 (20.2 bits), Expect = 1.4e-05, Sum P(2) = 1.4e-05
 Identities = 7/20 (35%), Positives = 12/20 (60%)

Query:   554 NSINGNGTSENRSNDNSYQN 573
             N+ N N   +N +N+N+  N
Sbjct:  1347 NNQNNNNNDQNNNNNNNNNN 1366

 Score = 43 (20.2 bits), Expect = 1.4e-05, Sum P(2) = 1.4e-05
 Identities = 9/42 (21%), Positives = 20/42 (47%)

Query:   554 NSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTI 595
             N+ N N  + N +N+N+  N  +  +  +  +    +  NT+
Sbjct:   892 NNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTSANTV 933

 Score = 42 (19.8 bits), Expect = 1.7e-05, Sum P(2) = 1.7e-05
 Identities = 7/20 (35%), Positives = 12/20 (60%)

Query:   554 NSINGNGTSENRSNDNSYQN 573
             N+ N N  + N +N+N+  N
Sbjct:   551 NNNNNNNNNNNNNNNNNITN 570

 Score = 39 (18.8 bits), Expect = 3.5e-05, Sum P(2) = 3.5e-05
 Identities = 9/24 (37%), Positives = 13/24 (54%)

Query:   554 NSINGNGTSENRSNDNSYQNEIDG 577
             +SIN N  + N  N N+  N  +G
Sbjct:  1119 SSINSNINNVNNCNINNNSNSNNG 1142

 Score = 37 (18.1 bits), Expect = 5.6e-05, Sum P(2) = 5.6e-05
 Identities = 8/20 (40%), Positives = 10/20 (50%)

Query:   554 NSINGNGTSENRSNDNSYQN 573
             N+ N N  S   SN N+  N
Sbjct:  1360 NNNNNNNNSTTNSNVNNNNN 1379


>DICTYBASE|DDB_G0273127 [details] [associations]
            symbol:hbx5-1 "putative homeobox transcription
            factor" species:44689 "Dictyostelium discoideum" [GO:0043565
            "sequence-specific DNA binding" evidence=IEA] [GO:0006355
            "regulation of transcription, DNA-dependent" evidence=IEA]
            [GO:0005509 "calcium ion binding" evidence=IEA] [GO:0003700
            "sequence-specific DNA binding transcription factor activity"
            evidence=IEA] [GO:0003677 "DNA binding" evidence=IEA] [GO:0007275
            "multicellular organismal development" evidence=IEA] [GO:0006351
            "transcription, DNA-dependent" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] InterPro:IPR001356 InterPro:IPR002048
            InterPro:IPR009057 Pfam:PF00046 PROSITE:PS00027 PROSITE:PS50071
            PROSITE:PS50222 SMART:SM00389 dictyBase:DDB_G0273645
            dictyBase:DDB_G0273127 GO:GO:0007275 GO:GO:0005634 GO:GO:0043565
            GenomeReviews:CM000151_GR GO:GO:0003700 GO:GO:0006351
            EMBL:AAFI02000011 EMBL:AAFI02000009 Gene3D:1.10.10.60
            SUPFAM:SSF46689 RefSeq:XP_644439.1 RefSeq:XP_644811.1
            ProteinModelPortal:Q557C9 EnsemblProtists:DDB0220481
            EnsemblProtists:DDB0266662 GeneID:8618913 GeneID:8619064
            KEGG:ddi:DDB_G0273127 KEGG:ddi:DDB_G0273645 OMA:THHINIF
            ProtClustDB:CLSZ2431129 Uniprot:Q557C9
        Length = 1723

 Score = 153 (58.9 bits), Expect = 6.8e-06, Sum P(2) = 6.8e-06
 Identities = 48/204 (23%), Positives = 82/204 (40%)

Query:   381 NKSDIPNYVNSTVENIIPRYENS-ILRYENGTHEYNSPRIENSNTRYENG---THEYNPK 436
             N ++   ++N+   N      +S  + ++N +  +N+    NSN    N    +++YN  
Sbjct:    85 NNNNNNQHMNNQYSNSFHNNNSSGFMAFQNNSSNFNNQNNNNSNNNNNNNNINSYDYNNS 144

Query:   437 YENRYENG--THEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGN 494
               N Y N   TH  N    N   N  + +N   N N N    N  +  N     N+ N N
Sbjct:   145 NNNNYNNNNNTHSNNSNNNNNNNNSNY-WNNNNNNNNNNNNNNNNNNNNNNNNNNNNNSN 203

Query:   495 GTSENRSNDNSYQNEIDGIDVWSVLSRNEPSK--RNTILHNIDD---EWQISALTRGKWK 549
               + N +N+N + +            +++P+    N I HN +D     Q +    G   
Sbjct:   204 NNNNNNNNNNHHHHHHQ--------QQSQPTSPYNNPIQHNPNDMKFNGQHNPFN-GNQM 254

Query:   550 LVKENSINGNGTSENRSNDNSYQN 573
             ++  N+ N N  + N  N NS  N
Sbjct:   255 VMDNNNNNNNNNNSNVFNSNSNSN 278

 Score = 134 (52.2 bits), Expect = 0.00065, Sum P(2) = 0.00065
 Identities = 47/188 (25%), Positives = 76/188 (40%)

Query:   387 NYVNSTVENIIPRYENSILRY-ENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGT 445
             ++V+  ++N  P+       Y +NG   YN     NSN    N  H  N +Y N + N  
Sbjct:    52 SFVSPNLDNNNPQIHVQSNNYNQNGFVGYN-----NSNNNNNNNQH-MNNQYSNSFHNNN 105

Query:   446 HEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNS 505
                   ++N   N  ++ N   N N N    N +++YN     N  N N T  N SN+N+
Sbjct:   106 SSGFMAFQNNSSNFNNQNNNNSNNNNNNNNIN-SYDYNNSNNNNYNNNNNTHSNNSNNNN 164

Query:   506 YQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENR 565
               N  +    W+  + N  +  N   +N ++                 N+ N N  S N 
Sbjct:   165 NNNNSN---YWNNNNNNNNNNNNNNNNNNNNN----------------NNNNNNNNSNNN 205

Query:   566 SNDNSYQN 573
             +N+N+  N
Sbjct:   206 NNNNNNNN 213

 Score = 46 (21.3 bits), Expect = 6.8e-06, Sum P(2) = 6.8e-06
 Identities = 8/24 (33%), Positives = 13/24 (54%)

Query:   554 NSINGNGTSENRSNDNSYQNEIDG 577
             N+ N N  + N +N+N   N + G
Sbjct:   552 NNNNNNNNNNNNNNNNITNNPLSG 575

 Score = 46 (21.3 bits), Expect = 6.8e-06, Sum P(2) = 6.8e-06
 Identities = 10/35 (28%), Positives = 19/35 (54%)

Query:   554 NSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNE 588
             N+ N N  + N +N+N+  N  + I   ++ + NE
Sbjct:  1689 NNNNNNNNNNNNNNNNNNNNNNNNIINNNITTINE 1723

 Score = 44 (20.5 bits), Expect = 1.1e-05, Sum P(2) = 1.1e-05
 Identities = 21/109 (19%), Positives = 39/109 (35%)

Query:   554 NSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTXXXX 613
             N+ N N  + N +N+N+  N        +  S N      T   N+   +Q ++ +    
Sbjct:   898 NNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTSANTVQSGTTSNSNL--VFQQTSNSNTLS 955

Query:   614 XXXXXXXXMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCG 662
                      + Q  + G         LSD ++  L +     +A+  CG
Sbjct:   956 PSQQQQQQTQQQQSINGSST----GSLSDAQYQDLGIHLDTSSANSGCG 1000

 Score = 43 (20.2 bits), Expect = 1.4e-05, Sum P(2) = 1.4e-05
 Identities = 7/20 (35%), Positives = 12/20 (60%)

Query:   554 NSINGNGTSENRSNDNSYQN 573
             N+ N N   +N +N+N+  N
Sbjct:  1347 NNQNNNNNDQNNNNNNNNNN 1366

 Score = 43 (20.2 bits), Expect = 1.4e-05, Sum P(2) = 1.4e-05
 Identities = 9/42 (21%), Positives = 20/42 (47%)

Query:   554 NSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTI 595
             N+ N N  + N +N+N+  N  +  +  +  +    +  NT+
Sbjct:   892 NNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTSANTV 933

 Score = 42 (19.8 bits), Expect = 1.7e-05, Sum P(2) = 1.7e-05
 Identities = 7/20 (35%), Positives = 12/20 (60%)

Query:   554 NSINGNGTSENRSNDNSYQN 573
             N+ N N  + N +N+N+  N
Sbjct:   551 NNNNNNNNNNNNNNNNNITN 570

 Score = 39 (18.8 bits), Expect = 3.5e-05, Sum P(2) = 3.5e-05
 Identities = 9/24 (37%), Positives = 13/24 (54%)

Query:   554 NSINGNGTSENRSNDNSYQNEIDG 577
             +SIN N  + N  N N+  N  +G
Sbjct:  1119 SSINSNINNVNNCNINNNSNSNNG 1142

 Score = 37 (18.1 bits), Expect = 5.6e-05, Sum P(2) = 5.6e-05
 Identities = 8/20 (40%), Positives = 10/20 (50%)

Query:   554 NSINGNGTSENRSNDNSYQN 573
             N+ N N  S   SN N+  N
Sbjct:  1360 NNNNNNNNSTTNSNVNNNNN 1379


>DICTYBASE|DDB_G0291424 [details] [associations]
            symbol:DDB_G0291424 "Transcription factor SKN7"
            species:44689 "Dictyostelium discoideum" [GO:0035556 "intracellular
            signal transduction" evidence=IEA] [GO:0006355 "regulation of
            transcription, DNA-dependent" evidence=IEA] [GO:0000160
            "phosphorelay signal transduction system" evidence=IEA] [GO:0000156
            "phosphorelay response regulator activity" evidence=IEA]
            InterPro:IPR001789 Pfam:PF00072 PROSITE:PS50110 SMART:SM00448
            dictyBase:DDB_G0291424 EMBL:AAFI02000177 GO:GO:0006355
            GO:GO:0035556 GO:GO:0005622 GO:GO:0000156 InterPro:IPR011006
            SUPFAM:SSF52172 eggNOG:COG0784 RefSeq:XP_635201.1
            ProteinModelPortal:Q54EN9 EnsemblProtists:DDB0183884 GeneID:8628146
            KEGG:ddi:DDB_G0291424 InParanoid:Q54EN9 OMA:MCANITD
            ProtClustDB:CLSZ2429563 Uniprot:Q54EN9
        Length = 902

 Score = 143 (55.4 bits), Expect = 6.9e-06, P = 6.9e-06
 Identities = 53/235 (22%), Positives = 95/235 (40%)

Query:   390 NSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYE-NGTHEY 448
             N++  N I    N+ +   NG + YN+    N+N  Y N  + +N  Y N Y  N + +Y
Sbjct:   131 NNSSNNNINNNNNNNINNNNGDN-YNNYNNNNNNNNYNN--NNFNNNYNNNYNGNNSFDY 187

Query:   449 NPKYE----------NRYENGTHEYNGPKNENTNPRYENGTH---EYNIPRLENSINGNG 495
             N              N Y N  ++YN   N NTN      T+     N     N+   N 
Sbjct:   188 NNNNNSNVYFNNDRGNNYNNSYNDYNNNNNNNTNTNTNTNTNTNTNTNTNTNTNTNTNNN 247

Query:   496 TSENRSNDNSYQNEIDGIDVWSVLSRNEP-----SKRNTILHNIDDEWQISALTRGKWKL 550
              S N +N+N+  N  +    ++    N+P     +  N   +N ++    +   R K   
Sbjct:   248 NSFNNNNNNNNNNNFNNSSNYNYDYNNKPYVNSNNNNNNNNNNFNNNINNNNNNRNKSPP 307

Query:   551 VK-ENSING---NGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDD 601
              + +  I+    N   + + N+ +   E++  D   +L++N   K+   +H + D
Sbjct:   308 PQYQTQISQQQPNNQQQQQLNNKNENLELEDEDENLILNQNRKPKKTKTIHRLMD 362

 Score = 127 (49.8 bits), Expect = 0.00036, P = 0.00036
 Identities = 46/207 (22%), Positives = 86/207 (41%)

Query:   409 NGTHEYNSPRIENSN-TRYENGTHEYNPKYENRYENGTHEYNPKYENRYE-NGTHEYNGP 466
             N  +  N+  I N+N   Y N  +  N    N Y N  + +N  Y N Y  N + +YN  
Sbjct:   136 NNINNNNNNNINNNNGDNYNNYNNNNN---NNNYNN--NNFNNNYNNNYNGNNSFDYNNN 190

Query:   467 KNENT---NPR---YENGTHEYNIPRLENSINGNGTSEN-RSNDNSYQNEIDGIDVWSVL 519
              N N    N R   Y N  ++YN     N+     T+ N  +N N+  N     +  +  
Sbjct:   191 NNSNVYFNNDRGNNYNNSYNDYNNNNNNNTNTNTNTNTNTNTNTNTNTNTNTNTNNNNSF 250

Query:   520 SRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQNEIDGID 579
             + N  +  N   +N    +      +        N+ N N  + N +N+N+ +N+     
Sbjct:   251 NNNNNNNNNNNFNN-SSNYNYDYNNKPYVNSNNNNNNNNNNFNNNINNNNNNRNKSPPPQ 309

Query:   580 VWSVLSRNEPS-KRNTILHNIDDEWQI 605
               + +S+ +P+ ++   L+N ++  ++
Sbjct:   310 YQTQISQQQPNNQQQQQLNNKNENLEL 336


>DICTYBASE|DDB_G0282019 [details] [associations]
            symbol:DDB_G0282019 species:44689 "Dictyostelium
            discoideum" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0005622 "intracellular" evidence=IEA] InterPro:IPR000315
            Pfam:PF00643 dictyBase:DDB_G0282019 GO:GO:0008270 GO:GO:0005622
            EMBL:AAFI02000044 RefSeq:XP_640410.1 ProteinModelPortal:Q54T41
            EnsemblProtists:DDB0205090 GeneID:8623365 KEGG:ddi:DDB_G0282019
            InParanoid:Q54T41 OMA:CNYSYNC ProtClustDB:CLSZ2846638
            Uniprot:Q54T41
        Length = 402

 Score = 138 (53.6 bits), Expect = 7.1e-06, P = 7.1e-06
 Identities = 56/240 (23%), Positives = 91/240 (37%)

Query:   367 VHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRY 426
             + + ++  +LL   N + I N      +N +  ++N IL +    +      IEN    Y
Sbjct:   164 IEMDEYQKSLLILNNNNIIDN------DNKLKDFKNQILSFN---YSLIKNIIENFKLIY 214

Query:   427 ENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPR 486
               G +  N    N   N  +  N    N   N  +  N   N N+N  Y N  + YN   
Sbjct:   215 SFGDNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNSNYSY-NCNYSYNCNY 273

Query:   487 LENSINGNGTSENRSNDNSYQN-EIDGIDVWSVLSRNEPSKRNTILHNID-----DEWQI 540
               N  N N    N SN NS  + + +  +  ++ S +     N I ++ D     D +  
Sbjct:   274 SYNCNNNNNYRNNNSNSNSNNSYDCNNDNNNNIFSNSNGHNDNDIGNDFDNDNDNDSYID 333

Query:   541 SALTRGKWKLVKENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNID 600
                  G +   K N+ N N  + N +N+N+  N  +        + N  S  N  L N D
Sbjct:   334 DDNNDGDYNNNKNNNYNNNNNNNNNNNNNNNNNNKN--------NNNNNSNNNNKLSNAD 385


>DICTYBASE|DDB_G0284321 [details] [associations]
            symbol:DDB_G0284321 "putative polypyrimidine tract
            binding protein (PTBP1)" species:44689 "Dictyostelium discoideum"
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0000166
            "nucleotide binding" evidence=IEA] [GO:0008150 "biological_process"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            InterPro:IPR000504 InterPro:IPR012677 PROSITE:PS50102 SMART:SM00360
            dictyBase:DDB_G0284321 GO:GO:0000166 Gene3D:3.30.70.330
            GO:GO:0003676 EMBL:AAFI02000064 eggNOG:NOG263741 OMA:DATENEI
            RefSeq:XP_638677.1 ProteinModelPortal:Q54PW8 SMR:Q54PW8
            EnsemblProtists:DDB0233645 GeneID:8624506 KEGG:ddi:DDB_G0284321
            InParanoid:Q54PW8 Uniprot:Q54PW8
        Length = 892

 Score = 142 (55.0 bits), Expect = 8.7e-06, P = 8.7e-06
 Identities = 50/214 (23%), Positives = 86/214 (40%)

Query:   381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYEN-GTHEYNPKYEN 439
             NK +  N  N++  N I   + +    EN   E +    EN N   EN  T++ + K EN
Sbjct:    81 NKKNNNNNNNNSSSNNIKETDGNKNDVENEISEVDFEGSENEN---ENKNTNQNDIKNEN 137

Query:   440 RYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSEN 499
               +N  +  N    N   N  +  N   N N N    N     N    EN       +EN
Sbjct:   138 ENDNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNSNENENENENENENENENENEN 197

Query:   500 ---RSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSI 556
                + N+N  + E D  D  +  S+  P ++   L   ++    ++ +         N+ 
Sbjct:   198 ENAKENENENEKEKDNED--NKESKTSPPQKIKNLDESNNNSNSNSNSNNNNNNNNNNNN 255

Query:   557 NGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPS 590
             N N  + N +N+N+  N  +  ++  V++ N+ S
Sbjct:   256 NNNNNNNNNNNNNNNNNNKNNKNLNGVINENKRS 289

 Score = 130 (50.8 bits), Expect = 0.00017, P = 0.00017
 Identities = 47/229 (20%), Positives = 89/229 (38%)

Query:   376 LLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNP 435
             L+S  N  +    +N+   N     + +     N +   N    + +    EN   E + 
Sbjct:    57 LISEPNNRNNSETLNNNNNNNNKNNKKNNNNNNNNSSSNNIKETDGNKNDVENEISEVDF 116

Query:   436 K-YENRYEN-GTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSING 493
             +  EN  EN  T++ + K EN  +N  +  N   N N N    N  +  N     N+ N 
Sbjct:   117 EGSENENENKNTNQNDIKNENENDNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNS 176

Query:   494 NGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKE 553
             N       N+N  +NE +  +  +  + NE  K      N + +       +   +    
Sbjct:   177 NENENENENENENENENENENENAKENENENEKEKDNEDNKESKTSPPQKIKNLDESNNN 236

Query:   554 NSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDE 602
             ++ N N  + N +N+N+  N  +  +  +  + N  +K N  L+ + +E
Sbjct:   237 SNSNSNSNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNKNNKNLNGVINE 285


>DICTYBASE|DDB_G0291348 [details] [associations]
            symbol:DDB_G0291348 "fungal transcriptional
            regulatory protein, N-terminal domain-containing protein"
            species:44689 "Dictyostelium discoideum" [GO:0008270 "zinc ion
            binding" evidence=IEA] [GO:0006366 "transcription from RNA
            polymerase II promoter" evidence=IEA] [GO:0006357 "regulation of
            transcription from RNA polymerase II promoter" evidence=IEA]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0000981
            "sequence-specific DNA binding RNA polymerase II transcription
            factor activity" evidence=IEA] InterPro:IPR001138 Pfam:PF00172
            PROSITE:PS00463 PROSITE:PS50048 SMART:SM00066
            dictyBase:DDB_G0291348 GO:GO:0005634 EMBL:AAFI02000177
            GO:GO:0008270 GO:GO:0006357 GO:GO:0006366 GO:GO:0000981
            Gene3D:4.10.240.10 SUPFAM:SSF57701 RefSeq:XP_635156.1
            ProteinModelPortal:Q54ET4 EnsemblProtists:DDB0220623 GeneID:8628102
            KEGG:ddi:DDB_G0291348 eggNOG:NOG295150 InParanoid:Q54ET4
            ProtClustDB:CLSZ2429552 Uniprot:Q54ET4
        Length = 771

 Score = 141 (54.7 bits), Expect = 9.2e-06, P = 9.2e-06
 Identities = 40/197 (20%), Positives = 74/197 (37%)

Query:   381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
             N +   N  +   +N    + +S +   N  ++ N   I N+N  Y N  +  N    N 
Sbjct:    93 NNNHSHNNCHDNNQNNSHNHNHSNIISNNIQNQINGNLITNNNNNYNNNNNNNNDNNNNN 152

Query:   441 YENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENR 500
               N  ++ N    N   N  +  N   N N N  Y N    +N      + N N  + + 
Sbjct:   153 NNNNNNDNNNNNNNNNNNNNNNNNNNNNNNNNNNYNNLNENFNNQNFNQNFNQNFNNVDN 212

Query:   501 SNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNG 560
              ++  + N  + ++  SV ++   +   T+  N + +                N+ N N 
Sbjct:   213 MHNQLFNNSNNYLNNNSVKTKQNENLIETLSKNKNKQNLNINNNNNNNNNNNNNNNNNNN 272

Query:   561 TSENRSNDNSYQNEIDG 577
              + N +N+N+  N  DG
Sbjct:   273 NNNNNNNNNNNNNNGDG 289

 Score = 139 (54.0 bits), Expect = 1.5e-05, P = 1.5e-05
 Identities = 53/202 (26%), Positives = 84/202 (41%)

Query:   378 SAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYE-NGTHEYNPK 436
             S +N S   NY N    N    + N     +N +H +N   I ++N + + NG    N  
Sbjct:    78 SQSNHSQ-SNY-NHNHTNNNHSHNNCHDNNQNNSHNHNHSNIISNNIQNQINGNLITNNN 135

Query:   437 YENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGT 496
               N Y N  +  N    N   N  ++ N   N N N    N  +  N     N+ N N  
Sbjct:   136 --NNYNNNNNNNNDNNNNNNNNNNNDNNNNNNNNNNNNNNNNNNNNNN---NNNNNYNNL 190

Query:   497 SENRSNDNSYQN---EIDGID-VWSVLSRNEPSK-RNTILHNIDDEWQISALTRGKWKLV 551
             +EN +N N  QN     + +D + + L  N  +   N  +    +E  I  L++ K K  
Sbjct:   191 NENFNNQNFNQNFNQNFNNVDNMHNQLFNNSNNYLNNNSVKTKQNENLIETLSKNKNK-- 248

Query:   552 KENSINGNGTSENRSNDNSYQN 573
             +  +IN N  + N +N+N+  N
Sbjct:   249 QNLNINNNNNNNNNNNNNNNNN 270

 Score = 136 (52.9 bits), Expect = 3.2e-05, P = 3.2e-05
 Identities = 51/222 (22%), Positives = 88/222 (39%)

Query:   381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
             N S   N+ N    NI  +   +++   N  + YN+    N++    N  +  N    N 
Sbjct:   107 NNSHNHNHSNIISNNIQNQINGNLIT--NNNNNYNNNNNNNNDNNNNNNNNNNNDNNNNN 164

Query:   441 YENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGN-GTSEN 499
               N  +  N    N   N  + YN   NEN N +  N     N   ++N  N     S N
Sbjct:   165 NNNNNNNNNNNNNNNNNNNNNNYNN-LNENFNNQNFNQNFNQNFNNVDNMHNQLFNNSNN 223

Query:   500 RSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGN 559
               N+NS + +    ++   LS+N+ +K+N  ++N ++    +            N+ N N
Sbjct:   224 YLNNNSVKTK-QNENLIETLSKNK-NKQNLNINNNNNNNNNNNNNNNNNN--NNNNNNNN 279

Query:   560 GTSENRSNDNSYQNEIDGIDVWSVLSRNEPS-KRNTILHNID 600
               + N + D +  N I    + + L  N+ + K      NID
Sbjct:   280 NNNNNNNGDGNNGNNIVKSPILNFLVNNQNAMKTQKTQSNID 321

 Score = 128 (50.1 bits), Expect = 0.00023, P = 0.00023
 Identities = 47/219 (21%), Positives = 91/219 (41%)

Query:   385 IPNYVNSTVENIIPRYEN-SILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYEN 443
             I N   +T  N      N SI   +N     ++    N N  + N  H +N  ++N  +N
Sbjct:    49 IKNNQTTTTTNSTTNPNNQSIKNIQNQNQSQSNHSQSNYNHNHTNNNHSHNNCHDNN-QN 107

Query:   444 GTHEYNPKYENRYENGT-HEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSN 502
              +H +N  + N   N   ++ NG    N N  Y N  +  N     N+ N N  + N +N
Sbjct:   108 NSHNHN--HSNIISNNIQNQINGNLITNNNNNYNNNNNNNNDNNNNNNNNNNNDNNNNNN 165

Query:   503 DNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRG-KWKLVKENSINGN-- 559
             +N+  N  +        + N  +  N   +N+++ +      +         ++++    
Sbjct:   166 NNNNNNNNNN-------NNNNNNNNNNNYNNLNENFNNQNFNQNFNQNFNNVDNMHNQLF 218

Query:   560 GTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHN 598
               S N  N+NS + +    ++   LS+N+ +K+N  ++N
Sbjct:   219 NNSNNYLNNNSVKTK-QNENLIETLSKNK-NKQNLNINN 255


>ZFIN|ZDB-GENE-030131-775 [details] [associations]
            symbol:sulf2l "sulfatase 2, like" species:7955
            "Danio rerio" [GO:0003824 "catalytic activity" evidence=IEA]
            [GO:0005794 "Golgi apparatus" evidence=IEA] [GO:0008152 "metabolic
            process" evidence=IEA] [GO:0009986 "cell surface" evidence=IEA]
            [GO:0005509 "calcium ion binding" evidence=IEA] [GO:0008484
            "sulfuric ester hydrolase activity" evidence=IEA] [GO:0005783
            "endoplasmic reticulum" evidence=IEA] InterPro:IPR000917
            InterPro:IPR014615 InterPro:IPR017849 InterPro:IPR017850
            Pfam:PF00884 PIRSF:PIRSF036665 ZFIN:ZDB-GENE-030131-775
            GO:GO:0005783 GO:GO:0005794 GO:GO:0009986 GO:GO:0005509
            Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
            PROSITE:PS00523 GO:GO:0008484 HOGENOM:HOG000290161 KO:K14607
            HOVERGEN:HBG056431 InterPro:IPR024609 Pfam:PF12548
            OrthoDB:EOG49KFPX EMBL:AY332607 IPI:IPI00499289
            RefSeq:NP_001003833.2 UniGene:Dr.12108 ProteinModelPortal:Q6EF98
            GeneID:322056 KEGG:dre:322056 CTD:322056 NextBio:20807645
            ArrayExpress:Q6EF98 Uniprot:Q6EF98
        Length = 885

 Score = 125 (49.1 bits), Expect = 1.2e-05, Sum P(2) = 1.2e-05
 Identities = 56/223 (25%), Positives = 91/223 (40%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
             P+II IL DD    D+   G  Q+       +   G    N + T  +C PSRS+++TGK
Sbjct:    46 PNIILILTDD---QDIEL-GSMQVMNKTRRIMEQGGTHFSNAFVTTPMCCPSRSSMLTGK 101

Query:   119 HPIHTGMQHNVLYGCERGGLPLSE-----KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
             +  H    HN     E    P  +     +    YL   GYRT   GK+ L  Y   Y P
Sbjct:   102 YA-HN---HNTYTNNENCSSPSWQAQHEPRTFGVYLNNTGYRTAFFGKY-LNEYNGTYIP 156

Query:   174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
                G+   +     +  +++++   +   G+  +   E   D    Y TD+ T ++++  
Sbjct:   157 P--GWREWVAM-VKNSRFYNYT---LCRNGVREKHGFEYPKD----YLTDLITNDSINYF 206

Query:   234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
                       P+ + ++HAA H      P Q    + N  +HI
Sbjct:   207 RMSKKIYPHRPVLMVISHAAPHGPEDAAP-QYTTAFPNASQHI 248

 Score = 66 (28.3 bits), Expect = 1.2e-05, Sum P(2) = 1.2e-05
 Identities = 28/118 (23%), Positives = 47/118 (39%)

Query:   269 IHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXX 328
             IH    +  + K    L  +D+SV KV   L     L N+ +++ +D             
Sbjct:   273 IHMEFTNMLQRKRLQTLLSVDDSVEKVYNMLVDTGELDNTYVIYTADHGYHIGQFGLVKG 332

Query:   329 SNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
              + P        +E  +R    I  P +E+ GI     +++ D  PT+L  A   D+P
Sbjct:   333 KSMP--------YEFDIRVPFYIRGPNVEAGGINPHIVLNI-DLAPTILDIAGM-DVP 380


>UNIPROTKB|I3L4C9 [details] [associations]
            symbol:SGSH "N-sulphoglucosamine sulphohydrolase"
            species:9606 "Homo sapiens" [GO:0008484 "sulfuric ester hydrolase
            activity" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484 EMBL:AC087741
            EMBL:AC123764 HGNC:HGNC:10818 ChiTaRS:SGSH
            ProteinModelPortal:I3L4C9 SMR:I3L4C9 Ensembl:ENST00000576941
            Bgee:I3L4C9 Uniprot:I3L4C9
        Length = 108

 Score = 114 (45.2 bits), Expect = 1.2e-05, P = 1.2e-05
 Identities = 25/78 (32%), Positives = 47/78 (60%)

Query:    41 PLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
             P+    +++ V  +  + P + + +LADD G+   G +    I TP++DALA   ++ +N
Sbjct:     4 PVPACCALLLVLGLCRARPRNALLLLADDGGFES-GAYNNSAIATPHLDALARRSLLFRN 62

Query:   101 YYT-VQLCTPSRSAIMTG 117
              +T V  C+PSR++++TG
Sbjct:    63 AFTSVSSCSPSRASLLTG 80


>GENEDB_PFALCIPARUM|PFL1370w [details] [associations]
            symbol:Pfnek-1 "NIMA-related protein kinase,
            Pfnek-1" species:5833 "Plasmodium falciparum" [GO:0007067 "mitosis"
            evidence=ISS] [GO:0005575 "cellular_component" evidence=ND]
            InterPro:IPR000719 InterPro:IPR002290 InterPro:IPR008271
            InterPro:IPR011009 Pfam:PF00069 PROSITE:PS00108 PROSITE:PS50011
            SMART:SM00220 GO:GO:0005524 SUPFAM:SSF56112 GO:GO:0004674
            EMBL:AE014188 KO:K08286 GenomeReviews:AE014188_GR HSSP:Q00535
            RefSeq:XP_001350680.1 ProteinModelPortal:Q8I5D5 IntAct:Q8I5D5
            MINT:MINT-1689491 EnsemblProtists:PFL1370w:mRNA GeneID:811326
            KEGG:pfa:PFL1370w EuPathDB:PlasmoDB:PF3D7_1228300
            HOGENOM:HOG000281114 OMA:CINDEEN Uniprot:Q8I5D5
        Length = 1057

 Score = 141 (54.7 bits), Expect = 1.4e-05, P = 1.4e-05
 Identities = 45/211 (21%), Positives = 91/211 (43%)

Query:   371 DWLPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYE-NGTHEYNSPRIENSNTRYENG 429
             D + +LL   + + I N    +  N      N+      +G +  ++   + SNT  ENG
Sbjct:   664 DEINSLLKKKSINTISNKNTQSYSNSSTHINNNYNVVNCHGAYNNHNTLSQYSNTSVENG 723

Query:   430 THEYNPKYENRYENGTHE-YNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLE 488
              ++Y  KY+    N + + YN   ++ Y +  +E     N++ N +  N  +  N+  + 
Sbjct:   724 KYKYENKYQGNIRNTSKDVYNENMDSAYRSPKYEKGYDDNKSVNKKKMNSNNMGNMNNMN 783

Query:   489 NSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTIL--HNIDDEWQISALTRG 546
             N  N N  + N SN+N+  N  +     + ++ N  +  N     + I + +   +  R 
Sbjct:   784 NMNNMNNNNNNNSNNNNNSNNSNS----NYMNNNHHTNNNNSCTSNRISNMYFNDSSRRS 839

Query:   547 KWKLVKENSINGNGTSENRSNDNSY-QNEID 576
                +   N+++   +S    +DN Y QN ++
Sbjct:   840 VSAMPNVNNVSRRKSSVYLCDDNMYNQNNVE 870


>UNIPROTKB|Q8I5D5 [details] [associations]
            symbol:nek-1 "NIMA-related protein kinase, Pfnek-1"
            species:36329 "Plasmodium falciparum 3D7" [GO:0005575
            "cellular_component" evidence=ND] InterPro:IPR000719
            InterPro:IPR002290 InterPro:IPR008271 InterPro:IPR011009
            Pfam:PF00069 PROSITE:PS00108 PROSITE:PS50011 SMART:SM00220
            GO:GO:0005524 SUPFAM:SSF56112 GO:GO:0004674 EMBL:AE014188 KO:K08286
            GenomeReviews:AE014188_GR HSSP:Q00535 RefSeq:XP_001350680.1
            ProteinModelPortal:Q8I5D5 IntAct:Q8I5D5 MINT:MINT-1689491
            EnsemblProtists:PFL1370w:mRNA GeneID:811326 KEGG:pfa:PFL1370w
            EuPathDB:PlasmoDB:PF3D7_1228300 HOGENOM:HOG000281114 OMA:CINDEEN
            Uniprot:Q8I5D5
        Length = 1057

 Score = 141 (54.7 bits), Expect = 1.4e-05, P = 1.4e-05
 Identities = 45/211 (21%), Positives = 91/211 (43%)

Query:   371 DWLPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYE-NGTHEYNSPRIENSNTRYENG 429
             D + +LL   + + I N    +  N      N+      +G +  ++   + SNT  ENG
Sbjct:   664 DEINSLLKKKSINTISNKNTQSYSNSSTHINNNYNVVNCHGAYNNHNTLSQYSNTSVENG 723

Query:   430 THEYNPKYENRYENGTHE-YNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLE 488
              ++Y  KY+    N + + YN   ++ Y +  +E     N++ N +  N  +  N+  + 
Sbjct:   724 KYKYENKYQGNIRNTSKDVYNENMDSAYRSPKYEKGYDDNKSVNKKKMNSNNMGNMNNMN 783

Query:   489 NSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTIL--HNIDDEWQISALTRG 546
             N  N N  + N SN+N+  N  +     + ++ N  +  N     + I + +   +  R 
Sbjct:   784 NMNNMNNNNNNNSNNNNNSNNSNS----NYMNNNHHTNNNNSCTSNRISNMYFNDSSRRS 839

Query:   547 KWKLVKENSINGNGTSENRSNDNSY-QNEID 576
                +   N+++   +S    +DN Y QN ++
Sbjct:   840 VSAMPNVNNVSRRKSSVYLCDDNMYNQNNVE 870


>TIGR_CMR|SPO_2214 [details] [associations]
            symbol:SPO_2214 "choline sulfatase" species:246200
            "Ruegeria pomeroyi DSS-3" [GO:0006790 "sulfur compound metabolic
            process" evidence=ISS] [GO:0047753 "choline-sulfatase activity"
            evidence=ISS] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 EMBL:CP000031
            GenomeReviews:CP000031_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
            InterPro:IPR024607 PROSITE:PS00149 HOGENOM:HOG000217625 KO:K01133
            ProtClustDB:CLSK864791 GO:GO:0047753 RefSeq:YP_167440.1
            ProteinModelPortal:Q5LRB5 GeneID:3194829 KEGG:sil:SPO2214
            PATRIC:23377781 OMA:LLIMADQ Uniprot:Q5LRB5
        Length = 498

 Score = 103 (41.3 bits), Expect = 1.7e-05, Sum P(3) = 1.7e-05
 Identities = 55/225 (24%), Positives = 87/225 (38%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
             P+I+ I+AD +    +   G     T ++  LA   +   N YT   +C P+RS  MTG 
Sbjct:    17 PNILLIMADQMTPFMLEACGGTGARTRHLTRLAGRAVQFTNAYTPSPICVPARSCFMTGL 76

Query:   119 HPIHTGMQHNVLYGCERGGLPLSEKILP---QYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
             +   TG        C   G P     LP    YL   GY T + GK H  F   +     
Sbjct:    77 YTSTTG--------CYDNGDPY-HSFLPTFAHYLTNAGYETVLSGKMH--FIGADQ---L 122

Query:   176 RGFESHLG---YWTGHQDYF------DHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFT 226
              GF+  L    Y +G    +      D S +        +  ++ P W    +Y  +   
Sbjct:   123 HGFQRRLNPDIYPSGFLWSYPLPPDGDASFQAFDFTPQYLAENIGPGWSKELQYDEET-Q 181

Query:   227 AEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
               A++ +  H+ D P  L ++       NP+ P   P  Y  +++
Sbjct:   182 FRALEYLR-HAPDTPWMLTVSFT-----NPHPPYVVPRPYWEMYK 220

 Score = 77 (32.2 bits), Expect = 1.7e-05, Sum P(3) = 1.7e-05
 Identities = 16/64 (25%), Positives = 33/64 (51%)

Query:   252 HSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIV 311
             H+   +  L    H +   R++   +R  FAA+ H +D+ +G ++E L++      ++I+
Sbjct:   242 HALRRWHGLHQRGHEVRDPRNLIAMRRG-FAALAHYVDDKIGALLEVLDETGQRDETVII 300

Query:   312 FVSD 315
               SD
Sbjct:   301 VTSD 304

 Score = 46 (21.3 bits), Expect = 1.7e-05, Sum P(3) = 1.7e-05
 Identities = 14/52 (26%), Positives = 25/52 (48%)

Query:   751 NEEEGMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEV 802
             +E  G   +R +  ++ G  K   C    AP L+++  DP E +N A   ++
Sbjct:   387 SEYHGEGIMRPSFMVRLGDWKYHYCHGS-APQLYNLARDPGEWHNRAGEPDL 437

 Score = 43 (20.2 bits), Expect = 3.3e-05, Sum P(3) = 3.3e-05
 Identities = 18/62 (29%), Positives = 31/62 (50%)

Query:   653 LRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLA---DRSEDQ-RINHYTTEVGR 708
             +R +  ++ G  K   C    AP L+++  DP E +N A   D +E + R++   T  G 
Sbjct:   395 MRPSFMVRLGDWKYHYCHGS-APQLYNLARDPGEWHNRAGEPDLAETEARLDRVITG-GS 452

Query:   709 FN 710
             F+
Sbjct:   453 FD 454


>DICTYBASE|DDB_G0271052 [details] [associations]
            symbol:snf2b "SNF2-related protein Snf2a"
            species:44689 "Dictyostelium discoideum" [GO:0016818 "hydrolase
            activity, acting on acid anhydrides, in phosphorus-containing
            anhydrides" evidence=IEA] [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
            [GO:0005524 "ATP binding" evidence=IEA] [GO:0004386 "helicase
            activity" evidence=IEA;ISS] [GO:0003677 "DNA binding" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0006357
            "regulation of transcription from RNA polymerase II promoter"
            evidence=ISS] [GO:0005654 "nucleoplasm" evidence=ISS]
            InterPro:IPR000330 InterPro:IPR001487 InterPro:IPR001650
            InterPro:IPR014978 Pfam:PF00176 Pfam:PF00271 PRINTS:PR00503
            PROSITE:PS50014 PROSITE:PS51194 SMART:SM00297 SMART:SM00490
            SMART:SM00951 dictyBase:DDB_G0271052 GO:GO:0005524 GO:GO:0005654
            EMBL:AAFI02000005 GO:GO:0003677 GO:GO:0006357 GO:GO:0004386
            InterPro:IPR011050 SUPFAM:SSF51126 eggNOG:COG0553
            InterPro:IPR014001 SMART:SM00487 PROSITE:PS51192 SUPFAM:SSF47370
            KO:K11647 InterPro:IPR014012 PROSITE:PS51204 RefSeq:XP_646649.1
            ProteinModelPortal:Q55C32 EnsemblProtists:DDB0220695 GeneID:8617621
            KEGG:ddi:DDB_G0271052 InParanoid:Q55C32 OMA:NINDNPN Uniprot:Q55C32
        Length = 3247

 Score = 144 (55.7 bits), Expect = 1.7e-05, Sum P(2) = 1.7e-05
 Identities = 56/217 (25%), Positives = 78/217 (35%)

Query:   362 VAEQYVHVSDWL-P-TLLSAANKSDIP-NYVNSTVENIIPRYENSILRYENGTHEYNSPR 418
             + E+Y  +     P T ++ ++ S +  N  NS V N      NS +   N     NS  
Sbjct:   632 ITEEYYGILQLAHPSTFINQSSPSVVQMNTNNSNVNNNNNNNSNSNMNNNNMNSNNNSNM 691

Query:   419 IENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENG 478
               N+N    NG +  N    N   N     N    N   N     N   N N N    N 
Sbjct:   692 --NNNMNNNNGVNNMNNNMNNNNTNNNSNNNNMNHNNMNNNNGMNNNMNNNNNNNNNMNN 749

Query:   479 THEYNIPRLENSINGNGTSENRSNDNSY--QNEIDGIDVWSVLSRNEPSKRNTILHNIDD 536
                 NI    NS N    S N SN+N     N I+ I   +  S N  +  N   +N ++
Sbjct:   750 NTNSNINSNNNSGNSTNNSANISNNNGNIGNNNINNISYNNNNSNNNSNNNNNSNNNSNN 809

Query:   537 EWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQN 573
                 S  +         NS N N  + N +N N+  N
Sbjct:   810 NNNSSGNSNSNSNN-NSNSNNNNNNNNNNNNSNTSGN 845

 Score = 57 (25.1 bits), Expect = 1.7e-05, Sum P(2) = 1.7e-05
 Identities = 13/49 (26%), Positives = 25/49 (51%)

Query:   554 NSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDE 602
             N+ N N  + N +N+N YQ+            R++P+K+    + +DD+
Sbjct:  3181 NNYNNNNYNSNHNNNNQYQHH--SYQQQQHQQRHQPNKKQRF-NPLDDD 3226


>GENEDB_PFALCIPARUM|PF11_0176 [details] [associations]
            symbol:PF11_0176 "hypothetical protein"
            species:5833 "Plasmodium falciparum" [GO:0003674
            "molecular_function" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR016196 SUPFAM:SSF103473 EMBL:AE014186
            RefSeq:XP_001347847.2 ProteinModelPortal:Q8IIJ7
            EnsemblProtists:PF11_0176:mRNA GeneID:810723 KEGG:pfa:PF11_0176
            EuPathDB:PlasmoDB:PF3D7_1117000 Uniprot:Q8IIJ7
        Length = 1283

 Score = 141 (54.7 bits), Expect = 1.7e-05, P = 1.7e-05
 Identities = 66/229 (28%), Positives = 95/229 (41%)

Query:   389 VNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEY 448
             +N+T  N     EN+     N  +E N+   EN+N    N  +E N   EN   N  +  
Sbjct:   754 INTTTTNNNNNNENNNNNENNNNNENNNNN-ENNNNNENNNNNENNNNNENNNNNENNNN 812

Query:   449 NPKYENRYENGTHEYNGPKNE-NTNPRY-ENGTHEYNI--PRLENS-INGNGTSENRSND 503
             N    N   N  +E N   N  N N  + +N  H  NI  P  +N  IN   T+E   N 
Sbjct:   813 NENNNNNENNNNNENNNNNNHHNHNHNHNQNNHHNQNINYPNPQNERINYPFTNEFIHNH 872

Query:   504 NSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISA---LTRGKWKLVKENSINGN- 559
             + Y N I        L+  +    NTIL N  ++  I+     T  +  L+KEN I  + 
Sbjct:   873 HEYVNNI-------ALTPKQQIIDNTILENKQNDEDINKKKLTTHSQKNLLKENLIITDE 925

Query:   560 ---GTSENRSNDNSY-QNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQ 604
                 T  N+  +N+  QN I  +     L R+E  +   I  NI +E Q
Sbjct:   926 YFINTDTNQYMNNAQNQNNIC-LPKGIYLDRSEECEPKNIW-NIQNESQ 972

 Score = 125 (49.1 bits), Expect = 0.00091, P = 0.00091
 Identities = 52/183 (28%), Positives = 81/183 (44%)

Query:   392 TVENIIPRYE---NSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEY 448
             T+ NII +     N+  +  NG+   N+    N+N    N  +E N   EN   N  +  
Sbjct:   730 TLNNIITQSNIPINNTNQNINGS-PINTTTTNNNNNNENNNNNENNNNNENN-NNNENNN 787

Query:   449 NPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRS-NDNSYQ 507
             N +  N  EN  +  N   NEN N    N  +E N    EN+ N N  + N + N N++ 
Sbjct:   788 NNENNNNNENNNNNENNNNNENNNNNENNNNNENNNNN-ENNNNNNHHNHNHNHNQNNHH 846

Query:   508 NEIDGIDVWSVLSR--NEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENR 565
             N+   I+  +  +   N P   N  +HN  +     ALT  K +++ +N+I      EN+
Sbjct:   847 NQ--NINYPNPQNERINYPFT-NEFIHNHHEYVNNIALTP-KQQII-DNTI-----LENK 896

Query:   566 SND 568
              ND
Sbjct:   897 QND 899


>UNIPROTKB|Q8IIJ7 [details] [associations]
            symbol:PF11_0176 "Conserved Plasmodium membrane protein"
            species:36329 "Plasmodium falciparum 3D7" [GO:0003674
            "molecular_function" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR016196 SUPFAM:SSF103473 EMBL:AE014186
            RefSeq:XP_001347847.2 ProteinModelPortal:Q8IIJ7
            EnsemblProtists:PF11_0176:mRNA GeneID:810723 KEGG:pfa:PF11_0176
            EuPathDB:PlasmoDB:PF3D7_1117000 Uniprot:Q8IIJ7
        Length = 1283

 Score = 141 (54.7 bits), Expect = 1.7e-05, P = 1.7e-05
 Identities = 66/229 (28%), Positives = 95/229 (41%)

Query:   389 VNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEY 448
             +N+T  N     EN+     N  +E N+   EN+N    N  +E N   EN   N  +  
Sbjct:   754 INTTTTNNNNNNENNNNNENNNNNENNNNN-ENNNNNENNNNNENNNNNENNNNNENNNN 812

Query:   449 NPKYENRYENGTHEYNGPKNE-NTNPRY-ENGTHEYNI--PRLENS-INGNGTSENRSND 503
             N    N   N  +E N   N  N N  + +N  H  NI  P  +N  IN   T+E   N 
Sbjct:   813 NENNNNNENNNNNENNNNNNHHNHNHNHNQNNHHNQNINYPNPQNERINYPFTNEFIHNH 872

Query:   504 NSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISA---LTRGKWKLVKENSINGN- 559
             + Y N I        L+  +    NTIL N  ++  I+     T  +  L+KEN I  + 
Sbjct:   873 HEYVNNI-------ALTPKQQIIDNTILENKQNDEDINKKKLTTHSQKNLLKENLIITDE 925

Query:   560 ---GTSENRSNDNSY-QNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQ 604
                 T  N+  +N+  QN I  +     L R+E  +   I  NI +E Q
Sbjct:   926 YFINTDTNQYMNNAQNQNNIC-LPKGIYLDRSEECEPKNIW-NIQNESQ 972

 Score = 125 (49.1 bits), Expect = 0.00091, P = 0.00091
 Identities = 52/183 (28%), Positives = 81/183 (44%)

Query:   392 TVENIIPRYE---NSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEY 448
             T+ NII +     N+  +  NG+   N+    N+N    N  +E N   EN   N  +  
Sbjct:   730 TLNNIITQSNIPINNTNQNINGS-PINTTTTNNNNNNENNNNNENNNNNENN-NNNENNN 787

Query:   449 NPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRS-NDNSYQ 507
             N +  N  EN  +  N   NEN N    N  +E N    EN+ N N  + N + N N++ 
Sbjct:   788 NNENNNNNENNNNNENNNNNENNNNNENNNNNENNNNN-ENNNNNNHHNHNHNHNQNNHH 846

Query:   508 NEIDGIDVWSVLSR--NEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENR 565
             N+   I+  +  +   N P   N  +HN  +     ALT  K +++ +N+I      EN+
Sbjct:   847 NQ--NINYPNPQNERINYPFT-NEFIHNHHEYVNNIALTP-KQQII-DNTI-----LENK 896

Query:   566 SND 568
              ND
Sbjct:   897 QND 899


>ZFIN|ZDB-GENE-030131-5846 [details] [associations]
            symbol:gnsb "glucosamine (N-acetyl)-6-sulfatase
            (Sanfilippo disease IIID), b" species:7955 "Danio rerio"
            [GO:0030203 "glycosaminoglycan metabolic process" evidence=IEA]
            [GO:0003824 "catalytic activity" evidence=IEA] [GO:0008152
            "metabolic process" evidence=IEA] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=IEA]
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008484 "sulfuric ester
            hydrolase activity" evidence=IEA] InterPro:IPR000917
            InterPro:IPR012251 InterPro:IPR015981 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036666
            ZFIN:ZDB-GENE-030131-5846 GO:GO:0005764 Gene3D:3.40.720.10
            SUPFAM:SSF53649 GeneTree:ENSGT00400000022041 GO:GO:0030203
            GO:GO:0008449 PANTHER:PTHR10342:SF5 EMBL:CU896586 IPI:IPI00971874
            Ensembl:ENSDART00000112103 ArrayExpress:F1QJ04 Bgee:F1QJ04
            Uniprot:F1QJ04
        Length = 507

 Score = 136 (52.9 bits), Expect = 1.8e-05, P = 1.8e-05
 Identities = 58/226 (25%), Positives = 93/226 (41%)

Query:    41 PLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
             PL   +   F    A S   +II IL DD    D    G+  +     + +  +G    N
Sbjct:    14 PLKLLVLFFFFFTCAFSSKNNIILILTDD---QDEQMGGMTPMKKTR-ELIGDAGATFSN 69

Query:   101 YYT-VQLCTPSRSAIMTGKHPIHTGMQHN--VLYGCERGGLPLSEK--ILPQYLKELGYR 155
              +T   LC PSRS+ ++G++P H  + HN  V   C       + +    P YL ++ Y+
Sbjct:    70 AFTSTPLCCPSRSSFLSGRYP-HNHLVHNNSVEGNCSSAAWQKTAEPFAFPVYLNKMRYQ 128

Query:   156 TRIVGKWHLGFYKKEYTPTFRGFESHL--GY--W---TGHQDYFDHSAEEMKMWGLDMRR 208
             T   GK     Y  +Y     G  +H+  G+  W    G+  Y++++        L +  
Sbjct:   129 TFYCGK-----YLNQYGSKDAGGVAHVPPGWDQWHALVGNSKYYNYT--------LSVNG 175

Query:   209 DLEPAWDLHGK-YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHS 253
               E   D + K Y TD+    ++  +   S   P F+ L   A HS
Sbjct:   176 KEEKHGDSYEKDYLTDLVLNRSLHFLEERSPSHPFFMMLCPPAPHS 221


>UNIPROTKB|F5H260 [details] [associations]
            symbol:GNS "N-acetylglucosamine-6-sulfatase" species:9606
            "Homo sapiens" [GO:0005764 "lysosome" evidence=IEA] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=IEA]
            [GO:0030203 "glycosaminoglycan metabolic process" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR015981 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 GO:GO:0005764 Gene3D:3.40.720.10
            SUPFAM:SSF53649 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            GO:GO:0042340 GO:GO:0043199 GO:GO:0005539 GO:GO:0008449
            PANTHER:PTHR10342:SF5 EMBL:AC025262 HGNC:HGNC:4422 ChiTaRS:GNS
            IPI:IPI01010051 ProteinModelPortal:F5H260 SMR:F5H260
            Ensembl:ENST00000545471 ArrayExpress:F5H260 Bgee:F5H260
            Uniprot:F5H260
        Length = 344

 Score = 126 (49.4 bits), Expect = 2.0e-05, Sum P(2) = 2.0e-05
 Identities = 48/161 (29%), Positives = 71/161 (44%)

Query:   101 YYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYG-CE-RGGLPLSE-KILPQYLKEL-GYRT 156
             Y    LC PSR++I+TGK+P +  + +N L G C  +    + E    P  L+ + GY+T
Sbjct:    22 YVPSALCCPSRASILTGKYPHNHHVVNNTLEGNCSSKSWQKIQEPNTFPAILRSMCGYQT 81

Query:   157 RIVGKWHLGFYKKEY-TPTFRGFES-HLG--YWTGHQDYFDHSAEEMKMWGLDMRRDLEP 212
                GK     Y  EY  P   G E   LG  YW   +    +    + + G   +     
Sbjct:    82 FFAGK-----YLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENY 136

Query:   213 AWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHS 253
             + D    Y TDV    ++D +   S  EP F+ +A  A HS
Sbjct:   137 SVD----YLTDVLANVSLDFLDYKSNFEPFFMMIATPAPHS 173

 Score = 51 (23.0 bits), Expect = 2.0e-05, Sum P(2) = 2.0e-05
 Identities = 13/38 (34%), Positives = 23/38 (60%)

Query:   278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
             R ++  +L  +D+ V K+V+ LE    L+N+ I + SD
Sbjct:   227 RKRWQTLL-SVDDLVEKLVKRLEFTGELNNTYIFYTSD 263


>DICTYBASE|DDB_G0278425 [details] [associations]
            symbol:DDB_G0278425 species:44689 "Dictyostelium
            discoideum" [GO:0016779 "nucleotidyltransferase activity"
            evidence=IEA] InterPro:IPR002934 Pfam:PF01909
            dictyBase:DDB_G0278425 Pfam:PF03828 EMBL:AAFI02000023
            eggNOG:COG5260 InterPro:IPR002058 GO:GO:0016779 RefSeq:XP_642359.1
            EnsemblProtists:DDB0205447 GeneID:8621564 KEGG:ddi:DDB_G0278425
            InParanoid:Q54Y43 OMA:TEESINT Uniprot:Q54Y43
        Length = 1090

 Score = 147 (56.8 bits), Expect = 2.1e-05, Sum P(2) = 2.1e-05
 Identities = 51/205 (24%), Positives = 91/205 (44%)

Query:   381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
             N  D PN +N+   N   +  N    + +  H +N+    N+N    N ++  N    N 
Sbjct:    43 NGVDNPNDINNGNNNSHHKKNNHHNHHYH--HHHNNNNNNNNNNNNNNNSNNNNNNNSNN 100

Query:   441 YENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTH-EYNIPRLENSINGNGTSEN 499
               N  +  N    N   NG H +N  +N N+N  Y+  T  +++I    N+IN N  + N
Sbjct:   101 NNNNNNNNNNNNNNN-NNG-HHHNNTQN-NSNFTYQPKTKKDHHIQNNNNNINNNNINNN 157

Query:   500 RSND-NSYQNEIDGIDVWSVLSRNEPSKRNTI----LHNIDDEWQISALTRGKWKLVKEN 554
               N+ N+  N  +G +V  ++S N  +  N      ++N ++    + +  G   +  + 
Sbjct:   158 NINNINNNINTNNGNEVGHIVSNNNNNNNNNNNNNNINNNNNNINNNTINGGNSNINNQF 217

Query:   555 SINGNGTSENRSNDNSYQNEIDGID 579
               N N  + N ++D +Y  E DGI+
Sbjct:   218 D-NENNNNNNINDDGNYIYE-DGIE 240

 Score = 135 (52.6 bits), Expect = 0.00038, Sum P(2) = 0.00038
 Identities = 50/219 (22%), Positives = 90/219 (41%)

Query:   387 NYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTH 446
             +Y N+   N      N+    +NG    N     N+N+ ++   H +N  Y + + N  +
Sbjct:    21 HYKNNNNNNNNNNNNNNNKNNQNGVDNPNDINNGNNNSHHKKNNH-HNHHYHHHHNNNNN 79

Query:   447 EYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSI--NGNGTSENRSN-D 503
               N    N   N  +  N   N N N    N  +  N     N+   N N T + ++  D
Sbjct:    80 NNNNNNNNNNSNNNNNNNSNNNNNNNNNNNNNNNNNNNGHHHNNTQNNSNFTYQPKTKKD 139

Query:   504 NSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSE 563
             +  QN  + I+  ++ + N  +  N I  N  +E  +  +          N+ N N  + 
Sbjct:   140 HHIQNNNNNINNNNINNNNINNINNNINTNNGNE--VGHIVSNN---NNNNNNNNNNNNI 194

Query:   564 NRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDE 602
             N +N+N   N I+G +  S ++ N+    N   +NI+D+
Sbjct:   195 NNNNNNINNNTINGGN--SNIN-NQFDNENNNNNNINDD 230

 Score = 43 (20.2 bits), Expect = 2.1e-05, Sum P(2) = 2.1e-05
 Identities = 7/15 (46%), Positives = 10/15 (66%)

Query:   670 EPQIAPCLFDIKNDP 684
             +P I PCL ++ N P
Sbjct:   934 QPPILPCLQELANGP 948

 Score = 43 (20.2 bits), Expect = 2.1e-05, Sum P(2) = 2.1e-05
 Identities = 7/15 (46%), Positives = 10/15 (66%)

Query:   776 EPQIAPCLFDIKNDP 790
             +P I PCL ++ N P
Sbjct:   934 QPPILPCLQELANGP 948


>ZFIN|ZDB-GENE-040426-759 [details] [associations]
            symbol:sulf2 "sulfatase 2" species:7955 "Danio
            rerio" [GO:0003824 "catalytic activity" evidence=IEA] [GO:0005794
            "Golgi apparatus" evidence=IEA] [GO:0008152 "metabolic process"
            evidence=IEA] [GO:0009986 "cell surface" evidence=IEA] [GO:0005509
            "calcium ion binding" evidence=IEA] [GO:0008484 "sulfuric ester
            hydrolase activity" evidence=IEA] [GO:0005783 "endoplasmic
            reticulum" evidence=IEA] InterPro:IPR000917 InterPro:IPR014615
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            PIRSF:PIRSF036665 ZFIN:ZDB-GENE-040426-759 GO:GO:0005783
            GO:GO:0005794 GO:GO:0009986 GO:GO:0005509 Gene3D:3.40.720.10
            SUPFAM:SSF53649 InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484
            KO:K14607 HOVERGEN:HBG056431 InterPro:IPR024609 Pfam:PF12548
            CTD:55959 EMBL:BC045403 IPI:IPI00482734 RefSeq:NP_957230.1
            UniGene:Dr.75551 ProteinModelPortal:Q7ZVU8 PRIDE:Q7ZVU8
            GeneID:393910 KEGG:dre:393910 InParanoid:Q7ZVU8 NextBio:20814887
            ArrayExpress:Q7ZVU8 Bgee:Q7ZVU8 Uniprot:Q7ZVU8
        Length = 873

 Score = 120 (47.3 bits), Expect = 2.1e-05, Sum P(3) = 2.1e-05
 Identities = 53/221 (23%), Positives = 89/221 (40%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQI-PTPNIDALAYSGIILKNYY-TVQLCTPSRSAIMTG 117
             P++I IL DD    D+    +  +  T  I  +   G    N + T  +C PSRS I+TG
Sbjct:    46 PNMILILTDD---QDIELGSMQAMNKTKRI--MMQGGTHFSNAFATTPMCCPSRSTILTG 100

Query:   118 KHPIHTGMQHNVLYGCERGGLPLSEK--ILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
             K+ +H    +     C         +      +L   GYRT   GK+ L  Y   Y P  
Sbjct:   101 KY-VHNHHTYTNNENCSSPSWQAHHEPHTFAVHLNNSGYRTAFFGKY-LNEYNGSYVPP- 157

Query:   176 RGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
              G+   +     +  +++++   +   G+  +   +   D    Y TDV T ++++    
Sbjct:   158 -GWREWVAL-VKNSRFYNYT---LCRNGIREKHGTQYPKD----YLTDVITNDSINFFRM 208

Query:   236 HST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
                     P+ + L+HAA H      P Q    + N  +HI
Sbjct:   209 SKRMYPHRPVMMVLSHAAPHGPEDAAP-QYSSAFPNASQHI 248

 Score = 75 (31.5 bits), Expect = 2.1e-05, Sum P(3) = 2.1e-05
 Identities = 41/190 (21%), Positives = 76/190 (40%)

Query:   245 YLAHAATHSANPYEP--LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQR 302
             ++  +  H+ NP +   L+       +H    +  + +    L  +D+SV KV   L + 
Sbjct:   247 HITPSYNHAPNPDKHWILRYTGPMKPVHMQFTNMLQRRRLQTLLSVDDSVEKVYNMLVET 306

Query:   303 RMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIV 362
               L N+ I+++SD              + P        +E  +R    +  P +E+ G +
Sbjct:   307 GELDNTYIIYMSDHGYHIGQFGLVKGKSMP--------YEFDIRIPFYVRGPNVEA-GAI 357

Query:   363 AEQYVHVSDWLPTLLSAANKSDIPNYVNS-TVENIIP--RYENSILRYENGTHEYNSPRI 419
                 V  +D  PTLL  A   DIP  ++  ++  ++   R  NS  R+    H Y   ++
Sbjct:   358 NPHIVLNTDLAPTLLDMAG-IDIPQDMDGKSILKLLETERPVNSFTRF----HSYKKAKL 412

Query:   420 ENSNTRYENG 429
                +   E G
Sbjct:   413 WRDSFLVERG 422

 Score = 38 (18.4 bits), Expect = 2.1e-05, Sum P(3) = 2.1e-05
 Identities = 7/24 (29%), Positives = 11/24 (45%)

Query:   678 FDIKNDPCEKNNLADRSEDQRINH 701
             FD+  DP +  N     +   +NH
Sbjct:   790 FDLNTDPYQLMNGVSTLDRDAVNH 813

 Score = 37 (18.1 bits), Expect = 2.6e-05, Sum P(3) = 2.6e-05
 Identities = 7/24 (29%), Positives = 11/24 (45%)

Query:   784 FDIKNDPCEKNNLADRSEVQRINH 807
             FD+  DP +  N     +   +NH
Sbjct:   790 FDLNTDPYQLMNGVSTLDRDAVNH 813


>GENEDB_PFALCIPARUM|MAL13P1.44 [details] [associations]
            symbol:MAL13P1.44 "protein phosphatase 2c-like
            protein, putative" species:5833 "Plasmodium falciparum" [GO:0008287
            "protein serine/threonine phosphatase complex" evidence=ISS]
            [GO:0004722 "protein serine/threonine phosphatase activity"
            evidence=ISS] [GO:0006468 "protein phosphorylation" evidence=ISS]
            InterPro:IPR001932 Pfam:PF00481 SMART:SM00331 SMART:SM00332
            GO:GO:0004722 GO:GO:0006468 Gene3D:3.60.40.10 SUPFAM:SSF81606
            KO:K01090 EMBL:AL844509 InterPro:IPR015655 PANTHER:PTHR13832
            GO:GO:0008287 RefSeq:XP_001349820.1 ProteinModelPortal:Q8IEM2
            EnsemblProtists:MAL13P1.44:mRNA GeneID:813933 KEGG:pfa:MAL13P1.44
            EuPathDB:PlasmoDB:PF3D7_1309200 ProtClustDB:CLSZ2432578
            Uniprot:Q8IEM2
        Length = 827

 Score = 138 (53.6 bits), Expect = 2.1e-05, P = 2.1e-05
 Identities = 69/278 (24%), Positives = 119/278 (42%)

Query:   430 THEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGT-HEYNIPRLE 488
             TH    KY +       E N + E   EN   EY   ++ NTN +   G  + +N   L+
Sbjct:    27 THSQKNKYRDAINKYAQENNSRGE--CENYCDEYYSRRSNNTNIKLNRGMKYSHNNNGLK 84

Query:   489 NSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNID--DEWQISALTRG 546
              + + N  + N S+D +  N  DGI + ++   N  +  N    N++  ++ + SA  + 
Sbjct:    85 KNDHFNCNNSNISSDENENNMNDGISINNIKQNNLDNVNNVDYDNLNIKEKKEESAFDKW 144

Query:   547 KWKLVKENSINGNGTSENRSNDN-SYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQI 605
             K K  K+NS   +  ++N ++D+ +Y+NE    D  +  + N  +  N I  N  +    
Sbjct:   145 KKKKKKKNSDQFSELAKNNNSDHVNYKNEKREYDNNNNNNNNNNNNNNNIFSN--NNCNN 202

Query:   606 SALTXXXXXXXXXXXXMRYQVDLTGGPDQ-VY-LSGLSDREWLALAMRKLRDAASIQCGP 663
             S++                + DL  G ++ V+ L GL+         RK  D    +   
Sbjct:   203 SSIIYDNNVFSDNYKYYNDKCDLCNGQEKCVHRLGGLNCTHDEDDKTRKCTDENINKKLL 262

Query:   664 VKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEDQRINH 701
             +K    E  I   + DI ND  E NN+ + +E   IN+
Sbjct:   263 IKND--EDSIDYSVDDI-NDEYENNNIIN-NESHIINN 296


>UNIPROTKB|Q8IEM2 [details] [associations]
            symbol:MAL13P1.44 "Protein phosphatase 2c-like protein,
            putative" species:36329 "Plasmodium falciparum 3D7" [GO:0004722
            "protein serine/threonine phosphatase activity" evidence=ISS]
            [GO:0006468 "protein phosphorylation" evidence=ISS] [GO:0008287
            "protein serine/threonine phosphatase complex" evidence=ISS]
            InterPro:IPR001932 Pfam:PF00481 SMART:SM00331 SMART:SM00332
            GO:GO:0004722 GO:GO:0006468 Gene3D:3.60.40.10 SUPFAM:SSF81606
            KO:K01090 EMBL:AL844509 InterPro:IPR015655 PANTHER:PTHR13832
            GO:GO:0008287 RefSeq:XP_001349820.1 ProteinModelPortal:Q8IEM2
            EnsemblProtists:MAL13P1.44:mRNA GeneID:813933 KEGG:pfa:MAL13P1.44
            EuPathDB:PlasmoDB:PF3D7_1309200 ProtClustDB:CLSZ2432578
            Uniprot:Q8IEM2
        Length = 827

 Score = 138 (53.6 bits), Expect = 2.1e-05, P = 2.1e-05
 Identities = 69/278 (24%), Positives = 119/278 (42%)

Query:   430 THEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGT-HEYNIPRLE 488
             TH    KY +       E N + E   EN   EY   ++ NTN +   G  + +N   L+
Sbjct:    27 THSQKNKYRDAINKYAQENNSRGE--CENYCDEYYSRRSNNTNIKLNRGMKYSHNNNGLK 84

Query:   489 NSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNID--DEWQISALTRG 546
              + + N  + N S+D +  N  DGI + ++   N  +  N    N++  ++ + SA  + 
Sbjct:    85 KNDHFNCNNSNISSDENENNMNDGISINNIKQNNLDNVNNVDYDNLNIKEKKEESAFDKW 144

Query:   547 KWKLVKENSINGNGTSENRSNDN-SYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQI 605
             K K  K+NS   +  ++N ++D+ +Y+NE    D  +  + N  +  N I  N  +    
Sbjct:   145 KKKKKKKNSDQFSELAKNNNSDHVNYKNEKREYDNNNNNNNNNNNNNNNIFSN--NNCNN 202

Query:   606 SALTXXXXXXXXXXXXMRYQVDLTGGPDQ-VY-LSGLSDREWLALAMRKLRDAASIQCGP 663
             S++                + DL  G ++ V+ L GL+         RK  D    +   
Sbjct:   203 SSIIYDNNVFSDNYKYYNDKCDLCNGQEKCVHRLGGLNCTHDEDDKTRKCTDENINKKLL 262

Query:   664 VKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEDQRINH 701
             +K    E  I   + DI ND  E NN+ + +E   IN+
Sbjct:   263 IKND--EDSIDYSVDDI-NDEYENNNIIN-NESHIINN 296


>DICTYBASE|DDB_G0279085 [details] [associations]
            symbol:cycA "cyclin" species:44689 "Dictyostelium
            discoideum" [GO:0005634 "nucleus" evidence=IEA] InterPro:IPR004367
            Pfam:PF02984 dictyBase:DDB_G0279085 GO:GO:0005634
            GenomeReviews:CM000152_GR EMBL:AAFI02000027 Gene3D:1.10.472.10
            InterPro:IPR013763 InterPro:IPR006671 Pfam:PF00134 SMART:SM00385
            SUPFAM:SSF47954 eggNOG:COG5024 PROSITE:PS00292
            RefSeq:XP_001134569.1 ProteinModelPortal:Q1ZXI1
            EnsemblProtists:DDB0231774 GeneID:8621862 KEGG:ddi:DDB_G0279085
            InParanoid:Q1ZXI1 OMA:ACAFFIA Uniprot:Q1ZXI1
        Length = 588

 Score = 136 (52.9 bits), Expect = 2.2e-05, P = 2.2e-05
 Identities = 46/201 (22%), Positives = 83/201 (41%)

Query:   378 SAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKY 437
             +A N S+  N  N+   NI     N+I    N  +  N+    N+N    N  +  N K 
Sbjct:   108 TATNNSNNNNNNNNN-NNINNNNNNNINIISNNNNNNNNNNNNNNNNNNNNNNNNNNNKL 166

Query:   438 ENRYENG--THEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNG 495
             +++  NG    E  P   N   N   + +   N+    + +N  +E   P   N+ N N 
Sbjct:   167 KSQTVNGGIKTENLPSKNNNDNNSNSDDSNNSNKTNQTQQDNSNNEIAPPTKPNNNNNNN 226

Query:   496 TSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENS 555
              + N +N+N+  N  +  +    L+ NE ++ N I +N ++    +            N+
Sbjct:   227 NNNNNNNNNNNNNNNNNNN----LTENENNELNNIKNNNNNNNNNN----------NNNN 272

Query:   556 INGNGTSENRSNDNSYQNEID 576
              N N  + N +N+N   N ++
Sbjct:   273 NNNNNNNNNNNNNNKENNSLE 293

 Score = 123 (48.4 bits), Expect = 0.00056, P = 0.00056
 Identities = 49/201 (24%), Positives = 82/201 (40%)

Query:   412 HEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYN-PK-YENRYENGTHEYNGPKNE 469
             + YN+    N+N  Y N      P   N+  +     N P  + N   N ++  N   N 
Sbjct:    63 NNYNNNNNNNNNNNYNNKNLMAKPIQSNKNNSIITASNIPSTFNNTATNNSNNNNN--NN 120

Query:   470 NTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWS------VLSRNE 523
             N N    N  +  NI    N+ N N  + N +N+N+  N  +   + S      + + N 
Sbjct:   121 NNNNINNNNNNNINIISNNNNNNNNNNNNNNNNNNNNNNNNNNNKLKSQTVNGGIKTENL 180

Query:   524 PSKRNTILHNIDDEWQISALTRGKWKLVKENSI------NGNGTSENRSNDNSYQNEIDG 577
             PSK N   ++  D+   S  T    +    N I      N N  + N +N+N+  N  + 
Sbjct:   181 PSKNNNDNNSNSDDSNNSNKTNQTQQDNSNNEIAPPTKPNNNNNNNNNNNNNNNNNNNNN 240

Query:   578 IDVWSVLSRNEPSKRNTILHN 598
              +  + L+ NE ++ N I +N
Sbjct:   241 NNN-NNLTENENNELNNIKNN 260


>DICTYBASE|DDB_G0291197 [details] [associations]
            symbol:hbx3 "putative homeobox transcription factor"
            species:44689 "Dictyostelium discoideum" [GO:0043565
            "sequence-specific DNA binding" evidence=IEA] [GO:0006355
            "regulation of transcription, DNA-dependent" evidence=IEA]
            [GO:0003700 "sequence-specific DNA binding transcription factor
            activity" evidence=IEA] [GO:0003677 "DNA binding" evidence=IEA]
            [GO:0007275 "multicellular organismal development" evidence=IEA]
            [GO:0006351 "transcription, DNA-dependent" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] InterPro:IPR001356
            InterPro:IPR008422 InterPro:IPR009057 Pfam:PF05920 PROSITE:PS00027
            PROSITE:PS50071 SMART:SM00389 dictyBase:DDB_G0291197 GO:GO:0007275
            GO:GO:0005634 GO:GO:0043565 GO:GO:0003700 GO:GO:0006351
            GenomeReviews:CM000154_GR Gene3D:1.10.10.60 SUPFAM:SSF46689
            EMBL:AAFI02000175 eggNOG:NOG248144 RefSeq:XP_635379.1 HSSP:P40424
            ProteinModelPortal:Q54F11 EnsemblProtists:DDB0220480 GeneID:8628027
            KEGG:ddi:DDB_G0291197 InParanoid:Q54F11 Uniprot:Q54F11
        Length = 667

 Score = 136 (52.9 bits), Expect = 2.6e-05, P = 2.6e-05
 Identities = 54/215 (25%), Positives = 96/215 (44%)

Query:   374 PTLLSAANK--SDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTH 431
             P L+S+     SD+ +  NS++ +  P  EN +L      + Y + +I   N  ++N + 
Sbjct:    89 PLLISSQTSYPSDLSS--NSSISHS-P-IENQLLDNNLDINNYLN-KINIFNNHFQN-SD 142

Query:   432 EYNPKYENRYENGTHEYNPKYENRYENGTHEYNG----PKNENTNPRYENGTHEYNIPRL 487
               N  + N++EN  +  N    N  EN ++ YN     P N N N    N  +  N    
Sbjct:   143 LINTTFFNQFENNNYINN---NNNKENNSYFYNNNVNIPNNNNLNINNNNNNNNNNNNNN 199

Query:   488 ENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSR-NEPSKRNTILHNIDDEWQISALTRG 546
              N+ N N  + N +N+N+  N  +  +  +V +  N P+  N  L N+ +      LT  
Sbjct:   200 NNNNNNNNNNNNNNNNNNNNNNNNNNNKNTVYNNVNIPNNNNFNL-NLSNNNNNLNLTNN 258

Query:   547 KWKLVKENSINGNGTS-ENRSNDNSYQNEIDGIDV 580
                   +NS+N N  +  N +N+N++   +   +V
Sbjct:   259 N---NNKNSVNNNNVNISNNNNNNNFNVNLSNNNV 290


>UNIPROTKB|Q5LVA2 [details] [associations]
            symbol:SPO0800 "Choline sulfatase, putative" species:246200
            "Ruegeria pomeroyi DSS-3" [GO:0006790 "sulfur compound metabolic
            process" evidence=ISS] [GO:0047753 "choline-sulfatase activity"
            evidence=ISS] InterPro:IPR000917 InterPro:IPR017849
            InterPro:IPR017850 Pfam:PF00884 EMBL:CP000031
            GenomeReviews:CP000031_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
            GO:GO:0006790 GO:GO:0047753 RefSeq:YP_166053.1
            ProteinModelPortal:Q5LVA2 GeneID:3195931 KEGG:sil:SPO0800
            PATRIC:23374875 HOGENOM:HOG000061225 ProtClustDB:CLSK279175
            Uniprot:Q5LVA2
        Length = 482

 Score = 102 (41.0 bits), Expect = 2.8e-05, Sum P(4) = 2.8e-05
 Identities = 23/66 (34%), Positives = 39/66 (59%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
             P+++ I++D+   + +G  G   + TPN+DALA  G + +  YT   +C P+R+A+ TG 
Sbjct:     5 PNLLVIVSDEHRKDAMGCAGHPIVKTPNLDALAARGTMFEAAYTPSPMCVPTRAALATGD 64

Query:   119 HPIHTG 124
                 TG
Sbjct:    65 WIHRTG 70

 Score = 63 (27.2 bits), Expect = 2.8e-05, Sum P(4) = 2.8e-05
 Identities = 14/52 (26%), Positives = 29/52 (55%)

Query:   264 DHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
             D Y +  +  E   ++ +  +   +D+ VG+V+ ALE      N+++++VSD
Sbjct:   236 DAYFDAQKMRE--AKAAYYGLTSFMDDCVGRVLAALEAGGKADNTVVLYVSD 285

 Score = 57 (25.1 bits), Expect = 2.8e-05, Sum P(4) = 2.8e-05
 Identities = 17/45 (37%), Positives = 23/45 (51%)

Query:   674 APCLFDIKNDPCEKNNLADRS-EDQRINHYTTE-VGRFNQIAYPD 716
             AP LFD++ DP E  +LA R+ ED  +     E   R   I  P+
Sbjct:   397 APQLFDLERDPQELTDLAPRAAEDPDMRALLAEGEHRLRAICNPE 441

 Score = 54 (24.1 bits), Expect = 5.5e-05, Sum P(4) = 5.5e-05
 Identities = 11/21 (52%), Positives = 15/21 (71%)

Query:   780 APCLFDIKNDPCEKNNLADRS 800
             AP LFD++ DP E  +LA R+
Sbjct:   397 APQLFDLERDPQELTDLAPRA 417

 Score = 39 (18.8 bits), Expect = 2.8e-05, Sum P(4) = 2.8e-05
 Identities = 14/47 (29%), Positives = 21/47 (44%)

Query:   183 GYW---TGHQDYFDHSAEEMKMWGLDMR---RDLEPAWDLHGKYSTD 223
             G W   TGH D     A + + W  D+R   R++     LH + + D
Sbjct:    63 GDWIHRTGHWDSATPYAGQPRSWMHDLRDAGREVVSIGKLHFRATED 109


>TIGR_CMR|SPO_0800 [details] [associations]
            symbol:SPO_0800 "choline sulfatase, putative"
            species:246200 "Ruegeria pomeroyi DSS-3" [GO:0006790 "sulfur
            compound metabolic process" evidence=ISS] [GO:0047753
            "choline-sulfatase activity" evidence=ISS] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 EMBL:CP000031
            GenomeReviews:CP000031_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
            GO:GO:0006790 GO:GO:0047753 RefSeq:YP_166053.1
            ProteinModelPortal:Q5LVA2 GeneID:3195931 KEGG:sil:SPO0800
            PATRIC:23374875 HOGENOM:HOG000061225 ProtClustDB:CLSK279175
            Uniprot:Q5LVA2
        Length = 482

 Score = 102 (41.0 bits), Expect = 2.8e-05, Sum P(4) = 2.8e-05
 Identities = 23/66 (34%), Positives = 39/66 (59%)

Query:    60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
             P+++ I++D+   + +G  G   + TPN+DALA  G + +  YT   +C P+R+A+ TG 
Sbjct:     5 PNLLVIVSDEHRKDAMGCAGHPIVKTPNLDALAARGTMFEAAYTPSPMCVPTRAALATGD 64

Query:   119 HPIHTG 124
                 TG
Sbjct:    65 WIHRTG 70

 Score = 63 (27.2 bits), Expect = 2.8e-05, Sum P(4) = 2.8e-05
 Identities = 14/52 (26%), Positives = 29/52 (55%)

Query:   264 DHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
             D Y +  +  E   ++ +  +   +D+ VG+V+ ALE      N+++++VSD
Sbjct:   236 DAYFDAQKMRE--AKAAYYGLTSFMDDCVGRVLAALEAGGKADNTVVLYVSD 285

 Score = 57 (25.1 bits), Expect = 2.8e-05, Sum P(4) = 2.8e-05
 Identities = 17/45 (37%), Positives = 23/45 (51%)

Query:   674 APCLFDIKNDPCEKNNLADRS-EDQRINHYTTE-VGRFNQIAYPD 716
             AP LFD++ DP E  +LA R+ ED  +     E   R   I  P+
Sbjct:   397 APQLFDLERDPQELTDLAPRAAEDPDMRALLAEGEHRLRAICNPE 441

 Score = 54 (24.1 bits), Expect = 5.5e-05, Sum P(4) = 5.5e-05
 Identities = 11/21 (52%), Positives = 15/21 (71%)

Query:   780 APCLFDIKNDPCEKNNLADRS 800
             AP LFD++ DP E  +LA R+
Sbjct:   397 APQLFDLERDPQELTDLAPRA 417

 Score = 39 (18.8 bits), Expect = 2.8e-05, Sum P(4) = 2.8e-05
 Identities = 14/47 (29%), Positives = 21/47 (44%)

Query:   183 GYW---TGHQDYFDHSAEEMKMWGLDMR---RDLEPAWDLHGKYSTD 223
             G W   TGH D     A + + W  D+R   R++     LH + + D
Sbjct:    63 GDWIHRTGHWDSATPYAGQPRSWMHDLRDAGREVVSIGKLHFRATED 109


>FB|FBgn0035445 [details] [associations]
            symbol:CG12014 species:7227 "Drosophila melanogaster"
            [GO:0004423 "iduronate-2-sulfatase activity" evidence=ISS]
            [GO:0008152 "metabolic process" evidence=IEA] InterPro:IPR000917
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 EMBL:AE014296
            Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
            PROSITE:PS00523 HSSP:P15289 KO:K01136 GO:GO:0004423
            GeneTree:ENSGT00640000091539 RefSeq:NP_647814.1 UniGene:Dm.15756
            ProteinModelPortal:Q9VZP8 STRING:Q9VZP8 PRIDE:Q9VZP8
            EnsemblMetazoa:FBtr0073077 GeneID:38423 KEGG:dme:Dmel_CG12014
            UCSC:CG12014-RA FlyBase:FBgn0035445 InParanoid:Q9VZP8 OMA:ERVIPAY
            OrthoDB:EOG45DV4P PhylomeDB:Q9VZP8 GenomeRNAi:38423 NextBio:808590
            ArrayExpress:Q9VZP8 Bgee:Q9VZP8 Uniprot:Q9VZP8
        Length = 512

 Score = 134 (52.2 bits), Expect = 3.0e-05, P = 3.0e-05
 Identities = 42/127 (33%), Positives = 61/127 (48%)

Query:    42 LAFTLSM-VFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
             L  +L M V +D  A    P+++ ++ DDL    +G +G     TP +D  A    I   
Sbjct:     6 LLLSLMMPVLLDAAAPPRRPNVVMVIFDDLR-PVIGAYGDTLASTPYLDNFARGSHIFTR 64

Query:   101 YYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIV 159
              Y+ Q LC PSR++++TG+ P    +     Y     G   +   LPQY KE GY T   
Sbjct:    65 VYSQQSLCAPSRNSLLTGRRPDTLHLYDFYSYWRTFTG---NFTTLPQYFKEHGYYTYSC 121

Query:   160 GK-WHLG 165
             GK +H G
Sbjct:   122 GKVFHPG 128


>DICTYBASE|DDB_G0268506 [details] [associations]
            symbol:DDB_G0268506 "putative histone-like
            transcription factor" species:44689 "Dictyostelium discoideum"
            [GO:0046982 "protein heterodimerization activity" evidence=IEA]
            [GO:0043565 "sequence-specific DNA binding" evidence=IEA]
            [GO:0005622 "intracellular" evidence=IEA] [GO:0008150
            "biological_process" evidence=ND] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0003674 "molecular_function" evidence=ND]
            [GO:0005634 "nucleus" evidence=IEA] InterPro:IPR003958
            InterPro:IPR009072 Pfam:PF00808 dictyBase:DDB_G0268506
            GO:GO:0005634 GO:GO:0043565 EMBL:AAFI02000003 Gene3D:1.10.20.10
            SUPFAM:SSF47113 eggNOG:COG5208 ProtClustDB:CLSZ2846877
            RefSeq:XP_647243.3 ProteinModelPortal:Q55GE1
            EnsemblProtists:DDB0304567 GeneID:8616048 KEGG:ddi:DDB_G0268506
            InParanoid:Q55GE1 OMA:DENEEDQ Uniprot:Q55GE1
        Length = 1120

 Score = 138 (53.6 bits), Expect = 3.1e-05, P = 3.1e-05
 Identities = 34/169 (20%), Positives = 67/169 (39%)

Query:   405 LRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYN 464
             + ++ G H +N  + +N +  Y +    Y+    N   N  +  N    N   N  +  N
Sbjct:   856 INHQLGMHHHNPHQNQNQHPMYSHQFQNYSQVAFNNNNNNNNNNNNNNNNNNNNNNNNNN 915

Query:   465 GPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEP 524
                N N N    N  +  N     N+ N N ++ N +N+N+  N  +  +  +  + N  
Sbjct:   916 NNNNNNNNNNSNNSNNSNNSSNNNNNNNNNNSNNNNNNNNNNNNNNNNNN--NSNNNNNS 973

Query:   525 SKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQN 573
             +  N   +N  + +  +      +     N+ N N  + N +N+N+  N
Sbjct:   974 NNSNNNNNNNYNNYNGNNNNYNNYNSSSNNNSNNNNNNNNNNNNNNNNN 1022

 Score = 133 (51.9 bits), Expect = 0.00011, P = 0.00011
 Identities = 46/200 (23%), Positives = 75/200 (37%)

Query:   381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
             N ++  N  N+   N      N+     N ++  N+    NSN    N  +  N    N 
Sbjct:   906 NNNNNNNNNNNNNNNNNNNNSNNSNNSNNSSNNNNNNNNNNSNNNNNNNNNNNNNNNNNN 965

Query:   441 YENGTHEYNPKYENRYENGTHEYNGPKNE--NTNPRYENGTHEYNIPRLENSINGNGTSE 498
               N  +  N    N   N  + YNG  N   N N    N ++  N     N+ N N  + 
Sbjct:   966 NSNNNNNSNNSNNNN-NNNYNNYNGNNNNYNNYNSSSNNNSNNNNNNNNNNNNNNNNNNN 1024

Query:   499 NRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSING 558
             N +N+N+  N  +G + +  ++  +P   N +         I+            NS N 
Sbjct:  1025 NNNNNNNSNNNNNGNNNFENINPFQP--HNHMQSQYYYNQSINQYQNQNHNNNNNNSNNN 1082

Query:   559 NGTSENRSN--DNSYQNEID 576
             N  ++N +N     Y+NE D
Sbjct:  1083 NSNNQNSNNIYTRQYENEED 1102

 Score = 129 (50.5 bits), Expect = 0.00029, P = 0.00029
 Identities = 39/181 (21%), Positives = 70/181 (38%)

Query:   394 ENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYE 453
             +N  P Y +    Y       N+    N+N    N  +  N    N   N  +  N    
Sbjct:   871 QNQHPMYSHQFQNYSQVAFNNNNNN-NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNSNNS 929

Query:   454 NRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQ-NEIDG 512
             N   N ++  N   N N+N    N  +  N     N+ N N  S N +N+N+   N  +G
Sbjct:   930 NNSNNSSNNNNNNNNNNSNNNNNNNNNNNNNNNNNNNSNNNNNSNNSNNNNNNNYNNYNG 989

Query:   513 IDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQ 572
              +  +  + N  S  N+  +N ++    +            N+ N N ++ N + +N+++
Sbjct:   990 NNN-NYNNYNSSSNNNSNNNNNNNNNNNNNNNNNN-----NNNNNNNNSNNNNNGNNNFE 1043

Query:   573 N 573
             N
Sbjct:  1044 N 1044


>DICTYBASE|DDB_G0278995 [details] [associations]
            symbol:DDB_G0278995 species:44689 "Dictyostelium
            discoideum" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0005622 "intracellular" evidence=IEA] InterPro:IPR015880
            SMART:SM00355 dictyBase:DDB_G0278995 EMBL:AAFI02000026
            GO:GO:0008270 GO:GO:0005622 RefSeq:XP_641895.1
            EnsemblProtists:DDB0215287 GeneID:8621821 KEGG:ddi:DDB_G0278995
            OMA:RRPERYQ Uniprot:Q54XF5
        Length = 1055

 Score = 143 (55.4 bits), Expect = 3.2e-05, Sum P(2) = 3.2e-05
 Identities = 48/193 (24%), Positives = 75/193 (38%)

Query:   381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
             N+   P Y NS   N+     N++    N  +  N+     +N    N  +  N KY N 
Sbjct:   838 NQQSSPQYYNSLNMNV----NNNVNGNNNNNNNNNNNNNNINNNINNNNNNNVNSKYNNN 893

Query:   441 YENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENR 500
               N  +  N    N   N  +  N   N N N    N  + YN     NS N N  + N 
Sbjct:   894 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN-SNSNNNYNN---NNSNNNNNNNNNN 949

Query:   501 SNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNG 560
             +N+N+  N  +  +  +  + N P   N+   N+      S   R +      N+IN + 
Sbjct:   950 NNNNNNNNNNNNNNNNNNNNSNFPGN-NSNYCNLSVNNSTSPFNRPQTPPKPINNINISN 1008

Query:   561 TSENRSNDNSYQN 573
              + N SN+N+  N
Sbjct:  1009 NNNN-SNNNNINN 1020

 Score = 45 (20.9 bits), Expect = 3.2e-05, Sum P(2) = 3.2e-05
 Identities = 17/51 (33%), Positives = 23/51 (45%)

Query:   245 YLAHAATHSANPY---EPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
             YL H   H  N +   E L   D Y N +    DF   +FA    +LD++V
Sbjct:   271 YLFHLKIHENNNHCLLENLLQNDGYSNQNN---DFFSGEFATESGQLDQTV 318


>DICTYBASE|DDB_G0271832 [details] [associations]
            symbol:DDB_G0271832 "Zinc finger CCHC
            domain-containing protein 7" species:44689 "Dictyostelium
            discoideum" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] InterPro:IPR001878
            PROSITE:PS50158 SMART:SM00343 dictyBase:DDB_G0271832 GO:GO:0008270
            GO:GO:0003676 EMBL:AAFI02000006 Gene3D:4.10.60.10 SUPFAM:SSF57756
            RefSeq:XP_645525.1 ProteinModelPortal:Q55AJ7
            EnsemblProtists:DDB0216923 GeneID:8618154 KEGG:ddi:DDB_G0271832
            eggNOG:NOG260401 InParanoid:Q55AJ7 Uniprot:Q55AJ7
        Length = 772

 Score = 136 (52.9 bits), Expect = 3.2e-05, P = 3.2e-05
 Identities = 54/207 (26%), Positives = 84/207 (40%)

Query:   387 NYVNSTVENIIPRYE---NSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYEN 443
             N  NST  NI  R++   N   RY N  + YN+    NS   Y N  H+ N  +  +Y++
Sbjct:   473 NRYNST--NINNRFDGKYNKNNRYNNNNNNYNN---NNSYNDYSNYNHKNNKDF-GKYQD 526

Query:   444 GTHEYNPKYENRY------------ENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSI 491
             G+ +YN   +N Y            +  +H  N   N N N    N  +  N    E+S 
Sbjct:   527 GSDDYNDDDQNDYRVKDSYSRKSKKQKTSHNNNNNNNNNDNNSSNNNKNNSNNSNKESSE 586

Query:   492 NGNGTSENRSNDNSYQN---EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKW 548
               N   + + N    Q    E D  D        +  K N I   +D + ++      K 
Sbjct:   587 EKNDKKKKKKNKKGNQEKEKEKDKKDKNHKTRERDREKDNDIDSMVDLD-KVKNNNNNKN 645

Query:   549 KLVKENSINGNGTSENRSNDNSYQNEI 575
             K   +N+ N N  + N +N+N+  N+I
Sbjct:   646 KNNNKNN-NNNNNNNNNNNNNNNNNKI 671

 Score = 130 (50.8 bits), Expect = 0.00014, P = 0.00014
 Identities = 48/209 (22%), Positives = 88/209 (42%)

Query:   399 RYENSILRYENGTHEYNSPRIEN-------SNTRYENGTHEYNPKYE-NRYENGTHEYNP 450
             RY+ +  R+   ++ YNS  I N        N RY N  + YN     N Y N  H+ N 
Sbjct:   461 RYDRNE-RFNYNSNRYNSTNINNRFDGKYNKNNRYNNNNNNYNNNNSYNDYSNYNHKNNK 519

Query:   451 KYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEI 510
              +  +Y++G+ +YN   +++ N      ++     + + S N N  + N  N++S  N+ 
Sbjct:   520 DF-GKYQDGSDDYN---DDDQNDYRVKDSYSRKSKKQKTSHNNNNNNNNNDNNSSNNNKN 575

Query:   511 DGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNS 570
             +  +     S  E S+        +D+ +     +G  +  KE     +   + R  D  
Sbjct:   576 NSNN-----SNKESSEEK------NDKKKKKKNKKGNQEKEKEKD-KKDKNHKTRERDRE 623

Query:   571 YQNEIDG-IDVWSVLSRNEPSKRNTILHN 598
               N+ID  +D+  V + N    +N   +N
Sbjct:   624 KDNDIDSMVDLDKVKNNNNNKNKNNNKNN 652

 Score = 128 (50.1 bits), Expect = 0.00023, P = 0.00023
 Identities = 43/186 (23%), Positives = 88/186 (47%)

Query:   421 NSNTRYE-NGTHEYNPKYE--NRYENGTHEYNPKYENRYENGTHEYNGPKNE-NTNPRYE 476
             N+N RY+ N  ++ N +Y+  +RY+    +  PK +  + N   +YN   +  + N R+ 
Sbjct:   411 NNNNRYDRNDRYDRNDRYDRYDRYDRYDKDGFPK-DIDHSNNNGQYNQDYHRYDRNERFN 469

Query:   477 NGTHEYNIPRLENSINGNGTSENR--SNDNSYQNEIDGIDVWSVLSRNEPS----KRNTI 530
               ++ YN   + N  +G     NR  +N+N+Y N     D  +   +N       +  + 
Sbjct:   470 YNSNRYNSTNINNRFDGKYNKNNRYNNNNNNYNNNNSYNDYSNYNHKNNKDFGKYQDGSD 529

Query:   531 LHNIDDE--WQI--SALTRGKWKLVKENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSR 586
              +N DD+  +++  S   + K +    N+ N N  ++N S++N+ +N  +  +  S   +
Sbjct:   530 DYNDDDQNDYRVKDSYSRKSKKQKTSHNNNNNNNNNDNNSSNNN-KNNSNNSNKESSEEK 588

Query:   587 NEPSKR 592
             N+  K+
Sbjct:   589 NDKKKK 594


>DICTYBASE|DDB_G0289337 [details] [associations]
            symbol:DDB_G0289337 species:44689 "Dictyostelium
            discoideum" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0000166 "nucleotide binding" evidence=IEA] InterPro:IPR000504
            InterPro:IPR012677 Pfam:PF00076 PROSITE:PS50102 SMART:SM00360
            dictyBase:DDB_G0289337 GO:GO:0000166 Gene3D:3.30.70.330
            GO:GO:0003676 EMBL:AAFI02000139 eggNOG:NOG145324 RefSeq:XP_636266.1
            ProteinModelPortal:Q54HN5 EnsemblProtists:DDB0188369 GeneID:8627085
            KEGG:ddi:DDB_G0289337 InParanoid:Q54HN5 OMA:MESINIS Uniprot:Q54HN5
        Length = 1528

 Score = 139 (54.0 bits), Expect = 3.5e-05, P = 3.5e-05
 Identities = 48/193 (24%), Positives = 80/193 (41%)

Query:   371 DWLPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGT 430
             D  PT+     K  I N  ++  EN   + EN     +N   + N    EN N       
Sbjct:  1045 DRSPTIKKNKEKEIIKNNHDNDNENE-NKNENE-KENDNQNEKENENENENKNKNENKNE 1102

Query:   431 HEYNPKYENRYENGTHEYNP-KYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLEN 489
             +E   + EN  EN     N  + EN+ EN   + N  +N+N N       +E    +  N
Sbjct:  1103 NEIKNENENENENENENENENENENKNENENEKENENENKNKNENVNENKNEQEEEKENN 1162

Query:   490 SINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWK 549
             + N N  + N +N+N+  N  +       +S  + + +N +L  I+++ +   +      
Sbjct:  1163 NNNNNNNNNNNNNNNNNNNNNNNNRKQKNISEQKDNPKNELLIGIENKEKKIIVNSNLEN 1222

Query:   550 LVKENSINGNGTS 562
                ENSI GN +S
Sbjct:  1223 DQDENSIVGNLSS 1235

 Score = 127 (49.8 bits), Expect = 0.00068, P = 0.00068
 Identities = 61/228 (26%), Positives = 101/228 (44%)

Query:   381 NKSDIPNYVNSTVENI--IPRYENS-ILRYENGT--HEYNSPRIENSNTRYENGTHEYNP 435
             N ++  N  NST  N   +P  E+S  L   N T    + S  I+ S T  +N   E   
Sbjct:  1001 NNNNNNNNNNSTNLNQSKVPTNESSSTLTASNDTIIKNFRSFEIDRSPTIKKNKEKEI-- 1058

Query:   436 KYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNG 495
               +N ++N     N K EN  EN     N  +NEN N   EN     N  + EN I    
Sbjct:  1059 -IKNNHDNDNENEN-KNENEKENDNQ--NEKENENEN---ENKNKNEN--KNENEIKNEN 1109

Query:   496 TSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENS 555
              +EN  N+N  +NE +  +     + NE  K N   +   +E  ++   + + +  KEN+
Sbjct:  1110 ENENE-NENENENENENENK----NENENEKENENENKNKNE-NVNE-NKNEQEEEKENN 1162

Query:   556 INGNGTSENRSNDNSYQNEIDGID-VWSVLSRNEPSKRNTILHNIDDE 602
              N N  + N +N+N+  N  +  +     +S  + + +N +L  I+++
Sbjct:  1163 NNNNNNNNNNNNNNNNNNNNNNNNRKQKNISEQKDNPKNELLIGIENK 1210


>DICTYBASE|DDB_G0272108 [details] [associations]
            symbol:DDB_G0272108 "RNA-binding region RNP-1
            domain-containing protein" species:44689 "Dictyostelium discoideum"
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0000166
            "nucleotide binding" evidence=IEA] [GO:0008150 "biological_process"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            InterPro:IPR000504 InterPro:IPR012677 Pfam:PF00076 PROSITE:PS50102
            SMART:SM00360 dictyBase:DDB_G0272108 GO:GO:0000166
            Gene3D:3.30.70.330 GenomeReviews:CM000151_GR GO:GO:0003676
            EMBL:AAFI02000008 RefSeq:XP_645138.2 ProteinModelPortal:Q55A46
            EnsemblProtists:DDB0220129 GeneID:8618308 KEGG:ddi:DDB_G0272108
            InParanoid:Q55A46 Uniprot:Q55A46
        Length = 469

 Score = 126 (49.4 bits), Expect = 3.6e-05, Sum P(2) = 3.6e-05
 Identities = 34/111 (30%), Positives = 50/111 (45%)

Query:   409 NGTHEYNSPRIENSNT-RYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPK 467
             NG       R  + NT  +E+G +E     +NR  N  + YN    N Y+NG    NG  
Sbjct:    97 NGQERDGIKRFRSDNTTNFEDGEYEEQVMNDNRNNNNNNNYNNS-NNNYKNGNENGNGNG 155

Query:   468 NENTNPRYENGTHE---YNIPRLENSING-NGTSENRSNDNSYQNEIDGID 514
             N N +P      H+   +N    EN+ N  N  + N +N+N+  N  +G D
Sbjct:   156 NGNGSPYGMVERHKPPPFNYENGENNDNKYNNNNNNNNNNNNNNNNNNGFD 206

 Score = 53 (23.7 bits), Expect = 3.6e-05, Sum P(2) = 3.6e-05
 Identities = 17/52 (32%), Positives = 25/52 (48%)

Query:   554 NSINGNGTSE--NRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEW 603
             N+IN N TS   N  N N+  N I      S+ S +E    N+ L  +D ++
Sbjct:   374 NNINNNNTSSSYNNYNSNNVNNSIQFSPTTSI-SNSETISSNSNLP-VDQDY 423

 Score = 41 (19.5 bits), Expect = 0.00061, Sum P(2) = 0.00061
 Identities = 10/33 (30%), Positives = 18/33 (54%)

Query:   564 NRSNDNSYQNEIDGIDVWSVLSR-NEPSKRNTI 595
             N +N+N+Y N I+  +  S  +  N  +  N+I
Sbjct:   365 NNNNNNNYNNNINNNNTSSSYNNYNSNNVNNSI 397


>UNIPROTKB|H7C3P4 [details] [associations]
            symbol:GNS "Glucosamine (N-acetyl)-6-sulfatase (Sanfilippo
            disease IIID), isoform CRA_b" species:9606 "Homo sapiens"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008449
            "N-acetylglucosamine-6-sulfatase activity" evidence=IEA]
            [GO:0030203 "glycosaminoglycan metabolic process" evidence=IEA]
            InterPro:IPR000917 InterPro:IPR012251 InterPro:IPR015981
            InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
            PIRSF:PIRSF036666 EMBL:CH471054 GO:GO:0005764 Gene3D:3.40.720.10
            SUPFAM:SSF53649 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
            GO:GO:0030203 GO:GO:0008449 PANTHER:PTHR10342:SF5 EMBL:AC025262
            UniGene:Hs.334534 HGNC:HGNC:4422 ChiTaRS:GNS SMR:H7C3P4
            Ensembl:ENST00000418919 Uniprot:H7C3P4
        Length = 496

 Score = 128 (50.1 bits), Expect = 4.2e-05, Sum P(2) = 4.2e-05
 Identities = 53/182 (29%), Positives = 78/182 (42%)

Query:    82 QIPTPNIDAL-AYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYG-CE-RGG 137
             Q P     AL    G+   + Y    LC PSR++I+TGK+P +  + +N L G C  +  
Sbjct:     8 QTPLKKTKALIGEMGMTFSSAYVPSALCCPSRASILTGKYPHNHHVVNNTLEGNCSSKSW 67

Query:   138 LPLSE-KILPQYLKEL-GYRTRIVGKWHLGFYKKEY-TPTFRGFES-HLG--YWTGHQDY 191
               + E    P  L+ + GY+T   GK     Y  EY  P   G E   LG  YW   +  
Sbjct:    68 QKIQEPNTFPAILRSMCGYQTFFAGK-----YLNEYGAPDAGGLEHVPLGWSYWYALEKN 122

Query:   192 FDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAAT 251
               +    + + G   +     + D    Y TDV    ++D +   S  EP F+ +A  A 
Sbjct:   123 SKYYNYTLSINGKARKHGENYSVD----YLTDVLANVSLDFLDYKSNFEPFFMMIATPAP 178

Query:   252 HS 253
             HS
Sbjct:   179 HS 180

 Score = 51 (23.0 bits), Expect = 4.2e-05, Sum P(2) = 4.2e-05
 Identities = 13/38 (34%), Positives = 23/38 (60%)

Query:   278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
             R ++  +L  +D+ V K+V+ LE    L+N+ I + SD
Sbjct:   234 RKRWQTLL-SVDDLVEKLVKRLEFTGELNNTYIFYTSD 270


>DICTYBASE|DDB_G0284187 [details] [associations]
            symbol:Dd5P2 "inositol 5-phosphatase" species:44689
            "Dictyostelium discoideum" [GO:0046856 "phosphatidylinositol
            dephosphorylation" evidence=IDA] [GO:0046855 "inositol phosphate
            dephosphorylation" evidence=IDA] [GO:0034485
            "phosphatidylinositol-3,4,5-trisphosphate 5-phosphatase activity"
            evidence=IDA] [GO:0004445 "inositol-polyphosphate 5-phosphatase
            activity" evidence=IDA] [GO:0004439
            "phosphatidylinositol-4,5-bisphosphate 5-phosphatase activity"
            evidence=IDA] [GO:0046854 "phosphatidylinositol phosphorylation"
            evidence=IEA] [GO:0005575 "cellular_component" evidence=ND]
            InterPro:IPR000300 SMART:SM00128 dictyBase:DDB_G0284187
            INTERPRO:IPR000408 Pfam:PF00415 GenomeReviews:CM000153_GR
            Gene3D:2.130.10.30 InterPro:IPR009091 SUPFAM:SSF50985
            PRINTS:PR00633 PROSITE:PS00626 PROSITE:PS50012 InterPro:IPR005135
            SUPFAM:SSF56219 EMBL:AAFI02000064 GO:GO:0046854 GO:GO:0046855
            GO:GO:0004439 GO:GO:0046856 GO:GO:0004445 eggNOG:COG5411
            GO:GO:0034485 RefSeq:XP_638694.1 ProteinModelPortal:Q54PV1
            EnsemblProtists:DDB0191414 GeneID:8624525 KEGG:ddi:DDB_G0284187
            InParanoid:Q54PV1 OMA:SHEKMER Uniprot:Q54PV1
        Length = 1800

 Score = 139 (54.0 bits), Expect = 4.2e-05, P = 4.2e-05
 Identities = 53/226 (23%), Positives = 92/226 (40%)

Query:   378 SAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKY 437
             S+++ ++  N +   + +I PR   S    +N   E     +ENS     N  +  N   
Sbjct:  1273 SSSSNNNSTNNLGDYISSISPRAITSTTLTKNPKQEIER-ELENS-VNNSNNNNSINNNS 1330

Query:   438 ENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTS 497
              N   N T+  N    N   N T+  N   N N N    N  +  N     N+ N N  +
Sbjct:  1331 NNNNNNNTNNNNNTNNN---NNTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 1387

Query:   498 ENRSNDNSYQN-EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSI 556
              N +N+NS +N + +   + S +  N    +N I+ NI +  Q++     K     E  +
Sbjct:  1388 NNNNNNNSDKNSDSEEASIGSGILGNIDDIQN-IIGNIKNGDQVNKNLNHKKSNSVEVVV 1446

Query:   557 NGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDE 602
               +   EN SND  Y      +D ++  + N  +  N   +N +++
Sbjct:  1447 VEHHDEENCSNDIFYIEPFTIVDQYTNNNNNNNNNNNNNNNNNNND 1492

 Score = 129 (50.5 bits), Expect = 0.00050, P = 0.00050
 Identities = 51/212 (24%), Positives = 82/212 (38%)

Query:   375 TLLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYN 434
             T L+   K +I   + ++V N      NSI    N  +  N+    N+NT   N T+  N
Sbjct:  1299 TTLTKNPKQEIERELENSVNN--SNNNNSINNNSNNNNNNNTNN--NNNTNNNNNTNNNN 1354

Query:   435 PKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYE-NGTHEY---------NI 484
                 N   N  +  N    N   N  +  N   N N N   + N   E          NI
Sbjct:  1355 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNSDKNSDSEEASIGSGILGNI 1414

Query:   485 PRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALT 544
               ++N I GN  + ++ N N    + + ++V  V   +E +  N I + I+    +   T
Sbjct:  1415 DDIQNII-GNIKNGDQVNKNLNHKKSNSVEVVVVEHHDEENCSNDIFY-IEPFTIVDQYT 1472

Query:   545 RGKWKLVKENSINGNGTSENRSNDNSYQNEID 576
                      N+ N N  + + +NDN+  N  D
Sbjct:  1473 NNNNNNNNNNNNNNNNNNNDNNNDNNNDNNND 1504

WARNING:  HSPs involving 73 database sequences were not reported due to the
          limiting value of parameter B = 250.


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.316   0.135   0.416    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      905       822   0.00079  122 3  11 22  0.44    34
                                                     37  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  323
  No. of states in DFA:  629 (67 KB)
  Total size of DFA:  478 KB (2218 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  79.72u 0.10s 79.82t   Elapsed:  00:00:38
  Total cpu time:  79.82u 0.10s 79.92t   Elapsed:  00:00:38
  Start:  Thu Aug 15 12:35:45 2013   End:  Thu Aug 15 12:36:23 2013
WARNINGS ISSUED:  2

Back to top