Your job contains 1 sequence.
>psy1088
MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIMAFAVLPLAFTLSMVFVDLVASSGPP
HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHP
IHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFES
HLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDE
PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALE
QRRMLSNSIIVFVSDNGGAAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPLLESRG
IVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIE
NSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTH
EYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQI
SALTRGKWKLVKENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNID
DEWQISALTKGKWKLVKVVKVMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQ
CGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYPDKEEE
EEKKKKKKKKKKKKKKKKKKKKKKKKKKYSNEEEGMRKLRDAASIQCGPVKEVPCEPQIA
PCLFDIKNDPCEKNNLADRSEVQRINHYTTEVGYLDPKQRFNQIAYLDKEKKKKKKKKKK
KKKKKKKKKKKKMMKKGYPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSIF
GDDLK
The BLAST search returned 5 gene products which did not match your query constraints. Please see the full BLAST report below for the details.
BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]
Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.
Reference: Gish, W. (1996-2006) http://blast.wustl.edu
Query= psy1088
(905 letters)
Database: go_20130330-seqdb.fasta
368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done
Smallest
Sum
High Probability
Sequences producing High-scoring Segment Pairs: Score P(N) N
FB|FBgn0033763 - symbol:CG8646 species:7227 "Drosophila m... 1091 5.1e-128 3
FB|FBgn0052191 - symbol:CG32191 species:7227 "Drosophila ... 796 4.3e-103 4
FB|FBgn0036768 - symbol:CG7402 species:7227 "Drosophila m... 880 9.1e-101 3
FB|FBgn0036765 - symbol:CG7408 species:7227 "Drosophila m... 768 8.5e-99 4
UNIPROTKB|P15848 - symbol:ARSB "Arylsulfatase B" species:... 825 2.8e-96 4
UNIPROTKB|A6QLZ3 - symbol:ARSB "Uncharacterized protein" ... 843 1.8e-94 3
UNIPROTKB|Q32KI4 - symbol:arsb "Arylsulfatase B" species:... 829 2.3e-92 3
RGD|2158 - symbol:Arsb "arylsulfatase B" species:10116 "R... 824 2.0e-91 3
MGI|MGI:88075 - symbol:Arsb "arylsulfatase B" species:100... 823 2.0e-91 3
UNIPROTKB|F1P099 - symbol:ARSB "Uncharacterized protein" ... 779 2.0e-90 4
UNIPROTKB|E1BKH3 - symbol:ARSJ "Uncharacterized protein" ... 764 1.1e-89 4
UNIPROTKB|Q5FYB0 - symbol:ARSJ "Arylsulfatase J" species:... 766 2.2e-89 4
UNIPROTKB|Q32KH6 - symbol:arsj "Uncharacterized protein" ... 761 3.6e-89 4
RGD|1310242 - symbol:Arsi "arylsulfatase family, member I... 743 2.5e-88 4
UNIPROTKB|E1BIN3 - symbol:ARSI "Uncharacterized protein" ... 744 8.3e-88 4
UNIPROTKB|Q5FYB1 - symbol:ARSI "Arylsulfatase I" species:... 738 1.7e-87 4
UNIPROTKB|Q32KH7 - symbol:ARSI "Arylsulfatase I" species:... 738 3.5e-87 4
UNIPROTKB|F1RL69 - symbol:LOC100517463 "Uncharacterized p... 723 1.3e-85 4
MGI|MGI:2443513 - symbol:Arsj "arylsulfatase J" species:1... 758 1.6e-85 3
RGD|1307640 - symbol:Arsj "arylsulfatase family, member J... 757 2.0e-85 3
UNIPROTKB|F1P095 - symbol:ARSB "Uncharacterized protein" ... 779 4.5e-84 2
UNIPROTKB|F1NQP9 - symbol:ARSI "Uncharacterized protein" ... 747 4.6e-84 3
MGI|MGI:2670959 - symbol:Arsi "arylsulfatase i" species:1... 743 9.5e-84 3
UNIPROTKB|F1NT29 - symbol:ARSB "Uncharacterized protein" ... 779 1.9e-83 2
UNIPROTKB|F1P098 - symbol:ARSB "Uncharacterized protein" ... 779 2.5e-79 2
UNIPROTKB|F6PKT4 - symbol:ARSJ "Uncharacterized protein" ... 577 1.1e-69 4
UNIPROTKB|F1S147 - symbol:ARSJ "Uncharacterized protein" ... 575 1.2e-69 4
UNIPROTKB|F1NH07 - symbol:ARSJ "Uncharacterized protein" ... 560 1.1e-66 3
WB|WBGene00006310 - symbol:sul-3 species:6239 "Caenorhabd... 544 3.1e-60 3
UNIPROTKB|P34059 - symbol:GALNS "N-acetylgalactosamine-6-... 440 2.1e-42 2
MGI|MGI:1355303 - symbol:Galns "galactosamine (N-acetyl)-... 433 2.0e-41 2
UNIPROTKB|F1S2F1 - symbol:F1S2F1 "Uncharacterized protein... 435 4.1e-40 1
UNIPROTKB|F1PHF0 - symbol:GALNS "N-acetylgalactosamine-6-... 428 2.4e-39 1
UNIPROTKB|Q32KH5 - symbol:GALNS "N-acetylgalactosamine-6-... 428 2.4e-39 1
UNIPROTKB|F1NW57 - symbol:GALNS "Uncharacterized protein"... 409 5.2e-39 2
RGD|1565391 - symbol:Galns "galactosamine (N-acetyl)-6-su... 423 8.2e-39 1
UNIPROTKB|Q8WNQ7 - symbol:GALNS "N-acetylgalactosamine-6-... 422 1.1e-38 1
UNIPROTKB|F1MU84 - symbol:GALNS "Uncharacterized protein"... 416 4.7e-38 1
ZFIN|ZDB-GENE-070112-1152 - symbol:galns "galactosamine (... 391 9.8e-37 2
UNIPROTKB|F1RL71 - symbol:F1RL71 "Uncharacterized protein... 205 3.5e-34 5
UNIPROTKB|Q08DD1 - symbol:ARSA "Arylsulfatase A" species:... 358 2.2e-33 3
UNIPROTKB|F1S6M1 - symbol:GALNS "N-acetylgalactosamine-6-... 363 2.5e-32 1
ZFIN|ZDB-GENE-050320-118 - symbol:arsa "arylsulfatase A" ... 345 9.9e-32 3
UNIPROTKB|Q32KK2 - symbol:Arsa "Arylsulfatase A" species:... 339 1.1e-30 3
RGD|1310381 - symbol:Arsa "arylsulfatase A" species:10116... 339 2.8e-30 3
UNIPROTKB|P15289 - symbol:ARSA "Arylsulfatase A" species:... 349 8.4e-30 2
UNIPROTKB|F6PKZ1 - symbol:ARSA "Uncharacterized protein" ... 344 2.2e-29 2
MGI|MGI:88077 - symbol:Arsa "arylsulfatase A" species:100... 335 6.4e-28 2
RGD|1304917 - symbol:Arse "arylsulfatase E (chondrodyspla... 263 2.2e-26 3
UNIPROTKB|F1Q1V3 - symbol:STS "Uncharacterized protein" s... 246 2.3e-26 4
UNIPROTKB|F1MFZ8 - symbol:STS "Uncharacterized protein" s... 252 4.1e-26 3
UNIPROTKB|F1Q1V2 - symbol:STS "Uncharacterized protein" s... 246 4.1e-26 4
UNIPROTKB|P08842 - symbol:STS "Steryl-sulfatase" species:... 248 5.8e-26 4
UNIPROTKB|P25549 - symbol:aslA "arylsulfatase" species:83... 319 1.6e-25 1
UNIPROTKB|F1NWF7 - symbol:ARSA "Uncharacterized protein" ... 308 1.6e-25 2
UNIPROTKB|F5H325 - symbol:GALNS "N-acetylgalactosamine-6-... 285 2.1e-25 3
UNIPROTKB|F1NGC8 - symbol:STS "Uncharacterized protein" s... 239 3.2e-25 3
UNIPROTKB|F6PN86 - symbol:ARSF "Uncharacterized protein" ... 236 4.3e-25 3
ZFIN|ZDB-GENE-030717-5 - symbol:sts "steroid sulfatase (m... 236 4.4e-25 3
UNIPROTKB|Q482D2 - symbol:CPS_2368 "Putative N-acetylgluc... 275 5.2e-25 3
TIGR_CMR|CPS_2368 - symbol:CPS_2368 "putative N-acetylglu... 275 5.2e-25 3
UNIPROTKB|I3LBW8 - symbol:STS "Uncharacterized protein" s... 242 7.1e-25 3
UNIPROTKB|K7GLQ3 - symbol:STS "Uncharacterized protein" s... 242 7.2e-25 3
RGD|3783 - symbol:Sts "steroid sulfatase (microsomal), is... 251 9.0e-25 3
TIGR_CMR|SPO_3286 - symbol:SPO_3286 "arylsulfatase" speci... 254 1.1e-24 4
UNIPROTKB|F1NFQ0 - symbol:ARSH "Uncharacterized protein" ... 259 1.8e-24 3
UNIPROTKB|G3N2T7 - symbol:ARSH "Uncharacterized protein" ... 229 4.0e-24 3
UNIPROTKB|F1NFQ1 - symbol:ARSH "Uncharacterized protein" ... 259 5.0e-24 3
UNIPROTKB|P54793 - symbol:ARSF "Arylsulfatase F" species:... 233 1.1e-23 3
UNIPROTKB|F5GYY5 - symbol:ARSE "Arylsulfatase E" species:... 260 1.9e-23 3
MGI|MGI:98438 - symbol:Sts "steroid sulfatase" species:10... 243 3.1e-23 3
TIGR_CMR|CPS_2364 - symbol:CPS_2364 "sulfatase family pro... 295 3.3e-23 2
UNIPROTKB|P51690 - symbol:ARSE "Arylsulfatase E" species:... 254 6.9e-23 3
UNIPROTKB|Q32KH8 - symbol:ARSH "Arylsulfatase H" species:... 232 1.1e-22 3
POMBASE|SPBPB10D8.02c - symbol:SPBPB10D8.02c "arylsulfata... 233 1.6e-22 3
TIGR_CMR|CPS_0660 - symbol:CPS_0660 "sulfatase family pro... 293 1.6e-22 2
RGD|1306571 - symbol:Arsg "arylsulfatase G" species:10116... 264 1.8e-22 3
UNIPROTKB|Q5FYA8 - symbol:ARSH "Arylsulfatase H" species:... 230 1.8e-22 3
UNIPROTKB|F1PY85 - symbol:ARSH "Arylsulfatase H" species:... 232 2.2e-22 3
UNIPROTKB|Q32KI1 - symbol:arse "Uncharacterized protein" ... 244 3.5e-22 3
MGI|MGI:1921258 - symbol:Arsg "arylsulfatase G" species:1... 257 3.8e-22 2
UNIPROTKB|G5E629 - symbol:ARSE "Uncharacterized protein" ... 243 9.0e-22 3
UNIPROTKB|E1BU03 - symbol:ARSG "Uncharacterized protein" ... 283 1.2e-21 1
UNIPROTKB|Q96EG1 - symbol:ARSG "Arylsulfatase G" species:... 252 2.3e-21 2
UNIPROTKB|F1PYB4 - symbol:ARSD "Uncharacterized protein" ... 231 3.5e-21 3
ZFIN|ZDB-GENE-081104-120 - symbol:arsh "arylsulfatase H" ... 238 6.8e-21 3
UNIPROTKB|P51689 - symbol:ARSD "Arylsulfatase D" species:... 230 7.1e-21 3
CGD|CAL0006319 - symbol:orf19.1608 species:5476 "Candida ... 238 1.0e-20 4
UNIPROTKB|E1BYN0 - symbol:ARSD "Uncharacterized protein" ... 243 1.1e-20 3
TIGR_CMR|CPS_2985 - symbol:CPS_2985 "sulfatase family pro... 281 1.2e-20 2
UNIPROTKB|C9J5G7 - symbol:ARSE "Arylsulfatase E" species:... 254 1.3e-20 1
ZFIN|ZDB-GENE-060503-154 - symbol:arsg "arylsulfatase G" ... 274 2.4e-20 3
TIGR_CMR|CPS_2983 - symbol:CPS_2983 "putative arylsulfata... 270 3.9e-20 1
UNIPROTKB|F1N665 - symbol:ARSG "Uncharacterized protein" ... 249 4.4e-20 1
UNIPROTKB|F1NFL4 - symbol:F1NFL4 "Uncharacterized protein... 258 1.1e-19 1
UNIPROTKB|Q32KH9 - symbol:ARSG "Arylsulfatase G" species:... 266 1.2e-19 1
TIGR_CMR|CPS_2984 - symbol:CPS_2984 "sulfatase family pro... 250 2.0e-19 2
UNIPROTKB|P77318 - symbol:ydeN "putative sulfatase" speci... 257 2.8e-19 2
UNIPROTKB|F1RV22 - symbol:ARSG "Uncharacterized protein" ... 260 5.0e-19 1
UNIPROTKB|F1PYB3 - symbol:ARSE "Uncharacterized protein" ... 239 5.2e-19 1
WARNING: Descriptions of 223 database sequences were not reported due to the
limiting value of parameter V = 100.
>FB|FBgn0033763 [details] [associations]
symbol:CG8646 species:7227 "Drosophila melanogaster"
[GO:0003943 "N-acetylgalactosamine-4-sulfatase activity"
evidence=ISS] [GO:0008152 "metabolic process" evidence=IEA]
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 EMBL:AE013599 Gene3D:3.40.720.10 SUPFAM:SSF53649
eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
KO:K01135 GO:GO:0003943 GeneTree:ENSGT00560000077076 EMBL:AY071072
RefSeq:NP_610807.3 UniGene:Dm.6132 HSSP:P15848 SMR:Q8SZ72
STRING:Q8SZ72 EnsemblMetazoa:FBtr0301237 GeneID:36394
KEGG:dme:Dmel_CG8646 FlyBase:FBgn0033763 InParanoid:Q8SZ72
OMA:FRGSAQI OrthoDB:EOG4W6MBG GenomeRNAi:36394 NextBio:798315
Uniprot:Q8SZ72
Length = 562
Score = 1091 (389.1 bits), Expect = 5.1e-128, Sum P(3) = 5.1e-128
Identities = 202/326 (61%), Positives = 243/326 (74%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
S P+IIFILADDLG+NDVGFHG +IPTPNIDALAYSGIIL YY +CTPSRSA+M
Sbjct: 22 SPAKPNIIFILADDLGFNDVGFHGSAEIPTPNIDALAYSGIILNRYYVAPICTPSRSALM 81
Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
TGK+PIHTGMQH VLY E GLPL EKILPQYL ELGY + I GKWHLG +K +YTP +
Sbjct: 82 TGKYPIHTGMQHTVLYAAEPRGLPLEEKILPQYLNELGYTSHIAGKWHLGHWKLKYTPLY 141
Query: 176 RGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
RGF SH+G+W+GHQDY DH+A E WGLDMR + A+DLHG Y+TDV T +V +I N
Sbjct: 142 RGFSSHVGFWSGHQDYNDHTAVENNQWGLDMRNGTQVAYDLHGHYTTDVITDHSVKVIAN 201
Query: 236 HS-TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGK 294
H+ T PLFLY+AHAA HS+NPY PL PD+ + HI ++KR KFAA++ K+D SVG+
Sbjct: 202 HNATKGPLFLYVAHAACHSSNPYNPLPVPDNDVIKMSHIPNYKRRKFAAMVSKMDNSVGQ 261
Query: 295 VVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWSP 354
+V+ L + ML NSII+F SD SN+PL+GVKNTLWEGGVR AGL+WSP
Sbjct: 262 IVDQLRKSNMLENSIIIFSSDNGGPAQGFNLNFASNYPLKGVKNTLWEGGVRAAGLMWSP 321
Query: 355 LLESRGIVAEQYVHVSDWLPTLLSAA 380
LL+ V+ Q +H+ DWLPTLL AA
Sbjct: 322 LLKKSQRVSNQTMHIIDWLPTLLEAA 347
Score = 137 (53.3 bits), Expect = 3.2e-20, Sum P(3) = 3.2e-20
Identities = 51/184 (27%), Positives = 83/184 (45%)
Query: 383 SDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRY--ENG--THEYNPKYE 438
S IPNY ++ + +NS+ + + + N +ENS + +NG +N +
Sbjct: 238 SHIPNYKRRKFAAMVSKMDNSVGQIVDQLRKSNM--LENSIIIFSSDNGGPAQGFNLNFA 295
Query: 439 NRYE-NGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGT-HEYN-IPRLENSINGNG 495
+ Y G N +E ++ P + + R N T H + +P L + G
Sbjct: 296 SNYPLKGVK--NTLWEGGVRAAGLMWS-PLLKKSQ-RVSNQTMHIIDWLPTLLEAAGGQP 351
Query: 496 TSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENS 555
N S +IDG +W L +++ S R +LHNIDD W +AL+ G WKLVK +
Sbjct: 352 ALSNLSK------QIDGQSIWRALVQDKASPRLNVLHNIDDIWGSAALSVGDWKLVKGTN 405
Query: 556 INGN 559
G+
Sbjct: 406 YRGS 409
Score = 122 (48.0 bits), Expect = 3.2e-20, Sum P(3) = 3.2e-20
Identities = 24/34 (70%), Positives = 25/34 (73%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRI 34
MQH VLY E GLPL EKILPQYL ELGY + I
Sbjct: 91 MQHTVLYAAEPRGLPLEEKILPQYLNELGYTSHI 124
Score = 112 (44.5 bits), Expect = 5.1e-128, Sum P(3) = 5.1e-128
Identities = 20/45 (44%), Positives = 29/45 (64%)
Query: 858 YPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSIFGD 902
YP+V++ + EL N TAV P NKP D DP+ +++ W+ FGD
Sbjct: 499 YPEVVNALMTELERFNATAVPPSNKPADPRADPRFWNYTWTNFGD 543
Score = 112 (44.5 bits), Expect = 5.1e-128, Sum P(3) = 5.1e-128
Identities = 27/69 (39%), Positives = 40/69 (57%)
Query: 651 RKLRDAASIQC-GPVKE-VPCEPQI--APCLFDIKNDPCEKNNLADRSEDQRINHYTTEV 706
+++R AA++ C G + C APCLF I++DPCE+ NLA + + +N TE+
Sbjct: 452 QRIRAAATVSCPGQSSQGTSCVATAFSAPCLFHIRDDPCEQYNLA-KQYPEVVNALMTEL 510
Query: 707 GRFNQIAYP 715
RFN A P
Sbjct: 511 ERFNATAVP 519
Score = 90 (36.7 bits), Expect = 1.0e-125, Sum P(3) = 1.0e-125
Identities = 28/74 (37%), Positives = 41/74 (55%)
Query: 757 RKLRDAASIQC-GPVKE-VPCEPQI--APCLFDIKNDPCEKNNLADR-SEVQRINHYTTE 811
+++R AA++ C G + C APCLF I++DPCE+ NLA + EV +N TE
Sbjct: 452 QRIRAAATVSCPGQSSQGTSCVATAFSAPCLFHIRDDPCEQYNLAKQYPEV--VNALMTE 509
Query: 812 VGYLDPKQRFNQIA 825
+ +RFN A
Sbjct: 510 L------ERFNATA 517
Score = 89 (36.4 bits), Expect = 5.1e-128, Sum P(3) = 5.1e-128
Identities = 16/41 (39%), Positives = 26/41 (63%)
Query: 569 NSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALT 609
++ +IDG +W L +++ S R +LHNIDD W +AL+
Sbjct: 354 SNLSKQIDGQSIWRALVQDKASPRLNVLHNIDDIWGSAALS 394
Score = 37 (18.1 bits), Expect = 3.0e-112, Sum P(2) = 3.0e-112
Identities = 12/41 (29%), Positives = 18/41 (43%)
Query: 362 VAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYEN 402
+A+QY V + L T L N + +P PR+ N
Sbjct: 495 LAKQYPEVVNALMTELERFNATAVPPSNKPADPRADPRFWN 535
>FB|FBgn0052191 [details] [associations]
symbol:CG32191 species:7227 "Drosophila melanogaster"
[GO:0003943 "N-acetylgalactosamine-4-sulfatase activity"
evidence=ISS] [GO:0008152 "metabolic process" evidence=IEA]
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 EMBL:AE014296 Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00149 GO:GO:0003943 HSSP:P08842
RefSeq:NP_730304.2 UniGene:Dm.15184 ProteinModelPortal:Q8IQS4
SMR:Q8IQS4 MINT:MINT-943884 PRIDE:Q8IQS4 GeneID:317903
KEGG:dme:Dmel_CG32191 UCSC:CG32191-RA FlyBase:FBgn0052191
InParanoid:Q8IQS4 OrthoDB:EOG43FFBZ PhylomeDB:Q8IQS4
GenomeRNAi:317903 NextBio:844132 Bgee:Q8IQS4 Uniprot:Q8IQS4
Length = 554
Score = 796 (285.3 bits), Expect = 4.3e-103, Sum P(4) = 4.3e-103
Identities = 162/345 (46%), Positives = 219/345 (63%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L L V D A++ P+II I+ADD+G++DV F G + TPNIDALAY G +L
Sbjct: 9 LLLCLQRVKSDESAAARRPNIIIIMADDMGFDDVSFRGGREFLTPNIDALAYHGRLLDRL 68
Query: 102 YTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
Y +CTPSR A+++G++PIHTG QH V+ E L L+ ++P+ KE GY T +VGK
Sbjct: 69 YAPAMCTPSRGALLSGRYPIHTGTQHFVISNEEPWALTLNATLMPEIFKEAGYSTNLVGK 128
Query: 162 WHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKM----WGLDMRRDLEPAWDLH 217
WHLGF + EYTPT RGF+ H GYW + DYF ++ M + G D RR++E
Sbjct: 129 WHLGFSRPEYTPTRRGFDYHFGYWGAYIDYFQRRSK-MPVANYSLGYDFRRNMELECRDR 187
Query: 218 GKYSTDVFTAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDF 276
G Y TD+ TAEA +I +H+ E PLFL L+H A H+AN +PLQAP+ + +I+D
Sbjct: 188 GVYVTDLLTAEAERLIKDHADKEQPLFLMLSHLAAHTANEDDPLQAPEEEIQKFSYIKDP 247
Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGV 336
R K+AA++ KLD+SVG+++ AL L NSI++F SD SN+PLRG
Sbjct: 248 NRRKYAAMISKLDQSVGRIITALSSTDQLENSIVIFYSDNGAPSVGMFSNTGSNFPLRGQ 307
Query: 337 KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
KNT WEGGVR AG IWS L++RG + Q ++V+DWLPTL AA+
Sbjct: 308 KNTPWEGGVRVAGAIWSSGLQARGSIFRQPLYVADWLPTLSRAAD 352
Score = 117 (46.2 bits), Expect = 4.3e-103, Sum P(4) = 4.3e-103
Identities = 33/92 (35%), Positives = 51/92 (55%)
Query: 509 EIDGIDVWSVLS--RNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRS 566
++DGID+W LS + P ILH +DD W++SAL G+WK V NGT+ +
Sbjct: 360 KLDGIDLWPELSGSADAPHVPREILHILDDVWRLSALQMGQWKYV-------NGTTASGR 412
Query: 567 NDN--SYQNEIDGIDV----WSVLSRNEPSKR 592
D+ +Y+ E+D +D ++V RN + R
Sbjct: 413 YDSVLTYR-ELDDLDPRDSRYAVTVRNSATSR 443
Score = 105 (42.0 bits), Expect = 4.3e-103, Sum P(4) = 4.3e-103
Identities = 25/76 (32%), Positives = 38/76 (50%)
Query: 623 RYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKN 682
RY V + LS R + R A+++CG ++ C P + CL+DI +
Sbjct: 431 RYAVTVRNSATSRALSRYDLRRLTQQRISLTRRLAAVRCGDLQR-SCNPLLEECLYDILS 489
Query: 683 DPCEKNNL--ADRSED 696
DPCE+NNL ++R D
Sbjct: 490 DPCEQNNLVYSERHSD 505
Score = 100 (40.3 bits), Expect = 1.4e-102, Sum P(4) = 1.4e-102
Identities = 17/37 (45%), Positives = 26/37 (70%)
Query: 760 RDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNL 796
R A+++CG ++ C P + CL+DI +DPCE+NNL
Sbjct: 462 RRLAAVRCGDLQR-SCNPLLEECLYDILSDPCEQNNL 497
Score = 82 (33.9 bits), Expect = 2.0e-99, Sum P(4) = 2.0e-99
Identities = 17/37 (45%), Positives = 24/37 (64%)
Query: 574 EIDGIDVWSVLS--RNEPSKRNTILHNIDDEWQISAL 608
++DGID+W LS + P ILH +DD W++SAL
Sbjct: 360 KLDGIDLWPELSGSADAPHVPREILHILDDVWRLSAL 396
Score = 58 (25.5 bits), Expect = 1.5e-11, Sum P(4) = 1.5e-11
Identities = 12/34 (35%), Positives = 19/34 (55%)
Query: 2 QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIM 35
QH V+ E L L+ ++P+ KE GY T ++
Sbjct: 93 QHFVISNEEPWALTLNATLMPEIFKEAGYSTNLV 126
Score = 47 (21.6 bits), Expect = 4.3e-103, Sum P(4) = 4.3e-103
Identities = 11/43 (25%), Positives = 19/43 (44%)
Query: 858 YPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSIF 900
+ DVL+ + + + + +A P N+ DP AW F
Sbjct: 503 HSDVLTALRRRVQELRASASRPGNRASMPEADPTLHTCAWESF 545
>FB|FBgn0036768 [details] [associations]
symbol:CG7402 species:7227 "Drosophila melanogaster"
[GO:0003943 "N-acetylgalactosamine-4-sulfatase activity"
evidence=ISS] [GO:0008152 "metabolic process" evidence=IEA]
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 EMBL:AE014296 Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00149 KO:K01135 GO:GO:0003943
GeneTree:ENSGT00560000077076 HSSP:P15848 RefSeq:NP_649023.1
UniGene:Dm.13635 ProteinModelPortal:Q9VVM4 STRING:Q9VVM4
PRIDE:Q9VVM4 EnsemblMetazoa:FBtr0075143 GeneID:39994
KEGG:dme:Dmel_CG7402 UCSC:CG7402-RA FlyBase:FBgn0036768
InParanoid:Q9VVM4 OMA:LYWAGPG PhylomeDB:Q9VVM4 GenomeRNAi:39994
NextBio:816457 ArrayExpress:Q9VVM4 Bgee:Q9VVM4 Uniprot:Q9VVM4
Length = 579
Score = 880 (314.8 bits), Expect = 9.1e-101, Sum P(3) = 9.1e-101
Identities = 180/412 (43%), Positives = 252/412 (61%)
Query: 57 SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMT 116
S P+I+ IL DD+G NDV FHG +QI TPNIDALAY+GI+L +Y LCTPSR+ ++T
Sbjct: 25 STKPNIVIILIDDMGMNDVSFHGSNQILTPNIDALAYNGILLNKHYVPNLCTPSRATLLT 84
Query: 117 GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
GK+PIHTGMQH V+ E GLP E+++P+ ++ GY T +VGKWHLGF++K+ TPT R
Sbjct: 85 GKYPIHTGMQHFVIITDEPWGLPQRERLMPEIFRDAGYSTHLVGKWHLGFWRKDLTPTMR 144
Query: 177 GFESHLGYWTGHQDYFDHSAEEMKM---WGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
GF+ H GY+ G+ DY+DH + GLD RRDLEP + +G Y+T+ FT+EA II
Sbjct: 145 GFDHHFGYYNGYIDYYDHQVRMLDRNYSAGLDFRRDLEPCPEANGTYATEAFTSEAKRII 204
Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
H +PLF+ L+H A H+ N P+QAP+ + HI D KR +A ++ LD+SV
Sbjct: 205 EQHDKSKPLFMVLSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPKRRTYAGMISSLDKSVA 264
Query: 294 KVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWS 353
+ + AL+ ML+NSII+ SD SN+P RG K + WEGG+R AG +WS
Sbjct: 265 QTIGALKDNGMLNNSIILLYSDNGAPTIGIHSNAGSNYPYRGQKESPWEGGIRSAGALWS 324
Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAANKS---DIP-NYVN---STVENIIPRYENSILR 406
PLL+ RG V+ Q +H DWLPTL AA S D+P + +N N P+ +++
Sbjct: 325 PLLKERGYVSNQAIHAVDWLPTLAGAAGVSLPQDLPLDGINLWPMLSGNEEPK-PRTMIH 383
Query: 407 YENGTHEYNSPRIENSNTRYENGTHEYNPKYENRY-ENGTHEYNPKYENRYE 457
+ Y+S +Y NG+ + +Y+ E T+E +P E+ YE
Sbjct: 384 VLDEVFGYSS--YMRDTLKYVNGS-SFKGRYDQWLGELETNEDDPLGES-YE 431
Score = 103 (41.3 bits), Expect = 9.1e-101, Sum P(3) = 9.1e-101
Identities = 22/58 (37%), Positives = 34/58 (58%)
Query: 753 EEGMRKLRDAASIQCGPVK-EVP------CEPQIAPCLFDIKNDPCEKNNLADRSEVQ 803
++ +R++R A+ C P++ + P CEP APC FD+ DPCE+ NLA +Q
Sbjct: 450 KDRIRQMRSEATETCPPIEGQNPLESHFKCEPLKAPCFFDLAKDPCERYNLAQMYPLQ 507
Score = 103 (41.3 bits), Expect = 6.0e-97, Sum P(2) = 6.0e-97
Identities = 25/73 (34%), Positives = 38/73 (52%)
Query: 650 MRKLRDAASIQCGPVK-EVP------CEPQIAPCLFDIKNDPCEKNNLADRSEDQRINHY 702
+R++R A+ C P++ + P CEP APC FD+ DPCE+ NLA Q +
Sbjct: 453 IRQMRSEATETCPPIEGQNPLESHFKCEPLKAPCFFDLAKDPCERYNLAQMYPLQ-LQQL 511
Query: 703 TTEVGRFNQIAYP 715
E+ + + A P
Sbjct: 512 ADELEQIRKTAIP 524
Score = 102 (41.0 bits), Expect = 2.5e-11, Sum P(4) = 2.5e-11
Identities = 28/90 (31%), Positives = 44/90 (48%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGN-----GTSEN 564
+DGI++W +LS NE K T++H +D+ + S+ R K V +S G G E
Sbjct: 361 LDGINLWPMLSGNEEPKPRTMIHVLDEVFGYSSYMRDTLKYVNGSSFKGRYDQWLGELET 420
Query: 565 RSND---NSYQNEIDGIDVWSVLSRNEPSK 591
+D SY+ + DV S+L +K
Sbjct: 421 NEDDPLGESYEQHVLASDVQSLLGNRGLTK 450
Score = 73 (30.8 bits), Expect = 1.8e-08, Sum P(4) = 1.8e-08
Identities = 13/33 (39%), Positives = 23/33 (69%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISA 607
+DGI++W +LS NE K T++H +D+ + S+
Sbjct: 361 LDGINLWPMLSGNEEPKPRTMIHVLDEVFGYSS 393
Score = 72 (30.4 bits), Expect = 2.5e-11, Sum P(4) = 2.5e-11
Identities = 13/35 (37%), Positives = 22/35 (62%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIM 35
MQH V+ E GLP E+++P+ ++ GY T ++
Sbjct: 93 MQHFVIITDEPWGLPQRERLMPEIFRDAGYSTHLV 127
Score = 50 (22.7 bits), Expect = 9.1e-101, Sum P(3) = 9.1e-101
Identities = 13/41 (31%), Positives = 18/41 (43%)
Query: 858 YPDVLSQMEKELANINRTAVAPINKPF-DKGGDPKNFDHAW 897
YP L Q+ EL I +TA+ P D +P + W
Sbjct: 504 YPLQLQQLADELEQIRKTAIPSARVPHSDSRANPTFHNGNW 544
>FB|FBgn0036765 [details] [associations]
symbol:CG7408 species:7227 "Drosophila melanogaster"
[GO:0003943 "N-acetylgalactosamine-4-sulfatase activity"
evidence=ISS] [GO:0008152 "metabolic process" evidence=IEA]
[GO:0042742 "defense response to bacterium" evidence=IMP]
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 EMBL:AE014296 GO:GO:0042742 Gene3D:3.40.720.10
SUPFAM:SSF53649 InterPro:IPR024607 PROSITE:PS00523 GO:GO:0003943
GeneTree:ENSGT00560000077076 HSSP:P15289 FlyBase:FBgn0036765
RefSeq:NP_001163462.1 RefSeq:NP_001163463.1 RefSeq:NP_649020.1
UniGene:Dm.13634 EnsemblMetazoa:FBtr0075142
EnsemblMetazoa:FBtr0300281 EnsemblMetazoa:FBtr0300282 GeneID:39991
KEGG:dme:Dmel_CG7408 UCSC:CG7408-RB InParanoid:Q9VVM1 OMA:TRENERD
GenomeRNAi:39991 NextBio:816442 Uniprot:Q9VVM1
Length = 585
Score = 768 (275.4 bits), Expect = 8.5e-99, Sum P(4) = 8.5e-99
Identities = 154/339 (45%), Positives = 216/339 (63%)
Query: 53 LVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRS 112
+VA+S P+II I+ADDLG++DV F G + TPNIDALAYSG+IL N Y +CTPSR+
Sbjct: 28 IVATSDKPNIIIIMADDLGFDDVSFRGSNNFLTPNIDALAYSGVILNNLYVAPMCTPSRA 87
Query: 113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
A++TGK+PI+TGMQH V+ + GLPL+E + + +E GYRT ++GKWHLG ++ +T
Sbjct: 88 ALLTGKYPINTGMQHYVIVNDQPWGLPLNETTMAEIFRENGYRTSLLGKWHLGLSQRNFT 147
Query: 173 PTFRGFESHLGYWTGHQDYFDHSAEEMKMW--GLDMRRDLEPAWDLHGKYSTDVFTAEAV 230
PT RGF+ HLGY + DY+ S E+ G D R L+ D G Y TD+ T AV
Sbjct: 148 PTERGFDRHLGYLGAYVDYYTQSYEQQNKGYNGHDFRDSLKSTHDHVGHYVTDLLTDAAV 207
Query: 231 DIIHNH---STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
I +H ++ +PLFL L H A H+AN +P+QAP ++ +I + +AA++ +
Sbjct: 208 KEIEDHGSKNSSQPLFLLLNHLAPHAANDDDPMQAPAEEVSRFEYISNKTHRYYAAMVSR 267
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRG 347
LD+SVG V++AL ++ ML NSII+F+SD SN+PLRG KN+ WEG +R
Sbjct: 268 LDKSVGSVIDALARQEMLQNSIILFLSDNGGPTQGQHSTTASNYPLRGQKNSPWEGALRS 327
Query: 348 AGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
+ IWS E G V +Q +++ D LPTL +AA S P
Sbjct: 328 SAAIWSTEFERLGSVWKQQIYIGDLLPTLAAAAGISPDP 366
Score = 114 (45.2 bits), Expect = 8.5e-99, Sum P(4) = 8.5e-99
Identities = 23/46 (50%), Positives = 30/46 (65%)
Query: 753 EEGMRKLRDAASIQC-GPVKEV-PCEPQIAPCLFDIKNDPCEKNNL 796
E + +LRD + I+C P V PC P PCLFDI+ DPCE++NL
Sbjct: 461 ERNISELRDQSRIECPDPATGVKPCLPLEGPCLFDIEADPCERSNL 506
Score = 111 (44.1 bits), Expect = 1.8e-98, Sum P(4) = 1.8e-98
Identities = 26/67 (38%), Positives = 39/67 (58%)
Query: 652 KLRDAASIQC-GPVKEV-PCEPQIAPCLFDIKNDPCEKNNLADRSEDQRIN-HYTTEVGR 708
+LRD + I+C P V PC P PCLFDI+ DPCE++NL ++ I + + +
Sbjct: 466 ELRDQSRIECPDPATGVKPCLPLEGPCLFDIEADPCERSNLYAEYQNSTIFLDLWSRIQQ 525
Query: 709 FNQIAYP 715
F + A+P
Sbjct: 526 FAKQAHP 532
Score = 90 (36.7 bits), Expect = 8.5e-99, Sum P(4) = 8.5e-99
Identities = 28/94 (29%), Positives = 43/94 (45%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISAL--TRGKWKLVKENSING--NG----- 560
+DG+++WS L S I+H ID++ L TRGKWK++ + G +G
Sbjct: 370 LDGLNLWSALKYGYESVEREIVHVIDEDVAEPHLSYTRGKWKVISGTTNQGLYDGWLGHR 429
Query: 561 -TSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRN 593
TSE Y+ + VW L + +RN
Sbjct: 430 ETSEVDPRAVEYEELVRNTSVWLQLQQVSFGERN 463
Score = 73 (30.8 bits), Expect = 1.3e-11, Sum P(4) = 1.3e-11
Identities = 14/35 (40%), Positives = 22/35 (62%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIM 35
MQH V+ + GLPL+E + + +E GYRT ++
Sbjct: 100 MQHYVIVNDQPWGLPLNETTMAEIFRENGYRTSLL 134
Score = 57 (25.1 bits), Expect = 2.4e-95, Sum P(4) = 2.4e-95
Identities = 10/28 (35%), Positives = 17/28 (60%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDE 602
+DG+++WS L S I+H ID++
Sbjct: 370 LDGLNLWSALKYGYESVEREIVHVIDED 397
Score = 52 (23.4 bits), Expect = 8.5e-99, Sum P(4) = 8.5e-99
Identities = 10/30 (33%), Positives = 17/30 (56%)
Query: 874 RTAVAPINKPFDKGGDPKNFDHAWSIFGDD 903
+ A P NKP D DP+ + + W+ + D+
Sbjct: 528 KQAHPPNNKPGDPNCDPRFYHNEWTWWQDE 557
>UNIPROTKB|P15848 [details] [associations]
symbol:ARSB "Arylsulfatase B" species:9606 "Homo sapiens"
[GO:0046872 "metal ion binding" evidence=IEA] [GO:0004065
"arylsulfatase activity" evidence=IEA] [GO:0005739 "mitochondrion"
evidence=IEA] [GO:0005791 "rough endoplasmic reticulum"
evidence=IEA] [GO:0005794 "Golgi apparatus" evidence=IEA]
[GO:0006914 "autophagy" evidence=IEA] [GO:0007417 "central nervous
system development" evidence=IEA] [GO:0007584 "response to
nutrient" evidence=IEA] [GO:0009268 "response to pH" evidence=IEA]
[GO:0043627 "response to estrogen stimulus" evidence=IEA]
[GO:0051597 "response to methylmercury" evidence=IEA] [GO:0005764
"lysosome" evidence=TAS] [GO:0007041 "lysosomal transport"
evidence=TAS] [GO:0007040 "lysosome organization" evidence=TAS]
[GO:0003943 "N-acetylgalactosamine-4-sulfatase activity"
evidence=TAS] [GO:0005788 "endoplasmic reticulum lumen"
evidence=TAS] [GO:0005975 "carbohydrate metabolic process"
evidence=TAS] [GO:0006644 "phospholipid metabolic process"
evidence=TAS] [GO:0006665 "sphingolipid metabolic process"
evidence=TAS] [GO:0006687 "glycosphingolipid metabolic process"
evidence=TAS] [GO:0030203 "glycosaminoglycan metabolic process"
evidence=TAS] [GO:0030204 "chondroitin sulfate metabolic process"
evidence=TAS] [GO:0030207 "chondroitin sulfate catabolic process"
evidence=TAS] [GO:0043202 "lysosomal lumen" evidence=TAS]
[GO:0043687 "post-translational protein modification" evidence=TAS]
[GO:0044267 "cellular protein metabolic process" evidence=TAS]
[GO:0044281 "small molecule metabolic process" evidence=TAS]
Reactome:REACT_17015 Reactome:REACT_111217 InterPro:IPR000917
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 GO:GO:0005739
GO:GO:0005794 Reactome:REACT_116125 GO:GO:0046872 GO:GO:0005975
GO:GO:0005791 GO:GO:0006914 GO:GO:0006644 GO:GO:0007584
GO:GO:0007417 GO:GO:0007040 GO:GO:0009268 GO:GO:0005788
EMBL:CH471084 GO:GO:0043627 GO:GO:0043687 GO:GO:0043202
GO:GO:0007041 Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0051597
eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
MIM:272200 GO:GO:0004065 GO:GO:0006687 EMBL:J05225 EMBL:M32373
EMBL:X72735 EMBL:X72736 EMBL:X72737 EMBL:X72738 EMBL:X72739
EMBL:X72740 EMBL:X72741 EMBL:X72742 EMBL:AK314903 EMBL:AC020937
EMBL:AC025755 EMBL:AC099485 EMBL:AC114963 EMBL:BC029051 EMBL:S57777
IPI:IPI00306576 IPI:IPI00413690 PIR:S35990 RefSeq:NP_000037.2
RefSeq:NP_942002.1 UniGene:Hs.149103 UniGene:Hs.604199 PDB:1FSU
PDBsum:1FSU ProteinModelPortal:P15848 SMR:P15848 IntAct:P15848
STRING:P15848 PhosphoSite:P15848 DMDM:114223 PaxDb:P15848
PRIDE:P15848 Ensembl:ENST00000264914 Ensembl:ENST00000396151
Ensembl:ENST00000565165 GeneID:411 KEGG:hsa:411 UCSC:uc003kfq.3
CTD:411 GeneCards:GC05M078108 HGNC:HGNC:714 HPA:HPA037770
HPA:HPA037771 MIM:253200 MIM:611542 neXtProt:NX_P15848
Orphanet:276212 Orphanet:276223 PharmGKB:PA25006
HOGENOM:HOG000135354 HOVERGEN:HBG004282 InParanoid:P15848 KO:K01135
OMA:WLFDIDR OrthoDB:EOG4DV5M0 PhylomeDB:P15848
BioCyc:MetaCyc:HS03665-MONOMER BRENDA:3.1.6.12 ChEMBL:CHEMBL2399
EvolutionaryTrace:P15848 GenomeRNAi:411 NextBio:1737
ArrayExpress:P15848 Bgee:P15848 CleanEx:HS_ARSB
Genevestigator:P15848 GermOnline:ENSG00000113273 GO:GO:0003943
GO:GO:0030207 Uniprot:P15848
Length = 533
Score = 825 (295.5 bits), Expect = 2.8e-96, Sum P(4) = 2.8e-96
Identities = 162/355 (45%), Positives = 225/355 (63%)
Query: 33 RIMAFAVLPLAFTLSMVFVDLVA-SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
R++ VLPL L + A +S PPH++F+LADDLGWNDVGFHG +I TP++DAL
Sbjct: 17 RLLLPVVLPLLLLLLLAPPGSGAGASRPPHLVFLLADDLGWNDVGFHG-SRIRTPHLDAL 75
Query: 92 AYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKE 151
A G++L NYYT LCTPSRS ++TG++ I TG+QH +++ C+ +PL EK+LPQ LKE
Sbjct: 76 AAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKE 135
Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHS------AEEMKMWGLD 205
GY T +VGKWHLG Y+KE PT RGF+++ GY G +DY+ H A + LD
Sbjct: 136 AGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALD 195
Query: 206 MRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDH 265
R E A YST++FT A+ +I NH ++PLFLYLA + H EPLQ P+
Sbjct: 196 FRDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEE 250
Query: 266 YLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXX 325
YL + I+D R +A ++ +DE+VG V AL+ + +N++ +F +D
Sbjct: 251 YLKPYDFIQDKNRHHYAGMVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAGG- 309
Query: 326 XXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
+NWPLRG K +LWEGGVRG G + SPLL+ +G+ + +H+SDWLPTL+ A
Sbjct: 310 ---NNWPLRGRKWSLWEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLA 361
Score = 97 (39.2 bits), Expect = 4.6e-06, Sum P(4) = 4.6e-06
Identities = 16/35 (45%), Positives = 25/35 (71%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIM 35
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 109 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMV 143
Score = 84 (34.6 bits), Expect = 1.1e-92, Sum P(3) = 1.1e-92
Identities = 18/47 (38%), Positives = 24/47 (51%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDEW-QISALTRGKWKLVKENS 555
+DG DVW +S PS R +LHNID + S R K++S
Sbjct: 371 LDGFDVWKTISEGSPSPRIELLHNIDPNFVDSSPCPRNSMAPAKDDS 417
Score = 82 (33.9 bits), Expect = 2.8e-96, Sum P(4) = 2.8e-96
Identities = 14/26 (53%), Positives = 17/26 (65%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNID 600
+DG DVW +S PS R +LHNID
Sbjct: 371 LDGFDVWKTISEGSPSPRIELLHNID 396
Score = 47 (21.6 bits), Expect = 2.8e-96, Sum P(4) = 2.8e-96
Identities = 12/30 (40%), Positives = 17/30 (56%)
Query: 770 VKEVPCE--PQIAPCLFDIKNDPCEKNNLA 797
V E+P P LFDI DP E+++L+
Sbjct: 459 VSEIPSSDPPTKTLWLFDIDRDPEERHDLS 488
Score = 47 (21.6 bits), Expect = 1.1e-92, Sum P(3) = 1.1e-92
Identities = 12/30 (40%), Positives = 17/30 (56%)
Query: 664 VKEVPCE--PQIAPCLFDIKNDPCEKNNLA 691
V E+P P LFDI DP E+++L+
Sbjct: 459 VSEIPSSDPPTKTLWLFDIDRDPEERHDLS 488
Score = 46 (21.3 bits), Expect = 2.8e-96, Sum P(4) = 2.8e-96
Identities = 6/24 (25%), Positives = 14/24 (58%)
Query: 680 IKNDPCEKNNLADRSEDQRINHYT 703
+ + PC +N++A +D + Y+
Sbjct: 400 VDSSPCPRNSMAPAKDDSSLPEYS 423
Score = 38 (18.4 bits), Expect = 1.6e-91, Sum P(3) = 1.6e-91
Identities = 8/30 (26%), Positives = 16/30 (53%)
Query: 786 IKNDPCEKNNLA---DRSEVQRINHYTTEV 812
+ + PC +N++A D S + + + T V
Sbjct: 400 VDSSPCPRNSMAPAKDDSSLPEYSAFNTSV 429
Score = 38 (18.4 bits), Expect = 2.8e-84, Sum P(2) = 2.8e-84
Identities = 9/32 (28%), Positives = 16/32 (50%)
Query: 357 ESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
E R ++ +Y H+ L + L +K +P Y
Sbjct: 482 EERHDLSREYPHIVTKLLSRLQFYHKHSVPVY 513
>UNIPROTKB|A6QLZ3 [details] [associations]
symbol:ARSB "Uncharacterized protein" species:9913 "Bos
taurus" [GO:0004065 "arylsulfatase activity" evidence=IEA]
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0004065
CTD:411 HOGENOM:HOG000135354 HOVERGEN:HBG004282 KO:K01135
OMA:WLFDIDR OrthoDB:EOG4DV5M0 GeneTree:ENSGT00560000077076
EMBL:DAAA02027809 EMBL:DAAA02027810 EMBL:DAAA02027811
EMBL:DAAA02027812 EMBL:DAAA02027813 EMBL:DAAA02027814
EMBL:DAAA02027815 EMBL:DAAA02027816 EMBL:DAAA02027817 EMBL:BC148139
IPI:IPI00710068 RefSeq:NP_001094645.1 UniGene:Bt.35850 SMR:A6QLZ3
STRING:A6QLZ3 Ensembl:ENSBTAT00000010988 GeneID:538401
KEGG:bta:538401 InParanoid:A6QLZ3 NextBio:20877344 Uniprot:A6QLZ3
Length = 533
Score = 843 (301.8 bits), Expect = 1.8e-94, Sum P(3) = 1.8e-94
Identities = 166/356 (46%), Positives = 224/356 (62%)
Query: 38 AVLPLAFTLSMVFVDLVAS----SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
A+LPL L ++ + + S S PPH++F+LADDLGWNDVGFHG I TP +DALA
Sbjct: 19 AILPLGLLLLLLLLPPLGSGAGASRPPHLVFVLADDLGWNDVGFHG-SAIRTPRLDALAA 77
Query: 94 SGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELG 153
G++L NYYT LCTPSRS ++TG++ IHTG+QH ++ C+ +PL EK+LPQ LKE G
Sbjct: 78 GGVLLDNYYTQPLCTPSRSQLLTGRYQIHTGLQHQIILPCQPSCIPLDEKLLPQLLKEAG 137
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHS------AEEMKMWGLDMR 207
Y T +VGKWHLG Y+KE PT RGF+++ GY G +DY+ H A + LD R
Sbjct: 138 YATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFR 197
Query: 208 RDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL 267
E A YST+VFT A +I NH ++PLFLYLA + H EPLQ P+ YL
Sbjct: 198 DGEEVATGYKNMYSTNVFTERATTLITNHPPEKPLFLYLALQSVH-----EPLQVPEEYL 252
Query: 268 NIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXX 327
+ I+D R +A + +DE+VG V ALE+R + +N++ +F +D
Sbjct: 253 KPYDFIQDRNRRYYAGMASVMDEAVGNVTAALERRGLWNNTVFIFSTDNGGQTLAGG--- 309
Query: 328 XSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
+NWPLRG K +LWEGGVRG G + SPLL+ +G+ + +H+SDWLPTL+ A S
Sbjct: 310 -NNWPLRGRKWSLWEGGVRGVGFVASPLLKRKGVKTRELIHISDWLPTLVKLAGGS 364
Score = 95 (38.5 bits), Expect = 3.1e-05, Sum P(3) = 3.1e-05
Identities = 16/35 (45%), Positives = 24/35 (68%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIM 35
+QH ++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 109 LQHQIILPCQPSCIPLDEKLLPQLLKEAGYATHMV 143
Score = 88 (36.0 bits), Expect = 1.8e-94, Sum P(3) = 1.8e-94
Identities = 19/47 (40%), Positives = 26/47 (55%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGK-WKLVKENS 555
+DG DVW+ +S PS R +LHNID + +A G L K+ S
Sbjct: 371 LDGFDVWNTISEGSPSPRMELLHNIDPNFVDTAPCPGNSMALAKDES 417
Score = 84 (34.6 bits), Expect = 4.8e-94, Sum P(3) = 4.8e-94
Identities = 14/26 (53%), Positives = 18/26 (69%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNID 600
+DG DVW+ +S PS R +LHNID
Sbjct: 371 LDGFDVWNTISEGSPSPRMELLHNID 396
Score = 42 (19.8 bits), Expect = 1.8e-94, Sum P(3) = 1.8e-94
Identities = 8/15 (53%), Positives = 12/15 (80%)
Query: 677 LFDIKNDPCEKNNLA 691
LFDI DP E+++L+
Sbjct: 474 LFDIDQDPEERHDLS 488
Score = 42 (19.8 bits), Expect = 1.8e-94, Sum P(3) = 1.8e-94
Identities = 8/15 (53%), Positives = 12/15 (80%)
Query: 783 LFDIKNDPCEKNNLA 797
LFDI DP E+++L+
Sbjct: 474 LFDIDQDPEERHDLS 488
Score = 37 (18.1 bits), Expect = 4.4e-86, Sum P(2) = 4.4e-86
Identities = 9/32 (28%), Positives = 15/32 (46%)
Query: 357 ESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
E R ++ +Y H+ L + L K +P Y
Sbjct: 482 EERHDLSREYPHIVKKLLSRLQFYQKHSVPVY 513
>UNIPROTKB|Q32KI4 [details] [associations]
symbol:arsb "Arylsulfatase B" species:9615 "Canis lupus
familiaris" [GO:0004065 "arylsulfatase activity" evidence=IEA]
[GO:0003943 "N-acetylgalactosamine-4-sulfatase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
GO:GO:0004065 CTD:411 HOGENOM:HOG000135354 HOVERGEN:HBG004282
KO:K01135 OMA:WLFDIDR OrthoDB:EOG4DV5M0 GO:GO:0003943
GeneTree:ENSGT00560000077076 EMBL:AAEX03002118 EMBL:AAEX03002119
EMBL:BN000753 RefSeq:NP_001041598.1 UniGene:Cfa.39080 SMR:Q32KI4
STRING:Q32KI4 Ensembl:ENSCAFT00000014585 GeneID:610364
KEGG:cfa:610364 InParanoid:Q32KI4 NextBio:20895924 Uniprot:Q32KI4
Length = 535
Score = 829 (296.9 bits), Expect = 2.3e-92, Sum P(3) = 2.3e-92
Identities = 158/334 (47%), Positives = 218/334 (65%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
++GPPH++F+LADDLGW+DVGFHG +I TP++DALA +G++L NYYT LCTPSRS ++
Sbjct: 43 AAGPPHLVFVLADDLGWHDVGFHG-SRIRTPHLDALAAAGVLLDNYYTQPLCTPSRSQLL 101
Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
TG++ IHTG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE PT
Sbjct: 102 TGRYQIHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 161
Query: 176 RGFESHLGYWTGHQDYFDHS------AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEA 229
RGF+++ GY G +DY+ H A + LD R E A YST++FT A
Sbjct: 162 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTERA 221
Query: 230 VDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLD 289
+I NH ++PLFLYLA + H EPLQ P+ YL + I D R +A ++ +D
Sbjct: 222 TALISNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIHDKNRRYYAGMVSLMD 276
Query: 290 ESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAG 349
E+VG V AL+ + +N++ VF +D +NWPLRG K +LWEGGVRG G
Sbjct: 277 EAVGNVTAALKSHGLWNNTVFVFSTDNGGQTLAGG----NNWPLRGRKWSLWEGGVRGVG 332
Query: 350 LIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
+ SPLL+ +G+ + + VH+SDWLPTL+ A S
Sbjct: 333 FVASPLLKRKGVKSRELVHISDWLPTLVGLAGGS 366
Score = 97 (39.2 bits), Expect = 7.4e-05, Sum P(3) = 7.4e-05
Identities = 16/35 (45%), Positives = 25/35 (71%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIM 35
+QH +++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 111 LQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMV 145
Score = 82 (33.9 bits), Expect = 2.3e-92, Sum P(3) = 2.3e-92
Identities = 14/26 (53%), Positives = 17/26 (65%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNID 535
+DG DVW +S PS R +LHNID
Sbjct: 373 LDGFDVWRTISEGSPSPRMELLHNID 398
Score = 82 (33.9 bits), Expect = 2.3e-92, Sum P(3) = 2.3e-92
Identities = 14/26 (53%), Positives = 17/26 (65%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNID 600
+DG DVW +S PS R +LHNID
Sbjct: 373 LDGFDVWRTISEGSPSPRMELLHNID 398
Score = 42 (19.8 bits), Expect = 2.3e-92, Sum P(3) = 2.3e-92
Identities = 8/15 (53%), Positives = 12/15 (80%)
Query: 677 LFDIKNDPCEKNNLA 691
LFDI DP E+++L+
Sbjct: 476 LFDIDQDPEERHDLS 490
Score = 42 (19.8 bits), Expect = 2.3e-92, Sum P(3) = 2.3e-92
Identities = 8/15 (53%), Positives = 12/15 (80%)
Query: 783 LFDIKNDPCEKNNLA 797
LFDI DP E+++L+
Sbjct: 476 LFDIDQDPEERHDLS 490
>RGD|2158 [details] [associations]
symbol:Arsb "arylsulfatase B" species:10116 "Rattus norvegicus"
[GO:0003943 "N-acetylgalactosamine-4-sulfatase activity"
evidence=IEA] [GO:0004065 "arylsulfatase activity" evidence=ISO;TAS]
[GO:0005739 "mitochondrion" evidence=IDA] [GO:0005764 "lysosome"
evidence=IDA] [GO:0005791 "rough endoplasmic reticulum" evidence=IDA]
[GO:0005794 "Golgi apparatus" evidence=IDA] [GO:0006914 "autophagy"
evidence=IDA] [GO:0007417 "central nervous system development"
evidence=IDA] [GO:0007584 "response to nutrient" evidence=IDA]
[GO:0008152 "metabolic process" evidence=ISO] [GO:0008484 "sulfuric
ester hydrolase activity" evidence=IDA] [GO:0009268 "response to pH"
evidence=IDA] [GO:0043627 "response to estrogen stimulus"
evidence=IDA] [GO:0046872 "metal ion binding" evidence=IEA]
[GO:0051597 "response to methylmercury" evidence=IDA]
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
RGD:2158 GO:GO:0005739 GO:GO:0005794 GO:GO:0046872 GO:GO:0005791
GO:GO:0006914 GO:GO:0007584 GO:GO:0007417 GO:GO:0005764 GO:GO:0009268
GO:GO:0043627 Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0051597
eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
GO:GO:0004065 CTD:411 HOGENOM:HOG000135354 HOVERGEN:HBG004282
KO:K01135 OrthoDB:EOG4DV5M0 GO:GO:0003943
GeneTree:ENSGT00560000077076 EMBL:AABR03012149 EMBL:AABR03015281
EMBL:AABR03016930 EMBL:AABR03021723 EMBL:D49434 EMBL:BN000736
IPI:IPI00198405 PIR:I54210 RefSeq:NP_254278.1 UniGene:Rn.94004
ProteinModelPortal:P50430 SMR:P50430 IntAct:P50430 STRING:P50430
PRIDE:P50430 Ensembl:ENSRNOT00000014860 GeneID:25227 KEGG:rno:25227
UCSC:RGD:2158 InParanoid:P50430 OMA:ALMTARY NextBio:605779
ArrayExpress:P50430 Genevestigator:P50430
GermOnline:ENSRNOG00000011150 Uniprot:P50430
Length = 528
Score = 824 (295.1 bits), Expect = 2.0e-91, Sum P(3) = 2.0e-91
Identities = 158/350 (45%), Positives = 227/350 (64%)
Query: 40 LPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
LPL L + ++ PPH++F+LADDLGWND+GFHG I TP++DALA G++L
Sbjct: 20 LPLLLLLLLWPARASDAAPPPHVVFVLADDLGWNDLGFHG-SVIRTPHLDALAAGGVVLD 78
Query: 100 NYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIV 159
NYY LCTPSRS ++TG++ IH G+QH ++ C+ +PL EK+LPQ LK+ GY T +V
Sbjct: 79 NYYVQPLCTPSRSQLLTGRYQIHMGLQHYLIMTCQPNCVPLDEKLLPQLLKDAGYATHMV 138
Query: 160 GKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSA----EEMK--MWGLDMRRDLEPA 213
GKWHLG Y+KE PT RGF+++ GY G +DY+ H A E + LD+R EPA
Sbjct: 139 GKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYTHEACAPIECLNGTRCALDLRDGEEPA 198
Query: 214 WDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
+ YST++FT A +I NH ++PLFLYLA + H +PLQ P+ Y+ + I
Sbjct: 199 KEYTDIYSTNIFTKRATTLIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYDFI 253
Query: 274 EDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPL 333
+D R +A ++ LDE+VG V +AL+ R + +N++++F +D +NWPL
Sbjct: 254 QDKHRRIYAGMVSLLDEAVGNVTKALKSRGLWNNTVLIFSTDNGGQTRSGG----NNWPL 309
Query: 334 RGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
RG K TLWEGG+RGAG + SPLL+ +G+ + + +H++DWLPTL++ A S
Sbjct: 310 RGRKGTLWEGGIRGAGFVASPLLKQKGVKSRELMHITDWLPTLVNLAGGS 359
Score = 73 (30.8 bits), Expect = 2.0e-91, Sum P(3) = 2.0e-91
Identities = 13/29 (44%), Positives = 18/29 (62%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDEW 538
+DG DVW +S PS R +L NID ++
Sbjct: 366 LDGFDVWETISEGSPSPRVELLLNIDPDF 394
Score = 73 (30.8 bits), Expect = 2.0e-91, Sum P(3) = 2.0e-91
Identities = 13/29 (44%), Positives = 18/29 (62%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEW 603
+DG DVW +S PS R +L NID ++
Sbjct: 366 LDGFDVWETISEGSPSPRVELLLNIDPDF 394
Score = 47 (21.6 bits), Expect = 2.0e-91, Sum P(3) = 2.0e-91
Identities = 13/54 (24%), Positives = 27/54 (50%)
Query: 664 VKEVPC--EPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
+ EVP P LFDI DP E+++++ R + + + + +++ + P
Sbjct: 454 ISEVPSVDSPTKTLWLFDINRDPEERHDVS-REHPHIVQNLLSRLQYYHEHSVP 506
Score = 45 (20.9 bits), Expect = 3.3e-91, Sum P(3) = 3.3e-91
Identities = 11/30 (36%), Positives = 17/30 (56%)
Query: 770 VKEVPC--EPQIAPCLFDIKNDPCEKNNLA 797
+ EVP P LFDI DP E+++++
Sbjct: 454 ISEVPSVDSPTKTLWLFDINRDPEERHDVS 483
>MGI|MGI:88075 [details] [associations]
symbol:Arsb "arylsulfatase B" species:10090 "Mus musculus"
[GO:0003824 "catalytic activity" evidence=IEA] [GO:0003943
"N-acetylgalactosamine-4-sulfatase activity" evidence=IEA]
[GO:0004065 "arylsulfatase activity" evidence=IDA] [GO:0005739
"mitochondrion" evidence=ISO] [GO:0005764 "lysosome" evidence=ISO]
[GO:0005791 "rough endoplasmic reticulum" evidence=ISO] [GO:0005794
"Golgi apparatus" evidence=ISO] [GO:0006914 "autophagy"
evidence=ISO] [GO:0007417 "central nervous system development"
evidence=ISO] [GO:0007584 "response to nutrient" evidence=ISO]
[GO:0008152 "metabolic process" evidence=IDA] [GO:0008484 "sulfuric
ester hydrolase activity" evidence=ISO] [GO:0009268 "response to
pH" evidence=ISO] [GO:0016787 "hydrolase activity" evidence=IEA]
[GO:0043627 "response to estrogen stimulus" evidence=ISO]
[GO:0046872 "metal ion binding" evidence=IEA] [GO:0051597 "response
to methylmercury" evidence=ISO] InterPro:IPR000917
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 MGI:MGI:88075
GO:GO:0005739 GO:GO:0005794 GO:GO:0046872 GO:GO:0005791
GO:GO:0006914 GO:GO:0007584 GO:GO:0007417 GO:GO:0005764
GO:GO:0009268 GO:GO:0043627 Gene3D:3.40.720.10 SUPFAM:SSF53649
GO:GO:0051597 eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0004065 CTD:411 HOGENOM:HOG000135354
HOVERGEN:HBG004282 KO:K01135 OrthoDB:EOG4DV5M0 GO:GO:0003943
EMBL:AK083309 EMBL:AK154098 EMBL:AK158312 EMBL:AC131739
EMBL:AC136976 EMBL:M82877 EMBL:X92096 EMBL:BN000746 IPI:IPI00406459
IPI:IPI00652358 RefSeq:NP_033842.3 UniGene:Mm.300178
UniGene:Mm.472255 ProteinModelPortal:P50429 SMR:P50429
STRING:P50429 PhosphoSite:P50429 PaxDb:P50429 PRIDE:P50429
DNASU:11881 Ensembl:ENSMUST00000091403 GeneID:11881 KEGG:mmu:11881
UCSC:uc007rlo.1 UCSC:uc011zcv.1 GeneTree:ENSGT00560000077076
InParanoid:P50429 SABIO-RK:P50429 NextBio:279911 Bgee:P50429
CleanEx:MM_ARSB Genevestigator:P50429 GermOnline:ENSMUSG00000042093
Uniprot:P50429
Length = 534
Score = 823 (294.8 bits), Expect = 2.0e-91, Sum P(3) = 2.0e-91
Identities = 159/356 (44%), Positives = 229/356 (64%)
Query: 40 LPLAFTLSMVFVDLVA---SSG---PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY 93
LPL L + + L++ +SG PPH++F+LADDLGWND+GFHG I TP++DALA
Sbjct: 20 LPLLLLLLQLLLLLLSPARASGATQPPHVVFVLADDLGWNDLGFHG-SVIRTPHLDALAA 78
Query: 94 SGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELG 153
G++L NYY LCTPSRS ++TG++ IH G+QH ++ C+ +PL EK+LPQ LKE G
Sbjct: 79 GGVVLDNYYVQPLCTPSRSQLLTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAG 138
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSA----EEMK--MWGLDMR 207
Y T +VGKWHLG Y+KE PT RGF+++ GY G +DY+ H A E + LD+R
Sbjct: 139 YATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYTHEACAPIESLNGTRCALDLR 198
Query: 208 RDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL 267
EPA + + YST++FT A +I NH ++PLFLYLA + H +PLQ P+ Y+
Sbjct: 199 DGEEPAKEYNNIYSTNIFTKRATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYM 253
Query: 268 NIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXX 327
+ I+D R +A ++ +DE+VG V +AL+ + +N++ +F +D
Sbjct: 254 EPYGFIQDKHRRIYAGMVSLMDEAVGNVTKALKSHGLWNNTVFIFSTDNGGQTRSGG--- 310
Query: 328 XSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
+NWPLRG K TLWEGG+RG G + SPLL+ +G+ + + +H++DWLPTL+ A S
Sbjct: 311 -NNWPLRGRKGTLWEGGIRGTGFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGS 365
Score = 90 (36.7 bits), Expect = 0.00079, Sum P(3) = 0.00079
Identities = 16/35 (45%), Positives = 24/35 (68%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIM 35
+QH ++ C+ +PL EK+LPQ LKE GY T ++
Sbjct: 110 LQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMV 144
Score = 77 (32.2 bits), Expect = 2.0e-91, Sum P(3) = 2.0e-91
Identities = 12/29 (41%), Positives = 19/29 (65%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDEW 538
+DG ++W +S PS R +LHNID ++
Sbjct: 372 LDGFNMWKTISEGHPSPRVELLHNIDQDF 400
Score = 77 (32.2 bits), Expect = 2.0e-91, Sum P(3) = 2.0e-91
Identities = 12/29 (41%), Positives = 19/29 (65%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNIDDEW 603
+DG ++W +S PS R +LHNID ++
Sbjct: 372 LDGFNMWKTISEGHPSPRVELLHNIDQDF 400
Score = 44 (20.5 bits), Expect = 2.0e-91, Sum P(3) = 2.0e-91
Identities = 13/54 (24%), Positives = 27/54 (50%)
Query: 664 VKEVPC--EPQIAPCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
V E+P P LFDI DP E+++++ R + + + + +++ + P
Sbjct: 460 VSEIPPVGPPTKTLWLFDINQDPEERHDVS-REHPHIVQNLLSRLQYYHEHSVP 512
Score = 42 (19.8 bits), Expect = 3.3e-91, Sum P(3) = 3.3e-91
Identities = 11/30 (36%), Positives = 17/30 (56%)
Query: 770 VKEVPC--EPQIAPCLFDIKNDPCEKNNLA 797
V E+P P LFDI DP E+++++
Sbjct: 460 VSEIPPVGPPTKTLWLFDINQDPEERHDVS 489
>UNIPROTKB|F1P099 [details] [associations]
symbol:ARSB "Uncharacterized protein" species:9031 "Gallus
gallus" [GO:0004065 "arylsulfatase activity" evidence=IEA]
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
PROSITE:PS00149 GO:GO:0004065 OMA:WLFDIDR
GeneTree:ENSGT00560000077076 EMBL:AADN02046142 EMBL:AADN02046143
EMBL:AADN02046144 EMBL:AADN02046145 EMBL:AADN02046146
EMBL:AADN02046147 EMBL:AADN02046148 EMBL:AADN02046149
EMBL:AADN02046150 IPI:IPI00822500 Ensembl:ENSGALT00000038612
ArrayExpress:F1P099 Uniprot:F1P099
Length = 527
Score = 779 (279.3 bits), Expect = 2.0e-90, Sum P(4) = 2.0e-90
Identities = 149/332 (44%), Positives = 213/332 (64%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
A+ PPH++ +LADDLGW DVG+HG I TP +DAL G+ LK Y T LCTPSR +
Sbjct: 34 AARPPPHLVLVLADDLGWGDVGWHG-SAIRTPRLDALGAGGVRLKGY-TQPLCTPSRPFL 91
Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
+ G + IHTG+QH +++ C+ LPL EK+LP+ LK+ GY T +VGKWHLG Y+KE PT
Sbjct: 92 LFGGYYIHTGLQHQIIWPCQPSCLPLDEKLLPELLKDAGYVTHMVGKWHLGMYRKECLPT 151
Query: 175 FRGFESHLGYWTGHQDYFDHS------AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
RGF+++ GY G +DY+ H A+ + LD R E A YST++FT
Sbjct: 152 RRGFDTYFGYLLGSEDYYSHDHCVLIKAKNVTRCALDFRDGEEVATGFKNMYSTNLFTER 211
Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
A+D+I NH T++PLFLYLA + H EPL+ Y+ + I+D KR ++A ++ +
Sbjct: 212 AIDLIANHKTEKPLFLYLAFQSVH-----EPLEVSAEYMKPYSSIKDVKRRRYAGMVSLM 266
Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGA 348
DE+VG + +AL++ + +N+++VF +D +NWPLRG K TLWEGGVRG
Sbjct: 267 DEAVGNLTDALKEYGLWNNTVLVFSTDNGGQTMAGG----NNWPLRGRKWTLWEGGVRGV 322
Query: 349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
G + SPLL+ +G+ + + +H+SDWLPTL+ A
Sbjct: 323 GFVASPLLKQKGVESHELIHISDWLPTLVHLA 354
Score = 92 (37.4 bits), Expect = 0.00013, Sum P(4) = 0.00013
Identities = 15/35 (42%), Positives = 25/35 (71%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIM 35
+QH +++ C+ LPL EK+LP+ LK+ GY T ++
Sbjct: 102 LQHQIIWPCQPSCLPLDEKLLPELLKDAGYVTHMV 136
Score = 82 (33.9 bits), Expect = 2.0e-90, Sum P(4) = 2.0e-90
Identities = 14/26 (53%), Positives = 17/26 (65%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNID 535
+DG DVW +S PS R +LHNID
Sbjct: 364 LDGFDVWKTISEGRPSPRVELLHNID 389
Score = 82 (33.9 bits), Expect = 2.0e-87, Sum P(3) = 2.0e-87
Identities = 14/26 (53%), Positives = 17/26 (65%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNID 600
+DG DVW +S PS R +LHNID
Sbjct: 364 LDGFDVWKTISEGRPSPRVELLHNID 389
Score = 45 (20.9 bits), Expect = 2.0e-90, Sum P(4) = 2.0e-90
Identities = 9/17 (52%), Positives = 13/17 (76%)
Query: 677 LFDIKNDPCEKNNLADR 693
LFDI +DP EK L+++
Sbjct: 468 LFDIVHDPEEKYELSEK 484
Score = 45 (20.9 bits), Expect = 2.0e-90, Sum P(4) = 2.0e-90
Identities = 9/17 (52%), Positives = 13/17 (76%)
Query: 783 LFDIKNDPCEKNNLADR 799
LFDI +DP EK L+++
Sbjct: 468 LFDIVHDPEEKYELSEK 484
Score = 38 (18.4 bits), Expect = 2.0e-90, Sum P(4) = 2.0e-90
Identities = 6/11 (54%), Positives = 9/11 (81%)
Query: 541 SALTRGKWKLV 551
+A+ GKWKL+
Sbjct: 425 AAIRHGKWKLL 435
>UNIPROTKB|E1BKH3 [details] [associations]
symbol:ARSJ "Uncharacterized protein" species:9913 "Bos
taurus" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
GeneTree:ENSGT00560000077076 KO:K12375 CTD:79642 OMA:AAGYGIW
EMBL:DAAA02016458 EMBL:DAAA02016459 EMBL:DAAA02016460
IPI:IPI00825946 RefSeq:XP_002688145.1 RefSeq:XP_611819.3
UniGene:Bt.87496 ProteinModelPortal:E1BKH3
Ensembl:ENSBTAT00000023672 GeneID:540514 KEGG:bta:540514
NextBio:20878676 Uniprot:E1BKH3
Length = 599
Score = 764 (274.0 bits), Expect = 1.1e-89, Sum P(4) = 1.1e-89
Identities = 150/329 (45%), Positives = 205/329 (62%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
V + PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 71 VTALSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 129
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+TGK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+KE P
Sbjct: 130 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 189
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
T RGF++ G G DY+ H + M G D+ + AWD +G YST ++T
Sbjct: 190 TKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGVYSTQMYTQRVQQ 249
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I+ +H +P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE+
Sbjct: 250 ILASHDPRKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIVNINRRRYAAMLSCLDEA 304
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLI 351
+ V AL+ +NSII++ SD SNWPLRG K T WEGG+R G +
Sbjct: 305 INNVTLALKMYGFYNNSIIIYSSDNGGQPTAGG----SNWPLRGSKGTYWEGGIRAIGFV 360
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 361 HSPLLKNKGTVCKELVHITDWYPTLISLA 389
Score = 75 (31.5 bits), Expect = 1.1e-89, Sum P(4) = 1.1e-89
Identities = 16/40 (40%), Positives = 21/40 (52%)
Query: 509 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKW 548
++DG DVW +S S R ILHNID + + G W
Sbjct: 398 QLDGYDVWETISEGLRSPRVDILHNIDPIY--TKAKNGSW 435
Score = 73 (30.8 bits), Expect = 1.8e-89, Sum P(4) = 1.8e-89
Identities = 14/27 (51%), Positives = 17/27 (62%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNID 600
++DG DVW +S S R ILHNID
Sbjct: 398 QLDGYDVWETISEGLRSPRVDILHNID 424
Score = 59 (25.8 bits), Expect = 1.1e-89, Sum P(4) = 1.1e-89
Identities = 12/40 (30%), Positives = 20/40 (50%)
Query: 858 YPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAW 897
YP ++ Q+ + L+ N+TAV P D +P+ W
Sbjct: 513 YPGIVKQLLRRLSQFNKTAVPVRYPPKDPRSNPRLNGGVW 552
Score = 39 (18.8 bits), Expect = 1.1e-89, Sum P(4) = 1.1e-89
Identities = 8/17 (47%), Positives = 13/17 (76%)
Query: 783 LFDIKNDPCEKNNLADR 799
LF+I DP E+ +L++R
Sbjct: 496 LFNITADPYERVDLSNR 512
>UNIPROTKB|Q5FYB0 [details] [associations]
symbol:ARSJ "Arylsulfatase J" species:9606 "Homo sapiens"
[GO:0046872 "metal ion binding" evidence=IEA] [GO:0005576
"extracellular region" evidence=IEA] [GO:0004065 "arylsulfatase
activity" evidence=TAS] [GO:0005788 "endoplasmic reticulum lumen"
evidence=TAS] [GO:0006644 "phospholipid metabolic process"
evidence=TAS] [GO:0006665 "sphingolipid metabolic process"
evidence=TAS] [GO:0006687 "glycosphingolipid metabolic process"
evidence=TAS] [GO:0043687 "post-translational protein modification"
evidence=TAS] [GO:0044267 "cellular protein metabolic process"
evidence=TAS] [GO:0044281 "small molecule metabolic process"
evidence=TAS] Reactome:REACT_17015 Reactome:REACT_111217
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 GO:GO:0005576 GO:GO:0044281 GO:GO:0046872
GO:GO:0006644 GO:GO:0005788 GO:GO:0043687 Gene3D:3.40.720.10
SUPFAM:SSF53649 eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0004065 GO:GO:0006687 HOGENOM:HOG000135354
HOVERGEN:HBG004282 KO:K12375 EMBL:AY875938 EMBL:AM049401
EMBL:AY358647 EMBL:AC104779 EMBL:BC089445 EMBL:BC132879
EMBL:BC132881 EMBL:BC144265 IPI:IPI00413865 RefSeq:NP_078866.3
UniGene:Hs.22895 UniGene:Hs.700496 UniGene:Hs.712042
ProteinModelPortal:Q5FYB0 SMR:Q5FYB0 STRING:Q5FYB0
PhosphoSite:Q5FYB0 DMDM:74722580 PRIDE:Q5FYB0
Ensembl:ENST00000315366 Ensembl:ENST00000541197 GeneID:79642
KEGG:hsa:79642 UCSC:uc003ibq.1 CTD:79642 GeneCards:GC04M114821
HGNC:HGNC:26286 HPA:HPA036482 MIM:610010 neXtProt:NX_Q5FYB0
PharmGKB:PA143485310 InParanoid:Q5FYB0 OMA:AAGYGIW
OrthoDB:EOG45HRX5 ChiTaRS:ARSJ GenomeRNAi:79642 NextBio:68769
ArrayExpress:Q5FYB0 Bgee:Q5FYB0 CleanEx:HS_ARSJ
Genevestigator:Q5FYB0 Uniprot:Q5FYB0
Length = 599
Score = 766 (274.7 bits), Expect = 2.2e-89, Sum P(4) = 2.2e-89
Identities = 150/327 (45%), Positives = 206/327 (62%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
S+ PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS +
Sbjct: 72 STSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFI 130
Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
TGK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+KE PT
Sbjct: 131 TGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTR 190
Query: 176 RGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDII 233
RGF++ G G DY+ H + M G D+ + AWD +G YST ++T I+
Sbjct: 191 RGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQIL 250
Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
+H+ +P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE++
Sbjct: 251 ASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAIN 305
Query: 294 KVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWS 353
V AL+ +NSII++ SD SNWPLRG K T WEGG+R G + S
Sbjct: 306 NVTLALKTYGFYNNSIIIYSSDNGGQPTAGG----SNWPLRGSKGTYWEGGIRAVGFVHS 361
Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAA 380
PLL+++G V ++ VH++DW PTL+S A
Sbjct: 362 PLLKNKGTVCKELVHITDWYPTLISLA 388
Score = 74 (31.1 bits), Expect = 2.2e-89, Sum P(4) = 2.2e-89
Identities = 15/40 (37%), Positives = 21/40 (52%)
Query: 509 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKW 548
++DG D+W +S S R ILHNID + + G W
Sbjct: 397 QLDGYDIWETISEGLRSPRVDILHNIDPIY--TKAKNGSW 434
Score = 72 (30.4 bits), Expect = 3.6e-89, Sum P(4) = 3.6e-89
Identities = 13/27 (48%), Positives = 17/27 (62%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNID 600
++DG D+W +S S R ILHNID
Sbjct: 397 QLDGYDIWETISEGLRSPRVDILHNID 423
Score = 55 (24.4 bits), Expect = 2.2e-89, Sum P(4) = 2.2e-89
Identities = 11/40 (27%), Positives = 20/40 (50%)
Query: 858 YPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAW 897
YP ++ ++ + L+ N+TAV P D +P+ W
Sbjct: 512 YPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPRLNGGVW 551
Score = 39 (18.8 bits), Expect = 2.2e-89, Sum P(4) = 2.2e-89
Identities = 8/17 (47%), Positives = 13/17 (76%)
Query: 783 LFDIKNDPCEKNNLADR 799
LF+I DP E+ +L++R
Sbjct: 495 LFNITADPYERVDLSNR 511
>UNIPROTKB|Q32KH6 [details] [associations]
symbol:arsj "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
GO:GO:0008484 HOGENOM:HOG000135354 HOVERGEN:HBG004282
GeneTree:ENSGT00560000077076 KO:K12375 CTD:79642 OMA:AAGYGIW
OrthoDB:EOG45HRX5 EMBL:AAEX03016834 EMBL:BN000761
RefSeq:NP_001041581.1 UniGene:Cfa.28600 SMR:Q32KH6
Ensembl:ENSCAFT00000048607 GeneID:487909 KEGG:cfa:487909
InParanoid:Q32KH6 NextBio:20861390 Uniprot:Q32KH6
Length = 598
Score = 761 (272.9 bits), Expect = 3.6e-89, Sum P(4) = 3.6e-89
Identities = 149/327 (45%), Positives = 205/327 (62%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
++ PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS +
Sbjct: 70 ATSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFI 128
Query: 116 TGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
TGK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+KE PT
Sbjct: 129 TGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTK 188
Query: 176 RGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDII 233
RGF++ G G DY+ H + M G D+ + AWD +G YST ++T I+
Sbjct: 189 RGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQIL 248
Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
+H +P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE++
Sbjct: 249 ASHDPRKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAIN 303
Query: 294 KVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWS 353
V AL+ +NSII++ SD SNWPLRG K T WEGG+R G + S
Sbjct: 304 NVTLALKTYGFYNNSIIIYSSDNGGQPTAGG----SNWPLRGSKGTYWEGGIRAVGFVHS 359
Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAA 380
PLL+++G V ++ VH++DW PTL+S A
Sbjct: 360 PLLKNKGTVCKELVHITDWYPTLISLA 386
Score = 75 (31.5 bits), Expect = 3.6e-89, Sum P(4) = 3.6e-89
Identities = 16/40 (40%), Positives = 21/40 (52%)
Query: 509 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKW 548
++DG DVW +S S R ILHNID + + G W
Sbjct: 395 QLDGYDVWETISEGLRSPRVDILHNIDPIY--TKAKNGSW 432
Score = 73 (30.8 bits), Expect = 5.9e-89, Sum P(4) = 5.9e-89
Identities = 14/27 (51%), Positives = 17/27 (62%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNID 600
++DG DVW +S S R ILHNID
Sbjct: 395 QLDGYDVWETISEGLRSPRVDILHNID 421
Score = 59 (25.8 bits), Expect = 3.6e-89, Sum P(4) = 3.6e-89
Identities = 12/40 (30%), Positives = 20/40 (50%)
Query: 858 YPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAW 897
YP ++ Q+ + L+ N+TAV P D +P+ W
Sbjct: 510 YPGIVKQLLRRLSQFNKTAVPVRYPPKDPRSNPRLNGGVW 549
Score = 37 (18.1 bits), Expect = 3.6e-89, Sum P(4) = 3.6e-89
Identities = 8/17 (47%), Positives = 12/17 (70%)
Query: 783 LFDIKNDPCEKNNLADR 799
LF+I DP E+ +L+ R
Sbjct: 493 LFNITADPYERVDLSHR 509
>RGD|1310242 [details] [associations]
symbol:Arsi "arylsulfatase family, member I" species:10116
"Rattus norvegicus" [GO:0005576 "extracellular region"
evidence=IEA] [GO:0005783 "endoplasmic reticulum" evidence=IEA]
[GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
[GO:0046872 "metal ion binding" evidence=IEA] InterPro:IPR000917
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 RGD:1310242
GO:GO:0005783 GO:GO:0005576 GO:GO:0046872 Gene3D:3.40.720.10
SUPFAM:SSF53649 eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0008484 HOGENOM:HOG000135354
HOVERGEN:HBG004282 GeneTree:ENSGT00560000077076 HSSP:P15289
CTD:340075 KO:K12375 OrthoDB:EOG4DFPN6 OMA:YHGSDIE
EMBL:AABR03109797 EMBL:BN000739 IPI:IPI00367540
RefSeq:NP_001041346.1 UniGene:Rn.202490 ProteinModelPortal:Q32KJ8
SMR:Q32KJ8 STRING:Q32KJ8 PhosphoSite:Q32KJ8
Ensembl:ENSRNOT00000030966 GeneID:307404 KEGG:rno:307404
UCSC:RGD:1310242 InParanoid:Q32KJ8 NextBio:657343
Genevestigator:Q32KJ8 Uniprot:Q32KJ8
Length = 573
Score = 743 (266.6 bits), Expect = 2.5e-88, Sum P(4) = 2.5e-88
Identities = 147/323 (45%), Positives = 206/323 (63%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PPHIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS ++TG+
Sbjct: 46 PPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGR 104
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFY+KE PT RGF
Sbjct: 105 YQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGF 164
Query: 179 ESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
++ LG TG+ DY+ + + + + G D+ AW L G+YST ++ A I+ +HS
Sbjct: 165 DTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHS 224
Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
+PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V +
Sbjct: 225 PQKPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 279
Query: 298 ALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
AL++ +NS+I+F SD SNWPLRG K T WEGGVRG G + SPLL+
Sbjct: 280 ALKRYGFYNNSVIIFSSDNGGQTFSGG----SNWPLRGRKGTYWEGGVRGLGFVHSPLLK 335
Query: 358 SRGIVAEQYVHVSDWLPTLLSAA 380
+ + VH++DW PTL+ A
Sbjct: 336 KKRRTSRALVHITDWYPTLVGLA 358
Score = 77 (32.2 bits), Expect = 2.5e-88, Sum P(4) = 2.5e-88
Identities = 14/26 (53%), Positives = 16/26 (61%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNID 535
+DG DVW +S S R ILHNID
Sbjct: 368 LDGYDVWPAISEGRASPRTEILHNID 393
Score = 77 (32.2 bits), Expect = 2.5e-88, Sum P(4) = 2.5e-88
Identities = 14/26 (53%), Positives = 16/26 (61%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNID 600
+DG DVW +S S R ILHNID
Sbjct: 368 LDGYDVWPAISEGRASPRTEILHNID 393
Score = 57 (25.1 bits), Expect = 2.2e-84, Sum P(3) = 2.2e-84
Identities = 13/39 (33%), Positives = 21/39 (53%)
Query: 677 LFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
LF+I DP E+ +LAD+ D + + +N+ A P
Sbjct: 464 LFNISADPYEREDLADQRPDV-VRTLLARLADYNRTAIP 501
Score = 54 (24.1 bits), Expect = 2.5e-88, Sum P(4) = 2.5e-88
Identities = 15/46 (32%), Positives = 23/46 (50%)
Query: 859 PDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDH-AWSIFGDD 903
PDV+ + LA+ NRTA+ P+ P +F+ AW + D
Sbjct: 482 PDVVRTLLARLADYNRTAI-PVRYPAANPRAHPDFNGGAWGPWASD 526
Score = 50 (22.7 bits), Expect = 2.5e-88, Sum P(4) = 2.5e-88
Identities = 12/23 (52%), Positives = 16/23 (69%)
Query: 783 LFDIKNDPCEKNNLAD-RSEVQR 804
LF+I DP E+ +LAD R +V R
Sbjct: 464 LFNISADPYEREDLADQRPDVVR 486
>UNIPROTKB|E1BIN3 [details] [associations]
symbol:ARSI "Uncharacterized protein" species:9913 "Bos
taurus" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
GeneTree:ENSGT00560000077076 OMA:YHGSDIE EMBL:DAAA02020627
IPI:IPI00695273 Ensembl:ENSBTAT00000017050 Uniprot:E1BIN3
Length = 572
Score = 744 (267.0 bits), Expect = 8.3e-88, Sum P(4) = 8.3e-88
Identities = 147/323 (45%), Positives = 207/323 (64%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PPHIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS ++TG+
Sbjct: 47 PPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGR 105
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ IHTG+QH+++ + LPL + LPQ L+ELGY T +VGKWHLGFY+KE PT RGF
Sbjct: 106 YQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQELGYSTHMVGKWHLGFYRKECLPTRRGF 165
Query: 179 ESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
++ LG TG+ DY+ + + + + G D+ AW L G+YST ++ I+ +HS
Sbjct: 166 DTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTLLYAQRVSHILASHS 225
Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
+PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V +
Sbjct: 226 PRQPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 280
Query: 298 ALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
AL++ +NS+I+F SD SNWPLRG K T WEGGVRG G + SPLL+
Sbjct: 281 ALKRHGFYNNSVIIFSSDNGGQTFSGG----SNWPLRGRKGTYWEGGVRGLGFVHSPLLK 336
Query: 358 SRGIVAEQYVHVSDWLPTLLSAA 380
+ + VH++DW PTL++ A
Sbjct: 337 RKRRTSRALVHITDWYPTLVALA 359
Score = 77 (32.2 bits), Expect = 8.3e-88, Sum P(4) = 8.3e-88
Identities = 14/26 (53%), Positives = 16/26 (61%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNID 535
+DG DVW +S S R ILHNID
Sbjct: 369 LDGYDVWPAISEGRASPRTEILHNID 394
Score = 77 (32.2 bits), Expect = 8.3e-88, Sum P(4) = 8.3e-88
Identities = 14/26 (53%), Positives = 16/26 (61%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNID 600
+DG DVW +S S R ILHNID
Sbjct: 369 LDGYDVWPAISEGRASPRTEILHNID 394
Score = 54 (24.1 bits), Expect = 8.3e-88, Sum P(4) = 8.3e-88
Identities = 14/46 (30%), Positives = 23/46 (50%)
Query: 859 PDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDH-AWSIFGDD 903
PDV+ + L + NRTA+ P+ P + +F+ AW + D
Sbjct: 483 PDVVRALLARLVDYNRTAI-PVRYPAENPRAHPDFNGGAWGPWASD 527
Score = 47 (21.6 bits), Expect = 2.0e-83, Sum P(3) = 2.0e-83
Identities = 12/39 (30%), Positives = 20/39 (51%)
Query: 677 LFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
LF+I DP E+ +LA + D + + +N+ A P
Sbjct: 465 LFNISADPYEREDLAGQRPDV-VRALLARLVDYNRTAIP 502
Score = 44 (20.5 bits), Expect = 8.3e-88, Sum P(4) = 8.3e-88
Identities = 11/23 (47%), Positives = 15/23 (65%)
Query: 783 LFDIKNDPCEKNNLA-DRSEVQR 804
LF+I DP E+ +LA R +V R
Sbjct: 465 LFNISADPYEREDLAGQRPDVVR 487
>UNIPROTKB|Q5FYB1 [details] [associations]
symbol:ARSI "Arylsulfatase I" species:9606 "Homo sapiens"
[GO:0046872 "metal ion binding" evidence=IEA] [GO:0005576
"extracellular region" evidence=IEA] [GO:0004065 "arylsulfatase
activity" evidence=TAS] [GO:0005788 "endoplasmic reticulum lumen"
evidence=TAS] [GO:0006644 "phospholipid metabolic process"
evidence=TAS] [GO:0006665 "sphingolipid metabolic process"
evidence=TAS] [GO:0006687 "glycosphingolipid metabolic process"
evidence=TAS] [GO:0043687 "post-translational protein modification"
evidence=TAS] [GO:0044267 "cellular protein metabolic process"
evidence=TAS] [GO:0044281 "small molecule metabolic process"
evidence=TAS] Reactome:REACT_17015 Reactome:REACT_111217
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 GO:GO:0005576 GO:GO:0044281 GO:GO:0046872
GO:GO:0006644 GO:GO:0005788 GO:GO:0043687 Gene3D:3.40.720.10
SUPFAM:SSF53649 eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0004065 GO:GO:0006687 HOGENOM:HOG000135354
HOVERGEN:HBG004282 CTD:340075 KO:K12375 OrthoDB:EOG4DFPN6
EMBL:AY875937 EMBL:AB448735 EMBL:AK122641 EMBL:BC129995
EMBL:BC129996 IPI:IPI00257076 IPI:IPI00915442 RefSeq:NP_001012301.1
UniGene:Hs.591252 ProteinModelPortal:Q5FYB1 SMR:Q5FYB1
STRING:Q5FYB1 PhosphoSite:Q5FYB1 DMDM:74722581 PRIDE:Q5FYB1
Ensembl:ENST00000328668 Ensembl:ENST00000515301 GeneID:340075
KEGG:hsa:340075 UCSC:uc003lrv.2 GeneCards:GC05M149657
HGNC:HGNC:32521 HPA:HPA038386 MIM:610009 neXtProt:NX_Q5FYB1
PharmGKB:PA143485309 InParanoid:Q5FYB1 OMA:YHGSDIE
GenomeRNAi:340075 NextBio:97681 ArrayExpress:Q5FYB1 Bgee:Q5FYB1
CleanEx:HS_ARSI Genevestigator:Q5FYB1 GermOnline:ENSG00000183876
Uniprot:Q5FYB1
Length = 569
Score = 738 (264.8 bits), Expect = 1.7e-87, Sum P(4) = 1.7e-87
Identities = 146/323 (45%), Positives = 205/323 (63%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PPHIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS ++TG+
Sbjct: 46 PPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGR 104
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFY+KE PT RGF
Sbjct: 105 YQIHTGLQHSIIRPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGF 164
Query: 179 ESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
++ LG TG+ DY+ + + + + G D+ AW L G+YST ++ A I+ +HS
Sbjct: 165 DTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHS 224
Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V +
Sbjct: 225 PQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 279
Query: 298 ALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
AL++ +NS+I+F SD SNWPLRG K T WEGGVRG G + SPLL+
Sbjct: 280 ALKRYGFYNNSVIIFSSDNGGQTFSGG----SNWPLRGRKGTYWEGGVRGLGFVHSPLLK 335
Query: 358 SRGIVAEQYVHVSDWLPTLLSAA 380
+ + +H++DW PTL+ A
Sbjct: 336 RKQRTSRALMHITDWYPTLVGLA 358
Score = 77 (32.2 bits), Expect = 1.7e-87, Sum P(4) = 1.7e-87
Identities = 14/26 (53%), Positives = 16/26 (61%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNID 535
+DG DVW +S S R ILHNID
Sbjct: 368 LDGYDVWPAISEGRASPRTEILHNID 393
Score = 77 (32.2 bits), Expect = 1.7e-87, Sum P(4) = 1.7e-87
Identities = 14/26 (53%), Positives = 16/26 (61%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNID 600
+DG DVW +S S R ILHNID
Sbjct: 368 LDGYDVWPAISEGRASPRTEILHNID 393
Score = 57 (25.1 bits), Expect = 1.7e-87, Sum P(4) = 1.7e-87
Identities = 15/46 (32%), Positives = 23/46 (50%)
Query: 859 PDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDH-AWSIFGDD 903
PDV+ + LA NRTA+ P+ P + +F+ AW + D
Sbjct: 482 PDVVRTLLARLAEYNRTAI-PVRYPAENPRAHPDFNGGAWGPWASD 526
Score = 52 (23.4 bits), Expect = 2.5e-83, Sum P(3) = 2.5e-83
Identities = 12/39 (30%), Positives = 20/39 (51%)
Query: 677 LFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
LF+I DP E+ +LA + D + + +N+ A P
Sbjct: 464 LFNISADPYEREDLAGQRPDV-VRTLLARLAEYNRTAIP 501
Score = 44 (20.5 bits), Expect = 1.7e-87, Sum P(4) = 1.7e-87
Identities = 11/23 (47%), Positives = 15/23 (65%)
Query: 783 LFDIKNDPCEKNNLA-DRSEVQR 804
LF+I DP E+ +LA R +V R
Sbjct: 464 LFNISADPYEREDLAGQRPDVVR 486
>UNIPROTKB|Q32KH7 [details] [associations]
symbol:ARSI "Arylsulfatase I" species:9615 "Canis lupus
familiaris" [GO:0005783 "endoplasmic reticulum" evidence=IEA]
[GO:0005576 "extracellular region" evidence=IEA] [GO:0046872 "metal
ion binding" evidence=IEA] [GO:0008484 "sulfuric ester hydrolase
activity" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 GO:GO:0005783 GO:GO:0005576
GO:GO:0046872 Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
HOGENOM:HOG000135354 HOVERGEN:HBG004282
GeneTree:ENSGT00560000077076 HSSP:P15289 EMBL:AAEX02012119
EMBL:BN000760 RefSeq:NP_001041583.1 UniGene:Cfa.39081
ProteinModelPortal:Q32KH7 Ensembl:ENSCAFT00000028793 GeneID:489186
KEGG:cfa:489186 CTD:340075 InParanoid:Q32KH7 KO:K12375
OrthoDB:EOG4DFPN6 NextBio:20862393 Uniprot:Q32KH7
Length = 573
Score = 738 (264.8 bits), Expect = 3.5e-87, Sum P(4) = 3.5e-87
Identities = 146/323 (45%), Positives = 204/323 (63%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGK 118
PPHIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS ++TG+
Sbjct: 47 PPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGR 105
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFY+KE PT RGF
Sbjct: 106 YQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGF 165
Query: 179 ESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHS 237
++ LG TG+ DY+ + + + + G D+ AW L G+YST ++ I+ +HS
Sbjct: 166 DTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTMLYAQRVSHILASHS 225
Query: 238 TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVE 297
PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V +
Sbjct: 226 PRRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITS 280
Query: 298 ALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLE 357
AL++ +NS+I+F SD SNWPLRG K T WEGGVRG G + SPLL+
Sbjct: 281 ALKRYGFYNNSVIIFSSDNGGQTFSGG----SNWPLRGRKGTYWEGGVRGLGFVHSPLLK 336
Query: 358 SRGIVAEQYVHVSDWLPTLLSAA 380
+ + VH++DW PTL+ A
Sbjct: 337 RKRRTSRALVHITDWYPTLVGLA 359
Score = 77 (32.2 bits), Expect = 3.5e-87, Sum P(4) = 3.5e-87
Identities = 14/26 (53%), Positives = 16/26 (61%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNID 535
+DG DVW +S S R ILHNID
Sbjct: 369 LDGYDVWPAISEGRASPRTEILHNID 394
Score = 77 (32.2 bits), Expect = 3.5e-87, Sum P(4) = 3.5e-87
Identities = 14/26 (53%), Positives = 16/26 (61%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNID 600
+DG DVW +S S R ILHNID
Sbjct: 369 LDGYDVWPAISEGRASPRTEILHNID 394
Score = 54 (24.1 bits), Expect = 3.5e-87, Sum P(4) = 3.5e-87
Identities = 14/46 (30%), Positives = 23/46 (50%)
Query: 859 PDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDH-AWSIFGDD 903
PDV+ + L + NRTA+ P+ P + +F+ AW + D
Sbjct: 483 PDVVRALLARLVDYNRTAI-PVRYPAENPRAHPDFNGGAWGPWASD 527
Score = 47 (21.6 bits), Expect = 8.3e-83, Sum P(3) = 8.3e-83
Identities = 12/39 (30%), Positives = 20/39 (51%)
Query: 677 LFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
LF+I DP E+ +LA + D + + +N+ A P
Sbjct: 465 LFNISADPYEREDLAGQRPDV-VRALLARLVDYNRTAIP 502
Score = 44 (20.5 bits), Expect = 3.5e-87, Sum P(4) = 3.5e-87
Identities = 11/23 (47%), Positives = 15/23 (65%)
Query: 783 LFDIKNDPCEKNNLA-DRSEVQR 804
LF+I DP E+ +LA R +V R
Sbjct: 465 LFNISADPYEREDLAGQRPDVVR 487
>UNIPROTKB|F1RL69 [details] [associations]
symbol:LOC100517463 "Uncharacterized protein" species:9823
"Sus scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
GeneTree:ENSGT00560000077076 OMA:YHGSDIE EMBL:FP102406
Ensembl:ENSSSCT00000015795 Uniprot:F1RL69
Length = 596
Score = 723 (259.6 bits), Expect = 1.3e-85, Sum P(4) = 1.3e-85
Identities = 145/325 (44%), Positives = 203/325 (62%)
Query: 57 SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMT 116
S PHIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS ++T
Sbjct: 69 SQQPHIIFILTDDQGYHDVGYHGSD-IQTPTLDRLAAEGVKLENYYIQPICTPSRSQLLT 127
Query: 117 GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
G++ IHTG+QH+++ + LPL + LPQ L++LGY T +VGKWHLGFY+KE PT R
Sbjct: 128 GRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQRLQQLGYATHMVGKWHLGFYRKECLPTRR 187
Query: 177 GFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
GF++ LG TG+ DY+ + + + + G D+ AW L G+YST ++ I+
Sbjct: 188 GFDTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGESVAWGLSGQYSTLLYAQRVSRILAG 247
Query: 236 HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKV 295
HS PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V +
Sbjct: 248 HSPRRPLFLYVAFQAVHT-----PLQSPREYLYRYRGMGNVARRKYAAMVTCMDEAVRNI 302
Query: 296 VEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWSPL 355
AL++ +NS+I+F SD SNWPLRG K T WEGGVRG G + SPL
Sbjct: 303 TGALKRYGFYNNSVIIFSSDNGGQTFSGG----SNWPLRGRKGTYWEGGVRGLGFVHSPL 358
Query: 356 LESRGIVAEQYVHVSDWLPTLLSAA 380
L+ + +H++DW PTL+ A
Sbjct: 359 LKRTRRTSRALLHITDWYPTLVGLA 383
Score = 77 (32.2 bits), Expect = 1.3e-85, Sum P(4) = 1.3e-85
Identities = 14/26 (53%), Positives = 16/26 (61%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNID 535
+DG DVW +S S R ILHNID
Sbjct: 393 LDGYDVWPAISEGRASPRTEILHNID 418
Score = 77 (32.2 bits), Expect = 1.3e-85, Sum P(4) = 1.3e-85
Identities = 14/26 (53%), Positives = 16/26 (61%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNID 600
+DG DVW +S S R ILHNID
Sbjct: 393 LDGYDVWPAISEGRASPRTEILHNID 418
Score = 54 (24.1 bits), Expect = 1.3e-85, Sum P(4) = 1.3e-85
Identities = 14/46 (30%), Positives = 23/46 (50%)
Query: 859 PDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDH-AWSIFGDD 903
PDV+ + L + NRTA+ P+ P + +F+ AW + D
Sbjct: 507 PDVVRALLARLVDYNRTAI-PVRYPAENPRAHPDFNGGAWGPWASD 551
Score = 47 (21.6 bits), Expect = 3.1e-81, Sum P(3) = 3.1e-81
Identities = 12/39 (30%), Positives = 20/39 (51%)
Query: 677 LFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
LF+I DP E+ +LA + D + + +N+ A P
Sbjct: 489 LFNISADPYEREDLAGQRPDV-VRALLARLVDYNRTAIP 526
Score = 44 (20.5 bits), Expect = 1.3e-85, Sum P(4) = 1.3e-85
Identities = 11/23 (47%), Positives = 15/23 (65%)
Query: 783 LFDIKNDPCEKNNLA-DRSEVQR 804
LF+I DP E+ +LA R +V R
Sbjct: 489 LFNISADPYEREDLAGQRPDVVR 511
>MGI|MGI:2443513 [details] [associations]
symbol:Arsj "arylsulfatase J" species:10090 "Mus musculus"
[GO:0003824 "catalytic activity" evidence=IEA] [GO:0005575
"cellular_component" evidence=ND] [GO:0005576 "extracellular
region" evidence=IEA] [GO:0008152 "metabolic process" evidence=IEA]
[GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
[GO:0016787 "hydrolase activity" evidence=IEA] [GO:0046872 "metal
ion binding" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 MGI:MGI:2443513 GO:GO:0005576
GO:GO:0046872 Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
HOVERGEN:HBG004282 GeneTree:ENSGT00560000077076 KO:K12375 CTD:79642
OMA:AAGYGIW OrthoDB:EOG45HRX5 EMBL:AK034454 EMBL:AK046410
EMBL:AK052931 IPI:IPI00986759 RefSeq:NP_775627.1 UniGene:Mm.317021
ProteinModelPortal:Q8BM89 SMR:Q8BM89 STRING:Q8BM89
PhosphoSite:Q8BM89 PRIDE:Q8BM89 Ensembl:ENSMUST00000093976
GeneID:271970 KEGG:mmu:271970 InParanoid:Q8BM89 NextBio:393532
Bgee:Q8BM89 CleanEx:MM_ARSJ Genevestigator:Q8BM89
GermOnline:ENSMUSG00000046561 Uniprot:Q8BM89
Length = 598
Score = 758 (271.9 bits), Expect = 1.6e-85, Sum P(3) = 1.6e-85
Identities = 149/328 (45%), Positives = 205/328 (62%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
A + PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 69 AGTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQF 127
Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
+TGK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+K+ PT
Sbjct: 128 ITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMPT 187
Query: 175 FRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDI 232
RGF++ G G DY+ H + + G D+ + AWD +G YST ++T I
Sbjct: 188 KRGFDTFFGSLLGSGDYYTHYKCDSPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQI 247
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
+ H +PLFLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE++
Sbjct: 248 LATHDPTKPLFLYVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAI 302
Query: 293 GKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIW 352
V AL++ +NSII++ SD SNWPLRG K T WEGG+R G +
Sbjct: 303 HNVTLALKRYGFYNNSIIIYSSDNGGQPTAGG----SNWPLRGSKGTYWEGGIRAVGFVH 358
Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 359 SPLLKNKGTVCKELVHITDWYPTLISLA 386
Score = 74 (31.1 bits), Expect = 1.6e-85, Sum P(3) = 1.6e-85
Identities = 15/40 (37%), Positives = 21/40 (52%)
Query: 509 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKW 548
++DG D+W +S S R ILHNID + + G W
Sbjct: 395 QLDGYDIWETISEGLRSPRVDILHNIDPIY--TKAKNGSW 432
Score = 72 (30.4 bits), Expect = 2.5e-85, Sum P(3) = 2.5e-85
Identities = 13/27 (48%), Positives = 17/27 (62%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNID 600
++DG D+W +S S R ILHNID
Sbjct: 395 QLDGYDIWETISEGLRSPRVDILHNID 421
Score = 56 (24.8 bits), Expect = 1.6e-85, Sum P(3) = 1.6e-85
Identities = 23/111 (20%), Positives = 41/111 (36%)
Query: 795 NLADRSEVQRINHY---TTEVGYLD--PKQRFNQIA---YLDKEXXXXXXXXXXXXXXXX 846
N A +S + R+ H+ T GY D P Q F+ + + ++
Sbjct: 440 NTAIQSAI-RVQHWKLLTGNPGYSDWVPPQAFSNLGPNRWHNERITLSTGKSIWLFNITA 498
Query: 847 XXXXXXXXXXGYPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAW 897
YP ++ ++ + L+ N+TAV P D +P+ W
Sbjct: 499 DPYERVDLSSRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPRLNGGVW 549
Score = 45 (20.9 bits), Expect = 2.2e-84, Sum P(3) = 2.2e-84
Identities = 12/39 (30%), Positives = 20/39 (51%)
Query: 677 LFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
LF+I DP E+ +L+ R + + +FN+ A P
Sbjct: 493 LFNITADPYERVDLSSRYPGI-VKKLLRRLSQFNKTAVP 530
Score = 38 (18.4 bits), Expect = 1.2e-83, Sum P(3) = 1.2e-83
Identities = 8/17 (47%), Positives = 12/17 (70%)
Query: 783 LFDIKNDPCEKNNLADR 799
LF+I DP E+ +L+ R
Sbjct: 493 LFNITADPYERVDLSSR 509
>RGD|1307640 [details] [associations]
symbol:Arsj "arylsulfatase family, member J" species:10116
"Rattus norvegicus" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 RGD:1307640 Gene3D:3.40.720.10
SUPFAM:SSF53649 eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0008484 HOGENOM:HOG000135354
HOVERGEN:HBG004282 GeneTree:ENSGT00560000077076 KO:K12375 CTD:79642
OMA:AAGYGIW OrthoDB:EOG45HRX5 EMBL:CH473952 EMBL:BN000740
IPI:IPI00777558 RefSeq:NP_001041352.1 UniGene:Rn.202364 SMR:Q32KJ7
STRING:Q32KJ7 Ensembl:ENSRNOT00000055633 GeneID:311013
KEGG:rno:311013 UCSC:RGD:1307640 InParanoid:Q32KJ7 NextBio:662880
Genevestigator:Q32KJ7 Uniprot:Q32KJ7
Length = 597
Score = 757 (271.5 bits), Expect = 2.0e-85, Sum P(3) = 2.0e-85
Identities = 149/328 (45%), Positives = 206/328 (62%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
A + PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 69 AVTSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQF 127
Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
+TGK+ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+K+ PT
Sbjct: 128 ITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMPT 187
Query: 175 FRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDI 232
RGF++ G G DY+ H + + G D+ + AWD +G YST ++T I
Sbjct: 188 KRGFDTFFGSLLGSGDYYTHYKCDSPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQI 247
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
+ +H +PLFLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE++
Sbjct: 248 LASHDPTKPLFLYVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAI 302
Query: 293 GKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIW 352
V AL++ +NSII++ SD SNWPLRG K T WEGG+R G +
Sbjct: 303 HNVTLALKRYGFYNNSIIIYSSDNGGQPTAGG----SNWPLRGSKGTYWEGGIRAVGFVH 358
Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 359 SPLLKNKGTVCKELVHITDWYPTLISLA 386
Score = 74 (31.1 bits), Expect = 2.0e-85, Sum P(3) = 2.0e-85
Identities = 15/40 (37%), Positives = 21/40 (52%)
Query: 509 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKW 548
++DG D+W +S S R ILHNID + + G W
Sbjct: 395 QLDGYDIWETISEGLRSPRVDILHNIDPIY--TKAKNGSW 432
Score = 72 (30.4 bits), Expect = 3.2e-85, Sum P(3) = 3.2e-85
Identities = 13/27 (48%), Positives = 17/27 (62%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNID 600
++DG D+W +S S R ILHNID
Sbjct: 395 QLDGYDIWETISEGLRSPRVDILHNID 421
Score = 56 (24.8 bits), Expect = 2.0e-85, Sum P(3) = 2.0e-85
Identities = 23/111 (20%), Positives = 41/111 (36%)
Query: 795 NLADRSEVQRINHY---TTEVGYLD--PKQRFNQIA---YLDKEXXXXXXXXXXXXXXXX 846
N A +S + R+ H+ T GY D P Q F+ + + ++
Sbjct: 440 NTAIQSAI-RVQHWKLLTGNPGYSDWVPPQAFSNLGPNRWHNERITLSTGKSIWLFNITA 498
Query: 847 XXXXXXXXXXGYPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAW 897
YP ++ ++ + L+ N+TAV P D +P+ W
Sbjct: 499 DPYERVDLSSRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPRLNGGVW 549
Score = 45 (20.9 bits), Expect = 2.8e-84, Sum P(3) = 2.8e-84
Identities = 12/39 (30%), Positives = 20/39 (51%)
Query: 677 LFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
LF+I DP E+ +L+ R + + +FN+ A P
Sbjct: 493 LFNITADPYERVDLSSRYPGI-VKKLLRRLSQFNKTAVP 530
Score = 38 (18.4 bits), Expect = 1.5e-83, Sum P(3) = 1.5e-83
Identities = 8/17 (47%), Positives = 12/17 (70%)
Query: 783 LFDIKNDPCEKNNLADR 799
LF+I DP E+ +L+ R
Sbjct: 493 LFNITADPYERVDLSSR 509
>UNIPROTKB|F1P095 [details] [associations]
symbol:ARSB "Uncharacterized protein" species:9031 "Gallus
gallus" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00149 GO:GO:0008484
GeneTree:ENSGT00560000077076 EMBL:AADN02046142 EMBL:AADN02046143
EMBL:AADN02046144 EMBL:AADN02046145 EMBL:AADN02046146
EMBL:AADN02046147 EMBL:AADN02046148 EMBL:AADN02046149
EMBL:AADN02046150 IPI:IPI00820595 Ensembl:ENSGALT00000038618
ArrayExpress:F1P095 Uniprot:F1P095
Length = 407
Score = 779 (279.3 bits), Expect = 4.5e-84, Sum P(2) = 4.5e-84
Identities = 149/332 (44%), Positives = 213/332 (64%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
A+ PPH++ +LADDLGW DVG+HG I TP +DAL G+ LK Y T LCTPSR +
Sbjct: 35 AARPPPHLVLVLADDLGWGDVGWHG-SAIRTPRLDALGAGGVRLKGY-TQPLCTPSRPFL 92
Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
+ G + IHTG+QH +++ C+ LPL EK+LP+ LK+ GY T +VGKWHLG Y+KE PT
Sbjct: 93 LFGGYYIHTGLQHQIIWPCQPSCLPLDEKLLPELLKDAGYVTHMVGKWHLGMYRKECLPT 152
Query: 175 FRGFESHLGYWTGHQDYFDHS------AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
RGF+++ GY G +DY+ H A+ + LD R E A YST++FT
Sbjct: 153 RRGFDTYFGYLLGSEDYYSHDHCVLIKAKNVTRCALDFRDGEEVATGFKNMYSTNLFTER 212
Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
A+D+I NH T++PLFLYLA + H EPL+ Y+ + I+D KR ++A ++ +
Sbjct: 213 AIDLIANHKTEKPLFLYLAFQSVH-----EPLEVSAEYMKPYSSIKDVKRRRYAGMVSLM 267
Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGA 348
DE+VG + +AL++ + +N+++VF +D +NWPLRG K TLWEGGVRG
Sbjct: 268 DEAVGNLTDALKEYGLWNNTVLVFSTDNGGQTMAGG----NNWPLRGRKWTLWEGGVRGV 323
Query: 349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
G + SPLL+ +G+ + + +H+SDWLPTL+ A
Sbjct: 324 GFVASPLLKQKGVESHELIHISDWLPTLVHLA 355
Score = 92 (37.4 bits), Expect = 0.00012, Sum P(2) = 0.00012
Identities = 15/35 (42%), Positives = 25/35 (71%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIM 35
+QH +++ C+ LPL EK+LP+ LK+ GY T ++
Sbjct: 103 LQHQIIWPCQPSCLPLDEKLLPELLKDAGYVTHMV 137
Score = 82 (33.9 bits), Expect = 4.5e-84, Sum P(2) = 4.5e-84
Identities = 14/26 (53%), Positives = 17/26 (65%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNID 535
+DG DVW +S PS R +LHNID
Sbjct: 365 LDGFDVWKTISEGRPSPRVELLHNID 390
Score = 82 (33.9 bits), Expect = 4.5e-84, Sum P(2) = 4.5e-84
Identities = 14/26 (53%), Positives = 17/26 (65%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNID 600
+DG DVW +S PS R +LHNID
Sbjct: 365 LDGFDVWKTISEGRPSPRVELLHNID 390
>UNIPROTKB|F1NQP9 [details] [associations]
symbol:ARSI "Uncharacterized protein" species:9031 "Gallus
gallus" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
GeneTree:ENSGT00560000077076 OMA:YHGSDIE EMBL:AADN02028629
IPI:IPI00587142 Ensembl:ENSGALT00000009011 Uniprot:F1NQP9
Length = 572
Score = 747 (268.0 bits), Expect = 4.6e-84, Sum P(3) = 4.6e-84
Identities = 150/335 (44%), Positives = 212/335 (63%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
A + PPHIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS +
Sbjct: 43 AFARPPHIIFILTDDQGYHDVGYHGSD-IQTPTLDRLAAEGVKLENYYIQPICTPSRSQL 101
Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
+TG++ IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFYKKE PT
Sbjct: 102 ITGRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPT 161
Query: 175 FRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
RGF++ LG TG+ DY+ + + + + G D+ AWD GKYST ++ I+
Sbjct: 162 RRGFDTFLGSLTGNVDYYTYDNCDGPGVCGYDLHEGENVAWDQSGKYSTFLYAQRVSKIL 221
Query: 234 HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVG 293
+HS EP+F+Y+A A H+ PLQ+P Y+ +R + + R K+AA++ +DE+V
Sbjct: 222 ASHSPKEPIFIYVAFQAVHT-----PLQSPKEYIYRYRSMGNVARRKYAAMVTCMDEAVK 276
Query: 294 KVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWS 353
+ AL++ NS+IVF +D SNWPLRG K T WEGGVRG G + S
Sbjct: 277 NITWALKKYGYYDNSVIVFSTDNGGQTFSGG----SNWPLRGRKGTYWEGGVRGIGFVHS 332
Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAA--NKSDIP 386
PL++ + + VH++DW PTL+S A N S++P
Sbjct: 333 PLIKRKRRTSWALVHITDWYPTLVSLARGNLSNVP 367
Score = 76 (31.8 bits), Expect = 4.6e-84, Sum P(3) = 4.6e-84
Identities = 20/56 (35%), Positives = 26/56 (46%)
Query: 545 RGKWKLVKENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNID 600
R W LV S R N ++ +DG +VW +S + S R ILHNID
Sbjct: 340 RTSWALVHITDWYPTLVSLARGNLSNVPG-LDGYNVWPAISEGKESPRTEILHNID 394
Score = 73 (30.8 bits), Expect = 9.5e-84, Sum P(3) = 9.5e-84
Identities = 13/26 (50%), Positives = 17/26 (65%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNID 535
+DG +VW +S + S R ILHNID
Sbjct: 369 LDGYNVWPAISEGKESPRTEILHNID 394
Score = 51 (23.0 bits), Expect = 4.6e-84, Sum P(3) = 4.6e-84
Identities = 12/39 (30%), Positives = 22/39 (56%)
Query: 677 LFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
LF+I DP E+ +L+++ D + T + +N+ A P
Sbjct: 466 LFNITADPYERYDLSEQRPDV-VRALLTRLVHYNRTAIP 503
Score = 50 (22.7 bits), Expect = 5.8e-84, Sum P(3) = 5.8e-84
Identities = 13/40 (32%), Positives = 21/40 (52%)
Query: 859 PDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDH-AW 897
PDV+ + L + NRTA+ P+ P + +F+ AW
Sbjct: 484 PDVVRALLTRLVHYNRTAI-PVRYPAENPRAHPDFNGGAW 522
Score = 40 (19.1 bits), Expect = 6.6e-83, Sum P(3) = 6.6e-83
Identities = 10/23 (43%), Positives = 16/23 (69%)
Query: 783 LFDIKNDPCEKNNLAD-RSEVQR 804
LF+I DP E+ +L++ R +V R
Sbjct: 466 LFNITADPYERYDLSEQRPDVVR 488
>MGI|MGI:2670959 [details] [associations]
symbol:Arsi "arylsulfatase i" species:10090 "Mus musculus"
[GO:0003824 "catalytic activity" evidence=IEA] [GO:0005575
"cellular_component" evidence=ND] [GO:0005576 "extracellular
region" evidence=IEA] [GO:0005783 "endoplasmic reticulum"
evidence=IEA] [GO:0008152 "metabolic process" evidence=IEA]
[GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
[GO:0016787 "hydrolase activity" evidence=IEA] [GO:0046872 "metal
ion binding" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 MGI:MGI:2670959 GO:GO:0005783
GO:GO:0005576 EMBL:CH466528 GO:GO:0046872 Gene3D:3.40.720.10
SUPFAM:SSF53649 eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0008484 HOGENOM:HOG000135354
HOVERGEN:HBG004282 GeneTree:ENSGT00560000077076 HSSP:P15289
CTD:340075 KO:K12375 OrthoDB:EOG4DFPN6 OMA:YHGSDIE EMBL:BC138970
EMBL:BC141169 EMBL:BN000748 IPI:IPI00462991 RefSeq:NP_001033588.1
UniGene:Mm.20147 ProteinModelPortal:Q32KI9 SMR:Q32KI9 STRING:Q32KI9
PRIDE:Q32KI9 Ensembl:ENSMUST00000040359 GeneID:545260
KEGG:mmu:545260 UCSC:uc008fbe.1 InParanoid:Q32KI9 NextBio:412424
Bgee:Q32KI9 Genevestigator:Q32KI9 Uniprot:Q32KI9
Length = 573
Score = 743 (266.6 bits), Expect = 9.5e-84, Sum P(3) = 9.5e-84
Identities = 148/328 (45%), Positives = 207/328 (63%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
VA PPHIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 41 VAPPQPPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQ 99
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
++TG++ IHTG+QH+++ + LPL + LPQ L+E GY T +VGKWHLGFY+KE P
Sbjct: 100 LLTGRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLP 159
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDI 232
T RGF++ LG TG+ DY+ + + + + G D+ AW L G+YST ++ A I
Sbjct: 160 TRRGFDTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHI 219
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
+ +H+ PLFLY+A A H+ PLQ+P YL +R + + R K+AA++ +DE+V
Sbjct: 220 LASHNPQNPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAV 274
Query: 293 GKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIW 352
+ AL++ +NS+I+F SD SNWPLRG K T WEGGVRG G +
Sbjct: 275 RNITWALKRYGFYNNSVIIFSSDNGGQTFSGG----SNWPLRGRKGTYWEGGVRGLGFVH 330
Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPLL+ + + VH++DW PTL+ A
Sbjct: 331 SPLLKKKRRTSRALVHITDWYPTLVGLA 358
Score = 77 (32.2 bits), Expect = 9.5e-84, Sum P(3) = 9.5e-84
Identities = 14/26 (53%), Positives = 16/26 (61%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNID 535
+DG DVW +S S R ILHNID
Sbjct: 368 LDGYDVWPAISEGRASPRTEILHNID 393
Score = 77 (32.2 bits), Expect = 9.5e-84, Sum P(3) = 9.5e-84
Identities = 14/26 (53%), Positives = 16/26 (61%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNID 600
+DG DVW +S S R ILHNID
Sbjct: 368 LDGYDVWPAISEGRASPRTEILHNID 393
Score = 51 (23.0 bits), Expect = 9.5e-84, Sum P(3) = 9.5e-84
Identities = 11/25 (44%), Positives = 16/25 (64%)
Query: 859 PDVLSQMEKELANINRTAVAPINKP 883
PDV+ + LA+ NRTA+ P+ P
Sbjct: 482 PDVVRTLLARLADYNRTAI-PVRYP 505
Score = 50 (22.7 bits), Expect = 1.2e-83, Sum P(3) = 1.2e-83
Identities = 12/39 (30%), Positives = 20/39 (51%)
Query: 677 LFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
LF+I DP E+ +LA + D + + +N+ A P
Sbjct: 464 LFNISADPYEREDLAGQRPDV-VRTLLARLADYNRTAIP 501
Score = 44 (20.5 bits), Expect = 5.1e-83, Sum P(3) = 5.1e-83
Identities = 11/23 (47%), Positives = 15/23 (65%)
Query: 783 LFDIKNDPCEKNNLA-DRSEVQR 804
LF+I DP E+ +LA R +V R
Sbjct: 464 LFNISADPYEREDLAGQRPDVVR 486
>UNIPROTKB|F1NT29 [details] [associations]
symbol:ARSB "Uncharacterized protein" species:9031 "Gallus
gallus" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00149 GO:GO:0008484
GeneTree:ENSGT00560000077076 EMBL:AADN02046142 EMBL:AADN02046143
EMBL:AADN02046144 EMBL:AADN02046145 EMBL:AADN02046146
EMBL:AADN02046147 EMBL:AADN02046148 EMBL:AADN02046149
EMBL:AADN02046150 IPI:IPI00582830 Ensembl:ENSGALT00000007062
ArrayExpress:F1NT29 Uniprot:F1NT29
Length = 395
Score = 779 (279.3 bits), Expect = 1.9e-83, Sum P(2) = 1.9e-83
Identities = 149/332 (44%), Positives = 213/332 (64%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
A+ PPH++ +LADDLGW DVG+HG I TP +DAL G+ LK Y T LCTPSR +
Sbjct: 41 AARPPPHLVLVLADDLGWGDVGWHG-SAIRTPRLDALGAGGVRLKGY-TQPLCTPSRPFL 98
Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
+ G + IHTG+QH +++ C+ LPL EK+LP+ LK+ GY T +VGKWHLG Y+KE PT
Sbjct: 99 LFGGYYIHTGLQHQIIWPCQPSCLPLDEKLLPELLKDAGYVTHMVGKWHLGMYRKECLPT 158
Query: 175 FRGFESHLGYWTGHQDYFDHS------AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
RGF+++ GY G +DY+ H A+ + LD R E A YST++FT
Sbjct: 159 RRGFDTYFGYLLGSEDYYSHDHCVLIKAKNVTRCALDFRDGEEVATGFKNMYSTNLFTER 218
Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
A+D+I NH T++PLFLYLA + H EPL+ Y+ + I+D KR ++A ++ +
Sbjct: 219 AIDLIANHKTEKPLFLYLAFQSVH-----EPLEVSAEYMKPYSSIKDVKRRRYAGMVSLM 273
Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGA 348
DE+VG + +AL++ + +N+++VF +D +NWPLRG K TLWEGGVRG
Sbjct: 274 DEAVGNLTDALKEYGLWNNTVLVFSTDNGGQTMAGG----NNWPLRGRKWTLWEGGVRGV 329
Query: 349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
G + SPLL+ +G+ + + +H+SDWLPTL+ A
Sbjct: 330 GFVASPLLKQKGVESHELIHISDWLPTLVHLA 361
Score = 92 (37.4 bits), Expect = 0.00045, Sum P(2) = 0.00045
Identities = 15/35 (42%), Positives = 25/35 (71%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIM 35
+QH +++ C+ LPL EK+LP+ LK+ GY T ++
Sbjct: 109 LQHQIIWPCQPSCLPLDEKLLPELLKDAGYVTHMV 143
Score = 76 (31.8 bits), Expect = 1.9e-83, Sum P(2) = 1.9e-83
Identities = 13/25 (52%), Positives = 16/25 (64%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNI 534
+DG DVW +S PS R +LHNI
Sbjct: 371 LDGFDVWKTISEGRPSPRVELLHNI 395
Score = 76 (31.8 bits), Expect = 1.9e-83, Sum P(2) = 1.9e-83
Identities = 13/25 (52%), Positives = 16/25 (64%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNI 599
+DG DVW +S PS R +LHNI
Sbjct: 371 LDGFDVWKTISEGRPSPRVELLHNI 395
>UNIPROTKB|F1P098 [details] [associations]
symbol:ARSB "Uncharacterized protein" species:9031 "Gallus
gallus" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00149 GO:GO:0008484
GeneTree:ENSGT00560000077076 EMBL:AADN02046142 EMBL:AADN02046143
EMBL:AADN02046144 EMBL:AADN02046145 EMBL:AADN02046146
EMBL:AADN02046147 EMBL:AADN02046148 EMBL:AADN02046149
EMBL:AADN02046150 IPI:IPI00820025 Ensembl:ENSGALT00000038614
ArrayExpress:F1P098 Uniprot:F1P098
Length = 388
Score = 779 (279.3 bits), Expect = 2.5e-79, Sum P(2) = 2.5e-79
Identities = 149/332 (44%), Positives = 213/332 (64%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
A+ PPH++ +LADDLGW DVG+HG I TP +DAL G+ LK Y T LCTPSR +
Sbjct: 41 AARPPPHLVLVLADDLGWGDVGWHG-SAIRTPRLDALGAGGVRLKGY-TQPLCTPSRPFL 98
Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
+ G + IHTG+QH +++ C+ LPL EK+LP+ LK+ GY T +VGKWHLG Y+KE PT
Sbjct: 99 LFGGYYIHTGLQHQIIWPCQPSCLPLDEKLLPELLKDAGYVTHMVGKWHLGMYRKECLPT 158
Query: 175 FRGFESHLGYWTGHQDYFDHS------AEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAE 228
RGF+++ GY G +DY+ H A+ + LD R E A YST++FT
Sbjct: 159 RRGFDTYFGYLLGSEDYYSHDHCVLIKAKNVTRCALDFRDGEEVATGFKNMYSTNLFTER 218
Query: 229 AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKL 288
A+D+I NH T++PLFLYLA + H EPL+ Y+ + I+D KR ++A ++ +
Sbjct: 219 AIDLIANHKTEKPLFLYLAFQSVH-----EPLEVSAEYMKPYSSIKDVKRRRYAGMVSLM 273
Query: 289 DESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGA 348
DE+VG + +AL++ + +N+++VF +D +NWPLRG K TLWEGGVRG
Sbjct: 274 DEAVGNLTDALKEYGLWNNTVLVFSTDNGGQTMAGG----NNWPLRGRKWTLWEGGVRGV 329
Query: 349 GLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
G + SPLL+ +G+ + + +H+SDWLPTL+ A
Sbjct: 330 GFVASPLLKQKGVESHELIHISDWLPTLVHLA 361
Score = 37 (18.1 bits), Expect = 2.5e-79, Sum P(2) = 2.5e-79
Identities = 5/10 (50%), Positives = 7/10 (70%)
Query: 510 IDGIDVWSVL 519
+DG DVW +
Sbjct: 371 LDGFDVWKTI 380
Score = 37 (18.1 bits), Expect = 2.5e-79, Sum P(2) = 2.5e-79
Identities = 5/10 (50%), Positives = 7/10 (70%)
Query: 575 IDGIDVWSVL 584
+DG DVW +
Sbjct: 371 LDGFDVWKTI 380
>UNIPROTKB|F6PKT4 [details] [associations]
symbol:ARSJ "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00149 GO:GO:0008484
HOGENOM:HOG000135354 HOVERGEN:HBG004282
GeneTree:ENSGT00560000077076 OrthoDB:EOG45HRX5 EMBL:AAEX03016834
Ensembl:ENSCAFT00000019312 Uniprot:F6PKT4
Length = 489
Score = 577 (208.2 bits), Expect = 1.1e-69, Sum P(4) = 1.1e-69
Identities = 114/269 (42%), Positives = 163/269 (60%)
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+++ ++ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+KE P
Sbjct: 1 LLSSRYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 60
Query: 174 TFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVD 231
T RGF++ G G DY+ H + M G D+ + AWD +G YST ++T
Sbjct: 61 TKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 120
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDES 291
I+ +H +P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE+
Sbjct: 121 ILASHDPRKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEA 175
Query: 292 VGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLI 351
+ V AL+ +NSII++ SD SNWPLRG K T WEGG+R G +
Sbjct: 176 INNVTLALKTYGFYNNSIIIYSSDNGGQPTAGG----SNWPLRGSKGTYWEGGIRAVGFV 231
Query: 352 WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 232 HSPLLKNKGTVCKELVHITDWYPTLISLA 260
Score = 75 (31.5 bits), Expect = 1.1e-69, Sum P(4) = 1.1e-69
Identities = 16/40 (40%), Positives = 21/40 (52%)
Query: 509 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKW 548
++DG DVW +S S R ILHNID + + G W
Sbjct: 269 QLDGYDVWETISEGLRSPRVDILHNIDPIY--TKAKNGSW 306
Score = 73 (30.8 bits), Expect = 1.8e-69, Sum P(4) = 1.8e-69
Identities = 14/27 (51%), Positives = 17/27 (62%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNID 600
++DG DVW +S S R ILHNID
Sbjct: 269 QLDGYDVWETISEGLRSPRVDILHNID 295
Score = 59 (25.8 bits), Expect = 1.1e-69, Sum P(4) = 1.1e-69
Identities = 12/40 (30%), Positives = 20/40 (50%)
Query: 858 YPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAW 897
YP ++ Q+ + L+ N+TAV P D +P+ W
Sbjct: 384 YPGIVKQLLRRLSQFNKTAVPVRYPPKDPRSNPRLNGGVW 423
Score = 37 (18.1 bits), Expect = 1.1e-69, Sum P(4) = 1.1e-69
Identities = 8/17 (47%), Positives = 12/17 (70%)
Query: 783 LFDIKNDPCEKNNLADR 799
LF+I DP E+ +L+ R
Sbjct: 367 LFNITADPYERVDLSHR 383
>UNIPROTKB|F1S147 [details] [associations]
symbol:ARSJ "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00149 GO:GO:0008484
GeneTree:ENSGT00560000077076 OMA:AAGYGIW EMBL:CU694917
Ensembl:ENSSSCT00000009989 Uniprot:F1S147
Length = 467
Score = 575 (207.5 bits), Expect = 1.2e-69, Sum P(4) = 1.2e-69
Identities = 114/268 (42%), Positives = 160/268 (59%)
Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT 174
+ ++ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY+KE PT
Sbjct: 1 LLSRYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPT 60
Query: 175 FRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDI 232
RGF++ G G DY+ H + M G D+ + AWD +G YST ++T I
Sbjct: 61 KRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENENAAWDYDNGIYSTQMYTQRVQQI 120
Query: 233 IHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
+ +H P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE++
Sbjct: 121 LASHDPKRPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAI 175
Query: 293 GKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIW 352
V AL+ +NSII++ SD SNWPLRG K T WEGG+R G +
Sbjct: 176 NNVTLALKMYGFYNNSIIIYSSDNGGQPTAGG----SNWPLRGSKGTYWEGGIRAVGFVH 231
Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SPLL+++G V ++ VH++DW PTL+S A
Sbjct: 232 SPLLKNKGTVCKELVHITDWYPTLISLA 259
Score = 75 (31.5 bits), Expect = 1.2e-69, Sum P(4) = 1.2e-69
Identities = 16/40 (40%), Positives = 21/40 (52%)
Query: 509 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKW 548
++DG DVW +S S R ILHNID + + G W
Sbjct: 268 QLDGYDVWETISEGLRSPRVDILHNIDPIY--TKAKNGSW 305
Score = 74 (31.1 bits), Expect = 0.00074, Sum P(4) = 0.00074
Identities = 14/35 (40%), Positives = 22/35 (62%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIM 35
+QH+++ + LPL LPQ LKE+GY T ++
Sbjct: 11 LQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMV 45
Score = 73 (30.8 bits), Expect = 1.9e-69, Sum P(4) = 1.9e-69
Identities = 14/27 (51%), Positives = 17/27 (62%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNID 600
++DG DVW +S S R ILHNID
Sbjct: 268 QLDGYDVWETISEGLRSPRVDILHNID 294
Score = 60 (26.2 bits), Expect = 1.2e-69, Sum P(4) = 1.2e-69
Identities = 13/40 (32%), Positives = 20/40 (50%)
Query: 858 YPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAW 897
YP V+ Q+ + L+ N+TAV P D +P+ W
Sbjct: 383 YPGVVKQLLRRLSQFNKTAVPVRYPPKDPRSNPRLNGGVW 422
Score = 39 (18.8 bits), Expect = 1.2e-69, Sum P(4) = 1.2e-69
Identities = 8/17 (47%), Positives = 13/17 (76%)
Query: 783 LFDIKNDPCEKNNLADR 799
LF+I DP E+ +L++R
Sbjct: 366 LFNITADPYERVDLSNR 382
>UNIPROTKB|F1NH07 [details] [associations]
symbol:ARSJ "Uncharacterized protein" species:9031 "Gallus
gallus" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00149 GO:GO:0008484
GeneTree:ENSGT00560000077076 OMA:AAGYGIW EMBL:AADN02009321
IPI:IPI00574604 Ensembl:ENSGALT00000019613 Uniprot:F1NH07
Length = 472
Score = 560 (202.2 bits), Expect = 1.1e-66, Sum P(3) = 1.1e-66
Identities = 111/265 (41%), Positives = 162/265 (61%)
Query: 118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRG 177
++ IHTG+QH+++ + LPL LPQ LKE+GY T +VGKWHLGFY++E PT RG
Sbjct: 1 RYQIHTGLQHSIIRPTQPNCLPLDNITLPQKLKEVGYSTHMVGKWHLGFYRRECMPTQRG 60
Query: 178 FESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL-HGKYSTDVFTAEAVDIIHN 235
F++ G G DY+ H + + G D+ + AWD +G YST ++T + I+ +
Sbjct: 61 FDTFFGSLLGSGDYYTHFKCDSPGICGYDLYENDNAAWDHDNGIYSTQMYTQKVQQILAS 120
Query: 236 HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKV 295
H+ +P+FLY+A+ A HS PLQAP Y +R I + R ++AA+L LDE++ V
Sbjct: 121 HNPRKPIFLYIAYQAVHS-----PLQAPGKYFEHYRSINNINRRRYAAMLACLDEAINNV 175
Query: 296 VEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWSPL 355
AL++ NSII++ SD SNWPLRG K T WEGG+R G + SPL
Sbjct: 176 TLALKKYGYYDNSIIIYSSDNGGQPMAGG----SNWPLRGSKGTYWEGGIRAVGFVHSPL 231
Query: 356 LESRGIVAEQYVHVSDWLPTLLSAA 380
L+++G V ++ VH++DW PTL++ A
Sbjct: 232 LKNKGSVCKELVHITDWFPTLITLA 256
Score = 79 (32.9 bits), Expect = 1.1e-66, Sum P(3) = 1.1e-66
Identities = 31/120 (25%), Positives = 47/120 (39%)
Query: 795 NLADRSEVQRINHY---TTEVGYLD--PKQRFNQIA---YLDKEXXXXXXXXXXXXXXXX 846
N A +S + R+NH+ T GY D P Q F+ + + ++
Sbjct: 310 NTAIQSAI-RVNHWKLLTGNPGYSDWVPPQAFSNVGPNRWHNERVSWSAGKTVWLFNITA 368
Query: 847 XXXXXXXXXXGYPDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSI-FGDDLK 905
YPDV+ Q+ + L+ N+TAV P D +PK W F +D K
Sbjct: 369 DPYERVDLSAKYPDVVKQLLRRLSQFNKTAVPVRYPPKDPRSNPKLNGGVWGPWFKEDEK 428
Score = 77 (32.2 bits), Expect = 1.1e-66, Sum P(3) = 1.1e-66
Identities = 15/40 (37%), Positives = 21/40 (52%)
Query: 509 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKW 548
++DG D+W +S S R ILHNID + + G W
Sbjct: 265 QLDGYDIWETISEGRRSPRVDILHNIDPIY--TKAKNGSW 302
Score = 75 (31.5 bits), Expect = 1.7e-66, Sum P(3) = 1.7e-66
Identities = 13/27 (48%), Positives = 17/27 (62%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILHNID 600
++DG D+W +S S R ILHNID
Sbjct: 265 QLDGYDIWETISEGRRSPRVDILHNID 291
Score = 72 (30.4 bits), Expect = 1.3e-05, Sum P(3) = 1.3e-05
Identities = 14/35 (40%), Positives = 22/35 (62%)
Query: 1 MQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIM 35
+QH+++ + LPL LPQ LKE+GY T ++
Sbjct: 8 LQHSIIRPTQPNCLPLDNITLPQKLKEVGYSTHMV 42
Score = 49 (22.3 bits), Expect = 1.5e-63, Sum P(3) = 1.5e-63
Identities = 12/39 (30%), Positives = 21/39 (53%)
Query: 677 LFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
LF+I DP E+ +L+ + D + + +FN+ A P
Sbjct: 363 LFNITADPYERVDLSAKYPDV-VKQLLRRLSQFNKTAVP 400
>WB|WBGene00006310 [details] [associations]
symbol:sul-3 species:6239 "Caenorhabditis elegans"
[GO:0008152 "metabolic process" evidence=IEA] [GO:0008484 "sulfuric
ester hydrolase activity" evidence=IEA] [GO:0003824 "catalytic
activity" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
GO:GO:0008484 GeneTree:ENSGT00560000077076 EMBL:FO080947
UniGene:Cel.8880 GeneID:183778 KEGG:cel:CELE_C54D2.4 CTD:183778
RefSeq:NP_001041231.1 ProteinModelPortal:H2KZF6 SMR:H2KZF6
EnsemblMetazoa:C54D2.4a WormBase:C54D2.4a OMA:RGMMVSD
Uniprot:H2KZF6
Length = 488
Score = 544 (196.6 bits), Expect = 3.1e-60, Sum P(3) = 3.1e-60
Identities = 132/375 (35%), Positives = 203/375 (54%)
Query: 31 RTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDA 90
RT + F +L L + VD ++ P+++FI+ADDLG++DV + + TPN+
Sbjct: 3 RTTLPTFLLL-LLHNHGITGVDGQTATQKPNVLFIMADDLGFSDVDWKD-STLHTPNLRH 60
Query: 91 LAY--SGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQY 148
LA+ + +L N Y QLCTP+RSA MTG +P G Q+ V E G+P L +
Sbjct: 61 LAFHKNTALLSNSYVNQLCTPTRSAFMTGYYPFRVGTQNGVFLHMEPAGVPTMFPFLSEN 120
Query: 149 LKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAE----EMK--MW 202
+++L Y T +VGKWHLG+ KKE+ PT RGF+ G++ YF+HSA+ E+K +
Sbjct: 121 MRQLDYSTYLVGKWHLGYCKKEFLPTNRGFDYFYGFYGPQTGYFNHSADQYHRELKRVVK 180
Query: 203 GLDMRRDLE-----PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPY 257
GLD+ ++ P + +G YSTD+FT A+ ++ NH+ +P F++L++ A H P
Sbjct: 181 GLDLFEEVGSGKSVPDFSQNGVYSTDLFTDVAMSVLDNHNNSKPFFMFLSYQAVH---P- 236
Query: 258 EPLQAPDHYLNIHRHIE-DF---KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFV 313
PLQ I + E F + +L +D ++G++VE L+ + N++IVF
Sbjct: 237 -PLQVSQQSKTIGQGKEATFILRSHAHSTRMLTAMDFAIGRLVEYLKASNLYENTVIVFT 295
Query: 314 SDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWL 373
SD SN PLRG K+T+WEGG + + SP+ G + HV DW
Sbjct: 296 SDNGGTANFGA----SNAPLRGEKDTIWEGGTKTTTFVHSPMYIEEGGTRDMMFHVVDWH 351
Query: 374 PTLLSAANKSDIPNY 388
T+LS +I +Y
Sbjct: 352 ATILSITGL-EIDSY 365
Score = 67 (28.6 bits), Expect = 3.1e-60, Sum P(3) = 3.1e-60
Identities = 18/57 (31%), Positives = 28/57 (49%)
Query: 511 DGIDVWSVLSRNEPS-KRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRS 566
DGI+ W L P +R ++NID+ SA+ G +KL+ N +NR+
Sbjct: 367 DGINQWEYLKTGRPKFRRFQFVYNIDNHG--SAIRDGDYKLIVGNVDRKMSKDKNRT 421
Score = 49 (22.3 bits), Expect = 2.4e-58, Sum P(3) = 2.4e-58
Identities = 10/27 (37%), Positives = 15/27 (55%)
Query: 576 DGIDVWSVLSRNEPS-KRNTILHNIDD 601
DGI+ W L P +R ++NID+
Sbjct: 367 DGINQWEYLKTGRPKFRRFQFVYNIDN 393
Score = 47 (21.6 bits), Expect = 3.1e-60, Sum P(3) = 3.1e-60
Identities = 9/45 (20%), Positives = 22/45 (48%)
Query: 859 PDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDHAWSIFGDD 903
P ++ ++ +L + + + KP G P+ F+ ++S + D
Sbjct: 441 PKIVRRLLAKLDQLKKFLHKNVRKPLSLNGSPERFNGSYSSYWCD 485
Score = 37 (18.1 bits), Expect = 3.5e-59, Sum P(3) = 3.5e-59
Identities = 9/38 (23%), Positives = 18/38 (47%)
Query: 677 LFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAY 714
LF I DP E ++A RS + + ++ + + +
Sbjct: 423 LFRITTDPTESKDIA-RSNPKIVRRLLAKLDQLKKFLH 459
>UNIPROTKB|P34059 [details] [associations]
symbol:GALNS "N-acetylgalactosamine-6-sulfatase"
species:9606 "Homo sapiens" [GO:0046872 "metal ion binding"
evidence=IEA] [GO:0043890 "N-acetylgalactosamine-6-sulfatase
activity" evidence=IEA] [GO:0003943
"N-acetylgalactosamine-4-sulfatase activity" evidence=TAS]
[GO:0008484 "sulfuric ester hydrolase activity" evidence=IDA]
[GO:0005975 "carbohydrate metabolic process" evidence=TAS]
[GO:0030203 "glycosaminoglycan metabolic process" evidence=TAS]
[GO:0042339 "keratan sulfate metabolic process" evidence=TAS]
[GO:0042340 "keratan sulfate catabolic process" evidence=TAS]
[GO:0043202 "lysosomal lumen" evidence=TAS] [GO:0044281 "small
molecule metabolic process" evidence=TAS] Reactome:REACT_111217
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 Reactome:REACT_116125 GO:GO:0046872 GO:GO:0005975
GO:GO:0043202 DrugBank:DB00070 Gene3D:3.40.720.10 SUPFAM:SSF53649
eggNOG:COG3119 HOGENOM:HOG000135352 HOVERGEN:HBG004283
InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0003943
Orphanet:582 GO:GO:0042340 CTD:2588 KO:K01132 OrthoDB:EOG480HWH
GO:GO:0043890 EMBL:D17629 EMBL:U06088 EMBL:U06078 EMBL:U06079
EMBL:U06080 EMBL:U06081 EMBL:U06082 EMBL:U06083 EMBL:U06084
EMBL:U06085 EMBL:U06086 EMBL:U06087 EMBL:BC050684 EMBL:BC056151
IPI:IPI00029605 PIR:JQ1299 RefSeq:NP_000503.1 UniGene:Hs.271383
PDB:4FDI PDB:4FDJ PDBsum:4FDI PDBsum:4FDJ ProteinModelPortal:P34059
SMR:P34059 STRING:P34059 PhosphoSite:P34059 DMDM:462148
PaxDb:P34059 PRIDE:P34059 DNASU:2588 Ensembl:ENST00000268695
GeneID:2588 KEGG:hsa:2588 UCSC:uc002fly.4 GeneCards:GC16M088880
H-InvDB:HIX0134371 HGNC:HGNC:4122 HPA:CAB026404 MIM:253000
MIM:612222 neXtProt:NX_P34059 PharmGKB:PA28535 InParanoid:P34059
OMA:GAISHAF PhylomeDB:P34059 BioCyc:MetaCyc:HS06790-MONOMER
BRENDA:3.1.6.4 ChiTaRS:Galns GenomeRNAi:2588 NextBio:10237
ArrayExpress:P34059 Bgee:P34059 CleanEx:HS_GALNS
Genevestigator:P34059 GermOnline:ENSG00000141012 Uniprot:P34059
Length = 522
Score = 440 (159.9 bits), Expect = 2.1e-42, Sum P(2) = 2.1e-42
Identities = 113/353 (32%), Positives = 178/353 (50%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L LS + + PP+I+ +L DD+GW D+G +G TPN+D +A G++ N+
Sbjct: 13 LLLVLSAAGMGASGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGLLFPNF 72
Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGMQ----H--NVLYGCER-GGLPLSEKILPQYLKELG 153
Y+ LC+PSR+A++TG+ PI G H N E GG+P SE++LP+ LK+ G
Sbjct: 73 YSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAG 132
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEE----MKMWGLDMRRD 209
Y ++IVGKWHLG ++ ++ P GF+ G H +D+ A + W + R
Sbjct: 133 YVSKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYY 191
Query: 210 LEPAWDLH-GKYS-TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYL 267
E +L G+ + T ++ EA+D I + P FLY A ATH+ P+ A +L
Sbjct: 192 EEFPINLKTGEANLTQIYLQEALDFIKRQARHHPFFLYWAVDATHA-----PVYASKPFL 246
Query: 268 NIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXX 327
+ R ++ + ++D+S+GK++E L+ + N+ + F SD
Sbjct: 247 GTSQ------RGRYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQG 300
Query: 328 XSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SN P K T +EGG+R L W P + G V+ Q + D T L+ A
Sbjct: 301 GSNGPFLCGKQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLALA 353
Score = 41 (19.5 bits), Expect = 2.1e-42, Sum P(2) = 2.1e-42
Identities = 10/24 (41%), Positives = 14/24 (58%)
Query: 667 VPCEPQIAPCLFDIKN--DP-CEK 687
VP +PQ+ C + + N P CEK
Sbjct: 480 VPAQPQLNVCNWAVMNWAPPGCEK 503
Score = 41 (19.5 bits), Expect = 2.1e-42, Sum P(2) = 2.1e-42
Identities = 10/24 (41%), Positives = 14/24 (58%)
Query: 773 VPCEPQIAPCLFDIKN--DP-CEK 793
VP +PQ+ C + + N P CEK
Sbjct: 480 VPAQPQLNVCNWAVMNWAPPGCEK 503
>MGI|MGI:1355303 [details] [associations]
symbol:Galns "galactosamine (N-acetyl)-6-sulfate sulfatase"
species:10090 "Mus musculus" [GO:0003824 "catalytic activity"
evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0008152
"metabolic process" evidence=ISO] [GO:0008484 "sulfuric ester
hydrolase activity" evidence=ISO] [GO:0016787 "hydrolase activity"
evidence=IEA] [GO:0043890 "N-acetylgalactosamine-6-sulfatase
activity" evidence=IEA] [GO:0046872 "metal ion binding"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 MGI:MGI:1355303 GO:GO:0046872
GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
HOGENOM:HOG000135352 HOVERGEN:HBG004283 InterPro:IPR024607
PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
GeneTree:ENSGT00560000077076 HSSP:P15289 CTD:2588 KO:K01132
OrthoDB:EOG480HWH GO:GO:0043890 BRENDA:3.1.6.4 EMBL:AF111346
EMBL:AF112242 EMBL:AF112230 EMBL:AF112231 EMBL:AF112233
EMBL:AF112232 EMBL:AF112234 EMBL:AF112235 EMBL:AF112236
EMBL:AF112237 EMBL:AF112238 EMBL:AF112239 EMBL:AF112240
EMBL:AF112241 EMBL:AK220245 EMBL:AK159592 EMBL:BC004002
IPI:IPI00310090 RefSeq:NP_001180574.1 RefSeq:NP_057931.3
UniGene:Mm.34702 ProteinModelPortal:Q571E4 SMR:Q571E4 STRING:Q571E4
PhosphoSite:Q571E4 PaxDb:Q571E4 PRIDE:Q571E4
Ensembl:ENSMUST00000015171 GeneID:50917 KEGG:mmu:50917
UCSC:uc012gmh.1 InParanoid:Q571E4 OMA:RKTGEAN NextBio:307919
Bgee:Q571E4 CleanEx:MM_GALNS Genevestigator:Q571E4 Uniprot:Q571E4
Length = 520
Score = 433 (157.5 bits), Expect = 2.0e-41, Sum P(2) = 2.0e-41
Identities = 115/358 (32%), Positives = 178/358 (49%)
Query: 38 AVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGII 97
A L LS + + + PP+I+ +L DD+GW D+G +G TPN+D +A G++
Sbjct: 6 AAQQLLLVLSALGLLAAGAPQPPNIVLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGML 65
Query: 98 LKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQ----H--NVLYGCE-RGGLPLSEKILPQYL 149
++Y+ LC+PSR+A++TG+ PI G H N E GG+P SE +LP+ L
Sbjct: 66 FPSFYSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELL 125
Query: 150 KELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEE----MKMWGLD 205
K+ GY +IVGKWHLG ++ ++ P GF+ G H +D+ A+ + W +
Sbjct: 126 KKAGYTNKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKAKPNIPVYRDWEMV 184
Query: 206 MRRDLE-PAWDLHGKYS-TDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQA 262
R E P G+ + T ++T EA+D I H+ P FLY A ATH+ P+ A
Sbjct: 185 GRFYEEFPINRKTGEANLTQLYTQEALDFIQTQHARQSPFFLYWAIDATHA-----PVYA 239
Query: 263 PDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXX 322
+L R ++ + ++D+SVGK++ L+ + N+ + F SD
Sbjct: 240 SRQFLGTSL------RGRYGDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALIS 293
Query: 323 XXXXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SN P K T +EGG+R + W P + G V+ Q + D T LS A
Sbjct: 294 APNEGGSNGPFLCGKQTTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLA 351
Score = 39 (18.8 bits), Expect = 2.0e-41, Sum P(2) = 2.0e-41
Identities = 12/38 (31%), Positives = 21/38 (55%)
Query: 675 PCLFDIKNDPCEKNNLADRSED-QRINHYTTEVGRFNQ 711
P +F + DP E+ L+ S++ Q TT+V + +Q
Sbjct: 437 PLIFHLGRDPGERFPLSFHSDEYQDALSRTTQVVQEHQ 474
>UNIPROTKB|F1S2F1 [details] [associations]
symbol:F1S2F1 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
GeneTree:ENSGT00560000077076 EMBL:CU468550
Ensembl:ENSSSCT00000015408 Uniprot:F1S2F1
Length = 151
Score = 435 (158.2 bits), Expect = 4.1e-40, P = 4.1e-40
Identities = 74/127 (58%), Positives = 98/127 (77%)
Query: 62 IIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPI 121
++F+LADDLGWNDVGFHG +I TP++DALA G++L NYYT LCTPSRS ++TG++ I
Sbjct: 15 LVFVLADDLGWNDVGFHG-SEIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQI 73
Query: 122 HTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESH 181
HTG+QH +++ C+ +PL EK+LPQ LKE GY T +VGKWHLG Y+KE PT RGF+++
Sbjct: 74 HTGLQHQIIWPCQPSCIPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTY 133
Query: 182 LGYWTGH 188
G H
Sbjct: 134 FGNGNAH 140
>UNIPROTKB|F1PHF0 [details] [associations]
symbol:GALNS "N-acetylgalactosamine-6-sulfatase"
species:9615 "Canis lupus familiaris" [GO:0008484 "sulfuric ester
hydrolase activity" evidence=IEA] InterPro:IPR000917
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
GeneTree:ENSGT00560000077076 OMA:RKTGEAN EMBL:AAEX03003965
Ensembl:ENSCAFT00000031604 Uniprot:F1PHF0
Length = 524
Score = 428 (155.7 bits), Expect = 2.4e-39, P = 2.4e-39
Identities = 113/354 (31%), Positives = 175/354 (49%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L LS + + PP+I+ +L DD+GW D+G +G TPN+D +A G++ ++
Sbjct: 14 LLLVLSAAGLGAAGAPQPPNILLLLMDDMGWGDLGIYGEPSRETPNLDRMAAEGMLFPSF 73
Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGM----QH-NVLYGCER--GGLPLSEKILPQYLKELG 153
Y+ LC+PSR+A++TG+ PI G +H Y + GG+P E +LP+ LKE G
Sbjct: 74 YSANPLCSPSRAALLTGRLPIRNGFYTTNRHARNAYTPQEIVGGIPDQEHVLPELLKEAG 133
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEE----MKMWGLDMRRD 209
Y ++IVGKWHLG ++ ++ P GF+ G H +D+ A + W + R
Sbjct: 134 YVSKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRYY 192
Query: 210 LEPAWDLH-GKYS-TDVFTAEAVDII-HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHY 266
E +L G+ + T V+ EA+D I + P FLY A ATH+ P+ A +
Sbjct: 193 EEFPINLKTGEANLTQVYLQEALDFIKRQQAAQRPFFLYWAIDATHA-----PVYASRPF 247
Query: 267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
L + R ++ + ++D SVGK++ L+ R+ N+ + F SD
Sbjct: 248 LGTSQ------RGRYGDAVREIDNSVGKILSLLQDLRISENTFVFFTSDNGAALISAPNQ 301
Query: 327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SN P K T +EGG+R + W P G V+ Q + D T LS A
Sbjct: 302 GGSNGPFLCGKQTTFEGGMREPAIAWWPGRIPAGRVSHQLGSIMDLFTTSLSLA 355
>UNIPROTKB|Q32KH5 [details] [associations]
symbol:GALNS "N-acetylgalactosamine-6-sulfatase"
species:9615 "Canis lupus familiaris" [GO:0005764 "lysosome"
evidence=IEA] [GO:0043890 "N-acetylgalactosamine-6-sulfatase
activity" evidence=IEA] [GO:0046872 "metal ion binding"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 GO:GO:0046872 GO:GO:0005764
Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
HOGENOM:HOG000135352 HOVERGEN:HBG004283 InterPro:IPR024607
PROSITE:PS00523 PROSITE:PS00149 HSSP:P15289 EMBL:BN000762
RefSeq:NP_001041585.1 UniGene:Cfa.37704 ProteinModelPortal:Q32KH5
STRING:Q32KH5 PRIDE:Q32KH5 GeneID:489661 KEGG:cfa:489661 CTD:2588
InParanoid:Q32KH5 KO:K01132 OrthoDB:EOG480HWH NextBio:20862813
GO:GO:0043890 Uniprot:Q32KH5
Length = 522
Score = 428 (155.7 bits), Expect = 2.4e-39, P = 2.4e-39
Identities = 113/354 (31%), Positives = 175/354 (49%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L LS + + PP+I+ +L DD+GW D+G +G TPN+D +A G++ ++
Sbjct: 12 LLLVLSAAGLGAAGAPQPPNILLLLMDDMGWGDLGIYGEPSRETPNLDRMAAEGMLFPSF 71
Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGM----QH-NVLYGCER--GGLPLSEKILPQYLKELG 153
Y+ LC+PSR+A++TG+ PI G +H Y + GG+P E +LP+ LKE G
Sbjct: 72 YSANPLCSPSRAALLTGRLPIRNGFYTTNRHARNAYTPQEIVGGIPDQEHVLPELLKEAG 131
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEE----MKMWGLDMRRD 209
Y ++IVGKWHLG ++ ++ P GF+ G H +D+ A + W + R
Sbjct: 132 YVSKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRYY 190
Query: 210 LEPAWDLH-GKYS-TDVFTAEAVDII-HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHY 266
E +L G+ + T V+ EA+D I + P FLY A ATH+ P+ A +
Sbjct: 191 EEFPINLKTGEANLTQVYLQEALDFIKRQQAAQRPFFLYWAIDATHA-----PVYASRPF 245
Query: 267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
L + R ++ + ++D SVGK++ L+ R+ N+ + F SD
Sbjct: 246 LGTSQ------RGRYGDAVREIDNSVGKILSLLQDLRISENTFVFFTSDNGAALISAPNQ 299
Query: 327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SN P K T +EGG+R + W P G V+ Q + D T LS A
Sbjct: 300 GGSNGPFLCGKQTTFEGGMREPAIAWWPGRIPAGRVSHQLGSIMDLFTTSLSLA 353
>UNIPROTKB|F1NW57 [details] [associations]
symbol:GALNS "Uncharacterized protein" species:9031 "Gallus
gallus" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484
GeneTree:ENSGT00560000077076 OMA:DDQVGIL EMBL:AADN02054103
IPI:IPI00577734 Ensembl:ENSGALT00000010149 Uniprot:F1NW57
Length = 521
Score = 409 (149.0 bits), Expect = 5.2e-39, Sum P(2) = 5.2e-39
Identities = 103/337 (30%), Positives = 169/337 (50%)
Query: 57 SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIM 115
+ PP+++ +L DD+GW D+G G TPN+D +A G++ ++Y LC+PSR+A++
Sbjct: 26 AAPPNVVLLLMDDMGWGDLGAFGEPSKETPNLDQMASEGMLFLDFYAANPLCSPSRAALL 85
Query: 116 TGKHPIHTGMQHNVLYGCER-------GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYK 168
TG+ P+ G + GG+ SE +LP+ LK+ GY +I+GKWHLG ++
Sbjct: 86 TGRLPVRNGFYTTNAHARNAYTPQDIVGGIQDSEILLPELLKKAGYTNKIIGKWHLG-HR 144
Query: 169 KEYTPTFRGFESHLGYWTGHQDYFDHSA----EEMKMWGLDMRRDLEPAWDLH-GKYS-T 222
++ P GF+ G H +D+ A + W + R + DL G+ + T
Sbjct: 145 PQFHPLKHGFDEWFGSPNCHFGPYDNRALPNIPVYRDWEMIGRYYEDFKIDLRTGEANLT 204
Query: 223 DVFTAEAVDIIHNH-STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
++ EA+D I ++ +P FLY A ATH+ P+ A H+L + R ++
Sbjct: 205 QIYLQEALDFISKQQASQQPFFLYWAIDATHA-----PVYASKHFLGTSQ------RGRY 253
Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLW 341
+ ++D+SVGK+++ L++ + N+ + F SD SN P K T +
Sbjct: 254 GDAVREIDDSVGKILKHLQKLGISENTFVFFTSDNGAALISAPKQGGSNGPFLCGKQTTF 313
Query: 342 EGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
EGG+R + W P G V+ Q V D T LS
Sbjct: 314 EGGMREPAIAWWPGHIPAGSVSRQLGSVMDLFTTSLS 350
Score = 41 (19.5 bits), Expect = 5.2e-39, Sum P(2) = 5.2e-39
Identities = 10/27 (37%), Positives = 14/27 (51%)
Query: 670 EPQIAPCLFDIKNDPCEKNNLADRSED 696
E P LF + DP EK L+ S++
Sbjct: 433 EHSTLPLLFHLGRDPGEKYPLSFASDE 459
Score = 39 (18.8 bits), Expect = 8.4e-39, Sum P(2) = 8.4e-39
Identities = 10/26 (38%), Positives = 13/26 (50%)
Query: 776 EPQIAPCLFDIKNDPCEKNNLADRSE 801
E P LF + DP EK L+ S+
Sbjct: 433 EHSTLPLLFHLGRDPGEKYPLSFASD 458
>RGD|1565391 [details] [associations]
symbol:Galns "galactosamine (N-acetyl)-6-sulfate sulfatase"
species:10116 "Rattus norvegicus" [GO:0005575 "cellular_component"
evidence=ND] [GO:0005764 "lysosome" evidence=IEA] [GO:0008152
"metabolic process" evidence=RCA] [GO:0008484 "sulfuric ester
hydrolase activity" evidence=IEA;ISO;RCA] [GO:0043890
"N-acetylgalactosamine-6-sulfatase activity" evidence=IEA]
[GO:0046872 "metal ion binding" evidence=IEA] InterPro:IPR000917
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 RGD:1565391
GO:GO:0046872 GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649
eggNOG:COG3119 HOGENOM:HOG000135352 HOVERGEN:HBG004283
InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
GeneTree:ENSGT00560000077076 HSSP:P15289 CTD:2588 KO:K01132
OrthoDB:EOG480HWH GO:GO:0043890 EMBL:AC134009 EMBL:BN000741
IPI:IPI00359847 RefSeq:NP_001041316.1 UniGene:Rn.101398
ProteinModelPortal:Q32KJ6 STRING:Q32KJ6 PRIDE:Q32KJ6
Ensembl:ENSRNOT00000019528 GeneID:292073 KEGG:rno:292073
UCSC:RGD:1565391 InParanoid:Q32KJ6 NextBio:633705
Genevestigator:Q32KJ6 Uniprot:Q32KJ6
Length = 524
Score = 423 (154.0 bits), Expect = 8.2e-39, P = 8.2e-39
Identities = 112/357 (31%), Positives = 178/357 (49%)
Query: 39 VLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
+LP+ L ++ + PP+I+ +L DD+GW D+G +G TPN+D +A G++
Sbjct: 14 LLPVLSALGLL---AAGAPQPPNIVLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLF 70
Query: 99 KNYYTVQ-LCTPSRSAIMTGKHPIHTGMQ----H--NVLYGCE-RGGLPLSEKILPQYLK 150
++Y+ LC+PSR+A++TG+ PI G H N E GG+P SE +LP+ LK
Sbjct: 71 PSFYSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLK 130
Query: 151 ELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEE----MKMWGLDM 206
+ GY +IVGKWHLG ++ ++ P GF+ G H +D+ + + W +
Sbjct: 131 KAGYTNKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKVKPNIPVYRDWEMVG 189
Query: 207 RRDLEPAWDLH-GKYS-TDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAP 263
R E +L G+ + T ++ EA+D I H+ P FLY A ATH+ P+ A
Sbjct: 190 RFYEEFPINLKTGEANLTQLYLQEALDFIRTQHARQSPFFLYWAIDATHA-----PVYAS 244
Query: 264 DHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXX 323
+L R ++ + ++D+SVGK++ L+ + N+ + F SD
Sbjct: 245 KQFLGTSL------RGRYGDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALISA 298
Query: 324 XXXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SN P K T +EGG+R + W P + G V+ Q + D T LS A
Sbjct: 299 PKEGGSNGPFLCGKQTTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLA 355
>UNIPROTKB|Q8WNQ7 [details] [associations]
symbol:GALNS "N-acetylgalactosamine-6-sulfatase"
species:9823 "Sus scrofa" [GO:0005764 "lysosome" evidence=IEA]
[GO:0043890 "N-acetylgalactosamine-6-sulfatase activity"
evidence=IEA] [GO:0046872 "metal ion binding" evidence=IEA]
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 GO:GO:0046872 GO:GO:0005764 Gene3D:3.40.720.10
SUPFAM:SSF53649 eggNOG:COG3119 HOGENOM:HOG000135352
HOVERGEN:HBG004283 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 HSSP:P15289 CTD:2588 KO:K01132 OrthoDB:EOG480HWH
GO:GO:0043890 EMBL:AF322917 RefSeq:NP_999120.1 UniGene:Ssc.4371
ProteinModelPortal:Q8WNQ7 STRING:Q8WNQ7 GeneID:397000
KEGG:ssc:397000 ArrayExpress:Q8WNQ7 Uniprot:Q8WNQ7
Length = 522
Score = 422 (153.6 bits), Expect = 1.1e-38, P = 1.1e-38
Identities = 113/354 (31%), Positives = 175/354 (49%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L LS + + + PP+I+ +L DD+GW D+G +G TPN+D +A G++ ++
Sbjct: 12 LLLVLSAAGLGVTGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSF 71
Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGMQ----H-NVLYGCER--GGLPLSEKILPQYLKELG 153
Y LC+PSR+A++TG+ PI TG H Y + GG+P E +LP+ LK G
Sbjct: 72 YAANPLCSPSRAALLTGRLPIRTGFYTTNGHARNAYTPQEIVGGIPDPEHLLPELLKGAG 131
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEE----MKMWGLDMRRD 209
Y ++IVGKWHLG ++ ++ P GF+ G H +D+ A + W + R
Sbjct: 132 YASKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRFY 190
Query: 210 LEPAWDLH-GKYS-TDVFTAEAVDII-HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHY 266
E +L G+ + T ++ EA+D I +T P FLY A ATH+ P+ A +
Sbjct: 191 EEFPINLKTGESNLTQIYLQEALDFIKRQQATHHPFFLYWAIDATHA-----PVYASRAF 245
Query: 267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
L + R ++ + ++D+SVG++V L ++ N+ + F SD
Sbjct: 246 LGTSQ------RGRYGDAVREIDDSVGRIVGLLRDLKIAGNTFVFFTSDNGAALVSAPKQ 299
Query: 327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SN P K T +EGG+R + W P G V+ Q V D T LS A
Sbjct: 300 GGSNGPFLCGKQTTFEGGMREPAIAWWPGHIPAGQVSHQLGSVMDLFTTSLSLA 353
>UNIPROTKB|F1MU84 [details] [associations]
symbol:GALNS "Uncharacterized protein" species:9913 "Bos
taurus" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
GeneTree:ENSGT00560000077076 EMBL:DAAA02046255 IPI:IPI00703141
Ensembl:ENSBTAT00000006001 OMA:DDQVGIL Uniprot:F1MU84
Length = 527
Score = 416 (151.5 bits), Expect = 4.7e-38, P = 4.7e-38
Identities = 111/358 (31%), Positives = 172/358 (48%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L LS + + + PP+I+ +L DD+GW D+G +G TPN+D +A G++ N+
Sbjct: 17 LLLVLSAAELGVARALQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAVEGMLFPNF 76
Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGMQ----H--NVLYGCER-GGLPLSEKILPQYLKELG 153
YT LC+PSR+A++TG+ PI +G H N E GG+P SE +LP LK G
Sbjct: 77 YTANPLCSPSRAALLTGRLPIRSGFYTTNGHARNAYTPQEIVGGIPDSELLLPALLKGAG 136
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA 213
Y ++IVGKWHLG ++ ++ P GF+ G H +D+ A + + RD E
Sbjct: 137 YASKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKARP----NIPVYRDQEMV 191
Query: 214 WDLHGKYS----------TDVFTAEAVDIIHNH-STDEPLFLYLAHAATHSANPYEPLQA 262
+ ++ T ++ EA++ I + P FLY A ATH+ P+ A
Sbjct: 192 GRFYEEFPINLKTGEANLTQIYLQEALEFIQRQQAAHRPFFLYWAVDATHA-----PIYA 246
Query: 263 PDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXX 322
+L + R ++ + +LD+SVG+++ L + N+ + F SD
Sbjct: 247 SKPFLGTSQ------RGRYGDAIRELDDSVGRILRLLRDLSIAENTFVFFTSDNGAALIS 300
Query: 323 XXXXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
SN P K T +EGG+R + W P G V+ Q + D T LS A
Sbjct: 301 APRQGGSNGPFLCGKQTTFEGGMREPAIAWWPGHIPAGQVSHQLGSIMDLFTTSLSLA 358
>ZFIN|ZDB-GENE-070112-1152 [details] [associations]
symbol:galns "galactosamine (N-acetyl)-6-sulfate
sulfatase" species:7955 "Danio rerio" [GO:0008152 "metabolic
process" evidence=IEA] [GO:0008484 "sulfuric ester hydrolase
activity" evidence=IEA] [GO:0003824 "catalytic activity"
evidence=IEA] [GO:0005575 "cellular_component" evidence=ND]
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 ZFIN:ZDB-GENE-070112-1152 Gene3D:3.40.720.10
SUPFAM:SSF53649 InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484
GeneTree:ENSGT00560000077076 EMBL:CR376726 EMBL:BX248306
EMBL:CR388041 IPI:IPI01023807 ProteinModelPortal:F8W261
Ensembl:ENSDART00000149478 ArrayExpress:F8W261 Bgee:F8W261
Uniprot:F8W261
Length = 514
Score = 391 (142.7 bits), Expect = 9.8e-37, Sum P(2) = 9.8e-37
Identities = 105/343 (30%), Positives = 167/343 (48%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAI 114
+SG P+II +L DD+GW D+G G TP +D +A G++ N+YT LC+PSR+A+
Sbjct: 18 TSGSPNIIIMLMDDMGWGDLGVFGEPSKETPYLDLMAAQGMLFPNFYTANPLCSPSRAAL 77
Query: 115 MTGKHPIHTGMQ----H-NVLYGCER--GGLPLSEKILPQYLKELGYRTRIVGKWHLGFY 167
+TG+ P+ G H Y + GG+ E +LP+ LK Y ++IVGKWHLG +
Sbjct: 78 LTGRLPVRNGFYTTNAHARNAYTPQEIVGGISADEILLPELLKNKHYVSKIVGKWHLG-H 136
Query: 168 KKEYTPTFRGFESHLGYWTGH-QDYFDHSAEEMKMWG-LDMRRDLEPAWDLHGKYS---- 221
+ +Y P GF+ G H Y D S + ++ +M+ ++++ K
Sbjct: 137 RTQYLPLKHGFDEWFGAPNCHFGPYNDSSRPNIPVYNNSEMKGRYYEEFEINVKTGESNL 196
Query: 222 TDVFTAEAVDIIHNHS-TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSK 280
T ++ E +D I + P FLY A ATH+ P+ A +L +R +
Sbjct: 197 TQLYLKEGLDFISQQAMAQRPFFLYWAPDATHA-----PVYASKPFLG------KSQRGR 245
Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTL 340
+ + +LD+S+G+++ L + +++++ F SD SN P K T
Sbjct: 246 YGDAVMELDDSIGQILAHLVSLGIQNDTLVFFTSDNGAALMSGPLQSGSNAPFLCGKETT 305
Query: 341 WEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
+EGG+R + W P G V+ Q V D T LS A S
Sbjct: 306 FEGGMREPAMAWWPGQIPAGTVSHQLASVMDLFSTSLSVAGVS 348
Score = 38 (18.4 bits), Expect = 9.8e-37, Sum P(2) = 9.8e-37
Identities = 11/48 (22%), Positives = 22/48 (45%)
Query: 670 EPQIAPCLFDIKNDPCEKNNLADRSEDQR--INHYTTEVGRFNQIAYP 715
E + P +F + DP E+ L+ + ++ R T V + ++ P
Sbjct: 426 EHTMQPLIFHLGRDPGERYPLSVQCKEYRDVFRRVTAVVEQHQKLLIP 473
>UNIPROTKB|F1RL71 [details] [associations]
symbol:F1RL71 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484
GeneTree:ENSGT00560000077076 EMBL:CU914366
Ensembl:ENSSSCT00000015793 Uniprot:F1RL71
Length = 561
Score = 205 (77.2 bits), Expect = 3.5e-34, Sum P(5) = 3.5e-34
Identities = 38/69 (55%), Positives = 49/69 (71%)
Query: 57 SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMT 116
S PHIIFIL DD G++DVG+HG D I TP +D LA G+ L+NYY +CTPSRS ++T
Sbjct: 44 SQQPHIIFILTDDQGYHDVGYHGSD-IQTPTLDRLAAEGVKLENYYIQPICTPSRSQLLT 102
Query: 117 GKHPIHTGM 125
G H + G+
Sbjct: 103 GSHSLDRGL 111
Score = 199 (75.1 bits), Expect = 3.5e-34, Sum P(5) = 3.5e-34
Identities = 42/102 (41%), Positives = 60/102 (58%)
Query: 279 SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKN 338
+K+AA++ +DE+V + AL+ +NS+I+F SD SNWPLRG K
Sbjct: 252 AKYAAMVTCMDEAVRNITGALKYG-FYNNSVIIFSSDNGGQTFSGG----SNWPLRGRKG 306
Query: 339 TLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
T WEGGVRG G + SPLL+ + +H++DW PTL+ A
Sbjct: 307 TYWEGGVRGLGFVHSPLLKRTRRTSRALLHITDWYPTLVGLA 348
Score = 77 (32.2 bits), Expect = 3.5e-34, Sum P(5) = 3.5e-34
Identities = 14/26 (53%), Positives = 16/26 (61%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNID 535
+DG DVW +S S R ILHNID
Sbjct: 358 LDGYDVWPAISEGRASPRTEILHNID 383
Score = 77 (32.2 bits), Expect = 3.5e-34, Sum P(5) = 3.5e-34
Identities = 14/26 (53%), Positives = 16/26 (61%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNID 600
+DG DVW +S S R ILHNID
Sbjct: 358 LDGYDVWPAISEGRASPRTEILHNID 383
Score = 54 (24.1 bits), Expect = 3.5e-34, Sum P(5) = 3.5e-34
Identities = 14/46 (30%), Positives = 23/46 (50%)
Query: 859 PDVLSQMEKELANINRTAVAPINKPFDKGGDPKNFDH-AWSIFGDD 903
PDV+ + L + NRTA+ P+ P + +F+ AW + D
Sbjct: 472 PDVVRALLARLVDYNRTAI-PVRYPAENPRAHPDFNGGAWGPWASD 516
Score = 47 (21.6 bits), Expect = 8.2e-33, Sum P(4) = 8.2e-33
Identities = 12/39 (30%), Positives = 20/39 (51%)
Query: 677 LFDIKNDPCEKNNLADRSEDQRINHYTTEVGRFNQIAYP 715
LF+I DP E+ +LA + D + + +N+ A P
Sbjct: 454 LFNISADPYEREDLAGQRPDV-VRALLARLVDYNRTAIP 491
Score = 44 (20.5 bits), Expect = 3.5e-34, Sum P(5) = 3.5e-34
Identities = 11/23 (47%), Positives = 15/23 (65%)
Query: 783 LFDIKNDPCEKNNLA-DRSEVQR 804
LF+I DP E+ +LA R +V R
Sbjct: 454 LFNISADPYEREDLAGQRPDVVR 476
Score = 39 (18.8 bits), Expect = 2.3e-17, Sum P(5) = 2.3e-17
Identities = 18/75 (24%), Positives = 33/75 (44%)
Query: 76 GFHGLDQ-IPTPNIDALAYSGIILKNYYTVQLCTPSRSAI---MTGKHPIHTGMQHNVLY 131
G H LD+ +P L+ S I+ Q +S +TG + G+ ++ +
Sbjct: 103 GSHSLDRGLPRLQPRELSPSCILTTKTALSQRTRNRKSPAGTRLTGVRDLGPGLTRSLPW 162
Query: 132 GCERGGL--PLSEKI 144
G RGG+ P +++
Sbjct: 163 GRGRGGVLTPCGDEV 177
>UNIPROTKB|Q08DD1 [details] [associations]
symbol:ARSA "Arylsulfatase A" species:9913 "Bos taurus"
[GO:0005509 "calcium ion binding" evidence=ISS] [GO:0005764
"lysosome" evidence=IEA] [GO:0016021 "integral to membrane"
evidence=IEA] [GO:0007339 "binding of sperm to zona pellucida"
evidence=IEA] [GO:0005886 "plasma membrane" evidence=IEA]
[GO:0004098 "cerebroside-sulfatase activity" evidence=IEA]
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 GO:GO:0005886 GO:GO:0005509 GO:GO:0005764
GO:GO:0007339 Gene3D:3.40.720.10 SUPFAM:SSF53649 EMBL:BC123816
IPI:IPI00713745 RefSeq:NP_001068673.1 UniGene:Bt.1076
ProteinModelPortal:Q08DD1 SMR:Q08DD1 STRING:Q08DD1 PRIDE:Q08DD1
Ensembl:ENSBTAT00000021364 GeneID:505514 KEGG:bta:505514 CTD:410
eggNOG:COG3119 GeneTree:ENSGT00560000076940 HOGENOM:HOG000135352
HOVERGEN:HBG004283 InParanoid:Q08DD1 KO:K01134 OMA:FGPSQMA
OrthoDB:EOG4MKNG4 NextBio:20867174 GO:GO:0004098 InterPro:IPR024607
PROSITE:PS00523 PROSITE:PS00149 Uniprot:Q08DD1
Length = 507
Score = 358 (131.1 bits), Expect = 2.2e-33, Sum P(3) = 2.2e-33
Identities = 115/363 (31%), Positives = 169/363 (46%)
Query: 44 FTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT 103
+TL++ +A++ PP+I+ I ADDLG+ D+G +G TPN+D LA G+ ++Y
Sbjct: 5 WTLTLALAAGLAAASPPNILLIFADDLGYGDLGSYGHPSSTTPNLDQLAAGGLRFTDFYV 64
Query: 104 -VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKW 162
V LCTPSR+A++TG+ P+ G+ VL RGGLPL E L + L GY T I GKW
Sbjct: 65 PVSLCTPSRAALLTGRLPVRMGLYPGVLEPSSRGGLPLDEVTLAEVLAAQGYLTGIAGKW 124
Query: 163 HLGFYKK-EYTPTFRGFESHLGYWTGHQD-------YFDHSA--EEMKMWGL-------D 205
HLG + + P GF LG H F + E + GL +
Sbjct: 125 HLGVGPEGAFLPPHHGFHRFLGIPYSHDQGPCQNLTCFPPATPCEGICDQGLVPIPLLAN 184
Query: 206 MRRDLEPAWDLHGKYSTDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPD 264
+ + +P W L G + + A A D++ + P FLY A TH + P
Sbjct: 185 LSVEAQPPW-LPGLEAR--YVAFARDLMTDAQHQGRPFFLYYASHHTHYPQ-FSGQSFPG 240
Query: 265 HYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXX 324
H R F L +LD +VG ++ A+ +L +++ F +D
Sbjct: 241 HS----------GRGPFGDSLMELDAAVGALMTAVGDLGLLGETLVFFTADNGPETMRMS 290
Query: 325 XXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSD 384
S LR K T +EGGVR L + P + G+ E + D LPTL + A +
Sbjct: 291 HGGCSGL-LRCGKGTTFEGGVREPALAFWPGHIAPGVTHELASSL-DLLPTLAALAG-AQ 347
Query: 385 IPN 387
+PN
Sbjct: 348 LPN 350
Score = 53 (23.7 bits), Expect = 2.2e-33, Sum P(3) = 2.2e-33
Identities = 10/22 (45%), Positives = 13/22 (59%)
Query: 675 PCLFDIKNDPCEKNNLADRSED 696
P LFD+ DP E NL D ++
Sbjct: 426 PLLFDLSEDPGENYNLLDSVDE 447
Score = 52 (23.4 bits), Expect = 2.8e-33, Sum P(3) = 2.8e-33
Identities = 10/18 (55%), Positives = 11/18 (61%)
Query: 781 PCLFDIKNDPCEKNNLAD 798
P LFD+ DP E NL D
Sbjct: 426 PLLFDLSEDPGENYNLLD 443
Score = 44 (20.5 bits), Expect = 2.2e-33, Sum P(3) = 2.2e-33
Identities = 15/59 (25%), Positives = 31/59 (52%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHN--IDDEWQ-ISALTRGKWK--LVKENSINGNGTSE 563
+DG+D+ +L S R+T+ DE + + A+ GK+K + S++ + T++
Sbjct: 353 LDGVDLSPLLLGTGKSPRHTLFFYSAYPDEVRGVFAVRSGKYKAHFFTQGSVHSDTTAD 411
>UNIPROTKB|F1S6M1 [details] [associations]
symbol:GALNS "N-acetylgalactosamine-6-sulfatase"
species:9823 "Sus scrofa" [GO:0008484 "sulfuric ester hydrolase
activity" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
GeneTree:ENSGT00560000077076 EMBL:FP102571
Ensembl:ENSSSCT00000002935 OMA:HISAGQX ArrayExpress:F1S6M1
Uniprot:F1S6M1
Length = 305
Score = 363 (132.8 bits), Expect = 2.5e-32, P = 2.5e-32
Identities = 94/289 (32%), Positives = 151/289 (52%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L LS + + + PP+I+ +L DD+GW D+G +G TPN+D +A G++ ++
Sbjct: 12 LLLVLSAAGLGVTGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSF 71
Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGMQ----H--NVLYGCER-GGLPLSEKILPQYLKELG 153
Y LC+PSR+A++TG+ PI TG H N E GG+P E +LP+ LK G
Sbjct: 72 YAANPLCSPSRAALLTGRLPIRTGFYTTNGHARNAYTPQEIVGGIPDPEHLLPELLKGAG 131
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEE----MKMWGLDMRRD 209
Y ++IVGKWHLG ++ ++ P GF+ G H +D+ A + W + R
Sbjct: 132 YASKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRFY 190
Query: 210 LEPAWDLH-GKYS-TDVFTAEAVDII-HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHY 266
E +L G+ + T ++ EA+D I +T P FLY A ATH+ P+ A +
Sbjct: 191 EEFPINLKTGESNLTQIYLQEALDFIKRQQATHHPFFLYWAIDATHA-----PVYASRAF 245
Query: 267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
L + R ++ + ++D+SVG++V L ++ N+ + F SD
Sbjct: 246 LGTSQ------RGRYGDAVREIDDSVGRIVGLLRDLKIAGNTFVFFTSD 288
>ZFIN|ZDB-GENE-050320-118 [details] [associations]
symbol:arsa "arylsulfatase A" species:7955 "Danio
rerio" [GO:0008152 "metabolic process" evidence=IEA] [GO:0008484
"sulfuric ester hydrolase activity" evidence=IEA] [GO:0003824
"catalytic activity" evidence=IEA] [GO:0005575 "cellular_component"
evidence=ND] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 ZFIN:ZDB-GENE-050320-118
Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
GeneTree:ENSGT00560000076940 HOGENOM:HOG000135352
HOVERGEN:HBG004283 OrthoDB:EOG4MKNG4 InterPro:IPR024607
PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484 EMBL:CR936412
IPI:IPI00488891 UniGene:Dr.91521 SMR:A5WV48
Ensembl:ENSDART00000140193 Uniprot:A5WV48
Length = 503
Score = 345 (126.5 bits), Expect = 9.9e-32, Sum P(3) = 9.9e-32
Identities = 112/364 (30%), Positives = 164/364 (45%)
Query: 47 SMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYY-TVQ 105
+++ V +S PP+ + + ADDLG+ D+G G TPN+D LA +G+ ++Y T
Sbjct: 12 ALIAAHCVGAS-PPNFVLLFADDLGYGDLGCFGHPCSLTPNLDRLAANGLRFTDFYVTSP 70
Query: 106 LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
+C+PSR+A++TG++ +G+ VLY RGGLPL+E + + LK GY T IVGKWHLG
Sbjct: 71 VCSPSRAALLTGRYQTRSGIYPGVLYPGSRGGLPLNETTIAEVLKTQGYSTAIVGKWHLG 130
Query: 166 F-YKKEYTPTFRGFESHLGYWTGHQD----YFDHSAEEMKMWGL-DMRRDLEPAW--DLH 217
Y PT GF+S+LG H ++K +GL D P ++
Sbjct: 131 VGLNGTYLPTRHGFDSYLGIPYSHDQGPCQNLSCFPPDVKCFGLCDQGVVTVPLLFNEII 190
Query: 218 GKYSTDVFTAE------AVDIIHNHSTDE-PLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
+ D E A I + D P FLY TH Y P A Y
Sbjct: 191 KQQPADFLQLEKAYGEFASQFISDSVKDNRPFFLYYPSHHTH----Y-PQYAGADYAG-- 243
Query: 271 RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSN 330
R F L + D +VGK+++ LE+ +++N++I F D +
Sbjct: 244 ----KSPRGPFGDALMEFDGTVGKILQTLEETGVINNTLIFFTGDNGPELMRKSRGGNAG 299
Query: 331 WPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVN 390
G K T +EGG+R + P G V D LPT A + +P
Sbjct: 300 LMKCG-KGTTYEGGMREPAIAHWPGFIKPG-VTRALASSLDILPTFAKLAG-APLPEVQL 356
Query: 391 STVE 394
VE
Sbjct: 357 DGVE 360
Score = 56 (24.8 bits), Expect = 9.9e-32, Sum P(3) = 9.9e-32
Identities = 14/62 (22%), Positives = 27/62 (43%)
Query: 509 EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSND 568
++DG+++ +L PSKR T+ + D + +W+ K + + D
Sbjct: 355 QLDGVEMTDILFNLGPSKRQTMFYYPTDPSVKYGVFAVRWENFKAHYYTRGAAHSESTPD 414
Query: 569 NS 570
NS
Sbjct: 415 NS 416
Score = 49 (22.3 bits), Expect = 5.3e-31, Sum P(3) = 5.3e-31
Identities = 8/24 (33%), Positives = 16/24 (66%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILH 597
++DG+++ +L PSKR T+ +
Sbjct: 355 QLDGVEMTDILFNLGPSKRQTMFY 378
Score = 45 (20.9 bits), Expect = 9.9e-32, Sum P(3) = 9.9e-32
Identities = 8/16 (50%), Positives = 11/16 (68%)
Query: 675 PCLFDIKNDPCEKNNL 690
P LF+++ DP E NL
Sbjct: 429 PLLFNLETDPSENYNL 444
Score = 45 (20.9 bits), Expect = 9.9e-32, Sum P(3) = 9.9e-32
Identities = 8/16 (50%), Positives = 11/16 (68%)
Query: 781 PCLFDIKNDPCEKNNL 796
P LF+++ DP E NL
Sbjct: 429 PLLFNLETDPSENYNL 444
>UNIPROTKB|Q32KK2 [details] [associations]
symbol:Arsa "Arylsulfatase A" species:10116 "Rattus
norvegicus" [GO:0004098 "cerebroside-sulfatase activity"
evidence=IEA] [GO:0005509 "calcium ion binding" evidence=IEA]
[GO:0005886 "plasma membrane" evidence=IEA] [GO:0007339 "binding of
sperm to zona pellucida" evidence=IEA] [GO:0016021 "integral to
membrane" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 RGD:1310381 GO:GO:0005886
GO:GO:0005509 GO:GO:0007339 Gene3D:3.40.720.10 SUPFAM:SSF53649
CTD:410 eggNOG:COG3119 GeneTree:ENSGT00560000076940
HOGENOM:HOG000135352 HOVERGEN:HBG004283 KO:K01134 OMA:FGPSQMA
OrthoDB:EOG4MKNG4 GO:GO:0004098 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 EMBL:CH474027 EMBL:BN000735 IPI:IPI00361483
RefSeq:NP_001030105.2 UniGene:Rn.23323 SMR:Q32KK2 IntAct:Q32KK2
STRING:Q32KK2 Ensembl:ENSRNOT00000017783 GeneID:315222
KEGG:rno:315222 InParanoid:Q32KK2 NextBio:668936
Genevestigator:Q32KK2 Uniprot:Q32KK2
Length = 507
Score = 339 (124.4 bits), Expect = 1.1e-30, Sum P(3) = 1.1e-30
Identities = 109/363 (30%), Positives = 166/363 (45%)
Query: 45 TLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT- 103
TL + ++++ PP+I+ I ADDLG+ D+G +G TPN+D LA G+ ++Y
Sbjct: 6 TLVLALAAGLSTASPPNIMLIFADDLGYGDLGSYGHPSSTTPNLDQLAAGGLRFTDFYVP 65
Query: 104 VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWH 163
V LCTPSR+A++TG+ P+ +GM VL +GGLPL E L + L GY T + GKWH
Sbjct: 66 VSLCTPSRAALLTGRLPVRSGMYPGVLGPSSQGGLPLEEVTLAEVLAARGYLTGMAGKWH 125
Query: 164 LGFYKK-EYTPTFRGFESHLGYWTGH-----QDYFDHSAEEMKMWGLD-----------M 206
LG + + P +GF LG H Q+ + G D +
Sbjct: 126 LGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPDITCSGGCDQGLVPIPLLANL 185
Query: 207 RRDLEPAW--DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPD 264
+ +P W L +Y + F+ + + P FLY A TH Y
Sbjct: 186 TVEAQPPWLPGLEARYVS--FSRDLMADAQRQG--RPFFLYYASHHTH----YPQFSGQS 237
Query: 265 HYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXX 324
+ R F L +LD +VG ++ A+ +L ++++F +D
Sbjct: 238 F-------TKRSGRGPFGDSLMELDGAVGALMTAVGDLGLLGETLVIFTADNGPELMRMS 290
Query: 325 XXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSD 384
S LR K T +EGGVR L++ P + G+ E + D LPTL +A +
Sbjct: 291 DGGCSGL-LRCGKGTTFEGGVREPALVYWPGHITPGVTHELASSL-DLLPTL-AALTGAP 347
Query: 385 IPN 387
+PN
Sbjct: 348 LPN 350
Score = 56 (24.8 bits), Expect = 1.1e-30, Sum P(3) = 1.1e-30
Identities = 10/22 (45%), Positives = 14/22 (63%)
Query: 675 PCLFDIKNDPCEKNNLADRSED 696
P L+D+ DP E NL D +E+
Sbjct: 426 PLLYDLSKDPGENYNLLDSTEE 447
Score = 54 (24.1 bits), Expect = 1.7e-30, Sum P(3) = 1.7e-30
Identities = 10/21 (47%), Positives = 13/21 (61%)
Query: 781 PCLFDIKNDPCEKNNLADRSE 801
P L+D+ DP E NL D +E
Sbjct: 426 PLLYDLSKDPGENYNLLDSTE 446
Score = 45 (20.9 bits), Expect = 1.1e-30, Sum P(3) = 1.1e-30
Identities = 13/43 (30%), Positives = 22/43 (51%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHN--IDDEWQ-ISALTRGKWK 549
+DG+D+ +L S RN++ DE + A+ GK+K
Sbjct: 353 LDGVDISPLLLGTGKSPRNSVFFYPPFPDEIHGVFAVRNGKYK 395
Score = 38 (18.4 bits), Expect = 5.6e-30, Sum P(3) = 5.6e-30
Identities = 7/21 (33%), Positives = 13/21 (61%)
Query: 575 IDGIDVWSVLSRNEPSKRNTI 595
+DG+D+ +L S RN++
Sbjct: 353 LDGVDISPLLLGTGKSPRNSV 373
>RGD|1310381 [details] [associations]
symbol:Arsa "arylsulfatase A" species:10116 "Rattus norvegicus"
[GO:0001669 "acrosomal vesicle" evidence=IDA] [GO:0004065
"arylsulfatase activity" evidence=IDA] [GO:0005509 "calcium ion
binding" evidence=ISO] [GO:0005615 "extracellular space"
evidence=IDA] [GO:0005737 "cytoplasm" evidence=IDA] [GO:0005764
"lysosome" evidence=IDA] [GO:0005768 "endosome" evidence=IDA]
[GO:0005886 "plasma membrane" evidence=ISO] [GO:0006914 "autophagy"
evidence=IDA] [GO:0007339 "binding of sperm to zona pellucida"
evidence=ISO] [GO:0007417 "central nervous system development"
evidence=IDA] [GO:0007584 "response to nutrient" evidence=IDA]
[GO:0008484 "sulfuric ester hydrolase activity" evidence=ISO]
[GO:0009268 "response to pH" evidence=IDA] [GO:0016021 "integral to
membrane" evidence=ISO] [GO:0031232 "extrinsic to external side of
plasma membrane" evidence=IDA] [GO:0043627 "response to estrogen
stimulus" evidence=IDA] [GO:0045471 "response to ethanol"
evidence=IDA] [GO:0051597 "response to methylmercury" evidence=IDA]
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 RGD:1310381 GO:GO:0005615 GO:GO:0045471 GO:GO:0005768
GO:GO:0001669 GO:GO:0006914 GO:GO:0007584 GO:GO:0005509
GO:GO:0007417 GO:GO:0005764 GO:GO:0009268 GO:GO:0007339
GO:GO:0043627 Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0031232
GO:GO:0051597 HOVERGEN:HBG004283 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0004065 IPI:IPI00361483 UniGene:Rn.23323
EMBL:BC105852 ProteinModelPortal:Q3KR80 SMR:Q3KR80 IntAct:Q3KR80
STRING:Q3KR80 ArrayExpress:Q3KR80 Genevestigator:Q3KR80
Uniprot:Q3KR80
Length = 497
Score = 339 (124.4 bits), Expect = 2.8e-30, Sum P(3) = 2.8e-30
Identities = 109/363 (30%), Positives = 166/363 (45%)
Query: 45 TLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT- 103
TL + ++++ PP+I+ I ADDLG+ D+G +G TPN+D LA G+ ++Y
Sbjct: 6 TLVLALAAGLSTASPPNIMLIFADDLGYGDLGSYGHPSSTTPNLDQLAAGGLRFTDFYVP 65
Query: 104 VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWH 163
V LCTPSR+A++TG+ P+ +GM VL +GGLPL E L + L GY T + GKWH
Sbjct: 66 VSLCTPSRAALLTGRLPVRSGMYPGVLGPSSQGGLPLEEVTLAEVLAARGYLTGMAGKWH 125
Query: 164 LGFYKK-EYTPTFRGFESHLGYWTGH-----QDYFDHSAEEMKMWGLD-----------M 206
LG + + P +GF LG H Q+ + G D +
Sbjct: 126 LGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPDITCSGGCDQGLVPIPLLANL 185
Query: 207 RRDLEPAW--DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPD 264
+ +P W L +Y + F+ + + P FLY A TH Y
Sbjct: 186 TVEAQPPWLPGLEARYVS--FSRDLMADAQRQG--RPFFLYYASHHTH----YPQFSGQS 237
Query: 265 HYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXX 324
+ R F L +LD +VG ++ A+ +L ++++F +D
Sbjct: 238 F-------TKRSGRGPFGDSLMELDGAVGALMTAVGDLGLLGETLVIFTADNGPELMRMS 290
Query: 325 XXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSD 384
S LR K T +EGGVR L++ P + G+ E + D LPTL +A +
Sbjct: 291 DGGCSGL-LRCGKGTTFEGGVREPALVYWPGHITPGVTHELASSL-DLLPTL-AALTGAP 347
Query: 385 IPN 387
+PN
Sbjct: 348 LPN 350
Score = 56 (24.8 bits), Expect = 2.8e-30, Sum P(3) = 2.8e-30
Identities = 10/22 (45%), Positives = 14/22 (63%)
Query: 675 PCLFDIKNDPCEKNNLADRSED 696
P L+D+ DP E NL D +E+
Sbjct: 416 PLLYDLSKDPGENYNLLDSTEE 437
Score = 54 (24.1 bits), Expect = 4.5e-30, Sum P(3) = 4.5e-30
Identities = 10/21 (47%), Positives = 13/21 (61%)
Query: 781 PCLFDIKNDPCEKNNLADRSE 801
P L+D+ DP E NL D +E
Sbjct: 416 PLLYDLSKDPGENYNLLDSTE 436
Score = 38 (18.4 bits), Expect = 2.8e-30, Sum P(3) = 2.8e-30
Identities = 7/21 (33%), Positives = 13/21 (61%)
Query: 510 IDGIDVWSVLSRNEPSKRNTI 530
+DG+D+ +L S RN++
Sbjct: 353 LDGVDISPLLLGTGKSPRNSV 373
Score = 38 (18.4 bits), Expect = 2.8e-30, Sum P(3) = 2.8e-30
Identities = 7/21 (33%), Positives = 13/21 (61%)
Query: 575 IDGIDVWSVLSRNEPSKRNTI 595
+DG+D+ +L S RN++
Sbjct: 353 LDGVDISPLLLGTGKSPRNSV 373
>UNIPROTKB|P15289 [details] [associations]
symbol:ARSA "Arylsulfatase A" species:9606 "Homo sapiens"
[GO:0005509 "calcium ion binding" evidence=IDA] [GO:0004065
"arylsulfatase activity" evidence=TAS] [GO:0005764 "lysosome"
evidence=TAS] [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IDA] [GO:0004098 "cerebroside-sulfatase activity"
evidence=TAS] [GO:0005788 "endoplasmic reticulum lumen"
evidence=TAS] [GO:0006644 "phospholipid metabolic process"
evidence=TAS] [GO:0006665 "sphingolipid metabolic process"
evidence=TAS] [GO:0006687 "glycosphingolipid metabolic process"
evidence=TAS] [GO:0043202 "lysosomal lumen" evidence=TAS]
[GO:0043687 "post-translational protein modification" evidence=TAS]
[GO:0044267 "cellular protein metabolic process" evidence=TAS]
[GO:0044281 "small molecule metabolic process" evidence=TAS]
Reactome:REACT_17015 Reactome:REACT_111217 InterPro:IPR000917
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 GO:GO:0005886
GO:GO:0044281 GO:GO:0006644 GO:GO:0005509 GO:GO:0005788
GO:GO:0007339 GO:GO:0043687 GO:GO:0043202 Gene3D:3.40.720.10
SUPFAM:SSF53649 CTD:410 eggNOG:COG3119 HOGENOM:HOG000135352
HOVERGEN:HBG004283 KO:K01134 OrthoDB:EOG4MKNG4 GO:GO:0004098
InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 EMBL:X52151
EMBL:X52150 EMBL:AB448736 EMBL:CR456383 EMBL:AK315011 EMBL:AY271820
EMBL:U62317 EMBL:BC014210 IPI:IPI00744184 PIR:S11031
RefSeq:NP_000478.3 RefSeq:NP_001078894.2 RefSeq:NP_001078895.2
RefSeq:NP_001078896.2 RefSeq:NP_001078897.1 UniGene:Hs.731715
UniGene:Hs.88251 PDB:1AUK PDB:1E1Z PDB:1E2S PDB:1E33 PDB:1E3C
PDB:1N2K PDB:1N2L PDB:2AIJ PDB:2AIK PDBsum:1AUK PDBsum:1E1Z
PDBsum:1E2S PDBsum:1E33 PDBsum:1E3C PDBsum:1N2K PDBsum:1N2L
PDBsum:2AIJ PDBsum:2AIK ProteinModelPortal:P15289 SMR:P15289
IntAct:P15289 STRING:P15289 GlycoSuiteDB:P15289 PaxDb:P15289
PRIDE:P15289 DNASU:410 Ensembl:ENST00000547307
Ensembl:ENST00000547805 GeneID:410 KEGG:hsa:410 UCSC:uc003bna.4
GeneCards:GC22M051063 HGNC:HGNC:713 HPA:CAB025183 HPA:HPA005554
MIM:250100 MIM:272200 MIM:607574 neXtProt:NX_P15289 Orphanet:512
Orphanet:751 PharmGKB:PA25005 InParanoid:P15289 PhylomeDB:P15289
BRENDA:3.1.6.8 ChEMBL:CHEMBL2193 DrugBank:DB01141
EvolutionaryTrace:P15289 GenomeRNAi:410 NextBio:1725
PMAP-CutDB:P15289 ArrayExpress:P15289 Bgee:P15289 CleanEx:HS_ARSA
Genevestigator:P15289 GermOnline:ENSG00000100299 GO:GO:0004065
GO:GO:0006687 Uniprot:P15289
Length = 507
Score = 349 (127.9 bits), Expect = 8.4e-30, Sum P(2) = 8.4e-30
Identities = 111/353 (31%), Positives = 164/353 (46%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRS 112
+A + PP+I+ I ADDLG+ D+G +G TPN+D LA G+ ++Y V LCTPSR+
Sbjct: 15 LAVARPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLRFTDFYVPVSLCTPSRA 74
Query: 113 AIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK-EY 171
A++TG+ P+ GM VL RGGLPL E + + L GY T + GKWHLG + +
Sbjct: 75 ALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLTGMAGKWHLGVGPEGAF 134
Query: 172 TPTFRGFESHLGYWTGHQD-------YFDHSAE-----EMKMWGLDMRRDL----EPAWD 215
P +GF LG H F + + + + + +L +P W
Sbjct: 135 LPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVPIPLLANLSVEAQPPW- 193
Query: 216 LHGKYSTDVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIE 274
L G + + A A D++ + D P FLY A TH Y E
Sbjct: 194 LPGLEAR--YMAFAHDLMADAQRQDRPFFLYYASHHTH----YPQFSGQSF-------AE 240
Query: 275 DFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLR 334
R F L +LD +VG ++ A+ +L ++++F +D S LR
Sbjct: 241 RSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGL-LR 299
Query: 335 GVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
K T +EGGVR L + P + G+ E + D LPTL + A + +PN
Sbjct: 300 CGKGTTYEGGVREPALAFWPGHIAPGVTHELASSL-DLLPTLAALAG-APLPN 350
Score = 44 (20.5 bits), Expect = 8.4e-30, Sum P(2) = 8.4e-30
Identities = 8/16 (50%), Positives = 10/16 (62%)
Query: 675 PCLFDIKNDPCEKNNL 690
P L+D+ DP E NL
Sbjct: 426 PLLYDLSKDPGENYNL 441
Score = 44 (20.5 bits), Expect = 8.4e-30, Sum P(2) = 8.4e-30
Identities = 8/16 (50%), Positives = 10/16 (62%)
Query: 781 PCLFDIKNDPCEKNNL 796
P L+D+ DP E NL
Sbjct: 426 PLLYDLSKDPGENYNL 441
>UNIPROTKB|F6PKZ1 [details] [associations]
symbol:ARSA "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
eggNOG:COG3119 GeneTree:ENSGT00560000076940 HOGENOM:HOG000135352
HOVERGEN:HBG004283 OMA:FGPSQMA OrthoDB:EOG4MKNG4 InterPro:IPR024607
PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484 EMBL:AAEX03007117
Ensembl:ENSCAFT00000000876 Uniprot:F6PKZ1
Length = 508
Score = 344 (126.2 bits), Expect = 2.2e-29, Sum P(2) = 2.2e-29
Identities = 117/369 (31%), Positives = 173/369 (46%)
Query: 38 AVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGII 97
A+ PL + L++ +A++GPP+I+ I ADDLG+ D+G +G TPN+D LA G+
Sbjct: 1 AMGPL-WALALASAVGLAAAGPPNIVLIFADDLGYGDLGCYGHPSSATPNLDQLAAGGLR 59
Query: 98 LKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRT 156
++Y LCTPSR+A++TG+ P+ G+ VL RGGLPL E L + L GY T
Sbjct: 60 FTDFYVPTSLCTPSRAALLTGRLPVRMGLYPGVLEPGSRGGLPLEEVTLAEVLAARGYLT 119
Query: 157 RIVGKWHLGFYKK-EYTPTFRGFESHLGYWTGHQD-------YFDHSAE-----EMKMWG 203
I GKWHLG + P +GF LG H F S + +
Sbjct: 120 GIAGKWHLGVGPDGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPSTPCDGSCDQGLVP 179
Query: 204 LDMRRDL----EPAWDLHGKYSTDVFTAEAVDIIHNHSTDE-PLFLYLAHAATHSANPYE 258
+ + +L +P W L G + + A A D++ + P FLY A TH Y
Sbjct: 180 IPLLANLSVEAQPPW-LPGLEAR--YVAFARDLMADAQRQGLPFFLYYASHHTH----Y- 231
Query: 259 PLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXX 318
P Q + H R F L +LD +VG ++ A+ +L ++++F +D
Sbjct: 232 P-QFGGQSFSGHSG-----RGPFGDSLMELDAAVGALMTAVGDLGLLGETLVIFTADNGP 285
Query: 319 XXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLS 378
S LR K T ++GGVR L + P + G+ E + D LPTL S
Sbjct: 286 ETMRMSHGGCSGL-LRCGKGTTFDGGVREPALAFWPGHIAPGVTHELASSL-DLLPTLAS 343
Query: 379 AANKSDIPN 387
+ +PN
Sbjct: 344 LTG-APLPN 351
Score = 47 (21.6 bits), Expect = 2.2e-29, Sum P(2) = 2.2e-29
Identities = 9/16 (56%), Positives = 10/16 (62%)
Query: 675 PCLFDIKNDPCEKNNL 690
P LFD+ DP E NL
Sbjct: 427 PLLFDLSEDPGENYNL 442
Score = 47 (21.6 bits), Expect = 2.2e-29, Sum P(2) = 2.2e-29
Identities = 9/16 (56%), Positives = 10/16 (62%)
Query: 781 PCLFDIKNDPCEKNNL 796
P LFD+ DP E NL
Sbjct: 427 PLLFDLSEDPGENYNL 442
>MGI|MGI:88077 [details] [associations]
symbol:Arsa "arylsulfatase A" species:10090 "Mus musculus"
[GO:0001669 "acrosomal vesicle" evidence=ISO] [GO:0003824
"catalytic activity" evidence=IEA] [GO:0004065 "arylsulfatase
activity" evidence=ISO] [GO:0004098 "cerebroside-sulfatase
activity" evidence=IEA] [GO:0005509 "calcium ion binding"
evidence=ISO] [GO:0005615 "extracellular space" evidence=ISO]
[GO:0005737 "cytoplasm" evidence=ISO] [GO:0005764 "lysosome"
evidence=ISO] [GO:0005768 "endosome" evidence=ISO] [GO:0005886
"plasma membrane" evidence=IDA] [GO:0006914 "autophagy"
evidence=ISO] [GO:0007339 "binding of sperm to zona pellucida"
evidence=IMP] [GO:0007417 "central nervous system development"
evidence=ISO] [GO:0007584 "response to nutrient" evidence=ISO]
[GO:0008152 "metabolic process" evidence=IEA] [GO:0008484 "sulfuric
ester hydrolase activity" evidence=ISO] [GO:0009268 "response to
pH" evidence=ISO] [GO:0016021 "integral to membrane" evidence=IDA]
[GO:0016787 "hydrolase activity" evidence=IEA] [GO:0031232
"extrinsic to external side of plasma membrane" evidence=ISO]
[GO:0043627 "response to estrogen stimulus" evidence=ISO]
[GO:0045471 "response to ethanol" evidence=ISO] [GO:0046872 "metal
ion binding" evidence=IEA] [GO:0051597 "response to methylmercury"
evidence=ISO] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 MGI:MGI:88077 GO:GO:0016021
GO:GO:0005886 GO:GO:0005509 GO:GO:0005764 GO:GO:0007339
EMBL:CH466550 Gene3D:3.40.720.10 SUPFAM:SSF53649 CTD:410
eggNOG:COG3119 GeneTree:ENSGT00560000076940 HOGENOM:HOG000135352
HOVERGEN:HBG004283 KO:K01134 OMA:FGPSQMA OrthoDB:EOG4MKNG4
GO:GO:0004098 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
EMBL:X73230 EMBL:X73231 EMBL:AK004540 EMBL:AK132501 EMBL:BC011284
EMBL:BC098075 EMBL:M82876 IPI:IPI00118039 PIR:A54190
RefSeq:NP_033843.2 UniGene:Mm.620 ProteinModelPortal:P50428
SMR:P50428 IntAct:P50428 STRING:P50428 PaxDb:P50428 PRIDE:P50428
Ensembl:ENSMUST00000165199 GeneID:11883 KEGG:mmu:11883
InParanoid:Q9DC66 SABIO-RK:P50428 NextBio:279915 Bgee:P50428
CleanEx:MM_ARSA Genevestigator:P50428 GermOnline:ENSMUSG00000022620
GO:GO:0008484 Uniprot:P50428
Length = 506
Score = 335 (123.0 bits), Expect = 6.4e-28, Sum P(2) = 6.4e-28
Identities = 108/363 (29%), Positives = 165/363 (45%)
Query: 45 TLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT- 103
TL + ++++ PP+I+ I ADDLG+ D+G +G TPN+D LA G+ ++Y
Sbjct: 5 TLFLALAAGLSTASPPNILLIFADDLGYGDLGSYGHPSSTTPNLDQLAEGGLRFTDFYVP 64
Query: 104 VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWH 163
V LCTPSR+A++TG+ P+ +GM VL +GGLPL E L + L GY T + GKWH
Sbjct: 65 VSLCTPSRAALLTGRLPVRSGMYPGVLGPSSQGGLPLEEVTLAEVLAARGYLTGMAGKWH 124
Query: 164 LGFYKK-EYTPTFRGFESHLGYWTGH-----QDYFDHSAEEMKMWGLD-----------M 206
LG + + P +GF LG H Q+ + G D +
Sbjct: 125 LGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPDIPCKGGCDQGLVPIPLLANL 184
Query: 207 RRDLEPAW--DLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPD 264
+ +P W L +Y + F+ + + P FLY A TH Y
Sbjct: 185 TVEAQPPWLPGLEARYVS--FSRDLMADAQRQG--RPFFLYYASHHTH----YPQFSGQS 236
Query: 265 HYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXX 324
+ R F L +LD +VG ++ + +L ++++F +D
Sbjct: 237 F-------TKRSGRGPFGDSLMELDGAVGALMTTVGDLGLLEETLVIFTADNGPELMRMS 289
Query: 325 XXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSD 384
S LR K T +EGGVR L++ P + G+ E + D LPTL +A +
Sbjct: 290 NGGCSGL-LRCGKGTTFEGGVREPALVYWPGHITPGVTHELASSL-DLLPTL-AALTGAP 346
Query: 385 IPN 387
+PN
Sbjct: 347 LPN 349
Score = 44 (20.5 bits), Expect = 6.4e-28, Sum P(2) = 6.4e-28
Identities = 8/21 (38%), Positives = 12/21 (57%)
Query: 675 PCLFDIKNDPCEKNNLADRSE 695
P L+D+ DP E N+ + E
Sbjct: 425 PLLYDLSQDPGENYNVLESIE 445
Score = 44 (20.5 bits), Expect = 6.4e-28, Sum P(2) = 6.4e-28
Identities = 8/21 (38%), Positives = 12/21 (57%)
Query: 781 PCLFDIKNDPCEKNNLADRSE 801
P L+D+ DP E N+ + E
Sbjct: 425 PLLYDLSQDPGENYNVLESIE 445
>RGD|1304917 [details] [associations]
symbol:Arse "arylsulfatase E (chondrodysplasia punctata 1)"
species:10116 "Rattus norvegicus" [GO:0003674 "molecular_function"
evidence=ND] [GO:0004065 "arylsulfatase activity" evidence=IEA]
[GO:0005575 "cellular_component" evidence=ND] [GO:0008150
"biological_process" evidence=ND] InterPro:IPR000917
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 RGD:1304917
Gene3D:3.40.720.10 SUPFAM:SSF53649 GeneTree:ENSGT00560000076940
HOGENOM:HOG000135352 HOVERGEN:HBG004283 InterPro:IPR024607
PROSITE:PS00523 PROSITE:PS00149 GO:GO:0004065 KO:K12374 CTD:415
OMA:CHIVALA EMBL:BN000737 IPI:IPI00367421 RefSeq:NP_001041350.1
UniGene:Rn.79118 STRING:Q32KK0 Ensembl:ENSRNOT00000033080
GeneID:310326 KEGG:rno:310326 UCSC:RGD:1304917 InParanoid:Q32KK0
NextBio:661844 Genevestigator:Q32KK0 Uniprot:Q32KK0
Length = 611
Score = 263 (97.6 bits), Expect = 2.2e-26, Sum P(3) = 2.2e-26
Identities = 69/182 (37%), Positives = 93/182 (51%)
Query: 38 AVLPLAFTLSMVFVDLVASSGP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
A L L + +VD + S P P+ + I+ADDLG D+G +G I TPNID LA G+
Sbjct: 12 ATLLCIVLLGLQYVDALRSPPPRPNFLIIMADDLGIGDLGCYGNTSIRTPNIDRLAEDGV 71
Query: 97 ILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQ----HNVL-YGCERGGLPLSEKILPQYLK 150
L Y + +CTPSR+A +TG++PI +GM H VL + GGLP E + L+
Sbjct: 72 RLTQYLAAESVCTPSRAAFLTGRYPIRSGMTSGNGHRVLQWAAGAGGLPPKEITFARILQ 131
Query: 151 ELGYRTRIVGKWHLGFYKKEYT-----PTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLD 205
GY T +VGKWHLG + + P GF LG G + K GL+
Sbjct: 132 GQGYVTGLVGKWHLGLSCRTVSDLCHHPLNHGFHHFLGLPLGMMGDCAGAEPSEKRAGLE 191
Query: 206 MR 207
R
Sbjct: 192 RR 193
Score = 119 (46.9 bits), Expect = 2.2e-26, Sum P(3) = 2.2e-26
Identities = 42/160 (26%), Positives = 68/160 (42%)
Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
T + EA D + H P L+L+ TH+ PL + H ++
Sbjct: 274 TPLLLREAKDFLRRHR-HAPFLLFLSLLHTHT-----PLVTSPEFRGRSAH------GRY 321
Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXX---XXXXXXXXXXSNWPLRGVKN 338
+ ++D VG+++E LE + ++++ F SD SN RG K
Sbjct: 322 GDNVEEMDWVVGQILEVLEHEGLTDSTLVHFTSDNGAWLEAQAGGEQLGGSNGVFRGGKG 381
Query: 339 TL-WEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLL 377
WEGG+R G+ P + RG V +Q V + D PT++
Sbjct: 382 MGGWEGGIRVPGVFRWPGVLPRGRVLDQPVSLMDVFPTVV 421
Score = 40 (19.1 bits), Expect = 2.2e-26, Sum P(3) = 2.2e-26
Identities = 14/51 (27%), Positives = 25/51 (49%)
Query: 762 AASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEVQRINHYTTEV 812
AA++ C V +V E P LF++ +DP E L +++ + T +
Sbjct: 500 AAAV-CPCVGKV--EEHDPPLLFELTSDPGEVRPLRAPAKMSEAPNLTAAI 547
Score = 38 (18.4 bits), Expect = 3.6e-26, Sum P(3) = 3.6e-26
Identities = 12/31 (38%), Positives = 18/31 (58%)
Query: 656 AASIQCGPVKEVPCEPQIAPCLFDIKNDPCE 686
AA++ C V +V E P LF++ +DP E
Sbjct: 500 AAAV-CPCVGKV--EEHDPPLLFELTSDPGE 527
>UNIPROTKB|F1Q1V3 [details] [associations]
symbol:STS "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0008484 EMBL:AAEX03026120 EMBL:AAEX03026121
Ensembl:ENSCAFT00000017942 Uniprot:F1Q1V3
Length = 594
Score = 246 (91.7 bits), Expect = 2.3e-26, Sum P(4) = 2.3e-26
Identities = 52/131 (39%), Positives = 73/131 (55%)
Query: 42 LAFTLSMVFVDLVASSGP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
L L ++ + + P P+ + ++ADDLG D G +G + TPNID LA G+ L
Sbjct: 22 LRLLLLLLLCEAQGHAAPRPNFVLLMADDLGIGDPGCYGNTTLRTPNIDRLAAEGVKLTQ 81
Query: 101 YYTVQ-LCTPSRSAIMTGKHPIHTGMQHN-----VLYGCERGGLPLSEKILPQYLKELGY 154
+ LCTPSR+A MTG++PI +GM ++ GGLP SE + LK GY
Sbjct: 82 HLAASPLCTPSRAAFMTGRYPIRSGMASQSFIGVFIFSASSGGLPTSEITFAKLLKNQGY 141
Query: 155 RTRIVGKWHLG 165
T ++GKWHLG
Sbjct: 142 STALIGKWHLG 152
Score = 125 (49.1 bits), Expect = 2.3e-26, Sum P(4) = 2.3e-26
Identities = 42/170 (24%), Positives = 77/170 (45%)
Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
T TA+A I ++ P L L++ H+A +PD + +H +
Sbjct: 275 TQRLTADAAQFIRRNA-GTPFLLLLSYLHVHTAL----FSSPD-FAGHSQH------GAY 322
Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXX----SNWPLRGVK 337
+LD SVG+++ L++ ++ +N+++ F SD SN +G K
Sbjct: 323 GDAAEELDWSVGQILNVLDELKLANNTLVYFTSDQGAHVEEVTTKGEVHGGSNGIYKGGK 382
Query: 338 NTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
WEGG+R G++ W ++++ G+V ++ D PT+ A S +P
Sbjct: 383 ANNWEGGIRIPGILRWPGVIQA-GLVIDEPTSNMDIFPTVAKLAG-SPLP 430
Score = 56 (24.8 bits), Expect = 4.5e-06, Sum P(4) = 4.5e-06
Identities = 11/30 (36%), Positives = 16/30 (53%)
Query: 6 LYGCERGGLPLSEKILPQYLKELGYRTRIM 35
++ GGLP SE + LK GY T ++
Sbjct: 117 IFSASSGGLPTSEITFAKLLKNQGYSTALI 146
Score = 48 (22.0 bits), Expect = 2.3e-26, Sum P(4) = 2.3e-26
Identities = 9/21 (42%), Positives = 13/21 (61%)
Query: 675 PCLFDIKNDPCEKNNLADRSE 695
P LFD+ DP E+ L+ +E
Sbjct: 514 PLLFDVAKDPGERTPLSPATE 534
Score = 48 (22.0 bits), Expect = 2.3e-26, Sum P(4) = 2.3e-26
Identities = 9/21 (42%), Positives = 13/21 (61%)
Query: 781 PCLFDIKNDPCEKNNLADRSE 801
P LFD+ DP E+ L+ +E
Sbjct: 514 PLLFDVAKDPGERTPLSPATE 534
Score = 43 (20.2 bits), Expect = 2.3e-26, Sum P(4) = 2.3e-26
Identities = 10/41 (24%), Positives = 19/41 (46%)
Query: 431 HEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENT 471
H+ P + + ++ HE+ Y N Y N + P+N +
Sbjct: 438 HDLMPLLQGKTQHSDHEFLFHYCNFYLNAVRWH--PRNSTS 476
>UNIPROTKB|F1MFZ8 [details] [associations]
symbol:STS "Uncharacterized protein" species:9913 "Bos
taurus" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0008484 OMA:GLSCQCD EMBL:DAAA02075641
EMBL:DAAA02075642 EMBL:DAAA02075643 EMBL:DAAA02075644
EMBL:DAAA02075645 IPI:IPI00693675 UniGene:Bt.63535
Ensembl:ENSBTAT00000027703 Uniprot:F1MFZ8
Length = 578
Score = 252 (93.8 bits), Expect = 4.1e-26, Sum P(3) = 4.1e-26
Identities = 55/135 (40%), Positives = 78/135 (57%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAI 114
++ P+ + ++ADDLG D G +G + TPNID LA G+ L + LCTPSR+A
Sbjct: 18 AASKPNFVLLMADDLGIGDPGCYGNKTLRTPNIDRLARGGVKLTQHLAASPLCTPSRAAF 77
Query: 115 MTGKHPIHTGM--QHNV---LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
MTG++P+ +GM Q V L+ GGLP SE + LK+ GY T ++GKWHLG
Sbjct: 78 MTGRYPVRSGMASQSQVGVFLFSASSGGLPPSEITFAKLLKDQGYSTALIGKWHLGISCH 137
Query: 170 E-----YTPTFRGFE 179
+ + PT GF+
Sbjct: 138 DPGDFCHHPTSHGFD 152
Score = 119 (46.9 bits), Expect = 4.1e-26, Sum P(3) = 4.1e-26
Identities = 69/324 (21%), Positives = 129/324 (39%)
Query: 144 ILPQYLKELGYRTRIVGKWHLGFYKKE---YTPTFRGFESHLGYWTGHQDYFDHSAEEMK 200
+LP L L T +V KW LG ++ + F LG G YF +
Sbjct: 182 LLPMQLIALALLTLVVLKW-LGLFRAPPCAFLFLFLLATLLLGLLLGFLHYF----RPLN 236
Query: 201 MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPL 260
+ L RD+ + T TA+A + ++ + P L L+ H+A L
Sbjct: 237 CF-LMRNRDITQQ-PMSYDNLTQRLTADAAHFLRRNA-ETPFLLVLSFLHMHTA-----L 288
Query: 261 QAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXX 320
+ + +H + ++D SVG++++ L + ++ +N+++ F SD
Sbjct: 289 FSSKDFAGKSQH------GSYGDAAEEMDWSVGQILDVLHELKLANNTLVYFSSDQGAHV 342
Query: 321 XXXXXXXX----SNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPT 375
SN +G K WEGG+R G++ W ++++ G+ ++ D PT
Sbjct: 343 EEVTVKGEVQGGSNGIYKGGKANNWEGGIRVPGIVRWPGVIQA-GLEIDEPTSNMDIFPT 401
Query: 376 LLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNP 435
+ A S +P +++P + +R + HE+ N+ Y N + P
Sbjct: 402 VAKLAG-SPLPQDRVIDGRDLMPLLQ---MRTQRSEHEF---LFHYCNS-YLNAVRWHPP 453
Query: 436 KYENRYENGTHEYNPKYENRYENG 459
+ ++ + PK+ NG
Sbjct: 454 NSTSIWK--AFFFTPKFSPEGANG 475
Score = 58 (25.5 bits), Expect = 2.9e-05, Sum P(3) = 2.9e-05
Identities = 12/30 (40%), Positives = 17/30 (56%)
Query: 6 LYGCERGGLPLSEKILPQYLKELGYRTRIM 35
L+ GGLP SE + LK+ GY T ++
Sbjct: 98 LFSASSGGLPPSEITFAKLLKDQGYSTALI 127
Score = 48 (22.0 bits), Expect = 4.1e-26, Sum P(3) = 4.1e-26
Identities = 10/21 (47%), Positives = 12/21 (57%)
Query: 675 PCLFDIKNDPCEKNNLADRSE 695
P LF+I DP E+N L E
Sbjct: 495 PLLFEISRDPRERNPLTPTLE 515
Score = 48 (22.0 bits), Expect = 4.1e-26, Sum P(3) = 4.1e-26
Identities = 10/21 (47%), Positives = 12/21 (57%)
Query: 781 PCLFDIKNDPCEKNNLADRSE 801
P LF+I DP E+N L E
Sbjct: 495 PLLFEISRDPRERNPLTPTLE 515
Score = 44 (20.5 bits), Expect = 1.0e-25, Sum P(3) = 1.0e-25
Identities = 20/79 (25%), Positives = 31/79 (39%)
Query: 584 LSRNEPSKRNTILHNIDDE-WQISALTXXXXXXXXXXXXMRYQVDLTGGPDQVYLSGLSD 642
+SR +P +RN + ++ W+I R+ L P+Q+ L L
Sbjct: 500 ISR-DPRERNPLTPTLEPRFWEI--------LEAMQEAAARHARTLQDVPNQLSLGNLMW 550
Query: 643 REWLALAMRKLRDAASIQC 661
+ WL L L S QC
Sbjct: 551 KPWLQLCCSSL--GLSCQC 567
>UNIPROTKB|F1Q1V2 [details] [associations]
symbol:STS "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0008484 OMA:GLSCQCD EMBL:AAEX03026120
EMBL:AAEX03026121 Ensembl:ENSCAFT00000017943 Uniprot:F1Q1V2
Length = 637
Score = 246 (91.7 bits), Expect = 4.1e-26, Sum P(4) = 4.1e-26
Identities = 52/131 (39%), Positives = 73/131 (55%)
Query: 42 LAFTLSMVFVDLVASSGP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
L L ++ + + P P+ + ++ADDLG D G +G + TPNID LA G+ L
Sbjct: 3 LRLLLLLLLCEAQGHAAPRPNFVLLMADDLGIGDPGCYGNTTLRTPNIDRLAAEGVKLTQ 62
Query: 101 YYTVQ-LCTPSRSAIMTGKHPIHTGMQHN-----VLYGCERGGLPLSEKILPQYLKELGY 154
+ LCTPSR+A MTG++PI +GM ++ GGLP SE + LK GY
Sbjct: 63 HLAASPLCTPSRAAFMTGRYPIRSGMASQSFIGVFIFSASSGGLPTSEITFAKLLKNQGY 122
Query: 155 RTRIVGKWHLG 165
T ++GKWHLG
Sbjct: 123 STALIGKWHLG 133
Score = 125 (49.1 bits), Expect = 4.1e-26, Sum P(4) = 4.1e-26
Identities = 42/170 (24%), Positives = 77/170 (45%)
Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
T TA+A I ++ P L L++ H+A +PD + +H +
Sbjct: 256 TQRLTADAAQFIRRNA-GTPFLLLLSYLHVHTAL----FSSPD-FAGHSQH------GAY 303
Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXX----SNWPLRGVK 337
+LD SVG+++ L++ ++ +N+++ F SD SN +G K
Sbjct: 304 GDAAEELDWSVGQILNVLDELKLANNTLVYFTSDQGAHVEEVTTKGEVHGGSNGIYKGGK 363
Query: 338 NTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
WEGG+R G++ W ++++ G+V ++ D PT+ A S +P
Sbjct: 364 ANNWEGGIRIPGILRWPGVIQA-GLVIDEPTSNMDIFPTVAKLAG-SPLP 411
Score = 56 (24.8 bits), Expect = 6.5e-06, Sum P(4) = 6.5e-06
Identities = 11/30 (36%), Positives = 16/30 (53%)
Query: 6 LYGCERGGLPLSEKILPQYLKELGYRTRIM 35
++ GGLP SE + LK GY T ++
Sbjct: 98 IFSASSGGLPTSEITFAKLLKNQGYSTALI 127
Score = 48 (22.0 bits), Expect = 4.1e-26, Sum P(4) = 4.1e-26
Identities = 9/21 (42%), Positives = 13/21 (61%)
Query: 675 PCLFDIKNDPCEKNNLADRSE 695
P LFD+ DP E+ L+ +E
Sbjct: 495 PLLFDVAKDPGERTPLSPATE 515
Score = 48 (22.0 bits), Expect = 4.1e-26, Sum P(4) = 4.1e-26
Identities = 9/21 (42%), Positives = 13/21 (61%)
Query: 781 PCLFDIKNDPCEKNNLADRSE 801
P LFD+ DP E+ L+ +E
Sbjct: 495 PLLFDVAKDPGERTPLSPATE 515
Score = 43 (20.2 bits), Expect = 4.1e-26, Sum P(4) = 4.1e-26
Identities = 10/41 (24%), Positives = 19/41 (46%)
Query: 431 HEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENT 471
H+ P + + ++ HE+ Y N Y N + P+N +
Sbjct: 419 HDLMPLLQGKTQHSDHEFLFHYCNFYLNAVRWH--PRNSTS 457
>UNIPROTKB|P08842 [details] [associations]
symbol:STS "Steryl-sulfatase" species:9606 "Homo sapiens"
[GO:0007565 "female pregnancy" evidence=IEA] [GO:0016021 "integral
to membrane" evidence=IEA] [GO:0046872 "metal ion binding"
evidence=IEA] [GO:0016020 "membrane" evidence=TAS] [GO:0005764
"lysosome" evidence=TAS] [GO:0005768 "endosome" evidence=TAS]
[GO:0005783 "endoplasmic reticulum" evidence=TAS] [GO:0043231
"intracellular membrane-bounded organelle" evidence=TAS]
[GO:0005794 "Golgi apparatus" evidence=TAS] [GO:0005886 "plasma
membrane" evidence=TAS] [GO:0006706 "steroid catabolic process"
evidence=TAS] [GO:0008544 "epidermis development" evidence=TAS]
[GO:0008484 "sulfuric ester hydrolase activity" evidence=IDA]
[GO:0004773 "steryl-sulfatase activity" evidence=TAS] [GO:0005788
"endoplasmic reticulum lumen" evidence=TAS] [GO:0005789
"endoplasmic reticulum membrane" evidence=TAS] [GO:0006644
"phospholipid metabolic process" evidence=TAS] [GO:0006665
"sphingolipid metabolic process" evidence=TAS] [GO:0006687
"glycosphingolipid metabolic process" evidence=TAS] [GO:0043687
"post-translational protein modification" evidence=TAS] [GO:0044267
"cellular protein metabolic process" evidence=TAS] [GO:0044281
"small molecule metabolic process" evidence=TAS]
Reactome:REACT_17015 Reactome:REACT_111217 InterPro:IPR000917
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 GO:GO:0016021
GO:GO:0005886 GO:GO:0005794 GO:GO:0005635 GO:GO:0043588
GO:GO:0044281 GO:GO:0005789 GO:GO:0046872 GO:GO:0006706
GO:GO:0008284 GO:GO:0005768 GO:GO:0043434 GO:GO:0006644
GO:GO:0007565 GO:GO:0005764 GO:GO:0009268 GO:GO:0007611
GO:GO:0005788 GO:GO:0043627 GO:GO:0043687 GO:GO:0008544
DrugBank:DB00655 Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
HOGENOM:HOG000135352 HOVERGEN:HBG004283 InterPro:IPR024607
PROSITE:PS00523 PROSITE:PS00149 GO:GO:0006687 OrthoDB:EOG4V4379
EMBL:J04964 EMBL:M16505 EMBL:AK314034 EMBL:BC075030 EMBL:M23945
EMBL:M23556 IPI:IPI00307433 PIR:A32641 RefSeq:NP_000342.2
UniGene:Hs.522578 UniGene:Hs.700558 UniGene:Hs.700559
UniGene:Hs.740067 PDB:1P49 PDBsum:1P49 ProteinModelPortal:P08842
SMR:P08842 MINT:MINT-1177440 STRING:P08842 PhosphoSite:P08842
DMDM:135006 PaxDb:P08842 PRIDE:P08842 Ensembl:ENST00000217961
GeneID:412 KEGG:hsa:412 UCSC:uc004cry.4 CTD:412
GeneCards:GC0XP007147 HGNC:HGNC:11425 HPA:HPA002904 MIM:300747
MIM:308100 neXtProt:NX_P08842 Orphanet:461 PharmGKB:PA36225
InParanoid:P08842 KO:K01131 OMA:GLSCQCD PhylomeDB:P08842
BindingDB:P08842 ChEMBL:CHEMBL3559 EvolutionaryTrace:P08842
GenomeRNAi:412 NextBio:1743 Bgee:P08842 CleanEx:HS_STS
Genevestigator:P08842 GermOnline:ENSG00000101846 GO:GO:0004773
Uniprot:P08842
Length = 583
Score = 248 (92.4 bits), Expect = 5.8e-26, Sum P(4) = 5.8e-26
Identities = 57/135 (42%), Positives = 76/135 (56%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+II ++ADDLG D G +G I TPNID LA G+ L + LCTPSR+A MTG+
Sbjct: 27 PNIILVMADDLGIGDPGCYGNKTIRTPNIDRLASGGVKLTQHLAASPLCTPSRAAFMTGR 86
Query: 119 HPIHTGM----QHNV-LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFY---KKE 170
+P+ +GM + V L+ GGLP E + LK+ GY T ++GKWHLG K +
Sbjct: 87 YPVRSGMASWSRTGVFLFTASSGGLPTDEITFAKLLKDQGYSTALIGKWHLGMSCHSKTD 146
Query: 171 YT--PTFRGFESHLG 183
+ P GF G
Sbjct: 147 FCHHPLHHGFNYFYG 161
Score = 112 (44.5 bits), Expect = 5.8e-26, Sum P(4) = 5.8e-26
Identities = 45/197 (22%), Positives = 82/197 (41%)
Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
T T EA I + T+ P L L++ H+A L + + +H +
Sbjct: 261 TQRLTVEAAQFIQRN-TETPFLLVLSYLHVHTA-----LFSSKDFAGKSQH------GVY 308
Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXX----SNWPLRGVK 337
+ ++D SVG+++ L++ R+ ++++I F SD SN +G K
Sbjct: 309 GDAVEEMDWSVGQILNLLDELRLANDTLIYFTSDQGAHVEEVSSKGEIHGGSNGIYKGGK 368
Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENII 397
WEGG+R G++ P + G ++ D PT+ A + +P +++
Sbjct: 369 ANNWEGGIRVPGILRWPRVIQAGQKIDEPTSNMDIFPTVAKLAG-APLPEDRIIDGRDLM 427
Query: 398 PRYENSILRYENGTHEY 414
P E R + HE+
Sbjct: 428 PLLEGKSQRSD---HEF 441
Score = 58 (25.5 bits), Expect = 5.8e-26, Sum P(4) = 5.8e-26
Identities = 12/21 (57%), Positives = 13/21 (61%)
Query: 675 PCLFDIKNDPCEKNNLADRSE 695
P LFDI DP E+N L SE
Sbjct: 500 PLLFDISKDPRERNPLTPASE 520
Score = 58 (25.5 bits), Expect = 5.8e-26, Sum P(4) = 5.8e-26
Identities = 12/21 (57%), Positives = 13/21 (61%)
Query: 781 PCLFDIKNDPCEKNNLADRSE 801
P LFDI DP E+N L SE
Sbjct: 500 PLLFDISKDPRERNPLTPASE 520
Score = 54 (24.1 bits), Expect = 3.9e-05, Sum P(4) = 3.9e-05
Identities = 11/30 (36%), Positives = 16/30 (53%)
Query: 6 LYGCERGGLPLSEKILPQYLKELGYRTRIM 35
L+ GGLP E + LK+ GY T ++
Sbjct: 103 LFTASSGGLPTDEITFAKLLKDQGYSTALI 132
Score = 39 (18.8 bits), Expect = 5.8e-26, Sum P(4) = 5.8e-26
Identities = 10/37 (27%), Positives = 16/37 (43%)
Query: 435 PKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENT 471
P E + + HE+ Y N Y N + P+N +
Sbjct: 428 PLLEGKSQRSDHEFLFHYCNAYLNAVRWH--PQNSTS 462
>UNIPROTKB|P25549 [details] [associations]
symbol:aslA "arylsulfatase" species:83333 "Escherichia coli
K-12" [GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
[GO:0008152 "metabolic process" evidence=IEA] [GO:0003824
"catalytic activity" evidence=IEA] [GO:0046872 "metal ion binding"
evidence=IEA] [GO:0042597 "periplasmic space" evidence=IEA]
[GO:0016787 "hydrolase activity" evidence=IEA] [GO:0004065
"arylsulfatase activity" evidence=IEA] InterPro:IPR000917
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 EMBL:U00096
EMBL:AP009048 GenomeReviews:AP009048_GR GenomeReviews:U00096_GR
GO:GO:0046872 GO:GO:0042597 Gene3D:3.40.720.10 SUPFAM:SSF53649
eggNOG:COG3119 OMA:FGPSQMA InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0004065 EMBL:M90498 EMBL:M87049 PIR:S30691
RefSeq:NP_418245.1 RefSeq:YP_491641.1 ProteinModelPortal:P25549
SMR:P25549 IntAct:P25549 EnsemblBacteria:EBESCT00000000559
EnsemblBacteria:EBESCT00000017339 GeneID:12933611 GeneID:949015
KEGG:ecj:Y75_p3377 KEGG:eco:b3801 PATRIC:32123099 EchoBASE:EB0087
EcoGene:EG10089 HOGENOM:HOG000126460 KO:K01130
ProtClustDB:CLSK880785 BioCyc:EcoCyc:ARYLSULFAT-MONOMER
BioCyc:ECOL316407:JW3773-MONOMER BioCyc:MetaCyc:ARYLSULFAT-MONOMER
Genevestigator:P25549 Uniprot:P25549
Length = 551
Score = 319 (117.4 bits), Expect = 1.6e-25, P = 1.6e-25
Identities = 109/370 (29%), Positives = 173/370 (46%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQI---PTPNIDALAYSGIILKNYYTVQLCTPSRSAIMT 116
P+++ L DD+GW DVGF+G PTP+IDA+A G+IL + Y+ +P+R+ I+T
Sbjct: 86 PNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILT 145
Query: 117 GKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPT-- 174
G++ IH G+ +YG + GGL LPQ L + GY T+ +GKWH+G KE P
Sbjct: 146 GQYSIHHGILMPPMYG-QPGGLQ-GLTTLPQLLHDQGYVTQAIGKWHMG-ENKESQPQNV 202
Query: 175 ----FRGFESHLGYWTGHQDYF---------DHSAEEMKM--WGLD----MRRDLEPAW- 214
FRGF S +T +D D S E +K + D +R + A
Sbjct: 203 GFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRS-EYIKQLPFSKDDVHAVRGGEQQAIA 261
Query: 215 DLHGKYSTDV---FTAEAVDIIHNHS-TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
D+ KY D+ + V + + +D+P FLY H D+Y N
Sbjct: 262 DITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF----------DNYPNAK 311
Query: 271 RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSN 330
R+ + + ++++ + + LE+ L N++IVF SD
Sbjct: 312 YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT-- 369
Query: 331 WPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYV 389
P RG K + WEGGVR + W +++ R ++ V ++D PT L D+ +
Sbjct: 370 -PFRGAKGSTWEGGVRVPTFVYWKGMIQPRK--SDGIVDLADLFPTAL------DLAGHP 420
Query: 390 NSTVENIIPR 399
+ V N++P+
Sbjct: 421 GAKVANLVPK 430
>UNIPROTKB|F1NWF7 [details] [associations]
symbol:ARSA "Uncharacterized protein" species:9031 "Gallus
gallus" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] [GO:0005509 "calcium ion binding" evidence=IEA]
[GO:0005886 "plasma membrane" evidence=IEA] [GO:0007339 "binding of
sperm to zona pellucida" evidence=IEA] [GO:0016021 "integral to
membrane" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 GO:GO:0005886 GO:GO:0005509
Gene3D:3.40.720.10 SUPFAM:SSF53649 GeneTree:ENSGT00560000076940
InterPro:IPR024607 PROSITE:PS00149 GO:GO:0008484 OMA:GFDENTI
EMBL:AADN02075680 EMBL:AADN02075681 IPI:IPI00584710
Ensembl:ENSGALT00000015860 Uniprot:F1NWF7
Length = 493
Score = 308 (113.5 bits), Expect = 1.6e-25, Sum P(2) = 1.6e-25
Identities = 108/377 (28%), Positives = 170/377 (45%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCT-PSRSA 113
A+ GPP + +LADDLG+ D+G +G TPN+ LA + + C P R+A
Sbjct: 15 AAGGPPSFVLLLADDLGFGDLGSYGHPSSATPNLSCLARAA-------PYECCPYPCRAA 67
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK-EYT 172
++TG+ + +G+ V Y RGGLPLSE + + LK GY T IVGKWHLG + +
Sbjct: 68 LLTGRFQMRSGIYPGVFYPGSRGGLPLSEVTIAEVLKAKGYATAIVGKWHLGLGARGSFL 127
Query: 173 PTFRGFESHLGYWTGHQD----YFDHSAEEMKMWGLDMRRDLEPA---WDLHGKYSTDVF 225
P +GF+ LG H ++K +G + L P W+ V
Sbjct: 128 PIHQGFDHFLGVPYSHDQGPCQNLTCFPPDIKCFGT-CDQGLVPVPLFWN-QSIVQQPVS 185
Query: 226 TAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDH--YLNI--HRHIEDFKRSKF 281
+ V + + + D ++A A P+ A H Y + +R F
Sbjct: 186 FPDLVPLYNKFARD-----FIADCARRGV-PFLLYYASHHTHYPQFASQEYAGRSQRGPF 239
Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLW 341
L + D SVG++++AL++ + + + + F SD S L+ K T +
Sbjct: 240 GDALSEFDGSVGQLLQALQENGLENTTFVFFTSDNGPSTMRMARGGSSGL-LKCGKGTTY 298
Query: 342 EGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYE 401
EGG+R + + P + G+ E D LPTL + A + +PN V+ + Y+
Sbjct: 299 EGGMREPAVAYWPGRIAPGVTHE-LASTLDILPTLTALAGAA-LPN-VS------LDGYD 349
Query: 402 NSILRYENGTHEYNSPR 418
S L +E+G SPR
Sbjct: 350 LSPLLFESG----KSPR 362
Score = 52 (23.4 bits), Expect = 1.6e-25, Sum P(2) = 1.6e-25
Identities = 9/16 (56%), Positives = 12/16 (75%)
Query: 675 PCLFDIKNDPCEKNNL 690
P LFD+++DP E NL
Sbjct: 418 PLLFDLESDPAENYNL 433
Score = 52 (23.4 bits), Expect = 1.6e-25, Sum P(2) = 1.6e-25
Identities = 9/16 (56%), Positives = 12/16 (75%)
Query: 781 PCLFDIKNDPCEKNNL 796
P LFD+++DP E NL
Sbjct: 418 PLLFDLESDPAENYNL 433
>UNIPROTKB|F5H325 [details] [associations]
symbol:GALNS "N-acetylgalactosamine-6-sulfatase"
species:9606 "Homo sapiens" [GO:0008484 "sulfuric ester hydrolase
activity" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
EMBL:AC092384 InterPro:IPR024607 PROSITE:PS00149 GO:GO:0008484
HGNC:HGNC:4122 ChiTaRS:Galns IPI:IPI00978346
ProteinModelPortal:F5H325 SMR:F5H325 Ensembl:ENST00000542788
ArrayExpress:F5H325 Bgee:F5H325 Uniprot:F5H325
Length = 447
Score = 285 (105.4 bits), Expect = 2.1e-25, Sum P(3) = 2.1e-25
Identities = 77/251 (30%), Positives = 123/251 (49%)
Query: 136 GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHS 195
GG+P SE++LP+ LK+ GY ++IVGKWHLG ++ ++ P GF+ G H +D+
Sbjct: 40 GGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNK 98
Query: 196 AEE----MKMWGLDMRRDLEPAWDLH-GKYS-TDVFTAEAVDIIHNHSTDEPLFLYLAHA 249
A + W + R E +L G+ + T ++ EA+D I + P FLY A
Sbjct: 99 ARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQARHHPFFLYWAVD 158
Query: 250 ATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSI 309
ATH+ P+ A +L + R ++ + ++D+S+GK++E L+ + N+
Sbjct: 159 ATHA-----PVYASKPFLGTSQ------RGRYGDAVREIDDSIGKILELLQDLHVADNTF 207
Query: 310 IVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHV 369
+ F SD SN P K T +EGG+R L W P + G V+ Q +
Sbjct: 208 VFFTSDNGAALISAPEQGGSNGPFLCGKQTTFEGGMREPALAWWPGHVTAGQVSHQLGSI 267
Query: 370 SDWLPTLLSAA 380
D T L+ A
Sbjct: 268 MDLFTTSLALA 278
Score = 68 (29.0 bits), Expect = 2.1e-25, Sum P(3) = 2.1e-25
Identities = 12/24 (50%), Positives = 20/24 (83%)
Query: 12 GGLPLSEKILPQYLKELGYRTRIM 35
GG+P SE++LP+ LK+ GY ++I+
Sbjct: 40 GGIPDSEQLLPELLKKAGYVSKIV 63
Score = 41 (19.5 bits), Expect = 2.1e-25, Sum P(3) = 2.1e-25
Identities = 10/24 (41%), Positives = 14/24 (58%)
Query: 667 VPCEPQIAPCLFDIKN--DP-CEK 687
VP +PQ+ C + + N P CEK
Sbjct: 405 VPAQPQLNVCNWAVMNWAPPGCEK 428
Score = 41 (19.5 bits), Expect = 2.1e-25, Sum P(3) = 2.1e-25
Identities = 10/24 (41%), Positives = 14/24 (58%)
Query: 773 VPCEPQIAPCLFDIKN--DP-CEK 793
VP +PQ+ C + + N P CEK
Sbjct: 405 VPAQPQLNVCNWAVMNWAPPGCEK 428
>UNIPROTKB|F1NGC8 [details] [associations]
symbol:STS "Uncharacterized protein" species:9031 "Gallus
gallus" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0008484 EMBL:AADN02017431 EMBL:AADN02017432
EMBL:AADN02017433 IPI:IPI00584657 Ensembl:ENSGALT00000026830
OMA:HTAMFAS Uniprot:F1NGC8
Length = 471
Score = 239 (89.2 bits), Expect = 3.2e-25, Sum P(3) = 3.2e-25
Identities = 48/112 (42%), Positives = 69/112 (61%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+++ ++ADDLG D+G +G + TP+ID LA G+ L + LCTPSR+A +TG+
Sbjct: 38 PNVVLLIADDLGIGDLGCYGNRTLRTPHIDRLAKEGVTLTQHIAASPLCTPSRAAFLTGR 97
Query: 119 HPIHTGMQ--HNV---LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
+PI +GM V L+ GGLP E + LK+ GY T ++GKWHLG
Sbjct: 98 YPIRSGMAAFSRVGVFLFSASSGGLPSEEITFSKLLKQRGYATALIGKWHLG 149
Score = 127 (49.8 bits), Expect = 3.2e-25, Sum P(3) = 3.2e-25
Identities = 49/199 (24%), Positives = 86/199 (43%)
Query: 222 TDVFTAEAVDIIH-NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSK 280
T T EAV I NH+ P L L++ H+A L A + RH
Sbjct: 272 TQRLTTEAVRFIERNHNA--PFLLVLSYLHVHTA-----LYASKMFRGKSRH------GL 318
Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXX----SNWPLRGV 336
+ + ++D SVG++++ LE + + S++ F SD N +G
Sbjct: 319 YGDAVEEMDWSVGQILDVLENYNLSNRSLVYFSSDQGAHIEEISSSGEVHGGCNGIYKGG 378
Query: 337 KNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVEN 395
K+T WEGG+R GL+ W ++ + G + D PT++ A + +P +
Sbjct: 379 KSTNWEGGIRVPGLLRWPGVIHA-GTYIDDPTSNMDIFPTIVKLAG-AQLPYDRIIDGHD 436
Query: 396 IIPRYENSILRYENGTHEY 414
++P + ++R + HE+
Sbjct: 437 LMPLLQGKVIRSK---HEF 452
Score = 55 (24.4 bits), Expect = 2.7e-05, Sum P(3) = 2.7e-05
Identities = 11/30 (36%), Positives = 16/30 (53%)
Query: 6 LYGCERGGLPLSEKILPQYLKELGYRTRIM 35
L+ GGLP E + LK+ GY T ++
Sbjct: 114 LFSASSGGLPSEEITFSKLLKQRGYATALI 143
Score = 39 (18.8 bits), Expect = 3.2e-25, Sum P(3) = 3.2e-25
Identities = 10/38 (26%), Positives = 16/38 (42%)
Query: 431 HEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKN 468
H+ P + + HE+ Y N Y N + P+N
Sbjct: 435 HDLMPLLQGKVIRSKHEFLFHYCNAYLNAVRWH--PRN 470
>UNIPROTKB|F6PN86 [details] [associations]
symbol:ARSF "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
eggNOG:COG3119 GeneTree:ENSGT00560000076940 HOGENOM:HOG000135352
HOVERGEN:HBG004283 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0008484 OrthoDB:EOG4V4379 OMA:LKPCCGV
EMBL:AAEX03026108 Ensembl:ENSCAFT00000017756 Uniprot:F6PN86
Length = 584
Score = 236 (88.1 bits), Expect = 4.3e-25, Sum P(3) = 4.3e-25
Identities = 53/136 (38%), Positives = 76/136 (55%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
P+I+ ++ DDLG D+G G D I TPNID LA G+ L ++ +CTPSR+A +TG+
Sbjct: 30 PNIVLMMVDDLGIGDLGCFGNDTIRTPNIDRLAREGVQLNHHIAAASMCTPSRAAFLTGR 89
Query: 119 HPIHTGMQHN------VLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGF-----Y 167
+PI +GM N + G GLP +E LK+ GY T ++GKWH G Y
Sbjct: 90 YPIRSGMVSNAVDRVIITLGAP-AGLPHNETTFAALLKKQGYSTALIGKWHQGLNCQSRY 148
Query: 168 KKEYTPTFRGFESHLG 183
+ + P GF+ + G
Sbjct: 149 DQCHHPYHYGFDYYYG 164
Score = 129 (50.5 bits), Expect = 4.3e-25, Sum P(3) = 4.3e-25
Identities = 40/143 (27%), Positives = 66/143 (46%)
Query: 277 KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXX-SNWP--L 333
K + + ++D VGK+++A++ + + +++ F SD W
Sbjct: 307 KHGLYGDNVQEMDSMVGKILDAIDNFHLKNRTLVYFTSDHGGHLESRVGHSQRGGWNGIY 366
Query: 334 RGVKNTL-WEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNS 391
RG K WEGG+R GLI WS L + G V E+ + D PTL +A + S +P
Sbjct: 367 RGGKGMAGWEGGIRVPGLIRWSGRLPA-GKVIEEPTSLMDIFPTL-AAVSGSSVPQDRVI 424
Query: 392 TVENIIPRYENSILRYENGTHEY 414
N++P + + R E HE+
Sbjct: 425 DGRNLMPLLQGEVQRSE---HEF 444
Score = 46 (21.3 bits), Expect = 4.3e-25, Sum P(3) = 4.3e-25
Identities = 9/21 (42%), Positives = 11/21 (52%)
Query: 675 PCLFDIKNDPCEKNNLADRSE 695
P LFD+ DP E L +E
Sbjct: 503 PLLFDLTRDPSESTPLTQDTE 523
Score = 46 (21.3 bits), Expect = 4.3e-25, Sum P(3) = 4.3e-25
Identities = 9/21 (42%), Positives = 11/21 (52%)
Query: 781 PCLFDIKNDPCEKNNLADRSE 801
P LFD+ DP E L +E
Sbjct: 503 PLLFDLTRDPSESTPLTQDTE 523
Score = 42 (19.8 bits), Expect = 0.00016, Sum P(3) = 0.00016
Identities = 9/23 (39%), Positives = 13/23 (56%)
Query: 13 GLPLSEKILPQYLKELGYRTRIM 35
GLP +E LK+ GY T ++
Sbjct: 113 GLPHNETTFAALLKKQGYSTALI 135
Score = 38 (18.4 bits), Expect = 2.7e-15, Sum P(2) = 2.7e-15
Identities = 11/31 (35%), Positives = 15/31 (48%)
Query: 389 VNSTVENIIPRYENSILRYENGTHE--YNSP 417
V TV N + + SIL + E Y+SP
Sbjct: 529 VIQTVANAVKEHRKSILPVQQQLSELNYDSP 559
>ZFIN|ZDB-GENE-030717-5 [details] [associations]
symbol:sts "steroid sulfatase (microsomal),
arylsulfatase C, isozyme S" species:7955 "Danio rerio" [GO:0003824
"catalytic activity" evidence=IEA] [GO:0008152 "metabolic process"
evidence=IEA] [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 ZFIN:ZDB-GENE-030717-5
Gene3D:3.40.720.10 SUPFAM:SSF53649 GeneTree:ENSGT00560000076940
InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484 EMBL:CT990606
EMBL:BX901898 IPI:IPI00963580 Ensembl:ENSDART00000075252
ArrayExpress:F1Q8F9 Bgee:F1Q8F9 Uniprot:F1Q8F9
Length = 587
Score = 236 (88.1 bits), Expect = 4.4e-25, Sum P(3) = 4.4e-25
Identities = 62/164 (37%), Positives = 86/164 (52%)
Query: 36 AFAVLPLAFTLSMVFVDLVASSGP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
+F +P F L + D A SG P+ +F++ DDLG D+G +G + TPNID LA
Sbjct: 11 SFQWIPCTFCLLLYTAD--AGSGTKPNFVFMMVDDLGIGDLGCYGNTTLRTPNIDRLALE 68
Query: 95 GIILKNYYTVQ-LCTPSRSAIMTGKHPIHT---GMQ-HN----VLYGCERGGLPLSEKIL 145
G+ L + LCTPSR+A +TG++PI + GM H L+ GGLP E
Sbjct: 69 GVKLTQHIAAAPLCTPSRAAFLTGRYPIRSDAKGMAAHGHMGVFLFSASSGGLPQEEITF 128
Query: 146 PQYLKELGYRTR-IVGKWHLGFYKKE-----YTPTFRGFESHLG 183
+ +K GY T IVGKWHLG ++ + P GF+ G
Sbjct: 129 AKAVKVQGYSTAVIVGKWHLGLNCEDSSDHCHHPNSHGFDYFYG 172
Score = 126 (49.4 bits), Expect = 4.4e-25, Sum P(3) = 4.4e-25
Identities = 44/193 (22%), Positives = 88/193 (45%)
Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
T T+EA++ + +S + P L+ + H+ PL + +H +
Sbjct: 269 TQRMTSEAIEFLERNS-ETPFLLFFSFIQVHTGVFASPL-----FRGRSQH------GLY 316
Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGV----K 337
+ ++D SVG++++ LE+ + N+++ SD + G+ K
Sbjct: 317 GDAVMEVDWSVGQIMQTLERLNLKDNTLVYMTSDQGPHLEEISVHGEMHGGYSGIYKAGK 376
Query: 338 NTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENI 396
+T WEGG+R G++ W +L + I+ E ++ D PT+L+ A S IP+ ++
Sbjct: 377 STNWEGGIRIPGILSWPGVLPAGNIIDEPTSNM-DIFPTVLNLAGAS-IPDDRVIDGHDL 434
Query: 397 IPRYENSILRYEN 409
+P + + R E+
Sbjct: 435 LPLLQGQVKRSEH 447
Score = 49 (22.3 bits), Expect = 4.4e-25, Sum P(3) = 4.4e-25
Identities = 13/34 (38%), Positives = 17/34 (50%)
Query: 675 PCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGR 708
P L+D+ DP E L+ +E Q H EV R
Sbjct: 508 PLLYDLSKDPTESTPLSPDTEPQF--HSVLEVIR 539
Score = 48 (22.0 bits), Expect = 4.4e-05, Sum P(3) = 4.4e-05
Identities = 10/30 (33%), Positives = 15/30 (50%)
Query: 6 LYGCERGGLPLSEKILPQYLKELGYRTRIM 35
L+ GGLP E + +K GY T ++
Sbjct: 113 LFSASSGGLPQEEITFAKAVKVQGYSTAVI 142
Score = 47 (21.6 bits), Expect = 7.1e-25, Sum P(3) = 7.1e-25
Identities = 9/23 (39%), Positives = 13/23 (56%)
Query: 781 PCLFDIKNDPCEKNNLADRSEVQ 803
P L+D+ DP E L+ +E Q
Sbjct: 508 PLLYDLSKDPTESTPLSPDTEPQ 530
>UNIPROTKB|Q482D2 [details] [associations]
symbol:CPS_2368 "Putative N-acetylglucosamine-6-sulfatase"
species:167879 "Colwellia psychrerythraea 34H" [GO:0008152
"metabolic process" evidence=ISS] [GO:0008449
"N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 EMBL:CP000083 GenomeReviews:CP000083_GR
Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008449 RefSeq:YP_269086.1
ProteinModelPortal:Q482D2 STRING:Q482D2 GeneID:3522371
KEGG:cps:CPS_2368 PATRIC:21467821 HOGENOM:HOG000024136 OMA:SHKAVHS
ProtClustDB:CLSK824923 BioCyc:CPSY167879:GI48-2431-MONOMER
Uniprot:Q482D2
Length = 537
Score = 275 (101.9 bits), Expect = 5.2e-25, Sum P(3) = 5.2e-25
Identities = 77/230 (33%), Positives = 120/230 (52%)
Query: 40 LPLAFTLSMVFVDLVAS-SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
L L F++S + + + ++I+IL DD +++VGF +I TPN+D LA G+
Sbjct: 15 LSLCFSVSSLSATVNKTVKQKKNVIYILTDDQRYDEVGFLN-PRIDTPNMDKLAAGGVYF 73
Query: 99 KN-YYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKIL--PQYLKELGYR 155
KN + T LC+PSR+ I+TG++ HN +G P E + P YL+E+GY
Sbjct: 74 KNAFVTTALCSPSRATILTGQY------MHN--HGVVDNNNPAKESSVYFPSYLQEVGYE 125
Query: 156 TRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
T GKWH+G + P GF+ L + G Y+ ++ + +++ + D
Sbjct: 126 TSFFGKWHMGGHGDSPQP---GFDHWLSF-AGQGHYYPKKDKKGRTNKININGERV---D 178
Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDH 265
G Y TD T AVD + +D+P F+YL+H A HS N ++P AP H
Sbjct: 179 QKG-YITDELTDYAVDWLDKRDSDKPFFMYLSHKAVHS-N-FDP--APRH 223
Score = 69 (29.3 bits), Expect = 5.2e-25, Sum P(3) = 5.2e-25
Identities = 31/144 (21%), Positives = 64/144 (44%)
Query: 273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWP 332
++++KR A L +D+S+G+V++ L+ + +++I++ + D
Sbjct: 272 VQEYKRQYHRA-LSAVDDSLGRVLKWLKDNNLENDTIVMLMGDNGFMFGEHGLID----- 325
Query: 333 LRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNST 392
K +E +R L ++P G V ++ V D PT+L A P + +
Sbjct: 326 ----KRNAYEESMRVPLLAYAPGYFKPGTVVDEMVANLDIAPTILEIAGAKK-PAHFDG- 379
Query: 393 VENIIPRYENSILRY--ENGTHEY 414
++ +P +N + EN +EY
Sbjct: 380 -DSWLPLAKNKEVNQWRENFLYEY 402
Score = 59 (25.8 bits), Expect = 5.2e-25, Sum P(3) = 5.2e-25
Identities = 11/21 (52%), Positives = 15/21 (71%)
Query: 677 LFDIKNDPCEKNNLADRSEDQ 697
L+D+KNDP E NNL + + Q
Sbjct: 436 LYDLKNDPKEMNNLINTPKHQ 456
Score = 57 (25.1 bits), Expect = 8.3e-25, Sum P(3) = 8.3e-25
Identities = 11/21 (52%), Positives = 15/21 (71%)
Query: 783 LFDIKNDPCEKNNLADRSEVQ 803
L+D+KNDP E NNL + + Q
Sbjct: 436 LYDLKNDPKEMNNLINTPKHQ 456
>TIGR_CMR|CPS_2368 [details] [associations]
symbol:CPS_2368 "putative N-acetylglucosamine-6-sulfatase"
species:167879 "Colwellia psychrerythraea 34H" [GO:0008152
"metabolic process" evidence=ISS] [GO:0008449
"N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 EMBL:CP000083 GenomeReviews:CP000083_GR
Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008449 RefSeq:YP_269086.1
ProteinModelPortal:Q482D2 STRING:Q482D2 GeneID:3522371
KEGG:cps:CPS_2368 PATRIC:21467821 HOGENOM:HOG000024136 OMA:SHKAVHS
ProtClustDB:CLSK824923 BioCyc:CPSY167879:GI48-2431-MONOMER
Uniprot:Q482D2
Length = 537
Score = 275 (101.9 bits), Expect = 5.2e-25, Sum P(3) = 5.2e-25
Identities = 77/230 (33%), Positives = 120/230 (52%)
Query: 40 LPLAFTLSMVFVDLVAS-SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL 98
L L F++S + + + ++I+IL DD +++VGF +I TPN+D LA G+
Sbjct: 15 LSLCFSVSSLSATVNKTVKQKKNVIYILTDDQRYDEVGFLN-PRIDTPNMDKLAAGGVYF 73
Query: 99 KN-YYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKIL--PQYLKELGYR 155
KN + T LC+PSR+ I+TG++ HN +G P E + P YL+E+GY
Sbjct: 74 KNAFVTTALCSPSRATILTGQY------MHN--HGVVDNNNPAKESSVYFPSYLQEVGYE 125
Query: 156 TRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
T GKWH+G + P GF+ L + G Y+ ++ + +++ + D
Sbjct: 126 TSFFGKWHMGGHGDSPQP---GFDHWLSF-AGQGHYYPKKDKKGRTNKININGERV---D 178
Query: 216 LHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDH 265
G Y TD T AVD + +D+P F+YL+H A HS N ++P AP H
Sbjct: 179 QKG-YITDELTDYAVDWLDKRDSDKPFFMYLSHKAVHS-N-FDP--APRH 223
Score = 69 (29.3 bits), Expect = 5.2e-25, Sum P(3) = 5.2e-25
Identities = 31/144 (21%), Positives = 64/144 (44%)
Query: 273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWP 332
++++KR A L +D+S+G+V++ L+ + +++I++ + D
Sbjct: 272 VQEYKRQYHRA-LSAVDDSLGRVLKWLKDNNLENDTIVMLMGDNGFMFGEHGLID----- 325
Query: 333 LRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNST 392
K +E +R L ++P G V ++ V D PT+L A P + +
Sbjct: 326 ----KRNAYEESMRVPLLAYAPGYFKPGTVVDEMVANLDIAPTILEIAGAKK-PAHFDG- 379
Query: 393 VENIIPRYENSILRY--ENGTHEY 414
++ +P +N + EN +EY
Sbjct: 380 -DSWLPLAKNKEVNQWRENFLYEY 402
Score = 59 (25.8 bits), Expect = 5.2e-25, Sum P(3) = 5.2e-25
Identities = 11/21 (52%), Positives = 15/21 (71%)
Query: 677 LFDIKNDPCEKNNLADRSEDQ 697
L+D+KNDP E NNL + + Q
Sbjct: 436 LYDLKNDPKEMNNLINTPKHQ 456
Score = 57 (25.1 bits), Expect = 8.3e-25, Sum P(3) = 8.3e-25
Identities = 11/21 (52%), Positives = 15/21 (71%)
Query: 783 LFDIKNDPCEKNNLADRSEVQ 803
L+D+KNDP E NNL + + Q
Sbjct: 436 LYDLKNDPKEMNNLINTPKHQ 456
>UNIPROTKB|I3LBW8 [details] [associations]
symbol:STS "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0008484 OMA:GLSCQCD EMBL:FP102981
EMBL:FP339595 Ensembl:ENSSSCT00000032160 Uniprot:I3LBW8
Length = 579
Score = 242 (90.2 bits), Expect = 7.1e-25, Sum P(3) = 7.1e-25
Identities = 50/112 (44%), Positives = 69/112 (61%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+ + ++ADDLG D G +G + TPNID LA G+ L + LCTPSR+A +TG+
Sbjct: 23 PNFVLLMADDLGIGDPGCYGNKTLRTPNIDRLAGGGVKLTQHLAAAPLCTPSRAAFLTGR 82
Query: 119 HPIHTGM--QHNV---LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
+PI +GM Q+ V ++ GGLP SE + LK GY T ++GKWHLG
Sbjct: 83 YPIRSGMAAQNQVGVFIFSASSGGLPPSEITFAKLLKSQGYTTALIGKWHLG 134
Score = 116 (45.9 bits), Expect = 7.1e-25, Sum P(3) = 7.1e-25
Identities = 40/164 (24%), Positives = 75/164 (45%)
Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
T TA+AV I ++ + P L L+ H+A L + + +H +
Sbjct: 257 TQRLTADAVRFIQRNA-ESPFLLVLSFLHVHTA-----LFSSKIFAGKSKH------GAY 304
Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXX----SNWPLRGVK 337
++D SVG++++ L++ ++ +N++I F SD SN +G K
Sbjct: 305 GDATEEMDWSVGQILDVLDELKLANNTLIYFSSDQGAHVEEVTVKGEVHGGSNGIYKGGK 364
Query: 338 NTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
T WEGG+R G++ W ++++ G+ + D PT+ + A
Sbjct: 365 ATNWEGGIRVPGILRWPGVIQA-GLELDAPTSNMDLFPTVANLA 407
Score = 54 (24.1 bits), Expect = 9.5e-05, Sum P(3) = 9.5e-05
Identities = 11/30 (36%), Positives = 16/30 (53%)
Query: 6 LYGCERGGLPLSEKILPQYLKELGYRTRIM 35
++ GGLP SE + LK GY T ++
Sbjct: 99 IFSASSGGLPPSEITFAKLLKSQGYTTALI 128
Score = 50 (22.7 bits), Expect = 7.1e-25, Sum P(3) = 7.1e-25
Identities = 11/21 (52%), Positives = 12/21 (57%)
Query: 675 PCLFDIKNDPCEKNNLADRSE 695
P LFDI DP E + L SE
Sbjct: 496 PLLFDISQDPRETDPLTPTSE 516
Score = 50 (22.7 bits), Expect = 7.1e-25, Sum P(3) = 7.1e-25
Identities = 11/21 (52%), Positives = 12/21 (57%)
Query: 781 PCLFDIKNDPCEKNNLADRSE 801
P LFDI DP E + L SE
Sbjct: 496 PLLFDISQDPRETDPLTPTSE 516
>UNIPROTKB|K7GLQ3 [details] [associations]
symbol:STS "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 EMBL:FP102981 EMBL:FP339595
Ensembl:ENSSSCT00000035627 Uniprot:K7GLQ3
Length = 580
Score = 242 (90.2 bits), Expect = 7.2e-25, Sum P(3) = 7.2e-25
Identities = 50/112 (44%), Positives = 69/112 (61%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+ + ++ADDLG D G +G + TPNID LA G+ L + LCTPSR+A +TG+
Sbjct: 24 PNFVLLMADDLGIGDPGCYGNKTLRTPNIDRLAGGGVKLTQHLAAAPLCTPSRAAFLTGR 83
Query: 119 HPIHTGM--QHNV---LYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
+PI +GM Q+ V ++ GGLP SE + LK GY T ++GKWHLG
Sbjct: 84 YPIRSGMAAQNQVGVFIFSASSGGLPPSEITFAKLLKSQGYTTALIGKWHLG 135
Score = 116 (45.9 bits), Expect = 7.2e-25, Sum P(3) = 7.2e-25
Identities = 40/164 (24%), Positives = 75/164 (45%)
Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
T TA+AV I ++ + P L L+ H+A L + + +H +
Sbjct: 258 TQRLTADAVRFIQRNA-ESPFLLVLSFLHVHTA-----LFSSKIFAGKSKH------GAY 305
Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXX----SNWPLRGVK 337
++D SVG++++ L++ ++ +N++I F SD SN +G K
Sbjct: 306 GDATEEMDWSVGQILDVLDELKLANNTLIYFSSDQGAHVEEVTVKGEVHGGSNGIYKGGK 365
Query: 338 NTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
T WEGG+R G++ W ++++ G+ + D PT+ + A
Sbjct: 366 ATNWEGGIRVPGILRWPGVIQA-GLELDAPTSNMDLFPTVANLA 408
Score = 54 (24.1 bits), Expect = 9.5e-05, Sum P(3) = 9.5e-05
Identities = 11/30 (36%), Positives = 16/30 (53%)
Query: 6 LYGCERGGLPLSEKILPQYLKELGYRTRIM 35
++ GGLP SE + LK GY T ++
Sbjct: 100 IFSASSGGLPPSEITFAKLLKSQGYTTALI 129
Score = 50 (22.7 bits), Expect = 7.2e-25, Sum P(3) = 7.2e-25
Identities = 11/21 (52%), Positives = 12/21 (57%)
Query: 675 PCLFDIKNDPCEKNNLADRSE 695
P LFDI DP E + L SE
Sbjct: 497 PLLFDISQDPRETDPLTPTSE 517
Score = 50 (22.7 bits), Expect = 7.2e-25, Sum P(3) = 7.2e-25
Identities = 11/21 (52%), Positives = 12/21 (57%)
Query: 781 PCLFDIKNDPCEKNNLADRSE 801
P LFDI DP E + L SE
Sbjct: 497 PLLFDISQDPRETDPLTPTSE 517
>RGD|3783 [details] [associations]
symbol:Sts "steroid sulfatase (microsomal), isozyme S"
species:10116 "Rattus norvegicus" [GO:0004773 "steryl-sulfatase
activity" evidence=IDA] [GO:0005635 "nuclear envelope" evidence=IDA]
[GO:0005789 "endoplasmic reticulum membrane" evidence=IDA]
[GO:0007565 "female pregnancy" evidence=IEA] [GO:0007611 "learning or
memory" evidence=IMP] [GO:0008202 "steroid metabolic process"
evidence=IEA] [GO:0008284 "positive regulation of cell proliferation"
evidence=IMP] [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA;ISO] [GO:0009268 "response to pH" evidence=IDA]
[GO:0014070 "response to organic cyclic compound" evidence=IDA]
[GO:0016021 "integral to membrane" evidence=IDA] [GO:0043231
"intracellular membrane-bounded organelle" evidence=IDA] [GO:0043434
"response to peptide hormone stimulus" evidence=IDA] [GO:0043588
"skin development" evidence=IEP] [GO:0043627 "response to estrogen
stimulus" evidence=IDA] [GO:0046872 "metal ion binding" evidence=IEA]
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
RGD:3783 GO:GO:0016021 GO:GO:0005635 GO:GO:0043588 GO:GO:0005789
GO:GO:0008202 GO:GO:0046872 GO:GO:0008284 GO:GO:0043434 GO:GO:0007565
GO:GO:0009268 GO:GO:0007611 GO:GO:0043627 Gene3D:3.40.720.10
SUPFAM:SSF53649 eggNOG:COG3119 HOGENOM:HOG000135352
HOVERGEN:HBG004283 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
OrthoDB:EOG4V4379 CTD:412 KO:K01131 GO:GO:0004773 EMBL:U37138
IPI:IPI00210494 PIR:S05414 RefSeq:NP_036793.1 UniGene:Rn.6312
ProteinModelPortal:P15589 SMR:P15589 STRING:P15589 PRIDE:P15589
GeneID:24800 KEGG:rno:24800 InParanoid:P15589 BindingDB:P15589
ChEMBL:CHEMBL3531 NextBio:604458 Genevestigator:P15589
GermOnline:ENSRNOG00000032487 Uniprot:P15589
Length = 577
Score = 251 (93.4 bits), Expect = 9.0e-25, Sum P(3) = 9.0e-25
Identities = 61/154 (39%), Positives = 84/154 (54%)
Query: 42 LAFTLSMVFVDLVASSGP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
LA LS + A GP P+ + I+ADDLG D+G +G + TP+ID LA G+ L
Sbjct: 7 LALLLSQLNFLCAARPGPGPNFLLIMADDLGIGDLGCYGNRTLRTPHIDRLALEGVKLTQ 66
Query: 101 YYTVQ-LCTPSRSAIMTGKHPIHTGM-QHN----VLYGCERGGLPLSEKILPQYLKELGY 154
+ LCTPSR+A +TG++P+ +GM H L+ GGLP +E + LK GY
Sbjct: 67 HLAAAPLCTPSRAAFLTGRYPVRSGMASHGRLGVFLFSASSGGLPPNEVTFAKLLKGQGY 126
Query: 155 RTRIVGKWHLGFYKKE-----YTPTFRGFESHLG 183
T +VGKWHLG + + P GF+ LG
Sbjct: 127 TTGLVGKWHLGLSCQAASDFCHHPGRHGFDRFLG 160
Score = 105 (42.0 bits), Expect = 9.0e-25, Sum P(3) = 9.0e-25
Identities = 41/169 (24%), Positives = 72/169 (42%)
Query: 222 TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKF 281
T +EA D + + D P L+L+ H+A+ P A ++H +
Sbjct: 260 TQRLASEAGDFLRRNR-DTPFLLFLSFMHVHTAHFANPEFAGQ---SLH--------GAY 307
Query: 282 AAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXX----XXXXSNWPLRGVK 337
+ ++D +VG+V+ L++ + +N+++ SD SN RG K
Sbjct: 308 GDAVEEMDWAVGQVLATLDKLGLANNTLVYLTSDHGAHVEELGPNGERHGGSNGIYRGGK 367
Query: 338 NTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
WEGG+R GL+ P + G E+ D PT+ A +++P
Sbjct: 368 ANTWEGGIRVPGLVRWPGVIVPGQEVEEPTSNMDVFPTVARLAG-AELP 415
Score = 50 (22.7 bits), Expect = 9.0e-25, Sum P(3) = 9.0e-25
Identities = 10/21 (47%), Positives = 13/21 (61%)
Query: 675 PCLFDIKNDPCEKNNLADRSE 695
P LFDI DP E++ L +E
Sbjct: 499 PLLFDIARDPRERHPLTPETE 519
Score = 50 (22.7 bits), Expect = 9.0e-25, Sum P(3) = 9.0e-25
Identities = 10/21 (47%), Positives = 13/21 (61%)
Query: 781 PCLFDIKNDPCEKNNLADRSE 801
P LFDI DP E++ L +E
Sbjct: 499 PLLFDIARDPRERHPLTPETE 519
Score = 39 (18.8 bits), Expect = 1.2e-23, Sum P(3) = 1.2e-23
Identities = 8/34 (23%), Positives = 17/34 (50%)
Query: 628 LTGGPDQVYLSGLSDREWLALAMRKLRDAASIQC 661
L P+Q+ +S ++ + WL L + + +C
Sbjct: 540 LEEAPNQLSMSNVAWKPWLQLCLPSKPHPLACRC 573
>TIGR_CMR|SPO_3286 [details] [associations]
symbol:SPO_3286 "arylsulfatase" species:246200 "Ruegeria
pomeroyi DSS-3" [GO:0004065 "arylsulfatase activity" evidence=ISS]
[GO:0006790 "sulfur compound metabolic process" evidence=ISS]
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 EMBL:CP000031 GenomeReviews:CP000031_GR
Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
PROSITE:PS00149 GO:GO:0004065 KO:K01130 HOGENOM:HOG000135353
RefSeq:YP_168482.1 ProteinModelPortal:Q5LNC6 GeneID:3193868
KEGG:sil:SPO3286 PATRIC:23380015 Uniprot:Q5LNC6
Length = 535
Score = 254 (94.5 bits), Expect = 1.1e-24, Sum P(4) = 1.1e-24
Identities = 76/213 (35%), Positives = 105/213 (49%)
Query: 57 SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMT 116
S P+II ILADDLG+ D+G G +I TPNID LA G +L Y C P+R++++T
Sbjct: 2 SRKPNIILILADDLGFADLGCTG-SEIRTPNIDGLARDGALLTAMYNCARCCPTRASLLT 60
Query: 117 GKHPIHTGMQH-NVLYGCE--RGGLPLSEKILPQYLKELGYRTRIVGKWHLG--FYKKEY 171
G +P + G+ H G RG L + ++L+ GYRT + GKWH+G F +E
Sbjct: 61 GLYPHNAGIGHMGADLGTPAYRGFLRNDCATIAEHLRAAGYRTCMSGKWHVGGDFMAREV 120
Query: 172 -----------TPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKY 220
TP RGF+ G G +F M D R + P D Y
Sbjct: 121 DSWRVGDVDHPTPRQRGFDRFYGIVDGVTHFFSPHY----MLEDDTRVETFPD-DF---Y 172
Query: 221 STDVFTAEAVDIIHNH-STDEPLFLYLAHAATH 252
TD T +A+ ++ ++P FLYLAH A H
Sbjct: 173 FTDAITDKAIGMVEEAVEMEQPFFLYLAHTAPH 205
Score = 85 (35.0 bits), Expect = 1.1e-24, Sum P(4) = 1.1e-24
Identities = 15/46 (32%), Positives = 32/46 (69%)
Query: 270 HRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
H+ E K + +AA++ ++D+S+G ++ AL++ N++I+F+SD
Sbjct: 264 HKDWEARKMATYAAMVDRMDQSIGTLLAALKRMGQFDNTLILFLSD 309
Score = 63 (27.2 bits), Expect = 1.1e-24, Sum P(4) = 1.1e-24
Identities = 19/57 (33%), Positives = 26/57 (45%)
Query: 329 SNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDI 385
SN P R K+ + EGG+ + P + + HV D LPT+L AA I
Sbjct: 363 SNAPFRKFKHYVHEGGISTPLIAHWPGRIAAPVPLHAACHVVDILPTILEAAGAPPI 419
Score = 38 (18.4 bits), Expect = 1.1e-24, Sum P(4) = 1.1e-24
Identities = 11/29 (37%), Positives = 14/29 (48%)
Query: 677 LFDIKNDPCEKNNLADRSEDQRINHYTTE 705
L+DI+ D E N+L R E R E
Sbjct: 477 LYDIEADRTELNDLI-RGEPDRAKALVAE 504
>UNIPROTKB|F1NFQ0 [details] [associations]
symbol:ARSH "Uncharacterized protein" species:9031 "Gallus
gallus" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0008484 EMBL:AADN02017595 IPI:IPI00587215
Ensembl:ENSGALT00000026860 OMA:GHYKAVF ArrayExpress:F1NFQ0
Uniprot:F1NFQ0
Length = 590
Score = 259 (96.2 bits), Expect = 1.8e-24, Sum P(3) = 1.8e-24
Identities = 57/135 (42%), Positives = 82/135 (60%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+ + +LADDLG DVG +G D I TPNID LA G+ L + T LCTPSR+A++TG+
Sbjct: 35 PNFVLLLADDLGIGDVGCYGNDTIRTPNIDRLAREGVKLTQHITAAPLCTPSRAALLTGR 94
Query: 119 HPIHTGMQ--HN---VLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGF---YKKE 170
+PI +GM +N + + GGLP +E + L++ GY T ++GKWHLG ++ +
Sbjct: 95 YPIRSGMDAVNNYRVIFWNGGSGGLPPNETTFAKILQQQGYSTGLIGKWHLGVNCEHRND 154
Query: 171 YT--PTFRGFESHLG 183
+ P GFE G
Sbjct: 155 HCHHPLNHGFEYFYG 169
Score = 101 (40.6 bits), Expect = 1.8e-24, Sum P(3) = 1.8e-24
Identities = 42/199 (21%), Positives = 86/199 (43%)
Query: 221 STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSK 280
+T E++ I + +P L+L+ +H+ PL + +L H
Sbjct: 268 TTSFILRESISFIERNK-HKPFLLFLSFLHSHT-----PLLTTEKFLGKSGH------GL 315
Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXX-XXXXXXXXXXSNWP--LRGVK 337
+ + ++D VG+V++A++++ + N+++ F SD W RG K
Sbjct: 316 YGDNVEEMDWMVGQVLDAIDKKGLKKNTLVYFASDHGGWLERQEGKRQLGGWNGIYRGGK 375
Query: 338 NTL-WEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVEN 395
WEGG+R G+ W +L + G V + + D PT++ A +P +
Sbjct: 376 AMGGWEGGIRVPGIFRWPGVLPA-GKVINEPTSLMDIYPTVVHLAG-GVVPQDRVIDGRD 433
Query: 396 IIPRYENSILRYENGTHEY 414
++P + ++ E+ H++
Sbjct: 434 LMPLLQGTV---EHSEHKF 449
Score = 43 (20.2 bits), Expect = 1.8e-24, Sum P(3) = 1.8e-24
Identities = 8/21 (38%), Positives = 12/21 (57%)
Query: 675 PCLFDIKNDPCEKNNLADRSE 695
P L+D+ DP E L+ +E
Sbjct: 508 PLLYDLSRDPSESQPLSADTE 528
Score = 43 (20.2 bits), Expect = 1.8e-24, Sum P(3) = 1.8e-24
Identities = 8/21 (38%), Positives = 12/21 (57%)
Query: 781 PCLFDIKNDPCEKNNLADRSE 801
P L+D+ DP E L+ +E
Sbjct: 508 PLLYDLSRDPSESQPLSADTE 528
>UNIPROTKB|G3N2T7 [details] [associations]
symbol:ARSH "Uncharacterized protein" species:9913 "Bos
taurus" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0008484 OMA:ATVWKVH EMBL:DAAA02075309
EMBL:DAAA02075310 EMBL:DAAA02075311 Ensembl:ENSBTAT00000063647
Uniprot:G3N2T7
Length = 557
Score = 229 (85.7 bits), Expect = 4.0e-24, Sum P(3) = 4.0e-24
Identities = 50/136 (36%), Positives = 78/136 (57%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
P+I+ ++ADDLG D+ +G + + TPNID LA G+ L + +CTPSR+A +TG+
Sbjct: 2 PNIVLLMADDLGVGDLCCYGNNSVSTPNIDRLASEGVRLTQHLAAASMCTPSRAAFLTGR 61
Query: 119 HPIHTGM------QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFY--KKE 170
+P+ +GM +V++ GGLP +E + L+ GYRT ++GKWH G ++
Sbjct: 62 YPVRSGMASSSNLNRDVVWLGGSGGLPPNETTFAKLLQHRGYRTGLIGKWHQGLSCASRD 121
Query: 171 ---YTPTFRGFESHLG 183
Y P GF+ G
Sbjct: 122 DHCYHPLNHGFDYFYG 137
Score = 124 (48.7 bits), Expect = 4.0e-24, Sum P(3) = 4.0e-24
Identities = 46/163 (28%), Positives = 77/163 (47%)
Query: 266 YLNIHRHI---EDFK-RSKFAAI---LHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXX 318
+L++H + E F SKF + ++D VGKV+EAL++ R+ +++++ F SD
Sbjct: 262 FLHVHTPLVTKEKFVGHSKFGLYGDNVEEMDWMVGKVLEALDRERLANHTLVYFTSDNGG 321
Query: 319 XXXXXXXXXX-SNWP--LRGVKNTL-WEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWL 373
W RG + WEGG+R G+ W +LE+ G V ++ + D
Sbjct: 322 RLEAQDRSGQLGGWNGRYRGGRGMAGWEGGIRVPGIFRWPTVLEA-GKVIDEPTSLMDIF 380
Query: 374 PTLLSAANKSDIPNYVNSTVE--NIIPRYENSILRYENGTHEY 414
PTL IP + ++ N++P E + R E HE+
Sbjct: 381 PTLSYIGG--GIPP-LGRVIDGRNLMPLLEGRVSRSE---HEF 417
Score = 50 (22.7 bits), Expect = 4.6e-05, Sum P(3) = 4.6e-05
Identities = 10/24 (41%), Positives = 15/24 (62%)
Query: 12 GGLPLSEKILPQYLKELGYRTRIM 35
GGLP +E + L+ GYRT ++
Sbjct: 85 GGLPPNETTFAKLLQHRGYRTGLI 108
Score = 48 (22.0 bits), Expect = 4.0e-24, Sum P(3) = 4.0e-24
Identities = 12/32 (37%), Positives = 14/32 (43%)
Query: 668 PCEPQIA----PCLFDIKNDPCEKNNLADRSE 695
PC + P LFDI DP E L +E
Sbjct: 464 PCSGDVTYHDPPLLFDISRDPSESRPLNPDNE 495
Score = 48 (22.0 bits), Expect = 4.0e-24, Sum P(3) = 4.0e-24
Identities = 12/32 (37%), Positives = 14/32 (43%)
Query: 774 PCEPQIA----PCLFDIKNDPCEKNNLADRSE 801
PC + P LFDI DP E L +E
Sbjct: 464 PCSGDVTYHDPPLLFDISRDPSESRPLNPDNE 495
>UNIPROTKB|F1NFQ1 [details] [associations]
symbol:ARSH "Uncharacterized protein" species:9031 "Gallus
gallus" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0008484 EMBL:AADN02017595 IPI:IPI00600266
Ensembl:ENSGALT00000026858 ArrayExpress:F1NFQ1 Uniprot:F1NFQ1
Length = 579
Score = 259 (96.2 bits), Expect = 5.0e-24, Sum P(3) = 5.0e-24
Identities = 57/135 (42%), Positives = 82/135 (60%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+ + +LADDLG DVG +G D I TPNID LA G+ L + T LCTPSR+A++TG+
Sbjct: 22 PNFVLLLADDLGIGDVGCYGNDTIRTPNIDRLAREGVKLTQHITAAPLCTPSRAALLTGR 81
Query: 119 HPIHTGMQ--HN---VLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGF---YKKE 170
+PI +GM +N + + GGLP +E + L++ GY T ++GKWHLG ++ +
Sbjct: 82 YPIRSGMDAVNNYRVIFWNGGSGGLPPNETTFAKILQQQGYSTGLIGKWHLGVNCEHRND 141
Query: 171 YT--PTFRGFESHLG 183
+ P GFE G
Sbjct: 142 HCHHPLNHGFEYFYG 156
Score = 96 (38.9 bits), Expect = 5.0e-24, Sum P(3) = 5.0e-24
Identities = 41/201 (20%), Positives = 87/201 (43%)
Query: 221 STDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSK 280
+T E++ I + +P L+L+ +H+ PL + +L H
Sbjct: 255 TTSFILRESISFIERNK-HKPFLLFLSFLHSHT-----PLLTTEKFLGKSGH------GL 302
Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXX-XXXXXXXXXXSNWP----LRG 335
+ + ++D VG+V++A++++ + N+++ F SD W ++G
Sbjct: 303 YGDNVEEMDWMVGQVLDAIDKKGLKKNTLVYFASDHGGWLERQEGKRQLGGWNGIYRVKG 362
Query: 336 VKNTL-WEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTV 393
K WEGG+R G+ W +L + G V + + D PT++ A +P
Sbjct: 363 GKAMGGWEGGIRVPGIFRWPGVLPA-GKVINEPTSLMDIYPTVVHLAG-GVVPQDRVIDG 420
Query: 394 ENIIPRYENSILRYENGTHEY 414
+++P + ++ E+ H++
Sbjct: 421 RDLMPLLQGTV---EHSEHKF 438
Score = 43 (20.2 bits), Expect = 5.0e-24, Sum P(3) = 5.0e-24
Identities = 8/21 (38%), Positives = 12/21 (57%)
Query: 675 PCLFDIKNDPCEKNNLADRSE 695
P L+D+ DP E L+ +E
Sbjct: 497 PLLYDLSRDPSESQPLSADTE 517
Score = 43 (20.2 bits), Expect = 5.0e-24, Sum P(3) = 5.0e-24
Identities = 8/21 (38%), Positives = 12/21 (57%)
Query: 781 PCLFDIKNDPCEKNNLADRSE 801
P L+D+ DP E L+ +E
Sbjct: 497 PLLYDLSRDPSESQPLSADTE 517
>UNIPROTKB|P54793 [details] [associations]
symbol:ARSF "Arylsulfatase F" species:9606 "Homo sapiens"
[GO:0046872 "metal ion binding" evidence=IEA] [GO:0005576
"extracellular region" evidence=IEA] [GO:0004065 "arylsulfatase
activity" evidence=TAS] [GO:0005788 "endoplasmic reticulum lumen"
evidence=TAS] [GO:0006644 "phospholipid metabolic process"
evidence=TAS] [GO:0006665 "sphingolipid metabolic process"
evidence=TAS] [GO:0006687 "glycosphingolipid metabolic process"
evidence=TAS] [GO:0043687 "post-translational protein modification"
evidence=TAS] [GO:0044267 "cellular protein metabolic process"
evidence=TAS] [GO:0044281 "small molecule metabolic process"
evidence=TAS] Reactome:REACT_17015 Reactome:REACT_111217
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 GO:GO:0005576 GO:GO:0044281 GO:GO:0046872
GO:GO:0006644 GO:GO:0005788 GO:GO:0043687 Gene3D:3.40.720.10
SUPFAM:SSF53649 eggNOG:COG3119 HOGENOM:HOG000135352
HOVERGEN:HBG004283 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0004065 GO:GO:0006687 KO:K12374
OrthoDB:EOG4V4379 EMBL:X97868 EMBL:AC112653 EMBL:BC022389
IPI:IPI00008405 PIR:A56217 RefSeq:NP_001188467.1
RefSeq:NP_001188468.1 RefSeq:NP_004033.2 UniGene:Hs.101674
ProteinModelPortal:P54793 SMR:P54793 IntAct:P54793 STRING:P54793
PhosphoSite:P54793 DMDM:259016386 PaxDb:P54793 PRIDE:P54793
Ensembl:ENST00000359361 Ensembl:ENST00000381127
Ensembl:ENST00000537104 GeneID:416 KEGG:hsa:416 UCSC:uc004cre.2
CTD:416 GeneCards:GC0XP002978 H-InvDB:HIX0016636 HGNC:HGNC:721
HPA:HPA000549 MIM:300003 neXtProt:NX_P54793 PharmGKB:PA25012
InParanoid:P54793 OMA:LKPCCGV PhylomeDB:P54793 GenomeRNAi:416
NextBio:1759 Bgee:P54793 CleanEx:HS_ARSF Genevestigator:P54793
GermOnline:ENSG00000062096 Uniprot:P54793
Length = 590
Score = 233 (87.1 bits), Expect = 1.1e-23, Sum P(3) = 1.1e-23
Identities = 49/112 (43%), Positives = 69/112 (61%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
P+I+ I+ DDLG D+G +G D + TP+ID LA G+ L + + LC+PSRSA +TG+
Sbjct: 30 PNIVLIMVDDLGIGDLGCYGNDTMRTPHIDRLAREGVRLTQHISAASLCSPSRSAFLTGR 89
Query: 119 HPIHTGM----QHNVLYGCE-RGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
+PI +GM V+ GLPL+E L LK+ GY T ++GKWH G
Sbjct: 90 YPIRSGMVSSGNRRVIQNLAVPAGLPLNETTLAALLKKQGYSTGLIGKWHQG 141
Score = 118 (46.6 bits), Expect = 1.1e-23, Sum P(3) = 1.1e-23
Identities = 42/195 (21%), Positives = 82/195 (42%)
Query: 224 VFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAA 283
+ EA+ + HS E L+ + H+ PL D + +H +
Sbjct: 266 IMVKEAISFLERHSK-ETFLLFFSFLHVHT-----PLPTTDDFTGTSKH------GLYGD 313
Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXX-SNWP--LRGVKNTL 340
+ ++D VGK+++A++ + +N+++ F SD W +G K
Sbjct: 314 NVEEMDSMVGKILDAIDDFGLRNNTLVYFTSDHGGHLEARRGHAQLGGWNGIYKGGKGMG 373
Query: 341 -WEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPR 399
WEGG+R G++ P G + ++ + D LPT+ S + S +P +++P
Sbjct: 374 GWEGGIRVPGIVRWPGKVPAGRLIKEPTSLMDILPTVASVSGGS-LPQDRVIDGRDLMPL 432
Query: 400 YENSILRYENGTHEY 414
+ ++ E HE+
Sbjct: 433 LQGNVRHSE---HEF 444
Score = 52 (23.4 bits), Expect = 0.00020, Sum P(3) = 0.00020
Identities = 11/23 (47%), Positives = 15/23 (65%)
Query: 13 GLPLSEKILPQYLKELGYRTRIM 35
GLPL+E L LK+ GY T ++
Sbjct: 113 GLPLNETTLAALLKKQGYSTGLI 135
Score = 47 (21.6 bits), Expect = 1.1e-23, Sum P(3) = 1.1e-23
Identities = 9/21 (42%), Positives = 11/21 (52%)
Query: 675 PCLFDIKNDPCEKNNLADRSE 695
P LFD+ DP E L +E
Sbjct: 504 PLLFDLSRDPSESTPLTPATE 524
Score = 47 (21.6 bits), Expect = 1.1e-23, Sum P(3) = 1.1e-23
Identities = 9/21 (42%), Positives = 11/21 (52%)
Query: 781 PCLFDIKNDPCEKNNLADRSE 801
P LFD+ DP E L +E
Sbjct: 504 PLLFDLSRDPSESTPLTPATE 524
>UNIPROTKB|F5GYY5 [details] [associations]
symbol:ARSE "Arylsulfatase E" species:9606 "Homo sapiens"
[GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484 EMBL:AC005295
HGNC:HGNC:719 OMA:PVINRCA IPI:IPI00020005 ProteinModelPortal:F5GYY5
SMR:F5GYY5 Ensembl:ENST00000545496 UCSC:uc011mhh.2
ArrayExpress:F5GYY5 Bgee:F5GYY5 Uniprot:F5GYY5
Length = 614
Score = 260 (96.6 bits), Expect = 1.9e-23, Sum P(3) = 1.9e-23
Identities = 71/197 (36%), Positives = 107/197 (54%)
Query: 5 VLYGCERGGLPLSEKILPQYLKE--LGYRTRI-MAF-AVLP--LAFTLSMV-FVDLVASS 57
V+ C G L L +LPQ E + + +I + F + LP LA LS+ S+
Sbjct: 4 VINRCAPGSLDL---MLPQAASEGIVFHSLQISLCFRSWLPAMLAVLLSLAPSASSDISA 60
Query: 58 GPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMT 116
P+I+ ++ADDLG D+G +G + + TPNID LA G+ L + + LCTPSR+A +T
Sbjct: 61 SRPNILLLMADDLGIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLT 120
Query: 117 GKHPIHTGMQHNVLYGCER-----GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKE- 170
G++P+ +GM ++ Y + GGLP +E + LKE GY T ++GKWHLG +
Sbjct: 121 GRYPVRSGMVSSIGYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESA 180
Query: 171 ----YTPTFRGFESHLG 183
+ P GF+ G
Sbjct: 181 SDHCHHPLHHGFDHFYG 197
Score = 85 (35.0 bits), Expect = 1.9e-23, Sum P(3) = 1.9e-23
Identities = 25/107 (23%), Positives = 50/107 (46%)
Query: 285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXX-SNWP--LRGVKNTL- 340
+ ++D VG++++ L+ + ++++I F SD W +G K
Sbjct: 348 VEEMDWMVGRILDTLDVEGLSNSTLIYFTSDHGGSLENQLGNTQYGGWNGIYKGGKGMGG 407
Query: 341 WEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
WEGG+R G+ W +L + ++ E + D PT++ A ++P
Sbjct: 408 WEGGIRVPGIFRWPGVLPAGRVIGEP-TSLMDVFPTVVRLAG-GEVP 452
Score = 49 (22.3 bits), Expect = 1.9e-23, Sum P(3) = 1.9e-23
Identities = 10/21 (47%), Positives = 12/21 (57%)
Query: 675 PCLFDIKNDPCEKNNLADRSE 695
P LFD+ DP E + L SE
Sbjct: 536 PLLFDLSRDPSETHILTPASE 556
Score = 49 (22.3 bits), Expect = 1.9e-23, Sum P(3) = 1.9e-23
Identities = 10/21 (47%), Positives = 12/21 (57%)
Query: 781 PCLFDIKNDPCEKNNLADRSE 801
P LFD+ DP E + L SE
Sbjct: 536 PLLFDLSRDPSETHILTPASE 556
>MGI|MGI:98438 [details] [associations]
symbol:Sts "steroid sulfatase" species:10090 "Mus musculus"
[GO:0003824 "catalytic activity" evidence=IEA] [GO:0004773
"steryl-sulfatase activity" evidence=ISO] [GO:0005635 "nuclear
envelope" evidence=ISO] [GO:0005783 "endoplasmic reticulum"
evidence=IEA] [GO:0005789 "endoplasmic reticulum membrane"
evidence=ISO] [GO:0006629 "lipid metabolic process" evidence=IEA]
[GO:0007565 "female pregnancy" evidence=IEA] [GO:0007611 "learning
or memory" evidence=ISO] [GO:0008152 "metabolic process"
evidence=ISO] [GO:0008202 "steroid metabolic process" evidence=IEA]
[GO:0008284 "positive regulation of cell proliferation"
evidence=ISO] [GO:0008484 "sulfuric ester hydrolase activity"
evidence=ISO] [GO:0009268 "response to pH" evidence=ISO]
[GO:0014070 "response to organic cyclic compound" evidence=ISO]
[GO:0016020 "membrane" evidence=IEA] [GO:0016021 "integral to
membrane" evidence=ISO] [GO:0016787 "hydrolase activity"
evidence=IEA] [GO:0043231 "intracellular membrane-bounded
organelle" evidence=ISO] [GO:0043434 "response to peptide hormone
stimulus" evidence=ISO] [GO:0043627 "response to estrogen stimulus"
evidence=ISO] [GO:0046872 "metal ion binding" evidence=IEA]
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 MGI:MGI:98438 GO:GO:0016021 GO:GO:0005789
GO:GO:0008202 GO:GO:0046872 GO:GO:0007565 Gene3D:3.40.720.10
SUPFAM:SSF53649 HOVERGEN:HBG004283 InterPro:IPR024607
PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484 CTD:412 KO:K01131
GO:GO:0004773 EMBL:U37545 IPI:IPI00118038 RefSeq:NP_033319.1
UniGene:Mm.423011 ProteinModelPortal:P50427 SMR:P50427
PhosphoSite:P50427 PRIDE:P50427 GeneID:20905 KEGG:mmu:20905
NextBio:299773 CleanEx:MM_STS Genevestigator:P50427 Uniprot:P50427
Length = 624
Score = 243 (90.6 bits), Expect = 3.1e-23, Sum P(3) = 3.1e-23
Identities = 55/136 (40%), Positives = 77/136 (56%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTG 117
PP+ + I+ADDLG D+G +G + TP++D LA G+ L + LCTPSR+A +TG
Sbjct: 34 PPNFLLIMADDLGIGDLGCYGNKTLRTPHLDRLAREGVKLTQHLAAAPLCTPSRAAFLTG 93
Query: 118 KHPIHTGMQ-HN----VLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT 172
++P +GM H L+ GGLP SE + + LK GY T ++GKWHLG + T
Sbjct: 94 RYPPRSGMAAHGRVGVYLFTASSGGLPPSEVTMARLLKGRGYATALIGKWHLGLSCRGAT 153
Query: 173 -----PTFRGFESHLG 183
P GF+ LG
Sbjct: 154 DFCHHPLRHGFDRFLG 169
Score = 102 (41.0 bits), Expect = 3.1e-23, Sum P(3) = 3.1e-23
Identities = 44/161 (27%), Positives = 69/161 (42%)
Query: 266 YLNIHR-HIED--FK-RSKFAAI---LHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXX 318
+L++H H D F RS A + ++D VG+V+ AL++ + +++ F SD
Sbjct: 294 FLHVHTAHFADPGFAGRSLHGAYGDSVEEMDWGVGRVLAALDELGLARETLVYFTSDHGA 353
Query: 319 XXXXXX----XXXXSNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWL 373
SN RG K WEGGVR L+ W L +VAE + D
Sbjct: 354 HVEELGPRGERMGGSNGVFRGGKGNNWEGGVRVPCLVRWPRELSPGRVVAEP-TSLMDVF 412
Query: 374 PTLLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEY 414
PT+ A +++P +++P R E HE+
Sbjct: 413 PTVARLAG-AELPGDRVIDGRDLMPLLRGDAQRSE---HEF 449
Score = 49 (22.3 bits), Expect = 3.1e-23, Sum P(3) = 3.1e-23
Identities = 12/34 (35%), Positives = 16/34 (47%)
Query: 662 GPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSE 695
GP +P P LFD+ DP E+ L +E
Sbjct: 497 GPAHVTAHDP---PLLFDLTRDPGERRPLTPEAE 527
Score = 49 (22.3 bits), Expect = 3.1e-23, Sum P(3) = 3.1e-23
Identities = 12/34 (35%), Positives = 16/34 (47%)
Query: 768 GPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSE 801
GP +P P LFD+ DP E+ L +E
Sbjct: 497 GPAHVTAHDP---PLLFDLTRDPGERRPLTPEAE 527
>TIGR_CMR|CPS_2364 [details] [associations]
symbol:CPS_2364 "sulfatase family protein" species:167879
"Colwellia psychrerythraea 34H" [GO:0008152 "metabolic process"
evidence=ISS] [GO:0008484 "sulfuric ester hydrolase activity"
evidence=ISS] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 EMBL:CP000083
GenomeReviews:CP000083_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
eggNOG:COG3119 HOGENOM:HOG000135352 InterPro:IPR024607
PROSITE:PS00149 GO:GO:0008484 RefSeq:YP_269082.1
ProteinModelPortal:Q482D6 STRING:Q482D6 GeneID:3521400
KEGG:cps:CPS_2364 PATRIC:21467813 OMA:MEIAVIN
BioCyc:CPSY167879:GI48-2427-MONOMER Uniprot:Q482D6
Length = 492
Score = 295 (108.9 bits), Expect = 3.3e-23, Sum P(2) = 3.3e-23
Identities = 91/305 (29%), Positives = 143/305 (46%)
Query: 35 MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
+ F+ L L FT S S+ P+++ +L DD G D+ +G + TPNID LA
Sbjct: 7 LLFSGLSL-FTCSQAVATPDKSTSKPNVVMLLVDDFGRQDLSTYGSNFYETPNIDQLAAD 65
Query: 95 GIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELG 153
G+ N Y C PSR AI +G +P G+ G + LPLS ++LKE G
Sbjct: 66 GMKFDNAYAAHPRCVPSRVAIFSGSYPTRYGVPQGERVG--KHHLPLSAVTFGEHLKEAG 123
Query: 154 YRTRIVGKWHLGFYKKEYTPTFRGFESHL--GYWTGHQDYFDHSAEEMKMWGLDMRRDLE 211
Y+T +GKWHLG K+ PT +GF+S + G+W Y+ +M G + +
Sbjct: 124 YQTGYIGKWHLG--KEGGDPTKQGFDSSIMAGHWGAPPSYY-FPYTKMSKSGKN--KGFA 178
Query: 212 PAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
+Y TD T EA+ I D+P L LAH A H+ +P + + +
Sbjct: 179 KVEGSEEEYLTDRLTDEALTFIEQKK-DQPFLLVLAHYAVHTPIEGKPALVKKYKTKMKK 237
Query: 272 H-------------IED---FKRS-----KFAAILHKLDESVGKVVEALEQRRMLSNSII 310
I+D + ++ +AA++ +D SVG++ + L++ + N+II
Sbjct: 238 LGIANAGPKSDADLIKDSTGYHKTIQNNPDYAAMVESVDISVGRIEQQLKRLGLEDNTII 297
Query: 311 VFVSD 315
+ SD
Sbjct: 298 ILTSD 302
Score = 45 (20.9 bits), Expect = 3.3e-23, Sum P(2) = 3.3e-23
Identities = 17/71 (23%), Positives = 30/71 (42%)
Query: 511 DGIDVWSVLSRNEPSKRNTILHNI------DDEWQISALTRGKWKLVKENSINGNGTSEN 564
DG+ + L+ +E ++ H+ + SA+ G+WKL+ S G N
Sbjct: 382 DGVSYLAALNSDETPRKAMFWHSPAARPSKTGDTNSSAIIEGEWKLLDFWS-TGKVELYN 440
Query: 565 RSNDNSYQNEI 575
+D S N +
Sbjct: 441 LKDDKSEANNL 451
Score = 44 (20.5 bits), Expect = 4.1e-23, Sum P(2) = 4.1e-23
Identities = 8/15 (53%), Positives = 12/15 (80%)
Query: 677 LFDIKNDPCEKNNLA 691
L+++K+D E NNLA
Sbjct: 438 LYNLKDDKSEANNLA 452
Score = 44 (20.5 bits), Expect = 4.1e-23, Sum P(2) = 4.1e-23
Identities = 8/15 (53%), Positives = 12/15 (80%)
Query: 783 LFDIKNDPCEKNNLA 797
L+++K+D E NNLA
Sbjct: 438 LYNLKDDKSEANNLA 452
Score = 42 (19.8 bits), Expect = 6.7e-23, Sum P(2) = 6.7e-23
Identities = 17/72 (23%), Positives = 33/72 (45%)
Query: 534 IDDEWQISAL-TRGKWKL--VKENSINGNGTSENRSNDNSYQ-----NEIDGIDVWSVLS 585
I+ EW++ + GK +L +K++ N ++ + N D ID +V
Sbjct: 421 IEGEWKLLDFWSTGKVELYNLKDDKSEANNLAKLMPEKTAEMLAKLTNWKDDIDAHTVKK 480
Query: 586 RNEPSKRNTILH 597
+N+ SK+ + H
Sbjct: 481 KNKKSKKKSKSH 492
Score = 39 (18.8 bits), Expect = 1.4e-22, Sum P(2) = 1.4e-22
Identities = 8/22 (36%), Positives = 13/22 (59%)
Query: 511 DGIDVWSVLSRNEPSKRNTILH 532
D ID +V +N+ SK+ + H
Sbjct: 471 DDIDAHTVKKKNKKSKKKSKSH 492
>UNIPROTKB|P51690 [details] [associations]
symbol:ARSE "Arylsulfatase E" species:9606 "Homo sapiens"
[GO:0046872 "metal ion binding" evidence=IEA] [GO:0005795 "Golgi
stack" evidence=IEA] [GO:0001501 "skeletal system development"
evidence=TAS] [GO:0004065 "arylsulfatase activity" evidence=TAS]
[GO:0005788 "endoplasmic reticulum lumen" evidence=TAS] [GO:0006644
"phospholipid metabolic process" evidence=TAS] [GO:0006665
"sphingolipid metabolic process" evidence=TAS] [GO:0006687
"glycosphingolipid metabolic process" evidence=TAS] [GO:0043687
"post-translational protein modification" evidence=TAS] [GO:0044267
"cellular protein metabolic process" evidence=TAS] [GO:0044281
"small molecule metabolic process" evidence=TAS]
Reactome:REACT_17015 Reactome:REACT_111217 InterPro:IPR000917
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 GO:GO:0044281
GO:GO:0046872 GO:GO:0006644 GO:GO:0005795 GO:GO:0005788
GO:GO:0001501 GO:GO:0043687 Gene3D:3.40.720.10 SUPFAM:SSF53649
eggNOG:COG3119 HOVERGEN:HBG004283 InterPro:IPR024607
PROSITE:PS00523 PROSITE:PS00149 GO:GO:0004065 GO:GO:0006687
KO:K12374 OrthoDB:EOG4V4379 EMBL:X83573 EMBL:AK223183 EMBL:AK223199
IPI:IPI01014058 PIR:I37187 RefSeq:NP_000038.2 UniGene:Hs.386975
ProteinModelPortal:P51690 SMR:P51690 IntAct:P51690
MINT:MINT-1382153 STRING:P51690 PhosphoSite:P51690 DMDM:77416850
PaxDb:P51690 PRIDE:P51690 DNASU:415 Ensembl:ENST00000381134
GeneID:415 KEGG:hsa:415 UCSC:uc004crc.4 CTD:415
GeneCards:GC0XM002846 HGNC:HGNC:719 MIM:300180 MIM:302950
neXtProt:NX_P51690 Orphanet:79345 PharmGKB:PA25010
InParanoid:P51690 GenomeRNAi:415 NextBio:1755 ArrayExpress:P51690
Bgee:P51690 CleanEx:HS_ARSE Genevestigator:P51690
GermOnline:ENSG00000157399 Uniprot:P51690
Length = 589
Score = 254 (94.5 bits), Expect = 6.9e-23, Sum P(3) = 6.9e-23
Identities = 54/139 (38%), Positives = 82/139 (58%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAI 114
S+ P+I+ ++ADDLG D+G +G + + TPNID LA G+ L + + LCTPSR+A
Sbjct: 34 SASRPNILLLMADDLGIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAF 93
Query: 115 MTGKHPIHTGMQHNVLYGCER-----GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
+TG++P+ +GM ++ Y + GGLP +E + LKE GY T ++GKWHLG +
Sbjct: 94 LTGRYPVRSGMVSSIGYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCE 153
Query: 170 E-----YTPTFRGFESHLG 183
+ P GF+ G
Sbjct: 154 SASDHCHHPLHHGFDHFYG 172
Score = 85 (35.0 bits), Expect = 6.9e-23, Sum P(3) = 6.9e-23
Identities = 25/107 (23%), Positives = 50/107 (46%)
Query: 285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXX-SNWP--LRGVKNTL- 340
+ ++D VG++++ L+ + ++++I F SD W +G K
Sbjct: 323 VEEMDWMVGRILDTLDVEGLSNSTLIYFTSDHGGSLENQLGNTQYGGWNGIYKGGKGMGG 382
Query: 341 WEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
WEGG+R G+ W +L + ++ E + D PT++ A ++P
Sbjct: 383 WEGGIRVPGIFRWPGVLPAGRVIGEP-TSLMDVFPTVVRLAG-GEVP 427
Score = 49 (22.3 bits), Expect = 6.9e-23, Sum P(3) = 6.9e-23
Identities = 10/21 (47%), Positives = 12/21 (57%)
Query: 675 PCLFDIKNDPCEKNNLADRSE 695
P LFD+ DP E + L SE
Sbjct: 511 PLLFDLSRDPSETHILTPASE 531
Score = 49 (22.3 bits), Expect = 6.9e-23, Sum P(3) = 6.9e-23
Identities = 10/21 (47%), Positives = 12/21 (57%)
Query: 781 PCLFDIKNDPCEKNNLADRSE 801
P LFD+ DP E + L SE
Sbjct: 511 PLLFDLSRDPSETHILTPASE 531
>UNIPROTKB|Q32KH8 [details] [associations]
symbol:ARSH "Arylsulfatase H" species:9615 "Canis lupus
familiaris" [GO:0046872 "metal ion binding" evidence=IEA]
[GO:0016021 "integral to membrane" evidence=IEA] [GO:0008484
"sulfuric ester hydrolase activity" evidence=IEA]
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 GO:GO:0016021 GO:GO:0046872 Gene3D:3.40.720.10
SUPFAM:SSF53649 eggNOG:COG3119 HOGENOM:HOG000135352
HOVERGEN:HBG004283 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0008484 KO:K12374 OrthoDB:EOG4V4379
EMBL:AAEX01047377 EMBL:BN000759 RefSeq:NP_001041588.1
UniGene:Cfa.39079 HSSP:P15289 ProteinModelPortal:Q32KH8 SMR:Q32KH8
GeneID:491720 KEGG:cfa:491720 CTD:347527 InParanoid:Q32KH8
NextBio:20864464 Uniprot:Q32KH8
Length = 562
Score = 232 (86.7 bits), Expect = 1.1e-22, Sum P(3) = 1.1e-22
Identities = 52/136 (38%), Positives = 78/136 (57%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
P+I+ ++ADDLG D+ +G + + TPNID LA G+ L + +CTPSR+A +TG+
Sbjct: 7 PNIVLLMADDLGVGDLCCYGNNTVSTPNIDRLASEGVRLTQHLAAASVCTPSRAAFLTGR 66
Query: 119 HPIHTGMQ--HNVLYGCE----RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFY---KK 169
+PI +GM +N+ G GGLP +E + L+ GYRT ++GKWH G +
Sbjct: 67 YPIRSGMASPYNLNRGLTWLGGSGGLPTNETTFAKLLQHYGYRTGLIGKWHQGLSCASRN 126
Query: 170 E--YTPTFRGFESHLG 183
+ Y P GF+ G
Sbjct: 127 DHCYHPLNHGFDYFYG 142
Score = 109 (43.4 bits), Expect = 1.1e-22, Sum P(3) = 1.1e-22
Identities = 47/192 (24%), Positives = 82/192 (42%)
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
EA+ I + P L+++ H+ PL D ++ H K + + +
Sbjct: 248 EALAFIDRYKRG-PFLLFVSFLHVHT-----PLITKDKFVG-HS-----KYGLYGDNVEE 295
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXX---SNWPLRGVKNTL-WEG 343
+D VGK++E L+Q R+ +++++ F SD SN +G + WEG
Sbjct: 296 MDWMVGKILETLDQERLTNHTLVYFTSDNGGRLEVQEGEVQLGGSNGIYKGGQGMGGWEG 355
Query: 344 GVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYEN 402
G+R G+ W +L++ G V + + D PTL S +P N++P E
Sbjct: 356 GIRVPGIFRWPTVLQA-GKVINEPTSLMDIYPTL-SYIGGGMLPQDRVIDGRNLMPLLEG 413
Query: 403 SILRYENGTHEY 414
R + HE+
Sbjct: 414 ---RVSHSDHEF 422
Score = 46 (21.3 bits), Expect = 1.1e-22, Sum P(3) = 1.1e-22
Identities = 11/32 (34%), Positives = 14/32 (43%)
Query: 668 PCEPQIA----PCLFDIKNDPCEKNNLADRSE 695
PC + P LFD+ DP E L +E
Sbjct: 469 PCSGDVTYHDPPLLFDVSRDPSETRPLNPDNE 500
Score = 46 (21.3 bits), Expect = 1.1e-22, Sum P(3) = 1.1e-22
Identities = 11/32 (34%), Positives = 14/32 (43%)
Query: 774 PCEPQIA----PCLFDIKNDPCEKNNLADRSE 801
PC + P LFD+ DP E L +E
Sbjct: 469 PCSGDVTYHDPPLLFDVSRDPSETRPLNPDNE 500
>POMBASE|SPBPB10D8.02c [details] [associations]
symbol:SPBPB10D8.02c "arylsulfatase (predicted)"
species:4896 "Schizosaccharomyces pombe" [GO:0004065 "arylsulfatase
activity" evidence=ISS] [GO:0005634 "nucleus" evidence=IDA]
[GO:0005829 "cytosol" evidence=IDA] [GO:0006790 "sulfur compound
metabolic process" evidence=ISS] [GO:0046872 "metal ion binding"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 PomBase:SPBPB10D8.02c GO:GO:0005829
GO:GO:0005634 GO:GO:0046872 EMBL:CU329671 Gene3D:3.40.720.10
SUPFAM:SSF53649 eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0004065 GO:GO:0006790 KO:K01130
RefSeq:NP_595046.1 HSSP:P51691 ProteinModelPortal:Q9C0V7 SMR:Q9C0V7
STRING:Q9C0V7 EnsemblFungi:SPBPB10D8.02c.1 GeneID:2541396
KEGG:spo:SPBPB10D8.02c HOGENOM:HOG000135353 OMA:IEWTNIS
OrthoDB:EOG4DJP4T NextBio:20802503 Uniprot:Q9C0V7
Length = 554
Score = 233 (87.1 bits), Expect = 1.6e-22, Sum P(3) = 1.6e-22
Identities = 70/232 (30%), Positives = 113/232 (48%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
A S P+ + I+ADDLGW+DV G +I TPNI+ LA G+ L N++T C+P+RS +
Sbjct: 7 AESKKPNFLVIVADDLGWSDVSPFG-SEIHTPNIERLAKEGVRLTNFHTASACSPTRSML 65
Query: 115 MTG--KHPIHTGMQHNVLYGCER--GGLP-----LSEKI--LPQYLKELGYRTRIVGKWH 163
++G H G + + GG P L++++ LP+ L+E GY T + GKWH
Sbjct: 66 LSGTDNHIAGLGQMAETVRRFSKVWGGKPGYEGYLNDRVAALPEILQEAGYYTTMSGKWH 125
Query: 164 LGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPA-WD-LHGKYS 221
LG Y P+ RGF+ G ++F + + + L D + K
Sbjct: 126 LGLTPDRY-PSKRGFKESFALLPGGGNHFAYEPGTRENPAVPFLPPLYTHNHDPVDHKSL 184
Query: 222 TDVFTAE--AVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
+ +++ A +I E + A+ + P+ PLQ+P Y+N +R
Sbjct: 185 KNFYSSNYFAEKLIDQLKNREKSQSFFAYLPFTA--PHWPLQSPKEYINKYR 234
Score = 84 (34.6 bits), Expect = 1.6e-22, Sum P(3) = 1.6e-22
Identities = 25/79 (31%), Positives = 40/79 (50%)
Query: 332 PLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNS 391
P R K + EGG+R +I P L I+++++V V D LPT+L A P +
Sbjct: 377 PSRLSKGFITEGGIRCPAIIRYPPLIKPDIISDEFVTVMDILPTILELAEVPH-PGHKFQ 435
Query: 392 TVENIIPRYENSILRYENG 410
+ +IPR + I + +G
Sbjct: 436 GRDVVIPRGKPWIDHFVHG 454
Score = 68 (29.0 bits), Expect = 1.6e-22, Sum P(3) = 1.6e-22
Identities = 12/35 (34%), Positives = 25/35 (71%)
Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
+AA++ LD ++G+V++ L+ L N+ ++F+SD
Sbjct: 293 YAAMVELLDLNIGRVIDYLKTIGELDNTFVIFMSD 327
Score = 37 (18.1 bits), Expect = 5.9e-15, Sum P(2) = 5.9e-15
Identities = 7/13 (53%), Positives = 7/13 (53%)
Query: 432 EYNPKYENRYENG 444
EY KY RY G
Sbjct: 228 EYINKYRGRYSEG 240
Score = 37 (18.1 bits), Expect = 5.9e-15, Sum P(2) = 5.9e-15
Identities = 7/13 (53%), Positives = 7/13 (53%)
Query: 447 EYNPKYENRYENG 459
EY KY RY G
Sbjct: 228 EYINKYRGRYSEG 240
>TIGR_CMR|CPS_0660 [details] [associations]
symbol:CPS_0660 "sulfatase family protein" species:167879
"Colwellia psychrerythraea 34H" [GO:0006790 "sulfur compound
metabolic process" evidence=ISS] [GO:0008484 "sulfuric ester
hydrolase activity" evidence=ISS] InterPro:IPR000917
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 EMBL:CP000083
GenomeReviews:CP000083_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
eggNOG:COG3119 GO:GO:0008484 KO:K01130 HOGENOM:HOG000135355
RefSeq:YP_267410.1 ProteinModelPortal:Q488V4 STRING:Q488V4
GeneID:3519819 KEGG:cps:CPS_0660 PATRIC:21464645 OMA:NISAYTH
BioCyc:CPSY167879:GI48-747-MONOMER Uniprot:Q488V4
Length = 525
Score = 293 (108.2 bits), Expect = 1.6e-22, Sum P(2) = 1.6e-22
Identities = 100/347 (28%), Positives = 159/347 (45%)
Query: 60 PHIIFILADDLGWNDVGF--HGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTG 117
P+I+ I DD+G +++ HG+ T NID +A G++ +YY CT R+A +TG
Sbjct: 39 PNILAIWGDDIGQSNISAYTHGMMGYKTTNIDRIAKEGVLFTDYYGENSCTAGRAAFITG 98
Query: 118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRG 177
++P+ TG+ L G ++G L + + + LK+ GY T GK HLG K E+ PT G
Sbjct: 99 QYPVRTGLTKVGLPGSDKG-LRAEDVTIAELLKDRGYVTGQFGKNHLGD-KDEFLPTNHG 156
Query: 178 FESHLG--YWTG------HQDYFDHSAEEMKMWG----LDMRRD--LEPAWDLHGK-YST 222
F+ LG Y H DY A + K +G + D +E + L K T
Sbjct: 157 FDEFLGNLYHLNAEEEPEHPDYPKDQAYK-KRFGPRGVIHSFADGKIEDSGPLTKKRMET 215
Query: 223 --DVFTAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRS 279
D F A I H ++P F++ H + L+ L+ I
Sbjct: 216 IDDEFLAATTKFIDKAHKNNKPFFVWFNATRMHI---WTHLKEESKGLSKRGGI------ 266
Query: 280 KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNT 339
+ + + D VG +++ L++ + N+I+++ +D P +G KNT
Sbjct: 267 -YGDGMMEHDYQVGVLLDQLDRLAIADNTIVLYTTDNGAEVFSWPDGGTI--PFKGEKNT 323
Query: 340 LWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDI 385
WEGG R ++ W + + E H+ DW PTLL+AA +DI
Sbjct: 324 TWEGGFRVPAMVRWPGKITAGDAKIEMVSHM-DWAPTLLAAAGVTDI 369
Score = 43 (20.2 bits), Expect = 1.6e-22, Sum P(2) = 1.6e-22
Identities = 19/67 (28%), Positives = 29/67 (43%)
Query: 628 LTGGPDQV----YLSGLSDREWLALAMRKLRDAASIQ-CGPVKE--VPCEPQIAPCLFDI 680
LTG D+ YL + A+ ++ SIQ C + P P AP L ++
Sbjct: 397 LTGATDEAPRPSYLYFTDGGDLSAVRFGDMKLQYSIQECEGLNVWICPLTPLRAPLLTNL 456
Query: 681 KNDPCEK 687
+ DP E+
Sbjct: 457 RQDPYER 463
Score = 42 (19.8 bits), Expect = 2.1e-22, Sum P(2) = 2.1e-22
Identities = 8/20 (40%), Positives = 12/20 (60%)
Query: 774 PCEPQIAPCLFDIKNDPCEK 793
P P AP L +++ DP E+
Sbjct: 444 PLTPLRAPLLTNLRQDPYER 463
>RGD|1306571 [details] [associations]
symbol:Arsg "arylsulfatase G" species:10116 "Rattus norvegicus"
[GO:0004065 "arylsulfatase activity" evidence=ISO;ISS] [GO:0005615
"extracellular space" evidence=IEA;ISO] [GO:0005764 "lysosome"
evidence=ISO;ISS] [GO:0005783 "endoplasmic reticulum"
evidence=IEA;ISO] [GO:0006790 "sulfur compound metabolic process"
evidence=IEA;ISO] [GO:0046872 "metal ion binding" evidence=IEA]
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 RGD:1306571 GO:GO:0005783 GO:GO:0005615 GO:GO:0046872
GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
GeneTree:ENSGT00560000076940 HOGENOM:HOG000135352
HOVERGEN:HBG004283 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0004065 CTD:22901 KO:K12381 OrthoDB:EOG4J9MZJ
GO:GO:0006790 EMBL:AABR03073953 EMBL:AABR03074766 EMBL:AABR03075952
EMBL:AABR03076519 EMBL:AABR03076696 EMBL:BN000738 IPI:IPI00361303
RefSeq:NP_001041342.1 UniGene:Rn.221856 ProteinModelPortal:Q32KJ9
PRIDE:Q32KJ9 Ensembl:ENSRNOT00000005257 GeneID:303631
KEGG:rno:303631 InParanoid:Q32KJ9 OMA:WHYPHYS NextBio:651782
Genevestigator:Q32KJ9 GermOnline:ENSRNOG00000003931 Uniprot:Q32KJ9
Length = 526
Score = 264 (98.0 bits), Expect = 1.8e-22, Sum P(3) = 1.8e-22
Identities = 54/139 (38%), Positives = 84/139 (60%)
Query: 50 FVDLVASS---GP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV- 104
FVD S P P+I+ ILADD+GW D+G + + T N+D +A G+ +++
Sbjct: 22 FVDFSISGETRAPRPNIVIILADDMGWGDLGANWAETKDTTNLDKMASEGMRFVDFHAAA 81
Query: 105 QLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL 164
C+PSR++++TG+ + G+ HN GGLPL+E L + L++ GY T ++GKWHL
Sbjct: 82 STCSPSRASLLTGRLGLRNGVTHNFAV-TSVGGLPLNETTLAEVLQQAGYVTAMIGKWHL 140
Query: 165 GFYKKEYTPTFRGFESHLG 183
G + Y P+FRGF+ + G
Sbjct: 141 GHHGS-YHPSFRGFDYYFG 158
Score = 72 (30.4 bits), Expect = 1.8e-22, Sum P(3) = 1.8e-22
Identities = 43/177 (24%), Positives = 67/177 (37%)
Query: 225 FTAEAVDIIHNHSTD-EPLFLYLAHAATH---SANPYEPLQAPDHYLNIHRHIEDFKRSK 280
+ AV+ I ST P LY+ A H S P PL P ++R
Sbjct: 223 YAERAVEFIEQASTSGRPFLLYVGLAHMHVPLSVTP--PLANPQSQ-RLYR--------- 270
Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGV---- 336
A L ++D VG++ + ++ N+++ F D S P G+
Sbjct: 271 --ASLQEMDSLVGQIKDKVDHVAK-ENTLLWFAGDNGPWAQKCELAG-SMGPFSGLWQTH 326
Query: 337 ------KNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
K T WEGG R L + P + + + + D PT+++ A S PN
Sbjct: 327 QGGSPAKQTTWEGGHRVPALAYWPGRVPVNVTSTALLSLLDIFPTVIALAGASLPPN 383
Score = 43 (20.2 bits), Expect = 1.8e-22, Sum P(3) = 1.8e-22
Identities = 11/40 (27%), Positives = 21/40 (52%)
Query: 774 PCEPQIAPCLFDIKNDPCEKNNLADRS-EVQRINHYTTEV 812
P + ++P +F++++D E + L S E Q + T V
Sbjct: 446 PEQHHVSPLIFNLEDDAAESSPLQKGSPEYQELLPKVTRV 485
Score = 41 (19.5 bits), Expect = 2.9e-22, Sum P(3) = 2.9e-22
Identities = 11/40 (27%), Positives = 21/40 (52%)
Query: 668 PCEPQIAPCLFDIKNDPCEKNNLADRS-EDQRINHYTTEV 706
P + ++P +F++++D E + L S E Q + T V
Sbjct: 446 PEQHHVSPLIFNLEDDAAESSPLQKGSPEYQELLPKVTRV 485
Score = 39 (18.8 bits), Expect = 4.7e-22, Sum P(3) = 4.7e-22
Identities = 26/130 (20%), Positives = 48/130 (36%)
Query: 574 EIDGIDVWSVLSRNEPSKRNTILH-NIDDEWQISALTXXXXXXXXXXXXMRYQVDLTGG- 631
+ DG+DV VL + + H N + AL GG
Sbjct: 385 KFDGVDVSEVLFGKSQTGHRVLFHPNSGAAGEYGALQTVRLDRYKAFYITGGAKACDGGV 444
Query: 632 -PDQVYLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNL 690
P+Q ++S L + +++ +Q G + P++ L D+ D + N+
Sbjct: 445 GPEQHHVSPL-----IFNLEDDAAESSPLQKGSPEYQELLPKVTRVLADVLQDIADDNSS 499
Query: 691 -ADRSEDQRI 699
AD ++D +
Sbjct: 500 QADYTQDPSV 509
>UNIPROTKB|Q5FYA8 [details] [associations]
symbol:ARSH "Arylsulfatase H" species:9606 "Homo sapiens"
[GO:0016021 "integral to membrane" evidence=IEA] [GO:0046872 "metal
ion binding" evidence=IEA] [GO:0004065 "arylsulfatase activity"
evidence=TAS] [GO:0005788 "endoplasmic reticulum lumen"
evidence=TAS] [GO:0006644 "phospholipid metabolic process"
evidence=TAS] [GO:0006665 "sphingolipid metabolic process"
evidence=TAS] [GO:0006687 "glycosphingolipid metabolic process"
evidence=TAS] [GO:0043687 "post-translational protein modification"
evidence=TAS] [GO:0044267 "cellular protein metabolic process"
evidence=TAS] [GO:0044281 "small molecule metabolic process"
evidence=TAS] Reactome:REACT_17015 Reactome:REACT_111217
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 GO:GO:0016021 GO:GO:0044281 GO:GO:0046872
GO:GO:0006644 GO:GO:0005788 GO:GO:0043687 Gene3D:3.40.720.10
SUPFAM:SSF53649 eggNOG:COG3119 HOGENOM:HOG000135352
HOVERGEN:HBG004283 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0004065 GO:GO:0006687 KO:K12374
OrthoDB:EOG4V4379 CTD:347527 EMBL:AY875940 IPI:IPI00233062
RefSeq:NP_001011719.1 UniGene:Hs.351533 HSSP:P08842
ProteinModelPortal:Q5FYA8 SMR:Q5FYA8 STRING:Q5FYA8 DMDM:74722579
PRIDE:Q5FYA8 DNASU:347527 Ensembl:ENST00000381130 GeneID:347527
KEGG:hsa:347527 UCSC:uc011mhj.2 GeneCards:GC0XP002919
HGNC:HGNC:32488 HPA:HPA050011 MIM:300586 neXtProt:NX_Q5FYA8
PharmGKB:PA143485308 InParanoid:Q5FYA8 OMA:ATVWKVH
GenomeRNAi:347527 NextBio:99177 Bgee:Q5FYA8 CleanEx:HS_ARSH
Genevestigator:Q5FYA8 Uniprot:Q5FYA8
Length = 562
Score = 230 (86.0 bits), Expect = 1.8e-22, Sum P(3) = 1.8e-22
Identities = 48/115 (41%), Positives = 69/115 (60%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
P+I+ ++ADDLG D+ +G + + TPNID LA G+ L + +CTPSR+A +TG+
Sbjct: 7 PNIVLLMADDLGVGDLCCYGNNSVSTPNIDRLASEGVRLTQHLAAASMCTPSRAAFLTGR 66
Query: 119 HPIHTGMQHNVLYGCER--------GGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
+PI +GM Y R GGLP +E + L+ GYRT ++GKWHLG
Sbjct: 67 YPIRSGMVS--AYNLNRAFTWLGGSGGLPTNETTFAKLLQHRGYRTGLIGKWHLG 119
Score = 112 (44.5 bits), Expect = 1.8e-22, Sum P(3) = 1.8e-22
Identities = 37/131 (28%), Positives = 62/131 (47%)
Query: 258 EPLQAPDHYLNIHRHIEDFK----RSKFAAI---LHKLDESVGKVVEALEQRRMLSNSII 310
EP +L++H + K RSK+ + ++D VGK+++AL+Q R+ +++++
Sbjct: 259 EPFLLFFSFLHVHTPLISKKKFVGRSKYGRYGDNVEEMDWMVGKILDALDQERLANHTLV 318
Query: 311 VFVSDXXXXXX-XXXXXXXSNWP--LRGVKNTL-WEGGVRGAGLI-WSPLLESRGIVAEQ 365
F SD W +G K WEGG+R G+ W +LE+ G V +
Sbjct: 319 YFTSDNGGHLEPLDGAVQLGGWNGIYKGGKGMGGWEGGIRVPGIFRWPSVLEA-GRVINE 377
Query: 366 YVHVSDWLPTL 376
+ D PTL
Sbjct: 378 PTSLMDIYPTL 388
Score = 43 (20.2 bits), Expect = 1.8e-22, Sum P(3) = 1.8e-22
Identities = 8/12 (66%), Positives = 8/12 (66%)
Query: 675 PCLFDIKNDPCE 686
P LFDI DP E
Sbjct: 480 PLLFDISRDPSE 491
Score = 43 (20.2 bits), Expect = 1.8e-22, Sum P(3) = 1.8e-22
Identities = 8/12 (66%), Positives = 8/12 (66%)
Query: 781 PCLFDIKNDPCE 792
P LFDI DP E
Sbjct: 480 PLLFDISRDPSE 491
>UNIPROTKB|F1PY85 [details] [associations]
symbol:ARSH "Arylsulfatase H" species:9615 "Canis lupus
familiaris" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0008484 OMA:ATVWKVH EMBL:AAEX03026108
Ensembl:ENSCAFT00000017754 Uniprot:F1PY85
Length = 562
Score = 232 (86.7 bits), Expect = 2.2e-22, Sum P(3) = 2.2e-22
Identities = 52/136 (38%), Positives = 78/136 (57%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
P+I+ ++ADDLG D+ +G + + TPNID LA G+ L + +CTPSR+A +TG+
Sbjct: 7 PNIVLLMADDLGVGDLCCYGNNTVSTPNIDRLASEGVRLTQHLAAASVCTPSRAAFLTGR 66
Query: 119 HPIHTGMQ--HNVLYGCE----RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFY---KK 169
+PI +GM +N+ G GGLP +E + L+ GYRT ++GKWH G +
Sbjct: 67 YPIRSGMASPYNLNRGLTWLGGSGGLPTNETTFAKLLQHYGYRTGLIGKWHQGLSCASRN 126
Query: 170 E--YTPTFRGFESHLG 183
+ Y P GF+ G
Sbjct: 127 DHCYHPLNHGFDYFYG 142
Score = 106 (42.4 bits), Expect = 2.2e-22, Sum P(3) = 2.2e-22
Identities = 46/192 (23%), Positives = 82/192 (42%)
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
EA+ I + P L+++ H+ PL D ++ H K + + +
Sbjct: 248 EALAFIDRYKRG-PFLLFVSFLHVHT-----PLITKDKFVG-HS-----KYGLYGDNVEE 295
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXX---SNWPLRGVKNTL-WEG 343
+D VG+++E L+Q R+ +++++ F SD SN +G + WEG
Sbjct: 296 MDWMVGRILETLDQERLTNHTLVYFTSDNGGRLEVQEGEVQLGGSNGIYKGGQGMGGWEG 355
Query: 344 GVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYEN 402
G+R G+ W +L++ G V + + D PTL S +P N++P E
Sbjct: 356 GIRVPGIFRWPTVLQA-GKVINEPTSLMDIYPTL-SYIGGGMLPQDRVIDGRNLMPLLEG 413
Query: 403 SILRYENGTHEY 414
R + HE+
Sbjct: 414 ---RVSHSDHEF 422
Score = 46 (21.3 bits), Expect = 2.2e-22, Sum P(3) = 2.2e-22
Identities = 11/32 (34%), Positives = 14/32 (43%)
Query: 668 PCEPQIA----PCLFDIKNDPCEKNNLADRSE 695
PC + P LFD+ DP E L +E
Sbjct: 469 PCSGDVTYHDPPLLFDVSRDPSETRPLNPDNE 500
Score = 46 (21.3 bits), Expect = 2.2e-22, Sum P(3) = 2.2e-22
Identities = 11/32 (34%), Positives = 14/32 (43%)
Query: 774 PCEPQIA----PCLFDIKNDPCEKNNLADRSE 801
PC + P LFD+ DP E L +E
Sbjct: 469 PCSGDVTYHDPPLLFDVSRDPSETRPLNPDNE 500
>UNIPROTKB|Q32KI1 [details] [associations]
symbol:arse "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0004065 "arylsulfatase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
eggNOG:COG3119 GeneTree:ENSGT00560000076940 HOGENOM:HOG000135352
HOVERGEN:HBG004283 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0004065 KO:K12374 OrthoDB:EOG4V4379 CTD:415
EMBL:AAEX03026107 OMA:VCFQIMA EMBL:BN000756 RefSeq:NP_001041587.1
UniGene:Cfa.28960 SMR:Q32KI1 STRING:Q32KI1
Ensembl:ENSCAFT00000045735 GeneID:491719 KEGG:cfa:491719
InParanoid:Q32KI1 NextBio:20864462 Uniprot:Q32KI1
Length = 585
Score = 244 (91.0 bits), Expect = 3.5e-22, Sum P(3) = 3.5e-22
Identities = 49/116 (42%), Positives = 73/116 (62%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAI 114
S P+I+ ++ADD G D+G +G + I TPNID LA G++L + +CTPSR+A
Sbjct: 30 SGSRPNILLLMADDFGIGDIGCYGNNSIRTPNIDRLAEDGVMLTQHIAAASVCTPSRAAF 89
Query: 115 MTGKHPIHTGMQ----HNVL-YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
+TG++P+ +GM + VL + GGLP +E + LK+ GY T ++GKWHLG
Sbjct: 90 LTGRYPLRSGMVSSNGYRVLQWTGVSGGLPTNETTFAKILKDRGYATGLIGKWHLG 145
Score = 88 (36.0 bits), Expect = 3.5e-22, Sum P(3) = 3.5e-22
Identities = 34/145 (23%), Positives = 65/145 (44%)
Query: 255 NPYEPLQAPDHYLNIHRHI---EDFKRSKFAAILH-----KLDESVGKVVEALEQRRMLS 306
N + P +L++H + E F R K A L+ ++D VG++++ L+ + +
Sbjct: 282 NKHRPFLLFVSFLHVHTPLITTEKF-RGKSAHGLYGDNTEEMDWMVGQILDTLDMEGLTN 340
Query: 307 NSIIVFVSDXXXXXXXXXXXXX-SNWP--LRGVKNTL-WEGGVRGAGLI-WSPLLESRGI 361
++++ F SD W +G K WEGG+R G+ W +L++ G
Sbjct: 341 STLVYFTSDHGGSLEAQLGKEQYGGWNGIYKGGKGMGGWEGGIRVPGIFRWPGVLQA-GR 399
Query: 362 VAEQYVHVSDWLPTLLSAANKSDIP 386
V + + D PT++ ++P
Sbjct: 400 VIHEPTSLMDVFPTVVQLGG-GEVP 423
Score = 50 (22.7 bits), Expect = 3.5e-22, Sum P(3) = 3.5e-22
Identities = 16/46 (34%), Positives = 20/46 (43%)
Query: 668 PCE-PQIA----PCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGR 708
PC Q+A P LFD+ DP E + L +E H V R
Sbjct: 495 PCSGDQVAHHDPPLLFDLSRDPSEAHALTPDTEPS-FYHVMDTVAR 539
Score = 49 (22.3 bits), Expect = 4.5e-22, Sum P(3) = 4.5e-22
Identities = 13/33 (39%), Positives = 17/33 (51%)
Query: 774 PCE-PQIA----PCLFDIKNDPCEKNNLADRSE 801
PC Q+A P LFD+ DP E + L +E
Sbjct: 495 PCSGDQVAHHDPPLLFDLSRDPSEAHALTPDTE 527
>MGI|MGI:1921258 [details] [associations]
symbol:Arsg "arylsulfatase G" species:10090 "Mus musculus"
[GO:0003824 "catalytic activity" evidence=IEA] [GO:0004065
"arylsulfatase activity" evidence=ISO] [GO:0005615 "extracellular
space" evidence=ISO] [GO:0005764 "lysosome" evidence=ISO]
[GO:0005783 "endoplasmic reticulum" evidence=ISO] [GO:0006790
"sulfur compound metabolic process" evidence=ISO] [GO:0008152
"metabolic process" evidence=IEA] [GO:0008484 "sulfuric ester
hydrolase activity" evidence=IEA] [GO:0016787 "hydrolase activity"
evidence=IEA] [GO:0046872 "metal ion binding" evidence=IEA]
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 MGI:MGI:1921258 GO:GO:0005783 GO:GO:0005615
GO:GO:0046872 GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649
eggNOG:COG3119 GeneTree:ENSGT00560000076940 HOVERGEN:HBG004283
InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0004065
CTD:22901 KO:K12381 OrthoDB:EOG4J9MZJ GO:GO:0006790 EMBL:AK018132
EMBL:AK158726 EMBL:AL645791 EMBL:BC022158 EMBL:BC039629
EMBL:BC084731 EMBL:AK173082 EMBL:BN000747 IPI:IPI00135805
IPI:IPI00648999 RefSeq:NP_001159649.1 RefSeq:NP_082986.3
UniGene:Mm.482224 ProteinModelPortal:Q3TYD4 SMR:Q3TYD4
STRING:Q3TYD4 PaxDb:Q3TYD4 PRIDE:Q3TYD4 Ensembl:ENSMUST00000020928
Ensembl:ENSMUST00000106696 Ensembl:ENSMUST00000106697 GeneID:74008
KEGG:mmu:74008 UCSC:uc007mcn.1 UCSC:uc007mcp.2 InParanoid:B1AT67
OMA:GNTFLWF NextBio:339520 Bgee:Q3TYD4 CleanEx:MM_ARSG
Genevestigator:Q3TYD4 GermOnline:ENSMUSG00000020604 Uniprot:Q3TYD4
Length = 525
Score = 257 (95.5 bits), Expect = 3.8e-22, Sum P(2) = 3.8e-22
Identities = 48/125 (38%), Positives = 78/125 (62%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
P+I+ ILADD+GW D+G + + T N+D +A G+ +++ C+PSR++++TG+
Sbjct: 36 PNIVIILADDMGWGDLGANWAETKDTTNLDKMASEGMRFVDFHAAASTCSPSRASLLTGR 95
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ G+ HN GGLP++E L + L++ GY T ++GKWHLG + Y P FRGF
Sbjct: 96 LGLRNGVTHNFAV-TSVGGLPVNETTLAEVLRQEGYVTAMIGKWHLGHHGS-YHPNFRGF 153
Query: 179 ESHLG 183
+ + G
Sbjct: 154 DYYFG 158
Score = 79 (32.9 bits), Expect = 3.8e-22, Sum P(2) = 3.8e-22
Identities = 42/176 (23%), Positives = 67/176 (38%)
Query: 225 FTAEAVDIIHNHSTD-EPLFLYLAHAATH---SANPYEPLQAPDHYLNIHRHIEDFKRSK 280
+ AV+ I ST P LY+ A H S P PL P ++S
Sbjct: 223 YAERAVEFIEQASTSGRPFLLYVGLAHMHVPLSVTP--PLAHPQ------------RQSL 268
Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSN-----WPL-R 334
+ A L ++D VG++ + ++ N+++ F D W +
Sbjct: 269 YRASLREMDSLVGQIKDKVDHVAR-ENTLLWFTGDNGPWAQKCELAGSVGPFFGLWQTHQ 327
Query: 335 G---VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
G K T WEGG R L + P + + + + D PT+++ A S PN
Sbjct: 328 GGSPTKQTTWEGGHRVPALAYWPGRVPANVTSTALLSLLDIFPTVIALAGASLPPN 383
>UNIPROTKB|G5E629 [details] [associations]
symbol:ARSE "Uncharacterized protein" species:9913 "Bos
taurus" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0008484 EMBL:DAAA02075311 EMBL:DAAA02075312
EMBL:DAAA02075313 UniGene:Bt.6471 Ensembl:ENSBTAT00000050377
OMA:VCFQIMA Uniprot:G5E629
Length = 583
Score = 243 (90.6 bits), Expect = 9.0e-22, Sum P(3) = 9.0e-22
Identities = 54/115 (46%), Positives = 72/115 (62%)
Query: 58 GP-PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIM 115
GP P+I+ ++ADDLG DVG +G I TPNID LA G+ L + LCTPSR+A +
Sbjct: 31 GPRPNILLLMADDLGIGDVGCYGNTTIRTPNIDRLAADGVRLTQHLAAAPLCTPSRAAFL 90
Query: 116 TGKHPIHTGMQHN----VL-YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
TG++P+ +GM + VL + GGLP SE + LK GY T ++GKWHLG
Sbjct: 91 TGRYPLRSGMVSSQGLRVLQWTAVSGGLPPSEITFAKILKAKGYTTGLIGKWHLG 145
Score = 89 (36.4 bits), Expect = 9.0e-22, Sum P(3) = 9.0e-22
Identities = 32/133 (24%), Positives = 61/133 (45%)
Query: 266 YLNIHRHI---EDFK-RSK---FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXX 318
+L++H + E+F+ RS + ++D VG+++E L+ + +++++ F SD
Sbjct: 294 FLHVHTPLVTTENFRGRSPHGLYGDNTEEMDWMVGQILETLDTEGLTNSTLVYFTSDHGG 353
Query: 319 XXXXXXXXXX-SNWP--LRGVKNTL-WEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWL 373
W +G K WEGG+R G+ W +L + G V + + D
Sbjct: 354 SLEARFGNNQYGGWNGIYKGGKGMAGWEGGIRVPGIFRWPGVLPA-GRVIHEPTSLMDIF 412
Query: 374 PTLLSAANKSDIP 386
PT++ A +P
Sbjct: 413 PTVVHLAG-GQVP 424
Score = 46 (21.3 bits), Expect = 9.0e-22, Sum P(3) = 9.0e-22
Identities = 9/21 (42%), Positives = 12/21 (57%)
Query: 675 PCLFDIKNDPCEKNNLADRSE 695
P LFD+ DP E + L +E
Sbjct: 505 PLLFDLSRDPSEAHALTPDTE 525
Score = 46 (21.3 bits), Expect = 9.0e-22, Sum P(3) = 9.0e-22
Identities = 9/21 (42%), Positives = 12/21 (57%)
Query: 781 PCLFDIKNDPCEKNNLADRSE 801
P LFD+ DP E + L +E
Sbjct: 505 PLLFDLSRDPSEAHALTPDTE 525
>UNIPROTKB|E1BU03 [details] [associations]
symbol:ARSG "Uncharacterized protein" species:9031 "Gallus
gallus" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
[GO:0005764 "lysosome" evidence=IEA] [GO:0005783 "endoplasmic
reticulum" evidence=IEA] [GO:0006790 "sulfur compound metabolic
process" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 GO:GO:0005783 GO:GO:0005615
GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649
GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0008484 GO:GO:0006790 EMBL:AADN02030038
EMBL:AADN02030039 EMBL:AADN02030040 IPI:IPI00574852
ProteinModelPortal:E1BU03 Ensembl:ENSGALT00000006665 OMA:SDEYIYW
Uniprot:E1BU03
Length = 505
Score = 283 (104.7 bits), Expect = 1.2e-21, P = 1.2e-21
Identities = 103/365 (28%), Positives = 152/365 (41%)
Query: 41 PLAFTLSMVFVDLVAS---SGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGII 97
P + L V V L S G P+ I ILADDLGW D+G + + TP++D LA G
Sbjct: 6 PWSVLLLAVLVGLCTSPVAQGKPNFIVILADDLGWGDLGANWAETKETPHLDELAAEGTR 65
Query: 98 LKNYYTV-QLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRT 156
++++ C+PSR++++TG+ + G+ HN GGLPL+E L + L+ GY T
Sbjct: 66 FVDFHSAASTCSPSRASLLTGRLGVRNGVTHNFAISSV-GGLPLNETTLAEVLRAAGYST 124
Query: 157 RIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDL 216
+GKWHLG + + P FRGF+ + G H D + + + P
Sbjct: 125 AAIGKWHLGHHGHHH-PIFRGFDYYFGIPYSH----DMGCTDTPGYNVPPC----PPCPQ 175
Query: 217 HGKYSTDVFTA--EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDH-YLNI-HRH 272
HG + DV E + II L AA P YL + H H
Sbjct: 176 HGAATRDVALPLFENLTIIQQPVDLSSLVEQYMEAAARFIQQARDSSRPFFLYLALAHMH 235
Query: 273 IE-----DFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXX--XXXXXXXX 325
+ R + A L ++D VG V + L ++++ F D
Sbjct: 236 VPLQIAPPPDRGIYGAALREMDALVGHV-KHLADSCGKGSTLLWFTGDNGPWMQKSPTQG 294
Query: 326 XXXSNWPLRG---VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANK 382
+ L G K T WEGG R L + P + + D PTL++ A
Sbjct: 295 TLSALLSLAGGSPAKQTTWEGGHRVPALAYWPGHVPAKRSSHAMLSTLDVFPTLVALAGA 354
Query: 383 SDIPN 387
+ PN
Sbjct: 355 TLPPN 359
>UNIPROTKB|Q96EG1 [details] [associations]
symbol:ARSG "Arylsulfatase G" species:9606 "Homo sapiens"
[GO:0046872 "metal ion binding" evidence=IEA] [GO:0004065
"arylsulfatase activity" evidence=IDA;TAS] [GO:0005764 "lysosome"
evidence=IDA] [GO:0005615 "extracellular space" evidence=IDA]
[GO:0006790 "sulfur compound metabolic process" evidence=IDA]
[GO:0005783 "endoplasmic reticulum" evidence=IDA] [GO:0005788
"endoplasmic reticulum lumen" evidence=TAS] [GO:0006644
"phospholipid metabolic process" evidence=TAS] [GO:0006665
"sphingolipid metabolic process" evidence=TAS] [GO:0006687
"glycosphingolipid metabolic process" evidence=TAS] [GO:0043687
"post-translational protein modification" evidence=TAS] [GO:0044267
"cellular protein metabolic process" evidence=TAS] [GO:0044281
"small molecule metabolic process" evidence=TAS]
Reactome:REACT_17015 Reactome:REACT_111217 InterPro:IPR000917
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 GO:GO:0005615
GO:GO:0044281 GO:GO:0046872 GO:GO:0006644 GO:GO:0005764
GO:GO:0005788 GO:GO:0043687 Gene3D:3.40.720.10 SUPFAM:SSF53649
eggNOG:COG3119 HOGENOM:HOG000135352 HOVERGEN:HBG004283
InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0004065
GO:GO:0006687 CTD:22901 KO:K12381 OMA:LPQDRHF OrthoDB:EOG4J9MZJ
GO:GO:0006790 EMBL:AB023218 EMBL:AY358380 EMBL:BC012375
IPI:IPI00402293 RefSeq:NP_001254656.1 RefSeq:NP_055775.2
UniGene:Hs.437249 ProteinModelPortal:Q96EG1 SMR:Q96EG1
STRING:Q96EG1 DMDM:74731559 PaxDb:Q96EG1 PeptideAtlas:Q96EG1
PRIDE:Q96EG1 Ensembl:ENST00000448504 Ensembl:ENST00000570630
GeneID:22901 KEGG:hsa:22901 UCSC:uc002jhc.2 GeneCards:GC17P066255
HGNC:HGNC:24102 HPA:HPA023245 HPA:HPA023285 MIM:610008
neXtProt:NX_Q96EG1 PharmGKB:PA143485307 InParanoid:Q96EG1
PhylomeDB:Q96EG1 SABIO-RK:Q96EG1 GenomeRNAi:22901 NextBio:43535
ArrayExpress:Q96EG1 Bgee:Q96EG1 CleanEx:HS_ARSG
Genevestigator:Q96EG1 GermOnline:ENSG00000141337 Uniprot:Q96EG1
Length = 525
Score = 252 (93.8 bits), Expect = 2.3e-21, Sum P(2) = 2.3e-21
Identities = 49/130 (37%), Positives = 77/130 (59%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
P+ + ILADD+GW D+G + + T N+D +A G+ +++ C+PSR++++TG+
Sbjct: 36 PNFVIILADDMGWGDLGANWAETKDTANLDKMASEGMRFVDFHAAASTCSPSRASLLTGR 95
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ G+ N GGLPL+E L + L++ GY T I+GKWHLG + Y P FRGF
Sbjct: 96 LGLRNGVTRNFAV-TSVGGLPLNETTLAEVLQQAGYVTGIIGKWHLGHHGS-YHPNFRGF 153
Query: 179 ESHLGYWTGH 188
+ + G H
Sbjct: 154 DYYFGIPYSH 163
Score = 77 (32.2 bits), Expect = 2.3e-21, Sum P(2) = 2.3e-21
Identities = 41/169 (24%), Positives = 62/169 (36%)
Query: 225 FTAEAVDIIHNHSTD-EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAA 283
+ +A I ST P LY+A A H P L A RS + A
Sbjct: 223 YAEKATQFIQRASTSGRPFLLYVALAHMHVPLPVTQLPAAPR-----------GRSLYGA 271
Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSN-----WPLR--G- 335
L ++D VG++ + ++ + N+ + F D W R G
Sbjct: 272 GLWEMDSLVGQIKDKVDHT-VKENTFLWFTGDNGPWAQKCELAGSVGPFTGFWQTRQGGS 330
Query: 336 -VKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
K T WEGG R L + P + + + V D PT+++ A S
Sbjct: 331 PAKQTTWEGGHRVPALAYWPGRVPVNVTSTALLSVLDIFPTVVALAQAS 379
>UNIPROTKB|F1PYB4 [details] [associations]
symbol:ARSD "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0008484 OMA:RSWIPSG EMBL:AAEX03026107
EMBL:AAEX03026106 Ensembl:ENSCAFT00000017716 Uniprot:F1PYB4
Length = 597
Score = 231 (86.4 bits), Expect = 3.5e-21, Sum P(3) = 3.5e-21
Identities = 48/117 (41%), Positives = 72/117 (61%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSA 113
A++ P+I+ I+ADDLG D+G +G + TPNID LA G+ L + LCTPSRS+
Sbjct: 39 ANAFKPNILLIMADDLGIGDLGCYGNSTLRTPNIDRLAEEGVRLTQHLAAAPLCTPSRSS 98
Query: 114 IMTGKHPIHTGMQ-HN----VLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
+TG+H +GM+ H+ + + GGLP +E + L++ GY T ++GKWH G
Sbjct: 99 FLTGRHSFRSGMEAHDGYRALQWNGASGGLPENETTFARILQQQGYATGLIGKWHQG 155
Score = 96 (38.9 bits), Expect = 3.5e-21, Sum P(3) = 3.5e-21
Identities = 49/224 (21%), Positives = 88/224 (39%)
Query: 199 MKMWGLDMRRD---LEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSAN 255
++ W + R+ E DL + +T EAV I + +P L+L+ H
Sbjct: 254 VRRWNCILMRNHDVTEQPMDL--ERTTSHMLREAVSYIERNK-HQPFLLFLSLLHVHI-- 308
Query: 256 PYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
PL +L +H + + ++D VG+V+ A+E+ + + + F SD
Sbjct: 309 ---PLVTTKQFLGKSQH------GLYGDNVEEMDWLVGEVLNAIEENGLKNTTFTYFTSD 359
Query: 316 XXXXXXXXXXXXX-SNWP--LRGVKNTL-WEGGVRGAGLI-WSPLLESRGIVAEQYVHVS 370
W RG K WEGG+R G+ W +L + G V + +
Sbjct: 360 HGGHLEARDERGQLGGWNGIFRGGKGMGGWEGGIRVPGIFRWPGVLPA-GRVIHEPTSLM 418
Query: 371 DWLPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEY 414
D PT++ ++P +++P + E+ HE+
Sbjct: 419 DVFPTVVQLGG-GEVPQDRVIDGRSLVPLLRGAA---EHSAHEF 458
Score = 47 (21.6 bits), Expect = 3.5e-21, Sum P(3) = 3.5e-21
Identities = 12/33 (36%), Positives = 16/33 (48%)
Query: 675 PCLFDIKNDPCEKNNLADRSEDQRINHYTTEVG 707
P LF++ DP E L+ SE N +VG
Sbjct: 517 PLLFELSRDPSEARPLSPDSEPL-YNMVVAQVG 548
Score = 47 (21.6 bits), Expect = 3.5e-21, Sum P(3) = 3.5e-21
Identities = 12/33 (36%), Positives = 16/33 (48%)
Query: 781 PCLFDIKNDPCEKNNLADRSEVQRINHYTTEVG 813
P LF++ DP E L+ SE N +VG
Sbjct: 517 PLLFELSRDPSEARPLSPDSE-PLYNMVVAQVG 548
>ZFIN|ZDB-GENE-081104-120 [details] [associations]
symbol:arsh "arylsulfatase H" species:7955 "Danio
rerio" [GO:0008152 "metabolic process" evidence=IEA] [GO:0008484
"sulfuric ester hydrolase activity" evidence=IEA] [GO:0003824
"catalytic activity" evidence=IEA] InterPro:IPR000917
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
ZFIN:ZDB-GENE-081104-120 Gene3D:3.40.720.10 SUPFAM:SSF53649
GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0008484 KO:K12374 EMBL:CR407703 EMBL:FP236869
IPI:IPI00506361 RefSeq:XP_003199313.1 Ensembl:ENSDART00000032992
GeneID:100332997 KEGG:dre:100332997 Uniprot:F8VNP0
Length = 583
Score = 238 (88.8 bits), Expect = 6.8e-21, Sum P(3) = 6.8e-21
Identities = 49/112 (43%), Positives = 69/112 (61%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+ + ++ DDLG D+G +G I TPNID LA G+ L ++ + LCTPSR+A MTG+
Sbjct: 34 PNFVLMMVDDLGIGDIGCYGNTTIRTPNIDRLASDGVKLTHHLSAAPLCTPSRTAFMTGR 93
Query: 119 HPIHTGMQHN-----VLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
+P+ GM +L+ GGLP +E + L++ GY T IVGKWHLG
Sbjct: 94 YPLRAGMGSTGRVQVILFLAGSGGLPPNETTFAKLLQKQGYTTGIVGKWHLG 145
Score = 92 (37.4 bits), Expect = 6.8e-21, Sum P(3) = 6.8e-21
Identities = 43/189 (22%), Positives = 78/189 (41%)
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
EA + H D P L+++ H+ P+ + + RH + + +
Sbjct: 274 EAEQFMERHR-DGPFLLFVSFPQVHT-----PMLVTEGFAGKSRH------GLYGDNVEE 321
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTL-WEGGVR 346
+D VG+VV+ +++ + +++ F SD N RG K W+GG+R
Sbjct: 322 VDWMVGRVVDTIDRLGLTEKTLLYFTSDHGGGIEEGPRGGW-NGIYRGGKAMGGWDGGIR 380
Query: 347 GAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENSIL 405
G+ W L + VAE + D PT++ A ++P +++P E S
Sbjct: 381 VPGIFRWPGRLAAGREVAEP-TSLMDVFPTVVKLAG-GELPKDRLLDGHDLMPLLEGSSS 438
Query: 406 RYENGTHEY 414
R + HE+
Sbjct: 439 RSQ---HEF 444
Score = 40 (19.1 bits), Expect = 6.8e-21, Sum P(3) = 6.8e-21
Identities = 11/34 (32%), Positives = 17/34 (50%)
Query: 675 PCLFDIKNDPCEKNNLADRSEDQRINHYTTEVGR 708
P +F I +DP E L +++ D R+ V R
Sbjct: 503 PLVFLISSDPSESVPLTEQT-DPRVPEVLQRVQR 535
>UNIPROTKB|P51689 [details] [associations]
symbol:ARSD "Arylsulfatase D" species:9606 "Homo sapiens"
[GO:0046872 "metal ion binding" evidence=IEA] [GO:0005764
"lysosome" evidence=IEA] [GO:0004065 "arylsulfatase activity"
evidence=TAS] [GO:0005788 "endoplasmic reticulum lumen"
evidence=TAS] [GO:0006644 "phospholipid metabolic process"
evidence=TAS] [GO:0006665 "sphingolipid metabolic process"
evidence=TAS] [GO:0006687 "glycosphingolipid metabolic process"
evidence=TAS] [GO:0043687 "post-translational protein modification"
evidence=TAS] [GO:0044267 "cellular protein metabolic process"
evidence=TAS] [GO:0044281 "small molecule metabolic process"
evidence=TAS] Reactome:REACT_17015 Reactome:REACT_111217
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 GO:GO:0044281 GO:GO:0046872 GO:GO:0006644
GO:GO:0005764 GO:GO:0005788 GO:GO:0043687 Gene3D:3.40.720.10
SUPFAM:SSF53649 eggNOG:COG3119 HOGENOM:HOG000135352
HOVERGEN:HBG004283 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0004065 GO:GO:0006687 EMBL:X83572
EMBL:AF160499 EMBL:AC005295 EMBL:BC020229 IPI:IPI00019989
IPI:IPI00028695 IPI:IPI00914575 PIR:I37186 RefSeq:NP_001660.2
UniGene:Hs.528631 ProteinModelPortal:P51689 SMR:P51689
STRING:P51689 DMDM:212276422 PaxDb:P51689 PRIDE:P51689 DNASU:414
Ensembl:ENST00000381154 GeneID:414 KEGG:hsa:414 UCSC:uc004cqy.3
CTD:414 GeneCards:GC0XM002818 HGNC:HGNC:717 HPA:HPA004694
MIM:300002 neXtProt:NX_P51689 PharmGKB:PA25008 InParanoid:P51689
KO:K12374 OMA:RSWIPSG OrthoDB:EOG4V4379 ChiTaRS:ARSD GenomeRNAi:414
NextBio:1749 Bgee:P51689 CleanEx:HS_ARSD Genevestigator:P51689
GermOnline:ENSG00000006756 Uniprot:P51689
Length = 593
Score = 230 (86.0 bits), Expect = 7.1e-21, Sum P(3) = 7.1e-21
Identities = 48/117 (41%), Positives = 71/117 (60%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSA 113
A++ P+I+ I+ADDLG D+G +G + + TPNID LA G+ L + LCTPSR+A
Sbjct: 36 ANAFKPNILLIMADDLGTGDLGCYGNNTLRTPNIDQLAEEGVRLTQHLAAAPLCTPSRAA 95
Query: 114 IMTGKHPIHTGMQ----HNVL-YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
+TG+H +GM + L + GGLP +E + L++ GY T ++GKWH G
Sbjct: 96 FLTGRHSFRSGMDASNGYRALQWNAGSGGLPENETTFARILQQHGYATGLIGKWHQG 152
Score = 93 (37.8 bits), Expect = 7.1e-21, Sum P(3) = 7.1e-21
Identities = 42/192 (21%), Positives = 75/192 (39%)
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
EAV I H P L+L+ H PL +L +H + + +
Sbjct: 281 EAVSYIERHKHG-PFLLFLSLLHVHI-----PLVTTSAFLGKSQH------GLYGDNVEE 328
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXX-SNWP--LRGVKNTL-WEG 343
+D +GKV+ A+E + +++ F SD W +G K WEG
Sbjct: 329 MDWLIGKVLNAIEDNGLKNSTFTYFTSDHGGHLEARDGHSQLGGWNGIYKGGKGMGGWEG 388
Query: 344 GVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYEN 402
G+R G+ W +L + ++ E + D PT++ ++P +++P +
Sbjct: 389 GIRVPGIFHWPGVLPAGRVIGEP-TSLMDVFPTVVQLVG-GEVPQDRVIDGHSLVPLLQG 446
Query: 403 SILRYENGTHEY 414
+ R HE+
Sbjct: 447 AEAR---SAHEF 455
Score = 48 (22.0 bits), Expect = 7.1e-21, Sum P(3) = 7.1e-21
Identities = 10/21 (47%), Positives = 11/21 (52%)
Query: 675 PCLFDIKNDPCEKNNLADRSE 695
P LFD+ DP E L SE
Sbjct: 514 PLLFDLSRDPSEARPLTPDSE 534
Score = 48 (22.0 bits), Expect = 7.1e-21, Sum P(3) = 7.1e-21
Identities = 10/21 (47%), Positives = 11/21 (52%)
Query: 781 PCLFDIKNDPCEKNNLADRSE 801
P LFD+ DP E L SE
Sbjct: 514 PLLFDLSRDPSEARPLTPDSE 534
>CGD|CAL0006319 [details] [associations]
symbol:orf19.1608 species:5476 "Candida albicans" [GO:0005634
"nucleus" evidence=IEA] [GO:0005829 "cytosol" evidence=IEA]
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 CGD:CAL0006319 EMBL:AACQ01000014 EMBL:AACQ01000013
Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
InterPro:IPR024607 PROSITE:PS00149 GO:GO:0008484 KO:K01130
RefSeq:XP_721567.1 RefSeq:XP_721687.1 ProteinModelPortal:Q5AJI4
GeneID:3636617 GeneID:3636713 KEGG:cal:CaO19.1608
KEGG:cal:CaO19.9176 Uniprot:Q5AJI4
Length = 588
Score = 238 (88.8 bits), Expect = 1.0e-20, Sum P(4) = 1.0e-20
Identities = 76/245 (31%), Positives = 122/245 (49%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY--SGIILKNYYTVQLCTPSRS 112
+SS P+ + I+ADDLG+ D+ G +I TPN++ LA +G+ L +++T C+P+RS
Sbjct: 6 SSSKQPNFLIIVADDLGFTDLSPFG-GEINTPNLNKLATGANGVRLTDFHTASACSPTRS 64
Query: 113 AIMTG--KHPIHTGM------QHNVLYGCERG--GLPLSEKI--LPQYLKELGYRTRIVG 160
+++G H G +H + + G G L++K+ LP+ L++ GY T I G
Sbjct: 65 MLLSGTDNHIAGLGQMAEFAQRHPEKFNNQPGYEGY-LNDKVVALPEILQDNGYHTFISG 123
Query: 161 KWHLGFYKKEYTPTFRGFESHLGYWTG---HQDYFDHSAEEMKMWGL------DMRRDLE 211
KWHLG KK Y P RGF G H Y ++ ++ L D + L+
Sbjct: 124 KWHLGL-KKPYWPNKRGFNKSFTLLPGAGNHYKYITRDSQGNQIPFLPAIYVEDDKELLQ 182
Query: 212 PAWDLHGK-YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
P +L YST+ FT +A++ I +P F + + A P+ P QAP + +
Sbjct: 183 PEIELPDDFYSTNYFTDKAIEFIKETPQGKPFFGMITYTA-----PHWPYQAPQDKIAKY 237
Query: 271 RHIED 275
+ D
Sbjct: 238 NGVYD 242
Score = 72 (30.4 bits), Expect = 1.0e-20, Sum P(4) = 1.0e-20
Identities = 13/35 (37%), Positives = 26/35 (74%)
Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
+AA++ LDE++G++++ L L+N+ I+F+SD
Sbjct: 296 YAAMVEILDENIGRLIDHLNSIDELNNTFILFMSD 330
Score = 53 (23.7 bits), Expect = 1.0e-20, Sum P(4) = 1.0e-20
Identities = 16/60 (26%), Positives = 29/60 (48%)
Query: 360 GIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPR---YENSILRYENGTHEYNS 416
G + +++ V D LPT+L AN S P + + PR + N ++ + H+ N+
Sbjct: 428 GKILKEFTTVMDILPTILELANVSH-PGETYKGRQVVKPRGKSWVNYLINKTDQVHDENT 486
Score = 44 (20.5 bits), Expect = 1.0e-20, Sum P(4) = 1.0e-20
Identities = 12/24 (50%), Positives = 16/24 (66%)
Query: 783 LFDIKNDPCEKNNLADRS-EVQRI 805
LF+I DP E N+L++ S E Q I
Sbjct: 519 LFNIIEDPGEINDLSESSSEYQTI 542
Score = 44 (20.5 bits), Expect = 1.0e-20, Sum P(4) = 1.0e-20
Identities = 14/52 (26%), Positives = 24/52 (46%)
Query: 525 SKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQNEID 576
S+ TIL+ + D W + A G +L + + E SN+ Y+ +D
Sbjct: 537 SEYQTILNELLDHWAVYAAETGLIELGSD--LFEKEKIEGESNEVVYRTILD 586
Score = 43 (20.2 bits), Expect = 1.3e-20, Sum P(4) = 1.3e-20
Identities = 9/20 (45%), Positives = 14/20 (70%)
Query: 677 LFDIKNDPCEKNNLADRSED 696
LF+I DP E N+L++ S +
Sbjct: 519 LFNIIEDPGEINDLSESSSE 538
Score = 41 (19.5 bits), Expect = 4.7e-16, Sum P(3) = 4.7e-16
Identities = 10/32 (31%), Positives = 17/32 (53%)
Query: 463 YNGPKNE--NTNPRYENGTHEYNIPRLENSIN 492
Y P+++ N Y+NG E RL+++ N
Sbjct: 227 YQAPQDKIAKYNGVYDNGPEELRQKRLQSAKN 258
Score = 39 (18.8 bits), Expect = 7.5e-16, Sum P(3) = 7.5e-16
Identities = 9/37 (24%), Positives = 20/37 (54%)
Query: 386 PNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENS 422
P++ ++ I +Y N + Y+NG E R++++
Sbjct: 223 PHWPYQAPQDKIAKY-NGV--YDNGPEELRQKRLQSA 256
Score = 37 (18.1 bits), Expect = 1.2e-15, Sum P(3) = 1.2e-15
Identities = 7/16 (43%), Positives = 8/16 (50%)
Query: 436 KYENRYENGTHEYNPK 451
KY Y+NG E K
Sbjct: 236 KYNGVYDNGPEELRQK 251
>UNIPROTKB|E1BYN0 [details] [associations]
symbol:ARSD "Uncharacterized protein" species:9031 "Gallus
gallus" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0008484 CTD:414 KO:K12374 OMA:RSWIPSG
EMBL:AADN02017596 IPI:IPI00570897 RefSeq:XP_416855.2
ProteinModelPortal:E1BYN0 Ensembl:ENSGALT00000026880 GeneID:418658
KEGG:gga:418658 NextBio:20821812 Uniprot:E1BYN0
Length = 596
Score = 243 (90.6 bits), Expect = 1.1e-20, Sum P(3) = 1.1e-20
Identities = 56/146 (38%), Positives = 81/146 (55%)
Query: 49 VFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LC 107
+F A P+I+ LADDLG DVG +G + I TPNID LA G+ L + LC
Sbjct: 31 IFGFSTAVDSKPNILLFLADDLGIGDVGCYGNNTIRTPNIDRLAREGVKLTQHIAAAPLC 90
Query: 108 TPSRSAIMTGKHPIHTGM----QHNVL-YGCERGGLPLSEKILPQYLKELGYRTRIVGKW 162
TPSR+A +TG++PI +GM ++ L + GGLP +E + L++ GY T ++GKW
Sbjct: 91 TPSRAAFLTGRYPIRSGMASSNRYRALQWNAGSGGLPANETTFARLLQQQGYTTGLIGKW 150
Query: 163 HLGFYKKEYT-----PTFRGFESHLG 183
H G + ++ P GF+ G
Sbjct: 151 HQGVNCESFSDHCHHPLNHGFDYFYG 176
Score = 82 (33.9 bits), Expect = 1.1e-20, Sum P(3) = 1.1e-20
Identities = 35/158 (22%), Positives = 65/158 (41%)
Query: 228 EAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHK 287
EA+ I + + P L+++ H+ PL +L H + + +
Sbjct: 282 EAISFI-KRNRNGPFLLFVSFLHVHT-----PLFTTVKFLGKSHH------GLYGDNVEE 329
Query: 288 LDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXX-SNWP--LRGVKNTL-WEG 343
+D VGK+++ L++ + +++ F SD W +G K WEG
Sbjct: 330 MDWMVGKILDLLDKEGLKNHTFTYFASDHGGHLEAQDGSAQMGGWNGIYKGGKGMGGWEG 389
Query: 344 GVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
G+R G+ W +L + G V + + D PT++ A
Sbjct: 390 GIRVPGVFRWPGVLPA-GTVINEPTSLMDIFPTVVHLA 426
Score = 43 (20.2 bits), Expect = 1.1e-20, Sum P(3) = 1.1e-20
Identities = 8/21 (38%), Positives = 12/21 (57%)
Query: 675 PCLFDIKNDPCEKNNLADRSE 695
P L+D+ DP E L+ +E
Sbjct: 515 PLLYDLSRDPSESQPLSADTE 535
Score = 43 (20.2 bits), Expect = 1.1e-20, Sum P(3) = 1.1e-20
Identities = 8/21 (38%), Positives = 12/21 (57%)
Query: 781 PCLFDIKNDPCEKNNLADRSE 801
P L+D+ DP E L+ +E
Sbjct: 515 PLLYDLSRDPSESQPLSADTE 535
>TIGR_CMR|CPS_2985 [details] [associations]
symbol:CPS_2985 "sulfatase family protein" species:167879
"Colwellia psychrerythraea 34H" [GO:0006790 "sulfur compound
metabolic process" evidence=ISS] [GO:0008484 "sulfuric ester
hydrolase activity" evidence=ISS] InterPro:IPR000917
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 EMBL:CP000083
GenomeReviews:CP000083_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
eggNOG:COG3119 GO:GO:0008484 KO:K01130 HOGENOM:HOG000135355
RefSeq:YP_269685.1 ProteinModelPortal:Q47ZT4 STRING:Q47ZT4
GeneID:3523028 KEGG:cps:CPS_2985 PATRIC:21468987 OMA:RNEFLPT
BioCyc:CPSY167879:GI48-3034-MONOMER Uniprot:Q47ZT4
Length = 502
Score = 281 (104.0 bits), Expect = 1.2e-20, Sum P(2) = 1.2e-20
Identities = 97/359 (27%), Positives = 165/359 (45%)
Query: 39 VLPLAFTLSMVFVDLVASSGPPHIIFILADDLG-WNDVGFH-GLDQIPTPNIDALAYSGI 96
VL L+ + + P+I+ I DD+G +N ++ G+ TPNID +A GI
Sbjct: 11 VLGLSLIAASSAAMATTDTAKPNILAIWGDDIGPFNISAYNRGIMGYKTPNIDRIANEGI 70
Query: 97 ILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRT 156
I + Y Q CT R+ +TG+HP+ TG+ L G + G L + + + LK GY T
Sbjct: 71 IFTDSYGDQSCTAGRAGFITGQHPMRTGLTKVGLPGAKEG-LNKKDPTIAELLKPHGYMT 129
Query: 157 RIVGKWHLGFYKKEYTPTFRGFESHLG--YWTGHQDYFDHSAEEMKMWGLDMRRDLEPAW 214
GK HLG + E+ PT GF+ G Y +D +H + K R P
Sbjct: 130 GQFGKNHLGD-QDEHLPTNHGFDEFFGNLYHLNAEDEPEHP-DYPKDPAFKKR--FGPRG 185
Query: 215 DLH----GKYS-TDVFTAEAVDIIHNHSTDEPL-FLYLAHAATHSANPYEPLQAPDHYLN 268
+H GK + T T + ++ I L F+ AHAA + +
Sbjct: 186 AIHSFADGKITDTGPVTKKRMETIDEEFLGAALKFIDKAHAAKKPFFVWFNSTRMHVWTR 245
Query: 269 IHRHIEDFK-RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXX 327
+ + + +A + + D VG++++ +++ + N+II++ +D
Sbjct: 246 LKPESDGVTGQGLYADGMVEHDGHVGQLLDKIDKLGIAENTIIMYTTDNGAELALWPDGG 305
Query: 328 XSNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDI 385
+ P RG KNT WEGG R ++ W+ ++ V+ + + + DW+PT+L+ A ++I
Sbjct: 306 YT--PFRGEKNTNWEGGYRVPMMVKWAGKIKPNQ-VSNEMISLIDWMPTILAVAGDTNI 361
Score = 37 (18.1 bits), Expect = 1.2e-20, Sum P(2) = 1.2e-20
Identities = 6/14 (42%), Positives = 11/14 (78%)
Query: 674 APCLFDIKNDPCEK 687
AP +F+++ DP E+
Sbjct: 442 APKIFNLRMDPYER 455
Score = 37 (18.1 bits), Expect = 1.2e-20, Sum P(2) = 1.2e-20
Identities = 6/14 (42%), Positives = 11/14 (78%)
Query: 780 APCLFDIKNDPCEK 793
AP +F+++ DP E+
Sbjct: 442 APKIFNLRMDPYER 455
>UNIPROTKB|C9J5G7 [details] [associations]
symbol:ARSE "Arylsulfatase E" species:9606 "Homo sapiens"
[GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484 EMBL:AC005295
HGNC:HGNC:719 IPI:IPI00640709 ProteinModelPortal:C9J5G7 SMR:C9J5G7
STRING:C9J5G7 Ensembl:ENST00000438544 HOGENOM:HOG000213821
ArrayExpress:C9J5G7 Bgee:C9J5G7 Uniprot:C9J5G7
Length = 178
Score = 254 (94.5 bits), Expect = 1.3e-20, P = 1.3e-20
Identities = 54/139 (38%), Positives = 82/139 (58%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAI 114
S+ P+I+ ++ADDLG D+G +G + + TPNID LA G+ L + + LCTPSR+A
Sbjct: 34 SASRPNILLLMADDLGIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAF 93
Query: 115 MTGKHPIHTGMQHNVLYGCER-----GGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
+TG++P+ +GM ++ Y + GGLP +E + LKE GY T ++GKWHLG +
Sbjct: 94 LTGRYPVRSGMVSSIGYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCE 153
Query: 170 E-----YTPTFRGFESHLG 183
+ P GF+ G
Sbjct: 154 SASDHCHHPLHHGFDHFYG 172
>ZFIN|ZDB-GENE-060503-154 [details] [associations]
symbol:arsg "arylsulfatase G" species:7955 "Danio
rerio" [GO:0003824 "catalytic activity" evidence=IEA] [GO:0008152
"metabolic process" evidence=IEA] [GO:0008484 "sulfuric ester
hydrolase activity" evidence=IEA] InterPro:IPR000917
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
ZFIN:ZDB-GENE-060503-154 Gene3D:3.40.720.10 SUPFAM:SSF53649
GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0008484 EMBL:CR926135 EMBL:CABZ01038699
EMBL:CABZ01038700 IPI:IPI00502628 Ensembl:ENSDART00000091423
Bgee:F1QQI9 Uniprot:F1QQI9
Length = 526
Score = 274 (101.5 bits), Expect = 2.4e-20, Sum P(3) = 2.4e-20
Identities = 87/283 (30%), Positives = 137/283 (48%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQ-IPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTG 117
P+ I ILADD+GW D+ + D PTP +D+L G ++++ C+PSR++I+TG
Sbjct: 35 PNFIIILADDIGWGDLWLNRPDNSTPTPWLDSLMLKGKRFTDFHSPASTCSPSRASILTG 94
Query: 118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRG 177
+H + G+ HN G GGLPL+E Q L + GY T ++GKWHLG + Y+P RG
Sbjct: 95 RHGLRNGVTHNFAVGSV-GGLPLNETTFAQLLHDEGYYTAMIGKWHLG-HNGSYSPVHRG 152
Query: 178 FESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTD-----VFTAEAVDI 232
F+ +LG + D + GLD+ P +H +YS + +T + +
Sbjct: 153 FDYYLGIPYSN----DMGCTDKP--GLDLPC-CPPC--VHSQYSINKKHEGCYTKVGLPL 203
Query: 233 IHNHST-DEPLFLY-----LAHAA-----THSANPYEPLQAPDHYLNI-HRHIEDFKRS- 279
N ++PL + A AA T S +P Y+ + H H+ F +
Sbjct: 204 FENEKIIEQPLDTWSLKDRYATAAVQQIFTASVTKKQPFLL---YVALAHMHVPLFHNTF 260
Query: 280 -------KFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
+ A L +D VG +++AL + L N++I F D
Sbjct: 261 LNVTTEDPYTASLSDMDSLVGNIMQALITEQ-LENTLIWFTGD 302
Score = 42 (19.8 bits), Expect = 2.4e-20, Sum P(3) = 2.4e-20
Identities = 11/41 (26%), Positives = 19/41 (46%)
Query: 658 SIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEDQR 698
S+ C + P + P +FD+ D E+ L S++ R
Sbjct: 437 SVACDG-ESGPQQHHDPPLIFDLSQDEAEETPLDPESKEFR 476
Score = 41 (19.5 bits), Expect = 2.4e-20, Sum P(3) = 2.4e-20
Identities = 7/22 (31%), Positives = 13/22 (59%)
Query: 511 DGIDVWSVLSRNEPSKRNTILH 532
DGID+ VL + + +++H
Sbjct: 387 DGIDITDVLLNDSETGHESLMH 408
Score = 41 (19.5 bits), Expect = 2.4e-20, Sum P(3) = 2.4e-20
Identities = 7/22 (31%), Positives = 13/22 (59%)
Query: 576 DGIDVWSVLSRNEPSKRNTILH 597
DGID+ VL + + +++H
Sbjct: 387 DGIDITDVLLNDSETGHESLMH 408
Score = 38 (18.4 bits), Expect = 6.1e-20, Sum P(3) = 6.1e-20
Identities = 10/38 (26%), Positives = 17/38 (44%)
Query: 764 SIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSE 801
S+ C + P + P +FD+ D E+ L S+
Sbjct: 437 SVACDG-ESGPQQHHDPPLIFDLSQDEAEETPLDPESK 473
>TIGR_CMR|CPS_2983 [details] [associations]
symbol:CPS_2983 "putative arylsulfatase" species:167879
"Colwellia psychrerythraea 34H" [GO:0004065 "arylsulfatase
activity" evidence=ISS] [GO:0008152 "metabolic process"
evidence=ISS] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 EMBL:CP000083
GenomeReviews:CP000083_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
eggNOG:COG3119 GO:GO:0008484 KO:K01130 OMA:DDQVGIL
HOGENOM:HOG000135355 RefSeq:YP_269683.1 ProteinModelPortal:Q47ZT6
STRING:Q47ZT6 GeneID:3520535 KEGG:cps:CPS_2983 PATRIC:21468983
BioCyc:CPSY167879:GI48-3032-MONOMER Uniprot:Q47ZT6
Length = 522
Score = 270 (100.1 bits), Expect = 3.9e-20, P = 3.9e-20
Identities = 100/381 (26%), Positives = 167/381 (43%)
Query: 27 ELGYRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFH--GLDQIP 84
E+ R + +A + LA S + P+++ I DD+G+ ++ + G+
Sbjct: 2 EMNNRLKKLALGIGVLAIATSAA---ATTNKAKPNVLAIWGDDIGYYNISAYNQGMMGYQ 58
Query: 85 TPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKI 144
TPNID +A G + ++Y Q CT R++ + G+ P TG+ + G G +P
Sbjct: 59 TPNIDRIADEGALFTHHYAQQSCTAGRASFILGQEPFRTGLLTIGMPGSTHG-IPDWTPT 117
Query: 145 LPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLG--YWTGHQD-----YFDHSAE 197
+ LKE GY T GK HLG K + PT GF+ G Y ++ Y+ E
Sbjct: 118 IADLLKEKGYMTAQFGKNHLGDQDK-HLPTNHGFDEFFGNLYHLNAEEEPETYYYPKDKE 176
Query: 198 EMKMWGLDMRRDLEPAWDLHGKY-STDVFTAEAVDIIHNHSTDEPL-FLYLAHAATHSAN 255
K +G R + D GK +T T + ++ L F+ AH A
Sbjct: 177 FHKKYG--PRGVIHSFAD--GKIENTGSMTRKRMETADGEFLAGTLKFIDKAHKAK---K 229
Query: 256 PYEPLQAPDHYLNIHRHIEDFKRSKFAAILH-----KLDESVGKVVEALEQRRMLSNSII 310
P+ + +++ +++ R K L + D+ VG +++ L+ ++ N+I+
Sbjct: 230 PFFIWHSSTR-MHVWTRLQEKYRGKSGVSLTADGMLEHDDQVGILLDKLDDLKIADNTIV 288
Query: 311 VFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHV 369
++ +D S P RG K T EGG+R L+ W +++ H
Sbjct: 289 IYSTDNGAEKFTWPDGGTS--PFRGEKGTTTEGGMRVPQLVRWPGTIKAGSKFNNMMSH- 345
Query: 370 SDWLPTLLSAANKSDIPNYVN 390
DW+PTLL+AA + PN VN
Sbjct: 346 EDWMPTLLAAAGE---PNIVN 363
>UNIPROTKB|F1N665 [details] [associations]
symbol:ARSG "Uncharacterized protein" species:9913 "Bos
taurus" [GO:0006790 "sulfur compound metabolic process"
evidence=IEA] [GO:0005783 "endoplasmic reticulum" evidence=IEA]
[GO:0005764 "lysosome" evidence=IEA] [GO:0005615 "extracellular
space" evidence=IEA] [GO:0008484 "sulfuric ester hydrolase
activity" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 GO:GO:0005783 GO:GO:0005615
GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649
GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0008484 GO:GO:0006790 EMBL:DAAA02049729
EMBL:DAAA02049730 EMBL:DAAA02049731 EMBL:DAAA02049732
EMBL:DAAA02049733 IPI:IPI00867152 UniGene:Bt.103824
ProteinModelPortal:F1N665 Ensembl:ENSBTAT00000014061 OMA:GHARNAF
Uniprot:F1N665
Length = 328
Score = 249 (92.7 bits), Expect = 4.4e-20, P = 4.4e-20
Identities = 49/130 (37%), Positives = 75/130 (57%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
P+ + ILADD+GW D+G + T N+D +A G +++ C+PSR+A++TG+
Sbjct: 36 PNFVIILADDMGWGDLGANWAGTKDTANLDRMAAEGTRFVDFHAAASTCSPSRAALLTGR 95
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ G+ HN GGLPL+E L + L+ GY T ++GKWHLG + + P FRGF
Sbjct: 96 LGLRNGVTHNFAV-TSVGGLPLNETTLAEVLRGAGYVTGMIGKWHLGHHGSHH-PNFRGF 153
Query: 179 ESHLGYWTGH 188
+ + G H
Sbjct: 154 DYYFGVPYSH 163
>UNIPROTKB|F1NFL4 [details] [associations]
symbol:F1NFL4 "Uncharacterized protein" species:9031
"Gallus gallus" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0008484 OMA:RWNDWKA EMBL:AADN02017596
IPI:IPI00586912 Ensembl:ENSGALT00000026882 Uniprot:F1NFL4
Length = 374
Score = 258 (95.9 bits), Expect = 1.1e-19, P = 1.1e-19
Identities = 97/347 (27%), Positives = 160/347 (46%)
Query: 60 PHIIFILADDLGWND--VGFHGLDQI---PTPNIDALAYSGIILKNYYTVQ-LCTPSRSA 113
P+ + ILADDLG D + H +D I TP+ID LA G+ L + +CTPSR+A
Sbjct: 2 PNFLLILADDLGIGDTSIKMH-IDMIFLFRTPHIDGLAKEGVRLTQHIAAAAVCTPSRAA 60
Query: 114 IMTGKHPIHTGMQHNVLY--GCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEY 171
+TG++PI + + +L+ GC GGLP +E + L + GY T +VGKWHLG K +
Sbjct: 61 FLTGRYPIRS--ERRILFWNGCS-GGLPPNETTFARVLHQQGYSTALVGKWHLGVNCKSH 117
Query: 172 T-----PTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKY-STDVF 225
P GFE Y+ G + + G D + + D Y S F
Sbjct: 118 RDHCHHPLNHGFE----YFYGMSFTILNECQ-----GTDDPELAKSSQD--NLYCSAYAF 166
Query: 226 TAEAVDII----HNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI---EDFK- 277
+ +I N+S + L+ L + N + P L++H + ++F
Sbjct: 167 VWKTYPLILSKMENNSMCDHLWSPLVSFSGKVRNKHRPFLLFLSLLHVHTPLITTKEFLG 226
Query: 278 RSK---FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLR 334
RS+ + + ++D VG++++ +++ + + + I F SD S +
Sbjct: 227 RSRHGLYGDNVEEMDWMVGRLLDVIDKEGLKNTTFIYFASDHKENLTNCPNVYTSKFSSE 286
Query: 335 GVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAA 380
+ WEGG+R G++ W L + GIV + + D PT++ A
Sbjct: 287 IMGG--WEGGIRVPGIVRWPGALPA-GIVISEPTSIMDIFPTVVHLA 330
>UNIPROTKB|Q32KH9 [details] [associations]
symbol:ARSG "Arylsulfatase G" species:9615 "Canis lupus
familiaris" [GO:0005764 "lysosome" evidence=ISS] [GO:0004065
"arylsulfatase activity" evidence=ISS] [GO:0006790 "sulfur compound
metabolic process" evidence=IEA] [GO:0005783 "endoplasmic
reticulum" evidence=IEA] [GO:0005615 "extracellular space"
evidence=IEA] [GO:0046872 "metal ion binding" evidence=IEA]
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 GO:GO:0005783 GO:GO:0005615 GO:GO:0046872
GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
GeneTree:ENSGT00560000076940 HOGENOM:HOG000135352
HOVERGEN:HBG004283 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0004065 EMBL:AAEX02034846 EMBL:BN000758
RefSeq:NP_001041563.1 UniGene:Cfa.37363 ProteinModelPortal:Q32KH9
STRING:Q32KH9 Ensembl:ENSCAFT00000017623 GeneID:480460
KEGG:cfa:480460 CTD:22901 InParanoid:Q32KH9 KO:K12381 OMA:LPQDRHF
OrthoDB:EOG4J9MZJ NextBio:20855470 GO:GO:0006790 Uniprot:Q32KH9
Length = 535
Score = 266 (98.7 bits), Expect = 1.2e-19, P = 1.2e-19
Identities = 78/272 (28%), Positives = 129/272 (47%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
P+ + ILADD+GW D+G + + T N+D +A G+ +++ C+PSR++++TG+
Sbjct: 36 PNFVIILADDMGWGDLGANWAETKDTANLDKMAAEGMRFVDFHAAASTCSPSRASLLTGR 95
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ G+ HN GGLPL+E L + L++ GY T ++GKWHLG + Y P FRGF
Sbjct: 96 LGLRNGVTHNFAV-TSVGGLPLNETTLAEVLQQAGYVTGMIGKWHLGHHGP-YHPNFRGF 153
Query: 179 ESHLGYWTGHQ-DYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTA--EAVDIIHN 235
+ + G H D R D P+ L TDV E ++I+
Sbjct: 154 DYYFGIPYSHDMGCTDTPGYNHPPCPACPRGD-RPSRSLERDCYTDVALPLYENLNIVEQ 212
Query: 236 --------HSTDEPLFLYLAHAATHSANPYEPLQAPDH-YLNIHR-HIEDFKRSK--FAA 283
H E ++ HA+ S P+ H ++ I R + R + + A
Sbjct: 213 PVNLSSLAHKYAEKAIQFIQHASA-SGRPFLLYMGLAHMHVPISRTQLSAVLRGRRPYGA 271
Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
L ++D VG++ + ++ R N+ + F D
Sbjct: 272 GLREMDSLVGQIKDKVD-RTAKENTFLWFTGD 302
>TIGR_CMR|CPS_2984 [details] [associations]
symbol:CPS_2984 "sulfatase family protein" species:167879
"Colwellia psychrerythraea 34H" [GO:0006790 "sulfur compound
metabolic process" evidence=ISS] [GO:0008484 "sulfuric ester
hydrolase activity" evidence=ISS] InterPro:IPR000917
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 EMBL:CP000083
GenomeReviews:CP000083_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
eggNOG:COG3119 GO:GO:0008484 KO:K01130 HOGENOM:HOG000135355
RefSeq:YP_269684.1 ProteinModelPortal:Q47ZT5 STRING:Q47ZT5
GeneID:3520029 KEGG:cps:CPS_2984 PATRIC:21468985 OMA:NGPHANT
BioCyc:CPSY167879:GI48-3033-MONOMER Uniprot:Q47ZT5
Length = 512
Score = 250 (93.1 bits), Expect = 2.0e-19, Sum P(2) = 2.0e-19
Identities = 89/359 (24%), Positives = 158/359 (44%)
Query: 47 SMVFVDLVASSGPPHIIFILADDLGWNDVGF--HGLDQIPTPNIDALAYSGIILKNYYTV 104
S++ ++ P+I+F DD+G ++ HG+ TPNID +A G++ +YY
Sbjct: 14 SLIATASATAAEKPNILFFWGDDIGRTNISAYSHGIMGFKTPNIDRIAKEGMMFTDYYAD 73
Query: 105 QLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHL 164
Q CT RS +TG+ + TGM L G + G + + + + LK GY T GK HL
Sbjct: 74 QSCTAGRSTFITGQSGLRTGMTKVGLPGAKEG-IQDRDITIAEMLKAKGYTTGQFGKNHL 132
Query: 165 GFYKKEYTPTFRGFESHLG--YWTGHQ------DYFDHSAEEMKMW--GL-----DMR-R 208
G K E+ P+ GF+ G Y + DY A + K G+ D +
Sbjct: 133 GD-KDEHLPSNHGFDEFFGNLYHLNAEEEPEDPDYPKDPAFKKKFGPRGVIHSYADGKIE 191
Query: 209 DLEPAWDLHGKYSTDVFTAEAVDIIHNH-STDEPLFLYLAHAATHSANPYEPLQAPDHYL 267
D P + + D F A A+ + +P F+++ A H P
Sbjct: 192 DTGPLTKKRMETADDEFVAAAMKFVDKAVKAKKPFFVWVNTAGMHFRTHINP-------- 243
Query: 268 NIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXX 327
+H+ + + ++ D VG +++ L++ ++ ++I+++ +D
Sbjct: 244 ---KHVGLSGQGFYNDVMVAHDNHVGMMLDQLDKLKVTDSTIVMYSTDNGVHYNTWPDAG 300
Query: 328 XSNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDI 385
+ P G KN+ EG R ++ W +++ + E H+ DW+PTL +AA + +
Sbjct: 301 IT--PFDGEKNSEKEGAYRVPMMVRWPGKIKAGEVSNEMMAHL-DWMPTLAAAAGDTKL 356
Score = 60 (26.2 bits), Expect = 2.0e-19, Sum P(2) = 2.0e-19
Identities = 19/59 (32%), Positives = 31/59 (52%)
Query: 498 ENRSNDNSYQNEIDGIDVWSVLS-RNEPSKRNTILHNIDDEWQISALTRGKWKLV-KEN 554
+ R + + +DG ++ L+ + E S RN I H ++DE A+ G WK+V EN
Sbjct: 364 KRRFGNKQSKIHLDGYNMLPHLTGKTEKSPRN-IYHYLNDEGFPVAIRIGDWKMVYAEN 421
>UNIPROTKB|P77318 [details] [associations]
symbol:ydeN "putative sulfatase" species:83333 "Escherichia
coli K-12" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] [GO:0008152 "metabolic process" evidence=IEA]
[GO:0003824 "catalytic activity" evidence=IEA] [GO:0046872 "metal
ion binding" evidence=IEA] [GO:0016787 "hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 EMBL:U00096 EMBL:AP009048
GenomeReviews:AP009048_GR GenomeReviews:U00096_GR GO:GO:0046872
Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
HOGENOM:HOG000135352 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0008484 PIR:E64903 RefSeq:NP_416015.2
RefSeq:YP_489763.1 ProteinModelPortal:P77318 SMR:P77318
DIP:DIP-11682N IntAct:P77318 PhosSite:P0810453 PRIDE:P77318
EnsemblBacteria:EBESCT00000001979 EnsemblBacteria:EBESCT00000015602
GeneID:12931856 GeneID:945957 KEGG:ecj:Y75_p1474 KEGG:eco:b1498
PATRIC:32118290 EchoBASE:EB3557 EcoGene:EG13796 KO:K01138
OMA:PVINRCA ProtClustDB:CLSK880035 BioCyc:EcoCyc:G6788-MONOMER
BioCyc:ECOL316407:JW5243-MONOMER Genevestigator:P77318
Uniprot:P77318
Length = 560
Score = 257 (95.5 bits), Expect = 2.8e-19, Sum P(2) = 2.8e-19
Identities = 105/390 (26%), Positives = 166/390 (42%)
Query: 82 QIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPL 140
Q TP + +L G+ N Y + PSR+AIMTG+ P G+ N + G+PL
Sbjct: 106 QKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT---DAQDGIPL 162
Query: 141 SEKILPQYLKELGYRTRIVGKWHLGFYK----------KEYTPTFRGFESHLGYWTGHQ- 189
+E LP+ + GY T VGKWHL ++Y F F + W
Sbjct: 163 TETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAE--EWQPQNR 220
Query: 190 --DYFD--HSAEEM--KMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHST-DEPL 242
DYF H+A L R+ PA G Y +D T EA+ ++ T D+P
Sbjct: 221 GFDYFMGFHAAGTAYYNSPSLFKNRERVPA---KG-YISDQLTDEAIGVVDRAKTLDQPF 276
Query: 243 FLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQR 302
LYLA+ A H N P APD Y + +A++ + +D+ V +++E L++
Sbjct: 277 MLYLAYNAPHLPND-NP--APDQYQKQFNTGSQTADNYYASV-YSVDQGVKRILEQLKKN 332
Query: 303 RMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIV 362
N+II+F SD N +G K+ + GG +W G
Sbjct: 333 GQYDNTIILFTSDNGAVIDGPLPL---NGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY 389
Query: 363 AEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENS-----------ILRYENGT 411
++ + D+ PT L AA+ S IP + +++P ++ I Y +
Sbjct: 390 -DKLISAMDFYPTALDAADIS-IPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWF 447
Query: 412 HEYNSPRIENSN--TRYENGTHEYNPKYEN 439
E N P +N + R+++ + +NP E+
Sbjct: 448 DEENIPFWDNYHKFVRHQSDDYPHNPNTED 477
Score = 53 (23.7 bits), Expect = 2.8e-19, Sum P(2) = 2.8e-19
Identities = 18/64 (28%), Positives = 29/64 (45%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFH--GLDQIPTPNIDAL-AYSGIILKNYYTVQLCTPSR 111
++ G P+II + DDLG+ + F D N + + Y I K Q TP+
Sbjct: 53 STKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTL 112
Query: 112 SAIM 115
++M
Sbjct: 113 LSLM 116
>UNIPROTKB|F1RV22 [details] [associations]
symbol:ARSG "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0006790 "sulfur compound metabolic process"
evidence=IEA] [GO:0005783 "endoplasmic reticulum" evidence=IEA]
[GO:0005764 "lysosome" evidence=IEA] [GO:0005615 "extracellular
space" evidence=IEA] [GO:0008484 "sulfuric ester hydrolase
activity" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 GO:GO:0005783 GO:GO:0005615
GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649
GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0008484 KO:K12381 OMA:LPQDRHF GO:GO:0006790
EMBL:FP085458 EMBL:FP085465 EMBL:FP067366 RefSeq:XP_003131311.1
UniGene:Ssc.62110 Ensembl:ENSSSCT00000018790 GeneID:100521576
KEGG:ssc:100521576 Uniprot:F1RV22
Length = 525
Score = 260 (96.6 bits), Expect = 5.0e-19, P = 5.0e-19
Identities = 78/275 (28%), Positives = 134/275 (48%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
P+ + ILADD+GW D+G + + T N+D LA G+ +++ C+PSR++++TG+
Sbjct: 36 PNFVIILADDMGWGDLGANWAETKDTANLDKLAAEGMRFVDFHAAASTCSPSRASLLTGR 95
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGF 178
+ G+ HN GGLPL+E L + L+ GY T ++GKWHLG + + P FRGF
Sbjct: 96 LGLRNGVTHNFAV-TSVGGLPLNETTLAEVLQRAGYITGMIGKWHLGHHGS-FHPNFRGF 153
Query: 179 ESHLGYWTGHQ-DYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN-H 236
+ + G H D RR +P+ +L +DV A+ + N +
Sbjct: 154 DYYFGIPYSHDMGCTDTPGYNYPPCPACPRRH-QPSRNLERDCYSDV----ALPLYENLN 208
Query: 237 STDEPLFLY-LAHAATHSANPY-EPLQAPDH----YLNI-HRHIE---------DFKRSK 280
++P+ L LA A + + +A Y+ + H H+ + R
Sbjct: 209 IVEQPVNLSGLARKYAEKATQFIQQARASGRPFLLYVGLAHMHVPLSRPQRSAGPWDRRP 268
Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
+AA L ++D VG++ + ++ R +N+ + F D
Sbjct: 269 YAAGLREMDRLVGQIKDKVD-RTAKNNTFLWFTGD 302
>UNIPROTKB|F1PYB3 [details] [associations]
symbol:ARSE "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0008484 EMBL:AAEX03026107
Ensembl:ENSCAFT00000017722 Uniprot:F1PYB3
Length = 253
Score = 239 (89.2 bits), Expect = 5.2e-19, P = 5.2e-19
Identities = 47/113 (41%), Positives = 72/113 (63%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGK 118
P+I+ ++ADD G D+G +G + I TPNID LA G++L + +CTPSR+A +TG+
Sbjct: 6 PNILLLMADDFGIGDIGCYGNNSIRTPNIDRLAEDGVMLTQHIAAASVCTPSRAAFLTGR 65
Query: 119 HPIHTGMQ-----HNVL-YGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
+P+ +G+ + VL + GGLP +E + LK+ GY T ++GKWHLG
Sbjct: 66 YPLRSGLSSLINGYRVLQWTGVSGGLPTNETTFAKILKDRGYATGLIGKWHLG 118
>TIGR_CMR|CPS_3032 [details] [associations]
symbol:CPS_3032 "sulfatase family protein" species:167879
"Colwellia psychrerythraea 34H" [GO:0006790 "sulfur compound
metabolic process" evidence=ISS] [GO:0008484 "sulfuric ester
hydrolase activity" evidence=ISS] InterPro:IPR000917
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 EMBL:CP000083
GenomeReviews:CP000083_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
eggNOG:COG3119 GO:GO:0008484 KO:K01130 RefSeq:YP_269731.1
ProteinModelPortal:Q47ZN9 STRING:Q47ZN9 GeneID:3518391
KEGG:cps:CPS_3032 PATRIC:21469075 HOGENOM:HOG000135355 OMA:RWNDWKA
BioCyc:CPSY167879:GI48-3081-MONOMER Uniprot:Q47ZN9
Length = 522
Score = 258 (95.9 bits), Expect = 8.2e-19, P = 8.2e-19
Identities = 97/366 (26%), Positives = 161/366 (43%)
Query: 32 TRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLG-WNDVGF-HGLDQIPTPNID 89
T+ FA+ T S + +S P+I+ I DD+G +N + HG+ TPNID
Sbjct: 5 TKFTQFAIALGMLTASATALATTDTS-KPNILAIWGDDIGIYNISAYNHGMMGYQTPNID 63
Query: 90 ALAYSGIILKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYL 149
+A G + + Y Q CT RSA + G+ P TG+ + G G +P +
Sbjct: 64 RIANEGALFTDQYAQQSCTAGRSAFILGQEPFRTGLLTIGMPGSTHG-IPDWAPTIGDVA 122
Query: 150 KELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGL--DMR 207
K+ GY T GK HLG K + PT GF+ G Y ++ EE + + D R
Sbjct: 123 KDNGYMTAQFGKNHLGDQDK-HLPTKHGFDEFFGNL-----YHLNAEEEPETYYYPKDPR 176
Query: 208 --RDLEPAWDLH----GKYS-TDVFTAEAVDIIHNHSTDEPL-FLYLAHAATHSANPYEP 259
+ P LH G+ T T + ++ L F+ AH A P+
Sbjct: 177 FKKKFGPRGVLHTFADGRMEDTGALTRKRMETADEEFLGATLKFIDKAHKAD---KPFF- 232
Query: 260 LQAPDHYLNIHRHIEDFKRSK-----FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVS 314
+ +++H +++ + K +A + + DE VG +++ L+ ++ N+I+++ +
Sbjct: 233 IWYNSTRMHVHTRLQEKWQGKSGISIYADGMLEHDEHVGVLLDKLDDLKIADNTIVIYTT 292
Query: 315 DXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWL 373
D N P G K T +EGG+R L+ W ++ + H+ DW+
Sbjct: 293 DNGAETFTWPDG--GNTPFHGEKGTTYEGGMRVPQLVRWPGTIKPGSKMNSMMSHI-DWM 349
Query: 374 PTLLSA 379
PTL +A
Sbjct: 350 PTLAAA 355
>UNIPROTKB|I3LCI6 [details] [associations]
symbol:I3LCI6 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
GO:GO:0008484 GeneTree:ENSGT00560000077076 EMBL:CU469102
EMBL:AEMK01103856 EMBL:AEMK01167009 Ensembl:ENSSSCT00000031398
Uniprot:I3LCI6
Length = 121
Score = 176 (67.0 bits), Expect = 2.4e-18, Sum P(2) = 2.4e-18
Identities = 30/55 (54%), Positives = 40/55 (72%)
Query: 329 SNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKS 383
+NWPLRG K +LWEGGVRG G + PLL+ +G+ + +H+SDWLPTL+ A S
Sbjct: 19 NNWPLRGRKWSLWEGGVRGVGFVAGPLLKRKGVKNRELIHISDWLPTLVKLAGGS 73
Score = 83 (34.3 bits), Expect = 2.4e-18, Sum P(2) = 2.4e-18
Identities = 14/26 (53%), Positives = 17/26 (65%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNID 535
+DG DVW +S PS R +LHNID
Sbjct: 80 LDGFDVWKTISEGSPSPRMELLHNID 105
Score = 83 (34.3 bits), Expect = 2.4e-18, Sum P(2) = 2.4e-18
Identities = 14/26 (53%), Positives = 17/26 (65%)
Query: 575 IDGIDVWSVLSRNEPSKRNTILHNID 600
+DG DVW +S PS R +LHNID
Sbjct: 80 LDGFDVWKTISEGSPSPRMELLHNID 105
>ASPGD|ASPL0000001694 [details] [associations]
symbol:AN6847 species:162425 "Emericella nidulans"
[GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
[GO:0008152 "metabolic process" evidence=IEA] [GO:0005634 "nucleus"
evidence=IEA] [GO:0005829 "cytosol" evidence=IEA]
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 EMBL:BN001301 Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00149 GO:GO:0008484 OMA:IEWTNIS
ProteinModelPortal:C8V2I8 EnsemblFungi:CADANIAT00007645
Uniprot:C8V2I8
Length = 616
Score = 144 (55.7 bits), Expect = 4.8e-18, Sum P(4) = 4.8e-18
Identities = 26/58 (44%), Positives = 41/58 (70%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTG 117
P+ + I+ADDLG++D+G +G +I TPNID LA G+ +++ C+P+R+ IMTG
Sbjct: 7 PNFLVIVADDLGFSDIGCYG-SEIRTPNIDKLAQKGVRFTDFHAAAACSPTRAMIMTG 63
Score = 138 (53.6 bits), Expect = 4.8e-18, Sum P(4) = 4.8e-18
Identities = 57/173 (32%), Positives = 72/173 (41%)
Query: 140 LSEKI--LPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRGFE---SHLGYWTGHQDYFDH 194
L+E++ LP+ L++ GY T + GKWHLG E +P RGF+ +HL + H Y
Sbjct: 106 LNERVVALPEILRDAGYHTLMSGKWHLGL-TPERSPYKRGFDRSLAHLPACSNHYAYEPQ 164
Query: 195 SAE--------EMKMWGLDMR-----RDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDE- 240
+ E L M R L W Y D VD N DE
Sbjct: 165 LRDQDETPTFLEASYIALHMEDDKYVRSLPEGWYSSNGYG-DKMREYLVDWHKNKKEDED 223
Query: 241 -PLFLYLAHAATHSANPYEPLQAP----DHYLNIHRHIEDFKRSKFAAILHKL 288
P F YL A P+ PLQAP DHY ++ D R K A L KL
Sbjct: 224 KPFFAYLPFTA-----PHWPLQAPREYIDHYRGVYDDGPDALRLKRLASLKKL 271
Score = 68 (29.0 bits), Expect = 4.8e-18, Sum P(4) = 4.8e-18
Identities = 13/35 (37%), Positives = 23/35 (65%)
Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
FA ++ +D +VGK+V+ L+ L N+ + F+SD
Sbjct: 311 FAGMVECIDANVGKIVDYLDSIGELDNTFVCFMSD 345
Score = 42 (19.8 bits), Expect = 4.8e-18, Sum P(4) = 4.8e-18
Identities = 11/39 (28%), Positives = 22/39 (56%)
Query: 783 LFDIKNDPCEKNNLADR--SEVQRI----NHYTTEVGYL 815
L+++ DP E N+LA++ +Q++ + Y E G +
Sbjct: 523 LYNLVEDPGEINDLAEKYPERLQKLLKLWDQYVLETGVI 561
Score = 39 (18.8 bits), Expect = 9.5e-18, Sum P(4) = 9.5e-18
Identities = 7/17 (41%), Positives = 13/17 (76%)
Query: 677 LFDIKNDPCEKNNLADR 693
L+++ DP E N+LA++
Sbjct: 523 LYNLVEDPGEINDLAEK 539
>TIGR_CMR|CPS_2381 [details] [associations]
symbol:CPS_2381 "sulfatase family protein" species:167879
"Colwellia psychrerythraea 34H" [GO:0008152 "metabolic process"
evidence=ISS] [GO:0008484 "sulfuric ester hydrolase activity"
evidence=ISS] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 EMBL:CP000083
GenomeReviews:CP000083_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
GO:GO:0008484 HOGENOM:HOG000014304 RefSeq:YP_269099.1
ProteinModelPortal:Q482B9 STRING:Q482B9 GeneID:3523329
KEGG:cps:CPS_2381 PATRIC:21467845 OMA:VAPKKYF
ProtClustDB:CLSK494238 BioCyc:CPSY167879:GI48-2444-MONOMER
Uniprot:Q482B9
Length = 511
Score = 175 (66.7 bits), Expect = 7.3e-18, Sum P(3) = 7.3e-18
Identities = 53/132 (40%), Positives = 70/132 (53%)
Query: 42 LAFTLSMV-FVDLVASSG-PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL- 98
LAF ++ V L SS +++FI DDL ND+G +G + +PNIDALA GI
Sbjct: 21 LAFVVNSVGAAQLKKSSTLSMNVLFITIDDLN-NDLGAYGHHLVKSPNIDALAKKGIRFD 79
Query: 99 KNYYTVQLCTPSRSAIMTGKHPIHTGM----QHNVLYGCERGGLPLSEKILPQYLKELGY 154
K Y +CTPSRS+ MTG +P TG+ H + R +P LPQ K GY
Sbjct: 80 KAYSQSPMCTPSRSSFMTGLYPDQTGIIAHGSHTQMTAHFREHIP-KVTTLPQLFKNNGY 138
Query: 155 RTRIVGK-WHLG 165
+ VGK +H G
Sbjct: 139 FSGRVGKIYHQG 150
Score = 109 (43.4 bits), Expect = 7.3e-18, Sum P(3) = 7.3e-18
Identities = 36/146 (24%), Positives = 67/146 (45%)
Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTL 340
+AA+ + +D VG+V++AL+Q+ + N+I+VF+SD W K +L
Sbjct: 305 YAAVSY-VDAQVGRVLDALKQQDLSDNTIVVFLSDHGYELGQHGL-----WQ----KGSL 354
Query: 341 WEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRY 400
+EG R +I++P ++ G V V + D PTL P Y+ +++ P
Sbjct: 355 FEGSARAPLIIYAPNVKDNGRVVTSPVELVDIYPTLAKLTGLV-APEYLAG--KDLTPAL 411
Query: 401 ENSILRYENGTHEYNSPRIENSNTRY 426
+ + G + R + N ++
Sbjct: 412 NDVDFQVRKGAYSAILNRNKGDNNQF 437
Score = 59 (25.8 bits), Expect = 7.3e-18, Sum P(3) = 7.3e-18
Identities = 11/23 (47%), Positives = 16/23 (69%)
Query: 783 LFDIKNDPCEKNNLADRSEVQRI 805
L+D KNDP E NLAD+ ++ +
Sbjct: 466 LYDHKNDPQELKNLADKVSLESV 488
Score = 56 (24.8 bits), Expect = 1.5e-17, Sum P(3) = 1.5e-17
Identities = 11/17 (64%), Positives = 13/17 (76%)
Query: 677 LFDIKNDPCEKNNLADR 693
L+D KNDP E NLAD+
Sbjct: 466 LYDHKNDPQELKNLADK 482
>UNIPROTKB|I3LM95 [details] [associations]
symbol:ARSD "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0008484 Ensembl:ENSSSCT00000027390
OMA:INIGGHE Uniprot:I3LM95
Length = 580
Score = 196 (74.1 bits), Expect = 7.9e-17, Sum P(3) = 7.9e-17
Identities = 46/135 (34%), Positives = 72/135 (53%)
Query: 40 LPLAFTLSMVFVDLVASSGP---PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
LP T+ ++ + +GP P+I+ I+ADDLG D+G +G D + P + +G
Sbjct: 45 LPGLLTVCLLLPTCASKAGPAFKPNILLIMADDLGIGDLGCYGNDTLRYPGLGLRVGAGT 104
Query: 97 ILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQ----HNVL-YGCERGGLPLSEKILPQYLK 150
L +CTPSR+A +TG+H + +G + VL + GGLP +E + L+
Sbjct: 105 RLSAXLAAAPVCTPSRAAFLTGRHALRSGRWKGDGYRVLRWNGGSGGLPQNETTFARILQ 164
Query: 151 ELGYRTRIVGKWHLG 165
GY T ++GKWH G
Sbjct: 165 RQGYATGLIGKWHQG 179
Score = 86 (35.3 bits), Expect = 7.9e-17, Sum P(3) = 7.9e-17
Identities = 30/135 (22%), Positives = 57/135 (42%)
Query: 285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXX-SNWP--LRGVKNTL- 340
+ ++D VG ++ A+E+ + + ++ F SD W RG K
Sbjct: 312 VEEMDGLVGDILNAIEEHGLKNTTLTYFTSDHGGHLEAIDGHVQLGGWNGIYRGGKGMGG 371
Query: 341 WEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPR 399
WEGG+R G+ W +L + G V ++ + D PT++ +P +++P
Sbjct: 372 WEGGIRVPGIFRWPGVLPA-GRVIQEPTSLMDVFPTVVQLGG-GQVPQDRVIDGRSLVPL 429
Query: 400 YENSILRYENGTHEY 414
+ E+ HE+
Sbjct: 430 LQGET---EHSAHEF 441
Score = 52 (23.4 bits), Expect = 7.9e-17, Sum P(3) = 7.9e-17
Identities = 11/21 (52%), Positives = 12/21 (57%)
Query: 675 PCLFDIKNDPCEKNNLADRSE 695
P LFD+ DP E LA SE
Sbjct: 500 PLLFDLSGDPSEAQPLAPGSE 520
Score = 52 (23.4 bits), Expect = 7.9e-17, Sum P(3) = 7.9e-17
Identities = 11/21 (52%), Positives = 12/21 (57%)
Query: 781 PCLFDIKNDPCEKNNLADRSE 801
P LFD+ DP E LA SE
Sbjct: 500 PLLFDLSGDPSEAQPLAPGSE 520
>UNIPROTKB|F5H324 [details] [associations]
symbol:ARSE "Arylsulfatase E" species:9606 "Homo sapiens"
[GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484 EMBL:AC005295
HGNC:HGNC:719 IPI:IPI01015579 ProteinModelPortal:F5H324 SMR:F5H324
Ensembl:ENST00000540563 UCSC:uc011mhi.2 ArrayExpress:F5H324
Bgee:F5H324 Uniprot:F5H324
Length = 544
Score = 193 (73.0 bits), Expect = 3.1e-16, Sum P(3) = 3.1e-16
Identities = 43/110 (39%), Positives = 62/110 (56%)
Query: 85 TPNIDALAYSGIILKNYYTV-QLCTPSRSAIMTGKHPIHTGMQHNVLYGCER-----GGL 138
TPNID LA G+ L + + LCTPSR+A +TG++P+ +GM ++ Y + GGL
Sbjct: 18 TPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSIGYRVLQWTGASGGL 77
Query: 139 PLSEKILPQYLKELGYRTRIVGKWHLGFYKKE-----YTPTFRGFESHLG 183
P +E + LKE GY T ++GKWHLG + + P GF+ G
Sbjct: 78 PTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFDHFYG 127
Score = 85 (35.0 bits), Expect = 3.1e-16, Sum P(3) = 3.1e-16
Identities = 25/107 (23%), Positives = 50/107 (46%)
Query: 285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXX-SNWP--LRGVKNTL- 340
+ ++D VG++++ L+ + ++++I F SD W +G K
Sbjct: 278 VEEMDWMVGRILDTLDVEGLSNSTLIYFTSDHGGSLENQLGNTQYGGWNGIYKGGKGMGG 337
Query: 341 WEGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
WEGG+R G+ W +L + ++ E + D PT++ A ++P
Sbjct: 338 WEGGIRVPGIFRWPGVLPAGRVIGEP-TSLMDVFPTVVRLAG-GEVP 382
Score = 49 (22.3 bits), Expect = 3.1e-16, Sum P(3) = 3.1e-16
Identities = 10/21 (47%), Positives = 12/21 (57%)
Query: 675 PCLFDIKNDPCEKNNLADRSE 695
P LFD+ DP E + L SE
Sbjct: 466 PLLFDLSRDPSETHILTPASE 486
Score = 49 (22.3 bits), Expect = 3.1e-16, Sum P(3) = 3.1e-16
Identities = 10/21 (47%), Positives = 12/21 (57%)
Query: 781 PCLFDIKNDPCEKNNLADRSE 801
P LFD+ DP E + L SE
Sbjct: 466 PLLFDLSRDPSETHILTPASE 486
>UNIPROTKB|I3LUP9 [details] [associations]
symbol:ARSA "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0016021 "integral to membrane" evidence=IEA]
[GO:0007339 "binding of sperm to zona pellucida" evidence=IEA]
[GO:0005886 "plasma membrane" evidence=IEA] [GO:0005509 "calcium
ion binding" evidence=IEA] [GO:0008484 "sulfuric ester hydrolase
activity" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 GO:GO:0005886 GO:GO:0005509
GO:GO:0007339 Gene3D:3.40.720.10 SUPFAM:SSF53649
GeneTree:ENSGT00560000076940 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0008484 Ensembl:ENSSSCT00000031954
OMA:GFDENTI Uniprot:I3LUP9
Length = 486
Score = 231 (86.4 bits), Expect = 4.6e-16, Sum P(2) = 4.6e-16
Identities = 86/291 (29%), Positives = 134/291 (46%)
Query: 44 FTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT 103
+ L++ +A++ PP+I+ I ADDLG+ D+G +G TPN+D LA G+ ++Y
Sbjct: 5 WALTLALASGLAATSPPNIVLIFADDLGYGDLGSYGHPSSTTPNLDQLAAGGLRFTDFYV 64
Query: 104 -VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKW 162
V LCTPSR+A++TG+ P+ G LY P +E + + GY T + GKW
Sbjct: 65 PVSLCTPSRAALLTGRLPVRMG-----LY-------PGAEVLAAR-----GYLTGMAGKW 107
Query: 163 HLGFYKK-EYTPTFRGFESHLGYWTGHQD-------YFDHSA--EEMKMWGL-------D 205
HLG + + P GF LG H F S + GL +
Sbjct: 108 HLGVGPEGAFLPPHXGFHRFLGIPYSHDQGPCQNLTCFPPSTPCDGSCDQGLVPVPLLAN 167
Query: 206 MRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTD-EPLFLYLAHAATHSANPYEPLQAPD 264
+ + +P W L G + + A A D++ + P FLY A TH Y P Q
Sbjct: 168 LSVEAQPPW-LPGLEAR--YVAFARDLMADAQRQGRPFFLYYASHHTH----Y-P-QFSG 218
Query: 265 HYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
+ H R F L +LD +VG ++ A+ +L ++++F +D
Sbjct: 219 QSFSGHSG-----RGPFGDSLMELDAAVGALMTAVGDLGLLGETLVIFTAD 264
Score = 47 (21.6 bits), Expect = 4.6e-16, Sum P(2) = 4.6e-16
Identities = 9/16 (56%), Positives = 10/16 (62%)
Query: 675 PCLFDIKNDPCEKNNL 690
P LFD+ DP E NL
Sbjct: 405 PLLFDLSEDPGENYNL 420
Score = 47 (21.6 bits), Expect = 4.6e-16, Sum P(2) = 4.6e-16
Identities = 9/16 (56%), Positives = 10/16 (62%)
Query: 781 PCLFDIKNDPCEKNNL 796
P LFD+ DP E NL
Sbjct: 405 PLLFDLSEDPGENYNL 420
>UNIPROTKB|P95059 [details] [associations]
symbol:atsA "POSSIBLE ARYLSULFATASE ATSA (ARYL-SULFATE
SULPHOHYDROLASE) (ARYLSULPHATASE)" species:1773 "Mycobacterium
tuberculosis" [GO:0005886 "plasma membrane" evidence=IDA]
[GO:0010033 "response to organic substance" evidence=IEP]
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 GO:GO:0005886 GenomeReviews:AL123456_GR GO:GO:0010033
EMBL:BX842574 Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
PROSITE:PS00523 PROSITE:PS00149 GO:GO:0004065 HSSP:P15289 KO:K01130
HOGENOM:HOG000042725 EMBL:AL123456 PIR:B70643 RefSeq:NP_215225.1
RefSeq:YP_006514055.1 ProteinModelPortal:P95059 SMR:P95059
PRIDE:P95059 EnsemblBacteria:EBMYCT00000001675 GeneID:13318600
GeneID:888394 KEGG:mtu:Rv0711 KEGG:mtv:RVBD_0711 PATRIC:18150088
TubercuList:Rv0711 OMA:FAGFLEH ProtClustDB:CLSK790691
Uniprot:P95059
Length = 787
Score = 196 (74.1 bits), Expect = 6.9e-16, Sum P(3) = 6.9e-16
Identities = 66/223 (29%), Positives = 104/223 (46%)
Query: 54 VASSGPPHIIFILADDLG---WNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPS 110
VA P+I++++ DD+G W+ G GL ++P + +A G+ L ++T LC+P+
Sbjct: 31 VAPEHSPNILYLVWDDVGIATWDCFG--GLVEMPA--MTRVAERGVRLSQFHTTALCSPT 86
Query: 111 RSAIMTGKHPIHTGMQ--HNVLYG---CERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
R++++TG++ GM G C G +P +LP+ L E GY T VGKWHL
Sbjct: 87 RASLLTGRNATTVGMATIEEFTDGFPNCN-GRIPADTALLPEVLAEHGYNTYCVGKWHLT 145
Query: 166 FYK-------KEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRD---LEPAWD 215
+ K + PT RGFE G+ G D W D+ D + P
Sbjct: 146 PLEESNMASTKRHWPTSRGFERFYGFLGGETD----------QWYPDLVYDNHPVSPPGT 195
Query: 216 LHGKY--STDVFTAEAVDIIHNHST---DEPLFLYLAHAATHS 253
G Y S D+ + ++ I + D+P F Y+ A H+
Sbjct: 196 PEGGYHLSKDI-ADKTIEFIRDAKVIAPDKPWFSYVCPGAGHA 237
Score = 73 (30.8 bits), Expect = 6.9e-16, Sum P(3) = 6.9e-16
Identities = 14/35 (40%), Positives = 22/35 (62%)
Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
FA L D +G++++ LE+ L N+IIV +SD
Sbjct: 324 FAGFLSYTDAQIGRILDYLEESGQLDNTIIVVISD 358
Score = 62 (26.9 bits), Expect = 6.9e-16, Sum P(3) = 6.9e-16
Identities = 13/36 (36%), Positives = 21/36 (58%)
Query: 342 EGGVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTL 376
EGG+ +I W + + G + + YV+VSD PT+
Sbjct: 425 EGGIADPAIISWPNGIAAHGEIRDNYVNVSDITPTV 460
Score = 37 (18.1 bits), Expect = 2.4e-13, Sum P(3) = 2.4e-13
Identities = 8/19 (42%), Positives = 11/19 (57%)
Query: 877 VAPINKPFDKGGDPKNFDH 895
VA K FD G P+ ++H
Sbjct: 384 VAESMKLFDHLGGPQTYNH 402
>UNIPROTKB|F1S048 [details] [associations]
symbol:F1S048 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484
GeneTree:ENSGT00560000077076 EMBL:FP104542
Ensembl:ENSSSCT00000018625 OMA:MAPRDFA Uniprot:F1S048
Length = 142
Score = 205 (77.2 bits), Expect = 2.3e-15, P = 2.3e-15
Identities = 39/72 (54%), Positives = 52/72 (72%)
Query: 54 VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSA 113
V +S PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS
Sbjct: 74 VTASSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 132
Query: 114 IMTGKHPIHTGM 125
+TGK+ H+G+
Sbjct: 133 FITGKY--HSGI 142
>UNIPROTKB|D6RGC1 [details] [associations]
symbol:ARSJ "Arylsulfatase J" species:9606 "Homo sapiens"
[GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
PROSITE:PS00523 GO:GO:0008484 EMBL:AC104779 HGNC:HGNC:26286
ChiTaRS:ARSJ IPI:IPI00966139 ProteinModelPortal:D6RGC1 SMR:D6RGC1
Ensembl:ENST00000509829 HOGENOM:HOG000172533 ArrayExpress:D6RGC1
Bgee:D6RGC1 Uniprot:D6RGC1
Length = 133
Score = 194 (73.4 bits), Expect = 3.4e-14, P = 3.4e-14
Identities = 36/63 (57%), Positives = 46/63 (73%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIM 115
S+ PH+IFILADD G+ DVG+HG +I TP +D LA G+ L+NYY +CTPSRS +
Sbjct: 72 STSQPHLIFILADDQGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFI 130
Query: 116 TGK 118
TGK
Sbjct: 131 TGK 133
>UNIPROTKB|D6RDH0 [details] [associations]
symbol:ARSI "Arylsulfatase I" species:9606 "Homo sapiens"
[GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0008484
HOGENOM:HOG000135354 HGNC:HGNC:32521 EMBL:AC011372 IPI:IPI00967848
ProteinModelPortal:D6RDH0 SMR:D6RDH0 Ensembl:ENST00000509146
ArrayExpress:D6RDH0 Bgee:D6RDH0 Uniprot:D6RDH0
Length = 86
Score = 191 (72.3 bits), Expect = 7.2e-14, P = 7.2e-14
Identities = 37/86 (43%), Positives = 53/86 (61%)
Query: 158 IVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDH-SAEEMKMWGLDMRRDLEPAWDL 216
+VGKWHLGFY+KE PT RGF++ LG TG+ DY+ + + + + G D+ AW L
Sbjct: 1 MVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVDYYTYDNCDGPGVCGFDLHEGENVAWGL 60
Query: 217 HGKYSTDVFTAEAVDIIHNHSTDEPL 242
G+YST ++ A I+ +HS PL
Sbjct: 61 SGQYSTMLYAQRASHILASHSPQRPL 86
>UNIPROTKB|H3BP66 [details] [associations]
symbol:GALNS "N-acetylgalactosamine-6-sulfatase"
species:9606 "Homo sapiens" [GO:0008484 "sulfuric ester hydrolase
activity" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
EMBL:AC092384 InterPro:IPR024607 PROSITE:PS00149 GO:GO:0008484
HGNC:HGNC:4122 ChiTaRS:Galns Ensembl:ENST00000562831 Bgee:H3BP66
Uniprot:H3BP66
Length = 170
Score = 188 (71.2 bits), Expect = 1.5e-13, P = 1.5e-13
Identities = 53/157 (33%), Positives = 83/157 (52%)
Query: 110 SRSAIMTGKHPIHTGMQ----H-NVLYGCER--GGLPLSEKILPQYLKELGYRTRIVGKW 162
+R+A++TG+ PI G H Y + GG+P SE++LP+ LK+ GY ++IVGKW
Sbjct: 10 ARAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKW 69
Query: 163 HLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEE----MKMWGLDMRRDLEPAWDLH- 217
HLG ++ ++ P GF+ G H +D+ A + W + R E +L
Sbjct: 70 HLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKT 128
Query: 218 GKYS-TDVFTAEAVDIIHNHSTDEPLFLYLAHAATHS 253
G+ + T ++ EA+D I + P FLY A ATH+
Sbjct: 129 GEANLTQIYLQEALDFIKRQARHHPFFLYWAVDATHA 165
>UNIPROTKB|Q2KEF7 [details] [associations]
symbol:MGCH7_ch7g1079 "Putative uncharacterized protein"
species:242507 "Magnaporthe oryzae 70-15" [GO:0005575
"cellular_component" evidence=ND] InterPro:IPR000917
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0008484 EMBL:CM000230
ProteinModelPortal:Q2KEF7 SMR:Q2KEF7 Uniprot:Q2KEF7
Length = 480
Score = 216 (81.1 bits), Expect = 1.8e-13, Sum P(2) = 1.8e-13
Identities = 88/284 (30%), Positives = 133/284 (46%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAY--SGIILKNYYTVQLCTPSRSAIMTG 117
P + I+ADDLG++D G +I TPN+ L +G +L N++T C+P+RS + +G
Sbjct: 9 PKFLIIVADDLGYSDTSPFG-GEINTPNLARLVSDGNGRLLTNFHTASACSPTRSMLFSG 67
Query: 118 --KHPIHTG-MQHNV-----LY----GCERGGLPLSEKILPQYLKELGYRTRIVGKWHLG 165
H G M N+ LY G E G L L + ++ GY+T + GKWHLG
Sbjct: 68 TDNHIAGLGQMAENMRAHADLYRDKPGYE-GYLNFRVAALSEVFQDAGYQTLMTGKWHLG 126
Query: 166 FYKKEYTPTFRGFE-SHLGYWTG-HQDY-FDHSAEE-----------MKMWGLDMRRDLE 211
+E +P RGFE SH+ + +G H Y F+ E+ K W ++ R L+
Sbjct: 127 L-TRETSPHARGFERSHV-FLSGCHNHYNFEPQLEDPAHGLGDVISQAKFW-MEDDRFLD 183
Query: 212 PAWDLHGK-YSTDVFTAEAVDIIHNHS-TDEPLFLYLAHAATHSANPYEPLQAPDHYLNI 269
DL YS+ + + + + +D P F YL A P+ PLQAP +
Sbjct: 184 RTKDLPKDFYSSTFYGNKMAQYLRERAGSDRPFFAYLPFTA-----PHWPLQAPADLVAK 238
Query: 270 HRHIEDFKRSKFAAI-LHKLDESVGKVVEALEQRRMLSNSIIVF 312
++ + D S A L +L E +G V E M+ I V+
Sbjct: 239 YKGVYDDGPSALRARRLERLVE-LGIVKAGTEPAPMVGRKIRVW 281
Score = 38 (18.4 bits), Expect = 1.8e-13, Sum P(2) = 1.8e-13
Identities = 18/54 (33%), Positives = 22/54 (40%)
Query: 332 PLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDI 385
P RG K + GG+R ++ P RG Q S PT S A DI
Sbjct: 388 PSRGFKTWITGGGIRCPCIVRYPG-SGRG--QAQSREKSQPTPTTDSFATVMDI 438
>WB|WBGene00006309 [details] [associations]
symbol:sul-2 species:6239 "Caenorhabditis elegans"
[GO:0008152 "metabolic process" evidence=IEA] [GO:0008484 "sulfuric
ester hydrolase activity" evidence=IEA] InterPro:IPR000917
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
GeneTree:ENSGT00560000076940 HOGENOM:HOG000135352
InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
HSSP:P15289 EMBL:FO080993 PIR:T29618 RefSeq:NP_505102.1
ProteinModelPortal:Q18924 SMR:Q18924 PaxDb:Q18924
EnsemblMetazoa:D1014.1 GeneID:179194 KEGG:cel:CELE_D1014.1
UCSC:D1014.1 CTD:179194 WormBase:D1014.1 InParanoid:Q18924
OMA:HITHHEP NextBio:904322 Uniprot:Q18924
Length = 452
Score = 208 (78.3 bits), Expect = 4.1e-13, Sum P(2) = 4.1e-13
Identities = 46/128 (35%), Positives = 69/128 (53%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+I+ ++ DDLG+ D+ +G +D +A G Y+ +C+PSR+ +TG+
Sbjct: 33 PNIVILMIDDLGYGDIASYGHPTQEYTQVDRMAAEGTRFTQAYSADSMCSPSRAGFITGR 92
Query: 119 HPIHTGMQ--HNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYT---- 172
PI G+ V + GGLP SE + + L+E GY T +VGKWHLG + T
Sbjct: 93 LPIRLGIVGGRRVFVPYDIGGLPKSETTMAEMLQEAGYATGMVGKWHLGINENNATDGAH 152
Query: 173 -PTFRGFE 179
P+ RGFE
Sbjct: 153 LPSKRGFE 160
Score = 42 (19.8 bits), Expect = 4.1e-13, Sum P(2) = 4.1e-13
Identities = 9/25 (36%), Positives = 14/25 (56%)
Query: 675 PCLFDIKNDPCEKNNLADRSEDQRI 699
P +FD+ DP E+ L + + Q I
Sbjct: 353 PLVFDLIRDPYEQYPLQNTVKSQEI 377
Score = 40 (19.1 bits), Expect = 6.7e-13, Sum P(2) = 6.7e-13
Identities = 9/25 (36%), Positives = 14/25 (56%)
Query: 781 PCLFDIKNDPCEKNNLADRSEVQRI 805
P +FD+ DP E+ L + + Q I
Sbjct: 353 PLVFDLIRDPYEQYPLQNTVKSQEI 377
>FB|FBgn0038660 [details] [associations]
symbol:CG14291 species:7227 "Drosophila melanogaster"
[GO:0016250 "N-sulfoglucosamine sulfohydrolase activity"
evidence=ISS] [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] [GO:0008152 "metabolic process" evidence=IEA]
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 EMBL:AE014297 Gene3D:3.40.720.10 SUPFAM:SSF53649
eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484
HSSP:P15289 KO:K01565 OMA:RDPHETQ GO:GO:0016250 EMBL:AY071569
RefSeq:NP_650760.1 UniGene:Dm.5859 SMR:Q9VE24 STRING:Q9VE24
EnsemblMetazoa:FBtr0083724 GeneID:42266 KEGG:dme:Dmel_CG14291
UCSC:CG14291-RA FlyBase:FBgn0038660 GeneTree:ENSGT00390000013080
InParanoid:Q9VE24 OrthoDB:EOG49ZW4K GenomeRNAi:42266 NextBio:827964
Uniprot:Q9VE24
Length = 524
Score = 148 (57.2 bits), Expect = 1.8e-12, Sum P(3) = 1.8e-12
Identities = 39/115 (33%), Positives = 66/115 (57%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQI-PTPNIDALAYSGIILKNYYT-VQLCTPSRSA 113
S+GP +++ +LADD G+ + L++ TPN+DALA G++ N +T V C+PSRS
Sbjct: 17 SAGPQNVLLLLADDAGFESGAY--LNKFCQTPNLDALAKRGLLFNNAFTSVSSCSPSRSQ 74
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKEL-GYR--TRIVGKWHLG 165
++TG+ +GM + + G + LP +++ G R + I+GK H+G
Sbjct: 75 LLTGQAGHSSGM-YGLHQGVHNFNVLPDTGSLPNLIRDQSGGRILSGIIGKKHVG 128
Score = 83 (34.3 bits), Expect = 1.8e-12, Sum P(3) = 1.8e-12
Identities = 38/149 (25%), Positives = 67/149 (44%)
Query: 275 DFKRSKFAA---ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNW 331
D R + AA + +LD+ VG +++ LE + +++++ SD +
Sbjct: 233 DVVRQELAAQYMTISRLDQGVGLMLKELEAAGVADQTLVIYTSD-------------NGP 279
Query: 332 PLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQ-YVHVSDWLPTLLSAANKSDIPNYVN 390
P G + L+E G+R +I SP E R A V + D P+++ A + PN
Sbjct: 280 PFPGGRTNLYEHGIRSPLIISSPNKEDRHHEATAAMVSLLDIYPSVMDAL-QIPRPNDTK 338
Query: 391 STVENIIP--RYENSILRYEN--GTHEYN 415
+I+P R E I ++ G+H Y+
Sbjct: 339 IVGRSILPVLREEPPIKESDSVFGSHSYH 367
Score = 62 (26.9 bits), Expect = 1.8e-12, Sum P(3) = 1.8e-12
Identities = 11/19 (57%), Positives = 16/19 (84%)
Query: 677 LFDIKNDPCEKNNLADRSE 695
L+DIK DP E+ NLAD+++
Sbjct: 437 LYDIKTDPLERFNLADKAK 455
Score = 62 (26.9 bits), Expect = 1.8e-12, Sum P(3) = 1.8e-12
Identities = 11/19 (57%), Positives = 16/19 (84%)
Query: 783 LFDIKNDPCEKNNLADRSE 801
L+DIK DP E+ NLAD+++
Sbjct: 437 LYDIKTDPLERFNLADKAK 455
>UNIPROTKB|P31447 [details] [associations]
symbol:yidJ "putative sulfatase" species:83333 "Escherichia
coli K-12" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] [GO:0008152 "metabolic process" evidence=IEA]
[GO:0003824 "catalytic activity" evidence=IEA] [GO:0046872 "metal
ion binding" evidence=IEA] [GO:0016787 "hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 EMBL:U00096 EMBL:AP009048
GenomeReviews:AP009048_GR GenomeReviews:U00096_GR GO:GO:0046872
EMBL:L10328 Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
OMA:RKDWENT KO:K01138 PIR:G65169 RefSeq:NP_418134.1
RefSeq:YP_491756.1 ProteinModelPortal:P31447 SMR:P31447
PRIDE:P31447 EnsemblBacteria:EBESCT00000001975
EnsemblBacteria:EBESCT00000016174 GeneID:12932459 GeneID:948188
KEGG:ecj:Y75_p3496 KEGG:eco:b3678 PATRIC:32122847 EchoBASE:EB1656
EcoGene:EG11705 HOGENOM:HOG000126316 ProtClustDB:CLSK880765
BioCyc:EcoCyc:EG11705-MONOMER BioCyc:ECOL316407:JW3654-MONOMER
Genevestigator:P31447 Uniprot:P31447
Length = 497
Score = 190 (71.9 bits), Expect = 1.8e-12, Sum P(3) = 1.8e-12
Identities = 62/218 (28%), Positives = 96/218 (44%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+ +F++ D N VG + + T NID+LA GI + YT +CTP+R+ + TG
Sbjct: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR-G 177
+ +G N + G +S + +Y K+ GY T +GKWHL + +Y T
Sbjct: 64 YANQSGPWTNNV----APGKNIST--MGRYFKDAGYHTCYIGKWHLDGH--DYFGTGECP 115
Query: 178 FESHLGYWTGHQDYFDHSAE-EMKMWGLDMRRDLEPAWDLHGKYSTDVFT------AEAV 230
E YW +Y E E+ +W R L DL + + FT AV
Sbjct: 116 PEWDADYWFDGANYLSELTEKEISLW----RNGLNSVEDLQANHIDETFTWAHRISNRAV 171
Query: 231 DIIHNHS-TDEPLFLYLAHAATHSANPYEPLQAPDHYL 267
D + + DEP + +++ P+ P P YL
Sbjct: 172 DFLQQPARADEPFLMVVSYD-----EPHHPFTCPVEYL 204
Score = 49 (22.3 bits), Expect = 1.8e-12, Sum P(3) = 1.8e-12
Identities = 11/30 (36%), Positives = 20/30 (66%)
Query: 288 LDESVGKVVEAL--EQRRMLSNSIIVFVSD 315
+D+ +G+V+ AL EQR N+ +++ SD
Sbjct: 258 VDDQIGRVINALTPEQRE---NTWVIYTSD 284
Score = 49 (22.3 bits), Expect = 1.8e-12, Sum P(3) = 1.8e-12
Identities = 14/41 (34%), Positives = 21/41 (51%)
Query: 677 LFDIKNDPCEKNNLAD--RSEDQRINHYTTEVGRFNQIAYP 715
L+D +NDP E +NL D R D R + + ++I P
Sbjct: 401 LYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDP 441
Score = 48 (22.0 bits), Expect = 2.3e-12, Sum P(3) = 2.3e-12
Identities = 9/16 (56%), Positives = 12/16 (75%)
Query: 783 LFDIKNDPCEKNNLAD 798
L+D +NDP E +NL D
Sbjct: 401 LYDRRNDPNEMHNLID 416
>ZFIN|ZDB-GENE-050107-5 [details] [associations]
symbol:gnsa "glucosamine (N-acetyl)-6-sulfatase a"
species:7955 "Danio rerio" [GO:0030203 "glycosaminoglycan metabolic
process" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
[GO:0008152 "metabolic process" evidence=IEA] [GO:0008484 "sulfuric
ester hydrolase activity" evidence=IEA] [GO:0003824 "catalytic
activity" evidence=IEA] [GO:0008449
"N-acetylglucosamine-6-sulfatase activity" evidence=IEA]
InterPro:IPR000917 InterPro:IPR012251 InterPro:IPR015981
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
PIRSF:PIRSF036666 ZFIN:ZDB-GENE-050107-5 GO:GO:0005764
Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
InterPro:IPR024607 PROSITE:PS00523 GO:GO:0030203
HOGENOM:HOG000169239 HOVERGEN:HBG005840 KO:K01137 OrthoDB:EOG4NGGMF
GO:GO:0008449 PANTHER:PTHR10342:SF5 EMBL:BC097128 IPI:IPI00499007
RefSeq:NP_001025379.1 UniGene:Dr.84802 ProteinModelPortal:Q4V902
STRING:Q4V902 GeneID:566506 KEGG:dre:566506 CTD:566506
InParanoid:Q4V902 NextBio:20888220 ArrayExpress:Q4V902
Uniprot:Q4V902
Length = 538
Score = 175 (66.7 bits), Expect = 2.6e-11, Sum P(2) = 2.6e-11
Identities = 71/253 (28%), Positives = 111/253 (43%)
Query: 34 IMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDAL-A 92
++ ++ + TL V + ++ P+I+ IL DDL DV G+ IP L
Sbjct: 7 VLLHCIIVICVTLHCVNLAAAKTNPKPNIVLILTDDL---DVSIGGM--IPLVKTKKLIG 61
Query: 93 YSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYG-CERGGLPLSEK--ILPQY 148
+GI N + LC PSR++I+TGK+P + + +N L G C ++ P +
Sbjct: 62 DAGITFTNAFVASPLCCPSRASILTGKYPHNHHVVNNTLEGNCSSTAWQKGQEPDAFPAF 121
Query: 149 L-KELGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMR 207
L K Y+T GK Y EY G H+ H + +++ + L +
Sbjct: 122 LQKHAAYQTFFAGK-----YLNEYGSKKAGGVEHVPLGWDHWFALERNSKYYN-YTLSVN 175
Query: 208 -RDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHS---ANP-YE---P 259
R + Y TDV ++D + N S P F+ ++ A HS A P Y+ P
Sbjct: 176 GRAQRHGQNYSEDYLTDVLANVSIDFLENKSNRRPFFMMVSTPAPHSPWTAAPQYDSSFP 235
Query: 260 -LQAP-DHYLNIH 270
L+AP D NIH
Sbjct: 236 DLKAPRDPNFNIH 248
Score = 63 (27.2 bits), Expect = 2.6e-11, Sum P(2) = 2.6e-11
Identities = 14/43 (32%), Positives = 27/43 (62%)
Query: 273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
+++ R ++ +L +D+ V K+V L+ R LSN+ ++F SD
Sbjct: 271 LDNAYRKRWRTLL-SVDDLVEKLVRKLDIRGELSNTYVIFTSD 312
>TIGR_CMR|CPS_0841 [details] [associations]
symbol:CPS_0841 "arylsulfatase" species:167879 "Colwellia
psychrerythraea 34H" [GO:0004065 "arylsulfatase activity"
evidence=ISS] [GO:0006790 "sulfur compound metabolic process"
evidence=ISS] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 EMBL:CP000083
GenomeReviews:CP000083_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
eggNOG:COG3119 GO:GO:0004065 KO:K01130 HOGENOM:HOG000135353
RefSeq:YP_267590.1 ProteinModelPortal:Q488C5 STRING:Q488C5
GeneID:3522242 KEGG:cps:CPS_0841 PATRIC:21464977 OMA:SSRIMEV
BioCyc:CPSY167879:GI48-927-MONOMER Uniprot:Q488C5
Length = 584
Score = 170 (64.9 bits), Expect = 4.6e-11, Sum P(2) = 4.6e-11
Identities = 49/178 (27%), Positives = 89/178 (50%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAI 114
A + P+I+ ++ADD + D+G +G ++ TPN++ +A +GI N++ +C+ +RS +
Sbjct: 32 ADAKKPNILLLVADDTAFGDIGAYG-SEVHTPNMNEIANAGIRFTNFHVSPVCSVTRSML 90
Query: 115 MTGKHPIHTGM---QHNVLYGCERG-----GLPLSEKI-LPQYLKELGYRTRIVGKWHLG 165
TG I G+ ++V Y RG G + + + + L + GY GKWHLG
Sbjct: 91 FTGNDNIEVGLGSFDYSV-YPATRGKKGYEGYLTKDAVTISELLNDDGYEVYKSGKWHLG 149
Query: 166 FYKKEYT-PTFRGFESHLGYWTGHQDYFDHSAEEMKMW---GLDMRRDLEPAWDLHGK 219
+ P GF G +G ++++ A GL+++R + W L+G+
Sbjct: 150 GEESGGKGPLEWGFTKEFGILSGGSNHWNDLAMTPNFKDPNGLNVKR--KENWTLNGE 205
Score = 67 (28.6 bits), Expect = 4.6e-11, Sum P(2) = 4.6e-11
Identities = 16/71 (22%), Positives = 38/71 (53%)
Query: 246 LAHAATHSANPYEPLQAPDHYLNIHRHIEDFK-RSKFAAILHKLDESVGKVVEALEQRRM 304
++H AT + P+ L L+ + K + +AA++ D +G++++ L +
Sbjct: 286 ISHEATEA--PFNNLTKKWQDLSQENKEKQAKIMATYAAMIEDQDNRIGQILDYLRESGQ 343
Query: 305 LSNSIIVFVSD 315
L N+++V+++D
Sbjct: 344 LDNTLVVYMTD 354
>UNIPROTKB|F1NGI6 [details] [associations]
symbol:SGSH "Uncharacterized protein" species:9031 "Gallus
gallus" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484 EMBL:AADN02053526
IPI:IPI00570654 Ensembl:ENSGALT00000011369 OMA:CYNPAVS
Uniprot:F1NGI6
Length = 119
Score = 162 (62.1 bits), Expect = 9.1e-11, P = 9.1e-11
Identities = 42/115 (36%), Positives = 66/115 (57%)
Query: 58 GPP--HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAI 114
G P +++ +LADD G+ G + I TPN+DALA G++ +N +T V C+PSR+++
Sbjct: 5 GAPARNVLLLLADDGGFES-GAYNNSAIRTPNLDALARRGLLFQNAFTSVSSCSPSRASV 63
Query: 115 MTGKHPIHTGMQHNVLYGCERGGLPLSE----KILPQYLKELGYRTRIVGKWHLG 165
+TG P H N +YG +G + + LP L++ RT I+GK H+G
Sbjct: 64 LTGL-PQH----QNGMYGLHQGVHHFNSFDAVRSLPGLLRQANIRTGIIGKKHVG 113
>UNIPROTKB|O65931 [details] [associations]
symbol:atsB "Arylsulfatase" species:83332 "Mycobacterium
tuberculosis H37Rv" [GO:0005829 "cytosol" evidence=IDA]
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 GO:GO:0005829 GenomeReviews:AL123456_GR EMBL:BX842582
Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
PROSITE:PS00523 GO:GO:0004065 GO:GO:0008484 HSSP:P15289 KO:K01130
EMBL:CP003248 PIR:E70533 RefSeq:NP_217816.1 RefSeq:YP_006516776.1
ProteinModelPortal:O65931 PRIDE:O65931
EnsemblBacteria:EBMYCT00000000058 GeneID:13318122 GeneID:887500
KEGG:mtu:Rv3299c KEGG:mtv:RVBD_3299c PATRIC:18155953
TubercuList:Rv3299c HOGENOM:HOG000042725 OMA:EIMGSRA
ProtClustDB:CLSK792415 InterPro:IPR009200 Pfam:PF06897
Uniprot:O65931
Length = 970
Score = 167 (63.8 bits), Expect = 1.2e-10, Sum P(2) = 1.2e-10
Identities = 61/215 (28%), Positives = 100/215 (46%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQLCTPSRSAIMTGKH 119
P+++ +L DD G+ G I TP + LA +G+I ++ +C+P+R+A++TG++
Sbjct: 212 PNVLIVLIDDAGFGGPDTFG-GAIRTPTLSRLAQNGLIYNRFHVTAVCSPTRAALLTGRN 270
Query: 120 PIHTGMQHNVLYG--CERGG--------LPLSEKILPQYLKELGYRTRIVGKWHL----- 164
H H V +G CE G P S LP+ L++ GY T GKWHL
Sbjct: 271 --H----HRVGFGSVCEFPGPYPGYSAVRPRSCAALPRILRDNGYVTGAFGKWHLTPDNV 324
Query: 165 -GFYKK-EYTPTFRGFESHLGYWTGHQDYFDHS-AEEMKMWGLDMRRDLEPAWDLHGKYS 221
G + P GF+ G+ +G +D +++ + G+ E D Y
Sbjct: 325 QGAAGPFDNWPLGWGFDHFWGFPSGAAGQYDPIISQDNSVIGIPEGSG-E---DGRPYYF 380
Query: 222 TDVFTAEAVDIIHN---HSTDEPLFLYLAHAATHS 253
D T +A++ +H + +P LY A ATH+
Sbjct: 381 PDDLTDKAIEWLHTVRAQNATKPWMLYYATGATHA 415
Score = 72 (30.4 bits), Expect = 1.2e-10, Sum P(2) = 1.2e-10
Identities = 41/185 (22%), Positives = 73/185 (39%)
Query: 285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLR-GVKNTLWEG 343
L+ LD + +E +EQ I + SN PL+ G + G
Sbjct: 540 LNGLDLDAERQLELIEQY----GGIAALGDEFTAPHFASAWAHASNTPLQWGKQMASHLG 595
Query: 344 GVRGAGLI-WSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYEN 402
G R ++ W + G V Q+ H D PT+L+A + P +V+ + P
Sbjct: 596 GTRDPLVVAWPARIRPDGRVRSQFTHCIDIAPTVLAAIGLPE-PTHVDGFEQE--PMDGT 652
Query: 403 SILR-YENGTHE--YNSPRIENSNTR--YENGTHEYNPKYENRYENGTHEYNPKYENRYE 457
S +R +++ E + EN +R Y++G R + + +P+ R+
Sbjct: 653 SFVRTFDDAEAEDRHTVQYFENFGSRAIYKDGWWACA-----RLDKAPWDLSPETMRRFA 707
Query: 458 NGTHE 462
GT++
Sbjct: 708 PGTYD 712
Score = 47 (21.6 bits), Expect = 4.6e-08, Sum P(2) = 4.6e-08
Identities = 8/33 (24%), Positives = 19/33 (57%)
Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFV 313
FA D +VG++++A+E N+++ ++
Sbjct: 486 FAGFSENADWNVGRLLDAIEDLGESDNTLVFYI 518
>UNIPROTKB|P51688 [details] [associations]
symbol:SGSH "N-sulphoglucosamine sulphohydrolase"
species:9606 "Homo sapiens" [GO:0008484 "sulfuric ester hydrolase
activity" evidence=IEA] [GO:0046872 "metal ion binding"
evidence=IEA] [GO:0016250 "N-sulfoglucosamine sulfohydrolase
activity" evidence=IEA] [GO:0006029 "proteoglycan metabolic
process" evidence=TAS] [GO:0003824 "catalytic activity"
evidence=TAS] [GO:0005975 "carbohydrate metabolic process"
evidence=TAS] [GO:0006027 "glycosaminoglycan catabolic process"
evidence=TAS] [GO:0030203 "glycosaminoglycan metabolic process"
evidence=TAS] [GO:0043202 "lysosomal lumen" evidence=TAS]
[GO:0044281 "small molecule metabolic process" evidence=TAS]
Reactome:REACT_111217 InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Reactome:REACT_116125 GO:GO:0003824
GO:GO:0044281 GO:GO:0046872 GO:GO:0005975 GO:GO:0043202
GO:GO:0006027 Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
GO:GO:0006029 EMBL:U30894 EMBL:U60111 EMBL:U60107 EMBL:U60108
EMBL:U60109 EMBL:U60110 EMBL:AK291257 EMBL:BC047318 IPI:IPI00019988
RefSeq:NP_000190.1 UniGene:Hs.31074 ProteinModelPortal:P51688
SMR:P51688 IntAct:P51688 STRING:P51688 PhosphoSite:P51688
DMDM:1711493 PaxDb:P51688 PRIDE:P51688 Ensembl:ENST00000326317
GeneID:6448 KEGG:hsa:6448 UCSC:uc002jxz.4 CTD:6448
GeneCards:GC17M078183 HGNC:HGNC:10818 HPA:HPA023436 HPA:HPA023451
MIM:252900 MIM:605270 neXtProt:NX_P51688 Orphanet:79269
PharmGKB:PA35726 HOGENOM:HOG000234731 HOVERGEN:HBG012598
InParanoid:P51688 KO:K01565 OMA:RDPHETQ OrthoDB:EOG4RXZ01
PhylomeDB:P51688 ChiTaRS:SGSH GenomeRNAi:6448 NextBio:25061
ArrayExpress:P51688 Bgee:P51688 CleanEx:HS_SGSH
Genevestigator:P51688 GermOnline:ENSG00000181523 GO:GO:0016250
Uniprot:P51688
Length = 502
Score = 154 (59.3 bits), Expect = 1.8e-10, Sum P(4) = 1.8e-10
Identities = 42/130 (32%), Positives = 72/130 (55%)
Query: 41 PLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
P+ +++ V + + P + + +LADD G+ G + I TP++DALA ++ +N
Sbjct: 4 PVPACCALLLVLGLCRARPRNALLLLADDGGFES-GAYNNSAIATPHLDALARRSLLFRN 62
Query: 101 YYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLS--EKI--LPQYLKELGYR 155
+T V C+PSR++++TG P H N +YG + + +K+ LP L + G R
Sbjct: 63 AFTSVSSCSPSRASLLTGL-PQH----QNGMYGLHQDVHHFNSFDKVRSLPLLLSQAGVR 117
Query: 156 TRIVGKWHLG 165
T I+GK H+G
Sbjct: 118 TGIIGKKHVG 127
Score = 77 (32.2 bits), Expect = 1.8e-10, Sum P(4) = 1.8e-10
Identities = 27/92 (29%), Positives = 44/92 (47%)
Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVR 346
++D+ VG V++ L +L++++++F SD +P G N W G
Sbjct: 245 RMDQGVGLVLQELRDAGVLNDTLVIFTSDNGIP-----------FP-SGRTNLYWPGTAE 292
Query: 347 GAGLIWSPLLESR-GIVAEQYVHVSDWLPTLL 377
L+ SP R G V+E YV + D PT+L
Sbjct: 293 PL-LVSSPEHPKRWGQVSEAYVSLLDLTPTIL 323
Score = 41 (19.5 bits), Expect = 1.8e-10, Sum P(4) = 1.8e-10
Identities = 8/21 (38%), Positives = 10/21 (47%)
Query: 239 DEPLFLYLAHAATHSANPYEP 259
D P FLY+A H +P
Sbjct: 168 DRPFFLYVAFHDPHRCGHSQP 188
Score = 38 (18.4 bits), Expect = 1.8e-10, Sum P(4) = 1.8e-10
Identities = 8/15 (53%), Positives = 9/15 (60%)
Query: 677 LFDIKNDPCEKNNLA 691
L+D DP E NLA
Sbjct: 438 LYDRSRDPHETQNLA 452
Score = 38 (18.4 bits), Expect = 1.8e-10, Sum P(4) = 1.8e-10
Identities = 8/15 (53%), Positives = 9/15 (60%)
Query: 783 LFDIKNDPCEKNNLA 797
L+D DP E NLA
Sbjct: 438 LYDRSRDPHETQNLA 452
>ASPGD|ASPL0000046382 [details] [associations]
symbol:AN11149 species:162425 "Emericella nidulans"
[GO:0005575 "cellular_component" evidence=ND] [GO:0008484 "sulfuric
ester hydrolase activity" evidence=IEA] [GO:0008152 "metabolic
process" evidence=IEA] InterPro:IPR000917 InterPro:IPR012083
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
PIRSF:PIRSF000972 GO:GO:0018958 EMBL:BN001307 Gene3D:3.40.720.10
SUPFAM:SSF53649 InterPro:IPR024607 PROSITE:PS00523 GO:GO:0004065
HOGENOM:HOG000169239 ProteinModelPortal:C8VLL2
EnsemblFungi:CADANIAT00007963 OMA:TENDPAN Uniprot:C8VLL2
Length = 565
Score = 143 (55.4 bits), Expect = 2.3e-10, Sum P(3) = 2.3e-10
Identities = 64/249 (25%), Positives = 109/249 (43%)
Query: 40 LPLAFTLSMVFVDLVASSGP---PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
+ L+ +++V V ++ + P P+ +F+ DD D+ + ++ +P + G+
Sbjct: 1 MKLSSLVALVGVSALSEASPRPKPNFVFVFTDD---QDLTMNSVEYMPHV-AGRIRDRGL 56
Query: 97 ILKNYY-TVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCER-GGLP------LSEKILPQY 148
N++ T LC PSR ++ TG+ +T NV + GG P +E P +
Sbjct: 57 DFTNHFVTTALCCPSRVSLWTGRQAHNT----NVTWVAPPYGGYPKFVSQGFNEDWFPLW 112
Query: 149 LKELGYRTRIVGKWHLGFYKKEYT-PTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMR 207
L++ GY T VGK Y P +GF + D F +S W +
Sbjct: 113 LQDAGYNTYYVGKLFNAHSVTTYNNPFVKGFNGS-DFLL---DPFTYS-----YWNSSYQ 163
Query: 208 RDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDE--PLFLYLAHAATH-------SAN--P 256
R+ E G+Y+TDV +A+ + + D+ P FL +A A H S++ P
Sbjct: 164 RNHEAPKSYAGQYTTDVTEEKALGFVDDALEDKERPFFLTVAPIAPHFEQDPGHSSDTPP 223
Query: 257 YEPLQAPDH 265
P+ AP H
Sbjct: 224 QAPIPAPRH 232
Score = 88 (36.0 bits), Expect = 2.3e-10, Sum P(3) = 2.3e-10
Identities = 51/188 (27%), Positives = 83/188 (44%)
Query: 274 EDF-KRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWP 332
EDF R + A L +DE V K+++ LE+ L+N+ +++ SD +
Sbjct: 272 EDFFYRQRLRA-LQSVDEMVDKLLDRLERSGQLNNTYVIYSSDNGFHI--------GHHR 322
Query: 333 LRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPT---LLSAANKSDIPNYV 389
L K+T +E +R I P ++S G V + H+ D+ PT LL +SD
Sbjct: 323 LPPGKSTSYEEDIRVPFFIRGPGIKSGGKVTQVTTHI-DFAPTIFELLGLPPRSDFDGTP 381
Query: 390 NSTVEN--IIPRYENSILRYENG-----THEYNSPRIENSNTRYENGTHEYNPKYENRYE 442
+++ IP +E+ I+ Y T N+ R+ N T Y++ + KY Y
Sbjct: 382 MRIMKDSAAIP-HEHVIVEYWGQALMMVTAPTNTDRMPN--TTYKS-VRLLSEKYNLFYA 437
Query: 443 ---NGTHE 447
G HE
Sbjct: 438 VWCTGDHE 445
Score = 43 (20.2 bits), Expect = 2.3e-10, Sum P(3) = 2.3e-10
Identities = 10/27 (37%), Positives = 16/27 (59%)
Query: 677 LFDIKNDPCEKNNL---ADRSEDQRIN 700
LFD+ DP + +N+ A RS R++
Sbjct: 446 LFDLNTDPYQMHNIYNTASRSFKNRLD 472
Score = 42 (19.8 bits), Expect = 2.9e-10, Sum P(3) = 2.9e-10
Identities = 10/27 (37%), Positives = 16/27 (59%)
Query: 783 LFDIKNDPCEKNNL---ADRSEVQRIN 806
LFD+ DP + +N+ A RS R++
Sbjct: 446 LFDLNTDPYQMHNIYNTASRSFKNRLD 472
>UNIPROTKB|F1NFI0 [details] [associations]
symbol:IDS "Uncharacterized protein" species:9031 "Gallus
gallus" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
GeneTree:ENSGT00640000091539 EMBL:AADN02013672 EMBL:AADN02013673
EMBL:AADN02013674 EMBL:AADN02013675 IPI:IPI00579251
Ensembl:ENSGALT00000014910 OMA:SELDYAY Uniprot:F1NFI0
Length = 525
Score = 162 (62.1 bits), Expect = 2.4e-10, Sum P(2) = 2.4e-10
Identities = 61/216 (28%), Positives = 98/216 (45%)
Query: 61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKH 119
+++FI+ DDL +G +G + + +PNID LA I+ N Y Q +C PSR + +TG+
Sbjct: 3 NVLFIVVDDLR-PVLGCYGDNLVKSPNIDQLASQSIVFSNAYAQQAVCAPSRVSFLTGRR 61
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK-WHLGF---YKKEYTPTF 175
P T + Y G + +PQY KE GY T VGK +H G Y +Y ++
Sbjct: 62 PDTTRLYDFYSYWRVHSG---NYSTMPQYFKENGYVTMSVGKVFHPGISSNYSDDYPYSW 118
Query: 176 RGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAE-AVDIIH 234
H D + L D+ ++ G D+ T E A+ +++
Sbjct: 119 SIPPFHPSTEKYENDKTCRGKDGRLYANLVCPIDVT---EMPGGTLPDIETTEEAIRLLN 175
Query: 235 NHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
T + F +LA H P+ PL+ P +L ++
Sbjct: 176 VMKTKKQKF-FLA-VGYHK--PHIPLRYPQEFLKLY 207
Score = 67 (28.6 bits), Expect = 2.4e-10, Sum P(2) = 2.4e-10
Identities = 20/60 (33%), Positives = 35/60 (58%)
Query: 256 PYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
PY PL PD + + R +S +AA+ + LD VG ++ AL+ + +++I+VF +D
Sbjct: 249 PYGPL--PDDFQRLIR------QSYYAAVSY-LDMQVGLLLNALDYVGLSNSTIVVFTAD 299
>TIGR_CMR|CPS_2367 [details] [associations]
symbol:CPS_2367 "sulfatase family protein" species:167879
"Colwellia psychrerythraea 34H" [GO:0008152 "metabolic process"
evidence=ISS] [GO:0008484 "sulfuric ester hydrolase activity"
evidence=ISS] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 EMBL:CP000083
GenomeReviews:CP000083_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484
RefSeq:YP_269085.1 ProteinModelPortal:Q482D3 STRING:Q482D3
GeneID:3522074 KEGG:cps:CPS_2367 PATRIC:21467819
HOGENOM:HOG000220675 OMA:TAGVCAP ProtClustDB:CLSK2525596
BioCyc:CPSY167879:GI48-2430-MONOMER Uniprot:Q482D3
Length = 558
Score = 142 (55.0 bits), Expect = 2.8e-10, Sum P(3) = 2.8e-10
Identities = 45/156 (28%), Positives = 69/156 (44%)
Query: 42 LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNY 101
L L++ V A P+I+ I+A+D+ VG G TP +D LA S + N
Sbjct: 11 LCLALALSSVTSFAKEQRPNILLIVAEDMSAK-VGAFGDTVAKTPVLDELAKSSVRYPNT 69
Query: 102 YTVQ-LCTPSRSAIMTGKHPIHTGMQH---NVLYGCERGGLPLSE-KILPQYLKELGYRT 156
+T +C PSR++++TG H I G QH +P + K P+ L++ GY T
Sbjct: 70 FTTAGVCAPSRTSLITGVHQITVGGQHMRTRSFKASNYRAVPAPDVKAFPELLRKSGYYT 129
Query: 157 RIVGKWHLGFYKKE-YTPTFR--GFESHLGYWTGHQ 189
+ K F +T F +E W G +
Sbjct: 130 YVSSKLDYQFSNTSPHTGPFTIWNYEGKKPTWRGRE 165
Score = 67 (28.6 bits), Expect = 2.8e-10, Sum P(3) = 2.8e-10
Identities = 37/163 (22%), Positives = 68/163 (41%)
Query: 285 LHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGG 344
+H +D VGK++ L++ + N+I+++ +D + P RG K +++ G
Sbjct: 230 IHAMDTQVGKLLAELKKDGLSDNTIVIWTTDHG-----------DSLP-RG-KREVYDSG 276
Query: 345 VRGAGLI-WS----PLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPR 399
++ +I W P G + Q + D P++L+ AN + P Y+ IP
Sbjct: 277 LKVPMIIHWPDKYRPSKTVNGSIDSQLLSFVDIAPSILAMAN-INTPAYIQGKAR--IPN 333
Query: 400 YENSILRYENGTHEYNSP-RIENSNTRYENGTHEYNPKYENRY 441
N+ + + Y S R++ R E KY Y
Sbjct: 334 -NNATNKIAKREYIYASKDRLDEFPFR-ERAVRNNKFKYIKNY 374
Score = 64 (27.6 bits), Expect = 2.8e-10, Sum P(3) = 2.8e-10
Identities = 13/22 (59%), Positives = 17/22 (77%)
Query: 783 LFDIKNDPCEKNNLADRSEVQR 804
L+DI NDP E NNLA++ E Q+
Sbjct: 421 LYDIINDPEEVNNLAEKVEYQQ 442
Score = 64 (27.6 bits), Expect = 2.8e-10, Sum P(3) = 2.8e-10
Identities = 14/25 (56%), Positives = 19/25 (76%)
Query: 677 LFDIKNDPCEKNNLADRSE-DQRIN 700
L+DI NDP E NNLA++ E Q++N
Sbjct: 421 LYDIINDPEEVNNLAEKVEYQQQLN 445
Score = 39 (18.8 bits), Expect = 9.0e-08, Sum P(3) = 9.0e-08
Identities = 11/41 (26%), Positives = 19/41 (46%)
Query: 499 NRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQ 539
NR + Y D +V ++ + E ++ I+ N EWQ
Sbjct: 415 NRPGEELYDIINDPEEVNNLAEKVEYQQQLNIMRNALKEWQ 455
Score = 39 (18.8 bits), Expect = 9.0e-08, Sum P(3) = 9.0e-08
Identities = 11/41 (26%), Positives = 19/41 (46%)
Query: 564 NRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQ 604
NR + Y D +V ++ + E ++ I+ N EWQ
Sbjct: 415 NRPGEELYDIINDPEEVNNLAEKVEYQQQLNIMRNALKEWQ 455
>UNIPROTKB|F6PP52 [details] [associations]
symbol:SULF1 "Uncharacterized protein" species:9796 "Equus
caballus" [GO:0001502 "cartilage condensation" evidence=ISS]
[GO:0001822 "kidney development" evidence=ISS] [GO:0001937
"negative regulation of endothelial cell proliferation"
evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
[GO:0004065 "arylsulfatase activity" evidence=ISS] [GO:0005615
"extracellular space" evidence=ISS] [GO:0005783 "endoplasmic
reticulum" evidence=ISS] [GO:0005794 "Golgi apparatus"
evidence=ISS] [GO:0007155 "cell adhesion" evidence=ISS] [GO:0008449
"N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
[GO:0009986 "cell surface" evidence=ISS] [GO:0014846 "esophagus
smooth muscle contraction" evidence=ISS] [GO:0016525 "negative
regulation of angiogenesis" evidence=ISS] [GO:0030177 "positive
regulation of Wnt receptor signaling pathway" evidence=ISS]
[GO:0030201 "heparan sulfate proteoglycan metabolic process"
evidence=ISS] [GO:0030336 "negative regulation of cell migration"
evidence=ISS] [GO:0030513 "positive regulation of BMP signaling
pathway" evidence=ISS] [GO:0035860 "glial cell-derived neurotrophic
factor receptor signaling pathway" evidence=ISS] [GO:0036022 "limb
joint morphogenesis" evidence=ISS] [GO:0040037 "negative regulation
of fibroblast growth factor receptor signaling pathway"
evidence=ISS] [GO:0045121 "membrane raft" evidence=ISS] [GO:0048010
"vascular endothelial growth factor receptor signaling pathway"
evidence=ISS] [GO:0048661 "positive regulation of smooth muscle
cell proliferation" evidence=ISS] [GO:0048706 "embryonic skeletal
system development" evidence=ISS] [GO:0051216 "cartilage
development" evidence=ISS] [GO:0060348 "bone development"
evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0060686
"negative regulation of prostatic bud formation" evidence=ISS]
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 GO:GO:0005783 GO:GO:0005886 GO:GO:0005794
GO:GO:0005615 GO:GO:0009986 GO:GO:0048661 GO:GO:0010575
GO:GO:0045121 GO:GO:0030336 GO:GO:0001822 GO:GO:0001937
GO:GO:0030513 GO:GO:0016525 GO:GO:0001502 GO:GO:0060348
Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0030177 GO:GO:0003094
InterPro:IPR024607 PROSITE:PS00523 GO:GO:0004065
GeneTree:ENSGT00400000022041 GO:GO:0048706 GO:GO:0048010
GO:GO:0060686 GO:GO:0002063 GO:GO:0040037 GO:GO:0032836
GO:GO:0060384 GO:GO:0008449 GO:GO:0030201 GO:GO:0014846
GO:GO:0035860 GO:GO:0036022 InterPro:IPR024609 Pfam:PF12548
OMA:SVRVTHK Ensembl:ENSECAT00000019009 Uniprot:F6PP52
Length = 1129
Score = 161 (61.7 bits), Expect = 4.1e-10, Sum P(3) = 4.1e-10
Identities = 62/223 (27%), Positives = 96/223 (43%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+II +L DD DV L Q+ + + G N + T +C PSRS+++TGK
Sbjct: 43 PNIILVLTDD---QDVELGSL-QVMNKTRKIMEHGGATFTNAFVTTPMCCPSRSSMLTGK 98
Query: 119 HPIHTGMQHNVLYGCERGGLP----LSE-KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+ +H HNV E P + E + YL GYRT GK+ L Y Y P
Sbjct: 99 Y-VHN---HNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
G+ LG + +++++ G+ + + A D Y TD+ T E+++
Sbjct: 154 P--GWREWLGLIKNSR-FYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESINYF 203
Query: 234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
P+ + ++HAA H P Q Y N +HI
Sbjct: 204 KMSKRMYPHRPIMMVISHAAPHGPEDSAP-QFSKLYPNASQHI 245
Score = 69 (29.3 bits), Expect = 4.1e-10, Sum P(3) = 4.1e-10
Identities = 28/131 (21%), Positives = 52/131 (39%)
Query: 267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
L IH + K L +D+SV ++ L + R L N+ I++ +D
Sbjct: 268 LPIHMEFTNVLHRKRLQTLMSVDDSVERLYNMLVETRELENTYIIYTADHGYHIGQFGLV 327
Query: 327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
+ P ++ +R I P +E IV + +++ D PT+L A P
Sbjct: 328 KGKSMP--------YDFDIRVPFFIRGPSVEPGSIVPQIVLNI-DLAPTILDIAGLDTPP 378
Query: 387 NYVNSTVENII 397
+ +V ++
Sbjct: 379 DVDGKSVLKLL 389
Score = 52 (23.4 bits), Expect = 4.1e-10, Sum P(3) = 4.1e-10
Identities = 13/52 (25%), Positives = 25/52 (48%)
Query: 448 YNPKYENRYENGTHEYNGPKNEN-TNPRYENGTHEYNIPRLENSINGNGTSE 498
++P+ + + N E + +E+ N + E + RLE +GNG +E
Sbjct: 986 FSPESKLEWNNNIPEVSRLNSEHWRNHKTEKWMEHEELNRLETDFSGNGMTE 1037
Score = 52 (23.4 bits), Expect = 8.9e-08, Sum P(2) = 8.9e-08
Identities = 19/87 (21%), Positives = 35/87 (40%)
Query: 428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGPKNENTNPRYENGTHEYNI 484
+G + P++ Y N + P Y N ++ +Y GP + + N H +
Sbjct: 225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLP-IHMEFTNVLHRKRL 283
Query: 485 PRL---ENSING--NGTSENRSNDNSY 506
L ++S+ N E R +N+Y
Sbjct: 284 QTLMSVDDSVERLYNMLVETRELENTY 310
>UNIPROTKB|G3WVX3 [details] [associations]
symbol:SULF1 "Uncharacterized protein" species:9305
"Sarcophilus harrisii" [GO:0001502 "cartilage condensation"
evidence=ISS] [GO:0001822 "kidney development" evidence=ISS]
[GO:0001937 "negative regulation of endothelial cell proliferation"
evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
[GO:0004065 "arylsulfatase activity" evidence=ISS] [GO:0005615
"extracellular space" evidence=ISS] [GO:0005783 "endoplasmic
reticulum" evidence=ISS] [GO:0005794 "Golgi apparatus"
evidence=ISS] [GO:0007155 "cell adhesion" evidence=ISS] [GO:0008449
"N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
[GO:0009986 "cell surface" evidence=ISS] [GO:0014846 "esophagus
smooth muscle contraction" evidence=ISS] [GO:0016525 "negative
regulation of angiogenesis" evidence=ISS] [GO:0030177 "positive
regulation of Wnt receptor signaling pathway" evidence=ISS]
[GO:0030201 "heparan sulfate proteoglycan metabolic process"
evidence=ISS] [GO:0030336 "negative regulation of cell migration"
evidence=ISS] [GO:0030513 "positive regulation of BMP signaling
pathway" evidence=ISS] [GO:0035860 "glial cell-derived neurotrophic
factor receptor signaling pathway" evidence=ISS] [GO:0036022 "limb
joint morphogenesis" evidence=ISS] [GO:0040037 "negative regulation
of fibroblast growth factor receptor signaling pathway"
evidence=ISS] [GO:0045121 "membrane raft" evidence=ISS] [GO:0048010
"vascular endothelial growth factor receptor signaling pathway"
evidence=ISS] [GO:0048661 "positive regulation of smooth muscle
cell proliferation" evidence=ISS] [GO:0048706 "embryonic skeletal
system development" evidence=ISS] [GO:0051216 "cartilage
development" evidence=ISS] [GO:0060348 "bone development"
evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0060686
"negative regulation of prostatic bud formation" evidence=ISS]
InterPro:IPR000917 InterPro:IPR014615 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783
GO:GO:0005794 GO:GO:0005615 GO:GO:0009986 GO:GO:0048661
GO:GO:0005509 GO:GO:0045121 GO:GO:0030336 GO:GO:0001822
GO:GO:0001937 GO:GO:0030513 GO:GO:0016525 GO:GO:0001502
GO:GO:0060348 Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0030177
InterPro:IPR024607 PROSITE:PS00523 GO:GO:0004065
GeneTree:ENSGT00400000022041 GO:GO:0048706 GO:GO:0048010
GO:GO:0060686 GO:GO:0002063 GO:GO:0040037 GO:GO:0060384
GO:GO:0008449 GO:GO:0030201 GO:GO:0014846 GO:GO:0035860
GO:GO:0036022 InterPro:IPR024609 Pfam:PF12548 EMBL:AEFK01056197
EMBL:AEFK01056198 EMBL:AEFK01056199 EMBL:AEFK01056200
Ensembl:ENSSHAT00000019735 Uniprot:G3WVX3
Length = 870
Score = 166 (63.5 bits), Expect = 6.2e-10, Sum P(2) = 6.2e-10
Identities = 63/223 (28%), Positives = 97/223 (43%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+II +L DD DV L Q+ + + G N + T +C PSRS+++TGK
Sbjct: 43 PNIILVLTDD---QDVELGSL-QVMNKTRKIMEHGGATFINAFVTTPMCCPSRSSMLTGK 98
Query: 119 HPIHTGMQHNVLYGCERGGLP----LSE-KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+ +H HNV E P + E + YL GYRT GK+ L Y Y P
Sbjct: 99 Y-VHN---HNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
G+ LG + +++++ G+ + + A D Y TD+ T E+++
Sbjct: 154 P--GWREWLGLIKNSR-FYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESINYF 203
Query: 234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
P+ + ++HAA H P Q D Y N +HI
Sbjct: 204 KMSKRMYPHRPIMMVISHAAPHGPEDSAP-QFSDLYPNASQHI 245
Score = 65 (27.9 bits), Expect = 6.2e-10, Sum P(2) = 6.2e-10
Identities = 27/131 (20%), Positives = 53/131 (40%)
Query: 267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
L IH + + K L +D+SV ++ L + L N+ I++ +D
Sbjct: 268 LPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVETGELDNTYIIYTADHGYHIGQFGLV 327
Query: 327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
+ P ++ +R I P +E IV++ +++ D PT+L A P
Sbjct: 328 KGKSMP--------YDFDIRVPFFIRGPSVEPGSIVSQIVLNI-DLAPTILDIAGLDTPP 378
Query: 387 NYVNSTVENII 397
+ +V ++
Sbjct: 379 DVDGKSVLKLL 389
Score = 48 (22.0 bits), Expect = 3.5e-08, Sum P(2) = 3.5e-08
Identities = 10/42 (23%), Positives = 19/42 (45%)
Query: 428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGP 466
+G + P++ + Y N + P Y N ++ +Y GP
Sbjct: 225 HGPEDSAPQFSDLYPNASQHITPSYNYAPNMDKHWIMQYTGP 266
>ASPGD|ASPL0000029545 [details] [associations]
symbol:AN5449 species:162425 "Emericella nidulans"
[GO:0005575 "cellular_component" evidence=ND] [GO:0008484 "sulfuric
ester hydrolase activity" evidence=IEA] [GO:0008152 "metabolic
process" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 EMBL:BN001305 Gene3D:3.40.720.10
SUPFAM:SSF53649 eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0008484 EMBL:AACD01000094 RefSeq:XP_663053.1
ProteinModelPortal:Q5B1Y1 EnsemblFungi:CADANIAT00003640
GeneID:2871741 KEGG:ani:AN5449.2 HOGENOM:HOG000217625 KO:K01133
OMA:YIMADQM OrthoDB:EOG45F0XM InterPro:IPR017785 InterPro:IPR025863
Pfam:PF12411 TIGRFAMs:TIGR03417 Uniprot:Q5B1Y1
Length = 594
Score = 161 (61.7 bits), Expect = 6.7e-10, Sum P(3) = 6.7e-10
Identities = 65/254 (25%), Positives = 111/254 (43%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQ-IPTPNIDALAYSGIILKNYY-TVQLCTPSRSAIMTG 117
P+I++I+AD + + FH D I TPN++ LA G++ + Y LC PSR ++TG
Sbjct: 6 PNILYIMADQMAAPLLAFHDKDSPIKTPNLNKLAEEGVVFDSAYCNSPLCAPSRFVMVTG 65
Query: 118 KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKKEYTPTFRG 177
+ P G N LP YL+ GY T + GK H F + G
Sbjct: 66 QLPSKIGAYDNA------ADLPADIPTYAHYLRREGYHTALAGKMH--FCGPDQ---LHG 114
Query: 178 FESHLGYWTGHQDY-FDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDV--FTAEAV---- 230
+E L DY + + +E + LD ++ D T+ F E +
Sbjct: 115 YEQRLTSDIYPGDYGWSVNWDEPDV-RLDYYHNMSSVMDAGPVVRTNQLDFDEEVIYKSK 173
Query: 231 DIIHNH---STDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILH- 286
+++H TD+P L ++ TH P++P + +++ +E K AAI H
Sbjct: 174 QYLYDHVRQRTDQPFCLTVS--MTH---PHDPYAMTKEFWDLYEDVE-IPLPKHAAIPHD 227
Query: 287 KLDESVGKVVEALE 300
+ D ++++ ++
Sbjct: 228 QQDPHSQRILKCID 241
Score = 61 (26.5 bits), Expect = 6.7e-10, Sum P(3) = 6.7e-10
Identities = 11/16 (68%), Positives = 13/16 (81%)
Query: 675 PCLFDIKNDPCEKNNL 690
P LFD++NDP EK NL
Sbjct: 409 PMLFDVQNDPLEKVNL 424
Score = 61 (26.5 bits), Expect = 6.7e-10, Sum P(3) = 6.7e-10
Identities = 11/16 (68%), Positives = 13/16 (81%)
Query: 781 PCLFDIKNDPCEKNNL 796
P LFD++NDP EK NL
Sbjct: 409 PMLFDVQNDPLEKVNL 424
Score = 47 (21.6 bits), Expect = 6.7e-10, Sum P(3) = 6.7e-10
Identities = 13/45 (28%), Positives = 22/45 (48%)
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALT---RGKWKLV 551
+DG+ + L+ + K +T+L E S + RG+WK V
Sbjct: 358 LDGVSLVPYLTGEDGVKTDTVLGEYMGEGTQSPVVMIRRGRWKFV 402
>UNIPROTKB|F1RZ89 [details] [associations]
symbol:LOC100737146 "Uncharacterized protein" species:9823
"Sus scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
OMA:RDPHETQ GeneTree:ENSGT00390000013080 EMBL:CU914710
Ensembl:ENSSSCT00000018673 ArrayExpress:F1RZ89 Uniprot:F1RZ89
Length = 496
Score = 150 (57.9 bits), Expect = 7.1e-10, Sum P(3) = 7.1e-10
Identities = 44/131 (33%), Positives = 70/131 (53%)
Query: 41 PLAFTLSMVFVDLVASSGPP-HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
P+ + L + A G +++ ILADD G+ G + I TP++DALA I+ +
Sbjct: 9 PVGWVLLLALGLCCAQGGRRRNVLLILADDGGFES-GAYNNSAITTPHLDALARRSIVFR 67
Query: 100 NYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLS--EKI--LPQYLKELGY 154
N +T V C+PSR++++TG P H N +YG + + +++ LP L G
Sbjct: 68 NAFTSVSSCSPSRASLLTGL-PQH----QNGMYGLHQDVHHFNSFDRVQSLPLLLGRAGV 122
Query: 155 RTRIVGKWHLG 165
RT I+GK H+G
Sbjct: 123 RTGIIGKKHVG 133
Score = 75 (31.5 bits), Expect = 7.1e-10, Sum P(3) = 7.1e-10
Identities = 26/92 (28%), Positives = 44/92 (47%)
Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVR 346
++D+ +G V++ L +L++++++F SD +P G N W G
Sbjct: 251 RMDQGIGLVLQELRGAGVLNDTLVIFTSDNGVP-----------FP-SGRTNLYWPGAAE 298
Query: 347 GAGLIWSPLLESR-GIVAEQYVHVSDWLPTLL 377
L+ SP R G V+E YV + D PT+L
Sbjct: 299 PL-LVSSPEHPQRWGQVSEAYVSLLDLTPTVL 329
Score = 41 (19.5 bits), Expect = 7.1e-10, Sum P(3) = 7.1e-10
Identities = 8/21 (38%), Positives = 10/21 (47%)
Query: 239 DEPLFLYLAHAATHSANPYEP 259
D P FLY+A H +P
Sbjct: 174 DRPFFLYVAFHDPHRCGHSQP 194
>MGI|MGI:96417 [details] [associations]
symbol:Ids "iduronate 2-sulfatase" species:10090 "Mus
musculus" [GO:0003824 "catalytic activity" evidence=IEA]
[GO:0004423 "iduronate-2-sulfatase activity" evidence=IEA]
[GO:0005515 "protein binding" evidence=IPI] [GO:0005764 "lysosome"
evidence=IEA] [GO:0008152 "metabolic process" evidence=IEA]
[GO:0008484 "sulfuric ester hydrolase activity" evidence=IDA]
[GO:0016787 "hydrolase activity" evidence=IEA] [GO:0046872 "metal
ion binding" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 MGI:MGI:96417 GO:GO:0046872
GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
CTD:3423 HOVERGEN:HBG006120 KO:K01136 OrthoDB:EOG49078W ChiTaRS:IDS
GO:GO:0004423 EMBL:AK166178 EMBL:BX294168 EMBL:L07921 EMBL:BN000750
IPI:IPI00125815 PIR:A47153 RefSeq:NP_034628.2 UniGene:Mm.233083
ProteinModelPortal:Q08890 SMR:Q08890 STRING:Q08890
PhosphoSite:Q08890 PRIDE:Q08890 DNASU:15931
Ensembl:ENSMUST00000101509 GeneID:15931 KEGG:mmu:15931
GeneTree:ENSGT00640000091539 InParanoid:Q32KI7 NextBio:288652
Bgee:Q08890 CleanEx:MM_IDS Genevestigator:Q08890
GermOnline:ENSMUSG00000035847 Uniprot:Q08890
Length = 552
Score = 144 (55.7 bits), Expect = 7.1e-10, Sum P(2) = 7.1e-10
Identities = 38/108 (35%), Positives = 57/108 (52%)
Query: 61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKH 119
+I+ I+ DDL +G +G + +PNID LA ++ +N + Q +C PSR + +TG+
Sbjct: 40 NILLIIVDDLR-PSLGCYGDKLVRSPNIDQLASHSVLFQNAFAQQAVCAPSRVSFLTGRR 98
Query: 120 PIHTGM-QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK-WHLG 165
P T + N + G +PQY KE GY T VGK +H G
Sbjct: 99 PDTTRLYDFNSYWRVHSGNF----STIPQYFKENGYVTMSVGKVFHPG 142
Score = 82 (33.9 bits), Expect = 7.1e-10, Sum P(2) = 7.1e-10
Identities = 21/46 (45%), Positives = 29/46 (63%)
Query: 274 EDFKR----SKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
EDF+R S FA++ + LD VG V+ AL+ R+ N+II F SD
Sbjct: 292 EDFQRKIRQSYFASVSY-LDTQVGHVLSALDDLRLAHNTIIAFTSD 336
>UNIPROTKB|F6PNP7 [details] [associations]
symbol:IDS "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
GO:GO:0008484 HOGENOM:HOG000014304 HOVERGEN:HBG006120 OMA:CREGKNL
OrthoDB:EOG49078W GeneTree:ENSGT00640000091539 EMBL:AAEX03027034
Ensembl:ENSCAFT00000030323 Uniprot:F6PNP7
Length = 468
Score = 150 (57.9 bits), Expect = 7.9e-10, Sum P(2) = 7.9e-10
Identities = 39/113 (34%), Positives = 60/113 (53%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAI 114
++ P +++ I+ DDL +G +G + +PNID LA ++ +N + Q +C PSR +
Sbjct: 31 TTAPLNVLLIIVDDLR-PSLGCYGDKLVRSPNIDQLASHSLLFQNAFAQQAVCAPSRVSF 89
Query: 115 MTGKHPIHTGM-QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK-WHLG 165
+TG+ P T + N + G LPQY KE GY T VGK +H G
Sbjct: 90 LTGRRPDTTRLYDFNSYWRVHAGNF----STLPQYFKENGYVTMSVGKVFHPG 138
Score = 73 (30.8 bits), Expect = 7.9e-10, Sum P(2) = 7.9e-10
Identities = 22/60 (36%), Positives = 37/60 (61%)
Query: 256 PYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
PY P+ P ++ R I ++S FA+I + LD VG ++ AL+ ++ +++IIVF SD
Sbjct: 282 PYGPI--P---VDFQRKI---RQSYFASISY-LDTQVGHLLSALDDLQLANSTIIVFASD 332
>UNIPROTKB|Q48QH2 [details] [associations]
symbol:betC "Choline sulfatase" species:264730 "Pseudomonas
syringae pv. phaseolicola 1448A" [GO:0006790 "sulfur compound
metabolic process" evidence=ISS] [GO:0030104 "water homeostasis"
evidence=ISS] [GO:0047753 "choline-sulfatase activity"
evidence=ISS] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 EMBL:CP000058
GenomeReviews:CP000058_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
GO:GO:0030104 eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00149
GO:GO:0006790 HOGENOM:HOG000217625 KO:K01133 InterPro:IPR017785
InterPro:IPR025863 Pfam:PF12411 TIGRFAMs:TIGR03417
RefSeq:YP_272344.1 ProteinModelPortal:Q48QH2 STRING:Q48QH2
GeneID:3556452 KEGG:psp:PSPPH_0030 PATRIC:19969019 OMA:MIRRGAY
ProtClustDB:CLSK864791 GO:GO:0047753 Uniprot:Q48QH2
Length = 501
Score = 133 (51.9 bits), Expect = 8.1e-10, Sum P(3) = 8.1e-10
Identities = 32/104 (30%), Positives = 49/104 (47%)
Query: 61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYY-TVQLCTPSRSAIMTGKH 119
+I+FI+AD + + F+ I PN+ LA G++ + Y LC PSR +++G+
Sbjct: 5 NILFIMADQMAAPMLPFYSRSPILMPNLSRLAADGVVFDSAYCNSPLCAPSRFTLVSGQL 64
Query: 120 PIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWH 163
P G N P YL+ LGY+T + GK H
Sbjct: 65 PSKIGAYDNA------ADFPADIPTYAHYLRALGYKTALAGKMH 102
Score = 76 (31.8 bits), Expect = 8.1e-10, Sum P(3) = 8.1e-10
Identities = 28/109 (25%), Positives = 50/109 (45%)
Query: 273 IEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWP 332
I D +R+ F A + +D +VGK+++ L++ + ++I+VF D +W
Sbjct: 248 IRDARRAYFGACSY-IDLNVGKLMQTLDEVGLAEDTIVVFSGDHGDMLGEKGLWYKMHW- 305
Query: 333 LRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAAN 381
+E R +++SP G V+ V +D LPT + AN
Sbjct: 306 --------FEMAARVPLVVYSPGQFKPGRVSAS-VSTADLLPTFVEMAN 345
Score = 58 (25.5 bits), Expect = 8.1e-10, Sum P(3) = 8.1e-10
Identities = 12/33 (36%), Positives = 20/33 (60%)
Query: 675 PCL-FDIKNDPCEKNNLADRSEDQRI-NHYTTE 705
PCL FD+K DP E+ +L+ +++ N + E
Sbjct: 402 PCLLFDVKKDPKEQKDLSQSPAHEKLFNDFLAE 434
Score = 56 (24.8 bits), Expect = 1.3e-09, Sum P(3) = 1.3e-09
Identities = 12/33 (36%), Positives = 20/33 (60%)
Query: 781 PCL-FDIKNDPCEKNNLADRSEVQRI-NHYTTE 811
PCL FD+K DP E+ +L+ +++ N + E
Sbjct: 402 PCLLFDVKKDPKEQKDLSQSPAHEKLFNDFLAE 434
>RGD|1560491 [details] [associations]
symbol:Ids "iduronate 2-sulfatase" species:10116 "Rattus
norvegicus" [GO:0003674 "molecular_function" evidence=ND]
[GO:0005575 "cellular_component" evidence=ND] [GO:0008150
"biological_process" evidence=ND] [GO:0008484 "sulfuric ester
hydrolase activity" evidence=ISO] InterPro:IPR000917
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 RGD:1560491
Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
HOGENOM:HOG000014304 HOVERGEN:HBG006120 OrthoDB:EOG49078W
GO:GO:0004423 EMBL:BN000743 IPI:IPI00764641
ProteinModelPortal:Q32KJ4 STRING:Q32KJ4 PhosphoSite:Q32KJ4
InParanoid:Q32KJ4 Genevestigator:Q32KJ4 Uniprot:Q32KJ4
Length = 543
Score = 149 (57.5 bits), Expect = 8.1e-10, Sum P(2) = 8.1e-10
Identities = 45/137 (32%), Positives = 71/137 (51%)
Query: 33 RIMAFAVLPLAFTLSMVFVDLVASSGPP-HIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
R ++F++L F +++V S+ +I+ I+ DDL +G +G + +PNID L
Sbjct: 2 RQLSFSLLLGFFCIALVSAAQGNSATDALNILLIIVDDLR-PSLGCYGDKLVRSPNIDQL 60
Query: 92 AYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGM-QHNVLYGCERGGLPLSEKILPQYL 149
A I+ +N + Q +C PSR + +TG+ P T + N + G +PQY
Sbjct: 61 ASHSIVFENAFAQQAVCAPSRVSFLTGRRPDTTRLYDFNSYWRVHSGNF----STIPQYF 116
Query: 150 KELGYRTRIVGK-WHLG 165
KE GY T VGK +H G
Sbjct: 117 KENGYVTMSVGKVFHPG 133
Score = 76 (31.8 bits), Expect = 8.1e-10, Sum P(2) = 8.1e-10
Identities = 22/60 (36%), Positives = 36/60 (60%)
Query: 256 PYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
PY P+ P ++ R I ++S FA++ + LD VG ++ AL+ R+ N+II F+SD
Sbjct: 277 PYGPI--P---VDFQRKI---RQSYFASVSY-LDTQVGHLLSALDDLRLAHNTIIAFMSD 327
>ZFIN|ZDB-GENE-030131-4958 [details] [associations]
symbol:sgsh "N-sulfoglucosamine sulfohydrolase
(sulfamidase)" species:7955 "Danio rerio" [GO:0003824 "catalytic
activity" evidence=IEA] [GO:0008152 "metabolic process"
evidence=IEA] [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 ZFIN:ZDB-GENE-030131-4958
Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484 CTD:6448
HOGENOM:HOG000234731 HOVERGEN:HBG012598 KO:K01565 OMA:RDPHETQ
OrthoDB:EOG4RXZ01 GeneTree:ENSGT00390000013080 EMBL:CU459096
IPI:IPI00616379 RefSeq:NP_001116740.1 UniGene:Dr.80125
Ensembl:ENSDART00000063147 GeneID:563849 KEGG:dre:563849
NextBio:20885106 Uniprot:B0V3V9
Length = 511
Score = 138 (53.6 bits), Expect = 9.1e-10, Sum P(4) = 9.1e-10
Identities = 45/136 (33%), Positives = 67/136 (49%)
Query: 35 MAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYS 94
MAF L + F D V +++ I+ADD G+ + + + TP++ AL+
Sbjct: 1 MAFVFAWTLLCLLLCF-D-VGGCRSRNVLLIIADDGGF-ETDVYNNTVVQTPHLRALSKR 57
Query: 95 GIILKNYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSE----KILPQYL 149
+I KN +T V C+PSRS I+TG P H N +YG +G + + LP L
Sbjct: 58 SLIFKNAFTSVSSCSPSRSTILTGL-PQH----QNGMYGLHQGVHHFNSFDGVQSLPLLL 112
Query: 150 KELGYRTRIVGKWHLG 165
K T I+GK H+G
Sbjct: 113 KRANIHTGIIGKKHVG 128
Score = 72 (30.4 bits), Expect = 9.1e-10, Sum P(4) = 9.1e-10
Identities = 32/128 (25%), Positives = 61/128 (47%)
Query: 257 YEP-LQAPDHYLNIHRHIEDFK--RSKFAA---ILHKLDESVGKVVEALEQRRMLSNSII 310
+EP +PD + + I D R+ AA + +LD+ +G V+E L + +++++
Sbjct: 219 WEPKYYSPDQ-VKVPYFIPDTPAARADIAAQYTTVSRLDQGIGLVLEELRKAGFENDTLV 277
Query: 311 VFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESR-GIVAEQYVHV 369
++ SD + P + L+ GV+ L+ SP + R G +++ YV +
Sbjct: 278 IYSSD-------------NGIPFPNGRTNLYGSGVKEPMLLSSPEHQQRWGKLSQAYVSL 324
Query: 370 SDWLPTLL 377
D PT+L
Sbjct: 325 LDITPTIL 332
Score = 58 (25.5 bits), Expect = 9.1e-10, Sum P(4) = 9.1e-10
Identities = 14/23 (60%), Positives = 16/23 (69%)
Query: 783 LFDIKNDPCEKNNLA---DRSEV 802
LFD++ DP EK NLA D SEV
Sbjct: 447 LFDVRTDPMEKVNLAGDLDYSEV 469
Score = 54 (24.1 bits), Expect = 2.2e-09, Sum P(4) = 2.2e-09
Identities = 10/15 (66%), Positives = 12/15 (80%)
Query: 677 LFDIKNDPCEKNNLA 691
LFD++ DP EK NLA
Sbjct: 447 LFDVRTDPMEKVNLA 461
Score = 37 (18.1 bits), Expect = 9.1e-10, Sum P(4) = 9.1e-10
Identities = 7/21 (33%), Positives = 10/21 (47%)
Query: 239 DEPLFLYLAHAATHSANPYEP 259
+ P FLY+A H +P
Sbjct: 177 ERPFFLYVAFHDPHRCGHSQP 197
>FB|FBgn0033836 [details] [associations]
symbol:CG18278 species:7227 "Drosophila melanogaster"
[GO:0006044 "N-acetylglucosamine metabolic process" evidence=ISS]
[GO:0008449 "N-acetylglucosamine-6-sulfatase activity"
evidence=ISS] [GO:0005764 "lysosome" evidence=ISS] [GO:0030203
"glycosaminoglycan metabolic process" evidence=IEA]
InterPro:IPR000917 InterPro:IPR012251 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036666 EMBL:AE013599
Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
InterPro:IPR024607 PROSITE:PS00523 GeneTree:ENSGT00400000022041
GO:GO:0030203 KO:K01137 GO:GO:0008449 OMA:MCGYQTF EMBL:BT021205
RefSeq:NP_725289.1 UniGene:Dm.28273 SMR:Q5BIL9 STRING:Q5BIL9
EnsemblMetazoa:FBtr0087716 GeneID:36487 KEGG:dme:Dmel_CG18278
UCSC:CG18278-RA FlyBase:FBgn0033836 InParanoid:Q5BIL9
OrthoDB:EOG43TXB4 GenomeRNAi:36487 NextBio:798808 Uniprot:Q5BIL9
Length = 492
Score = 174 (66.3 bits), Expect = 1.2e-09, P = 1.2e-09
Identities = 70/253 (27%), Positives = 117/253 (46%)
Query: 39 VLPLAFTLSMVFVDL--VASSGPPHIIFILADDLGWNDVGFHGLDQIPTPN-IDALAYSG 95
++ LA + +V L AS P+I+ IL+DD DV G+ P + I+ L + G
Sbjct: 1 MISLAPLIILVLACLGNTASEKLPNILLILSDD---QDVELRGM--FPMEHTIEMLGFGG 55
Query: 96 IILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHN-VLYGC--ERGGLPLSEKILPQYLKE 151
+ N YT +C P+R++++TG + + G ++N V GC L + LP L++
Sbjct: 56 ALFHNAYTPSPICCPARTSLLTGMYAHNHGTRNNSVSGGCYGPHWRRALEPRALPYILQQ 115
Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLE 211
GY T GK+ ++ P +G+ G G+ Y++++ +R E
Sbjct: 116 HGYNTFFGGKYLNQYWGAGDVP--KGWNHFYGLH-GNSRYYNYT----------LR---E 159
Query: 212 PAWDLH--GKYSTDVFTAEAVDIIHNHS-TDEPLFLYLAHAATHSANPYEPLQAPDHYLN 268
+ ++H Y TD+ A D + N + + EP F +A A H P+ P AP H
Sbjct: 160 NSGNVHYESTYLTDLLRDRAADFLRNATQSSEPFFAMVAPPAAHE--PFTP--APRHE-G 214
Query: 269 IHRHIEDFKRSKF 281
+ HIE + F
Sbjct: 215 VFSHIEALRTPSF 227
>UNIPROTKB|F1N2D5 [details] [associations]
symbol:IDS "Uncharacterized protein" species:9913 "Bos
taurus" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
GeneTree:ENSGT00640000091539 EMBL:DAAA02068060 EMBL:DAAA02068061
IPI:IPI00709383 Ensembl:ENSBTAT00000014683 OMA:CREGRNL
Uniprot:F1N2D5
Length = 546
Score = 149 (57.5 bits), Expect = 1.3e-09, Sum P(2) = 1.3e-09
Identities = 39/113 (34%), Positives = 60/113 (53%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAI 114
++ P +++ I+ DDL +G +G I +PNID LA ++ +N + Q +C PSR +
Sbjct: 30 ATDPLNVLLIIVDDLR-PSLGCYGNKLIRSPNIDQLASRSLLFQNAFAQQAVCAPSRVSF 88
Query: 115 MTGKHPIHTGM-QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK-WHLG 165
+TG+ P T + N + G +PQY KE GY T VGK +H G
Sbjct: 89 LTGRRPDTTRLYDFNSYWRVHAGNF----STIPQYFKENGYVTMSVGKVFHPG 137
Score = 74 (31.1 bits), Expect = 1.3e-09, Sum P(2) = 1.3e-09
Identities = 20/60 (33%), Positives = 35/60 (58%)
Query: 256 PYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
PY P+ A + R I ++S FA + + LD VG+++ AL+ ++ S++I+ F SD
Sbjct: 281 PYGPIPA-----DFQRKI---RQSYFACVSY-LDTQVGRLLSALDDLQLASSTIVAFTSD 331
>UNIPROTKB|E1BFX4 [details] [associations]
symbol:SGSH "Uncharacterized protein" species:9913 "Bos
taurus" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484 CTD:6448 KO:K01565
OMA:RDPHETQ GeneTree:ENSGT00390000013080 EMBL:DAAA02049454
RefSeq:NP_001095659.2 UniGene:Bt.12396 GeneID:535442
KEGG:bta:535442 NextBio:20876750 IPI:IPI00907105
ProteinModelPortal:E1BFX4 Ensembl:ENSBTAT00000020308
ArrayExpress:E1BFX4 Uniprot:E1BFX4
Length = 505
Score = 147 (56.8 bits), Expect = 1.7e-09, Sum P(4) = 1.7e-09
Identities = 39/112 (34%), Positives = 63/112 (56%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTG 117
P +++ ILADD G+ G + I TP++DALA ++ +N +T V C+PSR++++TG
Sbjct: 25 PRNVLLILADDGGFES-GAYNNSAISTPHLDALARRSLVFRNAFTSVSSCSPSRASLLTG 83
Query: 118 KHPIHTGMQHNVLYGCERGGLPLS--EKI--LPQYLKELGYRTRIVGKWHLG 165
P H N +YG + + +++ LP L G T I+GK H+G
Sbjct: 84 L-PQH----QNGMYGLHQDVHHFNSFDRVQSLPLLLGRAGIHTGIIGKKHVG 130
Score = 74 (31.1 bits), Expect = 1.7e-09, Sum P(4) = 1.7e-09
Identities = 26/92 (28%), Positives = 44/92 (47%)
Query: 287 KLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVR 346
++D+ +G V++ L +L++++++F SD +P G N W G
Sbjct: 248 RMDQGIGLVLQELRGAGVLNDTLVIFTSDNGIP-----------FP-SGRTNLYWPGTAE 295
Query: 347 GAGLIWSPLLESR-GIVAEQYVHVSDWLPTLL 377
L+ SP R G V+E YV + D PT+L
Sbjct: 296 PM-LVSSPEHPKRWGQVSEAYVSLLDLTPTIL 326
Score = 41 (19.5 bits), Expect = 1.7e-09, Sum P(4) = 1.7e-09
Identities = 8/21 (38%), Positives = 10/21 (47%)
Query: 239 DEPLFLYLAHAATHSANPYEP 259
D P FLY+A H +P
Sbjct: 171 DRPFFLYVAFHDPHRCGHSQP 191
Score = 39 (18.8 bits), Expect = 1.7e-09, Sum P(4) = 1.7e-09
Identities = 8/15 (53%), Positives = 10/15 (66%)
Query: 677 LFDIKNDPCEKNNLA 691
L+D DP E +NLA
Sbjct: 441 LYDRNQDPHETHNLA 455
Score = 39 (18.8 bits), Expect = 1.7e-09, Sum P(4) = 1.7e-09
Identities = 8/15 (53%), Positives = 10/15 (66%)
Query: 783 LFDIKNDPCEKNNLA 797
L+D DP E +NLA
Sbjct: 441 LYDRNQDPHETHNLA 455
>TIGR_CMR|SPO_A0121 [details] [associations]
symbol:SPO_A0121 "sulfatase family protein"
species:246200 "Ruegeria pomeroyi DSS-3" [GO:0008152 "metabolic
process" evidence=ISS] [GO:0008484 "sulfuric ester hydrolase
activity" evidence=ISS] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
GO:GO:0008484 EMBL:CP000032 GenomeReviews:CP000032_GR
HOGENOM:HOG000230030 RefSeq:YP_164953.1 ProteinModelPortal:Q5LLA5
GeneID:3196629 KEGG:sil:SPOA0121 PATRIC:23381566 OMA:FDYLSCY
ProtClustDB:CLSK867183 Uniprot:Q5LLA5
Length = 552
Score = 139 (54.0 bits), Expect = 1.7e-09, Sum P(4) = 1.7e-09
Identities = 33/107 (30%), Positives = 61/107 (57%)
Query: 61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTGKH 119
+I++I+ D L ++ + +G +++ TPNID LA G+ N Y +C PSR + TG++
Sbjct: 6 NILWIMCDQLRFDYLSCYGHERLNTPNIDKLAKRGVRFTNAYVQATVCGPSRMSAYTGRY 65
Query: 120 PIHTGMQHNVLYGCERGGLPL--SEKILPQYLKELGYRTRIVGKWHL 164
+ + +G + G+PL E L +L+++G R ++GK H+
Sbjct: 66 -VRS-------HGSTQNGIPLRVGEPTLGDHLRDVGMRNVLIGKTHM 104
Score = 71 (30.1 bits), Expect = 1.7e-09, Sum P(4) = 1.7e-09
Identities = 22/103 (21%), Positives = 49/103 (47%)
Query: 281 FAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTL 340
+ ++ ++D+ +G++ +++R + N++IVF +D +W G K
Sbjct: 294 YMGLIKQIDDQLGQLFAFMQERGLDENTMIVFTADHGDYLG-------DHW--MGEKYLF 344
Query: 341 WEGGVRGAGLIWSPLLES---RGIVAEQYVHVSDWLPTLLSAA 380
+E + +I+ P ++ RG V++ V + D PT + A
Sbjct: 345 YEAAAKVPLIIYDPSDKADATRGTVSDALVEMIDLAPTFVDYA 387
Score = 48 (22.0 bits), Expect = 1.7e-09, Sum P(4) = 1.7e-09
Identities = 18/61 (29%), Positives = 30/61 (49%)
Query: 204 LD-MRR-DLEPAWDLHGKYSTDVFTA-EAVDIIH--NHSTDEPLFL-YLAHAATHSANPY 257
LD M+R ++P ++ + F A + D +H + EP + YL HA + NP+
Sbjct: 108 LDGMKRLGIDPDSEIGARVGEGGFDAFDRDDGVHPTGYRKKEPAYNDYLRHAGFQAENPW 167
Query: 258 E 258
E
Sbjct: 168 E 168
Score = 46 (21.3 bits), Expect = 1.7e-09, Sum P(4) = 1.7e-09
Identities = 11/25 (44%), Positives = 16/25 (64%)
Query: 674 APCLFDIKNDPCEKNNLA-DRSEDQ 697
AP LFD++ DP E +L D S ++
Sbjct: 458 APILFDLEVDPDELKDLGRDPSAEE 482
Score = 46 (21.3 bits), Expect = 1.7e-09, Sum P(4) = 1.7e-09
Identities = 10/31 (32%), Positives = 15/31 (48%)
Query: 780 APCLFDIKNDPCEKNNLADRSEVQRINHYTT 810
AP LFD++ DP E +L + + T
Sbjct: 458 APILFDLEVDPDELKDLGRDPSAEEVRQRLT 488
Score = 41 (19.5 bits), Expect = 1.4e-05, Sum P(3) = 1.4e-05
Identities = 14/41 (34%), Positives = 18/41 (43%)
Query: 224 VFTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPD 264
VFTA+ D + +H E Y A AA Y+P D
Sbjct: 324 VFTADHGDYLGDHWMGEKYLFYEA-AAKVPLIIYDPSDKAD 363
>RGD|708554 [details] [associations]
symbol:Sulf1 "sulfatase 1" species:10116 "Rattus norvegicus"
[GO:0001502 "cartilage condensation" evidence=ISS] [GO:0001822
"kidney development" evidence=ISO;ISS] [GO:0001937 "negative
regulation of endothelial cell proliferation" evidence=IEA;ISO;ISS]
[GO:0002063 "chondrocyte development" evidence=IEA;ISO;ISS]
[GO:0003094 "glomerular filtration" evidence=IEA;ISO] [GO:0004065
"arylsulfatase activity" evidence=IEA;ISO;ISS] [GO:0005509 "calcium
ion binding" evidence=IEA] [GO:0005615 "extracellular space"
evidence=ISO;ISS] [GO:0005783 "endoplasmic reticulum"
evidence=IEA;ISO;NAS;IDA] [GO:0005794 "Golgi apparatus"
evidence=IEA;IDA] [GO:0005795 "Golgi stack" evidence=NAS]
[GO:0005886 "plasma membrane" evidence=IEA;ISO] [GO:0007155 "cell
adhesion" evidence=ISS] [GO:0008152 "metabolic process"
evidence=NAS] [GO:0008449 "N-acetylglucosamine-6-sulfatase
activity" evidence=IEA;ISO;ISS] [GO:0009986 "cell surface"
evidence=IEA;ISO;ISS;NAS;IDA] [GO:0010575 "positive regulation
vascular endothelial growth factor production" evidence=IEA;ISO]
[GO:0014846 "esophagus smooth muscle contraction"
evidence=IEA;ISO;ISS] [GO:0016525 "negative regulation of
angiogenesis" evidence=IEA;ISO;ISS] [GO:0018741 "alkyl sulfatase
activity" evidence=NAS] [GO:0030177 "positive regulation of Wnt
receptor signaling pathway" evidence=IEA;ISO;ISS] [GO:0030201
"heparan sulfate proteoglycan metabolic process"
evidence=IEA;ISO;ISS] [GO:0030336 "negative regulation of cell
migration" evidence=IEA;ISO;ISS] [GO:0030513 "positive regulation
of BMP signaling pathway" evidence=IEA;ISO;ISS] [GO:0032836
"glomerular basement membrane development" evidence=IEA;ISO]
[GO:0035860 "glial cell-derived neurotrophic factor receptor
signaling pathway" evidence=IEA;ISO;ISS] [GO:0036022 "limb joint
morphogenesis" evidence=ISS] [GO:0040036 "regulation of fibroblast
growth factor receptor signaling pathway" evidence=ISO] [GO:0040037
"negative regulation of fibroblast growth factor receptor signaling
pathway" evidence=IEA;ISO;ISS] [GO:0045121 "membrane raft"
evidence=IEA;ISO;ISS] [GO:0048010 "vascular endothelial growth
factor receptor signaling pathway" evidence=IEA;ISO;ISS]
[GO:0048661 "positive regulation of smooth muscle cell
proliferation" evidence=ISS] [GO:0048706 "embryonic skeletal system
development" evidence=IEA;ISO;ISS] [GO:0051216 "cartilage
development" evidence=ISO;ISS] [GO:0060348 "bone development"
evidence=IEA;ISO;ISS] [GO:0060384 "innervation"
evidence=IEA;ISO;ISS] [GO:0060686 "negative regulation of prostatic
bud formation" evidence=IEA;ISO;ISS] InterPro:IPR000917
InterPro:IPR014615 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 PIRSF:PIRSF036665 RGD:708554 GO:GO:0005783
GO:GO:0005886 GO:GO:0005615 GO:GO:0009986 GO:GO:0048661
GO:GO:0005795 GO:GO:0005509 GO:GO:0010575 GO:GO:0045121
GO:GO:0030336 GO:GO:0001822 GO:GO:0001937 GO:GO:0030513
GO:GO:0016525 GO:GO:0001502 GO:GO:0060348 Gene3D:3.40.720.10
SUPFAM:SSF53649 GO:GO:0030177 GO:GO:0003094 eggNOG:COG3119
InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0004065
GO:GO:0048706 GO:GO:0048010 GO:GO:0018741 GO:GO:0060686
GO:GO:0002063 GO:GO:0040037 GO:GO:0032836 GO:GO:0060384
GO:GO:0008449 GO:GO:0030201 HOGENOM:HOG000290161 KO:K14607
HOVERGEN:HBG056431 GO:GO:0014846 GO:GO:0035860 GO:GO:0036022
InterPro:IPR024609 Pfam:PF12548 CTD:23213 OrthoDB:EOG4VT5WH
EMBL:AF230072 IPI:IPI00331986 RefSeq:NP_599205.1 UniGene:Rn.161961
ProteinModelPortal:Q8VI60 STRING:Q8VI60 GeneID:171396
KEGG:rno:171396 UCSC:RGD:708554 NextBio:622244 ArrayExpress:Q8VI60
Genevestigator:Q8VI60 GermOnline:ENSRNOG00000009037 Uniprot:Q8VI60
Length = 870
Score = 162 (62.1 bits), Expect = 2.1e-09, Sum P(2) = 2.1e-09
Identities = 63/223 (28%), Positives = 96/223 (43%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+II +L DD DV L Q+ + + G N + T +C PSRS+++TGK
Sbjct: 43 PNIILVLTDD---QDVELGSL-QVMNKTRKIMEHGGATFTNAFVTTPMCCPSRSSMLTGK 98
Query: 119 HPIHTGMQHNVLYGCERGGLP----LSE-KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+ +H HNV E P L E + YL GYRT GK+ L Y Y P
Sbjct: 99 Y-VHN---HNVYTNNENCSSPSWQALHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
G+ LG + +++++ G+ + + A D Y TD+ T E+++
Sbjct: 154 P--GWREWLGLIKNSR-FYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESINYF 203
Query: 234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
P+ + ++HAA H P Q Y N +HI
Sbjct: 204 KMSKRMYPHRPVMMVISHAAPHGPEDSAP-QFSKLYPNASQHI 245
Score = 64 (27.6 bits), Expect = 2.1e-09, Sum P(2) = 2.1e-09
Identities = 28/124 (22%), Positives = 51/124 (41%)
Query: 267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
L IH + + K L +D+SV ++ L + L N+ I++ +D
Sbjct: 268 LPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVETGELGNTYIIYTADHGYHIGQFGLV 327
Query: 327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
+ P ++ +R I P +E IV + +++ D PT+L A D P
Sbjct: 328 KGKSMP--------YDFDIRVPFFIRGPSIEPGSIVPQIVLNI-DLAPTILDIAGL-DTP 377
Query: 387 NYVN 390
+ V+
Sbjct: 378 SDVD 381
Score = 47 (21.6 bits), Expect = 1.2e-07, Sum P(2) = 1.2e-07
Identities = 10/42 (23%), Positives = 18/42 (42%)
Query: 428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGP 466
+G + P++ Y N + P Y N ++ +Y GP
Sbjct: 225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGP 266
>UNIPROTKB|Q8VI60 [details] [associations]
symbol:Sulf1 "Extracellular sulfatase Sulf-1" species:10116
"Rattus norvegicus" [GO:0005509 "calcium ion binding" evidence=IEA]
InterPro:IPR000917 InterPro:IPR014615 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036665 RGD:708554
GO:GO:0005783 GO:GO:0005886 GO:GO:0005615 GO:GO:0009986
GO:GO:0048661 GO:GO:0005795 GO:GO:0005509 GO:GO:0010575
GO:GO:0045121 GO:GO:0030336 GO:GO:0001822 GO:GO:0001937
GO:GO:0030513 GO:GO:0016525 GO:GO:0001502 GO:GO:0060348
Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0030177 GO:GO:0003094
eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
GO:GO:0004065 GO:GO:0048706 GO:GO:0048010 GO:GO:0018741
GO:GO:0060686 GO:GO:0002063 GO:GO:0040037 GO:GO:0032836
GO:GO:0060384 GO:GO:0008449 GO:GO:0030201 HOGENOM:HOG000290161
KO:K14607 HOVERGEN:HBG056431 GO:GO:0014846 GO:GO:0035860
GO:GO:0036022 InterPro:IPR024609 Pfam:PF12548 CTD:23213
OrthoDB:EOG4VT5WH EMBL:AF230072 IPI:IPI00331986 RefSeq:NP_599205.1
UniGene:Rn.161961 ProteinModelPortal:Q8VI60 STRING:Q8VI60
GeneID:171396 KEGG:rno:171396 UCSC:RGD:708554 NextBio:622244
ArrayExpress:Q8VI60 Genevestigator:Q8VI60
GermOnline:ENSRNOG00000009037 Uniprot:Q8VI60
Length = 870
Score = 162 (62.1 bits), Expect = 2.1e-09, Sum P(2) = 2.1e-09
Identities = 63/223 (28%), Positives = 96/223 (43%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+II +L DD DV L Q+ + + G N + T +C PSRS+++TGK
Sbjct: 43 PNIILVLTDD---QDVELGSL-QVMNKTRKIMEHGGATFTNAFVTTPMCCPSRSSMLTGK 98
Query: 119 HPIHTGMQHNVLYGCERGGLP----LSE-KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+ +H HNV E P L E + YL GYRT GK+ L Y Y P
Sbjct: 99 Y-VHN---HNVYTNNENCSSPSWQALHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
G+ LG + +++++ G+ + + A D Y TD+ T E+++
Sbjct: 154 P--GWREWLGLIKNSR-FYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESINYF 203
Query: 234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
P+ + ++HAA H P Q Y N +HI
Sbjct: 204 KMSKRMYPHRPVMMVISHAAPHGPEDSAP-QFSKLYPNASQHI 245
Score = 64 (27.6 bits), Expect = 2.1e-09, Sum P(2) = 2.1e-09
Identities = 28/124 (22%), Positives = 51/124 (41%)
Query: 267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
L IH + + K L +D+SV ++ L + L N+ I++ +D
Sbjct: 268 LPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVETGELGNTYIIYTADHGYHIGQFGLV 327
Query: 327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
+ P ++ +R I P +E IV + +++ D PT+L A D P
Sbjct: 328 KGKSMP--------YDFDIRVPFFIRGPSIEPGSIVPQIVLNI-DLAPTILDIAGL-DTP 377
Query: 387 NYVN 390
+ V+
Sbjct: 378 SDVD 381
Score = 47 (21.6 bits), Expect = 1.2e-07, Sum P(2) = 1.2e-07
Identities = 10/42 (23%), Positives = 18/42 (42%)
Query: 428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGP 466
+G + P++ Y N + P Y N ++ +Y GP
Sbjct: 225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGP 266
>FB|FBgn0260475 [details] [associations]
symbol:CG30059 species:7227 "Drosophila melanogaster"
[GO:0008449 "N-acetylglucosamine-6-sulfatase activity"
evidence=ISS] [GO:0006044 "N-acetylglucosamine metabolic process"
evidence=ISS] [GO:0005764 "lysosome" evidence=ISS] [GO:0030203
"glycosaminoglycan metabolic process" evidence=IEA]
InterPro:IPR000917 InterPro:IPR012251 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036666 EMBL:AE013599
Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
PROSITE:PS00523 GeneTree:ENSGT00400000022041 GO:GO:0030203
KO:K01137 GO:GO:0008449 OrthoDB:EOG43TXB4 EMBL:AY061585
RefSeq:NP_610872.1 UniGene:Dm.21320 SMR:Q95R73 STRING:Q95R73
EnsemblMetazoa:FBtr0087715 GeneID:246425 KEGG:dme:Dmel_CG30059
UCSC:CG30059-RA FlyBase:FBgn0260475 InParanoid:Q95R73 OMA:GNSQYYN
GenomeRNAi:246425 NextBio:842420 Uniprot:Q95R73
Length = 492
Score = 171 (65.3 bits), Expect = 2.6e-09, P = 2.6e-09
Identities = 68/249 (27%), Positives = 114/249 (45%)
Query: 41 PLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPN-IDALAYSGIILK 99
PL L + + AS P+I+ IL+DD DV G+ P + I+ L + G +
Sbjct: 6 PL-IVLVLACLGNTASEKLPNILLILSDD---QDVELRGM--FPMEHTIEMLGFGGALFH 59
Query: 100 NYYTVQ-LCTPSRSAIMTGKHPIHTGMQHN-VLYGC--ERGGLPLSEKILPQYLKELGYR 155
N YT +C P+R++++TG + + G ++N V GC L + LP L++ GY
Sbjct: 60 NAYTPSPICCPARTSLLTGMYAHNHGTRNNSVSGGCYGPHWRRALEPRALPYILQQHGYN 119
Query: 156 TRIVGKWHLGFYKKEYTPTFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWD 215
T GK+ ++ P +G+ + G G+ Y++++ +R E +
Sbjct: 120 TFFGGKYLNQYWGAGDVP--KGWNNFYGLH-GNSRYYNYT----------LR---ENTGN 163
Query: 216 LH--GKYSTDVFTAEAVDIIHNHS-TDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH 272
+H Y +D+ A D + N + + EP F +A A H P+ P AP H + H
Sbjct: 164 VHYESTYLSDLLRDRAADFLRNATQSSEPFFAMVAPPAAHE--PFTP--APRHE-GVFSH 218
Query: 273 IEDFKRSKF 281
IE + F
Sbjct: 219 IEALRTPSF 227
>UNIPROTKB|Q32KH2 [details] [associations]
symbol:sulf1 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0030201 "heparan sulfate proteoglycan
metabolic process" evidence=ISS] [GO:0051216 "cartilage
development" evidence=ISS] [GO:0048706 "embryonic skeletal system
development" evidence=ISS] [GO:0001822 "kidney development"
evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0014846
"esophagus smooth muscle contraction" evidence=ISS] [GO:0048661
"positive regulation of smooth muscle cell proliferation"
evidence=ISS] [GO:0007155 "cell adhesion" evidence=ISS] [GO:0045121
"membrane raft" evidence=ISS] [GO:0009986 "cell surface"
evidence=ISS] [GO:0005783 "endoplasmic reticulum" evidence=ISS]
[GO:0005615 "extracellular space" evidence=ISS] [GO:0008449
"N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
[GO:0004065 "arylsulfatase activity" evidence=ISS] [GO:0048010
"vascular endothelial growth factor receptor signaling pathway"
evidence=ISS] [GO:0040037 "negative regulation of fibroblast growth
factor receptor signaling pathway" evidence=ISS] [GO:0030513
"positive regulation of BMP signaling pathway" evidence=ISS]
[GO:0030336 "negative regulation of cell migration" evidence=ISS]
[GO:0030177 "positive regulation of Wnt receptor signaling pathway"
evidence=ISS] [GO:0016525 "negative regulation of angiogenesis"
evidence=ISS] [GO:0001937 "negative regulation of endothelial cell
proliferation" evidence=ISS] [GO:0005794 "Golgi apparatus"
evidence=ISS] [GO:0060348 "bone development" evidence=ISS]
[GO:0060686 "negative regulation of prostatic bud formation"
evidence=ISS] [GO:0035860 "glial cell-derived neurotrophic factor
receptor signaling pathway" evidence=ISS] [GO:0002063 "chondrocyte
development" evidence=ISS] [GO:0036022 "limb joint morphogenesis"
evidence=ISS] [GO:0001502 "cartilage condensation" evidence=ISS]
[GO:0032836 "glomerular basement membrane development"
evidence=IEA] [GO:0010575 "positive regulation vascular endothelial
growth factor production" evidence=IEA] [GO:0005886 "plasma
membrane" evidence=IEA] [GO:0003094 "glomerular filtration"
evidence=IEA] [GO:0005509 "calcium ion binding" evidence=IEA]
InterPro:IPR000917 InterPro:IPR014615 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783
GO:GO:0005886 GO:GO:0005794 GO:GO:0005615 GO:GO:0009986
GO:GO:0048661 GO:GO:0005509 GO:GO:0010575 GO:GO:0045121
GO:GO:0030336 GO:GO:0001822 GO:GO:0001937 GO:GO:0030513
GO:GO:0016525 GO:GO:0001502 GO:GO:0060348 Gene3D:3.40.720.10
SUPFAM:SSF53649 GO:GO:0030177 GO:GO:0003094 eggNOG:COG3119
InterPro:IPR024607 PROSITE:PS00523 GO:GO:0004065
GeneTree:ENSGT00400000022041 GO:GO:0048706 GO:GO:0048010
GO:GO:0060686 GO:GO:0002063 GO:GO:0040037 GO:GO:0032836
GO:GO:0060384 GO:GO:0008449 GO:GO:0030201 HOGENOM:HOG000290161
KO:K14607 HOVERGEN:HBG056431 GO:GO:0014846 GO:GO:0035860
GO:GO:0036022 InterPro:IPR024609 Pfam:PF12548 CTD:23213
OrthoDB:EOG4VT5WH EMBL:AAEX03015848 EMBL:BN000765
RefSeq:NP_001041580.1 UniGene:Cfa.36649 Ensembl:ENSCAFT00000046451
GeneID:486986 KEGG:cfa:486986 InParanoid:Q32KH2 NextBio:20860674
Uniprot:Q32KH2
Length = 869
Score = 161 (61.7 bits), Expect = 4.3e-09, Sum P(2) = 4.3e-09
Identities = 62/223 (27%), Positives = 96/223 (43%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+II +L DD DV L Q+ + + G N + T +C PSRS+++TGK
Sbjct: 43 PNIILVLTDD---QDVELGSL-QVMNKTRKIMEHGGATFTNAFVTTPMCCPSRSSMLTGK 98
Query: 119 HPIHTGMQHNVLYGCERGGLP----LSE-KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+ +H HNV E P + E + YL GYRT GK+ L Y Y P
Sbjct: 99 Y-VHN---HNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
G+ LG + +++++ G+ + + A D Y TD+ T E+++
Sbjct: 154 P--GWREWLGLIKNSR-FYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESINYF 203
Query: 234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
P+ + ++HAA H P Q Y N +HI
Sbjct: 204 KMSKRMYPHRPIMMVISHAAPHGPEDSAP-QFSKLYPNASQHI 245
Score = 62 (26.9 bits), Expect = 4.3e-09, Sum P(2) = 4.3e-09
Identities = 27/131 (20%), Positives = 51/131 (38%)
Query: 267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
L IH + K L +D+SV ++ L + L N+ I++ +D
Sbjct: 268 LPIHMEFTNVLHRKRLQTLMSVDDSVERLYNMLVETGELDNTYIIYTADHGYHIGQFGLV 327
Query: 327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
+ P ++ +R I P +E IV + +++ D PT+L A P
Sbjct: 328 KGKSMP--------YDFDIRVPFFIRGPSVEPGSIVPQIVLNI-DLAPTILDIAGLDTPP 378
Query: 387 NYVNSTVENII 397
+ +V ++
Sbjct: 379 DVDGKSVLKLL 389
Score = 49 (22.3 bits), Expect = 9.4e-08, Sum P(2) = 9.4e-08
Identities = 19/87 (21%), Positives = 34/87 (39%)
Query: 428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGPKNENTNPRYENGTHEYNI 484
+G + P++ Y N + P Y N ++ +Y GP + + N H +
Sbjct: 225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLP-IHMEFTNVLHRKRL 283
Query: 485 PRL---ENSING--NGTSENRSNDNSY 506
L ++S+ N E DN+Y
Sbjct: 284 QTLMSVDDSVERLYNMLVETGELDNTY 310
>UNIPROTKB|G1PHQ1 [details] [associations]
symbol:SULF1 "Uncharacterized protein" species:59463
"Myotis lucifugus" [GO:0001502 "cartilage condensation"
evidence=ISS] [GO:0001822 "kidney development" evidence=ISS]
[GO:0001937 "negative regulation of endothelial cell proliferation"
evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
[GO:0004065 "arylsulfatase activity" evidence=ISS] [GO:0005615
"extracellular space" evidence=ISS] [GO:0005783 "endoplasmic
reticulum" evidence=ISS] [GO:0005794 "Golgi apparatus"
evidence=ISS] [GO:0007155 "cell adhesion" evidence=ISS] [GO:0008449
"N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
[GO:0009986 "cell surface" evidence=ISS] [GO:0014846 "esophagus
smooth muscle contraction" evidence=ISS] [GO:0016525 "negative
regulation of angiogenesis" evidence=ISS] [GO:0030177 "positive
regulation of Wnt receptor signaling pathway" evidence=ISS]
[GO:0030201 "heparan sulfate proteoglycan metabolic process"
evidence=ISS] [GO:0030336 "negative regulation of cell migration"
evidence=ISS] [GO:0030513 "positive regulation of BMP signaling
pathway" evidence=ISS] [GO:0035860 "glial cell-derived neurotrophic
factor receptor signaling pathway" evidence=ISS] [GO:0036022 "limb
joint morphogenesis" evidence=ISS] [GO:0040037 "negative regulation
of fibroblast growth factor receptor signaling pathway"
evidence=ISS] [GO:0045121 "membrane raft" evidence=ISS] [GO:0048010
"vascular endothelial growth factor receptor signaling pathway"
evidence=ISS] [GO:0048661 "positive regulation of smooth muscle
cell proliferation" evidence=ISS] [GO:0048706 "embryonic skeletal
system development" evidence=ISS] [GO:0051216 "cartilage
development" evidence=ISS] [GO:0060348 "bone development"
evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0060686
"negative regulation of prostatic bud formation" evidence=ISS]
InterPro:IPR000917 InterPro:IPR014615 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783
GO:GO:0005794 GO:GO:0005615 GO:GO:0009986 GO:GO:0048661
GO:GO:0005509 GO:GO:0045121 GO:GO:0030336 GO:GO:0001822
GO:GO:0001937 GO:GO:0030513 GO:GO:0016525 GO:GO:0001502
GO:GO:0060348 Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0030177
InterPro:IPR024607 PROSITE:PS00523 GO:GO:0004065
GeneTree:ENSGT00400000022041 GO:GO:0048706 GO:GO:0048010
GO:GO:0060686 GO:GO:0002063 GO:GO:0040037 GO:GO:0060384
GO:GO:0008449 GO:GO:0030201 GO:GO:0014846 GO:GO:0035860
GO:GO:0036022 InterPro:IPR024609 Pfam:PF12548 OMA:SVRVTHK
EMBL:AAPE02021694 Ensembl:ENSMLUT00000011203 Uniprot:G1PHQ1
Length = 871
Score = 161 (61.7 bits), Expect = 4.4e-09, Sum P(2) = 4.4e-09
Identities = 62/223 (27%), Positives = 96/223 (43%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+II +L DD DV L Q+ + + G N + T +C PSRS+++TGK
Sbjct: 43 PNIILVLTDD---QDVELGSL-QVMNKTRKIMEHGGATFTNAFVTTPMCCPSRSSMLTGK 98
Query: 119 HPIHTGMQHNVLYGCERGGLP----LSE-KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+ +H HNV E P + E + YL GYRT GK+ L Y Y P
Sbjct: 99 Y-VHN---HNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
G+ LG + +++++ G+ + + A D Y TD+ T E+++
Sbjct: 154 P--GWREWLGLIKNSR-FYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESINYF 203
Query: 234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
P+ + ++HAA H P Q Y N +HI
Sbjct: 204 KMSKRMYPHRPIMMVISHAAPHGPEDSAP-QFSKLYPNASQHI 245
Score = 62 (26.9 bits), Expect = 4.4e-09, Sum P(2) = 4.4e-09
Identities = 27/131 (20%), Positives = 51/131 (38%)
Query: 267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
L IH + K L +D+SV ++ L + L N+ I++ +D
Sbjct: 268 LPIHMEFTNVLHRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQFGLV 327
Query: 327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
+ P ++ +R I P +E IV + +++ D PT+L A P
Sbjct: 328 KGKSMP--------YDFDIRVPFFIRGPSVEPGSIVPQIVLNI-DLAPTILDIAGLDPPP 378
Query: 387 NYVNSTVENII 397
+ +V ++
Sbjct: 379 DVDGKSVLKLL 389
Score = 47 (21.6 bits), Expect = 1.5e-07, Sum P(2) = 1.5e-07
Identities = 10/42 (23%), Positives = 18/42 (42%)
Query: 428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGP 466
+G + P++ Y N + P Y N ++ +Y GP
Sbjct: 225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGP 266
>UNIPROTKB|F1Q233 [details] [associations]
symbol:SULF1 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0009986 "cell surface" evidence=IEA]
[GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
[GO:0005794 "Golgi apparatus" evidence=IEA] [GO:0005783
"endoplasmic reticulum" evidence=IEA] [GO:0005509 "calcium ion
binding" evidence=IEA] InterPro:IPR000917 InterPro:IPR014615
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
PIRSF:PIRSF036665 GO:GO:0005783 GO:GO:0005794 GO:GO:0009986
GO:GO:0005509 Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
PROSITE:PS00523 GO:GO:0008484 GeneTree:ENSGT00400000022041
InterPro:IPR024609 Pfam:PF12548 OMA:SVRVTHK EMBL:AAEX03015848
Ensembl:ENSCAFT00000012295 Uniprot:F1Q233
Length = 891
Score = 161 (61.7 bits), Expect = 4.6e-09, Sum P(2) = 4.6e-09
Identities = 62/223 (27%), Positives = 96/223 (43%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+II +L DD DV L Q+ + + G N + T +C PSRS+++TGK
Sbjct: 43 PNIILVLTDD---QDVELGSL-QVMNKTRKIMEHGGATFTNAFVTTPMCCPSRSSMLTGK 98
Query: 119 HPIHTGMQHNVLYGCERGGLP----LSE-KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+ +H HNV E P + E + YL GYRT GK+ L Y Y P
Sbjct: 99 Y-VHN---HNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
G+ LG + +++++ G+ + + A D Y TD+ T E+++
Sbjct: 154 P--GWREWLGLIKNSR-FYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESINYF 203
Query: 234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
P+ + ++HAA H P Q Y N +HI
Sbjct: 204 KMSKRMYPHRPIMMVISHAAPHGPEDSAP-QFSKLYPNASQHI 245
Score = 62 (26.9 bits), Expect = 4.6e-09, Sum P(2) = 4.6e-09
Identities = 27/131 (20%), Positives = 51/131 (38%)
Query: 267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
L IH + K L +D+SV ++ L + L N+ I++ +D
Sbjct: 268 LPIHMEFTNVLHRKRLQTLMSVDDSVERLYNMLVETGELDNTYIIYTADHGYHIGQFGLV 327
Query: 327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
+ P ++ +R I P +E IV + +++ D PT+L A P
Sbjct: 328 KGKSMP--------YDFDIRVPFFIRGPSVEPGSIVPQIVLNI-DLAPTILDIAGLDTPP 378
Query: 387 NYVNSTVENII 397
+ +V ++
Sbjct: 379 DVDGKSVLKLL 389
Score = 49 (22.3 bits), Expect = 1.0e-07, Sum P(2) = 1.0e-07
Identities = 19/87 (21%), Positives = 34/87 (39%)
Query: 428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGPKNENTNPRYENGTHEYNI 484
+G + P++ Y N + P Y N ++ +Y GP + + N H +
Sbjct: 225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGPMLP-IHMEFTNVLHRKRL 283
Query: 485 PRL---ENSING--NGTSENRSNDNSY 506
L ++S+ N E DN+Y
Sbjct: 284 QTLMSVDDSVERLYNMLVETGELDNTY 310
>UNIPROTKB|G3T2L0 [details] [associations]
symbol:SULF1 "Uncharacterized protein" species:9785
"Loxodonta africana" [GO:0001502 "cartilage condensation"
evidence=ISS] [GO:0001822 "kidney development" evidence=ISS]
[GO:0001937 "negative regulation of endothelial cell proliferation"
evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
[GO:0004065 "arylsulfatase activity" evidence=ISS] [GO:0005615
"extracellular space" evidence=ISS] [GO:0005783 "endoplasmic
reticulum" evidence=ISS] [GO:0005794 "Golgi apparatus"
evidence=ISS] [GO:0007155 "cell adhesion" evidence=ISS] [GO:0008449
"N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
[GO:0009986 "cell surface" evidence=ISS] [GO:0014846 "esophagus
smooth muscle contraction" evidence=ISS] [GO:0016525 "negative
regulation of angiogenesis" evidence=ISS] [GO:0030177 "positive
regulation of Wnt receptor signaling pathway" evidence=ISS]
[GO:0030201 "heparan sulfate proteoglycan metabolic process"
evidence=ISS] [GO:0030336 "negative regulation of cell migration"
evidence=ISS] [GO:0030513 "positive regulation of BMP signaling
pathway" evidence=ISS] [GO:0035860 "glial cell-derived neurotrophic
factor receptor signaling pathway" evidence=ISS] [GO:0036022 "limb
joint morphogenesis" evidence=ISS] [GO:0040037 "negative regulation
of fibroblast growth factor receptor signaling pathway"
evidence=ISS] [GO:0045121 "membrane raft" evidence=ISS] [GO:0048010
"vascular endothelial growth factor receptor signaling pathway"
evidence=ISS] [GO:0048661 "positive regulation of smooth muscle
cell proliferation" evidence=ISS] [GO:0048706 "embryonic skeletal
system development" evidence=ISS] [GO:0051216 "cartilage
development" evidence=ISS] [GO:0060348 "bone development"
evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0060686
"negative regulation of prostatic bud formation" evidence=ISS]
InterPro:IPR000917 InterPro:IPR014615 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783
GO:GO:0005886 GO:GO:0005794 GO:GO:0005615 GO:GO:0009986
GO:GO:0048661 GO:GO:0005509 GO:GO:0010575 GO:GO:0045121
GO:GO:0030336 GO:GO:0001822 GO:GO:0001937 GO:GO:0030513
GO:GO:0016525 GO:GO:0001502 GO:GO:0060348 Gene3D:3.40.720.10
SUPFAM:SSF53649 GO:GO:0030177 GO:GO:0003094 InterPro:IPR024607
PROSITE:PS00523 GO:GO:0004065 GeneTree:ENSGT00400000022041
GO:GO:0048706 GO:GO:0048010 GO:GO:0060686 GO:GO:0002063
GO:GO:0040037 GO:GO:0032836 GO:GO:0060384 GO:GO:0008449
GO:GO:0030201 GO:GO:0014846 GO:GO:0035860 GO:GO:0036022
InterPro:IPR024609 Pfam:PF12548 OMA:QRKGDEC
Ensembl:ENSLAFT00000008824 Uniprot:G3T2L0
Length = 857
Score = 159 (61.0 bits), Expect = 5.4e-09, Sum P(2) = 5.4e-09
Identities = 62/223 (27%), Positives = 96/223 (43%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+II +L DD DV L Q+ + + G N + T +C PSRS+++TGK
Sbjct: 44 PNIILVLTDD---QDVELGSL-QVMNKTRKIMEHGGATFINAFVTTPMCCPSRSSMLTGK 99
Query: 119 HPIHTGMQHNVLYGCERGGLP----LSE-KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+ +H HNV E P + E + YL GYRT GK+ L Y Y P
Sbjct: 100 Y-VHN---HNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 154
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
G+ LG + +++++ G+ + + A D Y TD+ T E+++
Sbjct: 155 P--GWREWLGLIKNSR-FYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESINYF 204
Query: 234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
P+ + ++HAA H P Q Y N +HI
Sbjct: 205 KMSKRLYPHRPIMMVISHAAPHGPEDSAP-QFSKLYPNASQHI 246
Score = 63 (27.2 bits), Expect = 5.4e-09, Sum P(2) = 5.4e-09
Identities = 27/131 (20%), Positives = 52/131 (39%)
Query: 267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
L IH + + K L +D+SV ++ L + L N+ I++ +D
Sbjct: 269 LPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQFGLV 328
Query: 327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
+ P ++ +R I P +E IV + +++ D PT+L A P
Sbjct: 329 KGKSMP--------YDFDIRVPFFIRGPSVEPGSIVPQIVLNI-DLAPTILDIAGLDTPP 379
Query: 387 NYVNSTVENII 397
+ +V ++
Sbjct: 380 DVDGKSVLKLL 390
Score = 47 (21.6 bits), Expect = 2.4e-07, Sum P(2) = 2.4e-07
Identities = 10/42 (23%), Positives = 18/42 (42%)
Query: 428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGP 466
+G + P++ Y N + P Y N ++ +Y GP
Sbjct: 226 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGP 267
>UNIPROTKB|F6VXY6 [details] [associations]
symbol:SULF1 "Uncharacterized protein" species:9483
"Callithrix jacchus" [GO:0001502 "cartilage condensation"
evidence=ISS] [GO:0001822 "kidney development" evidence=ISS]
[GO:0001937 "negative regulation of endothelial cell proliferation"
evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
[GO:0004065 "arylsulfatase activity" evidence=ISS] [GO:0005615
"extracellular space" evidence=ISS] [GO:0005783 "endoplasmic
reticulum" evidence=ISS] [GO:0005794 "Golgi apparatus"
evidence=ISS] [GO:0007155 "cell adhesion" evidence=ISS] [GO:0008449
"N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
[GO:0009986 "cell surface" evidence=ISS] [GO:0014846 "esophagus
smooth muscle contraction" evidence=ISS] [GO:0016525 "negative
regulation of angiogenesis" evidence=ISS] [GO:0030177 "positive
regulation of Wnt receptor signaling pathway" evidence=ISS]
[GO:0030201 "heparan sulfate proteoglycan metabolic process"
evidence=ISS] [GO:0030336 "negative regulation of cell migration"
evidence=ISS] [GO:0030513 "positive regulation of BMP signaling
pathway" evidence=ISS] [GO:0035860 "glial cell-derived neurotrophic
factor receptor signaling pathway" evidence=ISS] [GO:0036022 "limb
joint morphogenesis" evidence=ISS] [GO:0040037 "negative regulation
of fibroblast growth factor receptor signaling pathway"
evidence=ISS] [GO:0045121 "membrane raft" evidence=ISS] [GO:0048010
"vascular endothelial growth factor receptor signaling pathway"
evidence=ISS] [GO:0048661 "positive regulation of smooth muscle
cell proliferation" evidence=ISS] [GO:0048706 "embryonic skeletal
system development" evidence=ISS] [GO:0051216 "cartilage
development" evidence=ISS] [GO:0060348 "bone development"
evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0060686
"negative regulation of prostatic bud formation" evidence=ISS]
InterPro:IPR000917 InterPro:IPR014615 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783
GO:GO:0005794 GO:GO:0005615 GO:GO:0009986 GO:GO:0048661
GO:GO:0005509 GO:GO:0045121 GO:GO:0030336 GO:GO:0001822
GO:GO:0001937 GO:GO:0030513 GO:GO:0016525 GO:GO:0001502
GO:GO:0060348 Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0030177
InterPro:IPR024607 PROSITE:PS00523 GO:GO:0004065
GeneTree:ENSGT00400000022041 GO:GO:0048706 GO:GO:0048010
GO:GO:0060686 GO:GO:0002063 GO:GO:0040037 GO:GO:0060384
GO:GO:0008449 GO:GO:0030201 GO:GO:0014846 GO:GO:0035860
GO:GO:0036022 InterPro:IPR024609 Pfam:PF12548 CTD:23213 OMA:SVRVTHK
EMBL:ACFV01096449 EMBL:ACFV01096450 EMBL:ACFV01096451
EMBL:ACFV01096452 EMBL:ACFV01096453 EMBL:ACFV01096454
EMBL:ACFV01096455 EMBL:ACFV01096456 EMBL:ACFV01096457
EMBL:ACFV01096458 EMBL:ACFV01096459 EMBL:ACFV01096460
EMBL:ACFV01096461 RefSeq:XP_002759021.1 Ensembl:ENSCJAT00000009824
Ensembl:ENSCJAT00000053576 GeneID:100390937 Uniprot:F6VXY6
Length = 869
Score = 159 (61.0 bits), Expect = 6.6e-09, Sum P(3) = 6.6e-09
Identities = 63/223 (28%), Positives = 96/223 (43%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+II +L DD DV L Q+ + + G N + T +C PSRS+++TGK
Sbjct: 43 PNIILVLTDD---QDVELGSL-QVMNKTRKIMEHGGATFINAFVTTPMCCPSRSSMLTGK 98
Query: 119 HPIHTGMQHNVLYGCERGGLP----LSE-KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+ +H HNV E P + E + YL GYRT GK+ L Y Y P
Sbjct: 99 Y-VHN---HNVYTNNENCSSPSWQAMHEPRTFAVYLNSTGYRTAFFGKY-LNEYNGSYIP 153
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
G+ LG + +++++ G+ + + A D Y TD+ T E+V+
Sbjct: 154 P--GWREWLGLIKNSR-FYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESVNYF 203
Query: 234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
P+ + ++HAA H P Q Y N +HI
Sbjct: 204 KMSKRMYPHRPVMMVISHAAPHGPEDSAP-QFSKLYPNASQHI 245
Score = 64 (27.6 bits), Expect = 6.6e-09, Sum P(3) = 6.6e-09
Identities = 27/131 (20%), Positives = 52/131 (39%)
Query: 267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
L IH + + K L +D+SV ++ L + L N+ I++ +D
Sbjct: 268 LPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQFGLV 327
Query: 327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
+ P ++ +R I P +E IV + +++ D PT+L A P
Sbjct: 328 KGKSMP--------YDFDIRVPFFIRGPSVEPGSIVPQIVLNI-DLAPTILDIAGLDTPP 378
Query: 387 NYVNSTVENII 397
+ +V ++
Sbjct: 379 DVDGKSVLKLL 389
Score = 47 (21.6 bits), Expect = 2.5e-07, Sum P(2) = 2.5e-07
Identities = 10/42 (23%), Positives = 18/42 (42%)
Query: 428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGP 466
+G + P++ Y N + P Y N ++ +Y GP
Sbjct: 225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGP 266
Score = 43 (20.2 bits), Expect = 6.6e-09, Sum P(3) = 6.6e-09
Identities = 26/101 (25%), Positives = 42/101 (41%)
Query: 408 ENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTH--EYNPK-YENRYENGTHEYN 464
E+G S R GT +Y P++ + + + E+ + Y+ E +
Sbjct: 508 ESGYRASRSQRKSQRQFLRNQGTPKYKPRFVHTRQTRSLSVEFEGEIYDINLEEEELQVL 567
Query: 465 GPKNENTNPRYENGTHEYNIPR-LENSINGN--GTSENRSN 502
P+N R++ G HE PR L+ S GN G + SN
Sbjct: 568 HPRN--IAKRHDEG-HEE--PRGLQASSGGNRGGMLADSSN 603
>UNIPROTKB|F1RU06 [details] [associations]
symbol:SULF1 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0030513 "positive regulation of BMP signaling pathway"
evidence=ISS] [GO:0016525 "negative regulation of angiogenesis"
evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
[GO:0009986 "cell surface" evidence=ISS] [GO:0060348 "bone
development" evidence=ISS] [GO:0001822 "kidney development"
evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0048706
"embryonic skeletal system development" evidence=ISS] [GO:0014846
"esophagus smooth muscle contraction" evidence=ISS] [GO:0048661
"positive regulation of smooth muscle cell proliferation"
evidence=ISS] [GO:0007155 "cell adhesion" evidence=ISS] [GO:0045121
"membrane raft" evidence=ISS] [GO:0005783 "endoplasmic reticulum"
evidence=ISS] [GO:0005615 "extracellular space" evidence=ISS]
[GO:0008449 "N-acetylglucosamine-6-sulfatase activity"
evidence=ISS] [GO:0004065 "arylsulfatase activity" evidence=ISS]
[GO:0048010 "vascular endothelial growth factor receptor signaling
pathway" evidence=ISS] [GO:0040037 "negative regulation of
fibroblast growth factor receptor signaling pathway" evidence=ISS]
[GO:0030336 "negative regulation of cell migration" evidence=ISS]
[GO:0030201 "heparan sulfate proteoglycan metabolic process"
evidence=ISS] [GO:0030177 "positive regulation of Wnt receptor
signaling pathway" evidence=ISS] [GO:0001937 "negative regulation
of endothelial cell proliferation" evidence=ISS] [GO:0005794 "Golgi
apparatus" evidence=ISS] [GO:0060686 "negative regulation of
prostatic bud formation" evidence=ISS] [GO:0051216 "cartilage
development" evidence=ISS] [GO:0035860 "glial cell-derived
neurotrophic factor receptor signaling pathway" evidence=ISS]
[GO:0036022 "limb joint morphogenesis" evidence=ISS] [GO:0001502
"cartilage condensation" evidence=ISS] [GO:0032836 "glomerular
basement membrane development" evidence=IEA] [GO:0010575 "positive
regulation vascular endothelial growth factor production"
evidence=IEA] [GO:0005886 "plasma membrane" evidence=IEA]
[GO:0003094 "glomerular filtration" evidence=IEA] [GO:0005509
"calcium ion binding" evidence=IEA] InterPro:IPR000917
InterPro:IPR014615 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783 GO:GO:0005886
GO:GO:0005794 GO:GO:0005615 GO:GO:0009986 GO:GO:0048661
GO:GO:0005509 GO:GO:0010575 GO:GO:0045121 GO:GO:0030336
GO:GO:0001822 GO:GO:0001937 GO:GO:0030513 GO:GO:0016525
GO:GO:0001502 GO:GO:0060348 Gene3D:3.40.720.10 SUPFAM:SSF53649
GO:GO:0030177 GO:GO:0003094 InterPro:IPR024607 PROSITE:PS00523
GO:GO:0004065 GeneTree:ENSGT00400000022041 GO:GO:0048706
GO:GO:0048010 GO:GO:0060686 GO:GO:0002063 GO:GO:0040037
GO:GO:0032836 GO:GO:0060384 GO:GO:0008449 GO:GO:0030201
GO:GO:0014846 GO:GO:0035860 GO:GO:0036022 InterPro:IPR024609
Pfam:PF12548 OMA:SVRVTHK EMBL:CU179692 EMBL:CU302274
Ensembl:ENSSSCT00000006792 Uniprot:F1RU06
Length = 871
Score = 159 (61.0 bits), Expect = 7.1e-09, Sum P(2) = 7.1e-09
Identities = 62/223 (27%), Positives = 96/223 (43%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+II +L DD DV L Q+ + G N + T +C PSRS+++TGK
Sbjct: 44 PNIILVLTDD---QDVELGSL-QVMNKTRKIMELGGATFTNAFVTTPMCCPSRSSMLTGK 99
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKIL-PQ----YLKELGYRTRIVGKWHLGFYKKEYTP 173
+ +H HNV E P + + P+ YL GYRT GK+ L Y Y P
Sbjct: 100 Y-VHN---HNVYTNNENCSSPSWQAVHEPRTFAVYLNSTGYRTAFFGKY-LNEYNGSYVP 154
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
G+ LG + +++++ G+ + + A D Y TD+ T E+++
Sbjct: 155 P--GWREWLGLIKNSR-FYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESINYF 204
Query: 234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
P+ + ++HAA H P Q Y N +HI
Sbjct: 205 KMSKRMYPHRPIMMVISHAAPHGPEDSAP-QFSKLYPNASQHI 246
Score = 62 (26.9 bits), Expect = 7.1e-09, Sum P(2) = 7.1e-09
Identities = 27/131 (20%), Positives = 51/131 (38%)
Query: 267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
L IH + K L +D+SV ++ L + L N+ I++ +D
Sbjct: 269 LPIHMEFTNVLHRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQFGLV 328
Query: 327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
+ P ++ +R I P +E IV + +++ D PT+L A P
Sbjct: 329 KGKSMP--------YDFDIRVPFFIRGPSVEPGSIVPQIVLNI-DLAPTILDIAGLDTPP 379
Query: 387 NYVNSTVENII 397
+ +V ++
Sbjct: 380 DVDGKSVLKLL 390
Score = 47 (21.6 bits), Expect = 2.5e-07, Sum P(2) = 2.5e-07
Identities = 10/42 (23%), Positives = 18/42 (42%)
Query: 428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGP 466
+G + P++ Y N + P Y N ++ +Y GP
Sbjct: 226 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGP 267
>MGI|MGI:2138563 [details] [associations]
symbol:Sulf1 "sulfatase 1" species:10090 "Mus musculus"
[GO:0001822 "kidney development" evidence=IGI] [GO:0001937
"negative regulation of endothelial cell proliferation"
evidence=ISO] [GO:0002063 "chondrocyte development" evidence=IMP]
[GO:0003094 "glomerular filtration" evidence=IGI] [GO:0003824
"catalytic activity" evidence=IEA] [GO:0004065 "arylsulfatase
activity" evidence=ISO] [GO:0005509 "calcium ion binding"
evidence=IEA] [GO:0005615 "extracellular space" evidence=ISO]
[GO:0005783 "endoplasmic reticulum" evidence=ISO] [GO:0005794
"Golgi apparatus" evidence=ISO] [GO:0005886 "plasma membrane"
evidence=IDA] [GO:0006790 "sulfur compound metabolic process"
evidence=ISO] [GO:0006915 "apoptotic process" evidence=IEA]
[GO:0008152 "metabolic process" evidence=IEA] [GO:0008449
"N-acetylglucosamine-6-sulfatase activity" evidence=ISO;IMP]
[GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
[GO:0009986 "cell surface" evidence=ISO] [GO:0010575 "positive
regulation vascular endothelial growth factor production"
evidence=IGI] [GO:0014846 "esophagus smooth muscle contraction"
evidence=IGI] [GO:0016525 "negative regulation of angiogenesis"
evidence=ISO] [GO:0016787 "hydrolase activity" evidence=IEA]
[GO:0030177 "positive regulation of Wnt receptor signaling pathway"
evidence=ISO] [GO:0030201 "heparan sulfate proteoglycan metabolic
process" evidence=ISO;IMP] [GO:0030336 "negative regulation of cell
migration" evidence=ISO] [GO:0030513 "positive regulation of BMP
signaling pathway" evidence=ISO] [GO:0032836 "glomerular basement
membrane development" evidence=IGI] [GO:0035860 "glial cell-derived
neurotrophic factor receptor signaling pathway" evidence=IDA]
[GO:0040036 "regulation of fibroblast growth factor receptor
signaling pathway" evidence=ISO] [GO:0040037 "negative regulation
of fibroblast growth factor receptor signaling pathway"
evidence=ISO;IGI;IDA] [GO:0045121 "membrane raft" evidence=ISO]
[GO:0046872 "metal ion binding" evidence=IEA] [GO:0048010 "vascular
endothelial growth factor receptor signaling pathway" evidence=ISO]
[GO:0048706 "embryonic skeletal system development" evidence=IGI]
[GO:0051216 "cartilage development" evidence=IMP] [GO:0060348 "bone
development" evidence=IGI] [GO:0060384 "innervation" evidence=IGI]
[GO:0060686 "negative regulation of prostatic bud formation"
evidence=IDA] InterPro:IPR000917 InterPro:IPR014615
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
PIRSF:PIRSF036665 MGI:MGI:2138563 GO:GO:0005783 GO:GO:0005886
GO:GO:0005794 GO:GO:0006915 GO:GO:0005615 GO:GO:0009986
GO:GO:0048661 GO:GO:0005795 GO:GO:0005509 GO:GO:0010575
GO:GO:0045121 GO:GO:0030336 GO:GO:0001937 GO:GO:0030513
GO:GO:0016525 GO:GO:0001502 GO:GO:0060348 Gene3D:3.40.720.10
SUPFAM:SSF53649 GO:GO:0030177 GO:GO:0003094 eggNOG:COG3119
InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0004065
GeneTree:ENSGT00400000022041 GO:GO:0048706 GO:GO:0048010
GO:GO:0060686 GO:GO:0002063 GO:GO:0040037 GO:GO:0032836
GO:GO:0060384 GO:GO:0008449 GO:GO:0030201 HOGENOM:HOG000290161
KO:K14607 HOVERGEN:HBG056431 GO:GO:0014846 GO:GO:0035860
GO:GO:0036022 InterPro:IPR024609 Pfam:PF12548 CTD:23213 OMA:SVRVTHK
OrthoDB:EOG4VT5WH ChiTaRS:SULF1 EMBL:AY101178 EMBL:AK129278
EMBL:AK028285 EMBL:AK045002 EMBL:BC034547 EMBL:BC049276
IPI:IPI00111481 RefSeq:NP_001185494.1 RefSeq:NP_001185495.1
RefSeq:NP_758498.1 UniGene:Mm.45563 ProteinModelPortal:Q8K007
SMR:Q8K007 STRING:Q8K007 PhosphoSite:Q8K007 PRIDE:Q8K007
Ensembl:ENSMUST00000088585 Ensembl:ENSMUST00000177608
Ensembl:ENSMUST00000180062 GeneID:240725 KEGG:mmu:240725
UCSC:uc007aia.2 NextBio:384701 Bgee:Q8K007 CleanEx:MM_SULF1
Genevestigator:Q8K007 GermOnline:ENSMUSG00000016918 Uniprot:Q8K007
Length = 870
Score = 158 (60.7 bits), Expect = 7.2e-09, Sum P(2) = 7.2e-09
Identities = 62/223 (27%), Positives = 95/223 (42%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+II +L DD DV L Q+ + G N + T +C PSRS+++TGK
Sbjct: 43 PNIILVLTDD---QDVELGSL-QVMNKTRKIMEQGGATFTNAFVTTPMCCPSRSSMLTGK 98
Query: 119 HPIHTGMQHNVLYGCERGGLP----LSE-KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+ +H HNV E P + E + YL GYRT GK+ L Y Y P
Sbjct: 99 Y-VHN---HNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
G+ LG + +++++ G+ + + A D Y TD+ T E+++
Sbjct: 154 P--GWREWLGLIKNSR-FYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESINYF 203
Query: 234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
P+ + ++HAA H P Q Y N +HI
Sbjct: 204 KMSKRMYPHRPIMMVISHAAPHGPEDSAP-QFSKLYPNASQHI 245
Score = 63 (27.2 bits), Expect = 7.2e-09, Sum P(2) = 7.2e-09
Identities = 28/124 (22%), Positives = 51/124 (41%)
Query: 267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
L IH + + K L +D+SV ++ L + L N+ I++ +D
Sbjct: 268 LPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVESGELDNTYIIYTADHGYHIGQFGLV 327
Query: 327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
+ P ++ +R I P +E IV + +++ D PT+L A D P
Sbjct: 328 KGKSMP--------YDFDIRVPFFIRGPSIEPGSIVPQIVLNI-DLAPTILDIAGL-DSP 377
Query: 387 NYVN 390
+ V+
Sbjct: 378 SDVD 381
Score = 47 (21.6 bits), Expect = 3.2e-07, Sum P(2) = 3.2e-07
Identities = 10/42 (23%), Positives = 18/42 (42%)
Query: 428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGP 466
+G + P++ Y N + P Y N ++ +Y GP
Sbjct: 225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGP 266
>DICTYBASE|DDB_G0287225 [details] [associations]
symbol:DDB_G0287225 species:44689 "Dictyostelium
discoideum" [GO:0051082 "unfolded protein binding" evidence=IEA]
[GO:0006457 "protein folding" evidence=IEA] InterPro:IPR002939
InterPro:IPR008971 Pfam:PF01556 InterPro:IPR001623
InterPro:IPR018253 dictyBase:DDB_G0287225 Pfam:PF00226
GO:GO:0006457 eggNOG:COG0484 Gene3D:1.10.287.110 PRINTS:PR00625
SMART:SM00271 SUPFAM:SSF46565 SUPFAM:SSF49493 PROSITE:PS00636
PROSITE:PS50076 EMBL:AAFI02000099 RefSeq:XP_637319.1
ProteinModelPortal:Q54KN8 EnsemblProtists:DDB0187373 GeneID:8626017
KEGG:ddi:DDB_G0287225 OMA:CDYTSIN Uniprot:Q54KN8
Length = 701
Score = 169 (64.5 bits), Expect = 7.6e-09, P = 7.6e-09
Identities = 48/191 (25%), Positives = 81/191 (42%)
Query: 387 NYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTH 446
N NS++ +++ NS N + + NSN N + YN N N +
Sbjct: 50 NSNNSSISSLVNNSNNSDNNNNNNNNNNKNKNNNNSNNNNSNNNNNYNNNNNNNNNNNNN 109
Query: 447 EYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSY 506
N N+ N + N N N N N + YN NS N N +S N++N NS
Sbjct: 110 NNNNNNNNKNNNNKNNNN---NNNNNYNNNNNNNNYNYNYNNNSNNSNNSS-NKNNSNSN 165
Query: 507 QNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQI--SALTRGKWKL--VKENSINGNGTS 562
N ++ ++ + + + + N D+ +I S + + K + NSIN N ++
Sbjct: 166 SNSLNDLNQFMEIKEAYETLMDPTRKNKYDKSEILNSVILKHKSDFLPISLNSINNNISN 225
Query: 563 ENRSNDNSYQN 573
N +N+N+ N
Sbjct: 226 NNNNNNNNNNN 236
Score = 133 (51.9 bits), Expect = 5.9e-05, P = 5.9e-05
Identities = 51/213 (23%), Positives = 89/213 (41%)
Query: 373 LPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHE 432
+ +L++ +N SD N N+ N NS N + YN+ N+N N +
Sbjct: 56 ISSLVNNSNNSDNNNNNNNN-NNKNKNNNNSNNNNSNNNNNYNNNNNNNNNNNNNNNNNN 114
Query: 433 YNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSIN 492
N K N +N + N Y N N + YN N N + N +++ N NS+N
Sbjct: 115 NNNKNNNN-KNNNNNNNNNYNNNNNNNNYNYNYNNNSNNS---NNSSNKNNSNSNSNSLN 170
Query: 493 G-NGTSENRSN-----DNSYQNEIDGIDVW-SVLSRNEPSKRNTILHNIDDEWQISALTR 545
N E + D + +N+ D ++ SV+ +++ L++I++ IS
Sbjct: 171 DLNQFMEIKEAYETLMDPTRKNKYDKSEILNSVILKHKSDFLPISLNSINNN--ISNNNN 228
Query: 546 GKWKLVKENSINGNGTSENRSNDNSYQNEIDGI 578
N+ N N + N +N+N+ N + I
Sbjct: 229 NNNNNNNNNNNNNNNNNNNNNNNNNSNNSNNNI 261
>UNIPROTKB|G1LHX9 [details] [associations]
symbol:SULF1 "Uncharacterized protein" species:9646
"Ailuropoda melanoleuca" [GO:0001502 "cartilage condensation"
evidence=ISS] [GO:0001822 "kidney development" evidence=ISS]
[GO:0001937 "negative regulation of endothelial cell proliferation"
evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
[GO:0004065 "arylsulfatase activity" evidence=ISS] [GO:0005615
"extracellular space" evidence=ISS] [GO:0005783 "endoplasmic
reticulum" evidence=ISS] [GO:0005794 "Golgi apparatus"
evidence=ISS] [GO:0007155 "cell adhesion" evidence=ISS] [GO:0008449
"N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
[GO:0009986 "cell surface" evidence=ISS] [GO:0014846 "esophagus
smooth muscle contraction" evidence=ISS] [GO:0016525 "negative
regulation of angiogenesis" evidence=ISS] [GO:0030177 "positive
regulation of Wnt receptor signaling pathway" evidence=ISS]
[GO:0030201 "heparan sulfate proteoglycan metabolic process"
evidence=ISS] [GO:0030336 "negative regulation of cell migration"
evidence=ISS] [GO:0030513 "positive regulation of BMP signaling
pathway" evidence=ISS] [GO:0035860 "glial cell-derived neurotrophic
factor receptor signaling pathway" evidence=ISS] [GO:0036022 "limb
joint morphogenesis" evidence=ISS] [GO:0040037 "negative regulation
of fibroblast growth factor receptor signaling pathway"
evidence=ISS] [GO:0045121 "membrane raft" evidence=ISS] [GO:0048010
"vascular endothelial growth factor receptor signaling pathway"
evidence=ISS] [GO:0048661 "positive regulation of smooth muscle
cell proliferation" evidence=ISS] [GO:0048706 "embryonic skeletal
system development" evidence=ISS] [GO:0051216 "cartilage
development" evidence=ISS] [GO:0060348 "bone development"
evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0060686
"negative regulation of prostatic bud formation" evidence=ISS]
InterPro:IPR000917 InterPro:IPR014615 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783
GO:GO:0005886 GO:GO:0005794 GO:GO:0005615 GO:GO:0009986
GO:GO:0048661 GO:GO:0005509 GO:GO:0010575 GO:GO:0045121
GO:GO:0030336 GO:GO:0001822 GO:GO:0001937 GO:GO:0030513
GO:GO:0016525 GO:GO:0001502 GO:GO:0060348 Gene3D:3.40.720.10
SUPFAM:SSF53649 GO:GO:0030177 GO:GO:0003094 InterPro:IPR024607
PROSITE:PS00523 GO:GO:0004065 GeneTree:ENSGT00400000022041
GO:GO:0048706 GO:GO:0048010 GO:GO:0060686 GO:GO:0002063
GO:GO:0040037 GO:GO:0032836 GO:GO:0060384 GO:GO:0008449
GO:GO:0030201 GO:GO:0014846 GO:GO:0035860 GO:GO:0036022
InterPro:IPR024609 Pfam:PF12548 OMA:SVRVTHK EMBL:ACTA01145671
EMBL:ACTA01153670 Ensembl:ENSAMET00000006800 Uniprot:G1LHX9
Length = 868
Score = 159 (61.0 bits), Expect = 9.0e-09, Sum P(2) = 9.0e-09
Identities = 61/223 (27%), Positives = 95/223 (42%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+II +L DD DV L Q+ + + G N + T +C PSRS+++TGK
Sbjct: 43 PNIILVLTDD---QDVELGSL-QVMNKTRKIMEHGGATFTNAFVTTPMCCPSRSSMLTGK 98
Query: 119 HPIHTGMQHNVLYGCERGGLPLSE-----KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+ +H HNV E P + + YL GYRT GK+ L Y Y P
Sbjct: 99 Y-VHN---HNVYTNNENCSSPSWQATHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
G+ LG + +++++ G+ + + A D Y TD+ T E+++
Sbjct: 154 P--GWREWLGLIKNSR-FYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESINYF 203
Query: 234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
P+ + ++HAA H P Q Y N +HI
Sbjct: 204 KMSKRMYPHRPIMMVISHAAPHGPEDSAP-QFSKLYPNASQHI 245
Score = 61 (26.5 bits), Expect = 9.0e-09, Sum P(2) = 9.0e-09
Identities = 26/131 (19%), Positives = 51/131 (38%)
Query: 267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
L IH + K L +D+SV ++ L + L N+ I++ +D
Sbjct: 268 LPIHMEFTNVLHRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQFGLV 327
Query: 327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
+ P ++ +R I P +E +V + +++ D PT+L A P
Sbjct: 328 KGKSMP--------YDFDIRVPFFIRGPSVEPGSVVPQIVLNI-DLAPTILDIAGLDTPP 378
Query: 387 NYVNSTVENII 397
+ +V ++
Sbjct: 379 DVDGKSVLKLL 389
Score = 47 (21.6 bits), Expect = 2.5e-07, Sum P(2) = 2.5e-07
Identities = 10/42 (23%), Positives = 18/42 (42%)
Query: 428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGP 466
+G + P++ Y N + P Y N ++ +Y GP
Sbjct: 225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGP 266
>UNIPROTKB|F7FJY3 [details] [associations]
symbol:SULF1 "Uncharacterized protein" species:9544 "Macaca
mulatta" [GO:0001502 "cartilage condensation" evidence=ISS]
[GO:0001822 "kidney development" evidence=ISS] [GO:0001937
"negative regulation of endothelial cell proliferation"
evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
[GO:0004065 "arylsulfatase activity" evidence=ISS] [GO:0005615
"extracellular space" evidence=ISS] [GO:0005783 "endoplasmic
reticulum" evidence=ISS] [GO:0005794 "Golgi apparatus"
evidence=ISS] [GO:0007155 "cell adhesion" evidence=ISS] [GO:0008449
"N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
[GO:0009986 "cell surface" evidence=ISS] [GO:0014846 "esophagus
smooth muscle contraction" evidence=ISS] [GO:0016525 "negative
regulation of angiogenesis" evidence=ISS] [GO:0030177 "positive
regulation of Wnt receptor signaling pathway" evidence=ISS]
[GO:0030201 "heparan sulfate proteoglycan metabolic process"
evidence=ISS] [GO:0030336 "negative regulation of cell migration"
evidence=ISS] [GO:0030513 "positive regulation of BMP signaling
pathway" evidence=ISS] [GO:0035860 "glial cell-derived neurotrophic
factor receptor signaling pathway" evidence=ISS] [GO:0036022 "limb
joint morphogenesis" evidence=ISS] [GO:0040037 "negative regulation
of fibroblast growth factor receptor signaling pathway"
evidence=ISS] [GO:0045121 "membrane raft" evidence=ISS] [GO:0048010
"vascular endothelial growth factor receptor signaling pathway"
evidence=ISS] [GO:0048661 "positive regulation of smooth muscle
cell proliferation" evidence=ISS] [GO:0048706 "embryonic skeletal
system development" evidence=ISS] [GO:0051216 "cartilage
development" evidence=ISS] [GO:0060348 "bone development"
evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0060686
"negative regulation of prostatic bud formation" evidence=ISS]
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 GO:GO:0005783 GO:GO:0005886 GO:GO:0005794
GO:GO:0005615 GO:GO:0009986 GO:GO:0048661 GO:GO:0010575
GO:GO:0045121 GO:GO:0030336 GO:GO:0001822 GO:GO:0001937
GO:GO:0030513 GO:GO:0016525 GO:GO:0001502 GO:GO:0060348
Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0030177 GO:GO:0003094
InterPro:IPR024607 PROSITE:PS00523 GO:GO:0004065
GeneTree:ENSGT00400000022041 GO:GO:0048706 GO:GO:0048010
GO:GO:0060686 GO:GO:0002063 GO:GO:0040037 GO:GO:0032836
GO:GO:0060384 GO:GO:0008449 GO:GO:0030201 GO:GO:0014846
GO:GO:0035860 GO:GO:0036022 InterPro:IPR024609 Pfam:PF12548
OMA:QRKGDEC Ensembl:ENSMMUT00000032744 Ensembl:ENSMMUT00000032745
Uniprot:F7FJY3
Length = 759
Score = 158 (60.7 bits), Expect = 1.0e-08, Sum P(3) = 1.0e-08
Identities = 62/223 (27%), Positives = 96/223 (43%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+II +L DD DV L Q+ + + G N + T +C PSRS+++TGK
Sbjct: 43 PNIILVLTDD---QDVELGSL-QVMNKTRKIMEHGGATFINAFVTTPMCCPSRSSMLTGK 98
Query: 119 HPIHTGMQHNVLYGCERGGLP----LSE-KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+ +H HNV E P + E + YL GYRT GK+ L Y Y P
Sbjct: 99 Y-VHN---HNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
G+ LG + +++++ G+ + + A D Y TD+ T E+++
Sbjct: 154 P--GWREWLGLIKNSR-FYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESINYF 203
Query: 234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
P+ + ++HAA H P Q Y N +HI
Sbjct: 204 KMSKRMYPHRPVMMVISHAAPHGPEDSAP-QFSKLYPNASQHI 245
Score = 64 (27.6 bits), Expect = 1.0e-08, Sum P(3) = 1.0e-08
Identities = 27/131 (20%), Positives = 52/131 (39%)
Query: 267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
L IH + + K L +D+SV ++ L + L N+ I++ +D
Sbjct: 268 LPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQFGLV 327
Query: 327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
+ P ++ +R I P +E IV + +++ D PT+L A P
Sbjct: 328 KGKSMP--------YDFDIRVPFFIRGPSVEPGSIVPQIVLNI-DLAPTILDIAGLDTPP 378
Query: 387 NYVNSTVENII 397
+ +V ++
Sbjct: 379 DVDGKSVLKLL 389
Score = 47 (21.6 bits), Expect = 2.2e-07, Sum P(2) = 2.2e-07
Identities = 10/42 (23%), Positives = 18/42 (42%)
Query: 428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGP 466
+G + P++ Y N + P Y N ++ +Y GP
Sbjct: 225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGP 266
Score = 40 (19.1 bits), Expect = 1.0e-08, Sum P(3) = 1.0e-08
Identities = 25/101 (24%), Positives = 41/101 (40%)
Query: 408 ENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENR-YENGTHEYNGP 466
E+G S R GT +Y P++ + + T + ++E Y+ E
Sbjct: 508 ESGYRASGSQRKSQRQFLRNQGTPKYKPRFVHTRQ--TRSLSVEFEGEIYDINLEEEELQ 565
Query: 467 --KNENTNPRYENGTHEYNIPR-LENSINGN--GTSENRSN 502
+ N R++ G H+ PR L+ S GN G + SN
Sbjct: 566 VLQPRNIAKRHDEG-HKG--PRDLQASSGGNRGGMLADSSN 603
>UNIPROTKB|O60597 [details] [associations]
symbol:IDS "Iduronate-2-sulfatase" species:9606 "Homo
sapiens" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
HSSP:P08842 EMBL:AC233288 UniGene:Hs.460960 HGNC:HGNC:5389
ChiTaRS:IDS EMBL:AF050145 IPI:IPI00640469 SMR:O60597 STRING:O60597
Ensembl:ENST00000428056 UCSC:uc011mxj.2 HOGENOM:HOG000207088
HOVERGEN:HBG053054 Uniprot:O60597
Length = 179
Score = 142 (55.0 bits), Expect = 1.2e-08, P = 1.2e-08
Identities = 37/108 (34%), Positives = 57/108 (52%)
Query: 61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKH 119
+++ I+ DDL +G +G + +PNID LA ++ +N + Q +C PSR + +TG+
Sbjct: 38 NVLLIIVDDLR-PSLGCYGDKLVRSPNIDQLASHSLLFQNAFAQQAVCAPSRVSFLTGRR 96
Query: 120 PIHTGM-QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK-WHLG 165
P T + N + G +PQY KE GY T VGK +H G
Sbjct: 97 PDTTRLYDFNSYWRVHAGNF----STIPQYFKENGYVTMSVGKVFHPG 140
>UNIPROTKB|G1SJB8 [details] [associations]
symbol:SULF1 "Uncharacterized protein" species:9986
"Oryctolagus cuniculus" [GO:0001502 "cartilage condensation"
evidence=ISS] [GO:0001822 "kidney development" evidence=ISS]
[GO:0001937 "negative regulation of endothelial cell proliferation"
evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
[GO:0004065 "arylsulfatase activity" evidence=ISS] [GO:0005615
"extracellular space" evidence=ISS] [GO:0005783 "endoplasmic
reticulum" evidence=ISS] [GO:0005794 "Golgi apparatus"
evidence=ISS] [GO:0007155 "cell adhesion" evidence=ISS] [GO:0008449
"N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
[GO:0009986 "cell surface" evidence=ISS] [GO:0014846 "esophagus
smooth muscle contraction" evidence=ISS] [GO:0016525 "negative
regulation of angiogenesis" evidence=ISS] [GO:0030177 "positive
regulation of Wnt receptor signaling pathway" evidence=ISS]
[GO:0030201 "heparan sulfate proteoglycan metabolic process"
evidence=ISS] [GO:0030336 "negative regulation of cell migration"
evidence=ISS] [GO:0030513 "positive regulation of BMP signaling
pathway" evidence=ISS] [GO:0035860 "glial cell-derived neurotrophic
factor receptor signaling pathway" evidence=ISS] [GO:0036022 "limb
joint morphogenesis" evidence=ISS] [GO:0040037 "negative regulation
of fibroblast growth factor receptor signaling pathway"
evidence=ISS] [GO:0045121 "membrane raft" evidence=ISS] [GO:0048010
"vascular endothelial growth factor receptor signaling pathway"
evidence=ISS] [GO:0048661 "positive regulation of smooth muscle
cell proliferation" evidence=ISS] [GO:0048706 "embryonic skeletal
system development" evidence=ISS] [GO:0051216 "cartilage
development" evidence=ISS] [GO:0060348 "bone development"
evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0060686
"negative regulation of prostatic bud formation" evidence=ISS]
InterPro:IPR000917 InterPro:IPR014615 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783
GO:GO:0005886 GO:GO:0005794 GO:GO:0005615 GO:GO:0009986
GO:GO:0048661 GO:GO:0005509 GO:GO:0010575 GO:GO:0045121
GO:GO:0030336 GO:GO:0001822 GO:GO:0001937 GO:GO:0030513
GO:GO:0016525 GO:GO:0001502 GO:GO:0060348 Gene3D:3.40.720.10
SUPFAM:SSF53649 GO:GO:0030177 GO:GO:0003094 InterPro:IPR024607
PROSITE:PS00523 GO:GO:0004065 GeneTree:ENSGT00400000022041
GO:GO:0048706 GO:GO:0048010 GO:GO:0060686 GO:GO:0002063
GO:GO:0040037 GO:GO:0032836 GO:GO:0060384 GO:GO:0008449
GO:GO:0030201 GO:GO:0014846 GO:GO:0035860 GO:GO:0036022
InterPro:IPR024609 Pfam:PF12548 EMBL:AAGW02046925 EMBL:AAGW02046926
Ensembl:ENSOCUT00000003251 Uniprot:G1SJB8
Length = 869
Score = 161 (61.7 bits), Expect = 1.3e-08, Sum P(4) = 1.3e-08
Identities = 62/223 (27%), Positives = 96/223 (43%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+II +L DD DV L Q+ + + G N + T +C PSRS+++TGK
Sbjct: 43 PNIILVLTDD---QDVELGSL-QVMNKTRKIMEHGGATFTNAFVTTPMCCPSRSSMLTGK 98
Query: 119 HPIHTGMQHNVLYGCERGGLP----LSE-KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+ +H HNV E P + E + YL GYRT GK+ L Y Y P
Sbjct: 99 Y-VHN---HNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
G+ LG + +++++ G+ + + A D Y TD+ T E+++
Sbjct: 154 P--GWREWLGLIKNSR-FYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESINYF 203
Query: 234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
P+ + ++HAA H P Q Y N +HI
Sbjct: 204 KMSKRMYPHRPIMMVISHAAPHGPEDSAP-QFSKLYPNASQHI 245
Score = 62 (26.9 bits), Expect = 1.3e-08, Sum P(4) = 1.3e-08
Identities = 26/131 (19%), Positives = 52/131 (39%)
Query: 267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
L IH + + K L +D+SV ++ L + L N+ I++ +D
Sbjct: 268 LPIHMEFTNVLQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQFGLV 327
Query: 327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
+ P ++ +R I P +E +V + +++ D PT+L A P
Sbjct: 328 KGKSMP--------YDFDIRVPFFIRGPSVEPGSVVPQIVLNI-DLAPTILDIAGLDTPP 378
Query: 387 NYVNSTVENII 397
+ +V ++
Sbjct: 379 DVDGKSVLKLL 389
Score = 47 (21.6 bits), Expect = 5.1e-07, Sum P(3) = 5.1e-07
Identities = 10/42 (23%), Positives = 18/42 (42%)
Query: 428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGP 466
+G + P++ Y N + P Y N ++ +Y GP
Sbjct: 225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGP 266
Score = 42 (19.8 bits), Expect = 1.3e-08, Sum P(4) = 1.3e-08
Identities = 21/98 (21%), Positives = 37/98 (37%)
Query: 408 ENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENR-YENGTHEYNGP 466
E+G S R GT +Y P++ + + T + ++E Y+ E
Sbjct: 508 ESGYRASRSQRKSQRQFLRNQGTPKYKPRFVHTRQ--TRSLSVEFEGEIYDINLEEEELQ 565
Query: 467 --KNENTNPRYENGTHEYNIPRLENSINGNGTSENRSN 502
+ N R++ G + +S NG G + SN
Sbjct: 566 VLQPRNIAKRHDEGHRGLRGRQAGSSGNGAGMLADSSN 603
Score = 39 (18.8 bits), Expect = 1.3e-08, Sum P(4) = 1.3e-08
Identities = 11/28 (39%), Positives = 14/28 (50%)
Query: 749 YSNEEEGMRKLRDAASIQCGPVKEVPCE 776
Y N+E+G+RK S P KE E
Sbjct: 681 YYNKEKGVRKQEKLKS-HLHPFKEAAQE 707
>UNIPROTKB|G3R9R9 [details] [associations]
symbol:SULF1 "Uncharacterized protein" species:9595
"Gorilla gorilla gorilla" [GO:0001502 "cartilage condensation"
evidence=ISS] [GO:0001822 "kidney development" evidence=ISS]
[GO:0001937 "negative regulation of endothelial cell proliferation"
evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
[GO:0004065 "arylsulfatase activity" evidence=ISS] [GO:0005615
"extracellular space" evidence=ISS] [GO:0005783 "endoplasmic
reticulum" evidence=ISS] [GO:0005794 "Golgi apparatus"
evidence=ISS] [GO:0007155 "cell adhesion" evidence=ISS] [GO:0008449
"N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
[GO:0009986 "cell surface" evidence=ISS] [GO:0014846 "esophagus
smooth muscle contraction" evidence=ISS] [GO:0016525 "negative
regulation of angiogenesis" evidence=ISS] [GO:0030177 "positive
regulation of Wnt receptor signaling pathway" evidence=ISS]
[GO:0030201 "heparan sulfate proteoglycan metabolic process"
evidence=ISS] [GO:0030336 "negative regulation of cell migration"
evidence=ISS] [GO:0030513 "positive regulation of BMP signaling
pathway" evidence=ISS] [GO:0035860 "glial cell-derived neurotrophic
factor receptor signaling pathway" evidence=ISS] [GO:0036022 "limb
joint morphogenesis" evidence=ISS] [GO:0040037 "negative regulation
of fibroblast growth factor receptor signaling pathway"
evidence=ISS] [GO:0045121 "membrane raft" evidence=ISS] [GO:0048010
"vascular endothelial growth factor receptor signaling pathway"
evidence=ISS] [GO:0048661 "positive regulation of smooth muscle
cell proliferation" evidence=ISS] [GO:0048706 "embryonic skeletal
system development" evidence=ISS] [GO:0051216 "cartilage
development" evidence=ISS] [GO:0060348 "bone development"
evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0060686
"negative regulation of prostatic bud formation" evidence=ISS]
InterPro:IPR000917 InterPro:IPR014615 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783
GO:GO:0005794 GO:GO:0005615 GO:GO:0009986 GO:GO:0048661
GO:GO:0005509 GO:GO:0045121 GO:GO:0030336 GO:GO:0001822
GO:GO:0001937 GO:GO:0030513 GO:GO:0016525 GO:GO:0001502
GO:GO:0060348 Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0030177
InterPro:IPR024607 PROSITE:PS00523 GO:GO:0004065 GO:GO:0048706
GO:GO:0048010 GO:GO:0060686 GO:GO:0002063 GO:GO:0040037
GO:GO:0060384 GO:GO:0008449 GO:GO:0030201 GO:GO:0014846
GO:GO:0035860 GO:GO:0036022 InterPro:IPR024609 Pfam:PF12548
OMA:SVRVTHK RefSeq:XP_004047178.1 RefSeq:XP_004047179.1
Ensembl:ENSGGOT00000012515 GeneID:101141420 Uniprot:G3R9R9
Length = 869
Score = 158 (60.7 bits), Expect = 1.3e-08, Sum P(3) = 1.3e-08
Identities = 62/223 (27%), Positives = 96/223 (43%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+II +L DD DV L Q+ + + G N + T +C PSRS+++TGK
Sbjct: 43 PNIILVLTDD---QDVELGSL-QVMNKTRKIMEHGGATFINAFVTTPMCCPSRSSMLTGK 98
Query: 119 HPIHTGMQHNVLYGCERGGLP----LSE-KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+ +H HNV E P + E + YL GYRT GK+ L Y Y P
Sbjct: 99 Y-VHN---HNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
G+ LG + +++++ G+ + + A D Y TD+ T E+++
Sbjct: 154 P--GWREWLGLIKNSR-FYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESINYF 203
Query: 234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
P+ + ++HAA H P Q Y N +HI
Sbjct: 204 KMSKRMYPHRPVMMVISHAAPHGPEDSAP-QFSKLYPNASQHI 245
Score = 64 (27.6 bits), Expect = 1.3e-08, Sum P(3) = 1.3e-08
Identities = 27/131 (20%), Positives = 52/131 (39%)
Query: 267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
L IH + + K L +D+SV ++ L + L N+ I++ +D
Sbjct: 268 LPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQFGLV 327
Query: 327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
+ P ++ +R I P +E IV + +++ D PT+L A P
Sbjct: 328 KGKSMP--------YDFDIRVPFFIRGPSVEPGSIVPQIVLNI-DLAPTILDIAGLDTPP 378
Query: 387 NYVNSTVENII 397
+ +V ++
Sbjct: 379 DVDGKSVLKLL 389
Score = 47 (21.6 bits), Expect = 3.2e-07, Sum P(2) = 3.2e-07
Identities = 10/42 (23%), Positives = 18/42 (42%)
Query: 428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGP 466
+G + P++ Y N + P Y N ++ +Y GP
Sbjct: 225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGP 266
Score = 41 (19.5 bits), Expect = 1.3e-08, Sum P(3) = 1.3e-08
Identities = 25/101 (24%), Positives = 41/101 (40%)
Query: 408 ENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENR-YENGTHEYNGP 466
E+G S R GT +Y P++ + + T + ++E Y+ E
Sbjct: 508 ESGYRASRSQRKSQRQFLRNQGTPKYKPRFVHTRQ--TRSLSVEFEGEIYDINLEEEELQ 565
Query: 467 --KNENTNPRYENGTHEYNIPR-LENSINGN--GTSENRSN 502
+ N R++ G H+ PR L+ S GN G + SN
Sbjct: 566 VLQPRNIAKRHDEG-HKR--PRDLQASSGGNRGGMLADSSN 603
>UNIPROTKB|P22304 [details] [associations]
symbol:IDS "Iduronate 2-sulfatase" species:9606 "Homo
sapiens" [GO:0046872 "metal ion binding" evidence=IEA] [GO:0004423
"iduronate-2-sulfatase activity" evidence=TAS] [GO:0005975
"carbohydrate metabolic process" evidence=TAS] [GO:0006027
"glycosaminoglycan catabolic process" evidence=TAS] [GO:0030203
"glycosaminoglycan metabolic process" evidence=TAS] [GO:0030204
"chondroitin sulfate metabolic process" evidence=TAS] [GO:0030207
"chondroitin sulfate catabolic process" evidence=TAS] [GO:0043202
"lysosomal lumen" evidence=TAS] [GO:0044281 "small molecule
metabolic process" evidence=TAS] Reactome:REACT_111217
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 Reactome:REACT_116125 GO:GO:0046872 GO:GO:0005975
EMBL:CH471171 GO:GO:0043202 Gene3D:3.40.720.10 SUPFAM:SSF53649
eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
GO:GO:0030207 EMBL:AF011889 EMBL:M58342 EMBL:L13329 EMBL:L13321
EMBL:L13322 EMBL:L13323 EMBL:L13324 EMBL:L13325 EMBL:L13326
EMBL:L13327 EMBL:L13328 EMBL:L04586 EMBL:L04578 EMBL:L04579
EMBL:L04580 EMBL:L04581 EMBL:L04583 EMBL:L04582 EMBL:L04584
EMBL:L04585 EMBL:L40586 EMBL:AC233288 EMBL:BC006170 IPI:IPI00006121
IPI:IPI00013771 IPI:IPI00026104 PIR:A47535 RefSeq:NP_000193.1
RefSeq:NP_006114.1 UniGene:Hs.460960 ProteinModelPortal:P22304
IntAct:P22304 STRING:P22304 PhosphoSite:P22304 DMDM:124174
PRIDE:P22304 Ensembl:ENST00000340855 Ensembl:ENST00000370441
Ensembl:ENST00000370443 Ensembl:ENST00000466323 GeneID:3423
KEGG:hsa:3423 UCSC:uc004fcw.4 UCSC:uc011mxh.2 CTD:3423
GeneCards:GC0XM148558 HGNC:HGNC:5389 MIM:300823 MIM:309900
neXtProt:NX_P22304 Orphanet:217085 Orphanet:217093 PharmGKB:PA29636
HOGENOM:HOG000014304 HOVERGEN:HBG006120 InParanoid:P22304 KO:K01136
OMA:CREGKNL OrthoDB:EOG49078W PhylomeDB:P22304
BioCyc:MetaCyc:HS00286-MONOMER ChiTaRS:IDS GenomeRNAi:3423
NextBio:13500 PMAP-CutDB:P22304 ArrayExpress:P22304 Bgee:P22304
CleanEx:HS_IDS Genevestigator:P22304 GermOnline:ENSG00000010404
GO:GO:0004423 Uniprot:P22304
Length = 550
Score = 142 (55.0 bits), Expect = 1.6e-08, Sum P(2) = 1.6e-08
Identities = 37/108 (34%), Positives = 57/108 (52%)
Query: 61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKH 119
+++ I+ DDL +G +G + +PNID LA ++ +N + Q +C PSR + +TG+
Sbjct: 38 NVLLIIVDDLR-PSLGCYGDKLVRSPNIDQLASHSLLFQNAFAQQAVCAPSRVSFLTGRR 96
Query: 120 PIHTGM-QHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK-WHLG 165
P T + N + G +PQY KE GY T VGK +H G
Sbjct: 97 PDTTRLYDFNSYWRVHAGNF----STIPQYFKENGYVTMSVGKVFHPG 140
Score = 71 (30.1 bits), Expect = 1.6e-08, Sum P(2) = 1.6e-08
Identities = 20/60 (33%), Positives = 37/60 (61%)
Query: 256 PYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
PY P+ P ++ R I ++S FA++ + LD VG+++ AL+ ++ +++II F SD
Sbjct: 284 PYGPI--P---VDFQRKI---RQSYFASVSY-LDTQVGRLLSALDDLQLANSTIIAFTSD 334
>UNIPROTKB|F1NI04 [details] [associations]
symbol:GNS "Uncharacterized protein" species:9031 "Gallus
gallus" [GO:0005764 "lysosome" evidence=IEA] [GO:0008449
"N-acetylglucosamine-6-sulfatase activity" evidence=IEA]
[GO:0030203 "glycosaminoglycan metabolic process" evidence=IEA]
InterPro:IPR000917 InterPro:IPR012251 InterPro:IPR015981
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
PIRSF:PIRSF036666 GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
GeneTree:ENSGT00400000022041 GO:GO:0030203 GO:GO:0008449
PANTHER:PTHR10342:SF5 OMA:MCGYQTF EMBL:AADN02009911 IPI:IPI00596266
Ensembl:ENSGALT00000016025 Uniprot:F1NI04
Length = 546
Score = 174 (66.3 bits), Expect = 1.7e-08, Sum P(2) = 1.7e-08
Identities = 73/253 (28%), Positives = 109/253 (43%)
Query: 31 RTRIM---AFAVLP--LAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPT 85
R RIM A A L LA +V A+ P+++ IL DD DV G+ +
Sbjct: 5 RRRIMSRSALAALARGLALAALLVLSPAQAARQRPNVVLILTDD---QDVFLGGMTPMKK 61
Query: 86 PNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYG-CE-RGGLPLSE 142
N +A G+ N Y LC PSR++I+TGK+P + + +N L G C + + E
Sbjct: 62 TNA-LIAQMGVTFSNAYVPSALCCPSRASILTGKYPHNHHVVNNTLEGNCSSKLWQKIQE 120
Query: 143 -KILPQYLKEL-GYRTRIVGKWHLGFYKKEYTPTFRGFESHL----GYWTGHQDYFDHSA 196
P LK + GY+T GK Y EY G SH+ +W + +
Sbjct: 121 PNTFPALLKSMCGYQTFFAGK-----YLNEYGAEDAGGVSHVPPGWSFWYALEKNSKYYN 175
Query: 197 EEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHSANP 256
+ + G R + D Y TDV ++D + S EP F+ ++ A HS
Sbjct: 176 YTLSVNGKARRHGENYSVD----YLTDVLANMSLDFLEYKSNFEPFFMMISTPAPHSPWT 231
Query: 257 YEPLQAPDHYLNI 269
P Q + +LN+
Sbjct: 232 AAP-QYKNDFLNV 243
Score = 37 (18.1 bits), Expect = 1.7e-08, Sum P(2) = 1.7e-08
Identities = 9/28 (32%), Positives = 15/28 (53%)
Query: 450 PKYENRYENGTHEYNGPKNENTNPRYEN 477
P+Y+N + N + P+N N N +N
Sbjct: 234 PQYKNDFLN----VSAPRNSNFNIHGKN 257
>UNIPROTKB|Q8IWU6 [details] [associations]
symbol:SULF1 "Extracellular sulfatase Sulf-1" species:9606
"Homo sapiens" [GO:0005509 "calcium ion binding" evidence=IEA]
[GO:0006915 "apoptotic process" evidence=IEA] [GO:0005795 "Golgi
stack" evidence=IEA] [GO:0004065 "arylsulfatase activity"
evidence=IMP;IDA] [GO:0005615 "extracellular space"
evidence=IDA;NAS] [GO:0009986 "cell surface" evidence=IDA]
[GO:0030201 "heparan sulfate proteoglycan metabolic process"
evidence=IDA;NAS] [GO:0030336 "negative regulation of cell
migration" evidence=IMP] [GO:0040036 "regulation of fibroblast
growth factor receptor signaling pathway" evidence=IMP] [GO:0040037
"negative regulation of fibroblast growth factor receptor signaling
pathway" evidence=ISS;IMP] [GO:0030513 "positive regulation of BMP
signaling pathway" evidence=IMP] [GO:0008449
"N-acetylglucosamine-6-sulfatase activity" evidence=IMP;IDA]
[GO:0030177 "positive regulation of Wnt receptor signaling pathway"
evidence=IDA] [GO:0045121 "membrane raft" evidence=IDA] [GO:0005783
"endoplasmic reticulum" evidence=IDA] [GO:0048010 "vascular
endothelial growth factor receptor signaling pathway" evidence=IDA]
[GO:0002063 "chondrocyte development" evidence=ISS] [GO:0035860
"glial cell-derived neurotrophic factor receptor signaling pathway"
evidence=ISS] [GO:0051216 "cartilage development" evidence=ISS]
[GO:0060686 "negative regulation of prostatic bud formation"
evidence=ISS] [GO:0005794 "Golgi apparatus" evidence=ISS]
[GO:0005886 "plasma membrane" evidence=ISS] [GO:0010575 "positive
regulation vascular endothelial growth factor production"
evidence=ISS] [GO:0003094 "glomerular filtration" evidence=ISS]
[GO:0032836 "glomerular basement membrane development"
evidence=ISS] [GO:0016525 "negative regulation of angiogenesis"
evidence=IDA] [GO:0001937 "negative regulation of endothelial cell
proliferation" evidence=IDA] [GO:0014846 "esophagus smooth muscle
contraction" evidence=ISS] [GO:0048706 "embryonic skeletal system
development" evidence=ISS] [GO:0060384 "innervation" evidence=ISS]
[GO:0001822 "kidney development" evidence=ISS] [GO:0060348 "bone
development" evidence=ISS] InterPro:IPR000917 InterPro:IPR014615
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
PIRSF:PIRSF036665 EMBL:AY101175 GO:GO:0005783 GO:GO:0005886
GO:GO:0005794 GO:GO:0006915 GO:GO:0005615 GO:GO:0009986
GO:GO:0005795 GO:GO:0005509 GO:GO:0010575 GO:GO:0045121
GO:GO:0030336 GO:GO:0001937 GO:GO:0030513 GO:GO:0016525
GO:GO:0060348 Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0030177
GO:GO:0003094 eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0004065 GO:GO:0048706 GO:GO:0048010
GO:GO:0060686 GO:GO:0002063 GO:GO:0040037 GO:GO:0032836
GO:GO:0060384 GO:GO:0008449 GO:GO:0030201 Orphanet:2496
HOGENOM:HOG000290161 KO:K14607 HOVERGEN:HBG056431 GO:GO:0014846
GO:GO:0035860 InterPro:IPR024609 Pfam:PF12548 CTD:23213
EMBL:AF545571 EMBL:AB029000 EMBL:AK074873 IPI:IPI00293203
RefSeq:NP_001121676.1 RefSeq:NP_001121677.1 RefSeq:NP_001121678.1
RefSeq:NP_055985.2 UniGene:Hs.409602 ProteinModelPortal:Q8IWU6
SMR:Q8IWU6 STRING:Q8IWU6 PhosphoSite:Q8IWU6 DMDM:33112447
PaxDb:Q8IWU6 PRIDE:Q8IWU6 DNASU:23213 Ensembl:ENST00000260128
Ensembl:ENST00000402687 Ensembl:ENST00000419716
Ensembl:ENST00000458141 GeneID:23213 KEGG:hsa:23213 UCSC:uc003xyd.2
GeneCards:GC08P070428 HGNC:HGNC:20391 MIM:610012 neXtProt:NX_Q8IWU6
PharmGKB:PA134861022 InParanoid:Q8IWU6 OMA:SVRVTHK
OrthoDB:EOG4VT5WH ChiTaRS:SULF1 GenomeRNAi:23213 NextBio:44771
ArrayExpress:Q8IWU6 Bgee:Q8IWU6 CleanEx:HS_SULF1
Genevestigator:Q8IWU6 Uniprot:Q8IWU6
Length = 871
Score = 158 (60.7 bits), Expect = 2.7e-08, Sum P(3) = 2.7e-08
Identities = 62/223 (27%), Positives = 96/223 (43%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+II +L DD DV L Q+ + + G N + T +C PSRS+++TGK
Sbjct: 43 PNIILVLTDD---QDVELGSL-QVMNKTRKIMEHGGATFINAFVTTPMCCPSRSSMLTGK 98
Query: 119 HPIHTGMQHNVLYGCERGGLP----LSE-KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+ +H HNV E P + E + YL GYRT GK+ L Y Y P
Sbjct: 99 Y-VHN---HNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
G+ LG + +++++ G+ + + A D Y TD+ T E+++
Sbjct: 154 P--GWREWLGLIKNSR-FYNYTVCRN---GIKEKHGFDYAKD----YFTDLITNESINYF 203
Query: 234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
P+ + ++HAA H P Q Y N +HI
Sbjct: 204 KMSKRMYPHRPVMMVISHAAPHGPEDSAP-QFSKLYPNASQHI 245
Score = 64 (27.6 bits), Expect = 2.7e-08, Sum P(3) = 2.7e-08
Identities = 27/131 (20%), Positives = 52/131 (39%)
Query: 267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
L IH + + K L +D+SV ++ L + L N+ I++ +D
Sbjct: 268 LPIHMEFTNILQRKRLQTLMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQFGLV 327
Query: 327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
+ P ++ +R I P +E IV + +++ D PT+L A P
Sbjct: 328 KGKSMP--------YDFDIRVPFFIRGPSVEPGSIVPQIVLNI-DLAPTILDIAGLDTPP 378
Query: 387 NYVNSTVENII 397
+ +V ++
Sbjct: 379 DVDGKSVLKLL 389
Score = 47 (21.6 bits), Expect = 3.2e-07, Sum P(2) = 3.2e-07
Identities = 10/42 (23%), Positives = 18/42 (42%)
Query: 428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGP 466
+G + P++ Y N + P Y N ++ +Y GP
Sbjct: 225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGP 266
Score = 38 (18.4 bits), Expect = 2.7e-08, Sum P(3) = 2.7e-08
Identities = 22/92 (23%), Positives = 37/92 (40%)
Query: 408 ENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENR-YE---NGTHEY 463
E+G S R GT +Y P++ + + T + ++E Y+ E
Sbjct: 508 ESGYRASRSQRKSQRQFLRNQGTPKYKPRFVHTRQ--TRSLSVEFEGEIYDINLEEEEEL 565
Query: 464 NGPKNENTNPRYENGTHEYNIPR-LENSINGN 494
+ N R++ G H+ PR L+ S GN
Sbjct: 566 QVLQPRNIAKRHDEG-HKG--PRDLQASSGGN 594
>UNIPROTKB|G1KQZ3 [details] [associations]
symbol:SULF1 "Uncharacterized protein" species:28377
"Anolis carolinensis" [GO:0001502 "cartilage condensation"
evidence=ISS] [GO:0001822 "kidney development" evidence=ISS]
[GO:0001937 "negative regulation of endothelial cell proliferation"
evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
[GO:0004065 "arylsulfatase activity" evidence=ISS] [GO:0005615
"extracellular space" evidence=ISS] [GO:0005783 "endoplasmic
reticulum" evidence=ISS] [GO:0005794 "Golgi apparatus"
evidence=ISS] [GO:0007155 "cell adhesion" evidence=ISS] [GO:0008449
"N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
[GO:0009986 "cell surface" evidence=ISS] [GO:0014846 "esophagus
smooth muscle contraction" evidence=ISS] [GO:0016525 "negative
regulation of angiogenesis" evidence=ISS] [GO:0030177 "positive
regulation of Wnt receptor signaling pathway" evidence=ISS]
[GO:0030201 "heparan sulfate proteoglycan metabolic process"
evidence=ISS] [GO:0030336 "negative regulation of cell migration"
evidence=ISS] [GO:0030513 "positive regulation of BMP signaling
pathway" evidence=ISS] [GO:0035860 "glial cell-derived neurotrophic
factor receptor signaling pathway" evidence=ISS] [GO:0036022 "limb
joint morphogenesis" evidence=ISS] [GO:0040037 "negative regulation
of fibroblast growth factor receptor signaling pathway"
evidence=ISS] [GO:0045121 "membrane raft" evidence=ISS] [GO:0048010
"vascular endothelial growth factor receptor signaling pathway"
evidence=ISS] [GO:0048661 "positive regulation of smooth muscle
cell proliferation" evidence=ISS] [GO:0048706 "embryonic skeletal
system development" evidence=ISS] [GO:0051216 "cartilage
development" evidence=ISS] [GO:0060348 "bone development"
evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0060686
"negative regulation of prostatic bud formation" evidence=ISS]
InterPro:IPR000917 InterPro:IPR014615 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783
GO:GO:0005886 GO:GO:0005794 GO:GO:0005615 GO:GO:0009986
GO:GO:0048661 GO:GO:0005509 GO:GO:0010575 GO:GO:0045121
GO:GO:0030336 GO:GO:0001822 GO:GO:0001937 GO:GO:0030513
GO:GO:0016525 GO:GO:0001502 GO:GO:0060348 Gene3D:3.40.720.10
SUPFAM:SSF53649 GO:GO:0030177 InterPro:IPR024607 PROSITE:PS00523
GO:GO:0004065 GeneTree:ENSGT00400000022041 GO:GO:0048706
GO:GO:0048010 GO:GO:0060686 GO:GO:0002063 GO:GO:0040037
GO:GO:0060384 GO:GO:0008449 GO:GO:0030201 GO:GO:0014846
GO:GO:0035860 GO:GO:0036022 InterPro:IPR024609 Pfam:PF12548
OMA:SVRVTHK Ensembl:ENSACAT00000015364 Uniprot:G1KQZ3
Length = 878
Score = 147 (56.8 bits), Expect = 3.2e-08, Sum P(3) = 3.2e-08
Identities = 59/223 (26%), Positives = 93/223 (41%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+II +L DD DV L Q+ + G N + T +C PSRS+++TGK
Sbjct: 43 PNIILVLTDD---QDVELGSL-QVMNKTRKIMESGGATFVNAFVTTPMCCPSRSSMLTGK 98
Query: 119 HPIHTGMQHNVLYGCERGGLPLSE-----KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+ +H HN+ E P + + YL GYRT GK+ L Y Y P
Sbjct: 99 Y-VHN---HNIYTNNENCSSPSWQATHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
G+ +G + +++++ GL + + A D Y TD+ T +++
Sbjct: 154 P--GWREWVGLIKNSR-FYNYTVCRN---GLKEKHGFDYAKD----YFTDLITNDSIHYF 203
Query: 234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
P+ + ++HAA H P Q Y N +HI
Sbjct: 204 KMSKRIYPHRPIMMVISHAAPHGPEDSAP-QFSKLYPNASQHI 245
Score = 60 (26.2 bits), Expect = 3.2e-08, Sum P(3) = 3.2e-08
Identities = 25/131 (19%), Positives = 53/131 (40%)
Query: 267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
L IH + + K L +D+S+ ++ L + L N+ I++ +D
Sbjct: 268 LPIHMEFTNVLQRKRLQTLLSVDDSMERLYHMLVETGELENTYIIYTADHGYHIGQFGLV 327
Query: 327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
+ P ++ +R I P +E +V++ +++ D PT+L A P
Sbjct: 328 KGKSMP--------YDFDIRVPFFIRGPSIEPGSVVSQIVLNI-DLAPTVLDIAGLDTPP 378
Query: 387 NYVNSTVENII 397
+ +V ++
Sbjct: 379 DMDGKSVLKLL 389
Score = 53 (23.7 bits), Expect = 3.2e-08, Sum P(3) = 3.2e-08
Identities = 23/112 (20%), Positives = 41/112 (36%)
Query: 402 NSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNP---KYENRYEN 458
+ IL T +S N + +Y R NP KY+ R+ +
Sbjct: 480 SDILAIRKRTRSIHSQGYNNQENECDCREADYRSSRTQRKNQRAFMRNPSMPKYKPRFVH 539
Query: 459 GTHEYNGPKNENTNPRYE-NGTHEYNIPRLENSINGNGTSENRSNDNSYQNE 509
T + E Y+ N E +IP+ ++ + +G+ +DN Q +
Sbjct: 540 -TRQTRSLSVEFEGEIYDINLEEELHIPQPKSIVKRHGSYSEEDDDNEDQEQ 590
Score = 47 (21.6 bits), Expect = 4.8e-06, Sum P(2) = 4.8e-06
Identities = 10/42 (23%), Positives = 18/42 (42%)
Query: 428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGP 466
+G + P++ Y N + P Y N ++ +Y GP
Sbjct: 225 HGPEDSAPQFSKLYPNASQHITPSYNYAPNMDKHWIMQYTGP 266
>UNIPROTKB|Q90XB6 [details] [associations]
symbol:SULF1 "Extracellular sulfatase Sulf-1" species:9091
"Coturnix coturnix" [GO:0001502 "cartilage condensation"
evidence=IDA] [GO:0001822 "kidney development" evidence=ISS]
[GO:0001937 "negative regulation of endothelial cell proliferation"
evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
[GO:0004065 "arylsulfatase activity" evidence=ISS] [GO:0005615
"extracellular space" evidence=IDA] [GO:0005783 "endoplasmic
reticulum" evidence=IDA] [GO:0005794 "Golgi apparatus"
evidence=ISS] [GO:0007155 "cell adhesion" evidence=IDA] [GO:0008449
"N-acetylglucosamine-6-sulfatase activity" evidence=IDA]
[GO:0009986 "cell surface" evidence=ISS;IDA] [GO:0014846 "esophagus
smooth muscle contraction" evidence=ISS] [GO:0016525 "negative
regulation of angiogenesis" evidence=ISS] [GO:0030177 "positive
regulation of Wnt receptor signaling pathway" evidence=ISS]
[GO:0030201 "heparan sulfate proteoglycan metabolic process"
evidence=IDA] [GO:0030336 "negative regulation of cell migration"
evidence=ISS] [GO:0030513 "positive regulation of BMP signaling
pathway" evidence=ISS] [GO:0035860 "glial cell-derived neurotrophic
factor receptor signaling pathway" evidence=ISS] [GO:0036022 "limb
joint morphogenesis" evidence=IDA] [GO:0040037 "negative regulation
of fibroblast growth factor receptor signaling pathway"
evidence=ISS] [GO:0045121 "membrane raft" evidence=ISS] [GO:0048010
"vascular endothelial growth factor receptor signaling pathway"
evidence=ISS] [GO:0048661 "positive regulation of smooth muscle
cell proliferation" evidence=IDA] [GO:0048706 "embryonic skeletal
system development" evidence=ISS] [GO:0051216 "cartilage
development" evidence=ISS] [GO:0060070 "canonical Wnt receptor
signaling pathway" evidence=IDA] [GO:0060348 "bone development"
evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0060686
"negative regulation of prostatic bud formation" evidence=ISS]
InterPro:IPR000917 InterPro:IPR014615 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783
GO:GO:0005794 GO:GO:0005615 GO:GO:0009986 GO:GO:0048661
GO:GO:0005795 GO:GO:0005509 GO:GO:0045121 GO:GO:0030336
GO:GO:0001822 GO:GO:0001937 GO:GO:0030513 GO:GO:0016525
GO:GO:0001502 GO:GO:0060348 GO:GO:0060070 Gene3D:3.40.720.10
SUPFAM:SSF53649 GO:GO:0030177 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0004065 GO:GO:0048706 GO:GO:0048010
GO:GO:0060686 GO:GO:0002063 GO:GO:0040037 GO:GO:0060384
GO:GO:0008449 GO:GO:0030201 EMBL:AF410802 ProteinModelPortal:Q90XB6
HOVERGEN:HBG056431 GO:GO:0014846 GO:GO:0035860 GO:GO:0036022
InterPro:IPR024609 Pfam:PF12548 Uniprot:Q90XB6
Length = 867
Score = 150 (57.9 bits), Expect = 6.4e-08, Sum P(2) = 6.4e-08
Identities = 59/223 (26%), Positives = 94/223 (42%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+II +L DD DV L Q+ + G N + T +C PSRS+++TGK
Sbjct: 43 PNIILVLTDD---QDVELGSL-QVMNKTRRIMENGGASFINAFVTTPMCCPSRSSMLTGK 98
Query: 119 HPIHTGMQHNVLYGCERGGLPLSE-----KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+ +H HN+ E P + + YL GYRT GK+ L Y Y P
Sbjct: 99 Y-VHN---HNIYTNNENCSSPSWQATHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
G+ +G + +++++ G + + A D Y TD+ T E+++
Sbjct: 154 P--GWREWVGL-VKNSRFYNYTISRN---GNKEKHGFDYAKD----YFTDLITNESINYF 203
Query: 234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
P+ + ++HAA H P Q + Y N +HI
Sbjct: 204 RMSKRIYPHRPIMMVISHAAPHGPEDSAP-QFSELYPNASQHI 245
Score = 62 (26.9 bits), Expect = 6.4e-08, Sum P(2) = 6.4e-08
Identities = 25/131 (19%), Positives = 53/131 (40%)
Query: 267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
L IH + + K L +D+S+ ++ + L + L N+ I++ +D
Sbjct: 268 LPIHMEFTNVLQRKRLQTLMSVDDSMERLYQMLAEMGELENTYIIYTADHGYHIGQFGLV 327
Query: 327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
+ P ++ +R I P +E +V + +++ D PT+L A P
Sbjct: 328 KGKSMP--------YDFDIRVPFFIRGPSVEPGSVVPQIVLNI-DLAPTILDIAGLDTPP 378
Query: 387 NYVNSTVENII 397
+ +V ++
Sbjct: 379 DMDGKSVLKLL 389
Score = 47 (21.6 bits), Expect = 2.2e-06, Sum P(2) = 2.2e-06
Identities = 10/42 (23%), Positives = 18/42 (42%)
Query: 428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGP 466
+G + P++ Y N + P Y N ++ +Y GP
Sbjct: 225 HGPEDSAPQFSELYPNASQHITPSYNYAPNMDKHWIMQYTGP 266
>UNIPROTKB|E1BRF7 [details] [associations]
symbol:SULF1 "Uncharacterized protein" species:9031 "Gallus
gallus" [GO:0005509 "calcium ion binding" evidence=IEA] [GO:0003094
"glomerular filtration" evidence=IEA] [GO:0005886 "plasma membrane"
evidence=IEA] [GO:0010575 "positive regulation vascular endothelial
growth factor production" evidence=IEA] [GO:0032836 "glomerular
basement membrane development" evidence=IEA] [GO:0001502 "cartilage
condensation" evidence=ISS] [GO:0036022 "limb joint morphogenesis"
evidence=ISS] [GO:0002063 "chondrocyte development" evidence=ISS]
[GO:0035860 "glial cell-derived neurotrophic factor receptor
signaling pathway" evidence=ISS] [GO:0051216 "cartilage
development" evidence=ISS] [GO:0040037 "negative regulation of
fibroblast growth factor receptor signaling pathway" evidence=ISS]
[GO:0005794 "Golgi apparatus" evidence=ISS] [GO:0001937 "negative
regulation of endothelial cell proliferation" evidence=ISS]
[GO:0016525 "negative regulation of angiogenesis" evidence=ISS]
[GO:0030177 "positive regulation of Wnt receptor signaling pathway"
evidence=ISS] [GO:0030201 "heparan sulfate proteoglycan metabolic
process" evidence=ISS] [GO:0030336 "negative regulation of cell
migration" evidence=ISS] [GO:0030513 "positive regulation of BMP
signaling pathway" evidence=ISS] [GO:0048010 "vascular endothelial
growth factor receptor signaling pathway" evidence=ISS] [GO:0004065
"arylsulfatase activity" evidence=ISS] [GO:0008449
"N-acetylglucosamine-6-sulfatase activity" evidence=ISS]
[GO:0005615 "extracellular space" evidence=ISS] [GO:0005783
"endoplasmic reticulum" evidence=ISS] [GO:0009986 "cell surface"
evidence=ISS] [GO:0045121 "membrane raft" evidence=ISS] [GO:0007155
"cell adhesion" evidence=ISS] [GO:0048661 "positive regulation of
smooth muscle cell proliferation" evidence=ISS] [GO:0014846
"esophagus smooth muscle contraction" evidence=ISS] [GO:0048706
"embryonic skeletal system development" evidence=ISS] [GO:0001822
"kidney development" evidence=ISS] [GO:0060348 "bone development"
evidence=ISS] [GO:0060384 "innervation" evidence=ISS] [GO:0060686
"negative regulation of prostatic bud formation" evidence=ISS]
InterPro:IPR000917 InterPro:IPR014615 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783
GO:GO:0005886 GO:GO:0005794 GO:GO:0005615 GO:GO:0009986
GO:GO:0048661 GO:GO:0005509 GO:GO:0010575 GO:GO:0045121
GO:GO:0030336 GO:GO:0001822 GO:GO:0001937 GO:GO:0030513
GO:GO:0016525 GO:GO:0001502 GO:GO:0060348 Gene3D:3.40.720.10
SUPFAM:SSF53649 GO:GO:0030177 InterPro:IPR024607 PROSITE:PS00523
GO:GO:0004065 GeneTree:ENSGT00400000022041 GO:GO:0048706
GO:GO:0048010 GO:GO:0060686 GO:GO:0002063 GO:GO:0040037
GO:GO:0060384 GO:GO:0008449 GO:GO:0030201 GO:GO:0014846
GO:GO:0035860 GO:GO:0036022 InterPro:IPR024609 Pfam:PF12548
OMA:SVRVTHK EMBL:AADN02048527 EMBL:AADN02048528 EMBL:AADN02048529
EMBL:AADN02048530 EMBL:AADN02048531 EMBL:AADN02048532
EMBL:AADN02048533 EMBL:AADN02048534 EMBL:AADN02048535
EMBL:AADN02048536 EMBL:AADN02048537 EMBL:AADN02048538
EMBL:AADN02048539 EMBL:AADN02048540 EMBL:AADN02048541
IPI:IPI00571776 ProteinModelPortal:E1BRF7
Ensembl:ENSGALT00000018383 ArrayExpress:E1BRF7 Uniprot:E1BRF7
Length = 868
Score = 150 (57.9 bits), Expect = 6.4e-08, Sum P(2) = 6.4e-08
Identities = 59/223 (26%), Positives = 94/223 (42%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+II +L DD DV L Q+ + G N + T +C PSRS+++TGK
Sbjct: 43 PNIILVLTDD---QDVELGSL-QVMNKTRRIMENGGASFINAFVTTPMCCPSRSSMLTGK 98
Query: 119 HPIHTGMQHNVLYGCERGGLPLSE-----KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+ +H HN+ E P + + YL GYRT GK+ L Y Y P
Sbjct: 99 Y-VHN---HNIYTNNENCSSPSWQATHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
G+ +G + +++++ G + + A D Y TD+ T E+++
Sbjct: 154 P--GWREWVGL-VKNSRFYNYTISRN---GNKEKHGFDYAKD----YFTDLITNESINYF 203
Query: 234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
P+ + ++HAA H P Q + Y N +HI
Sbjct: 204 RMSKRIYPHRPIMMVISHAAPHGPEDSAP-QFSELYPNASQHI 245
Score = 62 (26.9 bits), Expect = 6.4e-08, Sum P(2) = 6.4e-08
Identities = 25/131 (19%), Positives = 53/131 (40%)
Query: 267 LNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXX 326
L IH + + K L +D+S+ ++ + L + L N+ I++ +D
Sbjct: 268 LPIHMEFTNVLQRKRLQTLMSVDDSMERLYQMLAEMGELENTYIIYTADHGYHIGQFGLV 327
Query: 327 XXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
+ P ++ +R I P +E +V + +++ D PT+L A P
Sbjct: 328 KGKSMP--------YDFDIRVPFFIRGPSVEPGSVVPQIVLNI-DLAPTILDIAGLDTPP 378
Query: 387 NYVNSTVENII 397
+ +V ++
Sbjct: 379 DMDGKSVLKLL 389
Score = 47 (21.6 bits), Expect = 2.2e-06, Sum P(2) = 2.2e-06
Identities = 10/42 (23%), Positives = 18/42 (42%)
Query: 428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGP 466
+G + P++ Y N + P Y N ++ +Y GP
Sbjct: 225 HGPEDSAPQFSELYPNASQHITPSYNYAPNMDKHWIMQYTGP 266
>DICTYBASE|DDB_G0281179 [details] [associations]
symbol:clkA "protein kinase, CMGC group"
species:44689 "Dictyostelium discoideum" [GO:0016772 "transferase
activity, transferring phosphorus-containing groups" evidence=IEA]
[GO:0006468 "protein phosphorylation" evidence=IEA] [GO:0005524
"ATP binding" evidence=IEA] [GO:0004674 "protein serine/threonine
kinase activity" evidence=IEA;ISS] [GO:0004672 "protein kinase
activity" evidence=IEA] [GO:0005575 "cellular_component"
evidence=ND] [GO:0016740 "transferase activity" evidence=IEA]
[GO:0016310 "phosphorylation" evidence=IEA] [GO:0016301 "kinase
activity" evidence=IEA] [GO:0000166 "nucleotide binding"
evidence=IEA] InterPro:IPR000719 InterPro:IPR008271
InterPro:IPR011009 InterPro:IPR017441 Pfam:PF00069 PROSITE:PS00107
PROSITE:PS00108 PROSITE:PS50011 dictyBase:DDB_G0281179
GO:GO:0005524 GenomeReviews:CM000152_GR eggNOG:COG0515
SUPFAM:SSF56112 EMBL:AAFI02000040 GO:GO:0004674 KO:K08287
HSSP:P49761 RefSeq:XP_640867.1 ProteinModelPortal:Q54UA9
EnsemblProtists:DDB0230105 GeneID:8622923 KEGG:ddi:DDB_G0281179
OMA:ICNENDY ProtClustDB:CLSZ2846791 Uniprot:Q54UA9
Length = 932
Score = 162 (62.1 bits), Expect = 6.5e-08, P = 6.5e-08
Identities = 48/165 (29%), Positives = 71/165 (43%)
Query: 409 NGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKN 468
N ++ YN N+N Y N + YN Y N N ++ +N Y N N + YN N
Sbjct: 369 NNSNNYNHNN-SNNNGGYNNYNNGYN-NYNNNNSNNSN-HNSSYNN---NNNNNYNNNNN 422
Query: 469 ENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRN 528
N N N + N N+ N N + N +N+N+ N + ++ S N S N
Sbjct: 423 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNI----SNN--SNNN 476
Query: 529 TILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQN 573
+N D++ S G + NS N N + N +N NSY N
Sbjct: 477 NFNYNNDNDRNNS---NGNYN---NNSSNINNNNNNNNNSNSYHN 515
Score = 146 (56.5 bits), Expect = 3.4e-06, P = 3.4e-06
Identities = 50/202 (24%), Positives = 79/202 (39%)
Query: 378 SAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKY 437
S N ++ NY NS N Y + Y + + YN+ N+N N ++ +
Sbjct: 216 SVNNNNNNRNYSNSYNNN---NYNDGNNNYNSNNYNYNNNNNNNNNINNNNNSNSNSNSN 272
Query: 438 ENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTS 497
N N N N Y N + YN K+ N RY + N+ N N
Sbjct: 273 SNSNSNSNSNSNSN-NNNYNN--YGYNNHKSNNGGNRYSDDDDNVFNNNNNNNNNNNNNY 329
Query: 498 ENRSNDNSYQNEIDGIDVW--SVLSRNEPSKRNTIL---HNIDDEWQISALTRGKWKLVK 552
N +++N+Y N+ D D ++ SRN + N +N ++ ++ G +
Sbjct: 330 NNYNSNNNYNNDYDYNDGKRANIYSRNNSNNNNNSKSGNNNSNNYNHNNSNNNGGYNNYN 389
Query: 553 ENSINGNGTSENRSNDNS-YQN 573
N N + N SN NS Y N
Sbjct: 390 NGYNNYNNNNSNNSNHNSSYNN 411
Score = 137 (53.3 bits), Expect = 3.2e-05, P = 3.2e-05
Identities = 49/201 (24%), Positives = 89/201 (44%)
Query: 381 NKSDIPNYVNSTVENIIPR-YENSILRYENGTHEYNSPRIENSNTR--YENGTHEYNPKY 437
N ++I N NS N EN+ + EN +++ + +S R +N H N
Sbjct: 99 NNNNINNNGNSNNNNNNSNGSENNYFQSENQSNKDQNSYFNSSYLRNPVDNYNHNNNNHN 158
Query: 438 ENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRL--EN---SIN 492
N ++N + + Y+N + + N+N N + +Y+I ++ EN S+N
Sbjct: 159 NNAFDNNNYNTQNLGDYSYKNDGYNNDNNNNDNNNSYGDTDREKYSIEKICNENDYDSVN 218
Query: 493 GNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVK 552
N + N SN + N DG + ++ + N + N +NI++ ++ +
Sbjct: 219 NNNNNRNYSNSYNNNNYNDGNNNYNSNNYNYNNNNNNN-NNINNNNNSNSNSNSN----- 272
Query: 553 ENSINGNGTSENRSNDNSYQN 573
NS N N S + SN+N+Y N
Sbjct: 273 SNS-NSNSNSNSNSNNNNYNN 292
Score = 135 (52.6 bits), Expect = 5.2e-05, P = 5.2e-05
Identities = 39/168 (23%), Positives = 70/168 (41%)
Query: 407 YENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGP 466
Y N + YN+ + N RY + N N + YN N N ++YN
Sbjct: 290 YNN--YGYNNHKSNNGGNRYSDDDDNVFNNNNNNNNNNNNNYNNYNSNNNYNNDYDYNDG 347
Query: 467 KNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNS-YQNEIDGIDVWSVLSRNEPS 525
K N R N ++ N + N+ N N + N SN+N Y N +G + ++ + N +
Sbjct: 348 KRANIYSR--NNSNNNNNSKSGNN-NSNNYNHNNSNNNGGYNNYNNGYNNYNNNNSNNSN 404
Query: 526 KRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQN 573
++ +N ++ + + N+ N N + N +N+N+ N
Sbjct: 405 HNSSYNNNNNNNYNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 452
Score = 130 (50.8 bits), Expect = 0.00018, P = 0.00018
Identities = 49/213 (23%), Positives = 85/213 (39%)
Query: 387 NYVNSTVENIIPRYEN-SILRYENGTHEYNSPRIENSNTRYENGTHEYNPKY---ENRYE 442
N+ N+ +N +N Y+N + ++ +N+N+ + +Y+ + EN Y+
Sbjct: 156 NHNNNAFDNNNYNTQNLGDYSYKNDGYNNDNNNNDNNNSYGDTDREKYSIEKICNENDYD 215
Query: 443 N-GTHEYNPKYENRYENGTHEYNGPKNENTNP-RYENGTHEYNIPRLENSINGNGTSENR 500
+ + N Y N Y N + +G N N+N Y N + N N+ N N S +
Sbjct: 216 SVNNNNNNRNYSNSYNNNNYN-DGNNNYNSNNYNYNNNNNNNNNINNNNNSNSNSNSNSN 274
Query: 501 SNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNG 560
SN NS N + ++ N N DD+ + N+ N
Sbjct: 275 SNSNSNSNSNSNNNNYNNYGYNNHKSNNGGNRYSDDDDNVFNNNNNN-NNNNNNNYNNYN 333
Query: 561 TSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRN 593
++ N +ND Y DG ++ SRN + N
Sbjct: 334 SNNNYNNDYDYN---DGKRA-NIYSRNNSNNNN 362
Score = 129 (50.5 bits), Expect = 0.00023, P = 0.00023
Identities = 39/169 (23%), Positives = 59/169 (34%)
Query: 407 YENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGP 466
Y N + Y + N N Y N H N Y N N + N N + N
Sbjct: 16 YSNNDYGYYNNNCSNVN--YNNDIHYKNNNYNNNNNNNNSNSGNNFNNNNNNNNNNNNNN 73
Query: 467 KNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSK 526
N N N N + Y NS N N N N N+ N +G + S N+ +K
Sbjct: 74 NNNNNNNN-NNNNYTYGNNNNNNSNNNNNNINNNGNSNNNNNNSNGSENNYFQSENQSNK 132
Query: 527 -RNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQNE 574
+N+ ++ + N+ + N + D SY+N+
Sbjct: 133 DQNSYFNSSYLRNPVDNYNHNN-NNHNNNAFDNNNYNTQNLGDYSYKND 180
Score = 129 (50.5 bits), Expect = 0.00023, P = 0.00023
Identities = 49/196 (25%), Positives = 73/196 (37%)
Query: 381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
N S+ N+ +S N Y N+ N + N+ N+N N + N N
Sbjct: 398 NNSNNSNHNSSYNNNNNNNYNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 457
Query: 441 YENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENR 500
N + N N N YN + N + NG YN NS N N + N
Sbjct: 458 NNNNNNNNNNNISNNSNNNNFNYNNDNDRNNS----NGN--YN----NNSSNINNNNNNN 507
Query: 501 SNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNG 560
+N NSY N S+N S +N N +++ Q +A S N
Sbjct: 508 NNSNSYHNSCISYSNGGSNSKN--SNKN----NYNNQ-QSNANGNHVGNSKNNESCNNTN 560
Query: 561 TSENRSNDNSYQNEID 576
T+ +SN + + +E D
Sbjct: 561 TNIEKSNKSMWDDEND 576
Score = 127 (49.8 bits), Expect = 0.00038, P = 0.00038
Identities = 43/194 (22%), Positives = 75/194 (38%)
Query: 381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
N ++I N NS + NS + ++ N +N + NG + Y+ +N
Sbjct: 255 NNNNINNNNNSNSNSNSNSNSNSNSNSNSNSNNNNYNNYGYNNHKSNNGGNRYSDDDDNV 314
Query: 441 YENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENR 500
+ N + N N Y N YN N N + Y +G NI NS N N +
Sbjct: 315 FNNNNNNNNNN-NNNYNN----YNSNNNYNNDYDYNDGKRA-NIYSRNNSNNNNNSKSGN 368
Query: 501 SNDNSYQ-NEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGN 559
+N N+Y N + ++ + + N +N + + + N+ N N
Sbjct: 369 NNSNNYNHNNSNNNGGYNNYNNGYNNYNNNNSNNSNHNSSYNNNNNNNYNNNNNNNNNNN 428
Query: 560 GTSENRSNDNSYQN 573
+ N +N+N+ N
Sbjct: 429 NNNNNNNNNNNNNN 442
Score = 127 (49.8 bits), Expect = 0.00038, P = 0.00038
Identities = 52/214 (24%), Positives = 78/214 (36%)
Query: 388 YVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHE 447
Y N+ N+ Y N I Y+N + YN+ N++ N + N N N +
Sbjct: 23 YYNNNCSNV--NYNNDI-HYKN--NNYNNNNNNNNSNSGNNFNNNNNNNNNNNNNNNNNN 77
Query: 448 YNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSN--DNS 505
N N Y G + N N N N NG N S N SEN+SN NS
Sbjct: 78 NNNNNNNNYTYGNNNNNNSNNNNNNIN-NNGNSNNNNNNSNGSENNYFQSENQSNKDQNS 136
Query: 506 YQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENR 565
Y N + + N + N N + + L G + + N N N
Sbjct: 137 YFNSSYLRNPVDNYNHNNNNHNNNAFDN--NNYNTQNL--GDYSYKNDGYNNDNN---NN 189
Query: 566 SNDNSY-QNEIDGIDVWSVLSRNEPSKRNTILHN 598
N+NSY + + + + + N+ N +N
Sbjct: 190 DNNNSYGDTDREKYSIEKICNENDYDSVNNNNNN 223
>UNIPROTKB|I3L2L4 [details] [associations]
symbol:SGSH "N-sulphoglucosamine sulphohydrolase"
species:9606 "Homo sapiens" [GO:0008484 "sulfuric ester hydrolase
activity" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484 EMBL:AC087741
EMBL:AC123764 HGNC:HGNC:10818 ChiTaRS:SGSH Ensembl:ENST00000570427
Bgee:I3L2L4 Uniprot:I3L2L4
Length = 188
Score = 135 (52.6 bits), Expect = 6.7e-08, Sum P(2) = 6.7e-08
Identities = 42/136 (30%), Positives = 72/136 (52%)
Query: 41 PLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
P+ +++ V + + P + + +LADD G+ G + I TP++DALA ++ +N
Sbjct: 4 PVPACCALLLVLGLCRARPRNALLLLADDGGFES-GAYNNSAIATPHLDALARRSLLFRN 62
Query: 101 YYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLS--EKI--LPQYLKELGYR 155
+T V C+PSR++++TG P H N +YG + + +K+ LP L + G R
Sbjct: 63 AFTSVSSCSPSRASLLTGL-PQH----QNGMYGLHQDVHHFNSFDKVRSLPLLLSQAGVR 117
Query: 156 TR------IVGKWHLG 165
T I+GK H+G
Sbjct: 118 TGLSSRPGIIGKKHVG 133
Score = 37 (18.1 bits), Expect = 6.7e-08, Sum P(2) = 6.7e-08
Identities = 7/14 (50%), Positives = 8/14 (57%)
Query: 239 DEPLFLYLAHAATH 252
D P FLY+A H
Sbjct: 174 DRPFFLYVAFHDPH 187
>RGD|1305877 [details] [associations]
symbol:Gns "glucosamine (N-acetyl)-6-sulfatase" species:10116
"Rattus norvegicus" [GO:0005539 "glycosaminoglycan binding"
evidence=IPI] [GO:0005764 "lysosome" evidence=IDA] [GO:0008449
"N-acetylglucosamine-6-sulfatase activity" evidence=IDA]
[GO:0008484 "sulfuric ester hydrolase activity" evidence=ISO]
[GO:0042340 "keratan sulfate catabolic process" evidence=IDA]
[GO:0043199 "sulfate binding" evidence=IPI] InterPro:IPR000917
InterPro:IPR012251 InterPro:IPR015981 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036666 RGD:1305877
GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
PROSITE:PS00523 PROSITE:PS00149 GO:GO:0042340 GO:GO:0043199
GO:GO:0005539 CTD:2799 HOGENOM:HOG000169239 HOVERGEN:HBG005840
KO:K01137 GO:GO:0008449 PANTHER:PTHR10342:SF5 UniGene:Rn.228654
EMBL:BC087741 IPI:IPI00951484 RefSeq:NP_001011989.1 IntAct:Q5M918
STRING:Q5M918 Ensembl:ENSRNOT00000064349 GeneID:299825
KEGG:rno:299825 InParanoid:Q5M918 NextBio:645846
Genevestigator:Q5M918 Uniprot:Q5M918
Length = 519
Score = 156 (60.0 bits), Expect = 1.2e-07, P = 1.2e-07
Identities = 69/235 (29%), Positives = 104/235 (44%)
Query: 29 GYRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNI 88
G +R+ A +LPL LS + LV ++ P+++ +L DD D G+ P
Sbjct: 12 GCPSRLPALLLLPL---LSGC-LGLVGAARRPNVLLLLTDD---QDAELGGMT--PLKKT 62
Query: 89 DAL-AYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYG-CE-RGGLPLSEKI 144
AL G+ + Y LC PSR++I+TGK+P + + +N L G C + + E
Sbjct: 63 KALIGEKGMTFSSAYVPSALCCPSRASILTGKYPHNHHVVNNTLEGNCSSKSWQKIQEPY 122
Query: 145 -LPQYLKEL-GYRTRIVGKWHLGFYKKEY-TPTFRGFES-HLG--YWTGHQDYFDHSAEE 198
P LK + GY+T GK Y EY P G E LG YW + +
Sbjct: 123 TFPAILKLVCGYQTFFAGK-----YLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYT 177
Query: 199 MKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHS 253
+ + G R + D Y TDV ++D + S EP F+ ++ A HS
Sbjct: 178 LSINGKARRHGENYSVD----YLTDVLANLSLDFLDYKSNSEPFFMMISTPAPHS 228
>TIGR_CMR|CPS_2358 [details] [associations]
symbol:CPS_2358 "sulfatase family protein" species:167879
"Colwellia psychrerythraea 34H" [GO:0008152 "metabolic process"
evidence=ISS] [GO:0008484 "sulfuric ester hydrolase activity"
evidence=ISS] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 EMBL:CP000083
GenomeReviews:CP000083_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484
HOGENOM:HOG000014304 RefSeq:YP_269076.1 ProteinModelPortal:Q482E2
STRING:Q482E2 GeneID:3518855 KEGG:cps:CPS_2358 PATRIC:21467803
OMA:ETIRIDS BioCyc:CPSY167879:GI48-2421-MONOMER Uniprot:Q482E2
Length = 499
Score = 135 (52.6 bits), Expect = 1.2e-07, Sum P(3) = 1.2e-07
Identities = 39/103 (37%), Positives = 53/103 (51%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALA-YSGIILKNYYTVQLCTPSRSAIMTGK 118
P+I+FI DDL + +G ++ TPNID LA S + + Y +C PSR +I+TG
Sbjct: 53 PNILFIAVDDLK-PLIRDYGTAKVQTPNIDKLASQSTVFTRAYSQYPVCGPSRMSILTGL 111
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK 161
P G+ + L R P S LPQ+ K GY T GK
Sbjct: 112 RPESNGIMN--LKDKIRDVNP-SVITLPQFFKNNGYETAATGK 151
Score = 65 (27.9 bits), Expect = 1.2e-07, Sum P(3) = 1.2e-07
Identities = 26/105 (24%), Positives = 45/105 (42%)
Query: 219 KYSTDVFTAEAVDIIHNHSTDEPL-FLYLAHAATHSANPYEPLQAPDHYLNIH------- 270
KY D+++ E+ D+ S E YL H Y+P + +
Sbjct: 239 KYY-DLYSRESFDLASYQSAPEDADTTYLFHK-NQELRGYKPTPIKGGEIKPYPKGKLSS 296
Query: 271 RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
H ++ FA++ +D VG+++E LE+ N++IVF D
Sbjct: 297 AHQKELLHGYFASVSF-IDSLVGELLEELEKTGQAENTVIVFWGD 340
Score = 45 (20.9 bits), Expect = 1.2e-07, Sum P(3) = 1.2e-07
Identities = 8/19 (42%), Positives = 12/19 (63%)
Query: 677 LFDIKNDPCEKNNLADRSE 695
L+D+ NDP E N+ + E
Sbjct: 458 LYDLINDPLETKNIINTPE 476
Score = 45 (20.9 bits), Expect = 1.2e-07, Sum P(3) = 1.2e-07
Identities = 8/19 (42%), Positives = 12/19 (63%)
Query: 783 LFDIKNDPCEKNNLADRSE 801
L+D+ NDP E N+ + E
Sbjct: 458 LYDLINDPLETKNIINTPE 476
>UNIPROTKB|I3L643 [details] [associations]
symbol:GNS "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0030203 "glycosaminoglycan metabolic process"
evidence=IEA] [GO:0008449 "N-acetylglucosamine-6-sulfatase
activity" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
InterPro:IPR000917 InterPro:IPR012251 InterPro:IPR015981
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
PIRSF:PIRSF036666 GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
GeneTree:ENSGT00400000022041 GO:GO:0030203 GO:GO:0008449
PANTHER:PTHR10342:SF5 EMBL:AEMK01192095 EMBL:FP700150
Ensembl:ENSSSCT00000032527 OMA:FARAFAN Uniprot:I3L643
Length = 369
Score = 146 (56.5 bits), Expect = 1.3e-07, Sum P(2) = 1.3e-07
Identities = 63/229 (27%), Positives = 98/229 (42%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDAL-AYSGIILKNYYTVQ-LCTPSRS 112
A S P+++ +LADD D G+ P AL G+ + Y LC PSR+
Sbjct: 42 ADSRRPNVVLLLADD---QDEVLGGMT--PLKKTKALIGEMGMTFSSAYVPSALCCPSRA 96
Query: 113 AIMTGKHPIHTGMQHNVLYG-CE-RGGLPLSE-KILPQYLKEL-GYRTRIVGKWHLGFYK 168
+I+TGK+P + + +N L G C + + E P L+ + GY+T GK Y
Sbjct: 97 SILTGKYPHNHHVVNNTLEGNCSSKSWQKIEEPNTFPAILRSVCGYQTFFAGK-----YL 151
Query: 169 KEYTPTFRGFESH--LG--YWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDV 224
EY G +H LG YW + + + + G + + D Y TDV
Sbjct: 152 NEYGAPDAGGLAHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVD----YLTDV 207
Query: 225 FTAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
++D + S EP F+ ++ A HS P A Y N +++
Sbjct: 208 LANVSLDFLDYKSNSEPFFMMISTPAPHS-----PWTAAPQYQNTFQNV 251
Score = 52 (23.4 bits), Expect = 1.3e-07, Sum P(2) = 1.3e-07
Identities = 13/38 (34%), Positives = 23/38 (60%)
Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
R ++ +L +D+ V K+V+ LE L+N+ I + SD
Sbjct: 290 RKRWQTLL-SVDDLVEKLVKRLEFNGELNNTYIFYTSD 326
Score = 40 (19.1 bits), Expect = 1.6e-06, Sum P(3) = 1.6e-06
Identities = 8/23 (34%), Positives = 14/23 (60%)
Query: 450 PKYENRYENGTHEYNGPKNENTN 472
P+Y+N ++N P+N+N N
Sbjct: 242 PQYQNTFQN----VFAPRNKNFN 260
Score = 40 (19.1 bits), Expect = 1.6e-06, Sum P(3) = 1.6e-06
Identities = 18/62 (29%), Positives = 26/62 (41%)
Query: 516 WSVLSRNEPSKRNTI--LHN-IDDEWQ-ISALTRGKWKLVKENSING--NGTSENRSNDN 569
W + P ++I L N WQ + ++ KLVK NG N T ++DN
Sbjct: 268 WLIRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFNGELNNTYIFYTSDN 327
Query: 570 SY 571
Y
Sbjct: 328 GY 329
>UNIPROTKB|Q32KJ5 [details] [associations]
symbol:Gns "Glucosamine (N-acetyl)-6-sulfatase"
species:10116 "Rattus norvegicus" [GO:0005764 "lysosome"
evidence=IEA] [GO:0008449 "N-acetylglucosamine-6-sulfatase
activity" evidence=IEA] [GO:0030203 "glycosaminoglycan metabolic
process" evidence=IEA] InterPro:IPR000917 InterPro:IPR012251
InterPro:IPR015981 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 PIRSF:PIRSF036666 RGD:1305877 GO:GO:0005764
Gene3D:3.40.720.10 SUPFAM:SSF53649 eggNOG:COG3119
InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0030203
HOGENOM:HOG000169239 HOVERGEN:HBG005840 KO:K01137 OrthoDB:EOG4NGGMF
GO:GO:0008449 PANTHER:PTHR10342:SF5 OMA:MCGYQTF EMBL:BN000742
IPI:IPI00366226 RefSeq:XP_003750373.1 UniGene:Rn.228654
IntAct:Q32KJ5 STRING:Q32KJ5 Ensembl:ENSRNOT00000006566
GeneID:100909505 KEGG:rno:100909505 InParanoid:Q32KJ5
Genevestigator:Q32KJ5 Uniprot:Q32KJ5
Length = 544
Score = 156 (60.0 bits), Expect = 1.3e-07, P = 1.3e-07
Identities = 69/235 (29%), Positives = 104/235 (44%)
Query: 29 GYRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNI 88
G +R+ A +LPL LS + LV ++ P+++ +L DD D G+ P
Sbjct: 12 GCPSRLPALLLLPL---LSGC-LGLVGAARRPNVLLLLTDD---QDAELGGMT--PLKKT 62
Query: 89 DAL-AYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYG-CE-RGGLPLSEKI 144
AL G+ + Y LC PSR++I+TGK+P + + +N L G C + + E
Sbjct: 63 KALIGEKGMTFSSAYVPSALCCPSRASILTGKYPHNHHVVNNTLEGNCSSKSWQKIQEPY 122
Query: 145 -LPQYLKEL-GYRTRIVGKWHLGFYKKEY-TPTFRGFES-HLG--YWTGHQDYFDHSAEE 198
P LK + GY+T GK Y EY P G E LG YW + +
Sbjct: 123 TFPAILKLVCGYQTFFAGK-----YLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYT 177
Query: 199 MKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHS 253
+ + G R + D Y TDV ++D + S EP F+ ++ A HS
Sbjct: 178 LSINGKARRHGENYSVD----YLTDVLANLSLDFLDYKSNSEPFFMMISTPAPHS 228
>DICTYBASE|DDB_G0287057 [details] [associations]
symbol:gtaN "GATA zinc finger domain-containing
protein 14" species:44689 "Dictyostelium discoideum" [GO:0043565
"sequence-specific DNA binding" evidence=IEA] [GO:0008270 "zinc ion
binding" evidence=IEA] [GO:0006355 "regulation of transcription,
DNA-dependent" evidence=IEA] [GO:0003700 "sequence-specific DNA
binding transcription factor activity" evidence=IEA] [GO:0046872
"metal ion binding" evidence=IEA] InterPro:IPR000679 Pfam:PF00320
PROSITE:PS00344 PROSITE:PS50114 SMART:SM00401
dictyBase:DDB_G0287057 GenomeReviews:CM000153_GR GO:GO:0046872
GO:GO:0043565 GO:GO:0008270 GO:GO:0003700 eggNOG:COG5641
EMBL:AAFI02000096 HSSP:P17679 RefSeq:XP_637400.1
ProteinModelPortal:Q54KX0 EnsemblProtists:DDB0220469 GeneID:8625931
KEGG:ddi:DDB_G0287057 OMA:GANEDHL Uniprot:Q54KX0
Length = 953
Score = 159 (61.0 bits), Expect = 1.4e-07, P = 1.4e-07
Identities = 56/229 (24%), Positives = 104/229 (45%)
Query: 381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
NK++ N N+ N + N+ + N + +N+ N+ + N + +N N
Sbjct: 448 NKNNHNNNHNNNNHNN-NNHNNNNNNHNNNNNNHNNNNNHNNQNNHNNQNNNHNNNQNNN 506
Query: 441 YENG-THEYNPK-YENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSE 498
Y N + YNP Y N Y N + YN N N NP N + +N N+ N N +
Sbjct: 507 YNNNQNNNYNPNNYGNNY-NPNNNYN---NSN-NPNNMNNNYNHN-QNNNNNNNNNNQNY 560
Query: 499 NRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSING 558
N +++N + N+ + I S ++N ++ N HN +++ + + N+ N
Sbjct: 561 NNNHNNQFNNQNNQIHNQSN-NQNNYNQNNN--HNNNNQNNNNNNQNNNNNNNQNNNNNN 617
Query: 559 NGTSENRSNDNSYQNEIDGIDVWSVLSRNE-P--SKRNTIL-HNIDDEW 603
N + N +N+N+ N G+ + S++ P S N+ L +N ++E+
Sbjct: 618 NNINNNNNNNNNNNNGNTGLSSSTNNSKHSSPRSSPNNSPLNYNTNEEY 666
Score = 145 (56.1 bits), Expect = 4.5e-06, P = 4.5e-06
Identities = 48/197 (24%), Positives = 77/197 (39%)
Query: 381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYE-NGTHEYNPKY-E 438
N ++ N NS + N NS +N + N+ + + N+ NG + K E
Sbjct: 282 NNNNNNNSSNSNINNNNNNSNNSNNNIDNSNNNNNNNNVRSGNSNVNANGHNRLKRKSKE 341
Query: 439 NRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYEN--GTHEYNIPRLENSINGNGT 496
N Y N N + N+ N + +N N N N N T++ NI N N N
Sbjct: 342 NIYNNNNQNNNNQNNNQNNNHNNNHNNNHNNNQNNNQNNIQNTNQNNIQNNHNQQNNNNH 401
Query: 497 SENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSI 556
N + +N+YQN + S + N+ N N + + + K N
Sbjct: 402 QNNNNQNNNYQNNNNQN---SGNNNNQNHHNNKFNQNNNHNQNNHSNNQNK-NNHNNNHN 457
Query: 557 NGNGTSENRSNDNSYQN 573
N N + N +N+N+ N
Sbjct: 458 NNNHNNNNHNNNNNNHN 474
Score = 141 (54.7 bits), Expect = 1.2e-05, P = 1.2e-05
Identities = 45/214 (21%), Positives = 88/214 (41%)
Query: 387 NYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEY--NPKYENRYENG 444
N N+ NI +N+I N + N N N Y+N ++ N +N + N
Sbjct: 372 NNQNNNQNNIQNTNQNNIQNNHNQQNNNNHQNNNNQNNNYQNNNNQNSGNNNNQNHHNNK 431
Query: 445 THEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDN 504
++ N +N + N ++ N N N N + N H N N+ N + + N +N N
Sbjct: 432 FNQNNNHNQNNHSNNQNKNNHNNNHNNN-NHNNNNHNNNNNNHNNNNNNHNNNNNHNNQN 490
Query: 505 SYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSEN 564
++ N+ + + + N N +N + + + N++N N + N
Sbjct: 491 NHNNQNNNHNNNQNNNYNNNQNNNYNPNNYGNNYNPNNNYNNS---NNPNNMNNN-YNHN 546
Query: 565 RSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHN 598
++N+N+ N ++ N+ + +N +HN
Sbjct: 547 QNNNNNNNNNNQN---YNNNHNNQFNNQNNQIHN 577
Score = 140 (54.3 bits), Expect = 1.6e-05, P = 1.6e-05
Identities = 54/248 (21%), Positives = 94/248 (37%)
Query: 354 PLLESRGIVAEQYVHVSDWLPTLLSAANK--SDIPNYVNSTVENIIPRYENSILRYENGT 411
P++ ++ I+ S L + S N D PN N+T N + S + T
Sbjct: 163 PIINTKSIIPSASQLQSQNLNIINSINNNFSKDSPNSQNNTSFNEDTIFIASTTYGSSNT 222
Query: 412 HEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENT 471
N+ I N+N+ N ++ N N N + N N + N + N N N
Sbjct: 223 PNNNNNNINNNNSNNNNNSNNSNNN-NNSTNNNNNSSNINSPNDFNNNHNNNNNNNNNNN 281
Query: 472 NPRYENGTHEYNIPRLEN-SINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTI 530
N N + NI N S N N +N +N+N+ N G + N +++
Sbjct: 282 NNNNNNNSSNSNINNNNNNSNNSNNNIDNSNNNNNNNNVRSGNSNVNANGHNRLKRKSK- 340
Query: 531 LHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPS 590
NI + + + N+ N N + + +N N+ QN I + ++ + +
Sbjct: 341 -ENI---YNNNNQNNNNQNNNQNNNHNNNHNNNHNNNQNNNQNNIQNTNQNNIQNNHNQQ 396
Query: 591 KRNTILHN 598
N +N
Sbjct: 397 NNNNHQNN 404
Score = 135 (52.6 bits), Expect = 5.4e-05, P = 5.4e-05
Identities = 52/207 (25%), Positives = 85/207 (41%)
Query: 381 NKSDIPNYVNSTVENIIPR-YENSIL---RYENGTHEYNSPRIENSNTRYENGTHEYNPK 436
N + NY N+ N P Y N+ Y N + N N N N + N
Sbjct: 500 NNNQNNNYNNNQNNNYNPNNYGNNYNPNNNYNNSNNPNNMNNNYNHNQNNNNNNNNNNQN 559
Query: 437 YENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPR--LENSINGN 494
Y N + N + N + N+ N + YN N N N + N ++ N N+ N N
Sbjct: 560 YNNNHNNQFNNQNNQIHNQ-SNNQNNYNQNNNHNNNNQNNNNNNQNNNNNNNQNNNNNNN 618
Query: 495 GTSENRSNDNSYQNEIDGIDVWSVLSRNE-P--SKRNTIL-HNIDDEWQISALTRGKWKL 550
+ N +N+N+ N G+ + S++ P S N+ L +N ++E+ S +
Sbjct: 619 NINNNNNNNNNNNNGNTGLSSSTNNSKHSSPRSSPNNSPLNYNTNEEYYNSGSSSPSSPG 678
Query: 551 VKENSI----NGNGTSENRSNDNSYQN 573
+SI +GN N++N N+ N
Sbjct: 679 SPNSSILQITDGNNGFNNQNNLNNGNN 705
Score = 134 (52.2 bits), Expect = 6.9e-05, P = 6.9e-05
Identities = 44/218 (20%), Positives = 90/218 (41%)
Query: 381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
N + N N+ N + N+ +N N I+N++ + N H+ N N
Sbjct: 351 NNNQNNNQNNNHNNNHNNNHNNNQNNNQNNIQNTNQNNIQNNHNQQNNNNHQNNNNQNNN 410
Query: 441 YENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENR 500
Y+N ++ N N + +++N N N N + N ++ N N+ N N + N
Sbjct: 411 YQNNNNQ-NSGNNNNQNHHNNKFNQNNNHNQN-NHSNNQNKNNHNNNHNNNNHNNNNHNN 468
Query: 501 SNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNG 560
+N+N N + + + ++N + +N HN + + + N+ N N
Sbjct: 469 NNNNHNNNNNNHNNNNNHNNQNNHNNQNNN-HNNNQN--------NNYNNNQNNNYNPNN 519
Query: 561 TSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHN 598
N + +N+Y N + ++ + + N+ + N +N
Sbjct: 520 YGNNYNPNNNYNNSNNPNNMNNNYNHNQNNNNNNNNNN 557
Score = 129 (50.5 bits), Expect = 0.00024, P = 0.00024
Identities = 47/196 (23%), Positives = 76/196 (38%)
Query: 381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
N + NY N+ +N N+ + N N+ +N+++ +N + N N
Sbjct: 404 NNNQNNNYQNNNNQN---SGNNNNQNHHNNKFNQNNNHNQNNHSNNQNKNNHNNNHNNNN 460
Query: 441 YENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENR 500
+ N H N N N H N N N +N H N +N+ N N N
Sbjct: 461 HNNNNHNNNNNNHNN-NNNNHNNNNNHNNQNNHNNQNNNHNNN----QNN-NYNNNQNNN 514
Query: 501 SNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSING-N 559
N N+Y N + + ++ S N + N HN ++ + + N N N
Sbjct: 515 YNPNNYGNNYNPNNNYNN-SNNPNNMNNNYNHNQNNN-NNNNNNNQNYNNNHNNQFNNQN 572
Query: 560 GTSENRSND-NSY-QN 573
N+SN+ N+Y QN
Sbjct: 573 NQIHNQSNNQNNYNQN 588
Score = 124 (48.7 bits), Expect = 0.00082, P = 0.00082
Identities = 41/194 (21%), Positives = 80/194 (41%)
Query: 380 ANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYEN 439
+N ++ N V S N+ N + R ++ + YN+ +N+N + N + +N + N
Sbjct: 311 SNNNNNNNNVRSGNSNVNANGHNRLKR-KSKENIYNNNN-QNNNNQNNNQNNNHNNNHNN 368
Query: 440 RYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSEN 499
+ N + +N +N + +N N + N + Y +NS GN ++N
Sbjct: 369 NHNNNQNNNQNNIQNTNQNNIQNNHNQQNNNNHQNNNNQNNNYQNNNNQNS--GNNNNQN 426
Query: 500 RSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGN 559
N+ QN + S ++N+ + N HN ++ + N+ N N
Sbjct: 427 HHNNKFNQNNNHNQNNHSN-NQNKNNHNNN--HNNNNHNNNNHNNNNNNHNNNNNNHNNN 483
Query: 560 GTSENRSNDNSYQN 573
N++N N+ N
Sbjct: 484 NNHNNQNNHNNQNN 497
>DICTYBASE|DDB_G0288501 [details] [associations]
symbol:ddx42 "DEAD/DEAH box helicase" species:44689
"Dictyostelium discoideum" [GO:0008026 "ATP-dependent helicase
activity" evidence=IEA] [GO:0005524 "ATP binding" evidence=IEA]
[GO:0004386 "helicase activity" evidence=IEA] [GO:0003676 "nucleic
acid binding" evidence=IEA] [GO:0008150 "biological_process"
evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
[GO:0016787 "hydrolase activity" evidence=IEA] [GO:0005634
"nucleus" evidence=IEA] [GO:0003723 "RNA binding" evidence=IEA]
[GO:0000166 "nucleotide binding" evidence=IEA] InterPro:IPR000629
InterPro:IPR001650 InterPro:IPR011545 Pfam:PF00270 Pfam:PF00271
PROSITE:PS00039 PROSITE:PS51194 SMART:SM00490
dictyBase:DDB_G0288501 GO:GO:0005524 GO:GO:0005634
GenomeReviews:CM000154_GR GO:GO:0003723 InterPro:IPR014001
SMART:SM00487 PROSITE:PS51192 EMBL:AAFI02000112 GO:GO:0008026
eggNOG:COG0513 InterPro:IPR014014 PROSITE:PS51195 HSSP:P09052
KO:K12835 RefSeq:XP_636700.1 ProteinModelPortal:Q54IV3
STRING:Q54IV3 EnsemblProtists:DDB0233432 GeneID:8626657
KEGG:ddi:DDB_G0288501 OMA:DRDKRGG Uniprot:Q54IV3
Length = 986
Score = 159 (61.0 bits), Expect = 1.5e-07, P = 1.5e-07
Identities = 54/202 (26%), Positives = 82/202 (40%)
Query: 381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
NK N +S N I NS N T+ + NSN+ + YN Y+N
Sbjct: 774 NKFSNNNSGSSNDRNSINYRNNSFNNNSNNTNNSGNSNFNNSNSNNGYSNNNYNNNYKNN 833
Query: 441 --YENGTHEYNPKYENRYENGTHE--YNGPKNENTNPR--YENGTHE--YNIPRL--ENS 490
Y N + N Y N N + YN N N N Y NG + YN N+
Sbjct: 834 SNYNNSNNNNNSYYNNNNSNNNNNSNYNNSSNNNNNNNNNYRNGNNNNNYNNNNYYNNNN 893
Query: 491 INGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTIL--HNIDDEWQISALTRGKW 548
N N ++ N SN+NS N + + + + N+ S N L ++ ++ S +
Sbjct: 894 SNNNSSNNNNSNNNSSNNNFNN-NFNNNNNNNDNSNFNRALPFNDFNNNNNNSNNNNFNY 952
Query: 549 KLVKENSINGNGTSENRSNDNS 570
NS N N ++ ++N+NS
Sbjct: 953 NNNFNNSYNANNSNHYKNNNNS 974
Score = 149 (57.5 bits), Expect = 1.7e-06, P = 1.7e-06
Identities = 48/177 (27%), Positives = 71/177 (40%)
Query: 402 NSILRYENGTHEYNSPRIENS-NTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGT 460
NSI Y N + NS NS N+ + N N Y N N ++ N Y N N
Sbjct: 788 NSI-NYRNNSFNNNSNNTNNSGNSNFNNSNS--NNGYSNNNYNNNYKNNSNYNNSNNNNN 844
Query: 461 HEYNGPK-NENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVL 519
YN N N N Y N ++ N N NGN + N +N+N Y N + +
Sbjct: 845 SYYNNNNSNNNNNSNYNNSSNNNNNNN-NNYRNGNNNN-NYNNNNYYNNNNSNNNSSNNN 902
Query: 520 SRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQNEID 576
+ N S N +N ++ + + + + N N N + N +N N Y N +
Sbjct: 903 NSNNNSSNNNFNNNFNNNNNNNDNSNFN-RALPFNDFNNNNNNSNNNNFN-YNNNFN 957
Score = 140 (54.3 bits), Expect = 1.6e-05, P = 1.6e-05
Identities = 47/200 (23%), Positives = 81/200 (40%)
Query: 401 ENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGT 460
+NS + EN N + N+N+ N + N + N + N ++ N + + N +
Sbjct: 758 DNSEINNENEKSINNENKFSNNNSGSSNDRNSINYR-NNSFNNNSNNTNNSGNSNFNN-S 815
Query: 461 HEYNGPKNENTNPRYENGTHEYNIPRLENSI--NGNGTSENRSNDNSYQNEIDGIDVWSV 518
+ NG N N N Y+N ++ N NS N N + N SN N+ N + +
Sbjct: 816 NSNNGYSNNNYNNNYKNNSNYNNSNNNNNSYYNNNNSNNNNNSNYNNSSNNNNNNNNNYR 875
Query: 519 LSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQNEIDGI 578
N + N +N ++ S+ N+ N N + N +NDNS N
Sbjct: 876 NGNNNNNYNNNNYYNNNNSNNNSSNNNNSNNNSSNNNFNNNFNNNNNNNDNSNFNRALPF 935
Query: 579 DVWSVLSRNEPSKRNTILHN 598
+ ++ + N S N +N
Sbjct: 936 NDFN--NNNNNSNNNNFNYN 953
>UNIPROTKB|E1BZH8 [details] [associations]
symbol:SULF2 "Uncharacterized protein" species:9031 "Gallus
gallus" [GO:0005509 "calcium ion binding" evidence=IEA] [GO:0005794
"Golgi apparatus" evidence=IEA] [GO:0003094 "glomerular filtration"
evidence=IEA] [GO:0004065 "arylsulfatase activity" evidence=IEA]
[GO:0005783 "endoplasmic reticulum" evidence=IEA] [GO:0005886
"plasma membrane" evidence=IEA] [GO:0008449
"N-acetylglucosamine-6-sulfatase activity" evidence=IEA]
[GO:0009986 "cell surface" evidence=IEA] [GO:0010575 "positive
regulation vascular endothelial growth factor production"
evidence=IEA] [GO:0014846 "esophagus smooth muscle contraction"
evidence=IEA] [GO:0030177 "positive regulation of Wnt receptor
signaling pathway" evidence=IEA] [GO:0030201 "heparan sulfate
proteoglycan metabolic process" evidence=IEA] [GO:0032836
"glomerular basement membrane development" evidence=IEA]
[GO:0035860 "glial cell-derived neurotrophic factor receptor
signaling pathway" evidence=IEA] [GO:0040037 "negative regulation
of fibroblast growth factor receptor signaling pathway"
evidence=IEA] [GO:0048706 "embryonic skeletal system development"
evidence=IEA] [GO:0060348 "bone development" evidence=IEA]
[GO:0060384 "innervation" evidence=IEA] [GO:0002063 "chondrocyte
development" evidence=IEA] InterPro:IPR000917 InterPro:IPR014615
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
PIRSF:PIRSF036665 GO:GO:0005783 GO:GO:0005886 GO:GO:0005794
GO:GO:0009986 GO:GO:0005509 GO:GO:0010575 Gene3D:3.40.720.10
SUPFAM:SSF53649 GO:GO:0030177 InterPro:IPR024607 PROSITE:PS00523
GO:GO:0004065 GeneTree:ENSGT00400000022041 GO:GO:0040037
GO:GO:0030201 GO:GO:0035860 InterPro:IPR024609 Pfam:PF12548
OMA:PKYYGQG EMBL:AADN02019298 IPI:IPI00571119
ProteinModelPortal:E1BZH8 Ensembl:ENSGALT00000007309 Uniprot:E1BZH8
Length = 879
Score = 145 (56.1 bits), Expect = 1.6e-07, Sum P(3) = 1.6e-07
Identities = 60/222 (27%), Positives = 90/222 (40%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+II +L DD DV G Q+ + + G N + T +C PSRS+I+TGK
Sbjct: 44 PNIILVLTDD---QDVEL-GSMQVMNKTRRIMEHGGAHFINAFVTTPMCCPSRSSILTGK 99
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKI--LPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
+ +H + C +I YL GYRT GK+ L Y Y P
Sbjct: 100 Y-VHNHNTYTNNENCSSPSWQAQHEIRTFAVYLNNTGYRTAFFGKY-LNEYNGSYVPP-- 155
Query: 177 GFESHLGYWTGHQDY-FDHSAEEMK-MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD--- 231
G++ +G + Y + +K G D RD Y TD+ T +++
Sbjct: 156 GWKEWVGLLKNSRFYNYTLCRNGVKEKHGFDYSRD----------YLTDLITNDSITFFR 205
Query: 232 IIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
I P+ + ++HAA H P Q + N +HI
Sbjct: 206 ISKKMYPHRPVLMVISHAAPHGPEDSAP-QYSHLFPNASQHI 246
Score = 65 (27.9 bits), Expect = 1.6e-07, Sum P(3) = 1.6e-07
Identities = 46/231 (19%), Positives = 89/231 (38%)
Query: 234 HNHSTDEPLFLYL-AHAATHSANPYEPLQAPD-HYLN--------IHRHIEDFKRSKFAA 283
H P + +L +A+ H Y PD H++ IH + + K
Sbjct: 226 HGPEDSAPQYSHLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQ 285
Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEG 343
L +D+S+ + L + L N+ I++ +D + P +E
Sbjct: 286 TLMSVDDSMEMIYNTLVETGELDNTYIIYTADHGYHIGQFGLVKGKSMP--------YEF 337
Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENS 403
+R + P +E+ + +++ D PT+L A DIP+ ++ ++I+ ++
Sbjct: 338 DIRVPFYVRGPNVEAGSLNPHIVLNI-DLAPTILDIAGL-DIPSDMDG--KSILKLLDSE 393
Query: 404 ILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHE-YNPKYE 453
R N H ++ + E G + K EN + E + PKY+
Sbjct: 394 --RPVNRFHLKKKVKVWRDSFLVERGKLLH--KRENEKVDAQEENFLPKYQ 440
Score = 43 (20.2 bits), Expect = 1.6e-07, Sum P(3) = 1.6e-07
Identities = 19/79 (24%), Positives = 33/79 (41%)
Query: 429 GTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLE 488
GT PKY NR + + N + EN Y+ + G + + + + ++ N
Sbjct: 489 GTSNLLPKYYNR---NSEDCNCE-ENEYKLS---HTGRRKKLFSKKKYKPSYARNRSTRS 541
Query: 489 NSINGNGTSENRSNDNSYQ 507
S+ NG N ++ YQ
Sbjct: 542 VSVELNGAMFNLGLEDGYQ 560
Score = 40 (19.1 bits), Expect = 4.0e-05, Sum P(2) = 4.0e-05
Identities = 13/56 (23%), Positives = 28/56 (50%)
Query: 387 NYVNSTVENIIPRYENSI--LRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
N VN+ +++ + + LR G + N PR N + ++G++E +++ R
Sbjct: 802 NAVNTLDRDVLNQLHVQLMELRSCKGYKQCN-PRTRNIDLGLKDGSYEQYRQFQRR 856
>DICTYBASE|DDB_G0275173 [details] [associations]
symbol:hbx2 "putative homeobox transcription factor"
species:44689 "Dictyostelium discoideum" [GO:0043565
"sequence-specific DNA binding" evidence=IEA] [GO:0006355
"regulation of transcription, DNA-dependent" evidence=IEA]
[GO:0005634 "nucleus" evidence=IEA] [GO:0003700 "sequence-specific
DNA binding transcription factor activity" evidence=IEA]
[GO:0003677 "DNA binding" evidence=IEA] [GO:0007275 "multicellular
organismal development" evidence=IEA] [GO:0006351 "transcription,
DNA-dependent" evidence=IEA] InterPro:IPR001356 InterPro:IPR009057
InterPro:IPR017970 Pfam:PF00046 PROSITE:PS00027 PROSITE:PS50071
SMART:SM00389 dictyBase:DDB_G0275173 GO:GO:0007275 GO:GO:0005634
GO:GO:0043565 GenomeReviews:CM000151_GR GO:GO:0003700 GO:GO:0006351
EMBL:AAFI02000013 Gene3D:1.10.10.60 SUPFAM:SSF46689 EMBL:AF036171
RefSeq:XP_643746.1 ProteinModelPortal:Q869W0
EnsemblProtists:DDB0185105 GeneID:8619790 KEGG:ddi:DDB_G0275173
eggNOG:NOG301813 InParanoid:Q869W0 OMA:HAPENIK
ProtClustDB:CLSZ2846877 Uniprot:Q869W0
Length = 942
Score = 158 (60.7 bits), Expect = 1.8e-07, P = 1.8e-07
Identities = 48/189 (25%), Positives = 77/189 (40%)
Query: 388 YVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTH- 446
Y N+ N N I ++ +Y S N+N Y NG + YN N N +
Sbjct: 750 YFNNNNNNNNNNNNNRIS--DSSDDQYFSDDTNNNNDNYNNGNNNYNNNNNNNNFNNNYM 807
Query: 447 -EYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNS 505
YN Y N N + YN N N N + N + N +N+ N N ++ +N+ +
Sbjct: 808 NNYNNNYNNNNYNNNNSYN---NSNGNNNFNNNNNNNN----QNNNNNNNNNQYNNNNKN 860
Query: 506 YQNEIDGIDVWSVLSRNEPSKRNTILHN-IDDEWQISALTRGKWKLVKENSINGNGTSEN 564
Y N I + E +RN++ ++ I + + K N+ N NG N
Sbjct: 861 YLNNIPSSKKHQLQGNYE--RRNSLPNSQIQNNFNGDNNNNNNNKNNNNNNQNNNGNGNN 918
Query: 565 RSNDNSYQN 573
+N+N+ N
Sbjct: 919 NNNNNNDNN 927
Score = 145 (56.1 bits), Expect = 4.4e-06, P = 4.4e-06
Identities = 54/228 (23%), Positives = 96/228 (42%)
Query: 353 SPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTH 412
+P+LES + +Q +++ + + NK+ +P+ S N N+ N +
Sbjct: 592 TPILESLNV--KQQNNINFFKNNNMDNNNKN-VPHLSLSNNNNNNNNNNNN--NNNNNNN 646
Query: 413 EYNSPRIENSNTRYENGTHEYNPKYENRYEN----GTHEYNP---KYENRYENGTHEYNG 465
N+ R N+N Y N + N NR +N G+ + + ++ N N ++ YN
Sbjct: 647 NNNNNRNRNNNNIYNNNNNNNNNNSNNRGKNFSDSGSSDSDSELNRHNNNNNNNSNNYNN 706
Query: 466 PKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPS 525
+ + N R N + YN N IN N + N + + E D + + N +
Sbjct: 707 GNSNSNNNRNNNNNYNYN-----NYINNNNYNNNNNRQHCDDEEEDEQYFNNNNNNNNNN 761
Query: 526 KRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQN 573
N I + DD++ S T +N NGN N +N+N++ N
Sbjct: 762 NNNRISDSSDDQY-FSDDTNNN----NDNYNNGNNNYNNNNNNNNFNN 804
Score = 139 (54.0 bits), Expect = 2.0e-05, P = 2.0e-05
Identities = 52/200 (26%), Positives = 94/200 (47%)
Query: 409 NGTHEYNSPRI-ENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPK 467
N + N+ RI ++S+ +Y + + N +N Y NG + YN N N + N
Sbjct: 756 NNNNNNNNNRISDSSDDQYFSD--DTNNNNDN-YNNGNNNYNNNNNNNNFNNNYMNNYNN 812
Query: 468 NENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKR 527
N N N Y N + YN N+ N N + N++N+N+ N + + L+ SK+
Sbjct: 813 NYNNN-NYNNN-NSYNNSNGNNNFNNNNNNNNQNNNNNNNNNQYNNNNKNYLNNIPSSKK 870
Query: 528 NTILHNIDDEWQISALTRGKWKLVKENSING--NGTSENRSNDNSYQNEI-DGIDVWSVL 584
+ + N + ++L + +N+ NG N + N++N+N+ QN +G + +
Sbjct: 871 HQLQGNYERR---NSLPNSQI----QNNFNGDNNNNNNNKNNNNNNQNNNGNGNNNNNNN 923
Query: 585 SRNEPSKRNTILHNIDDEWQ 604
+ N KR H++DD+ Q
Sbjct: 924 NDNNIYKRR---HSMDDDCQ 940
Score = 129 (50.5 bits), Expect = 0.00023, P = 0.00023
Identities = 40/177 (22%), Positives = 74/177 (41%)
Query: 380 ANKSDIPNYVNSTVENIIPRYENSILRYENGTHE--YNSPRIENSNTRYENGTHEYNPKY 437
++ SD Y + N Y N Y N + +N+ + N N Y N + N Y
Sbjct: 767 SDSSD-DQYFSDDTNNNNDNYNNGNNNYNNNNNNNNFNNNYMNNYNNNYNNNNYNNNNSY 825
Query: 438 ENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYEN-----------GTHEY--NI 484
N NG + +N N +N + N + N N Y N G +E ++
Sbjct: 826 NN--SNGNNNFNNNNNNNNQNNNNNNNNNQYNNNNKNYLNNIPSSKKHQLQGNYERRNSL 883
Query: 485 P--RLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQ 539
P +++N+ NG+ + N + +N+ N+ + + + + N + H++DD+ Q
Sbjct: 884 PNSQIQNNFNGDNNNNNNNKNNNNNNQNNNGNGNNNNNNNNDNNIYKRRHSMDDDCQ 940
>DICTYBASE|DDB_G0292550 [details] [associations]
symbol:DDB_G0292550 "protein kinase, CMGC group"
species:44689 "Dictyostelium discoideum" [GO:0016772 "transferase
activity, transferring phosphorus-containing groups" evidence=IEA]
[GO:0006468 "protein phosphorylation" evidence=IEA] [GO:0005524
"ATP binding" evidence=IEA] [GO:0004674 "protein serine/threonine
kinase activity" evidence=IEA] [GO:0004672 "protein kinase
activity" evidence=IEA] [GO:0007049 "cell cycle" evidence=IEA]
[GO:0004693 "cyclin-dependent protein serine/threonine kinase
activity" evidence=IEA] [GO:0005575 "cellular_component"
evidence=ND] [GO:0016740 "transferase activity" evidence=IEA]
[GO:0016310 "phosphorylation" evidence=IEA] [GO:0016301 "kinase
activity" evidence=IEA] [GO:0000166 "nucleotide binding"
evidence=IEA] InterPro:IPR000719 InterPro:IPR002290
InterPro:IPR008271 InterPro:IPR011009 InterPro:IPR017441
Pfam:PF00069 PROSITE:PS00107 PROSITE:PS00108 PROSITE:PS50011
SMART:SM00220 dictyBase:DDB_G0292550 GO:GO:0005524 eggNOG:COG0515
SUPFAM:SSF56112 GO:GO:0007049 EMBL:AAFI02000190 GO:GO:0004693
HSSP:P24941 RefSeq:XP_629621.1 ProteinModelPortal:Q54D75
EnsemblProtists:DDB0229424 GeneID:8628684 KEGG:ddi:DDB_G0292550
InParanoid:Q54D75 OMA:RRETSEY Uniprot:Q54D75
Length = 1397
Score = 159 (61.0 bits), Expect = 2.2e-07, P = 2.2e-07
Identities = 55/227 (24%), Positives = 86/227 (37%)
Query: 382 KSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTR-YENGTHEYNPKYENR 440
++ +PN++ V + + ILR + N+ + N+N Y N H N N
Sbjct: 379 QTSVPNHIYKEVYEVNQLLKQYILRLKQQKVNLNNNNLNNNNNNLYGNNNHNNNNNNNNN 438
Query: 441 YENGTHE--YNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSE 498
N + YN N N H+ N N N N Y+N + N NS N N +
Sbjct: 439 NNNNNNNNNYNNNNNNHNNNYNHDNNNNNNYNNN-NYKNNNNSNNNFSFNNSNNNNNNNN 497
Query: 499 NRS----NDNSYQNEIDGIDVWSVLSRNEPSKRN-TILHNIDDEWQISALTRGKWKLVKE 553
N + N+N+ N + + ++ S N N N +D + V
Sbjct: 498 NNNRNNRNNNNNNNNNNNNNNYNNNSNNNSYNNNFNNGFNNNDNINDDNNNNNSYNNVNN 557
Query: 554 NSINGNGTSENRSND-NSYQNEIDGIDV-WSVLSRNEPSKRNTILHN 598
N+IN N + N N N+Y N + + + N S NT N
Sbjct: 558 NNINNNNNNNNGFNGFNNYGNNFNNSNNNGNQFGANNNSFNNTDFSN 604
Score = 137 (53.3 bits), Expect = 5.2e-05, P = 5.2e-05
Identities = 46/190 (24%), Positives = 72/190 (37%)
Query: 387 NYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYE-NGT 445
N N+ N N+ N H N N+N Y N ++ N N + N +
Sbjct: 430 NNNNNNNNNNNNNNNNNNYNNNNNNHNNNYNHDNNNNNNYNNNNYKNNNNSNNNFSFNNS 489
Query: 446 HEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHE--YNIPRLENSINGNGTSENRSND 503
+ N N N + N N N N Y N ++ YN N NG ++N ++D
Sbjct: 490 NNNNNNNNNNNRNNRNNNNNNNNNNNNNNYNNNSNNNSYN----NNFNNGFNNNDNINDD 545
Query: 504 NSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSE 563
N+ N + ++ ++ + N + +N + + S G NS N S
Sbjct: 546 NNNNNSYNNVNNNNINNNNNNNNGFNGFNNYGNNFNNSN-NNGNQFGANNNSFNNTDFS- 603
Query: 564 NRSNDNSYQN 573
N SN SY N
Sbjct: 604 NDSNYGSYCN 613
Score = 129 (50.5 bits), Expect = 0.00037, P = 0.00037
Identities = 52/205 (25%), Positives = 75/205 (36%)
Query: 381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
N + NY + N Y N+ Y+N + N+ NSN N + N
Sbjct: 452 NNNHNNNYNHDNNNN--NNYNNN--NYKNNNNSNNNFSFNNSNNNNNNNNNNNRNNRNNN 507
Query: 441 YENGTHEYNPKYENRYENGTHE--YNGPKNENTNPRYENGTHE-YNIPRLENSINGNGTS 497
N + N Y N N ++ +N N N N +N + YN N+IN N +
Sbjct: 508 NNNNNNNNNNNYNNNSNNNSYNNNFNNGFNNNDNINDDNNNNNSYNNVN-NNNINNNNNN 566
Query: 498 ENRSND-NSYQNEIDGIDV-WSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENS 555
N N N+Y N + + + N S NT N D + + G L+ NS
Sbjct: 567 NNGFNGFNNYGNNFNNSNNNGNQFGANNNSFNNTDFSN-DSNY--GSYCNGLMDLINNNS 623
Query: 556 I--NGNGTSENRSNDNSYQNEIDGI 578
+ GN N S Q I I
Sbjct: 624 MYNGGNYYMNNASFHQRIQEHIQKI 648
>DICTYBASE|DDB_G0269922 [details] [associations]
symbol:xrn2 "CCHC-type zinc finger-containing
protein" species:44689 "Dictyostelium discoideum" [GO:0008270 "zinc
ion binding" evidence=IEA] [GO:0006139 "nucleobase-containing
compound metabolic process" evidence=IEA] [GO:0005634 "nucleus"
evidence=IEA] [GO:0005622 "intracellular" evidence=IEA] [GO:0004534
"5'-3' exoribonuclease activity" evidence=IEA] [GO:0004527
"exonuclease activity" evidence=IEA] [GO:0003676 "nucleic acid
binding" evidence=IEA] [GO:0008150 "biological_process"
evidence=ND] InterPro:IPR001878 InterPro:IPR004859
InterPro:IPR017151 Pfam:PF03159 PIRSF:PIRSF037239 PROSITE:PS50158
SMART:SM00343 dictyBase:DDB_G0269922 GO:GO:0005634
EMBL:AAFI02000005 GenomeReviews:CM000150_GR GO:GO:0008270
GO:GO:0003676 Gene3D:4.10.60.10 eggNOG:COG5049 InterPro:IPR027073
PANTHER:PTHR12341 GO:GO:0004534 KO:K12619 RefSeq:XP_646407.1
STRING:Q55CS4 EnsemblProtists:DDB0237528 GeneID:8617364
KEGG:ddi:DDB_G0269922 InParanoid:Q55CS4 Uniprot:Q55CS4
Length = 1190
Score = 165 (63.1 bits), Expect = 2.5e-07, Sum P(2) = 2.5e-07
Identities = 54/211 (25%), Positives = 88/211 (41%)
Query: 381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYEN-GTHEYNPKYEN 439
N+ + NY N+ N N+ N + N+ N+N Y N + N Y+N
Sbjct: 919 NRFNNQNYNNNRYNNNNNNNNNN--NNNNNNNNNNNNNNNNNNNNYNNYNNYNNNNNYKN 976
Query: 440 RYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSEN 499
N YN N Y N + YN N N Y NG + YN N+ NGNG + N
Sbjct: 977 NNYNNNGNYNGNNSNNYNNNNN-YNNSNYNNYNNSYNNGNN-YN----NNNNNGNGYNSN 1030
Query: 500 RSND--NSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSIN 557
+N+ N+Y N + + ++ N + N N ++ + G + N N
Sbjct: 1031 YNNNYNNNYNNGNNNGNNYNNNYNNNYNNGNNNGFNNNNNNNYNNNNYGGYD--NNNGFN 1088
Query: 558 GNGTSENRSNDN-SYQNEIDGIDVWSVLSRN 587
N + N +N+N SY + + ++ S++ N
Sbjct: 1089 NNNNNNNNNNNNNSYNYDFNNLNDPSLIDIN 1119
Score = 144 (55.7 bits), Expect = 4.1e-05, Sum P(2) = 4.1e-05
Identities = 48/196 (24%), Positives = 75/196 (38%)
Query: 407 YENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGP 466
Y N YN R N N Y N + N N N + N N N + YN
Sbjct: 911 YNNKQIGYN--RFNNQN--YNNNRYNNNNNNNNNNNNNNNNNNNNNNNN-NNNNNNYNNY 965
Query: 467 KNENTNPRYENGTHEYNIPRLENSING--NGTSENRSNDNSYQNEIDGIDVWSVLSRNEP 524
N N N Y+N + N N+ N N + N SN N+Y N + + ++ + N
Sbjct: 966 NNYNNNNNYKNNNYNNNGNYNGNNSNNYNNNNNYNNSNYNNYNNSYNNGNNYNNNNNNGN 1025
Query: 525 SKRNTILHNIDDEWQISALTRGKWKLVKENSIN-GNGTSENRSNDNSYQNE-IDGIDVWS 582
+ +N ++ + + N+ N GN N +N+N+Y N G D +
Sbjct: 1026 GYNSNYNNNYNNNYNNGNNNGNNYNNNYNNNYNNGNNNGFNNNNNNNYNNNNYGGYDNNN 1085
Query: 583 VLSRNEPSKRNTILHN 598
+ N + N +N
Sbjct: 1086 GFNNNNNNNNNNNNNN 1101
Score = 44 (20.5 bits), Expect = 2.5e-07, Sum P(2) = 2.5e-07
Identities = 14/53 (26%), Positives = 24/53 (45%)
Query: 23 QYLKELGYRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDV 75
QY++E + + + + P +F+D VA S ++ L D W DV
Sbjct: 149 QYMEEKKNKFKFDSNCITP-----GTLFMDRVAESLRTYVAEKLTTDPAWKDV 196
>UNIPROTKB|F1SBF1 [details] [associations]
symbol:LOC100739059 "Uncharacterized protein" species:9823
"Sus scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484
GeneTree:ENSGT00400000022041 OMA:TENDPAN EMBL:FP339597
RefSeq:XP_003484028.1 Ensembl:ENSSSCT00000008161 GeneID:100739169
KEGG:ssc:100739169 Uniprot:F1SBF1
Length = 527
Score = 141 (54.7 bits), Expect = 3.0e-07, Sum P(2) = 3.0e-07
Identities = 61/225 (27%), Positives = 89/225 (39%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+II +L DD DV G Q+ + G N + T +C PSRS+I+TGK
Sbjct: 44 PNIILVLTDD---QDVEL-GSMQVMNKTRRIMEQGGAHFINAFVTTPMCCPSRSSILTGK 99
Query: 119 HPIHTGMQHNVLYGCERGGLPL-----SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+ +H HN E P + YL GYRT GK+ L Y Y P
Sbjct: 100 Y-VHN---HNTYTNNENCSSPSWQAQHESRTFAVYLNSTGYRTAFFGKY-LNEYNGSYVP 154
Query: 174 TFRGFESHLGYWTGHQDY-FDHSAEEMK-MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
G++ +G + Y + +K G D +D Y TD+ T ++V
Sbjct: 155 P--GWKEWVGLLKNSRFYNYTLCRNGVKEKHGFDYSKD----------YLTDLITNDSVS 202
Query: 232 IIHNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
P+ + ++HAA H P Q + N +HI
Sbjct: 203 FFRTSKKMYPHRPVLMVISHAAPHGPEDSAP-QYSRLFPNASQHI 246
Score = 59 (25.8 bits), Expect = 3.0e-07, Sum P(2) = 3.0e-07
Identities = 48/233 (20%), Positives = 89/233 (38%)
Query: 234 HNHSTDEPLFLYL-AHAATHSANPYEPLQAPD-HYLN--------IHRHIEDFKRSKFAA 283
H P + L +A+ H Y PD H++ IH + + K
Sbjct: 226 HGPEDSAPQYSRLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQ 285
Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEG 343
L +D+S+ + L + L N+ IV+ +D + P +E
Sbjct: 286 TLMSVDDSMETIYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMP--------YEF 337
Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENS 403
+R + P +E+ + +++ D PT+L A DIP+ ++ ++I+ +
Sbjct: 338 DIRVPFYVRGPNVEAGSLNPHIVLNI-DLAPTILDIAGL-DIPSDMDG--KSILKLLDTE 393
Query: 404 ILRYENGTHEYNSPRIENSNTRYENGT--HEYNP-KYENRYENGTHEYNPKYE 453
R N H R+ + E G H+ + K + + EN + PKY+
Sbjct: 394 --RPANRFHLKKKMRVWRDSFLVERGKLLHKRDSDKVDAQEEN----FLPKYQ 440
>DICTYBASE|DDB_G0273013 [details] [associations]
symbol:uglB "uracil glycosylase" species:44689
"Dictyostelium discoideum" [GO:0006284 "base-excision repair"
evidence=IEA] [GO:0006281 "DNA repair" evidence=IEA] [GO:0004844
"uracil DNA N-glycosylase activity" evidence=IEA]
InterPro:IPR002043 dictyBase:DDB_G0273013 HAMAP:MF_00148
Pfam:PF03167 GO:GO:0005737 GO:GO:0006284 GenomeReviews:CM000151_GR
EMBL:AAFI02000008 GO:GO:0004844 Gene3D:3.40.470.10
InterPro:IPR005122 SMART:SM00986 SUPFAM:SSF52141 eggNOG:COG0692
KO:K03648 PANTHER:PTHR11264 TIGRFAMs:TIGR00628
RefSeq:XP_001134629.1 ProteinModelPortal:Q1ZXM2
EnsemblProtists:DDB0232990 GeneID:8618639 KEGG:ddi:DDB_G0273013
InParanoid:Q1ZXM2 OMA:FININEP Uniprot:Q1ZXM2
Length = 597
Score = 151 (58.2 bits), Expect = 3.9e-07, Sum P(2) = 3.9e-07
Identities = 48/192 (25%), Positives = 76/192 (39%)
Query: 391 STVENIIPRYENSILRYENGTHEYNSPRIENSNTR------YENGTHEYNPKYENRYENG 444
ST NI+ + + S + ++ + N+ I N N Y N + N N N
Sbjct: 405 STANNILIQSQQSPIDWDLDNIDCNNNNINNKNKNLNLNVDYNNNNNNNNNNNNNNNNNN 464
Query: 445 THEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDN 504
+ N N N + N N N N N + N N+IN N T+ N+SNDN
Sbjct: 465 NNNNNNNNNNNNNNNNNNNNNNNNNNINNNNHNNNNNNNNNNTNNNINNN-TNNNKSNDN 523
Query: 505 SYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSEN 564
+ N + + + S N +K N N + ++ + N+ N N ++ N
Sbjct: 524 NNHNNTNNNNT-NNNSNNIDNKENNNEENENVDFNNNNNNNNN-NNNNNNNNNINNSNNN 581
Query: 565 RSNDNSYQNEID 576
+ND S ID
Sbjct: 582 TNNDKSNSKSID 593
Score = 122 (48.0 bits), Expect = 0.00049, Sum P(2) = 0.00049
Identities = 41/160 (25%), Positives = 65/160 (40%)
Query: 384 DIPNYVNSTVENIIPRYENSILR--YENGTHEYNSPRIENSNTRYENGTHEYNPKYENRY 441
D+ N ++ NI + +N L Y N + N+ N+N N + N N
Sbjct: 422 DLDN-IDCNNNNINNKNKNLNLNVDYNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 480
Query: 442 ENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTH--EYNIPRLENSINGNGTSEN 499
N + N N N + N N NTN N T+ + N N+ N N T+ N
Sbjct: 481 NNNNNNNNNNINNNNHNNNNNNN---NNNTNNNINNNTNNNKSNDNNNHNNTNNNNTNNN 537
Query: 500 RSN-DNSYQN--EIDGIDVWSVLSRNEPSKRNTILHNIDD 536
+N DN N E + +D + + N + N +NI++
Sbjct: 538 SNNIDNKENNNEENENVDFNNNNNNNNNNNNNNNNNNINN 577
Score = 49 (22.3 bits), Expect = 3.9e-07, Sum P(2) = 3.9e-07
Identities = 18/71 (25%), Positives = 33/71 (46%)
Query: 98 LKNYYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTR 157
L N +T + + I+ K +H ++N + E SE I L E G+R+R
Sbjct: 144 LFNGFTSPIDQNINNNIINNKR-LHLNNENNFININEPTNENFSENIKKSLL-EKGWRSR 201
Query: 158 IVGKWHLGFYK 168
+ G++ ++K
Sbjct: 202 LQGEFEKDYFK 212
>ZFIN|ZDB-GENE-061215-37 [details] [associations]
symbol:ids "iduronate 2-sulfatase" species:7955
"Danio rerio" [GO:0008152 "metabolic process" evidence=IEA]
[GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
[GO:0003824 "catalytic activity" evidence=IEA] [GO:0030512
"negative regulation of transforming growth factor beta receptor
signaling pathway" evidence=IMP] [GO:0005737 "cytoplasm"
evidence=IDA] [GO:0009790 "embryo development" evidence=IMP]
[GO:0060536 "cartilage morphogenesis" evidence=IMP] [GO:0004423
"iduronate-2-sulfatase activity" evidence=IDA] InterPro:IPR000917
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
ZFIN:ZDB-GENE-061215-37 GO:GO:0005737 GO:GO:0009790
Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
PROSITE:PS00149 GO:GO:0030512 GO:GO:0060536 OMA:CREGKNL
GO:GO:0004423 GeneTree:ENSGT00640000091539 EMBL:CR774199
IPI:IPI00495228 Ensembl:ENSDART00000106205 Bgee:F1R4Q5
Uniprot:F1R4Q5
Length = 561
Score = 135 (52.6 bits), Expect = 4.1e-07, Sum P(3) = 4.1e-07
Identities = 38/113 (33%), Positives = 57/113 (50%)
Query: 55 ASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSA 113
A S ++++++ADDL +G + + +PNID LA ++ N Y Q +C PSR +
Sbjct: 26 AKSKDFNVLYLIADDLR-PSLGCYSDPVVKSPNIDQLASLSVVFHNAYAQQAVCGPSRVS 84
Query: 114 IMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGK-WHLG 165
+T + P T + Y G + LPQY K GY T VGK +H G
Sbjct: 85 FLTSRRPDTTKLYDFNSYWRVHAG---NYTTLPQYFKSNGYTTLSVGKVFHPG 134
Score = 69 (29.3 bits), Expect = 4.1e-07, Sum P(3) = 4.1e-07
Identities = 19/60 (31%), Positives = 32/60 (53%)
Query: 256 PYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
PY P+ D L I +H FA++ + +D VGK+++ L+ + N+I+V SD
Sbjct: 278 PYGPIPK-DFQLRIRQHY-------FASVSY-VDAQVGKILQTLDDVGLAKNTIVVLSSD 328
Score = 44 (20.5 bits), Expect = 0.00012, Sum P(3) = 0.00012
Identities = 12/46 (26%), Positives = 22/46 (47%)
Query: 226 TAEAVDIIHN-HSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIH 270
T EA+ ++ + + +P FL A P+ P + P YL ++
Sbjct: 196 TEEAIRLLRSMKGSQKPFFL-----AVGFYKPHIPFRIPQEYLKLY 236
Score = 38 (18.4 bits), Expect = 4.1e-07, Sum P(3) = 4.1e-07
Identities = 11/41 (26%), Positives = 18/41 (43%)
Query: 775 CEPQIAPC----LFDIKNDPCEKNNLADR-SEVQRINHYTT 810
C+P + L+ + DP + NNL D +N + T
Sbjct: 496 CKPNMTEIHAGELYILTEDPGQDNNLFDEFGHAALLNKFGT 536
Score = 37 (18.1 bits), Expect = 5.2e-07, Sum P(3) = 5.2e-07
Identities = 9/28 (32%), Positives = 14/28 (50%)
Query: 669 CEPQIAPC----LFDIKNDPCEKNNLAD 692
C+P + L+ + DP + NNL D
Sbjct: 496 CKPNMTEIHAGELYILTEDPGQDNNLFD 523
>UNIPROTKB|Q1LZH9 [details] [associations]
symbol:GNS "N-acetylglucosamine-6-sulfatase" species:9913
"Bos taurus" [GO:0005764 "lysosome" evidence=IEA] [GO:0008449
"N-acetylglucosamine-6-sulfatase activity" evidence=IEA]
[GO:0046872 "metal ion binding" evidence=IEA] [GO:0030203
"glycosaminoglycan metabolic process" evidence=IEA]
InterPro:IPR000917 InterPro:IPR012251 InterPro:IPR015981
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
PIRSF:PIRSF036666 GO:GO:0046872 GO:GO:0005764 Gene3D:3.40.720.10
SUPFAM:SSF53649 eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523
PROSITE:PS00149 GO:GO:0030203 EMBL:BC115990 IPI:IPI00703612
RefSeq:NP_001069030.1 UniGene:Bt.20235 ProteinModelPortal:Q1LZH9
STRING:Q1LZH9 PRIDE:Q1LZH9 GeneID:512444 KEGG:bta:512444 CTD:2799
HOGENOM:HOG000169239 HOVERGEN:HBG005840 InParanoid:Q1LZH9 KO:K01137
OrthoDB:EOG4NGGMF NextBio:20870390 GO:GO:0008449
PANTHER:PTHR10342:SF5 Uniprot:Q1LZH9
Length = 560
Score = 147 (56.8 bits), Expect = 4.3e-07, Sum P(2) = 4.3e-07
Identities = 63/228 (27%), Positives = 97/228 (42%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDAL-AYSGIILKNYYTVQ-LCTPSRSA 113
SS P+++ +LADD D G+ P AL G+ + Y LC PSR++
Sbjct: 51 SSRRPNVVLLLADD---QDEVLGGMT--PLKKTKALIGEMGMTFSSAYVPSALCCPSRAS 105
Query: 114 IMTGKHPIHTGMQHNVLYG-CE-RGGLPLSE-KILPQYLKEL-GYRTRIVGKWHLGFYKK 169
I+TGK+P + + +N L G C + + E P L+ + GY+T GK Y
Sbjct: 106 ILTGKYPHNLHVVNNTLEGNCSSKSWQKIQEPNTFPAILRSMCGYQTFFAGK-----YLN 160
Query: 170 EYTPTFRGFESH--LG--YWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVF 225
EY G H LG YW + + + + G + + D Y TDV
Sbjct: 161 EYGAPDAGGLGHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVD----YLTDVL 216
Query: 226 TAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
++D + S EP F+ ++ A HS P A Y N +++
Sbjct: 217 ANVSLDFLDYKSNSEPFFMMISTPAPHS-----PWTAAPQYQNAFQNV 259
Score = 52 (23.4 bits), Expect = 4.3e-07, Sum P(2) = 4.3e-07
Identities = 13/38 (34%), Positives = 23/38 (60%)
Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
R ++ +L +D+ V K+V+ LE L+N+ I + SD
Sbjct: 298 RKRWQTLL-SVDDLVEKLVKRLEFNGELNNTYIFYTSD 334
Score = 41 (19.5 bits), Expect = 5.8e-06, Sum P(2) = 5.8e-06
Identities = 27/105 (25%), Positives = 42/105 (40%)
Query: 473 PRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTI-- 530
P+Y+N PR +N N +GT+++ W + P ++I
Sbjct: 250 PQYQNAFQNVFAPRNKN-FNIHGTNKH----------------WLIRQAKTPMTNSSIQF 292
Query: 531 LHN-IDDEWQ-ISALTRGKWKLVKENSING--NGTSENRSNDNSY 571
L N WQ + ++ KLVK NG N T ++DN Y
Sbjct: 293 LDNAFRKRWQTLLSVDDLVEKLVKRLEFNGELNNTYIFYTSDNGY 337
Score = 40 (19.1 bits), Expect = 7.3e-06, Sum P(2) = 7.3e-06
Identities = 8/23 (34%), Positives = 14/23 (60%)
Query: 450 PKYENRYENGTHEYNGPKNENTN 472
P+Y+N ++N P+N+N N
Sbjct: 250 PQYQNAFQN----VFAPRNKNFN 268
>DICTYBASE|DDB_G0277905 [details] [associations]
symbol:snfA "AMP-activated protein kinase alpha
subunit" species:44689 "Dictyostelium discoideum" [GO:0046956
"positive phototaxis" evidence=IMP] [GO:0008283 "cell
proliferation" evidence=IMP] [GO:0007005 "mitochondrion
organization" evidence=IMP] [GO:0006754 "ATP biosynthetic process"
evidence=IMP] [GO:0016772 "transferase activity, transferring
phosphorus-containing groups" evidence=IEA] [GO:0006468 "protein
phosphorylation" evidence=IEA;ISS] [GO:0005524 "ATP binding"
evidence=IEA] [GO:0004674 "protein serine/threonine kinase
activity" evidence=IEA] [GO:0004672 "protein kinase activity"
evidence=IEA] [GO:0005575 "cellular_component" evidence=ND]
[GO:0007165 "signal transduction" evidence=ISS] [GO:0004679
"AMP-activated protein kinase activity" evidence=ISS] [GO:0016740
"transferase activity" evidence=IEA] [GO:0016310 "phosphorylation"
evidence=IEA] [GO:0016301 "kinase activity" evidence=IEA]
[GO:0000166 "nucleotide binding" evidence=IEA] InterPro:IPR000719
InterPro:IPR002290 InterPro:IPR008271 InterPro:IPR011009
InterPro:IPR017441 Pfam:PF00069 PROSITE:PS00107 PROSITE:PS00108
PROSITE:PS50011 SMART:SM00220 dictyBase:DDB_G0277905
InterPro:IPR001772 GO:GO:0005524 GO:GO:0007165
GenomeReviews:CM000152_GR eggNOG:COG0515 GO:GO:0008283
EMBL:AAFI02000023 SUPFAM:SSF56112 GO:GO:0004679 GO:GO:0006754
HSSP:P06782 GO:GO:0007005 EMBL:AF118151 RefSeq:XP_642250.1
ProteinModelPortal:Q54YF2 SMR:Q54YF2 STRING:Q54YF2
EnsemblProtists:DDB0215396 GeneID:8621459 KEGG:ddi:DDB_G0277905
OMA:KREANSI GO:GO:0046956 Pfam:PF02149 PROSITE:PS50032
Uniprot:Q54YF2
Length = 727
Score = 153 (58.9 bits), Expect = 4.3e-07, P = 4.3e-07
Identities = 52/208 (25%), Positives = 79/208 (37%)
Query: 375 TLLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYN 434
T + +N + I N N+ N N+ N N+ I N+N N + N
Sbjct: 386 TGFNPSNSNSISNNNNNNNNNNNNTTNNNNNTTNNNNSIINNNNINNNNINNNNNNNNNN 445
Query: 435 PKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNI-PRLENSING 493
N N + N N N + N N N N GT ++I P L NS N
Sbjct: 446 INNNNIINNNNNNNN----NNNNNNNNNNNNNNNNNNNSSISGGTEVFSISPNLNNSYNS 501
Query: 494 NGT-SENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVK 552
N + + N SN N+ N D + + N + N +N ++ + + L
Sbjct: 502 NSSGNSNGSNSNNNSNNNTNNDNNNNNNNNNNNNNNNNNNNNNNNNNNNCIDSVNNSLNN 561
Query: 553 ENSING---NGTSENRSNDNSYQNEIDG 577
EN +N N + N S+D S N +G
Sbjct: 562 ENDVNNSNINNNNNNNSDDGSNNNSYEG 589
Score = 137 (53.3 bits), Expect = 2.3e-05, P = 2.3e-05
Identities = 50/222 (22%), Positives = 87/222 (39%)
Query: 378 SAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKY 437
S N+ + PN V+ I+ + S + + T +N P NS + N + N
Sbjct: 353 SYENEINSPNLVSPITTPIMSSAQKSPIMFTTTTG-FN-PSNSNSISNNNNNNNNNNNNT 410
Query: 438 ENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTS 497
N N T+ N N N + N N N N N + N N+ N N +
Sbjct: 411 TNNNNNTTNNNNSIINNNNINNNNINNNNNNNNNNINNNNIINNNNNNNNNNNNNNNNNN 470
Query: 498 ENRSNDNSYQNEIDGIDVWSVLSR-NEPSKRNTILHNIDDEWQISALTRGKWKLVKENSI 556
N +N+N+ + G +V+S+ N N+ ++ ++ N+
Sbjct: 471 NNNNNNNNNSSISGGTEVFSISPNLNNSYNSNSSGNSNGSNSNNNSNNNTNNDNNNNNNN 530
Query: 557 NGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHN 598
N N + N +N+N+ N + ID + NE N+ ++N
Sbjct: 531 NNNNNNNNNNNNNNNNNNNNCIDSVNNSLNNENDVNNSNINN 572
>UNIPROTKB|Q32KH1 [details] [associations]
symbol:sulf2 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0060384 "innervation" evidence=IEA]
[GO:0060348 "bone development" evidence=IEA] [GO:0048706 "embryonic
skeletal system development" evidence=IEA] [GO:0040037 "negative
regulation of fibroblast growth factor receptor signaling pathway"
evidence=IEA] [GO:0035860 "glial cell-derived neurotrophic factor
receptor signaling pathway" evidence=IEA] [GO:0032836 "glomerular
basement membrane development" evidence=IEA] [GO:0030201 "heparan
sulfate proteoglycan metabolic process" evidence=IEA] [GO:0030177
"positive regulation of Wnt receptor signaling pathway"
evidence=IEA] [GO:0014846 "esophagus smooth muscle contraction"
evidence=IEA] [GO:0010575 "positive regulation vascular endothelial
growth factor production" evidence=IEA] [GO:0009986 "cell surface"
evidence=IEA] [GO:0008449 "N-acetylglucosamine-6-sulfatase
activity" evidence=IEA] [GO:0005886 "plasma membrane" evidence=IEA]
[GO:0005783 "endoplasmic reticulum" evidence=IEA] [GO:0004065
"arylsulfatase activity" evidence=IEA] [GO:0003094 "glomerular
filtration" evidence=IEA] [GO:0002063 "chondrocyte development"
evidence=IEA] [GO:0005794 "Golgi apparatus" evidence=IEA]
[GO:0005509 "calcium ion binding" evidence=IEA] InterPro:IPR000917
InterPro:IPR014615 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783 GO:GO:0005886
GO:GO:0005794 GO:GO:0009986 GO:GO:0005509 GO:GO:0010575
GO:GO:0060348 Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0030177
GO:GO:0003094 eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523
GO:GO:0004065 GeneTree:ENSGT00400000022041 GO:GO:0048706
GO:GO:0002063 GO:GO:0040037 GO:GO:0032836 GO:GO:0060384
GO:GO:0030201 HOGENOM:HOG000290161 KO:K14607 HOVERGEN:HBG056431
GO:GO:0014846 GO:GO:0035860 InterPro:IPR024609 Pfam:PF12548
CTD:55959 OMA:PKYYGQG OrthoDB:EOG49KFPX EMBL:AAEX03013985
EMBL:AAEX03013986 EMBL:BN000766 RefSeq:NP_001041555.1
UniGene:Cfa.6393 STRING:Q32KH1 Ensembl:ENSCAFT00000017345
GeneID:477254 KEGG:cfa:477254 InParanoid:Q32KH1 NextBio:20852774
Uniprot:Q32KH1
Length = 869
Score = 142 (55.0 bits), Expect = 4.6e-07, Sum P(2) = 4.6e-07
Identities = 61/225 (27%), Positives = 89/225 (39%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+II +L DD DV G Q+ + G N + T +C PSRS+I+TGK
Sbjct: 44 PNIILVLTDD---QDVEL-GSMQVMNKTRRIMEQGGAHFINAFVTTPMCCPSRSSILTGK 99
Query: 119 HPIHTGMQHNVLYGCERGGLPL-----SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+ +H HN E P + YL GYRT GK+ L Y Y P
Sbjct: 100 Y-VHN---HNTYTNNENCSSPSWQAQHESRTFAVYLNSTGYRTAFFGKY-LNEYNGSYVP 154
Query: 174 TFRGFESHLGYWTGHQDY-FDHSAEEMK-MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
G++ +G + Y + +K G D +D Y TD+ T ++V
Sbjct: 155 P--GWKEWVGLLKNSRFYNYTLCRNGVKEKHGFDYSKD----------YLTDLITNDSVS 202
Query: 232 IIHNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
P+ + ++HAA H P Q + N +HI
Sbjct: 203 FFRTSKKMYPHRPVLMVISHAAPHGPEDSAP-QYSGLFPNASQHI 246
Score = 62 (26.9 bits), Expect = 4.6e-07, Sum P(2) = 4.6e-07
Identities = 40/188 (21%), Positives = 75/188 (39%)
Query: 269 IHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXX 328
IH + + K L +D+S+ + L + L N+ IV+ +D
Sbjct: 271 IHMEFTNMLQRKRLQTLMSVDDSMETIYNMLVETGELDNTYIVYTADHGYHIGQFGLVKG 330
Query: 329 SNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNY 388
+ P +E +R + P +E+ + +++ D PT+L A DIP+
Sbjct: 331 KSMP--------YEFDIRVPFYVRGPNVEAGSLNPHIVLNI-DLAPTILDIAGL-DIPSD 380
Query: 389 VNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGT--HEY-NPKYENRYENGT 445
++ ++I+ + R N H R+ + E G H+ N K + + EN
Sbjct: 381 MDG--KSILKLLDTE--RPVNRFHLKKKMRVWRDSFLVERGKLLHKRDNDKVDAQEEN-- 434
Query: 446 HEYNPKYE 453
+ PKY+
Sbjct: 435 --FLPKYQ 440
Score = 44 (20.5 bits), Expect = 3.2e-05, Sum P(2) = 3.2e-05
Identities = 10/42 (23%), Positives = 15/42 (35%)
Query: 428 NGTHEYNPKYENRYENGTHEYNPKYENRYENGTH---EYNGP 466
+G + P+Y + N + P Y H Y GP
Sbjct: 226 HGPEDSAPQYSGLFPNASQHITPSYNYAPNPDKHWIMRYTGP 267
>UNIPROTKB|E9PJL8 [details] [associations]
symbol:SULF1 "Extracellular sulfatase Sulf-1" species:9606
"Homo sapiens" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] [GO:0002063 "chondrocyte development" evidence=IEA]
[GO:0003094 "glomerular filtration" evidence=IEA] [GO:0005886
"plasma membrane" evidence=IEA] [GO:0010575 "positive regulation
vascular endothelial growth factor production" evidence=IEA]
[GO:0014846 "esophagus smooth muscle contraction" evidence=IEA]
[GO:0030201 "heparan sulfate proteoglycan metabolic process"
evidence=IEA] [GO:0032836 "glomerular basement membrane
development" evidence=IEA] [GO:0035860 "glial cell-derived
neurotrophic factor receptor signaling pathway" evidence=IEA]
[GO:0040037 "negative regulation of fibroblast growth factor
receptor signaling pathway" evidence=IEA] [GO:0048706 "embryonic
skeletal system development" evidence=IEA] [GO:0060348 "bone
development" evidence=IEA] [GO:0060384 "innervation" evidence=IEA]
[GO:0060686 "negative regulation of prostatic bud formation"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 GO:GO:0005886 GO:GO:0005794
GO:GO:0010575 GO:GO:0060348 Gene3D:3.40.720.10 SUPFAM:SSF53649
GO:GO:0003094 InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484
GO:GO:0048706 GO:GO:0060686 GO:GO:0002063 GO:GO:0040037
GO:GO:0032836 GO:GO:0060384 GO:GO:0030201 EMBL:AC091047
GO:GO:0014846 GO:GO:0035860 HGNC:HGNC:20391 ChiTaRS:SULF1
EMBL:AC013746 EMBL:AC022790 IPI:IPI00978157
ProteinModelPortal:E9PJL8 SMR:E9PJL8 Ensembl:ENST00000525999
ArrayExpress:E9PJL8 Bgee:E9PJL8 Uniprot:E9PJL8
Length = 172
Score = 127 (49.8 bits), Expect = 5.0e-07, P = 5.0e-07
Identities = 43/130 (33%), Positives = 59/130 (45%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+II +L DD DV L Q+ + + G N + T +C PSRS+++TGK
Sbjct: 43 PNIILVLTDD---QDVELGSL-QVMNKTRKIMEHGGATFINAFVTTPMCCPSRSSMLTGK 98
Query: 119 HPIHTGMQHNVLYGCERGGLP----LSE-KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+ +H HNV E P + E + YL GYRT GK+ L Y Y P
Sbjct: 99 Y-VHN---HNVYTNNENCSSPSWQAMHEPRTFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 153
Query: 174 TFRGFESHLG 183
G+ LG
Sbjct: 154 P--GWREWLG 161
>WB|WBGene00006308 [details] [associations]
symbol:sul-1 species:6239 "Caenorhabditis elegans"
[GO:0008152 "metabolic process" evidence=IEA] [GO:0008484 "sulfuric
ester hydrolase activity" evidence=IEA] [GO:0003824 "catalytic
activity" evidence=IEA] [GO:0016021 "integral to membrane"
evidence=IEA] [GO:0015015 "heparan sulfate proteoglycan
biosynthetic process, enzymatic modification" evidence=IMP]
[GO:0017095 "heparan sulfate 6-O-sulfotransferase activity"
evidence=IMP] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 GO:GO:0005783 GO:GO:0009986
GO:GO:0046872 GO:GO:0005795 Gene3D:3.40.720.10 SUPFAM:SSF53649
eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
GO:GO:0008484 GeneTree:ENSGT00400000022041 GO:GO:0015015
GO:GO:0017095 EMBL:FO081118 PIR:T16584 RefSeq:NP_508560.1
ProteinModelPortal:Q21376 SMR:Q21376 STRING:Q21376
EnsemblMetazoa:K09C4.8 GeneID:180619 KEGG:cel:CELE_K09C4.8
UCSC:K09C4.8 CTD:180619 WormBase:K09C4.8 HOGENOM:HOG000290161
InParanoid:Q21376 KO:K14607 OMA:TVEDRWR NextBio:910136
Uniprot:Q21376
Length = 709
Score = 148 (57.2 bits), Expect = 5.2e-07, Sum P(2) = 5.2e-07
Identities = 65/233 (27%), Positives = 104/233 (44%)
Query: 37 FAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGI 96
F ++P+ T S+ FVD ++I IL DD D+ +D +P + + G
Sbjct: 18 FLIIPIKVT-SIHFVD-----SQHNVILILTDD---QDIELGSMDFMPKTS-QIMKERGT 67
Query: 97 -ILKNYYTVQLCTPSRSAIMTG----KHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKE 151
Y T +C PSRS I+TG H +HT Q+ G E + +K + YL+E
Sbjct: 68 EFTSGYVTTPICCPSRSTILTGLYVHNHHVHTNNQNCT--GVEWRKVH-EKKSIGVYLQE 124
Query: 152 LGYRTRIVGKWHLGFYKKEYTPTFRGFES-H-LGYWTGHQDYFDHSAEEMKMWGLDMRRD 209
GYRT +GK+ L Y Y P G++ H + + +Y +S E + +G + +D
Sbjct: 125 AGYRTAYLGKY-LNEYDGSYIPP--GWDEWHAIVKNSKFYNYTMNSNGEREKFGSEYEKD 181
Query: 210 LEPAWDLHGKYSTDVFTAEAVDIIHNH---STDEPLFLYLAHAATHSANPYEP 259
Y TD+ T ++ I H +P L +++ A H P +P
Sbjct: 182 ----------YFTDLVTNRSLKFIDKHIKIRAWQPFALIISYPAPHG--PEDP 222
Score = 53 (23.7 bits), Expect = 5.2e-07, Sum P(2) = 5.2e-07
Identities = 26/121 (21%), Positives = 45/121 (37%)
Query: 260 LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXX 319
LQ ++H D + L +DE + ++ L + L N+ ++ SD
Sbjct: 253 LQRTGKMNDVHISFTDLLHRRRLQTLQSVDEGIERLFNLLRELNQLWNTYAIYTSDHGYH 312
Query: 320 XXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSA 379
L+G KN +E +R + P + R + + V D PT+L
Sbjct: 313 LGQFGL-------LKG-KNMPYEFDIRVPFFMRGPGIP-RNVTFNEIVTNVDIAPTMLHI 363
Query: 380 A 380
A
Sbjct: 364 A 364
>UNIPROTKB|F1MXZ0 [details] [associations]
symbol:GNS "N-acetylglucosamine-6-sulfatase" species:9913
"Bos taurus" [GO:0030203 "glycosaminoglycan metabolic process"
evidence=IEA] [GO:0008449 "N-acetylglucosamine-6-sulfatase
activity" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
InterPro:IPR000917 InterPro:IPR012251 InterPro:IPR015981
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
PIRSF:PIRSF036666 GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
GeneTree:ENSGT00400000022041 GO:GO:0030203 IPI:IPI00703612
UniGene:Bt.20235 GO:GO:0008449 PANTHER:PTHR10342:SF5 OMA:MCGYQTF
EMBL:DAAA02013337 ProteinModelPortal:F1MXZ0
Ensembl:ENSBTAT00000023218 ArrayExpress:F1MXZ0 Uniprot:F1MXZ0
Length = 560
Score = 146 (56.5 bits), Expect = 5.5e-07, Sum P(2) = 5.5e-07
Identities = 63/228 (27%), Positives = 97/228 (42%)
Query: 56 SSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDAL-AYSGIILKNYYTVQ-LCTPSRSA 113
SS P+++ +LADD D G+ P AL G+ + Y LC PSR++
Sbjct: 51 SSRRPNVVLLLADD---QDEVLGGMT--PLKKTKALIGEMGMTFSSAYVPSALCCPSRAS 105
Query: 114 IMTGKHPIHTGMQHNVLYG-CE-RGGLPLSE-KILPQYLKEL-GYRTRIVGKWHLGFYKK 169
I+TGK+P + + +N L G C + + E P L+ + GY+T GK Y
Sbjct: 106 ILTGKYPHNHHVVNNTLEGNCSSKSWQKIQEPNTFPAILRSMCGYQTFFAGK-----YLN 160
Query: 170 EYTPTFRGFESH--LG--YWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVF 225
EY G H LG YW + + + + G + + D Y TDV
Sbjct: 161 EYGAPDAGGLGHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVD----YLTDVL 216
Query: 226 TAEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
++D + S EP F+ ++ A HS P A Y N +++
Sbjct: 217 ANVSLDFLDYKSNSEPFFMMISTPAPHS-----PWTAAPQYQNAFQNV 259
Score = 52 (23.4 bits), Expect = 5.5e-07, Sum P(2) = 5.5e-07
Identities = 13/38 (34%), Positives = 23/38 (60%)
Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
R ++ +L +D+ V K+V+ LE L+N+ I + SD
Sbjct: 298 RKRWQTLL-SVDDLVEKLVKRLEFNGELNNTYIFYTSD 334
Score = 41 (19.5 bits), Expect = 7.4e-06, Sum P(2) = 7.4e-06
Identities = 27/105 (25%), Positives = 42/105 (40%)
Query: 473 PRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTI-- 530
P+Y+N PR +N N +GT+++ W + P ++I
Sbjct: 250 PQYQNAFQNVFAPRNKN-FNIHGTNKH----------------WLIRQAKTPMTNSSIQF 292
Query: 531 LHN-IDDEWQ-ISALTRGKWKLVKENSING--NGTSENRSNDNSY 571
L N WQ + ++ KLVK NG N T ++DN Y
Sbjct: 293 LDNAFRKRWQTLLSVDDLVEKLVKRLEFNGELNNTYIFYTSDNGY 337
Score = 40 (19.1 bits), Expect = 9.4e-06, Sum P(2) = 9.4e-06
Identities = 8/23 (34%), Positives = 14/23 (60%)
Query: 450 PKYENRYENGTHEYNGPKNENTN 472
P+Y+N ++N P+N+N N
Sbjct: 250 PQYQNAFQN----VFAPRNKNFN 268
>MGI|MGI:1922862 [details] [associations]
symbol:Gns "glucosamine (N-acetyl)-6-sulfatase"
species:10090 "Mus musculus" [GO:0003824 "catalytic activity"
evidence=IEA] [GO:0005539 "glycosaminoglycan binding" evidence=ISO]
[GO:0005764 "lysosome" evidence=ISO] [GO:0008152 "metabolic
process" evidence=IEA] [GO:0008449 "N-acetylglucosamine-6-sulfatase
activity" evidence=ISO] [GO:0008484 "sulfuric ester hydrolase
activity" evidence=ISO] [GO:0016787 "hydrolase activity"
evidence=IEA] [GO:0030203 "glycosaminoglycan metabolic process"
evidence=IEA] [GO:0042340 "keratan sulfate catabolic process"
evidence=ISO] [GO:0043199 "sulfate binding" evidence=ISO]
[GO:0046872 "metal ion binding" evidence=IEA] InterPro:IPR000917
InterPro:IPR012251 InterPro:IPR015981 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036666 MGI:MGI:1922862
GO:GO:0046872 GO:GO:0005764 Gene3D:3.40.720.10 SUPFAM:SSF53649
eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
GO:GO:0008484 GeneTree:ENSGT00400000022041 GO:GO:0042340
GO:GO:0043199 GO:GO:0005539 CTD:2799 HOGENOM:HOG000169239
HOVERGEN:HBG005840 KO:K01137 OrthoDB:EOG4NGGMF GO:GO:0008449
PANTHER:PTHR10342:SF5 ChiTaRS:GNS EMBL:AK030773 EMBL:AK049162
EMBL:AK054046 EMBL:AK083597 EMBL:AK159562 EMBL:AK169485
EMBL:AK165180 EMBL:AK170791 EMBL:BC055328 IPI:IPI00221426
RefSeq:NP_083640.1 UniGene:Mm.207683 ProteinModelPortal:Q8BFR4
SMR:Q8BFR4 STRING:Q8BFR4 PhosphoSite:Q8BFR4 PaxDb:Q8BFR4
PRIDE:Q8BFR4 Ensembl:ENSMUST00000040344 GeneID:75612 KEGG:mmu:75612
UCSC:uc007hfo.1 InParanoid:Q8BFR4 OMA:MCGYQTF NextBio:343508
Bgee:Q8BFR4 CleanEx:MM_GNS Genevestigator:Q8BFR4 Uniprot:Q8BFR4
Length = 544
Score = 150 (57.9 bits), Expect = 5.9e-07, P = 5.9e-07
Identities = 66/235 (28%), Positives = 101/235 (42%)
Query: 29 GYRTRIMAFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNI 88
G R+ A +LPL + LV ++ P+++ +L DD D G+ P
Sbjct: 12 GRPRRLPALLLLPLLGGC----LGLVGAARRPNVLLLLTDD---QDAELGGMT--PLKKT 62
Query: 89 DAL-AYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYG-CE-RGGLPLSEKI 144
AL G+ + Y LC PSR++I+TGK+P + + +N L G C + + E
Sbjct: 63 KALIGEKGMTFSSAYVPSALCCPSRASILTGKYPHNHHVVNNTLEGNCSSKAWQKIQEPY 122
Query: 145 -LPQYLKEL-GYRTRIVGKWHLGFYKKEY-TPTFRGFES-HLG--YWTGHQDYFDHSAEE 198
P LK + GY+T GK Y EY P G E LG YW + +
Sbjct: 123 TFPAILKSVCGYQTFFAGK-----YLNEYGAPDAGGLEHIPLGWSYWYALEKNSKYYNYT 177
Query: 199 MKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHS 253
+ + G + + D Y TDV ++D + S EP F+ ++ A HS
Sbjct: 178 LSINGKARKHGENYSVD----YLTDVLANLSLDFLDYKSNSEPFFMMISTPAPHS 228
>TIGR_CMR|SPO_3593 [details] [associations]
symbol:SPO_3593 "sulfatase family protein" species:246200
"Ruegeria pomeroyi DSS-3" [GO:0008152 "metabolic process"
evidence=ISS] [GO:0008484 "sulfuric ester hydrolase activity"
evidence=ISS] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 EMBL:CP000031
GenomeReviews:CP000031_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
GO:GO:0008484 HOGENOM:HOG000230030 ProtClustDB:CLSK867183
RefSeq:YP_168788.1 ProteinModelPortal:Q5LMH0 GeneID:3195684
KEGG:sil:SPO3593 PATRIC:23380663 OMA:MNILFIM Uniprot:Q5LMH0
Length = 552
Score = 121 (47.7 bits), Expect = 6.0e-07, Sum P(3) = 6.0e-07
Identities = 32/107 (29%), Positives = 55/107 (51%)
Query: 61 HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIIL-KNYYTVQLCTPSRSAIMTGKH 119
+I+FI+ D L W+ + +G + TP+ID LA G+ + Y +C SR + TG++
Sbjct: 2 NILFIMFDQLRWDYLSCYGHKTLNTPHIDRLAAKGVRFDRAYIQSPICGSSRMSTYTGRY 61
Query: 120 PIHTGMQHNVLYGCERGGLPLS--EKILPQYLKELGYRTRIVGKWHL 164
+H+ +G G+PL E + +L+ G +VGK H+
Sbjct: 62 -VHS-------HGASWNGIPLKVGEMTMGDHLRAAGMGCWLVGKTHM 100
Score = 69 (29.3 bits), Expect = 6.0e-07, Sum P(3) = 6.0e-07
Identities = 28/131 (21%), Positives = 54/131 (41%)
Query: 271 RHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSN 330
+ + D + ++ + D+ +G++ + LE + +++IV SD +
Sbjct: 282 QEVRDAVIPAYMGLIKQADDQMGRLFKWLEDTGRMQDTMIVLTSDHGDFLG-------DH 334
Query: 331 WPLRGVKNTLWEGGVRGAGLIWSPLLES---RGIVAEQYVHVSDWLPTLLSAANKSDIPN 387
W G K + R +I+ P E+ RG V + V D PT + AA +
Sbjct: 335 W--MGEKTFFHDASTRVPLIIYDPRPEADATRGSVCDALVESIDLAPTFVEAAGGKPAMH 392
Query: 388 YVNSTVENIIP 398
+ E++IP
Sbjct: 393 ILEG--ESLIP 401
Score = 51 (23.0 bits), Expect = 6.0e-07, Sum P(3) = 6.0e-07
Identities = 11/22 (50%), Positives = 13/22 (59%)
Query: 665 KEVPCEPQIAPCLFDIKNDPCE 686
K + E P LFD+KNDP E
Sbjct: 448 KLIHFEADPRPMLFDLKNDPQE 469
Score = 51 (23.0 bits), Expect = 6.0e-07, Sum P(3) = 6.0e-07
Identities = 11/22 (50%), Positives = 13/22 (59%)
Query: 771 KEVPCEPQIAPCLFDIKNDPCE 792
K + E P LFD+KNDP E
Sbjct: 448 KLIHFEADPRPMLFDLKNDPQE 469
>DICTYBASE|DDB_G0286855 [details] [associations]
symbol:gtaD "GATA zinc finger domain-containing
protein 4" species:44689 "Dictyostelium discoideum" [GO:0043565
"sequence-specific DNA binding" evidence=IEA] [GO:0008270 "zinc ion
binding" evidence=IEA] [GO:0006355 "regulation of transcription,
DNA-dependent" evidence=IEA] [GO:0003700 "sequence-specific DNA
binding transcription factor activity" evidence=IEA] [GO:0046872
"metal ion binding" evidence=IEA] InterPro:IPR000679
PROSITE:PS00344 PROSITE:PS50114 dictyBase:DDB_G0286855
GenomeReviews:CM000153_GR GO:GO:0046872 GO:GO:0043565 GO:GO:0008270
GO:GO:0003700 EMBL:AAFI02000090 RefSeq:XP_637531.1 PRIDE:Q54L72
EnsemblProtists:DDB0220503 GeneID:8625829 KEGG:ddi:DDB_G0286855
eggNOG:NOG258313 Uniprot:Q54L72
Length = 530
Score = 149 (57.5 bits), Expect = 7.3e-07, P = 7.3e-07
Identities = 46/185 (24%), Positives = 75/185 (40%)
Query: 381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
N ++ N NS N Y N+ + + G N+ N+N + N + N Y N
Sbjct: 267 NNNNNNNNNNSNNNNNNNNYFNNNKKNKIGDCNSNNSN-NNNNNNHNNNNNNNNYNYNNN 325
Query: 441 YENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTS-EN 499
N + N N N + N N N N N + NI N+ N N + N
Sbjct: 326 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNSNNNKNNNNINNNNNNNNNNNNNINN 385
Query: 500 RSNDNSYQNEIDGIDVWSVLS-RNEPSKRNTILHNIDDE--WQISALTRGKWKLVKENSI 556
+N+NS N I+ + ++ + N N++ +N + W+ S+ L+KE S+
Sbjct: 386 NNNNNSINNIINNNNNFNNNNINNNLFNNNSMNYNKKENYNWESSSSEEDNNNLIKEQSV 445
Query: 557 NGNGT 561
N T
Sbjct: 446 KKNET 450
Score = 145 (56.1 bits), Expect = 2.0e-06, P = 2.0e-06
Identities = 46/200 (23%), Positives = 76/200 (38%)
Query: 374 PTLLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEY 433
PT++S+ + N N+ N N+ N + N+ N+N Y N +
Sbjct: 236 PTIISSNSPLKTRNKNNNN--NYNNNNNNNNNNNNNNNNNNNNSNNNNNNNNYFNNNKK- 292
Query: 434 NPKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSING 493
N + N + N + N N + YN N N N N + N N+ N
Sbjct: 293 NKIGDCNSNNSNNNNNNNHNNNNNNNNYNYNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 352
Query: 494 NGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKE 553
N + N SN+N N I+ + N + N ++N ++ I+ +
Sbjct: 353 NNNNNNNSNNNKNNNNINN-------NNNNNNNNNNNINNNNNNNSINNIINNNNNF-NN 404
Query: 554 NSINGNGTSENRSNDNSYQN 573
N+IN N + N N N +N
Sbjct: 405 NNINNNLFNNNSMNYNKKEN 424
Score = 137 (53.3 bits), Expect = 1.5e-05, P = 1.5e-05
Identities = 43/190 (22%), Positives = 73/190 (38%)
Query: 386 PNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGT 445
P ST II R +N + YN+ N+N N + N N N
Sbjct: 228 PQSSQSTTPTIISSNSPLKTRNKNNNNNYNN---NNNNNNNNNNNNNNNNNNSNNNNNNN 284
Query: 446 HEYNPKYENRYEN-GTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDN 504
+ +N +N+ + ++ N N N N N + YN N+ N N + N +N+N
Sbjct: 285 NYFNNNKKNKIGDCNSNNSNNNNNNNHNNNNNNNNYNYNNNNNNNNNNNNNNNNNNNNNN 344
Query: 505 SYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSEN 564
+ N + + + S N + N +N ++ + + N IN N N
Sbjct: 345 NNNNNNNNNNNNNNNSNNNKNNNNINNNNNNNNNNNNNINNNNNNNSINNIINNNNNFNN 404
Query: 565 RS-NDNSYQN 573
+ N+N + N
Sbjct: 405 NNINNNLFNN 414
Score = 128 (50.1 bits), Expect = 0.00014, P = 0.00014
Identities = 47/230 (20%), Positives = 89/230 (38%)
Query: 375 TLLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYN 434
T+ ++ S P + ENI ++++ N + Y+ P S+ N
Sbjct: 185 TICIPSHHSPSPQFPVYYTENINNATPSTVV--SNSPNNYSQPISPQSSQSTTPTIISSN 242
Query: 435 PKYENRYENGTHEYNPKYENRYENGTHEYNG---PKNENTNPRYENGTHEYNIP--RLEN 489
+ R +N + YN N N + N N N N Y N + I N
Sbjct: 243 SPLKTRNKNNNNNYNNNNNNNNNNNNNNNNNNNNSNNNNNNNNYFNNNKKNKIGDCNSNN 302
Query: 490 SINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWK 549
S N N + N +N+N+ N + + + + N + N +N ++ +
Sbjct: 303 SNNNNNNNHNNNNNNNNYNYNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNSNN 362
Query: 550 LVKENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNI 599
N+IN N + N +N+N+ N + + ++++ N N I +N+
Sbjct: 363 NKNNNNINNNNNNNN-NNNNNINNNNNNNSINNIINNNNNFNNNNINNNL 411
>UNIPROTKB|F1LLW8 [details] [associations]
symbol:Ids "Protein Ids" species:10116 "Rattus norvegicus"
[GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
InterPro:IPR000917 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 RGD:1560491 Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
GeneTree:ENSGT00640000091539 OMA:CREGRNL IPI:IPI00569342
Ensembl:ENSRNOT00000042925 ArrayExpress:F1LLW8 Uniprot:F1LLW8
Length = 544
Score = 149 (57.5 bits), Expect = 7.6e-07, P = 7.6e-07
Identities = 45/137 (32%), Positives = 71/137 (51%)
Query: 33 RIMAFAVLPLAFTLSMVFVDLVASSGPP-HIIFILADDLGWNDVGFHGLDQIPTPNIDAL 91
R ++F++L F +++V S+ +I+ I+ DDL +G +G + +PNID L
Sbjct: 2 RQLSFSLLLGFFCIALVSAAQGNSATDALNILLIIVDDLR-PSLGCYGDKLVRSPNIDQL 60
Query: 92 AYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGM-QHNVLYGCERGGLPLSEKILPQYL 149
A I+ +N + Q +C PSR + +TG+ P T + N + G +PQY
Sbjct: 61 ASHSIVFENAFAQQAVCAPSRVSFLTGRRPDTTRLYDFNSYWRVHSGNF----STIPQYF 116
Query: 150 KELGYRTRIVGK-WHLG 165
KE GY T VGK +H G
Sbjct: 117 KENGYVTMSVGKVFHPG 133
>TIGR_CMR|SPO_1083 [details] [associations]
symbol:SPO_1083 "choline sulfatase" species:246200
"Ruegeria pomeroyi DSS-3" [GO:0006790 "sulfur compound metabolic
process" evidence=ISS] [GO:0047753 "choline-sulfatase activity"
evidence=ISS] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 EMBL:CP000031
GenomeReviews:CP000031_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00149 HOGENOM:HOG000217625 KO:K01133
InterPro:IPR017785 InterPro:IPR025863 Pfam:PF12411
TIGRFAMs:TIGR03417 ProtClustDB:CLSK864791 GO:GO:0047753
RefSeq:YP_166334.1 ProteinModelPortal:Q5LUH1 GeneID:3195014
KEGG:sil:SPO1083 PATRIC:23375467 OMA:QEAIILF Uniprot:Q5LUH1
Length = 502
Score = 109 (43.4 bits), Expect = 7.9e-07, Sum P(3) = 7.9e-07
Identities = 31/105 (29%), Positives = 46/105 (43%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+I+ ++ D L D + PN+ LA N YT LC P R++ M+G+
Sbjct: 4 PNILILMVDQLNGTLFPDGPADWLHAPNLKRLAARSTRFANAYTASPLCAPGRASFMSGQ 63
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIVGKWH 163
P TG+ N R +P +L+ GY T + GK H
Sbjct: 64 LPSRTGVYDNAAEF--RSDIPT----YAHHLRRAGYYTCLSGKMH 102
Score = 76 (31.8 bits), Expect = 7.9e-07, Sum P(3) = 7.9e-07
Identities = 34/123 (27%), Positives = 57/123 (46%)
Query: 274 EDFKRSKFA--AILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNW 331
ED ++S+ A A + LD+ +G+++E LE R +II+FVSD W
Sbjct: 248 EDIRKSRRAYFANISYLDDKLGEILEVLETTRQ--EAIILFVSDHGDMLGERGL-----W 300
Query: 332 PLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTL--LSAANKSDIPNYV 389
K +EG R ++ +P +E I + V D PTL L+ + ++I +
Sbjct: 301 ----FKMNFYEGSARVPLMVAAPGMEPGRI--DTPVSTIDVTPTLGELAGVDMAEIAPWT 354
Query: 390 NST 392
+ T
Sbjct: 355 DGT 357
Score = 54 (24.1 bits), Expect = 7.9e-07, Sum P(3) = 7.9e-07
Identities = 11/21 (52%), Positives = 13/21 (61%)
Query: 677 LFDIKNDPCEKNNLADRSEDQ 697
LFD+ DP E NLAD + Q
Sbjct: 405 LFDLDADPHEMTNLADHPDHQ 425
Score = 52 (23.4 bits), Expect = 1.3e-06, Sum P(3) = 1.3e-06
Identities = 11/21 (52%), Positives = 13/21 (61%)
Query: 783 LFDIKNDPCEKNNLADRSEVQ 803
LFD+ DP E NLAD + Q
Sbjct: 405 LFDLDADPHEMTNLADHPDHQ 425
>UNIPROTKB|Q3L472 [details] [associations]
symbol:Sulf2 "Protein Sulf2" species:10116 "Rattus
norvegicus" [GO:0002063 "chondrocyte development" evidence=IEA]
[GO:0003094 "glomerular filtration" evidence=IEA] [GO:0004065
"arylsulfatase activity" evidence=IEA] [GO:0005509 "calcium ion
binding" evidence=IEA] [GO:0005783 "endoplasmic reticulum"
evidence=IEA] [GO:0005794 "Golgi apparatus" evidence=IEA]
[GO:0005886 "plasma membrane" evidence=IEA] [GO:0008449
"N-acetylglucosamine-6-sulfatase activity" evidence=IEA]
[GO:0009986 "cell surface" evidence=IEA] [GO:0010575 "positive
regulation vascular endothelial growth factor production"
evidence=IEA] [GO:0014846 "esophagus smooth muscle contraction"
evidence=IEA] [GO:0030177 "positive regulation of Wnt receptor
signaling pathway" evidence=IEA] [GO:0030201 "heparan sulfate
proteoglycan metabolic process" evidence=IEA] [GO:0032836
"glomerular basement membrane development" evidence=IEA]
[GO:0035860 "glial cell-derived neurotrophic factor receptor
signaling pathway" evidence=IEA] [GO:0040037 "negative regulation
of fibroblast growth factor receptor signaling pathway"
evidence=IEA] [GO:0048706 "embryonic skeletal system development"
evidence=IEA] [GO:0060348 "bone development" evidence=IEA]
[GO:0060384 "innervation" evidence=IEA] InterPro:IPR000917
InterPro:IPR014615 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 PIRSF:PIRSF036665 RGD:1305078 GO:GO:0005783
GO:GO:0005886 GO:GO:0005794 GO:GO:0009986 GO:GO:0005509
GO:GO:0010575 GO:GO:0060348 Gene3D:3.40.720.10 SUPFAM:SSF53649
GO:GO:0030177 GO:GO:0003094 InterPro:IPR024607 PROSITE:PS00523
GO:GO:0004065 GeneTree:ENSGT00400000022041 GO:GO:0048706
GO:GO:0002063 GO:GO:0040037 GO:GO:0032836 EMBL:CH474005
GO:GO:0060384 GO:GO:0030201 KO:K14607 HOVERGEN:HBG056431
GO:GO:0014846 GO:GO:0035860 InterPro:IPR024609 Pfam:PF12548
CTD:55959 OMA:PKYYGQG EMBL:AY742216 IPI:IPI00767654
RefSeq:NP_001030099.1 UniGene:Rn.4228 STRING:Q3L472
Ensembl:ENSRNOT00000008478 GeneID:311642 KEGG:rno:311642
InParanoid:Q3L472 NextBio:663979 Genevestigator:Q3L472
Uniprot:Q3L472
Length = 875
Score = 141 (54.7 bits), Expect = 9.5e-07, Sum P(2) = 9.5e-07
Identities = 58/223 (26%), Positives = 93/223 (41%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+II +L DD DV G Q+ + G N + T +C PSRS+I+TGK
Sbjct: 44 PNIILVLTDD---QDVEL-GSMQVMNKTRRIMEQGGAHFINAFVTTPMCCPSRSSILTGK 99
Query: 119 HPIHTGMQHNVLYGCERGGLPL-----SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+ +H HN E P + YL GYRT GK+ L Y Y P
Sbjct: 100 Y-VHN---HNTYTNNENCSSPSWQAQHESRTFAVYLNSTGYRTAFFGKY-LNEYNGSYVP 154
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
G++ +G + +++++ + G+ + + + D Y TD+ T ++V
Sbjct: 155 P--GWKEWVGLLKNSR-FYNYT---LCRNGMKEKHGSDYSTD----YLTDLITNDSVSFF 204
Query: 234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
P+ + ++HAA H P Q + N +HI
Sbjct: 205 RTSKKMYPHRPVLMVISHAAPHGPEDSAP-QYSRLFPNASQHI 246
Score = 60 (26.2 bits), Expect = 9.5e-07, Sum P(2) = 9.5e-07
Identities = 48/231 (20%), Positives = 87/231 (37%)
Query: 234 HNHSTDEPLFLYL-AHAATHSANPYEPLQAPD-HYLN--------IHRHIEDFKRSKFAA 283
H P + L +A+ H Y PD H++ IH + + K
Sbjct: 226 HGPEDSAPQYSRLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQ 285
Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEG 343
L +D+S+ + + L + L N+ IV+ +D + P +E
Sbjct: 286 TLMSVDDSMETIYDMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMP--------YEF 337
Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENS 403
+R + P +E+ + +++ D PT+L A DIP ++ ++I+ ++
Sbjct: 338 DIRVPFYVRGPSVEAGSLNPHIVLNI-DLAPTILDIAGL-DIPADMDG--KSILKLLDSE 393
Query: 404 ILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHE-YNPKYE 453
R N H R+ + E G + K E N E + PKY+
Sbjct: 394 --RPVNRFHLKKKLRVWRDSFLVERGKLLH--KREGDKVNAQEENFLPKYQ 440
>UNIPROTKB|I3L2I6 [details] [associations]
symbol:SGSH "N-sulphoglucosamine sulphohydrolase"
species:9606 "Homo sapiens" [GO:0008484 "sulfuric ester hydrolase
activity" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484 EMBL:AC087741
EMBL:AC123764 HGNC:HGNC:10818 ChiTaRS:SGSH Ensembl:ENST00000574505
Uniprot:I3L2I6
Length = 106
Score = 124 (48.7 bits), Expect = 1.0e-06, P = 1.0e-06
Identities = 35/103 (33%), Positives = 57/103 (55%)
Query: 59 PPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYT-VQLCTPSRSAIMTG 117
P + + +LADD G+ G + I TP++DALA ++ +N +T V C+PSR++++TG
Sbjct: 4 PRNALLLLADDGGFES-GAYNNSAIATPHLDALARRSLLFRNAFTSVSSCSPSRASLLTG 62
Query: 118 KHPIHTGMQHNVLYGCERGGLPLS--EKI--LPQYLKELGYRT 156
P H N +YG + + +K+ LP L + G RT
Sbjct: 63 L-PQH----QNGMYGLHQDVHHFNSFDKVRSLPLLLSQAGVRT 100
>UNIPROTKB|E1BIY5 [details] [associations]
symbol:SULF2 "Uncharacterized protein" species:9913 "Bos
taurus" [GO:0060384 "innervation" evidence=IEA] [GO:0060348 "bone
development" evidence=IEA] [GO:0048706 "embryonic skeletal system
development" evidence=IEA] [GO:0040037 "negative regulation of
fibroblast growth factor receptor signaling pathway" evidence=IEA]
[GO:0035860 "glial cell-derived neurotrophic factor receptor
signaling pathway" evidence=IEA] [GO:0032836 "glomerular basement
membrane development" evidence=IEA] [GO:0030201 "heparan sulfate
proteoglycan metabolic process" evidence=IEA] [GO:0030177 "positive
regulation of Wnt receptor signaling pathway" evidence=IEA]
[GO:0014846 "esophagus smooth muscle contraction" evidence=IEA]
[GO:0010575 "positive regulation vascular endothelial growth factor
production" evidence=IEA] [GO:0009986 "cell surface" evidence=IEA]
[GO:0008449 "N-acetylglucosamine-6-sulfatase activity"
evidence=IEA] [GO:0005886 "plasma membrane" evidence=IEA]
[GO:0005783 "endoplasmic reticulum" evidence=IEA] [GO:0004065
"arylsulfatase activity" evidence=IEA] [GO:0003094 "glomerular
filtration" evidence=IEA] [GO:0002063 "chondrocyte development"
evidence=IEA] [GO:0005794 "Golgi apparatus" evidence=IEA]
[GO:0005509 "calcium ion binding" evidence=IEA] InterPro:IPR000917
InterPro:IPR014615 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 PIRSF:PIRSF036665 GO:GO:0005783 GO:GO:0005886
GO:GO:0005794 GO:GO:0009986 GO:GO:0005509 GO:GO:0010575
GO:GO:0060348 Gene3D:3.40.720.10 SUPFAM:SSF53649 GO:GO:0030177
GO:GO:0003094 InterPro:IPR024607 PROSITE:PS00523 GO:GO:0004065
GeneTree:ENSGT00400000022041 GO:GO:0048706 GO:GO:0002063
GO:GO:0040037 GO:GO:0032836 GO:GO:0060384 GO:GO:0030201 KO:K14607
GO:GO:0014846 GO:GO:0035860 InterPro:IPR024609 Pfam:PF12548
CTD:55959 OMA:PKYYGQG EMBL:DAAA02036810 IPI:IPI00698144
RefSeq:NP_001179867.1 UniGene:Bt.90452 ProteinModelPortal:E1BIY5
Ensembl:ENSBTAT00000009852 GeneID:533264 KEGG:bta:533264
NextBio:20875979 Uniprot:E1BIY5
Length = 862
Score = 143 (55.4 bits), Expect = 1.1e-06, Sum P(2) = 1.1e-06
Identities = 62/225 (27%), Positives = 89/225 (39%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+II +L DD DV G Q+ + G N + T +C PSRS+I+TGK
Sbjct: 44 PNIILVLTDD---QDVEL-GSMQVMNKTRRIMEQGGAHFINAFVTTPMCCPSRSSILTGK 99
Query: 119 HPIHTGMQHNVLYGCERGGLPL-----SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+ +H HN E P + YL GYRT GK+ L Y Y P
Sbjct: 100 Y-VHN---HNTYTNNENCSSPSWQAQHESRTFAVYLNSTGYRTAFFGKY-LNEYNGSYVP 154
Query: 174 TFRGFESHLGYWTGHQDY-FDHSAEEMK-MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
G++ +G + Y + +K G D +D Y TD+ T ++V
Sbjct: 155 P--GWKEWVGLLKNSRFYNYTLCRNGVKEKHGFDYSKD----------YLTDLITNDSVS 202
Query: 232 IIHNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
P+ + L+HAA H P Q + N +HI
Sbjct: 203 FFRASKKMYPHRPVLMVLSHAAPHGPEDSAP-QYSSLFPNASQHI 246
Score = 57 (25.1 bits), Expect = 1.1e-06, Sum P(2) = 1.1e-06
Identities = 48/233 (20%), Positives = 89/233 (38%)
Query: 234 HNHSTDEPLFLYL-AHAATHSANPYEPLQAPD-HYLN--------IHRHIEDFKRSKFAA 283
H P + L +A+ H Y PD H++ IH + + K
Sbjct: 226 HGPEDSAPQYSSLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMQFTNMLQRKRLQ 285
Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEG 343
L +D+S+ + L + L N+ IV+ +D + P +E
Sbjct: 286 TLLSVDDSMETIYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMP--------YEF 337
Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENS 403
+R + P +E+ + +++ D PT+L A DIP+ ++ ++I+ +
Sbjct: 338 DIRVPFYVRGPNVEAGSLNPHIVLNI-DLAPTILDIAGL-DIPSDMDG--KSILKLLDTE 393
Query: 404 ILRYENGTHEYNSPRIENSNTRYENGT--HEYNP-KYENRYENGTHEYNPKYE 453
R N H R+ + E G H+ + K + + EN + PKY+
Sbjct: 394 --RPANRFHLKKKLRVWRDSFLVERGKLLHKRDSDKVDAQEEN----FLPKYQ 440
>UNIPROTKB|G3XAE6 [details] [associations]
symbol:SULF2 "Extracellular sulfatase Sulf-2" species:9606
"Homo sapiens" [GO:0005509 "calcium ion binding" evidence=IEA]
[GO:0005783 "endoplasmic reticulum" evidence=IEA] [GO:0005794
"Golgi apparatus" evidence=IEA] [GO:0008484 "sulfuric ester
hydrolase activity" evidence=IEA] [GO:0009986 "cell surface"
evidence=IEA] InterPro:IPR000917 InterPro:IPR014615
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
PIRSF:PIRSF036665 GO:GO:0005783 GO:GO:0005794 EMBL:CH471077
GO:GO:0009986 GO:GO:0005509 Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484 EMBL:AL034418
InterPro:IPR024609 Pfam:PF12548 EMBL:AL354813 UniGene:Hs.162016
HGNC:HGNC:20392 EMBL:AL121777 ProteinModelPortal:G3XAE6 SMR:G3XAE6
PRIDE:G3XAE6 Ensembl:ENST00000361612 ArrayExpress:G3XAE6
Bgee:G3XAE6 Uniprot:G3XAE6
Length = 852
Score = 139 (54.0 bits), Expect = 1.1e-06, Sum P(2) = 1.1e-06
Identities = 61/225 (27%), Positives = 89/225 (39%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+II +L DD DV G Q+ + G N + T +C PSRS+I+TGK
Sbjct: 44 PNIILVLTDD---QDVEL-GSMQVMNKTRRIMEQGGAHFINAFVTTPMCCPSRSSILTGK 99
Query: 119 HPIHTGMQHNVLYGCERGGLPL-----SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+ +H HN E P + YL GYRT GK+ L Y Y P
Sbjct: 100 Y-VHN---HNTYTNNENCSSPSWQAQHESRTFAVYLNSTGYRTAFFGKY-LNEYNGSYVP 154
Query: 174 TFRGFESHLGYWTGHQDY-FDHSAEEMK-MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
G++ +G + Y + +K G D +D Y TD+ T ++V
Sbjct: 155 P--GWKEWVGLLKNSRFYNYTLCRNGVKEKHGSDYSKD----------YLTDLITNDSVS 202
Query: 232 IIHNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
P+ + ++HAA H P Q + N +HI
Sbjct: 203 FFRTSKKMYPHRPVLMVISHAAPHGPEDSAP-QYSRLFPNASQHI 246
Score = 61 (26.5 bits), Expect = 1.1e-06, Sum P(2) = 1.1e-06
Identities = 51/233 (21%), Positives = 87/233 (37%)
Query: 234 HNHSTDEPLFLYL-AHAATHSANPYEPLQAPD-HYLN--------IHRHIEDFKRSKFAA 283
H P + L +A+ H Y PD H++ IH + + K
Sbjct: 226 HGPEDSAPQYSRLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQ 285
Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEG 343
L +D+S+ + L + L N+ IV+ +D + P +E
Sbjct: 286 TLMSVDDSMETIYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMP--------YEF 337
Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENS 403
+R + P +E+ G + V D PT+L A DIP ++ ++I+ +
Sbjct: 338 DIRVPFYVRGPNVEA-GCLNPHIVLNIDLAPTILDIAGL-DIPADMDG--KSILKLLDTE 393
Query: 404 ILRYENGTHEYNSPRIENSNTRYENGT--HEY-NPKYENRYENGTHEYNPKYE 453
R N H R+ + E G H+ N K + + EN + PKY+
Sbjct: 394 --RPVNRFHLKKKMRVWRDSFLVERGKLLHKRDNDKVDAQEEN----FLPKYQ 440
>UNIPROTKB|Q8IWU5 [details] [associations]
symbol:SULF2 "Extracellular sulfatase Sulf-2" species:9606
"Homo sapiens" [GO:0005509 "calcium ion binding" evidence=IEA]
[GO:0005795 "Golgi stack" evidence=IEA] [GO:0004065 "arylsulfatase
activity" evidence=IMP;IDA] [GO:0005615 "extracellular space"
evidence=NAS] [GO:0009986 "cell surface" evidence=IDA] [GO:0030201
"heparan sulfate proteoglycan metabolic process" evidence=IDA;NAS]
[GO:0030177 "positive regulation of Wnt receptor signaling pathway"
evidence=IDA] [GO:0008449 "N-acetylglucosamine-6-sulfatase
activity" evidence=IDA;IMP] [GO:0005783 "endoplasmic reticulum"
evidence=IDA] [GO:0002063 "chondrocyte development" evidence=ISS]
[GO:0014846 "esophagus smooth muscle contraction" evidence=ISS]
[GO:0035860 "glial cell-derived neurotrophic factor receptor
signaling pathway" evidence=ISS] [GO:0048706 "embryonic skeletal
system development" evidence=ISS] [GO:0051216 "cartilage
development" evidence=ISS] [GO:0060384 "innervation" evidence=ISS]
[GO:0005886 "plasma membrane" evidence=ISS] [GO:0010575 "positive
regulation vascular endothelial growth factor production"
evidence=ISS] [GO:0040037 "negative regulation of fibroblast growth
factor receptor signaling pathway" evidence=ISS] [GO:0003094
"glomerular filtration" evidence=ISS] [GO:0032836 "glomerular
basement membrane development" evidence=ISS] [GO:0001822 "kidney
development" evidence=ISS] [GO:0060348 "bone development"
evidence=ISS] InterPro:IPR000917 InterPro:IPR014615
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
PIRSF:PIRSF036665 EMBL:AY101176 GO:GO:0005783 GO:GO:0005886
EMBL:CH471077 GO:GO:0005615 GO:GO:0009986 GO:GO:0005795
GO:GO:0005509 GO:GO:0010575 GO:GO:0060348 Gene3D:3.40.720.10
SUPFAM:SSF53649 GO:GO:0030177 GO:GO:0003094 eggNOG:COG3119
InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0004065
GO:GO:0048706 GO:GO:0002063 GO:GO:0040037 GO:GO:0032836
GO:GO:0060384 GO:GO:0008449 GO:GO:0030201 EMBL:AL034418 KO:K14607
HOVERGEN:HBG056431 GO:GO:0014846 GO:GO:0035860 InterPro:IPR024609
Pfam:PF12548 EMBL:AB033073 EMBL:AY358461 EMBL:CR749319
EMBL:AL354813 EMBL:BC020962 EMBL:BC110539 EMBL:AL133001
IPI:IPI00297252 IPI:IPI00555879 RefSeq:NP_001155313.1
RefSeq:NP_061325.1 RefSeq:NP_940998.2 UniGene:Hs.162016
ProteinModelPortal:Q8IWU5 SMR:Q8IWU5 IntAct:Q8IWU5 STRING:Q8IWU5
PhosphoSite:Q8IWU5 DMDM:33112446 PaxDb:Q8IWU5 PRIDE:Q8IWU5
DNASU:55959 Ensembl:ENST00000359930 Ensembl:ENST00000467815
Ensembl:ENST00000484875 GeneID:55959 KEGG:hsa:55959 UCSC:uc002xto.3
UCSC:uc002xtr.3 CTD:55959 GeneCards:GC20M046285 H-InvDB:HIX0027735
HGNC:HGNC:20392 HPA:HPA002325 MIM:610013 neXtProt:NX_Q8IWU5
PharmGKB:PA134902131 InParanoid:Q8IWU5 OMA:PKYYGQG
OrthoDB:EOG49KFPX PhylomeDB:Q8IWU5 GenomeRNAi:55959 NextBio:61367
ArrayExpress:Q8IWU5 Bgee:Q8IWU5 CleanEx:HS_SULF2
Genevestigator:Q8IWU5 GermOnline:ENSG00000196562 Uniprot:Q8IWU5
Length = 870
Score = 139 (54.0 bits), Expect = 1.2e-06, Sum P(2) = 1.2e-06
Identities = 61/225 (27%), Positives = 89/225 (39%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+II +L DD DV G Q+ + G N + T +C PSRS+I+TGK
Sbjct: 44 PNIILVLTDD---QDVEL-GSMQVMNKTRRIMEQGGAHFINAFVTTPMCCPSRSSILTGK 99
Query: 119 HPIHTGMQHNVLYGCERGGLPL-----SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+ +H HN E P + YL GYRT GK+ L Y Y P
Sbjct: 100 Y-VHN---HNTYTNNENCSSPSWQAQHESRTFAVYLNSTGYRTAFFGKY-LNEYNGSYVP 154
Query: 174 TFRGFESHLGYWTGHQDY-FDHSAEEMK-MWGLDMRRDLEPAWDLHGKYSTDVFTAEAVD 231
G++ +G + Y + +K G D +D Y TD+ T ++V
Sbjct: 155 P--GWKEWVGLLKNSRFYNYTLCRNGVKEKHGSDYSKD----------YLTDLITNDSVS 202
Query: 232 IIHNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
P+ + ++HAA H P Q + N +HI
Sbjct: 203 FFRTSKKMYPHRPVLMVISHAAPHGPEDSAP-QYSRLFPNASQHI 246
Score = 61 (26.5 bits), Expect = 1.2e-06, Sum P(2) = 1.2e-06
Identities = 51/233 (21%), Positives = 87/233 (37%)
Query: 234 HNHSTDEPLFLYL-AHAATHSANPYEPLQAPD-HYLN--------IHRHIEDFKRSKFAA 283
H P + L +A+ H Y PD H++ IH + + K
Sbjct: 226 HGPEDSAPQYSRLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQ 285
Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEG 343
L +D+S+ + L + L N+ IV+ +D + P +E
Sbjct: 286 TLMSVDDSMETIYNMLVETGELDNTYIVYTADHGYHIGQFGLVKGKSMP--------YEF 337
Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENS 403
+R + P +E+ G + V D PT+L A DIP ++ ++I+ +
Sbjct: 338 DIRVPFYVRGPNVEA-GCLNPHIVLNIDLAPTILDIAGL-DIPADMDG--KSILKLLDTE 393
Query: 404 ILRYENGTHEYNSPRIENSNTRYENGT--HEY-NPKYENRYENGTHEYNPKYE 453
R N H R+ + E G H+ N K + + EN + PKY+
Sbjct: 394 --RPVNRFHLKKKMRVWRDSFLVERGKLLHKRDNDKVDAQEEN----FLPKYQ 440
>DICTYBASE|DDB_G0282469 [details] [associations]
symbol:gnt13 "putative
beta-1,3-N-acetylglucosaminyltransferase" species:44689
"Dictyostelium discoideum" [GO:0008150 "biological_process"
evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
[GO:0003674 "molecular_function" evidence=ND] [GO:0016021 "integral
to membrane" evidence=IEA] [GO:0016020 "membrane" evidence=IEA]
dictyBase:DDB_G0282469 GO:GO:0016021 EMBL:AAFI02000047
GenomeReviews:CM000152_GR RefSeq:XP_640067.1
EnsemblProtists:DDB0231851 GeneID:8623596 KEGG:ddi:DDB_G0282469
eggNOG:NOG279004 InParanoid:Q54SH2 ProtClustDB:CLSZ2430453
Uniprot:Q54SH2
Length = 635
Score = 147 (56.8 bits), Expect = 1.6e-06, P = 1.6e-06
Identities = 39/153 (25%), Positives = 62/153 (40%)
Query: 421 NSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTH 480
N+++ + N + EN Y + N +N Y N + N N N N N +
Sbjct: 277 NTDSEFNNINYNMENLNENEYLKNINNNNNNNDNNYNNNNNNNNNNNNNNNNNNNNNNNN 336
Query: 481 EYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQI 540
N N+ N N + N + DN+ N+ID ID N N I NI++ I
Sbjct: 337 NNNNNNNNNN-NNNNNNNNNNIDNNIDNKIDNID-------NNIDNNNNI-DNINNNNNI 387
Query: 541 SALTRGKWKLVKENSINGNGTSENRSNDNSYQN 573
+ + N+ N N + N +N+N+ N
Sbjct: 388 NNIDNNNSNYNDNNNNNNNNNNNNNNNNNNNNN 420
>DICTYBASE|DDB_G0287637 [details] [associations]
symbol:mybD "myb domain-containing protein"
species:44689 "Dictyostelium discoideum" [GO:0003677 "DNA binding"
evidence=IEA;ISS] [GO:0003682 "chromatin binding" evidence=IEA]
[GO:0008150 "biological_process" evidence=ND] [GO:0006355
"regulation of transcription, DNA-dependent" evidence=IEA]
[GO:0006351 "transcription, DNA-dependent" evidence=IEA]
[GO:0005634 "nucleus" evidence=IEA] [GO:0044351 "macropinocytosis"
evidence=RCA] InterPro:IPR001005 InterPro:IPR009057 Pfam:PF00249
SMART:SM00717 dictyBase:DDB_G0287637 GO:GO:0005634 GO:GO:0006355
GO:GO:0003677 GO:GO:0006351 GO:GO:0003682 GenomeReviews:CM000154_GR
EMBL:AAFI02000103 Gene3D:1.10.10.60 SUPFAM:SSF46689
InterPro:IPR017930 PROSITE:PS51294 RefSeq:XP_637145.1
ProteinModelPortal:Q54K19 EnsemblProtists:DDB0220512 GeneID:8626240
KEGG:ddi:DDB_G0287637 eggNOG:NOG321969 OMA:HNNYINH
ProtClustDB:CLSZ2846665 Uniprot:Q54K19
Length = 595
Score = 146 (56.5 bits), Expect = 1.9e-06, P = 1.9e-06
Identities = 45/191 (23%), Positives = 84/191 (43%)
Query: 384 DIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYEN-RYE 442
D+ + N+ NI NSI YEN + P+ N N +Y++ + N +++ +
Sbjct: 16 DLSDNYNNNNSNINTNNNNSINDYENQNNGLVVPQ-SNQNQQYQD---DQNDSFDDDSMD 71
Query: 443 NGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSN 502
G + N + +N + N +EN N N + NI EN+I+ N + N +N
Sbjct: 72 EGEEKSNLIIDESQQNSLNNNNN-NSENNNI---NNSENNNINNSENNIHNNNNNNNNNN 127
Query: 503 DNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTS 562
+N+ N + + + + N + N ++N ++ I+ N+IN N
Sbjct: 128 NNNNNNNNNNNNNNNNNNNNNNNNNNNTINNNNNNNNINNNINNNNNNYNNNNINNNNNI 187
Query: 563 ENRSNDNSYQN 573
N +N+N+ N
Sbjct: 188 NNNNNNNNENN 198
Score = 134 (52.2 bits), Expect = 3.7e-05, P = 3.7e-05
Identities = 45/196 (22%), Positives = 75/196 (38%)
Query: 376 LLSAANKSDIPNYVNSTVENIIPRYENS-ILRYENGTHEYNSPRIENSNTRYENGTHEYN 434
++ + ++ + N N++ N I EN+ I EN H N+ N+N N + N
Sbjct: 80 IIDESQQNSLNNNNNNSENNNINNSENNNINNSENNIHNNNNNNNNNNNNNNNNNNNNNN 139
Query: 435 PKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSIN-G 493
N N + N N N + N N N N N + NI N+ N
Sbjct: 140 NNNNNNNNNNNNNNNTINNNNNNNNIN--NNINNNNNNYNNNNINNNNNINNNNNNNNEN 197
Query: 494 NGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKE 553
N +EN +N+N N+ G + N + N +N ++ + +
Sbjct: 198 NNNNENNNNNNENNNKFIGSPMGEPQINNNNNNNNNNNNNNNNNNNNNNNNKNN----NN 253
Query: 554 NSINGNGTSENRSNDN 569
N+ N N + NR D+
Sbjct: 254 NNNNNNNNNNNRKFDD 269
Score = 131 (51.2 bits), Expect = 7.8e-05, P = 7.8e-05
Identities = 43/174 (24%), Positives = 69/174 (39%)
Query: 401 ENSILRYENGTHEYNSPRIENSN-TRYENGTHEYNPKYENRYENGTHEYNPKYENRYENG 459
+NS+ N + N EN+N EN H N N N + N N N
Sbjct: 86 QNSLNNNNNNSENNNINNSENNNINNSENNIHNNNNNNNNNNNNNNNNNNNNNNNNNNN- 144
Query: 460 THEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVL 519
N N N N N + NI N+IN N + N +N N+ N I+ + +
Sbjct: 145 ----NNNNNNNNNNTINNNNNNNNI---NNNINNNNNNYNNNNINN-NNNINNNNNNNNE 196
Query: 520 SRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQN 573
+ N N +N ++ + G+ ++ N+ N N + N +N+N+ N
Sbjct: 197 NNNNNENNN---NNNENNNKFIGSPMGEPQINNNNNNNNNNNNNNNNNNNNNNN 247
Score = 127 (49.8 bits), Expect = 0.00021, P = 0.00021
Identities = 44/197 (22%), Positives = 77/197 (39%)
Query: 413 EYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTN 472
+ NS N+N+ N + N N EN H N N N + N N N N
Sbjct: 85 QQNSLNNNNNNSENNNINNSENNNINNS-ENNIHNNNNNNNNNNNNNNNNNNNNNNNNNN 143
Query: 473 PRYENGTHEYNIPRLENSINGNGTSEN-RSNDNSYQNEIDGIDVWSVLSRNEPSKRNTIL 531
N + N + N+ N N + N +N+N+Y N + I+ + ++ N +
Sbjct: 144 NNNNNNNNNNNT--INNNNNNNNINNNINNNNNNYNN--NNINNNNNINNNNNNNNENNN 199
Query: 532 HNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSK 591
+N ++ + + E IN N + N +N+N+ N + + + + N +
Sbjct: 200 NNENNNNNNENNNKFIGSPMGEPQINNNNNNNNNNNNNNNNNNNNNNNNKNNNNNNNNNN 259
Query: 592 RNTILHNIDDEWQISAL 608
N DD+ I L
Sbjct: 260 NNNNNRKFDDQQIIKDL 276
Score = 121 (47.7 bits), Expect = 0.00094, P = 0.00094
Identities = 42/187 (22%), Positives = 67/187 (35%)
Query: 381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
N ++ N N+ N N+ N + N+ N+N N + N Y N
Sbjct: 120 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTINNNNNNNNINNNINNNNNNYNNN 179
Query: 441 YENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENR 500
N + N N EN + NEN N EN P E IN N + N
Sbjct: 180 NINNNNNINNNNNNNNENNNN------NENNNNNNENNNKFIGSPMGEPQINNNNNNNNN 233
Query: 501 SNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNG 560
+N+N+ N + + + + N + N DD+ I L + K N +
Sbjct: 234 NNNNNNNNNNNNNNNKNNNNNNNNNNNNNNNRKFDDQQIIKDLENRLKEAKKTNQLLDEK 293
Query: 561 TSENRSN 567
++ + N
Sbjct: 294 CNQLKKN 300
>UNIPROTKB|Q6MX51 [details] [associations]
symbol:Rv0296c "Sulfatase" species:83332 "Mycobacterium
tuberculosis H37Rv" [GO:0004065 "arylsulfatase activity"
evidence=IDA] [GO:0005618 "cell wall" evidence=IDA] [GO:0046872
"metal ion binding" evidence=IDA] InterPro:IPR000917
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 GO:GO:0005618
GenomeReviews:AL123456_GR GO:GO:0046872 EMBL:BX842573
Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
PROSITE:PS00523 GO:GO:0004065 GO:GO:0008484 KO:K01567 EMBL:CP003248
PIR:F70837 RefSeq:YP_006513622.1 RefSeq:YP_177712.1
ProteinModelPortal:Q6MX51 SMR:Q6MX51 PRIDE:Q6MX51
EnsemblBacteria:EBMYCT00000002598 GeneID:13316285 GeneID:886600
KEGG:mtu:Rv0296c KEGG:mtv:RVBD_0296c PATRIC:18149150
TubercuList:Rv0296c HOGENOM:HOG000045150 OMA:DPGMAEP
ProtClustDB:CLSK799699 Uniprot:Q6MX51
Length = 465
Score = 133 (51.9 bits), Expect = 1.9e-06, Sum P(2) = 1.9e-06
Identities = 36/104 (34%), Positives = 56/104 (53%)
Query: 72 WNDVG-FHGLDQIP---TPNIDALAYSGIIL-KNYYTVQLCTPSRSAIMTGKHPIHTGMQ 126
W+D+G + G+ P +P +D LA GI+ + + T LCTPSR ++ TG++P G+
Sbjct: 18 WHDLGRYLGVYHHPDVYSPRLDRLAAEGILFTRAHATAPLCTPSRGSLFTGRYPQSNGLV 77
Query: 127 HNVLYGCE-RGGLPLSEKILPQYLKELGYRTRIVGKWHLGFYKK 169
+G E R G+ + LPQ L E G+ + + G H Y K
Sbjct: 78 GLAHHGWEYRTGV----QTLPQLLSESGWYSALFGMQHETSYPK 117
Score = 58 (25.5 bits), Expect = 1.9e-06, Sum P(2) = 1.9e-06
Identities = 17/55 (30%), Positives = 25/55 (45%)
Query: 636 YLSGLSDREWLALAMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNL 690
Y+ + R L L A + P+ + P PQ L+D++ DP E NNL
Sbjct: 340 YIENYAPRPLLDLPWDIQESPAGMAVAPLVKAP-RPQRE--LYDLRADPTETNNL 391
Score = 52 (23.4 bits), Expect = 7.8e-06, Sum P(2) = 7.8e-06
Identities = 13/34 (38%), Positives = 19/34 (55%)
Query: 763 ASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNL 796
A + P+ + P PQ L+D++ DP E NNL
Sbjct: 361 AGMAVAPLVKAP-RPQRE--LYDLRADPTETNNL 391
>CGD|CAL0006287 [details] [associations]
symbol:SHE3 species:5476 "Candida albicans" [GO:0008298
"intracellular mRNA localization" evidence=IMP] [GO:0003729 "mRNA
binding" evidence=IDA] [GO:0001897 "cytolysis by symbiont of host
cells" evidence=IMP] [GO:0030447 "filamentous growth" evidence=IMP]
[GO:0009267 "cellular response to starvation" evidence=IMP]
[GO:0071216 "cellular response to biotic stimulus" evidence=IMP]
[GO:0005934 "cellular bud tip" evidence=IEA] [GO:0005737
"cytoplasm" evidence=IEA] [GO:0036170 "filamentous growth of a
population of unicellular organisms in response to starvation"
evidence=IMP] [GO:0036180 "filamentous growth of a population of
unicellular organisms in response to biotic stimulus" evidence=IMP]
[GO:0048309 "endoplasmic reticulum inheritance" evidence=IEA]
[GO:0007533 "mating type switching" evidence=IEA] CGD:CAL0006287
GO:GO:0071216 GO:GO:0036180 GO:GO:0005789 GO:GO:0003729
GO:GO:0009267 GO:GO:0036170 GO:GO:0008298 GO:GO:0051028
EMBL:AACQ01000034 EMBL:AACQ01000033 RefSeq:XP_719156.1
RefSeq:XP_719272.1 GeneID:3639162 GeneID:3639277
KEGG:cal:CaO19.13040 KEGG:cal:CaO19.5595 eggNOG:NOG245845
GO:GO:0001897 Uniprot:Q5ABV6
Length = 519
Score = 145 (56.1 bits), Expect = 1.9e-06, P = 1.9e-06
Identities = 39/126 (30%), Positives = 61/126 (48%)
Query: 409 NGTHEYNSPRIENS-NTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPK 467
NG + N+ R NS ++R +NG H N Y++R ++G H P +N N + YN
Sbjct: 390 NGNNNINNHRRNNSVDSRSDNGQHRRNNSYDSRSDHGQHRRQPSQQNNNYNNNN-YNNNN 448
Query: 468 NENTNPRYENG--THEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPS 525
N N N NG ++ + N N NG + N N N++ N+ ++ + N S
Sbjct: 449 NNNNN-NSNNGFVKRSGSVRNVNNYNNNNGNANN--NGNNHGNKSKRRSTYNN-NNNNNS 504
Query: 526 KRNTIL 531
KRN+ L
Sbjct: 505 KRNSQL 510
>UNIPROTKB|Q5ABV6 [details] [associations]
symbol:SHE3 "SWI5-dependent HO expression protein 3"
species:237561 "Candida albicans SC5314" [GO:0001897 "cytolysis by
symbiont of host cells" evidence=IMP] [GO:0003729 "mRNA binding"
evidence=IDA] [GO:0008298 "intracellular mRNA localization"
evidence=IMP] [GO:0009267 "cellular response to starvation"
evidence=IMP] [GO:0030447 "filamentous growth" evidence=IMP]
[GO:0036170 "filamentous growth of a population of unicellular
organisms in response to starvation" evidence=IMP] [GO:0036180
"filamentous growth of a population of unicellular organisms in
response to biotic stimulus" evidence=IMP] [GO:0071216 "cellular
response to biotic stimulus" evidence=IMP] CGD:CAL0006287
GO:GO:0071216 GO:GO:0036180 GO:GO:0005789 GO:GO:0003729
GO:GO:0009267 GO:GO:0036170 GO:GO:0008298 GO:GO:0051028
EMBL:AACQ01000034 EMBL:AACQ01000033 RefSeq:XP_719156.1
RefSeq:XP_719272.1 GeneID:3639162 GeneID:3639277
KEGG:cal:CaO19.13040 KEGG:cal:CaO19.5595 eggNOG:NOG245845
GO:GO:0001897 Uniprot:Q5ABV6
Length = 519
Score = 145 (56.1 bits), Expect = 1.9e-06, P = 1.9e-06
Identities = 39/126 (30%), Positives = 61/126 (48%)
Query: 409 NGTHEYNSPRIENS-NTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPK 467
NG + N+ R NS ++R +NG H N Y++R ++G H P +N N + YN
Sbjct: 390 NGNNNINNHRRNNSVDSRSDNGQHRRNNSYDSRSDHGQHRRQPSQQNNNYNNNN-YNNNN 448
Query: 468 NENTNPRYENG--THEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPS 525
N N N NG ++ + N N NG + N N N++ N+ ++ + N S
Sbjct: 449 NNNNN-NSNNGFVKRSGSVRNVNNYNNNNGNANN--NGNNHGNKSKRRSTYNN-NNNNNS 504
Query: 526 KRNTIL 531
KRN+ L
Sbjct: 505 KRNSQL 510
>ZFIN|ZDB-GENE-030131-9242 [details] [associations]
symbol:sulf1 "sulfatase 1" species:7955 "Danio
rerio" [GO:0005509 "calcium ion binding" evidence=IEA] [GO:0008152
"metabolic process" evidence=IEA] [GO:0008484 "sulfuric ester
hydrolase activity" evidence=IEA] [GO:0003824 "catalytic activity"
evidence=IEA] [GO:0005783 "endoplasmic reticulum" evidence=IEA]
[GO:0005794 "Golgi apparatus" evidence=IEA] [GO:0009986 "cell
surface" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 ZFIN:ZDB-GENE-030131-9242
Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
PROSITE:PS00523 GO:GO:0008484 GeneTree:ENSGT00400000022041
HOGENOM:HOG000290161 KO:K14607 HOVERGEN:HBG056431
InterPro:IPR024609 Pfam:PF12548 CTD:23213 OMA:SVRVTHK EMBL:CR385071
EMBL:CR382282 EMBL:AY332604 IPI:IPI00509599 RefSeq:NP_001003846.1
UniGene:Dr.81473 Ensembl:ENSDART00000056081 GeneID:337298
KEGG:dre:337298 InParanoid:Q6EFA1 NextBio:20812164 Uniprot:Q6EFA1
Length = 1099
Score = 136 (52.9 bits), Expect = 2.0e-06, Sum P(3) = 2.0e-06
Identities = 57/223 (25%), Positives = 93/223 (41%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+II I+ DD DV L Q+ + G N + T +C PSRS+++TGK
Sbjct: 42 PNIILIMTDD---QDVELGSL-QVMNKTRKIMEDGGTSFTNAFVTTPMCCPSRSSMLTGK 97
Query: 119 HPIHTGMQHNVLYGCERGGLPLSE-----KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+ +H HN E P + + YL GYRT GK+ L Y Y P
Sbjct: 98 Y-VHN---HNTYTNNENCSSPSWQAQHEPRSFAVYLNNTGYRTAFFGKY-LNEYNGSYIP 152
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
G+ +G + +++++ G + + A D Y TD+ T ++++
Sbjct: 153 P--GWREWVGLIKNSR-FYNYTVCRN---GNKEKHGADYAKD----YFTDLITNDSINYF 202
Query: 234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
P+ + ++HAA H P Q + + N +HI
Sbjct: 203 RTSKRMFPHRPVMMVISHAAPHGPEDSAP-QYSELFPNASQHI 244
Score = 63 (27.2 bits), Expect = 2.0e-06, Sum P(3) = 2.0e-06
Identities = 15/47 (31%), Positives = 23/47 (48%)
Query: 269 IHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
IH ++ K L +D+SV KV AL L N+ I++ +D
Sbjct: 269 IHMEFTNYLHRKRLQTLMSVDDSVEKVYNALVDTGELDNTYIIYTAD 315
Score = 52 (23.4 bits), Expect = 2.4e-05, Sum P(3) = 2.4e-05
Identities = 18/87 (20%), Positives = 34/87 (39%)
Query: 428 NGTHEYNPKYENRYENGTHEYNPKYE---NRYENGTHEYNGPKNENTNPRYENGTHEYNI 484
+G + P+Y + N + P Y N ++ +Y GP + + N H +
Sbjct: 224 HGPEDSAPQYSELFPNASQHITPSYNYAPNMDKHWIMQYTGPMKP-IHMEFTNYLHRKRL 282
Query: 485 PRL---ENSING--NGTSENRSNDNSY 506
L ++S+ N + DN+Y
Sbjct: 283 QTLMSVDDSVEKVYNALVDTGELDNTY 309
Score = 47 (21.6 bits), Expect = 2.0e-06, Sum P(3) = 2.0e-06
Identities = 12/39 (30%), Positives = 16/39 (41%)
Query: 537 EWQISALTRGKWKLVK-ENSINGNGTSENRS-NDNSYQN 573
+W GKW+L K + S+ RS SY N
Sbjct: 461 KWHCVEEVSGKWRLQKCKGSLKEGSKKRTRSLRSRSYDN 499
>DICTYBASE|DDB_G0280253 [details] [associations]
symbol:DDB_G0280253 "putative GTPase activating
protein (GAP)" species:44689 "Dictyostelium discoideum" [GO:0032851
"positive regulation of Rab GTPase activity" evidence=IEA]
[GO:0032313 "regulation of Rab GTPase activity" evidence=IEA]
[GO:0005622 "intracellular" evidence=IEA] [GO:0005097 "Rab GTPase
activator activity" evidence=IEA] [GO:0043547 "positive regulation
of GTPase activity" evidence=IEA] [GO:0005096 "GTPase activator
activity" evidence=IEA] InterPro:IPR000195 Pfam:PF00566
PROSITE:PS50086 SMART:SM00164 dictyBase:DDB_G0280253
GenomeReviews:CM000152_GR GO:GO:0005622 EMBL:AAFI02000035
eggNOG:COG5210 GO:GO:0005097 GO:GO:0032851 SUPFAM:SSF47923
RefSeq:XP_641332.1 ProteinModelPortal:Q54VM3
EnsemblProtists:DDB0235314 GeneID:8622466 KEGG:ddi:DDB_G0280253
InParanoid:Q54VM3 OMA:ISHDISR ProtClustDB:CLSZ2846777
Uniprot:Q54VM3
Length = 1173
Score = 149 (57.5 bits), Expect = 2.2e-06, P = 2.2e-06
Identities = 52/205 (25%), Positives = 89/205 (43%)
Query: 380 ANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRI---ENSNT-RYENGTHEYNP 435
+N S+ N N+ + N R N+ N Y + EN+++ Y + + ++
Sbjct: 74 SNNSNNSNNNNNNINNNNNRNNNNFNNNNNNNVNYFEQDVDFGENAHSSNYGDNNNIFSD 133
Query: 436 KYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNG 495
+ N Y N + + N Y+N + YN NEN N Y N + N N+ N N
Sbjct: 134 E-SNNYNNNNNNNDYNNNNYYDN--NNYNENYNENYNENYNNNNNNNNNNNNNNN-NNNN 189
Query: 496 TSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENS 555
+ N +N+N+Y NE + + ++N +N ++E+ I+ +NS
Sbjct: 190 NNNNNNNNNNYYNENNN---------QQQLQQNYSNNNYNNEY-INNFNNN------DNS 233
Query: 556 INGNGTSENR-SNDNSYQNEIDGID 579
N N + N SN N+Y N +G D
Sbjct: 234 YNNNNNNNNNNSNFNNYNNNNNGYD 258
Score = 135 (52.6 bits), Expect = 6.9e-05, P = 6.9e-05
Identities = 37/137 (27%), Positives = 54/137 (39%)
Query: 381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
N D NY + EN Y N+ N + N+ N+N N + N +
Sbjct: 151 NYYDNNNYNENYNENYNENYNNNNNNNNNNNNNNNNNNNNNNNNNNNNNYYNENNNQQQL 210
Query: 441 YEN-GTHEYNPKYENRYENGTHEYNGPKNENTNP----RYENGTHEYNIPRLENSINGN- 494
+N + YN +Y N + N + YN N N N Y N + Y+ NS N N
Sbjct: 211 QQNYSNNNYNNEYINNFNNNDNSYNNNNNNNNNNSNFNNYNNNNNGYD-NSYSNSNNNNY 269
Query: 495 --GTSENRSNDNSYQNE 509
++ N NDN Y +
Sbjct: 270 YDNSNNNSKNDNQYNQQ 286
>DICTYBASE|DDB_G0267636 [details] [associations]
symbol:mybM "putative myb transcription factor"
species:44689 "Dictyostelium discoideum" [GO:0003682 "chromatin
binding" evidence=IEA] [GO:0003677 "DNA binding" evidence=IEA]
[GO:0008150 "biological_process" evidence=ND] [GO:0006355
"regulation of transcription, DNA-dependent" evidence=IEA]
[GO:0006351 "transcription, DNA-dependent" evidence=IEA]
[GO:0005634 "nucleus" evidence=IEA] InterPro:IPR001005
InterPro:IPR009057 SMART:SM00717 dictyBase:DDB_G0267636
GO:GO:0005634 GenomeReviews:CM000150_GR GO:GO:0006355 GO:GO:0003677
EMBL:AAFI02000003 GO:GO:0006351 GO:GO:0003682 Gene3D:1.10.10.60
SUPFAM:SSF46689 InterPro:IPR017930 PROSITE:PS51294
InterPro:IPR017877 PROSITE:PS50090 HSSP:P06876 RefSeq:XP_647181.1
ProteinModelPortal:Q55GK3 EnsemblProtists:DDB0220517 GeneID:8615985
KEGG:ddi:DDB_G0267636 eggNOG:NOG244606 OMA:KRICKRT Uniprot:Q55GK3
Length = 669
Score = 146 (56.5 bits), Expect = 2.2e-06, P = 2.2e-06
Identities = 47/209 (22%), Positives = 85/209 (40%)
Query: 365 QYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNT 424
+Y+ ++ ++L N+S++ + +NS+ N N L+Y+ + + +
Sbjct: 218 RYLQLTGKGGSILPPLNQSNVSS-LNSSSANTF----NQQLQYQQQQQQQQQQQQQQQQQ 272
Query: 425 RYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNI 484
+ + + YN Y N N + YN + N+ N + +N N N + YN
Sbjct: 273 QQQQMNNNYNNNYNNNNNNINNNYNNNHNNQNNNNNNNHNHYNNHYNQMNNNNNNNHYN- 331
Query: 485 PRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALT 544
N+ N N N +N+N YQ + S + N S + L +I D S
Sbjct: 332 ----NNNNNNNNINNNNNNNMYQMNNNN----SNSNNNNKSHNLSPLSSIIDSNTSSPSF 383
Query: 545 RGKWKLVKENSINGNGTSENRSNDNSYQN 573
G N+ N N + N +N+N+ N
Sbjct: 384 EGCEDNNNNNNNNNNNNNNNNNNNNNNNN 412
Score = 104 (41.7 bits), Expect = 0.00015, Sum P(2) = 0.00015
Identities = 35/133 (26%), Positives = 52/133 (39%)
Query: 381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
N ++I N N+ N N+ Y N H YN N+N Y N + N N
Sbjct: 288 NNNNINNNYNNNHNNQNNNNNNNHNHYNN--H-YNQMNNNNNNNHYNNNNNNNN-NINNN 343
Query: 441 YENGTHEYNPKYENRYENGTHEYNGPKNE----NTN-PRYENGTHEYNIPRLENSINGNG 495
N ++ N N N P + NT+ P +E G + N N+ N N
Sbjct: 344 NNNNMYQMNNNNSNSNNNNKSHNLSPLSSIIDSNTSSPSFE-GCEDNNNNNNNNNNNNNN 402
Query: 496 TSENRSNDNSYQN 508
+ N +N+N+ N
Sbjct: 403 NNNNNNNNNNSNN 415
Score = 74 (31.1 bits), Expect = 0.00015, Sum P(2) = 0.00015
Identities = 32/116 (27%), Positives = 52/116 (44%)
Query: 490 SINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWK 549
SIN SEN +N+N+ N+ D I V NEP + + E+ + + K
Sbjct: 503 SINNIIDSENNNNNNN--NDNDNIKVEDN-GCNEPVMKKV---RSNGEFYYQPI-KNKLN 555
Query: 550 LVKENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQI 605
N+ N N + N +N+N+ N +G + S S N S + +L + + QI
Sbjct: 556 NNNNNNNNNNNNNNNNNNNNNNNNNNNGNNTLSYNSDN--SSDDDMLPKLKNNKQI 609
>UNIPROTKB|P15586 [details] [associations]
symbol:GNS "N-acetylglucosamine-6-sulfatase" species:9606
"Homo sapiens" [GO:0046872 "metal ion binding" evidence=IEA]
[GO:0008449 "N-acetylglucosamine-6-sulfatase activity"
evidence=IEA] [GO:0006027 "glycosaminoglycan catabolic process"
evidence=TAS] [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IDA] [GO:0005975 "carbohydrate metabolic process"
evidence=TAS] [GO:0030203 "glycosaminoglycan metabolic process"
evidence=TAS] [GO:0042339 "keratan sulfate metabolic process"
evidence=TAS] [GO:0042340 "keratan sulfate catabolic process"
evidence=TAS] [GO:0043202 "lysosomal lumen" evidence=TAS]
[GO:0044281 "small molecule metabolic process" evidence=TAS]
[GO:0005515 "protein binding" evidence=IPI] Reactome:REACT_11123
Reactome:REACT_111217 InterPro:IPR000917 InterPro:IPR012251
InterPro:IPR015981 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 PIRSF:PIRSF036666 Reactome:REACT_116125 GO:GO:0046872
GO:GO:0005975 GO:GO:0043202 Gene3D:3.40.720.10 SUPFAM:SSF53649
eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
GO:GO:0042340 GO:GO:0043199 GO:GO:0005539 CTD:2799
HOGENOM:HOG000169239 HOVERGEN:HBG005840 KO:K01137 OrthoDB:EOG4NGGMF
GO:GO:0008449 PANTHER:PTHR10342:SF5 EMBL:Z12173 EMBL:AK223484
EMBL:AC025262 EMBL:BC012482 IPI:IPI00012102 PIR:S27164
RefSeq:NP_002067.1 UniGene:Hs.334534 ProteinModelPortal:P15586
SMR:P15586 IntAct:P15586 STRING:P15586 PhosphoSite:P15586
DMDM:232126 PaxDb:P15586 PeptideAtlas:P15586 PRIDE:P15586
DNASU:2799 Ensembl:ENST00000258145 GeneID:2799 KEGG:hsa:2799
UCSC:uc001ssf.3 GeneCards:GC12M065107 H-InvDB:HIX0010785
HGNC:HGNC:4422 HPA:CAB026011 HPA:HPA013695 MIM:252940 MIM:607664
neXtProt:NX_P15586 Orphanet:79272 PharmGKB:PA28802
InParanoid:P15586 PhylomeDB:P15586 BioCyc:MetaCyc:HS06046-MONOMER
BRENDA:3.1.6.14 SABIO-RK:P15586 ChiTaRS:GNS GenomeRNAi:2799
NextBio:11033 ArrayExpress:P15586 Bgee:P15586 CleanEx:HS_GNS
Genevestigator:P15586 GermOnline:ENSG00000135677 Uniprot:P15586
Length = 552
Score = 141 (54.7 bits), Expect = 2.3e-06, Sum P(2) = 2.3e-06
Identities = 65/228 (28%), Positives = 98/228 (42%)
Query: 36 AFAVLPLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDAL-AYS 94
A +L L L VF + A + P+++ +L DD D G+ P AL
Sbjct: 25 ALLLLVLGGCLG-VF-GVAAGTRRPNVVLLLTDD---QDEVLGGMT--PLKKTKALIGEM 77
Query: 95 GIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYG-CE-RGGLPLSE-KILPQYLK 150
G+ + Y LC PSR++I+TGK+P + + +N L G C + + E P L+
Sbjct: 78 GMTFSSAYVPSALCCPSRASILTGKYPHNHHVVNNTLEGNCSSKSWQKIQEPNTFPAILR 137
Query: 151 EL-GYRTRIVGKWHLGFYKKEY-TPTFRGFES-HLG--YWTGHQDYFDHSAEEMKMWGLD 205
+ GY+T GK Y EY P G E LG YW + + + + G
Sbjct: 138 SMCGYQTFFAGK-----YLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKA 192
Query: 206 MRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHS 253
+ + D Y TDV ++D + S EP F+ +A A HS
Sbjct: 193 RKHGENYSVD----YLTDVLANVSLDFLDYKSNFEPFFMMIATPAPHS 236
Score = 51 (23.0 bits), Expect = 2.3e-06, Sum P(2) = 2.3e-06
Identities = 13/38 (34%), Positives = 23/38 (60%)
Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
R ++ +L +D+ V K+V+ LE L+N+ I + SD
Sbjct: 290 RKRWQTLL-SVDDLVEKLVKRLEFTGELNNTYIFYTSD 326
>DICTYBASE|DDB_G0282835 [details] [associations]
symbol:srfB "putative MADS-box transcription factor"
species:44689 "Dictyostelium discoideum" [GO:0043565
"sequence-specific DNA binding" evidence=ISS] [GO:0019933
"cAMP-mediated signaling" evidence=IMP] [GO:0006355 "regulation of
transcription, DNA-dependent" evidence=IEA;ISS] [GO:0003700
"sequence-specific DNA binding transcription factor activity"
evidence=ISS] [GO:0046983 "protein dimerization activity"
evidence=IEA] [GO:0003677 "DNA binding" evidence=IEA] [GO:0006351
"transcription, DNA-dependent" evidence=IEA] [GO:0005634 "nucleus"
evidence=IEA] [GO:0044351 "macropinocytosis" evidence=RCA]
InterPro:IPR002100 Pfam:PF00319 PRINTS:PR00404 PROSITE:PS00350
PROSITE:PS50066 SMART:SM00432 dictyBase:DDB_G0282835 GO:GO:0005634
EMBL:AAFI02000047 GenomeReviews:CM000152_GR GO:GO:0019933
GO:GO:0043565 GO:GO:0003700 GO:GO:0006351 eggNOG:COG5068
SUPFAM:SSF55455 HSSP:P11831 ProtClustDB:CLSZ2430546
RefSeq:XP_639351.1 ProteinModelPortal:Q54RY6 SMR:Q54RY6
PRIDE:Q54RY6 EnsemblProtists:DDB0220492 GeneID:8623792
KEGG:ddi:DDB_G0282835 OMA:WSTASSC Uniprot:Q54RY6
Length = 467
Score = 117 (46.2 bits), Expect = 3.0e-06, Sum P(2) = 3.0e-06
Identities = 37/137 (27%), Positives = 57/137 (41%)
Query: 375 TLLSAA-NKSDIP--NYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTH 431
TL+ A N D+P + + N N+ N ++ N+ N NT NG +
Sbjct: 108 TLIQACLNTPDVPPVSKDDGNNNNGNNSNNNNNSNNNNSSNNNNNGNNNNGNTNNNNGNN 167
Query: 432 EYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSI 491
N N N + N Y N N + N N N N + E + N + +N+I
Sbjct: 168 N-NSNNNNSGNNNNNNNNNSYNNNNNNNNNNNNN--NNNNNCKEEQNMNIPNERKSKNNI 224
Query: 492 NGNGTSENRSNDNSYQN 508
N N ++N +N N+ QN
Sbjct: 225 NNNNNNQN-NNQNNNQN 240
Score = 73 (30.8 bits), Expect = 3.0e-06, Sum P(2) = 3.0e-06
Identities = 27/85 (31%), Positives = 36/85 (42%)
Query: 491 INGNGTSENRSNDNSYQN--EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKW 548
INGNG + N +N+N+ N E + + S N S N +N + T
Sbjct: 316 INGNGMNGNNNNNNNSNNIPEYGQVIIQSYRGSN--SGGNNSSNNTSTNTNTNTNTNTNN 373
Query: 549 KLVKENSINGNGTSENRSNDNSYQN 573
NS NGN S N SN+ QN
Sbjct: 374 NNNNSNSSNGNN-SNNNSNNILPQN 397
>MGI|MGI:1919293 [details] [associations]
symbol:Sulf2 "sulfatase 2" species:10090 "Mus musculus"
[GO:0001822 "kidney development" evidence=IGI] [GO:0002063
"chondrocyte development" evidence=IMP] [GO:0003094 "glomerular
filtration" evidence=IGI] [GO:0003824 "catalytic activity"
evidence=IEA] [GO:0004065 "arylsulfatase activity" evidence=ISO]
[GO:0005509 "calcium ion binding" evidence=IEA] [GO:0005615
"extracellular space" evidence=ISO] [GO:0005783 "endoplasmic
reticulum" evidence=ISO] [GO:0005794 "Golgi apparatus"
evidence=IEA] [GO:0005886 "plasma membrane" evidence=IDA]
[GO:0006790 "sulfur compound metabolic process" evidence=ISO]
[GO:0008152 "metabolic process" evidence=IEA] [GO:0008449
"N-acetylglucosamine-6-sulfatase activity" evidence=ISO;IMP]
[GO:0008484 "sulfuric ester hydrolase activity" evidence=IEA]
[GO:0009986 "cell surface" evidence=ISO] [GO:0010575 "positive
regulation vascular endothelial growth factor production"
evidence=IGI] [GO:0014846 "esophagus smooth muscle contraction"
evidence=IGI] [GO:0016787 "hydrolase activity" evidence=IEA]
[GO:0030177 "positive regulation of Wnt receptor signaling pathway"
evidence=ISO] [GO:0030201 "heparan sulfate proteoglycan metabolic
process" evidence=ISO;IMP] [GO:0032836 "glomerular basement
membrane development" evidence=IGI] [GO:0035860 "glial cell-derived
neurotrophic factor receptor signaling pathway" evidence=IDA]
[GO:0040037 "negative regulation of fibroblast growth factor
receptor signaling pathway" evidence=IGI] [GO:0046872 "metal ion
binding" evidence=IEA] [GO:0048706 "embryonic skeletal system
development" evidence=IGI] [GO:0051216 "cartilage development"
evidence=IMP] [GO:0060348 "bone development" evidence=IGI]
[GO:0060384 "innervation" evidence=IGI] InterPro:IPR000917
InterPro:IPR014615 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 PIRSF:PIRSF036665 MGI:MGI:1919293 GO:GO:0005783
GO:GO:0005886 GO:GO:0005615 GO:GO:0009986 GO:GO:0005795
GO:GO:0005509 GO:GO:0010575 GO:GO:0060348 Gene3D:3.40.720.10
SUPFAM:SSF53649 GO:GO:0030177 GO:GO:0003094 eggNOG:COG3119
InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0004065
GeneTree:ENSGT00400000022041 GO:GO:0048706 GO:GO:0002063
GO:GO:0040037 GO:GO:0032836 GO:GO:0060384 GO:GO:0008449
GO:GO:0030201 KO:K14607 HOVERGEN:HBG056431 GO:GO:0014846
GO:GO:0035860 InterPro:IPR024609 Pfam:PF12548 CTD:55959 OMA:PKYYGQG
OrthoDB:EOG49KFPX EMBL:AY101177 EMBL:AK008108 EMBL:AK028874
EMBL:AK034712 EMBL:AK036685 EMBL:AK049170 EMBL:AK081643
EMBL:AK133336 EMBL:AK165183 EMBL:AL589873 EMBL:BC027238
EMBL:BC141086 IPI:IPI00268030 RefSeq:NP_001239507.1
RefSeq:NP_001239508.1 RefSeq:NP_082348.2 UniGene:Mm.1011
ProteinModelPortal:Q8CFG0 SMR:Q8CFG0 STRING:Q8CFG0
PhosphoSite:Q8CFG0 PRIDE:Q8CFG0 Ensembl:ENSMUST00000088086
Ensembl:ENSMUST00000109249 GeneID:72043 KEGG:mmu:72043
InParanoid:B2RUD5 NextBio:335292 Bgee:Q8CFG0 CleanEx:MM_SULF2
Genevestigator:Q8CFG0 GermOnline:ENSMUSG00000006800 Uniprot:Q8CFG0
Length = 875
Score = 140 (54.3 bits), Expect = 3.1e-06, Sum P(2) = 3.1e-06
Identities = 58/223 (26%), Positives = 93/223 (41%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+II +L DD DV G Q+ + G N + T +C PSRS+I+TGK
Sbjct: 44 PNIILVLTDD---QDVEL-GSMQVMNKTRRIMEQGGAHFINAFVTTPMCCPSRSSILTGK 99
Query: 119 HPIHTGMQHNVLYGCERGGLPL-----SEKILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+ +H HN E P + YL GYRT GK+ L Y Y P
Sbjct: 100 Y-VHN---HNTYTNNENCSSPSWQAQHESRTFAVYLNSTGYRTAFFGKY-LNEYNGSYVP 154
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
G++ +G + +++++ + G+ + + + D Y TD+ T ++V
Sbjct: 155 P--GWKEWVGLLKNSR-FYNYT---LCRNGVKEKHGSDYSTD----YLTDLITNDSVSFF 204
Query: 234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
P+ + ++HAA H P Q + N +HI
Sbjct: 205 RTSKKMYPHRPVLMVISHAAPHGPEDSAP-QYSRLFPNASQHI 246
Score = 56 (24.8 bits), Expect = 3.1e-06, Sum P(2) = 3.1e-06
Identities = 47/231 (20%), Positives = 87/231 (37%)
Query: 234 HNHSTDEPLFLYL-AHAATHSANPYEPLQAPD-HYLN--------IHRHIEDFKRSKFAA 283
H P + L +A+ H Y PD H++ IH + + K
Sbjct: 226 HGPEDSAPQYSRLFPNASQHITPSYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQ 285
Query: 284 ILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEG 343
L +D+S+ + + L + L N+ I++ +D + P +E
Sbjct: 286 TLMSVDDSMETIYDMLVETGELDNTYILYTADHGYHIGQFGLVKGKSMP--------YEF 337
Query: 344 GVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENS 403
+R + P +E+ + +++ D PT+L A DIP ++ ++I+ ++
Sbjct: 338 DIRVPFYVRGPNVEAGSLNPHIVLNI-DLAPTILDIAGL-DIPADMDG--KSILKLLDSE 393
Query: 404 ILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHE-YNPKYE 453
R N H R+ + E G + K E N E + PKY+
Sbjct: 394 --RPVNRFHLKKKLRVWRDSFLVERGKLLH--KREGDKVNAQEENFLPKYQ 440
>DICTYBASE|DDB_G0275409 [details] [associations]
symbol:DDB_G0275409 "RNA-binding region RNP-1
domain-containing protein" species:44689 "Dictyostelium discoideum"
[GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0000166
"nucleotide binding" evidence=IEA] [GO:0008150 "biological_process"
evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
InterPro:IPR000504 InterPro:IPR012677 Pfam:PF00076 PROSITE:PS50102
SMART:SM00360 dictyBase:DDB_G0275409 GO:GO:0000166
Gene3D:3.30.70.330 GO:GO:0003676 EMBL:AAFI02000013
RefSeq:XP_001134599.1 ProteinModelPortal:Q1ZXL1
EnsemblProtists:DDB0233346 GeneID:8619946 KEGG:ddi:DDB_G0275409
eggNOG:NOG288151 InParanoid:Q1ZXL1 OMA:DIKNGYA Uniprot:Q1ZXL1
Length = 737
Score = 145 (56.1 bits), Expect = 3.2e-06, P = 3.2e-06
Identities = 56/208 (26%), Positives = 91/208 (43%)
Query: 377 LSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPK 436
L+ N ++ N VN + + R SI R+ +G + N+ R N+N Y+N + YN
Sbjct: 453 LNDTNGNNTDNGVNYSQQRDRSR-SRSIERFRDGRNNRNNFR-NNNNNNYQNNNN-YNRN 509
Query: 437 YENRYENGTHEYNPKYENRYENGTHEYNGPKNENTN-PRYENGT-HEYNIPRLENS---I 491
N N +YN NR YNG N+ N RY N H N + N+
Sbjct: 510 NINN-SNNNRDYNNSDRNR-----EFYNGNDNDRNNGDRYSNNNRHNINFNKRNNNDRNY 563
Query: 492 NGNGTSENRSNDNSYQNEIDGIDV-WSVLSRNEPSK--RNTILHNIDDEWQISALTRGKW 548
N N N +N+N+ + +G DV ++ ++ N + R+ N ++E++ + T
Sbjct: 564 NNNNNRFNNNNNNNNNSSNNGRDVDFNGINNNNNNNNYRDDNNFNNNEEFENNRRTYNND 623
Query: 549 KLVKENSINGNGTSENRSND---NSYQN 573
K + G S + S D N+Y N
Sbjct: 624 KKRSRSHSRGRSRSRSHSGDRRNNNYNN 651
>DICTYBASE|DDB_G0287645 [details] [associations]
symbol:DDB_G0287645 "DUF1222 family protein"
species:44689 "Dictyostelium discoideum" [GO:0008150
"biological_process" evidence=ND] [GO:0005575 "cellular_component"
evidence=ND] [GO:0003674 "molecular_function" evidence=ND]
dictyBase:DDB_G0287645 EMBL:AAFI02000103 eggNOG:NOG81106
InterPro:IPR009613 Pfam:PF06762 RefSeq:XP_637151.1
EnsemblProtists:DDB0238347 GeneID:8626246 KEGG:ddi:DDB_G0287645
Uniprot:Q54K13
Length = 771
Score = 145 (56.1 bits), Expect = 3.4e-06, P = 3.4e-06
Identities = 48/190 (25%), Positives = 75/190 (39%)
Query: 403 SILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHE 462
S+L Y T + I RY+ T N N +N + N N N +
Sbjct: 535 SLLEYSPFTTDKPPIYIRAQKYRYKFTTFN-NENINNNNDNNNNNDNNNNNNNNNNNNNN 593
Query: 463 YNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRN 522
N N N N N + N N+ N N + N SN+N+Y N + D + N
Sbjct: 594 NNNNNNNNNNNNNNNNNNNNN----NNNNNNNNNNNNDSNNNNYSNNNNNND-----NNN 644
Query: 523 EPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQNEI--DGIDV 580
+ + +N +N ++ + N+ N N + N +NDN+ QN + D +
Sbjct: 645 DNNNKNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNDNNSQNPVSYDNNEE 704
Query: 581 WSVLSRNEPS 590
+S NEPS
Sbjct: 705 DRNISTNEPS 714
Score = 133 (51.9 bits), Expect = 6.7e-05, P = 6.7e-05
Identities = 38/147 (25%), Positives = 58/147 (39%)
Query: 381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
N +D N N+ N N+ N + N+ N+N N + N N
Sbjct: 576 NNNDNNNNNNNNNNN---NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNDSNNNN 632
Query: 441 YENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENR 500
Y N + + +N +N + N N N N N + N N+ N N + N
Sbjct: 633 YSNNNNNNDNNNDNNNKNNNNNNNNNNNNNNNNNNNNNNNNNN-----NNNNNNNNNNNN 687
Query: 501 SNDNSYQNEI--DGIDVWSVLSRNEPS 525
+NDN+ QN + D + +S NEPS
Sbjct: 688 NNDNNSQNPVSYDNNEEDRNISTNEPS 714
>UNIPROTKB|F1RZ87 [details] [associations]
symbol:SGSH "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0008484 "sulfuric ester hydrolase activity"
evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484
GeneTree:ENSGT00390000013080 EMBL:CU655945
Ensembl:ENSSSCT00000018675 OMA:LCRAHRA Uniprot:F1RZ87
Length = 231
Score = 133 (51.9 bits), Expect = 3.7e-06, Sum P(2) = 3.7e-06
Identities = 44/133 (33%), Positives = 69/133 (51%)
Query: 41 PLAFTLSMVFVDLVASSGPP-HIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILK 99
P+ + L + A G +++ ILADD G+ G + I TP++DALA I+ +
Sbjct: 6 PVGWVLLLALGLCCAQGGRRRNVLLILADDGGFES-GAYNNSAITTPHLDALARRSIVFR 64
Query: 100 NYYT-VQLCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLS--EKI--LPQYLKELGY 154
N +T V C+PSR++++TG P H N +YG + + +++ LP L G
Sbjct: 65 NAFTSVSSCSPSRASLLTGL-PQH----QNGMYGLHQDVHHFNSFDRVQSLPLLLGRAGV 119
Query: 155 RT--RIVGKWHLG 165
RT R GK H+G
Sbjct: 120 RTGSRHHGKKHVG 132
Score = 41 (19.5 bits), Expect = 3.7e-06, Sum P(2) = 3.7e-06
Identities = 8/21 (38%), Positives = 10/21 (47%)
Query: 239 DEPLFLYLAHAATHSANPYEP 259
D P FLY+A H +P
Sbjct: 173 DRPFFLYVAFHDPHRCGHSQP 193
>RGD|1306654 [details] [associations]
symbol:Bub3 "budding uninhibited by benzimidazoles 3 homolog (S.
cerevisiae)" species:10116 "Rattus norvegicus" [GO:0000070 "mitotic
sister chromatid segregation" evidence=ISO] [GO:0000776
"kinetochore" evidence=ISO] [GO:0005634 "nucleus" evidence=ISO]
[GO:0007059 "chromosome segregation" evidence=ISO] [GO:0008608
"attachment of spindle microtubules to kinetochore" evidence=ISO]
[GO:0051983 "regulation of chromosome segregation" evidence=ISO]
[GO:0071173 "spindle assembly checkpoint" evidence=ISO] [GO:0005730
"nucleolus" evidence=ISO] InterPro:IPR017986 InterPro:IPR001680
InterPro:IPR015943 Pfam:PF00400 PROSITE:PS50082 PROSITE:PS50294
SMART:SM00320 RGD:1306654 Gene3D:2.130.10.10 SUPFAM:SSF50978
HOVERGEN:HBG002942 EMBL:AY325173 IPI:IPI00382243 UniGene:Rn.6897
ProteinModelPortal:Q7TP72 IntAct:Q7TP72 PRIDE:Q7TP72
UCSC:RGD:1306654 InParanoid:Q7TP72 ArrayExpress:Q7TP72
Genevestigator:Q7TP72 Uniprot:Q7TP72
Length = 628
Score = 143 (55.4 bits), Expect = 4.2e-06, P = 4.2e-06
Identities = 57/230 (24%), Positives = 86/230 (37%)
Query: 347 GAGLIWSPLLESRGIVAEQYV--HVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENSI 404
G G SP + ++A Q+ H S+ S N S+ N N+ N NS
Sbjct: 369 GEGKSGSPKSQKHFLLALQFFIWHNSNSNNNNNSNNNNSNNNNSNNNNNSNNSSS-NNS- 426
Query: 405 LRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYN 464
N ++ NS SN N ++ N N N ++ N + N + N
Sbjct: 427 --NSNNSNSNNSSSNSTSNNSNSNNSNSNNSNNNNNNSNNSNSNNSNSNSNNSNNKNSNN 484
Query: 465 GPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEP 524
N N+N N + N N+ N N S N +N N+ N S S N+
Sbjct: 485 NSNNNNSNSNSNNSNNSSNNSSSSNNSNSNNNSNNSNNSNNSSNNSSS----SNNSNNKN 540
Query: 525 SKRNTILHNIDDEWQISALTRGKWKL-VKENSINGNGTSENRSNDNSYQN 573
+ N +N ++ S+ +S N N +S N SN+NS N
Sbjct: 541 NSNNNNSNNNNNSNSSSSNNNSNSNNNSNSSSSNNNSSSNNNSNNNSNNN 590
Score = 139 (54.0 bits), Expect = 1.1e-05, P = 1.1e-05
Identities = 47/190 (24%), Positives = 72/190 (37%)
Query: 409 NGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKN 468
N ++ NS NSN N ++ N N N T + + N + N N
Sbjct: 405 NNSNNNNSNNNNNSNNSSSNNSNSNNSNSNNSSSNSTSNNSNSNNSNSNNSNNNNNNSNN 464
Query: 469 ENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRN 528
N+N N + N NS N N S + +++NS N + S + N + N
Sbjct: 465 SNSNNSNSNSNNSNNKNSNNNSNNNNSNSNSNNSNNSSNNSSSSNN--SNSNNNSNNSNN 522
Query: 529 TILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNE 588
+ +N + S + K NS N N ++ + SN+NS N S S N
Sbjct: 523 S--NNSSNNSSSSNNSNNKNNSNNNNSNNNNNSNSSSSNNNSNSNNNSN----SSSSNNN 576
Query: 589 PSKRNTILHN 598
S N +N
Sbjct: 577 SSSNNNSNNN 586
>DICTYBASE|DDB_G0281825 [details] [associations]
symbol:comB "Rab GTPase domain-containing protein"
species:44689 "Dictyostelium discoideum" [GO:0005525 "GTP binding"
evidence=ISS] [GO:0031154 "culmination involved in sorocarp
development" evidence=IMP] [GO:0016021 "integral to membrane"
evidence=ISS] [GO:0015031 "protein transport" evidence=IEA]
[GO:0007264 "small GTPase mediated signal transduction"
evidence=IEA] [GO:0000166 "nucleotide binding" evidence=IEA]
InterPro:IPR001806 InterPro:IPR003579 InterPro:IPR005225
InterPro:IPR011990 Pfam:PF00071 PRINTS:PR00449 SMART:SM00175
dictyBase:DDB_G0281825 TIGRFAMs:TIGR00231 GO:GO:0016021
GO:GO:0007264 GO:GO:0000166 GenomeReviews:CM000152_GR GO:GO:0015031
Gene3D:1.25.40.10 EMBL:AAFI02000043 GO:GO:0031154 eggNOG:COG1100
InterPro:IPR025697 Pfam:PF13236 RefSeq:XP_640497.1
ProteinModelPortal:Q54T92 EnsemblProtists:DDB0214836 GeneID:8623311
KEGG:ddi:DDB_G0281825 InParanoid:Q54T92 OMA:NELASKF Uniprot:Q54T92
Length = 2107
Score = 149 (57.5 bits), Expect = 4.2e-06, P = 4.2e-06
Identities = 57/264 (21%), Positives = 107/264 (40%)
Query: 332 PLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIPNYVNS 391
PL + ++ G + G G LL S G+ EQ S+ L ++ S + N V+S
Sbjct: 312 PLNSSNHYIFSGSISG-GSNRDQLLSSNGL-REQD---SNSLSVNSNSGLASSV-NSVSS 365
Query: 392 TVE--NIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYN 449
T N++ +S+ N ++ N+ + N+N N + N N N N
Sbjct: 366 TSSGSNLLTSSNSSVNNNSNNSNSINNNNV-NNNININNNNNTNNTNNNNIINNNNININ 424
Query: 450 PKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNE 509
+ + N N N+N N + N N+ N N + N +N N+ N
Sbjct: 425 ENSTSGINSNNSGNNINNNNNSNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNSNNNNNN 484
Query: 510 IDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDN 569
+ I+ + + N + N +N ++ S+++ N+ N N + N +N+N
Sbjct: 485 TNSINNNNNNNNNNNNNNNNNNNNNNNN-NNSSISNNNNNNNNNNNNNNNNNNNNNNNNN 543
Query: 570 SYQNEIDGIDVWSVLSRNEPSKRN 593
+ + + I+ ++ + N S N
Sbjct: 544 NSSSSNNNINNNNINTDNNSSNNN 567
Score = 140 (54.3 bits), Expect = 0.00019, Sum P(2) = 0.00019
Identities = 49/203 (24%), Positives = 79/203 (38%)
Query: 376 LLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRY--ENGTHEY 433
LL+++N S N NS N N + N T+ N+ I N+N EN T
Sbjct: 372 LLTSSNSSVNNNSNNSNSINNNNVNNNININNNNNTNNTNNNNIINNNNININENSTSGI 431
Query: 434 NPKYE-NRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYN--IPRLENS 490
N N N + N N N + N N N N N ++ N + N+
Sbjct: 432 NSNNSGNNINNNNNSNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNSNNNNNNTNSINNN 491
Query: 491 INGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKL 550
N N + N +N+N+ N + S + N + N +N ++ + +
Sbjct: 492 NNNNNNNNNNNNNNNNNNNNNNNSSISNNNNNNNNNNNNNNNNNNNNNNNNNNSSSSNNN 551
Query: 551 VKENSINGNGTSENRSNDNSYQN 573
+ N+IN + S N +N+N N
Sbjct: 552 INNNNINTDNNSSNNNNNNMNNN 574
Score = 47 (21.6 bits), Expect = 0.00019, Sum P(2) = 0.00019
Identities = 14/45 (31%), Positives = 24/45 (53%)
Query: 559 NGTSENRSNDNSYQNEIDG--IDVWSVLSRNEPSKRN--TILHNI 599
+ T+ N SN+N+ N++ S LS N+P N T ++N+
Sbjct: 770 SSTNNNNSNNNNNNNQLQPPQTPTSSSLSVNQPFNLNSSTNINNL 814
>DICTYBASE|DDB_G0269328 [details] [associations]
symbol:DDB_G0269328 species:44689 "Dictyostelium
discoideum" [GO:0016021 "integral to membrane" evidence=IEA]
InterPro:IPR004240 Pfam:PF02990 dictyBase:DDB_G0269328
GO:GO:0016021 EMBL:AAFI02000005 RefSeq:XP_645881.2
EnsemblProtists:DDB0190180 GeneID:8616821 KEGG:ddi:DDB_G0269328
eggNOG:KOG1277 OMA:RVINECK Uniprot:Q55EA0
Length = 1140
Score = 146 (56.5 bits), Expect = 4.4e-06, P = 4.4e-06
Identities = 49/194 (25%), Positives = 80/194 (41%)
Query: 400 YENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENG 459
Y+ +Y N + +N+ I NS N + N N N + N N N
Sbjct: 680 YKFKYNQYFNYYNTFNNNSINNSINN-NNNINNINSIINNNNNNNNNNNNNNNNNNNNNN 738
Query: 460 THEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVL 519
+ N N N N N ++ N NS N N + N +N+N+ N + ID +++
Sbjct: 739 NNNNNNNNNNNNNNNNNNNSNS-NSSSNSNSNNNNNNNNNNNNNNNNNNNNNSIDNNNII 797
Query: 520 SRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQNEIDGID 579
+ N N NI++ + +K V N+IN S + +N NS N I+ I+
Sbjct: 798 NNNNNIISNINNSNINNNISSDGINTNGYK-VNNNNINN---SNDVNNINSATN-INNIN 852
Query: 580 VWSVLSRNEPSKRN 593
+ + + N S N
Sbjct: 853 ISNGNNNNNNSINN 866
Score = 133 (51.9 bits), Expect = 0.00026, Sum P(2) = 0.00026
Identities = 49/194 (25%), Positives = 74/194 (38%)
Query: 381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
N + I N +N+ NI NSI+ N + N+ N+N N + N N
Sbjct: 695 NNNSINNSINNN-NNI--NNINSIINNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 751
Query: 441 YENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENR 500
N + + N N + N N N N N + + N N IN N +
Sbjct: 752 NNNNNNSNSNSSSNSNSNNNNNNNNNNNNNNNNNNNNNSIDNN-----NIINNNNNIISN 806
Query: 501 SNDNSYQNEI--DGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSING 558
N+++ N I DGI+ N + ++NI+ I+ + NSIN
Sbjct: 807 INNSNINNNISSDGINTNGYKVNNNNINNSNDVNNINSATNINNINISNGNNNNNNSINN 866
Query: 559 NG-TSENR-SNDNS 570
N S N S +NS
Sbjct: 867 NNIVSGNIISGNNS 880
Score = 47 (21.6 bits), Expect = 0.00026, Sum P(2) = 0.00026
Identities = 10/20 (50%), Positives = 13/20 (65%)
Query: 554 NSINGNGTSENRSNDNSYQN 573
NS N N S N +N+N+Y N
Sbjct: 1043 NSNNNNNNS-NSNNNNNYNN 1061
>DICTYBASE|DDB_G0280599 [details] [associations]
symbol:fhkB "forkhead-associated kinase protein B"
species:44689 "Dictyostelium discoideum" [GO:0016772 "transferase
activity, transferring phosphorus-containing groups" evidence=IEA]
[GO:0006468 "protein phosphorylation" evidence=IEA] [GO:0005524
"ATP binding" evidence=IEA] [GO:0004674 "protein serine/threonine
kinase activity" evidence=IEA] [GO:0004672 "protein kinase
activity" evidence=IEA] [GO:0005575 "cellular_component"
evidence=ND] [GO:0016740 "transferase activity" evidence=IEA]
[GO:0016310 "phosphorylation" evidence=IEA] [GO:0016301 "kinase
activity" evidence=IEA] [GO:0000166 "nucleotide binding"
evidence=IEA] InterPro:IPR000253 InterPro:IPR000719
InterPro:IPR002290 InterPro:IPR008271 InterPro:IPR008984
InterPro:IPR011009 InterPro:IPR017441 Pfam:PF00069 Pfam:PF00498
PROSITE:PS00107 PROSITE:PS00108 PROSITE:PS50006 PROSITE:PS50011
SMART:SM00220 SMART:SM00240 dictyBase:DDB_G0280599 GO:GO:0005524
GenomeReviews:CM000152_GR eggNOG:COG0515 SUPFAM:SSF56112
GO:GO:0004674 Gene3D:2.60.200.20 SUPFAM:SSF49879 EMBL:AAFI02000037
InterPro:IPR020636 PANTHER:PTHR24347 HSSP:O43293
RefSeq:XP_001134559.1 ProteinModelPortal:Q1ZXH2
EnsemblProtists:DDB0233266 GeneID:8622630 KEGG:ddi:DDB_G0280599
InParanoid:Q1ZXH2 Uniprot:Q1ZXH2
Length = 1142
Score = 146 (56.5 bits), Expect = 4.4e-06, P = 4.4e-06
Identities = 35/135 (25%), Positives = 55/135 (40%)
Query: 381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYE-N 439
N ++ N + + N Y NS + N H +N N N N H +N + N
Sbjct: 993 NNNNNNNTNTNNINNNNNNYNNSH-NHNNNNHNHN----HNLNNHNHNNNHHHNHNHNHN 1047
Query: 440 RYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSEN 499
N H +N + + + + H N N N N N + N N+ N N + N
Sbjct: 1048 HNHNHNHNHNHNHNHNHNHNNHNNNNNNNNNNNNNNNNNNNNNNN---NNNNNNNNNNNN 1104
Query: 500 RSNDNSYQNEIDGID 514
+N+N Y N I+ I+
Sbjct: 1105 NNNNNYYNNNINNIN 1119
Score = 137 (53.3 bits), Expect = 4.1e-05, P = 4.1e-05
Identities = 35/131 (26%), Positives = 53/131 (40%)
Query: 381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYEN- 439
N ++ N N+ N I N+I N + N+ N+N N + N Y N
Sbjct: 958 NNNNNNNNNNNNNNNNINNNNNNI--NNNNINNNNNNNNNNNNNTNTNNINNNNNNYNNS 1015
Query: 440 -RYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYE-NGTHEYNIPRLENSINGNGTS 497
+ N H +N N N H +N N N N + N H +N N+ N N +
Sbjct: 1016 HNHNNNNHNHNHNLNNHNHNNNHHHNHNHNHNHNHNHNHNHNHNHNHNHNHNNHNNNNNN 1075
Query: 498 ENRSNDNSYQN 508
N +N+N+ N
Sbjct: 1076 NNNNNNNNNNN 1086
>DICTYBASE|DDB_G0275313 [details] [associations]
symbol:dhx9 "ATP-dependent RNA helicase"
species:44689 "Dictyostelium discoideum" [GO:0008026 "ATP-dependent
helicase activity" evidence=IEA] [GO:0005524 "ATP binding"
evidence=IEA] [GO:0004386 "helicase activity" evidence=IEA]
[GO:0003725 "double-stranded RNA binding" evidence=IEA] [GO:0003676
"nucleic acid binding" evidence=IEA] [GO:0008150
"biological_process" evidence=ND] [GO:0005737 "cytoplasm"
evidence=ISS] [GO:0005634 "nucleus" evidence=ISS] [GO:0004004
"ATP-dependent RNA helicase activity" evidence=ISS] [GO:0004003
"ATP-dependent DNA helicase activity" evidence=ISS] [GO:0016787
"hydrolase activity" evidence=IEA] [GO:0000166 "nucleotide binding"
evidence=IEA] InterPro:IPR001159 InterPro:IPR001650
InterPro:IPR002464 InterPro:IPR007502 InterPro:IPR011545
Pfam:PF00035 Pfam:PF00270 Pfam:PF00271 Pfam:PF04408 PROSITE:PS00690
PROSITE:PS50137 PROSITE:PS51194 SMART:SM00358 SMART:SM00490
SMART:SM00847 dictyBase:DDB_G0275313 GO:GO:0005524 GO:GO:0005634
GO:GO:0005737 GenomeReviews:CM000151_GR EMBL:AAFI02000013
GO:GO:0003725 GO:GO:0004003 InterPro:IPR014001 SMART:SM00487
PROSITE:PS51192 eggNOG:COG1643 InterPro:IPR011709 Pfam:PF07717
GO:GO:0004004 RefSeq:XP_643861.1 ProteinModelPortal:Q869Z1
EnsemblProtists:DDB0233740 GeneID:8619912 KEGG:ddi:DDB_G0275313
InParanoid:Q869Z1 OMA:CEYLLEN Uniprot:Q869Z1
Length = 1472
Score = 147 (56.8 bits), Expect = 4.6e-06, P = 4.6e-06
Identities = 49/222 (22%), Positives = 85/222 (38%)
Query: 387 NYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTH 446
+Y NS N NS N + N+ N+N N + YN + + + +
Sbjct: 6 SYNNSYNSN--NNNNNSYNSNNNNNNNNNNNNNNNNNNNSNNNNNNYNNNFSSGGRSNYN 63
Query: 447 EYNP--KYENRYENGTHEYNGPK---NENTNPRYENGTHEYNIPRLE---NSINGNGTSE 498
YN Y N + N + Y G N+N + + G YN N+ N N +
Sbjct: 64 NYNNYNSYNNDFNNSNNNYRGNSVFGNKNNSYLNKGGNKVYNTSNSNINYNNNNNNNNNN 123
Query: 499 NRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKE--NSI 556
N +N+NS N + + + N + N +N + + ++ NS+
Sbjct: 124 NNNNNNSNNNNNNNNQGQTSYNNNNNNSNNNNNNNNQGQTSYNNNNNINNNIINSGLNSL 183
Query: 557 NGNGTSENRSNDNSYQNEIDGIDVWS-VLSRNEPSKRNTILH 597
N N + N +N + +N I+ +L + +P N I H
Sbjct: 184 NNNNNNNNNNNYSGLENNINNYQQTPPILQQQQPLLSNPINH 225
>DICTYBASE|DDB_G0287625 [details] [associations]
symbol:DDB_G0287625 species:44689 "Dictyostelium
discoideum" [GO:0005615 "extracellular space" evidence=IDA]
dictyBase:DDB_G0287625 GO:GO:0005615 EMBL:AAFI02000103
RefSeq:XP_637128.1 EnsemblProtists:DDB0187557 GeneID:8626223
KEGG:ddi:DDB_G0287625 eggNOG:NOG285146 OMA:ESTERNE Uniprot:Q54K36
Length = 981
Score = 145 (56.1 bits), Expect = 4.7e-06, P = 4.7e-06
Identities = 47/208 (22%), Positives = 88/208 (42%)
Query: 390 NSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYE---NGTH 446
NST ENI ++ +L+ +N + N + + + + E+ E N +R++ + T
Sbjct: 566 NSTPENISTDLDSPLLK-KN--QQLNLIKEQTNKLKTEDSIDESNNNGNDRFKTKCSSTE 622
Query: 447 EYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSY 506
N EN N + N P N N N N + N N+ N N + N +N+N+
Sbjct: 623 NENKNRENEKNNSENSKNNPNNNNPNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 682
Query: 507 QNEIDGIDVWSVLSRNEPSKRNTILHNIDD-EWQISALTRGKWKLVKENSINGNGTSENR 565
N + + + + N + N +N ++ + K + +N+ N + S N
Sbjct: 683 NNNNNNNNNNNNNNNNSNNNNNPNNYNNNNPNNNPNNNNNNNNKNINKNNSNNSNNSNNS 742
Query: 566 SNDNSYQNEIDGIDVWSVLSRNEPSKRN 593
SN + N + + + L+ N P+ N
Sbjct: 743 SNSRNNSNNSNNNNNNNNLNNNNPNNNN 770
Score = 140 (54.3 bits), Expect = 1.6e-05, P = 1.6e-05
Identities = 40/167 (23%), Positives = 65/167 (38%)
Query: 409 NGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKN 468
N + N+ N+N N + N N N + N + N + YN N
Sbjct: 654 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNSNNNNNPNNYNN-NN 712
Query: 469 ENTNPRYENGTHEYNIPR--LENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSK 526
N NP N + NI + NS N N +S +R+N N+ N + + L+ N P+
Sbjct: 713 PNNNPNNNNNNNNKNINKNNSNNSNNSNNSSNSRNNSNNSNNNNNNNN----LNNNNPNN 768
Query: 527 RNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQN 573
N +N ++ + N+ N N + N +N+N N
Sbjct: 769 NNPNNNNPNNNNPNNNNPNNNNNNNNNNNNNNNNNNNNNNNNNKNNN 815
Score = 133 (51.9 bits), Expect = 9.1e-05, P = 9.1e-05
Identities = 40/161 (24%), Positives = 61/161 (37%)
Query: 380 ANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYEN 439
+N ++ PN N+ N P N+ ++ NS NSN N + N N
Sbjct: 699 SNNNNNPNNYNNNNPNNNPNNNNN--NNNKNINKNNSNNSNNSNNS-SNSRNNSNNSNNN 755
Query: 440 RYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSEN 499
N + NP N N + N P N N N N + N N+ N N N
Sbjct: 756 NNNNNLNNNNPNNNNPNNNNPNN-NNPNNNNPNNNNNNNNNNNNNNNNNNNNNNNNNKNN 814
Query: 500 RSNDNSYQNEIDGIDVWSVLSRNEPS--KRNTILHNIDDEW 538
+N+NS+ E + + + P K N + ++ EW
Sbjct: 815 NNNNNSFSEEEEEEGSLNQVRNISPKIGKYNEDISFLEKEW 855
Score = 131 (51.2 bits), Expect = 0.00015, P = 0.00015
Identities = 45/187 (24%), Positives = 67/187 (35%)
Query: 391 STVENIIPRYENSILRYENGTHE--YNSPRIENSNTRYENGTHEYNPKYENRYENGTHEY 448
S+ EN EN EN + N+P N+N N + N N N +
Sbjct: 619 SSTENENKNRENEKNNSENSKNNPNNNNPNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 678
Query: 449 NPKYENRYENGTHEYNGPKNENTN--PRYENGTHEYNIPRLENSINGNGTSENRSNDNSY 506
N N N + N N N N P N + N P N+ N ++N SN+++
Sbjct: 679 NNNNNNNNNNNNNNNNNNNNSNNNNNPNNYNNNNPNNNPNNNNNNNNKNINKNNSNNSNN 738
Query: 507 QNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRS 566
N S S N + N L+N + N+ N N + N +
Sbjct: 739 SNNSSNSRNNSNNSNNNNNNNN--LNNNNPNNNNPNNNNPNNNNPNNNNPNNNNNNNNNN 796
Query: 567 NDNSYQN 573
N+N+ N
Sbjct: 797 NNNNNNN 803
>DICTYBASE|DDB_G0283357 [details] [associations]
symbol:DDB_G0283357 "unknown" species:44689
"Dictyostelium discoideum" [GO:0008150 "biological_process"
evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
[GO:0003674 "molecular_function" evidence=ND]
dictyBase:DDB_G0283357 EMBL:AAFI02000054 RefSeq:XP_639110.1
ProteinModelPortal:Q54R73 EnsemblProtists:DDB0302625 GeneID:8624043
KEGG:ddi:DDB_G0283357 OMA:NSANEND Uniprot:Q54R73
Length = 1247
Score = 146 (56.5 bits), Expect = 4.9e-06, P = 4.9e-06
Identities = 59/208 (28%), Positives = 78/208 (37%)
Query: 374 PTLLSAANKSDIPNYV---NSTVENIIPRYE---NSILRYENGTHEYNSPRIENSNTRYE 427
PT S SD N V N+ N P NS N + N+ NS+
Sbjct: 152 PTFKSLDLSSDTVNSVGAANNGSSNSSPTINGISNSNTMNNNNNNNNNNNNNSNSSNNNN 211
Query: 428 NGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYN-GPKNENTNPRYENGTHEYNIPR 486
NG + N Y N + N T N N Y N T+ N G N N N N N
Sbjct: 212 NGNNNNNNNY-NSFVNITKNNNNTNSNNYNNSTNSNNNGYNNNNNNNSISNSNSNSNSNS 270
Query: 487 LENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRG 546
NS N N S + SN NS N + S S N + N +N ++ S+ + G
Sbjct: 271 NSNS-NSNSNSNSNSNSNSNSNSNSSSN--SSSSSNNNNNNNNNNNNNNNSSSSSSNSNG 327
Query: 547 KWKLVKENSINGNGTSENRSNDN-SYQN 573
N+ + G S ++ N SY N
Sbjct: 328 N----NNNNYHSYGYSNSKYNQQKSYNN 351
Score = 141 (54.7 bits), Expect = 1.7e-05, P = 1.7e-05
Identities = 52/204 (25%), Positives = 89/204 (43%)
Query: 398 PRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHE--YNPKYEN- 454
P + N I +N T++ S + N++ Y N N + N Y NG++ YN N
Sbjct: 4 PIFSNDIDSIKNNTYQAKSYQKYNNSNNYNNNN---NNSFNN-YSNGSNYGGYNNSGNNS 59
Query: 455 RYENGTHEYNGPK--NENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDG 512
Y N + YN N N N N + N N+IN N ++ N +N+N+ N +
Sbjct: 60 NYNNNNNLYNNNNINNNNNNNNNNNINNNNNNINNNNNINNNNSNNNNNNNNNNSNSNNS 119
Query: 513 IDVWSVLSRNEPSKRNTILHN---IDDEWQISALTRGKWKLVKENSINGNGTSENRSNDN 569
I+ S N P++ H+ I+ + T L +++N G + N S+++
Sbjct: 120 INSNSY-KVNTPTQNGKSSHSPPLINANANVVFPTFKSLDL-SSDTVNSVGAANNGSSNS 177
Query: 570 SYQNEIDGIDVWSVLSRNEPSKRN 593
S I+GI + ++ N + N
Sbjct: 178 S--PTINGISNSNTMNNNNNNNNN 199
Score = 125 (49.1 bits), Expect = 0.00088, P = 0.00088
Identities = 51/202 (25%), Positives = 74/202 (36%)
Query: 402 NSILRYENGTHEYNSPRIE---NSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYEN 458
NS+ NG+ +SP I NSNT N + N + N + N N Y +
Sbjct: 165 NSVGAANNGSSN-SSPTINGISNSNTMNNNNNNNNNNNNNSNSSNNNNNGNNNNNNNYNS 223
Query: 459 GTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSV 518
+ N N+N Y N T+ N N+ N N S + SN NS N + S
Sbjct: 224 FVNITKNNNNTNSN-NYNNSTNSNN-NGYNNNNNNNSISNSNSNSNSNSNSNSNSNSNSN 281
Query: 519 LSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQNEIDGI 578
+ N S N+ N S+ N+ N N +S + SN N N
Sbjct: 282 SNSNSNSNSNS---NSSSNSSSSSNNNNN---NNNNNNNNNNSSSSSSNSNGNNNNNYHS 335
Query: 579 DVWSVLSRNEPSKRNTILHNID 600
+S N+ N H ++
Sbjct: 336 YGYSNSKYNQQKSYNNAPHQLN 357
>DICTYBASE|DDB_G0277589 [details] [associations]
symbol:gtaC "GATA zinc finger domain-containing
protein 3" species:44689 "Dictyostelium discoideum" [GO:0005634
"nucleus" evidence=IDA] [GO:0031149 "sorocarp stalk cell
differentiation" evidence=IMP] [GO:0005737 "cytoplasm"
evidence=IDA] [GO:0043565 "sequence-specific DNA binding"
evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
[GO:0006355 "regulation of transcription, DNA-dependent"
evidence=IEA] [GO:0003700 "sequence-specific DNA binding
transcription factor activity" evidence=IEA] [GO:0046872 "metal ion
binding" evidence=IEA] InterPro:IPR000679 InterPro:IPR013088
Pfam:PF00320 PROSITE:PS00344 PROSITE:PS50114 SMART:SM00401
dictyBase:DDB_G0277589 GO:GO:0005737 GO:GO:0046872 GO:GO:0043565
GO:GO:0008270 Gene3D:3.30.50.10 GenomeReviews:CM000151_GR
GO:GO:0003700 EMBL:AAFI02000020 eggNOG:COG5641 GO:GO:0031149
RefSeq:XP_642533.1 HSSP:P17678 ProteinModelPortal:Q75JZ1
EnsemblProtists:DDB0220470 GeneID:8621095 KEGG:ddi:DDB_G0277589
OMA:SNIRVEE Uniprot:Q75JZ1
Length = 587
Score = 142 (55.0 bits), Expect = 4.9e-06, P = 4.9e-06
Identities = 53/235 (22%), Positives = 101/235 (42%)
Query: 372 WLPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTH 431
++P+ + + S + N VN ++ N+ N+ Y N + YN+ I N+N N +
Sbjct: 5 YIPSPIYSDQNSGVHN-VNKSLHNLNINNGNNNYNYSN--NNYNN-NINNNNN-INNNIN 59
Query: 432 EYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSI 491
N N N ++Y+ + ++Y + + N N N N + NI N+I
Sbjct: 60 NNNNNNNNNNNNNINQYHQNHYDQYSDNNCNNSNSNNINNNNNINNNINNNNINNNNNNI 119
Query: 492 NGNGTSENRSNDNSYQN--EIDGIDVW--SVLSRNEPSKRNTILHNIDDEWQISALTRGK 547
N N + N +N+N+ N +I +++ V N S N + + I + +S +
Sbjct: 120 NSNNNNNNNNNNNNNNNLLKIPQLNISPNGVGGGNGISNGNGV-NKIFSKLDLSKVPNS- 177
Query: 548 WKLVKENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNE--PSKRNTILHNID 600
++L +S+ + TS N S ++ + S+L P+ + HN D
Sbjct: 178 YQLAHNSSMPNSPTSSNISPSTPTSMALNLSSLKSILDSPPAAPAHSASSSHNND 232
>DICTYBASE|DDB_G0283697 [details] [associations]
symbol:DDB_G0283697 "unknown" species:44689
"Dictyostelium discoideum" [GO:0008150 "biological_process"
evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
[GO:0003674 "molecular_function" evidence=ND]
dictyBase:DDB_G0283697 EMBL:AAFI02000056 RefSeq:XP_638971.1
HSSP:Q9BYW2 ProteinModelPortal:Q54QQ2 EnsemblProtists:DDB0237901
GeneID:8624217 KEGG:ddi:DDB_G0283697 OMA:PPKENFF Uniprot:Q54QQ2
Length = 853
Score = 144 (55.7 bits), Expect = 5.0e-06, P = 5.0e-06
Identities = 40/183 (21%), Positives = 74/183 (40%)
Query: 414 YNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYE-NRYENGTHEYNGPKNENTN 472
Y ++ +N N ++ + N + N + YN N + T YN N N+N
Sbjct: 502 YRDDSLQQNNDNNNNSSNNNSNNSNNNFNNDNNPYNNSNNYNMNNSNTSPYNNSNNSNSN 561
Query: 473 PRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILH 532
Y N N N+ N N + N +N+N+ N + + ++ + N + +
Sbjct: 562 SSYYNDNDYNNNNNNNNNSNNNNNNNNNNNNNNNNNNNNNNNNFNNSNSNSSESKPNYFN 621
Query: 533 NIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQNEIDGIDVWSVL--SRNEPS 590
N+ + + + +T+ + K N N N + N N N + ID + L S++ P
Sbjct: 622 NLSNVF--NQITK-PLENYKNNGKNENNNNNNNKNKNEDEKRIDLVQTKLSLKSSKSTPQ 678
Query: 591 KRN 593
N
Sbjct: 679 TYN 681
Score = 130 (50.8 bits), Expect = 0.00016, P = 0.00016
Identities = 35/172 (20%), Positives = 74/172 (43%)
Query: 398 PRYENSILR-YENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRY 456
P ++ + + Y + + + N+ +N+N N ++ N + N +N + + Y N
Sbjct: 492 PNFQIPLSKPYRDDSLQQNN---DNNNNSSNNNSNNSNNNFNN--DNNPYNNSNNY-NMN 545
Query: 457 ENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVW 516
+ T YN N N+N Y N N N+ N N + N +N+N+ N + + +
Sbjct: 546 NSNTSPYNNSNNSNSNSSYYNDNDYNNNNNNNNNSNNNNNNNNNNNNNNNNNNNNNNNNF 605
Query: 517 SVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSND 568
+ + N + +N+ + + +K +N N N ++N++ D
Sbjct: 606 NNSNSNSSESKPNYFNNLSNVFNQITKPLENYKNNGKNENNNNNNNKNKNED 657
>DICTYBASE|DDB_G0292046 [details] [associations]
symbol:DDB_G0292046 "Ubiquitin carboxyl-terminal
hydrolase 34" species:44689 "Dictyostelium discoideum" [GO:0006511
"ubiquitin-dependent protein catabolic process" evidence=IEA]
[GO:0004221 "ubiquitin thiolesterase activity" evidence=IEA]
[GO:0016787 "hydrolase activity" evidence=IEA] [GO:0008234
"cysteine-type peptidase activity" evidence=IEA] [GO:0008233
"peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
evidence=IEA] InterPro:IPR000626 InterPro:IPR001394
InterPro:IPR018200 Pfam:PF00443 PROSITE:PS00972 PROSITE:PS00973
PROSITE:PS50235 SMART:SM00213 dictyBase:DDB_G0292046
EMBL:AAFI02000187 GO:GO:0008234 GO:GO:0006511 GO:GO:0004221
eggNOG:COG5077 RefSeq:XP_629788.1 ProteinModelPortal:Q54DT4
EnsemblProtists:DDB0184183 GeneID:8628465 KEGG:ddi:DDB_G0292046
InParanoid:Q54DT4 OMA:ISKECTH Uniprot:Q54DT4
Length = 3240
Score = 150 (57.9 bits), Expect = 5.3e-06, P = 5.3e-06
Identities = 48/194 (24%), Positives = 77/194 (39%)
Query: 406 RYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNG 465
+ +N + N+ N+N N N EN N + YN Y N Y N N
Sbjct: 427 KQDNNNNNNNNNNNNNNNNNNNNNNVNCNFNSENS-NNNNNNYNNNYNNNYNNSNSSSNN 485
Query: 466 PKNENTNPR-YENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEP 524
N N N NG + N N+ N N + N +N+N+ N + + N
Sbjct: 486 NNNSNDNGNGNSNGINNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN-------NNNNN 538
Query: 525 SKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQNEIDGIDVWSVL 584
+ N +N ++ ++++ G NS N N ++ + SN NS N + + +V
Sbjct: 539 NNNNNNNNNNNNNNNGNSISNGNNN--SNNSNNSNNSNNSNSNSNSNNNNSNNNNNSNVN 596
Query: 585 SRNEPSKRNTILHN 598
S N + IL N
Sbjct: 597 SPNPQILYDWILKN 610
Score = 140 (54.3 bits), Expect = 0.00023, Sum P(2) = 0.00023
Identities = 51/194 (26%), Positives = 72/194 (37%)
Query: 382 KSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRY 441
K D N N+ N N+ N +NS ENSN N + YN Y N Y
Sbjct: 427 KQDNNNNNNNNNNN---NNNNNNNNNNNVNCNFNS---ENSN----NNNNNYNNNYNNNY 476
Query: 442 ENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRS 501
N N N +NG NG N N N N + N N+ N N + N +
Sbjct: 477 NNSNSSSNNN-NNSNDNGNGNSNGINNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 535
Query: 502 NDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGT 561
N+N+ N + + N + N+I + ++ + NS N N
Sbjct: 536 NNNNNNNNNNN-------NNNNNNNGNSISNGNNNSNNSNNSNNSNNSNSNSNSNNNNSN 588
Query: 562 SENRSNDNSYQNEI 575
+ N SN NS +I
Sbjct: 589 NNNNSNVNSPNPQI 602
Score = 50 (22.7 bits), Expect = 0.00023, Sum P(2) = 0.00023
Identities = 14/43 (32%), Positives = 23/43 (53%)
Query: 554 NSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNE--PSKRNT 594
N+ N N + N +N+N+ N +G + S+ S +E P NT
Sbjct: 1592 NNNNNNNNNNNNNNNNNNNNNSNG-NSNSLTSSSERMPGTPNT 1633
>DICTYBASE|DDB_G0288611 [details] [associations]
symbol:DDB_G0288611 species:44689 "Dictyostelium
discoideum" [GO:0003676 "nucleic acid binding" evidence=IEA]
[GO:0000166 "nucleotide binding" evidence=IEA] InterPro:IPR000504
InterPro:IPR012677 Pfam:PF00076 PROSITE:PS50102 SMART:SM00360
dictyBase:DDB_G0288611 GO:GO:0000166 Gene3D:3.30.70.330
GO:GO:0003676 EMBL:AAFI02000118 RefSeq:XP_636639.1
EnsemblProtists:DDB0220605 GeneID:8626713 KEGG:ddi:DDB_G0288611
eggNOG:NOG283861 InParanoid:Q54IP7 Uniprot:Q54IP7
Length = 524
Score = 141 (54.7 bits), Expect = 5.3e-06, P = 5.3e-06
Identities = 38/135 (28%), Positives = 59/135 (43%)
Query: 381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYN---PKY 437
N+++ NY NS+ +N Y N+ Y N + N + + + +E N P
Sbjct: 280 NRNNRDNYNNSSRDNYNNNYNNNYNNYNNNNNNNNDDSYRGAVSFNDENNNEENSIVPNN 339
Query: 438 ENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTS 497
EN+ + Y YE+ Y +G+ N N N N Y N EYN + +NS N +
Sbjct: 340 ENKSNDFNKGYGAFYESDYYDGSQFNNNNNNRNINNDYNN---EYN--KHKNSYNSENNN 394
Query: 498 ENRSNDNSYQNEIDG 512
N N+N+ N G
Sbjct: 395 NNNYNNNNNNNNNGG 409
Score = 121 (47.7 bits), Expect = 0.00079, P = 0.00079
Identities = 48/178 (26%), Positives = 73/178 (41%)
Query: 408 ENGTHEYNSPRIENSNTRYE--NGTHEY-NPKYENRYENGTHEYNPKYENRYENGTHEYN 464
EN + E N P+ + Y+ +G E N N +N + Y N Y N + YN
Sbjct: 249 EN-SFENNKPKHSQFSKEYQFLDGLIENDNRNNRNNRDNYNNSSRDNYNNNYNNNYNNYN 307
Query: 465 GPKNENTNPRYENGTHEYNIPRL--ENSINGNGTSENRSND-NSYQNEIDGIDVWSVLSR 521
N N + Y G +N ENSI N +EN+SND N D +
Sbjct: 308 NNNNNNNDDSYR-GAVSFNDENNNEENSIVPN--NENKSNDFNKGYGAFYESDYYDGSQF 364
Query: 522 NEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQNEIDGID 579
N + I ++ ++E+ + K EN+ N N + N +N+N E +G D
Sbjct: 365 NNNNNNRNINNDYNNEYN-----KHKNSYNSENNNNNNYNNNNNNNNNGGYGE-EGYD 416
>DICTYBASE|DDB_G0279041 [details] [associations]
symbol:DDB_G0279041 "unknown" species:44689
"Dictyostelium discoideum" [GO:0008150 "biological_process"
evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
[GO:0003674 "molecular_function" evidence=ND]
dictyBase:DDB_G0279041 EMBL:AAFI02000026 RefSeq:XP_641933.1
EnsemblProtists:DDB0266471 GeneID:8621845 KEGG:ddi:DDB_G0279041
OMA:NNGMMNQ Uniprot:Q54XC8
Length = 637
Score = 142 (55.0 bits), Expect = 5.5e-06, P = 5.5e-06
Identities = 57/224 (25%), Positives = 97/224 (43%)
Query: 383 SDIPNYVNSTVENIIPRYENSILRYE-NGTHEYNSPRIENSNTRYENGTHEYNPKYENRY 441
+D P++++ +++IP+Y N + N YN+ N+N N + YN N +
Sbjct: 11 NDSPSFLS---DDLIPQYNNQFQSLQQNPQLNYNNNN-NNNNNNNNNNNNNYNNNNNNNF 66
Query: 442 ENGTHEYNPK---YENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSE 498
+N N ++N N + N N N + N + N N+IN T
Sbjct: 67 KNDNLFQNSNLFVFQNDLNNNINININNNNFNNNNNFNNNINFNNFNN--NNINNGFTYS 124
Query: 499 NRSNDNSYQNEIDGIDV----WSVLSRNEPS----KRNTILHNIDDEWQISALTRGKWKL 550
N N+N N +G DV SV+S S N ++N+++ + T L
Sbjct: 125 NNQNNNFKPNN-NGCDVEYSDHSVISTPTSSIYNENENNNINNLNNNINNTDNTCNI--L 181
Query: 551 VKENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRN-EPSKRN 593
N+ N N + N +N+++ QNE+ I+ S +S N E +N
Sbjct: 182 NNNNNSNNNDMNNNNNNNSNNQNEVTNIN--SNISPNYENQNQN 223
Score = 134 (52.2 bits), Expect = 4.1e-05, P = 4.1e-05
Identities = 55/203 (27%), Positives = 82/203 (40%)
Query: 390 NSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYN 449
N N I N+I +N + N+ NSN N + N +N N +
Sbjct: 157 NENENNNINNLNNNINNTDNTCNILNNNN--NSNNNDMNNNNNNNSNNQNEVTNINSNIS 214
Query: 450 PKYENRYENGTHEYNGPKNENTNPR---YENGTHEY----NI------PRL---ENSING 493
P YEN+ +N N N N P EN T++ NI P+L EN IN
Sbjct: 215 PNYENQNQNQNENENNSNNNNNKPNDNLVENNTNQITNPNNIDQQQEQPQLNQVENKINN 274
Query: 494 NGTSENRSNDNSYQNEIDGIDVWSVLSR-NEPSKRNTILHNIDDEWQISALTRGKWKLVK 552
N + N +N+N+ E V+ V + NE S IL D ++ +G ++
Sbjct: 275 NSNNNNINNNNNNSGEFCPDYVYFVNKQLNEFSNCLPILEK--DMPDFASTIKG---IIS 329
Query: 553 ENSINGNGTSENRSNDNSYQNEI 575
N + + +EN+S NS I
Sbjct: 330 PNIVGSSIKNENKSTPNSTSTSI 352
>UNIPROTKB|H0YB91 [details] [associations]
symbol:IDS "Iduronate 2-sulfatase 14 kDa chain"
species:9606 "Homo sapiens" [GO:0008484 "sulfuric ester hydrolase
activity" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149 GO:GO:0008484
EMBL:AC233288 HGNC:HGNC:5389 ChiTaRS:IDS Ensembl:ENST00000464251
Bgee:H0YB91 Uniprot:H0YB91
Length = 106
Score = 117 (46.2 bits), Expect = 5.9e-06, P = 5.9e-06
Identities = 29/79 (36%), Positives = 41/79 (51%)
Query: 85 TPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGM-QHNVLYGCERGGLPLSE 142
+PNID LA ++ +N + Q +C PSR + +TG+ P T + N + G
Sbjct: 2 SPNIDQLASHSLLFQNAFAQQAVCAPSRVSFLTGRRPDTTRLYDFNSYWRVHAGNF---- 57
Query: 143 KILPQYLKELGYRTRIVGK 161
+PQY KE GY T VGK
Sbjct: 58 STIPQYFKENGYVTMSVGK 76
>FB|FBgn0040271 [details] [associations]
symbol:Sulf1 "Sulfated" species:7227 "Drosophila
melanogaster" [GO:0008449 "N-acetylglucosamine-6-sulfatase
activity" evidence=ISS] [GO:0007389 "pattern specification process"
evidence=IMP] [GO:0018741 "alkyl sulfatase activity" evidence=ISS]
[GO:0008152 "metabolic process" evidence=ISS] [GO:0005783
"endoplasmic reticulum" evidence=ISS] [GO:0009986 "cell surface"
evidence=ISS] [GO:0005795 "Golgi stack" evidence=ISS] [GO:0017015
"regulation of transforming growth factor beta receptor signaling
pathway" evidence=IMP] [GO:0030111 "regulation of Wnt receptor
signaling pathway" evidence=IMP] [GO:0045880 "positive regulation
of smoothened signaling pathway" evidence=IMP] [GO:0045879
"negative regulation of smoothened signaling pathway" evidence=IMP]
[GO:0042059 "negative regulation of epidermal growth factor
receptor signaling pathway" evidence=IGI] InterPro:IPR000917
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 GO:GO:0005783
EMBL:AE014297 GO:GO:0009986 GO:GO:0030111 GO:GO:0046872
GO:GO:0005795 GO:GO:0042059 Gene3D:3.40.720.10 SUPFAM:SSF53649
eggNOG:COG3119 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
GO:GO:0008484 GeneTree:ENSGT00400000022041 GO:GO:0017015
GO:GO:0045879 GO:GO:0045880 KO:K14607 InterPro:IPR024609
Pfam:PF12548 EMBL:AY119658 EMBL:AF211192 RefSeq:NP_524987.1
UniGene:Dm.13781 ProteinModelPortal:Q9VEX0 SMR:Q9VEX0
DIP:DIP-21001N MINT:MINT-1598983 STRING:Q9VEX0 PaxDb:Q9VEX0
PRIDE:Q9VEX0 EnsemblMetazoa:FBtr0083273 GeneID:53437
KEGG:dme:Dmel_CG6725 UCSC:CG6725-RA CTD:23213 FlyBase:FBgn0040271
InParanoid:Q9VEX0 OMA:QWILQVT OrthoDB:EOG4GB5N2 PhylomeDB:Q9VEX0
GenomeRNAi:53437 NextBio:841154 Bgee:Q9VEX0 GermOnline:CG6725
Uniprot:Q9VEX0
Length = 1114
Score = 122 (48.0 bits), Expect = 6.3e-06, Sum P(2) = 6.3e-06
Identities = 54/222 (24%), Positives = 92/222 (41%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+II IL DD DV L+ +P + L G ++ YT +C P+RS+++TG
Sbjct: 54 PNIILILTDD---QDVELGSLNFMPR-TLRLLRDGGAEFRHAYTTTPMCCPARSSLLTGM 109
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKI--LPQYLKELGYRTRIVGKWHLGFYKKEYTPTFR 176
+ +H M C + + YL GYRT GK+ L Y Y P
Sbjct: 110 Y-VHNHMVFTNNDNCSSPQWQATHETRSYATYLSNAGYRTGYFGKY-LNKYNGSYIPP-- 165
Query: 177 GFESHLGYWTG---HQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
G+ W G + Y+++S + + G ++ + A D Y D+ +++ +
Sbjct: 166 GWRE----WGGLIMNSKYYNYS---INLNGQKIKHGFDYAKD----YYPDLIANDSIAFL 214
Query: 234 HNHSTD---EPLFLYLAHAATHSANPYEPLQAPDHYLNIHRH 272
+ +P+ L ++ A H P Q + N+ H
Sbjct: 215 RSSKQQNQRKPVLLTMSFPAPHGPEDSAP-QYSHLFFNVTTH 255
Score = 74 (31.1 bits), Expect = 6.3e-06, Sum P(2) = 6.3e-06
Identities = 34/149 (22%), Positives = 62/149 (41%)
Query: 252 HSANPYEP--LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSI 309
H+ NP + L+ + +H+ + +K L +D +V +V L++ L N+
Sbjct: 262 HAPNPDKQWILRVTEPMQPVHKRFTNLLMTKRLQTLQSVDVAVERVYNELKELGELDNTY 321
Query: 310 IVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHV 369
IV+ SD ++P +E VR LI P +++ +V E ++V
Sbjct: 322 IVYTSDHGYHLGQFGLIKGKSFP--------FEFDVRVPFLIRGPGIQASKVVNEIVLNV 373
Query: 370 SDWLPTLLSAANKSDIPNYVNSTVENIIP 398
D PT L +P + +I+P
Sbjct: 374 -DLAPTFLDMGG---VPTPQHMDGRSILP 398
Score = 54 (24.1 bits), Expect = 0.00069, Sum P(2) = 0.00069
Identities = 14/51 (27%), Positives = 26/51 (50%)
Query: 381 NKSDIPNYVNSTVENIIPRYENS--ILRYENGTHEYNSPRIENSNTRYENG 429
+K D+P N T+ +I + +++ IL + HE ++ +S YE G
Sbjct: 674 SKRDLPASSNETIAQVIQQIQSTLEILELKFNEHELHASN--SSGNSYERG 722
>DICTYBASE|DDB_G0273645 [details] [associations]
symbol:hbx5-2 "putative homeobox transcription
factor" species:44689 "Dictyostelium discoideum" [GO:0043565
"sequence-specific DNA binding" evidence=IEA] [GO:0006355
"regulation of transcription, DNA-dependent" evidence=IEA]
[GO:0005509 "calcium ion binding" evidence=IEA] [GO:0003700
"sequence-specific DNA binding transcription factor activity"
evidence=IEA] [GO:0003677 "DNA binding" evidence=IEA] [GO:0007275
"multicellular organismal development" evidence=IEA] [GO:0006351
"transcription, DNA-dependent" evidence=IEA] [GO:0005634 "nucleus"
evidence=IEA] InterPro:IPR001356 InterPro:IPR002048
InterPro:IPR009057 Pfam:PF00046 PROSITE:PS00027 PROSITE:PS50071
PROSITE:PS50222 SMART:SM00389 dictyBase:DDB_G0273645
dictyBase:DDB_G0273127 GO:GO:0007275 GO:GO:0005634 GO:GO:0043565
GenomeReviews:CM000151_GR GO:GO:0003700 GO:GO:0006351
EMBL:AAFI02000011 EMBL:AAFI02000009 Gene3D:1.10.10.60
SUPFAM:SSF46689 RefSeq:XP_644439.1 RefSeq:XP_644811.1
ProteinModelPortal:Q557C9 EnsemblProtists:DDB0220481
EnsemblProtists:DDB0266662 GeneID:8618913 GeneID:8619064
KEGG:ddi:DDB_G0273127 KEGG:ddi:DDB_G0273645 OMA:THHINIF
ProtClustDB:CLSZ2431129 Uniprot:Q557C9
Length = 1723
Score = 153 (58.9 bits), Expect = 6.8e-06, Sum P(2) = 6.8e-06
Identities = 48/204 (23%), Positives = 82/204 (40%)
Query: 381 NKSDIPNYVNSTVENIIPRYENS-ILRYENGTHEYNSPRIENSNTRYENG---THEYNPK 436
N ++ ++N+ N +S + ++N + +N+ NSN N +++YN
Sbjct: 85 NNNNNNQHMNNQYSNSFHNNNSSGFMAFQNNSSNFNNQNNNNSNNNNNNNNINSYDYNNS 144
Query: 437 YENRYENG--THEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGN 494
N Y N TH N N N + +N N N N N + N N+ N N
Sbjct: 145 NNNNYNNNNNTHSNNSNNNNNNNNSNY-WNNNNNNNNNNNNNNNNNNNNNNNNNNNNNSN 203
Query: 495 GTSENRSNDNSYQNEIDGIDVWSVLSRNEPSK--RNTILHNIDD---EWQISALTRGKWK 549
+ N +N+N + + +++P+ N I HN +D Q + G
Sbjct: 204 NNNNNNNNNNHHHHHHQ--------QQSQPTSPYNNPIQHNPNDMKFNGQHNPFN-GNQM 254
Query: 550 LVKENSINGNGTSENRSNDNSYQN 573
++ N+ N N + N N NS N
Sbjct: 255 VMDNNNNNNNNNNSNVFNSNSNSN 278
Score = 134 (52.2 bits), Expect = 0.00065, Sum P(2) = 0.00065
Identities = 47/188 (25%), Positives = 76/188 (40%)
Query: 387 NYVNSTVENIIPRYENSILRY-ENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGT 445
++V+ ++N P+ Y +NG YN NSN N H N +Y N + N
Sbjct: 52 SFVSPNLDNNNPQIHVQSNNYNQNGFVGYN-----NSNNNNNNNQH-MNNQYSNSFHNNN 105
Query: 446 HEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNS 505
++N N ++ N N N N N +++YN N N N T N SN+N+
Sbjct: 106 SSGFMAFQNNSSNFNNQNNNNSNNNNNNNNIN-SYDYNNSNNNNYNNNNNTHSNNSNNNN 164
Query: 506 YQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENR 565
N + W+ + N + N +N ++ N+ N N S N
Sbjct: 165 NNNNSN---YWNNNNNNNNNNNNNNNNNNNNN----------------NNNNNNNNSNNN 205
Query: 566 SNDNSYQN 573
+N+N+ N
Sbjct: 206 NNNNNNNN 213
Score = 46 (21.3 bits), Expect = 6.8e-06, Sum P(2) = 6.8e-06
Identities = 8/24 (33%), Positives = 13/24 (54%)
Query: 554 NSINGNGTSENRSNDNSYQNEIDG 577
N+ N N + N +N+N N + G
Sbjct: 552 NNNNNNNNNNNNNNNNITNNPLSG 575
Score = 46 (21.3 bits), Expect = 6.8e-06, Sum P(2) = 6.8e-06
Identities = 10/35 (28%), Positives = 19/35 (54%)
Query: 554 NSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNE 588
N+ N N + N +N+N+ N + I ++ + NE
Sbjct: 1689 NNNNNNNNNNNNNNNNNNNNNNNNIINNNITTINE 1723
Score = 44 (20.5 bits), Expect = 1.1e-05, Sum P(2) = 1.1e-05
Identities = 21/109 (19%), Positives = 39/109 (35%)
Query: 554 NSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTXXXX 613
N+ N N + N +N+N+ N + S N T N+ +Q ++ +
Sbjct: 898 NNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTSANTVQSGTTSNSNL--VFQQTSNSNTLS 955
Query: 614 XXXXXXXXMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCG 662
+ Q + G LSD ++ L + +A+ CG
Sbjct: 956 PSQQQQQQTQQQQSINGSST----GSLSDAQYQDLGIHLDTSSANSGCG 1000
Score = 43 (20.2 bits), Expect = 1.4e-05, Sum P(2) = 1.4e-05
Identities = 7/20 (35%), Positives = 12/20 (60%)
Query: 554 NSINGNGTSENRSNDNSYQN 573
N+ N N +N +N+N+ N
Sbjct: 1347 NNQNNNNNDQNNNNNNNNNN 1366
Score = 43 (20.2 bits), Expect = 1.4e-05, Sum P(2) = 1.4e-05
Identities = 9/42 (21%), Positives = 20/42 (47%)
Query: 554 NSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTI 595
N+ N N + N +N+N+ N + + + + + NT+
Sbjct: 892 NNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTSANTV 933
Score = 42 (19.8 bits), Expect = 1.7e-05, Sum P(2) = 1.7e-05
Identities = 7/20 (35%), Positives = 12/20 (60%)
Query: 554 NSINGNGTSENRSNDNSYQN 573
N+ N N + N +N+N+ N
Sbjct: 551 NNNNNNNNNNNNNNNNNITN 570
Score = 39 (18.8 bits), Expect = 3.5e-05, Sum P(2) = 3.5e-05
Identities = 9/24 (37%), Positives = 13/24 (54%)
Query: 554 NSINGNGTSENRSNDNSYQNEIDG 577
+SIN N + N N N+ N +G
Sbjct: 1119 SSINSNINNVNNCNINNNSNSNNG 1142
Score = 37 (18.1 bits), Expect = 5.6e-05, Sum P(2) = 5.6e-05
Identities = 8/20 (40%), Positives = 10/20 (50%)
Query: 554 NSINGNGTSENRSNDNSYQN 573
N+ N N S SN N+ N
Sbjct: 1360 NNNNNNNNSTTNSNVNNNNN 1379
>DICTYBASE|DDB_G0273127 [details] [associations]
symbol:hbx5-1 "putative homeobox transcription
factor" species:44689 "Dictyostelium discoideum" [GO:0043565
"sequence-specific DNA binding" evidence=IEA] [GO:0006355
"regulation of transcription, DNA-dependent" evidence=IEA]
[GO:0005509 "calcium ion binding" evidence=IEA] [GO:0003700
"sequence-specific DNA binding transcription factor activity"
evidence=IEA] [GO:0003677 "DNA binding" evidence=IEA] [GO:0007275
"multicellular organismal development" evidence=IEA] [GO:0006351
"transcription, DNA-dependent" evidence=IEA] [GO:0005634 "nucleus"
evidence=IEA] InterPro:IPR001356 InterPro:IPR002048
InterPro:IPR009057 Pfam:PF00046 PROSITE:PS00027 PROSITE:PS50071
PROSITE:PS50222 SMART:SM00389 dictyBase:DDB_G0273645
dictyBase:DDB_G0273127 GO:GO:0007275 GO:GO:0005634 GO:GO:0043565
GenomeReviews:CM000151_GR GO:GO:0003700 GO:GO:0006351
EMBL:AAFI02000011 EMBL:AAFI02000009 Gene3D:1.10.10.60
SUPFAM:SSF46689 RefSeq:XP_644439.1 RefSeq:XP_644811.1
ProteinModelPortal:Q557C9 EnsemblProtists:DDB0220481
EnsemblProtists:DDB0266662 GeneID:8618913 GeneID:8619064
KEGG:ddi:DDB_G0273127 KEGG:ddi:DDB_G0273645 OMA:THHINIF
ProtClustDB:CLSZ2431129 Uniprot:Q557C9
Length = 1723
Score = 153 (58.9 bits), Expect = 6.8e-06, Sum P(2) = 6.8e-06
Identities = 48/204 (23%), Positives = 82/204 (40%)
Query: 381 NKSDIPNYVNSTVENIIPRYENS-ILRYENGTHEYNSPRIENSNTRYENG---THEYNPK 436
N ++ ++N+ N +S + ++N + +N+ NSN N +++YN
Sbjct: 85 NNNNNNQHMNNQYSNSFHNNNSSGFMAFQNNSSNFNNQNNNNSNNNNNNNNINSYDYNNS 144
Query: 437 YENRYENG--THEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGN 494
N Y N TH N N N + +N N N N N + N N+ N N
Sbjct: 145 NNNNYNNNNNTHSNNSNNNNNNNNSNY-WNNNNNNNNNNNNNNNNNNNNNNNNNNNNNSN 203
Query: 495 GTSENRSNDNSYQNEIDGIDVWSVLSRNEPSK--RNTILHNIDD---EWQISALTRGKWK 549
+ N +N+N + + +++P+ N I HN +D Q + G
Sbjct: 204 NNNNNNNNNNHHHHHHQ--------QQSQPTSPYNNPIQHNPNDMKFNGQHNPFN-GNQM 254
Query: 550 LVKENSINGNGTSENRSNDNSYQN 573
++ N+ N N + N N NS N
Sbjct: 255 VMDNNNNNNNNNNSNVFNSNSNSN 278
Score = 134 (52.2 bits), Expect = 0.00065, Sum P(2) = 0.00065
Identities = 47/188 (25%), Positives = 76/188 (40%)
Query: 387 NYVNSTVENIIPRYENSILRY-ENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGT 445
++V+ ++N P+ Y +NG YN NSN N H N +Y N + N
Sbjct: 52 SFVSPNLDNNNPQIHVQSNNYNQNGFVGYN-----NSNNNNNNNQH-MNNQYSNSFHNNN 105
Query: 446 HEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNS 505
++N N ++ N N N N N +++YN N N N T N SN+N+
Sbjct: 106 SSGFMAFQNNSSNFNNQNNNNSNNNNNNNNIN-SYDYNNSNNNNYNNNNNTHSNNSNNNN 164
Query: 506 YQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENR 565
N + W+ + N + N +N ++ N+ N N S N
Sbjct: 165 NNNNSN---YWNNNNNNNNNNNNNNNNNNNNN----------------NNNNNNNNSNNN 205
Query: 566 SNDNSYQN 573
+N+N+ N
Sbjct: 206 NNNNNNNN 213
Score = 46 (21.3 bits), Expect = 6.8e-06, Sum P(2) = 6.8e-06
Identities = 8/24 (33%), Positives = 13/24 (54%)
Query: 554 NSINGNGTSENRSNDNSYQNEIDG 577
N+ N N + N +N+N N + G
Sbjct: 552 NNNNNNNNNNNNNNNNITNNPLSG 575
Score = 46 (21.3 bits), Expect = 6.8e-06, Sum P(2) = 6.8e-06
Identities = 10/35 (28%), Positives = 19/35 (54%)
Query: 554 NSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNE 588
N+ N N + N +N+N+ N + I ++ + NE
Sbjct: 1689 NNNNNNNNNNNNNNNNNNNNNNNNIINNNITTINE 1723
Score = 44 (20.5 bits), Expect = 1.1e-05, Sum P(2) = 1.1e-05
Identities = 21/109 (19%), Positives = 39/109 (35%)
Query: 554 NSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTXXXX 613
N+ N N + N +N+N+ N + S N T N+ +Q ++ +
Sbjct: 898 NNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTSANTVQSGTTSNSNL--VFQQTSNSNTLS 955
Query: 614 XXXXXXXXMRYQVDLTGGPDQVYLSGLSDREWLALAMRKLRDAASIQCG 662
+ Q + G LSD ++ L + +A+ CG
Sbjct: 956 PSQQQQQQTQQQQSINGSST----GSLSDAQYQDLGIHLDTSSANSGCG 1000
Score = 43 (20.2 bits), Expect = 1.4e-05, Sum P(2) = 1.4e-05
Identities = 7/20 (35%), Positives = 12/20 (60%)
Query: 554 NSINGNGTSENRSNDNSYQN 573
N+ N N +N +N+N+ N
Sbjct: 1347 NNQNNNNNDQNNNNNNNNNN 1366
Score = 43 (20.2 bits), Expect = 1.4e-05, Sum P(2) = 1.4e-05
Identities = 9/42 (21%), Positives = 20/42 (47%)
Query: 554 NSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTI 595
N+ N N + N +N+N+ N + + + + + NT+
Sbjct: 892 NNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTSANTV 933
Score = 42 (19.8 bits), Expect = 1.7e-05, Sum P(2) = 1.7e-05
Identities = 7/20 (35%), Positives = 12/20 (60%)
Query: 554 NSINGNGTSENRSNDNSYQN 573
N+ N N + N +N+N+ N
Sbjct: 551 NNNNNNNNNNNNNNNNNITN 570
Score = 39 (18.8 bits), Expect = 3.5e-05, Sum P(2) = 3.5e-05
Identities = 9/24 (37%), Positives = 13/24 (54%)
Query: 554 NSINGNGTSENRSNDNSYQNEIDG 577
+SIN N + N N N+ N +G
Sbjct: 1119 SSINSNINNVNNCNINNNSNSNNG 1142
Score = 37 (18.1 bits), Expect = 5.6e-05, Sum P(2) = 5.6e-05
Identities = 8/20 (40%), Positives = 10/20 (50%)
Query: 554 NSINGNGTSENRSNDNSYQN 573
N+ N N S SN N+ N
Sbjct: 1360 NNNNNNNNSTTNSNVNNNNN 1379
>DICTYBASE|DDB_G0291424 [details] [associations]
symbol:DDB_G0291424 "Transcription factor SKN7"
species:44689 "Dictyostelium discoideum" [GO:0035556 "intracellular
signal transduction" evidence=IEA] [GO:0006355 "regulation of
transcription, DNA-dependent" evidence=IEA] [GO:0000160
"phosphorelay signal transduction system" evidence=IEA] [GO:0000156
"phosphorelay response regulator activity" evidence=IEA]
InterPro:IPR001789 Pfam:PF00072 PROSITE:PS50110 SMART:SM00448
dictyBase:DDB_G0291424 EMBL:AAFI02000177 GO:GO:0006355
GO:GO:0035556 GO:GO:0005622 GO:GO:0000156 InterPro:IPR011006
SUPFAM:SSF52172 eggNOG:COG0784 RefSeq:XP_635201.1
ProteinModelPortal:Q54EN9 EnsemblProtists:DDB0183884 GeneID:8628146
KEGG:ddi:DDB_G0291424 InParanoid:Q54EN9 OMA:MCANITD
ProtClustDB:CLSZ2429563 Uniprot:Q54EN9
Length = 902
Score = 143 (55.4 bits), Expect = 6.9e-06, P = 6.9e-06
Identities = 53/235 (22%), Positives = 95/235 (40%)
Query: 390 NSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYE-NGTHEY 448
N++ N I N+ + NG + YN+ N+N Y N + +N Y N Y N + +Y
Sbjct: 131 NNSSNNNINNNNNNNINNNNGDN-YNNYNNNNNNNNYNN--NNFNNNYNNNYNGNNSFDY 187
Query: 449 NPKYE----------NRYENGTHEYNGPKNENTNPRYENGTH---EYNIPRLENSINGNG 495
N N Y N ++YN N NTN T+ N N+ N
Sbjct: 188 NNNNNSNVYFNNDRGNNYNNSYNDYNNNNNNNTNTNTNTNTNTNTNTNTNTNTNTNTNNN 247
Query: 496 TSENRSNDNSYQNEIDGIDVWSVLSRNEP-----SKRNTILHNIDDEWQISALTRGKWKL 550
S N +N+N+ N + ++ N+P + N +N ++ + R K
Sbjct: 248 NSFNNNNNNNNNNNFNNSSNYNYDYNNKPYVNSNNNNNNNNNNFNNNINNNNNNRNKSPP 307
Query: 551 VK-ENSING---NGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDD 601
+ + I+ N + + N+ + E++ D +L++N K+ +H + D
Sbjct: 308 PQYQTQISQQQPNNQQQQQLNNKNENLELEDEDENLILNQNRKPKKTKTIHRLMD 362
Score = 127 (49.8 bits), Expect = 0.00036, P = 0.00036
Identities = 46/207 (22%), Positives = 86/207 (41%)
Query: 409 NGTHEYNSPRIENSN-TRYENGTHEYNPKYENRYENGTHEYNPKYENRYE-NGTHEYNGP 466
N + N+ I N+N Y N + N N Y N + +N Y N Y N + +YN
Sbjct: 136 NNINNNNNNNINNNNGDNYNNYNNNNN---NNNYNN--NNFNNNYNNNYNGNNSFDYNNN 190
Query: 467 KNENT---NPR---YENGTHEYNIPRLENSINGNGTSEN-RSNDNSYQNEIDGIDVWSVL 519
N N N R Y N ++YN N+ T+ N +N N+ N + +
Sbjct: 191 NNSNVYFNNDRGNNYNNSYNDYNNNNNNNTNTNTNTNTNTNTNTNTNTNTNTNTNNNNSF 250
Query: 520 SRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQNEIDGID 579
+ N + N +N + + N+ N N + N +N+N+ +N+
Sbjct: 251 NNNNNNNNNNNFNN-SSNYNYDYNNKPYVNSNNNNNNNNNNFNNNINNNNNNRNKSPPPQ 309
Query: 580 VWSVLSRNEPS-KRNTILHNIDDEWQI 605
+ +S+ +P+ ++ L+N ++ ++
Sbjct: 310 YQTQISQQQPNNQQQQQLNNKNENLEL 336
>DICTYBASE|DDB_G0282019 [details] [associations]
symbol:DDB_G0282019 species:44689 "Dictyostelium
discoideum" [GO:0008270 "zinc ion binding" evidence=IEA]
[GO:0005622 "intracellular" evidence=IEA] InterPro:IPR000315
Pfam:PF00643 dictyBase:DDB_G0282019 GO:GO:0008270 GO:GO:0005622
EMBL:AAFI02000044 RefSeq:XP_640410.1 ProteinModelPortal:Q54T41
EnsemblProtists:DDB0205090 GeneID:8623365 KEGG:ddi:DDB_G0282019
InParanoid:Q54T41 OMA:CNYSYNC ProtClustDB:CLSZ2846638
Uniprot:Q54T41
Length = 402
Score = 138 (53.6 bits), Expect = 7.1e-06, P = 7.1e-06
Identities = 56/240 (23%), Positives = 91/240 (37%)
Query: 367 VHVSDWLPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRY 426
+ + ++ +LL N + I N +N + ++N IL + + IEN Y
Sbjct: 164 IEMDEYQKSLLILNNNNIIDN------DNKLKDFKNQILSFN---YSLIKNIIENFKLIY 214
Query: 427 ENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPR 486
G + N N N + N N N + N N N+N Y N + YN
Sbjct: 215 SFGDNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNSNYSY-NCNYSYNCNY 273
Query: 487 LENSINGNGTSENRSNDNSYQN-EIDGIDVWSVLSRNEPSKRNTILHNID-----DEWQI 540
N N N N SN NS + + + + ++ S + N I ++ D D +
Sbjct: 274 SYNCNNNNNYRNNNSNSNSNNSYDCNNDNNNNIFSNSNGHNDNDIGNDFDNDNDNDSYID 333
Query: 541 SALTRGKWKLVKENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNID 600
G + K N+ N N + N +N+N+ N + + N S N L N D
Sbjct: 334 DDNNDGDYNNNKNNNYNNNNNNNNNNNNNNNNNNKN--------NNNNNSNNNNKLSNAD 385
>DICTYBASE|DDB_G0284321 [details] [associations]
symbol:DDB_G0284321 "putative polypyrimidine tract
binding protein (PTBP1)" species:44689 "Dictyostelium discoideum"
[GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0000166
"nucleotide binding" evidence=IEA] [GO:0008150 "biological_process"
evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
InterPro:IPR000504 InterPro:IPR012677 PROSITE:PS50102 SMART:SM00360
dictyBase:DDB_G0284321 GO:GO:0000166 Gene3D:3.30.70.330
GO:GO:0003676 EMBL:AAFI02000064 eggNOG:NOG263741 OMA:DATENEI
RefSeq:XP_638677.1 ProteinModelPortal:Q54PW8 SMR:Q54PW8
EnsemblProtists:DDB0233645 GeneID:8624506 KEGG:ddi:DDB_G0284321
InParanoid:Q54PW8 Uniprot:Q54PW8
Length = 892
Score = 142 (55.0 bits), Expect = 8.7e-06, P = 8.7e-06
Identities = 50/214 (23%), Positives = 86/214 (40%)
Query: 381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYEN-GTHEYNPKYEN 439
NK + N N++ N I + + EN E + EN N EN T++ + K EN
Sbjct: 81 NKKNNNNNNNNSSSNNIKETDGNKNDVENEISEVDFEGSENEN---ENKNTNQNDIKNEN 137
Query: 440 RYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSEN 499
+N + N N N + N N N N N N EN +EN
Sbjct: 138 ENDNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNSNENENENENENENENENENEN 197
Query: 500 ---RSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSI 556
+ N+N + E D D + S+ P ++ L ++ ++ + N+
Sbjct: 198 ENAKENENENEKEKDNED--NKESKTSPPQKIKNLDESNNNSNSNSNSNNNNNNNNNNNN 255
Query: 557 NGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPS 590
N N + N +N+N+ N + ++ V++ N+ S
Sbjct: 256 NNNNNNNNNNNNNNNNNNKNNKNLNGVINENKRS 289
Score = 130 (50.8 bits), Expect = 0.00017, P = 0.00017
Identities = 47/229 (20%), Positives = 89/229 (38%)
Query: 376 LLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNP 435
L+S N + +N+ N + + N + N + + EN E +
Sbjct: 57 LISEPNNRNNSETLNNNNNNNNKNNKKNNNNNNNNSSSNNIKETDGNKNDVENEISEVDF 116
Query: 436 K-YENRYEN-GTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSING 493
+ EN EN T++ + K EN +N + N N N N N + N N+ N
Sbjct: 117 EGSENENENKNTNQNDIKNENENDNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNS 176
Query: 494 NGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKE 553
N N+N +NE + + + + NE K N + + + +
Sbjct: 177 NENENENENENENENENENENENAKENENENEKEKDNEDNKESKTSPPQKIKNLDESNNN 236
Query: 554 NSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDE 602
++ N N + N +N+N+ N + + + + N +K N L+ + +E
Sbjct: 237 SNSNSNSNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNKNNKNLNGVINE 285
>DICTYBASE|DDB_G0291348 [details] [associations]
symbol:DDB_G0291348 "fungal transcriptional
regulatory protein, N-terminal domain-containing protein"
species:44689 "Dictyostelium discoideum" [GO:0008270 "zinc ion
binding" evidence=IEA] [GO:0006366 "transcription from RNA
polymerase II promoter" evidence=IEA] [GO:0006357 "regulation of
transcription from RNA polymerase II promoter" evidence=IEA]
[GO:0006355 "regulation of transcription, DNA-dependent"
evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0000981
"sequence-specific DNA binding RNA polymerase II transcription
factor activity" evidence=IEA] InterPro:IPR001138 Pfam:PF00172
PROSITE:PS00463 PROSITE:PS50048 SMART:SM00066
dictyBase:DDB_G0291348 GO:GO:0005634 EMBL:AAFI02000177
GO:GO:0008270 GO:GO:0006357 GO:GO:0006366 GO:GO:0000981
Gene3D:4.10.240.10 SUPFAM:SSF57701 RefSeq:XP_635156.1
ProteinModelPortal:Q54ET4 EnsemblProtists:DDB0220623 GeneID:8628102
KEGG:ddi:DDB_G0291348 eggNOG:NOG295150 InParanoid:Q54ET4
ProtClustDB:CLSZ2429552 Uniprot:Q54ET4
Length = 771
Score = 141 (54.7 bits), Expect = 9.2e-06, P = 9.2e-06
Identities = 40/197 (20%), Positives = 74/197 (37%)
Query: 381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
N + N + +N + +S + N ++ N I N+N Y N + N N
Sbjct: 93 NNNHSHNNCHDNNQNNSHNHNHSNIISNNIQNQINGNLITNNNNNYNNNNNNNNDNNNNN 152
Query: 441 YENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENR 500
N ++ N N N + N N N N Y N +N + N N + +
Sbjct: 153 NNNNNNDNNNNNNNNNNNNNNNNNNNNNNNNNNNYNNLNENFNNQNFNQNFNQNFNNVDN 212
Query: 501 SNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNG 560
++ + N + ++ SV ++ + T+ N + + N+ N N
Sbjct: 213 MHNQLFNNSNNYLNNNSVKTKQNENLIETLSKNKNKQNLNINNNNNNNNNNNNNNNNNNN 272
Query: 561 TSENRSNDNSYQNEIDG 577
+ N +N+N+ N DG
Sbjct: 273 NNNNNNNNNNNNNNGDG 289
Score = 139 (54.0 bits), Expect = 1.5e-05, P = 1.5e-05
Identities = 53/202 (26%), Positives = 84/202 (41%)
Query: 378 SAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYE-NGTHEYNPK 436
S +N S NY N N + N +N +H +N I ++N + + NG N
Sbjct: 78 SQSNHSQ-SNY-NHNHTNNNHSHNNCHDNNQNNSHNHNHSNIISNNIQNQINGNLITNNN 135
Query: 437 YENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGT 496
N Y N + N N N ++ N N N N N + N N+ N N
Sbjct: 136 --NNYNNNNNNNNDNNNNNNNNNNNDNNNNNNNNNNNNNNNNNNNNNN---NNNNNYNNL 190
Query: 497 SENRSNDNSYQN---EIDGID-VWSVLSRNEPSK-RNTILHNIDDEWQISALTRGKWKLV 551
+EN +N N QN + +D + + L N + N + +E I L++ K K
Sbjct: 191 NENFNNQNFNQNFNQNFNNVDNMHNQLFNNSNNYLNNNSVKTKQNENLIETLSKNKNK-- 248
Query: 552 KENSINGNGTSENRSNDNSYQN 573
+ +IN N + N +N+N+ N
Sbjct: 249 QNLNINNNNNNNNNNNNNNNNN 270
Score = 136 (52.9 bits), Expect = 3.2e-05, P = 3.2e-05
Identities = 51/222 (22%), Positives = 88/222 (39%)
Query: 381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
N S N+ N NI + +++ N + YN+ N++ N + N N
Sbjct: 107 NNSHNHNHSNIISNNIQNQINGNLIT--NNNNNYNNNNNNNNDNNNNNNNNNNNDNNNNN 164
Query: 441 YENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGN-GTSEN 499
N + N N N + YN NEN N + N N ++N N S N
Sbjct: 165 NNNNNNNNNNNNNNNNNNNNNNYNN-LNENFNNQNFNQNFNQNFNNVDNMHNQLFNNSNN 223
Query: 500 RSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGN 559
N+NS + + ++ LS+N+ +K+N ++N ++ + N+ N N
Sbjct: 224 YLNNNSVKTK-QNENLIETLSKNK-NKQNLNINNNNNNNNNNNNNNNNNN--NNNNNNNN 279
Query: 560 GTSENRSNDNSYQNEIDGIDVWSVLSRNEPS-KRNTILHNID 600
+ N + D + N I + + L N+ + K NID
Sbjct: 280 NNNNNNNGDGNNGNNIVKSPILNFLVNNQNAMKTQKTQSNID 321
Score = 128 (50.1 bits), Expect = 0.00023, P = 0.00023
Identities = 47/219 (21%), Positives = 91/219 (41%)
Query: 385 IPNYVNSTVENIIPRYEN-SILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYEN 443
I N +T N N SI +N ++ N N + N H +N ++N +N
Sbjct: 49 IKNNQTTTTTNSTTNPNNQSIKNIQNQNQSQSNHSQSNYNHNHTNNNHSHNNCHDNN-QN 107
Query: 444 GTHEYNPKYENRYENGT-HEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSN 502
+H +N + N N ++ NG N N Y N + N N+ N N + N +N
Sbjct: 108 NSHNHN--HSNIISNNIQNQINGNLITNNNNNYNNNNNNNNDNNNNNNNNNNNDNNNNNN 165
Query: 503 DNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRG-KWKLVKENSINGN-- 559
+N+ N + + N + N +N+++ + + ++++
Sbjct: 166 NNNNNNNNNN-------NNNNNNNNNNNYNNLNENFNNQNFNQNFNQNFNNVDNMHNQLF 218
Query: 560 GTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHN 598
S N N+NS + + ++ LS+N+ +K+N ++N
Sbjct: 219 NNSNNYLNNNSVKTK-QNENLIETLSKNK-NKQNLNINN 255
>ZFIN|ZDB-GENE-030131-775 [details] [associations]
symbol:sulf2l "sulfatase 2, like" species:7955
"Danio rerio" [GO:0003824 "catalytic activity" evidence=IEA]
[GO:0005794 "Golgi apparatus" evidence=IEA] [GO:0008152 "metabolic
process" evidence=IEA] [GO:0009986 "cell surface" evidence=IEA]
[GO:0005509 "calcium ion binding" evidence=IEA] [GO:0008484
"sulfuric ester hydrolase activity" evidence=IEA] [GO:0005783
"endoplasmic reticulum" evidence=IEA] InterPro:IPR000917
InterPro:IPR014615 InterPro:IPR017849 InterPro:IPR017850
Pfam:PF00884 PIRSF:PIRSF036665 ZFIN:ZDB-GENE-030131-775
GO:GO:0005783 GO:GO:0005794 GO:GO:0009986 GO:GO:0005509
Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
PROSITE:PS00523 GO:GO:0008484 HOGENOM:HOG000290161 KO:K14607
HOVERGEN:HBG056431 InterPro:IPR024609 Pfam:PF12548
OrthoDB:EOG49KFPX EMBL:AY332607 IPI:IPI00499289
RefSeq:NP_001003833.2 UniGene:Dr.12108 ProteinModelPortal:Q6EF98
GeneID:322056 KEGG:dre:322056 CTD:322056 NextBio:20807645
ArrayExpress:Q6EF98 Uniprot:Q6EF98
Length = 885
Score = 125 (49.1 bits), Expect = 1.2e-05, Sum P(2) = 1.2e-05
Identities = 56/223 (25%), Positives = 91/223 (40%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN-YYTVQLCTPSRSAIMTGK 118
P+II IL DD D+ G Q+ + G N + T +C PSRS+++TGK
Sbjct: 46 PNIILILTDD---QDIEL-GSMQVMNKTRRIMEQGGTHFSNAFVTTPMCCPSRSSMLTGK 101
Query: 119 HPIHTGMQHNVLYGCERGGLPLSE-----KILPQYLKELGYRTRIVGKWHLGFYKKEYTP 173
+ H HN E P + + YL GYRT GK+ L Y Y P
Sbjct: 102 YA-HN---HNTYTNNENCSSPSWQAQHEPRTFGVYLNNTGYRTAFFGKY-LNEYNGTYIP 156
Query: 174 TFRGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDII 233
G+ + + +++++ + G+ + E D Y TD+ T ++++
Sbjct: 157 P--GWREWVAM-VKNSRFYNYT---LCRNGVREKHGFEYPKD----YLTDLITNDSINYF 206
Query: 234 HNHST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
P+ + ++HAA H P Q + N +HI
Sbjct: 207 RMSKKIYPHRPVLMVISHAAPHGPEDAAP-QYTTAFPNASQHI 248
Score = 66 (28.3 bits), Expect = 1.2e-05, Sum P(2) = 1.2e-05
Identities = 28/118 (23%), Positives = 47/118 (39%)
Query: 269 IHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSDXXXXXXXXXXXXX 328
IH + + K L +D+SV KV L L N+ +++ +D
Sbjct: 273 IHMEFTNMLQRKRLQTLLSVDDSVEKVYNMLVDTGELDNTYVIYTADHGYHIGQFGLVKG 332
Query: 329 SNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIVAEQYVHVSDWLPTLLSAANKSDIP 386
+ P +E +R I P +E+ GI +++ D PT+L A D+P
Sbjct: 333 KSMP--------YEFDIRVPFYIRGPNVEAGGINPHIVLNI-DLAPTILDIAGM-DVP 380
>UNIPROTKB|I3L4C9 [details] [associations]
symbol:SGSH "N-sulphoglucosamine sulphohydrolase"
species:9606 "Homo sapiens" [GO:0008484 "sulfuric ester hydrolase
activity" evidence=IEA] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484 EMBL:AC087741
EMBL:AC123764 HGNC:HGNC:10818 ChiTaRS:SGSH
ProteinModelPortal:I3L4C9 SMR:I3L4C9 Ensembl:ENST00000576941
Bgee:I3L4C9 Uniprot:I3L4C9
Length = 108
Score = 114 (45.2 bits), Expect = 1.2e-05, P = 1.2e-05
Identities = 25/78 (32%), Positives = 47/78 (60%)
Query: 41 PLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
P+ +++ V + + P + + +LADD G+ G + I TP++DALA ++ +N
Sbjct: 4 PVPACCALLLVLGLCRARPRNALLLLADDGGFES-GAYNNSAIATPHLDALARRSLLFRN 62
Query: 101 YYT-VQLCTPSRSAIMTG 117
+T V C+PSR++++TG
Sbjct: 63 AFTSVSSCSPSRASLLTG 80
>GENEDB_PFALCIPARUM|PFL1370w [details] [associations]
symbol:Pfnek-1 "NIMA-related protein kinase,
Pfnek-1" species:5833 "Plasmodium falciparum" [GO:0007067 "mitosis"
evidence=ISS] [GO:0005575 "cellular_component" evidence=ND]
InterPro:IPR000719 InterPro:IPR002290 InterPro:IPR008271
InterPro:IPR011009 Pfam:PF00069 PROSITE:PS00108 PROSITE:PS50011
SMART:SM00220 GO:GO:0005524 SUPFAM:SSF56112 GO:GO:0004674
EMBL:AE014188 KO:K08286 GenomeReviews:AE014188_GR HSSP:Q00535
RefSeq:XP_001350680.1 ProteinModelPortal:Q8I5D5 IntAct:Q8I5D5
MINT:MINT-1689491 EnsemblProtists:PFL1370w:mRNA GeneID:811326
KEGG:pfa:PFL1370w EuPathDB:PlasmoDB:PF3D7_1228300
HOGENOM:HOG000281114 OMA:CINDEEN Uniprot:Q8I5D5
Length = 1057
Score = 141 (54.7 bits), Expect = 1.4e-05, P = 1.4e-05
Identities = 45/211 (21%), Positives = 91/211 (43%)
Query: 371 DWLPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYE-NGTHEYNSPRIENSNTRYENG 429
D + +LL + + I N + N N+ +G + ++ + SNT ENG
Sbjct: 664 DEINSLLKKKSINTISNKNTQSYSNSSTHINNNYNVVNCHGAYNNHNTLSQYSNTSVENG 723
Query: 430 THEYNPKYENRYENGTHE-YNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLE 488
++Y KY+ N + + YN ++ Y + +E N++ N + N + N+ +
Sbjct: 724 KYKYENKYQGNIRNTSKDVYNENMDSAYRSPKYEKGYDDNKSVNKKKMNSNNMGNMNNMN 783
Query: 489 NSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTIL--HNIDDEWQISALTRG 546
N N N + N SN+N+ N + + ++ N + N + I + + + R
Sbjct: 784 NMNNMNNNNNNNSNNNNNSNNSNS----NYMNNNHHTNNNNSCTSNRISNMYFNDSSRRS 839
Query: 547 KWKLVKENSINGNGTSENRSNDNSY-QNEID 576
+ N+++ +S +DN Y QN ++
Sbjct: 840 VSAMPNVNNVSRRKSSVYLCDDNMYNQNNVE 870
>UNIPROTKB|Q8I5D5 [details] [associations]
symbol:nek-1 "NIMA-related protein kinase, Pfnek-1"
species:36329 "Plasmodium falciparum 3D7" [GO:0005575
"cellular_component" evidence=ND] InterPro:IPR000719
InterPro:IPR002290 InterPro:IPR008271 InterPro:IPR011009
Pfam:PF00069 PROSITE:PS00108 PROSITE:PS50011 SMART:SM00220
GO:GO:0005524 SUPFAM:SSF56112 GO:GO:0004674 EMBL:AE014188 KO:K08286
GenomeReviews:AE014188_GR HSSP:Q00535 RefSeq:XP_001350680.1
ProteinModelPortal:Q8I5D5 IntAct:Q8I5D5 MINT:MINT-1689491
EnsemblProtists:PFL1370w:mRNA GeneID:811326 KEGG:pfa:PFL1370w
EuPathDB:PlasmoDB:PF3D7_1228300 HOGENOM:HOG000281114 OMA:CINDEEN
Uniprot:Q8I5D5
Length = 1057
Score = 141 (54.7 bits), Expect = 1.4e-05, P = 1.4e-05
Identities = 45/211 (21%), Positives = 91/211 (43%)
Query: 371 DWLPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYE-NGTHEYNSPRIENSNTRYENG 429
D + +LL + + I N + N N+ +G + ++ + SNT ENG
Sbjct: 664 DEINSLLKKKSINTISNKNTQSYSNSSTHINNNYNVVNCHGAYNNHNTLSQYSNTSVENG 723
Query: 430 THEYNPKYENRYENGTHE-YNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLE 488
++Y KY+ N + + YN ++ Y + +E N++ N + N + N+ +
Sbjct: 724 KYKYENKYQGNIRNTSKDVYNENMDSAYRSPKYEKGYDDNKSVNKKKMNSNNMGNMNNMN 783
Query: 489 NSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTIL--HNIDDEWQISALTRG 546
N N N + N SN+N+ N + + ++ N + N + I + + + R
Sbjct: 784 NMNNMNNNNNNNSNNNNNSNNSNS----NYMNNNHHTNNNNSCTSNRISNMYFNDSSRRS 839
Query: 547 KWKLVKENSINGNGTSENRSNDNSY-QNEID 576
+ N+++ +S +DN Y QN ++
Sbjct: 840 VSAMPNVNNVSRRKSSVYLCDDNMYNQNNVE 870
>TIGR_CMR|SPO_2214 [details] [associations]
symbol:SPO_2214 "choline sulfatase" species:246200
"Ruegeria pomeroyi DSS-3" [GO:0006790 "sulfur compound metabolic
process" evidence=ISS] [GO:0047753 "choline-sulfatase activity"
evidence=ISS] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 EMBL:CP000031
GenomeReviews:CP000031_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
InterPro:IPR024607 PROSITE:PS00149 HOGENOM:HOG000217625 KO:K01133
ProtClustDB:CLSK864791 GO:GO:0047753 RefSeq:YP_167440.1
ProteinModelPortal:Q5LRB5 GeneID:3194829 KEGG:sil:SPO2214
PATRIC:23377781 OMA:LLIMADQ Uniprot:Q5LRB5
Length = 498
Score = 103 (41.3 bits), Expect = 1.7e-05, Sum P(3) = 1.7e-05
Identities = 55/225 (24%), Positives = 87/225 (38%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+I+ I+AD + + G T ++ LA + N YT +C P+RS MTG
Sbjct: 17 PNILLIMADQMTPFMLEACGGTGARTRHLTRLAGRAVQFTNAYTPSPICVPARSCFMTGL 76
Query: 119 HPIHTGMQHNVLYGCERGGLPLSEKILP---QYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
+ TG C G P LP YL GY T + GK H F +
Sbjct: 77 YTSTTG--------CYDNGDPY-HSFLPTFAHYLTNAGYETVLSGKMH--FIGADQ---L 122
Query: 176 RGFESHLG---YWTGHQDYF------DHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFT 226
GF+ L Y +G + D S + + ++ P W +Y +
Sbjct: 123 HGFQRRLNPDIYPSGFLWSYPLPPDGDASFQAFDFTPQYLAENIGPGWSKELQYDEET-Q 181
Query: 227 AEAVDIIHNHSTDEPLFLYLAHAATHSANPYEPLQAPDHYLNIHR 271
A++ + H+ D P L ++ NP+ P P Y +++
Sbjct: 182 FRALEYLR-HAPDTPWMLTVSFT-----NPHPPYVVPRPYWEMYK 220
Score = 77 (32.2 bits), Expect = 1.7e-05, Sum P(3) = 1.7e-05
Identities = 16/64 (25%), Positives = 33/64 (51%)
Query: 252 HSANPYEPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIV 311
H+ + L H + R++ +R FAA+ H +D+ +G ++E L++ ++I+
Sbjct: 242 HALRRWHGLHQRGHEVRDPRNLIAMRRG-FAALAHYVDDKIGALLEVLDETGQRDETVII 300
Query: 312 FVSD 315
SD
Sbjct: 301 VTSD 304
Score = 46 (21.3 bits), Expect = 1.7e-05, Sum P(3) = 1.7e-05
Identities = 14/52 (26%), Positives = 25/52 (48%)
Query: 751 NEEEGMRKLRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEV 802
+E G +R + ++ G K C AP L+++ DP E +N A ++
Sbjct: 387 SEYHGEGIMRPSFMVRLGDWKYHYCHGS-APQLYNLARDPGEWHNRAGEPDL 437
Score = 43 (20.2 bits), Expect = 3.3e-05, Sum P(3) = 3.3e-05
Identities = 18/62 (29%), Positives = 31/62 (50%)
Query: 653 LRDAASIQCGPVKEVPCEPQIAPCLFDIKNDPCEKNNLA---DRSEDQ-RINHYTTEVGR 708
+R + ++ G K C AP L+++ DP E +N A D +E + R++ T G
Sbjct: 395 MRPSFMVRLGDWKYHYCHGS-APQLYNLARDPGEWHNRAGEPDLAETEARLDRVITG-GS 452
Query: 709 FN 710
F+
Sbjct: 453 FD 454
>DICTYBASE|DDB_G0271052 [details] [associations]
symbol:snf2b "SNF2-related protein Snf2a"
species:44689 "Dictyostelium discoideum" [GO:0016818 "hydrolase
activity, acting on acid anhydrides, in phosphorus-containing
anhydrides" evidence=IEA] [GO:0006355 "regulation of transcription,
DNA-dependent" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
[GO:0005524 "ATP binding" evidence=IEA] [GO:0004386 "helicase
activity" evidence=IEA;ISS] [GO:0003677 "DNA binding" evidence=IEA]
[GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0006357
"regulation of transcription from RNA polymerase II promoter"
evidence=ISS] [GO:0005654 "nucleoplasm" evidence=ISS]
InterPro:IPR000330 InterPro:IPR001487 InterPro:IPR001650
InterPro:IPR014978 Pfam:PF00176 Pfam:PF00271 PRINTS:PR00503
PROSITE:PS50014 PROSITE:PS51194 SMART:SM00297 SMART:SM00490
SMART:SM00951 dictyBase:DDB_G0271052 GO:GO:0005524 GO:GO:0005654
EMBL:AAFI02000005 GO:GO:0003677 GO:GO:0006357 GO:GO:0004386
InterPro:IPR011050 SUPFAM:SSF51126 eggNOG:COG0553
InterPro:IPR014001 SMART:SM00487 PROSITE:PS51192 SUPFAM:SSF47370
KO:K11647 InterPro:IPR014012 PROSITE:PS51204 RefSeq:XP_646649.1
ProteinModelPortal:Q55C32 EnsemblProtists:DDB0220695 GeneID:8617621
KEGG:ddi:DDB_G0271052 InParanoid:Q55C32 OMA:NINDNPN Uniprot:Q55C32
Length = 3247
Score = 144 (55.7 bits), Expect = 1.7e-05, Sum P(2) = 1.7e-05
Identities = 56/217 (25%), Positives = 78/217 (35%)
Query: 362 VAEQYVHVSDWL-P-TLLSAANKSDIP-NYVNSTVENIIPRYENSILRYENGTHEYNSPR 418
+ E+Y + P T ++ ++ S + N NS V N NS + N NS
Sbjct: 632 ITEEYYGILQLAHPSTFINQSSPSVVQMNTNNSNVNNNNNNNSNSNMNNNNMNSNNNSNM 691
Query: 419 IENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENG 478
N+N NG + N N N N N N N N N N N
Sbjct: 692 --NNNMNNNNGVNNMNNNMNNNNTNNNSNNNNMNHNNMNNNNGMNNNMNNNNNNNNNMNN 749
Query: 479 THEYNIPRLENSINGNGTSENRSNDNSY--QNEIDGIDVWSVLSRNEPSKRNTILHNIDD 536
NI NS N S N SN+N N I+ I + S N + N +N ++
Sbjct: 750 NTNSNINSNNNSGNSTNNSANISNNNGNIGNNNINNISYNNNNSNNNSNNNNNSNNNSNN 809
Query: 537 EWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQN 573
S + NS N N + N +N N+ N
Sbjct: 810 NNNSSGNSNSNSNN-NSNSNNNNNNNNNNNNSNTSGN 845
Score = 57 (25.1 bits), Expect = 1.7e-05, Sum P(2) = 1.7e-05
Identities = 13/49 (26%), Positives = 25/49 (51%)
Query: 554 NSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDE 602
N+ N N + N +N+N YQ+ R++P+K+ + +DD+
Sbjct: 3181 NNYNNNNYNSNHNNNNQYQHH--SYQQQQHQQRHQPNKKQRF-NPLDDD 3226
>GENEDB_PFALCIPARUM|PF11_0176 [details] [associations]
symbol:PF11_0176 "hypothetical protein"
species:5833 "Plasmodium falciparum" [GO:0003674
"molecular_function" evidence=ND] [GO:0008150 "biological_process"
evidence=ND] InterPro:IPR016196 SUPFAM:SSF103473 EMBL:AE014186
RefSeq:XP_001347847.2 ProteinModelPortal:Q8IIJ7
EnsemblProtists:PF11_0176:mRNA GeneID:810723 KEGG:pfa:PF11_0176
EuPathDB:PlasmoDB:PF3D7_1117000 Uniprot:Q8IIJ7
Length = 1283
Score = 141 (54.7 bits), Expect = 1.7e-05, P = 1.7e-05
Identities = 66/229 (28%), Positives = 95/229 (41%)
Query: 389 VNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEY 448
+N+T N EN+ N +E N+ EN+N N +E N EN N +
Sbjct: 754 INTTTTNNNNNNENNNNNENNNNNENNNNN-ENNNNNENNNNNENNNNNENNNNNENNNN 812
Query: 449 NPKYENRYENGTHEYNGPKNE-NTNPRY-ENGTHEYNI--PRLENS-INGNGTSENRSND 503
N N N +E N N N N + +N H NI P +N IN T+E N
Sbjct: 813 NENNNNNENNNNNENNNNNNHHNHNHNHNQNNHHNQNINYPNPQNERINYPFTNEFIHNH 872
Query: 504 NSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISA---LTRGKWKLVKENSINGN- 559
+ Y N I L+ + NTIL N ++ I+ T + L+KEN I +
Sbjct: 873 HEYVNNI-------ALTPKQQIIDNTILENKQNDEDINKKKLTTHSQKNLLKENLIITDE 925
Query: 560 ---GTSENRSNDNSY-QNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQ 604
T N+ +N+ QN I + L R+E + I NI +E Q
Sbjct: 926 YFINTDTNQYMNNAQNQNNIC-LPKGIYLDRSEECEPKNIW-NIQNESQ 972
Score = 125 (49.1 bits), Expect = 0.00091, P = 0.00091
Identities = 52/183 (28%), Positives = 81/183 (44%)
Query: 392 TVENIIPRYE---NSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEY 448
T+ NII + N+ + NG+ N+ N+N N +E N EN N +
Sbjct: 730 TLNNIITQSNIPINNTNQNINGS-PINTTTTNNNNNNENNNNNENNNNNENN-NNNENNN 787
Query: 449 NPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRS-NDNSYQ 507
N + N EN + N NEN N N +E N EN+ N N + N + N N++
Sbjct: 788 NNENNNNNENNNNNENNNNNENNNNNENNNNNENNNNN-ENNNNNNHHNHNHNHNQNNHH 846
Query: 508 NEIDGIDVWSVLSR--NEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENR 565
N+ I+ + + N P N +HN + ALT K +++ +N+I EN+
Sbjct: 847 NQ--NINYPNPQNERINYPFT-NEFIHNHHEYVNNIALTP-KQQII-DNTI-----LENK 896
Query: 566 SND 568
ND
Sbjct: 897 QND 899
>UNIPROTKB|Q8IIJ7 [details] [associations]
symbol:PF11_0176 "Conserved Plasmodium membrane protein"
species:36329 "Plasmodium falciparum 3D7" [GO:0003674
"molecular_function" evidence=ND] [GO:0008150 "biological_process"
evidence=ND] InterPro:IPR016196 SUPFAM:SSF103473 EMBL:AE014186
RefSeq:XP_001347847.2 ProteinModelPortal:Q8IIJ7
EnsemblProtists:PF11_0176:mRNA GeneID:810723 KEGG:pfa:PF11_0176
EuPathDB:PlasmoDB:PF3D7_1117000 Uniprot:Q8IIJ7
Length = 1283
Score = 141 (54.7 bits), Expect = 1.7e-05, P = 1.7e-05
Identities = 66/229 (28%), Positives = 95/229 (41%)
Query: 389 VNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEY 448
+N+T N EN+ N +E N+ EN+N N +E N EN N +
Sbjct: 754 INTTTTNNNNNNENNNNNENNNNNENNNNN-ENNNNNENNNNNENNNNNENNNNNENNNN 812
Query: 449 NPKYENRYENGTHEYNGPKNE-NTNPRY-ENGTHEYNI--PRLENS-INGNGTSENRSND 503
N N N +E N N N N + +N H NI P +N IN T+E N
Sbjct: 813 NENNNNNENNNNNENNNNNNHHNHNHNHNQNNHHNQNINYPNPQNERINYPFTNEFIHNH 872
Query: 504 NSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISA---LTRGKWKLVKENSINGN- 559
+ Y N I L+ + NTIL N ++ I+ T + L+KEN I +
Sbjct: 873 HEYVNNI-------ALTPKQQIIDNTILENKQNDEDINKKKLTTHSQKNLLKENLIITDE 925
Query: 560 ---GTSENRSNDNSY-QNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQ 604
T N+ +N+ QN I + L R+E + I NI +E Q
Sbjct: 926 YFINTDTNQYMNNAQNQNNIC-LPKGIYLDRSEECEPKNIW-NIQNESQ 972
Score = 125 (49.1 bits), Expect = 0.00091, P = 0.00091
Identities = 52/183 (28%), Positives = 81/183 (44%)
Query: 392 TVENIIPRYE---NSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEY 448
T+ NII + N+ + NG+ N+ N+N N +E N EN N +
Sbjct: 730 TLNNIITQSNIPINNTNQNINGS-PINTTTTNNNNNNENNNNNENNNNNENN-NNNENNN 787
Query: 449 NPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRS-NDNSYQ 507
N + N EN + N NEN N N +E N EN+ N N + N + N N++
Sbjct: 788 NNENNNNNENNNNNENNNNNENNNNNENNNNNENNNNN-ENNNNNNHHNHNHNHNQNNHH 846
Query: 508 NEIDGIDVWSVLSR--NEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENR 565
N+ I+ + + N P N +HN + ALT K +++ +N+I EN+
Sbjct: 847 NQ--NINYPNPQNERINYPFT-NEFIHNHHEYVNNIALTP-KQQII-DNTI-----LENK 896
Query: 566 SND 568
ND
Sbjct: 897 QND 899
>ZFIN|ZDB-GENE-030131-5846 [details] [associations]
symbol:gnsb "glucosamine (N-acetyl)-6-sulfatase
(Sanfilippo disease IIID), b" species:7955 "Danio rerio"
[GO:0030203 "glycosaminoglycan metabolic process" evidence=IEA]
[GO:0003824 "catalytic activity" evidence=IEA] [GO:0008152
"metabolic process" evidence=IEA] [GO:0008449
"N-acetylglucosamine-6-sulfatase activity" evidence=IEA]
[GO:0005764 "lysosome" evidence=IEA] [GO:0008484 "sulfuric ester
hydrolase activity" evidence=IEA] InterPro:IPR000917
InterPro:IPR012251 InterPro:IPR015981 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 PIRSF:PIRSF036666
ZFIN:ZDB-GENE-030131-5846 GO:GO:0005764 Gene3D:3.40.720.10
SUPFAM:SSF53649 GeneTree:ENSGT00400000022041 GO:GO:0030203
GO:GO:0008449 PANTHER:PTHR10342:SF5 EMBL:CU896586 IPI:IPI00971874
Ensembl:ENSDART00000112103 ArrayExpress:F1QJ04 Bgee:F1QJ04
Uniprot:F1QJ04
Length = 507
Score = 136 (52.9 bits), Expect = 1.8e-05, P = 1.8e-05
Identities = 58/226 (25%), Positives = 93/226 (41%)
Query: 41 PLAFTLSMVFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
PL + F A S +II IL DD D G+ + + + +G N
Sbjct: 14 PLKLLVLFFFFFTCAFSSKNNIILILTDD---QDEQMGGMTPMKKTR-ELIGDAGATFSN 69
Query: 101 YYT-VQLCTPSRSAIMTGKHPIHTGMQHN--VLYGCERGGLPLSEK--ILPQYLKELGYR 155
+T LC PSRS+ ++G++P H + HN V C + + P YL ++ Y+
Sbjct: 70 AFTSTPLCCPSRSSFLSGRYP-HNHLVHNNSVEGNCSSAAWQKTAEPFAFPVYLNKMRYQ 128
Query: 156 TRIVGKWHLGFYKKEYTPTFRGFESHL--GY--W---TGHQDYFDHSAEEMKMWGLDMRR 208
T GK Y +Y G +H+ G+ W G+ Y++++ L +
Sbjct: 129 TFYCGK-----YLNQYGSKDAGGVAHVPPGWDQWHALVGNSKYYNYT--------LSVNG 175
Query: 209 DLEPAWDLHGK-YSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHS 253
E D + K Y TD+ ++ + S P F+ L A HS
Sbjct: 176 KEEKHGDSYEKDYLTDLVLNRSLHFLEERSPSHPFFMMLCPPAPHS 221
>UNIPROTKB|F5H260 [details] [associations]
symbol:GNS "N-acetylglucosamine-6-sulfatase" species:9606
"Homo sapiens" [GO:0005764 "lysosome" evidence=IEA] [GO:0008449
"N-acetylglucosamine-6-sulfatase activity" evidence=IEA]
[GO:0030203 "glycosaminoglycan metabolic process" evidence=IEA]
InterPro:IPR000917 InterPro:IPR015981 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 GO:GO:0005764 Gene3D:3.40.720.10
SUPFAM:SSF53649 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
GO:GO:0042340 GO:GO:0043199 GO:GO:0005539 GO:GO:0008449
PANTHER:PTHR10342:SF5 EMBL:AC025262 HGNC:HGNC:4422 ChiTaRS:GNS
IPI:IPI01010051 ProteinModelPortal:F5H260 SMR:F5H260
Ensembl:ENST00000545471 ArrayExpress:F5H260 Bgee:F5H260
Uniprot:F5H260
Length = 344
Score = 126 (49.4 bits), Expect = 2.0e-05, Sum P(2) = 2.0e-05
Identities = 48/161 (29%), Positives = 71/161 (44%)
Query: 101 YYTVQLCTPSRSAIMTGKHPIHTGMQHNVLYG-CE-RGGLPLSE-KILPQYLKEL-GYRT 156
Y LC PSR++I+TGK+P + + +N L G C + + E P L+ + GY+T
Sbjct: 22 YVPSALCCPSRASILTGKYPHNHHVVNNTLEGNCSSKSWQKIQEPNTFPAILRSMCGYQT 81
Query: 157 RIVGKWHLGFYKKEY-TPTFRGFES-HLG--YWTGHQDYFDHSAEEMKMWGLDMRRDLEP 212
GK Y EY P G E LG YW + + + + G +
Sbjct: 82 FFAGK-----YLNEYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENY 136
Query: 213 AWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAATHS 253
+ D Y TDV ++D + S EP F+ +A A HS
Sbjct: 137 SVD----YLTDVLANVSLDFLDYKSNFEPFFMMIATPAPHS 173
Score = 51 (23.0 bits), Expect = 2.0e-05, Sum P(2) = 2.0e-05
Identities = 13/38 (34%), Positives = 23/38 (60%)
Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
R ++ +L +D+ V K+V+ LE L+N+ I + SD
Sbjct: 227 RKRWQTLL-SVDDLVEKLVKRLEFTGELNNTYIFYTSD 263
>DICTYBASE|DDB_G0278425 [details] [associations]
symbol:DDB_G0278425 species:44689 "Dictyostelium
discoideum" [GO:0016779 "nucleotidyltransferase activity"
evidence=IEA] InterPro:IPR002934 Pfam:PF01909
dictyBase:DDB_G0278425 Pfam:PF03828 EMBL:AAFI02000023
eggNOG:COG5260 InterPro:IPR002058 GO:GO:0016779 RefSeq:XP_642359.1
EnsemblProtists:DDB0205447 GeneID:8621564 KEGG:ddi:DDB_G0278425
InParanoid:Q54Y43 OMA:TEESINT Uniprot:Q54Y43
Length = 1090
Score = 147 (56.8 bits), Expect = 2.1e-05, Sum P(2) = 2.1e-05
Identities = 51/205 (24%), Positives = 91/205 (44%)
Query: 381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
N D PN +N+ N + N + + H +N+ N+N N ++ N N
Sbjct: 43 NGVDNPNDINNGNNNSHHKKNNHHNHHYH--HHHNNNNNNNNNNNNNNNSNNNNNNNSNN 100
Query: 441 YENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTH-EYNIPRLENSINGNGTSEN 499
N + N N NG H +N +N N+N Y+ T +++I N+IN N + N
Sbjct: 101 NNNNNNNNNNNNNNN-NNG-HHHNNTQN-NSNFTYQPKTKKDHHIQNNNNNINNNNINNN 157
Query: 500 RSND-NSYQNEIDGIDVWSVLSRNEPSKRNTI----LHNIDDEWQISALTRGKWKLVKEN 554
N+ N+ N +G +V ++S N + N ++N ++ + + G + +
Sbjct: 158 NINNINNNINTNNGNEVGHIVSNNNNNNNNNNNNNNINNNNNNINNNTINGGNSNINNQF 217
Query: 555 SINGNGTSENRSNDNSYQNEIDGID 579
N N + N ++D +Y E DGI+
Sbjct: 218 D-NENNNNNNINDDGNYIYE-DGIE 240
Score = 135 (52.6 bits), Expect = 0.00038, Sum P(2) = 0.00038
Identities = 50/219 (22%), Positives = 90/219 (41%)
Query: 387 NYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTH 446
+Y N+ N N+ +NG N N+N+ ++ H +N Y + + N +
Sbjct: 21 HYKNNNNNNNNNNNNNNNKNNQNGVDNPNDINNGNNNSHHKKNNH-HNHHYHHHHNNNNN 79
Query: 447 EYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSI--NGNGTSENRSN-D 503
N N N + N N N N N + N N+ N N T + ++ D
Sbjct: 80 NNNNNNNNNNSNNNNNNNSNNNNNNNNNNNNNNNNNNNGHHHNNTQNNSNFTYQPKTKKD 139
Query: 504 NSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSE 563
+ QN + I+ ++ + N + N I N +E + + N+ N N +
Sbjct: 140 HHIQNNNNNINNNNINNNNINNINNNINTNNGNE--VGHIVSNN---NNNNNNNNNNNNI 194
Query: 564 NRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDE 602
N +N+N N I+G + S ++ N+ N +NI+D+
Sbjct: 195 NNNNNNINNNTINGGN--SNIN-NQFDNENNNNNNINDD 230
Score = 43 (20.2 bits), Expect = 2.1e-05, Sum P(2) = 2.1e-05
Identities = 7/15 (46%), Positives = 10/15 (66%)
Query: 670 EPQIAPCLFDIKNDP 684
+P I PCL ++ N P
Sbjct: 934 QPPILPCLQELANGP 948
Score = 43 (20.2 bits), Expect = 2.1e-05, Sum P(2) = 2.1e-05
Identities = 7/15 (46%), Positives = 10/15 (66%)
Query: 776 EPQIAPCLFDIKNDP 790
+P I PCL ++ N P
Sbjct: 934 QPPILPCLQELANGP 948
>ZFIN|ZDB-GENE-040426-759 [details] [associations]
symbol:sulf2 "sulfatase 2" species:7955 "Danio
rerio" [GO:0003824 "catalytic activity" evidence=IEA] [GO:0005794
"Golgi apparatus" evidence=IEA] [GO:0008152 "metabolic process"
evidence=IEA] [GO:0009986 "cell surface" evidence=IEA] [GO:0005509
"calcium ion binding" evidence=IEA] [GO:0008484 "sulfuric ester
hydrolase activity" evidence=IEA] [GO:0005783 "endoplasmic
reticulum" evidence=IEA] InterPro:IPR000917 InterPro:IPR014615
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
PIRSF:PIRSF036665 ZFIN:ZDB-GENE-040426-759 GO:GO:0005783
GO:GO:0005794 GO:GO:0009986 GO:GO:0005509 Gene3D:3.40.720.10
SUPFAM:SSF53649 InterPro:IPR024607 PROSITE:PS00523 GO:GO:0008484
KO:K14607 HOVERGEN:HBG056431 InterPro:IPR024609 Pfam:PF12548
CTD:55959 EMBL:BC045403 IPI:IPI00482734 RefSeq:NP_957230.1
UniGene:Dr.75551 ProteinModelPortal:Q7ZVU8 PRIDE:Q7ZVU8
GeneID:393910 KEGG:dre:393910 InParanoid:Q7ZVU8 NextBio:20814887
ArrayExpress:Q7ZVU8 Bgee:Q7ZVU8 Uniprot:Q7ZVU8
Length = 873
Score = 120 (47.3 bits), Expect = 2.1e-05, Sum P(3) = 2.1e-05
Identities = 53/221 (23%), Positives = 89/221 (40%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQI-PTPNIDALAYSGIILKNYY-TVQLCTPSRSAIMTG 117
P++I IL DD D+ + + T I + G N + T +C PSRS I+TG
Sbjct: 46 PNMILILTDD---QDIELGSMQAMNKTKRI--MMQGGTHFSNAFATTPMCCPSRSTILTG 100
Query: 118 KHPIHTGMQHNVLYGCERGGLPLSEK--ILPQYLKELGYRTRIVGKWHLGFYKKEYTPTF 175
K+ +H + C + +L GYRT GK+ L Y Y P
Sbjct: 101 KY-VHNHHTYTNNENCSSPSWQAHHEPHTFAVHLNNSGYRTAFFGKY-LNEYNGSYVPP- 157
Query: 176 RGFESHLGYWTGHQDYFDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHN 235
G+ + + +++++ + G+ + + D Y TDV T ++++
Sbjct: 158 -GWREWVAL-VKNSRFYNYT---LCRNGIREKHGTQYPKD----YLTDVITNDSINFFRM 208
Query: 236 HST---DEPLFLYLAHAATHSANPYEPLQAPDHYLNIHRHI 273
P+ + L+HAA H P Q + N +HI
Sbjct: 209 SKRMYPHRPVMMVLSHAAPHGPEDAAP-QYSSAFPNASQHI 248
Score = 75 (31.5 bits), Expect = 2.1e-05, Sum P(3) = 2.1e-05
Identities = 41/190 (21%), Positives = 76/190 (40%)
Query: 245 YLAHAATHSANPYEP--LQAPDHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQR 302
++ + H+ NP + L+ +H + + + L +D+SV KV L +
Sbjct: 247 HITPSYNHAPNPDKHWILRYTGPMKPVHMQFTNMLQRRRLQTLLSVDDSVEKVYNMLVET 306
Query: 303 RMLSNSIIVFVSDXXXXXXXXXXXXXSNWPLRGVKNTLWEGGVRGAGLIWSPLLESRGIV 362
L N+ I+++SD + P +E +R + P +E+ G +
Sbjct: 307 GELDNTYIIYMSDHGYHIGQFGLVKGKSMP--------YEFDIRIPFYVRGPNVEA-GAI 357
Query: 363 AEQYVHVSDWLPTLLSAANKSDIPNYVNS-TVENIIP--RYENSILRYENGTHEYNSPRI 419
V +D PTLL A DIP ++ ++ ++ R NS R+ H Y ++
Sbjct: 358 NPHIVLNTDLAPTLLDMAG-IDIPQDMDGKSILKLLETERPVNSFTRF----HSYKKAKL 412
Query: 420 ENSNTRYENG 429
+ E G
Sbjct: 413 WRDSFLVERG 422
Score = 38 (18.4 bits), Expect = 2.1e-05, Sum P(3) = 2.1e-05
Identities = 7/24 (29%), Positives = 11/24 (45%)
Query: 678 FDIKNDPCEKNNLADRSEDQRINH 701
FD+ DP + N + +NH
Sbjct: 790 FDLNTDPYQLMNGVSTLDRDAVNH 813
Score = 37 (18.1 bits), Expect = 2.6e-05, Sum P(3) = 2.6e-05
Identities = 7/24 (29%), Positives = 11/24 (45%)
Query: 784 FDIKNDPCEKNNLADRSEVQRINH 807
FD+ DP + N + +NH
Sbjct: 790 FDLNTDPYQLMNGVSTLDRDAVNH 813
>GENEDB_PFALCIPARUM|MAL13P1.44 [details] [associations]
symbol:MAL13P1.44 "protein phosphatase 2c-like
protein, putative" species:5833 "Plasmodium falciparum" [GO:0008287
"protein serine/threonine phosphatase complex" evidence=ISS]
[GO:0004722 "protein serine/threonine phosphatase activity"
evidence=ISS] [GO:0006468 "protein phosphorylation" evidence=ISS]
InterPro:IPR001932 Pfam:PF00481 SMART:SM00331 SMART:SM00332
GO:GO:0004722 GO:GO:0006468 Gene3D:3.60.40.10 SUPFAM:SSF81606
KO:K01090 EMBL:AL844509 InterPro:IPR015655 PANTHER:PTHR13832
GO:GO:0008287 RefSeq:XP_001349820.1 ProteinModelPortal:Q8IEM2
EnsemblProtists:MAL13P1.44:mRNA GeneID:813933 KEGG:pfa:MAL13P1.44
EuPathDB:PlasmoDB:PF3D7_1309200 ProtClustDB:CLSZ2432578
Uniprot:Q8IEM2
Length = 827
Score = 138 (53.6 bits), Expect = 2.1e-05, P = 2.1e-05
Identities = 69/278 (24%), Positives = 119/278 (42%)
Query: 430 THEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGT-HEYNIPRLE 488
TH KY + E N + E EN EY ++ NTN + G + +N L+
Sbjct: 27 THSQKNKYRDAINKYAQENNSRGE--CENYCDEYYSRRSNNTNIKLNRGMKYSHNNNGLK 84
Query: 489 NSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNID--DEWQISALTRG 546
+ + N + N S+D + N DGI + ++ N + N N++ ++ + SA +
Sbjct: 85 KNDHFNCNNSNISSDENENNMNDGISINNIKQNNLDNVNNVDYDNLNIKEKKEESAFDKW 144
Query: 547 KWKLVKENSINGNGTSENRSNDN-SYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQI 605
K K K+NS + ++N ++D+ +Y+NE D + + N + N I N +
Sbjct: 145 KKKKKKKNSDQFSELAKNNNSDHVNYKNEKREYDNNNNNNNNNNNNNNNIFSN--NNCNN 202
Query: 606 SALTXXXXXXXXXXXXMRYQVDLTGGPDQ-VY-LSGLSDREWLALAMRKLRDAASIQCGP 663
S++ + DL G ++ V+ L GL+ RK D +
Sbjct: 203 SSIIYDNNVFSDNYKYYNDKCDLCNGQEKCVHRLGGLNCTHDEDDKTRKCTDENINKKLL 262
Query: 664 VKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEDQRINH 701
+K E I + DI ND E NN+ + +E IN+
Sbjct: 263 IKND--EDSIDYSVDDI-NDEYENNNIIN-NESHIINN 296
>UNIPROTKB|Q8IEM2 [details] [associations]
symbol:MAL13P1.44 "Protein phosphatase 2c-like protein,
putative" species:36329 "Plasmodium falciparum 3D7" [GO:0004722
"protein serine/threonine phosphatase activity" evidence=ISS]
[GO:0006468 "protein phosphorylation" evidence=ISS] [GO:0008287
"protein serine/threonine phosphatase complex" evidence=ISS]
InterPro:IPR001932 Pfam:PF00481 SMART:SM00331 SMART:SM00332
GO:GO:0004722 GO:GO:0006468 Gene3D:3.60.40.10 SUPFAM:SSF81606
KO:K01090 EMBL:AL844509 InterPro:IPR015655 PANTHER:PTHR13832
GO:GO:0008287 RefSeq:XP_001349820.1 ProteinModelPortal:Q8IEM2
EnsemblProtists:MAL13P1.44:mRNA GeneID:813933 KEGG:pfa:MAL13P1.44
EuPathDB:PlasmoDB:PF3D7_1309200 ProtClustDB:CLSZ2432578
Uniprot:Q8IEM2
Length = 827
Score = 138 (53.6 bits), Expect = 2.1e-05, P = 2.1e-05
Identities = 69/278 (24%), Positives = 119/278 (42%)
Query: 430 THEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGT-HEYNIPRLE 488
TH KY + E N + E EN EY ++ NTN + G + +N L+
Sbjct: 27 THSQKNKYRDAINKYAQENNSRGE--CENYCDEYYSRRSNNTNIKLNRGMKYSHNNNGLK 84
Query: 489 NSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNID--DEWQISALTRG 546
+ + N + N S+D + N DGI + ++ N + N N++ ++ + SA +
Sbjct: 85 KNDHFNCNNSNISSDENENNMNDGISINNIKQNNLDNVNNVDYDNLNIKEKKEESAFDKW 144
Query: 547 KWKLVKENSINGNGTSENRSNDN-SYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQI 605
K K K+NS + ++N ++D+ +Y+NE D + + N + N I N +
Sbjct: 145 KKKKKKKNSDQFSELAKNNNSDHVNYKNEKREYDNNNNNNNNNNNNNNNIFSN--NNCNN 202
Query: 606 SALTXXXXXXXXXXXXMRYQVDLTGGPDQ-VY-LSGLSDREWLALAMRKLRDAASIQCGP 663
S++ + DL G ++ V+ L GL+ RK D +
Sbjct: 203 SSIIYDNNVFSDNYKYYNDKCDLCNGQEKCVHRLGGLNCTHDEDDKTRKCTDENINKKLL 262
Query: 664 VKEVPCEPQIAPCLFDIKNDPCEKNNLADRSEDQRINH 701
+K E I + DI ND E NN+ + +E IN+
Sbjct: 263 IKND--EDSIDYSVDDI-NDEYENNNIIN-NESHIINN 296
>DICTYBASE|DDB_G0279085 [details] [associations]
symbol:cycA "cyclin" species:44689 "Dictyostelium
discoideum" [GO:0005634 "nucleus" evidence=IEA] InterPro:IPR004367
Pfam:PF02984 dictyBase:DDB_G0279085 GO:GO:0005634
GenomeReviews:CM000152_GR EMBL:AAFI02000027 Gene3D:1.10.472.10
InterPro:IPR013763 InterPro:IPR006671 Pfam:PF00134 SMART:SM00385
SUPFAM:SSF47954 eggNOG:COG5024 PROSITE:PS00292
RefSeq:XP_001134569.1 ProteinModelPortal:Q1ZXI1
EnsemblProtists:DDB0231774 GeneID:8621862 KEGG:ddi:DDB_G0279085
InParanoid:Q1ZXI1 OMA:ACAFFIA Uniprot:Q1ZXI1
Length = 588
Score = 136 (52.9 bits), Expect = 2.2e-05, P = 2.2e-05
Identities = 46/201 (22%), Positives = 83/201 (41%)
Query: 378 SAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKY 437
+A N S+ N N+ NI N+I N + N+ N+N N + N K
Sbjct: 108 TATNNSNNNNNNNNN-NNINNNNNNNINIISNNNNNNNNNNNNNNNNNNNNNNNNNNNKL 166
Query: 438 ENRYENG--THEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNG 495
+++ NG E P N N + + N+ + +N +E P N+ N N
Sbjct: 167 KSQTVNGGIKTENLPSKNNNDNNSNSDDSNNSNKTNQTQQDNSNNEIAPPTKPNNNNNNN 226
Query: 496 TSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENS 555
+ N +N+N+ N + + L+ NE ++ N I +N ++ + N+
Sbjct: 227 NNNNNNNNNNNNNNNNNNN----LTENENNELNNIKNNNNNNNNNN----------NNNN 272
Query: 556 INGNGTSENRSNDNSYQNEID 576
N N + N +N+N N ++
Sbjct: 273 NNNNNNNNNNNNNNKENNSLE 293
Score = 123 (48.4 bits), Expect = 0.00056, P = 0.00056
Identities = 49/201 (24%), Positives = 82/201 (40%)
Query: 412 HEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYN-PK-YENRYENGTHEYNGPKNE 469
+ YN+ N+N Y N P N+ + N P + N N ++ N N
Sbjct: 63 NNYNNNNNNNNNNNYNNKNLMAKPIQSNKNNSIITASNIPSTFNNTATNNSNNNNN--NN 120
Query: 470 NTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWS------VLSRNE 523
N N N + NI N+ N N + N +N+N+ N + + S + + N
Sbjct: 121 NNNNINNNNNNNINIISNNNNNNNNNNNNNNNNNNNNNNNNNNNKLKSQTVNGGIKTENL 180
Query: 524 PSKRNTILHNIDDEWQISALTRGKWKLVKENSI------NGNGTSENRSNDNSYQNEIDG 577
PSK N ++ D+ S T + N I N N + N +N+N+ N +
Sbjct: 181 PSKNNNDNNSNSDDSNNSNKTNQTQQDNSNNEIAPPTKPNNNNNNNNNNNNNNNNNNNNN 240
Query: 578 IDVWSVLSRNEPSKRNTILHN 598
+ + L+ NE ++ N I +N
Sbjct: 241 NNN-NNLTENENNELNNIKNN 260
>DICTYBASE|DDB_G0291197 [details] [associations]
symbol:hbx3 "putative homeobox transcription factor"
species:44689 "Dictyostelium discoideum" [GO:0043565
"sequence-specific DNA binding" evidence=IEA] [GO:0006355
"regulation of transcription, DNA-dependent" evidence=IEA]
[GO:0003700 "sequence-specific DNA binding transcription factor
activity" evidence=IEA] [GO:0003677 "DNA binding" evidence=IEA]
[GO:0007275 "multicellular organismal development" evidence=IEA]
[GO:0006351 "transcription, DNA-dependent" evidence=IEA]
[GO:0005634 "nucleus" evidence=IEA] InterPro:IPR001356
InterPro:IPR008422 InterPro:IPR009057 Pfam:PF05920 PROSITE:PS00027
PROSITE:PS50071 SMART:SM00389 dictyBase:DDB_G0291197 GO:GO:0007275
GO:GO:0005634 GO:GO:0043565 GO:GO:0003700 GO:GO:0006351
GenomeReviews:CM000154_GR Gene3D:1.10.10.60 SUPFAM:SSF46689
EMBL:AAFI02000175 eggNOG:NOG248144 RefSeq:XP_635379.1 HSSP:P40424
ProteinModelPortal:Q54F11 EnsemblProtists:DDB0220480 GeneID:8628027
KEGG:ddi:DDB_G0291197 InParanoid:Q54F11 Uniprot:Q54F11
Length = 667
Score = 136 (52.9 bits), Expect = 2.6e-05, P = 2.6e-05
Identities = 54/215 (25%), Positives = 96/215 (44%)
Query: 374 PTLLSAANK--SDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTH 431
P L+S+ SD+ + NS++ + P EN +L + Y + +I N ++N +
Sbjct: 89 PLLISSQTSYPSDLSS--NSSISHS-P-IENQLLDNNLDINNYLN-KINIFNNHFQN-SD 142
Query: 432 EYNPKYENRYENGTHEYNPKYENRYENGTHEYNG----PKNENTNPRYENGTHEYNIPRL 487
N + N++EN + N N EN ++ YN P N N N N + N
Sbjct: 143 LINTTFFNQFENNNYINN---NNNKENNSYFYNNNVNIPNNNNLNINNNNNNNNNNNNNN 199
Query: 488 ENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSR-NEPSKRNTILHNIDDEWQISALTRG 546
N+ N N + N +N+N+ N + + +V + N P+ N L N+ + LT
Sbjct: 200 NNNNNNNNNNNNNNNNNNNNNNNNNNNKNTVYNNVNIPNNNNFNL-NLSNNNNNLNLTNN 258
Query: 547 KWKLVKENSINGNGTS-ENRSNDNSYQNEIDGIDV 580
+NS+N N + N +N+N++ + +V
Sbjct: 259 N---NNKNSVNNNNVNISNNNNNNNFNVNLSNNNV 290
>UNIPROTKB|Q5LVA2 [details] [associations]
symbol:SPO0800 "Choline sulfatase, putative" species:246200
"Ruegeria pomeroyi DSS-3" [GO:0006790 "sulfur compound metabolic
process" evidence=ISS] [GO:0047753 "choline-sulfatase activity"
evidence=ISS] InterPro:IPR000917 InterPro:IPR017849
InterPro:IPR017850 Pfam:PF00884 EMBL:CP000031
GenomeReviews:CP000031_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
GO:GO:0006790 GO:GO:0047753 RefSeq:YP_166053.1
ProteinModelPortal:Q5LVA2 GeneID:3195931 KEGG:sil:SPO0800
PATRIC:23374875 HOGENOM:HOG000061225 ProtClustDB:CLSK279175
Uniprot:Q5LVA2
Length = 482
Score = 102 (41.0 bits), Expect = 2.8e-05, Sum P(4) = 2.8e-05
Identities = 23/66 (34%), Positives = 39/66 (59%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+++ I++D+ + +G G + TPN+DALA G + + YT +C P+R+A+ TG
Sbjct: 5 PNLLVIVSDEHRKDAMGCAGHPIVKTPNLDALAARGTMFEAAYTPSPMCVPTRAALATGD 64
Query: 119 HPIHTG 124
TG
Sbjct: 65 WIHRTG 70
Score = 63 (27.2 bits), Expect = 2.8e-05, Sum P(4) = 2.8e-05
Identities = 14/52 (26%), Positives = 29/52 (55%)
Query: 264 DHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
D Y + + E ++ + + +D+ VG+V+ ALE N+++++VSD
Sbjct: 236 DAYFDAQKMRE--AKAAYYGLTSFMDDCVGRVLAALEAGGKADNTVVLYVSD 285
Score = 57 (25.1 bits), Expect = 2.8e-05, Sum P(4) = 2.8e-05
Identities = 17/45 (37%), Positives = 23/45 (51%)
Query: 674 APCLFDIKNDPCEKNNLADRS-EDQRINHYTTE-VGRFNQIAYPD 716
AP LFD++ DP E +LA R+ ED + E R I P+
Sbjct: 397 APQLFDLERDPQELTDLAPRAAEDPDMRALLAEGEHRLRAICNPE 441
Score = 54 (24.1 bits), Expect = 5.5e-05, Sum P(4) = 5.5e-05
Identities = 11/21 (52%), Positives = 15/21 (71%)
Query: 780 APCLFDIKNDPCEKNNLADRS 800
AP LFD++ DP E +LA R+
Sbjct: 397 APQLFDLERDPQELTDLAPRA 417
Score = 39 (18.8 bits), Expect = 2.8e-05, Sum P(4) = 2.8e-05
Identities = 14/47 (29%), Positives = 21/47 (44%)
Query: 183 GYW---TGHQDYFDHSAEEMKMWGLDMR---RDLEPAWDLHGKYSTD 223
G W TGH D A + + W D+R R++ LH + + D
Sbjct: 63 GDWIHRTGHWDSATPYAGQPRSWMHDLRDAGREVVSIGKLHFRATED 109
>TIGR_CMR|SPO_0800 [details] [associations]
symbol:SPO_0800 "choline sulfatase, putative"
species:246200 "Ruegeria pomeroyi DSS-3" [GO:0006790 "sulfur
compound metabolic process" evidence=ISS] [GO:0047753
"choline-sulfatase activity" evidence=ISS] InterPro:IPR000917
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 EMBL:CP000031
GenomeReviews:CP000031_GR Gene3D:3.40.720.10 SUPFAM:SSF53649
GO:GO:0006790 GO:GO:0047753 RefSeq:YP_166053.1
ProteinModelPortal:Q5LVA2 GeneID:3195931 KEGG:sil:SPO0800
PATRIC:23374875 HOGENOM:HOG000061225 ProtClustDB:CLSK279175
Uniprot:Q5LVA2
Length = 482
Score = 102 (41.0 bits), Expect = 2.8e-05, Sum P(4) = 2.8e-05
Identities = 23/66 (34%), Positives = 39/66 (59%)
Query: 60 PHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKNYYTVQ-LCTPSRSAIMTGK 118
P+++ I++D+ + +G G + TPN+DALA G + + YT +C P+R+A+ TG
Sbjct: 5 PNLLVIVSDEHRKDAMGCAGHPIVKTPNLDALAARGTMFEAAYTPSPMCVPTRAALATGD 64
Query: 119 HPIHTG 124
TG
Sbjct: 65 WIHRTG 70
Score = 63 (27.2 bits), Expect = 2.8e-05, Sum P(4) = 2.8e-05
Identities = 14/52 (26%), Positives = 29/52 (55%)
Query: 264 DHYLNIHRHIEDFKRSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
D Y + + E ++ + + +D+ VG+V+ ALE N+++++VSD
Sbjct: 236 DAYFDAQKMRE--AKAAYYGLTSFMDDCVGRVLAALEAGGKADNTVVLYVSD 285
Score = 57 (25.1 bits), Expect = 2.8e-05, Sum P(4) = 2.8e-05
Identities = 17/45 (37%), Positives = 23/45 (51%)
Query: 674 APCLFDIKNDPCEKNNLADRS-EDQRINHYTTE-VGRFNQIAYPD 716
AP LFD++ DP E +LA R+ ED + E R I P+
Sbjct: 397 APQLFDLERDPQELTDLAPRAAEDPDMRALLAEGEHRLRAICNPE 441
Score = 54 (24.1 bits), Expect = 5.5e-05, Sum P(4) = 5.5e-05
Identities = 11/21 (52%), Positives = 15/21 (71%)
Query: 780 APCLFDIKNDPCEKNNLADRS 800
AP LFD++ DP E +LA R+
Sbjct: 397 APQLFDLERDPQELTDLAPRA 417
Score = 39 (18.8 bits), Expect = 2.8e-05, Sum P(4) = 2.8e-05
Identities = 14/47 (29%), Positives = 21/47 (44%)
Query: 183 GYW---TGHQDYFDHSAEEMKMWGLDMR---RDLEPAWDLHGKYSTD 223
G W TGH D A + + W D+R R++ LH + + D
Sbjct: 63 GDWIHRTGHWDSATPYAGQPRSWMHDLRDAGREVVSIGKLHFRATED 109
>FB|FBgn0035445 [details] [associations]
symbol:CG12014 species:7227 "Drosophila melanogaster"
[GO:0004423 "iduronate-2-sulfatase activity" evidence=ISS]
[GO:0008152 "metabolic process" evidence=IEA] InterPro:IPR000917
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884 EMBL:AE014296
Gene3D:3.40.720.10 SUPFAM:SSF53649 InterPro:IPR024607
PROSITE:PS00523 HSSP:P15289 KO:K01136 GO:GO:0004423
GeneTree:ENSGT00640000091539 RefSeq:NP_647814.1 UniGene:Dm.15756
ProteinModelPortal:Q9VZP8 STRING:Q9VZP8 PRIDE:Q9VZP8
EnsemblMetazoa:FBtr0073077 GeneID:38423 KEGG:dme:Dmel_CG12014
UCSC:CG12014-RA FlyBase:FBgn0035445 InParanoid:Q9VZP8 OMA:ERVIPAY
OrthoDB:EOG45DV4P PhylomeDB:Q9VZP8 GenomeRNAi:38423 NextBio:808590
ArrayExpress:Q9VZP8 Bgee:Q9VZP8 Uniprot:Q9VZP8
Length = 512
Score = 134 (52.2 bits), Expect = 3.0e-05, P = 3.0e-05
Identities = 42/127 (33%), Positives = 61/127 (48%)
Query: 42 LAFTLSM-VFVDLVASSGPPHIIFILADDLGWNDVGFHGLDQIPTPNIDALAYSGIILKN 100
L +L M V +D A P+++ ++ DDL +G +G TP +D A I
Sbjct: 6 LLLSLMMPVLLDAAAPPRRPNVVMVIFDDLR-PVIGAYGDTLASTPYLDNFARGSHIFTR 64
Query: 101 YYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYGCERGGLPLSEKILPQYLKELGYRTRIV 159
Y+ Q LC PSR++++TG+ P + Y G + LPQY KE GY T
Sbjct: 65 VYSQQSLCAPSRNSLLTGRRPDTLHLYDFYSYWRTFTG---NFTTLPQYFKEHGYYTYSC 121
Query: 160 GK-WHLG 165
GK +H G
Sbjct: 122 GKVFHPG 128
>DICTYBASE|DDB_G0268506 [details] [associations]
symbol:DDB_G0268506 "putative histone-like
transcription factor" species:44689 "Dictyostelium discoideum"
[GO:0046982 "protein heterodimerization activity" evidence=IEA]
[GO:0043565 "sequence-specific DNA binding" evidence=IEA]
[GO:0005622 "intracellular" evidence=IEA] [GO:0008150
"biological_process" evidence=ND] [GO:0005575 "cellular_component"
evidence=ND] [GO:0003674 "molecular_function" evidence=ND]
[GO:0005634 "nucleus" evidence=IEA] InterPro:IPR003958
InterPro:IPR009072 Pfam:PF00808 dictyBase:DDB_G0268506
GO:GO:0005634 GO:GO:0043565 EMBL:AAFI02000003 Gene3D:1.10.20.10
SUPFAM:SSF47113 eggNOG:COG5208 ProtClustDB:CLSZ2846877
RefSeq:XP_647243.3 ProteinModelPortal:Q55GE1
EnsemblProtists:DDB0304567 GeneID:8616048 KEGG:ddi:DDB_G0268506
InParanoid:Q55GE1 OMA:DENEEDQ Uniprot:Q55GE1
Length = 1120
Score = 138 (53.6 bits), Expect = 3.1e-05, P = 3.1e-05
Identities = 34/169 (20%), Positives = 67/169 (39%)
Query: 405 LRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYN 464
+ ++ G H +N + +N + Y + Y+ N N + N N N + N
Sbjct: 856 INHQLGMHHHNPHQNQNQHPMYSHQFQNYSQVAFNNNNNNNNNNNNNNNNNNNNNNNNNN 915
Query: 465 GPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEP 524
N N N N + N N+ N N ++ N +N+N+ N + + + + N
Sbjct: 916 NNNNNNNNNNSNNSNNSNNSSNNNNNNNNNNSNNNNNNNNNNNNNNNNNN--NSNNNNNS 973
Query: 525 SKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQN 573
+ N +N + + + + N+ N N + N +N+N+ N
Sbjct: 974 NNSNNNNNNNYNNYNGNNNNYNNYNSSSNNNSNNNNNNNNNNNNNNNNN 1022
Score = 133 (51.9 bits), Expect = 0.00011, P = 0.00011
Identities = 46/200 (23%), Positives = 75/200 (37%)
Query: 381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
N ++ N N+ N N+ N ++ N+ NSN N + N N
Sbjct: 906 NNNNNNNNNNNNNNNNNNNNSNNSNNSNNSSNNNNNNNNNNSNNNNNNNNNNNNNNNNNN 965
Query: 441 YENGTHEYNPKYENRYENGTHEYNGPKNE--NTNPRYENGTHEYNIPRLENSINGNGTSE 498
N + N N N + YNG N N N N ++ N N+ N N +
Sbjct: 966 NSNNNNNSNNSNNNN-NNNYNNYNGNNNNYNNYNSSSNNNSNNNNNNNNNNNNNNNNNNN 1024
Query: 499 NRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSING 558
N +N+N+ N +G + + ++ +P N + I+ NS N
Sbjct: 1025 NNNNNNNSNNNNNGNNNFENINPFQP--HNHMQSQYYYNQSINQYQNQNHNNNNNNSNNN 1082
Query: 559 NGTSENRSN--DNSYQNEID 576
N ++N +N Y+NE D
Sbjct: 1083 NSNNQNSNNIYTRQYENEED 1102
Score = 129 (50.5 bits), Expect = 0.00029, P = 0.00029
Identities = 39/181 (21%), Positives = 70/181 (38%)
Query: 394 ENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYENGTHEYNPKYE 453
+N P Y + Y N+ N+N N + N N N + N
Sbjct: 871 QNQHPMYSHQFQNYSQVAFNNNNNN-NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNSNNS 929
Query: 454 NRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQ-NEIDG 512
N N ++ N N N+N N + N N+ N N S N +N+N+ N +G
Sbjct: 930 NNSNNSSNNNNNNNNNNSNNNNNNNNNNNNNNNNNNNSNNNNNSNNSNNNNNNNYNNYNG 989
Query: 513 IDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNSYQ 572
+ + + N S N+ +N ++ + N+ N N ++ N + +N+++
Sbjct: 990 NNN-NYNNYNSSSNNNSNNNNNNNNNNNNNNNNNN-----NNNNNNNNSNNNNNGNNNFE 1043
Query: 573 N 573
N
Sbjct: 1044 N 1044
>DICTYBASE|DDB_G0278995 [details] [associations]
symbol:DDB_G0278995 species:44689 "Dictyostelium
discoideum" [GO:0008270 "zinc ion binding" evidence=IEA]
[GO:0005622 "intracellular" evidence=IEA] InterPro:IPR015880
SMART:SM00355 dictyBase:DDB_G0278995 EMBL:AAFI02000026
GO:GO:0008270 GO:GO:0005622 RefSeq:XP_641895.1
EnsemblProtists:DDB0215287 GeneID:8621821 KEGG:ddi:DDB_G0278995
OMA:RRPERYQ Uniprot:Q54XF5
Length = 1055
Score = 143 (55.4 bits), Expect = 3.2e-05, Sum P(2) = 3.2e-05
Identities = 48/193 (24%), Positives = 75/193 (38%)
Query: 381 NKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENR 440
N+ P Y NS N+ N++ N + N+ +N N + N KY N
Sbjct: 838 NQQSSPQYYNSLNMNV----NNNVNGNNNNNNNNNNNNNNINNNINNNNNNNVNSKYNNN 893
Query: 441 YENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENR 500
N + N N N + N N N N N + YN NS N N + N
Sbjct: 894 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN-SNSNNNYNN---NNSNNNNNNNNNN 949
Query: 501 SNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNG 560
+N+N+ N + + + + N P N+ N+ S R + N+IN +
Sbjct: 950 NNNNNNNNNNNNNNNNNNNNSNFPGN-NSNYCNLSVNNSTSPFNRPQTPPKPINNINISN 1008
Query: 561 TSENRSNDNSYQN 573
+ N SN+N+ N
Sbjct: 1009 NNNN-SNNNNINN 1020
Score = 45 (20.9 bits), Expect = 3.2e-05, Sum P(2) = 3.2e-05
Identities = 17/51 (33%), Positives = 23/51 (45%)
Query: 245 YLAHAATHSANPY---EPLQAPDHYLNIHRHIEDFKRSKFAAILHKLDESV 292
YL H H N + E L D Y N + DF +FA +LD++V
Sbjct: 271 YLFHLKIHENNNHCLLENLLQNDGYSNQNN---DFFSGEFATESGQLDQTV 318
>DICTYBASE|DDB_G0271832 [details] [associations]
symbol:DDB_G0271832 "Zinc finger CCHC
domain-containing protein 7" species:44689 "Dictyostelium
discoideum" [GO:0008270 "zinc ion binding" evidence=IEA]
[GO:0003676 "nucleic acid binding" evidence=IEA] InterPro:IPR001878
PROSITE:PS50158 SMART:SM00343 dictyBase:DDB_G0271832 GO:GO:0008270
GO:GO:0003676 EMBL:AAFI02000006 Gene3D:4.10.60.10 SUPFAM:SSF57756
RefSeq:XP_645525.1 ProteinModelPortal:Q55AJ7
EnsemblProtists:DDB0216923 GeneID:8618154 KEGG:ddi:DDB_G0271832
eggNOG:NOG260401 InParanoid:Q55AJ7 Uniprot:Q55AJ7
Length = 772
Score = 136 (52.9 bits), Expect = 3.2e-05, P = 3.2e-05
Identities = 54/207 (26%), Positives = 84/207 (40%)
Query: 387 NYVNSTVENIIPRYE---NSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKYENRYEN 443
N NST NI R++ N RY N + YN+ NS Y N H+ N + +Y++
Sbjct: 473 NRYNST--NINNRFDGKYNKNNRYNNNNNNYNN---NNSYNDYSNYNHKNNKDF-GKYQD 526
Query: 444 GTHEYNPKYENRY------------ENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSI 491
G+ +YN +N Y + +H N N N N N + N E+S
Sbjct: 527 GSDDYNDDDQNDYRVKDSYSRKSKKQKTSHNNNNNNNNNDNNSSNNNKNNSNNSNKESSE 586
Query: 492 NGNGTSENRSNDNSYQN---EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKW 548
N + + N Q E D D + K N I +D + ++ K
Sbjct: 587 EKNDKKKKKKNKKGNQEKEKEKDKKDKNHKTRERDREKDNDIDSMVDLD-KVKNNNNNKN 645
Query: 549 KLVKENSINGNGTSENRSNDNSYQNEI 575
K +N+ N N + N +N+N+ N+I
Sbjct: 646 KNNNKNN-NNNNNNNNNNNNNNNNNKI 671
Score = 130 (50.8 bits), Expect = 0.00014, P = 0.00014
Identities = 48/209 (22%), Positives = 88/209 (42%)
Query: 399 RYENSILRYENGTHEYNSPRIEN-------SNTRYENGTHEYNPKYE-NRYENGTHEYNP 450
RY+ + R+ ++ YNS I N N RY N + YN N Y N H+ N
Sbjct: 461 RYDRNE-RFNYNSNRYNSTNINNRFDGKYNKNNRYNNNNNNYNNNNSYNDYSNYNHKNNK 519
Query: 451 KYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTSENRSNDNSYQNEI 510
+ +Y++G+ +YN +++ N ++ + + S N N + N N++S N+
Sbjct: 520 DF-GKYQDGSDDYN---DDDQNDYRVKDSYSRKSKKQKTSHNNNNNNNNNDNNSSNNNKN 575
Query: 511 DGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSINGNGTSENRSNDNS 570
+ + S E S+ +D+ + +G + KE + + R D
Sbjct: 576 NSNN-----SNKESSEEK------NDKKKKKKNKKGNQEKEKEKD-KKDKNHKTRERDRE 623
Query: 571 YQNEIDG-IDVWSVLSRNEPSKRNTILHN 598
N+ID +D+ V + N +N +N
Sbjct: 624 KDNDIDSMVDLDKVKNNNNNKNKNNNKNN 652
Score = 128 (50.1 bits), Expect = 0.00023, P = 0.00023
Identities = 43/186 (23%), Positives = 88/186 (47%)
Query: 421 NSNTRYE-NGTHEYNPKYE--NRYENGTHEYNPKYENRYENGTHEYNGPKNE-NTNPRYE 476
N+N RY+ N ++ N +Y+ +RY+ + PK + + N +YN + + N R+
Sbjct: 411 NNNNRYDRNDRYDRNDRYDRYDRYDRYDKDGFPK-DIDHSNNNGQYNQDYHRYDRNERFN 469
Query: 477 NGTHEYNIPRLENSINGNGTSENR--SNDNSYQNEIDGIDVWSVLSRNEPS----KRNTI 530
++ YN + N +G NR +N+N+Y N D + +N + +
Sbjct: 470 YNSNRYNSTNINNRFDGKYNKNNRYNNNNNNYNNNNSYNDYSNYNHKNNKDFGKYQDGSD 529
Query: 531 LHNIDDE--WQI--SALTRGKWKLVKENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSR 586
+N DD+ +++ S + K + N+ N N ++N S++N+ +N + + S +
Sbjct: 530 DYNDDDQNDYRVKDSYSRKSKKQKTSHNNNNNNNNNDNNSSNNN-KNNSNNSNKESSEEK 588
Query: 587 NEPSKR 592
N+ K+
Sbjct: 589 NDKKKK 594
>DICTYBASE|DDB_G0289337 [details] [associations]
symbol:DDB_G0289337 species:44689 "Dictyostelium
discoideum" [GO:0003676 "nucleic acid binding" evidence=IEA]
[GO:0000166 "nucleotide binding" evidence=IEA] InterPro:IPR000504
InterPro:IPR012677 Pfam:PF00076 PROSITE:PS50102 SMART:SM00360
dictyBase:DDB_G0289337 GO:GO:0000166 Gene3D:3.30.70.330
GO:GO:0003676 EMBL:AAFI02000139 eggNOG:NOG145324 RefSeq:XP_636266.1
ProteinModelPortal:Q54HN5 EnsemblProtists:DDB0188369 GeneID:8627085
KEGG:ddi:DDB_G0289337 InParanoid:Q54HN5 OMA:MESINIS Uniprot:Q54HN5
Length = 1528
Score = 139 (54.0 bits), Expect = 3.5e-05, P = 3.5e-05
Identities = 48/193 (24%), Positives = 80/193 (41%)
Query: 371 DWLPTLLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGT 430
D PT+ K I N ++ EN + EN +N + N EN N
Sbjct: 1045 DRSPTIKKNKEKEIIKNNHDNDNENE-NKNENE-KENDNQNEKENENENENKNKNENKNE 1102
Query: 431 HEYNPKYENRYENGTHEYNP-KYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLEN 489
+E + EN EN N + EN+ EN + N +N+N N +E + N
Sbjct: 1103 NEIKNENENENENENENENENENENKNENENEKENENENKNKNENVNENKNEQEEEKENN 1162
Query: 490 SINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWK 549
+ N N + N +N+N+ N + +S + + +N +L I+++ + +
Sbjct: 1163 NNNNNNNNNNNNNNNNNNNNNNNNRKQKNISEQKDNPKNELLIGIENKEKKIIVNSNLEN 1222
Query: 550 LVKENSINGNGTS 562
ENSI GN +S
Sbjct: 1223 DQDENSIVGNLSS 1235
Score = 127 (49.8 bits), Expect = 0.00068, P = 0.00068
Identities = 61/228 (26%), Positives = 101/228 (44%)
Query: 381 NKSDIPNYVNSTVENI--IPRYENS-ILRYENGT--HEYNSPRIENSNTRYENGTHEYNP 435
N ++ N NST N +P E+S L N T + S I+ S T +N E
Sbjct: 1001 NNNNNNNNNNSTNLNQSKVPTNESSSTLTASNDTIIKNFRSFEIDRSPTIKKNKEKEI-- 1058
Query: 436 KYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNG 495
+N ++N N K EN EN N +NEN N EN N + EN I
Sbjct: 1059 -IKNNHDNDNENEN-KNENEKENDNQ--NEKENENEN---ENKNKNEN--KNENEIKNEN 1109
Query: 496 TSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENS 555
+EN N+N +NE + + + NE K N + +E ++ + + + KEN+
Sbjct: 1110 ENENE-NENENENENENENK----NENENEKENENENKNKNE-NVNE-NKNEQEEEKENN 1162
Query: 556 INGNGTSENRSNDNSYQNEIDGID-VWSVLSRNEPSKRNTILHNIDDE 602
N N + N +N+N+ N + + +S + + +N +L I+++
Sbjct: 1163 NNNNNNNNNNNNNNNNNNNNNNNNRKQKNISEQKDNPKNELLIGIENK 1210
>DICTYBASE|DDB_G0272108 [details] [associations]
symbol:DDB_G0272108 "RNA-binding region RNP-1
domain-containing protein" species:44689 "Dictyostelium discoideum"
[GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0000166
"nucleotide binding" evidence=IEA] [GO:0008150 "biological_process"
evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
InterPro:IPR000504 InterPro:IPR012677 Pfam:PF00076 PROSITE:PS50102
SMART:SM00360 dictyBase:DDB_G0272108 GO:GO:0000166
Gene3D:3.30.70.330 GenomeReviews:CM000151_GR GO:GO:0003676
EMBL:AAFI02000008 RefSeq:XP_645138.2 ProteinModelPortal:Q55A46
EnsemblProtists:DDB0220129 GeneID:8618308 KEGG:ddi:DDB_G0272108
InParanoid:Q55A46 Uniprot:Q55A46
Length = 469
Score = 126 (49.4 bits), Expect = 3.6e-05, Sum P(2) = 3.6e-05
Identities = 34/111 (30%), Positives = 50/111 (45%)
Query: 409 NGTHEYNSPRIENSNT-RYENGTHEYNPKYENRYENGTHEYNPKYENRYENGTHEYNGPK 467
NG R + NT +E+G +E +NR N + YN N Y+NG NG
Sbjct: 97 NGQERDGIKRFRSDNTTNFEDGEYEEQVMNDNRNNNNNNNYNNS-NNNYKNGNENGNGNG 155
Query: 468 NENTNPRYENGTHE---YNIPRLENSING-NGTSENRSNDNSYQNEIDGID 514
N N +P H+ +N EN+ N N + N +N+N+ N +G D
Sbjct: 156 NGNGSPYGMVERHKPPPFNYENGENNDNKYNNNNNNNNNNNNNNNNNNGFD 206
Score = 53 (23.7 bits), Expect = 3.6e-05, Sum P(2) = 3.6e-05
Identities = 17/52 (32%), Positives = 25/52 (48%)
Query: 554 NSINGNGTSE--NRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEW 603
N+IN N TS N N N+ N I S+ S +E N+ L +D ++
Sbjct: 374 NNINNNNTSSSYNNYNSNNVNNSIQFSPTTSI-SNSETISSNSNLP-VDQDY 423
Score = 41 (19.5 bits), Expect = 0.00061, Sum P(2) = 0.00061
Identities = 10/33 (30%), Positives = 18/33 (54%)
Query: 564 NRSNDNSYQNEIDGIDVWSVLSR-NEPSKRNTI 595
N +N+N+Y N I+ + S + N + N+I
Sbjct: 365 NNNNNNNYNNNINNNNTSSSYNNYNSNNVNNSI 397
>UNIPROTKB|H7C3P4 [details] [associations]
symbol:GNS "Glucosamine (N-acetyl)-6-sulfatase (Sanfilippo
disease IIID), isoform CRA_b" species:9606 "Homo sapiens"
[GO:0005764 "lysosome" evidence=IEA] [GO:0008449
"N-acetylglucosamine-6-sulfatase activity" evidence=IEA]
[GO:0030203 "glycosaminoglycan metabolic process" evidence=IEA]
InterPro:IPR000917 InterPro:IPR012251 InterPro:IPR015981
InterPro:IPR017849 InterPro:IPR017850 Pfam:PF00884
PIRSF:PIRSF036666 EMBL:CH471054 GO:GO:0005764 Gene3D:3.40.720.10
SUPFAM:SSF53649 InterPro:IPR024607 PROSITE:PS00523 PROSITE:PS00149
GO:GO:0030203 GO:GO:0008449 PANTHER:PTHR10342:SF5 EMBL:AC025262
UniGene:Hs.334534 HGNC:HGNC:4422 ChiTaRS:GNS SMR:H7C3P4
Ensembl:ENST00000418919 Uniprot:H7C3P4
Length = 496
Score = 128 (50.1 bits), Expect = 4.2e-05, Sum P(2) = 4.2e-05
Identities = 53/182 (29%), Positives = 78/182 (42%)
Query: 82 QIPTPNIDAL-AYSGIILKNYYTVQ-LCTPSRSAIMTGKHPIHTGMQHNVLYG-CE-RGG 137
Q P AL G+ + Y LC PSR++I+TGK+P + + +N L G C +
Sbjct: 8 QTPLKKTKALIGEMGMTFSSAYVPSALCCPSRASILTGKYPHNHHVVNNTLEGNCSSKSW 67
Query: 138 LPLSE-KILPQYLKEL-GYRTRIVGKWHLGFYKKEY-TPTFRGFES-HLG--YWTGHQDY 191
+ E P L+ + GY+T GK Y EY P G E LG YW +
Sbjct: 68 QKIQEPNTFPAILRSMCGYQTFFAGK-----YLNEYGAPDAGGLEHVPLGWSYWYALEKN 122
Query: 192 FDHSAEEMKMWGLDMRRDLEPAWDLHGKYSTDVFTAEAVDIIHNHSTDEPLFLYLAHAAT 251
+ + + G + + D Y TDV ++D + S EP F+ +A A
Sbjct: 123 SKYYNYTLSINGKARKHGENYSVD----YLTDVLANVSLDFLDYKSNFEPFFMMIATPAP 178
Query: 252 HS 253
HS
Sbjct: 179 HS 180
Score = 51 (23.0 bits), Expect = 4.2e-05, Sum P(2) = 4.2e-05
Identities = 13/38 (34%), Positives = 23/38 (60%)
Query: 278 RSKFAAILHKLDESVGKVVEALEQRRMLSNSIIVFVSD 315
R ++ +L +D+ V K+V+ LE L+N+ I + SD
Sbjct: 234 RKRWQTLL-SVDDLVEKLVKRLEFTGELNNTYIFYTSD 270
>DICTYBASE|DDB_G0284187 [details] [associations]
symbol:Dd5P2 "inositol 5-phosphatase" species:44689
"Dictyostelium discoideum" [GO:0046856 "phosphatidylinositol
dephosphorylation" evidence=IDA] [GO:0046855 "inositol phosphate
dephosphorylation" evidence=IDA] [GO:0034485
"phosphatidylinositol-3,4,5-trisphosphate 5-phosphatase activity"
evidence=IDA] [GO:0004445 "inositol-polyphosphate 5-phosphatase
activity" evidence=IDA] [GO:0004439
"phosphatidylinositol-4,5-bisphosphate 5-phosphatase activity"
evidence=IDA] [GO:0046854 "phosphatidylinositol phosphorylation"
evidence=IEA] [GO:0005575 "cellular_component" evidence=ND]
InterPro:IPR000300 SMART:SM00128 dictyBase:DDB_G0284187
INTERPRO:IPR000408 Pfam:PF00415 GenomeReviews:CM000153_GR
Gene3D:2.130.10.30 InterPro:IPR009091 SUPFAM:SSF50985
PRINTS:PR00633 PROSITE:PS00626 PROSITE:PS50012 InterPro:IPR005135
SUPFAM:SSF56219 EMBL:AAFI02000064 GO:GO:0046854 GO:GO:0046855
GO:GO:0004439 GO:GO:0046856 GO:GO:0004445 eggNOG:COG5411
GO:GO:0034485 RefSeq:XP_638694.1 ProteinModelPortal:Q54PV1
EnsemblProtists:DDB0191414 GeneID:8624525 KEGG:ddi:DDB_G0284187
InParanoid:Q54PV1 OMA:SHEKMER Uniprot:Q54PV1
Length = 1800
Score = 139 (54.0 bits), Expect = 4.2e-05, P = 4.2e-05
Identities = 53/226 (23%), Positives = 92/226 (40%)
Query: 378 SAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYNPKY 437
S+++ ++ N + + +I PR S +N E +ENS N + N
Sbjct: 1273 SSSSNNNSTNNLGDYISSISPRAITSTTLTKNPKQEIER-ELENS-VNNSNNNNSINNNS 1330
Query: 438 ENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYENGTHEYNIPRLENSINGNGTS 497
N N T+ N N N T+ N N N N N + N N+ N N +
Sbjct: 1331 NNNNNNNTNNNNNTNNN---NNTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 1387
Query: 498 ENRSNDNSYQN-EIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALTRGKWKLVKENSI 556
N +N+NS +N + + + S + N +N I+ NI + Q++ K E +
Sbjct: 1388 NNNNNNNSDKNSDSEEASIGSGILGNIDDIQN-IIGNIKNGDQVNKNLNHKKSNSVEVVV 1446
Query: 557 NGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDE 602
+ EN SND Y +D ++ + N + N +N +++
Sbjct: 1447 VEHHDEENCSNDIFYIEPFTIVDQYTNNNNNNNNNNNNNNNNNNND 1492
Score = 129 (50.5 bits), Expect = 0.00050, P = 0.00050
Identities = 51/212 (24%), Positives = 82/212 (38%)
Query: 375 TLLSAANKSDIPNYVNSTVENIIPRYENSILRYENGTHEYNSPRIENSNTRYENGTHEYN 434
T L+ K +I + ++V N NSI N + N+ N+NT N T+ N
Sbjct: 1299 TTLTKNPKQEIERELENSVNN--SNNNNSINNNSNNNNNNNTNN--NNNTNNNNNTNNNN 1354
Query: 435 PKYENRYENGTHEYNPKYENRYENGTHEYNGPKNENTNPRYE-NGTHEY---------NI 484
N N + N N N + N N N N + N E NI
Sbjct: 1355 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNSDKNSDSEEASIGSGILGNI 1414
Query: 485 PRLENSINGNGTSENRSNDNSYQNEIDGIDVWSVLSRNEPSKRNTILHNIDDEWQISALT 544
++N I GN + ++ N N + + ++V V +E + N I + I+ + T
Sbjct: 1415 DDIQNII-GNIKNGDQVNKNLNHKKSNSVEVVVVEHHDEENCSNDIFY-IEPFTIVDQYT 1472
Query: 545 RGKWKLVKENSINGNGTSENRSNDNSYQNEID 576
N+ N N + + +NDN+ N D
Sbjct: 1473 NNNNNNNNNNNNNNNNNNNDNNNDNNNDNNND 1504
WARNING: HSPs involving 73 database sequences were not reported due to the
limiting value of parameter B = 250.
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.316 0.135 0.416 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 905 822 0.00079 122 3 11 22 0.44 34
37 0.45 37
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 323
No. of states in DFA: 629 (67 KB)
Total size of DFA: 478 KB (2218 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 79.72u 0.10s 79.82t Elapsed: 00:00:38
Total cpu time: 79.82u 0.10s 79.92t Elapsed: 00:00:38
Start: Thu Aug 15 12:35:45 2013 End: Thu Aug 15 12:36:23 2013
WARNINGS ISSUED: 2