Citrus Sinensis ID: 045385


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------19
MEGASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRSERQVMLEGRVPAASSGDFLTEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNCPIAEPAQIDPELENAFICHLHDHWFCIRKVNGEWYNFDSLYAAPEHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKECPISSSDSSN
cccccccccEEEEEcccHHHHHHHHHHHHccccccHHHHHHHHHHHHHHHHHHHHcccccccccccccccccccccccccccHHHHHHHHHHcccEEEEcccccccccccccccHHHEEEcccccEEEEEEEccEEEEccccccccEEccHHHHHHHHHHHHHccEEEEEEEcccccccccccccccc
ccccccccEEEEHHHHHHHHHHHHHHHHHccccccHHHHHHHHHHccHHHHHHHHHcccccccHHHHcccccccccccccccHHHHHHHHHHcccEEEEcccccHHHHccccHHcEEEEEEccccEEEEEEEccEEEEccccccccccccHHHHHHHHHHHHHcccEEEEEEcccccccccccccccc
megasnggmlyHEVQESKLCAVHCVNtvlqgpffsefDLAALASDLDRSERQVMLegrvpaassgdflteeshnvsldgdFSIQVLQKALEVWdlqviplncpiaepaqidpeLENAFICHLHDHWFCIrkvngewynfdslyaapeHLSKFYLSAYLDSLKGFGWSIFIVrgnfpkecpisssdssn
megasnggMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRSERQVMLEGRVPAASSGDFLTEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNCPIAEPAQIDPELENAFICHLHDHWFCIRKVNGEWYNFDSLYAAPEHLSKFYLSAYLDSLKGFGWSIFIVRGNFPkecpisssdssn
MEGASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRSERQVMLEGRVPAASSGDFLTEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNCPIAEPAQIDPELENAFICHLHDHWFCIRKVNGEWYNFDSLYAAPEHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKECPISSSDSSN
*********LYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASD****************************NVSLDGDFSIQVLQKALEVWDLQVIPLNCPIAEPAQIDPELENAFICHLHDHWFCIRKVNGEWYNFDSLYAAPEHLSKFYLSAYLDSLKGFGWSIFIVRGNFP************
***********HEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRS*****************FLTEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNCPIAEPAQIDPELENAFICHLHDHWFCIRKVNGEWYNFDSLYAAPEHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKEC*********
********MLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRSERQVMLEGRVPAASSGDFLTEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNCPIAEPAQIDPELENAFICHLHDHWFCIRKVNGEWYNFDSLYAAPEHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKEC*********
******GGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRSERQVMLEGRVPAASSGDFLTEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNCPIAEPAQIDPELENAFICHLHDHWFCIRKVNGEWYNFDSLYAAPEHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKECP********
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MEGASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRSERQVMLEGRVPAASSGDFLTEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNCPIAEPAQIDPELENAFICHLHDHWFCIRKVNGEWYNFDSLYAAPEHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKECPISSSDSSN
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query188 2.2.26 [Sep-21-2011]
Q9M391280 Ataxin-3 homolog OS=Arabi yes no 0.962 0.646 0.862 3e-90
Q8LQ36 336 Putative ataxin-3 homolog yes no 0.978 0.547 0.760 6e-78
P54252 364 Ataxin-3 OS=Homo sapiens yes no 0.888 0.458 0.437 5e-38
Q9CVD2 355 Ataxin-3 OS=Mus musculus yes no 0.888 0.470 0.437 7e-38
O35815 355 Ataxin-3 OS=Rattus norveg yes no 0.888 0.470 0.437 7e-38
Q9W689 363 Ataxin-3 OS=Gallus gallus yes no 0.888 0.460 0.437 1e-37
Q9H3M9 355 Putative ataxin-3-like pr no no 0.888 0.470 0.431 9e-35
O17850 317 Ataxin-3 homolog OS=Caeno yes no 0.851 0.504 0.339 2e-24
Q60XN1 319 Ataxin-3 homolog OS=Caeno N/A no 0.845 0.498 0.323 3e-22
O82391 360 Josephin-like protein OS= no no 0.675 0.352 0.257 9e-07
>sp|Q9M391|ATX3H_ARATH Ataxin-3 homolog OS=Arabidopsis thaliana GN=At3g54130 PE=2 SV=1 Back     alignment and function desciption
 Score =  330 bits (846), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 157/182 (86%), Positives = 166/182 (91%), Gaps = 1/182 (0%)

Query: 1   MEGASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRSERQVMLEGR-V 59
           ME  SNGGMLYHEVQES LCAVHCVNTVLQGPFFSEFDLAA+A+DLD  ERQVMLEG  V
Sbjct: 1   MERTSNGGMLYHEVQESNLCAVHCVNTVLQGPFFSEFDLAAVAADLDGKERQVMLEGAAV 60

Query: 60  PAASSGDFLTEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNCPIAEPAQIDPELENAFI 119
              + GDFL EESHNVSL GDFSIQVLQKALEVWDLQVIPLNCP AEPAQIDPELE+AFI
Sbjct: 61  GGFAPGDFLAEESHNVSLGGDFSIQVLQKALEVWDLQVIPLNCPDAEPAQIDPELESAFI 120

Query: 120 CHLHDHWFCIRKVNGEWYNFDSLYAAPEHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKEC 179
           CHLHDHWFCIRKVNGEWYNFDSL AAP+HLSKFYLSA+LDSLKG GWSIFIV+GNFP+EC
Sbjct: 121 CHLHDHWFCIRKVNGEWYNFDSLLAAPQHLSKFYLSAFLDSLKGAGWSIFIVKGNFPQEC 180

Query: 180 PI 181
           P+
Sbjct: 181 PM 182




Interacts with key regulators of transcription and represses transcription. Acts as a histone-binding protein that regulates transcription. Acts as a deubiquitinating enzyme.
Arabidopsis thaliana (taxid: 3702)
EC: 3EC: .EC: 4EC: .EC: 1EC: 9EC: .EC: 1EC: 2
>sp|Q8LQ36|ATX3_ORYSJ Putative ataxin-3 homolog OS=Oryza sativa subsp. japonica GN=Os01g0851400 PE=3 SV=1 Back     alignment and function description
>sp|P54252|ATX3_HUMAN Ataxin-3 OS=Homo sapiens GN=ATXN3 PE=1 SV=4 Back     alignment and function description
>sp|Q9CVD2|ATX3_MOUSE Ataxin-3 OS=Mus musculus GN=Atxn3 PE=1 SV=2 Back     alignment and function description
>sp|O35815|ATX3_RAT Ataxin-3 OS=Rattus norvegicus GN=Atxn3 PE=1 SV=1 Back     alignment and function description
>sp|Q9W689|ATX3_CHICK Ataxin-3 OS=Gallus gallus GN=ATXN3 PE=2 SV=1 Back     alignment and function description
>sp|Q9H3M9|ATX3L_HUMAN Putative ataxin-3-like protein OS=Homo sapiens GN=ATXN3L PE=5 SV=2 Back     alignment and function description
>sp|O17850|ATX3_CAEEL Ataxin-3 homolog OS=Caenorhabditis elegans GN=atx-3 PE=1 SV=1 Back     alignment and function description
>sp|Q60XN1|ATX3_CAEBR Ataxin-3 homolog OS=Caenorhabditis briggsae GN=atx-3 PE=3 SV=1 Back     alignment and function description
>sp|O82391|JOSL_ARATH Josephin-like protein OS=Arabidopsis thaliana GN=At2g29640 PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query188
449439932 426 PREDICTED: ataxin-3 homolog [Cucumis sat 0.978 0.431 0.882 2e-96
388500356 345 unknown [Lotus japonicus] 0.984 0.536 0.872 1e-94
356496152 338 PREDICTED: ataxin-3 homolog [Glycine max 0.962 0.535 0.893 6e-94
356504297 336 PREDICTED: ataxin-3 homolog [Glycine max 0.957 0.535 0.893 6e-93
224140363280 predicted protein [Populus trichocarpa] 1.0 0.671 0.851 2e-91
255573895266 Ataxin-3, putative [Ricinus communis] gi 0.984 0.695 0.856 3e-91
15232434280 Ataxin-3 like protein [Arabidopsis thali 0.962 0.646 0.862 2e-88
147823359 445 hypothetical protein VITISV_014341 [Viti 0.984 0.415 0.824 2e-88
357468589 330 Ataxin-3-like protein [Medicago truncatu 0.941 0.536 0.864 3e-88
225440596272 PREDICTED: ataxin-3 homolog [Vitis vinif 0.984 0.680 0.824 4e-88
>gi|449439932|ref|XP_004137739.1| PREDICTED: ataxin-3 homolog [Cucumis sativus] gi|449483438|ref|XP_004156591.1| PREDICTED: ataxin-3 homolog [Cucumis sativus] Back     alignment and taxonomy information
 Score =  357 bits (915), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 166/188 (88%), Positives = 178/188 (94%), Gaps = 4/188 (2%)

Query: 1   MEGASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRSERQVMLEGRVP 60
           M+GA NGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDR ERQ+ML G   
Sbjct: 1   MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSG--- 57

Query: 61  AASSGDFLTEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNCPIAEPAQIDPELENAFIC 120
            +++GDFL+EESHNVSLDGDFSIQVLQKALEVWDLQVIPLN P+AEPAQIDP+LENAFIC
Sbjct: 58  -STTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPQLENAFIC 116

Query: 121 HLHDHWFCIRKVNGEWYNFDSLYAAPEHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKECP 180
           HL DHWFCIRKVNGEWYNFDSLYAAP+HLSKFYLSAYLDSLKGFGWSIFIVRGNFPK+ P
Sbjct: 117 HLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFP 176

Query: 181 ISSSDSSN 188
           ISSS++SN
Sbjct: 177 ISSSEASN 184




Source: Cucumis sativus

Species: Cucumis sativus

Genus: Cucumis

Family: Cucurbitaceae

Order: Cucurbitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|388500356|gb|AFK38244.1| unknown [Lotus japonicus] Back     alignment and taxonomy information
>gi|356496152|ref|XP_003516934.1| PREDICTED: ataxin-3 homolog [Glycine max] Back     alignment and taxonomy information
>gi|356504297|ref|XP_003520933.1| PREDICTED: ataxin-3 homolog [Glycine max] Back     alignment and taxonomy information
>gi|224140363|ref|XP_002323552.1| predicted protein [Populus trichocarpa] gi|222868182|gb|EEF05313.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|255573895|ref|XP_002527866.1| Ataxin-3, putative [Ricinus communis] gi|223532717|gb|EEF34497.1| Ataxin-3, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|15232434|ref|NP_190981.1| Ataxin-3 like protein [Arabidopsis thaliana] gi|29336976|sp|Q9M391.1|ATX3H_ARATH RecName: Full=Ataxin-3 homolog; AltName: Full=MJD1a-like; AltName: Full=Machado-Joseph disease-like protein gi|6822059|emb|CAB70987.1| Machado-Joseph disease MJD1a-like protein [Arabidopsis thaliana] gi|38564306|gb|AAR23732.1| At3g54130 [Arabidopsis thaliana] gi|38604054|gb|AAR24770.1| At3g54130 [Arabidopsis thaliana] gi|332645670|gb|AEE79191.1| Ataxin-3 like protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|147823359|emb|CAN64200.1| hypothetical protein VITISV_014341 [Vitis vinifera] Back     alignment and taxonomy information
>gi|357468589|ref|XP_003604579.1| Ataxin-3-like protein [Medicago truncatula] gi|355505634|gb|AES86776.1| Ataxin-3-like protein [Medicago truncatula] Back     alignment and taxonomy information
>gi|225440596|ref|XP_002277594.1| PREDICTED: ataxin-3 homolog [Vitis vinifera] gi|297740255|emb|CBI30437.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query188
TAIR|locus:2080330280 AT3G54130 [Arabidopsis thalian 1.0 0.671 0.852 1.1e-85
UNIPROTKB|E2QUQ9 359 ATXN3 "Uncharacterized protein 0.898 0.470 0.435 2.7e-36
UNIPROTKB|F5H211 370 ATXN3 "Ataxin-3" [Homo sapiens 0.898 0.456 0.435 2.7e-36
UNIPROTKB|P54252 364 ATXN3 "Ataxin-3" [Homo sapiens 0.898 0.464 0.435 2.7e-36
UNIPROTKB|Q06AV3 353 ATXN3 "Uncharacterized protein 0.898 0.478 0.435 2.7e-36
MGI|MGI:1099442 355 Atxn3 "ataxin 3" [Mus musculus 0.898 0.476 0.435 4.4e-36
RGD|621567 355 Atxn3 "ataxin 3" [Rattus norve 0.898 0.476 0.435 4.4e-36
UNIPROTKB|Q9W689 363 ATXN3 "Ataxin-3" [Gallus gallu 0.898 0.465 0.435 9.2e-36
UNIPROTKB|G3V3R7 329 ATXN3 "Ataxin-3" [Homo sapiens 0.893 0.510 0.435 1.9e-35
UNIPROTKB|F1N3H4 369 ATXN3 "Uncharacterized protein 0.898 0.457 0.432 2.4e-35
TAIR|locus:2080330 AT3G54130 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 857 (306.7 bits), Expect = 1.1e-85, P = 1.1e-85
 Identities = 162/190 (85%), Positives = 173/190 (91%)

Query:     1 MEGASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRSERQVMLEGR-V 59
             ME  SNGGMLYHEVQES LCAVHCVNTVLQGPFFSEFDLAA+A+DLD  ERQVMLEG  V
Sbjct:     1 MERTSNGGMLYHEVQESNLCAVHCVNTVLQGPFFSEFDLAAVAADLDGKERQVMLEGAAV 60

Query:    60 PAASSGDFLTEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNCPIAEPAQIDPELENAFI 119
                + GDFL EESHNVSL GDFSIQVLQKALEVWDLQVIPLNCP AEPAQIDPELE+AFI
Sbjct:    61 GGFAPGDFLAEESHNVSLGGDFSIQVLQKALEVWDLQVIPLNCPDAEPAQIDPELESAFI 120

Query:   120 CHLHDHWFCIRKVNGEWYNFDSLYAAPEHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKEC 179
             CHLHDHWFCIRKVNGEWYNFDSL AAP+HLSKFYLSA+LDSLKG GWSIFIV+GNFP+EC
Sbjct:   121 CHLHDHWFCIRKVNGEWYNFDSLLAAPQHLSKFYLSAFLDSLKGAGWSIFIVKGNFPQEC 180

Query:   180 PISSS-DSSN 188
             P+SSS ++SN
Sbjct:   181 PMSSSSEASN 190




GO:0005634 "nucleus" evidence=ISM
GO:0008242 "omega peptidase activity" evidence=IEA
GO:0010200 "response to chitin" evidence=RCA
GO:0042732 "D-xylose metabolic process" evidence=RCA
UNIPROTKB|E2QUQ9 ATXN3 "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] Back     alignment and assigned GO terms
UNIPROTKB|F5H211 ATXN3 "Ataxin-3" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|P54252 ATXN3 "Ataxin-3" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|Q06AV3 ATXN3 "Uncharacterized protein" [Sus scrofa (taxid:9823)] Back     alignment and assigned GO terms
MGI|MGI:1099442 Atxn3 "ataxin 3" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms
RGD|621567 Atxn3 "ataxin 3" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms
UNIPROTKB|Q9W689 ATXN3 "Ataxin-3" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
UNIPROTKB|G3V3R7 ATXN3 "Ataxin-3" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|F1N3H4 ATXN3 "Uncharacterized protein" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

ID ?Name ?Annotated EC number ?Identity ?Query coverage ?Hit coverage ?RBH(Q2H) ?RBH(H2Q) ?
Q9M391ATX3H_ARATH3, ., 4, ., 1, 9, ., 1, 20.86260.96270.6464yesno
Q8LQ36ATX3_ORYSJ3, ., 4, ., 1, 9, ., 1, 20.76060.97870.5476yesno

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

Prediction LevelEC numberConfidence of Prediction
3rd Layer3.4.190.963
3rd Layer3.4.19.120.946

Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query188
pfam02099158 pfam02099, Josephin, Josephin 2e-85
>gnl|CDD|216874 pfam02099, Josephin, Josephin Back     alignment and domain information
 Score =  248 bits (635), Expect = 2e-85
 Identities = 92/161 (57%), Positives = 117/161 (72%), Gaps = 3/161 (1%)

Query: 14  VQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRSERQVMLEGRVPAASSGDFLTEESH 73
            QE +LCA HC+N +LQGP+FSEFDLA++A +LD+ ER  +        +   FL E SH
Sbjct: 1   RQEGQLCAQHCLNNLLQGPYFSEFDLASIAKELDQKERNKLSVE---HWTPKSFLDEPSH 57

Query: 74  NVSLDGDFSIQVLQKALEVWDLQVIPLNCPIAEPAQIDPELENAFICHLHDHWFCIRKVN 133
           NV   G+FSIQVL+ ALEVW L VIP + P    + +DP+LE  FIC+L +HWF IRKVN
Sbjct: 58  NVDDTGNFSIQVLETALEVWGLSVIPWHSPRVGASSLDPDLERGFICNLSEHWFTIRKVN 117

Query: 134 GEWYNFDSLYAAPEHLSKFYLSAYLDSLKGFGWSIFIVRGN 174
           G WYN +SL AAPEH+SKFYLSA+LD LK  G+S+F+VRG+
Sbjct: 118 GTWYNLNSLLAAPEHISKFYLSAFLDQLKSEGYSVFVVRGD 158


Length = 158

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 188
PF02099157 Josephin: Josephin; InterPro: IPR006155 Human gene 100.0
KOG2935 315 consensus Ataxin 3/Josephin [General function pred 100.0
KOG2934204 consensus Uncharacterized conserved protein, conta 100.0
cd02424129 Peptidase_C39E A sub-family of peptidase family C3 91.96
cd02420125 Peptidase_C39D A sub-family of peptidase family C3 87.8
cd02549141 Peptidase_C39A A sub-family of peptidase family C3 87.25
cd02419127 Peptidase_C39C A sub-family of peptidase family C3 86.2
PF03412131 Peptidase_C39: Peptidase C39 family This is family 84.52
cd02418136 Peptidase_C39B A sub-family of peptidase family C3 82.52
PF01088214 Peptidase_C12: Ubiquitin carboxyl-terminal hydrola 82.15
cd02259122 Peptidase_C39_like Peptidase family C39 mostly con 80.86
>PF02099 Josephin: Josephin; InterPro: IPR006155 Human genes containing triplet repeats can markedly expand in length, leading to neuropsychiatric disease Back     alignment and domain information
Probab=100.00  E-value=5.1e-65  Score=411.51  Aligned_cols=157  Identities=48%  Similarity=0.869  Sum_probs=141.7

Q ss_pred             eccchhhHHHHHHhhhCCCCCCHHHHHHHHhhhCHHHHHHhhhCCCCCCCCCccccccccccCCCCCchHHHHHHHHhhC
Q 045385           14 VQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRSERQVMLEGRVPAASSGDFLTEESHNVSLDGDFSIQVLQKALEVW   93 (188)
Q Consensus        14 kQ~~~lCalHaLNnLlQ~~~ft~~dL~~Ia~~Ld~~er~~~~~~~~~~~~~~~~~~~~s~n~~~~GNydinVL~~AL~~~   93 (188)
                      ||+++|||+||||||||||+||+.||++||++||+.||..|.+   ++.++.+++.++|+||+++||||||||++||++|
T Consensus         1 kQ~~~lCalHaLNnLlQ~~~ft~~dL~~Ia~~Ld~~E~~~~~~---~~~~~~~~~~~~s~n~~~~GnysinVL~~AL~~~   77 (157)
T PF02099_consen    1 KQELQLCALHALNNLLQGPYFTAVDLDEIAQELDEEERSLMAE---DSWTPLSFLFNPSRNVDGTGNYSINVLMAALQVW   77 (157)
T ss_dssp             S-STTSHHHHHHHHHCTSS-S-HHHHHHHHHHHHHHHHHHHHC---TSHHHHHHHTSTSSTCSTTSTCECHHHHHHHHCC
T ss_pred             CcHHHHHHHHHHHHHhhhhhcCHHHHHHHHHHhChhhhhhhhc---cCccchhhccccccCccccCCcCHHHHHHHHHHc
Confidence            8999999999999999999999999999999999999999987   4556777888899999999999999999999999


Q ss_pred             CceeEEcCCCCCCCCCCCcccceeeeeccCCceEEEEeeCCEEEEeeCCCCCceEcchhhHHHHHHHHHhCCeEEEEEEC
Q 045385           94 DLQVIPLNCPIAEPAQIDPELENAFICHLHDHWFCIRKVNGEWYNFDSLYAAPEHLSKFYLSAYLDSLKGFGWSIFIVRG  173 (188)
Q Consensus        94 ~~~~~~~d~~~~~~~~~~~~~~~gFI~N~~rHWfaIRki~g~wyNLDS~l~~P~~i~~~~L~~~L~~l~~~g~sIFvV~g  173 (188)
                      |++++||..++.....++|+.+.|||||.+|||||||||+|+||||||++++|+.|+++||.+||++++++||+||||+|
T Consensus        78 g~~~~~~~~~~~~~~~~~~~~~~gfI~N~~~HWf~iRki~~~wyNLDS~l~~P~~i~~~~l~~fL~~l~~~g~~ifvV~~  157 (157)
T PF02099_consen   78 GLELVPWFDKRMQEASIDPDNEFGFICNLSRHWFAIRKIGGQWYNLDSKLKEPELISDFYLSAFLQQLQSEGYSIFVVRG  157 (157)
T ss_dssp             T-EEEETTSHHHHHCC--CCCSSEEEEECTTEEEEEEEETTEEEEECTTTSS-EEE-HHHHHHHHHHHHCCTEEEEEEES
T ss_pred             CceEEEccCccchhhhcCchhceEEEeccCcceEEEEeeCCeeEeccCCCCCCcccCHHHHHHHHHHHHhCCcEEEEEeC
Confidence            99999887777788889999999999999999999999999999999999999999999999999999999999999986



Expansion of triplet repeats explains the phenomenon of anticipation, i.e. the increasing severity or earlier age of onset in successive generations in a pedigree []. A novel gene containing CAG repeats has been identified and mapped to chromosome 14q32.1, the genetic locus for Machado-Joseph disease (MJD). Normally, the gene contains 13-36 CAG repeats, but most clinically diagnosed patients and all affected members of a family with the clinical and pathological diagnosis of MJD show expansion of the repeat number, from 68-79 []. Similar abnormalities in related genes may give rise to diseases similar to MJD. MJD is a neurodegenerative disorder characterised by cerebellar ataxia, pyramidal and extra-pyramidal signs, peripheral nerve palsy, external ophtalmoplegia, facial and lingual fasciculation and bulging. The disease is autosomal dominant, with late onset of symptoms, generally after the fourth decade.; GO: 0008242 omega peptidase activity; PDB: 3O65_G 1YZB_A 2JRI_A 2DOS_A 2AGA_A.

>KOG2935 consensus Ataxin 3/Josephin [General function prediction only] Back     alignment and domain information
>KOG2934 consensus Uncharacterized conserved protein, contains Josephin domain [General function prediction only] Back     alignment and domain information
>cd02424 Peptidase_C39E A sub-family of peptidase family C39 Back     alignment and domain information
>cd02420 Peptidase_C39D A sub-family of peptidase family C39 Back     alignment and domain information
>cd02549 Peptidase_C39A A sub-family of peptidase family C39 Back     alignment and domain information
>cd02419 Peptidase_C39C A sub-family of peptidase family C39 Back     alignment and domain information
>PF03412 Peptidase_C39: Peptidase C39 family This is family C39 in the peptidase classification Back     alignment and domain information
>cd02418 Peptidase_C39B A sub-family of peptidase family C39 Back     alignment and domain information
>PF01088 Peptidase_C12: Ubiquitin carboxyl-terminal hydrolase, family 1; InterPro: IPR001578 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families Back     alignment and domain information
>cd02259 Peptidase_C39_like Peptidase family C39 mostly contains bacteriocin-processing endopeptidases from bacteria Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query188
2dos_A176 Structural Basis For The Recognition Of Lys48-Linke 9e-39
2aga_A190 De-Ubiquitinating Function Of Ataxin-3: Insights Fr 9e-39
2jri_A182 Solution Structure Of The Josephin Domain Of Ataxin 1e-38
3o65_A191 Crystal Structure Of A Josephin-Ubiquitin Complex: 3e-32
>pdb|2DOS|A Chain A, Structural Basis For The Recognition Of Lys48-Linked Polyubiquitin Chain By The Josephin Domain Of Ataxin-3, A Putative Deubiquitinating Enzyme Length = 176 Back     alignment and structure

Iteration: 1

Score = 155 bits (393), Expect = 9e-39, Method: Compositional matrix adjust. Identities = 73/167 (43%), Positives = 107/167 (64%) Query: 10 LYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRSERQVMLEGRVPAASSGDFLT 69 ++HE QE LCA HC+N +LQG +FS +L+++A LD ER M EG V + FL Sbjct: 9 IFHEKQEGSLCAQHCLNNLLQGEYFSPVELSSIAHQLDEEERMRMAEGGVTSEDYRTFLQ 68 Query: 70 EESHNVSLDGDFSIQVLQKALEVWDLQVIPLNCPIAEPAQIDPELENAFICHLHDHWFCI 129 + S N+ G FSIQV+ AL+VW L++I N P + +IDP E +FIC+ +HWF + Sbjct: 69 QPSGNMDDSGFFSIQVISNALKVWGLELILFNSPEYQRLRIDPINERSFICNYKEHWFTV 128 Query: 130 RKVNGEWYNFDSLYAAPEHLSKFYLSAYLDSLKGFGWSIFIVRGNFP 176 RK+ +W+N +SL PE +S YL+ +L L+ G+SIF+V+G+ P Sbjct: 129 RKLGKQWFNLNSLLTGPELISDTYLALFLAQLQQEGYSIFVVKGDLP 175
>pdb|2AGA|A Chain A, De-Ubiquitinating Function Of Ataxin-3: Insights From The Solution Structure Of The Josephin Domain Length = 190 Back     alignment and structure
>pdb|2JRI|A Chain A, Solution Structure Of The Josephin Domain Of Ataxin-3 In Complex With Ubiquitin Molecule. Length = 182 Back     alignment and structure
>pdb|3O65|A Chain A, Crystal Structure Of A Josephin-Ubiquitin Complex: Evolutionary Restraints On Ataxin-3 Deubiquitinating Activity Length = 191 Back     alignment and structure

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query188
3o65_A191 Putative ataxin-3-like protein; papain-like fold, 1e-69
>3o65_A Putative ataxin-3-like protein; papain-like fold, hydrolase-protein complex; 2.70A {Homo sapiens} PDB: 2aga_A 1yzb_A 2jri_A 2dos_A Length = 191 Back     alignment and structure
 Score =  209 bits (532), Expect = 1e-69
 Identities = 72/168 (42%), Positives = 101/168 (60%)

Query: 9   MLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRSERQVMLEGRVPAASSGDFL 68
            ++HE QE  LCA HC+N +LQG +FS  +LA++A  LD  ER  M EG V +     FL
Sbjct: 4   FIFHEKQEGFLCAQHCLNNLLQGEYFSPVELASIAHQLDEEERMRMAEGGVTSEEYLAFL 63

Query: 69  TEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNCPIAEPAQIDPELENAFICHLHDHWFC 128
            + S N+   G FSIQV+  AL+ W L++I  N P  +   IDP  E +FIC+   HWF 
Sbjct: 64  QQPSENMDDTGFFSIQVISNALKFWGLEIIHFNNPEYQKLGIDPINERSFICNYKQHWFT 123

Query: 129 IRKVNGEWYNFDSLYAAPEHLSKFYLSAYLDSLKGFGWSIFIVRGNFP 176
           IRK    W+N +SL A PE +S   L+ +L  L+   +S+F+V+G+ P
Sbjct: 124 IRKFGKHWFNLNSLLAGPELISDTCLANFLARLQQQAYSVFVVKGDLP 171


Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query188
3o65_A191 Putative ataxin-3-like protein; papain-like fold, 100.0
3zua_A142 CLD, alpha-hemolysin translocation ATP-binding pro 85.29
1xd3_A230 Ubiquitin carboxyl-terminal esterase L3; enzyme-li 81.21
2wdt_A232 Ubiquitin carboxyl-terminal hydrolase L3; hydrolas 80.17
>3o65_A Putative ataxin-3-like protein; papain-like fold, hydrolase-protein complex; 2.70A {Homo sapiens} PDB: 2aga_A 1yzb_A 2jri_A 2dos_A Back     alignment and structure
Probab=100.00  E-value=8.5e-74  Score=473.52  Aligned_cols=170  Identities=42%  Similarity=0.778  Sum_probs=164.7

Q ss_pred             CceeeeccchhhHHHHHHhhhCCCCCCHHHHHHHHhhhCHHHHHHhhhCCCCCCCCCccccccccccCCCCCchHHHHHH
Q 045385            9 MLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRSERQVMLEGRVPAASSGDFLTEESHNVSLDGDFSIQVLQK   88 (188)
Q Consensus         9 ~iYhEkQ~~~lCalHaLNnLlQ~~~ft~~dL~~Ia~~Ld~~er~~~~~~~~~~~~~~~~~~~~s~n~~~~GNydinVL~~   88 (188)
                      .||||||+.+|||+|||||||||++||+.||++||++||++||..|.++|.++.+|.+++.++|+||+++||||||||++
T Consensus         4 ~IYhEkQ~~~LCalHaLNnLLQg~~Ft~~dL~~Ia~~Ld~~e~~~m~e~g~~~~d~~~~~~~ps~n~~~~GnfsInVl~~   83 (191)
T 3o65_A            4 FIFHEKQEGFLCAQHCLNNLLQGEYFSPVELASIAHQLDEEERMRMAEGGVTSEEYLAFLQQPSENMDDTGFFSIQVISN   83 (191)
T ss_dssp             TSCCCCCCTTCHHHHHHHHHHTSSCCCHHHHHHHHHHHHHHHHHHHGGGCTTSHHHHHHHTSCCSSBCTTCCBBHHHHHH
T ss_pred             cceeeccchhhHHHHHHHHHhccccCCHHHHHHHHHHcCHHHHHHHhhcCCChHHHHHHhcCCCCCcccCCCccHHHHHH
Confidence            49999999999999999999999999999999999999999999999999888888889888999999999999999999


Q ss_pred             HHhhCCceeEEcCCCCCCCCCCCcccceeeeeccCCceEEEEeeCCEEEEeeCCCCCceEcchhhHHHHHHHHHhCCeEE
Q 045385           89 ALEVWDLQVIPLNCPIAEPAQIDPELENAFICHLHDHWFCIRKVNGEWYNFDSLYAAPEHLSKFYLSAYLDSLKGFGWSI  168 (188)
Q Consensus        89 AL~~~~~~~~~~d~~~~~~~~~~~~~~~gFI~N~~rHWfaIRki~g~wyNLDS~l~~P~~i~~~~L~~~L~~l~~~g~sI  168 (188)
                      ||++||++++|+|++++...++||..+.|||||.+|||||||||+|+||||||++++|+.|+++||.+||+||+.+||||
T Consensus        84 AL~~~glelv~~d~~~~~~~~~~p~~~~~FI~N~~~HWfaiRkigg~WyNLdS~l~~P~~i~~~yL~~fL~ql~~eg~si  163 (191)
T 3o65_A           84 ALKFWGLEIIHFNNPEYQKLGIDPINERSFICNYKQHWFTIRKFGKHWFNLNSLLAGPELISDTCLANFLARLQQQAYSV  163 (191)
T ss_dssp             HHHTTTCEEEETTCHHHHHTCCCGGGCCEEEEEESSCEEEEEEETTEEEECCTTSSSCEEECHHHHHHHHGGGGSTTEEE
T ss_pred             HHHHcCCeEEECCCchhhhhcCChhhhhhhhhccccceeeEEecCCEEEEccCCCCCCCCcChHHHHHHHHHHHHCCCEE
Confidence            99999999999999987667899999999999999999999999999999999999999999999999999999999999


Q ss_pred             EEEECCCCCC
Q 045385          169 FIVRGNFPKE  178 (188)
Q Consensus       169 FvV~g~lp~~  178 (188)
                      |||+|+||..
T Consensus       164 FvV~g~lP~~  173 (191)
T 3o65_A          164 FVVKGDLPDC  173 (191)
T ss_dssp             EEEESCCCCC
T ss_pred             EEEcCCCCCC
Confidence            9999999983



>3zua_A CLD, alpha-hemolysin translocation ATP-binding protein; C39 peptidase-like domain, ABC transporter, haemolysin, HYDR heteronuclear; NMR {Escherichia coli} Back     alignment and structure
>1xd3_A Ubiquitin carboxyl-terminal esterase L3; enzyme-ligand complex, active site crossover loop, hydrolase; HET: GVE; 1.45A {Homo sapiens} SCOP: d.3.1.6 PDB: 1uch_A* Back     alignment and structure
>2wdt_A Ubiquitin carboxyl-terminal hydrolase L3; hydrolase-protein binding complex, enzyme-ligand complex, UB isopeptidase, UCH-L superfamily, cystein proteinase, peptid hydrolase; 2.30A {Plasmodium falciparum} PDB: 2we6_A Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query188
d1xd3a_229 Ubiquitin carboxyl-terminal hydrolase UCH-l3 {Huma 89.02
d2etla1223 Ubiquitin carboxyl-terminal hydrolase isozyme l1 { 85.38
>d1xd3a_ d.3.1.6 (A:) Ubiquitin carboxyl-terminal hydrolase UCH-l3 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
class: Alpha and beta proteins (a+b)
fold: Cysteine proteinases
superfamily: Cysteine proteinases
family: Ubiquitin carboxyl-terminal hydrolase UCH-L
domain: Ubiquitin carboxyl-terminal hydrolase UCH-l3
species: Human (Homo sapiens) [TaxId: 9606]
Probab=89.02  E-value=1.4  Score=34.64  Aligned_cols=107  Identities=17%  Similarity=0.205  Sum_probs=66.3

Q ss_pred             CCCCceeeecc-chhhHH----HHHHhhhCCCCC----CHHHHHHHHhhhCHHHHHHhhhCCCCCCCCCccccccccccC
Q 045385            6 NGGMLYHEVQE-SKLCAV----HCVNTVLQGPFF----SEFDLAALASDLDRSERQVMLEGRVPAASSGDFLTEESHNVS   76 (188)
Q Consensus         6 ~~~~iYhEkQ~-~~lCal----HaLNnLlQ~~~f----t~~dL~~Ia~~Ld~~er~~~~~~~~~~~~~~~~~~~~s~n~~   76 (188)
                      ....|||=||. .--||.    |+|-|....-..    .-..|-+.+..+++.+|...++..       +.+. .-||. 
T Consensus        79 ~~~~v~f~kQti~NACgT~Allh~l~N~~~~~~i~~gs~L~~f~~~t~~~~p~eRg~~l~~~-------~~l~-~aH~s-  149 (229)
T d1xd3a_          79 VTSSVYFMKQTISNACGTIGLIHAIANNKDKMHFESGSTLKKFLEESVSMSPEERARYLENY-------DAIR-VTHET-  149 (229)
T ss_dssp             CCTTCCCCCCCSBTCHHHHHHHHHHHTTGGGCCBCTTCHHHHHHHHHTTSCHHHHHHHHHTC-------HHHH-HHHHH-
T ss_pred             CCccceehhhhhhhhhHHHHHHHHHhcCccccccCCcHHHHHHHHHcCCCCHHHHHHHhhcC-------HHHH-HHHHH-
Confidence            34569999999 688885    555554332112    356677889999999998887631       0111 11221 


Q ss_pred             CCCCchHHHHHHHHhhCCceeEEcCCCCCCCCCCCcccceeeeeccCCceEEEEeeCCEEEEeeCCCCCceEcch
Q 045385           77 LDGDFSIQVLQKALEVWDLQVIPLNCPIAEPAQIDPELENAFICHLHDHWFCIRKVNGEWYNFDSLYAAPEHLSK  151 (188)
Q Consensus        77 ~~GNydinVL~~AL~~~~~~~~~~d~~~~~~~~~~~~~~~gFI~N~~rHWfaIRki~g~wyNLDS~l~~P~~i~~  151 (188)
                                   .-..|-      . +  .  -+++...+      -|.+|.-.++|.=|-||-.++.|+..|.
T Consensus       150 -------------~A~~g~------t-~--~--p~~~~~~~------~HFI~fV~~~G~lyELDG~k~~Pi~hG~  194 (229)
T d1xd3a_         150 -------------SAHEGQ------T-E--A--PSIDEKVD------LHFIALVHVDGHLYELDGRKPFPINHGE  194 (229)
T ss_dssp             -------------HHTCSS------S-C--C--CCTTSCCC------EEEEEEEEETTEEEEECTTSSSCEEEEE
T ss_pred             -------------hccccC------C-C--C--CCcccccc------eeEEEEEeeCCEEEEccCCCCCCcccCC
Confidence                         111111      0 0  0  01111222      4999999999999999999999999974



>d2etla1 d.3.1.6 (A:1-223) Ubiquitin carboxyl-terminal hydrolase isozyme l1 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure