Citrus Sinensis ID: 027380
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 224 | ||||||
| 359492114 | 223 | PREDICTED: uncharacterized protein LOC10 | 0.991 | 0.995 | 0.897 | 1e-111 | |
| 225458376 | 228 | PREDICTED: uncharacterized protein LOC10 | 0.991 | 0.973 | 0.878 | 1e-109 | |
| 255538556 | 225 | conserved hypothetical protein [Ricinus | 0.986 | 0.982 | 0.889 | 1e-109 | |
| 224136578 | 228 | predicted protein [Populus trichocarpa] | 0.991 | 0.973 | 0.886 | 1e-109 | |
| 363807704 | 222 | uncharacterized protein LOC100806535 [Gl | 0.991 | 1.0 | 0.857 | 1e-108 | |
| 356519050 | 223 | PREDICTED: uncharacterized protein LOC10 | 0.991 | 0.995 | 0.866 | 1e-108 | |
| 224067361 | 224 | predicted protein [Populus trichocarpa] | 0.973 | 0.973 | 0.884 | 1e-108 | |
| 363808178 | 224 | uncharacterized protein LOC100787225 [Gl | 0.991 | 0.991 | 0.858 | 1e-107 | |
| 449470011 | 223 | PREDICTED: uncharacterized protein LOC10 | 0.991 | 0.995 | 0.844 | 1e-107 | |
| 363807457 | 218 | uncharacterized protein LOC100784573 [Gl | 0.968 | 0.995 | 0.854 | 1e-104 |
| >gi|359492114|ref|XP_002283430.2| PREDICTED: uncharacterized protein LOC100247103 isoform 1 [Vitis vinifera] gi|302142443|emb|CBI19646.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
Score = 407 bits (1045), Expect = e-111, Method: Compositional matrix adjust.
Identities = 202/225 (89%), Positives = 212/225 (94%), Gaps = 3/225 (1%)
Query: 1 MLETMESSVNGGFSQLQSYGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVG 60
MLE +ESSVNGGFSQLQS GDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVG
Sbjct: 1 MLEAVESSVNGGFSQLQSCGDSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVG 60
Query: 61 LGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFDNVPWNISDLASDDTQKSHRS 120
LGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEF+N+ WN SD+ASDDTQKSHRS
Sbjct: 61 LGGWHWLVLTNGIEVKLQRNALSVIEAPTGNEEDDDLEFENMQWNGSDMASDDTQKSHRS 120
Query: 121 KLRTHKSTGSSHKTTSRSLSCDSHSQSKGSISTP-GSMKIDLSKLEMGALWRYWRHFNLV 179
+ RTHKS+GSSHKT SRSLSCD SQSKGSISTP GS K+DLSKLEM ALWRY RHFNLV
Sbjct: 121 RHRTHKSSGSSHKTMSRSLSCD--SQSKGSISTPRGSTKVDLSKLEMAALWRYLRHFNLV 178
Query: 180 DAIPNPSREQLIDVVQRHFMSQQMDELQVIVGFVQAAKRLKTVCK 224
D IPNPS+EQLIDVVQRHF++QQMDELQVIVGFVQAAKRLKTVCK
Sbjct: 179 DGIPNPSKEQLIDVVQRHFVTQQMDELQVIVGFVQAAKRLKTVCK 223
|
Source: Vitis vinifera Species: Vitis vinifera Genus: Vitis Family: Vitaceae Order: Vitales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|225458376|ref|XP_002283437.1| PREDICTED: uncharacterized protein LOC100247103 isoform 2 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|255538556|ref|XP_002510343.1| conserved hypothetical protein [Ricinus communis] gi|223551044|gb|EEF52530.1| conserved hypothetical protein [Ricinus communis] | Back alignment and taxonomy information |
|---|
| >gi|224136578|ref|XP_002326895.1| predicted protein [Populus trichocarpa] gi|222835210|gb|EEE73645.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|363807704|ref|NP_001242423.1| uncharacterized protein LOC100806535 [Glycine max] gi|255641384|gb|ACU20969.1| unknown [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|356519050|ref|XP_003528187.1| PREDICTED: uncharacterized protein LOC100809195 [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|224067361|ref|XP_002302473.1| predicted protein [Populus trichocarpa] gi|222844199|gb|EEE81746.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|363808178|ref|NP_001241972.1| uncharacterized protein LOC100787225 [Glycine max] gi|255644949|gb|ACU22974.1| unknown [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|449470011|ref|XP_004152712.1| PREDICTED: uncharacterized protein LOC101220556 [Cucumis sativus] gi|449496053|ref|XP_004160024.1| PREDICTED: uncharacterized LOC101220556 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|363807457|ref|NP_001242646.1| uncharacterized protein LOC100784573 [Glycine max] gi|255639437|gb|ACU20013.1| unknown [Glycine max] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 224 | ||||||
| TAIR|locus:2037286 | 242 | AT1G75060 "AT1G75060" [Arabido | 0.950 | 0.880 | 0.711 | 3.9e-76 | |
| UNIPROTKB|I3LU53 | 124 | SAP30 "Uncharacterized protein | 0.294 | 0.532 | 0.342 | 8.8e-05 |
| TAIR|locus:2037286 AT1G75060 "AT1G75060" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 767 (275.1 bits), Expect = 3.9e-76, P = 3.9e-76
Identities = 155/218 (71%), Positives = 177/218 (81%)
Query: 11 GGFSQLQS-YGD-SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLV 68
GGFSQLQS +GD SSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLV
Sbjct: 17 GGFSQLQSCFGDCSSEEELSVLPRHTKVVVTGNNRTKSVLVGLQGVVKKAVGLGGWHWLV 76
Query: 69 LTNGIEVKLQRNALSVIEAPTGNEEDDDLEFDN-VPWNI-SDLASDDTQKSHRSKLXXXX 126
LTNGIEVKLQRNALSV+E PTGNEED+DLE D+ WN SD+ ++DT K H+SK
Sbjct: 77 LTNGIEVKLQRNALSVLEHPTGNEEDNDLEVDHSTQWNHPSDMTTEDTLKPHKSKKRGHR 136
Query: 127 XXXXXXXXXXXXLSCDSHSQSKGSISTPGSMKIDLSKLEMGALWRYWRHFNLVDAIPNPS 186
+SCDSHS+ SI+ +MK+DL+KL+M AL RYWRHFNLVDA+PNP+
Sbjct: 137 SSRLSQKALYREVSCDSHSKIS-SITPRLNMKVDLTKLDMAALLRYWRHFNLVDALPNPT 195
Query: 187 REQLIDVVQRHFMSQQMDELQVIVGFVQAAKRLKTVCK 224
+EQLID++QRHFMSQQMDELQVIVGFVQAA +K C+
Sbjct: 196 KEQLIDIIQRHFMSQQMDELQVIVGFVQAATGMKKACQ 233
|
|
| UNIPROTKB|I3LU53 SAP30 "Uncharacterized protein" [Sus scrofa (taxid:9823)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
Fail to connect to STRING server
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 224 | |||
| pfam13867 | 53 | pfam13867, SAP30_Sin3_bdg, Sin3 binding region of | 8e-23 |
| >gnl|CDD|206038 pfam13867, SAP30_Sin3_bdg, Sin3 binding region of histone deacetylase complex subunit SAP30 | Back alignment and domain information |
|---|
Score = 86.9 bits (216), Expect = 8e-23
Identities = 30/54 (55%), Positives = 37/54 (68%), Gaps = 1/54 (1%)
Query: 164 LEMGALWRYWRHFNLVDAIPNPSREQLIDVVQRHFMSQQMDELQVIVGFVQAAK 217
L + L RY RHF LV PN S+EQL + VQ+HFMSQ +DE +VI GF+ A K
Sbjct: 1 LGVNTLRRYKRHFKLVTR-PNSSKEQLAEAVQKHFMSQPLDEQEVIAGFLYAVK 53
|
This C-terminal domain of the SAP30 proteins appears to be the binding region for Sin3. Length = 53 |
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 224 | |||
| PF13867 | 53 | SAP30_Sin3_bdg: Sin3 binding region of histone dea | 99.84 | |
| PRK05886 | 109 | yajC preprotein translocase subunit YajC; Validate | 95.77 | |
| PF04648 | 13 | MF_alpha: Yeast mating factor alpha hormone; Inter | 94.51 | |
| PF02037 | 35 | SAP: SAP domain; InterPro: IPR003034 The SAP (afte | 93.51 | |
| PF10281 | 38 | Ish1: Putative stress-responsive nuclear envelope | 92.25 | |
| smart00513 | 35 | SAP Putative DNA-binding (bihelical) motif predict | 88.14 |
| >PF13867 SAP30_Sin3_bdg: Sin3 binding region of histone deacetylase complex subunit SAP30; PDB: 2LD7_A | Back alignment and domain information |
|---|
Probab=99.84 E-value=3.2e-21 Score=135.39 Aligned_cols=53 Identities=42% Similarity=0.658 Sum_probs=41.0
Q ss_pred cchhHHHHHHHhcccccCCCCCCHHHHHHHHHhhhccccCChHHHHHHHHHHHh
Q 027380 164 LEMGALWRYWRHFNLVDAIPNPSREQLIDVVQRHFMSQQMDELQVIVGFVQAAK 217 (224)
Q Consensus 164 Le~~TLrRYkkhfkL~~~~P~~sKeQLv~aV~rHF~sq~VdE~eVI~~Fi~avK 217 (224)
|+++|||||++||+|+ ..|+.+|+||+.+|.+||++|+|||.+||++|||+||
T Consensus 1 L~~~tLrrY~~~~~l~-~~~~~sK~qLa~~V~kHF~s~~v~E~evI~~Fly~vk 53 (53)
T PF13867_consen 1 LQTPTLRRYKKHYKLP-ERPRSSKEQLANAVRKHFNSQPVDENEVIANFLYKVK 53 (53)
T ss_dssp S-HHHHHHHHHHTT-----SS--HHHHHHHHHHHHTT----HHHHHHHHHHHHH
T ss_pred CchHHHHHHHHHhCCC-CCCCCCHHHHHHHHHHHHhcCCCCHHHHHHHHHHHhC
Confidence 7899999999999994 4679999999999999999999999999999999997
|
|
| >PRK05886 yajC preprotein translocase subunit YajC; Validated | Back alignment and domain information |
|---|
| >PF04648 MF_alpha: Yeast mating factor alpha hormone; InterPro: IPR006742 This repeated sequence,WHWLQLKPGQPMY, characterises the mating factor alpha-1 or alpha-1 mating pheromone [contains: Mating factor alpha] | Back alignment and domain information |
|---|
| >PF02037 SAP: SAP domain; InterPro: IPR003034 The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins involved in chromosomal organisation [], including in apoptosis [] | Back alignment and domain information |
|---|
| >PF10281 Ish1: Putative stress-responsive nuclear envelope protein; InterPro: IPR018803 This group of proteins, found primarily in fungi, consists of putative stress-responsive nuclear envelope protein Ish1 and homologues [] | Back alignment and domain information |
|---|
| >smart00513 SAP Putative DNA-binding (bihelical) motif predicted to be involved in chromosomal organisation | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() | |
| Query | 224 | ||||
| 2ld7_A | 94 | Solution Structure Of The Msin3a Pah3-Sap30 Sid Com | 2e-04 |
| >pdb|2LD7|A Chain A, Solution Structure Of The Msin3a Pah3-Sap30 Sid Complex Length = 94 | Back alignment and structure |
|
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 224 | |||
| 2ld7_A | 94 | Histone deacetylase complex subunit SAP30; transcr | 1e-23 | |
| 1vt4_I | 1221 | APAF-1 related killer DARK; drosophila apoptosome, | 5e-04 |
| >2ld7_A Histone deacetylase complex subunit SAP30; transcription; NMR {Mus musculus} Length = 94 | Back alignment and structure |
|---|
Score = 89.7 bits (222), Expect = 1e-23
Identities = 20/77 (25%), Positives = 38/77 (49%), Gaps = 1/77 (1%)
Query: 145 SQSKGSISTPGSMKIDLSKLEMGALWRYWRHFNLVDAIPNPSREQLIDVVQRHFMSQQMD 204
+ + ++DL +L++ L RY RHF L P ++ QL+++V HF S ++
Sbjct: 8 DGGDSPVQDIDTPEVDLYQLQVNTLRRYKRHFKLPTR-PGLNKAQLVEIVGCHFKSIPVN 66
Query: 205 ELQVIVGFVQAAKRLKT 221
E + F+ + + K
Sbjct: 67 EKDTLTCFIYSVRNDKN 83
|
| >1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 | Back alignment and structure |
|---|
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 224 | |||
| 2ld7_A | 94 | Histone deacetylase complex subunit SAP30; transcr | 99.9 | |
| 2do1_A | 55 | Nuclear protein HCC-1; SAP domain, structural geno | 92.71 | |
| 1zrj_A | 50 | E1B-55KDA-associated protein 5 isoform C; SAP doma | 89.1 | |
| 2rnn_A | 114 | E3 SUMO-protein ligase SIZ1; SUMO ligase, DNA bind | 88.79 | |
| 1jjr_A | 151 | KU70, thyroid autoantigen; DNA repair protein, pro | 87.68 | |
| 1h1j_S | 51 | THO1 protein; SAP domain, DNA binding; NMR {Saccha | 87.59 |
| >2ld7_A Histone deacetylase complex subunit SAP30; transcription; NMR {Mus musculus} | Back alignment and structure |
|---|
Probab=99.90 E-value=4.1e-24 Score=165.42 Aligned_cols=68 Identities=29% Similarity=0.507 Sum_probs=64.7
Q ss_pred CCCccccccccchhHHHHHHHhcccccCCCCCCHHHHHHHHHhhhccccCChHHHHHHHHHHHhhcccc
Q 027380 154 PGSMKIDLSKLEMGALWRYWRHFNLVDAIPNPSREQLIDVVQRHFMSQQMDELQVIVGFVQAAKRLKTV 222 (224)
Q Consensus 154 ~~~p~VDL~KLe~~TLrRYkkhfkL~~~~P~~sKeQLv~aV~rHF~sq~VdE~eVI~~Fi~avKr~Kt~ 222 (224)
...|+|||++|+++|||||++||+| .+.|+.+|+||+.+|.+||++|+|+|.|||++|||+||+.++-
T Consensus 17 ~~~p~vdf~kL~~~tLrrY~r~y~L-~~~~~~sK~qLa~aV~kHF~s~~VdE~evI~~Fly~VK~~~~k 84 (94)
T 2ld7_A 17 IDTPEVDLYQLQVNTLRRYKRHFKL-PTRPGLNKAQLVEIVGCHFKSIPVNEKDTLTCFIYSVRNDKNK 84 (94)
T ss_dssp SCCCCCCCSSSCHHHHHHHHHHTTC-CCCSSCCHHHHHHHHHHHHTTCCCCHHHHHHHHHHHHHHCSSC
T ss_pred CCCCCcCHHHCCHHHHHHHHHHhCC-CCCCCCCHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHhcCcc
Confidence 3478899999999999999999999 8999999999999999999999999999999999999999864
|
| >2do1_A Nuclear protein HCC-1; SAP domain, structural genomics, NPPSFA, national project on protein structural and functional analyses; NMR {Homo sapiens} SCOP: a.140.2.1 | Back alignment and structure |
|---|
| >1zrj_A E1B-55KDA-associated protein 5 isoform C; SAP domain, structural genomics, NPPSFA, national project on protein structural and functional analyses; NMR {Homo sapiens} SCOP: a.140.2.1 | Back alignment and structure |
|---|
| >2rnn_A E3 SUMO-protein ligase SIZ1; SUMO ligase, DNA binding, sumoylation, metal-binding, nucLeu phosphoprotein, UBL conjugation pathway; NMR {Saccharomyces cerevisiae} | Back alignment and structure |
|---|
| >1h1j_S THO1 protein; SAP domain, DNA binding; NMR {Saccharomyces cerevisiae} SCOP: a.140.2.1 PDB: 2wqg_A | Back alignment and structure |
|---|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 224 | |||
| d1jeqa1 | 51 | DNA binding C-terminal domain of ku70 {Human (Homo | 95.8 | |
| d1h1js_ | 44 | S/mar DNA-binding protein Tho1 {Baker's yeast (Sac | 84.19 |
| >d1jeqa1 a.140.2.1 (A:559-609) DNA binding C-terminal domain of ku70 {Human (Homo sapiens) [TaxId: 9606]} | Back information, alignment and structure |
|---|
class: All alpha proteins fold: LEM/SAP HeH motif superfamily: SAP domain family: SAP domain domain: DNA binding C-terminal domain of ku70 species: Human (Homo sapiens) [TaxId: 9606]
Probab=95.80 E-value=0.0055 Score=40.81 Aligned_cols=39 Identities=23% Similarity=0.409 Sum_probs=34.5
Q ss_pred cccccccchhHHHHHHHhcccccCCCCCCHHHHHHHHHhhhc
Q 027380 158 KIDLSKLEMGALWRYWRHFNLVDAIPNPSREQLIDVVQRHFM 199 (224)
Q Consensus 158 ~VDL~KLe~~TLrRYkkhfkL~~~~P~~sKeQLv~aV~rHF~ 199 (224)
+-+|.+|.++-||-|.|.++|+ -...|++|++.|..||.
T Consensus 12 ~~~L~kltV~~LK~~lk~~gL~---~sGkKa~Li~Ri~~~l~ 50 (51)
T d1jeqa1 12 KGTLGKFTVPMLKEACRAYGLK---SGLKKQELLEALTKHFQ 50 (51)
T ss_dssp HTCGGGCCHHHHHHHHHHTTCC---CCSSHHHHHHHHHHHHT
T ss_pred cCcHhhCcHHHHHHHHHHcCCC---CCCcHHHHHHHHHHHhh
Confidence 3589999999999999999993 34699999999999995
|
| >d1h1js_ a.140.2.1 (S:) S/mar DNA-binding protein Tho1 {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} | Back information, alignment and structure |
|---|