Citrus Sinensis ID: 032431
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 141 | ||||||
| 326415786 | 193 | AGAMOUS-like E 24-like protein 1 [Aquile | 0.765 | 0.559 | 0.648 | 1e-31 | |
| 224141161 | 215 | predicted protein [Populus trichocarpa] | 0.751 | 0.493 | 0.622 | 9e-30 | |
| 255545954 | 161 | mads box protein, putative [Ricinus comm | 0.815 | 0.714 | 0.606 | 2e-29 | |
| 449451385 | 228 | PREDICTED: MADS-box protein SVP-like [Cu | 0.765 | 0.473 | 0.633 | 5e-29 | |
| 224141159 | 169 | predicted protein [Populus trichocarpa] | 0.751 | 0.627 | 0.613 | 5e-29 | |
| 375155240 | 215 | SVP4 [Actinidia chinensis] | 0.921 | 0.604 | 0.555 | 5e-29 | |
| 118482054 | 215 | unknown [Populus trichocarpa] | 0.751 | 0.493 | 0.613 | 8e-29 | |
| 316890774 | 227 | MADS-box protein STMADS11 subfamily [Cof | 0.794 | 0.493 | 0.589 | 8e-29 | |
| 83999600 | 229 | incomposita protein [Antirrhinum majus] | 0.893 | 0.550 | 0.559 | 2e-28 | |
| 255563302 | 184 | MADS-box transcription factor, putative | 0.751 | 0.576 | 0.638 | 2e-28 |
| >gi|326415786|gb|ADZ72840.1| AGAMOUS-like E 24-like protein 1 [Aquilegia formosa] | Back alignment and taxonomy information |
|---|
Score = 141 bits (355), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 70/108 (64%), Positives = 92/108 (85%)
Query: 1 MKQVIDRHNLHSQNLHKFDQPSLQLQLESSTYAILSKEMADRTRELRQMKGEELQELNME 60
MK++++RHNLHS+NLHK DQPSL+LQLE+ YA LSKE+A+R+R+LR M+GEELQ LN+E
Sbjct: 31 MKEILERHNLHSKNLHKLDQPSLELQLENGNYARLSKEVAERSRQLRNMRGEELQGLNIE 90
Query: 61 ELMRLEKSLEGGLSRVVQTKGERLLNEIDALRRKEAQLTEENLRLKQR 108
EL +LEKSLE GLSRV++TK + ++NEI L+ K A+L EEN RLKQ+
Sbjct: 91 ELQQLEKSLETGLSRVLETKSDWIMNEISTLQAKGAKLMEENERLKQK 138
|
Source: Aquilegia formosa Species: Aquilegia formosa Genus: Aquilegia Family: Ranunculaceae Order: Ranunculales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|224141161|ref|XP_002323943.1| predicted protein [Populus trichocarpa] gi|222866945|gb|EEF04076.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|255545954|ref|XP_002514037.1| mads box protein, putative [Ricinus communis] gi|223547123|gb|EEF48620.1| mads box protein, putative [Ricinus communis] | Back alignment and taxonomy information |
|---|
| >gi|449451385|ref|XP_004143442.1| PREDICTED: MADS-box protein SVP-like [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|224141159|ref|XP_002323942.1| predicted protein [Populus trichocarpa] gi|222866944|gb|EEF04075.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|375155240|gb|AFA37970.1| SVP4 [Actinidia chinensis] | Back alignment and taxonomy information |
|---|
| >gi|118482054|gb|ABK92958.1| unknown [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|316890774|gb|ADU56833.1| MADS-box protein STMADS11 subfamily [Coffea arabica] | Back alignment and taxonomy information |
|---|
| >gi|83999600|emb|CAG27846.1| incomposita protein [Antirrhinum majus] | Back alignment and taxonomy information |
|---|
| >gi|255563302|ref|XP_002522654.1| MADS-box transcription factor, putative [Ricinus communis] gi|223538130|gb|EEF39741.1| MADS-box transcription factor, putative [Ricinus communis] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 141 | ||||||
| TAIR|locus:2041233 | 240 | SVP "AT2G22540" [Arabidopsis t | 0.773 | 0.454 | 0.486 | 2.1e-22 | |
| UNIPROTKB|Q5K4R0 | 246 | MADS47 "MADS-box transcription | 0.950 | 0.544 | 0.401 | 1.5e-19 | |
| UNIPROTKB|Q9XJ66 | 228 | MADS22 "MADS-box transcription | 0.765 | 0.473 | 0.416 | 8.3e-19 | |
| TAIR|locus:2121924 | 220 | AGL24 "AT4G24540" [Arabidopsis | 0.851 | 0.545 | 0.347 | 2.9e-16 | |
| TAIR|locus:2121070 | 228 | AGL21 "AT4G37940" [Arabidopsis | 0.921 | 0.570 | 0.235 | 8.7e-05 | |
| TAIR|locus:2082618 | 240 | AGL16 "AGAMOUS-like 16" [Arabi | 0.978 | 0.575 | 0.257 | 9.8e-05 |
| TAIR|locus:2041233 SVP "AT2G22540" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 260 (96.6 bits), Expect = 2.1e-22, P = 2.1e-22
Identities = 54/111 (48%), Positives = 79/111 (71%)
Query: 1 MKQVIDRHNLHSQNLHKFDQPSLQLQL-ESSTYAILSKEMADRTRELRQMKGXXXXXXXX 59
MK+V++RHNL S+NL K DQPSL+LQL E+S +A +SKE+AD++ LRQM+G
Sbjct: 62 MKEVLERHNLQSKNLEKLDQPSLELQLVENSDHARMSKEIADKSHRLRQMRGEELQGLDI 121
Query: 60 XXXXXXXKSLEGGLSRVVQTKGERLLNEIDALRRKEAQLTEENLRLKQRQG 110
K+LE GL+RV++TK +++++EI L++K QL +EN RL+Q QG
Sbjct: 122 EELQQLEKALETGLTRVIETKSDKIMSEISELQKKGMQLMDENKRLRQ-QG 171
|
|
| UNIPROTKB|Q5K4R0 MADS47 "MADS-box transcription factor 47" [Oryza sativa Japonica Group (taxid:39947)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|Q9XJ66 MADS22 "MADS-box transcription factor 22" [Oryza sativa Japonica Group (taxid:39947)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2121924 AGL24 "AT4G24540" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2121070 AGL21 "AT4G37940" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2082618 AGL16 "AGAMOUS-like 16" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
Fail to connect to STRING server
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 141 | |||
| pfam01486 | 100 | pfam01486, K-box, K-box region | 2e-11 |
| >gnl|CDD|216525 pfam01486, K-box, K-box region | Back alignment and domain information |
|---|
Score = 56.4 bits (137), Expect = 2e-11
Identities = 30/65 (46%), Positives = 45/65 (69%)
Query: 44 RELRQMKGEELQELNMEELMRLEKSLEGGLSRVVQTKGERLLNEIDALRRKEAQLTEENL 103
RE R + GE+L L+++EL +LE+ LE GL + K E LLN+I+ L++KE +L EEN
Sbjct: 33 REQRHLLGEDLGSLSLKELQQLEQQLENGLKHIRSRKNELLLNQIEELQKKERELQEENR 92
Query: 104 RLKQR 108
L+Q+
Sbjct: 93 NLRQK 97
|
The K-box region is commonly found associated with SRF-type transcription factors see pfam00319. The K-box is a possible coiled-coil structure. Possible role in multimer formation. Length = 100 |
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 141 | |||
| PF01486 | 100 | K-box: K-box region; InterPro: IPR002487 MADS gene | 99.92 | |
| KOG0014 | 195 | consensus MADS box transcription factor [Transcrip | 97.24 | |
| PF06005 | 72 | DUF904: Protein of unknown function (DUF904); Inte | 95.32 | |
| COG3074 | 79 | Uncharacterized protein conserved in bacteria [Fun | 91.81 | |
| PRK15422 | 79 | septal ring assembly protein ZapB; Provisional | 90.17 | |
| PF06156 | 107 | DUF972: Protein of unknown function (DUF972); Inte | 89.1 | |
| COG2433 | 652 | Uncharacterized conserved protein [Function unknow | 88.86 | |
| PRK13169 | 110 | DNA replication intiation control protein YabA; Re | 86.66 | |
| PF06698 | 59 | DUF1192: Protein of unknown function (DUF1192); In | 85.15 | |
| cd07429 | 108 | Cby_like Chibby, a nuclear inhibitor of Wnt/beta-c | 83.05 |
| >PF01486 K-box: K-box region; InterPro: IPR002487 MADS genes in plants encode key developmental regulators of vegetative and reproductive development | Back alignment and domain information |
|---|
Probab=99.92 E-value=9.3e-25 Score=151.96 Aligned_cols=90 Identities=37% Similarity=0.517 Sum_probs=86.3
Q ss_pred hhHHhhhHHHHHHHHHHHHHHHHHhhcccccCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 032431 22 SLQLQLESSTYAILSKEMADRTRELRQMKGEELQELNMEELMRLEKSLEGGLSRVVQTKGERLLNEIDALRRKEAQLTEE 101 (141)
Q Consensus 22 ~~~~~~~~~e~~~Lk~e~e~L~~~~R~l~GedL~~Lsl~EL~~LE~qLe~al~~IR~rK~qll~~~i~~Lk~Ke~~L~ee 101 (141)
....+.|..++++|+.+++.|+..+|||+|+||++||++||++||++|+.||.+||+||+++|+++|+.|++|++.+.++
T Consensus 11 ~~~~e~~~~e~~~L~~~~~~L~~~~R~~~GedL~~Ls~~eL~~LE~~Le~aL~~VR~rK~~~l~~~i~~l~~ke~~l~~e 90 (100)
T PF01486_consen 11 DSQHEELQQEIAKLRKENESLQKELRHLMGEDLESLSLKELQQLEQQLESALKRVRSRKDQLLMEQIEELKKKERELEEE 90 (100)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45578899999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHh
Q 032431 102 NLRLKQRQGL 111 (141)
Q Consensus 102 N~~L~~~~~~ 111 (141)
|..|+.++.+
T Consensus 91 n~~L~~~~~e 100 (100)
T PF01486_consen 91 NNQLRQKIEE 100 (100)
T ss_pred HHHHHHHhcC
Confidence 9999999864
|
The majority of the plant MADS proteins share a stereotypical MIKC structure. It comprises (from N- to C-terminal) an N-terminal domain, which is, however, present only in a minority of proteins; a MADS domain (see PDOC00302 from PROSITEDOC, IPR002100 from INTERPRO), which is the major determinant of DNA-binding but which also performs dimerisation and accessory factor binding functions; a weakly conserved intervening (I) domain, which constitutes a key molecular determinant for the selective formation of DNA-binding dimers; a keratin-like (K-box) domain, which promotes protein dimerisation; and a C-terminal (C) domain, which is involved in transcriptional activation or in the formation of ternary or quaternary protein complexes. The 80-amino acid K-box domain was originally identified as a region with low but significant similarity to a region of keratin, which is part of the coiled-coil sequence constituting the central rod-shaped domain of keratin [, , ]. The K-box protein-protein interaction domain which mediates heterodimerization of MIKC-type MADS proteins contains several heptad repeats in which the first and the fourth positions are occupied by hydrophobic amino acids suggesting that the K-box domain forms three amphipathic alpha-helices referred to as K1, K2, and K3 [].; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus |
| >KOG0014 consensus MADS box transcription factor [Transcription] | Back alignment and domain information |
|---|
| >PF06005 DUF904: Protein of unknown function (DUF904); InterPro: IPR009252 Cell division protein ZapB is a non-essential, abundant cell division factor that is required for proper Z-ring formation | Back alignment and domain information |
|---|
| >COG3074 Uncharacterized protein conserved in bacteria [Function unknown] | Back alignment and domain information |
|---|
| >PRK15422 septal ring assembly protein ZapB; Provisional | Back alignment and domain information |
|---|
| >PF06156 DUF972: Protein of unknown function (DUF972); InterPro: IPR010377 FUNCTION: Involved in initiation control of chromosome replication | Back alignment and domain information |
|---|
| >COG2433 Uncharacterized conserved protein [Function unknown] | Back alignment and domain information |
|---|
| >PRK13169 DNA replication intiation control protein YabA; Reviewed | Back alignment and domain information |
|---|
| >PF06698 DUF1192: Protein of unknown function (DUF1192); InterPro: IPR009579 This family consists of several short, hypothetical, bacterial proteins of around 60 residues in length | Back alignment and domain information |
|---|
| >cd07429 Cby_like Chibby, a nuclear inhibitor of Wnt/beta-catenin mediated transcription, and similar proteins | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 141 | |||
| 2jee_A | 81 | YIIU; FTSZ, septum, coiled-coil, cell division, ce | 92.49 | |
| 2dgc_A | 63 | Protein (GCN4); basic domain, leucine zipper, DNA | 86.64 | |
| 2wt7_A | 63 | Proto-oncogene protein C-FOS; transcription, trans | 85.75 | |
| 1dh3_A | 55 | Transcription factor CREB; protein-DNA complex, tr | 85.25 | |
| 1t2k_D | 61 | Cyclic-AMP-dependent transcription factor ATF-2; p | 84.05 | |
| 1jnm_A | 62 | Proto-oncogene C-JUN; BZIP, protein-DNA complex, t | 80.74 |
| >2jee_A YIIU; FTSZ, septum, coiled-coil, cell division, cell cycle, hypothetical protein; 2.8A {Escherichia coli} | Back alignment and structure |
|---|
Probab=92.49 E-value=1.4 Score=28.62 Aligned_cols=43 Identities=23% Similarity=0.420 Sum_probs=33.6
Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 032431 57 LNMEELMRLEKSLEGGLSRVVQTKGERLLNEIDALRRKEAQLTEENLR 104 (141)
Q Consensus 57 Lsl~EL~~LE~qLe~al~~IR~rK~qll~~~i~~Lk~Ke~~L~eeN~~ 104 (141)
+|++=|.+||..++.|+..| .++..+|+.|+.|-..|.++|..
T Consensus 3 MS~ElleqLE~KIq~avdtI-----~lLqmEieELKekN~~L~~e~~e 45 (81)
T 2jee_A 3 MSLEVFEKLEAKVQQAIDTI-----TLLQMEIEELKEKNNSLSQEVQN 45 (81)
T ss_dssp CCHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHH
T ss_pred CcHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHH
Confidence 78899999999999999986 45566777787777665555555
|
| >2dgc_A Protein (GCN4); basic domain, leucine zipper, DNA binding, eukaryotic regulatory protein, transcription/DNA complex; HET: DNA; 2.20A {Saccharomyces cerevisiae} SCOP: h.1.3.1 PDB: 1dgc_A* 1ld4_E 1ysa_C* 3p8m_D | Back alignment and structure |
|---|
| >2wt7_A Proto-oncogene protein C-FOS; transcription, transcription regulation, nucleus, activator, repressor, DNA-binding, phosphoprotein, differentiation; 2.30A {Mus musculus} PDB: 1fos_E* 1a02_F* 1s9k_D | Back alignment and structure |
|---|
| >1dh3_A Transcription factor CREB; protein-DNA complex, transcription/DNA complex; HET: DNA; 3.00A {Mus musculus} SCOP: h.1.3.1 | Back alignment and structure |
|---|
| >1t2k_D Cyclic-AMP-dependent transcription factor ATF-2; protein DNA complex, transcription/DNA complex; 3.00A {Homo sapiens} SCOP: h.1.3.1 | Back alignment and structure |
|---|
| >1jnm_A Proto-oncogene C-JUN; BZIP, protein-DNA complex, transcription/DNA complex; 2.20A {Homo sapiens} SCOP: h.1.3.1 PDB: 1fos_F 2h7h_A 1t2k_C 1a02_J* 1s9k_E 1jun_A | Back alignment and structure |
|---|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00