Citrus Sinensis ID: 042608
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 192 | ||||||
| 255541878 | 236 | hypothetical protein RCOM_1618730 [Ricin | 0.958 | 0.779 | 0.599 | 2e-54 | |
| 255641569 | 225 | unknown [Glycine max] | 0.864 | 0.737 | 0.603 | 5e-51 | |
| 357477723 | 204 | Ethylene-responsive transcription factor | 0.781 | 0.735 | 0.583 | 5e-42 | |
| 449441127 | 199 | PREDICTED: ethylene-responsive transcrip | 0.937 | 0.904 | 0.497 | 5e-35 | |
| 357477623 | 220 | Ethylene-responsive transcription factor | 0.473 | 0.413 | 0.673 | 7e-30 | |
| 388499190 | 335 | unknown [Medicago truncatula] | 0.317 | 0.182 | 0.786 | 9e-21 | |
| 356506302 | 341 | PREDICTED: uncharacterized protein LOC10 | 0.390 | 0.219 | 0.636 | 2e-20 | |
| 357502029 | 463 | Ethylene-responsive transcription factor | 0.322 | 0.133 | 0.725 | 2e-20 | |
| 297742491 | 748 | unnamed protein product [Vitis vinifera] | 0.338 | 0.086 | 0.723 | 2e-20 | |
| 147781798 | 345 | hypothetical protein VITISV_011417 [Viti | 0.296 | 0.165 | 0.789 | 3e-20 |
| >gi|255541878|ref|XP_002512003.1| hypothetical protein RCOM_1618730 [Ricinus communis] gi|223549183|gb|EEF50672.1| hypothetical protein RCOM_1618730 [Ricinus communis] | Back alignment and taxonomy information |
|---|
Score = 217 bits (553), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 124/207 (59%), Positives = 145/207 (70%), Gaps = 23/207 (11%)
Query: 2 LQTNNNNLHENG--------------NRSKSRFLGVRQRPSGRWVAEIKESSQKLRLWLG 47
LQTN+N + + G N ++LGVRQRPSGRWVAEIK++SQKLRLWLG
Sbjct: 23 LQTNDNTIEKGGKKMPMKQAKRGIIVNNKGRKYLGVRQRPSGRWVAEIKDASQKLRLWLG 82
Query: 48 TFDKAEEAAMAYDIAARLLRGRNAKTNFI--HHG-IINFHEEINYSLLGKNPRLHQLLQQ 104
TFD AEEAA++YD AARLLRGRNAKTNF HHG ++N H+EI LLGKNPRLHQLLQ
Sbjct: 83 TFDTAEEAALSYDSAARLLRGRNAKTNFPNDHHGNMVNTHQEI-CRLLGKNPRLHQLLQH 141
Query: 105 AAVTKSHAISSSSK---FLNNNMANRDDESINFDTLVEETIFCSSTSTADCSVKSHGNFD 161
AV K+H S S+ +N N N+ D S NFD+LVE+TI CSS+S D G+
Sbjct: 142 -AVLKNHGKSLLSRRIPLVNQNERNQVDSSFNFDSLVEDTIICSSSSADDHDHNQDGSDK 200
Query: 162 -NGLSFGSSKVYSSVVVAPSFSASLCQ 187
GLSFGS KVYSSV+VAPSFSASLCQ
Sbjct: 201 FCGLSFGSCKVYSSVIVAPSFSASLCQ 227
|
Source: Ricinus communis Species: Ricinus communis Genus: Ricinus Family: Euphorbiaceae Order: Malpighiales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|255641569|gb|ACU21058.1| unknown [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|357477723|ref|XP_003609147.1| Ethylene-responsive transcription factor [Medicago truncatula] gi|355510202|gb|AES91344.1| Ethylene-responsive transcription factor [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|449441127|ref|XP_004138335.1| PREDICTED: ethylene-responsive transcription factor ERF086-like [Cucumis sativus] gi|449477490|ref|XP_004155038.1| PREDICTED: ethylene-responsive transcription factor ERF086-like [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|357477623|ref|XP_003609097.1| Ethylene-responsive transcription factor, partial [Medicago truncatula] gi|355510152|gb|AES91294.1| Ethylene-responsive transcription factor, partial [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|388499190|gb|AFK37661.1| unknown [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|356506302|ref|XP_003521924.1| PREDICTED: uncharacterized protein LOC100810022 [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|357502029|ref|XP_003621303.1| Ethylene-responsive transcription factor [Medicago truncatula] gi|124360260|gb|ABN08273.1| Pathogenesis-related transcriptional factor and ERF [Medicago truncatula] gi|355496318|gb|AES77521.1| Ethylene-responsive transcription factor [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|297742491|emb|CBI34640.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|147781798|emb|CAN65440.1| hypothetical protein VITISV_011417 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 192 | ||||||
| TAIR|locus:2183174 | 253 | RAP2.11 "related to AP2 11" [A | 0.479 | 0.363 | 0.432 | 9.8e-16 | |
| TAIR|locus:2182993 | 348 | PUCHI [Arabidopsis thaliana (t | 0.479 | 0.264 | 0.378 | 1.3e-13 | |
| TAIR|locus:2008021 | 225 | ERF3 "ethylene responsive elem | 0.333 | 0.284 | 0.484 | 4.5e-11 | |
| TAIR|locus:2159068 | 211 | LEP "LEAFY PETIOLE" [Arabidops | 0.385 | 0.350 | 0.391 | 7.4e-11 | |
| TAIR|locus:2009655 | 185 | ERF8 "ethylene response factor | 0.359 | 0.372 | 0.442 | 1.2e-10 | |
| TAIR|locus:2087065 | 222 | ERF4 "ethylene responsive elem | 0.348 | 0.301 | 0.434 | 2e-10 | |
| TAIR|locus:2198978 | 256 | AT1G80580 [Arabidopsis thalian | 0.296 | 0.222 | 0.491 | 4.3e-10 | |
| TAIR|locus:2092339 | 244 | ERF7 "ethylene response factor | 0.328 | 0.258 | 0.460 | 4.5e-10 | |
| TAIR|locus:2167659 | 200 | ERF9 "erf domain protein 9" [A | 0.317 | 0.305 | 0.459 | 5.2e-10 | |
| TAIR|locus:2170101 | 220 | ERF110 "ethylene response fact | 0.338 | 0.295 | 0.415 | 6.6e-10 |
| TAIR|locus:2183174 RAP2.11 "related to AP2 11" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 197 (74.4 bits), Expect = 9.8e-16, P = 9.8e-16
Identities = 42/97 (43%), Positives = 56/97 (57%)
Query: 4 TNNNNLHENGNRSKSRFLGVRQRPSGRWVAEIKESSQKLRLWLGTFDKXXXXXXXXXXXX 63
T E +K++F+GVRQRPSG+WVAEIK+++QK+R+WLGTF+
Sbjct: 6 TPKQKTKEKSKGNKTKFVGVRQRPSGKWVAEIKDTTQKIRMWLGTFETAEEAARAYDEAA 65
Query: 64 RLLRGRNAKTNFIHHGIINFHEEINYSLLGKNPRLHQ 100
LLRG N +TNF +H F SL +N LHQ
Sbjct: 66 CLLRGSNTRTNFANH----FPNNSQLSLKIRN-LLHQ 97
|
|
| TAIR|locus:2182993 PUCHI [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2008021 ERF3 "ethylene responsive element binding factor 3" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2159068 LEP "LEAFY PETIOLE" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2009655 ERF8 "ethylene response factor 8" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2087065 ERF4 "ethylene responsive element binding factor 4" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2198978 AT1G80580 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2092339 ERF7 "ethylene response factor 7" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2167659 ERF9 "erf domain protein 9" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2170101 ERF110 "ethylene response factor 110" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
Fail to connect to STRING server
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 192 | |||
| smart00380 | 64 | smart00380, AP2, DNA-binding domain in plant prote | 6e-32 | |
| cd00018 | 61 | cd00018, AP2, DNA-binding domain found in transcri | 2e-30 | |
| pfam00847 | 53 | pfam00847, AP2, AP2 domain | 2e-11 |
| >gnl|CDD|197689 smart00380, AP2, DNA-binding domain in plant proteins such as APETALA2 and EREBPs | Back alignment and domain information |
|---|
Score = 109 bits (275), Expect = 6e-32
Identities = 36/57 (63%), Positives = 45/57 (78%)
Query: 19 RFLGVRQRPSGRWVAEIKESSQKLRLWLGTFDKAEEAAMAYDIAARLLRGRNAKTNF 75
++ GVRQRP G+WVAEI++ S+ R+WLGTFD AEEAA AYD AA RGR+A+ NF
Sbjct: 1 KYRGVRQRPWGKWVAEIRDPSKGKRVWLGTFDTAEEAARAYDRAAFKFRGRSARLNF 57
|
Length = 64 |
| >gnl|CDD|237985 cd00018, AP2, DNA-binding domain found in transcription regulators in plants such as APETALA2 and EREBP (ethylene responsive element binding protein) | Back alignment and domain information |
|---|
| >gnl|CDD|216148 pfam00847, AP2, AP2 domain | Back alignment and domain information |
|---|
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 192 | |||
| cd00018 | 61 | AP2 DNA-binding domain found in transcription regu | 99.86 | |
| smart00380 | 64 | AP2 DNA-binding domain in plant proteins such as A | 99.85 | |
| PHA00280 | 121 | putative NHN endonuclease | 99.61 | |
| PF00847 | 56 | AP2: AP2 domain; InterPro: IPR001471 Pathogenesis- | 99.25 | |
| PF14657 | 46 | Integrase_AP2: AP2-like DNA-binding integrase doma | 87.11 |
| >cd00018 AP2 DNA-binding domain found in transcription regulators in plants such as APETALA2 and EREBP (ethylene responsive element binding protein) | Back alignment and domain information |
|---|
Probab=99.86 E-value=1.3e-21 Score=135.27 Aligned_cols=61 Identities=61% Similarity=0.933 Sum_probs=57.5
Q ss_pred CceeEEEECCCCcEEEEEEeCCCCcEEeeccccCHHHHHHHHHHHHHHhcCCCCCCCCCCC
Q 042608 18 SRFLGVRQRPSGRWVAEIKESSQKLRLWLGTFDKAEEAAMAYDIAARLLRGRNAKTNFIHH 78 (192)
Q Consensus 18 S~yrGV~~r~~GkW~A~I~~~~~kkri~LGtFdt~EEAA~AYd~Aa~~~~G~~a~~NFp~~ 78 (192)
|+|+||+++++|||+|+|+++..++++|||+|+|+|||+.|||.++++++|.++.+|||.+
T Consensus 1 s~~~GV~~~~~gkw~A~I~~~~~gk~~~lG~f~t~eeAa~Ayd~a~~~~~g~~a~~Nf~~~ 61 (61)
T cd00018 1 SKYRGVRQRPWGKWVAEIRDPSGGRRIWLGTFDTAEEAARAYDRAALKLRGSSAVLNFPDS 61 (61)
T ss_pred CCccCEEECCCCcEEEEEEeCCCCceEccCCCCCHHHHHHHHHHHHHHhcCCccccCCCCC
Confidence 6899999999999999999876688999999999999999999999999999999999953
|
In EREBPs the domain specifically binds to the 11bp GCC box of the ethylene response element (ERE), a promotor element essential for ethylene responsiveness. EREBPs and the C-repeat binding factor CBF1, which is involved in stress response, contain a single copy of the AP2 domain. APETALA2-like proteins, which play a role in plant development contain two copies. |
| >smart00380 AP2 DNA-binding domain in plant proteins such as APETALA2 and EREBPs | Back alignment and domain information |
|---|
| >PHA00280 putative NHN endonuclease | Back alignment and domain information |
|---|
| >PF00847 AP2: AP2 domain; InterPro: IPR001471 Pathogenesis-related genes transcriptional activator binds to the GCC-box pathogenesis-related promoter element and activates the plant's defence genes | Back alignment and domain information |
|---|
| >PF14657 Integrase_AP2: AP2-like DNA-binding integrase domain | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() | |
| Query | 192 | ||||
| 1gcc_A | 63 | Solution Nmr Structure Of The Complex Of Gcc-Box Bi | 1e-04 | ||
| 2gcc_A | 70 | Solution Structure Of The Gcc-Box Binding Domain, N | 6e-04 |
| >pdb|1GCC|A Chain A, Solution Nmr Structure Of The Complex Of Gcc-Box Binding Domain Of Aterf1 And Gcc-Box Dna, Minimized Average Structure Length = 63 | Back alignment and structure |
|
| >pdb|2GCC|A Chain A, Solution Structure Of The Gcc-Box Binding Domain, Nmr, Minimized Mean Structure Length = 70 | Back alignment and structure |
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 192 | |||
| 1gcc_A | 63 | Ethylene responsive element binding factor 1; tran | 4e-30 |
| >1gcc_A Ethylene responsive element binding factor 1; transcription factor, protein-DNA complex, ethylene inducible; HET: DNA; NMR {Arabidopsis thaliana} SCOP: d.10.1.2 PDB: 2gcc_A 3gcc_A Length = 63 | Back alignment and structure |
|---|
Score = 104 bits (261), Expect = 4e-30
Identities = 30/58 (51%), Positives = 42/58 (72%), Gaps = 1/58 (1%)
Query: 19 RFLGVRQRPSGRWVAEIKESSQK-LRLWLGTFDKAEEAAMAYDIAARLLRGRNAKTNF 75
+ GVRQRP G++ AEI++ ++ R+WLGTF+ AE+AA+AYD AA +RG A NF
Sbjct: 2 HYRGVRQRPWGKFAAEIRDPAKNGARVWLGTFETAEDAALAYDRAAFRMRGSRALLNF 59
|
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 192 | |||
| 1gcc_A | 63 | Ethylene responsive element binding factor 1; tran | 99.91 | |
| 3igm_B | 77 | PF14_0633 protein; AP2 domain, specific transcript | 91.02 | |
| 3jtz_A | 88 | Integrase; four stranded beta-sheet, DNA binding p | 85.82 | |
| 1u3e_M | 174 | HNH homing endonuclease; HNH catalytic motif, heli | 82.45 |
| >1gcc_A Ethylene responsive element binding factor 1; transcription factor, protein-DNA complex, ethylene inducible; HET: DNA; NMR {Arabidopsis thaliana} SCOP: d.10.1.2 PDB: 2gcc_A 3gcc_A | Back alignment and structure |
|---|
Probab=99.91 E-value=5.3e-25 Score=153.97 Aligned_cols=60 Identities=50% Similarity=0.838 Sum_probs=56.6
Q ss_pred ceeEEEECCCCcEEEEEEeCCC-CcEEeeccccCHHHHHHHHHHHHHHhcCCCCCCCCCCC
Q 042608 19 RFLGVRQRPSGRWVAEIKESSQ-KLRLWLGTFDKAEEAAMAYDIAARLLRGRNAKTNFIHH 78 (192)
Q Consensus 19 ~yrGV~~r~~GkW~A~I~~~~~-kkri~LGtFdt~EEAA~AYd~Aa~~~~G~~a~~NFp~~ 78 (192)
+||||+++++|||+|+|+++.+ ++++|||+|+|+||||+|||.|+++++|.++.+|||.+
T Consensus 2 ~yrGV~~r~~gkw~A~I~~~~~~g~r~~LGtf~T~eeAA~AyD~Aa~~~~G~~a~~NFp~~ 62 (63)
T 1gcc_A 2 HYRGVRQRPWGKFAAEIRDPAKNGARVWLGTFETAEDAALAYDRAAFRMRGSRALLNFPLR 62 (63)
T ss_dssp CCTTEEEETTTEEEEEEEETTTTSEEEEEEEESSHHHHHHHHHHHHHHHHSSCCCCSSCTT
T ss_pred CcccEeeCCCCcEEEEEccccCCCeEEEeeeCCCHHHHHHHHHHHHHHhcCcccccCCCCc
Confidence 6999999899999999999874 68999999999999999999999999999999999964
|
| >3igm_B PF14_0633 protein; AP2 domain, specific transcription FA protein-DNA complex, transcription-DNA complex; 2.20A {Plasmodium falciparum} | Back alignment and structure |
|---|
| >3jtz_A Integrase; four stranded beta-sheet, DNA binding protein; 1.30A {Yersinia pestis} PDB: 3rmp_A | Back alignment and structure |
|---|
| >1u3e_M HNH homing endonuclease; HNH catalytic motif, helix-turn-helix DNA binding domain, PR complex, DNA binding protein-DNA complex; 2.92A {Bacillus phage SPO1} SCOP: d.4.1.3 d.285.1.1 | Back alignment and structure |
|---|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| 192 | ||||
| d1gcca_ | 63 | d.10.1.2 (A:) GCC-box binding domain {Mouse-ear cr | 1e-26 |
| >d1gcca_ d.10.1.2 (A:) GCC-box binding domain {Mouse-ear cress (Arabidopsis thaliana) [TaxId: 3702]} Length = 63 | Back information, alignment and structure |
|---|
class: Alpha and beta proteins (a+b) fold: DNA-binding domain superfamily: DNA-binding domain family: GCC-box binding domain domain: GCC-box binding domain species: Mouse-ear cress (Arabidopsis thaliana) [TaxId: 3702]
Score = 94.4 bits (235), Expect = 1e-26
Identities = 30/58 (51%), Positives = 42/58 (72%), Gaps = 1/58 (1%)
Query: 19 RFLGVRQRPSGRWVAEIKESSQKL-RLWLGTFDKAEEAAMAYDIAARLLRGRNAKTNF 75
+ GVRQRP G++ AEI++ ++ R+WLGTF+ AE+AA+AYD AA +RG A NF
Sbjct: 2 HYRGVRQRPWGKFAAEIRDPAKNGARVWLGTFETAEDAALAYDRAAFRMRGSRALLNF 59
|
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 192 | |||
| d1gcca_ | 63 | GCC-box binding domain {Mouse-ear cress (Arabidops | 99.91 |
| >d1gcca_ d.10.1.2 (A:) GCC-box binding domain {Mouse-ear cress (Arabidopsis thaliana) [TaxId: 3702]} | Back information, alignment and structure |
|---|
class: Alpha and beta proteins (a+b) fold: DNA-binding domain superfamily: DNA-binding domain family: GCC-box binding domain domain: GCC-box binding domain species: Mouse-ear cress (Arabidopsis thaliana) [TaxId: 3702]
Probab=99.91 E-value=2.7e-25 Score=154.23 Aligned_cols=60 Identities=50% Similarity=0.834 Sum_probs=55.8
Q ss_pred ceeEEEECCCCcEEEEEEeCC-CCcEEeeccccCHHHHHHHHHHHHHHhcCCCCCCCCCCC
Q 042608 19 RFLGVRQRPSGRWVAEIKESS-QKLRLWLGTFDKAEEAAMAYDIAARLLRGRNAKTNFIHH 78 (192)
Q Consensus 19 ~yrGV~~r~~GkW~A~I~~~~-~kkri~LGtFdt~EEAA~AYd~Aa~~~~G~~a~~NFp~~ 78 (192)
+||||+++++|||+|+|+++. +++++|||+|+|+||||+|||.|+++++|.++.+|||+.
T Consensus 2 ~yrGVr~r~~gkw~A~Ir~~~~~~~r~~LGtf~t~eeAArAYD~aa~~~~G~~a~~NFP~~ 62 (63)
T d1gcca_ 2 HYRGVRQRPWGKFAAEIRDPAKNGARVWLGTFETAEDAALAYDRAAFRMRGSRALLNFPLR 62 (63)
T ss_dssp CCTTEEEETTTEEEEEEEETTTTSEEEEEEEESSHHHHHHHHHHHHHHHHSSCCCCSSCTT
T ss_pred CcceEeECCCCCEEEEEecCCCCCcEeccccccCHHHHHHHHHHHHHHhcCCCcccCCCcc
Confidence 599999999999999999875 558999999999999999999999999999999999954
|