Citrus Sinensis ID: 041015
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 211 | ||||||
| 255550908 | 270 | Transcriptional factor TINY, putative [R | 0.587 | 0.459 | 0.669 | 3e-39 | |
| 224138644 | 221 | AP2/ERF domain-containing transcription | 0.535 | 0.511 | 0.700 | 1e-38 | |
| 255549882 | 241 | Transcriptional factor TINY, putative [R | 0.521 | 0.456 | 0.690 | 4e-38 | |
| 356504603 | 287 | PREDICTED: ethylene-responsive transcrip | 0.549 | 0.404 | 0.655 | 1e-37 | |
| 255541118 | 276 | Transcriptional factor TINY, putative [R | 0.824 | 0.630 | 0.550 | 1e-37 | |
| 255647266 | 287 | unknown [Glycine max] | 0.549 | 0.404 | 0.655 | 2e-37 | |
| 297824531 | 290 | hypothetical protein ARALYDRAFT_483623 [ | 0.563 | 0.410 | 0.644 | 2e-37 | |
| 297811277 | 236 | hypothetical protein ARALYDRAFT_487996 [ | 0.549 | 0.491 | 0.622 | 3e-37 | |
| 21592664 | 295 | putative AP2 domain transcription factor | 0.431 | 0.308 | 0.791 | 7e-37 | |
| 15225381 | 295 | ethylene-responsive transcription factor | 0.431 | 0.308 | 0.791 | 7e-37 |
| >gi|255550908|ref|XP_002516502.1| Transcriptional factor TINY, putative [Ricinus communis] gi|223544322|gb|EEF45843.1| Transcriptional factor TINY, putative [Ricinus communis] | Back alignment and taxonomy information |
|---|
Score = 167 bits (422), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 85/127 (66%), Positives = 100/127 (78%), Gaps = 3/127 (2%)
Query: 22 SSKSQQANKELMSPTIDNENESNKKICNNGANALNKYPTYRGVRMRSWGKWVSEIRQPRK 81
+SK+Q+ + L+ D E+ +NKKI NK+P YRGVRMR+WGKWVSEIR+PRK
Sbjct: 17 ASKTQR-QEALVEEQQDPESPTNKKIKR--IRDTNKHPVYRGVRMRNWGKWVSEIREPRK 73
Query: 82 KSRIWLGSFPTAEMAARAHDVAALAIKGHSAHLNFPELADELPRPASAAPKDIQAAAAKA 141
KSRIWLG+FPT EMAARAHDVAAL+IKG+SA LNFPELAD LPRPAS AP+D+Q AAAKA
Sbjct: 74 KSRIWLGTFPTPEMAARAHDVAALSIKGNSAILNFPELADSLPRPASLAPRDVQIAAAKA 133
Query: 142 AAAAIFD 148
A FD
Sbjct: 134 AQMDTFD 140
|
Source: Ricinus communis Species: Ricinus communis Genus: Ricinus Family: Euphorbiaceae Order: Malpighiales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|224138644|ref|XP_002326654.1| AP2/ERF domain-containing transcription factor [Populus trichocarpa] gi|148372079|gb|ABQ62971.1| TINY-like protein [Populus trichocarpa] gi|222833976|gb|EEE72453.1| AP2/ERF domain-containing transcription factor [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|255549882|ref|XP_002515992.1| Transcriptional factor TINY, putative [Ricinus communis] gi|223544897|gb|EEF46412.1| Transcriptional factor TINY, putative [Ricinus communis] | Back alignment and taxonomy information |
|---|
| >gi|356504603|ref|XP_003521085.1| PREDICTED: ethylene-responsive transcription factor ERF034-like [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|255541118|ref|XP_002511623.1| Transcriptional factor TINY, putative [Ricinus communis] gi|223548803|gb|EEF50292.1| Transcriptional factor TINY, putative [Ricinus communis] | Back alignment and taxonomy information |
|---|
| >gi|255647266|gb|ACU24100.1| unknown [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|297824531|ref|XP_002880148.1| hypothetical protein ARALYDRAFT_483623 [Arabidopsis lyrata subsp. lyrata] gi|297325987|gb|EFH56407.1| hypothetical protein ARALYDRAFT_483623 [Arabidopsis lyrata subsp. lyrata] | Back alignment and taxonomy information |
|---|
| >gi|297811277|ref|XP_002873522.1| hypothetical protein ARALYDRAFT_487996 [Arabidopsis lyrata subsp. lyrata] gi|297319359|gb|EFH49781.1| hypothetical protein ARALYDRAFT_487996 [Arabidopsis lyrata subsp. lyrata] | Back alignment and taxonomy information |
|---|
| >gi|21592664|gb|AAM64613.1| putative AP2 domain transcription factor [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|15225381|ref|NP_182021.1| ethylene-responsive transcription factor ERF034 [Arabidopsis thaliana] gi|148839467|sp|Q8LBQ7.2|ERF34_ARATH RecName: Full=Ethylene-responsive transcription factor ERF034 gi|20196923|gb|AAM14835.1| putative AP2 domain transcription factor [Arabidopsis thaliana] gi|30793877|gb|AAP40391.1| putative AP2 domain transcription factor [Arabidopsis thaliana] gi|30794084|gb|AAP40485.1| putative AP2 domain transcription factor [Arabidopsis thaliana] gi|110739089|dbj|BAF01461.1| putative AP2 domain transcription factor [Arabidopsis thaliana] gi|330255394|gb|AEC10488.1| ethylene-responsive transcription factor ERF034 [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 211 | ||||||
| TAIR|locus:2055007 | 295 | AT2G44940 [Arabidopsis thalian | 0.407 | 0.291 | 0.727 | 4.9e-30 | |
| TAIR|locus:2129111 | 179 | AT4G16750 [Arabidopsis thalian | 0.459 | 0.541 | 0.646 | 1.6e-29 | |
| TAIR|locus:2144296 | 236 | TINY2 "AT5G11590" [Arabidopsis | 0.454 | 0.406 | 0.649 | 7.1e-29 | |
| TAIR|locus:2103301 | 256 | AT3G60490 [Arabidopsis thalian | 0.407 | 0.335 | 0.724 | 1.5e-28 | |
| TAIR|locus:2058764 | 194 | ERF38 "ERF family protein 38" | 0.336 | 0.365 | 0.816 | 5e-28 | |
| TAIR|locus:2134128 | 221 | AT4G32800 [Arabidopsis thalian | 0.331 | 0.316 | 0.828 | 1.7e-27 | |
| TAIR|locus:2145249 | 218 | tny "TINY" [Arabidopsis thalia | 0.393 | 0.380 | 0.670 | 2.8e-27 | |
| TAIR|locus:2043495 | 225 | ESE2 "ethylene and salt induci | 0.308 | 0.288 | 0.892 | 3.5e-27 | |
| TAIR|locus:2094897 | 236 | AT3G16280 [Arabidopsis thalian | 0.459 | 0.411 | 0.608 | 5.7e-27 | |
| TAIR|locus:2195985 | 244 | AT1G77200 [Arabidopsis thalian | 0.436 | 0.377 | 0.621 | 2.2e-25 |
| TAIR|locus:2055007 AT2G44940 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 332 (121.9 bits), Expect = 4.9e-30, P = 4.9e-30
Identities = 64/88 (72%), Positives = 74/88 (84%)
Query: 38 DNENESNKKICNNGANALNKYPTYRGVRMRSWGKWVSEIRQPRKKSRIWLGSFPTAEMAA 97
D++N K N +K+PTYRGVRMRSWGKWVSEIR+PRKKSRIWLG++PTAEMAA
Sbjct: 79 DDQNGGGGK--RRKTNGGDKHPTYRGVRMRSWGKWVSEIREPRKKSRIWLGTYPTAEMAA 136
Query: 98 RAHDVAALAIKGHSAHLNFPELADELPR 125
RAHDVAALAIKG +A+LNFP+LA ELPR
Sbjct: 137 RAHDVAALAIKGTTAYLNFPKLAGELPR 164
|
|
| TAIR|locus:2129111 AT4G16750 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2144296 TINY2 "AT5G11590" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2103301 AT3G60490 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2058764 ERF38 "ERF family protein 38" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2134128 AT4G32800 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2145249 tny "TINY" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2043495 ESE2 "ethylene and salt inducible 2" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2094897 AT3G16280 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2195985 AT1G77200 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| fgenesh2_kg.4__2683__AT2G44940.1 | annotation not avaliable (290 aa) | |||||||
(Arabidopsis lyrata) | ||||||||
| Sorry, there are no predicted associations at the current settings. |
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 211 | |||
| smart00380 | 64 | smart00380, AP2, DNA-binding domain in plant prote | 2e-33 | |
| cd00018 | 61 | cd00018, AP2, DNA-binding domain found in transcri | 4e-31 | |
| pfam00847 | 53 | pfam00847, AP2, AP2 domain | 3e-12 |
| >gnl|CDD|197689 smart00380, AP2, DNA-binding domain in plant proteins such as APETALA2 and EREBPs | Back alignment and domain information |
|---|
Score = 114 bits (287), Expect = 2e-33
Identities = 38/59 (64%), Positives = 43/59 (72%)
Query: 61 YRGVRMRSWGKWVSEIRQPRKKSRIWLGSFPTAEMAARAHDVAALAIKGHSAHLNFPEL 119
YRGVR R WGKWV+EIR P K R+WLG+F TAE AARA+D AA +G SA LNFP
Sbjct: 2 YRGVRQRPWGKWVAEIRDPSKGKRVWLGTFDTAEEAARAYDRAAFKFRGRSARLNFPNS 60
|
Length = 64 |
| >gnl|CDD|237985 cd00018, AP2, DNA-binding domain found in transcription regulators in plants such as APETALA2 and EREBP (ethylene responsive element binding protein) | Back alignment and domain information |
|---|
| >gnl|CDD|216148 pfam00847, AP2, AP2 domain | Back alignment and domain information |
|---|
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 211 | |||
| cd00018 | 61 | AP2 DNA-binding domain found in transcription regu | 99.87 | |
| smart00380 | 64 | AP2 DNA-binding domain in plant proteins such as A | 99.85 | |
| PHA00280 | 121 | putative NHN endonuclease | 99.53 | |
| PF00847 | 56 | AP2: AP2 domain; InterPro: IPR001471 Pathogenesis- | 99.18 |
| >cd00018 AP2 DNA-binding domain found in transcription regulators in plants such as APETALA2 and EREBP (ethylene responsive element binding protein) | Back alignment and domain information |
|---|
Probab=99.87 E-value=3.6e-22 Score=139.99 Aligned_cols=61 Identities=64% Similarity=1.085 Sum_probs=57.9
Q ss_pred CceeEEEeCCCCcEEEEEeeCCCCeEEecCCCCCHHHHHHHHHHHHHHhcCCCCCCCCCCC
Q 041015 59 PTYRGVRMRSWGKWVSEIRQPRKKSRIWLGSFPTAEMAARAHDVAALAIKGHSAHLNFPEL 119 (211)
Q Consensus 59 S~yRGVr~r~~GkW~A~I~~p~~~kri~LGtF~T~EeAArAYD~Aal~l~G~~A~lNFp~s 119 (211)
|+||||+++++|||+|+|+++..++++|||+|+|+|||++|||.++++++|.++.+|||++
T Consensus 1 s~~~GV~~~~~gkw~A~I~~~~~gk~~~lG~f~t~eeAa~Ayd~a~~~~~g~~a~~Nf~~~ 61 (61)
T cd00018 1 SKYRGVRQRPWGKWVAEIRDPSGGRRIWLGTFDTAEEAARAYDRAALKLRGSSAVLNFPDS 61 (61)
T ss_pred CCccCEEECCCCcEEEEEEeCCCCceEccCCCCCHHHHHHHHHHHHHHhcCCccccCCCCC
Confidence 6899999998999999999966699999999999999999999999999999999999985
|
In EREBPs the domain specifically binds to the 11bp GCC box of the ethylene response element (ERE), a promotor element essential for ethylene responsiveness. EREBPs and the C-repeat binding factor CBF1, which is involved in stress response, contain a single copy of the AP2 domain. APETALA2-like proteins, which play a role in plant development contain two copies. |
| >smart00380 AP2 DNA-binding domain in plant proteins such as APETALA2 and EREBPs | Back alignment and domain information |
|---|
| >PHA00280 putative NHN endonuclease | Back alignment and domain information |
|---|
| >PF00847 AP2: AP2 domain; InterPro: IPR001471 Pathogenesis-related genes transcriptional activator binds to the GCC-box pathogenesis-related promoter element and activates the plant's defence genes | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() | |
| Query | 211 | ||||
| 2gcc_A | 70 | Solution Structure Of The Gcc-Box Binding Domain, N | 2e-12 | ||
| 1gcc_A | 63 | Solution Nmr Structure Of The Complex Of Gcc-Box Bi | 4e-12 |
| >pdb|2GCC|A Chain A, Solution Structure Of The Gcc-Box Binding Domain, Nmr, Minimized Mean Structure Length = 70 | Back alignment and structure |
|
| >pdb|1GCC|A Chain A, Solution Nmr Structure Of The Complex Of Gcc-Box Binding Domain Of Aterf1 And Gcc-Box Dna, Minimized Average Structure Length = 63 | Back alignment and structure |
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 211 | |||
| 1gcc_A | 63 | Ethylene responsive element binding factor 1; tran | 3e-28 |
| >1gcc_A Ethylene responsive element binding factor 1; transcription factor, protein-DNA complex, ethylene inducible; HET: DNA; NMR {Arabidopsis thaliana} SCOP: d.10.1.2 PDB: 2gcc_A 3gcc_A Length = 63 | Back alignment and structure |
|---|
Score = 100 bits (251), Expect = 3e-28
Identities = 34/59 (57%), Positives = 42/59 (71%), Gaps = 1/59 (1%)
Query: 61 YRGVRMRSWGKWVSEIRQPRKK-SRIWLGSFPTAEMAARAHDVAALAIKGHSAHLNFPE 118
YRGVR R WGK+ +EIR P K +R+WLG+F TAE AA A+D AA ++G A LNFP
Sbjct: 3 YRGVRQRPWGKFAAEIRDPAKNGARVWLGTFETAEDAALAYDRAAFRMRGSRALLNFPL 61
|
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 211 | |||
| 1gcc_A | 63 | Ethylene responsive element binding factor 1; tran | 99.93 | |
| 3igm_B | 77 | PF14_0633 protein; AP2 domain, specific transcript | 84.25 | |
| 1u3e_M | 174 | HNH homing endonuclease; HNH catalytic motif, heli | 80.79 |
| >1gcc_A Ethylene responsive element binding factor 1; transcription factor, protein-DNA complex, ethylene inducible; HET: DNA; NMR {Arabidopsis thaliana} SCOP: d.10.1.2 PDB: 2gcc_A 3gcc_A | Back alignment and structure |
|---|
Probab=99.93 E-value=1.3e-26 Score=164.28 Aligned_cols=60 Identities=57% Similarity=0.945 Sum_probs=57.3
Q ss_pred ceeEEEeCCCCcEEEEEeeCCC-CeEEecCCCCCHHHHHHHHHHHHHHhcCCCCCCCCCCC
Q 041015 60 TYRGVRMRSWGKWVSEIRQPRK-KSRIWLGSFPTAEMAARAHDVAALAIKGHSAHLNFPEL 119 (211)
Q Consensus 60 ~yRGVr~r~~GkW~A~I~~p~~-~kri~LGtF~T~EeAArAYD~Aal~l~G~~A~lNFp~s 119 (211)
+||||++++||||+|+|++|.+ |++||||+|+|+||||+|||.|+++++|..+.||||++
T Consensus 2 ~yrGV~~r~~gkw~A~I~~~~~~g~r~~LGtf~T~eeAA~AyD~Aa~~~~G~~a~~NFp~~ 62 (63)
T 1gcc_A 2 HYRGVRQRPWGKFAAEIRDPAKNGARVWLGTFETAEDAALAYDRAAFRMRGSRALLNFPLR 62 (63)
T ss_dssp CCTTEEEETTTEEEEEEEETTTTSEEEEEEEESSHHHHHHHHHHHHHHHHSSCCCCSSCTT
T ss_pred CcccEeeCCCCcEEEEEccccCCCeEEEeeeCCCHHHHHHHHHHHHHHhcCcccccCCCCc
Confidence 5999999999999999999974 79999999999999999999999999999999999986
|
| >3igm_B PF14_0633 protein; AP2 domain, specific transcription FA protein-DNA complex, transcription-DNA complex; 2.20A {Plasmodium falciparum} | Back alignment and structure |
|---|
| >1u3e_M HNH homing endonuclease; HNH catalytic motif, helix-turn-helix DNA binding domain, PR complex, DNA binding protein-DNA complex; 2.92A {Bacillus phage SPO1} SCOP: d.4.1.3 d.285.1.1 | Back alignment and structure |
|---|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| 211 | ||||
| d1gcca_ | 63 | d.10.1.2 (A:) GCC-box binding domain {Mouse-ear cr | 3e-30 |
| >d1gcca_ d.10.1.2 (A:) GCC-box binding domain {Mouse-ear cress (Arabidopsis thaliana) [TaxId: 3702]} Length = 63 | Back information, alignment and structure |
|---|
class: Alpha and beta proteins (a+b) fold: DNA-binding domain superfamily: DNA-binding domain family: GCC-box binding domain domain: GCC-box binding domain species: Mouse-ear cress (Arabidopsis thaliana) [TaxId: 3702]
Score = 104 bits (260), Expect = 3e-30
Identities = 34/59 (57%), Positives = 42/59 (71%), Gaps = 1/59 (1%)
Query: 61 YRGVRMRSWGKWVSEIRQPRKK-SRIWLGSFPTAEMAARAHDVAALAIKGHSAHLNFPE 118
YRGVR R WGK+ +EIR P K +R+WLG+F TAE AA A+D AA ++G A LNFP
Sbjct: 3 YRGVRQRPWGKFAAEIRDPAKNGARVWLGTFETAEDAALAYDRAAFRMRGSRALLNFPL 61
|
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 211 | |||
| d1gcca_ | 63 | GCC-box binding domain {Mouse-ear cress (Arabidops | 99.92 |
| >d1gcca_ d.10.1.2 (A:) GCC-box binding domain {Mouse-ear cress (Arabidopsis thaliana) [TaxId: 3702]} | Back information, alignment and structure |
|---|
class: Alpha and beta proteins (a+b) fold: DNA-binding domain superfamily: DNA-binding domain family: GCC-box binding domain domain: GCC-box binding domain species: Mouse-ear cress (Arabidopsis thaliana) [TaxId: 3702]
Probab=99.92 E-value=2.5e-26 Score=161.63 Aligned_cols=60 Identities=55% Similarity=0.932 Sum_probs=56.1
Q ss_pred ceeEEEeCCCCcEEEEEeeCC-CCeEEecCCCCCHHHHHHHHHHHHHHhcCCCCCCCCCCC
Q 041015 60 TYRGVRMRSWGKWVSEIRQPR-KKSRIWLGSFPTAEMAARAHDVAALAIKGHSAHLNFPEL 119 (211)
Q Consensus 60 ~yRGVr~r~~GkW~A~I~~p~-~~kri~LGtF~T~EeAArAYD~Aal~l~G~~A~lNFp~s 119 (211)
.||||+++++|||+|+|++|. ++++||||+|+|+||||+|||+|+++++|..+.+|||..
T Consensus 2 ~yrGVr~r~~gkw~A~Ir~~~~~~~r~~LGtf~t~eeAArAYD~aa~~~~G~~a~~NFP~~ 62 (63)
T d1gcca_ 2 HYRGVRQRPWGKFAAEIRDPAKNGARVWLGTFETAEDAALAYDRAAFRMRGSRALLNFPLR 62 (63)
T ss_dssp CCTTEEEETTTEEEEEEEETTTTSEEEEEEEESSHHHHHHHHHHHHHHHHSSCCCCSSCTT
T ss_pred CcceEeECCCCCEEEEEecCCCCCcEeccccccCHHHHHHHHHHHHHHhcCCCcccCCCcc
Confidence 499999998999999999974 668999999999999999999999999999999999964
|