Citrus Sinensis ID: 036154


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190---
MVNSRHSVIPPVKYTEHRKHTKLMTPFSTNGQEMKPRVVRISVTDGDATDSSSDEDCEVYKRPRVKKFLSEIMIASSSSSSEIDVVSKSRISKCKKKKKSGKVGAPATLKPAAKQPAGKKFRGVRQRPWGKWAAEIRDPLRRVRLWLGTYDTAEEAAMVYDNAAIQLRGPDALTNFATPPPKFPSQIKQSLSS
cccccccccccccccccccccccccccccccccccccEEEEEEccccccccccccccccccccccccccEEEEccccccccccccccccccccccccccccccccccccccccccccccccccEEEcccccEEEEEEccccccEEEEEccccHHHHHHHHHHHHHHHcccccccccccccccccccccccccc
cccccccccccEEEEEEEEEEEEEccccccccccccEEEEEEEEcccccccccccccccccccccEEEEEEEEccccccccccccccccccccccccccccccccccccccccccccccEEEEEEEcccccEEEEEEccccccEEEEcccccHHHHHHHHHHHHHHHcccccEEccccccccccccccccccc
mvnsrhsvippvkytehrkhtklmtpfstngqemkprVVRISvtdgdatdsssdedcevykrprvKKFLSEIMIasssssseidvvSKSRiskckkkkksgkvgapatlkpaakqpagkkfrgvrqrpwgkwaaEIRDPLRRVRLWLGTYDTAEEAAMVYDNAaiqlrgpdaltnfatpppkfpsqikqslss
mvnsrhsvippvkytehrkhtklmtpfstngqemkpRVVRISvtdgdatdsssdedcevykrprvkKFLSEIMiasssssseidvvsksriskckkkkksgkvgapatlkpaakqpagkkfrgvrqrpwgkwaaeirdplrrVRLWLGTYDTAEEAAMVYDNAAIQLRGPDALTNFatpppkfpsqikqslss
MVNSRHSVIPPVKYTEHRKHTKLMTPFSTNGQEMKPRVVRISVTDGDATDSSSDEDCEVYKRPRVKKFLseimiasssssseidvvsksriskckkkkksgkVGAPATLKPAAKQPAGKKFRGVRQRPWGKWAAEIRDPLRRVRLWLGTYDTAEEAAMVYDNAAIQLRGPDALTNFATPPPKFPSQIKQSLSS
***************************************************************************************************************************VRQRPWGKWAAEIRDPLRRVRLWLGTYDTAEEAAMVYDNAAIQLRGPDAL********************
****************************************ISVTDGDATDS****************FLS******************************************************RQRPWGKWAAEIRDPLRRVRLWLGTYDTAEEAAMVYDNAAIQLRGPDALTNFA****************
MVNSRHSVIPPVKYTEHRKHTKLMTPFSTNGQEMKPRVVRISVT**************VYKRPRVKKFLSEIMIA****************************************************PWGKWAAEIRDPLRRVRLWLGTYDTAEEAAMVYDNAAIQLRGPDALTNFATPPPK***********
*******VIPPVKYTEHRKHTKLMTPFSTNGQEMKPRVVRISVTDGD***************PRVKKFLSEIMIAS****************************************AGKKFRGVRQRPWGKWAAEIRDPLRRVRLWLGTYDTAEEAAMVYDNAAIQLRGPDALTNFATP**************
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MVNSRHSVIPPVKYTEHRKHTKLMTPFSTNGQEMKPRVVRISVTDGDATDSSSDEDCEVYKRPRVKKFLSEIMIASSSSSSEIDVVSKSRISKCKKKKKSGKVGAPATLKPAAKQPAGKKFRGVRQRPWGKWAAEIRDPLRRVRLWLGTYDTAEEAAMVYDNAAIQLRGPDALTNFATPPPKFPSQIKQSLSS
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query193 2.2.26 [Sep-21-2011]
Q9SUQ2 343 Ethylene-responsive trans yes no 0.854 0.481 0.560 2e-41
Q9SUE3 335 Ethylene-responsive trans no no 0.870 0.501 0.530 2e-40
O82503 287 Ethylene-responsive trans no no 0.720 0.484 0.546 2e-33
Q9FK12 354 Ethylene-responsive trans no no 0.839 0.457 0.516 9e-32
Q9M374 315 Ethylene-responsive trans no no 0.865 0.530 0.491 1e-29
Q9C995161 Ethylene-responsive trans no no 0.673 0.807 0.503 7e-29
O82339 294 Ethylene-responsive trans no no 0.777 0.510 0.452 7e-28
O04682248 Pathogenesis-related gene N/A no 0.746 0.580 0.447 7e-24
Q8W4I5159 Ethylene-responsive trans no no 0.658 0.798 0.533 1e-23
Q84XB3244 Ethylene-responsive trans N/A no 0.373 0.295 0.616 2e-20
>sp|Q9SUQ2|CRF2_ARATH Ethylene-responsive transcription factor CRF2 OS=Arabidopsis thaliana GN=CRF2 PE=1 SV=1 Back     alignment and function desciption
 Score =  168 bits (426), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 102/182 (56%), Positives = 127/182 (69%), Gaps = 17/182 (9%)

Query: 8   VIPPVKYTEHRKHTKLMTPFSTNGQEMKPRVVRISVTDGDATDSSSDEDCE-----VYKR 62
           V+P +K+TEH+ +T  +    TN  +   R++RISVTD DATDSSSD++ E     V KR
Sbjct: 8   VLPRIKFTEHKTNTTTIVSELTNTHQT--RILRISVTDPDATDSSSDDEEEEHQRFVSKR 65

Query: 63  PRVKKFLSEIMIASSSSSSEIDVVSKSRISKCKKKKKSGKVGAPATLKP---AAKQPAG- 118
            RVKKF++E+ + S +      VV+ S      KK++   V + +T+ P   A     G 
Sbjct: 66  RRVKKFVNEVYLDSGA------VVTGSCGQMESKKRQKRAVKSESTVSPVVSATTTTTGE 119

Query: 119 KKFRGVRQRPWGKWAAEIRDPLRRVRLWLGTYDTAEEAAMVYDNAAIQLRGPDALTNFAT 178
           KKFRGVRQRPWGKWAAEIRDPL+RVRLWLGTY+TAEEAAMVYDNAAIQLRGPDALTNF+ 
Sbjct: 120 KKFRGVRQRPWGKWAAEIRDPLKRVRLWLGTYNTAEEAAMVYDNAAIQLRGPDALTNFSV 179

Query: 179 PP 180
            P
Sbjct: 180 TP 181




Component of the cytokinin signaling pathway involved in cotyledons, leaves, and embryos development. Probably acts as a transcriptional activator. Binds to the GCC-box pathogenesis-related promoter element. May be involved in the regulation of gene expression by stress factors and by components of stress signal transduction pathways.
Arabidopsis thaliana (taxid: 3702)
>sp|Q9SUE3|CRF4_ARATH Ethylene-responsive transcription factor CRF4 OS=Arabidopsis thaliana GN=CRF4 PE=1 SV=2 Back     alignment and function description
>sp|O82503|CRF1_ARATH Ethylene-responsive transcription factor CRF1 OS=Arabidopsis thaliana GN=CRF1 PE=1 SV=1 Back     alignment and function description
>sp|Q9FK12|CRF3_ARATH Ethylene-responsive transcription factor CRF3 OS=Arabidopsis thaliana GN=CRF3 PE=1 SV=1 Back     alignment and function description
>sp|Q9M374|CRF6_ARATH Ethylene-responsive transcription factor CRF6 OS=Arabidopsis thaliana GN=CRF6 PE=1 SV=1 Back     alignment and function description
>sp|Q9C995|ERF70_ARATH Ethylene-responsive transcription factor ERF070 OS=Arabidopsis thaliana GN=ERF070 PE=1 SV=1 Back     alignment and function description
>sp|O82339|CRF5_ARATH Ethylene-responsive transcription factor CRF5 OS=Arabidopsis thaliana GN=CRF5 PE=1 SV=1 Back     alignment and function description
>sp|O04682|PTI6_SOLLC Pathogenesis-related genes transcriptional activator PTI6 OS=Solanum lycopersicum GN=PTI6 PE=2 SV=1 Back     alignment and function description
>sp|Q8W4I5|ERF69_ARATH Ethylene-responsive transcription factor ERF069 OS=Arabidopsis thaliana GN=ERF069 PE=1 SV=1 Back     alignment and function description
>sp|Q84XB3|ERF1_SOLLC Ethylene-responsive transcription factor 1 OS=Solanum lycopersicum GN=ERF1 PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query193
147801286292 hypothetical protein VITISV_007771 [Viti 0.891 0.589 0.659 8e-58
225426038 327 PREDICTED: ethylene-responsive transcrip 0.891 0.525 0.659 2e-57
449452420 318 PREDICTED: ethylene-responsive transcrip 0.844 0.512 0.642 2e-55
449501843 318 PREDICTED: ethylene-responsive transcrip 0.844 0.512 0.642 5e-55
224058399 347 AP2/ERF domain-containing transcription 0.891 0.495 0.606 3e-49
449464098 319 PREDICTED: ethylene-responsive transcrip 0.813 0.492 0.613 2e-48
449463464 326 PREDICTED: ethylene-responsive transcrip 0.844 0.5 0.594 3e-48
224118152 365 AP2/ERF domain-containing transcription 0.901 0.476 0.550 1e-47
224072103 348 AP2/ERF domain-containing transcription 0.880 0.488 0.588 5e-47
255545072 309 Ethylene-responsive transcription factor 0.865 0.540 0.546 1e-46
>gi|147801286|emb|CAN77118.1| hypothetical protein VITISV_007771 [Vitis vinifera] Back     alignment and taxonomy information
 Score =  228 bits (581), Expect = 8e-58,   Method: Compositional matrix adjust.
 Identities = 118/179 (65%), Positives = 140/179 (78%), Gaps = 7/179 (3%)

Query: 8   VIPPVKYTEHRKHTKLMTPFST--NGQEMK---PRVVRISVTDGDATDSSSDEDCEVYKR 62
           ++ P+KY+EHRK TKL+T  S    G+ M    PR+VRISVTDGDATDSSSDE+ E++ R
Sbjct: 1   MLSPIKYSEHRKRTKLLTTSSEKFTGRSMNADGPRIVRISVTDGDATDSSSDEEGELFAR 60

Query: 63  PRVKKFLSEIMIAS-SSSSSEIDVVSKSRISKCKKKKKSGKVGAPATLKPAAKQPAGKKF 121
            RVKKF++EI I S S  +S+   V +SR ++  +KK +GK G    ++  AK   GKKF
Sbjct: 61  QRVKKFVNEITIQSCSGXNSDGTTVWRSRSARNGRKKSTGK-GEAQAVRQLAKPSTGKKF 119

Query: 122 RGVRQRPWGKWAAEIRDPLRRVRLWLGTYDTAEEAAMVYDNAAIQLRGPDALTNFATPP 180
           RGVRQRPWGKWAAEIRDPLRRVRLWLGTYDTAEEAAMVYDNAAIQLRGPDALTNFA+PP
Sbjct: 120 RGVRQRPWGKWAAEIRDPLRRVRLWLGTYDTAEEAAMVYDNAAIQLRGPDALTNFASPP 178




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|225426038|ref|XP_002271526.1| PREDICTED: ethylene-responsive transcription factor CRF2 [Vitis vinifera] Back     alignment and taxonomy information
>gi|449452420|ref|XP_004143957.1| PREDICTED: ethylene-responsive transcription factor CRF2-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|449501843|ref|XP_004161474.1| PREDICTED: ethylene-responsive transcription factor CRF2-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|224058399|ref|XP_002299493.1| AP2/ERF domain-containing transcription factor [Populus trichocarpa] gi|222846751|gb|EEE84298.1| AP2/ERF domain-containing transcription factor [Populus trichocarpa] Back     alignment and taxonomy information
>gi|449464098|ref|XP_004149766.1| PREDICTED: ethylene-responsive transcription factor CRF2-like [Cucumis sativus] gi|449527296|ref|XP_004170648.1| PREDICTED: ethylene-responsive transcription factor CRF2-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|449463464|ref|XP_004149454.1| PREDICTED: ethylene-responsive transcription factor CRF4-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|224118152|ref|XP_002317743.1| AP2/ERF domain-containing transcription factor [Populus trichocarpa] gi|222858416|gb|EEE95963.1| AP2/ERF domain-containing transcription factor [Populus trichocarpa] Back     alignment and taxonomy information
>gi|224072103|ref|XP_002303625.1| AP2/ERF domain-containing transcription factor [Populus trichocarpa] gi|222841057|gb|EEE78604.1| AP2/ERF domain-containing transcription factor [Populus trichocarpa] Back     alignment and taxonomy information
>gi|255545072|ref|XP_002513597.1| Ethylene-responsive transcription factor, putative [Ricinus communis] gi|223547505|gb|EEF49000.1| Ethylene-responsive transcription factor, putative [Ricinus communis] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query193
TAIR|locus:2128429 343 CRF2 "AT4G23750" [Arabidopsis 0.906 0.510 0.510 1.3e-38
TAIR|locus:2132962 335 CRF4 "AT4G27950" [Arabidopsis 0.870 0.501 0.469 2.2e-34
TAIR|locus:2136078 287 CRF1 "AT4G11140" [Arabidopsis 0.404 0.271 0.687 3.4e-34
TAIR|locus:2154272 354 CRF3 "AT5G53290" [Arabidopsis 0.865 0.471 0.486 4.6e-34
TAIR|locus:2076720 315 CRF6 "AT3G61630" [Arabidopsis 0.388 0.238 0.666 1.6e-32
TAIR|locus:2062954 294 CRF5 "AT2G46310" [Arabidopsis 0.352 0.231 0.661 3.3e-30
TAIR|locus:505006142159 CRF7 "AT1G22985" [Arabidopsis 0.383 0.465 0.666 4.5e-27
TAIR|locus:2026301161 CRF8 "AT1G71130" [Arabidopsis 0.373 0.447 0.607 1e-25
TAIR|locus:2032510 189 ERF12 "ERF domain protein 12" 0.305 0.312 0.644 1.5e-19
TAIR|locus:2183861 212 Rap2.6L "related to AP2 6l" [A 0.347 0.316 0.597 3.1e-19
TAIR|locus:2128429 CRF2 "AT4G23750" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 413 (150.4 bits), Expect = 1.3e-38, P = 1.3e-38
 Identities = 95/186 (51%), Positives = 112/186 (60%)

Query:     1 MVNSRHSVIPPVKYTEHRKHTKLMTPFSTNGQEMKPRVVRISVTDGDATDSSSDEDCE-- 58
             M   +  V+P +K+TEH+ +T  +    TN  +   R++RISVTD DATDSSSD++ E  
Sbjct:     1 MEAEKKMVLPRIKFTEHKTNTTTIVSELTNTHQT--RILRISVTDPDATDSSSDDEEEEH 58

Query:    59 ---VYKRPRVKKFLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVGAPATLKPAAKQ 115
                V KR RVKKF+                                 V +P     A   
Sbjct:    59 QRFVSKRRRVKKFVNEVYLDSGAVVTGSCGQMESKKRQKRAVKSESTV-SPVV--SATTT 115

Query:   116 PAG-KKFRGVRQRPWGKWAAEIRDPLRRVRLWLGTYDTAEEAAMVYDNAAIQLRGPDALT 174
               G KKFRGVRQRPWGKWAAEIRDPL+RVRLWLGTY+TAEEAAMVYDNAAIQLRGPDALT
Sbjct:   116 TTGEKKFRGVRQRPWGKWAAEIRDPLKRVRLWLGTYNTAEEAAMVYDNAAIQLRGPDALT 175

Query:   175 NFATPP 180
             NF+  P
Sbjct:   176 NFSVTP 181




GO:0003677 "DNA binding" evidence=ISS
GO:0003700 "sequence-specific DNA binding transcription factor activity" evidence=ISS
GO:0005634 "nucleus" evidence=ISM;ISS;IDA
GO:0006355 "regulation of transcription, DNA-dependent" evidence=IEA;ISS
GO:0042991 "transcription factor import into nucleus" evidence=RCA;IDA
GO:0048825 "cotyledon development" evidence=IMP
GO:0005515 "protein binding" evidence=IPI
GO:0048364 "root development" evidence=IEP
GO:0042802 "identical protein binding" evidence=IPI
GO:0006606 "protein import into nucleus" evidence=RCA
TAIR|locus:2132962 CRF4 "AT4G27950" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2136078 CRF1 "AT4G11140" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2154272 CRF3 "AT5G53290" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2076720 CRF6 "AT3G61630" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2062954 CRF5 "AT2G46310" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:505006142 CRF7 "AT1G22985" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2026301 CRF8 "AT1G71130" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2032510 ERF12 "ERF domain protein 12" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2183861 Rap2.6L "related to AP2 6l" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query193
smart0038064 smart00380, AP2, DNA-binding domain in plant prote 7e-34
cd0001861 cd00018, AP2, DNA-binding domain found in transcri 3e-32
pfam0084753 pfam00847, AP2, AP2 domain 1e-12
>gnl|CDD|197689 smart00380, AP2, DNA-binding domain in plant proteins such as APETALA2 and EREBPs Back     alignment and domain information
 Score =  114 bits (288), Expect = 7e-34
 Identities = 39/62 (62%), Positives = 44/62 (70%)

Query: 120 KFRGVRQRPWGKWAAEIRDPLRRVRLWLGTYDTAEEAAMVYDNAAIQLRGPDALTNFATP 179
           K+RGVRQRPWGKW AEIRDP +  R+WLGT+DTAEEAA  YD AA + RG  A  NF   
Sbjct: 1   KYRGVRQRPWGKWVAEIRDPSKGKRVWLGTFDTAEEAARAYDRAAFKFRGRSARLNFPNS 60

Query: 180 PP 181
             
Sbjct: 61  LY 62


Length = 64

>gnl|CDD|237985 cd00018, AP2, DNA-binding domain found in transcription regulators in plants such as APETALA2 and EREBP (ethylene responsive element binding protein) Back     alignment and domain information
>gnl|CDD|216148 pfam00847, AP2, AP2 domain Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 193
cd0001861 AP2 DNA-binding domain found in transcription regu 99.84
smart0038064 AP2 DNA-binding domain in plant proteins such as A 99.82
PHA00280121 putative NHN endonuclease 99.44
PF0084756 AP2: AP2 domain; InterPro: IPR001471 Pathogenesis- 99.18
PF1465746 Integrase_AP2: AP2-like DNA-binding integrase doma 86.22
>cd00018 AP2 DNA-binding domain found in transcription regulators in plants such as APETALA2 and EREBP (ethylene responsive element binding protein) Back     alignment and domain information
Probab=99.84  E-value=5.1e-21  Score=132.41  Aligned_cols=61  Identities=66%  Similarity=1.170  Sum_probs=57.5

Q ss_pred             CceeeEEECCCCcEEEEEecCCCCeEeecCCCCCHHHHHHHHHHHHHHhhCCCCCCCCCCC
Q 036154          119 KKFRGVRQRPWGKWAAEIRDPLRRVRLWLGTYDTAEEAAMVYDNAAIQLRGPDALTNFATP  179 (193)
Q Consensus       119 S~yRGVr~r~~GkW~A~I~~~~~~kri~LGtfdT~EeAA~AYD~Aa~~~~G~~A~~NFp~~  179 (193)
                      |+|+||+++++|+|+|+|+++..++++|||+|+|+||||.|||.+++.++|.++.+|||.+
T Consensus         1 s~~~GV~~~~~gkw~A~I~~~~~gk~~~lG~f~t~eeAa~Ayd~a~~~~~g~~a~~Nf~~~   61 (61)
T cd00018           1 SKYRGVRQRPWGKWVAEIRDPSGGRRIWLGTFDTAEEAARAYDRAALKLRGSSAVLNFPDS   61 (61)
T ss_pred             CCccCEEECCCCcEEEEEEeCCCCceEccCCCCCHHHHHHHHHHHHHHhcCCccccCCCCC
Confidence            6899998888899999999976699999999999999999999999999999999999964



In EREBPs the domain specifically binds to the 11bp GCC box of the ethylene response element (ERE), a promotor element essential for ethylene responsiveness. EREBPs and the C-repeat binding factor CBF1, which is involved in stress response, contain a single copy of the AP2 domain. APETALA2-like proteins, which play a role in plant development contain two copies.

>smart00380 AP2 DNA-binding domain in plant proteins such as APETALA2 and EREBPs Back     alignment and domain information
>PHA00280 putative NHN endonuclease Back     alignment and domain information
>PF00847 AP2: AP2 domain; InterPro: IPR001471 Pathogenesis-related genes transcriptional activator binds to the GCC-box pathogenesis-related promoter element and activates the plant's defence genes Back     alignment and domain information
>PF14657 Integrase_AP2: AP2-like DNA-binding integrase domain Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query193
2gcc_A70 Solution Structure Of The Gcc-Box Binding Domain, N 2e-17
1gcc_A63 Solution Nmr Structure Of The Complex Of Gcc-Box Bi 2e-16
>pdb|2GCC|A Chain A, Solution Structure Of The Gcc-Box Binding Domain, Nmr, Minimized Mean Structure Length = 70 Back     alignment and structure

Iteration: 1

Score = 85.1 bits (209), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 39/60 (65%), Positives = 49/60 (81%), Gaps = 1/60 (1%) Query: 118 GKKFRGVRQRPWGKWAAEIRDPLRR-VRLWLGTYDTAEEAAMVYDNAAIQLRGPDALTNF 176 GK +RGVRQRPWGK+AAEIRDP + R+WLGT++TAE+AA+ YD AA ++RG AL NF Sbjct: 3 GKHYRGVRQRPWGKFAAEIRDPAKNGARVWLGTFETAEDAALAYDRAAFRMRGSRALLNF 62
>pdb|1GCC|A Chain A, Solution Nmr Structure Of The Complex Of Gcc-Box Binding Domain Of Aterf1 And Gcc-Box Dna, Minimized Average Structure Length = 63 Back     alignment and structure

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query193
1gcc_A63 Ethylene responsive element binding factor 1; tran 1e-35
>1gcc_A Ethylene responsive element binding factor 1; transcription factor, protein-DNA complex, ethylene inducible; HET: DNA; NMR {Arabidopsis thaliana} SCOP: d.10.1.2 PDB: 2gcc_A 3gcc_A Length = 63 Back     alignment and structure
 Score =  118 bits (298), Expect = 1e-35
 Identities = 38/59 (64%), Positives = 48/59 (81%), Gaps = 1/59 (1%)

Query: 119 KKFRGVRQRPWGKWAAEIRDPLRR-VRLWLGTYDTAEEAAMVYDNAAIQLRGPDALTNF 176
           K +RGVRQRPWGK+AAEIRDP +   R+WLGT++TAE+AA+ YD AA ++RG  AL NF
Sbjct: 1   KHYRGVRQRPWGKFAAEIRDPAKNGARVWLGTFETAEDAALAYDRAAFRMRGSRALLNF 59


Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query193
1gcc_A63 Ethylene responsive element binding factor 1; tran 99.91
3igm_B77 PF14_0633 protein; AP2 domain, specific transcript 88.81
1u3e_M174 HNH homing endonuclease; HNH catalytic motif, heli 80.19
>1gcc_A Ethylene responsive element binding factor 1; transcription factor, protein-DNA complex, ethylene inducible; HET: DNA; NMR {Arabidopsis thaliana} SCOP: d.10.1.2 PDB: 2gcc_A 3gcc_A Back     alignment and structure
Probab=99.91  E-value=3e-25  Score=155.40  Aligned_cols=60  Identities=62%  Similarity=1.146  Sum_probs=57.0

Q ss_pred             ceeeEEECCCCcEEEEEecCCC-CeEeecCCCCCHHHHHHHHHHHHHHhhCCCCCCCCCCC
Q 036154          120 KFRGVRQRPWGKWAAEIRDPLR-RVRLWLGTYDTAEEAAMVYDNAAIQLRGPDALTNFATP  179 (193)
Q Consensus       120 ~yRGVr~r~~GkW~A~I~~~~~-~kri~LGtfdT~EeAA~AYD~Aa~~~~G~~A~~NFp~~  179 (193)
                      +|+||++++||||+|+|++|.. ++++|||+|+|+||||+|||.|++.++|.+|.+|||.+
T Consensus         2 ~yrGV~~r~~gkw~A~I~~~~~~g~r~~LGtf~T~eeAA~AyD~Aa~~~~G~~a~~NFp~~   62 (63)
T 1gcc_A            2 HYRGVRQRPWGKFAAEIRDPAKNGARVWLGTFETAEDAALAYDRAAFRMRGSRALLNFPLR   62 (63)
T ss_dssp             CCTTEEEETTTEEEEEEEETTTTSEEEEEEEESSHHHHHHHHHHHHHHHHSSCCCCSSCTT
T ss_pred             CcccEeeCCCCcEEEEEccccCCCeEEEeeeCCCHHHHHHHHHHHHHHhcCcccccCCCCc
Confidence            6999988899999999999875 79999999999999999999999999999999999975



>3igm_B PF14_0633 protein; AP2 domain, specific transcription FA protein-DNA complex, transcription-DNA complex; 2.20A {Plasmodium falciparum} Back     alignment and structure
>1u3e_M HNH homing endonuclease; HNH catalytic motif, helix-turn-helix DNA binding domain, PR complex, DNA binding protein-DNA complex; 2.92A {Bacillus phage SPO1} SCOP: d.4.1.3 d.285.1.1 Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 193
d1gcca_63 d.10.1.2 (A:) GCC-box binding domain {Mouse-ear cr 2e-29
>d1gcca_ d.10.1.2 (A:) GCC-box binding domain {Mouse-ear cress (Arabidopsis thaliana) [TaxId: 3702]} Length = 63 Back     information, alignment and structure

class: Alpha and beta proteins (a+b)
fold: DNA-binding domain
superfamily: DNA-binding domain
family: GCC-box binding domain
domain: GCC-box binding domain
species: Mouse-ear cress (Arabidopsis thaliana) [TaxId: 3702]
 Score =  101 bits (253), Expect = 2e-29
 Identities = 38/59 (64%), Positives = 48/59 (81%), Gaps = 1/59 (1%)

Query: 119 KKFRGVRQRPWGKWAAEIRDPLR-RVRLWLGTYDTAEEAAMVYDNAAIQLRGPDALTNF 176
           K +RGVRQRPWGK+AAEIRDP +   R+WLGT++TAE+AA+ YD AA ++RG  AL NF
Sbjct: 1   KHYRGVRQRPWGKFAAEIRDPAKNGARVWLGTFETAEDAALAYDRAAFRMRGSRALLNF 59


Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query193
d1gcca_63 GCC-box binding domain {Mouse-ear cress (Arabidops 99.91
>d1gcca_ d.10.1.2 (A:) GCC-box binding domain {Mouse-ear cress (Arabidopsis thaliana) [TaxId: 3702]} Back     information, alignment and structure
class: Alpha and beta proteins (a+b)
fold: DNA-binding domain
superfamily: DNA-binding domain
family: GCC-box binding domain
domain: GCC-box binding domain
species: Mouse-ear cress (Arabidopsis thaliana) [TaxId: 3702]
Probab=99.91  E-value=3.4e-25  Score=153.91  Aligned_cols=60  Identities=62%  Similarity=1.133  Sum_probs=55.9

Q ss_pred             ceeeEEECCCCcEEEEEecCC-CCeEeecCCCCCHHHHHHHHHHHHHHhhCCCCCCCCCCC
Q 036154          120 KFRGVRQRPWGKWAAEIRDPL-RRVRLWLGTYDTAEEAAMVYDNAAIQLRGPDALTNFATP  179 (193)
Q Consensus       120 ~yRGVr~r~~GkW~A~I~~~~-~~kri~LGtfdT~EeAA~AYD~Aa~~~~G~~A~~NFp~~  179 (193)
                      +||||+++++|||.|+|++|. +++++|||+|+|+||||+|||.|++.++|.+|.+|||..
T Consensus         2 ~yrGVr~r~~gkw~A~Ir~~~~~~~r~~LGtf~t~eeAArAYD~aa~~~~G~~a~~NFP~~   62 (63)
T d1gcca_           2 HYRGVRQRPWGKFAAEIRDPAKNGARVWLGTFETAEDAALAYDRAAFRMRGSRALLNFPLR   62 (63)
T ss_dssp             CCTTEEEETTTEEEEEEEETTTTSEEEEEEEESSHHHHHHHHHHHHHHHHSSCCCCSSCTT
T ss_pred             CcceEeECCCCCEEEEEecCCCCCcEeccccccCHHHHHHHHHHHHHHhcCCCcccCCCcc
Confidence            599998888999999999975 558999999999999999999999999999999999964