Citrus Sinensis ID: 021987


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300----
MGNNEDGKSFKSEKPSSPPPSDQGNIHMYTDWAAMQAYYGPRVAIPPYYNSPIASGHAPQPYMWGPAQPMMPPYGAPYAAIYSTGGVYAHPAVPLGSHAHNHGVPTSPAAVTPLNTEAPTKSSGNADRGLAKKLKGLDGLAMSIGNASAESAEGGAEQRPSQSEADGSTDGSDGNTVRAGQSRKKRSREGTPIAGGDGKTDIQSTPVPVGVNATPDKVLATAVAPTSVSGKPVGPVLSPGMPTKLELRNAPGMNVKASPTSVPQPCAVLPPETWIQNERELKRERRKQSNRESARRSRLRKQVK
cccccccccccccccccccccccccccccccHHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHcc
cccccccccccccccccccccccccccccccHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHcccccccccEEcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHcHHHHHHHHHHHccHHHHHHHHHHHHcc
mgnnedgksfksekpsspppsdqgnihmytDWAAMQayygprvaippyynspiasghapqpymwgpaqpmmppygapyaaiystggvyahpavplgshahnhgvptspaavtplnteaptkssgnadRGLAKKLKGLDGLAMSIGnasaesaeggaeqrpsqseadgstdgsdgntvragqsrkkrsregtpiaggdgktdiqstpvpvgvnatpdKVLATAvaptsvsgkpvgpvlspgmptklelrnapgmnvkasptsvpqpcavlppetwiQNERELKRERRKQSNRESARRSRLRKQVK
mgnnedgksfksekpsspppsdQGNIHMYTDWAAMQAYYGPRVAIPPYYNSPIASGHAPQPYMWGPAQPMMPPYGAPYAAIYSTGGVYAHPAVPLGSHAHNHGVPTSPAAVTPLNTeaptkssgnadrGLAKKLKGLDGLAMSIGNASAESAEGGAEQRPSqseadgstdgsdgntvragqsrkkrsregtpiaggdgktdiqstpvPVGVNATPDKVLATAVAptsvsgkpvgpvlspgMPTKLELRNAPGMNVKASptsvpqpcavlppetwiqnerelkrerrkqsnresarrsrlrkqvk
MGNNEDGksfksekpsspppsDQGNIHMYTDWAAMQAYYGPRVAIPPYYNSPIASGHAPQPYMWgpaqpmmppygapyaaIYSTGGVYAHPAVPLGSHAHNHGVPTSPAAVTPLNTEAPTKSSGNadrglakklkgldglaMsignasaesaeggaeQRPSQSEADGSTDGSDGNTVRAGQSRKKRSREGTPIAGGDGKTDIQSTPVPVGVNATPDKVLATAVAPTSVSGKPVGPVLSPGMPTKLELRNAPGMNVKASPTSVPQPCAVLPPETWIQNERELKRERRKQSNRESARRSRLRKQVK
*************************IHMYTDWAAMQAYYGPRVAIPPYYNSPIASGHAPQPYMWGPAQPMMPPYGAPYAAIYSTGGVYAHPAVPL*****************************************************************************************************************************************************************************************************************
*****************************TDWAAMQAYYGPRVAIPPYYNSPIASGHAPQPYMWGPAQPMMPPYGAPYAAIYSTGGVYAHPAVPL*****************************************************************************************************************************************VGPVLSPGMPTKLEL*********************************************************
**********************QGNIHMYTDWAAMQAYYGPRVAIPPYYNSPIASGHAPQPYMWGPAQPMMPPYGAPYAAIYSTGGVYAHPAVPLGSHAHNHGVPTSPAAVTPLNTE*********DRGLAKKLKGLDGLAMSIGNA*********************************************IAGGDGKTDIQSTPVPVGVNATPDKVLATAVAPTSVSGKPVGPVLSPGMPTKLELRNAPGMNVKASPTSVPQPCAVLPPETWIQNER*************************
************************NIHMYTDWAAMQAYYGPRVAIPPYYNSPIASGHAPQPYMWGPAQPMMPPYGAPYAAIYSTGGVYAHPAVPLGSHAHNHGVPTSP**************************************************************************************************************************KPVGPVLSPGMPTKLELRN*******************************************************
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MGNNEDGKSFKSEKPSSPPPSDQGNIHMYTDWAAMQAYYGPRVAIPPYYNSPIASGHAPQPYMWGPAQPMMPPYGAPYAAIYSTGGVYAHPAVPLGSHAHNHGVPTSPAAVTPLNTEAPTKSSGNADRGLAKKLKGLDGLAMSIGNASAESAEGGAEQRPSQSEADGSTDGSDGNTVRAGQSRKKRSREGTPIAGGDGKTDIQSTPVPVGVNATPDKVLATAVAPTSVSGKPVGPVLSPGMPTKLELRNAPGMNVKASPTSVPQPCAVLPPETWIQNERELKRERRKQSNRESARRSRLRKQVK
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query304 2.2.26 [Sep-21-2011]
Q99089 411 Common plant regulatory f N/A no 0.950 0.703 0.519 1e-79
P42776382 G-box-binding factor 3 OS yes no 0.884 0.704 0.540 1e-71
P42775360 G-box-binding factor 2 OS no no 0.884 0.747 0.457 6e-56
Q99142265 Transcriptional activator N/A no 0.690 0.792 0.513 1e-48
P42774315 G-box-binding factor 1 OS no no 0.786 0.758 0.331 5e-23
Q99091296 Light-inducible protein C N/A no 0.562 0.577 0.285 3e-13
P25032354 DNA-binding protein EMBP- N/A no 0.085 0.073 0.961 1e-06
P23922349 Transcription factor HBP- N/A no 0.101 0.088 0.696 6e-05
>sp|Q99089|CPRF1_PETCR Common plant regulatory factor 1 OS=Petroselinum crispum GN=CPRF1 PE=2 SV=1 Back     alignment and function desciption
 Score =  296 bits (758), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 160/308 (51%), Positives = 206/308 (66%), Gaps = 19/308 (6%)

Query: 1   MGNNEDGKSFKSEK---PSSPPPSDQGNIHMYTDWAAMQAYYGPRVAIPPYYNSPIASGH 57
           MGN +D K+ K EK   P  P   DQ N H+Y DWAAMQAYYGPRVA+PPY+N  +ASG 
Sbjct: 1   MGNTDDVKAVKPEKLSSPPPPAAPDQSNSHVYPDWAAMQAYYGPRVALPPYFNPAVASGQ 60

Query: 58  APQPYMWGPAQPMMPPYGAPYAAIYSTGGVYAHPAVPLGSHAHNHGVPTSPAAVTPLNTE 117
           +P PYMWGP QP+MPPYG PYAA+Y+ GGVYAHP VPL              A +P++ +
Sbjct: 61  SPHPYMWGPPQPVMPPYGVPYAALYAHGGVYAHPGVPL--------------AASPMSMD 106

Query: 118 APTKSSGNADRGLAKKLKGLDGLAMSIGNASAESAEGGAEQRPSQS-EADGSTDGSDGNT 176
              KSSG  + GL KKLKG D LAMSIGN  A+S+EG  E+  SQS E +GS+DGS+ N+
Sbjct: 107 THAKSSGTNEHGLIKKLKGHDDLAMSIGNGKADSSEGEMERTLSQSKETEGSSDGSNENS 166

Query: 177 VRAGQSRKKRSREGTPIAGGDGKTDIQSTPVPVGVNATPDKVLATAVAPTSVSGKPVGPV 236
            RA  + +KR R+  P   G+ K + QS+ +P    A  +K+L   VA   V+GK VG V
Sbjct: 167 KRAAVNGRKRGRDEAPNMIGEVKIETQSSVIP-SPRAKSEKLLGITVATPMVAGKVVGTV 225

Query: 237 LSPGMPTKLELRNAPGMNVKASPTSVPQPCAVLPPETWIQNERELKRERRKQSNRESARR 296
           +SP M + LEL+++P  +   SP    QP  ++P ++W+ N+R+LKRERRKQSNRESARR
Sbjct: 226 VSPSMTSSLELKDSPKEHAVNSPAGGQQPSTMMPNDSWLHNDRDLKRERRKQSNRESARR 285

Query: 297 SRLRKQVK 304
           SRLRKQ +
Sbjct: 286 SRLRKQAE 293




Binds to the G-box-like motif (5'-ACGTGGC-3') of the chalcone synthase (CHS) gene promoter. G-box and G-box-like motifs are defined in promoters of certain plant genes which are regulated by such diverse stimuli as light-induction or hormone control.
Petroselinum crispum (taxid: 4043)
>sp|P42776|GBF3_ARATH G-box-binding factor 3 OS=Arabidopsis thaliana GN=GBF3 PE=1 SV=2 Back     alignment and function description
>sp|P42775|GBF2_ARATH G-box-binding factor 2 OS=Arabidopsis thaliana GN=GBF2 PE=1 SV=1 Back     alignment and function description
>sp|Q99142|TAF1_TOBAC Transcriptional activator TAF-1 (Fragment) OS=Nicotiana tabacum GN=TAF1 PE=2 SV=1 Back     alignment and function description
>sp|P42774|GBF1_ARATH G-box-binding factor 1 OS=Arabidopsis thaliana GN=GBF1 PE=2 SV=2 Back     alignment and function description
>sp|Q99091|CPRF3_PETCR Light-inducible protein CPRF3 OS=Petroselinum crispum GN=CPRF3 PE=2 SV=1 Back     alignment and function description
>sp|P25032|EMBP1_WHEAT DNA-binding protein EMBP-1 OS=Triticum aestivum PE=1 SV=1 Back     alignment and function description
>sp|P23922|HBP1A_WHEAT Transcription factor HBP-1a OS=Triticum aestivum PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query304
357512481 425 G-box binding factor [Medicago truncatul 0.990 0.708 0.665 1e-109
388501482 425 unknown [Medicago truncatula] 0.990 0.708 0.662 1e-109
255541782401 Common plant regulatory factor CPRF-1, p 0.914 0.693 0.669 1e-106
364521140 421 bZIP protein [Medicago sativa] 0.990 0.714 0.649 1e-106
449432082 419 PREDICTED: common plant regulatory facto 0.986 0.715 0.645 1e-105
307136354 419 bzip transcription factor [Cucumis melo 0.986 0.715 0.642 1e-104
51870705 424 bZip transcription factor [Psophocarpus 0.986 0.707 0.642 1e-103
356560615 424 PREDICTED: common plant regulatory facto 0.986 0.707 0.637 1e-103
359489695 430 PREDICTED: common plant regulatory facto 1.0 0.706 0.697 1e-102
169959 423 G-box binding factor, partial [Glycine m 0.983 0.706 0.636 1e-102
>gi|357512481|ref|XP_003626529.1| G-box binding factor [Medicago truncatula] gi|355501544|gb|AES82747.1| G-box binding factor [Medicago truncatula] Back     alignment and taxonomy information
 Score =  402 bits (1032), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 205/308 (66%), Positives = 239/308 (77%), Gaps = 7/308 (2%)

Query: 1   MGNNEDGKSFKSEKPSSPPPSDQGN---IHMYTDWAAMQAYYGPRVAIPPYYNSPIASGH 57
           MGN+++ KS K+EKPSSP   DQ N   +H+Y DWAAMQAYYGPRVA+PPYYNSP+ASGH
Sbjct: 1   MGNSDEEKSTKTEKPSSPVTVDQTNQTNVHVYPDWAAMQAYYGPRVAMPPYYNSPVASGH 60

Query: 58  APQPYMWGPAQPMMPPYGAPYAAIYSTGGVYAHPAVPLGSHAHNHGVPTSPAAVTPLNTE 117
            P PYMWGP QPMMPPYG PYAA+Y  GGVY HPAVP+G H H+ G+ +SPA  TPL+ E
Sbjct: 61  TPHPYMWGPPQPMMPPYGHPYAAMYPHGGVYTHPAVPIGPHPHSQGISSSPATGTPLSIE 120

Query: 118 APTKSSGNADRGLAKKLKGLDGLAMSIGNASAESAEGGAEQRPSQS-EADGSTDGSDGNT 176
            P KSSGN D+GL KKLKG DGLAMSIGN  AESAE GAE R SQS   +GS+DGSDGNT
Sbjct: 121 TPPKSSGNTDQGLMKKLKGFDGLAMSIGNGHAESAEPGAESRQSQSVNTEGSSDGSDGNT 180

Query: 177 VRAGQSRKKRSREGTPIAGGDGKTDIQSTPVPVGVNATPDKVLATAVAPTSVSGKPVGPV 236
             A Q+R+KRSREGTP   G+GKT+ Q + +   + A+ DK++  AVAP  V+G+ VGPV
Sbjct: 181 SGANQTRRKRSREGTPTTDGEGKTNTQGSQISKEIAAS-DKMM--AVAPAGVTGQLVGPV 237

Query: 237 LSPGMPTKLELRNAPGMNVKASPTSVPQPCAVLPPETWIQNERELKRERRKQSNRESARR 296
            S  M T LELRN+  ++ K +PTS PQP AVLPPE WIQNERELKRERRKQSNRESARR
Sbjct: 238 ASSAMTTALELRNSSSVHSKTNPTSTPQPSAVLPPEAWIQNERELKRERRKQSNRESARR 297

Query: 297 SRLRKQVK 304
           SRLRKQ +
Sbjct: 298 SRLRKQAE 305




Source: Medicago truncatula

Species: Medicago truncatula

Genus: Medicago

Family: Fabaceae

Order: Fabales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|388501482|gb|AFK38807.1| unknown [Medicago truncatula] Back     alignment and taxonomy information
>gi|255541782|ref|XP_002511955.1| Common plant regulatory factor CPRF-1, putative [Ricinus communis] gi|223549135|gb|EEF50624.1| Common plant regulatory factor CPRF-1, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|364521140|gb|AEW66884.1| bZIP protein [Medicago sativa] Back     alignment and taxonomy information
>gi|449432082|ref|XP_004133829.1| PREDICTED: common plant regulatory factor 1-like [Cucumis sativus] gi|449480276|ref|XP_004155848.1| PREDICTED: common plant regulatory factor 1-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|307136354|gb|ADN34168.1| bzip transcription factor [Cucumis melo subsp. melo] Back     alignment and taxonomy information
>gi|51870705|dbj|BAD42432.1| bZip transcription factor [Psophocarpus tetragonolobus] Back     alignment and taxonomy information
>gi|356560615|ref|XP_003548586.1| PREDICTED: common plant regulatory factor 1 [Glycine max] Back     alignment and taxonomy information
>gi|359489695|ref|XP_002279966.2| PREDICTED: common plant regulatory factor 1-like [Vitis vinifera] gi|297745388|emb|CBI40468.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|169959|gb|AAB00097.1| G-box binding factor, partial [Glycine max] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query304
TAIR|locus:2124938360 GBF2 "G-box binding factor 2" 0.746 0.630 0.377 6.8e-49
TAIR|locus:2063020382 GBF3 "G-box binding factor 3" 0.875 0.696 0.369 1.2e-35
TAIR|locus:2062455409 bZIP16 "basic region/leucine z 0.861 0.640 0.290 2.9e-21
TAIR|locus:2031705389 bZIP68 "basic region/leucine z 0.444 0.347 0.360 1.9e-20
TAIR|locus:2115250315 GBF1 "G-box binding factor 1" 0.424 0.409 0.333 1.9e-16
TAIR|locus:2124938 GBF2 "G-box binding factor 2" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 379 (138.5 bits), Expect = 6.8e-49, Sum P(2) = 6.8e-49
 Identities = 88/233 (37%), Positives = 113/233 (48%)

Query:     1 MGNNEDGXXXXXXXXXXXXXX-DQGNIHMYT-DWAAMQAYYGPRVAIPPYYNSPIASGHA 58
             MG+NE+G               +Q N+H+Y  DWAAMQAYYGPRV IP YYNS +A GHA
Sbjct:     1 MGSNEEGNPTNNSDKPSQAAAPEQSNVHVYHHDWAAMQAYYGPRVGIPQYYNSNLAPGHA 60

Query:    59 PQPYMWXXXXXXXXXXXXXXXXIYSTGGVYAHPAVPLGSHAHNHGVPTSPAAVTPLNTEA 118
             P PYMW                    GGVYAHP V +GS        ++    TPL  +A
Sbjct:    61 PPPYMWASPSPMMAPYGAPYPPFCPPGGVYAHPGVQMGSQPQGPVSQSASGVTTPLTIDA 120

Query:   119 PTKSSGNXXXXXXXXXXXXXXXXMXXXXXXXXXXXXXXXQ-RPSQS-EADGSTDGSDGNT 176
             P  S+GN                M               + R SQS E DGS++GSDGNT
Sbjct:   121 PANSAGNSDHGFMKKLKEFDGLAMSISNNKVGSAEHSSSEHRSSQSSENDGSSNGSDGNT 180

Query:   177 VRAGQSRKKRSREGTPIAGGDGKTDIQSTPVPVGVNATPDKVLATAVAPTSVS 229
                 QSR+KR ++ +P + G+  +   S P+  G N  PD  + T V PT++S
Sbjct:   181 TGGEQSRRKRRQQRSP-STGERPSSQNSLPLR-GENEKPDVTMGTPVMPTAMS 231


GO:0003677 "DNA binding" evidence=IEA;ISS;IDA
GO:0003700 "sequence-specific DNA binding transcription factor activity" evidence=IEA;ISS
GO:0005634 "nucleus" evidence=ISM;IEA;IDA
GO:0006351 "transcription, DNA-dependent" evidence=IEA
GO:0006355 "regulation of transcription, DNA-dependent" evidence=IEA
GO:0043565 "sequence-specific DNA binding" evidence=IEA;IDA
GO:0046983 "protein dimerization activity" evidence=IEA
GO:0009410 "response to xenobiotic stimulus" evidence=RCA
GO:0030968 "endoplasmic reticulum unfolded protein response" evidence=RCA
GO:0005737 "cytoplasm" evidence=IDA
GO:0009637 "response to blue light" evidence=IDA
TAIR|locus:2063020 GBF3 "G-box binding factor 3" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2062455 bZIP16 "basic region/leucine zipper transcription factor 16" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2031705 bZIP68 "basic region/leucine zipper transcription factor 68" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2115250 GBF1 "G-box binding factor 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

ID ?Name ?Annotated EC number ?Identity ?Query coverage ?Hit coverage ?RBH(Q2H) ?RBH(H2Q) ?
P42776GBF3_ARATHNo assigned EC number0.54050.88480.7041yesno

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query304
pfam07777189 pfam07777, MFMR, G-box binding protein MFMR 2e-52
pfam0017064 pfam00170, bZIP_1, bZIP transcription factor 3e-11
smart0033865 smart00338, BRLZ, basic region leucin zipper 3e-11
cd1219354 cd12193, B_zip1, basic leucine zipper DNA-binding 1e-04
pfam0771654 pfam07716, bZIP_2, Basic region leucine zipper 0.001
>gnl|CDD|219569 pfam07777, MFMR, G-box binding protein MFMR Back     alignment and domain information
 Score =  170 bits (431), Expect = 2e-52
 Identities = 95/187 (50%), Positives = 114/187 (60%), Gaps = 13/187 (6%)

Query: 1   MGNNEDGKSFKSEKPSSPPPSDQGNIHMYTDWAAMQAYYGPRVAIPPYYNSPIASGHAPQ 60
           MG++E+GK  KS  P +    D     +Y DW+AMQAYYGPR   PPY+NS +AS   P 
Sbjct: 1   MGSSEEGKPSKS-SPKTSVQEDTPTPTVYPDWSAMQAYYGPRPP-PPYFNSSVASSPQPH 58

Query: 61  PYMWGPAQPMMPPYGAP--YAAIYSTGGVYAHPAVPLGSHAHNHGVPTSPAA--VTPLNT 116
           PYMWGP QPMMPPYG P  YAA+Y  GGVYAHP++P GSH  +     S      TPL+ 
Sbjct: 59  PYMWGPQQPMMPPYGTPPPYAAMYPPGGVYAHPSMPPGSHPFSPYAMPSAEVPGSTPLSM 118

Query: 117 EAPTKSSGNADRGLAKKLKGLD---GLAMSIGNASAESAEGG-AEQRPSQSE---ADGST 169
           E   KSS N D+G  KK KG D   GLAMS  N  +  A G  A    SQS    +DGS+
Sbjct: 119 ETDAKSSDNKDKGSIKKSKGSDGSLGLAMSGKNGESGKASGSSANGGSSQSSESGSDGSS 178

Query: 170 DGSDGNT 176
           +GSDGN+
Sbjct: 179 EGSDGNS 185


This region is found to the N-terminus of the pfam00170 transcription factor domain. It is between 150 and 200 amino acids in length. The N-terminal half is rather rich in proline residues and has been termed the PRD (proline rich domain), whereas the C-terminal half is more polar and has been called the MFMR (multifunctional mosaic region). It has been suggested that this family is composed of three sub-families called A, B and C, classified according to motif composition. It has been suggested that some of these motifs may be involved in mediating protein-protein interactions. The MFMR region contains a nuclear localisation signal in bZIP opaque and GBF-2. The MFMR also contains a transregulatory activity in TAF-1. The MFMR in CPRF-2 contains cytoplasmic retention signals. Length = 189

>gnl|CDD|201054 pfam00170, bZIP_1, bZIP transcription factor Back     alignment and domain information
>gnl|CDD|197664 smart00338, BRLZ, basic region leucin zipper Back     alignment and domain information
>gnl|CDD|213400 cd12193, B_zip1, basic leucine zipper DNA-binding and multimerization region of GCN4 and related proteins Back     alignment and domain information
>gnl|CDD|203738 pfam07716, bZIP_2, Basic region leucine zipper Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 304
PF07777189 MFMR: G-box binding protein MFMR; InterPro: IPR012 100.0
PF0017064 bZIP_1: bZIP transcription factor cAMP response el 96.96
smart0033865 BRLZ basic region leucin zipper. 96.89
PF0771654 bZIP_2: Basic region leucine zipper; InterPro: IPR 96.21
KOG0709 472 consensus CREB/ATF family transcription factor [Tr 93.11
KOG4343 655 consensus bZIP transcription factor ATF6 [Transcri 92.8
PF0313192 bZIP_Maf: bZIP Maf transcription factor; InterPro: 91.14
KOG2236483 consensus Uncharacterized conserved protein [Funct 81.02
>PF07777 MFMR: G-box binding protein MFMR; InterPro: IPR012900 This region is found to the N terminus of IPR011616 from INTERPRO, which is a transcription factor domain Back     alignment and domain information
Probab=100.00  E-value=2.9e-74  Score=512.30  Aligned_cols=179  Identities=60%  Similarity=1.043  Sum_probs=168.5

Q ss_pred             CCCCCCCCCcCCCCCCCCCCCCCCCcCcCCchhhhhhhcCCCCCCCCCCCCCCCCCCCCCCCccCCCCCCCCCCCCC--c
Q 021987            1 MGNNEDGKSFKSEKPSSPPPSDQGNIHMYTDWAAMQAYYGPRVAIPPYYNSPIASGHAPQPYMWGPAQPMMPPYGAP--Y   78 (304)
Q Consensus         1 MG~~E~~~~~K~~K~ss~~~~eq~~~~vyPDWsamQAYYgp~v~~Pp~f~s~VAs~p~pHPYMWGp~QpmmPPYGtP--Y   78 (304)
                      ||++|++|++|++|++++++++|+++|+||||++||||||+| ++|+||++.||++|++||||||+|||||||||||  |
T Consensus         1 MG~~E~~~~~k~~k~~s~~~~~~~~~~~ypDWs~mQAYyg~~-~~p~~f~s~va~sp~phPYMWG~~q~mmPPYGtP~pY   79 (189)
T PF07777_consen    1 MGSSEEGKPSKSSKPSSPPPEDQPTPHVYPDWSAMQAYYGPG-APPPYFNSAVASSPQPHPYMWGPQQPMMPPYGTPVPY   79 (189)
T ss_pred             CCCccCCcCCCCCCCCCCCcCCCCCCccCCccHhhhhccCCC-CCCcccCcccCCCCCCCCcccCCCccccCCCCCCCCC
Confidence            999999999999999887655899999999999999999999 8899999999999999999999999999999997  9


Q ss_pred             cccccCCccccCCCCCCCCCCCCCCCCCCCC--CCCCCCccC-CCCCCCCcccchhhhhcccC-CcccccCCCCCCCCCC
Q 021987           79 AAIYSTGGVYAHPAVPLGSHAHNHGVPTSPA--AVTPLNTEA-PTKSSGNADRGLAKKLKGLD-GLAMSIGNASAESAEG  154 (304)
Q Consensus        79 ~AmYp~GGvYAHP~mp~gs~p~~~~~~~sp~--~~tp~s~e~-~~Kss~~kdk~~~Kk~Kg~~-Glams~G~g~~~~~~g  154 (304)
                      +||||||||||||+||+++|||+++.+++..  ++|++++|+ ++|++++|||+++||||||| ||+|++||++.+|+++
T Consensus        80 ~A~YphGgvYAHP~mp~~s~p~~~~~~~s~~~~~~tp~s~E~~p~Kss~~kd~~~~KksKg~~g~~a~s~~n~~~gk~~~  159 (189)
T PF07777_consen   80 PAMYPHGGVYAHPSMPPGSHPFSPYAMPSPETPAATPLSTETDPGKSSGNKDKGSMKKSKGFDGGLAMSIKNGESGKTSG  159 (189)
T ss_pred             ccccCCCccccCCCCCcccccCCCcccccccccCCCCcccccccccCcCccccccccccccccccceeeccCCccCcccc
Confidence            9999999999999999999999999999854  479999999 69999999999999999999 6999999999999998


Q ss_pred             C-CCCCCCCc---CCCCCCCCCCCCccccc
Q 021987          155 G-AEQRPSQS---EADGSTDGSDGNTVRAG  180 (304)
Q Consensus       155 ~-s~~~~SqS---esegSSdGSD~Ns~~~s  180 (304)
                      + +|++.|||   .+||||||||+|+++++
T Consensus       160 ~s~n~~~Sqs~eSgsegSSdgSD~Nt~~~~  189 (189)
T PF07777_consen  160 SSANDGSSQSSESGSEGSSDGSDGNTNNDS  189 (189)
T ss_pred             CCCCCccCccccccccccccCcCccccCCC
Confidence            4 58899997   47999999999999874



It is between 150 and 200 amino acids in length. The N-terminal half is rather rich in proline residues and has been termed the PRD (proline rich domain) [], whereas the C-terminal half is more polar and has been called the MFMR (multifunctional mosaic region). It has been suggested that this family is composed of three sub-families called A, B and C [], classified according to motif composition. It has been suggested that some of these motifs may be involved in mediating protein-protein interactions []. The MFMR region contains a nuclear localisation signal in bZIP opaque and GBF-2 []. The MFMR also contains a transregulatory activity in TAF-1. The MFMR in CPRF-2 contains cytoplasmic retention signals []. ; GO: 0003677 DNA binding, 0006351 transcription, DNA-dependent, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus

>PF00170 bZIP_1: bZIP transcription factor cAMP response element binding (CREB) protein signature fos transforming protein signature jun transcription factor signature; InterPro: IPR011616 The basic-leucine zipper (bZIP) transcription factors [, ] of eukaryotic are proteins that contain a basic region mediating sequence-specific DNA-binding followed by a leucine zipper region (see IPR002158 from INTERPRO) required for dimerization Back     alignment and domain information
>smart00338 BRLZ basic region leucin zipper Back     alignment and domain information
>PF07716 bZIP_2: Basic region leucine zipper; InterPro: IPR011700 The basic-leucine zipper (bZIP) transcription factors [, ] of eukaryotes are proteins that contain a basic region mediating sequence-specific DNA-binding, followed by a leucine zipper region (see IPR002158 from INTERPRO), which is required for dimerization Back     alignment and domain information
>KOG0709 consensus CREB/ATF family transcription factor [Transcription] Back     alignment and domain information
>KOG4343 consensus bZIP transcription factor ATF6 [Transcription] Back     alignment and domain information
>PF03131 bZIP_Maf: bZIP Maf transcription factor; InterPro: IPR004826 There are several different types of Maf transcription factors with different roles in the cell Back     alignment and domain information
>KOG2236 consensus Uncharacterized conserved protein [Function unknown] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query304
1dh3_A55 Transcription factor CREB; protein-DNA complex, tr 2e-08
1ci6_A63 Transcription factor ATF-4; BZIP; 2.60A {Homo sapi 4e-07
1t2k_D61 Cyclic-AMP-dependent transcription factor ATF-2; p 5e-06
1jnm_A62 Proto-oncogene C-JUN; BZIP, protein-DNA complex, t 8e-06
2wt7_A63 Proto-oncogene protein C-FOS; transcription, trans 9e-06
3a5t_A107 Transcription factor MAFG; protein-DNA complex, BZ 4e-05
2wt7_B90 Transcription factor MAFB; transcription, transcri 2e-04
1m2v_B 926 SEC24, protein transport protein SEC24, SEC24P, SE 4e-04
>1dh3_A Transcription factor CREB; protein-DNA complex, transcription/DNA complex; HET: DNA; 3.00A {Mus musculus} SCOP: h.1.3.1 Length = 55 Back     alignment and structure
 Score = 48.9 bits (117), Expect = 2e-08
 Identities = 12/21 (57%), Positives = 15/21 (71%)

Query: 282 KRERRKQSNRESARRSRLRKQ 302
           KRE R   NRE+AR SR +K+
Sbjct: 1   KREVRLMKNREAARESRRKKK 21


>1ci6_A Transcription factor ATF-4; BZIP; 2.60A {Homo sapiens} SCOP: h.1.3.1 Length = 63 Back     alignment and structure
>1t2k_D Cyclic-AMP-dependent transcription factor ATF-2; protein DNA complex, transcription/DNA complex; 3.00A {Homo sapiens} SCOP: h.1.3.1 Length = 61 Back     alignment and structure
>1jnm_A Proto-oncogene C-JUN; BZIP, protein-DNA complex, transcription/DNA complex; 2.20A {Homo sapiens} SCOP: h.1.3.1 PDB: 1fos_F 2h7h_A 1t2k_C 1a02_J* 1s9k_E 1jun_A Length = 62 Back     alignment and structure
>2wt7_A Proto-oncogene protein C-FOS; transcription, transcription regulation, nucleus, activator, repressor, DNA-binding, phosphoprotein, differentiation; 2.30A {Mus musculus} PDB: 1fos_E* 1a02_F* 1s9k_D Length = 63 Back     alignment and structure
>3a5t_A Transcription factor MAFG; protein-DNA complex, BZIP factor, acetylation, DNA-binding, isopeptide bond, nucleus; 2.80A {Mus musculus} Length = 107 Back     alignment and structure
>2wt7_B Transcription factor MAFB; transcription, transcription regulation, nucleus, activator, repressor, DNA-binding, phosphoprotein, differentiation; 2.30A {Mus musculus} PDB: 2wty_A* 1k1v_A Length = 90 Back     alignment and structure
>1m2v_B SEC24, protein transport protein SEC24, SEC24P, SEC24 protein, abnormal nuclear; zinc-finger, beta barrel, VWA domain, gelsolin domain,; 2.75A {Saccharomyces cerevisiae} SCOP: a.71.2.1 b.2.8.1 c.62.1.2 d.109.2.1 g.41.10.1 Length = 926 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query304
1dh3_A55 Transcription factor CREB; protein-DNA complex, tr 98.14
2dgc_A63 Protein (GCN4); basic domain, leucine zipper, DNA 97.16
2wt7_A63 Proto-oncogene protein C-FOS; transcription, trans 97.08
3a5t_A107 Transcription factor MAFG; protein-DNA complex, BZ 96.64
1jnm_A62 Proto-oncogene C-JUN; BZIP, protein-DNA complex, t 96.03
1t2k_D61 Cyclic-AMP-dependent transcription factor ATF-2; p 95.84
1ci6_A63 Transcription factor ATF-4; BZIP; 2.60A {Homo sapi 93.71
1gd2_E70 Transcription factor PAP1; basic leucine zipper, p 82.98
>1dh3_A Transcription factor CREB; protein-DNA complex, transcription/DNA complex; HET: DNA; 3.00A {Mus musculus} SCOP: h.1.3.1 Back     alignment and structure
Probab=98.14  E-value=6.9e-07  Score=65.10  Aligned_cols=22  Identities=55%  Similarity=0.638  Sum_probs=20.7

Q ss_pred             HHHHHHhhhhHHHHHhHHhhhc
Q 021987          282 KRERRKQSNRESARRSRLRKQV  303 (304)
Q Consensus       282 ~r~~r~~snresarrsr~rkq~  303 (304)
                      ||+||+++|||||+|||.||++
T Consensus         1 kr~rR~~~NResA~rSR~RKk~   22 (55)
T 1dh3_A            1 KREVRLMKNREAARESRRKKKE   22 (55)
T ss_dssp             CHHHHHHHHHHHHHHHHHHHHH
T ss_pred             ChHHHHHhhHHHHHHHHHHHHH
Confidence            5899999999999999999986



>2dgc_A Protein (GCN4); basic domain, leucine zipper, DNA binding, eukaryotic regulatory protein, transcription/DNA complex; HET: DNA; 2.20A {Saccharomyces cerevisiae} SCOP: h.1.3.1 PDB: 1dgc_A* 1ld4_E 1ysa_C* 3p8m_D Back     alignment and structure
>2wt7_A Proto-oncogene protein C-FOS; transcription, transcription regulation, nucleus, activator, repressor, DNA-binding, phosphoprotein, differentiation; 2.30A {Mus musculus} PDB: 1fos_E* 1a02_F* 1s9k_D Back     alignment and structure
>3a5t_A Transcription factor MAFG; protein-DNA complex, BZIP factor, acetylation, DNA-binding, isopeptide bond, nucleus; 2.80A {Mus musculus} Back     alignment and structure
>1jnm_A Proto-oncogene C-JUN; BZIP, protein-DNA complex, transcription/DNA complex; 2.20A {Homo sapiens} SCOP: h.1.3.1 PDB: 1fos_F 2h7h_A 1t2k_C 1a02_J* 1s9k_E 1jun_A Back     alignment and structure
>1t2k_D Cyclic-AMP-dependent transcription factor ATF-2; protein DNA complex, transcription/DNA complex; 3.00A {Homo sapiens} SCOP: h.1.3.1 Back     alignment and structure
>1ci6_A Transcription factor ATF-4; BZIP; 2.60A {Homo sapiens} SCOP: h.1.3.1 Back     alignment and structure
>1gd2_E Transcription factor PAP1; basic leucine zipper, protein-DNA complex, transcription/DNA complex; HET: DNA; 2.00A {Schizosaccharomyces pombe} SCOP: h.1.3.1 Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00