Citrus Sinensis ID: 040019


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160------
MAVCLCCHQQMFSLFFVVQLRRERIADRMRALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGAAGAVAQLVADVPLSSALEGESIDGGSSQPEWEKWSNDGTEQQVAKLMEEDIGAAMQFLQSKALCIMPISLASAIYRTRQPDAPAFVKPESSTPS
cccccccccccccccHHHHHHHHHHHHHHHHHHHHccccccccHHHHHHHHHHHHHHHHHHHHHHHcccccccccHHHHcccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHccccccccccccccccHHHHccccccccccccccccccc
cEEEEcccccHHHHHHHHHHHHHHHHHHHHHHHHHccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccHccccccccccccccccccccccccccccccHHHHHHHHHHHcHHHHHHHHHHccccEcHHHHHHHHHccccccccccccccccccc
MAVCLCCHQQMFSLFFVVQLRRERIADRMRALQELvpscnktdrAAMLDEIVDYVKFLRLQVKVLSMSRLGAAGAVAQLVAdvplssalegesidggssqpewekwsndgTEQQVAKLMEEDIGAAMQFLQSKALCIMPISLASAIYrtrqpdapafvkpesstps
mavclcchqqmFSLFFVVQLRRERIADRMRALqelvpscnktdraAMLDEIVDYVKFLRLQVKVLSMSRLGAAGAVAQLVADVPLSSALEGesidggssqpewekwsNDGTEQQVAKLMEEDIGAAMQFLQSKALCIMPISLASAIYRTRQPDApafvkpesstps
MAVCLCCHQQMFSLFFVVQLRRERIADRMRALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRlgaagavaqlvaDVPLSSALEGESIDGGSSQPEWEKWSNDGTEQQVAKLMEEDIGAAMQFLQSKALCIMPISLASAIYRTRQPDAPAFVKPESSTPS
**VCLCCHQQMFSLFFVVQLRRERIADRMRALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGAAGAVAQLVADVPL**********************************EEDIGAAMQFLQSKALCIMPISLASAIYRT*****************
************************IADRMRALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLS**************************************************************FLQSKALCIMPISLASAIYRTRQPDAPA***P******
MAVCLCCHQQMFSLFFVVQLRRERIADRMRALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGAAGAVAQLVADVPLSSA***********************EQQVAKLMEEDIGAAMQFLQSKALCIMPISLASAIYRTRQPDAP***********
MAVCLCCHQQMFSLFFVVQLRRERIADRMRALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGAAGAVAQLVADV*************************DGTEQQVAKLMEEDIGAAMQFLQSKALCIMPISLASAIYR******************
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MAVCLCCHQQMFSLFFVVQLRRERIADRMRALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGAAGAVAQLVADVPLSSALEGESIDGGSSQPEWEKWSNDGTEQQVAKLMEEDIGAAMQFLQSKALCIMPISLASAIYRTRQPDAPAFVKPESSTPS
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query166 2.2.26 [Sep-21-2011]
O22768310 Transcription factor UNE1 yes no 0.807 0.432 0.748 7e-54
Q93Y00302 Transcription factor bHLH no no 0.777 0.427 0.720 4e-49
Q9ZUG9350 Transcription factor bHLH no no 0.825 0.391 0.578 2e-39
Q8S3D5310 Transcription factor bHLH no no 0.777 0.416 0.577 2e-37
Q9LSQ3297 Transcription factor bHLH no no 0.843 0.471 0.522 6e-36
O80536524 Transcription factor PIF3 no no 0.283 0.089 0.617 8e-13
Q8GZM7478 Transcription factor PIF1 no no 0.283 0.098 0.595 3e-12
Q6NKN9366 Transcription factor bHLH no no 0.295 0.133 0.62 5e-11
Q9FUA4373 Transcription factor SPAT no no 0.283 0.126 0.574 2e-10
Q84WK0352 Transcription factor bHLH no no 0.403 0.190 0.507 2e-10
>sp|O22768|UNE12_ARATH Transcription factor UNE12 OS=Arabidopsis thaliana GN=UNE12 PE=2 SV=2 Back     alignment and function desciption
 Score =  209 bits (532), Expect = 7e-54,   Method: Compositional matrix adjust.
 Identities = 101/135 (74%), Positives = 118/135 (87%), Gaps = 1/135 (0%)

Query: 32  LQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGAAGAVAQLVADVPLSSALEG 91
           LQELVP+ NKTDRAAM+DEIVDYVKFLRLQVKVLSMSRLG AGAVA LV D+PLSS++E 
Sbjct: 175 LQELVPTVNKTDRAAMIDEIVDYVKFLRLQVKVLSMSRLGGAGAVAPLVTDMPLSSSVED 234

Query: 92  ESIDGG-SSQPEWEKWSNDGTEQQVAKLMEEDIGAAMQFLQSKALCIMPISLASAIYRTR 150
           E+ +GG + QP WEKWSNDGTE+QVAKLMEE++GAAMQ LQSKALC+MPISLA AIY ++
Sbjct: 235 ETGEGGRTPQPAWEKWSNDGTERQVAKLMEENVGAAMQLLQSKALCMMPISLAMAIYHSQ 294

Query: 151 QPDAPAFVKPESSTP 165
            PD  + VKPE++ P
Sbjct: 295 PPDTSSVVKPENNPP 309




Required for ovule fertilization.
Arabidopsis thaliana (taxid: 3702)
>sp|Q93Y00|BH007_ARATH Transcription factor bHLH7 OS=Arabidopsis thaliana GN=BHLH7 PE=2 SV=1 Back     alignment and function description
>sp|Q9ZUG9|BH066_ARATH Transcription factor bHLH66 OS=Arabidopsis thaliana GN=BHLH66 PE=2 SV=1 Back     alignment and function description
>sp|Q8S3D5|BH069_ARATH Transcription factor bHLH69 OS=Arabidopsis thaliana GN=BHLH69 PE=2 SV=2 Back     alignment and function description
>sp|Q9LSQ3|BH082_ARATH Transcription factor bHLH82 OS=Arabidopsis thaliana GN=BHLH82 PE=2 SV=1 Back     alignment and function description
>sp|O80536|PIF3_ARATH Transcription factor PIF3 OS=Arabidopsis thaliana GN=PIF3 PE=1 SV=1 Back     alignment and function description
>sp|Q8GZM7|PIF1_ARATH Transcription factor PIF1 OS=Arabidopsis thaliana GN=PIF1 PE=1 SV=1 Back     alignment and function description
>sp|Q6NKN9|BH074_ARATH Transcription factor bHLH74 OS=Arabidopsis thaliana GN=BHLH74 PE=2 SV=1 Back     alignment and function description
>sp|Q9FUA4|SPT_ARATH Transcription factor SPATULA OS=Arabidopsis thaliana GN=SPT PE=1 SV=1 Back     alignment and function description
>sp|Q84WK0|BH085_ARATH Transcription factor bHLH85 OS=Arabidopsis thaliana GN=BHLH85 PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query166
255580949 326 conserved hypothetical protein [Ricinus 0.885 0.450 0.852 7e-66
449530006219 PREDICTED: transcription factor UNE12-li 0.879 0.666 0.836 3e-64
449435746 318 PREDICTED: transcription factor UNE12-li 0.879 0.459 0.836 7e-64
359487888 331 PREDICTED: transcription factor UNE12-li 0.885 0.444 0.824 1e-63
224131266182 predicted protein [Populus trichocarpa] 0.885 0.807 0.804 2e-63
298204851261 unnamed protein product [Vitis vinifera] 0.885 0.563 0.824 2e-63
225432880 313 PREDICTED: transcription factor UNE12 [V 0.885 0.469 0.790 5e-62
223702396 342 putative basic helix-loop-helix protein 0.873 0.423 0.821 6e-62
147771925 337 hypothetical protein VITISV_032727 [Viti 0.885 0.436 0.790 1e-61
224123420178 predicted protein [Populus trichocarpa] 0.885 0.825 0.777 1e-60
>gi|255580949|ref|XP_002531293.1| conserved hypothetical protein [Ricinus communis] gi|223529126|gb|EEF31106.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
 Score =  254 bits (650), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 127/149 (85%), Positives = 139/149 (93%), Gaps = 2/149 (1%)

Query: 19  QLRRERIADRMRALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGAAGAVAQ 78
           +LRRERIA+RM+ALQELVP+ NKTDRAAM+DEIVDYVKFLRLQVKVLSMSRLGAAGAVAQ
Sbjct: 179 RLRRERIAERMKALQELVPTANKTDRAAMIDEIVDYVKFLRLQVKVLSMSRLGAAGAVAQ 238

Query: 79  LVADVPLSSALEGESIDGGS-SQPEWEKWSNDGTEQQVAKLMEEDIGAAMQFLQSKALCI 137
           LVADVPL+S +EGESIDG + +Q  WEKWSNDGTEQQVAKLMEEDIGAAMQFLQSKALCI
Sbjct: 239 LVADVPLAS-VEGESIDGAAANQQTWEKWSNDGTEQQVAKLMEEDIGAAMQFLQSKALCI 297

Query: 138 MPISLASAIYRTRQPDAPAFVKPESSTPS 166
           MPISLASAI RT  PDAP+ +KPES+TPS
Sbjct: 298 MPISLASAILRTHPPDAPSIIKPESNTPS 326




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|449530006|ref|XP_004171988.1| PREDICTED: transcription factor UNE12-like, partial [Cucumis sativus] Back     alignment and taxonomy information
>gi|449435746|ref|XP_004135655.1| PREDICTED: transcription factor UNE12-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|359487888|ref|XP_002274833.2| PREDICTED: transcription factor UNE12-like [Vitis vinifera] Back     alignment and taxonomy information
>gi|224131266|ref|XP_002328496.1| predicted protein [Populus trichocarpa] gi|222838211|gb|EEE76576.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|298204851|emb|CBI34158.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|225432880|ref|XP_002284047.1| PREDICTED: transcription factor UNE12 [Vitis vinifera] gi|297737150|emb|CBI26351.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|223702396|gb|ACN21629.1| putative basic helix-loop-helix protein BHLH2 [Lotus japonicus] gi|388506732|gb|AFK41432.1| unknown [Lotus japonicus] Back     alignment and taxonomy information
>gi|147771925|emb|CAN66761.1| hypothetical protein VITISV_032727 [Vitis vinifera] Back     alignment and taxonomy information
>gi|224123420|ref|XP_002319074.1| predicted protein [Populus trichocarpa] gi|222857450|gb|EEE94997.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query166
TAIR|locus:2132303310 UNE12 "AT4G02590" [Arabidopsis 0.885 0.474 0.689 4.1e-49
UNIPROTKB|Q7Y1H4294 OSJNBa0094F01.11 "Putative unc 0.879 0.496 0.702 2e-47
TAIR|locus:2007534302 AT1G03040 "AT1G03040" [Arabido 0.849 0.466 0.662 1.2e-44
TAIR|locus:2047555350 LRL1 "AT2G24260" [Arabidopsis 0.771 0.365 0.585 2.6e-31
UNIPROTKB|Q6K8Y4463 OJ1695_H09.18 "Basic helix-loo 0.765 0.274 0.566 1.4e-30
TAIR|locus:2126624310 LRL2 "AT4G30980" [Arabidopsis 0.813 0.435 0.530 2.3e-30
TAIR|locus:2147760297 LRL3 "AT5G58010" [Arabidopsis 0.873 0.488 0.486 1.9e-28
UNIPROTKB|Q6EPZ6499 P0014G10.34 "BHLH transcriptio 0.759 0.252 0.553 1.1e-27
UNIPROTKB|Q69Y51401 P0698A06.26-2 "Putative bHLH t 0.849 0.351 0.472 3.2e-26
UNIPROTKB|Q6Z7E7524 P0020C11.18 "Putative bHLH tra 0.849 0.269 0.471 9e-26
TAIR|locus:2132303 UNE12 "AT4G02590" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 512 (185.3 bits), Expect = 4.1e-49, P = 4.1e-49
 Identities = 102/148 (68%), Positives = 122/148 (82%)

Query:    19 QLRRERIADRMRALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRXXXXXXXXX 78
             +LRRERIA+R+RALQELVP+ NKTDRAAM+DEIVDYVKFLRLQVKVLSMSR         
Sbjct:   162 RLRRERIAERIRALQELVPTVNKTDRAAMIDEIVDYVKFLRLQVKVLSMSRLGGAGAVAP 221

Query:    79 XXXDVPLSSALEGESIDGGSS-QPEWEKWSNDGTEQQVAKLMEEDIGAAMQFLQSKALCI 137
                D+PLSS++E E+ +GG + QP WEKWSNDGTE+QVAKLMEE++GAAMQ LQSKALC+
Sbjct:   222 LVTDMPLSSSVEDETGEGGRTPQPAWEKWSNDGTERQVAKLMEENVGAAMQLLQSKALCM 281

Query:   138 MPISLASAIYRTRQPDAPAFVKPESSTP 165
             MPISLA AIY ++ PD  + VKPE++ P
Sbjct:   282 MPISLAMAIYHSQPPDTSSVVKPENNPP 309




GO:0003677 "DNA binding" evidence=ISS
GO:0005634 "nucleus" evidence=ISM
GO:0003700 "sequence-specific DNA binding transcription factor activity" evidence=ISS
GO:0006355 "regulation of transcription, DNA-dependent" evidence=TAS
GO:0009567 "double fertilization forming a zygote and endosperm" evidence=IMP
GO:0031347 "regulation of defense response" evidence=IMP
UNIPROTKB|Q7Y1H4 OSJNBa0094F01.11 "Putative uncharacterized protein OSJNBa0094F01.11" [Oryza sativa Japonica Group (taxid:39947)] Back     alignment and assigned GO terms
TAIR|locus:2007534 AT1G03040 "AT1G03040" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2047555 LRL1 "AT2G24260" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|Q6K8Y4 OJ1695_H09.18 "Basic helix-loop-helix (BHLH)-like" [Oryza sativa Japonica Group (taxid:39947)] Back     alignment and assigned GO terms
TAIR|locus:2126624 LRL2 "AT4G30980" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2147760 LRL3 "AT5G58010" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|Q6EPZ6 P0014G10.34 "BHLH transcription factor PTF1-like protein" [Oryza sativa Japonica Group (taxid:39947)] Back     alignment and assigned GO terms
UNIPROTKB|Q69Y51 P0698A06.26-2 "Putative bHLH transcription factor PTF1" [Oryza sativa Japonica Group (taxid:39947)] Back     alignment and assigned GO terms
UNIPROTKB|Q6Z7E7 P0020C11.18 "Putative bHLH transcription factor PTF1" [Oryza sativa Japonica Group (taxid:39947)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

ID ?Name ?Annotated EC number ?Identity ?Query coverage ?Hit coverage ?RBH(Q2H) ?RBH(H2Q) ?
O22768UNE12_ARATHNo assigned EC number0.74810.80720.4322yesno

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query166
smart0035353 smart00353, HLH, helix loop helix domain 4e-08
cd0008360 cd00083, HLH, Helix-loop-helix domain, found in sp 4e-07
pfam0001052 pfam00010, HLH, Helix-loop-helix DNA-binding domai 6e-06
>gnl|CDD|197674 smart00353, HLH, helix loop helix domain Back     alignment and domain information
 Score = 47.2 bits (113), Expect = 4e-08
 Identities = 12/46 (26%), Positives = 26/46 (56%), Gaps = 3/46 (6%)

Query: 21 RRERIADRMRALQELVPSCN---KTDRAAMLDEIVDYVKFLRLQVK 63
          RR +I +    L+ L+P+     K  +A +L   ++Y+K L+ +++
Sbjct: 7  RRRKINEAFDELRSLLPTLPKNKKLSKAEILRLAIEYIKSLQEELQ 52


Length = 53

>gnl|CDD|238036 cd00083, HLH, Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long Back     alignment and domain information
>gnl|CDD|215654 pfam00010, HLH, Helix-loop-helix DNA-binding domain Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 166
cd0008360 HLH Helix-loop-helix domain, found in specific DNA 99.52
smart0035353 HLH helix loop helix domain. 99.45
PF0001055 HLH: Helix-loop-helix DNA-binding domain only nucl 99.44
KOG1318411 consensus Helix loop helix transcription factor EB 99.16
KOG1319229 consensus bHLHZip transcription factor BIGMAX [Tra 98.75
KOG4304250 consensus Transcriptional repressors of the hairy/ 98.45
KOG3561 803 consensus Aryl-hydrocarbon receptor nuclear transl 98.24
KOG2588 953 consensus Predicted DNA-binding protein [Transcrip 98.06
PLN0321793 transcription factor ATBS1; Provisional 97.86
KOG2483232 consensus Upstream transcription factor 2/L-myc-2 97.68
KOG0561 373 consensus bHLH transcription factor [Transcription 97.3
KOG4029228 consensus Transcription factor HAND2/Transcription 97.0
KOG3960284 consensus Myogenic helix-loop-helix transcription 96.97
KOG4447173 consensus Transcription factor TWIST [Transcriptio 94.34
KOG3910632 consensus Helix loop helix transcription factor [T 91.29
KOG3558 768 consensus Hypoxia-inducible factor 1/Neuronal PAS 82.02
>cd00083 HLH Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long Back     alignment and domain information
Probab=99.52  E-value=1.9e-14  Score=95.13  Aligned_cols=54  Identities=31%  Similarity=0.450  Sum_probs=50.8

Q ss_pred             cccccchHHHHHHHHHHHHHHHHHhhcCCCC---CCCChhhHHHHHHHHHHHHHHHH
Q 040019            9 QQMFSLFFVVQLRRERIADRMRALQELVPSC---NKTDRAAMLDEIVDYVKFLRLQV   62 (166)
Q Consensus         9 ~a~~~hs~aER~RRerIneri~aLq~LVP~~---~K~DkasVL~eAI~YIk~Lq~qV   62 (166)
                      ..+..|+..||.||++||+.|..|+.+||+.   .|+||++||+.||+||++|+.++
T Consensus         3 ~~r~~~~~~Er~RR~~~n~~~~~L~~llp~~~~~~k~~k~~iL~~a~~yI~~L~~~~   59 (60)
T cd00083           3 SRREAHNLRERRRRERINDAFDELRSLLPTLPPSKKLSKAEILRKAVDYIKSLQELL   59 (60)
T ss_pred             HHHHHHhHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHh
Confidence            4567899999999999999999999999998   78999999999999999999876



A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and

>smart00353 HLH helix loop helix domain Back     alignment and domain information
>PF00010 HLH: Helix-loop-helix DNA-binding domain only nuclear translocator protein (Arnt) Back     alignment and domain information
>KOG1318 consensus Helix loop helix transcription factor EB [Transcription] Back     alignment and domain information
>KOG1319 consensus bHLHZip transcription factor BIGMAX [Transcription] Back     alignment and domain information
>KOG4304 consensus Transcriptional repressors of the hairy/E(spl) family (contains HLH) [Transcription] Back     alignment and domain information
>KOG3561 consensus Aryl-hydrocarbon receptor nuclear translocator [Transcription] Back     alignment and domain information
>KOG2588 consensus Predicted DNA-binding protein [Transcription] Back     alignment and domain information
>PLN03217 transcription factor ATBS1; Provisional Back     alignment and domain information
>KOG2483 consensus Upstream transcription factor 2/L-myc-2 protein [Transcription] Back     alignment and domain information
>KOG0561 consensus bHLH transcription factor [Transcription] Back     alignment and domain information
>KOG4029 consensus Transcription factor HAND2/Transcription factor TAL1/TAL2/LYL1 [Transcription] Back     alignment and domain information
>KOG3960 consensus Myogenic helix-loop-helix transcription factor [Transcription] Back     alignment and domain information
>KOG4447 consensus Transcription factor TWIST [Transcription] Back     alignment and domain information
>KOG3910 consensus Helix loop helix transcription factor [Transcription] Back     alignment and domain information
>KOG3558 consensus Hypoxia-inducible factor 1/Neuronal PAS domain protein NPAS1 [Signal transduction mechanisms; Transcription] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query166
1am9_A82 Srebp-1A, protein (sterol regulatory element bindi 1e-10
1an4_A65 Protein (upstream stimulatory factor); protein-DNA 3e-07
1hlo_A80 Protein (transcription factor MAX); transcriptiona 1e-05
1nkp_B83 MAX protein, MYC proto-oncogene protein; transcrip 6e-05
1nkp_A88 C-MYC, MYC proto-oncogene protein; transcription, 5e-04
>1am9_A Srebp-1A, protein (sterol regulatory element binding protein 1A); basic-helix-loop- helix-leucine zipper, transcription factor; HET: DNA; 2.30A {Homo sapiens} SCOP: a.38.1.1 PDB: 1ukl_C Length = 82 Back     alignment and structure
 Score = 53.9 bits (130), Expect = 1e-10
 Identities = 14/48 (29%), Positives = 28/48 (58%), Gaps = 1/48 (2%)

Query: 21 RRERIADRMRALQELVPSCN-KTDRAAMLDEIVDYVKFLRLQVKVLSM 67
           R  I D++  L++LV     K +++A+L + +DY++FL+   + L  
Sbjct: 17 YRSSINDKIIELKDLVVGTEAKLNKSAVLRKAIDYIRFLQHSNQKLKQ 64


>1an4_A Protein (upstream stimulatory factor); protein-DNA complex, double helix, overhanging base, transcription/DNA complex; HET: DNA; 2.90A {Homo sapiens} SCOP: a.38.1.1 Length = 65 Back     alignment and structure
>1hlo_A Protein (transcription factor MAX); transcriptional regulation, DNA binding, complex (transcription factor MAX/DNA), transcription/DNA complex; HET: DNA; 2.80A {Homo sapiens} SCOP: a.38.1.1 Length = 80 Back     alignment and structure
>1nkp_B MAX protein, MYC proto-oncogene protein; transcription, DNA, BHLHZ, heterodimer, transcription/DNA complex; 1.80A {Homo sapiens} SCOP: a.38.1.1 PDB: 1an2_A* 1r05_A 1nlw_B Length = 83 Back     alignment and structure
>1nkp_A C-MYC, MYC proto-oncogene protein; transcription, DNA, BHLHZ, heterodimer, transcription/DNA complex; 1.80A {Homo sapiens} SCOP: a.38.1.1 Length = 88 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query166
1am9_A82 Srebp-1A, protein (sterol regulatory element bindi 99.69
4h10_B71 Circadian locomoter output cycles protein kaput; B 99.64
4ati_A118 MITF, microphthalmia-associated transcription fact 99.64
1a0a_A63 BHLH, protein (phosphate system positive regulator 99.62
1an4_A65 Protein (upstream stimulatory factor); protein-DNA 99.59
4h10_A73 ARYL hydrocarbon receptor nuclear translocator-LI 99.57
1hlo_A80 Protein (transcription factor MAX); transcriptiona 99.51
1nkp_B83 MAX protein, MYC proto-oncogene protein; transcrip 99.51
1nkp_A88 C-MYC, MYC proto-oncogene protein; transcription, 99.47
3u5v_A76 Protein MAX, transcription factor E2-alpha chimer; 99.42
1nlw_A80 MAD protein, MAX dimerizer; transcription factor, 99.3
4f3l_A 361 Mclock, circadian locomoter output cycles protein 99.1
4f3l_B 387 BMAL1B; BHLH, PAS, circadian rhythm proteins, tran 99.03
1mdy_A68 Protein (MYOD BHLH domain); protein-DNA complex, t 99.02
2ql2_B60 Neurod1, neurogenic differentiation factor 1; basi 98.98
4ath_A83 MITF, microphthalmia-associated transcription fact 98.75
2lfh_A68 DNA-binding protein inhibitor ID-3; structural gen 98.45
4aya_A97 DNA-binding protein inhibitor ID-2; cell cycle; 2. 97.9
>1am9_A Srebp-1A, protein (sterol regulatory element binding protein 1A); basic-helix-loop- helix-leucine zipper, transcription factor; HET: DNA; 2.30A {Homo sapiens} SCOP: a.38.1.1 PDB: 1ukl_C Back     alignment and structure
Probab=99.69  E-value=2e-17  Score=118.20  Aligned_cols=61  Identities=23%  Similarity=0.386  Sum_probs=57.0

Q ss_pred             cccccchHHHHHHHHHHHHHHHHHhhcCCCC-CCCChhhHHHHHHHHHHHHHHHHHHhchhh
Q 040019            9 QQMFSLFFVVQLRRERIADRMRALQELVPSC-NKTDRAAMLDEIVDYVKFLRLQVKVLSMSR   69 (166)
Q Consensus         9 ~a~~~hs~aER~RRerIneri~aLq~LVP~~-~K~DkasVL~eAI~YIk~Lq~qVk~Ls~~~   69 (166)
                      +.+.+|+.+||+||++||++|.+|+.|||++ .|+||++||.+||+||++||.+++.|....
T Consensus         5 ~rr~~H~~~ErrRR~~in~~f~~L~~lvP~~~~k~~Ka~IL~~Ai~YI~~Lq~~~~~L~~e~   66 (82)
T 1am9_A            5 EKRTAHNAIEKRYRSSINDKIIELKDLVVGTEAKLNKSAVLRKAIDYIRFLQHSNQKLKQEN   66 (82)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHHHTCSSCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHhhhhHHHHHHHHHHHHHHHHHHhccCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4577899999999999999999999999998 799999999999999999999999998654



>4h10_B Circadian locomoter output cycles protein kaput; BHLH, circadian transcription, transcription-DNA complex; 2.40A {Homo sapiens} Back     alignment and structure
>4ati_A MITF, microphthalmia-associated transcription factor; DNA-binding protein-DNA complex, melanoma; 2.60A {Mus musculus} PDB: 4atk_A Back     alignment and structure
>1a0a_A BHLH, protein (phosphate system positive regulatory protein PHO4); transcription factor, basic helix loop helix; HET: DNA; 2.80A {Saccharomyces cerevisiae} SCOP: a.38.1.1 Back     alignment and structure
>1an4_A Protein (upstream stimulatory factor); protein-DNA complex, double helix, overhanging base, transcription/DNA complex; HET: DNA; 2.90A {Homo sapiens} SCOP: a.38.1.1 Back     alignment and structure
>4h10_A ARYL hydrocarbon receptor nuclear translocator-LI 1; BHLH, circadian transcription, transcription-DNA complex; 2.40A {Homo sapiens} Back     alignment and structure
>1hlo_A Protein (transcription factor MAX); transcriptional regulation, DNA binding, complex (transcription factor MAX/DNA), transcription/DNA complex; HET: DNA; 2.80A {Homo sapiens} SCOP: a.38.1.1 Back     alignment and structure
>1nkp_B MAX protein, MYC proto-oncogene protein; transcription, DNA, BHLHZ, heterodimer, transcription/DNA complex; 1.80A {Homo sapiens} SCOP: a.38.1.1 PDB: 1an2_A* 1r05_A 1nlw_B Back     alignment and structure
>1nkp_A C-MYC, MYC proto-oncogene protein; transcription, DNA, BHLHZ, heterodimer, transcription/DNA complex; 1.80A {Homo sapiens} SCOP: a.38.1.1 Back     alignment and structure
>3u5v_A Protein MAX, transcription factor E2-alpha chimer; basic helix-loop-helix (BHLH); 1.70A {Mus musculus} PDB: 2ql2_A* Back     alignment and structure
>1nlw_A MAD protein, MAX dimerizer; transcription factor, DNA, BHLHZ, transcription/DNA complex; 2.00A {Homo sapiens} SCOP: a.38.1.1 Back     alignment and structure
>4f3l_A Mclock, circadian locomoter output cycles protein kaput; BHLH, PAS, circadian rhythm proteins, transcription-activato; 2.27A {Mus musculus} Back     alignment and structure
>4f3l_B BMAL1B; BHLH, PAS, circadian rhythm proteins, transcription-activato; 2.27A {Mus musculus} Back     alignment and structure
>1mdy_A Protein (MYOD BHLH domain); protein-DNA complex, transcription/DNA complex; HET: DNA; 2.80A {Mus musculus} SCOP: a.38.1.1 PDB: 1mdy_B* Back     alignment and structure
>2ql2_B Neurod1, neurogenic differentiation factor 1; basic-helix-loop-helix; HET: DNA; 2.50A {Mus musculus} Back     alignment and structure
>4ath_A MITF, microphthalmia-associated transcription factor; DNA binding protein, melanoma; HET: MSE; 1.95A {Mus musculus} Back     alignment and structure
>2lfh_A DNA-binding protein inhibitor ID-3; structural genomics, northeast structural genomics consortiu PSI-biology, protein structure initiative; NMR {Homo sapiens} Back     alignment and structure
>4aya_A DNA-binding protein inhibitor ID-2; cell cycle; 2.10A {Homo sapiens} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 166
d1uklc_61 a.38.1.1 (C:) SREBP-2 {Human (Homo sapiens) [TaxId 2e-13
d1a0aa_63 a.38.1.1 (A:) Pho4 B/HLH domain {Baker's yeast (Sa 5e-08
d1nkpa_88 a.38.1.1 (A:) Myc proto-oncogene protein {Human (H 5e-08
d1nkpb_83 a.38.1.1 (B:) Max protein {Human (Homo sapiens) [T 8e-08
d1am9a_80 a.38.1.1 (A:) SREBP-1a {Human (Homo sapiens) [TaxI 2e-07
d1mdya_68 a.38.1.1 (A:) Myod B/HLH domain {Mouse (Mus muscul 7e-07
d1an4a_65 a.38.1.1 (A:) Usf B/HLH domain {Human (Homo sapien 8e-07
d1nlwa_79 a.38.1.1 (A:) Mad protein {Human (Homo sapiens) [T 3e-06
>d1uklc_ a.38.1.1 (C:) SREBP-2 {Human (Homo sapiens) [TaxId: 9606]} Length = 61 Back     information, alignment and structure

class: All alpha proteins
fold: HLH-like
superfamily: HLH, helix-loop-helix DNA-binding domain
family: HLH, helix-loop-helix DNA-binding domain
domain: SREBP-2
species: Human (Homo sapiens) [TaxId: 9606]
 Score = 59.4 bits (144), Expect = 2e-13
 Identities = 13/47 (27%), Positives = 26/47 (55%), Gaps = 1/47 (2%)

Query: 22 RERIADRMRALQELVPSCN-KTDRAAMLDEIVDYVKFLRLQVKVLSM 67
          R  I D++  L++LV   + K  ++ +L + +DY+K+L+     L  
Sbjct: 1  RSSINDKIIELKDLVMGTDAKMHKSGVLRKAIDYIKYLQQVNHKLRQ 47


>d1a0aa_ a.38.1.1 (A:) Pho4 B/HLH domain {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} Length = 63 Back     information, alignment and structure
>d1nkpa_ a.38.1.1 (A:) Myc proto-oncogene protein {Human (Homo sapiens) [TaxId: 9606]} Length = 88 Back     information, alignment and structure
>d1nkpb_ a.38.1.1 (B:) Max protein {Human (Homo sapiens) [TaxId: 9606]} Length = 83 Back     information, alignment and structure
>d1am9a_ a.38.1.1 (A:) SREBP-1a {Human (Homo sapiens) [TaxId: 9606]} Length = 80 Back     information, alignment and structure
>d1mdya_ a.38.1.1 (A:) Myod B/HLH domain {Mouse (Mus musculus) [TaxId: 10090]} Length = 68 Back     information, alignment and structure
>d1an4a_ a.38.1.1 (A:) Usf B/HLH domain {Human (Homo sapiens) [TaxId: 9606]} Length = 65 Back     information, alignment and structure
>d1nlwa_ a.38.1.1 (A:) Mad protein {Human (Homo sapiens) [TaxId: 9606]} Length = 79 Back     information, alignment and structure

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query166
d1a0aa_63 Pho4 B/HLH domain {Baker's yeast (Saccharomyces ce 99.58
d1am9a_80 SREBP-1a {Human (Homo sapiens) [TaxId: 9606]} 99.58
d1mdya_68 Myod B/HLH domain {Mouse (Mus musculus) [TaxId: 10 99.55
d1nlwa_79 Mad protein {Human (Homo sapiens) [TaxId: 9606]} 99.51
d1nkpb_83 Max protein {Human (Homo sapiens) [TaxId: 9606]} 99.5
d1nkpa_88 Myc proto-oncogene protein {Human (Homo sapiens) [ 99.47
d1uklc_61 SREBP-2 {Human (Homo sapiens) [TaxId: 9606]} 99.47
d1an4a_65 Usf B/HLH domain {Human (Homo sapiens) [TaxId: 960 99.46
>d1a0aa_ a.38.1.1 (A:) Pho4 B/HLH domain {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} Back     information, alignment and structure
class: All alpha proteins
fold: HLH-like
superfamily: HLH, helix-loop-helix DNA-binding domain
family: HLH, helix-loop-helix DNA-binding domain
domain: Pho4 B/HLH domain
species: Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]
Probab=99.58  E-value=5e-16  Score=104.13  Aligned_cols=52  Identities=23%  Similarity=0.349  Sum_probs=48.0

Q ss_pred             cccchHHHHHHHHHHHHHHHHHhhcCCC-------CCCCChhhHHHHHHHHHHHHHHHH
Q 040019           11 MFSLFFVVQLRRERIADRMRALQELVPS-------CNKTDRAAMLDEIVDYVKFLRLQV   62 (166)
Q Consensus        11 ~~~hs~aER~RRerIneri~aLq~LVP~-------~~K~DkasVL~eAI~YIk~Lq~qV   62 (166)
                      +..|+.+||+||++||+.|..|+.|||+       ..|++|++||+.||+||++||++|
T Consensus         3 R~~H~~~Er~RR~~in~~~~~L~~llP~~~~~~~~~~k~sKa~iL~~Av~yI~~Lq~~v   61 (63)
T d1a0aa_           3 RESHKHAEQARRNRLAVALHELASLIPAEWKQQNVSAAPSKATTVEAACRYIRHLQQNG   61 (63)
T ss_dssp             TTGGGGGTHHHHHHHHHHHHHHHHTSCHHHHTSSCCCCSCTTHHHHHHHHHHHHHHTCS
T ss_pred             cchHHHHHHHHHHHHHHHHHHHHHHCCCcccccCCcccccHHHHHHHHHHHHHHHHHhh
Confidence            5679999999999999999999999994       468999999999999999999875



>d1am9a_ a.38.1.1 (A:) SREBP-1a {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1mdya_ a.38.1.1 (A:) Myod B/HLH domain {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
>d1nlwa_ a.38.1.1 (A:) Mad protein {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1nkpb_ a.38.1.1 (B:) Max protein {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1nkpa_ a.38.1.1 (A:) Myc proto-oncogene protein {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1uklc_ a.38.1.1 (C:) SREBP-2 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1an4a_ a.38.1.1 (A:) Usf B/HLH domain {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure