Citrus Sinensis ID: 043478


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-----
MSHIQLKELQEGNHMKLSSLDLGIDDDAYYKKTLFAIFGRSNQLNDNLSFLSVEHKSSFENLFSVPLMYGGCAHRSPKEICRKYCPLTMESENFCEEHVSLDKRTKNEKFMLLRSVVPYISEVDKASILNDTIKYLKRLKARVEE
ccccccccccccccccccccccccccccEEEEEEEHHccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHcccccccccHHHHHHHHHHHHHHHHHHHHc
cccccHHHHccccccccccccccccccccHHHHHHHHHccccccccccccccccccccHHHHEccccccccccccccHHccccccccccccccccHHHHHHHHHHHHHHHHHHHHHcccccHHcHHHHHHHHHHHHHHHHHHHHc
mshiqlkelqegnhmklssldlgidddayYKKTLFAIFGrsnqlndnlSFLSVEHKssfenlfsvplmyggcahrspkeicrkycpltmesenFCEEHVSLDkrtknekfMLLRSVVPYISEVDKASILNDTIKYLKRLKARVEE
MSHIQLkelqegnhmklSSLDLGIDDDAYYKKTLFAIFGRSNQLNDNLSFLSVEHKSSFENLFSVPLMYGGCAHRSPKEICRKYCPLTMESENFCEEHVSLdkrtknekfMLLRSVVpyisevdkasilndTIKYLKRLKARVEE
MSHIQLKELQEGNHMKLSSLDLGIDDDAYYKKTLFAIFGRSNQLNDNLSFLSVEHKSSFENLFSVPLMYGGCAHRSPKEICRKYCPLTMESENFCEEHVSLDKRTKNEKFMLLRSVVPYISEVDKASILNDTIKYLKRLKARVEE
******************SLDLGIDDDAYYKKTLFAIFGRSNQLNDNLSFLSVEHKSSFENLFSVPLMYGGCAHRSPKEICRKYCPLTMESENFCEEHVSLDKRTKNEKFMLLRSVVPYISEVDKASILNDTIKYLKRL******
***********************************************************************************************************EKFMLLRSVVPYISEVDKASILNDTIKYLKRLKA****
MSHIQLKELQEGNHMKLSSLDLGIDDDAYYKKTLFAIFGRSNQLNDNLSFLSVEHKSSFENLFSVPLMYGGCAHRSPKEICRKYCPLTMESENFCEEHVSLDKRTKNEKFMLLRSVVPYISEVDKASILNDTIKYLKRLKARVEE
***IQLKEL***NHMKLSSLDLGIDDDAYYKKTLFAIFGRSNQLNDNLSFLSVEHKSSFENLFSVPLMYGGCA*******************NFC*****LDKRTKNEKFMLLRSVVPYISEVDKASILNDTIKYLKRLKARV**
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MSHIQLKELQEGNHMKLSSLDLGIDDDAYYKKTLFAIFGRSNQLNDNLSFLSVEHKSSFENLFSVPLMYGGCAHRSPKEICRKYCPLTMESENFCEEHVSLDKRTKNEKFMLLRSVVPYISEVDKASILNDTIKYLKRLKARVEE
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query145 2.2.26 [Sep-21-2011]
Q8W2F1 526 Transcription factor MYC1 yes no 0.868 0.239 0.356 4e-12
Q9CAD0 596 Transcription factor EGL1 no no 0.855 0.208 0.335 2e-11
Q9FN69 637 Transcription factor GLAB no no 0.834 0.189 0.309 5e-09
Q9FT81 518 Transcription factor TT8 no no 0.331 0.092 0.490 3e-07
P13027 612 Anthocyanin regulatory R- N/A no 0.344 0.081 0.452 5e-06
P13526 610 Anthocyanin regulatory Lc N/A no 0.344 0.081 0.452 5e-06
Q9ZPY8 566 Transcription factor ABA- no no 0.324 0.083 0.46 1e-05
Q39204 623 Transcription factor MYC2 no no 0.462 0.107 0.385 2e-05
Q9LSE2 494 Transcription factor ICE1 no no 0.296 0.087 0.488 3e-05
Q9LPW3 450 Transcription factor SCRE no no 0.268 0.086 0.538 3e-05
>sp|Q8W2F1|BH012_ARATH Transcription factor MYC1 OS=Arabidopsis thaliana GN=BHLH12 PE=1 SV=1 Back     alignment and function desciption
 Score = 70.1 bits (170), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 52/146 (35%), Positives = 78/146 (53%), Gaps = 20/146 (13%)

Query: 20  LDLGI-DDDAYYKKTLFAIFG----RSNQLNDNL-----SFLSVEHKSSFENLFSVPLMY 69
           L LGI D+D +YK+T+  +      RS + + N+     + ++ E  SSF          
Sbjct: 243 LPLGISDEDLHYKRTISTVLNYSADRSGKNDKNIRHRQPNIVTSEPGSSFLRWKQCEQQV 302

Query: 70  GGCAHRSPKE-ICRKY---CPLTMESENFCEEHVSL------DKRTKNEKFMLLRSVVPY 119
            G   +   + + RK     PL      F  ++  L      D+R +NEKF +LR++VP 
Sbjct: 303 SGFVQKKKSQNVLRKILHDVPLMHTKRMFPSQNSGLNQDDPSDRRKENEKFSVLRTMVPT 362

Query: 120 ISEVDKASILNDTIKYLKRLKARVEE 145
           ++EVDK SILN+TIKYL+ L+ARVEE
Sbjct: 363 VNEVDKESILNNTIKYLQELEARVEE 388




Trancsription activator, when associated with MYB75/PAP1 or MYB90/PAP2.
Arabidopsis thaliana (taxid: 3702)
>sp|Q9CAD0|EGL1_ARATH Transcription factor EGL1 OS=Arabidopsis thaliana GN=BHLH2 PE=1 SV=1 Back     alignment and function description
>sp|Q9FN69|GL3_ARATH Transcription factor GLABRA 3 OS=Arabidopsis thaliana GN=GL3 PE=1 SV=1 Back     alignment and function description
>sp|Q9FT81|TT8_ARATH Transcription factor TT8 OS=Arabidopsis thaliana GN=TT8 PE=1 SV=2 Back     alignment and function description
>sp|P13027|ARRS_MAIZE Anthocyanin regulatory R-S protein OS=Zea mays GN=R-S PE=2 SV=1 Back     alignment and function description
>sp|P13526|ARLC_MAIZE Anthocyanin regulatory Lc protein OS=Zea mays GN=LC PE=2 SV=1 Back     alignment and function description
>sp|Q9ZPY8|AIB_ARATH Transcription factor ABA-INDUCIBLE bHLH-TYPE OS=Arabidopsis thaliana GN=AIB PE=2 SV=2 Back     alignment and function description
>sp|Q39204|RAP1_ARATH Transcription factor MYC2 OS=Arabidopsis thaliana GN=RAP1 PE=1 SV=2 Back     alignment and function description
>sp|Q9LSE2|ICE1_ARATH Transcription factor ICE1 OS=Arabidopsis thaliana GN=SCRM PE=1 SV=1 Back     alignment and function description
>sp|Q9LPW3|SCRM2_ARATH Transcription factor SCREAM2 OS=Arabidopsis thaliana GN=SCRM2 PE=1 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query145
150416421 656 MYC2 [Citrus sinensis] 1.0 0.221 0.706 3e-60
150416419 656 MYC2 [Citrus sinensis] 1.0 0.221 0.706 4e-60
224127668 649 predicted protein [Populus trichocarpa] 1.0 0.223 0.445 2e-28
224064031 654 predicted protein [Populus trichocarpa] 1.0 0.221 0.445 2e-27
147834195 583 hypothetical protein VITISV_040405 [Viti 1.0 0.248 0.407 2e-26
297745295 639 unnamed protein product [Vitis vinifera] 1.0 0.226 0.407 2e-26
187936004 654 myc anthocyanin regulatory protein [Viti 1.0 0.221 0.407 2e-26
225454234 654 PREDICTED: transcription factor GLABRA 3 1.0 0.221 0.407 2e-26
359489618 659 PREDICTED: transcription factor GLABRA 3 1.0 0.220 0.407 2e-26
255541590 656 transcription factor, putative [Ricinus 0.986 0.217 0.409 8e-22
>gi|150416421|gb|ABR68793.1| MYC2 [Citrus sinensis] Back     alignment and taxonomy information
 Score =  236 bits (601), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 118/167 (70%), Positives = 133/167 (79%), Gaps = 22/167 (13%)

Query: 1   MSHIQLKELQEGNHMKLSSLDLGIDDDAYYKKTLFAIFGRSNQLNDNLSFLSVEHKSSFE 60
           MSHIQLKELQEGNH KLS LDLGIDD A+Y+KTL AIFG SN+L +N  FLSVEHKSSF 
Sbjct: 343 MSHIQLKELQEGNHTKLSLLDLGIDDGAHYRKTLSAIFGSSNRLTENPCFLSVEHKSSFV 402

Query: 61  N----------------------LFSVPLMYGGCAHRSPKEICRKYCPLTMESENFCEEH 98
           +                      LFSVPLM+GGC HRS KEICRKYCP+TMES+NFCEEH
Sbjct: 403 SWKKGGMVKRHWPGIQQNLLKKILFSVPLMHGGCTHRSQKEICRKYCPVTMESDNFCEEH 462

Query: 99  VSLDKRTKNEKFMLLRSVVPYISEVDKASILNDTIKYLKRLKARVEE 145
           +S DKRT+NEKFM+LRS+VPYISEVDKASIL+DTIKYLK+L+ARVEE
Sbjct: 463 ISSDKRTENEKFMVLRSMVPYISEVDKASILSDTIKYLKKLEARVEE 509




Source: Citrus sinensis

Species: Citrus sinensis

Genus: Citrus

Family: Rutaceae

Order: Sapindales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|150416419|gb|ABR68792.1| MYC2 [Citrus sinensis] Back     alignment and taxonomy information
>gi|224127668|ref|XP_002320131.1| predicted protein [Populus trichocarpa] gi|222860904|gb|EEE98446.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|224064031|ref|XP_002301357.1| predicted protein [Populus trichocarpa] gi|222843083|gb|EEE80630.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|147834195|emb|CAN75308.1| hypothetical protein VITISV_040405 [Vitis vinifera] Back     alignment and taxonomy information
>gi|297745295|emb|CBI40375.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|187936004|gb|ABM92332.3| myc anthocyanin regulatory protein [Vitis vinifera] Back     alignment and taxonomy information
>gi|225454234|ref|XP_002274163.1| PREDICTED: transcription factor GLABRA 3 isoform 1 [Vitis vinifera] Back     alignment and taxonomy information
>gi|359489618|ref|XP_003633950.1| PREDICTED: transcription factor GLABRA 3 isoform 2 [Vitis vinifera] Back     alignment and taxonomy information
>gi|255541590|ref|XP_002511859.1| transcription factor, putative [Ricinus communis] gi|223549039|gb|EEF50528.1| transcription factor, putative [Ricinus communis] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query145
TAIR|locus:2127023 580 ATMYC1 [Arabidopsis thaliana ( 0.303 0.075 0.659 2.2e-12
TAIR|locus:2026629 596 EGL3 "AT1G63650" [Arabidopsis 0.931 0.226 0.352 1.6e-11
TAIR|locus:504954829 637 GL3 "AT5G41315" [Arabidopsis t 0.386 0.087 0.440 1.7e-11
UNIPROTKB|Q7XPS7 567 OSJNBa0065O17.4 "OSJNBa0065O17 0.393 0.100 0.5 3.7e-09
UNIPROTKB|Q657A4 370 P0022F12.30 "Regulatory protei 0.344 0.135 0.509 7.2e-08
UNIPROTKB|Q53L62 458 LOC_Os11g15210 "Helix-loop-hel 0.406 0.128 0.467 1e-07
TAIR|locus:2039094 566 AIB "AT2G46510" [Arabidopsis t 0.855 0.219 0.325 2.4e-07
TAIR|locus:2118524 518 TT8 "AT4G09820" [Arabidopsis t 0.331 0.092 0.490 3.4e-07
UNIPROTKB|Q84LF9 310 RERJ1 "Transcription Factor" [ 0.303 0.141 0.533 1.3e-06
UNIPROTKB|Q948Y2451 Plw-OSB2 "R-type basic helix-l 0.813 0.261 0.309 1.6e-06
TAIR|locus:2127023 ATMYC1 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 151 (58.2 bits), Expect = 2.2e-12, Sum P(2) = 2.2e-12
 Identities = 29/44 (65%), Positives = 39/44 (88%)

Query:   102 DKRTKNEKFMLLRSVVPYISEVDKASILNDTIKYLKRLKARVEE 145
             D+R +NEKF +LR++VP ++EVDK SILN+TIKYL+ L+ARVEE
Sbjct:   345 DRRKENEKFSVLRTMVPTVNEVDKESILNNTIKYLQELEARVEE 388


GO:0005634 "nucleus" evidence=ISM
GO:0003700 "sequence-specific DNA binding transcription factor activity" evidence=ISS
GO:0005515 "protein binding" evidence=IPI
GO:0048629 "trichome patterning" evidence=IMP
GO:0001708 "cell fate specification" evidence=RCA
GO:0009913 "epidermal cell differentiation" evidence=RCA
GO:0009957 "epidermal cell fate specification" evidence=RCA
GO:0003677 "DNA binding" evidence=ISS
TAIR|locus:2026629 EGL3 "AT1G63650" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:504954829 GL3 "AT5G41315" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|Q7XPS7 OSJNBa0065O17.4 "OSJNBa0065O17.4 protein" [Oryza sativa Japonica Group (taxid:39947)] Back     alignment and assigned GO terms
UNIPROTKB|Q657A4 P0022F12.30 "Regulatory protein B-Peru-like" [Oryza sativa Japonica Group (taxid:39947)] Back     alignment and assigned GO terms
UNIPROTKB|Q53L62 LOC_Os11g15210 "Helix-loop-helix DNA-binding domain, putative" [Oryza sativa Japonica Group (taxid:39947)] Back     alignment and assigned GO terms
TAIR|locus:2039094 AIB "AT2G46510" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2118524 TT8 "AT4G09820" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|Q84LF9 RERJ1 "Transcription Factor" [Oryza sativa (taxid:4530)] Back     alignment and assigned GO terms
UNIPROTKB|Q948Y2 Plw-OSB2 "R-type basic helix-loop-helix protein" [Oryza sativa (taxid:4530)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
estExt_Genewise1_v1.C_LG_XIV1259
hypothetical protein (650 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query145
smart0035353 smart00353, HLH, helix loop helix domain 1e-05
cd0008360 cd00083, HLH, Helix-loop-helix domain, found in sp 3e-04
pfam0001052 pfam00010, HLH, Helix-loop-helix DNA-binding domai 0.003
>gnl|CDD|197674 smart00353, HLH, helix loop helix domain Back     alignment and domain information
 Score = 40.3 bits (95), Expect = 1e-05
 Identities = 16/45 (35%), Positives = 26/45 (57%), Gaps = 3/45 (6%)

Query: 104 RTKNEKFMLLRSVVPY---ISEVDKASILNDTIKYLKRLKARVEE 145
           R  NE F  LRS++P      ++ KA IL   I+Y+K L+  +++
Sbjct: 9   RKINEAFDELRSLLPTLPKNKKLSKAEILRLAIEYIKSLQEELQK 53


Length = 53

>gnl|CDD|238036 cd00083, HLH, Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long Back     alignment and domain information
>gnl|CDD|215654 pfam00010, HLH, Helix-loop-helix DNA-binding domain Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 145
cd0008360 HLH Helix-loop-helix domain, found in specific DNA 99.43
smart0035353 HLH helix loop helix domain. 99.4
PF0001055 HLH: Helix-loop-helix DNA-binding domain only nucl 99.35
KOG1318 411 consensus Helix loop helix transcription factor EB 98.4
KOG1319 229 consensus bHLHZip transcription factor BIGMAX [Tra 98.06
KOG2588 953 consensus Predicted DNA-binding protein [Transcrip 97.86
KOG4304 250 consensus Transcriptional repressors of the hairy/ 97.83
KOG3561 803 consensus Aryl-hydrocarbon receptor nuclear transl 97.79
KOG2483 232 consensus Upstream transcription factor 2/L-myc-2 97.71
KOG0561 373 consensus bHLH transcription factor [Transcription 97.67
KOG3960 284 consensus Myogenic helix-loop-helix transcription 97.36
KOG4029228 consensus Transcription factor HAND2/Transcription 97.31
KOG3558 768 consensus Hypoxia-inducible factor 1/Neuronal PAS 94.0
KOG3910632 consensus Helix loop helix transcription factor [T 92.77
PLN0321793 transcription factor ATBS1; Provisional 92.72
KOG4447173 consensus Transcription factor TWIST [Transcriptio 87.81
KOG3560 712 consensus Aryl-hydrocarbon receptor [Transcription 83.89
KOG3898 254 consensus Transcription factor NeuroD and related 83.8
KOG3559 598 consensus Transcriptional regulator SIM1 [Transcri 80.39
>cd00083 HLH Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long Back     alignment and domain information
Probab=99.43  E-value=2.8e-13  Score=87.54  Aligned_cols=50  Identities=32%  Similarity=0.578  Sum_probs=47.0

Q ss_pred             hhhhhhHHHHhH---HHhHHHHHhcCCCC---CCcchhhhHHHHHHHHHHHHHHhh
Q 043478           95 CEEHVSLDKRTK---NEKFMLLRSVVPYI---SEVDKASILNDTIKYLKRLKARVE  144 (145)
Q Consensus        95 ~~~h~~~ERrRR---ne~f~~LrslvP~~---~K~dKaSIL~daI~Yik~Lq~~V~  144 (145)
                      ...|+..||+||   |+.|..|+++||..   .|+||++||..||+||+.|+.+++
T Consensus         5 r~~~~~~Er~RR~~~n~~~~~L~~llp~~~~~~k~~k~~iL~~a~~yI~~L~~~~~   60 (60)
T cd00083           5 REAHNLRERRRRERINDAFDELRSLLPTLPPSKKLSKAEILRKAVDYIKSLQELLQ   60 (60)
T ss_pred             HHHHhHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHhC
Confidence            567999999999   99999999999988   899999999999999999999874



A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and

>smart00353 HLH helix loop helix domain Back     alignment and domain information
>PF00010 HLH: Helix-loop-helix DNA-binding domain only nuclear translocator protein (Arnt) Back     alignment and domain information
>KOG1318 consensus Helix loop helix transcription factor EB [Transcription] Back     alignment and domain information
>KOG1319 consensus bHLHZip transcription factor BIGMAX [Transcription] Back     alignment and domain information
>KOG2588 consensus Predicted DNA-binding protein [Transcription] Back     alignment and domain information
>KOG4304 consensus Transcriptional repressors of the hairy/E(spl) family (contains HLH) [Transcription] Back     alignment and domain information
>KOG3561 consensus Aryl-hydrocarbon receptor nuclear translocator [Transcription] Back     alignment and domain information
>KOG2483 consensus Upstream transcription factor 2/L-myc-2 protein [Transcription] Back     alignment and domain information
>KOG0561 consensus bHLH transcription factor [Transcription] Back     alignment and domain information
>KOG3960 consensus Myogenic helix-loop-helix transcription factor [Transcription] Back     alignment and domain information
>KOG4029 consensus Transcription factor HAND2/Transcription factor TAL1/TAL2/LYL1 [Transcription] Back     alignment and domain information
>KOG3558 consensus Hypoxia-inducible factor 1/Neuronal PAS domain protein NPAS1 [Signal transduction mechanisms; Transcription] Back     alignment and domain information
>KOG3910 consensus Helix loop helix transcription factor [Transcription] Back     alignment and domain information
>PLN03217 transcription factor ATBS1; Provisional Back     alignment and domain information
>KOG4447 consensus Transcription factor TWIST [Transcription] Back     alignment and domain information
>KOG3560 consensus Aryl-hydrocarbon receptor [Transcription] Back     alignment and domain information
>KOG3898 consensus Transcription factor NeuroD and related HTH proteins [Transcription] Back     alignment and domain information
>KOG3559 consensus Transcriptional regulator SIM1 [Transcription] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query145
1am9_A82 Srebp-1A, protein (sterol regulatory element bindi 9e-05
>1am9_A Srebp-1A, protein (sterol regulatory element binding protein 1A); basic-helix-loop- helix-leucine zipper, transcription factor; HET: DNA; 2.30A {Homo sapiens} SCOP: a.38.1.1 PDB: 1ukl_C Length = 82 Back     alignment and structure
 Score = 37.7 bits (88), Expect = 9e-05
 Identities = 9/40 (22%), Positives = 25/40 (62%), Gaps = 1/40 (2%)

Query: 107 NEKFMLLRSVVPYI-SEVDKASILNDTIKYLKRLKARVEE 145
           N+K + L+ +V    ++++K+++L   I Y++ L+   ++
Sbjct: 22  NDKIIELKDLVVGTEAKLNKSAVLRKAIDYIRFLQHSNQK 61


Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query145
1am9_A82 Srebp-1A, protein (sterol regulatory element bindi 99.59
1an4_A65 Protein (upstream stimulatory factor); protein-DNA 99.55
4h10_B71 Circadian locomoter output cycles protein kaput; B 99.54
1a0a_A63 BHLH, protein (phosphate system positive regulator 99.49
1nkp_A88 C-MYC, MYC proto-oncogene protein; transcription, 99.48
1nkp_B83 MAX protein, MYC proto-oncogene protein; transcrip 99.47
1hlo_A80 Protein (transcription factor MAX); transcriptiona 99.46
4ati_A118 MITF, microphthalmia-associated transcription fact 99.46
4h10_A73 ARYL hydrocarbon receptor nuclear translocator-LI 99.45
3u5v_A76 Protein MAX, transcription factor E2-alpha chimer; 99.33
1nlw_A80 MAD protein, MAX dimerizer; transcription factor, 99.33
1mdy_A68 Protein (MYOD BHLH domain); protein-DNA complex, t 99.22
2ql2_B60 Neurod1, neurogenic differentiation factor 1; basi 99.13
4f3l_A 361 Mclock, circadian locomoter output cycles protein 98.99
4f3l_B 387 BMAL1B; BHLH, PAS, circadian rhythm proteins, tran 98.85
2lfh_A68 DNA-binding protein inhibitor ID-3; structural gen 98.6
4ath_A83 MITF, microphthalmia-associated transcription fact 98.19
4aya_A97 DNA-binding protein inhibitor ID-2; cell cycle; 2. 98.0
>1am9_A Srebp-1A, protein (sterol regulatory element binding protein 1A); basic-helix-loop- helix-leucine zipper, transcription factor; HET: DNA; 2.30A {Homo sapiens} SCOP: a.38.1.1 PDB: 1ukl_C Back     alignment and structure
Probab=99.59  E-value=8.4e-16  Score=107.22  Aligned_cols=51  Identities=24%  Similarity=0.450  Sum_probs=48.3

Q ss_pred             hhhhhhhHHHHhH---HHhHHHHHhcCCCC-CCcchhhhHHHHHHHHHHHHHHhh
Q 043478           94 FCEEHVSLDKRTK---NEKFMLLRSVVPYI-SEVDKASILNDTIKYLKRLKARVE  144 (145)
Q Consensus        94 ~~~~h~~~ERrRR---ne~f~~LrslvP~~-~K~dKaSIL~daI~Yik~Lq~~V~  144 (145)
                      ...+|+.+||+||   |++|..|+++||+. .|+|||+||.+||+||++|+.+++
T Consensus         6 rr~~H~~~ErrRR~~in~~f~~L~~lvP~~~~k~~Ka~IL~~Ai~YI~~Lq~~~~   60 (82)
T 1am9_A            6 KRTAHNAIEKRYRSSINDKIIELKDLVVGTEAKLNKSAVLRKAIDYIRFLQHSNQ   60 (82)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHHTCSSCCCCHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHhhhhHHHHHHHHHHHHHHHHHHhccCCCCCCCHHHHHHHHHHHHHHHHHHHH
Confidence            4678999999999   99999999999998 899999999999999999999886



>1an4_A Protein (upstream stimulatory factor); protein-DNA complex, double helix, overhanging base, transcription/DNA complex; HET: DNA; 2.90A {Homo sapiens} SCOP: a.38.1.1 Back     alignment and structure
>4h10_B Circadian locomoter output cycles protein kaput; BHLH, circadian transcription, transcription-DNA complex; 2.40A {Homo sapiens} Back     alignment and structure
>1a0a_A BHLH, protein (phosphate system positive regulatory protein PHO4); transcription factor, basic helix loop helix; HET: DNA; 2.80A {Saccharomyces cerevisiae} SCOP: a.38.1.1 Back     alignment and structure
>1nkp_A C-MYC, MYC proto-oncogene protein; transcription, DNA, BHLHZ, heterodimer, transcription/DNA complex; 1.80A {Homo sapiens} SCOP: a.38.1.1 Back     alignment and structure
>1nkp_B MAX protein, MYC proto-oncogene protein; transcription, DNA, BHLHZ, heterodimer, transcription/DNA complex; 1.80A {Homo sapiens} SCOP: a.38.1.1 PDB: 1an2_A* 1r05_A 1nlw_B Back     alignment and structure
>1hlo_A Protein (transcription factor MAX); transcriptional regulation, DNA binding, complex (transcription factor MAX/DNA), transcription/DNA complex; HET: DNA; 2.80A {Homo sapiens} SCOP: a.38.1.1 Back     alignment and structure
>4ati_A MITF, microphthalmia-associated transcription factor; DNA-binding protein-DNA complex, melanoma; 2.60A {Mus musculus} PDB: 4atk_A Back     alignment and structure
>4h10_A ARYL hydrocarbon receptor nuclear translocator-LI 1; BHLH, circadian transcription, transcription-DNA complex; 2.40A {Homo sapiens} Back     alignment and structure
>3u5v_A Protein MAX, transcription factor E2-alpha chimer; basic helix-loop-helix (BHLH); 1.70A {Mus musculus} PDB: 2ql2_A* Back     alignment and structure
>1nlw_A MAD protein, MAX dimerizer; transcription factor, DNA, BHLHZ, transcription/DNA complex; 2.00A {Homo sapiens} SCOP: a.38.1.1 Back     alignment and structure
>1mdy_A Protein (MYOD BHLH domain); protein-DNA complex, transcription/DNA complex; HET: DNA; 2.80A {Mus musculus} SCOP: a.38.1.1 PDB: 1mdy_B* Back     alignment and structure
>2ql2_B Neurod1, neurogenic differentiation factor 1; basic-helix-loop-helix; HET: DNA; 2.50A {Mus musculus} Back     alignment and structure
>4f3l_A Mclock, circadian locomoter output cycles protein kaput; BHLH, PAS, circadian rhythm proteins, transcription-activato; 2.27A {Mus musculus} Back     alignment and structure
>4f3l_B BMAL1B; BHLH, PAS, circadian rhythm proteins, transcription-activato; 2.27A {Mus musculus} Back     alignment and structure
>2lfh_A DNA-binding protein inhibitor ID-3; structural genomics, northeast structural genomics consortiu PSI-biology, protein structure initiative; NMR {Homo sapiens} Back     alignment and structure
>4ath_A MITF, microphthalmia-associated transcription factor; DNA binding protein, melanoma; HET: MSE; 1.95A {Mus musculus} Back     alignment and structure
>4aya_A DNA-binding protein inhibitor ID-2; cell cycle; 2.10A {Homo sapiens} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 145
d1uklc_61 a.38.1.1 (C:) SREBP-2 {Human (Homo sapiens) [TaxId 4e-07
d1mdya_68 a.38.1.1 (A:) Myod B/HLH domain {Mouse (Mus muscul 8e-05
d1nkpa_88 a.38.1.1 (A:) Myc proto-oncogene protein {Human (H 6e-04
d1nlwa_79 a.38.1.1 (A:) Mad protein {Human (Homo sapiens) [T 0.001
d1an4a_65 a.38.1.1 (A:) Usf B/HLH domain {Human (Homo sapien 0.002
d1nkpb_83 a.38.1.1 (B:) Max protein {Human (Homo sapiens) [T 0.002
>d1uklc_ a.38.1.1 (C:) SREBP-2 {Human (Homo sapiens) [TaxId: 9606]} Length = 61 Back     information, alignment and structure

class: All alpha proteins
fold: HLH-like
superfamily: HLH, helix-loop-helix DNA-binding domain
family: HLH, helix-loop-helix DNA-binding domain
domain: SREBP-2
species: Human (Homo sapiens) [TaxId: 9606]
 Score = 42.8 bits (101), Expect = 4e-07
 Identities = 11/44 (25%), Positives = 24/44 (54%), Gaps = 2/44 (4%)

Query: 104 RTK-NEKFMLLRSVVPYI-SEVDKASILNDTIKYLKRLKARVEE 145
           R+  N+K + L+ +V    +++ K+ +L   I Y+K L+    +
Sbjct: 1   RSSINDKIIELKDLVMGTDAKMHKSGVLRKAIDYIKYLQQVNHK 44


>d1mdya_ a.38.1.1 (A:) Myod B/HLH domain {Mouse (Mus musculus) [TaxId: 10090]} Length = 68 Back     information, alignment and structure
>d1nkpa_ a.38.1.1 (A:) Myc proto-oncogene protein {Human (Homo sapiens) [TaxId: 9606]} Length = 88 Back     information, alignment and structure
>d1nlwa_ a.38.1.1 (A:) Mad protein {Human (Homo sapiens) [TaxId: 9606]} Length = 79 Back     information, alignment and structure
>d1an4a_ a.38.1.1 (A:) Usf B/HLH domain {Human (Homo sapiens) [TaxId: 9606]} Length = 65 Back     information, alignment and structure
>d1nkpb_ a.38.1.1 (B:) Max protein {Human (Homo sapiens) [TaxId: 9606]} Length = 83 Back     information, alignment and structure

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query145
d1mdya_68 Myod B/HLH domain {Mouse (Mus musculus) [TaxId: 10 99.53
d1a0aa_63 Pho4 B/HLH domain {Baker's yeast (Saccharomyces ce 99.52
d1am9a_80 SREBP-1a {Human (Homo sapiens) [TaxId: 9606]} 99.48
d1nlwa_79 Mad protein {Human (Homo sapiens) [TaxId: 9606]} 99.43
d1nkpb_83 Max protein {Human (Homo sapiens) [TaxId: 9606]} 99.4
d1an4a_65 Usf B/HLH domain {Human (Homo sapiens) [TaxId: 960 99.39
d1nkpa_88 Myc proto-oncogene protein {Human (Homo sapiens) [ 99.37
d1uklc_61 SREBP-2 {Human (Homo sapiens) [TaxId: 9606]} 99.32
>d1mdya_ a.38.1.1 (A:) Myod B/HLH domain {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
class: All alpha proteins
fold: HLH-like
superfamily: HLH, helix-loop-helix DNA-binding domain
family: HLH, helix-loop-helix DNA-binding domain
domain: Myod B/HLH domain
species: Mouse (Mus musculus) [TaxId: 10090]
Probab=99.53  E-value=3e-15  Score=99.58  Aligned_cols=54  Identities=22%  Similarity=0.358  Sum_probs=49.3

Q ss_pred             cchhhhhhhHHHHhH---HHhHHHHHhcCCCC--CCcchhhhHHHHHHHHHHHHHHhhC
Q 043478           92 ENFCEEHVSLDKRTK---NEKFMLLRSVVPYI--SEVDKASILNDTIKYLKRLKARVEE  145 (145)
Q Consensus        92 ~~~~~~h~~~ERrRR---ne~f~~LrslvP~~--~K~dKaSIL~daI~Yik~Lq~~V~e  145 (145)
                      .+....|+..||+||   |+.|..|+++||.+  .|++|++||..||+||++||+.|+|
T Consensus        10 ~~rR~~hn~~Er~Rr~~in~~~~~L~~llP~~~~~k~sK~~iL~~Ai~YI~~Lq~~l~e   68 (68)
T d1mdya_          10 ADRRKAATMRERRRLSKVNEAFETLKRSTSSNPNQRLPKVEILRNAIRYIEGLQALLRD   68 (68)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHTTSCSCTTSCCCHHHHHHHHHHHHHHHHHTTCC
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHhhC
Confidence            345678999999999   99999999999976  6889999999999999999999986



>d1a0aa_ a.38.1.1 (A:) Pho4 B/HLH domain {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} Back     information, alignment and structure
>d1am9a_ a.38.1.1 (A:) SREBP-1a {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1nlwa_ a.38.1.1 (A:) Mad protein {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1nkpb_ a.38.1.1 (B:) Max protein {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1an4a_ a.38.1.1 (A:) Usf B/HLH domain {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1nkpa_ a.38.1.1 (A:) Myc proto-oncogene protein {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1uklc_ a.38.1.1 (C:) SREBP-2 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure