Citrus Sinensis ID: 039972


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------17
MEYVNAKTLASGKGNKKKKSSGSKSNPMKLSTDPQSVAARQRRHRISDRFKILQSMVPGGTKMDTASMLDEAINYVKFLKAVIWFHENIINFADHHHHEHDPPVAFGQLPGSFPYSPESLYSSINNLAAAPVPQQDSMQASETQPFQDFNVFQGEESETLMHLDGFMKY
ccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccc
ccEEcccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccHHHHHccccccHHHccccccccccEEEEEEEEEEc
meyvnaktlasgkgnkkkkssgsksnpmklstdpqsVAARQRRHRISDRFKILqsmvpggtkmdtASMLDEAINYVKFLKAVIWFHENIINfadhhhhehdppvafgqlpgsfpyspeslyssinnlaaapvpqqdsmqasetqpfqdfnvfqgeesetlmhldgfmky
meyvnaktlasgkgnkkkkssgsksnpmklstdpqsvaarqrRHRIsdrfkilqsmvpggTKMDTASMLDEAINYVKFLKAVIWFHENIINFADHHHHEHDPPVAFGQLPGSFPYSPESLYSSINNLAAAPVPQQDSMQASETQPFQDFNVFQGEESetlmhldgfmky
MEYVNAKTLAsgkgnkkkkssgsksnPMKLSTDPQSVAARQRRHRISDRFKILQSMVPGGTKMDTASMLDEAINYVKFLKAVIWFHENIINFADHHHHEHDPPVAFGQLPGSFPYSPESLYSSINNLAAAPVPQQDSMQASETQPFQDFNVFQGEESETLMHLDGFMKY
***************************************************IL**********DTASMLDEAINYVKFLKAVIWFHENIINF*****************************************************************************
*******************************************HRISDRFKILQSMVPGGTKMDTASMLDEAINYVKFLKAVIWFHE***************************************************************************LDGFMKY
*******************************************HRISDRFKILQSMVPGGTKMDTASMLDEAINYVKFLKAVIWFHENIINFADHHHHEHDPPVAFGQLPGSFPYSPESLYSSINNLAAAP***********TQPFQDFNVFQGEESETLMHLDGFMKY
**********************************QSVAARQRRHRISDRFKILQSMVPGGTKMDTASMLDEAINYVKFLKAVIWFHENIINFADHHHHEHDPPVAFGQLPGSFPYSPESLYSSINNLAAAPVPQQDSMQASETQPFQDFNVFQGEESETLMHLDGFMKY
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MEYVNAKTLASGKGNKKKKSSGSKSNPMKLSTDPQSVAARQRRHRISDRFKILQSMVPGGTKMDTASMLDEAINYVKFLKAVIWFHENIINFADHHHHEHDPPVAFGQLPGSFPYSPESLYSSINNLAAAPVPQQDSMQASETQPFQDFNVFQGEESETLMHLDGFMKY
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query169 2.2.26 [Sep-21-2011]
Q9M041 912 Transcription factor bHLH yes no 0.585 0.108 0.583 1e-27
Q7XAQ6215 Transcription factor LAX yes no 0.366 0.288 0.709 2e-21
Q8S3D2373 Transcription factor bHLH no no 0.502 0.227 0.538 5e-21
O81313198 Transcription factor IND no no 0.313 0.267 0.660 4e-15
Q9C707298 Transcription factor bHLH no no 0.396 0.224 0.535 1e-13
Q3E7L7223 Transcription factor bHLH no no 0.562 0.426 0.364 1e-13
Q84WK0352 Transcription factor bHLH no no 0.313 0.150 0.584 2e-13
Q7XHI9328 Transcription factor bHLH no no 0.313 0.161 0.584 3e-13
Q9FJ00307 Putative transcription fa no no 0.396 0.218 0.521 7e-13
Q8LEG1258 Transcription factor bHLH no no 0.479 0.313 0.404 1e-12
>sp|Q9M041|BH140_ARATH Transcription factor bHLH140 OS=Arabidopsis thaliana GN=BHLH140 PE=4 SV=1 Back     alignment and function desciption
 Score =  122 bits (306), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 63/108 (58%), Positives = 76/108 (70%), Gaps = 9/108 (8%)

Query: 11  SGKGNKKKKSSGSKSNPMKLSTDPQSVAARQRRHRISDRFKILQSMVPGGTKMDTASMLD 70
           S  GN K+  S S      LSTDPQSVAAR RRHRISDRFKILQSMVPGG KMDT SMLD
Sbjct: 29  SETGNTKRSRSTST-----LSTDPQSVAARDRRHRISDRFKILQSMVPGGAKMDTVSMLD 83

Query: 71  EAINYVKFLKAVIWFHENIINFADHHHHEHDPPVAFGQLPGSFPYSPE 118
           EAI+YVKFLKA IW+H+N++ F +    +H+   +    PG+  + P+
Sbjct: 84  EAISYVKFLKAQIWYHQNMLLFIN----DHETTSSCTYSPGAGEFGPK 127





Arabidopsis thaliana (taxid: 3702)
>sp|Q7XAQ6|LAX_ORYSJ Transcription factor LAX PANICLE OS=Oryza sativa subsp. japonica GN=LAX PE=1 SV=1 Back     alignment and function description
>sp|Q8S3D2|BH087_ARATH Transcription factor bHLH87 OS=Arabidopsis thaliana GN=BHLH87 PE=2 SV=1 Back     alignment and function description
>sp|O81313|IND_ARATH Transcription factor IND OS=Arabidopsis thaliana GN=IND PE=1 SV=3 Back     alignment and function description
>sp|Q9C707|BH083_ARATH Transcription factor bHLH83 OS=Arabidopsis thaliana GN=BHLH83 PE=2 SV=1 Back     alignment and function description
>sp|Q3E7L7|BH139_ARATH Transcription factor bHLH139 OS=Arabidopsis thaliana GN=BHLH139 PE=4 SV=1 Back     alignment and function description
>sp|Q84WK0|BH085_ARATH Transcription factor bHLH85 OS=Arabidopsis thaliana GN=BHLH85 PE=2 SV=1 Back     alignment and function description
>sp|Q7XHI9|BH084_ARATH Transcription factor bHLH84 OS=Arabidopsis thaliana GN=BHLH84 PE=2 SV=1 Back     alignment and function description
>sp|Q9FJ00|BH086_ARATH Putative transcription factor bHLH086 OS=Arabidopsis thaliana GN=BHLH86 PE=4 SV=2 Back     alignment and function description
>sp|Q8LEG1|BH054_ARATH Transcription factor bHLH54 OS=Arabidopsis thaliana GN=BHLH54 PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query169
147838921161 hypothetical protein VITISV_022410 [Viti 0.668 0.701 0.541 3e-27
449461799195 PREDICTED: transcription factor bHLH87-l 0.721 0.625 0.534 3e-27
224091024165 predicted protein [Populus trichocarpa] 0.810 0.830 0.477 6e-27
15240948 912 aprataxin [Arabidopsis thaliana] gi|7533 0.585 0.108 0.583 6e-26
297810265 898 basic helix-loop-helix family protein [A 0.479 0.090 0.686 2e-25
357506069153 LAX [Medicago truncatula] gi|355498338|g 0.585 0.647 0.559 2e-25
224091579163 predicted protein [Populus trichocarpa] 0.414 0.429 0.771 1e-24
357131240228 PREDICTED: transcription factor LAX PANI 0.461 0.342 0.594 6e-21
227016421211 barren stalk1-like protein [Leymus ciner 0.366 0.293 0.709 4e-20
227016419211 barren stalk1-like protein [Leymus ciner 0.366 0.293 0.709 4e-20
>gi|147838921|emb|CAN77067.1| hypothetical protein VITISV_022410 [Vitis vinifera] gi|297735648|emb|CBI18142.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
 Score =  126 bits (316), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 71/131 (54%), Positives = 85/131 (64%), Gaps = 18/131 (13%)

Query: 30  LSTDPQSVAARQRRHRISDRFKILQSMVPGGTKMDTASMLDEAINYVKFLKAVIWFHENI 89
           LSTDPQSVAAR+RRHRISDRFKILQS+VPGGTKMDT SML+EAI+YVK+LK  IW H+ +
Sbjct: 36  LSTDPQSVAARERRHRISDRFKILQSLVPGGTKMDTVSMLEEAIHYVKYLKTQIWLHQTM 95

Query: 90  INFADHHHHEHDPPVAFGQLPGSFP-YSPESLYSSINNLAAAPVPQQDSMQ--ASETQPF 146
           INF D      DP + +     S P +  E+ YS    LA         MQ   + T P 
Sbjct: 96  INFVD-----DDPSLLYQS--DSLPSHQQEAFYSPDETLAM--------MQPYITTTAPV 140

Query: 147 QDFNVFQGEES 157
           +D   FQGEE+
Sbjct: 141 RDSYCFQGEET 151




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|449461799|ref|XP_004148629.1| PREDICTED: transcription factor bHLH87-like [Cucumis sativus] gi|449532778|ref|XP_004173357.1| PREDICTED: transcription factor bHLH87-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|224091024|ref|XP_002309150.1| predicted protein [Populus trichocarpa] gi|222855126|gb|EEE92673.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|15240948|ref|NP_195751.1| aprataxin [Arabidopsis thaliana] gi|75335734|sp|Q9M041.1|BH140_ARATH RecName: Full=Transcription factor bHLH140; AltName: Full=Basic helix-loop-helix protein 140; Short=AtbHLH140; Short=bHLH 140; AltName: Full=Transcription factor EN 122; AltName: Full=bHLH transcription factor bHLH140 gi|7320709|emb|CAB81914.1| putative protein [Arabidopsis thaliana] gi|332002943|gb|AED90326.1| aprataxin [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|297810265|ref|XP_002873016.1| basic helix-loop-helix family protein [Arabidopsis lyrata subsp. lyrata] gi|297318853|gb|EFH49275.1| basic helix-loop-helix family protein [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|357506069|ref|XP_003623323.1| LAX [Medicago truncatula] gi|355498338|gb|AES79541.1| LAX [Medicago truncatula] Back     alignment and taxonomy information
>gi|224091579|ref|XP_002334947.1| predicted protein [Populus trichocarpa] gi|222875273|gb|EEF12404.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|357131240|ref|XP_003567247.1| PREDICTED: transcription factor LAX PANICLE-like [Brachypodium distachyon] Back     alignment and taxonomy information
>gi|227016421|gb|ACP18672.1| barren stalk1-like protein [Leymus cinereus x Leymus triticoides] Back     alignment and taxonomy information
>gi|227016419|gb|ACP18671.1| barren stalk1-like protein [Leymus cinereus x Leymus triticoides] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query169
TAIR|locus:2179122 912 APTX "APRATAXIN-like" [Arabido 0.781 0.144 0.485 5.7e-24
UNIPROTKB|Q5MP56219 BA1 "Uncharacterized protein" 0.349 0.269 0.745 1.2e-19
TAIR|locus:2144791224 HEC3 "AT5G09750" [Arabidopsis 0.331 0.25 0.75 2.4e-19
TAIR|locus:2089418373 AT3G21330 "AT3G21330" [Arabido 0.514 0.233 0.531 4.3e-19
UNIPROTKB|Q6Z9R3246 P0461F06.33 "BHLH protein fami 0.502 0.345 0.545 6.5e-19
UNIPROTKB|Q67TR8236 B1342C04.6 "Basic helix-loop-h 0.331 0.237 0.732 4.1e-17
TAIR|locus:2074865231 HEC2 "AT3G50330" [Arabidopsis 0.313 0.229 0.773 4.1e-17
TAIR|locus:2155543241 HEC1 "HECATE 1" [Arabidopsis t 0.313 0.219 0.773 4.1e-17
UNIPROTKB|Q657D1484 P0697C12.40 "BHLH transcriptio 0.331 0.115 0.678 6.7e-16
UNIPROTKB|Q69TX2211 P0021C04.13 "BHLH protein-like 0.550 0.440 0.478 7.7e-16
TAIR|locus:2179122 APTX "APRATAXIN-like" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 286 (105.7 bits), Expect = 5.7e-24, P = 5.7e-24
 Identities = 68/140 (48%), Positives = 86/140 (61%)

Query:    30 LSTDPQSVAARQRRHRISDRFKILQSMVPGGTKMDTASMLDEAINYVKFLKAVIWFHENI 89
             LSTDPQSVAAR RRHRISDRFKILQSMVPGG KMDT SMLDEAI+YVKFLKA IW+H+N+
Sbjct:    43 LSTDPQSVAARDRRHRISDRFKILQSMVPGGAKMDTVSMLDEAISYVKFLKAQIWYHQNM 102

Query:    90 INFADHHHHEHDPPVAFGQLPGSFPYSPESLYSSINNLAAAPVPQQDSMQASETQPFQDF 149
             + F + H  E      +   PG+  + P+ L+   ++ A  P+    S     T     +
Sbjct:   103 LLFINDH--ETTSSCTYS--PGAGEFGPK-LFGYDDDYA--PIMDTYSQGVPLTVADSKY 155

Query:   150 NVFQGEESETLMHLDGFMKY 169
               + G   +   H+  F KY
Sbjct:   156 TPWFGSVDDEQEHVTYF-KY 174




GO:0003824 "catalytic activity" evidence=IEA
GO:0005622 "intracellular" evidence=IEA
GO:0005634 "nucleus" evidence=ISM
GO:0008270 "zinc ion binding" evidence=IEA
GO:0003700 "sequence-specific DNA binding transcription factor activity" evidence=ISS
GO:0006355 "regulation of transcription, DNA-dependent" evidence=TAS
GO:0006790 "sulfur compound metabolic process" evidence=IDA
GO:0009150 "purine ribonucleotide metabolic process" evidence=IDA
GO:0047627 "adenylylsulfatase activity" evidence=IDA
GO:0000278 "mitotic cell cycle" evidence=RCA
GO:0006259 "DNA metabolic process" evidence=RCA
GO:0006310 "DNA recombination" evidence=RCA
GO:0006396 "RNA processing" evidence=RCA
GO:0007062 "sister chromatid cohesion" evidence=RCA
GO:0007126 "meiosis" evidence=RCA
GO:0007131 "reciprocal meiotic recombination" evidence=RCA
GO:0010332 "response to gamma radiation" evidence=RCA
GO:0010413 "glucuronoxylan metabolic process" evidence=RCA
GO:0032204 "regulation of telomere maintenance" evidence=RCA
GO:0033044 "regulation of chromosome organization" evidence=RCA
GO:0042138 "meiotic DNA double-strand break formation" evidence=RCA
GO:0043247 "telomere maintenance in response to DNA damage" evidence=RCA
GO:0045132 "meiotic chromosome segregation" evidence=RCA
GO:0045492 "xylan biosynthetic process" evidence=RCA
UNIPROTKB|Q5MP56 BA1 "Uncharacterized protein" [Zea mays (taxid:4577)] Back     alignment and assigned GO terms
TAIR|locus:2144791 HEC3 "AT5G09750" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2089418 AT3G21330 "AT3G21330" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|Q6Z9R3 P0461F06.33 "BHLH protein family-like" [Oryza sativa Japonica Group (taxid:39947)] Back     alignment and assigned GO terms
UNIPROTKB|Q67TR8 B1342C04.6 "Basic helix-loop-helix (BHLH)-like protein" [Oryza sativa Japonica Group (taxid:39947)] Back     alignment and assigned GO terms
TAIR|locus:2074865 HEC2 "AT3G50330" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2155543 HEC1 "HECATE 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|Q657D1 P0697C12.40 "BHLH transcription-like" [Oryza sativa Japonica Group (taxid:39947)] Back     alignment and assigned GO terms
UNIPROTKB|Q69TX2 P0021C04.13 "BHLH protein-like" [Oryza sativa Japonica Group (taxid:39947)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query169
smart0035353 smart00353, HLH, helix loop helix domain 3e-13
cd0008360 cd00083, HLH, Helix-loop-helix domain, found in sp 7e-13
pfam0001052 pfam00010, HLH, Helix-loop-helix DNA-binding domai 4e-10
>gnl|CDD|197674 smart00353, HLH, helix loop helix domain Back     alignment and domain information
 Score = 60.3 bits (147), Expect = 3e-13
 Identities = 16/50 (32%), Positives = 28/50 (56%), Gaps = 3/50 (6%)

Query: 36 SVAARQRRHRISDRFKILQSMVPG---GTKMDTASMLDEAINYVKFLKAV 82
          +   R+RR +I++ F  L+S++P      K+  A +L  AI Y+K L+  
Sbjct: 1  NARERRRRRKINEAFDELRSLLPTLPKNKKLSKAEILRLAIEYIKSLQEE 50


Length = 53

>gnl|CDD|238036 cd00083, HLH, Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long Back     alignment and domain information
>gnl|CDD|215654 pfam00010, HLH, Helix-loop-helix DNA-binding domain Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 169
cd0008360 HLH Helix-loop-helix domain, found in specific DNA 99.53
PF0001055 HLH: Helix-loop-helix DNA-binding domain only nucl 99.52
smart0035353 HLH helix loop helix domain. 99.51
KOG1318411 consensus Helix loop helix transcription factor EB 99.22
KOG4304250 consensus Transcriptional repressors of the hairy/ 98.98
KOG1319229 consensus bHLHZip transcription factor BIGMAX [Tra 98.96
KOG2483232 consensus Upstream transcription factor 2/L-myc-2 98.68
KOG3561 803 consensus Aryl-hydrocarbon receptor nuclear transl 98.57
KOG2588 953 consensus Predicted DNA-binding protein [Transcrip 98.21
KOG0561 373 consensus bHLH transcription factor [Transcription 98.05
KOG3960284 consensus Myogenic helix-loop-helix transcription 98.02
PLN0321793 transcription factor ATBS1; Provisional 97.98
KOG4029228 consensus Transcription factor HAND2/Transcription 97.86
KOG3910632 consensus Helix loop helix transcription factor [T 95.7
KOG4447173 consensus Transcription factor TWIST [Transcriptio 95.68
KOG3560 712 consensus Aryl-hydrocarbon receptor [Transcription 95.13
KOG3898254 consensus Transcription factor NeuroD and related 93.62
KOG3558 768 consensus Hypoxia-inducible factor 1/Neuronal PAS 92.71
KOG3559 598 consensus Transcriptional regulator SIM1 [Transcri 91.31
KOG4395285 consensus Transcription factor Atonal, contains HT 80.24
>cd00083 HLH Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long Back     alignment and domain information
Probab=99.53  E-value=1.7e-14  Score=95.24  Aligned_cols=51  Identities=33%  Similarity=0.660  Sum_probs=48.6

Q ss_pred             CcccHHHHHHHHHHHHHHHHHHhcCCCC---CCCChhhHHHHHHHHHHHHHHHH
Q 039972           33 DPQSVAARQRRHRISDRFKILQSMVPGG---TKMDTASMLDEAINYVKFLKAVI   83 (169)
Q Consensus        33 ~~h~~~ER~RR~kin~~~~~Lr~lvP~~---~K~dkasiL~~aI~YIk~Lq~~v   83 (169)
                      ..|+..||+||++||++|..|+++||..   .|+||++||+.||+||+.|+.++
T Consensus         6 ~~~~~~Er~RR~~~n~~~~~L~~llp~~~~~~k~~k~~iL~~a~~yI~~L~~~~   59 (60)
T cd00083           6 EAHNLRERRRRERINDAFDELRSLLPTLPPSKKLSKAEILRKAVDYIKSLQELL   59 (60)
T ss_pred             HHHhHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHh
Confidence            4699999999999999999999999987   89999999999999999999876



A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and

>PF00010 HLH: Helix-loop-helix DNA-binding domain only nuclear translocator protein (Arnt) Back     alignment and domain information
>smart00353 HLH helix loop helix domain Back     alignment and domain information
>KOG1318 consensus Helix loop helix transcription factor EB [Transcription] Back     alignment and domain information
>KOG4304 consensus Transcriptional repressors of the hairy/E(spl) family (contains HLH) [Transcription] Back     alignment and domain information
>KOG1319 consensus bHLHZip transcription factor BIGMAX [Transcription] Back     alignment and domain information
>KOG2483 consensus Upstream transcription factor 2/L-myc-2 protein [Transcription] Back     alignment and domain information
>KOG3561 consensus Aryl-hydrocarbon receptor nuclear translocator [Transcription] Back     alignment and domain information
>KOG2588 consensus Predicted DNA-binding protein [Transcription] Back     alignment and domain information
>KOG0561 consensus bHLH transcription factor [Transcription] Back     alignment and domain information
>KOG3960 consensus Myogenic helix-loop-helix transcription factor [Transcription] Back     alignment and domain information
>PLN03217 transcription factor ATBS1; Provisional Back     alignment and domain information
>KOG4029 consensus Transcription factor HAND2/Transcription factor TAL1/TAL2/LYL1 [Transcription] Back     alignment and domain information
>KOG3910 consensus Helix loop helix transcription factor [Transcription] Back     alignment and domain information
>KOG4447 consensus Transcription factor TWIST [Transcription] Back     alignment and domain information
>KOG3560 consensus Aryl-hydrocarbon receptor [Transcription] Back     alignment and domain information
>KOG3898 consensus Transcription factor NeuroD and related HTH proteins [Transcription] Back     alignment and domain information
>KOG3558 consensus Hypoxia-inducible factor 1/Neuronal PAS domain protein NPAS1 [Signal transduction mechanisms; Transcription] Back     alignment and domain information
>KOG3559 consensus Transcriptional regulator SIM1 [Transcription] Back     alignment and domain information
>KOG4395 consensus Transcription factor Atonal, contains HTH domain [Transcription] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query169
1am9_A82 Srebp-1A, protein (sterol regulatory element bindi 2e-10
1nkp_B83 MAX protein, MYC proto-oncogene protein; transcrip 1e-07
1nkp_A88 C-MYC, MYC proto-oncogene protein; transcription, 1e-07
1hlo_A80 Protein (transcription factor MAX); transcriptiona 6e-07
1an4_A65 Protein (upstream stimulatory factor); protein-DNA 2e-06
>1am9_A Srebp-1A, protein (sterol regulatory element binding protein 1A); basic-helix-loop- helix-leucine zipper, transcription factor; HET: DNA; 2.30A {Homo sapiens} SCOP: a.38.1.1 PDB: 1ukl_C Length = 82 Back     alignment and structure
 Score = 53.2 bits (128), Expect = 2e-10
 Identities = 13/46 (28%), Positives = 30/46 (65%), Gaps = 1/46 (2%)

Query: 36 SVAARQRRHRISDRFKILQSMVPG-GTKMDTASMLDEAINYVKFLK 80
          +   ++ R  I+D+   L+ +V G   K++ +++L +AI+Y++FL+
Sbjct: 11 NAIEKRYRSSINDKIIELKDLVVGTEAKLNKSAVLRKAIDYIRFLQ 56


>1nkp_B MAX protein, MYC proto-oncogene protein; transcription, DNA, BHLHZ, heterodimer, transcription/DNA complex; 1.80A {Homo sapiens} SCOP: a.38.1.1 PDB: 1an2_A* 1r05_A 1nlw_B Length = 83 Back     alignment and structure
>1nkp_A C-MYC, MYC proto-oncogene protein; transcription, DNA, BHLHZ, heterodimer, transcription/DNA complex; 1.80A {Homo sapiens} SCOP: a.38.1.1 Length = 88 Back     alignment and structure
>1hlo_A Protein (transcription factor MAX); transcriptional regulation, DNA binding, complex (transcription factor MAX/DNA), transcription/DNA complex; HET: DNA; 2.80A {Homo sapiens} SCOP: a.38.1.1 Length = 80 Back     alignment and structure
>1an4_A Protein (upstream stimulatory factor); protein-DNA complex, double helix, overhanging base, transcription/DNA complex; HET: DNA; 2.90A {Homo sapiens} SCOP: a.38.1.1 Length = 65 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query169
1am9_A82 Srebp-1A, protein (sterol regulatory element bindi 99.72
4ati_A118 MITF, microphthalmia-associated transcription fact 99.66
4h10_B71 Circadian locomoter output cycles protein kaput; B 99.66
1a0a_A63 BHLH, protein (phosphate system positive regulator 99.65
1an4_A65 Protein (upstream stimulatory factor); protein-DNA 99.64
1hlo_A80 Protein (transcription factor MAX); transcriptiona 99.61
1nkp_B83 MAX protein, MYC proto-oncogene protein; transcrip 99.61
4h10_A73 ARYL hydrocarbon receptor nuclear translocator-LI 99.61
1nkp_A88 C-MYC, MYC proto-oncogene protein; transcription, 99.58
1nlw_A80 MAD protein, MAX dimerizer; transcription factor, 99.51
3u5v_A76 Protein MAX, transcription factor E2-alpha chimer; 99.51
1mdy_A68 Protein (MYOD BHLH domain); protein-DNA complex, t 99.26
2ql2_B60 Neurod1, neurogenic differentiation factor 1; basi 99.23
4f3l_A 361 Mclock, circadian locomoter output cycles protein 99.17
4f3l_B 387 BMAL1B; BHLH, PAS, circadian rhythm proteins, tran 99.03
2lfh_A68 DNA-binding protein inhibitor ID-3; structural gen 98.88
4ath_A83 MITF, microphthalmia-associated transcription fact 98.84
4aya_A97 DNA-binding protein inhibitor ID-2; cell cycle; 2. 98.44
1pyi_A96 Protein (pyrimidine pathway regulator 1); protein- 84.19
>1am9_A Srebp-1A, protein (sterol regulatory element binding protein 1A); basic-helix-loop- helix-leucine zipper, transcription factor; HET: DNA; 2.30A {Homo sapiens} SCOP: a.38.1.1 PDB: 1ukl_C Back     alignment and structure
Probab=99.72  E-value=6.8e-18  Score=119.53  Aligned_cols=62  Identities=21%  Similarity=0.333  Sum_probs=57.4

Q ss_pred             CcccHHHHHHHHHHHHHHHHHHhcCCCC-CCCChhhHHHHHHHHHHHHHHHHHHHHHHHHhhc
Q 039972           33 DPQSVAARQRRHRISDRFKILQSMVPGG-TKMDTASMLDEAINYVKFLKAVIWFHENIINFAD   94 (169)
Q Consensus        33 ~~h~~~ER~RR~kin~~~~~Lr~lvP~~-~K~dkasiL~~aI~YIk~Lq~~v~~L~~~~~~~~   94 (169)
                      ..|+.+||+||++||++|..|++|||++ .|+|||+||++||+||++|+.+++.|+.....+.
T Consensus         8 ~~H~~~ErrRR~~in~~f~~L~~lvP~~~~k~~Ka~IL~~Ai~YI~~Lq~~~~~L~~e~~~L~   70 (82)
T 1am9_A            8 TAHNAIEKRYRSSINDKIIELKDLVVGTEAKLNKSAVLRKAIDYIRFLQHSNQKLKQENLSLR   70 (82)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHTCSSCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HhhhhHHHHHHHHHHHHHHHHHHhccCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4699999999999999999999999997 9999999999999999999999999988666553



>4ati_A MITF, microphthalmia-associated transcription factor; DNA-binding protein-DNA complex, melanoma; 2.60A {Mus musculus} PDB: 4atk_A Back     alignment and structure
>4h10_B Circadian locomoter output cycles protein kaput; BHLH, circadian transcription, transcription-DNA complex; 2.40A {Homo sapiens} Back     alignment and structure
>1a0a_A BHLH, protein (phosphate system positive regulatory protein PHO4); transcription factor, basic helix loop helix; HET: DNA; 2.80A {Saccharomyces cerevisiae} SCOP: a.38.1.1 Back     alignment and structure
>1an4_A Protein (upstream stimulatory factor); protein-DNA complex, double helix, overhanging base, transcription/DNA complex; HET: DNA; 2.90A {Homo sapiens} SCOP: a.38.1.1 Back     alignment and structure
>1hlo_A Protein (transcription factor MAX); transcriptional regulation, DNA binding, complex (transcription factor MAX/DNA), transcription/DNA complex; HET: DNA; 2.80A {Homo sapiens} SCOP: a.38.1.1 Back     alignment and structure
>1nkp_B MAX protein, MYC proto-oncogene protein; transcription, DNA, BHLHZ, heterodimer, transcription/DNA complex; 1.80A {Homo sapiens} SCOP: a.38.1.1 PDB: 1an2_A* 1r05_A 1nlw_B Back     alignment and structure
>4h10_A ARYL hydrocarbon receptor nuclear translocator-LI 1; BHLH, circadian transcription, transcription-DNA complex; 2.40A {Homo sapiens} Back     alignment and structure
>1nkp_A C-MYC, MYC proto-oncogene protein; transcription, DNA, BHLHZ, heterodimer, transcription/DNA complex; 1.80A {Homo sapiens} SCOP: a.38.1.1 Back     alignment and structure
>1nlw_A MAD protein, MAX dimerizer; transcription factor, DNA, BHLHZ, transcription/DNA complex; 2.00A {Homo sapiens} SCOP: a.38.1.1 Back     alignment and structure
>3u5v_A Protein MAX, transcription factor E2-alpha chimer; basic helix-loop-helix (BHLH); 1.70A {Mus musculus} PDB: 2ql2_A* Back     alignment and structure
>1mdy_A Protein (MYOD BHLH domain); protein-DNA complex, transcription/DNA complex; HET: DNA; 2.80A {Mus musculus} SCOP: a.38.1.1 PDB: 1mdy_B* Back     alignment and structure
>2ql2_B Neurod1, neurogenic differentiation factor 1; basic-helix-loop-helix; HET: DNA; 2.50A {Mus musculus} Back     alignment and structure
>4f3l_A Mclock, circadian locomoter output cycles protein kaput; BHLH, PAS, circadian rhythm proteins, transcription-activato; 2.27A {Mus musculus} Back     alignment and structure
>4f3l_B BMAL1B; BHLH, PAS, circadian rhythm proteins, transcription-activato; 2.27A {Mus musculus} Back     alignment and structure
>2lfh_A DNA-binding protein inhibitor ID-3; structural genomics, northeast structural genomics consortiu PSI-biology, protein structure initiative; NMR {Homo sapiens} Back     alignment and structure
>4ath_A MITF, microphthalmia-associated transcription factor; DNA binding protein, melanoma; HET: MSE; 1.95A {Mus musculus} Back     alignment and structure
>4aya_A DNA-binding protein inhibitor ID-2; cell cycle; 2.10A {Homo sapiens} Back     alignment and structure
>1pyi_A Protein (pyrimidine pathway regulator 1); protein-DNA complex, transcription/DNA complex, GAL4, zinc finger, Zn2Cys6, binuclear cluster; HET: DNA; 3.20A {Saccharomyces cerevisiae} SCOP: g.38.1.1 h.1.3.1 Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 169
d1uklc_61 a.38.1.1 (C:) SREBP-2 {Human (Homo sapiens) [TaxId 3e-13
d1a0aa_63 a.38.1.1 (A:) Pho4 B/HLH domain {Baker's yeast (Sa 4e-12
d1nkpb_83 a.38.1.1 (B:) Max protein {Human (Homo sapiens) [T 1e-11
d1nkpa_88 a.38.1.1 (A:) Myc proto-oncogene protein {Human (H 1e-10
d1mdya_68 a.38.1.1 (A:) Myod B/HLH domain {Mouse (Mus muscul 2e-10
d1an4a_65 a.38.1.1 (A:) Usf B/HLH domain {Human (Homo sapien 5e-10
d1nlwa_79 a.38.1.1 (A:) Mad protein {Human (Homo sapiens) [T 1e-09
d1am9a_80 a.38.1.1 (A:) SREBP-1a {Human (Homo sapiens) [TaxI 9e-09
>d1uklc_ a.38.1.1 (C:) SREBP-2 {Human (Homo sapiens) [TaxId: 9606]} Length = 61 Back     information, alignment and structure

class: All alpha proteins
fold: HLH-like
superfamily: HLH, helix-loop-helix DNA-binding domain
family: HLH, helix-loop-helix DNA-binding domain
domain: SREBP-2
species: Human (Homo sapiens) [TaxId: 9606]
 Score = 59.0 bits (143), Expect = 3e-13
 Identities = 14/40 (35%), Positives = 25/40 (62%), Gaps = 1/40 (2%)

Query: 43 RHRISDRFKILQSMVPGGT-KMDTASMLDEAINYVKFLKA 81
          R  I+D+   L+ +V G   KM  + +L +AI+Y+K+L+ 
Sbjct: 1  RSSINDKIIELKDLVMGTDAKMHKSGVLRKAIDYIKYLQQ 40


>d1a0aa_ a.38.1.1 (A:) Pho4 B/HLH domain {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} Length = 63 Back     information, alignment and structure
>d1nkpb_ a.38.1.1 (B:) Max protein {Human (Homo sapiens) [TaxId: 9606]} Length = 83 Back     information, alignment and structure
>d1nkpa_ a.38.1.1 (A:) Myc proto-oncogene protein {Human (Homo sapiens) [TaxId: 9606]} Length = 88 Back     information, alignment and structure
>d1mdya_ a.38.1.1 (A:) Myod B/HLH domain {Mouse (Mus musculus) [TaxId: 10090]} Length = 68 Back     information, alignment and structure
>d1an4a_ a.38.1.1 (A:) Usf B/HLH domain {Human (Homo sapiens) [TaxId: 9606]} Length = 65 Back     information, alignment and structure
>d1nlwa_ a.38.1.1 (A:) Mad protein {Human (Homo sapiens) [TaxId: 9606]} Length = 79 Back     information, alignment and structure
>d1am9a_ a.38.1.1 (A:) SREBP-1a {Human (Homo sapiens) [TaxId: 9606]} Length = 80 Back     information, alignment and structure

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query169
d1am9a_80 SREBP-1a {Human (Homo sapiens) [TaxId: 9606]} 99.64
d1a0aa_63 Pho4 B/HLH domain {Baker's yeast (Saccharomyces ce 99.64
d1nlwa_79 Mad protein {Human (Homo sapiens) [TaxId: 9606]} 99.63
d1mdya_68 Myod B/HLH domain {Mouse (Mus musculus) [TaxId: 10 99.59
d1nkpb_83 Max protein {Human (Homo sapiens) [TaxId: 9606]} 99.58
d1nkpa_88 Myc proto-oncogene protein {Human (Homo sapiens) [ 99.57
d1an4a_65 Usf B/HLH domain {Human (Homo sapiens) [TaxId: 960 99.54
d1uklc_61 SREBP-2 {Human (Homo sapiens) [TaxId: 9606]} 99.53
>d1am9a_ a.38.1.1 (A:) SREBP-1a {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
class: All alpha proteins
fold: HLH-like
superfamily: HLH, helix-loop-helix DNA-binding domain
family: HLH, helix-loop-helix DNA-binding domain
domain: SREBP-1a
species: Human (Homo sapiens) [TaxId: 9606]
Probab=99.64  E-value=1.6e-16  Score=109.95  Aligned_cols=61  Identities=21%  Similarity=0.340  Sum_probs=55.7

Q ss_pred             CcccHHHHHHHHHHHHHHHHHHhcCCCC-CCCChhhHHHHHHHHHHHHHHHHHHHHHHHHhh
Q 039972           33 DPQSVAARQRRHRISDRFKILQSMVPGG-TKMDTASMLDEAINYVKFLKAVIWFHENIINFA   93 (169)
Q Consensus        33 ~~h~~~ER~RR~kin~~~~~Lr~lvP~~-~K~dkasiL~~aI~YIk~Lq~~v~~L~~~~~~~   93 (169)
                      ..|+.+||+||++||++|..|++|||++ .|+||++||..||+||+.|+.+++.|......+
T Consensus         8 ~~H~~~Er~RR~~in~~~~~L~~llP~~~~k~~K~~iL~~a~~yI~~L~~~~~~L~~e~~~l   69 (80)
T d1am9a_           8 TAHNAIEKRYRSSINDKIIELKDLVVGTEAKLNKSAVLRKAIDYIRFLQHSNQKLKQENLSL   69 (80)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHTCSSCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3699999999999999999999999985 799999999999999999999999888755544



>d1a0aa_ a.38.1.1 (A:) Pho4 B/HLH domain {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} Back     information, alignment and structure
>d1nlwa_ a.38.1.1 (A:) Mad protein {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1mdya_ a.38.1.1 (A:) Myod B/HLH domain {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
>d1nkpb_ a.38.1.1 (B:) Max protein {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1nkpa_ a.38.1.1 (A:) Myc proto-oncogene protein {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1an4a_ a.38.1.1 (A:) Usf B/HLH domain {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1uklc_ a.38.1.1 (C:) SREBP-2 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure