Citrus Sinensis ID: 044343


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-
VPWNFSVASGDSEKGQSSRNENSFDFLESSASQLPTSWHQTSHSLDTDVDDFSDATEISEGMERKPDNRPKRTHSAEVHNQSERRRRDRINKRLKSLKELVPNCNKSDRASVLDDAVNYVKALRHQLEMMSAQGGAIFQSPFMSPSGHQSTEVPQISPNVPISPKLGMGSGIGTGMVDVLFSWSSRDIIGSLGQSASFISS
cccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHccccccccHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccc
ccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHccccccccHHHHHHHHHHHHHHHHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccHEEccEEccccHHHccccccccEEcc
vpwnfsvasgdsekgqssrnensFDFLEssasqlptswhqtshsldtdvddfsdateisegmerkpdnrpkrthsaevhnqserRRRDRINKRLKSLKElvpncnksdrasvLDDAVNYVKALRHQLEMMSaqggaifqspfmspsghqstevpqispnvpispklgmgsgigtGMVDVLFSwssrdiigslgqsasfiss
vpwnfsvasgdsekgqssrNENSFDFLESSASQLPTSWHQTSHSLDTDVDDFSDATEisegmerkpdnrpkrthsaevhnqserrrrdriNKRLKslkelvpncnksdrasvlDDAVNYVKALRHQLEMMSAQGGAIFQSPFMSPSGHQSTEVPQISPNVPISPKLGMGSGIGTGMVDVLFSWSSRDIIgslgqsasfiss
VPWNFSVASGDSEKGQSSRNENSFDFLESSASQLPTSWHQTSHSLDTDVDDFSDATEISEGMERKPDNRPKRTHSAEVHNQSErrrrdrINKRLKSLKELVPNCNKSDRASVLDDAVNYVKALRHQLEMMSAQGGAIFQSPFMSPSGHQSTEVPQISPNVPISPKLGMGSGIGTGMVDVLFSWSSRDIIGSLGQSASFISS
***************************************************************************************************************VLDDAVNYVKALRHQLE***************************************MGSGIGTGMVDVLFSWSSRDIIGS**********
*PWNF**********************************************************************************DRINKRLKSLKELVPNCNKSDRASVLDDAVNYVKALRHQLE*************************************************************************
*********************NSFDFLESSASQLPTSWHQTSHSLDTDVDDFSDATEISE**************************RDRINKRLKSLKELVPNCNKSDRASVLDDAVNYVKALRHQLEMMSAQGGAIFQSPFM***********QISPNVPISPKLGMGSGIGTGMVDVLFSWSSRDIIGSLGQSASFISS
************************************************************************************RRRDRINKRLKSLKELVPNCNKSDRASVLDDAVNYVKALRHQLEMMSAQGGAIFQSPFMSPSGHQSTEVPQISPNVPISPKLGMGSGIGTGMVDVLFSWSSRDIIGSLGQ*******
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
VPWNFSVASGDSEKGQSSRNENSFDFLESSASQLPTSWHQTSHSLDTDVDDFSDATEISEGMERKPDNRPKRTHSAEVHxxxxxxxxxxxxxxxxxxxxxVPNCNKSDRASVLDDAVNYVKALRHQLEMMSAQGGAIFQSPFMSPSGHQSTEVPQISPNVPISPKLGMGSGIGTGMVDVLFSWSSRDIIGSLGQSASFISS
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query201 2.2.26 [Sep-21-2011]
O80536524 Transcription factor PIF3 yes no 0.572 0.219 0.439 3e-20
Q8GZM7478 Transcription factor PIF1 no no 0.303 0.127 0.639 8e-19
Q8L5W8416 Transcription factor PIL1 no no 0.487 0.235 0.435 1e-17
Q9SVU7445 Putative transcription fa no no 0.497 0.224 0.398 1e-15
Q8GZ38399 Transcription factor UNE1 no no 0.303 0.152 0.557 1e-15
Q9SVU6413 Transcription factor bHLH no no 0.502 0.244 0.372 2e-15
Q9FUA4373 Transcription factor SPAT no no 0.417 0.225 0.416 2e-14
Q7XHI7307 Transcription factor bHLH no no 0.398 0.260 0.432 3e-14
Q8S3D5310 Transcription factor bHLH no no 0.368 0.238 0.486 1e-13
Q84LH8444 Transcription factor PIF5 no no 0.432 0.195 0.467 3e-13
>sp|O80536|PIF3_ARATH Transcription factor PIF3 OS=Arabidopsis thaliana GN=PIF3 PE=1 SV=1 Back     alignment and function desciption
 Score = 98.2 bits (243), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 54/123 (43%), Positives = 78/123 (63%), Gaps = 8/123 (6%)

Query: 32  SQLPTSWHQTSHSLDTDVDDFSDATEISEGMERKPDNRP-------KRTHSAEVHNQSER 84
           S+ P+   +  HS   D+D  S+  E   G  RK +  P       KR+ SAEVHN SER
Sbjct: 295 SESPSLSLKRKHSNIQDIDCHSEDVEEESGDGRK-EAGPSRTGLGSKRSRSAEVHNLSER 353

Query: 85  RRRDRINKRLKSLKELVPNCNKSDRASVLDDAVNYVKALRHQLEMMSAQGGAIFQSPFMS 144
           RRRDRIN+++++L+EL+PNCNK D+AS+LD+A+ Y+K+L+ Q+++MS   G       M 
Sbjct: 354 RRRDRINEKMRALQELIPNCNKVDKASMLDEAIEYLKSLQLQVQIMSMASGYYLPPAVMF 413

Query: 145 PSG 147
           P G
Sbjct: 414 PPG 416




Transcription factor acting positively in the phytochrome signaling pathway. Activates transcription by binding to the G box (5'-CACGTG-3').
Arabidopsis thaliana (taxid: 3702)
>sp|Q8GZM7|PIF1_ARATH Transcription factor PIF1 OS=Arabidopsis thaliana GN=PIF1 PE=1 SV=1 Back     alignment and function description
>sp|Q8L5W8|PIL1_ARATH Transcription factor PIL1 OS=Arabidopsis thaliana GN=PIL1 PE=1 SV=1 Back     alignment and function description
>sp|Q9SVU7|BH056_ARATH Putative transcription factor bHLH056 OS=Arabidopsis thaliana GN=BHLH56 PE=4 SV=2 Back     alignment and function description
>sp|Q8GZ38|UNE10_ARATH Transcription factor UNE10 OS=Arabidopsis thaliana GN=UNE10 PE=2 SV=1 Back     alignment and function description
>sp|Q9SVU6|BH023_ARATH Transcription factor bHLH23 OS=Arabidopsis thaliana GN=BHLH23 PE=2 SV=1 Back     alignment and function description
>sp|Q9FUA4|SPT_ARATH Transcription factor SPATULA OS=Arabidopsis thaliana GN=SPT PE=1 SV=1 Back     alignment and function description
>sp|Q7XHI7|BH127_ARATH Transcription factor bHLH127 OS=Arabidopsis thaliana GN=BHLH127 PE=2 SV=1 Back     alignment and function description
>sp|Q8S3D5|BH069_ARATH Transcription factor bHLH69 OS=Arabidopsis thaliana GN=BHLH69 PE=2 SV=2 Back     alignment and function description
>sp|Q84LH8|PIF5_ARATH Transcription factor PIF5 OS=Arabidopsis thaliana GN=PIF5 PE=1 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query201
168023762 1015 predicted protein [Physcomitrella patens 0.527 0.104 0.504 3e-23
356575289 633 PREDICTED: transcription factor PIF3-lik 0.447 0.142 0.505 2e-21
356534971 665 PREDICTED: transcription factor PIF3-lik 0.562 0.169 0.434 7e-21
238015240 435 unknown [Zea mays] 0.592 0.273 0.436 1e-20
238008194 638 unknown [Zea mays] gi|413946840|gb|AFW79 0.572 0.180 0.442 2e-20
56784181 565 putative BP-5 protein [Oryza sativa Japo 0.547 0.194 0.445 2e-20
297606720 447 Os07g0143200 [Oryza sativa Japonica Grou 0.502 0.225 0.435 3e-20
297596562 637 Os01g0286100 [Oryza sativa Japonica Grou 0.547 0.172 0.445 3e-20
23495742 417 transcription factor BHLH9-like protein 0.502 0.242 0.435 3e-20
222618221 705 hypothetical protein OsJ_01348 [Oryza sa 0.547 0.156 0.445 3e-20
>gi|168023762|ref|XP_001764406.1| predicted protein [Physcomitrella patens subsp. patens] gi|162684270|gb|EDQ70673.1| predicted protein [Physcomitrella patens subsp. patens] Back     alignment and taxonomy information
 Score =  114 bits (284), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 55/109 (50%), Positives = 82/109 (75%), Gaps = 3/109 (2%)

Query: 71  KRTHSAEVHNQSERRRRDRINKRLKSLKELVPNCNKSDRASVLDDAVNYVKALRHQLEMM 130
           KR+ +AEVHNQSERRRRDRIN+++++L+EL+PN NK+D+AS+LD+A++Y+K L+ QL+MM
Sbjct: 731 KRSRAAEVHNQSERRRRDRINEKMRALQELIPNSNKTDKASMLDEAIDYLKILQLQLQMM 790

Query: 131 SAQGGAIFQSPFMSPSGHQSTEVPQISPNVPISPKLGMGS-GIGTGMVD 178
           S + G     P + P G Q  ++PQ+ P V   P +GM   G+G GM++
Sbjct: 791 SIRTGMTL-PPMVMPPGLQHMQMPQM-PQVAAMPSMGMVQMGLGMGMME 837




Source: Physcomitrella patens subsp. patens

Species: Physcomitrella patens

Genus: Physcomitrella

Family: Funariaceae

Order: Funariales

Class: Bryopsida

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|356575289|ref|XP_003555774.1| PREDICTED: transcription factor PIF3-like [Glycine max] Back     alignment and taxonomy information
>gi|356534971|ref|XP_003536024.1| PREDICTED: transcription factor PIF3-like [Glycine max] Back     alignment and taxonomy information
>gi|238015240|gb|ACR38655.1| unknown [Zea mays] Back     alignment and taxonomy information
>gi|238008194|gb|ACR35132.1| unknown [Zea mays] gi|413946840|gb|AFW79489.1| putative HLH DNA-binding domain superfamily protein [Zea mays] Back     alignment and taxonomy information
>gi|56784181|dbj|BAD81566.1| putative BP-5 protein [Oryza sativa Japonica Group] Back     alignment and taxonomy information
>gi|297606720|ref|NP_001058876.2| Os07g0143200 [Oryza sativa Japonica Group] gi|255677508|dbj|BAF20790.2| Os07g0143200 [Oryza sativa Japonica Group] Back     alignment and taxonomy information
>gi|297596562|ref|NP_001042775.2| Os01g0286100 [Oryza sativa Japonica Group] gi|215717101|dbj|BAG95464.1| unnamed protein product [Oryza sativa Japonica Group] gi|255673123|dbj|BAF04689.2| Os01g0286100 [Oryza sativa Japonica Group] Back     alignment and taxonomy information
>gi|23495742|dbj|BAC19953.1| transcription factor BHLH9-like protein [Oryza sativa Japonica Group] gi|24059945|dbj|BAC21408.1| transcription factor BHLH9-like protein [Oryza sativa Japonica Group] Back     alignment and taxonomy information
>gi|222618221|gb|EEE54353.1| hypothetical protein OsJ_01348 [Oryza sativa Japonica Group] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query201
UNIPROTKB|Q5NAE0 565 P0498A12.33 "Putative BP-5 pro 0.636 0.226 0.397 1.2e-19
TAIR|locus:2012345524 PIF3 "AT1G09530" [Arabidopsis 0.835 0.320 0.337 8.5e-17
UNIPROTKB|Q8GRJ1417 OJ1343_B12.103 "Transcription 0.442 0.213 0.432 4.5e-16
TAIR|locus:2053733430 PIF4 "AT2G43010" [Arabidopsis 0.761 0.355 0.345 1e-15
UNIPROTKB|Q7FA23181 OSJNBa0058K23.6 "Os04g0618600 0.517 0.574 0.405 5.4e-15
TAIR|locus:2061634478 PIL5 "phytochrome interacting 0.567 0.238 0.391 6.2e-15
UNIPROTKB|Q948F6 298 OSJNBa0049O12.18 "Putative SPA 0.487 0.328 0.393 8.5e-15
TAIR|locus:2041369416 PIL1 "phytochrome interacting 0.537 0.259 0.383 9.4e-15
TAIR|locus:2126876399 UNE10 "AT4G00050" [Arabidopsis 0.681 0.343 0.346 1.1e-14
TAIR|locus:2077680444 PIL6 "AT3G59060" [Arabidopsis 0.731 0.331 0.318 3e-14
UNIPROTKB|Q5NAE0 P0498A12.33 "Putative BP-5 protein" [Oryza sativa Japonica Group (taxid:39947)] Back     alignment and assigned GO terms
 Score = 242 (90.2 bits), Expect = 1.2e-19, P = 1.2e-19
 Identities = 56/141 (39%), Positives = 85/141 (60%)

Query:    40 QTSHSLDTDVDDFSDATEISEGMERKPDNRP-KRTHSAEVHNQSEXXXXXXINKRLKSLK 98
             + S S D D+DD         G+ RK   R  KR+ +AEVHN SE      IN+++++L+
Sbjct:   284 ECSASQDDDLDD-------EPGVLRKSGTRSTKRSRTAEVHNLSERRRRDRINEKMRALQ 336

Query:    99 ELVPNCNKSDRASVLDDAVNYVKALRHQLEMMSAQGGAIFQSPFMSPSGHQSTEVPQISP 158
             EL+PNCNK D+AS+LD+A+ Y+K L+ Q++MMS  G  +   P + P+  Q  ++P ++ 
Sbjct:   337 ELIPNCNKIDKASMLDEAIEYLKTLQLQVQMMS-MGTGLCIPPMLLPTAMQHLQIPPMAH 395

Query:   159 NVPISPKLGMGSGIGTGMVDV 179
                  P LGMG G G G+ D+
Sbjct:   396 ----FPHLGMGLGYGMGVFDM 412




GO:0005634 "nucleus" evidence=IC
TAIR|locus:2012345 PIF3 "AT1G09530" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|Q8GRJ1 OJ1343_B12.103 "Transcription factor BHLH9-like protein" [Oryza sativa Japonica Group (taxid:39947)] Back     alignment and assigned GO terms
TAIR|locus:2053733 PIF4 "AT2G43010" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|Q7FA23 OSJNBa0058K23.6 "Os04g0618600 protein" [Oryza sativa Japonica Group (taxid:39947)] Back     alignment and assigned GO terms
TAIR|locus:2061634 PIL5 "phytochrome interacting factor 3-like 5" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|Q948F6 OSJNBa0049O12.18 "Putative SPATULA" [Oryza sativa (taxid:4530)] Back     alignment and assigned GO terms
TAIR|locus:2041369 PIL1 "phytochrome interacting factor 3-like 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2126876 UNE10 "AT4G00050" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2077680 PIL6 "AT3G59060" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query201
smart0035353 smart00353, HLH, helix loop helix domain 1e-14
cd0008360 cd00083, HLH, Helix-loop-helix domain, found in sp 2e-14
pfam0001052 pfam00010, HLH, Helix-loop-helix DNA-binding domai 2e-13
>gnl|CDD|197674 smart00353, HLH, helix loop helix domain Back     alignment and domain information
 Score = 65.3 bits (160), Expect = 1e-14
 Identities = 19/52 (36%), Positives = 31/52 (59%), Gaps = 3/52 (5%)

Query: 80  NQSERRRRDRINKRLKSLKELVPNCN---KSDRASVLDDAVNYVKALRHQLE 128
           N  ERRRR +IN+    L+ L+P      K  +A +L  A+ Y+K+L+ +L+
Sbjct: 1   NARERRRRRKINEAFDELRSLLPTLPKNKKLSKAEILRLAIEYIKSLQEELQ 52


Length = 53

>gnl|CDD|238036 cd00083, HLH, Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long Back     alignment and domain information
>gnl|CDD|215654 pfam00010, HLH, Helix-loop-helix DNA-binding domain Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 201
PF0001055 HLH: Helix-loop-helix DNA-binding domain only nucl 99.59
cd0008360 HLH Helix-loop-helix domain, found in specific DNA 99.57
smart0035353 HLH helix loop helix domain. 99.5
KOG3561 803 consensus Aryl-hydrocarbon receptor nuclear transl 99.45
KOG1318411 consensus Helix loop helix transcription factor EB 99.41
KOG1319229 consensus bHLHZip transcription factor BIGMAX [Tra 99.28
KOG4304250 consensus Transcriptional repressors of the hairy/ 99.15
KOG2483232 consensus Upstream transcription factor 2/L-myc-2 98.83
KOG2588 953 consensus Predicted DNA-binding protein [Transcrip 98.63
KOG3960284 consensus Myogenic helix-loop-helix transcription 98.53
KOG0561 373 consensus bHLH transcription factor [Transcription 98.34
KOG4029228 consensus Transcription factor HAND2/Transcription 98.3
PLN0321793 transcription factor ATBS1; Provisional 98.17
KOG3910632 consensus Helix loop helix transcription factor [T 97.71
KOG4447173 consensus Transcription factor TWIST [Transcriptio 96.79
KOG3898254 consensus Transcription factor NeuroD and related 95.8
KOG3560 712 consensus Aryl-hydrocarbon receptor [Transcription 95.03
KOG4395285 consensus Transcription factor Atonal, contains HT 95.02
KOG3558 768 consensus Hypoxia-inducible factor 1/Neuronal PAS 92.31
KOG3559 598 consensus Transcriptional regulator SIM1 [Transcri 91.35
>PF00010 HLH: Helix-loop-helix DNA-binding domain only nuclear translocator protein (Arnt) Back     alignment and domain information
Probab=99.59  E-value=2.2e-15  Score=101.45  Aligned_cols=49  Identities=49%  Similarity=0.899  Sum_probs=46.6

Q ss_pred             hhhhhHHHHHHHHHHHHHHHHHHhhcCCC-----CCcchHhHHHHHHHHHHHHH
Q 044343           76 AEVHNQSERRRRDRINKRLKSLKELVPNC-----NKSDRASVLDDAVNYVKALR  124 (201)
Q Consensus        76 ~~~H~~~ER~RR~~in~~~~~Lr~lvP~~-----~K~dKasiL~~AI~YIk~Lq  124 (201)
                      |..|+..||+||++||++|.+|+.+||.+     .|++|++||+.||+||++||
T Consensus         2 R~~h~~~Er~RR~~i~~~~~~L~~llp~~~~~~~~k~~K~~iL~~ai~yI~~Lq   55 (55)
T PF00010_consen    2 RQKHNERERRRRDRINDCFDELRELLPSCSAGSSRKLSKASILQKAIDYIKQLQ   55 (55)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHCCSSHHCCTTSSSSHHHHHHHHHHHHHHHH
T ss_pred             CchHHHHHHHHHHHHHHHHHHHHHhccchhccccccCCHHHHHHHHHHHHHHhC
Confidence            67899999999999999999999999987     38999999999999999997



; InterPro: IPR011598 The helix-loop-helix (HLH) DNA-binding domain consists of a closed bundle of four helices in a left-handed twist with two crossover connections. The HLH domain directs dimerisation, and is juxtaposed to basic regions to create a DNA interaction interface surface that recognises specific DNA sequences. Basic region/HLH (bHLH) proteins regulate diverse biological pathways []. bHLH proteins include MyoD [], SREBPs (sterol regulatory element binding proteins) [], and yeast Pho4 (phosphatase system) []. In certain proteins the bHLH domain contains a leucine-zipper motif. The bHLH/leucine zipper (bHLHZip) domain specifies dimerisation within a network of proteins and determines sequence-specific DNA binding []. bHLHZip domains occur in the transcription factors Myc, Mad, Max and Usf [, ]. This entry is bHLHZip, which covers the bHLH domain and the leucine zipper motif, when present.; PDB: 1NLW_A 1NKP_D 1A93_A 2A93_A 1AM9_C 3U5V_A 1A0A_B 2QL2_C 1UKL_C 1AN4_B ....

>cd00083 HLH Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long Back     alignment and domain information
>smart00353 HLH helix loop helix domain Back     alignment and domain information
>KOG3561 consensus Aryl-hydrocarbon receptor nuclear translocator [Transcription] Back     alignment and domain information
>KOG1318 consensus Helix loop helix transcription factor EB [Transcription] Back     alignment and domain information
>KOG1319 consensus bHLHZip transcription factor BIGMAX [Transcription] Back     alignment and domain information
>KOG4304 consensus Transcriptional repressors of the hairy/E(spl) family (contains HLH) [Transcription] Back     alignment and domain information
>KOG2483 consensus Upstream transcription factor 2/L-myc-2 protein [Transcription] Back     alignment and domain information
>KOG2588 consensus Predicted DNA-binding protein [Transcription] Back     alignment and domain information
>KOG3960 consensus Myogenic helix-loop-helix transcription factor [Transcription] Back     alignment and domain information
>KOG0561 consensus bHLH transcription factor [Transcription] Back     alignment and domain information
>KOG4029 consensus Transcription factor HAND2/Transcription factor TAL1/TAL2/LYL1 [Transcription] Back     alignment and domain information
>PLN03217 transcription factor ATBS1; Provisional Back     alignment and domain information
>KOG3910 consensus Helix loop helix transcription factor [Transcription] Back     alignment and domain information
>KOG4447 consensus Transcription factor TWIST [Transcription] Back     alignment and domain information
>KOG3898 consensus Transcription factor NeuroD and related HTH proteins [Transcription] Back     alignment and domain information
>KOG3560 consensus Aryl-hydrocarbon receptor [Transcription] Back     alignment and domain information
>KOG4395 consensus Transcription factor Atonal, contains HTH domain [Transcription] Back     alignment and domain information
>KOG3558 consensus Hypoxia-inducible factor 1/Neuronal PAS domain protein NPAS1 [Signal transduction mechanisms; Transcription] Back     alignment and domain information
>KOG3559 consensus Transcriptional regulator SIM1 [Transcription] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query201
1am9_A82 Srebp-1A, protein (sterol regulatory element bindi 5e-18
1nkp_B83 MAX protein, MYC proto-oncogene protein; transcrip 1e-14
1hlo_A80 Protein (transcription factor MAX); transcriptiona 5e-14
1nkp_A88 C-MYC, MYC proto-oncogene protein; transcription, 6e-14
1an4_A65 Protein (upstream stimulatory factor); protein-DNA 1e-13
1nlw_A80 MAD protein, MAX dimerizer; transcription factor, 4e-10
4f3l_B 387 BMAL1B; BHLH, PAS, circadian rhythm proteins, tran 5e-08
4f3l_A 361 Mclock, circadian locomoter output cycles protein 5e-08
1a0a_A63 BHLH, protein (phosphate system positive regulator 1e-07
3u5v_A76 Protein MAX, transcription factor E2-alpha chimer; 2e-04
1mdy_A68 Protein (MYOD BHLH domain); protein-DNA complex, t 4e-04
>1am9_A Srebp-1A, protein (sterol regulatory element binding protein 1A); basic-helix-loop- helix-leucine zipper, transcription factor; HET: DNA; 2.30A {Homo sapiens} SCOP: a.38.1.1 PDB: 1ukl_C Length = 82 Back     alignment and structure
 Score = 74.0 bits (182), Expect = 5e-18
 Identities = 19/64 (29%), Positives = 35/64 (54%), Gaps = 1/64 (1%)

Query: 71  KRTHSAEVHNQSERRRRDRINKRLKSLKELVPNCN-KSDRASVLDDAVNYVKALRHQLEM 129
            R      HN  E+R R  IN ++  LK+LV     K ++++VL  A++Y++ L+H  + 
Sbjct: 2   SRGEKRTAHNAIEKRYRSSINDKIIELKDLVVGTEAKLNKSAVLRKAIDYIRFLQHSNQK 61

Query: 130 MSAQ 133
           +  +
Sbjct: 62  LKQE 65


>1nkp_B MAX protein, MYC proto-oncogene protein; transcription, DNA, BHLHZ, heterodimer, transcription/DNA complex; 1.80A {Homo sapiens} SCOP: a.38.1.1 PDB: 1an2_A* 1r05_A 1nlw_B Length = 83 Back     alignment and structure
>1hlo_A Protein (transcription factor MAX); transcriptional regulation, DNA binding, complex (transcription factor MAX/DNA), transcription/DNA complex; HET: DNA; 2.80A {Homo sapiens} SCOP: a.38.1.1 Length = 80 Back     alignment and structure
>1nkp_A C-MYC, MYC proto-oncogene protein; transcription, DNA, BHLHZ, heterodimer, transcription/DNA complex; 1.80A {Homo sapiens} SCOP: a.38.1.1 Length = 88 Back     alignment and structure
>1an4_A Protein (upstream stimulatory factor); protein-DNA complex, double helix, overhanging base, transcription/DNA complex; HET: DNA; 2.90A {Homo sapiens} SCOP: a.38.1.1 Length = 65 Back     alignment and structure
>1nlw_A MAD protein, MAX dimerizer; transcription factor, DNA, BHLHZ, transcription/DNA complex; 2.00A {Homo sapiens} SCOP: a.38.1.1 Length = 80 Back     alignment and structure
>4f3l_B BMAL1B; BHLH, PAS, circadian rhythm proteins, transcription-activato; 2.27A {Mus musculus} Length = 387 Back     alignment and structure
>4f3l_A Mclock, circadian locomoter output cycles protein kaput; BHLH, PAS, circadian rhythm proteins, transcription-activato; 2.27A {Mus musculus} Length = 361 Back     alignment and structure
>1a0a_A BHLH, protein (phosphate system positive regulatory protein PHO4); transcription factor, basic helix loop helix; HET: DNA; 2.80A {Saccharomyces cerevisiae} SCOP: a.38.1.1 Length = 63 Back     alignment and structure
>3u5v_A Protein MAX, transcription factor E2-alpha chimer; basic helix-loop-helix (BHLH); 1.70A {Mus musculus} PDB: 2ql2_A* Length = 76 Back     alignment and structure
>1mdy_A Protein (MYOD BHLH domain); protein-DNA complex, transcription/DNA complex; HET: DNA; 2.80A {Mus musculus} SCOP: a.38.1.1 PDB: 1mdy_B* Length = 68 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query201
1am9_A82 Srebp-1A, protein (sterol regulatory element bindi 99.75
4h10_B71 Circadian locomoter output cycles protein kaput; B 99.72
4ati_A118 MITF, microphthalmia-associated transcription fact 99.69
1an4_A65 Protein (upstream stimulatory factor); protein-DNA 99.68
1a0a_A63 BHLH, protein (phosphate system positive regulator 99.67
4h10_A73 ARYL hydrocarbon receptor nuclear translocator-LI 99.67
1hlo_A80 Protein (transcription factor MAX); transcriptiona 99.66
1nkp_B83 MAX protein, MYC proto-oncogene protein; transcrip 99.65
1nkp_A88 C-MYC, MYC proto-oncogene protein; transcription, 99.62
3u5v_A76 Protein MAX, transcription factor E2-alpha chimer; 99.61
1nlw_A80 MAD protein, MAX dimerizer; transcription factor, 99.6
1mdy_A68 Protein (MYOD BHLH domain); protein-DNA complex, t 99.48
2ql2_B60 Neurod1, neurogenic differentiation factor 1; basi 99.43
4f3l_A 361 Mclock, circadian locomoter output cycles protein 99.38
4f3l_B 387 BMAL1B; BHLH, PAS, circadian rhythm proteins, tran 99.37
2lfh_A68 DNA-binding protein inhibitor ID-3; structural gen 99.08
4ath_A83 MITF, microphthalmia-associated transcription fact 99.04
4aya_A97 DNA-binding protein inhibitor ID-2; cell cycle; 2. 98.69
>1am9_A Srebp-1A, protein (sterol regulatory element binding protein 1A); basic-helix-loop- helix-leucine zipper, transcription factor; HET: DNA; 2.30A {Homo sapiens} SCOP: a.38.1.1 PDB: 1ukl_C Back     alignment and structure
Probab=99.75  E-value=1.6e-18  Score=125.64  Aligned_cols=63  Identities=29%  Similarity=0.488  Sum_probs=58.3

Q ss_pred             ccchhhhhHHHHHHHHHHHHHHHHHHhhcCCC-CCcchHhHHHHHHHHHHHHHHHHHHHhhccC
Q 044343           73 THSAEVHNQSERRRRDRINKRLKSLKELVPNC-NKSDRASVLDDAVNYVKALRHQLEMMSAQGG  135 (201)
Q Consensus        73 ~~~~~~H~~~ER~RR~~in~~~~~Lr~lvP~~-~K~dKasiL~~AI~YIk~Lq~~v~~l~~~~~  135 (201)
                      ..++..|+.+||+||++||++|.+|+.+||++ .|++|++||.+||+||++||.+++.|+.++.
T Consensus         4 ~~rr~~H~~~ErrRR~~in~~f~~L~~lvP~~~~k~~Ka~IL~~Ai~YI~~Lq~~~~~L~~e~~   67 (82)
T 1am9_A            4 GEKRTAHNAIEKRYRSSINDKIIELKDLVVGTEAKLNKSAVLRKAIDYIRFLQHSNQKLKQENL   67 (82)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHTCSSCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             hHHHHhhhhHHHHHHHHHHHHHHHHHHhccCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            34678999999999999999999999999998 6999999999999999999999999987653



>4h10_B Circadian locomoter output cycles protein kaput; BHLH, circadian transcription, transcription-DNA complex; 2.40A {Homo sapiens} Back     alignment and structure
>4ati_A MITF, microphthalmia-associated transcription factor; DNA-binding protein-DNA complex, melanoma; 2.60A {Mus musculus} PDB: 4atk_A Back     alignment and structure
>1an4_A Protein (upstream stimulatory factor); protein-DNA complex, double helix, overhanging base, transcription/DNA complex; HET: DNA; 2.90A {Homo sapiens} SCOP: a.38.1.1 Back     alignment and structure
>1a0a_A BHLH, protein (phosphate system positive regulatory protein PHO4); transcription factor, basic helix loop helix; HET: DNA; 2.80A {Saccharomyces cerevisiae} SCOP: a.38.1.1 Back     alignment and structure
>4h10_A ARYL hydrocarbon receptor nuclear translocator-LI 1; BHLH, circadian transcription, transcription-DNA complex; 2.40A {Homo sapiens} Back     alignment and structure
>1hlo_A Protein (transcription factor MAX); transcriptional regulation, DNA binding, complex (transcription factor MAX/DNA), transcription/DNA complex; HET: DNA; 2.80A {Homo sapiens} SCOP: a.38.1.1 Back     alignment and structure
>1nkp_B MAX protein, MYC proto-oncogene protein; transcription, DNA, BHLHZ, heterodimer, transcription/DNA complex; 1.80A {Homo sapiens} SCOP: a.38.1.1 PDB: 1an2_A* 1r05_A 1nlw_B Back     alignment and structure
>1nkp_A C-MYC, MYC proto-oncogene protein; transcription, DNA, BHLHZ, heterodimer, transcription/DNA complex; 1.80A {Homo sapiens} SCOP: a.38.1.1 Back     alignment and structure
>3u5v_A Protein MAX, transcription factor E2-alpha chimer; basic helix-loop-helix (BHLH); 1.70A {Mus musculus} PDB: 2ql2_A* Back     alignment and structure
>1nlw_A MAD protein, MAX dimerizer; transcription factor, DNA, BHLHZ, transcription/DNA complex; 2.00A {Homo sapiens} SCOP: a.38.1.1 Back     alignment and structure
>1mdy_A Protein (MYOD BHLH domain); protein-DNA complex, transcription/DNA complex; HET: DNA; 2.80A {Mus musculus} SCOP: a.38.1.1 PDB: 1mdy_B* Back     alignment and structure
>2ql2_B Neurod1, neurogenic differentiation factor 1; basic-helix-loop-helix; HET: DNA; 2.50A {Mus musculus} Back     alignment and structure
>4f3l_A Mclock, circadian locomoter output cycles protein kaput; BHLH, PAS, circadian rhythm proteins, transcription-activato; 2.27A {Mus musculus} Back     alignment and structure
>4f3l_B BMAL1B; BHLH, PAS, circadian rhythm proteins, transcription-activato; 2.27A {Mus musculus} Back     alignment and structure
>2lfh_A DNA-binding protein inhibitor ID-3; structural genomics, northeast structural genomics consortiu PSI-biology, protein structure initiative; NMR {Homo sapiens} Back     alignment and structure
>4ath_A MITF, microphthalmia-associated transcription factor; DNA binding protein, melanoma; HET: MSE; 1.95A {Mus musculus} Back     alignment and structure
>4aya_A DNA-binding protein inhibitor ID-2; cell cycle; 2.10A {Homo sapiens} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 201
d1nkpb_83 a.38.1.1 (B:) Max protein {Human (Homo sapiens) [T 6e-16
d1nkpa_88 a.38.1.1 (A:) Myc proto-oncogene protein {Human (H 2e-15
d1mdya_68 a.38.1.1 (A:) Myod B/HLH domain {Mouse (Mus muscul 1e-14
d1a0aa_63 a.38.1.1 (A:) Pho4 B/HLH domain {Baker's yeast (Sa 1e-14
d1nlwa_79 a.38.1.1 (A:) Mad protein {Human (Homo sapiens) [T 4e-14
d1am9a_80 a.38.1.1 (A:) SREBP-1a {Human (Homo sapiens) [TaxI 8e-14
d1an4a_65 a.38.1.1 (A:) Usf B/HLH domain {Human (Homo sapien 1e-13
d1uklc_61 a.38.1.1 (C:) SREBP-2 {Human (Homo sapiens) [TaxId 2e-13
>d1nkpb_ a.38.1.1 (B:) Max protein {Human (Homo sapiens) [TaxId: 9606]} Length = 83 Back     information, alignment and structure

class: All alpha proteins
fold: HLH-like
superfamily: HLH, helix-loop-helix DNA-binding domain
family: HLH, helix-loop-helix DNA-binding domain
domain: Max protein
species: Human (Homo sapiens) [TaxId: 9606]
 Score = 67.6 bits (165), Expect = 6e-16
 Identities = 23/65 (35%), Positives = 33/65 (50%), Gaps = 6/65 (9%)

Query: 71  KRTHSAEVHNQSERRRRDRINKRLKSLKELVPNCN--KSDRASVLDDAVNYVKALRHQLE 128
           KR H    HN  ER+RRD I     SL++ VP+    K+ RA +LD A  Y++ +R +  
Sbjct: 2   KRAH----HNALERKRRDHIKDSFHSLRDSVPSLQGEKASRAQILDKATEYIQYMRRKNH 57

Query: 129 MMSAQ 133
                
Sbjct: 58  THQQD 62


>d1nkpa_ a.38.1.1 (A:) Myc proto-oncogene protein {Human (Homo sapiens) [TaxId: 9606]} Length = 88 Back     information, alignment and structure
>d1mdya_ a.38.1.1 (A:) Myod B/HLH domain {Mouse (Mus musculus) [TaxId: 10090]} Length = 68 Back     information, alignment and structure
>d1a0aa_ a.38.1.1 (A:) Pho4 B/HLH domain {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} Length = 63 Back     information, alignment and structure
>d1nlwa_ a.38.1.1 (A:) Mad protein {Human (Homo sapiens) [TaxId: 9606]} Length = 79 Back     information, alignment and structure
>d1am9a_ a.38.1.1 (A:) SREBP-1a {Human (Homo sapiens) [TaxId: 9606]} Length = 80 Back     information, alignment and structure
>d1an4a_ a.38.1.1 (A:) Usf B/HLH domain {Human (Homo sapiens) [TaxId: 9606]} Length = 65 Back     information, alignment and structure
>d1uklc_ a.38.1.1 (C:) SREBP-2 {Human (Homo sapiens) [TaxId: 9606]} Length = 61 Back     information, alignment and structure

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query201
d1am9a_80 SREBP-1a {Human (Homo sapiens) [TaxId: 9606]} 99.69
d1mdya_68 Myod B/HLH domain {Mouse (Mus musculus) [TaxId: 10 99.67
d1a0aa_63 Pho4 B/HLH domain {Baker's yeast (Saccharomyces ce 99.67
d1nlwa_79 Mad protein {Human (Homo sapiens) [TaxId: 9606]} 99.67
d1nkpb_83 Max protein {Human (Homo sapiens) [TaxId: 9606]} 99.63
d1nkpa_88 Myc proto-oncogene protein {Human (Homo sapiens) [ 99.62
d1an4a_65 Usf B/HLH domain {Human (Homo sapiens) [TaxId: 960 99.6
d1uklc_61 SREBP-2 {Human (Homo sapiens) [TaxId: 9606]} 99.49
>d1am9a_ a.38.1.1 (A:) SREBP-1a {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
class: All alpha proteins
fold: HLH-like
superfamily: HLH, helix-loop-helix DNA-binding domain
family: HLH, helix-loop-helix DNA-binding domain
domain: SREBP-1a
species: Human (Homo sapiens) [TaxId: 9606]
Probab=99.69  E-value=1.2e-17  Score=118.50  Aligned_cols=61  Identities=30%  Similarity=0.540  Sum_probs=56.6

Q ss_pred             cchhhhhHHHHHHHHHHHHHHHHHHhhcCCCC-CcchHhHHHHHHHHHHHHHHHHHHHhhcc
Q 044343           74 HSAEVHNQSERRRRDRINKRLKSLKELVPNCN-KSDRASVLDDAVNYVKALRHQLEMMSAQG  134 (201)
Q Consensus        74 ~~~~~H~~~ER~RR~~in~~~~~Lr~lvP~~~-K~dKasiL~~AI~YIk~Lq~~v~~l~~~~  134 (201)
                      .+|..|+.+||+||++||++|.+|+.|||++. |++|++||..||+||++|+.+++.|..++
T Consensus         5 ~kR~~H~~~Er~RR~~in~~~~~L~~llP~~~~k~~K~~iL~~a~~yI~~L~~~~~~L~~e~   66 (80)
T d1am9a_           5 EKRTAHNAIEKRYRSSINDKIIELKDLVVGTEAKLNKSAVLRKAIDYIRFLQHSNQKLKQEN   66 (80)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHHHTCSSCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            35778999999999999999999999999865 99999999999999999999999998765



>d1mdya_ a.38.1.1 (A:) Myod B/HLH domain {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
>d1a0aa_ a.38.1.1 (A:) Pho4 B/HLH domain {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} Back     information, alignment and structure
>d1nlwa_ a.38.1.1 (A:) Mad protein {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1nkpb_ a.38.1.1 (B:) Max protein {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1nkpa_ a.38.1.1 (A:) Myc proto-oncogene protein {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1an4a_ a.38.1.1 (A:) Usf B/HLH domain {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1uklc_ a.38.1.1 (C:) SREBP-2 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure