Citrus Sinensis ID: 017877


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360----
MEEHLNPLAMTHLLQHTLRSLCIHENSQWVYAVFWRILPRNYPPPKWDGQGAYDRSRGNRRNWILVWEDGFCNFAASTAAEINSGDCPSSSVYGNCEFQHYQGLQPELFFKMSHEIYNYGEGLIGKVAADHSHKWIYKEPNDQEINFLSAWHNAADSHPRTWEAQFQSGIKTIALIAVREGVVQLGAVNKVVEDLNYVVLLRKKFSYIESIPGVLLPHPSSSAYPFKIDGYHAPAEAWHHYQIQNPLAAALPLDHHHHQYYDHFNQPSSAMKITPSMSSLEALLSKLPSVVPPPPSGDYGESQPPLMSPFVSAAHQRPMEFIQAEKVAKEDTDEDYCRRPHDRDVGECSSSMSGYQHHFHHHQN
ccccccccHHHHHHHHHHHHHHcccccccEEEEEEEEccccccccccccccccccccccccccEEEEccccccccccHHHHccccccccccccccccccccccccHHHHHHHcccEEcccccccccEEEccccEEEEcccccccHHHHHHcccccccccHHHHHHHHccccEEEEEEEcccEEEEccccHHcccHHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHccccccccccccccccccccccccccccccccccHHHHHcccccccccccccccccccccccccccccccccccccccc
cHHcccHHHHHHHHHHHHHHHHHccccccEEEEEEEEcccccccccccccccccccccccccEEEEEcccccccHHHHHHHHHccccccccccccccccccccccHHHHHHHHHHHcccccccccEEEEcccEEEEEcccccccccHHHHHHcccccccHHHHHHHHcccEEEEEEEccccEEEEcccccccccHHHHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHccccccccccccccccccccccccccccHHHcHHHHHHcHEcccccccHcccccccccccccccccccHHHHHcccccc
MEEHLNPLAMTHLLQHTLRSLCIHENSQWVYAVFWRilprnypppkwdgqgaydrsrgnrrnwILVWEDGFCNFAAStaaeinsgdcpsssvygncefqhyqglqpeLFFKMSHEIYNYGEGLIGKVaadhshkwiykepndqeiNFLSAWhnaadshprtwEAQFQSGIKTIALIAVREGVVQLGAVNKVVEDLNYVVLLRKKFsyiesipgvllphpsssaypfkidgyhapaeawhhyqiqnplaaalpldhhhhqyydhfnqpssamkitpSMSSLEALLSklpsvvppppsgdygesqpplmspfvsaahqrpmEFIQAEKvakedtdedycrrphdrdvgecsssmsgyqhhfhhhqn
MEEHLNPLAMTHLLQHTLRSLCIHENSQWVYAVFWRILPRNYPPPKWDGQGAYDRSRGNRRNWILVWEDGFCNFAASTAAEINSGDCPSSSVYGNCEFQHYQGLQPELFFKMSHEIYNYGEGLIGKVAADHSHKWIYKEPNDQEINFLSAWHNAADSHPRTWEAQFQSGIKTIALIAVREGVVQLgavnkvvedLNYVVLLRKKFSYIESIPGVLLPHPSSSAYPFKIDGYHAPAEAWHHYQIQNPLAAALPLDHHHHQYYDHFNQPSSAMKITPSMSSLEALLSKLPSVVPPPPSGDYGESQPPLMSPFVSAAHQRPMEFIQAekvakedtdedycRRPHDRDVGECSssmsgyqhhFHHHQN
MEEHLNPLAMTHLLQHTLRSLCIHENSQWVYAVFWRILPRNYPPPKWDGQGAYDRSRGNRRNWILVWEDGFCNFAASTAAEINSGDCPSSSVYGNCEFQHYQGLQPELFFKMSHEIYNYGEGLIGKVAADHSHKWIYKEPNDQEINFLSAWHNAADSHPRTWEAQFQSGIKTIALIAVREGVVQLGAVNKVVEDLNYVVLLRKKFSYIESIPGVLLPHPSSSAYPFKIDGYHAPAEAWHHYQIQNplaaalpldhhhhQYYDHFNQPSSAMKITPSMSSLEALLSKLpsvvppppsGDYGESQPPLMSPFVSAAHQRPMEFIQAEKVAKEDTDEDYCRRPHDRDVGECSSSMSGYQHHFHHHQN
*******LAMTHLLQHTLRSLCIHENSQWVYAVFWRILPRNYPPPKWDGQGAYDRSRGNRRNWILVWEDGFCNFAASTAAEINSGDCPSSSVYGNCEFQHYQGLQPELFFKMSHEIYNYGEGLIGKVAADHSHKWIYKEPNDQEINFLSAWHNAADSHPRTWEAQFQSGIKTIALIAVREGVVQLGAVNKVVEDLNYVVLLRKKFSYIESIPGVLLPHPSSSAYPFKIDGYHAPAEAWHHYQIQNPLA********************************************************************************************************************
************LLQHTLRSLCIHENSQWVYAVFWRILPRNYPPPKWDGQGAYDR*RGNRRNWILVWEDGFCNFA***************************GLQPELFFKMSHEIYNYGEGLIGKVAADHSHKWIYKEPNDQEINFLSAWHNAADSHPRTWEAQFQSGIKTIALIAVREGVVQLGAVNKVVEDLNYVVLLRKKFSYIES****************************************************************************************************************************DYCRR*************************
MEEHLNPLAMTHLLQHTLRSLCIHENSQWVYAVFWRILPRNYPPPKWDGQGAYDRSRGNRRNWILVWEDGFCNFAASTAAEINSGDCPSSSVYGNCEFQHYQGLQPELFFKMSHEIYNYGEGLIGKVAADHSHKWIYKEPNDQEINFLSAWHNAADSHPRTWEAQFQSGIKTIALIAVREGVVQLGAVNKVVEDLNYVVLLRKKFSYIESIPGVLLPHPSSSAYPFKIDGYHAPAEAWHHYQIQNPLAAALPLDHHHHQYYDHFNQPSSAMKITPSMSSLEALLSKLPSVVP*********SQPPLMSPFVSAAHQRPMEFIQAEK************RPHDRDVGECSSSMSGYQHHFHHHQN
***HLNPLAMTHLLQHTLRSLCIHENSQWVYAVFWRILPRNYPPPKWDGQGAYDRSRGNRRNWILVWEDGFCNFAASTAA********************YQGLQPELFFKMSHEIYNYGEGLIGKVAADHSHKWIYKEPNDQEINFLSAWHNAADSHPRTWEAQFQSGIKTIALIAVREGVVQLGAVNKVVEDLNYVVLLRKKFSYIESIPGVLL*****************************************************A*KI*PSMSSLEALLSKLPSVVPPPP*********PLMSPFVSAAHQRPMEFIQAEKVAKEDT********************************
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MEEHLNPLAMTHLLQHTLRSLCIHENSQWVYAVFWRILPRNYPPPKWDGQGAYDRSRGNRRNWILVWEDGFCNFAASTAAEINSGDCPSSSVYGNCEFQHYQGLQPELFFKMSHEIYNYGEGLIGKVAADHSHKWIYKEPNDQEINFLSAWHNAADSHPRTWEAQFQSGIKTIALIAVREGVVQLGAVNKVVEDLNYVVLLRKKFSYIESIPGVLLPHPSSSAYPFKIDGYHAPAEAWHHYQIQNPLAAALPLDHHHHQYYDHFNQPSSAMKITPSMSSLEALLSKLPSVVPPPPSGDYGESQPPLMSPFVSAAHQRPMEFIQAEKVAKEDTDEDYCRRPHDRDVGECSSSMSGYQHHFHHHQN
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query364 2.2.26 [Sep-21-2011]
P0C7P8 734 Transcription factor EMB1 no no 0.387 0.192 0.303 1e-15
Q58G01 720 Transcription factor bHLH no no 0.384 0.194 0.307 2e-13
Q9XIN0 650 Transcription factor LHW no no 0.456 0.255 0.278 3e-11
O23090423 Transcription factor bHLH no no 0.423 0.364 0.256 6e-05
>sp|P0C7P8|LHWL1_ARATH Transcription factor EMB1444 OS=Arabidopsis thaliana GN=EMB1444 PE=2 SV=1 Back     alignment and function desciption
 Score = 84.3 bits (207), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 64/211 (30%), Positives = 92/211 (43%), Gaps = 70/211 (33%)

Query: 10  MTHLLQHTLRSLCIHENSQWVYAVFWRILPRNYPPPKWDGQGAYDRSRGNRRNWILVWED 69
           M + LQ  LRS+C   N+ W YAVFW++   N+  P                  +L  ED
Sbjct: 1   MGYTLQQILRSIC--SNTDWNYAVFWKL---NHHSP-----------------MVLTLED 38

Query: 70  GFCNFAASTAAEINSGDCPSSSVYGNCEFQHYQGLQPELF--------------FKMSHE 115
            +C                           H +GL PE                 KMS+ 
Sbjct: 39  VYC-------------------------VNHERGLMPESLHGGRHAHDPLGLAVAKMSYH 73

Query: 116 IYNYGEGLIGKVAADHSHKWIYKEPNDQEINFLSAWHNAADSHPRTWEAQFQSGIKTIAL 175
           +++ GEG++G+VA    H+WI+ E       +L+  H+    H   WE+Q  +GIKTI +
Sbjct: 74  VHSLGEGIVGQVAISGQHQWIFSE-------YLNDSHSTLQVH-NGWESQISAGIKTILI 125

Query: 176 IAVRE-GVVQLGAVNKVVEDLNYVVLLRKKF 205
           +AV   GVVQLG++ KV ED   V  +R  F
Sbjct: 126 VAVGSCGVVQLGSLCKVEEDPALVTHIRHLF 156




Transcription factor that may regulate root development.
Arabidopsis thaliana (taxid: 3702)
>sp|Q58G01|LHWL3_ARATH Transcription factor bHLH155 OS=Arabidopsis thaliana GN=BHLH155 PE=2 SV=1 Back     alignment and function description
>sp|Q9XIN0|LHW_ARATH Transcription factor LHW OS=Arabidopsis thaliana GN=LHW PE=1 SV=1 Back     alignment and function description
>sp|O23090|BH014_ARATH Transcription factor bHLH14 OS=Arabidopsis thaliana GN=BHLH14 PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query364
255560027366 conserved hypothetical protein [Ricinus 0.953 0.948 0.773 1e-156
224108141366 predicted protein [Populus trichocarpa] 0.914 0.909 0.775 1e-150
388491326366 unknown [Lotus japonicus] 0.923 0.918 0.764 1e-147
356497894376 PREDICTED: uncharacterized protein LOC10 0.945 0.914 0.750 1e-146
225424721346 PREDICTED: uncharacterized protein LOC10 0.914 0.962 0.757 1e-145
357485981362 Basic helix-loop-helix protein, putative 0.939 0.944 0.745 1e-144
255637625385 unknown [Glycine max] 0.903 0.854 0.713 1e-141
356501548385 PREDICTED: uncharacterized protein LOC10 0.909 0.859 0.720 1e-141
147780838346 hypothetical protein VITISV_035287 [Viti 0.876 0.921 0.699 1e-130
18406603386 serine/threonine-protein kinase WNK (Wit 0.796 0.751 0.767 1e-127
>gi|255560027|ref|XP_002521032.1| conserved hypothetical protein [Ricinus communis] gi|223539869|gb|EEF41449.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
 Score =  557 bits (1435), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 284/367 (77%), Positives = 309/367 (84%), Gaps = 20/367 (5%)

Query: 1   MEEHLNPLAMTHLLQHTLRSLCIHENSQWVYAVFWRILPRNYPPPKWDGQGAYDRSRGNR 60
           MEEHL+PLA+THLLQHTLRSLCIHENSQWVYAVFWRILPRNYPPPKWDGQGAYDRSRGNR
Sbjct: 1   MEEHLSPLAVTHLLQHTLRSLCIHENSQWVYAVFWRILPRNYPPPKWDGQGAYDRSRGNR 60

Query: 61  RNWILVWEDGFCNFAASTAAEINSGDCPSSSVYGNCEFQHYQGLQPELFFKMSHEIYNYG 120
           RNWILVWEDGFCNF+AS AAEINSGDCPSSS YGNCEFQ YQGLQPELFFKMSHEIYNYG
Sbjct: 61  RNWILVWEDGFCNFSAS-AAEINSGDCPSSSAYGNCEFQPYQGLQPELFFKMSHEIYNYG 119

Query: 121 EGLIGKVAADHSHKWIYKEPNDQEINFLSAWHNAADSHPRTWEAQFQSGIKTIALIAVRE 180
           EGLIGKVAADHSHKWIYKEPNDQEINFLS+WHN+ADSHPRTWEAQFQSGIKTIALIAVRE
Sbjct: 120 EGLIGKVAADHSHKWIYKEPNDQEINFLSSWHNSADSHPRTWEAQFQSGIKTIALIAVRE 179

Query: 181 GVVQLGAVNKVVEDLNYVVLLRKKFSYIESIPGVLLPHPSSSAYPFKIDGYHAPAEAWHH 240
           GVVQLGAV+KV+EDL+YVVLLRKKFSYIESIPGVLLPHPSSSA+P+K+DGY  P ++WH+
Sbjct: 180 GVVQLGAVHKVIEDLSYVVLLRKKFSYIESIPGVLLPHPSSSAFPYKVDGYGTP-DSWHY 238

Query: 241 YQIQNPLAAALPLDHHHHQYYDHFNQPSSAMKITPSMSSLEALLSKLPSVVPPPPSGDYG 300
                P     P +++H    DHFNQ     KITPSMSSLEALLSKLPSVVPP P   Y 
Sbjct: 239 QGTIAP-----PTEYYH----DHFNQ--VPFKITPSMSSLEALLSKLPSVVPPQPGSGYC 287

Query: 301 ESQPPLMSPFVSAAHQRPMEFIQAEKVAKEDTDEDYCRRPHDRDVGECSSSMSGY---QH 357
            ++    S ++S   QRP  F+  EKVAKE+ DEDY R  H  D+GE SSSMS Y   Q 
Sbjct: 288 TTESQPQSQYLST--QRPAAFMGMEKVAKEEIDEDYNRTEH--DMGETSSSMSAYRRQQF 343

Query: 358 HFHHHQN 364
           H HH Q+
Sbjct: 344 HQHHGQD 350




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|224108141|ref|XP_002314734.1| predicted protein [Populus trichocarpa] gi|222863774|gb|EEF00905.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|388491326|gb|AFK33729.1| unknown [Lotus japonicus] Back     alignment and taxonomy information
>gi|356497894|ref|XP_003517791.1| PREDICTED: uncharacterized protein LOC100816910 [Glycine max] Back     alignment and taxonomy information
>gi|225424721|ref|XP_002265760.1| PREDICTED: uncharacterized protein LOC100258629 [Vitis vinifera] Back     alignment and taxonomy information
>gi|357485981|ref|XP_003613278.1| Basic helix-loop-helix protein, putative [Medicago truncatula] gi|355514613|gb|AES96236.1| Basic helix-loop-helix protein, putative [Medicago truncatula] Back     alignment and taxonomy information
>gi|255637625|gb|ACU19137.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|356501548|ref|XP_003519586.1| PREDICTED: uncharacterized protein LOC100799671 [Glycine max] Back     alignment and taxonomy information
>gi|147780838|emb|CAN65927.1| hypothetical protein VITISV_035287 [Vitis vinifera] Back     alignment and taxonomy information
>gi|18406603|ref|NP_564753.1| serine/threonine-protein kinase WNK (With No lysine)-related protein [Arabidopsis thaliana] gi|4249385|gb|AAD14482.1|AAD14482 T2K10.11 [Arabidopsis thaliana] gi|21537011|gb|AAM61352.1| unknown [Arabidopsis thaliana] gi|225898034|dbj|BAH30349.1| hypothetical protein [Arabidopsis thaliana] gi|332195533|gb|AEE33654.1| serine/threonine-protein kinase WNK (With No lysine)-related protein [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query364
TAIR|locus:2202735386 AT1G60060 "AT1G60060" [Arabido 0.777 0.733 0.754 1.3e-118
TAIR|locus:2154719377 AT5G53900 "AT5G53900" [Arabido 0.497 0.480 0.390 1.3e-31
TAIR|locus:2087070362 AT3G15240 "AT3G15240" [Arabido 0.450 0.453 0.373 3.6e-25
TAIR|locus:2042551 720 CPUORF7 "conserved peptide ups 0.239 0.120 0.427 7.5e-14
UNIPROTKB|Q2QXE3 881 LOC_Os12g06330 "Expressed prot 0.329 0.136 0.338 2.7e-09
TAIR|locus:2039580 650 LHW "AT2G27230" [Arabidopsis t 0.266 0.149 0.283 2.1e-07
UNIPROTKB|Q5N9E6 904 P0505D12.13 "BHLH transcriptio 0.521 0.210 0.272 1.3e-05
TAIR|locus:2202735 AT1G60060 "AT1G60060" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 1168 (416.2 bits), Expect = 1.3e-118, P = 1.3e-118
 Identities = 230/305 (75%), Positives = 250/305 (81%)

Query:     1 MEEHLNPLAMTHLLQHTLRSLCIHENSQWVYAVFWRILPRNYPPPKWDGQGAYDRSRGNR 60
             MEEHLNPLA+THLLQHTLRSLCIHENSQWVYAVFWRILPRNYPPPKWDGQGAYDRSRGNR
Sbjct:     1 MEEHLNPLAVTHLLQHTLRSLCIHENSQWVYAVFWRILPRNYPPPKWDGQGAYDRSRGNR 60

Query:    61 RNWILVWEDGFCNFAASTAAEINSGDCPS----SSVYGNCEFQHYQGLQPELFFKMSHEI 116
             RNWILVWEDGFCNFAAS AAE++SG+       S+ YGN +FQ YQGLQPELFFKMSHEI
Sbjct:    61 RNWILVWEDGFCNFAAS-AAEMSSGEGSGGGGGSAAYGNSDFQQYQGLQPELFFKMSHEI 119

Query:   117 YNYGEGLIGKVAADHSHKWIYKEPNDQEINFLSAWHNAADSHPRTWEAQFQSGIKTIALI 176
             YNYGEGLIGKVAADHSHKWIYKEPNDQEINFLSAWHN+ADS+PRTWEAQFQSGIKTIALI
Sbjct:   120 YNYGEGLIGKVAADHSHKWIYKEPNDQEINFLSAWHNSADSYPRTWEAQFQSGIKTIALI 179

Query:   177 AVREGVVQLGAVNKVVEDLNYVVLLRKKFSYIESIPGVLLPHPSSSAYPFKIDGYHAPAE 236
             +VREGVVQLGAV+KV+EDL+YVV+LRKK SYIESIPGVLLPHPSSS YPF I+   +P++
Sbjct:   180 SVREGVVQLGAVHKVIEDLSYVVMLRKKLSYIESIPGVLLPHPSSSGYPF-INA--SPSD 236

Query:   237 AWHHYQI----QNXXXXXXXXXXXXXQYYDHFNQPSS----------AMKITPSMSSLEA 282
              WH   +    Q                  H NQP +          +MKITPSMSSLEA
Sbjct:   237 TWHFPGVAPPHQQPEHQFYHSDHNHRFLIGHHNQPQAVGGAAPPLPLSMKITPSMSSLEA 296

Query:   283 LLSKL 287
             LLSKL
Sbjct:   297 LLSKL 301




GO:0003674 "molecular_function" evidence=ND
GO:0005576 "extracellular region" evidence=ISM
GO:0008150 "biological_process" evidence=ND
TAIR|locus:2154719 AT5G53900 "AT5G53900" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2087070 AT3G15240 "AT3G15240" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2042551 CPUORF7 "conserved peptide upstream open reading frame 7" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|Q2QXE3 LOC_Os12g06330 "Expressed protein" [Oryza sativa Japonica Group (taxid:39947)] Back     alignment and assigned GO terms
TAIR|locus:2039580 LHW "AT2G27230" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|Q5N9E6 P0505D12.13 "BHLH transcription factor-like protein" [Oryza sativa Japonica Group (taxid:39947)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query364
pfam14215171 pfam14215, bHLH-MYC_N, bHLH-MYC and R2R3-MYB trans 1e-53
>gnl|CDD|222598 pfam14215, bHLH-MYC_N, bHLH-MYC and R2R3-MYB transcription factors N-terminal Back     alignment and domain information
 Score =  174 bits (443), Expect = 1e-53
 Identities = 54/207 (26%), Positives = 76/207 (36%), Gaps = 51/207 (24%)

Query: 14  LQHTLRSLCIHENSQWVYAVFWRILPRNYPPPKWDGQGAYDRSRGNRRNWILVWEDGFCN 73
           LQ  L++L   E+ QW YA+FW++ P                      + +L W DG+ N
Sbjct: 1   LQQRLQALV--ESEQWSYAIFWQLSPDQ--------------------SGVLGWGDGYYN 38

Query: 74  FAASTAAEINSGDCPSSSVYGNCEFQ--------HYQGLQPELFFK-------MSHEIYN 118
               T   +              E                 +L          M+   + 
Sbjct: 39  GEIKTRKTVAEELGLQRK-KVLRELYSLLSGSRDAAALSPEDLTDTEWFYLVSMTF-SFP 96

Query: 119 YGEGLIGKVAADHSHKWIYKEPNDQEINFLSAWHNAADSHPRTWEAQFQSGIKTIALIAV 178
            GEGL GK  A  SH W+         N             R+W AQ  +GI+TI  I V
Sbjct: 97  PGEGLPGKAFASGSHIWLSGANELDSSNC-----------SRSWLAQS-AGIQTIVCIPV 144

Query: 179 REGVVQLGAVNKVVEDLNYVVLLRKKF 205
             GVV+LG+  K+ EDLN V  ++  F
Sbjct: 145 PGGVVELGSTEKIPEDLNLVQHVKSLF 171


This is the N-terminal region of a family of MYB and MYC transcription factors. The DNA-binding HLH domain is further downstream, pfam00010. Members of the MYB and MYC family regulate the biosynthesis of phenylpropanoids in several plant species (DOI:10.1007/s11295-009-0232-y). Length = 171

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 364
PF14215163 bHLH-MYC_N: bHLH-MYC and R2R3-MYB transcription fa 100.0
>PF14215 bHLH-MYC_N: bHLH-MYC and R2R3-MYB transcription factors N-terminal Back     alignment and domain information
Probab=100.00  E-value=7.3e-50  Score=356.34  Aligned_cols=155  Identities=39%  Similarity=0.632  Sum_probs=129.6

Q ss_pred             HHHHHHHhccCCCCCceEEEEeeecCCCCCCCCCCCCCccccCCCCCCceeEEecCccccCCccchhhcC------CCC-
Q 017877           14 LQHTLRSLCIHENSQWVYAVFWRILPRNYPPPKWDGQGAYDRSRGNRRNWILVWEDGFCNFAASTAAEIN------SGD-   86 (364)
Q Consensus        14 LQ~~LrsLc~~~~~~WtYAVFWqisprn~ppP~w~~~g~~D~S~~~sg~~iLvWgDGycng~~~~~~E~~------~~~-   86 (364)
                      |||+||+||+  +++|+||||||+++++                     ++|+||||||++++++++..+      +.. 
T Consensus         1 Lq~~Lr~lv~--~~~W~YaVFWk~~~~~---------------------~~L~W~DG~~~g~~~~~~~~~~~~~~~~~~l   57 (163)
T PF14215_consen    1 LQQRLRSLVE--NSQWTYAVFWKLSPDN---------------------SVLVWGDGYCNGPKETRKNGEEEQEQRSKVL   57 (163)
T ss_pred             ChHHHHHHhC--CCCCcEEEEeEEcCCC---------------------CeeeEcceeecCCcccccchhhccchhhhHH
Confidence            7999999999  8899999999999775                     499999999999876543210      000 


Q ss_pred             CCCCCCCCCccccccCCcchHHHHhhccc-cccCCCCeeeeEeeCCCeeeeeCCCCchhhhhhhhcccCCCCCCchhhhh
Q 017877           87 CPSSSVYGNCEFQHYQGLQPELFFKMSHE-IYNYGEGLIGKVAADHSHKWIYKEPNDQEINFLSAWHNAADSHPRTWEAQ  165 (364)
Q Consensus        87 ~~~ss~~g~~~~~~~~~~d~EwFflMSm~-sF~~GeGLpGKaaasg~h~WI~~~~~~~e~~f~s~w~~s~d~~~Ra~~AQ  165 (364)
                      ....+.++++...+.+++++||||++||+ +|  |+|+|||||++|+|+||++++..           ..+.|+|+|+||
T Consensus        58 ~~l~~~~~~~~~~~~~v~~~e~f~~~s~~~sf--g~G~~G~a~~sg~~~Wi~~~~~~-----------~~~~~~r~~~aq  124 (163)
T PF14215_consen   58 RELHSSFSSYALSPEEVTDTEWFYLVSMSYSF--GEGIPGRAAASGQHIWISGANEL-----------DSSYCERAWLAQ  124 (163)
T ss_pred             HHHhhhccccccccchhHHHHHHhhceeeEEe--cCCccEEEeecCccEEEeCCCcc-----------ccccchhhhhhc
Confidence            01122223445678899999999999994 66  99999999999999999999742           345789999999


Q ss_pred             hccCcceEEEeEeCCceEeecccccccCChHHHHHHHHHh
Q 017877          166 FQSGIKTIALIAVREGVVQLGAVNKVVEDLNYVVLLRKKF  205 (364)
Q Consensus       166 FsaGIQTIVcIPV~~GVVELGSte~I~Ed~~lV~~IK~lF  205 (364)
                       .+||||||||||++||||||||++|+||.+||++||++|
T Consensus       125 -~~~~~Tiv~IPv~~GVvELGSt~~I~Ed~~~v~~vk~~F  163 (163)
T PF14215_consen  125 -FAGIQTIVCIPVPNGVVELGSTEKIPEDSNLVQRVKSLF  163 (163)
T ss_pred             -ccccceEEEEEecCCEEEeeeeeeeccCHHHHHHHHhhC
Confidence             688889999999999999999999999999999999998




Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query364
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 1e-05
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
 Score = 46.0 bits (108), Expect = 1e-05
 Identities = 51/350 (14%), Positives = 89/350 (25%), Gaps = 98/350 (28%)

Query: 75  AASTAAEINSGDCPSSSVY----GNCEFQHYQGLQPELFFKMSHEIY-----NYGEGLIG 125
           A                ++     NC         PE   +M  ++      N+      
Sbjct: 167 ALDVCLSYKVQCKMDFKIFWLNLKNCN-------SPETVLEMLQKLLYQIDPNWTSR--- 216

Query: 126 KVAADHSHKWIYK-EPNDQEINFLSAWHNAADS--------HPRTWEAQFQSGIKTI--- 173
              +DHS     +      E+  L       +         + + W A F    K +   
Sbjct: 217 ---SDHSSNIKLRIHSIQAELRRLLKSKPYENCLLVLLNVQNAKAWNA-FNLSCKILLTT 272

Query: 174 -------ALIAVREGVVQLGAVNKVVEDLNYVVLLRKKFSYIESIPGVLLPHPSSSAYPF 226
                   L A     + L   +  +       LL K   Y++  P   LP    +  P 
Sbjct: 273 RFKQVTDFLSAATTTHISLDHHSMTLTPDEVKSLLLK---YLDCRPQ-DLPREVLTTNPR 328

Query: 227 KIDGYHAPAEAWHHYQIQNPLAAALPLDHHHHQYYDHFNQPSSAMKITPSMSSLEAL--- 283
           ++                  +A ++         + H N       I  S++ LE     
Sbjct: 329 RLSI----------------IAESIRDGLATWDNWKHVNCDKLTTIIESSLNVLEPAEYR 372

Query: 284 -----LSKLPSVVPPPPS--------GDYGESQPPLMSPFV--SAAHQRPMEF------I 322
                LS  P     P              +    +++     S   ++P E       I
Sbjct: 373 KMFDRLSVFPPSAHIPTILLSLIWFDVIKSDVM-VVVNKLHKYSLVEKQPKESTISIPSI 431

Query: 323 QAEKVAKEDTD--------EDYCRRPHDRDVGECS-SSMSGY-QHHFHHH 362
             E   K + +        + Y   P   D  +     +  Y   H  HH
Sbjct: 432 YLELKVKLENEYALHRSIVDHY-NIPKTFDSDDLIPPYLDQYFYSHIGHH 480


Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query364
1f5m_A180 GAF; CGMP binding, signaling protein; 1.90A {Sacch 94.87
3dba_A180 CONE CGMP-specific 3',5'-cyclic phosphodiesterase 94.79
3trc_A171 Phosphoenolpyruvate-protein phosphotransferase; si 94.54
3e0y_A181 Conserved domain protein; APC87688.2, geobacter su 93.67
1vhm_A195 Protein YEBR; structural genomics, unknown functio 93.25
3ci6_A171 Phosphoenolpyruvate-protein phosphotransferase; PE 92.98
2w3g_A153 DOSS, two component sensor histidine kinase DEVS ( 92.51
3k2n_A177 Sigma-54-dependent transcriptional regulator; PSI- 92.32
3mmh_A167 FRMSR, methionine-R-sulfoxide reductase; oxidoredu 91.74
2e4s_A189 CAMP and CAMP-inhibited CGMP 3',5'-cyclic phosphod 90.83
3rfb_A171 Putative uncharacterized protein; FRMSR, GAF, oxid 90.06
2vjw_A149 GAF-B, GAF family protein; histidine kinase, hypox 89.78
3ksh_A160 Putative uncharacterized protein; FRMSR, free-Met- 89.65
3oov_A169 Methyl-accepting chemotaxis protein, putative; str 86.97
2zmf_A189 CAMP and CAMP-inhibited CGMP 3',5'-cyclic phospho 85.16
3hcy_A151 Putative two-component sensor histidine kinase PR; 84.6
3ibj_A 691 CGMP-dependent 3',5'-cyclic phosphodiesterase; PDE 80.13
>1f5m_A GAF; CGMP binding, signaling protein; 1.90A {Saccharomyces cerevisiae} SCOP: d.110.2.1 PDB: 3ko6_A* Back     alignment and structure
Probab=94.87  E-value=0.011  Score=51.72  Aligned_cols=72  Identities=17%  Similarity=0.301  Sum_probs=52.4

Q ss_pred             cccCCCCeeeeEeeCCCeeeeeCCCCchhhhhhhhcccCCCCCCchhhhhhccCcceEEEeEeCC------ceEeecccc
Q 017877          116 IYNYGEGLIGKVAADHSHKWIYKEPNDQEINFLSAWHNAADSHPRTWEAQFQSGIKTIALIAVRE------GVVQLGAVN  189 (364)
Q Consensus       116 sF~~GeGLpGKaaasg~h~WI~~~~~~~e~~f~s~w~~s~d~~~Ra~~AQFsaGIQTIVcIPV~~------GVVELGSte  189 (364)
                      .|..|+|+.|+|+.+++.+.|.+....                +| |.+. ..+++.++|||+..      |||++.+.+
T Consensus        93 ~i~~g~Gi~G~aa~~g~~v~v~Dv~~d----------------p~-~~~~-~~~~~S~l~vPi~~~~g~viGVL~l~s~~  154 (180)
T 1f5m_A           93 MIQFGKGVCGTAASTKETQIVPDVNKY----------------PG-HIAC-DGETKSEIVVPIISNDGKTLGVIDIDCLD  154 (180)
T ss_dssp             EEETTSHHHHHHHHHTSCEEESCGGGS----------------TT-CCCS-STTCCEEEEEEEECTTSCEEEEEEEEESS
T ss_pred             eecCCCcchhhhhhcCCEEEeCCcccC----------------cc-cccc-CcccceEEEEEEEcCCCeEEEEEEeccCC
Confidence            688899999999999999999765421                11 1222 46889999999865      999999876


Q ss_pred             cc---cCChHHHHHHHHHh
Q 017877          190 KV---VEDLNYVVLLRKKF  205 (364)
Q Consensus       190 ~I---~Ed~~lV~~IK~lF  205 (364)
                      .-   .+|..+++.+-...
T Consensus       155 ~~~F~~~d~~~L~~la~~~  173 (180)
T 1f5m_A          155 YEGFDHVDKEFLEKLAKLI  173 (180)
T ss_dssp             TTCCCHHHHHHHHHHHHHH
T ss_pred             CCCcCHHHHHHHHHHHHHH
Confidence            42   34556666655443



>3dba_A CONE CGMP-specific 3',5'-cyclic phosphodiesterase alpha'; 3', GAF domain, cyclic nucleotide phosphodiesterase hydrolase, lipoprotein, membrane; HET: 35G; 2.57A {Gallus gallus} Back     alignment and structure
>3trc_A Phosphoenolpyruvate-protein phosphotransferase; signal transduction; HET: MSE; 1.65A {Coxiella burnetii} Back     alignment and structure
>3e0y_A Conserved domain protein; APC87688.2, geobacter sulfurreducens PCA, structural genomics, PSI-2, midwest center for structural G MCSG; 3.10A {Geobacter sulfurreducens} Back     alignment and structure
>1vhm_A Protein YEBR; structural genomics, unknown function; HET: MES; 2.10A {Escherichia coli} SCOP: d.110.2.1 Back     alignment and structure
>3ci6_A Phosphoenolpyruvate-protein phosphotransferase; PEP-phosphotransferase, GAF domain, structura genomics, PSI-2, protein structure initiative; HET: MSE P4G; 1.55A {Acinetobacter SP} Back     alignment and structure
>2w3g_A DOSS, two component sensor histidine kinase DEVS (GAF family protein); redox sensor, heme, hypoxia, GAF domain, transferase; HET: HEM; 1.40A {Mycobacterium tuberculosis} PDB: 2w3d_A* 2w3f_A* 2w3e_A* 2w3h_A* 2y79_A* 2y8h_A* 2vzw_A* Back     alignment and structure
>3k2n_A Sigma-54-dependent transcriptional regulator; PSI-2, protein structure initiative, structural genomics; 2.50A {Chlorobium tepidum tls} Back     alignment and structure
>3mmh_A FRMSR, methionine-R-sulfoxide reductase; oxidoreductase; HET: SME MRD; 1.25A {Neisseria meningitidis} SCOP: d.110.2.0 Back     alignment and structure
>3rfb_A Putative uncharacterized protein; FRMSR, GAF, oxidoreductase, SME; HET: SME; 2.30A {Streptococcus pneumoniae} Back     alignment and structure
>2vjw_A GAF-B, GAF family protein; histidine kinase, hypoxia sensing, hydrolase; HET: MSE; 2.0A {Mycobacterium smegmatis} PDB: 2vks_A Back     alignment and structure
>3ksh_A Putative uncharacterized protein; FRMSR, free-Met-R-SO, oxidoreductase; 1.50A {Staphylococcus aureus} SCOP: d.110.2.0 PDB: 3ksf_A 3ksi_A 3ksg_A* Back     alignment and structure
>3oov_A Methyl-accepting chemotaxis protein, putative; structural genomics, PSI-2, protein structure initiative; 2.20A {Geobacter sulfurreducens} Back     alignment and structure
>2zmf_A CAMP and CAMP-inhibited CGMP 3',5'-cyclic phospho 10A; GAF domain, phosphodiesterase, CGMP-binding, HYD nucleotide-binding, structural genomics; HET: MSE CMP; 2.10A {Homo sapiens} Back     alignment and structure
>3hcy_A Putative two-component sensor histidine kinase PR; two-component sensor histidine kinase protein, structural GE PSI, MCSG; 2.80A {Sinorhizobium meliloti} Back     alignment and structure
>3ibj_A CGMP-dependent 3',5'-cyclic phosphodiesterase; PDE2A, GAF-domains, allosteric regulation hydrolase, membrane; 3.02A {Homo sapiens} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query364
d1vhma_159 Hypothetical protein YebR {Escherichia coli [TaxId 93.83
d1f5ma_176 Hypothetical protein ykl069wp {Baker's yeast (Sacc 93.26
d1mc0a1187 3',5'-cyclic nucleotide phosphodiesterase 2A, GAF 87.57
>d1vhma_ d.110.2.1 (A:) Hypothetical protein YebR {Escherichia coli [TaxId: 562]} Back     information, alignment and structure
class: Alpha and beta proteins (a+b)
fold: Profilin-like
superfamily: GAF domain-like
family: GAF domain
domain: Hypothetical protein YebR
species: Escherichia coli [TaxId: 562]
Probab=93.83  E-value=0.018  Score=46.92  Aligned_cols=70  Identities=14%  Similarity=0.087  Sum_probs=50.4

Q ss_pred             cccCCCCeeeeEeeCCCeeeeeCCCCchhhhhhhhcccCCCCCCchhhhhhccCcceEEEeEeC--C---ceEeeccccc
Q 017877          116 IYNYGEGLIGKVAADHSHKWIYKEPNDQEINFLSAWHNAADSHPRTWEAQFQSGIKTIALIAVR--E---GVVQLGAVNK  190 (364)
Q Consensus       116 sF~~GeGLpGKaaasg~h~WI~~~~~~~e~~f~s~w~~s~d~~~Ra~~AQFsaGIQTIVcIPV~--~---GVVELGSte~  190 (364)
                      .++.|+|+.|+|+.+|+++.+.+....                +|..  .+..+++.++|||+.  +   |||.++|.+.
T Consensus        67 ~i~~g~g~~g~v~~~~~~~~v~d~~~d----------------~~~~--~~~~~~~S~l~vPl~~~~~viGvl~v~s~~~  128 (159)
T d1vhma_          67 RIPVGRGVCGTAVARNQVQRIEDVHVF----------------DGHI--ACDAASNSEIVLPLVVKNQIIGVLDIDSTVF  128 (159)
T ss_dssp             EEETTSHHHHHHHHHTSCEEESCTTTC----------------TTCC--CSCCCCSEEEEEEEEETTEEEEEEEEEESST
T ss_pred             EEecCCChHHHHHHcCCcEEecChHHh----------------hhhh--hcccccceeEEeCEEECCEEEEEEEecCCCC
Confidence            367899999999999999999876421                2221  125788999999983  2   9999999765


Q ss_pred             cc---CChHHHHHHHH
Q 017877          191 VV---EDLNYVVLLRK  203 (364)
Q Consensus       191 I~---Ed~~lV~~IK~  203 (364)
                      =.   +|..+++.|-.
T Consensus       129 ~~F~~~d~~ll~~la~  144 (159)
T d1vhma_         129 GRFTDEDEQGLRQLVA  144 (159)
T ss_dssp             TCCCHHHHHHHHHHHH
T ss_pred             CCCCHHHHHHHHHHHH
Confidence            44   34555555443



>d1f5ma_ d.110.2.1 (A:) Hypothetical protein ykl069wp {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} Back     information, alignment and structure
>d1mc0a1 d.110.2.1 (A:215-401) 3',5'-cyclic nucleotide phosphodiesterase 2A, GAF A and GAF B domains {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure