Citrus Sinensis ID: 033800


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-
MDSPWAIQPSYSTLSISTSRFSAESSGTDSFGSRMSPITGGEASCHSPDLDSPHLEARENAMMRYKEKKKARRQDKQIQYPSRKARADARKRVQGRFLKTEGYDSDATDVT
ccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHcccccccHHHHHHHHHcccccccEEEcccccccccccc
ccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHccccccEEEEHHHHHHHccccccccEEEEcccccccccccc
mdspwaiqpsystlsistsrfsaessgtdsfgsrmspitggeaschspdldsphlEARENAMMRYKEKKKArrqdkqiqypsrKARADARKRVQGRflktegydsdatdvt
MDSPWAIQPSYSTLSISTSRFSAESSGTDSFGSRMSPITGGEASCHSPDLDSPHLEARENAMMRYKEkkkarrqdkqiqypsrkaradarkrvqgrflktegydsdatdvt
MDSPWAIQPSYSTLSISTSRFSAESSGTDSFGSRMSPITGGEASCHSPDLDSPHLEARENAMMRYKEKKKARRQDKQIQYPSRKARADARKRVQGRFLKTEGYDSDATDVT
***************************************************************************************************************
************************************************************************RQDKQIQYPSRKARADARKRVQGRFL*************
MDSPWAIQPSYSTLSIST**************SRMSPITGGEASCHSPDLDSPHLEARENAMMRY**************************RVQGRFLKTEG*********
***************************************************S*H*EARENAMMRYKEKKKARRQDKQIQYPSRKARADARKRVQGRFLKTEGY********
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MDSPWAIQPSYSTLSISTSRFSAESSGTDSFGSRMSPITGGEASCHSPDLDSPHLEARENAMMRYKEKKKARRQDKQIQYPSRKARADARKRVQGRFLKTEGYDSDATDVT
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query111 2.2.26 [Sep-21-2011]
Q9SSE5372 Zinc finger protein CONST yes no 0.900 0.268 0.458 3e-14
Q9LUA9373 Zinc finger protein CONST no no 0.558 0.166 0.575 5e-13
O22800402 Zinc finger protein CONST no no 0.396 0.109 0.659 2e-10
Q9LJ44364 Zinc finger protein CONST no no 0.504 0.153 0.578 8e-10
Q9C7E8433 Zinc finger protein CONST no no 0.396 0.101 0.659 2e-09
O82256332 Zinc finger protein CONST no no 0.801 0.268 0.346 1e-08
Q9FDX8395 Zinc finger protein HD1 O no no 0.378 0.106 0.571 7e-07
O50055355 Zinc finger protein CONST no no 0.459 0.143 0.490 7e-07
Q96502347 Zinc finger protein CONST no no 0.378 0.121 0.571 8e-07
Q940T9362 Zinc finger protein CONST no no 0.540 0.165 0.4 1e-06
>sp|Q9SSE5|COL9_ARATH Zinc finger protein CONSTANS-LIKE 9 OS=Arabidopsis thaliana GN=COL9 PE=2 SV=1 Back     alignment and function desciption
 Score = 77.0 bits (188), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 50/109 (45%), Positives = 66/109 (60%), Gaps = 9/109 (8%)

Query: 5   WAIQPSYSTLSISTSRFSAESSGTD--SFGSRMSPITGGEASCHSPDLD----SPHLEAR 58
           +A +P++S +S S    + ESS  D    G+  S    GE   + P L       H   R
Sbjct: 258 FASKPAHSNISFSG--VTGESSAGDFQECGASSSIQLSGEPPWYPPTLQDNNACSHSVTR 315

Query: 59  ENAMMRYKEKKKARRQDKQIQYPSRKARADARKRVQGRFLKT-EGYDSD 106
            NA+MRYKEKKKAR+ DK+++Y SRKARAD R+RV+GRF+K  E YD D
Sbjct: 316 NNAVMRYKEKKKARKFDKRVRYASRKARADVRRRVKGRFVKAGEAYDYD 364





Arabidopsis thaliana (taxid: 3702)
>sp|Q9LUA9|COL10_ARATH Zinc finger protein CONSTANS-LIKE 10 OS=Arabidopsis thaliana GN=COL10 PE=1 SV=1 Back     alignment and function description
>sp|O22800|COL14_ARATH Zinc finger protein CONSTANS-LIKE 14 OS=Arabidopsis thaliana GN=COL14 PE=2 SV=2 Back     alignment and function description
>sp|Q9LJ44|COL12_ARATH Zinc finger protein CONSTANS-LIKE 12 OS=Arabidopsis thaliana GN=COL12 PE=2 SV=2 Back     alignment and function description
>sp|Q9C7E8|COL15_ARATH Zinc finger protein CONSTANS-LIKE 15 OS=Arabidopsis thaliana GN=COL15 PE=2 SV=1 Back     alignment and function description
>sp|O82256|COL13_ARATH Zinc finger protein CONSTANS-LIKE 13 OS=Arabidopsis thaliana GN=COL13 PE=2 SV=1 Back     alignment and function description
>sp|Q9FDX8|HD1_ORYSJ Zinc finger protein HD1 OS=Oryza sativa subsp. japonica GN=HD1 PE=2 SV=1 Back     alignment and function description
>sp|O50055|COL1_ARATH Zinc finger protein CONSTANS-LIKE 1 OS=Arabidopsis thaliana GN=COL1 PE=1 SV=1 Back     alignment and function description
>sp|Q96502|COL2_ARATH Zinc finger protein CONSTANS-LIKE 2 OS=Arabidopsis thaliana GN=COL2 PE=1 SV=1 Back     alignment and function description
>sp|Q940T9|COL4_ARATH Zinc finger protein CONSTANS-LIKE 4 OS=Arabidopsis thaliana GN=COL4 PE=2 SV=2 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query111
359472914133 PREDICTED: zinc finger protein CONSTANS- 0.981 0.819 0.531 1e-20
297737757153 unnamed protein product [Vitis vinifera] 0.981 0.712 0.531 1e-20
224099423 497 predicted protein [Populus trichocarpa] 0.837 0.187 0.572 6e-19
116787086 555 unknown [Picea sitchensis] 0.945 0.189 0.485 3e-16
255570685 315 hypothetical protein RCOM_0577390 [Ricin 0.720 0.253 0.592 9e-16
297739315 338 unnamed protein product [Vitis vinifera] 0.864 0.284 0.474 1e-12
359485671 448 PREDICTED: putative zinc finger protein 0.864 0.214 0.474 1e-12
15231491 372 zinc finger protein CONSTANS-LIKE 9 [Ara 0.900 0.268 0.458 1e-12
224093218 332 predicted protein [Populus trichocarpa] 0.864 0.289 0.474 5e-12
297795479 263 hypothetical protein ARALYDRAFT_357028 [ 0.558 0.235 0.575 2e-11
>gi|359472914|ref|XP_003631214.1| PREDICTED: zinc finger protein CONSTANS-LIKE 12-like [Vitis vinifera] Back     alignment and taxonomy information
 Score =  103 bits (257), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 59/111 (53%), Positives = 74/111 (66%), Gaps = 2/111 (1%)

Query: 1   MDSPWAIQPSYSTLSISTSRFSAESSGTDSFGSRMSPITGGEASCHSPDLDSPHLEAREN 60
           ++S   I+PS+S +S S S  S ESS TD   S +SP T       S DL+S H EAREN
Sbjct: 22  VNSHCPIRPSFSAMSFSVSILSPESSSTDYLDSGLSPTTTRGPPFDSSDLESAHSEAREN 81

Query: 61  AMMRYKEKKKARRQDKQIQYPSRKARADARKRVQGRFLKTEGYDSDATDVT 111
           AMMRYKEKKK R  +KQI++  RK  AD RKRV+G+ LK EGY+SD+ + T
Sbjct: 82  AMMRYKEKKKVRMNEKQIRHTPRK--ADPRKRVKGQSLKAEGYESDSVNAT 130




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|297737757|emb|CBI26958.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|224099423|ref|XP_002311478.1| predicted protein [Populus trichocarpa] gi|222851298|gb|EEE88845.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|116787086|gb|ABK24370.1| unknown [Picea sitchensis] Back     alignment and taxonomy information
>gi|255570685|ref|XP_002526297.1| hypothetical protein RCOM_0577390 [Ricinus communis] gi|223534378|gb|EEF36086.1| hypothetical protein RCOM_0577390 [Ricinus communis] Back     alignment and taxonomy information
>gi|297739315|emb|CBI28966.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|359485671|ref|XP_002274384.2| PREDICTED: putative zinc finger protein CONSTANS-LIKE 11-like [Vitis vinifera] Back     alignment and taxonomy information
>gi|15231491|ref|NP_187422.1| zinc finger protein CONSTANS-LIKE 9 [Arabidopsis thaliana] gi|42572309|ref|NP_974250.1| zinc finger protein CONSTANS-LIKE 9 [Arabidopsis thaliana] gi|79313159|ref|NP_001030659.1| zinc finger protein CONSTANS-LIKE 9 [Arabidopsis thaliana] gi|186509893|ref|NP_001118599.1| zinc finger protein CONSTANS-LIKE 9 [Arabidopsis thaliana] gi|17433084|sp|Q9SSE5.1|COL9_ARATH RecName: Full=Zinc finger protein CONSTANS-LIKE 9 gi|6466948|gb|AAF13083.1|AC009176_10 unknown protein [Arabidopsis thaliana] gi|119360037|gb|ABL66747.1| At3g07650 [Arabidopsis thaliana] gi|222423327|dbj|BAH19639.1| AT3G07650 [Arabidopsis thaliana] gi|332641059|gb|AEE74580.1| zinc finger protein CONSTANS-LIKE 9 [Arabidopsis thaliana] gi|332641060|gb|AEE74581.1| zinc finger protein CONSTANS-LIKE 9 [Arabidopsis thaliana] gi|332641061|gb|AEE74582.1| zinc finger protein CONSTANS-LIKE 9 [Arabidopsis thaliana] gi|332641062|gb|AEE74583.1| zinc finger protein CONSTANS-LIKE 9 [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|224093218|ref|XP_002309838.1| predicted protein [Populus trichocarpa] gi|222852741|gb|EEE90288.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|297795479|ref|XP_002865624.1| hypothetical protein ARALYDRAFT_357028 [Arabidopsis lyrata subsp. lyrata] gi|297311459|gb|EFH41883.1| hypothetical protein ARALYDRAFT_357028 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query111
TAIR|locus:2091082372 COL9 "CONSTANS-like 9" [Arabid 0.900 0.268 0.467 8.5e-16
TAIR|locus:2164718373 BBX8 "B-box domain protein 8" 0.945 0.281 0.434 1.5e-14
TAIR|locus:2129860330 BBX9 "B-box domain protein 9" 0.837 0.281 0.431 2.1e-11
TAIR|locus:2088748364 BBX10 "B-box domain protein 10 0.792 0.241 0.440 7.7e-11
TAIR|locus:2010504433 BBX13 "B-box domain protein 13 0.828 0.212 0.414 3e-10
TAIR|locus:2051104402 BBX12 "B-box domain protein 12 0.378 0.104 0.690 4.3e-10
TAIR|locus:2043288332 AT2G47890 [Arabidopsis thalian 0.540 0.180 0.5 7.6e-09
TAIR|locus:2047246294 COL3 "CONSTANS-like 3" [Arabid 0.900 0.340 0.330 9.3e-09
UNIPROTKB|Q9FDX8395 HD1 "Zinc finger protein HD1" 0.765 0.215 0.413 3.8e-08
UNIPROTKB|Q9FE92407 Hd1 "Heading day 1" [Oryza sat 0.765 0.208 0.413 4e-08
TAIR|locus:2091082 COL9 "CONSTANS-like 9" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 202 (76.2 bits), Expect = 8.5e-16, P = 8.5e-16
 Identities = 51/109 (46%), Positives = 67/109 (61%)

Query:     5 WAIQPSYSTLSISTSRFSAESSGTD--SFGSRMSPITGGEASCHSPDLDS----PHLEAR 58
             +A +P++S  +IS S  + ESS  D    G+  S    GE   + P L       H   R
Sbjct:   258 FASKPAHS--NISFSGVTGESSAGDFQECGASSSIQLSGEPPWYPPTLQDNNACSHSVTR 315

Query:    59 ENAMMRYKEKKKARRQDKQIQYPSRKARADARKRVQGRFLKT-EGYDSD 106
              NA+MRYKEKKKAR+ DK+++Y SRKARAD R+RV+GRF+K  E YD D
Sbjct:   316 NNAVMRYKEKKKARKFDKRVRYASRKARADVRRRVKGRFVKAGEAYDYD 364




GO:0005622 "intracellular" evidence=IEA
GO:0005634 "nucleus" evidence=ISM;IDA
GO:0008270 "zinc ion binding" evidence=IEA;ISS
GO:0003700 "sequence-specific DNA binding transcription factor activity" evidence=ISS
GO:0006355 "regulation of transcription, DNA-dependent" evidence=TAS
GO:0048579 "negative regulation of long-day photoperiodism, flowering" evidence=IMP
GO:0007623 "circadian rhythm" evidence=IEP;RCA
TAIR|locus:2164718 BBX8 "B-box domain protein 8" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2129860 BBX9 "B-box domain protein 9" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2088748 BBX10 "B-box domain protein 10" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2010504 BBX13 "B-box domain protein 13" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2051104 BBX12 "B-box domain protein 12" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2043288 AT2G47890 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2047246 COL3 "CONSTANS-like 3" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|Q9FDX8 HD1 "Zinc finger protein HD1" [Oryza sativa Japonica Group (taxid:39947)] Back     alignment and assigned GO terms
UNIPROTKB|Q9FE92 Hd1 "Heading day 1" [Oryza sativa (taxid:4530)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
eugene3.00081165
hypothetical protein (497 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query111
pfam0620345 pfam06203, CCT, CCT motif 2e-14
>gnl|CDD|203407 pfam06203, CCT, CCT motif Back     alignment and domain information
 Score = 61.5 bits (150), Expect = 2e-14
 Identities = 26/42 (61%), Positives = 36/42 (85%)

Query: 58 RENAMMRYKEKKKARRQDKQIQYPSRKARADARKRVQGRFLK 99
          RE A++RYKEK+K R+ DK+I+Y SRKA A++R RV+GRF+K
Sbjct: 1  REAALLRYKEKRKTRKFDKKIRYASRKAVAESRPRVKGRFVK 42


This short motif is found in a number of plant proteins. It is rich in basic amino acids and has been called a CCT motif after Co, Col and Toc1. The CCT motif is about 45 amino acids long and contains a putative nuclear localisation signal within the second half of the CCT motif. Toc1 mutants have been identified in this region. Length = 45

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 111
PF0620345 CCT: CCT motif; InterPro: IPR010402 The CCT (CONST 99.86
KOG1601340 consensus GATA-4/5/6 transcription factors [Transc 96.9
PF0942527 CCT_2: Divergent CCT motif; InterPro: IPR018467 Th 95.89
>PF06203 CCT: CCT motif; InterPro: IPR010402 The CCT (CONSTANS, CO-like, and TOC1) domain is a highly conserved basic module of ~43 amino acids, which is found near the C terminus of plant proteins often involved in light signal transduction Back     alignment and domain information
Probab=99.86  E-value=9.4e-23  Score=127.19  Aligned_cols=45  Identities=53%  Similarity=0.846  Sum_probs=43.0

Q ss_pred             HHHHHHHHHHHHhhhccCCcccccchhhhhhhccCCCcceecCCC
Q 033800           58 RENAMMRYKEKKKARRQDKQIQYPSRKARADARKRVQGRFLKTEG  102 (111)
Q Consensus        58 R~~~v~RYreKkk~R~f~KkIrY~~RK~~Ad~RpRvKGRFvk~~~  102 (111)
                      |+++|+||+|||++|+|+|+|+|++||++||.|||||||||+.++
T Consensus         1 R~~~l~Ry~~Kr~~R~f~kkirY~~Rk~~A~~R~RvkGRFvk~~e   45 (45)
T PF06203_consen    1 REEKLQRYREKRKRRNFEKKIRYESRKAVADKRPRVKGRFVKKSE   45 (45)
T ss_pred             CHHHHHHHHHHHHhhcccccCCcchHHHHHhhCCccCCcccCCCC
Confidence            689999999999999999999999999999999999999999853



The CCT domain is found in association with other domains, such as the B-box zinc finger, the GATA-type zinc finger, the ZIM motif or the response regulatory domain. The CCT domain contains a putative nuclear localisation signal within the second half of the CCT motif and has been shown to be involved in nuclear localization and probably also has a role in protein-protein interaction [].; GO: 0005515 protein binding

>KOG1601 consensus GATA-4/5/6 transcription factors [Transcription] Back     alignment and domain information
>PF09425 CCT_2: Divergent CCT motif; InterPro: IPR018467 The short CCT (CO, COL, TOC1) motif is found in a number of plant proteins, including Constans (CO), Constans-like (COL) and TOC1 Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query111
3ogl_Q21 JAZ1 incomplete degron peptide; leucine-rich repea 95.38
3ogk_Q22 JAZ1 incomplete degron peptide; leucine rich repea 93.43
>3ogl_Q JAZ1 incomplete degron peptide; leucine-rich repeats, ubiquitin ligase, SCF, protein binding; HET: 7JA; 3.18A {Arabidopsis thaliana} PDB: 3ogm_Q* Back     alignment and structure
Probab=95.38  E-value=0.0088  Score=31.54  Aligned_cols=18  Identities=39%  Similarity=0.549  Sum_probs=16.3

Q ss_pred             HHHHHHHHHHHHHHHhhh
Q 033800           55 LEARENAMMRYKEKKKAR   72 (111)
Q Consensus        55 ~~~R~~~v~RYreKkk~R   72 (111)
                      +.+|.+.|+||.||||.|
T Consensus         3 p~aRk~SLqRFleKRk~R   20 (21)
T 3ogl_Q            3 PIARRASLHRFLEKRKDR   20 (26)
T ss_pred             chhHHHHHHHHHHHhhcc
Confidence            468999999999999987



>3ogk_Q JAZ1 incomplete degron peptide; leucine rich repeat, ubiquitin ligase, SCF, protein binding; HET: OGK; 2.80A {Arabidopsis thaliana} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00