Citrus Sinensis ID: 041672


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------
DHEEQKNPCKKVVRYKECLKNHAAAIGGSATDGCGEFMPSGEEGSIESLKCSACNCHRNFHRKEIGCDAHYHHLHQQAMMMLSGNSGSAHSASDEKVERRGVVVGRPAPADDDDHQKRMMMRKRFRT
ccccccccccccEEHHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHcccccccccccccHHHHccccccccccccccccccccccccccccc
cccccccccccEEEEHHHHHHHHHHccccccccccccccccccccHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHcccccccccccc
dheeqknpckkVVRYKECLKNHaaaiggsatdgcgefmpsgeegsieslkcsacnchrnfhrkeigcdahyHHLHQQAMMMlsgnsgsahsasdekverrgvvvgrpapaddddhQKRMMMRKRFRT
dheeqknpckkVVRYKECLKNHAAaiggsatdgCGEFMPSGEEGSIESLKCSACNCHRNFHRKEIGCDAHYHHLHQQAMMMLSGNSGSAHSASDEKverrgvvvgrpapaddddhqkrMMMRKRFRT
DHEEQKNPCKKVVRYKECLKNHAAAIGGSATDGCGEFMPSGEEGSIESLKCSACNCHRNFHRKEIGCDAHYHHLHQQAMMMLSGNSGSAHSASDEKVErrgvvvgrpapaDDDDHQKRMMMRKRFRT
**********KVVRYKECLKNHAAAIGGSATDGCGEF********IESLKCSACNCHRNFHRKEIGCDAHYHHLHQQ**************************************************
*************RYKECLKNHAAAIGGSATDGCGEFMP****GSIESLKCSACNCHRNFHRKEIG*******************************************************RKRFR*
********CKKVVRYKECLKNHAAAIGGSATDGCGEFMPSGEEGSIESLKCSACNCHRNFHRKEIGCDAHYHHLHQQAMMMLS***************RRGVVVGRPAPADDDDHQKRMMMRKRFRT
*******PCKKVVRYKECLKNHAAAIGGSATDGCGEFMPSGEEGSIESLKCSACNCHRNFHRKEIGCDAHYHHLHQQAMMMLSGNSGSAHSASDEKVE*****************************
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
DHEEQKNPCKKVVRYKECLKNHAAAIGGSATDGCGEFMPSGEEGSIESLKCSACNCHRNFHRKEIGCDAHYHHLHQQAMMMLSGNSGSAHSASDEKVERRGVVVGRPAPADDDDHQKRMMMRKRFRT
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query127 2.2.26 [Sep-21-2011]
Q9SB61220 ZF-HD homeobox protein At no no 0.456 0.263 0.688 9e-19
Q9FKP8279 ZF-HD homeobox protein At no no 0.417 0.189 0.698 4e-18
Q9SEZ1242 Transcription factor HB29 no no 0.889 0.466 0.385 3e-15
>sp|Q9SB61|Y4466_ARATH ZF-HD homeobox protein At4g24660 OS=Arabidopsis thaliana GN=At4g24660 PE=1 SV=1 Back     alignment and function desciption
 Score = 92.0 bits (227), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 42/61 (68%), Positives = 50/61 (81%), Gaps = 3/61 (4%)

Query: 13  VRYKECLKNHAAAIGGSATDGCGEFMPSGEEGSIESLKCSACNCHRNFHRKE---IGCDA 69
           +RY+ECLKNHA  IGG A DGC EFMPSGE+G++++LKC+AC CHRNFHRKE   IG  A
Sbjct: 47  IRYRECLKNHAVNIGGHAVDGCCEFMPSGEDGTLDALKCAACGCHRNFHRKETESIGGRA 106

Query: 70  H 70
           H
Sbjct: 107 H 107




Putative transcription factor.
Arabidopsis thaliana (taxid: 3702)
>sp|Q9FKP8|Y5541_ARATH ZF-HD homeobox protein At5g65410 OS=Arabidopsis thaliana GN=At5g65410 PE=1 SV=1 Back     alignment and function description
>sp|Q9SEZ1|HB29_ARATH Transcription factor HB29 OS=Arabidopsis thaliana GN=HB29 PE=1 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query127
224129806 271 predicted protein [Populus trichocarpa] 0.866 0.405 0.507 2e-29
164562219 258 zinc finger-homeodomain protein 2 [Sarum 0.795 0.391 0.598 8e-26
224119848177 predicted protein [Populus trichocarpa] 0.779 0.559 0.507 8e-26
255569812 311 transcription factor, putative [Ricinus 0.850 0.347 0.434 7e-25
359486213 281 PREDICTED: ZF-HD homeobox protein At4g24 0.866 0.391 0.440 3e-24
356522109 283 PREDICTED: ZF-HD homeobox protein At4g24 0.496 0.222 0.782 6e-24
224168739175 predicted protein [Populus trichocarpa] 0.834 0.605 0.5 1e-23
224080087212 predicted protein [Populus trichocarpa] 0.850 0.509 0.435 1e-23
356563811 286 PREDICTED: ZF-HD homeobox protein At4g24 0.921 0.409 0.454 2e-23
116783202 249 unknown [Picea sitchensis] 0.984 0.502 0.418 3e-23
>gi|224129806|ref|XP_002328807.1| predicted protein [Populus trichocarpa] gi|222839105|gb|EEE77456.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score =  133 bits (335), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 72/142 (50%), Positives = 91/142 (64%), Gaps = 32/142 (22%)

Query: 8   PCKKVVRYKECLKNHAAAIGGSATDGCGEFMPSGEEGSIESLKCSACNCHRNFHRKEI-- 65
           P KK+VRY+ECLKNHAA++GG+ATDGCGEFMPSGEEGSIE+L CSACNCHRNFHRKEI  
Sbjct: 72  PYKKMVRYRECLKNHAASMGGNATDGCGEFMPSGEEGSIEALTCSACNCHRNFHRKEIEG 131

Query: 66  ------------GCDAHYHHLHQQAM--------MMLSGNSGSAHSASDEKVERRGVVVG 105
                           H++ L ++ +        M++S N GS  S SDE+ +  GV++ 
Sbjct: 132 EHTSCTGDHCYHNSPVHFNRLGRKVILASAPHHQMIMSYNMGSLPSESDEQEDGGGVLMA 191

Query: 106 RPAPADDDDHQKRMMMRKRFRT 127
           RPA           +M+KRFRT
Sbjct: 192 RPA----------QLMKKRFRT 203




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|164562219|gb|ABY61024.1| zinc finger-homeodomain protein 2 [Saruma henryi] Back     alignment and taxonomy information
>gi|224119848|ref|XP_002331077.1| predicted protein [Populus trichocarpa] gi|222872805|gb|EEF09936.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|255569812|ref|XP_002525870.1| transcription factor, putative [Ricinus communis] gi|223534875|gb|EEF36564.1| transcription factor, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|359486213|ref|XP_003633412.1| PREDICTED: ZF-HD homeobox protein At4g24660 [Vitis vinifera] Back     alignment and taxonomy information
>gi|356522109|ref|XP_003529692.1| PREDICTED: ZF-HD homeobox protein At4g24660-like [Glycine max] Back     alignment and taxonomy information
>gi|224168739|ref|XP_002339186.1| predicted protein [Populus trichocarpa] gi|222874604|gb|EEF11735.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|224080087|ref|XP_002306012.1| predicted protein [Populus trichocarpa] gi|222848976|gb|EEE86523.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|356563811|ref|XP_003550152.1| PREDICTED: ZF-HD homeobox protein At4g24660-like [Glycine max] Back     alignment and taxonomy information
>gi|116783202|gb|ABK22835.1| unknown [Picea sitchensis] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query127
TAIR|locus:2012602 312 HB31 "AT1G14440" [Arabidopsis 0.440 0.179 0.821 4.2e-24
TAIR|locus:2025121 309 HB33 "AT1G75240" [Arabidopsis 0.590 0.242 0.597 8.7e-24
TAIR|locus:2065304 310 HB21 "homeobox protein 21" [Ar 0.417 0.170 0.811 3.4e-22
TAIR|locus:2150871 271 HB30 "AT5G15210" [Arabidopsis 0.496 0.232 0.6 2.9e-21
TAIR|locus:2121989220 HB22 "AT4G24660" [Arabidopsis 0.456 0.263 0.688 4.4e-20
TAIR|locus:2168240279 HB25 "AT5G65410" [Arabidopsis 0.480 0.218 0.625 1.9e-19
TAIR|locus:2062141262 HB24 "AT2G18350" [Arabidopsis 0.409 0.198 0.730 8.3e-19
TAIR|locus:2167052 334 HB23 "AT5G39760" [Arabidopsis 0.480 0.182 0.6 1.6e-18
TAIR|locus:2077957249 HB28 "homeobox protein 28" [Ar 0.937 0.477 0.357 4.6e-18
TAIR|locus:2026734242 ZFHD1 "AT1G69600" [Arabidopsis 0.409 0.214 0.666 8.9e-18
TAIR|locus:2012602 HB31 "AT1G14440" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 276 (102.2 bits), Expect = 4.2e-24, P = 4.2e-24
 Identities = 46/56 (82%), Positives = 54/56 (96%)

Query:    10 KKVVRYKECLKNHAAAIGGSATDGCGEFMPSGEEGSIESLKCSACNCHRNFHRKEI 65
             K +++YKECLKNHAAA+GG+ATDGCGEFMPSGE+GSIE+L CSACNCHRNFHRKE+
Sbjct:    85 KPMIKYKECLKNHAAAMGGNATDGCGEFMPSGEDGSIEALTCSACNCHRNFHRKEV 140




GO:0003700 "sequence-specific DNA binding transcription factor activity" evidence=ISS
GO:0005634 "nucleus" evidence=ISM
TAIR|locus:2025121 HB33 "AT1G75240" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2065304 HB21 "homeobox protein 21" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2150871 HB30 "AT5G15210" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2121989 HB22 "AT4G24660" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2168240 HB25 "AT5G65410" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2062141 HB24 "AT2G18350" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2167052 HB23 "AT5G39760" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2077957 HB28 "homeobox protein 28" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2026734 ZFHD1 "AT1G69600" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query127
pfam0477060 pfam04770, ZF-HD_dimer, ZF-HD protein dimerisation 1e-30
TIGR0156653 TIGR01566, ZF_HD_prot_N, ZF-HD homeobox protein Cy 6e-25
>gnl|CDD|218256 pfam04770, ZF-HD_dimer, ZF-HD protein dimerisation region Back     alignment and domain information
 Score =  103 bits (260), Expect = 1e-30
 Identities = 42/54 (77%), Positives = 48/54 (88%)

Query: 12 VVRYKECLKNHAAAIGGSATDGCGEFMPSGEEGSIESLKCSACNCHRNFHRKEI 65
           VRY+ECLKNHAA+IGG A DGCGEFMPSGEEG+ +SLKC+AC CHRNFHR+E 
Sbjct: 3  GVRYRECLKNHAASIGGHAVDGCGEFMPSGEEGTPDSLKCAACGCHRNFHRREP 56


This family of proteins has are plant transcription factors, and have been named ZF-HD for zinc finger homeodomain proteins, on the basis of similarity to proteins of known structure. This region is thought to be involved in the formation of homo and heterodimers, and may form a zinc finger. Length = 60

>gnl|CDD|130629 TIGR01566, ZF_HD_prot_N, ZF-HD homeobox protein Cys/His-rich dimerization domain Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 127
PF0477060 ZF-HD_dimer: ZF-HD protein dimerisation region; In 100.0
TIGR0156653 ZF_HD_prot_N ZF-HD homeobox protein Cys/His-rich d 100.0
>PF04770 ZF-HD_dimer: ZF-HD protein dimerisation region; InterPro: IPR006456 The homeodomain (HD) is a 60-amino acid DNA-binding domain found in many transcription factors Back     alignment and domain information
Probab=100.00  E-value=1.7e-40  Score=224.41  Aligned_cols=58  Identities=76%  Similarity=1.422  Sum_probs=55.3

Q ss_pred             ceeehHHHhhhhhhhcCCccccCccccccC-CCCCCccccccccccccccccccccCCC
Q 041672           11 KVVRYKECLKNHAAAIGGSATDGCGEFMPS-GEEGSIESLKCSACNCHRNFHRKEIGCD   68 (127)
Q Consensus        11 ~~v~Y~EC~kNHAa~~GghavDGCgEFmps-g~~gt~~aL~CaACgCHRnFHRre~~~~   68 (127)
                      .+|+|+||||||||+||||+||||+||||+ +++|++++|+||||||||||||||++++
T Consensus         2 ~~v~Y~EC~kNHAa~~Gg~a~DGCgEFm~~~g~eg~~~al~CaACgCHRnFHRre~~~e   60 (60)
T PF04770_consen    2 KVVRYRECLKNHAASIGGHAVDGCGEFMPSPGEEGTPEALKCAACGCHRNFHRREVEGE   60 (60)
T ss_pred             CceeHHHHHhhHhHhhCCcccccccccccCCCCCCCcccceecccCcchhcccCCcCCC
Confidence            479999999999999999999999999999 8899999999999999999999998764



HD-containing proteins are found in diverse organisms such as humans, Drosophila, nematode worms, and plants, where they play important roles in development. Zinc-finger-homeodomain (ZF- HD) subfamily proteins have only been identified in plants, and likely play plant specific roles. ZF-HD proteins are expressed predominantly or exclusively in floral tissue, indicating a likely regulatory role during floral development []. The ZF-HD class of homeodomain proteins may also be involved in the photosynthesis-related mesophyll-specific gene expression of phosphoenolpyruvate carboxylase in C4 species [] and in pathogen signaling and plant defense mechanisms []. These proteins share three domains of high sequence similarity: the homeodomain (II) located at the carboxy-terminus, and two other segments (Ia and Ib) located in the amino-terminal part. These N-terminal domains contain five conserved cysteine residues and at least three conserved histidine residues whose spacing ressembles zinc-binding domains involved in dimerization of transcription factors. Although the two domains contain at least eight potential zinc-binding amino-acids, the unique spacing of the conserved cysteine and histidine residues within domain Ib suggests that both domains form one rather than two zinc finger structures. The two conserved motifs Ia and Ib constitute a dimerization domain which is sufficient for the formation of homo- and heterodimers []. This entry represents the N-terminal Cysteine/Histidine-rich dimerization domain. The companion ZF-HD homeobox domain is described in IPR006455 from INTERPRO.

>TIGR01566 ZF_HD_prot_N ZF-HD homeobox protein Cys/His-rich dimerization domain Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00