Citrus Sinensis ID: 043591


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-----
MRKRQVVLRREEPSRNSTASSVTVRSVRYAECQKNHAAGVGGYAVDGCREFMASGEEGTAAALTCAACGCHRSFHRREVDTEVVCECSSPSSTAT
ccccEEEEccccccccccccccccccEEEHHHHHHHHccccccccccccccccccccccHHHHHHHccccccccccccccccccccccccccccc
cccEEEEEEEccccccccccccccEEEEEHHHHHHHHHHccccccccccccccccccccHHHHHHHHcccccccccccccccccccccccccccc
mrkrqvvlrreepsrnstassvTVRSVRYAEcqknhaagvggyavdgcrefmasgeeGTAAALTCAacgchrsfhrrevdtevvcecsspsstat
mrkrqvvlrreepsrnstassvtvrsVRYAECQknhaagvggYAVDGCREFMASGEEGTAAALTCAACGCHRSfhrrevdtevvcecsspsstat
MRKRQVVLRREEPsrnstassvtvrsvrYAECQKNHAAGVGGYAVDGCREFMASgeegtaaaltcaacgcHRSFHRREVDTEVVCECSSPSSTAT
***********************VRSVRYAECQKNHAAGVGGYAVDGCREFMASGEEGTAAALTCAACGCHRSFHRREVDTEVVCE*********
****************************YAECQKNHAAGVGGYAVDGCREFMASGEEGTAAALTCAACGCHRSFHRREVDT**************
************************RSVRYAECQKNHAAGVGGYAVDGCREFMASGEEGTAAALTCAACGCHRSFHRREV****************
*RKRQVVLRREE********SVTVRSVRYAECQKNHAAGVGGYAVDGCREFMASGEEGTAAALTCAACGCHRSFH********************
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MRKRQVVLRREEPSRNSTASSVTVRSVRYAECQKNHAAGVGGYAVDGCREFMASGEEGTAAALTCAACGCHRSFHRREVDTEVVCECSSPSSTAT
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query95 2.2.26 [Sep-21-2011]
Q9SB61 220 ZF-HD homeobox protein At no no 0.578 0.25 0.672 1e-17
Q9FKP8 279 ZF-HD homeobox protein At no no 0.557 0.189 0.641 3e-15
Q9SEZ1 242 Transcription factor HB29 no no 0.684 0.268 0.507 1e-12
>sp|Q9SB61|Y4466_ARATH ZF-HD homeobox protein At4g24660 OS=Arabidopsis thaliana GN=At4g24660 PE=1 SV=1 Back     alignment and function desciption
 Score = 88.2 bits (217), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 37/55 (67%), Positives = 45/55 (81%)

Query: 27  VRYAECQKNHAAGVGGYAVDGCREFMASGEEGTAAALTCAACGCHRSFHRREVDT 81
           +RY EC KNHA  +GG+AVDGC EFM SGE+GT  AL CAACGCHR+FHR+E ++
Sbjct: 47  IRYRECLKNHAVNIGGHAVDGCCEFMPSGEDGTLDALKCAACGCHRNFHRKETES 101




Putative transcription factor.
Arabidopsis thaliana (taxid: 3702)
>sp|Q9FKP8|Y5541_ARATH ZF-HD homeobox protein At5g65410 OS=Arabidopsis thaliana GN=At5g65410 PE=1 SV=1 Back     alignment and function description
>sp|Q9SEZ1|HB29_ARATH Transcription factor HB29 OS=Arabidopsis thaliana GN=HB29 PE=1 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query95
25558679498 transcription factor, putative [Ricinus 1.0 0.969 0.826 4e-36
44946192792 PREDICTED: ZF-HD homeobox protein At4g24 0.936 0.967 0.855 5e-36
22545281595 PREDICTED: ZF-HD homeobox protein At4g24 0.989 0.989 0.776 9e-36
224080193130 predicted protein [Populus trichocarpa] 0.989 0.723 0.781 1e-35
29608291689 unnamed protein product [Vitis vinifera] 0.884 0.943 0.845 1e-34
225452819123 PREDICTED: ZF-HD homeobox protein At4g24 0.884 0.682 0.845 2e-34
449522936110 PREDICTED: ZF-HD homeobox protein At4g24 0.873 0.754 0.857 2e-33
25554740995 transcription factor, putative [Ricinus 0.978 0.978 0.720 3e-32
42572555100 mini zinc finger 2 [Arabidopsis thaliana 1.0 0.95 0.78 1e-31
297815192100 hypothetical protein ARALYDRAFT_484661 [ 1.0 0.95 0.78 1e-31
>gi|255586794|ref|XP_002534013.1| transcription factor, putative [Ricinus communis] gi|223525986|gb|EEF28371.1| transcription factor, putative [Ricinus communis] Back     alignment and taxonomy information
 Score =  155 bits (391), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 81/98 (82%), Positives = 87/98 (88%), Gaps = 3/98 (3%)

Query: 1  MRKRQVVLRREEPSRNS-TASSVTVRSVRYAECQKNHAAGVGGYAVDGCREFMAS-GEEG 58
          MRKRQVV+RREEPSR+S T S+ TVRSVRY ECQKNHAA VGGYAVDGCREFMAS GEEG
Sbjct: 1  MRKRQVVVRREEPSRSSSTTSAFTVRSVRYGECQKNHAANVGGYAVDGCREFMASNGEEG 60

Query: 59 TAAALTCAACGCHRSFHRREVDTE-VVCECSSPSSTAT 95
          T AALTCAACGCHR+FHRREV+TE VVC+CSSPSS   
Sbjct: 61 TTAALTCAACGCHRNFHRREVETEQVVCDCSSPSSNGN 98




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|449461927|ref|XP_004148693.1| PREDICTED: ZF-HD homeobox protein At4g24660-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|225452815|ref|XP_002283546.1| PREDICTED: ZF-HD homeobox protein At4g24660-like [Vitis vinifera] Back     alignment and taxonomy information
>gi|224080193|ref|XP_002306048.1| predicted protein [Populus trichocarpa] gi|222849012|gb|EEE86559.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|296082916|emb|CBI22217.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|225452819|ref|XP_002283568.1| PREDICTED: ZF-HD homeobox protein At4g24660-like isoform 2 [Vitis vinifera] Back     alignment and taxonomy information
>gi|449522936|ref|XP_004168481.1| PREDICTED: ZF-HD homeobox protein At4g24660-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|255547409|ref|XP_002514762.1| transcription factor, putative [Ricinus communis] gi|223546366|gb|EEF47868.1| transcription factor, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|42572555|ref|NP_974373.1| mini zinc finger 2 [Arabidopsis thaliana] gi|9294233|dbj|BAB02135.1| unnamed protein product [Arabidopsis thaliana] gi|21555769|gb|AAM63930.1| unknown [Arabidopsis thaliana] gi|62319539|dbj|BAD94968.1| hypothetical protein [Arabidopsis thaliana] gi|88900352|gb|ABD57488.1| At3g28917 [Arabidopsis thaliana] gi|332643985|gb|AEE77506.1| mini zinc finger 2 [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|297815192|ref|XP_002875479.1| hypothetical protein ARALYDRAFT_484661 [Arabidopsis lyrata subsp. lyrata] gi|297321317|gb|EFH51738.1| hypothetical protein ARALYDRAFT_484661 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query95
TAIR|locus:1006230234100 MIF2 "mini zinc finger 2" [Ara 0.978 0.93 0.561 2.6e-22
TAIR|locus:401071345288 MIF3 "mini zinc finger 3" [Ara 0.905 0.977 0.5 2.8e-18
TAIR|locus:2019190102 MIF1 "AT1G74660" [Arabidopsis 0.968 0.901 0.425 6.7e-17
TAIR|locus:2062141 262 HB24 "AT2G18350" [Arabidopsis 0.547 0.198 0.519 6.2e-10
TAIR|locus:2012602 312 HB31 "AT1G14440" [Arabidopsis 0.652 0.198 0.451 1.8e-09
TAIR|locus:2077957 249 HB28 "homeobox protein 28" [Ar 0.536 0.204 0.529 1.9e-09
TAIR|locus:2095157 312 HB34 "AT3G28920" [Arabidopsis 0.705 0.214 0.450 5.1e-09
TAIR|locus:2121989 220 HB22 "AT4G24660" [Arabidopsis 0.557 0.240 0.452 1.2e-08
TAIR|locus:2150871 271 HB30 "AT5G15210" [Arabidopsis 0.547 0.191 0.5 1.2e-08
TAIR|locus:2065304 310 HB21 "homeobox protein 21" [Ar 0.568 0.174 0.481 3e-08
TAIR|locus:1006230234 MIF2 "mini zinc finger 2" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 259 (96.2 bits), Expect = 2.6e-22, P = 2.6e-22
 Identities = 55/98 (56%), Positives = 59/98 (60%)

Query:     1 MRKRQVVLRR---EEPXXXXXXXXXXXXXXX-YAECQKNHAAGVGGYAVDGCREFMASXX 56
             MRKRQVVLRR   EEP                Y ECQKNHAA VGGYAVDGCREFMAS  
Sbjct:     1 MRKRQVVLRRASPEEPSRSSSTASSLTVRTVRYGECQKNHAAAVGGYAVDGCREFMASRG 60

Query:    57 XXXXXXXXXXXXXX-HRSFHRREVDTEVVCECSSPSST 93
                            HRSFHRRE++TEVVC+C+SP ST
Sbjct:    61 EEGTVAALTCAACGCHRSFHRREIETEVVCDCNSPPST 98




GO:0005634 "nucleus" evidence=ISM
GO:0008150 "biological_process" evidence=ND
GO:0003677 "DNA binding" evidence=ISS
TAIR|locus:4010713452 MIF3 "mini zinc finger 3" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2019190 MIF1 "AT1G74660" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2062141 HB24 "AT2G18350" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2012602 HB31 "AT1G14440" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2077957 HB28 "homeobox protein 28" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2095157 HB34 "AT3G28920" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2121989 HB22 "AT4G24660" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2150871 HB30 "AT5G15210" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2065304 HB21 "homeobox protein 21" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
eugene3.00041072
hypothetical protein (130 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query95
pfam0477060 pfam04770, ZF-HD_dimer, ZF-HD protein dimerisation 2e-30
TIGR0156653 TIGR01566, ZF_HD_prot_N, ZF-HD homeobox protein Cy 2e-21
>gnl|CDD|218256 pfam04770, ZF-HD_dimer, ZF-HD protein dimerisation region Back     alignment and domain information
 Score =  101 bits (254), Expect = 2e-30
 Identities = 42/57 (73%), Positives = 47/57 (82%)

Query: 27 VRYAECQKNHAAGVGGYAVDGCREFMASGEEGTAAALTCAACGCHRSFHRREVDTEV 83
          VRY EC KNHAA +GG+AVDGC EFM SGEEGT  +L CAACGCHR+FHRRE + EV
Sbjct: 4  VRYRECLKNHAASIGGHAVDGCGEFMPSGEEGTPDSLKCAACGCHRNFHRREPEGEV 60


This family of proteins has are plant transcription factors, and have been named ZF-HD for zinc finger homeodomain proteins, on the basis of similarity to proteins of known structure. This region is thought to be involved in the formation of homo and heterodimers, and may form a zinc finger. Length = 60

>gnl|CDD|130629 TIGR01566, ZF_HD_prot_N, ZF-HD homeobox protein Cys/His-rich dimerization domain Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 95
PF0477060 ZF-HD_dimer: ZF-HD protein dimerisation region; In 100.0
TIGR0156653 ZF_HD_prot_N ZF-HD homeobox protein Cys/His-rich d 100.0
>PF04770 ZF-HD_dimer: ZF-HD protein dimerisation region; InterPro: IPR006456 The homeodomain (HD) is a 60-amino acid DNA-binding domain found in many transcription factors Back     alignment and domain information
Probab=100.00  E-value=3.8e-42  Score=223.28  Aligned_cols=58  Identities=74%  Similarity=1.318  Sum_probs=55.5

Q ss_pred             cceehHHhhhhhccccCCccccCccccccC-CCCCCcccccccCccccccCccccccCc
Q 043591           25 RSVRYAECQKNHAAGVGGYAVDGCREFMAS-GEEGTAAALTCAACGCHRSFHRREVDTE   82 (95)
Q Consensus        25 ~~v~Y~EC~kNHAa~~GghavDGCgEFmps-g~egt~~aL~CaACGCHRNFHRre~~~~   82 (95)
                      ..|+|+|||||||++||||+||||+||||+ |++|++++|+||||||||||||||+++|
T Consensus         2 ~~v~Y~EC~kNHAa~~Gg~a~DGCgEFm~~~g~eg~~~al~CaACgCHRnFHRre~~~e   60 (60)
T PF04770_consen    2 KVVRYRECLKNHAASIGGHAVDGCGEFMPSPGEEGTPEALKCAACGCHRNFHRREVEGE   60 (60)
T ss_pred             CceeHHHHHhhHhHhhCCcccccccccccCCCCCCCcccceecccCcchhcccCCcCCC
Confidence            469999999999999999999999999999 8999999999999999999999999875



HD-containing proteins are found in diverse organisms such as humans, Drosophila, nematode worms, and plants, where they play important roles in development. Zinc-finger-homeodomain (ZF- HD) subfamily proteins have only been identified in plants, and likely play plant specific roles. ZF-HD proteins are expressed predominantly or exclusively in floral tissue, indicating a likely regulatory role during floral development []. The ZF-HD class of homeodomain proteins may also be involved in the photosynthesis-related mesophyll-specific gene expression of phosphoenolpyruvate carboxylase in C4 species [] and in pathogen signaling and plant defense mechanisms []. These proteins share three domains of high sequence similarity: the homeodomain (II) located at the carboxy-terminus, and two other segments (Ia and Ib) located in the amino-terminal part. These N-terminal domains contain five conserved cysteine residues and at least three conserved histidine residues whose spacing ressembles zinc-binding domains involved in dimerization of transcription factors. Although the two domains contain at least eight potential zinc-binding amino-acids, the unique spacing of the conserved cysteine and histidine residues within domain Ib suggests that both domains form one rather than two zinc finger structures. The two conserved motifs Ia and Ib constitute a dimerization domain which is sufficient for the formation of homo- and heterodimers []. This entry represents the N-terminal Cysteine/Histidine-rich dimerization domain. The companion ZF-HD homeobox domain is described in IPR006455 from INTERPRO.

>TIGR01566 ZF_HD_prot_N ZF-HD homeobox protein Cys/His-rich dimerization domain Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00