Citrus Sinensis ID: 041598


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-----
MKRKNYMAPRGDNMSPLWEQEQWAPCDDCSKWRKLPTDALLPPKWTCSDNVWDSIRCSCSAAEEMIQKDLENLLRVGRESKKRKIVESQRRDQQNEPSGLDALATAAVLGDYAGDAGEPSSVGATTTRHPRHRPGCTCIVCIQPPSGKGKHKPTCTCNVCMTVKRRFKTLMLRKKRRQSEREVEVAQKNHSDQMGDSEMNGSVKQESVPTSHTDNEISQTKSQTEVAESSSAQIGLDLNCYPNREDMQPEESRVSIMTLDRVASMPLEDCHPNGLASLSSCMLQQVNNGDHERLLSDEGFLASVGWGRDQEPNLD
ccccccccccccccccccccccEEccccccccccccccccccccEEEcccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHccccccccHHHHHHHHHcccccccccccccccccccccccccccccEEEEEcccccccccccccccccccHHHHHHHHHHHHHHHcccHHHHHHHHccccccccccccccccccccccccccccccccccccHHHHccccccccccccccccccccccccccccEEccccccccccccccccccccccccccccccccccccccccccEEEccccccccccccc
ccHHHHccccccccccccccccEEEEcccHHHccccccccccccEEEccccccHcHccccccccccHHHHHHHccccHHHHHHccHHHHcccHccccccccHHHHHHHHcccccccccccccccccccccccccccEEEEEEcccccccccccccccHHHHHHHHHHHHHHHHHHHHHcHHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccEEcccccccHHHccccccccHHHHHHHccccHHHHHHcccccHcccccccccccccccEccccccEHHHccccccccccc
mkrknymaprgdnmsplweqeqwapcddcskwrklptdallppkwtcsdnvwdsirCSCSAAEEMIQKDLENLLRVGRESKKRKIVESQrrdqqnepsglDALATAAVLgdyagdagepssvgatttrhprhrpgctcivciqppsgkgkhkptctcnvcmtVKRRFKTLMLRKKRRQSEREVEVAQKnhsdqmgdsemngsvkqesvptshtdneisqtksqtevaesssaqigldlncypnredmqpeesrvsimtldrvasmpledchpnglaslSSCMLQqvnngdherllsdegflasvgwgrdqepnld
mkrknymaprgdnmsplweqEQWAPCDDCSKWRKlptdallppkwtcsdnvWDSIRCSCSAAEEMIQKDLENLlrvgreskkrkivesqrrdqqnepsgLDALATAAVLGDYAGDAGEPSSVgatttrhprhrpgCTCIVCIqppsgkgkhkptctcnvcmtvkrrFKTLMlrkkrrqserevevaqknhsdqmgdsemngsVKQEsvptshtdneisqtksqtevaesssaqigLDLNCYPNREDMQPEESRVSIMTLDRVASMPLEDCHPNGLASLSSCMLQQVNNGDHERLLSDEgflasvgwgrdqepnld
MKRKNYMAPRGDNMSPLWEQEQWAPCDDCSKWRKLPTDALLPPKWTCSDNVWDSIRCSCSAAEEMIQKDLENLLRVGRESKKRKIVESQRRDQQNEPSGLDALATAAVLGDYAGDAGEPSSVGATTTRHPRHRPGCTCIVCIQPPSGKGKHKPTCTCNVCMTVKRRFKTLMLRKKRRQSEREVEVAQKNHSDQMGDSEMNGSVKQESVPTSHTDNEISQTKSQTEVAESSSAQIGLDLNCYPNREDMQPEESRVSIMTLDRVASMPLEDCHPNGLASLSSCMLQQVNNGDHERLLSDEGFLASVGWGRDQEPNLD
****************LWEQEQWAPCDDCSKWRKLPTDALLPPKWTCSDNVWDSIRCSCSAAEEMIQKDLENL********************************AAVLG***********************PGCTCIVCIQPPSGKGKHKPTCTCNVCMTVKRRFKTLML*********************************************************************************************************L*SCMLQ**********L**EGFLASVGW*********
****************LWEQEQWAPCDDCSKWRKLPTDALLPPKWTCSDNVWDSIR**************************************************************************************************************************************************************************************************************ASMP*************************************************
***********DNMSPLWEQEQWAPCDDCSKWRKLPTDALLPPKWTCSDNVWDSIRCSCSAAEEMIQKDLENLLRVGRESK****************SGLDALATAAVLGDYAGDAGEPSSVGATTTRHPRHRPGCTCIVCIQPP********TCTCNVCMTVKRRFKTLML***********************************************************AQIGLDLNCYPNREDMQPEESRVSIMTLDRVASMPLEDCHPNGLASLSSCMLQQVNNGDHERLLSDEGFLASVGWGRDQEPNLD
*************MSPLWEQEQWAPCDDCSKWRKLPTDALLPPKWTCSDNVWDSIRCSCSAAEEMIQKDLENLLRVGRESKK*K*************SGLDALATAAVLGD*********************RPGCTCIVCIQPPSGKGKHKPTCTCNVCMTVKRRFKTLMLRKKRR**********************************************************L***CYPNRE*******RVSIMT*DRVASMPLEDCHPNGLAS********************EGFLASVGWGR*******
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MKRKNYMAPRGDNMSPLWEQEQWAPCDDCSKWRKLPTDALLPPKWTCSDNVWDSIRCSCSAAEEMIQKDLENLLRVGRESKKRKIVESQRRDQQNEPSGLDALATAAVLGDYAGDAGEPSSVGATTTRHPRHRPGCTCIVCIQPPSGKGKHKPTCTCNVCMTVKRRFKTLMLRKKRRQSEREVEVAQKNHSDQMGDSEMNGSVKQESVPTSHTDNEISQTKSQTEVAESSSAQIGLDLNCYPNREDMQPEESRVSIMTLDRVASMPLEDCHPNGLASLSSCMLQQVNNGDHERLLSDEGFLASVGWGRDQEPNLD
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query315 2.2.26 [Sep-21-2011]
Q8W4L5790 B3 domain-containing tran yes no 0.612 0.244 0.628 2e-64
Q6Z3U3949 B3 domain-containing prot yes no 0.793 0.263 0.488 2e-55
Q0D5G4955 B3 domain-containing prot no no 0.739 0.243 0.460 1e-49
Q5CCK4780 B3 domain-containing tran no no 0.685 0.276 0.430 8e-47
O65420713 B3 domain-containing tran no no 0.336 0.148 0.367 3e-10
Q8TE76 937 MORC family CW-type zinc yes no 0.152 0.051 0.403 0.0008
>sp|Q8W4L5|VAL1_ARATH B3 domain-containing transcription repressor VAL1 OS=Arabidopsis thaliana GN=VAL1 PE=1 SV=1 Back     alignment and function desciption
 Score =  246 bits (627), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 125/199 (62%), Positives = 147/199 (73%), Gaps = 6/199 (3%)

Query: 16  PLWEQEQWAPCDDCSKWRKLPTDALLPPKWTCSDNVWDSIRCSCSAAEEMIQKDLENLLR 75
           P  EQE+WA CDDCSKWR+LP DALL  KWTC DNVWD  RCSCSA EE + K+LEN+L+
Sbjct: 537 PSGEQERWATCDDCSKWRRLPVDALLSFKWTCIDNVWDVSRCSCSAPEESL-KELENVLK 595

Query: 76  VGRESKKRKIVESQRRDQQNEPSGLDALATAAVLGDYAGDAGEPSSVGATTTRHPRHRPG 135
           VGRE KKR+  ESQ    Q EP GLDALA+AAVLGD     GEP    ATTTRHPRHR G
Sbjct: 596 VGREHKKRRTGESQAAKSQQEPCGLDALASAAVLGD---TIGEPEV--ATTTRHPRHRAG 650

Query: 136 CTCIVCIQPPSGKGKHKPTCTCNVCMTVKRRFKTLMLRKKRRQSEREVEVAQKNHSDQMG 195
           C+CIVCIQPPSGKG+HKPTC C VC TVKRRFKTLM+R+K++Q ER+V  A+      M 
Sbjct: 651 CSCIVCIQPPSGKGRHKPTCGCTVCSTVKRRFKTLMMRRKKKQLERDVTAAEDKKKKDME 710

Query: 196 DSEMNGSVKQESVPTSHTD 214
            +E + S +++ V T+  D
Sbjct: 711 LAESDKSKEEKEVNTARID 729




Transcriptional repressor of gene expression involved in embryonic pathways, such as LEC1, ABI3, and FUS3. Repressor of the sugar-inducible genes involved in the seed maturation program in seedlings. Plays an essential role in regulating the transition from seed maturation to seedling growth. Functionally redundant with VAL2/HSL1.
Arabidopsis thaliana (taxid: 3702)
>sp|Q6Z3U3|Y7797_ORYSJ B3 domain-containing protein Os07g0679700 OS=Oryza sativa subsp. japonica GN=Os07g0679700 PE=2 SV=1 Back     alignment and function description
>sp|Q0D5G4|Y7633_ORYSJ B3 domain-containing protein Os07g0563300 OS=Oryza sativa subsp. japonica GN=Os07g0563300 PE=2 SV=2 Back     alignment and function description
>sp|Q5CCK4|VAL2_ARATH B3 domain-containing transcription repressor VAL2 OS=Arabidopsis thaliana GN=VAL2 PE=2 SV=1 Back     alignment and function description
>sp|O65420|VAL3_ARATH B3 domain-containing transcription factor VAL3 OS=Arabidopsis thaliana GN=VAL3 PE=4 SV=3 Back     alignment and function description
>sp|Q8TE76|MORC4_HUMAN MORC family CW-type zinc finger protein 4 OS=Homo sapiens GN=MORC4 PE=1 SV=2 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query315
255565913 861 transcription factor, putative [Ricinus 0.907 0.332 0.603 2e-93
224145431 786 predicted protein [Populus trichocarpa] 0.828 0.332 0.610 2e-88
449519482 605 PREDICTED: B3 domain-containing transcri 0.784 0.408 0.624 1e-85
449439577 1195 PREDICTED: B3 domain-containing protein 0.784 0.206 0.624 3e-85
356569441 895 PREDICTED: B3 domain-containing transcri 0.901 0.317 0.536 1e-79
356537702 898 PREDICTED: B3 domain-containing transcri 0.730 0.256 0.637 1e-79
224140599 917 predicted protein [Populus trichocarpa] 0.841 0.288 0.571 6e-79
255575574 891 transcription factor, putative [Ricinus 0.819 0.289 0.571 5e-76
359478914 881 PREDICTED: B3 domain-containing protein 0.895 0.320 0.535 2e-75
356495129 876 PREDICTED: B3 domain-containing transcri 0.841 0.302 0.548 2e-74
>gi|255565913|ref|XP_002523945.1| transcription factor, putative [Ricinus communis] gi|223536792|gb|EEF38432.1| transcription factor, putative [Ricinus communis] Back     alignment and taxonomy information
 Score =  348 bits (892), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 181/300 (60%), Positives = 221/300 (73%), Gaps = 14/300 (4%)

Query: 16  PLWEQEQWAPCDDCSKWRKLPTDALLPPKWTCSDNVWDSIRCSCSAAEEMIQKDLENLLR 75
           P   QEQWA CDDCSKWRKLP DALLPPKWTC DNVWDS RC+CSA EEM  KDL+ LLR
Sbjct: 560 PSGGQEQWAQCDDCSKWRKLPEDALLPPKWTCLDNVWDSSRCTCSAPEEMNPKDLDTLLR 619

Query: 76  VGRESKKRKIVESQRRDQQNEPSGLDALATAAVLGDYAGDAGEPSSVGATTTRHPRHRPG 135
             ++ K+R++ +  +   + EPSGLDALA+AAVLGD  GD GEP SVGA TT+HPRHRPG
Sbjct: 620 GSKDFKRRRMADRHKPSSECEPSGLDALASAAVLGDNIGDLGEP-SVGA-TTKHPRHRPG 677

Query: 136 CTCIVCIQPPSGKGKHKPTCTCNVCMTVKRRFKTLMLRKKRRQSEREVEVAQKNHSDQMG 195
           CTCIVCIQPPSGKGKHKPTCTCNVCMTVKRRFKTLMLRKK+RQSERE E++QK+ +D+  
Sbjct: 678 CTCIVCIQPPSGKGKHKPTCTCNVCMTVKRRFKTLMLRKKKRQSEREAEISQKDDNDRKD 737

Query: 196 DSEMNGSVKQESVPTSHTDNEISQTKSQTEVAESSSAQIGLDLNCYPNREDMQPEESRVS 255
           +  M G +    +  ++++NE +  + QTE+ E+SS QI  DLN +PNREDMQ +   +S
Sbjct: 738 EFAMIGRLSHAVLNLNNSENEGNYNRKQTEIPETSSGQI--DLNSHPNREDMQLDIQGLS 795

Query: 256 IMTLDRVASMPLED-CHPNGL--------ASLSSCMLQQVNNGDHERLLSDEGFLASVGW 306
           +M L   A++P ++    NGL        AS+ SC+  Q NNG+    LSDE FLASVGW
Sbjct: 796 MMNLANAANLPFDNYVKQNGLADFLCERQASMGSCLHSQ-NNGESLIRLSDEAFLASVGW 854




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|224145431|ref|XP_002325640.1| predicted protein [Populus trichocarpa] gi|222862515|gb|EEF00022.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|449519482|ref|XP_004166764.1| PREDICTED: B3 domain-containing transcription repressor VAL1-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|449439577|ref|XP_004137562.1| PREDICTED: B3 domain-containing protein Os07g0679700-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|356569441|ref|XP_003552909.1| PREDICTED: B3 domain-containing transcription repressor VAL1-like [Glycine max] Back     alignment and taxonomy information
>gi|356537702|ref|XP_003537364.1| PREDICTED: B3 domain-containing transcription repressor VAL1-like [Glycine max] Back     alignment and taxonomy information
>gi|224140599|ref|XP_002323669.1| predicted protein [Populus trichocarpa] gi|222868299|gb|EEF05430.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|255575574|ref|XP_002528687.1| transcription factor, putative [Ricinus communis] gi|223531859|gb|EEF33676.1| transcription factor, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|359478914|ref|XP_002276326.2| PREDICTED: B3 domain-containing protein Os07g0679700-like [Vitis vinifera] Back     alignment and taxonomy information
>gi|356495129|ref|XP_003516433.1| PREDICTED: B3 domain-containing transcription repressor VAL2-like [Glycine max] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query315
TAIR|locus:2064417790 HSI2 "high-level expression of 0.647 0.258 0.606 7.6e-64
TAIR|locus:2116592780 HSL1 "HSI2-like 1" [Arabidopsi 0.574 0.232 0.489 5.1e-46
UNIPROTKB|F1SGX2 940 MORC3 "Uncharacterized protein 0.625 0.209 0.260 0.00011
RGD|1307924 679 Morc3 "MORC family CW-type zin 0.225 0.104 0.384 0.00019
RGD|1559905 928 Morc4 "MORC family CW-type zin 0.590 0.200 0.25 0.00023
MGI|MGI:1922996 928 Morc4 "microrchidia 4" [Mus mu 0.288 0.098 0.336 0.00033
UNIPROTKB|F1PFJ7 939 MORC3 "Uncharacterized protein 0.619 0.207 0.270 0.00039
UNIPROTKB|F1MFY1 960 MORC3 "Uncharacterized protein 0.152 0.05 0.442 0.00045
UNIPROTKB|Q8TE76 937 MORC4 "MORC family CW-type zin 0.250 0.084 0.321 0.00064
UNIPROTKB|F1P705 898 MORC4 "Uncharacterized protein 0.339 0.119 0.303 0.00079
TAIR|locus:2064417 HSI2 "high-level expression of sugar-inducible gene 2" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 651 (234.2 bits), Expect = 7.6e-64, P = 7.6e-64
 Identities = 128/211 (60%), Positives = 150/211 (71%)

Query:    16 PLWEQEQWAPCDDCSKWRKLPTDALLPPKWTCSDNVWDSIRCSCSAAEEMIQKDLENLLR 75
             P  EQE+WA CDDCSKWR+LP DALL  KWTC DNVWD  RCSCSA EE + K+LEN+L+
Sbjct:   537 PSGEQERWATCDDCSKWRRLPVDALLSFKWTCIDNVWDVSRCSCSAPEESL-KELENVLK 595

Query:    76 VGRESKKRKIVESQRRDQQNEPSGLDALATAAVLGDYAGDAGEPSSVGATTTRHPRHRPG 135
             VGRE KKR+  ESQ    Q EP GLDALA+AAVLGD     GEP    ATTTRHPRHR G
Sbjct:   596 VGREHKKRRTGESQAAKSQQEPCGLDALASAAVLGD---TIGEPEV--ATTTRHPRHRAG 650

Query:   136 CTCIVCIQPPSGKGKHKPTCTCNVCMTVKRRFKTLMLRKKRRQSEREVEVAQKNHSDQMG 195
             C+CIVCIQPPSGKG+HKPTC C VC TVKRRFKTLM+R+K++Q ER+V  A+      M 
Sbjct:   651 CSCIVCIQPPSGKGRHKPTCGCTVCSTVKRRFKTLMMRRKKKQLERDVTAAEDKKKKDME 710

Query:   196 DSEMNGSVKQESVPTSHTD-NEISQTKSQTE 225
              +E + S +++ V T+  D N     K   E
Sbjct:   711 LAESDKSKEEKEVNTARIDLNSDPYNKEDVE 741




GO:0003677 "DNA binding" evidence=IEA
GO:0005634 "nucleus" evidence=ISM;IDA
GO:0006355 "regulation of transcription, DNA-dependent" evidence=IEA
GO:0008270 "zinc ion binding" evidence=IEA
GO:0003700 "sequence-specific DNA binding transcription factor activity" evidence=ISS
GO:0005739 "mitochondrion" evidence=IDA
GO:0009737 "response to abscisic acid stimulus" evidence=TAS
GO:0009744 "response to sucrose stimulus" evidence=TAS
GO:0045892 "negative regulation of transcription, DNA-dependent" evidence=IDA
GO:0010030 "positive regulation of seed germination" evidence=IGI
GO:2000034 "regulation of seed maturation" evidence=IGI
GO:0005515 "protein binding" evidence=IPI
TAIR|locus:2116592 HSL1 "HSI2-like 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|F1SGX2 MORC3 "Uncharacterized protein" [Sus scrofa (taxid:9823)] Back     alignment and assigned GO terms
RGD|1307924 Morc3 "MORC family CW-type zinc finger 3" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms
RGD|1559905 Morc4 "MORC family CW-type zinc finger 4" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms
MGI|MGI:1922996 Morc4 "microrchidia 4" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms
UNIPROTKB|F1PFJ7 MORC3 "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] Back     alignment and assigned GO terms
UNIPROTKB|F1MFY1 MORC3 "Uncharacterized protein" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms
UNIPROTKB|Q8TE76 MORC4 "MORC family CW-type zinc finger protein 4" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|F1P705 MORC4 "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
eugene3.00161312
hypothetical protein (917 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query315
pfam0749650 pfam07496, zf-CW, CW-type Zinc Finger 1e-09
>gnl|CDD|219431 pfam07496, zf-CW, CW-type Zinc Finger Back     alignment and domain information
 Score = 53.2 bits (128), Expect = 1e-09
 Identities = 20/50 (40%), Positives = 24/50 (48%), Gaps = 6/50 (12%)

Query: 21 EQWAPCDDCSKWRKLP-----TDALLPPKWTCSDNVWDSIRCSCSAAEEM 65
          + W  CD C KWRKLP       + LP  W CS N   +   SC   EE+
Sbjct: 1  DYWVQCDKCLKWRKLPEEYEEIRSKLPDPWFCSMNPDPA-YNSCDDPEEI 49


This domain appears to be a zinc finger. The alignment shows four conserved cysteine residues and a conserved tryptophan. It was first identified by, and is predicted to be a "highly specialised mononuclear four-cysteine zinc finger...that plays a role in DNA binding and/or promoting protein-protein interactions in complicated eukaryotic processes including...chromatin methylation status and early embryonic development." Weak homology to pfam00628 further evidences these predictions (personal obs: C Yeats). Twelve different CW-domain-containing protein subfamilies are described, with different subfamilies being characteristic of vertebrates, higher plants and other animals in which these domain is found. Length = 50

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 315
PF0749650 zf-CW: CW-type Zinc Finger; InterPro: IPR011124 Zi 99.68
smart0024947 PHD PHD zinc finger. The plant homeodomain (PHD) f 88.07
>PF07496 zf-CW: CW-type Zinc Finger; InterPro: IPR011124 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule Back     alignment and domain information
Probab=99.68  E-value=7.9e-18  Score=121.85  Aligned_cols=44  Identities=48%  Similarity=1.036  Sum_probs=30.2

Q ss_pred             CCeEecccCccccCCCCCC-----CCCCCceeecCCcCCCCCCCCccccc
Q 041598           21 EQWAPCDDCSKWRKLPTDA-----LLPPKWTCSDNVWDSIRCSCSAAEEM   65 (315)
Q Consensus        21 ~~WVQCD~C~KWRrLP~~~-----~lPd~W~C~mN~WD~~~~sCsaPEE~   65 (315)
                      +.|||||.|+|||+||.++     .+|+.|||+||+ |+.+++|++|||.
T Consensus         1 ~~WVQCd~C~KWR~lp~~~~~~~~~~~d~W~C~~n~-~~~~~sC~~pee~   49 (50)
T PF07496_consen    1 DYWVQCDSCLKWRRLPEEVDPIREELPDPWYCSMNP-DPPFNSCDAPEEI   49 (50)
T ss_dssp             -EEEE-TTT--EEEE-CCHHCTSCCSSTT--GGGSS--CCC-STTS--SS
T ss_pred             CeEEECCCCCceeeCChhhCcccccCCCeEEcCCCC-CCCCCCCCCcccC
Confidence            4799999999999999986     468899999999 8899999999985



Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a CW-type zinc finger motif, named for its conserved cysteine and tryptophan residues. It is predicted to be a highly specialised mononuclear four-cysteine (C4) zinc finger that plays a role in DNA binding and/or promoting protein-protein interactions in complicated eukaryotic processes including chromatin methylation status and early embryonic development. Weak homology to members of IPR001965 from INTERPRO further evidences these predictions. The domain is found exclusively in vertebrates, vertebrate-infecting parasites and higher plants []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 2RR4_A 2E61_A 2L7P_A.

>smart00249 PHD PHD zinc finger Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query315
2l7p_A100 Histone-lysine N-methyltransferase ASHH2; CW-domai 7e-18
2e61_A69 Zinc finger CW-type PWWP domain protein 1; ZF-CW d 5e-11
>2l7p_A Histone-lysine N-methyltransferase ASHH2; CW-domain; NMR {Arabidopsis thaliana} Length = 100 Back     alignment and structure
 Score = 76.8 bits (188), Expect = 7e-18
 Identities = 21/94 (22%), Positives = 39/94 (41%), Gaps = 5/94 (5%)

Query: 2  KRKNYMAPRGDNMSPLWEQEQWAPCDDCSKWRKLPTDAL----LPPKWTCSDNVWDSIRC 57
              +M            +  W  CDDC KWR++P   +       +W C +N  D    
Sbjct: 7  VGSEFMVVDVTIEDSYSTESAWVRCDDCFKWRRIPASVVGSIDESSRWICMNN-SDKRFA 65

Query: 58 SCSAAEEMIQKDLENLLRVGRESKKRKIVESQRR 91
           CS ++EM  +++   L +G++       ++ +R
Sbjct: 66 DCSKSQEMSNEEINEELGIGQDEADAYDCDAAKR 99


>2e61_A Zinc finger CW-type PWWP domain protein 1; ZF-CW domain, structural genomics, NPPSFA, national project protein structural and functional analyses; NMR {Homo sapiens} PDB: 2rr4_A* Length = 69 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query315
2l7p_A100 Histone-lysine N-methyltransferase ASHH2; CW-domai 99.81
2e61_A69 Zinc finger CW-type PWWP domain protein 1; ZF-CW d 99.75
4gut_A 776 Lysine-specific histone demethylase 1B; histone de 94.72
1we9_A64 PHD finger family protein; structural genomics, PH 91.93
1wee_A72 PHD finger family protein; structural genomics, PH 90.91
2lv9_A98 Histone-lysine N-methyltransferase MLL5; zinc fing 89.05
3o70_A68 PHD finger protein 13; PHF13, structural genomics 88.97
3kqi_A75 GRC5, PHD finger protein 2; metal-binding, zinc-fi 88.56
3o7a_A52 PHD finger protein 13 variant; PHF13, zinc finger, 87.4
2k16_A75 Transcription initiation factor TFIID subunit 3; p 86.8
2kgg_A52 Histone demethylase jarid1A; PHD finger, histone m 86.25
>2l7p_A Histone-lysine N-methyltransferase ASHH2; CW-domain; NMR {Arabidopsis thaliana} Back     alignment and structure
Probab=99.81  E-value=3.5e-21  Score=157.10  Aligned_cols=72  Identities=28%  Similarity=0.678  Sum_probs=60.9

Q ss_pred             ccCCCCccCCCCCCCCCeEecccCccccCCCCCC----CCCCCceeecCCcCCCCCCCCcccccChHHHHHHHHhhH
Q 041598            6 YMAPRGDNMSPLWEQEQWAPCDDCSKWRKLPTDA----LLPPKWTCSDNVWDSIRCSCSAAEEMIQKDLENLLRVGR   78 (315)
Q Consensus         6 ~~~~~~f~~r~~~e~~~WVQCD~C~KWRrLP~~~----~lPd~W~C~mN~WD~~~~sCsaPEE~t~~eid~ll~~~~   78 (315)
                      |++--.....+.....+|||||.|+|||+||.++    .+|++|||+||+ |+.|++|++|||++++||+.+|+++.
T Consensus        11 ~~~~d~~~~~~~~~~~~WVQCD~C~KWRrLP~~~~~~~~~pd~W~C~mN~-D~~~nsCs~PEE~~~~ei~~~l~~~~   86 (100)
T 2l7p_A           11 FMVVDVTIEDSYSTESAWVRCDDCFKWRRIPASVVGSIDESSRWICMNNS-DKRFADCSKSQEMSNEEINEELGIGQ   86 (100)
T ss_dssp             CCCCCSSCSSCCSSSSEEEECTTTCCEEEECHHHHTTSTTSSCCCGGGSS-CSSSCSTTSCCSSCHHHHHHHHTCCC
T ss_pred             eeeeeeccCCCCCCCCeEEeeCCCCccccCChhHccccCCCCCceeCCCC-CCCCCCCCCccCCCHHHHHHHhcccc
Confidence            3444444444555699999999999999999864    479999999998 99999999999999999999999864



>2e61_A Zinc finger CW-type PWWP domain protein 1; ZF-CW domain, structural genomics, NPPSFA, national project protein structural and functional analyses; NMR {Homo sapiens} PDB: 2rr4_A* Back     alignment and structure
>4gut_A Lysine-specific histone demethylase 1B; histone demethylase; HET: FAD PGE; 2.00A {Homo sapiens} PDB: 4gur_A* 4gus_A* 4guu_A* 4fwe_A* 4fwf_A* 4fwj_A* 4gu1_A* Back     alignment and structure
>1we9_A PHD finger family protein; structural genomics, PHD domain, riken structural genomics/proteomics initiative, RSGI, DNA binding protein; NMR {Arabidopsis thaliana} SCOP: g.50.1.2 Back     alignment and structure
>1wee_A PHD finger family protein; structural genomics, PHD domain, riken structural genomics/proteomics initiative, RSGI, DNA binding protein; NMR {Arabidopsis thaliana} SCOP: g.50.1.2 Back     alignment and structure
>2lv9_A Histone-lysine N-methyltransferase MLL5; zinc finger, transcription, protein binding, NESG, northeast structural genomics consortium, SGC; NMR {Homo sapiens} Back     alignment and structure
>3o70_A PHD finger protein 13; PHF13, structural genomics consortium, SGC, structural genom type zinc finger, protein binding, zinc ION binding; 1.85A {Homo sapiens} Back     alignment and structure
>3kqi_A GRC5, PHD finger protein 2; metal-binding, zinc-finger, histone-binding, NUC protein; HET: M3L; 1.78A {Homo sapiens} SCOP: g.50.1.2 Back     alignment and structure
>3o7a_A PHD finger protein 13 variant; PHF13, zinc finger, PHD domain, nuclear protein, structural structural genomics consortium, SGC, protein binding; HET: M3L; 1.67A {Homo sapiens} Back     alignment and structure
>2k16_A Transcription initiation factor TFIID subunit 3; protein, alternative splicing, metal-binding, nucleus, phosphoprotein, transcription regulation; NMR {Mus musculus} PDB: 2k17_A* Back     alignment and structure
>2kgg_A Histone demethylase jarid1A; PHD finger, histone modification, leukemia, alternative splicing, chromatin regulator, developmental protein; NMR {Homo sapiens} PDB: 2kgi_A* 3gl6_A* Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query315
d1weea_72 PHD finger protein At1g33420 {Thale cress (Arabido 88.13
d1we9a_64 PHD finger protein At5g26210 {Thale cress (Arabido 84.09
d1wema_76 Death associated transcription factor 1, Datf1 (DI 81.82
d1wepa_79 PHD finger protein 8 {Mouse (Mus musculus) [TaxId: 81.77
>d1weea_ g.50.1.2 (A:) PHD finger protein At1g33420 {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} Back     information, alignment and structure
class: Small proteins
fold: FYVE/PHD zinc finger
superfamily: FYVE/PHD zinc finger
family: PHD domain
domain: PHD finger protein At1g33420
species: Thale cress (Arabidopsis thaliana) [TaxId: 3702]
Probab=88.13  E-value=0.16  Score=35.66  Aligned_cols=31  Identities=29%  Similarity=0.681  Sum_probs=22.5

Q ss_pred             CCCCCeEecccCccccCCCC-----CCCCCCCceee
Q 041598           18 WEQEQWAPCDDCSKWRKLPT-----DALLPPKWTCS   48 (315)
Q Consensus        18 ~e~~~WVQCD~C~KWRrLP~-----~~~lPd~W~C~   48 (315)
                      .....|||||.|.+|-.+.=     .-..++.|+|.
T Consensus        26 ~~~~~mv~Cd~C~~w~H~~C~g~~~~~~~~~~~~C~   61 (72)
T d1weea_          26 DDGERMLACDGCGVWHHTRCIGINNADALPSKFLCF   61 (72)
T ss_dssp             CCSSCEEECSSSCEEEETTTTTCCTTSCCCSCCCCH
T ss_pred             CCCCeEEEeCCCCCcCchhhcCCccccCCCCcEECc
Confidence            34567999999999986632     12456889985



>d1we9a_ g.50.1.2 (A:) PHD finger protein At5g26210 {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} Back     information, alignment and structure
>d1wema_ g.50.1.2 (A:) Death associated transcription factor 1, Datf1 (DIO-1) {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
>d1wepa_ g.50.1.2 (A:) PHD finger protein 8 {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure