Citrus Sinensis ID: 029898


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-----
MDEDEMEELDNKKDYDVEYDPLVAGGDEDITMVDSKSFVYTQGWDSETVVDYRIDEDEFHKISLMECDFFIRKPPDPDNDVYDFREMYVTPPDTDIYAIPKVLAPMPQKYIRCAVSDYGCYNVTEPPIDAPRDPMYKSEREIHKVFLVKHYRNRRLGDPEFVLDFEEIYVIDSKTKSITRAKVLV
ccHHHHHHHccccccccccccccccccccEEEEEEccEEEEcccccccEEEEEccccccEEEEEEEccEEEEccccccccccccEEEEccccccccccccccccccccccHHcccccccEEEccccccccccccccHHHHHHHHHEEEEccccccccccccEEEccEEEEEEccccEEEEEEEEc
ccHHHHHHHccccccccccccccccccccEEEEEcccEEEEccccccEEEEEEEcHHHHEEEEEEcccEEEEcccccccccccHHEEEcccccccEEEccHHccccccccEEEccccccEEEccccccccccccccccHHHHHHHHHHHHHHHHccccccEEEEHcEEEEEEccccEEEEEEEEc
MDEDEMeeldnkkdydveydplvaggdeditmvdsksfvytqgwdsetvvdyridedefhkislmecdffirkppdpdndvydfremyvtppdtdiyaipkvlapmpqkyircavsdygcynvteppidaprdpmykseREIHKVFLVKHyrnrrlgdpefvlDFEEIYVIDSKTKSITRAKVLV
mdedemeeldnkkdydveydplvaggdeditmVDSKSFvytqgwdseTVVDYRIDEDEFHKISLMECDFFIRKPPDPDNDVYDFREMYVTPPDTDIYAIPKVLAPMPQKYIRCAVSDYGCYNVTEPPIDAPRDPMYKSEREIHKVFlvkhyrnrrlgdpefvldfeeiyvidsktksitrakvlv
MDEDEMEELDNKKDYDVEYDPLVAGGDEDITMVDSKSFVYTQGWDSETVVDYRIDEDEFHKISLMECDFFIRKPPDPDNDVYDFREMYVTPPDTDIYAIPKVLAPMPQKYIRCAVSDYGCYNVTEPPIDAPRDPMYKSEREIHKVFLVKHYRNRRLGDPEFVLDFEEIYVIDSKTKSITRAKVLV
****************VEYDPLVAGGDEDITMVDSKSFVYTQGWDSETVVDYRIDEDEFHKISLMECDFFIRKPPDPDNDVYDFREMYVTPPDTDIYAIPKVLAPMPQKYIRCAVSDYGCYNVTEPPID******YKSEREIHKVFLVKHYRNRRLGDPEFVLDFEEIYVIDSK***********
**********NKKDYDVEYDPLVAGGDEDITMVDSKSFVYTQGWDSETVVDYRIDEDEFHKISLMECDFFIRKPPDPDNDVYDFREMYVTPPDTDIYAIPKVLAPMPQKYIRCAVSDYGCYNVTEPPIDAPRDPMYKSEREIHKVFLVKHYRNRRLGDPEFVLDFEEIYVIDSKTKSITRAKVLV
**********NKKDYDVEYDPLVAGGDEDITMVDSKSFVYTQGWDSETVVDYRIDEDEFHKISLMECDFFIRKPPDPDNDVYDFREMYVTPPDTDIYAIPKVLAPMPQKYIRCAVSDYGCYNVTEPPIDAPRDPMYKSEREIHKVFLVKHYRNRRLGDPEFVLDFEEIYVIDSKTKSITRAKVLV
*****MEELDNKKDYDVEYDPLVAGGDEDITMVDSKSFVYTQGWDSETVVDYRIDEDEFHKISLMECDFFIRKPPDPDNDVYDFREMYVTPPDTDIYAIPKVLAPMPQKYIRCAVSDYGCYNVTEPPIDAPRDPMYKSEREIHKVFLVKHYRNRRLGDPEFVLDFEEIYVIDSKTKSITRAKVLV
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MDEDEMEELDNKKDYDVEYDPLVAGGDEDITMVDSKSFVYTQGWDSETVVDYRIDEDEFHKISLMECDFFIRKPPDPDNDVYDFREMYVTPPDTDIYAIPKVLAPMPQKYIRCAVSDYGCYNVTEPPIDAPRDPMYKSEREIHKVFLVKHYRNRRLGDPEFVLDFEEIYVIDSKTKSITRAKVLV
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query185
317106688 332 JMS09K11.8 [Jatropha curcas] 1.0 0.557 0.858 2e-88
255558568 327 conserved hypothetical protein [Ricinus 0.945 0.535 0.853 4e-84
297850594 328 hypothetical protein ARALYDRAFT_472396 [ 1.0 0.564 0.812 5e-84
18395008 328 plastid transcriptionally active 6 [Arab 1.0 0.564 0.802 8e-83
224063005 325 predicted protein [Populus trichocarpa] 1.0 0.569 0.832 3e-82
225459665 333 PREDICTED: uncharacterized protein LOC10 0.945 0.525 0.844 8e-82
449500699 337 PREDICTED: uncharacterized protein LOC10 0.945 0.519 0.8 1e-81
356510055 328 PREDICTED: uncharacterized protein LOC10 0.956 0.539 0.790 3e-80
356515939 329 PREDICTED: uncharacterized protein LOC10 0.956 0.537 0.8 9e-77
302141763221 unnamed protein product [Vitis vinifera] 0.832 0.696 0.889 2e-75
>gi|317106688|dbj|BAJ53190.1| JMS09K11.8 [Jatropha curcas] Back     alignment and taxonomy information
 Score =  330 bits (845), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 164/191 (85%), Positives = 176/191 (92%), Gaps = 6/191 (3%)

Query: 1   MDEDEMEELDNKKDYDVEYDP--LVA----GGDEDITMVDSKSFVYTQGWDSETVVDYRI 54
           MDE+E EE+DNKKDYD++YDP  +VA    GGDEDI MV SKSFV TQGWDSE +VDYRI
Sbjct: 75  MDEEEAEEVDNKKDYDLDYDPSAVVAAANVGGDEDIAMVISKSFVSTQGWDSEKIVDYRI 134

Query: 55  DEDEFHKISLMECDFFIRKPPDPDNDVYDFREMYVTPPDTDIYAIPKVLAPMPQKYIRCA 114
           DE+EFHKISL++CDFFIRKPPDPDNDVYDFREMYVTPPDTDIYAIPKVLAPMPQKYIRCA
Sbjct: 135 DEEEFHKISLLDCDFFIRKPPDPDNDVYDFREMYVTPPDTDIYAIPKVLAPMPQKYIRCA 194

Query: 115 VSDYGCYNVTEPPIDAPRDPMYKSEREIHKVFLVKHYRNRRLGDPEFVLDFEEIYVIDSK 174
           +SDYG YNVTEPPIDAPRDP+YKSEREI KVFL KHYRNRR GDPEFVLDFEEIYVIDSK
Sbjct: 195 MSDYGRYNVTEPPIDAPRDPLYKSEREILKVFLTKHYRNRRFGDPEFVLDFEEIYVIDSK 254

Query: 175 TKSITRAKVLV 185
           TKSITRAKV+V
Sbjct: 255 TKSITRAKVVV 265




Source: Jatropha curcas

Species: Jatropha curcas

Genus: Jatropha

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|255558568|ref|XP_002520309.1| conserved hypothetical protein [Ricinus communis] gi|223540528|gb|EEF42095.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|297850594|ref|XP_002893178.1| hypothetical protein ARALYDRAFT_472396 [Arabidopsis lyrata subsp. lyrata] gi|297339020|gb|EFH69437.1| hypothetical protein ARALYDRAFT_472396 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|18395008|ref|NP_564144.1| plastid transcriptionally active 6 [Arabidopsis thaliana] gi|79318316|ref|NP_001031076.1| plastid transcriptionally active 6 [Arabidopsis thaliana] gi|5263310|gb|AAD41412.1|AC007727_1 EST gb|N95925 comes from this gene [Arabidopsis thaliana] gi|11692926|gb|AAG40066.1|AF324715_1 At1g21600 [Arabidopsis thaliana] gi|11908094|gb|AAG41476.1|AF326894_1 unknown protein [Arabidopsis thaliana] gi|17386146|gb|AAL38619.1|AF446886_1 At1g21600/F24J8.10 [Arabidopsis thaliana] gi|15450597|gb|AAK96570.1| At1g21600/F24J8.10 [Arabidopsis thaliana] gi|21280837|gb|AAM44921.1| unknown protein [Arabidopsis thaliana] gi|222423451|dbj|BAH19696.1| AT1G21600 [Arabidopsis thaliana] gi|332192001|gb|AEE30122.1| plastid transcriptionally active 6 [Arabidopsis thaliana] gi|332192002|gb|AEE30123.1| plastid transcriptionally active 6 [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|224063005|ref|XP_002300963.1| predicted protein [Populus trichocarpa] gi|222842689|gb|EEE80236.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|225459665|ref|XP_002284617.1| PREDICTED: uncharacterized protein LOC100250482 [Vitis vinifera] Back     alignment and taxonomy information
>gi|449500699|ref|XP_004161171.1| PREDICTED: uncharacterized protein LOC101225304 [Cucumis sativus] Back     alignment and taxonomy information
>gi|356510055|ref|XP_003523756.1| PREDICTED: uncharacterized protein LOC100782614 [Glycine max] Back     alignment and taxonomy information
>gi|356515939|ref|XP_003526654.1| PREDICTED: uncharacterized protein LOC100792016 [Glycine max] Back     alignment and taxonomy information
>gi|302141763|emb|CBI18966.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query185
TAIR|locus:2026972328 PTAC6 "AT1G21600" [Arabidopsis 1.0 0.564 0.828 3.3e-86
TAIR|locus:2026972 PTAC6 "AT1G21600" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 862 (308.5 bits), Expect = 3.3e-86, P = 3.3e-86
 Identities = 159/192 (82%), Positives = 173/192 (90%)

Query:     1 MDEDEMEELDNKKDYDVEYDPLVA-------GGDEDITMVDSKSFVYTQGWDSETVVDYR 53
             MD++E+EE+DNKKDYDVEYDPL A       GGD DI  V SKSF+ TQGWDS+ VVDYR
Sbjct:    70 MDDEEVEEVDNKKDYDVEYDPLAAAIAAAGGGGDGDIAFVQSKSFISTQGWDSDMVVDYR 129

Query:    54 IDEDEFHKISLMECDFFIRKPPDPDNDVYDFREMYVTPPDTDIYAIPKVLAPMPQKYIRC 113
             I+EDEFHK+SLM+CDFFIRKPPDPDNDVYDFREMYVTPPDTDIY+IP+VLAPMPQKYIRC
Sbjct:   130 INEDEFHKLSLMDCDFFIRKPPDPDNDVYDFREMYVTPPDTDIYSIPRVLAPMPQKYIRC 189

Query:   114 AVSDYGCYNVTEPPIDAPRDPMYKSEREIHKVFLVKHYRNRRLGDPEFVLDFEEIYVIDS 173
             A+SDYGCYNVTEPPIDAPRDP+YKSEREI KVFL KHYRNRR  DPEFVLD EEIYVIDS
Sbjct:   190 AMSDYGCYNVTEPPIDAPRDPLYKSEREISKVFLTKHYRNRRTNDPEFVLDLEEIYVIDS 249

Query:   174 KTKSITRAKVLV 185
             KTKSITRA+VLV
Sbjct:   250 KTKSITRARVLV 261


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.319   0.140   0.423    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      185       185   0.00080  110 3  11 22  0.46    32
                                                     31  0.41    35


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  1
  No. of states in DFA:  589 (63 KB)
  Total size of DFA:  172 KB (2100 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  19.33u 0.13s 19.46t   Elapsed:  00:00:01
  Total cpu time:  19.33u 0.13s 19.46t   Elapsed:  00:00:01
  Start:  Tue May 21 01:18:46 2013   End:  Tue May 21 01:18:47 2013


GO:0003674 "molecular_function" evidence=ND
GO:0009507 "chloroplast" evidence=ISM;IDA
GO:0009295 "nucleoid" evidence=IDA
GO:0009508 "plastid chromosome" evidence=IDA
GO:0042793 "transcription from plastid promoter" evidence=RCA;IMP
GO:0045893 "positive regulation of transcription, DNA-dependent" evidence=RCA;IMP
GO:0006364 "rRNA processing" evidence=RCA
GO:0006417 "regulation of translation" evidence=RCA
GO:0009657 "plastid organization" evidence=RCA
GO:0009902 "chloroplast relocation" evidence=RCA
GO:0009965 "leaf morphogenesis" evidence=RCA
GO:0010027 "thylakoid membrane organization" evidence=RCA
GO:0010207 "photosystem II assembly" evidence=RCA
GO:0030154 "cell differentiation" evidence=RCA
GO:0034660 "ncRNA metabolic process" evidence=RCA
GO:0035304 "regulation of protein dephosphorylation" evidence=RCA

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query185
pfam0787964 pfam07879, PHB_acc_N, PHB/PHA accumulation regulat 0.003
>gnl|CDD|149123 pfam07879, PHB_acc_N, PHB/PHA accumulation regulator DNA-binding domain Back     alignment and domain information
 Score = 34.5 bits (80), Expect = 0.003
 Identities = 20/48 (41%), Positives = 24/48 (50%), Gaps = 13/48 (27%)

Query: 147 LVKHYRNRRLGDPE------------FVLDFEEIYVIDSKTKS-ITRA 181
           L+K Y NRRL D E             V D E+  VID+KT   ITR+
Sbjct: 2   LIKKYANRRLYDTETSQYITLEDIAQLVRDGEDFKVIDAKTGEDITRS 49


This domain is found at the N-terminus of the Polyhydroxyalkanoate (PHA) synthesis regulators. These regulators have been shown to directly bind DNA and PHA. The invariant nature of this domain compared to the C-terminal pfam05233 domain(s) suggests that it contains the DNA-binding function. Length = 64

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 185
TIGR01848107 PHA_reg_PhaR polyhydroxyalkanoate synthesis repres 90.37
PF0787964 PHB_acc_N: PHB/PHA accumulation regulator DNA-bind 89.82
>TIGR01848 PHA_reg_PhaR polyhydroxyalkanoate synthesis repressor PhaR Back     alignment and domain information
Probab=90.37  E-value=0.17  Score=39.68  Aligned_cols=35  Identities=46%  Similarity=0.674  Sum_probs=29.8

Q ss_pred             eehhcccccCCCCc------------ceeecceEEEeecCcce-eeee
Q 029898          147 LVKHYRNRRLGDPE------------FVLDFEEIYVIDSKTKS-ITRA  181 (185)
Q Consensus       147 l~khyrnrr~~dp~------------fvldfeeiyvids~tks-itra  181 (185)
                      ++|-|-|||+=|.+            +|.+=+++-|+|+||+. |||+
T Consensus         2 ~IKKY~NRRLYDT~tS~YITLedi~~lV~~g~~f~V~DakTgeDiT~~   49 (107)
T TIGR01848         2 IIKKYPNRRLYDTETSSYVTLEDIRDLVREGREFQVVDSKSGDDLTRS   49 (107)
T ss_pred             cccccCCCcccCCCccceeeHHHHHHHHHCCCeEEEEECCCCchhHHH
Confidence            68999999998875            78999999999999975 5553



Poly-B-hydroxyalkanoates are lipidlike carbon/energy storage polymers found in granular inclusions. PhaR is a regulatory protein found in general near other proteins associated with polyhydroxyalkanoate (PHA) granule biosynthesis and utilization. It is found to be a DNA-binding homotetramer that is also capable of binding short chain hydroxyalkanoic acids and PHA granules. PhaR may regulate the expression of itself, of the phasins that coat granules, and of enzymes that direct carbon flux into polymers stored in granules.

>PF07879 PHB_acc_N: PHB/PHA accumulation regulator DNA-binding domain; InterPro: IPR012909 This domain is found at the N terminus of the polyhydroxyalkanoate (PHA) synthesis regulators Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query185
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 6e-04
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
 Score = 38.7 bits (89), Expect = 6e-04
 Identities = 26/190 (13%), Positives = 60/190 (31%), Gaps = 51/190 (26%)

Query: 1   MDEDEMEELDNKKDYDVEYDPL-VAGGDEDITMVDSKSFVYTQGWDSETVVDYRIDEDEF 59
           ++  E  ++         +D L V      I  +     + +  W      D  +  ++ 
Sbjct: 366 LEPAEYRKM---------FDRLSVFPPSAHIPTI-----LLSLIWFDVIKSDVMVVVNKL 411

Query: 60  HKISLMECDFFIRKPPDPDNDVYDFREMY--VTPPDTDIYAIPKVLAPMPQKY--IRCAV 115
           HK SL+E          P         +Y  +     + YA+ + +      Y   +   
Sbjct: 412 HKYSLVEK--------QPKESTISIPSIYLELKVKLENEYALHRSIV---DHYNIPKT-- 458

Query: 116 SDYGCYNVTEPPIDAPRDPMYKSEREIHKVFLVKHYRNRRLGDPEFVLDFEEIYV----I 171
             +   ++  P +D      Y      +   +  H +N    +    L F  +++    +
Sbjct: 459 --FDSDDLIPPYLD-----QY-----FYS-HIGHHLKNIEHPE-RMTL-FRMVFLDFRFL 503

Query: 172 DSKTKSITRA 181
           + K +  + A
Sbjct: 504 EQKIRHDSTA 513


Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00