Citrus Sinensis ID: 034472


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90---
MYRNHHFFPCLHCHPHSYIRMVQHLIERCLLLHMSRDQCIKALAEHAGIRPLVTLTVWRELQKENKDFFRAYFHSISPRPFMSKTSSCFINLI
ccccccccccccccHHHHHHHHHHHHHHHHHHHccHHHHHHHHHHHHcccHHHHHHHHHHHHHHcHHHHHHHHHcccccccHHHHHHHHHHHc
ccccccccccccccHHHHHHHHHHHHHHHHHHHccHHHHHHHHHHHcccccHHHHHHHHHHHHHcHHHHHHHHHHHHHHHHHHHHHHHHHHcc
myrnhhffpclhchphsYIRMVQHLIERCLLLHMSRDQCIKALAEHAGIRPLVTLTVWRELQKENKDFFRAYfhsisprpfmsktssCFINLI
MYRNHHFFPCLHCHPHSYIRMVQHLIERCLLLHMSRDQCIKALAEHAGIRPLVTLTVWRELQKENKDFFRAYFhsisprpfmsktsscfinli
MYRNhhffpclhchphSYIRMVQHLIERCLLLHMSRDQCIKALAEHAGIRPLVTLTVWRELQKENKDFFRAYFHSISPRPFMSKTSSCFINLI
****HHFFPCLHCHPHSYIRMVQHLIERCLLLHMSRDQCIKALAEHAGIRPLVTLTVWRELQKENKDFFRAYFHSISPRPFM*****CFI***
******FFPCLHCHPHSYIRMVQHLIERCLLLHMSRDQCIKALAEHAGIRPLVTLTVWRELQKENKDFFRAYFHSI**********SCFINL*
MYRNHHFFPCLHCHPHSYIRMVQHLIERCLLLHMSRDQCIKALAEHAGIRPLVTLTVWRELQKENKDFFRAYFHSISPRPFMSKTSSCFINLI
****HHFFPCLHCHPHSYIRMVQHLIERCLLLHMSRDQCIKALAEHAGIRPLVTLTVWRELQKENKDFFRAYFHSISPRPFMSKTSSCFI*LI
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooo
oooooooooooooooooohhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MYRNHHFFPCLHCHPHSYIRMVQHLIERCLLLHMSRDQCIKALAEHAGIRPLVTLTVWRELQKENKDFFRAYFHSISPRPFMSKTSSCFINLI
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query93
449534373103 PREDICTED: uncharacterized LOC101211262 0.935 0.844 0.764 2e-34
44945063495 PREDICTED: uncharacterized protein LOC10 0.892 0.873 0.788 4e-34
224102463116 predicted protein [Populus trichocarpa] 0.870 0.698 0.814 4e-33
22411094290 predicted protein [Populus trichocarpa] 0.935 0.966 0.770 2e-32
29773907099 unnamed protein product [Vitis vinifera] 0.870 0.818 0.827 1e-31
255568842122 conserved hypothetical protein [Ricinus 0.913 0.696 0.758 1e-29
356567646102 PREDICTED: uncharacterized protein LOC10 0.827 0.754 0.743 2e-28
186478340100 uncharacterized protein [Arabidopsis tha 0.870 0.81 0.716 3e-28
351734538102 uncharacterized protein LOC100306139 [Gl 0.827 0.754 0.743 3e-28
116830729102 unknown [Arabidopsis thaliana] 0.860 0.784 0.712 9e-28
>gi|449534373|ref|XP_004174137.1| PREDICTED: uncharacterized LOC101211262 [Cucumis sativus] Back     alignment and taxonomy information
 Score =  149 bits (377), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 68/89 (76%), Positives = 79/89 (88%), Gaps = 2/89 (2%)

Query: 1  MYR--NHHFFPCLHCHPHSYIRMVQHLIERCLLLHMSRDQCIKALAEHAGIRPLVTLTVW 58
          MYR  +HHF PC HC PH+YIRMVQHLIERCLLLHMSRD+C+KALA+HA IRPL+TLTVW
Sbjct: 1  MYRPNSHHFLPCFHCQPHAYIRMVQHLIERCLLLHMSRDECVKALADHANIRPLITLTVW 60

Query: 59 RELQKENKDFFRAYFHSISPRPFMSKTSS 87
          +ELQKEN DFFRAYFH+ISP PF++K + 
Sbjct: 61 KELQKENSDFFRAYFHTISPNPFLAKFTG 89




Source: Cucumis sativus

Species: Cucumis sativus

Genus: Cucumis

Family: Cucurbitaceae

Order: Cucurbitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|449450634|ref|XP_004143067.1| PREDICTED: uncharacterized protein LOC101211262 [Cucumis sativus] Back     alignment and taxonomy information
>gi|224102463|ref|XP_002312687.1| predicted protein [Populus trichocarpa] gi|222852507|gb|EEE90054.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|224110942|ref|XP_002315690.1| predicted protein [Populus trichocarpa] gi|222864730|gb|EEF01861.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|297739070|emb|CBI28559.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|255568842|ref|XP_002525392.1| conserved hypothetical protein [Ricinus communis] gi|223535355|gb|EEF37030.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|356567646|ref|XP_003552028.1| PREDICTED: uncharacterized protein LOC100784875 [Glycine max] Back     alignment and taxonomy information
>gi|186478340|ref|NP_001117262.1| uncharacterized protein [Arabidopsis thaliana] gi|332190497|gb|AEE28618.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|351734538|ref|NP_001238177.1| uncharacterized protein LOC100306139 [Glycine max] gi|255627659|gb|ACU14174.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|116830729|gb|ABK28322.1| unknown [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query93
TAIR|locus:210067695 AT3G55240 "AT3G55240" [Arabido 0.731 0.715 0.558 1.4e-18
TAIR|locus:208699288 AT3G28990 "AT3G28990" [Arabido 0.817 0.863 0.474 3.7e-16
TAIR|locus:218183891 AT5G02580 "AT5G02580" [Arabido 0.602 0.615 0.553 4.2e-15
TAIR|locus:4010713425125 AT1G10657 "AT1G10657" [Arabido 0.677 0.504 0.546 1.2e-12
TAIR|locus:2076805 364 AT3G61700 "AT3G61700" [Arabido 0.591 0.151 0.545 4.6e-11
TAIR|locus:2039069 363 AT2G46420 "AT2G46420" [Arabido 0.591 0.151 0.545 5.9e-11
TAIR|locus:2146728 305 AT5G04090 "AT5G04090" [Arabido 0.591 0.180 0.472 1e-09
TAIR|locus:2076299 324 AT3G10250 "AT3G10250" [Arabido 0.591 0.169 0.472 1.2e-09
TAIR|locus:2077187 590 AT3G45800 "AT3G45800" [Arabido 0.602 0.094 0.385 4.9e-06
TAIR|locus:504956077110 AT2G11623 "AT2G11623" [Arabido 0.559 0.472 0.326 0.00013
TAIR|locus:2100676 AT3G55240 "AT3G55240" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 224 (83.9 bits), Expect = 1.4e-18, P = 1.4e-18
 Identities = 38/68 (55%), Positives = 56/68 (82%)

Query:    17 SYIRMVQHLIERCLLLHMSRDQCIKALAEHAGIRPLVTLTVWRELQKENKDFFRAYFHSI 76
             SYI MVQH+IE+CL+ HMS+++C++AL++HA I P++T TVW+EL+KENK+FF+AY    
Sbjct:     8 SYIHMVQHMIEKCLIFHMSKEECVEALSKHANITPVITSTVWKELEKENKEFFKAYEERQ 67

Query:    77 SPRPFMSK 84
             S +  MS+
Sbjct:    68 SKQEQMSE 75




GO:0003674 "molecular_function" evidence=ND
GO:0009507 "chloroplast" evidence=ISM
GO:0032502 "developmental process" evidence=IMP
TAIR|locus:2086992 AT3G28990 "AT3G28990" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2181838 AT5G02580 "AT5G02580" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:4010713425 AT1G10657 "AT1G10657" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2076805 AT3G61700 "AT3G61700" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2039069 AT2G46420 "AT2G46420" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2146728 AT5G04090 "AT5G04090" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2076299 AT3G10250 "AT3G10250" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2077187 AT3G45800 "AT3G45800" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:504956077 AT2G11623 "AT2G11623" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
eugene3.00081782
hypothetical protein (116 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query93
pfam0971354 pfam09713, A_thal_3526, Plant protein 1589 of unkn 6e-23
TIGR0158957 TIGR01589, A_thal_3526, uncharacterized plant-spec 9e-19
>gnl|CDD|192339 pfam09713, A_thal_3526, Plant protein 1589 of unknown function (A_thal_3526) Back     alignment and domain information
 Score = 82.8 bits (205), Expect = 6e-23
 Identities = 30/53 (56%), Positives = 43/53 (81%)

Query: 22 VQHLIERCLLLHMSRDQCIKALAEHAGIRPLVTLTVWRELQKENKDFFRAYFH 74
          VQ+LIERC+ L+MS+++C+ AL +HA I P  T TVW++L+KEN +FFRAY+ 
Sbjct: 1  VQNLIERCIQLYMSKEECVNALQKHANISPKFTSTVWKQLEKENPEFFRAYYT 53


This plant-specific family of proteins is defined by an uncharacterized region 57 residues in length. It is found toward the N terminus of most proteins that contain it. Examples include at least several proteins from Arabidopsis thaliana and Oryza sativa. The function of the proteins are unknown. Length = 54

>gnl|CDD|130650 TIGR01589, A_thal_3526, uncharacterized plant-specific domain TIGR01589 Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 93
TIGR0158957 A_thal_3526 uncharacterized plant-specific domain 100.0
PF0971354 A_thal_3526: Plant protein 1589 of unknown functio 100.0
smart0041190 BHL bacterial (prokaryotic) histone like domain. 86.72
PF03918148 CcmH: Cytochrome C biogenesis protein; InterPro: I 81.75
>TIGR01589 A_thal_3526 uncharacterized plant-specific domain TIGR01589 Back     alignment and domain information
Probab=100.00  E-value=2.6e-37  Score=196.80  Aligned_cols=57  Identities=46%  Similarity=0.695  Sum_probs=55.7

Q ss_pred             HHHHHHHHHHHHHhcCCHHHHHHHHHHhcCCChhhHHHHHHHHHHhCHHHHHHHhhc
Q 034472           19 IRMVQHLIERCLLLHMSRDQCIKALAEHAGIRPLVTLTVWRELQKENKDFFRAYFHS   75 (93)
Q Consensus        19 I~~VQ~LIErCLqlyMsk~Evv~~L~~~a~I~P~fT~~VW~~LEkeNpeFFkaY~~r   75 (93)
                      |++|||||||||++|||++|||++|+++|||+|+||++||++||+||||||+|||++
T Consensus         1 i~~Vq~lIE~Cl~~yMsk~E~v~~L~~~a~I~P~~T~~VW~~LekeN~eFF~aY~~~   57 (57)
T TIGR01589         1 IDLVQNRIETCIQGYMSKEETVSFLFENAGISPKFTRFVWYLLEKENADFFRCYKTH   57 (57)
T ss_pred             CHHHHHHHHHHHHHHCCHHHHHHHHHHHcCCCchhHHHHHHHHHHHHHHHHHHHhcC
Confidence            689999999999999999999999999999999999999999999999999999975



This model represents an uncharacterized plant-specific domain 57 residues in length. It is found toward the N-terminus of most proteins that contain it. Examples include at least 10 proteins from Arabidopsis thaliana and at least one from Oryza sativa.

>PF09713 A_thal_3526: Plant protein 1589 of unknown function (A_thal_3526); InterPro: IPR006476 This plant-specific family of proteins are defined by an uncharacterised region 57 residues in length Back     alignment and domain information
>smart00411 BHL bacterial (prokaryotic) histone like domain Back     alignment and domain information
>PF03918 CcmH: Cytochrome C biogenesis protein; InterPro: IPR005616 Members of this family include NrfF, CcmH, CycL, Ccl2 Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query93
1b8z_A90 Protein (histonelike protein HU); thermostable DNA 82.46
2es9_A115 Putative cytoplasmic protein; structural genomics, 81.36
1mul_A90 NS2, HU-2, DNA binding protein HU-alpha; histone-l 80.37
2di0_A71 Activating signal cointegrator 1 complex subunit 2 80.19
>1b8z_A Protein (histonelike protein HU); thermostable DNA binding protein; 1.60A {Thermotoga maritima} SCOP: a.55.1.1 PDB: 1riy_A Back     alignment and structure
Probab=82.46  E-value=2.3  Score=26.46  Aligned_cols=30  Identities=17%  Similarity=0.277  Sum_probs=25.6

Q ss_pred             CCHHHHHHHHHHhcCCChhhHHHHHHHHHH
Q 034472           34 MSRDQCIKALAEHAGIRPLVTLTVWRELQK   63 (93)
Q Consensus        34 Msk~Evv~~L~~~a~I~P~fT~~VW~~LEk   63 (93)
                      ||++|+++.++++.++...-...|++.+-+
T Consensus         1 mtk~eli~~ia~~~~ls~~~~~~~l~~~~~   30 (90)
T 1b8z_A            1 MNKKELIDRVAKKAGAKKKDVKLILDTILE   30 (90)
T ss_dssp             CCHHHHHHHHHHHHTCCHHHHHHHHHHHHH
T ss_pred             CCHHHHHHHHHHHhCcCHHHHHHHHHHHHH
Confidence            899999999999999999888777766554



>2es9_A Putative cytoplasmic protein; structural genomics, PSI, protein structure initiative, NORT structural genomics consortium, NESG; 2.00A {Salmonella typhimurium} SCOP: a.247.1.1 PDB: 2jn8_A Back     alignment and structure
>1mul_A NS2, HU-2, DNA binding protein HU-alpha; histone-like; HET: DNA; 2.30A {Escherichia coli} SCOP: a.55.1.1 PDB: 2o97_A Back     alignment and structure
>2di0_A Activating signal cointegrator 1 complex subunit 2; ASCC2, CUE domain, structural genomics, NPPSFA; NMR {Homo sapiens} SCOP: a.5.2.4 Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query93
d1b8za_90 HU protein {Thermotoga maritima [TaxId: 2336]} 87.43
d1mula_90 HU protein {Escherichia coli [TaxId: 562]} 86.15
d1exea_99 Transcription factor 1, TF1 {Bacteriophage SPO1 [T 85.88
d1huua_90 HU protein {Bacillus stearothermophilus [TaxId: 14 83.79
d2es9a197 Hypothetical protein YoaC {Salmonella typhimurium 82.43
>d1b8za_ a.55.1.1 (A:) HU protein {Thermotoga maritima [TaxId: 2336]} Back     information, alignment and structure
class: All alpha proteins
fold: IHF-like DNA-binding proteins
superfamily: IHF-like DNA-binding proteins
family: Prokaryotic DNA-bending protein
domain: HU protein
species: Thermotoga maritima [TaxId: 2336]
Probab=87.43  E-value=0.42  Score=28.81  Aligned_cols=29  Identities=17%  Similarity=0.273  Sum_probs=25.6

Q ss_pred             CCHHHHHHHHHHhcCCChhhHHHHHHHHH
Q 034472           34 MSRDQCIKALAEHAGIRPLVTLTVWRELQ   62 (93)
Q Consensus        34 Msk~Evv~~L~~~a~I~P~fT~~VW~~LE   62 (93)
                      |||.|+++.|+++.|+...-...+++.+.
T Consensus         1 MnK~eli~~ia~~~~~s~~~~~~~~~~~~   29 (90)
T d1b8za_           1 MNKKELIDRVAKKAGAKKKDVKLILDTIL   29 (90)
T ss_dssp             CCHHHHHHHHHHHHTCCHHHHHHHHHHHH
T ss_pred             CCHHHHHHHHHHhhCCCHHHHHHHHHHHH
Confidence            99999999999999999988888876554



>d1mula_ a.55.1.1 (A:) HU protein {Escherichia coli [TaxId: 562]} Back     information, alignment and structure
>d1exea_ a.55.1.1 (A:) Transcription factor 1, TF1 {Bacteriophage SPO1 [TaxId: 10685]} Back     information, alignment and structure
>d1huua_ a.55.1.1 (A:) HU protein {Bacillus stearothermophilus [TaxId: 1422]} Back     information, alignment and structure
>d2es9a1 a.247.1.1 (A:11-107) Hypothetical protein YoaC {Salmonella typhimurium [TaxId: 90371]} Back     information, alignment and structure