Citrus Sinensis ID: 041202
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 166 | ||||||
| 194320315 | 155 | oleosin 2 [Jatropha curcas] gi|399105971 | 0.915 | 0.980 | 0.616 | 5e-45 | |
| 148283921 | 155 | oleosin H-isoform [Ficus pumila var. awk | 0.897 | 0.961 | 0.616 | 3e-44 | |
| 9963897 | 175 | oleosin [Perilla frutescens] | 0.939 | 0.891 | 0.558 | 2e-42 | |
| 224140875 | 149 | predicted protein [Populus trichocarpa] | 0.873 | 0.973 | 0.590 | 2e-42 | |
| 259453 | 180 | lipid body membrane protein [Daucus caro | 0.963 | 0.888 | 0.575 | 3e-42 | |
| 449446694 | 158 | PREDICTED: oleosin 5-like [Cucumis sativ | 0.933 | 0.981 | 0.593 | 2e-40 | |
| 10834827 | 166 | oleosin [Sesamum indicum] | 0.957 | 0.957 | 0.543 | 2e-40 | |
| 255560890 | 153 | Oleosin1 [Ricinus communis] gi|38259656| | 0.897 | 0.973 | 0.558 | 2e-40 | |
| 266690 | 168 | RecName: Full=Oleosin 18.2 kDa gi|167363 | 0.987 | 0.976 | 0.629 | 3e-40 | |
| 21311553 | 160 | 16.9 kDa oleosin [Theobroma cacao] | 0.963 | 1.0 | 0.613 | 3e-40 |
| >gi|194320315|gb|ABW90149.2| oleosin 2 [Jatropha curcas] gi|399105971|gb|AFP19884.1| 16.6 kDa oleosin [Jatropha curcas] | Back alignment and taxonomy information |
|---|
Score = 185 bits (470), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 98/159 (61%), Positives = 120/159 (75%), Gaps = 7/159 (4%)
Query: 1 MAERDRPQPHQLQVHPQ-QHGKSLMGQRGAAGGGPSASKVLAVLAMLPLGGTFLALAGVA 59
MAER QPHQ+QVHPQ ++ + GQ+ GPSA KVLAV+ +LP+GG LALAG+
Sbjct: 1 MAERS--QPHQVQVHPQHRYEAAFKGQQK----GPSAQKVLAVITLLPVGGGLLALAGIT 54
Query: 60 LTGTIIGLCVTTPLFIIFSPVIVPAAIVLALAVTGFLTSGAFGLTALSSLSWVLSILRQK 119
L GT+IGL +TTPLF+IFSPV+VPAA+V+ L+V FL SGA GLT LSSLSWVL L++
Sbjct: 55 LVGTLIGLAITTPLFVIFSPVLVPAALVIGLSVMAFLASGAMGLTGLSSLSWVLKYLQEV 114
Query: 120 TGSVPEMADQAKKRVAGMADYVGQKTKEVGQDIQSKVHE 158
T +PE D AKKR+ MA +VGQKTKEVGQ+IQ K HE
Sbjct: 115 TRRMPEQLDIAKKRMQDMAGFVGQKTKEVGQEIQRKAHE 153
|
Source: Jatropha curcas Species: Jatropha curcas Genus: Jatropha Family: Euphorbiaceae Order: Malpighiales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|148283921|gb|ABQ57396.1| oleosin H-isoform [Ficus pumila var. awkeotsang] | Back alignment and taxonomy information |
|---|
| >gi|9963897|gb|AAG09751.1|AF237625_1 oleosin [Perilla frutescens] | Back alignment and taxonomy information |
|---|
| >gi|224140875|ref|XP_002323804.1| predicted protein [Populus trichocarpa] gi|222866806|gb|EEF03937.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|259453|gb|AAB24078.1| lipid body membrane protein [Daucus carota] | Back alignment and taxonomy information |
|---|
| >gi|449446694|ref|XP_004141106.1| PREDICTED: oleosin 5-like [Cucumis sativus] gi|449524042|ref|XP_004169032.1| PREDICTED: oleosin 5-like [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|10834827|gb|AAG23840.1|AF302807_1 oleosin [Sesamum indicum] | Back alignment and taxonomy information |
|---|
| >gi|255560890|ref|XP_002521458.1| Oleosin1 [Ricinus communis] gi|38259656|gb|AAR15171.1| oleosin [Ricinus communis] gi|223539357|gb|EEF40948.1| Oleosin1 [Ricinus communis] | Back alignment and taxonomy information |
|---|
| >gi|266690|sp|P29527.1|OLEO6_GOSHI RecName: Full=Oleosin 18.2 kDa gi|167363|gb|AAA18524.1| 18.2 kDa oleosin [Gossypium hirsutum] gi|167365|gb|AAA18525.1| 18.2 kDa oleosin [Gossypium hirsutum] | Back alignment and taxonomy information |
|---|
| >gi|21311553|gb|AAM46777.1|AF466102_1 16.9 kDa oleosin [Theobroma cacao] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 166 | ||||||
| TAIR|locus:2084223 | 183 | AT3G01570 "AT3G01570" [Arabido | 0.759 | 0.688 | 0.492 | 4.4e-29 | |
| TAIR|locus:2089189 | 191 | OLEO4 "AT3G27660" [Arabidopsis | 0.765 | 0.664 | 0.496 | 7.1e-29 | |
| TAIR|locus:2170558 | 199 | OLEO2 "AT5G40420" [Arabidopsis | 0.807 | 0.673 | 0.447 | 1.9e-26 | |
| TAIR|locus:2176212 | 141 | OLEO3 "AT5G51210" [Arabidopsis | 0.590 | 0.695 | 0.356 | 5.6e-13 | |
| TAIR|locus:2043470 | 149 | AT2G25890 "AT2G25890" [Arabido | 0.674 | 0.751 | 0.321 | 2.4e-12 | |
| TAIR|locus:2117328 | 173 | OLEO1 "AT4G25140" [Arabidopsis | 0.783 | 0.751 | 0.313 | 2.4e-12 | |
| TAIR|locus:2160319 | 230 | AT5G07600 [Arabidopsis thalian | 0.391 | 0.282 | 0.434 | 1.5e-08 | |
| TAIR|locus:2183414 | 106 | GRP19 "glycine-rich protein 19 | 0.427 | 0.669 | 0.346 | 1.6e-08 | |
| TAIR|locus:2183384 | 543 | GRP17 "AT5G07530" [Arabidopsis | 0.427 | 0.130 | 0.351 | 1.6e-07 | |
| TAIR|locus:2183354 | 193 | GRP14 "glycine-rich protein 14 | 0.487 | 0.419 | 0.388 | 2.3e-07 |
| TAIR|locus:2084223 AT3G01570 "AT3G01570" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 323 (118.8 bits), Expect = 4.4e-29, P = 4.4e-29
Identities = 62/126 (49%), Positives = 86/126 (68%)
Query: 33 GPSASKVLAVLAMLPLGGTFLALAGVALTGTIIGLCVTTPLFIIFSXXXXXXXXXXXXXX 92
GPS+++VLAV +P+GGT L +AG+ L G++IGL + PLF+IFS
Sbjct: 31 GPSSTQVLAVFVGVPIGGTLLTIAGLTLAGSVIGLMLAFPLFLIFSPVIVPAAFVIGLAM 90
Query: 93 TGFLTSGAFGLTALSSLSWVLSILRQKTGSVPEMADQAKKRVAGMADYVGQKTKEVGQDI 152
TGFL SGA GLT LSS+SWVL+ +R+ +PE ++AK R+A MA+YVGQ+TK+ GQ I
Sbjct: 91 TGFLASGAIGLTGLSSMSWVLNYIRRAGQHIPEELEEAKHRLADMAEYVGQRTKDAGQTI 150
Query: 153 QSKVHE 158
+ K H+
Sbjct: 151 EDKAHD 156
|
|
| TAIR|locus:2089189 OLEO4 "AT3G27660" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2170558 OLEO2 "AT5G40420" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2176212 OLEO3 "AT5G51210" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2043470 AT2G25890 "AT2G25890" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2117328 OLEO1 "AT4G25140" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2160319 AT5G07600 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2183414 GRP19 "glycine-rich protein 19" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2183384 GRP17 "AT5G07530" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2183354 GRP14 "glycine-rich protein 14" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
Fail to connect to STRING server
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 166 | |||
| pfam01277 | 118 | pfam01277, Oleosin, Oleosin | 6e-47 | |
| PRK06814 | 1140 | PRK06814, PRK06814, acylglycerophosphoethanolamine | 0.003 |
| >gnl|CDD|110290 pfam01277, Oleosin, Oleosin | Back alignment and domain information |
|---|
Score = 148 bits (375), Expect = 6e-47
Identities = 75/118 (63%), Positives = 90/118 (76%)
Query: 35 SASKVLAVLAMLPLGGTFLALAGVALTGTIIGLCVTTPLFIIFSPVIVPAAIVLALAVTG 94
S S+VLAV+ LP GG+ L LAG+ L GT+IGL V TPLF+IFSPV+VPAAI + LAVTG
Sbjct: 1 STSQVLAVVTALPAGGSLLVLAGLTLAGTVIGLAVATPLFVIFSPVLVPAAIAIGLAVTG 60
Query: 95 FLTSGAFGLTALSSLSWVLSILRQKTGSVPEMADQAKKRVAGMADYVGQKTKEVGQDI 152
FL SGAFGLT LSSLSW+L +R K VP+ D AK+R+ A+Y GQKTKE+GQ I
Sbjct: 61 FLASGAFGLTGLSSLSWLLKYVRGKHPPVPDQLDYAKRRLQEAAEYAGQKTKEMGQRI 118
|
Length = 118 |
| >gnl|CDD|235865 PRK06814, PRK06814, acylglycerophosphoethanolamine acyltransferase; Provisional | Back alignment and domain information |
|---|
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 166 | |||
| PF01277 | 118 | Oleosin: Oleosin; InterPro: IPR000136 Oleosins [] | 100.0 | |
| PF01277 | 118 | Oleosin: Oleosin; InterPro: IPR000136 Oleosins [] | 96.76 | |
| PF08006 | 181 | DUF1700: Protein of unknown function (DUF1700); In | 93.35 | |
| PF04156 | 191 | IncA: IncA protein; InterPro: IPR007285 Chlamydia | 92.24 | |
| PF11990 | 121 | DUF3487: Protein of unknown function (DUF3487); In | 91.53 | |
| PF02987 | 44 | LEA_4: Late embryogenesis abundant protein; InterP | 88.75 | |
| PF07332 | 121 | DUF1469: Protein of unknown function (DUF1469); In | 87.84 | |
| PRK09459 | 76 | pspG phage shock protein G; Reviewed | 86.48 | |
| PF05957 | 94 | DUF883: Bacterial protein of unknown function (DUF | 85.63 | |
| TIGR02797 | 211 | exbB tonB-system energizer ExbB. This model descri | 83.84 | |
| PRK10801 | 227 | colicin uptake protein TolQ; Provisional | 83.67 | |
| PRK10414 | 244 | biopolymer transport protein ExbB; Provisional | 83.23 | |
| COG0811 | 216 | TolQ Biopolymer transport proteins [Intracellular | 82.11 | |
| PF07178 | 95 | TraL: TraL protein; InterPro: IPR009838 This entry | 81.19 | |
| PF01618 | 139 | MotA_ExbB: MotA/TolQ/ExbB proton channel family Mo | 80.84 | |
| PF05957 | 94 | DUF883: Bacterial protein of unknown function (DUF | 80.08 |
| >PF01277 Oleosin: Oleosin; InterPro: IPR000136 Oleosins [] are the proteinaceous components of plants' lipid storage bodies called oil bodies | Back alignment and domain information |
|---|
Probab=100.00 E-value=6.5e-52 Score=320.25 Aligned_cols=118 Identities=64% Similarity=1.020 Sum_probs=116.5
Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhhhchhhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHH
Q 041202 35 SASKVLAVLAMLPLGGTFLALAGVALTGTIIGLCVTTPLFIIFSPVIVPAAIVLALAVTGFLTSGAFGLTALSSLSWVLS 114 (166)
Q Consensus 35 sssqvl~~~tll~~gg~LL~LaGlTL~gtvigL~vatPL~iifSPVLVPaai~~~l~~~gfl~sg~~g~~~ls~lsW~~~ 114 (166)
|++|++++++++++|++||+|+|+||+||++||+++|||||||||||||++|+++|+++||++||+||+++++++||+||
T Consensus 1 s~~qvl~~~~~~~~gg~LL~LaGlTL~gtvigL~vatPLfvifSPVlVPaai~~~l~~~Gfl~sg~~g~~~ls~lsW~~~ 80 (118)
T PF01277_consen 1 STSQVLAVVTLLPAGGTLLVLAGLTLAGTVIGLAVATPLFVIFSPVLVPAAIAIGLAVAGFLTSGAFGLTALSSLSWMYN 80 (118)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhhHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHH
Confidence 68999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHhhCCCCCchhHHHHHHHHhhhhHHHhhhhhHhhHHH
Q 041202 115 ILRQKTGSVPEMADQAKKRVAGMADYVGQKTKEVGQDI 152 (166)
Q Consensus 115 y~rg~~p~g~dqld~Ak~Ri~d~A~~vg~kake~Gq~i 152 (166)
|+||+||+++||+|+||+|++|+|+||+||+||+||+|
T Consensus 81 y~rg~~~~~~~q~d~Ak~ri~d~a~~v~~kake~gq~~ 118 (118)
T PF01277_consen 81 YFRGRHPPGPDQLDYAKRRIADTASYVGQKAKEVGQKI 118 (118)
T ss_pred HhccCCCCCCccHHHHHHHHHHHHHHHHHHHHHhCccC
Confidence 99999999999999999999999999999999999975
|
Oil bodies are small droplets (0.2 to 1.5 mu-m in diameter) containing mostly triacylglycerol that are surrounded by a phospholipid/ oleosin annulus. Oleosins may have a structural role in stabilising the lipid body during dessication of the seed, by preventing coalescence of the oil. They may also provide recognition signals for specific lipase anchorage in lipolysis during seedling growth. Oleosins are found in the monolayer lipid/ water interface of oil bodies and probably interact with both the lipid and phospholipid moieties. Oleosins are proteins of 16 Kd to 24 Kd and are composed of three domains: an N-terminal hydrophilic region of variable length (from 30 to 60 residues); a central hydrophobic domain of about 70 residues and a C-terminal amphipathic region of variable length (from 60 to 100 residues). The central hydrophobic domain is proposed to be made up of beta-strand structure and to interact with the lipids []. It is the only domain whose sequence is conserved.; GO: 0012511 monolayer-surrounded lipid storage body, 0016021 integral to membrane |
| >PF01277 Oleosin: Oleosin; InterPro: IPR000136 Oleosins [] are the proteinaceous components of plants' lipid storage bodies called oil bodies | Back alignment and domain information |
|---|
| >PF08006 DUF1700: Protein of unknown function (DUF1700); InterPro: IPR012963 This family contains many hypothetical bacterial proteins and two putative membrane proteins (Q6GFD0 from SWISSPROT and Q6G806 from SWISSPROT) | Back alignment and domain information |
|---|
| >PF04156 IncA: IncA protein; InterPro: IPR007285 Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion | Back alignment and domain information |
|---|
| >PF11990 DUF3487: Protein of unknown function (DUF3487); InterPro: IPR021877 This family of proteins is functionally uncharacterised | Back alignment and domain information |
|---|
| >PF02987 LEA_4: Late embryogenesis abundant protein; InterPro: IPR004238 Different types of late embryogenesis abundant (LEA) proteins are expressed at different stages of late embryogenesis in higher plant seed embryos and under conditions of dehydration stress | Back alignment and domain information |
|---|
| >PF07332 DUF1469: Protein of unknown function (DUF1469); InterPro: IPR009937 This entry represents proteins found in hypothetical bacterial proteins where is is annotated as ycf49 or ycf49-like | Back alignment and domain information |
|---|
| >PRK09459 pspG phage shock protein G; Reviewed | Back alignment and domain information |
|---|
| >PF05957 DUF883: Bacterial protein of unknown function (DUF883); InterPro: IPR010279 This family consists of several bacterial proteins of unknown function that include the Escherichia coli genes for ElaB, YgaM and YqjD | Back alignment and domain information |
|---|
| >TIGR02797 exbB tonB-system energizer ExbB | Back alignment and domain information |
|---|
| >PRK10801 colicin uptake protein TolQ; Provisional | Back alignment and domain information |
|---|
| >PRK10414 biopolymer transport protein ExbB; Provisional | Back alignment and domain information |
|---|
| >COG0811 TolQ Biopolymer transport proteins [Intracellular trafficking and secretion] | Back alignment and domain information |
|---|
| >PF07178 TraL: TraL protein; InterPro: IPR009838 This entry represents bacterial TraL proteins | Back alignment and domain information |
|---|
| >PF01618 MotA_ExbB: MotA/TolQ/ExbB proton channel family MotA family only; InterPro: IPR002898 This family groups together integral membrane proteins that appear to be involved in translocation of proteins across a membrane | Back alignment and domain information |
|---|
| >PF05957 DUF883: Bacterial protein of unknown function (DUF883); InterPro: IPR010279 This family consists of several bacterial proteins of unknown function that include the Escherichia coli genes for ElaB, YgaM and YqjD | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
No hit with probability above 80.00
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00