Citrus Sinensis ID: 020017
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 332 | ||||||
| 255551066 | 379 | Dehydration-responsive protein RD22 prec | 0.984 | 0.862 | 0.679 | 1e-133 | |
| 118486691 | 421 | unknown [Populus trichocarpa] | 0.981 | 0.774 | 0.590 | 1e-129 | |
| 224089066 | 401 | predicted protein [Populus trichocarpa] | 0.981 | 0.812 | 0.611 | 1e-126 | |
| 224141799 | 331 | predicted protein [Populus trichocarpa] | 0.954 | 0.957 | 0.707 | 1e-121 | |
| 225430045 | 337 | PREDICTED: dehydration-responsive protei | 0.975 | 0.961 | 0.686 | 1e-121 | |
| 49615303 | 376 | dehydration-induced protein RD22-like pr | 0.969 | 0.856 | 0.637 | 1e-120 | |
| 225430043 | 364 | PREDICTED: dehydration-responsive protei | 0.981 | 0.895 | 0.652 | 1e-117 | |
| 18447761 | 335 | dehydration-induced protein RD22-like pr | 0.969 | 0.961 | 0.694 | 1e-117 | |
| 54645911 | 364 | RD22-like protein [Vitis vinifera] | 0.981 | 0.895 | 0.650 | 1e-116 | |
| 255639828 | 367 | unknown [Glycine max] | 0.978 | 0.885 | 0.613 | 1e-116 |
| >gi|255551066|ref|XP_002516581.1| Dehydration-responsive protein RD22 precursor, putative [Ricinus communis] gi|223544401|gb|EEF45922.1| Dehydration-responsive protein RD22 precursor, putative [Ricinus communis] | Back alignment and taxonomy information |
|---|
Score = 480 bits (1235), Expect = e-133, Method: Compositional matrix adjust.
Identities = 254/374 (67%), Positives = 283/374 (75%), Gaps = 47/374 (12%)
Query: 1 MEFHLLPILAFLSLALVASHADISPELYWKTVLPNSPMPKAVKDLLQPDVLEDKSTSVNV 60
MEF L IL FL++AL SHA + PE+YWK+VLPN+ MPKAV DLLQ ++DKSTSV+V
Sbjct: 6 MEFSLPCILVFLTIALATSHAALPPEVYWKSVLPNTQMPKAVTDLLQSGFMDDKSTSVSV 65
Query: 61 GKGGVNVDAGKGKPGG-------------------GTHVNVGGKGVGVNTGKPDKRTSVG 101
GKG VNV+AGKGKPGG GT VNVG KGVGVNTGKP K T+VG
Sbjct: 66 GKGSVNVNAGKGKPGGTSVNVGKGGVNVNTRKGKPGTTVNVGRKGVGVNTGKPGKGTNVG 125
Query: 102 VGKGGVSVSTGHKGKPVYVG----------------------------DLHPGMKMNLHF 133
VGKGGVSV+TGHKGKPV+V D++P MNLHF
Sbjct: 126 VGKGGVSVNTGHKGKPVHVNVAPFIYNYAATETQLHHDPNVALFFLEKDMYPRKTMNLHF 185
Query: 134 TQTSNGATFLSRQAAKSTPFSSDKLPEIFNQFSVKPGSVEAEIMQNTIKECEDPGIKGEQ 193
T+ N A FL RQ AKS PFSS +LPEI+NQFSVKPGS+EAE+M+NTIKECE PGI+GE+
Sbjct: 186 TENPNTAMFLPRQVAKSIPFSSKELPEIYNQFSVKPGSMEAELMKNTIKECEAPGIEGEE 245
Query: 194 KYCATSLESMIDFSTSKLGKSVQAISTEVKKGTKMQTYTIAAGVKQMAADKSVVCHKQNY 253
K CATSLESMIDFSTS LGK+VQAISTEV+ T+MQ YTI AG K+MA DKSVVCHKQNY
Sbjct: 246 KLCATSLESMIDFSTSVLGKNVQAISTEVENQTQMQKYTITAGAKEMAGDKSVVCHKQNY 305
Query: 254 PYAVFYCHATQTTRAYMVPLEGADGTKAKAAAVCHTDTSAWNPKHLAFQVLKVKPGTVPI 313
YAVFYCHATQTTRAYMV LEGADGTKAKA AVCHTDTS WN KHLAFQVLKVKPGTVP+
Sbjct: 306 AYAVFYCHATQTTRAYMVSLEGADGTKAKAVAVCHTDTSTWNTKHLAFQVLKVKPGTVPV 365
Query: 314 CHFLPEDHIVWVPN 327
CHFLP+DHIVWVPN
Sbjct: 366 CHFLPQDHIVWVPN 379
|
Source: Ricinus communis Species: Ricinus communis Genus: Ricinus Family: Euphorbiaceae Order: Malpighiales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|118486691|gb|ABK95182.1| unknown [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|224089066|ref|XP_002308621.1| predicted protein [Populus trichocarpa] gi|222854597|gb|EEE92144.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|224141799|ref|XP_002324250.1| predicted protein [Populus trichocarpa] gi|222865684|gb|EEF02815.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|225430045|ref|XP_002284286.1| PREDICTED: dehydration-responsive protein RD22 [Vitis vinifera] gi|227464475|gb|ACP40551.1| rd22-c [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|49615303|gb|AAT66913.1| dehydration-induced protein RD22-like protein 2 [Gossypium arboreum] | Back alignment and taxonomy information |
|---|
| >gi|225430043|ref|XP_002284269.1| PREDICTED: dehydration-responsive protein RD22 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|18447761|gb|AAL67991.1| dehydration-induced protein RD22-like protein [Gossypium hirsutum] | Back alignment and taxonomy information |
|---|
| >gi|54645911|gb|AAV36561.1| RD22-like protein [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|255639828|gb|ACU20207.1| unknown [Glycine max] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 332 | ||||||
| TAIR|locus:2179424 | 392 | RD22 "RESPONSIVE TO DESSICATIO | 0.611 | 0.517 | 0.617 | 2.5e-79 | |
| UNIPROTKB|Q70KG3 | 362 | RAFTIN1B "Protein RAFTIN 1B" [ | 0.614 | 0.563 | 0.423 | 4.2e-49 | |
| UNIPROTKB|Q7F8U7 | 412 | BURP13 "BURP domain-containing | 0.614 | 0.495 | 0.415 | 6e-48 | |
| UNIPROTKB|Q70KG5 | 389 | RAFTIN1A "Protein RAFTIN 1A" [ | 0.608 | 0.519 | 0.412 | 3.3e-47 | |
| TAIR|locus:2010237 | 280 | USPL1 "AT1G49320" [Arabidopsis | 0.620 | 0.735 | 0.357 | 3.6e-34 | |
| TAIR|locus:2195593 | 624 | PG1 "AT1G60390" [Arabidopsis t | 0.554 | 0.294 | 0.374 | 7.2e-30 | |
| TAIR|locus:2034823 | 622 | JP630 "AT1G23760" [Arabidopsis | 0.551 | 0.294 | 0.377 | 8.8e-29 | |
| TAIR|locus:2016194 | 626 | PG2 "AT1G70370" [Arabidopsis t | 0.554 | 0.293 | 0.379 | 9e-29 |
| TAIR|locus:2179424 RD22 "RESPONSIVE TO DESSICATION 22" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 656 (236.0 bits), Expect = 2.5e-79, Sum P(2) = 2.5e-79
Identities = 129/209 (61%), Positives = 154/209 (73%)
Query: 122 DLHPGMKMNLHFTQTSN--GAT-FLSRQAAKSTPFSSDKLPEIFNQFSVKPGSVEAEIMQ 178
DL G +MN+ F G T FL R A++ PF S+K E +FSV+ GS EAE+M+
Sbjct: 181 DLVRGKEMNVRFNAEDGYGGKTAFLPRGEAETVPFGSEKFSETLKRFSVEAGSEEAEMMK 240
Query: 179 NTIKECEDPGIKGEQKYCATSLESMIDFSTSKLGK-SVQAISTEV-KKGTKMQTYTIAA- 235
TI+ECE + GE+KYCATSLESM+DFS SKLGK V+A+STEV KK MQ Y IAA
Sbjct: 241 KTIEECEARKVSGEEKYCATSLESMVDFSVSKLGKYHVRAVSTEVAKKNAPMQKYKIAAA 300
Query: 236 GVKQMAADKSVVCHKQNYPYAVFYCHATQTTRAYMVPLEGADGTKAKAAAVCHTDTSAWN 295
GVK+++ DKSVVCHKQ YP+AVFYCH T Y VPLEG +G +AKA AVCH +TSAWN
Sbjct: 301 GVKKLSDDKSVVCHKQKYPFAVFYCHKAMMTTVYAVPLEGENGMRAKAVAVCHKNTSAWN 360
Query: 296 PKHLAFQVLKVKPGTVPICHFLPEDHIVW 324
P HLAF+VLKVKPGTVP+CHFLPE H+VW
Sbjct: 361 PNHLAFKVLKVKPGTVPVCHFLPETHVVW 389
|
|
| UNIPROTKB|Q70KG3 RAFTIN1B "Protein RAFTIN 1B" [Triticum aestivum (taxid:4565)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|Q7F8U7 BURP13 "BURP domain-containing protein 13" [Oryza sativa Japonica Group (taxid:39947)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|Q70KG5 RAFTIN1A "Protein RAFTIN 1A" [Triticum aestivum (taxid:4565)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2010237 USPL1 "AT1G49320" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2195593 PG1 "AT1G60390" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2034823 JP630 "AT1G23760" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2016194 PG2 "AT1G70370" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| eugene3.00061755 | hypothetical protein (401 aa) | |||||||
(Populus trichocarpa) | ||||||||
| Sorry, there are no predicted associations at the current settings. |
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 332 | |||
| pfam03181 | 216 | pfam03181, BURP, BURP domain | 1e-120 | |
| smart01045 | 222 | smart01045, BURP, The BURP domain is found at the | 4e-79 |
| >gnl|CDD|202568 pfam03181, BURP, BURP domain | Back alignment and domain information |
|---|
Score = 344 bits (885), Expect = e-120
Identities = 140/210 (66%), Positives = 163/210 (77%), Gaps = 5/210 (2%)
Query: 122 DLHPGMKMNLHFTQTSNGA--TFLSRQAAKSTPFSSDKLPEIFNQFSVKPGSVEAEIMQN 179
DL PG KM LHF + ++GA FL RQ A S PFSS+KLPEI FSV PGSVEA+IM++
Sbjct: 8 DLKPGKKMPLHFPKITDGAKRPFLPRQIADSIPFSSEKLPEILAMFSVPPGSVEAKIMKS 67
Query: 180 TIKECEDPGIKGEQKYCATSLESMIDFSTSKLGKS-VQAISTEVKKGTKMQTYTIAAGVK 238
T++ECE P IKGE+K+CATSLESM+DF+TSKLG ++A+STEV+ G +Q YT+ GVK
Sbjct: 68 TLRECEAPAIKGEEKFCATSLESMVDFATSKLGTRDIRAVSTEVEGGGPLQKYTVE-GVK 126
Query: 239 QMAADKSVV-CHKQNYPYAVFYCHATQTTRAYMVPLEGADGTKAKAAAVCHTDTSAWNPK 297
+A VV CH YPYAVFYCH TRAY V L GADGTK KA AVCHTDTSAWNPK
Sbjct: 127 PVAGGGKVVACHPMLYPYAVFYCHTVPKTRAYEVDLVGADGTKVKAVAVCHTDTSAWNPK 186
Query: 298 HLAFQVLKVKPGTVPICHFLPEDHIVWVPN 327
H+AFQVL VKPGTVP+CHFLPE H+VWVPN
Sbjct: 187 HVAFQVLGVKPGTVPVCHFLPEGHVVWVPN 216
|
The BURP domain is found at the C-terminus of several different plant proteins. It was named after the proteins in which it was first identified: the BNM2 clone-derived protein from Brassica napus; USPs and USP-like proteins; RD22 from Arabidopsis thaliana; and PG1beta from Lycopersicon esculentum. This domain is around 230 amino acid residues long. It possesses the following conserved features: two phenylalanine residues at its N-terminus; two cysteine residues; and four repeated cysteine-histidine motifs, arranged as: CH-X(10)-CH-X(25-27)-CH-X(25-26)-CH, where X can be any amino acid. The function of this domain is unknown. Length = 216 |
| >gnl|CDD|214992 smart01045, BURP, The BURP domain is found at the C-terminus of several different plant proteins | Back alignment and domain information |
|---|
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 332 | |||
| PF03181 | 216 | BURP: BURP domain; InterPro: IPR004873 The BURP do | 100.0 | |
| PF10950 | 108 | DUF2775: Protein of unknown function (DUF2775); In | 96.06 |
| >PF03181 BURP: BURP domain; InterPro: IPR004873 The BURP domain is a ~230-residue module, which has been named for the four members of the group initially identified, BNM2, USP, RD22, and PG1beta | Back alignment and domain information |
|---|
Probab=100.00 E-value=4.9e-88 Score=621.46 Aligned_cols=209 Identities=66% Similarity=1.124 Sum_probs=201.3
Q ss_pred eee-cCCCCCceeeecccCCCC--CCCccchhhhccCCCCCCChhHHhhhcCCCCCCHHHHHHHHHHHHccCCCCCCccc
Q 020017 118 VYV-GDLHPGMKMNLHFTQTSN--GATFLSRQAAKSTPFSSDKLPEIFNQFSVKPGSVEAEIMQNTIKECEDPGIKGEQK 194 (332)
Q Consensus 118 ~~~-~dL~pG~km~l~f~~~~~--~~~FLPR~vAdsIPfSs~kl~~iL~~Fsi~~~S~~A~~m~~Tl~~Ce~p~~~GE~K 194 (332)
||+ +||++|++|+|+|++..+ .++||||++||+||||+++|++||++|+|+++|+||++|++||++||.++++||+|
T Consensus 3 fF~e~dL~~G~~m~l~f~~~~~~~~~~fLpr~~A~siPfss~~l~~iL~~Fsi~~~S~~A~~m~~Tl~~Ce~~~~~GE~k 82 (216)
T PF03181_consen 3 FFLEKDLHPGKKMPLYFPKSDNSAKRPFLPRQVADSIPFSSSKLPEILQMFSIPPGSPMAKAMKNTLEECESPPIKGETK 82 (216)
T ss_pred ccCHHHCCCCceeeecCCCCCCCcccccCCHHHhccCCcCHHHHHHHHHHhcCCCCCHHHHHHHHHHHHhhcCCCCCcCc
Confidence 344 999999999999998764 68999999999999999999999999999999999999999999999999999999
Q ss_pred ccccchhHHHHHhhhhcCC-ccEEEeecccCCCcceeeEEEeeeeeccC-CcceeeccccCCceeeeccccCceeEEEEe
Q 020017 195 YCATSLESMIDFSTSKLGK-SVQAISTEVKKGTKMQTYTIAAGVKQMAA-DKSVVCHKQNYPYAVFYCHATQTTRAYMVP 272 (332)
Q Consensus 195 ~CaTSLESMvdFa~S~LG~-~v~a~sT~~~~~~~~q~Ytv~~~Vk~i~g-~k~V~CH~~~yPyaVfyCH~~~~T~vY~V~ 272 (332)
+||||||||+||++|+||+ |++++||++..+.+.|+|+|. +|++++| +++|+||+|+|||+|||||.++.||+|+|+
T Consensus 83 ~CaTSLESMvdF~~s~LG~~~v~a~st~~~~~~~~~~y~V~-~v~~i~~~~~~V~CH~~~yPYaVyyCH~~~~t~~y~V~ 161 (216)
T PF03181_consen 83 YCATSLESMVDFAVSKLGTRNVRALSTEVPKSTPLQNYTVE-GVKKIGGGDKSVVCHKMPYPYAVYYCHSIPPTRVYMVP 161 (216)
T ss_pred cCcCCHHHHHHHHHHhcCCCccEEEeccccCCCCCccEEEE-eeeeecCCCceEEEcccCCceeEEEeeecCceeEEEEE
Confidence 9999999999999999997 899999999777889999996 8999987 899999999999999999999999999999
Q ss_pred eecCCCcceeeEeeeeccCCCCCCCchhhhhhccCCCCcceeeeecCCcEEEeeC
Q 020017 273 LEGADGTKAKAAAVCHTDTSAWNPKHLAFQVLKVKPGTVPICHFLPEDHIVWVPN 327 (332)
Q Consensus 273 L~g~dG~~~~AvAVCH~DTS~Wnp~H~aF~~L~vkPG~~pVCHfl~~~~ivWvp~ 327 (332)
|+|+||++++||||||+|||.|||+|+||++||+|||++|||||+++|+|+||||
T Consensus 162 l~g~dg~~~~avavCH~DTS~W~p~h~aF~~L~vkPG~~~VCHf~~~~~ivWv~~ 216 (216)
T PF03181_consen 162 LVGEDGTKVEAVAVCHLDTSGWNPDHPAFQVLGVKPGTVPVCHFLPNDHIVWVPN 216 (216)
T ss_pred EeecCCceEEEEEEEecCCCCCCcchHHHHHhCCCCCCcceEEEeeCCeEEEccC
Confidence 9999999999999999999999999999999999999999999999999999997
|
It is found in the C-terminal part of a number of plant cell wall proteins, which are defined not only by the BURP domain, but also by the overall similarity in their modular construction. The BURP domain proteins consists of either three or four modules: (i) an N-terminal hydrophobic domain - a presumptive transit peptide, joined to (ii) a short conserved segment or other short segment, (iii) an optional segment consisting of repeated units which is unique to each member, and (iv) the C-terminal BURP domain. Although the BURP domain proteins share primary structural features, their expression patterns and the conditions under which they are expressed differ. The presence of the conserved BURP domain in diverse plant proteins suggests an important and fundamental functional role for this domain []. It is possible that the BURP domain represents a general motif for localization of proteins within the cell wall matrix. The other structural domains associated with the BURP domain may specify other target sites for intermolecular interactions []. Some proteins known to contain a BURP domain are listed below [, , ]: Brassica protein BNM2, which is expressed during the induction of microspore embryogenesis. Field bean USPs, abundant non-storage seed proteins with unknown function. Soybean USP-like proteins ADR6 (or SALI5-4A), an auxin-repressible, aluminium-inducible protein and SALI3-2, a protein that is up-regulated by aluminium. Soybean seed coat BURP-domain protein 1 (SCB1). It might play a role in the differentiation of the seed coat parenchyma cells. Arabidopsis RD22 drought induced protein. Maize ZRP2, a protein of unknown function in cortex parenchyma. Tomato PG1beta, the beta-subunit of polygalacturonase isozyme 1 (PG1), which is expressed in ripening fruits. Cereal RAFTIN. It is essential specifically for the maturation phase of pollen development. |
| >PF10950 DUF2775: Protein of unknown function (DUF2775); InterPro: IPR024489 This eukaryotic family includes a number of plant organ-specific proteins | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
No hit with probability above 80.00
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00