Citrus Sinensis ID: 018131
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 360 | ||||||
| 255551066 | 379 | Dehydration-responsive protein RD22 prec | 0.986 | 0.936 | 0.735 | 1e-150 | |
| 224089066 | 401 | predicted protein [Populus trichocarpa] | 0.983 | 0.882 | 0.679 | 1e-150 | |
| 118486691 | 421 | unknown [Populus trichocarpa] | 0.983 | 0.840 | 0.639 | 1e-145 | |
| 49615303 | 376 | dehydration-induced protein RD22-like pr | 0.972 | 0.930 | 0.696 | 1e-139 | |
| 225430045 | 337 | PREDICTED: dehydration-responsive protei | 0.922 | 0.985 | 0.709 | 1e-137 | |
| 225430043 | 364 | PREDICTED: dehydration-responsive protei | 0.983 | 0.972 | 0.713 | 1e-135 | |
| 54645911 | 364 | RD22-like protein [Vitis vinifera] | 0.983 | 0.972 | 0.710 | 1e-134 | |
| 224141799 | 331 | predicted protein [Populus trichocarpa] | 0.913 | 0.993 | 0.729 | 1e-133 | |
| 255639828 | 367 | unknown [Glycine max] | 0.980 | 0.961 | 0.673 | 1e-133 | |
| 351723823 | 367 | BURP domain-containing protein precursor | 0.980 | 0.961 | 0.673 | 1e-133 |
| >gi|255551066|ref|XP_002516581.1| Dehydration-responsive protein RD22 precursor, putative [Ricinus communis] gi|223544401|gb|EEF45922.1| Dehydration-responsive protein RD22 precursor, putative [Ricinus communis] | Back alignment and taxonomy information |
|---|
Score = 537 bits (1384), Expect = e-150, Method: Compositional matrix adjust.
Identities = 275/374 (73%), Positives = 305/374 (81%), Gaps = 19/374 (5%)
Query: 1 MEFHLLPILAFLSLALVASHADISPELYWKTVLPNSPMPKAVKDLLQPDVLEDKSTSVNV 60
MEF L IL FL++AL SHA + PE+YWK+VLPN+ MPKAV DLLQ ++DKSTSV+V
Sbjct: 6 MEFSLPCILVFLTIALATSHAALPPEVYWKSVLPNTQMPKAVTDLLQSGFMDDKSTSVSV 65
Query: 61 GKGGVNVDAGKGKPGG-------------------GTHVNVGGKGVGVNTGKPDKRTSVG 101
GKG VNV+AGKGKPGG GT VNVG KGVGVNTGKP K T+VG
Sbjct: 66 GKGSVNVNAGKGKPGGTSVNVGKGGVNVNTRKGKPGTTVNVGRKGVGVNTGKPGKGTNVG 125
Query: 102 VGKGGVSVSTGHKGKPVYVGVSPFNYVYAANENQLHDDPNTALFFLEKDLHPGMKMNLHF 161
VGKGGVSV+TGHKGKPV+V V+PF Y YAA E QLH DPN ALFFLEKD++P MNLHF
Sbjct: 126 VGKGGVSVNTGHKGKPVHVNVAPFIYNYAATETQLHHDPNVALFFLEKDMYPRKTMNLHF 185
Query: 162 TQTSNGATFLSRQAAKSTPFSSDKLPEIFNQFSVKPGSVEAEIMQNTIKECEDPGIKGEQ 221
T+ N A FL RQ AKS PFSS +LPEI+NQFSVKPGS+EAE+M+NTIKECE PGI+GE+
Sbjct: 186 TENPNTAMFLPRQVAKSIPFSSKELPEIYNQFSVKPGSMEAELMKNTIKECEAPGIEGEE 245
Query: 222 KYCATSLESMIDFSTSKLGKSVQAISTEVKKGTKMQTYTIAAGVKQMAADKSVVCHKQNY 281
K CATSLESMIDFSTS LGK+VQAISTEV+ T+MQ YTI AG K+MA DKSVVCHKQNY
Sbjct: 246 KLCATSLESMIDFSTSVLGKNVQAISTEVENQTQMQKYTITAGAKEMAGDKSVVCHKQNY 305
Query: 282 PYAVFYCHATQTTRAYMVPLEGADGTKAKAAAVCHTDTSAWNPKHLAFQVLKVKPGTVPI 341
YAVFYCHATQTTRAYMV LEGADGTKAKA AVCHTDTS WN KHLAFQVLKVKPGTVP+
Sbjct: 306 AYAVFYCHATQTTRAYMVSLEGADGTKAKAVAVCHTDTSTWNTKHLAFQVLKVKPGTVPV 365
Query: 342 CHFLPEDHIVWVPN 355
CHFLP+DHIVWVPN
Sbjct: 366 CHFLPQDHIVWVPN 379
|
Source: Ricinus communis Species: Ricinus communis Genus: Ricinus Family: Euphorbiaceae Order: Malpighiales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|224089066|ref|XP_002308621.1| predicted protein [Populus trichocarpa] gi|222854597|gb|EEE92144.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|118486691|gb|ABK95182.1| unknown [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|49615303|gb|AAT66913.1| dehydration-induced protein RD22-like protein 2 [Gossypium arboreum] | Back alignment and taxonomy information |
|---|
| >gi|225430045|ref|XP_002284286.1| PREDICTED: dehydration-responsive protein RD22 [Vitis vinifera] gi|227464475|gb|ACP40551.1| rd22-c [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|225430043|ref|XP_002284269.1| PREDICTED: dehydration-responsive protein RD22 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|54645911|gb|AAV36561.1| RD22-like protein [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|224141799|ref|XP_002324250.1| predicted protein [Populus trichocarpa] gi|222865684|gb|EEF02815.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|255639828|gb|ACU20207.1| unknown [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|351723823|ref|NP_001238316.1| BURP domain-containing protein precursor [Glycine max] gi|188474279|gb|ACD49739.1| BURP domain-containing protein [Glycine max] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 360 | ||||||
| TAIR|locus:2179424 | 392 | RD22 "RESPONSIVE TO DESSICATIO | 0.825 | 0.757 | 0.544 | 2.3e-95 | |
| UNIPROTKB|Q70KG3 | 362 | RAFTIN1B "Protein RAFTIN 1B" [ | 0.6 | 0.596 | 0.418 | 5.4e-51 | |
| UNIPROTKB|Q7F8U7 | 412 | BURP13 "BURP domain-containing | 0.6 | 0.524 | 0.410 | 7.7e-50 | |
| UNIPROTKB|Q70KG5 | 389 | RAFTIN1A "Protein RAFTIN 1A" [ | 0.594 | 0.550 | 0.408 | 4.2e-49 | |
| TAIR|locus:2010237 | 280 | USPL1 "AT1G49320" [Arabidopsis | 0.6 | 0.771 | 0.359 | 1.3e-36 | |
| TAIR|locus:2195593 | 624 | PG1 "AT1G60390" [Arabidopsis t | 0.577 | 0.333 | 0.358 | 2e-30 | |
| TAIR|locus:2016194 | 626 | PG2 "AT1G70370" [Arabidopsis t | 0.577 | 0.332 | 0.358 | 4.2e-29 | |
| TAIR|locus:2034823 | 622 | JP630 "AT1G23760" [Arabidopsis | 0.508 | 0.294 | 0.377 | 8.8e-29 |
| TAIR|locus:2179424 RD22 "RESPONSIVE TO DESSICATION 22" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 808 (289.5 bits), Expect = 2.3e-95, Sum P(2) = 2.3e-95
Identities = 170/312 (54%), Positives = 198/312 (63%)
Query: 56 TSVNVGKGGVNVDAGKX-----XXXXXXXXXXXXXXXXXXXXXXDKRXXXXXXXXXXXXX 110
T+VNVGKGGV VD GK KR
Sbjct: 78 TAVNVGKGGVRVDTGKGKPGGGTHVSVGSGKGHGGGVAVHTGKPGKRTDVGVGKGGVTVH 137
Query: 111 XXXXXXPVYVGV----SPFNYVYAANENQLHDDPNTALFFLEKDLHPGMKMNLHFTQTSN 166
P+YVGV +PF Y YAA E QLHDDPN ALFFLEKDL G +MN+ F
Sbjct: 138 TRHKGRPIYVGVKPGANPFVYNYAAKETQLHDDPNAALFFLEKDLVRGKEMNVRFNAEDG 197
Query: 167 --GAT-FLSRQAAKSTPFSSDKLPEIFNQFSVKPGSVEAEIMQNTIKECEDPGIKGEQKY 223
G T FL R A++ PF S+K E +FSV+ GS EAE+M+ TI+ECE + GE+KY
Sbjct: 198 YGGKTAFLPRGEAETVPFGSEKFSETLKRFSVEAGSEEAEMMKKTIEECEARKVSGEEKY 257
Query: 224 CATSLESMIDFSTSKLGK-SVQAISTEV-KKGTKMQTYTIAA-GVKQMAADKSVVCHKQN 280
CATSLESM+DFS SKLGK V+A+STEV KK MQ Y IAA GVK+++ DKSVVCHKQ
Sbjct: 258 CATSLESMVDFSVSKLGKYHVRAVSTEVAKKNAPMQKYKIAAAGVKKLSDDKSVVCHKQK 317
Query: 281 YPYAVFYCHATQTTRAYMVPLEGADGTKAKAAAVCHTDTSAWNPKHLAFQVLKVKPGTVP 340
YP+AVFYCH T Y VPLEG +G +AKA AVCH +TSAWNP HLAF+VLKVKPGTVP
Sbjct: 318 YPFAVFYCHKAMMTTVYAVPLEGENGMRAKAVAVCHKNTSAWNPNHLAFKVLKVKPGTVP 377
Query: 341 ICHFLPEDHIVW 352
+CHFLPE H+VW
Sbjct: 378 VCHFLPETHVVW 389
|
|
| UNIPROTKB|Q70KG3 RAFTIN1B "Protein RAFTIN 1B" [Triticum aestivum (taxid:4565)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|Q7F8U7 BURP13 "BURP domain-containing protein 13" [Oryza sativa Japonica Group (taxid:39947)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|Q70KG5 RAFTIN1A "Protein RAFTIN 1A" [Triticum aestivum (taxid:4565)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2010237 USPL1 "AT1G49320" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2195593 PG1 "AT1G60390" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2016194 PG2 "AT1G70370" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2034823 JP630 "AT1G23760" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| eugene3.00061755 | hypothetical protein (401 aa) | |||||||
(Populus trichocarpa) | ||||||||
| Sorry, there are no predicted associations at the current settings. |
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 360 | |||
| pfam03181 | 216 | pfam03181, BURP, BURP domain | 1e-126 | |
| smart01045 | 222 | smart01045, BURP, The BURP domain is found at the | 2e-83 |
| >gnl|CDD|202568 pfam03181, BURP, BURP domain | Back alignment and domain information |
|---|
Score = 360 bits (927), Expect = e-126
Identities = 146/217 (67%), Positives = 169/217 (77%), Gaps = 5/217 (2%)
Query: 143 ALFFLEKDLHPGMKMNLHFTQTSNGA--TFLSRQAAKSTPFSSDKLPEIFNQFSVKPGSV 200
LFFLEKDL PG KM LHF + ++GA FL RQ A S PFSS+KLPEI FSV PGSV
Sbjct: 1 GLFFLEKDLKPGKKMPLHFPKITDGAKRPFLPRQIADSIPFSSEKLPEILAMFSVPPGSV 60
Query: 201 EAEIMQNTIKECEDPGIKGEQKYCATSLESMIDFSTSKLGKS-VQAISTEVKKGTKMQTY 259
EA+IM++T++ECE P IKGE+K+CATSLESM+DF+TSKLG ++A+STEV+ G +Q Y
Sbjct: 61 EAKIMKSTLRECEAPAIKGEEKFCATSLESMVDFATSKLGTRDIRAVSTEVEGGGPLQKY 120
Query: 260 TIAAGVKQMAADKSVV-CHKQNYPYAVFYCHATQTTRAYMVPLEGADGTKAKAAAVCHTD 318
T+ GVK +A VV CH YPYAVFYCH TRAY V L GADGTK KA AVCHTD
Sbjct: 121 TVE-GVKPVAGGGKVVACHPMLYPYAVFYCHTVPKTRAYEVDLVGADGTKVKAVAVCHTD 179
Query: 319 TSAWNPKHLAFQVLKVKPGTVPICHFLPEDHIVWVPN 355
TSAWNPKH+AFQVL VKPGTVP+CHFLPE H+VWVPN
Sbjct: 180 TSAWNPKHVAFQVLGVKPGTVPVCHFLPEGHVVWVPN 216
|
The BURP domain is found at the C-terminus of several different plant proteins. It was named after the proteins in which it was first identified: the BNM2 clone-derived protein from Brassica napus; USPs and USP-like proteins; RD22 from Arabidopsis thaliana; and PG1beta from Lycopersicon esculentum. This domain is around 230 amino acid residues long. It possesses the following conserved features: two phenylalanine residues at its N-terminus; two cysteine residues; and four repeated cysteine-histidine motifs, arranged as: CH-X(10)-CH-X(25-27)-CH-X(25-26)-CH, where X can be any amino acid. The function of this domain is unknown. Length = 216 |
| >gnl|CDD|214992 smart01045, BURP, The BURP domain is found at the C-terminus of several different plant proteins | Back alignment and domain information |
|---|
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 360 | |||
| PF03181 | 216 | BURP: BURP domain; InterPro: IPR004873 The BURP do | 100.0 | |
| PF10950 | 108 | DUF2775: Protein of unknown function (DUF2775); In | 95.17 |
| >PF03181 BURP: BURP domain; InterPro: IPR004873 The BURP domain is a ~230-residue module, which has been named for the four members of the group initially identified, BNM2, USP, RD22, and PG1beta | Back alignment and domain information |
|---|
Probab=100.00 E-value=6e-90 Score=640.58 Aligned_cols=212 Identities=68% Similarity=1.147 Sum_probs=205.4
Q ss_pred ccccccCCCCCCceeeecCCCCCC--CCcccchhhccccCCCCCChHHHhhhcCCCCCCHHHHHHHHHHHHccCCCCCCc
Q 018131 143 ALFFLEKDLHPGMKMNLHFTQTSN--GATFLSRQAAKSTPFSSDKLPEIFNQFSVKPGSVEAEIMQNTIKECEDPGIKGE 220 (360)
Q Consensus 143 ~~FF~e~dL~pG~~m~l~f~~~~~--~~~FLPR~vAdsIPFSs~kl~~iL~~FsI~~~S~~A~~m~~TL~~CE~p~~~GE 220 (360)
++||+|+||++|++|+++|++..+ .++||||++||+||||+++|++||++|+|+++|+||++|++||++||.++++||
T Consensus 1 ~~fF~e~dL~~G~~m~l~f~~~~~~~~~~fLpr~~A~siPfss~~l~~iL~~Fsi~~~S~~A~~m~~Tl~~Ce~~~~~GE 80 (216)
T PF03181_consen 1 ALFFLEKDLHPGKKMPLYFPKSDNSAKRPFLPRQVADSIPFSSSKLPEILQMFSIPPGSPMAKAMKNTLEECESPPIKGE 80 (216)
T ss_pred CcccCHHHCCCCceeeecCCCCCCCcccccCCHHHhccCCcCHHHHHHHHHHhcCCCCCHHHHHHHHHHHHhhcCCCCCc
Confidence 489999999999999999998754 689999999999999999999999999999999999999999999999999999
Q ss_pred cccccCchhhHHHHHhhhcCC-ccEEeeeeeccCCcceeeEEEeeeeeecC-CcceeeecccCCceEEeecccCceeEEE
Q 018131 221 QKYCATSLESMIDFSTSKLGK-SVQAISTEVKKGTKMQTYTIAAGVKQMAA-DKSVVCHKQNYPYAVFYCHATQTTRAYM 298 (360)
Q Consensus 221 ~K~CaTSLESMvdFa~S~LG~-nv~a~sT~~~~~~~~q~YtV~~~Vk~i~g-~k~V~CH~~~yPYaVfYCH~~~~T~~Y~ 298 (360)
+|+||||||||+||++|+||+ |++++||++..+.+.|+|+|. +|+++.| +++|+||+|+|||+|||||.++.||+|+
T Consensus 81 ~k~CaTSLESMvdF~~s~LG~~~v~a~st~~~~~~~~~~y~V~-~v~~i~~~~~~V~CH~~~yPYaVyyCH~~~~t~~y~ 159 (216)
T PF03181_consen 81 TKYCATSLESMVDFAVSKLGTRNVRALSTEVPKSTPLQNYTVE-GVKKIGGGDKSVVCHKMPYPYAVYYCHSIPPTRVYM 159 (216)
T ss_pred CccCcCCHHHHHHHHHHhcCCCccEEEeccccCCCCCccEEEE-eeeeecCCCceEEEcccCCceeEEEeeecCceeEEE
Confidence 999999999999999999997 899999999877889999996 9999987 8999999999999999999999999999
Q ss_pred EeeecCCCCceeEEEeeecCCCCCCCCchhhhhhccCCCCcceeeeecCCcEEEeeC
Q 018131 299 VPLEGADGTKAKAAAVCHTDTSAWNPKHLAFQVLKVKPGTVPICHFLPEDHIVWVPN 355 (360)
Q Consensus 299 V~L~g~dG~~v~AvAVCH~DTS~Wnp~H~aF~~L~vKPG~~pVCHflp~~~ivWvp~ 355 (360)
|+|+|+||++++||||||+|||.|||+|+||++||+|||++|||||+++|+|+||||
T Consensus 160 V~l~g~dg~~~~avavCH~DTS~W~p~h~aF~~L~vkPG~~~VCHf~~~~~ivWv~~ 216 (216)
T PF03181_consen 160 VPLVGEDGTKVEAVAVCHLDTSGWNPDHPAFQVLGVKPGTVPVCHFLPNDHIVWVPN 216 (216)
T ss_pred EEEeecCCceEEEEEEEecCCCCCCcchHHHHHhCCCCCCcceEEEeeCCeEEEccC
Confidence 999999999999999999999999999999999999999999999999999999997
|
It is found in the C-terminal part of a number of plant cell wall proteins, which are defined not only by the BURP domain, but also by the overall similarity in their modular construction. The BURP domain proteins consists of either three or four modules: (i) an N-terminal hydrophobic domain - a presumptive transit peptide, joined to (ii) a short conserved segment or other short segment, (iii) an optional segment consisting of repeated units which is unique to each member, and (iv) the C-terminal BURP domain. Although the BURP domain proteins share primary structural features, their expression patterns and the conditions under which they are expressed differ. The presence of the conserved BURP domain in diverse plant proteins suggests an important and fundamental functional role for this domain []. It is possible that the BURP domain represents a general motif for localization of proteins within the cell wall matrix. The other structural domains associated with the BURP domain may specify other target sites for intermolecular interactions []. Some proteins known to contain a BURP domain are listed below [, , ]: Brassica protein BNM2, which is expressed during the induction of microspore embryogenesis. Field bean USPs, abundant non-storage seed proteins with unknown function. Soybean USP-like proteins ADR6 (or SALI5-4A), an auxin-repressible, aluminium-inducible protein and SALI3-2, a protein that is up-regulated by aluminium. Soybean seed coat BURP-domain protein 1 (SCB1). It might play a role in the differentiation of the seed coat parenchyma cells. Arabidopsis RD22 drought induced protein. Maize ZRP2, a protein of unknown function in cortex parenchyma. Tomato PG1beta, the beta-subunit of polygalacturonase isozyme 1 (PG1), which is expressed in ripening fruits. Cereal RAFTIN. It is essential specifically for the maturation phase of pollen development. |
| >PF10950 DUF2775: Protein of unknown function (DUF2775); InterPro: IPR024489 This eukaryotic family includes a number of plant organ-specific proteins | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
No hit with probability above 80.00
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00