Citrus Sinensis ID: 029048
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 200 | ||||||
| 21593052 | 232 | DAG protein, putative [Arabidopsis thali | 0.87 | 0.75 | 0.658 | 1e-67 | |
| 15220382 | 232 | putative plastid developmental protein D | 0.87 | 0.75 | 0.658 | 1e-67 | |
| 225461197 | 229 | PREDICTED: DAG protein, chloroplastic is | 0.945 | 0.825 | 0.697 | 6e-67 | |
| 297843972 | 232 | hypothetical protein ARALYDRAFT_471280 [ | 0.865 | 0.745 | 0.671 | 3e-62 | |
| 357446239 | 221 | DAG protein [Medicago truncatula] gi|355 | 0.985 | 0.891 | 0.598 | 3e-61 | |
| 449468532 | 230 | PREDICTED: DAG protein, chloroplastic-li | 0.89 | 0.773 | 0.669 | 9e-61 | |
| 255625841 | 221 | unknown [Glycine max] | 0.74 | 0.669 | 0.686 | 5e-60 | |
| 255555105 | 226 | DAG protein, chloroplast precursor, puta | 0.66 | 0.584 | 0.754 | 6e-60 | |
| 255629093 | 225 | unknown [Glycine max] | 0.83 | 0.737 | 0.647 | 2e-59 | |
| 343172728 | 171 | putative plastid developmental protein, | 0.705 | 0.824 | 0.706 | 3e-59 |
| >gi|21593052|gb|AAM65001.1| DAG protein, putative [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
Score = 261 bits (667), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 133/202 (65%), Positives = 151/202 (74%), Gaps = 28/202 (13%)
Query: 25 GLRVG---SPTLRLPSRAHSRS-ILTVRAGATDSEYSSKRSSSNEPRETIMLPGCDYNHW 80
G+RVG +P LR S A SR + V+A DS+YSSKRS+SNE RETIMLPGCDYNHW
Sbjct: 31 GIRVGDSWTPLLRSISTAGSRRRVAIVKAATVDSDYSSKRSNSNEQRETIMLPGCDYNHW 90
Query: 81 LIVMEFPKDPAPTREQMIETYLNTLATVLGSMEEAKKNMYAFSTTTYTGFQCTVSEETSE 140
LIVMEFPKDPAP+R+QMI+TYLNTLATVLGSMEEAKKNMYAFSTTTYTGFQCT+ EETSE
Sbjct: 91 LIVMEFPKDPAPSRDQMIDTYLNTLATVLGSMEEAKKNMYAFSTTTYTGFQCTIDEETSE 150
Query: 141 KFK----------------------GDKYVNGEIIPCTYPTYQPNKRKESKYVSKRYERR 178
KFK GDKY+NGEIIPCTYPTYQP +R +KY SKRYER+
Sbjct: 151 KFKGLPGVLWVLPDSYIDVKNKDYGGDKYINGEIIPCTYPTYQPKQRNNTKYQSKRYERK 210
Query: 179 RDG--PPAERRTRQAAGQSESA 198
RDG PP +R+ RQ S+S+
Sbjct: 211 RDGPPPPEQRKPRQEPAASDSS 232
|
Source: Arabidopsis thaliana Species: Arabidopsis thaliana Genus: Arabidopsis Family: Brassicaceae Order: Brassicales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|15220382|ref|NP_172610.1| putative plastid developmental protein DAG [Arabidopsis thaliana] gi|6554182|gb|AAF16628.1|AC011661_6 T23J18.10 [Arabidopsis thaliana] gi|26450103|dbj|BAC42171.1| unknown protein [Arabidopsis thaliana] gi|28827520|gb|AAO50604.1| putative DAG protein [Arabidopsis thaliana] gi|332190614|gb|AEE28735.1| putative plastid developmental protein DAG [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|225461197|ref|XP_002283211.1| PREDICTED: DAG protein, chloroplastic isoform 1 [Vitis vinifera] gi|359493924|ref|XP_003634693.1| PREDICTED: DAG protein, chloroplastic isoform 2 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|297843972|ref|XP_002889867.1| hypothetical protein ARALYDRAFT_471280 [Arabidopsis lyrata subsp. lyrata] gi|297335709|gb|EFH66126.1| hypothetical protein ARALYDRAFT_471280 [Arabidopsis lyrata subsp. lyrata] | Back alignment and taxonomy information |
|---|
| >gi|357446239|ref|XP_003593397.1| DAG protein [Medicago truncatula] gi|355482445|gb|AES63648.1| DAG protein [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|449468532|ref|XP_004151975.1| PREDICTED: DAG protein, chloroplastic-like [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|255625841|gb|ACU13265.1| unknown [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|255555105|ref|XP_002518590.1| DAG protein, chloroplast precursor, putative [Ricinus communis] gi|223542435|gb|EEF43977.1| DAG protein, chloroplast precursor, putative [Ricinus communis] | Back alignment and taxonomy information |
|---|
| >gi|255629093|gb|ACU14891.1| unknown [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|343172728|gb|AEL99067.1| putative plastid developmental protein, partial [Silene latifolia] gi|343172730|gb|AEL99068.1| putative plastid developmental protein, partial [Silene latifolia] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 200 | ||||||
| TAIR|locus:2200131 | 232 | MORF9 "multiple organellar RNA | 0.735 | 0.633 | 0.618 | 1.5e-42 | |
| UNIPROTKB|Q2R8U1 | 374 | Os11g0216400 "Os11g0216400 pro | 0.67 | 0.358 | 0.507 | 4.5e-27 | |
| TAIR|locus:2083348 | 244 | MORF3 "multiple organellar RNA | 0.88 | 0.721 | 0.385 | 6.1e-23 | |
| TAIR|locus:2063389 | 232 | MORF6 "multiple organellar RNA | 0.68 | 0.586 | 0.390 | 2.8e-20 | |
| TAIR|locus:2086310 | 395 | RIP1 "RNA-editing factor inter | 0.765 | 0.387 | 0.327 | 3.7e-20 | |
| TAIR|locus:2051003 | 219 | DAL1 "differentiation and gree | 0.93 | 0.849 | 0.329 | 9.2e-20 | |
| TAIR|locus:2206639 | 229 | AT1G32580 "AT1G32580" [Arabido | 0.56 | 0.489 | 0.432 | 3.6e-18 | |
| TAIR|locus:2119782 | 419 | MORF1 "multiple organellar RNA | 0.51 | 0.243 | 0.476 | 4.7e-18 | |
| TAIR|locus:2030200 | 192 | AT1G72530 "AT1G72530" [Arabido | 0.36 | 0.375 | 0.410 | 6e-17 | |
| TAIR|locus:2156344 | 723 | MORF4 "AT5G44780" [Arabidopsis | 0.415 | 0.114 | 0.517 | 1.5e-15 |
| TAIR|locus:2200131 MORF9 "multiple organellar RNA editing factor 9" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 450 (163.5 bits), Expect = 1.5e-42, P = 1.5e-42
Identities = 94/152 (61%), Positives = 108/152 (71%)
Query: 25 GLRVG---SPTLRLPSRAHSRS-ILTVRAGATDXXXXXXXXXXNEPRETIMLPGCDYNHW 80
G+RVG +P LR S A SR + V+A D NE RETIMLPGCDYNHW
Sbjct: 31 GIRVGDSWTPLLRNISTAGSRRRVAIVKAATVDSDYSSKRSNSNEQRETIMLPGCDYNHW 90
Query: 81 LIVMEFPKDPAPTREQMIETYLNTLATVLGSMEEAKKNMYAFSTTTYTGFQCTVSEETSE 140
LIVMEFPKDPAP+R+QMI+TYLNTLATVLGSMEEAKKNMYAFSTTTYTGFQCT+ EETSE
Sbjct: 91 LIVMEFPKDPAPSRDQMIDTYLNTLATVLGSMEEAKKNMYAFSTTTYTGFQCTIDEETSE 150
Query: 141 KFKGDKYVNGEIIPCTYPTYQPNKRKESKYVS 172
KFKG V ++P +Y + KY++
Sbjct: 151 KFKGLPGVLW-VLPDSYIDVKNKDYGGDKYIN 181
|
|
| UNIPROTKB|Q2R8U1 Os11g0216400 "Os11g0216400 protein" [Oryza sativa Japonica Group (taxid:39947)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2083348 MORF3 "multiple organellar RNA editing factor 3" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2063389 MORF6 "multiple organellar RNA editing factor 6" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2086310 RIP1 "RNA-editing factor interacting protein 1" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2051003 DAL1 "differentiation and greening-like 1" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2206639 AT1G32580 "AT1G32580" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2119782 MORF1 "multiple organellar RNA editing factor 1" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2030200 AT1G72530 "AT1G72530" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2156344 MORF4 "AT5G44780" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| AT1G11430 | plastid developmental protein DAG, putative; plastid developmental protein DAG, putative; LOCATED IN- chloroplast stroma, chloroplast, chloroplast envelope; EXPRESSED IN- 22 plant structures; EXPRESSED DURING- 14 growth stages; BEST Arabidopsis thaliana protein match is- plastid developmental protein DAG, putative (TAIR-AT3G06790.2); Has 147 Blast hits to 135 proteins in 9 species- Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 147; Viruses - 0; Other Eukaryotes - 0 (source- NCBI BLink). (232 aa) | ||||||||||
(Arabidopsis thaliana) | |||||||||||
| AT3G08740 | • | • | 0.896 | ||||||||
| AT5G55220 | • | • | 0.894 | ||||||||
| CHL-CPN10 | • | • | 0.883 | ||||||||
| AT1G36390 | • | • | 0.875 | ||||||||
| AT3G12930 | • | 0.843 | |||||||||
| AT5G58250 | • | 0.841 | |||||||||
| AT3G18680 | • | 0.841 | |||||||||
| CP33 | • | 0.835 | |||||||||
| emb2184 | • | • | 0.832 | ||||||||
| EMB1241 | • | • | 0.824 |
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 200 | |||
| PF05922 | 82 | Inhibitor_I9: Peptidase inhibitor I9; InterPro: IP | 95.39 |
| >PF05922 Inhibitor_I9: Peptidase inhibitor I9; InterPro: IPR010259 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively | Back alignment and domain information |
|---|
Probab=95.39 E-value=0.022 Score=39.11 Aligned_cols=69 Identities=22% Similarity=0.412 Sum_probs=42.6
Q ss_pred eEEEEcCCCCCCCCHHHHHHHHHHHHHHHhCCHH----HHhhc-cceeeecceeeeeeeeCHHHHHhhhCCCcccceEec
Q 029048 80 WLIVMEFPKDPAPTREQMIETYLNTLATVLGSME----EAKKN-MYAFSTTTYTGFQCTVSEETSEKFKGDKYVNGEIIP 154 (200)
Q Consensus 80 WLVvMd~P~~~~pSr~EmId~Yv~TLAkVLGSEE----EAKkk-IYsvSt~tyfGF~c~IdEE~S~KLkGd~~VnG~IvP 154 (200)
.+|+|+.+ .+.++.++...+-++..|.+.. ..+-+ +|.+.. .--||.+.+++++.++|+.+|.|. -|.|
T Consensus 2 YIV~~k~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~-~~~Gfs~~l~~~~i~~L~~~p~V~-~Ve~ 75 (82)
T PF05922_consen 2 YIVVFKDD----ASAASSFSSHKSWQASILKSALKSASSINAKVLYSYDN-AFNGFSAKLSEEEIEKLRKDPGVK-SVEP 75 (82)
T ss_dssp EEEEE-TT----STHHCHHHHHHHHHH----HHHHTH-TTT-EEEEEESS-TSSEEEEEE-HHHHHHHHTSTTEE-EEEE
T ss_pred EEEEECCC----CCcchhHHHHHHHHHHHHhhhhhhhcccCCceEEEEee-eEEEEEEEeCHHHHHHHHcCCCeE-EEEe
Confidence 47888866 3334456666666665554321 12223 676766 789999999999999999999888 4444
|
In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. Limited proteolysis of most large protein precursors is carried out in vivo by the subtilisin-like pro-protein convertases. Many important biological processes such as peptide hormone synthesis, viral protein processing and receptor maturation involve proteolytic processing by these enzymes []. The subtilisin-serine protease (SRSP) family hormone and pro-protein convertases (furin, PC1/3, PC2, PC4, PACE4, PC5/6, and PC7/7/LPC) act within the secretory pathway to cleave polypeptide precursors at specific basic sites, generating their biologically active forms. Serum proteins, pro-hormones, receptors, zymogens, viral surface glycoproteins, bacterial toxins, amongst others, are activated by this route []. The SRSPs share the same domain structure, including a signal peptide, the pro-peptide, the catalytic domain, the P/middle or homo B domain, and the C terminus. Proteinase propeptide inhibitors (sometimes refered to as activation peptides) are responsible for the modulation of folding and activity of the pro-enzyme or zymogen. The pro-segment docks into the enzyme moiety shielding the substrate binding site, thereby promoting inhibition of the enzyme. Several such propeptides share a similar topology [], despite often low sequence identities []. The propeptide region has an open-sandwich antiparallel-alpha/antiparallel-beta fold, with two alpha-helices and four beta-strands with a (beta/alpha/beta)x2 topology. This group of sequences contain the propeptide domain at the N terminus of peptidases belonging to MEROPS family S8A, subtilisins. A number of the members of this group of sequences belong to MEROPS inhibitor family I9, clan I-. The propeptide is removed by proteolytic cleavage; removal activating the enzyme.; GO: 0004252 serine-type endopeptidase activity, 0042802 identical protein binding, 0043086 negative regulation of catalytic activity; PDB: 3CNQ_P 1SPB_P 3CO0_P 1ITP_A 1V5I_B 1SCJ_B 3P5B_P 2XTJ_P 2W2M_P 2P4E_P .... |
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 200 | |||
| 2w2n_P | 114 | Proprotein convertase subtilisin/kexin type 9; hyd | 94.49 | |
| 3cnq_P | 80 | Subtilisin BPN'; uncleaved, proenzyme, substrate c | 88.21 | |
| 2qtw_A | 124 | Proprotein convertase subtilisin/kexin type 9 Pro; | 80.03 |
| >2w2n_P Proprotein convertase subtilisin/kexin type 9; hydrolase-receptor complex, PCSK9, proprotein converta low-density lipoprotein receptor, EGF; 2.30A {Homo sapiens} PDB: 2w2m_P 2w2o_P 2w2p_P 2w2q_P 2xtj_P | Back alignment and structure |
|---|
Probab=94.49 E-value=0.035 Score=41.46 Aligned_cols=65 Identities=14% Similarity=0.102 Sum_probs=46.7
Q ss_pred ceeEEEEcCCCCCCCCHHHHHHHHHHHHHHHhCCHHHHhhc-cceeeecceeeeeeeeCHHHHHhhhCCCccc
Q 029048 78 NHWLIVMEFPKDPAPTREQMIETYLNTLATVLGSMEEAKKN-MYAFSTTTYTGFQCTVSEETSEKFKGDKYVN 149 (200)
Q Consensus 78 ~HWLVvMd~P~~~~pSr~EmId~Yv~TLAkVLGSEEEAKkk-IYsvSt~tyfGF~c~IdEE~S~KLkGd~~Vn 149 (200)
+.|+|+|..- ++ .+.++.+.+.|+.+|++ +.+.-. +|.+. ..--||.+.++|++.++|+.+|.|.
T Consensus 38 ~~YIV~lk~~----~~-~~~~~~h~~~l~s~~~~-~~~~~~i~~sY~-~~~~GFaa~Lt~~~~~~L~~~P~V~ 103 (114)
T 2w2n_P 38 GTYVVVLKEE----TH-LSQSERTARRLQAQAAR-RGYLTKILHVFH-GLLPGFLVKMSGDLLELALKLPHVD 103 (114)
T ss_dssp EEEEEEECTT----CC-HHHHHHHHHHHHHHHHH-TTCCCEEEEEEC-SSSSEEEEECCGGGHHHHHTSTTEE
T ss_pred CcEEEEECCC----CC-HHHHHHHHHHHHHHhhh-cccCCceEEEec-ccceEEEEEcCHHHHHHHHcCCCcc
Confidence 4699999742 22 34566788888888754 223333 56664 4678999999999999999998775
|
| >3cnq_P Subtilisin BPN'; uncleaved, proenzyme, substrate complex, hydrolase, metal- binding, protease, secreted, serine protease, sporulation; 1.71A {Bacillus amyloliquefaciens} PDB: 3bgo_P 3co0_P 1spb_P 1scj_B | Back alignment and structure |
|---|
| >2qtw_A Proprotein convertase subtilisin/kexin type 9 Pro; coronary heart disease, hypercholest low density lipoprotein receptor, autocatalytic cleavage; HET: NAG; 1.90A {Homo sapiens} PDB: 3m0c_A 2pmw_A 3h42_A 3bps_P 3gcw_P 3gcx_P 3p5b_P 3p5c_P | Back alignment and structure |
|---|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00