Citrus Sinensis ID: 029233

Local Sequence Feature Prediction

Prediction and (Method)	Result

Residue Number Marker

Protein Sequence

Secondary Structure (PSIPRED)

Secondary Structure Prediction (SSPRO)

Coil and Loop (DISEMBL)

Flexible Loop (DISEMBL)

Low Complexity Region (SEG)

Disordered region (IsUnstruct)

Disordered Region (DISOPRED)

Disordered Region (DISEMBL)

Disordered Region (DISPRO)

Transmembrane Helix (TMHMM)

Transmembrane Helix (HMMTOP)

Transmembrane Helix (MEMSAT)

TM Helix, Signal Peptide (MEMSAT_SVM)

TM Helix, Signal Peptide (Phobius)

Signal Peptide (SignalP HMM Mode)

Signal Peptide (SignalP NN Mode)

Coiled Coils (COILS)

Positional Conservation

--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------

MCIYDASCDTHFGFCCNVDEETSRELARLPGVLTVRPDPDYNSVKKDYSCLDIQMSNPWNSQIGTTLLFPLGTTKHWLVRMDKPGVGVVTKAQMVDRYAQILTKVLGNEKDAQMCIYHVSWQSNFGFCCELDDECAQELTGVPGVLSVQPDENFGSENKDYGGNNLQNSMVPSDSSEASPTQIKTKKLFVTGIQWTQ

ccEEEEEccccEEEEEEEcHHHHHHHcccccEEEEcccccccccccccccccccccccccccccccccccccccEEEEEEEEccccccccHHHHHHHHHHHHHHHHccHHHHHHHHEEEEEEEEEEEEEEEcHHHHHHHccccccEEEcccccccccccccccccccccccccccccccccccccccEEEEEccccc

cEEEEEccccccccEccccHHHHHHHcccccEEEEEcccccccccccccccccccccccccccccccccccccccEEEEEEcccccccccHHHHHHHHHHHHHHHHccHHHHHHHHHHEcccccccEEEEEcHHHHHHHcccccEEEEEccccccccccccccccccccEEccccccccccccccccEEEEEccccc

mciydascdthfgfccnvdeETSRELArlpgvltvrpdpdynsvkkdyscldiqmsnpwnsqigttllfplgttkhwlvrmdkpgvgvvtKAQMVDRYAQILTKVLGNEKDAQMCIYHVSwqsnfgfccelDDECAQeltgvpgvlsvqpdenfgsenkdyggnnlqnsmvpsdsseasptqiktKKLFVTGIQWTQ

mciydascdthFGFCCNVDEETSRELarlpgvltvrpdpdynsVKKDYSCLDIQMSNPWNSQIGTTLLFPLGTTKHWLVRMDKPGVGVVTKAQMVDRYAQILTKVLGNEKDAQMCIYHVSWQSNFGFCCELDDECAQELTGVPGVLSVQPDENFGSENKDYGGNNLQNSMVPsdsseasptqiktkklfvtgiqwtq

MCIYDASCDTHFGFCCNVDEETSRELARLPGVLTVRPDPDYNSVKKDYSCLDIQMSNPWNSQIGTTLLFPLGTTKHWLVRMDKPGVGVVTKAQMVDRYAQILTKVLGNEKDAQMCIYHVSWQSNFGFCCELDDECAQELTGVPGVLSVQPDENFGSENKDYGGNNLQNSMVPSDSSEASPTQIKTKKLFVTGIQWTQ

*CIYDASCDTHFGFCCNVDEETSRELARLPGVLTVRPDPDYNSVKKDYSCLDIQMSNPWNSQIGTTLLFPLGTTKHWLVRMDKPGVGVVTKAQMVDRYAQILTKVLGNEKDAQMCIYHVSWQSNFGFCCELDDECAQELTGVPGVL*****************************************LFVTGI****

*CIYDASCDTHFGFCCNVDEE*******************************************TTLLFPLGTTKHWLVRMDKPG***VTKAQMVDRYAQILTKVLGNEKDAQMCIYHVSWQSNFGFCCELDDEC****TGVPGVLSVQPDENFGSENKDYGGNNLQNSM******************FVTGIQWT*

MCIYDASCDTHFGFCCNVDEETSRELARLPGVLTVRPDPDYNSVKKDYSCLDIQMSNPWNSQIGTTLLFPLGTTKHWLVRMDKPGVGVVTKAQMVDRYAQILTKVLGNEKDAQMCIYHVSWQSNFGFCCELDDECAQELTGVPGVLSVQPDENFGSENKDYGGNNLQNS************QIKTKKLFVTGIQWTQ

MCIYDASCDTHFGFCCNVDEETSRELARLPGVLTVRPDP***********************IGTTLLFPLGTTKHWLVRMDKPGVGVVTKAQMVDRYAQILTKVLGNEKDAQMCIYHVSWQSNFGFCCELDDECAQELTGVPGVLSVQPDENFGSENKDYGGNNLQNSMVPSDSSEASPTQIKTKKLFVTGIQWTQ

ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo

ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx

MCIYDASCDTHFGFCCNVDEETSRELARLPGVLTVRPDPDYNSVKKDYSCLDIQMSNPWNSQIGTTLLFPLGTTKHWLVRMDKPGVGVVTKAQMVDRYAQILTKVLGNEKDAQMCIYHVSWQSNFGFCCELDDECAQELTGVPGVLSVQPDENFGSENKDYGGNNLQNSMVPSDSSEASPTQIKTKKLFVTGIQWTQ

no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST

Original result of BLAST against SWISS-PROT Database

ID	Alignment graph	Length	Definition	RBH(Q2H)	RBH(H2Q)	Q cover	H cover	Identity	E-value
Query		197	2.2.26 [Sep-21-2011]
Q38732		230	DAG protein, chloroplasti	N/A	no	0.619	0.530	0.341	7e-15
Q9LKA5		395	Uncharacterized protein A	no	no	0.446	0.222	0.337	1e-08

>sp\|Q38732\|DAG_ANTMA DAG protein, chloroplastic OS=Antirrhinum majus GN=DAG PE=2 SV=1	Back alignment and function desciption

 Score = 80.5 bits (197), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 42/123 (34%), Positives = 66/123 (53%), Gaps = 1/123 (0%)

Query: 66  TLLFPLGTTKHWLVRMDKPGVGVVTKAQMVDRYAQILTKVLGNEKDAQMCIYHVSWQSNF 125
           T++ P     HWL+ M+ P     T+ QM+D Y   L  VLG+ ++A+  +Y  S  +  
Sbjct: 78  TIMLPGCDYNHWLIVMEFPKDPAPTREQMIDTYLNTLATVLGSMEEAKKNMYAFSTTTYT 137

Query: 126 GFCCELDDECAQELTGVPGVLSVQPDENFGSENKDYGGNNLQN-SMVPSDSSEASPTQIK 184
           GF C + +E +++  G+PGVL V PD     +NKDYGG+   N  ++P       P Q +
Sbjct: 138 GFQCTVTEETSEKFKGLPGVLWVLPDSYIDVKNKDYGGDKYVNGEIIPCQYPTYQPKQSR 197

Query: 185 TKK 187
           + K
Sbjct: 198 SSK 200

Acts very early in chloroplast development, being required for expression of RNA polymerase beta subunit gene, and hence indirectly for subsequent expression of CAB and RBCS genes.
Antirrhinum majus (taxid: 4151)

>sp\|Q9LKA5\|UMP1_ARATH Uncharacterized protein At3g15000, mitochondrial OS=Arabidopsis thaliana GN=At3g15000 PE=1 SV=1	Back alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST

Original result of BLAST against Nonredundant Database

GI	Alignment Graph	Length	Definition	Q cover	H cover	Identity	E-value
Query		197
224110196		311	predicted protein [Populus trichocarpa]	0.984	0.623	0.75	7e-82
224097630		333	predicted protein [Populus trichocarpa]	0.954	0.564	0.740	3e-79
225464121		409	PREDICTED: uncharacterized protein LOC10	0.989	0.476	0.719	9e-79
296087958		315	unnamed protein product [Vitis vinifera]	0.989	0.619	0.719	3e-78
147867424		408	hypothetical protein VITISV_001132 [Viti	0.989	0.477	0.591	5e-73
449525960		394	PREDICTED: uncharacterized LOC101208465	0.989	0.494	0.653	8e-72
449446458		394	PREDICTED: uncharacterized protein LOC10	0.989	0.494	0.658	1e-71
255568010		371	RNA binding protein, putative [Ricinus c	0.822	0.436	0.773	4e-70
356576299		386	PREDICTED: uncharacterized protein LOC10	0.969	0.494	0.663	2e-68
9293999		396	unnamed protein product [Arabidopsis tha	0.923	0.459	0.594	2e-60

>gi\|224110196\|ref\|XP_002315444.1\| predicted protein [Populus trichocarpa] gi\|222864484\|gb\|EEF01615.1\| predicted protein [Populus trichocarpa]	Back alignment and taxonomy information

 Score =  308 bits (790), Expect = 7e-82,   Method: Compositional matrix adjust.
 Identities = 147/196 (75%), Positives = 168/196 (85%), Gaps = 2/196 (1%)

Query: 1   MCIYDASCDTHFGFCCNVDEETSRELARLPGVLTVRPDPDYNSVKKDYSCLDIQMSNPWN 60
           MCIYDA  DTHFGFCC++DE+ S ELARLPGVL+VRPDPDYNSV+KDYS   +++S   N
Sbjct: 34  MCIYDACYDTHFGFCCDIDEDASLELARLPGVLSVRPDPDYNSVEKDYSS-GVKLSTLSN 92

Query: 61  SQIGTTLLFPLGTTKHWLVRMDKPGVGVVTKAQMVDRYAQILTKVLGNEKDAQMCIYHVS 120
            QIG+ LLFP G TKHWLVR+DKPGVGVVTKAQMVD YAQILTKV+G EKDAQMCIYHVS
Sbjct: 93  PQIGSKLLFPSGNTKHWLVRIDKPGVGVVTKAQMVDYYAQILTKVMGYEKDAQMCIYHVS 152

Query: 121 WQSNFGFCCELDDECAQELTGVPGVLSVQPDENFGSENKDYGGNNLQNSMVPSDSSEASP 180
           WQSNFGFCCELD+ECAQEL GVPGVLSV PD++F SENKDY G++L NS  P DSSEAS 
Sbjct: 153 WQSNFGFCCELDEECAQELAGVPGVLSVLPDKDFESENKDYRGDSLINSANPPDSSEASQ 212

Query: 181 -TQIKTKKLFVTGIQW 195
            T ++TKKLF+TG+ +
Sbjct: 213 ITHVRTKKLFITGLSF 228

Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi\|224097630\|ref\|XP_002311019.1\| predicted protein [Populus trichocarpa] gi\|222850839\|gb\|EEE88386.1\| predicted protein [Populus trichocarpa]	Back alignment and taxonomy information

>gi\|225464121\|ref\|XP_002264164.1\| PREDICTED: uncharacterized protein LOC100265361 [Vitis vinifera]	Back alignment and taxonomy information

>gi\|296087958\|emb\|CBI35241.3\| unnamed protein product [Vitis vinifera]	Back alignment and taxonomy information

>gi\|147867424\|emb\|CAN83272.1\| hypothetical protein VITISV_001132 [Vitis vinifera]	Back alignment and taxonomy information

>gi\|449525960\|ref\|XP_004169984.1\| PREDICTED: uncharacterized LOC101208465 [Cucumis sativus]	Back alignment and taxonomy information

>gi\|449446458\|ref\|XP_004140988.1\| PREDICTED: uncharacterized protein LOC101208465 [Cucumis sativus]	Back alignment and taxonomy information

>gi\|255568010\|ref\|XP_002524982.1\| RNA binding protein, putative [Ricinus communis] gi\|223535726\|gb\|EEF37389.1\| RNA binding protein, putative [Ricinus communis]	Back alignment and taxonomy information

>gi\|356576299\|ref\|XP_003556270.1\| PREDICTED: uncharacterized protein LOC100791652 [Glycine max]	Back alignment and taxonomy information

>gi\|9293999\|dbj\|BAB01902.1\| unnamed protein product [Arabidopsis thaliana]	Back alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST

Original result of BLAST against Gene Ontology (AMIGO)

ID	Alignment graph	Length	Definition	Q cover	H cover	Identity	E-value
Query		197
TAIR\|locus:2088756		374	AT3G20930 [Arabidopsis thalian	0.913	0.481	0.601	1.5e-58
TAIR\|locus:2206639		229	AT1G32580 "AT1G32580" [Arabido	0.527	0.454	0.471	5.5e-22
TAIR\|locus:2051003		219	DAL1 "differentiation and gree	0.558	0.502	0.472	1.5e-21
TAIR\|locus:2063389		232	MORF6 "multiple organellar RNA	0.532	0.452	0.462	4.4e-20
TAIR\|locus:2200131		232	MORF9 "multiple organellar RNA	0.710	0.603	0.342	6e-16
UNIPROTKB\|Q2R8U1		374	Os11g0216400 "Os11g0216400 pro	0.527	0.278	0.314	8.4e-12
TAIR\|locus:2083348		244	MORF3 "multiple organellar RNA	0.761	0.614	0.282	6.4e-11
TAIR\|locus:2086310		395	RIP1 "RNA-editing factor inter	0.593	0.296	0.322	2e-10
TAIR\|locus:2030200		192	AT1G72530 "AT1G72530" [Arabido	0.522	0.536	0.307	1.1e-07
TAIR\|locus:2119782		419	MORF1 "multiple organellar RNA	0.583	0.274	0.270	1.5e-05

TAIR\|locus:2088756 AT3G20930 [Arabidopsis thaliana (taxid:3702)]	Back alignment and assigned GO terms

 Score = 601 (216.6 bits), Expect = 1.5e-58, P = 1.5e-58
 Identities = 116/193 (60%), Positives = 146/193 (75%)

Query:     3 IYDASCDTHFGFCCNVDEETSRELARLPGVLTVRPDPDYNSVKKDYSCLDIQMSNPWNSQ 62
             IYDAS DTHFGFCC++DE+ SR+LA LPGV+++RP+ DY+S KK+Y            S 
Sbjct:   112 IYDASFDTHFGFCCHIDEDASRQLASLPGVVSIRPEQDYSSEKKNYGI---------GSH 162

Query:    63 IGTTLLFPLGTTKHWLVRMDKPGVGVVTKAQMVDRYAQILTKVLGNEKDAQMCIYHVSWQ 122
              G +L F  GT KHW+VR+DKPGVG+VTKAQMVD   Q+L+KVL NEKDAQMC+YHVSWQ
Sbjct:   163 KGVSL-FDHGTVKHWMVRIDKPGVGIVTKAQMVDHCVQLLSKVLWNEKDAQMCLYHVSWQ 221

Query:   123 SNFGFCCELDDECAQELTGVPGVLSVQPDENFGSENKDYGGNNLQNSMVPSDSSEASPTQ 182
             S+FGFCC+LD+  A EL GVPGVL+V PD +F S NKDY G++ Q+S    D SE+ P  
Sbjct:   222 SDFGFCCDLDERSAVELAGVPGVLAVVPDNSFESLNKDYEGDSTQDSR-DQDDSESPP-- 278

Query:   183 IKTKKLFVTGIQW 195
             +KTKKLF+TG+ +
Sbjct:   279 VKTKKLFITGLSF 291

GO:0000166 "nucleotide binding" evidence=IEA

GO:0003676 "nucleic acid binding" evidence=IEA

GO:0003723 "RNA binding" evidence=ISS

GO:0008150 "biological_process" evidence=ND

GO:0000023 "maltose metabolic process" evidence=RCA

GO:0010027 "thylakoid membrane organization" evidence=RCA

GO:0019252 "starch biosynthetic process" evidence=RCA

TAIR\|locus:2206639 AT1G32580 "AT1G32580" [Arabidopsis thaliana (taxid:3702)]	Back alignment and assigned GO terms

TAIR\|locus:2051003 DAL1 "differentiation and greening-like 1" [Arabidopsis thaliana (taxid:3702)]	Back alignment and assigned GO terms

TAIR\|locus:2063389 MORF6 "multiple organellar RNA editing factor 6" [Arabidopsis thaliana (taxid:3702)]	Back alignment and assigned GO terms

TAIR\|locus:2200131 MORF9 "multiple organellar RNA editing factor 9" [Arabidopsis thaliana (taxid:3702)]	Back alignment and assigned GO terms

UNIPROTKB\|Q2R8U1 Os11g0216400 "Os11g0216400 protein" [Oryza sativa Japonica Group (taxid:39947)]	Back alignment and assigned GO terms

TAIR\|locus:2083348 MORF3 "multiple organellar RNA editing factor 3" [Arabidopsis thaliana (taxid:3702)]	Back alignment and assigned GO terms

TAIR\|locus:2086310 RIP1 "RNA-editing factor interacting protein 1" [Arabidopsis thaliana (taxid:3702)]	Back alignment and assigned GO terms

TAIR\|locus:2030200 AT1G72530 "AT1G72530" [Arabidopsis thaliana (taxid:3702)]	Back alignment and assigned GO terms

TAIR\|locus:2119782 MORF1 "multiple organellar RNA editing factor 1" [Arabidopsis thaliana (taxid:3702)]	Back alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries

Original result of BLAST against SWISS-PROT

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server

Original result from Ezypred Server

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software

No EC number assignment, probably not an enzyme!

Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING

Original result from the STRING server

Fail to connect to STRING server

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST

Original result of RPS-BLAST against CDD database part I

Conserved Domains Detected by HHsearch

Original result of HHsearch against CDD database

ID	Alignment Graph	Length	Definition	Probability
Query		197
PF05922		82	Inhibitor_I9: Peptidase inhibitor I9; InterPro: IP	99.4
PF05922		82	Inhibitor_I9: Peptidase inhibitor I9; InterPro: IP	95.89

>PF05922 Inhibitor_I9: Peptidase inhibitor I9; InterPro: IPR010259 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively	Back alignment and domain information

Probab=99.40  E-value=3.9e-13  Score=95.32  Aligned_cols=77  Identities=18%  Similarity=0.242  Sum_probs=57.8

Q ss_pred             eEEEEecCCCCCCcchhhHHHHHHHHHHHHhCCc----hhcccceEEEeecceeeEEeEeCHHHHHHhcCCCCeEEEcCC
Q 029233           76 HWLVRMDKPGVGVVTKAQMVDRYAQILTKVLGNE----KDAQMCIYHVSWQSNFGFCCELDDECAQELTGVPGVLSVQPD  151 (197)
Q Consensus        76 ~YIV~M~~p~~~~~~~~~~~~sH~q~LasVLgS~----e~Ak~sI~ysYt~~f~GFAA~LTeeeA~kLs~lPGVVSVfPD  151 (197)
                      .|||.|+.+....    ...+.|.+|+++++.+.    ......++|.|...++||+|+|+++++++|+++|+|.+|.||
T Consensus         1 ~YIV~~k~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~Gfs~~l~~~~i~~L~~~p~V~~Ve~D   76 (82)
T PF05922_consen    1 RYIVVFKDDASAA----SSFSSHKSWQASILKSALKSASSINAKVLYSYDNAFNGFSAKLSEEEIEKLRKDPGVKSVEPD   76 (82)
T ss_dssp             EEEEEE-TTSTHH----CHHHHHHHHHH----HHHHTH-TTT-EEEEEESSTSSEEEEEE-HHHHHHHHTSTTEEEEEEE
T ss_pred             CEEEEECCCCCcc----hhHHHHHHHHHHHHhhhhhhhcccCCceEEEEeeeEEEEEEEeCHHHHHHHHcCCCeEEEEeC
Confidence            5999999875433    25778888888776542    234567999999999999999999999999999999999999


Q ss_pred             CCcCC
Q 029233          152 ENFGS  156 (197)
Q Consensus       152 ~~~~~  156 (197)
                      ..+++
T Consensus        77 ~~v~l   81 (82)
T PF05922_consen   77 QVVSL   81 (82)
T ss_dssp             CEEEE
T ss_pred             ceEec
Confidence            88754

In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. Limited proteolysis of most large protein precursors is carried out in vivo by the subtilisin-like pro-protein convertases. Many important biological processes such as peptide hormone synthesis, viral protein processing and receptor maturation involve proteolytic processing by these enzymes []. The subtilisin-serine protease (SRSP) family hormone and pro-protein convertases (furin, PC1/3, PC2, PC4, PACE4, PC5/6, and PC7/7/LPC) act within the secretory pathway to cleave polypeptide precursors at specific basic sites, generating their biologically active forms. Serum proteins, pro-hormones, receptors, zymogens, viral surface glycoproteins, bacterial toxins, amongst others, are activated by this route []. The SRSPs share the same domain structure, including a signal peptide, the pro-peptide, the catalytic domain, the P/middle or homo B domain, and the C terminus. Proteinase propeptide inhibitors (sometimes refered to as activation peptides) are responsible for the modulation of folding and activity of the pro-enzyme or zymogen. The pro-segment docks into the enzyme moiety shielding the substrate binding site, thereby promoting inhibition of the enzyme. Several such propeptides share a similar topology [], despite often low sequence identities []. The propeptide region has an open-sandwich antiparallel-alpha/antiparallel-beta fold, with two alpha-helices and four beta-strands with a (beta/alpha/beta)x2 topology. This group of sequences contain the propeptide domain at the N terminus of peptidases belonging to MEROPS family S8A, subtilisins. A number of the members of this group of sequences belong to MEROPS inhibitor family I9, clan I-. The propeptide is removed by proteolytic cleavage; removal activating the enzyme.; GO: 0004252 serine-type endopeptidase activity, 0042802 identical protein binding, 0043086 negative regulation of catalytic activity; PDB: 3CNQ_P 1SPB_P 3CO0_P 1ITP_A 1V5I_B 1SCJ_B 3P5B_P 2XTJ_P 2W2M_P 2P4E_P ....

>PF05922 Inhibitor_I9: Peptidase inhibitor I9; InterPro: IPR010259 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively	Back alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST

Original result of BLAST against Protein Data Bank

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST

Original result of RPS-BLAST against PDB70 database

No hit with e-value below 0.005

Structure Templates Detected by HHsearch

Original result of HHsearch against PDB70 database

ID	Alignment Graph	Length	Definition	Probability
Query		197
2w2n_P		114	Proprotein convertase subtilisin/kexin type 9; hyd	99.59
2qtw_A		124	Proprotein convertase subtilisin/kexin type 9 Pro;	99.18
3cnq_P		80	Subtilisin BPN'; uncleaved, proenzyme, substrate c	99.11
1v5i_B		76	POIA1, IA-1=serine proteinase inhibitor; protease-	98.46
2p4e_P		692	Proprotein convertase subtilisin/kexin type 9; pro	97.94
3t41_A		471	Epidermin leader peptide processing serine protea;	96.3
3afg_A		539	Subtilisin-like serine protease; propeptide, therm	96.22
3cnq_P		80	Subtilisin BPN'; uncleaved, proenzyme, substrate c	95.73
2z30_B		65	TK-subtilisin; thermococcus kodakaraensis, hydrola	95.56
2z2z_A		395	TK-subtilisin precursor; thermococcus kodakaraensi	94.59
2w2n_P		114	Proprotein convertase subtilisin/kexin type 9; hyd	94.57
2z30_B		65	TK-subtilisin; thermococcus kodakaraensis, hydrola	93.67
2qtw_A		124	Proprotein convertase subtilisin/kexin type 9 Pro;	93.08
1v5i_B		76	POIA1, IA-1=serine proteinase inhibitor; protease-	86.82

>2w2n_P Proprotein convertase subtilisin/kexin type 9; hydrolase-receptor complex, PCSK9, proprotein converta low-density lipoprotein receptor, EGF; 2.30A {Homo sapiens} PDB: 2w2m_P 2w2o_P 2w2p_P 2w2q_P 2xtj_P	Back alignment and structure

Probab=99.59  E-value=7e-16  Score=118.49  Aligned_cols=76  Identities=16%  Similarity=0.137  Sum_probs=64.7

Q ss_pred             eeEEEEecCCCCCCcchhhHHHHHHHHHHHHhCCchhcccceEEEeecceeeEEeEeCHHHHHHhcCCCCeEEEcCCCCc
Q 029233           75 KHWLVRMDKPGVGVVTKAQMVDRYAQILTKVLGNEKDAQMCIYHVSWQSNFGFCCELDDECAQELTGVPGVLSVQPDENF  154 (197)
Q Consensus        75 ~~YIV~M~~p~~~~~~~~~~~~sH~q~LasVLgS~e~Ak~sI~ysYt~~f~GFAA~LTeeeA~kLs~lPGVVSVfPD~~~  154 (197)
                      +.|||+|+....    + ..++.|++||++++++ +.+...|.|+|++.|+||+|+||++|+++|+++|+|++|+||+.+
T Consensus        38 ~~YIV~lk~~~~----~-~~~~~h~~~l~s~~~~-~~~~~~i~~sY~~~~~GFaa~Lt~~~~~~L~~~P~V~~VE~D~~v  111 (114)
T 2w2n_P           38 GTYVVVLKEETH----L-SQSERTARRLQAQAAR-RGYLTKILHVFHGLLPGFLVKMSGDLLELALKLPHVDYIEEDSSV  111 (114)
T ss_dssp             EEEEEEECTTCC----H-HHHHHHHHHHHHHHHH-TTCCCEEEEEECSSSSEEEEECCGGGHHHHHTSTTEEEEEEEEEE
T ss_pred             CcEEEEECCCCC----H-HHHHHHHHHHHHHhhh-cccCCceEEEecccceEEEEEcCHHHHHHHHcCCCccEEEeCceE
Confidence            589999986442    2 2567999999999865 335668999999999999999999999999999999999999876


Q ss_pred             CC
Q 029233          155 GS  156 (197)
Q Consensus       155 ~~  156 (197)
                      +.
T Consensus       112 ~~  113 (114)
T 2w2n_P          112 FA  113 (114)
T ss_dssp             EE
T ss_pred             ec
Confidence            53

PyMOL of 2w2n

>2qtw_A Proprotein convertase subtilisin/kexin type 9 Pro; coronary heart disease, hypercholest low density lipoprotein receptor, autocatalytic cleavage; HET: NAG; 1.90A {Homo sapiens} PDB: 3m0c_A 2pmw_A 3h42_A 3bps_P 3gcw_P 3gcx_P 3p5b_P 3p5c_P	Back alignment and structure

>3cnq_P Subtilisin BPN'; uncleaved, proenzyme, substrate complex, hydrolase, metal- binding, protease, secreted, serine protease, sporulation; 1.71A {Bacillus amyloliquefaciens} PDB: 3bgo_P 3co0_P 1spb_P 1scj_B	Back alignment and structure

>1v5i_B POIA1, IA-1=serine proteinase inhibitor; protease-inhibitor complex, subtilisin, hydrolase-Pro binding complex; 1.50A {Pleurotus ostreatus} SCOP: d.58.3.2 PDB: 1itp_A	Back alignment and structure

>2p4e_P Proprotein convertase subtilisin/kexin type 9; protease, LDL receptor, LDL, endocytosis, hydrol; 1.98A {Homo sapiens}	Back alignment and structure

>3t41_A Epidermin leader peptide processing serine protea; structural genomics, center for structural genomics of infec diseases, csgid; 1.95A {Staphylococcus aureus} PDB: 3qfh_A	Back alignment and structure

>3afg_A Subtilisin-like serine protease; propeptide, thermococcus kodakaraensis, hydrolas protease; 2.00A {Thermococcus kodakarensis}	Back alignment and structure

>3cnq_P Subtilisin BPN'; uncleaved, proenzyme, substrate complex, hydrolase, metal- binding, protease, secreted, serine protease, sporulation; 1.71A {Bacillus amyloliquefaciens} PDB: 3bgo_P 3co0_P 1spb_P 1scj_B	Back alignment and structure

>2z30_B TK-subtilisin; thermococcus kodakaraensis, hydrolase; 1.65A {Thermococcus kodakarensis} PDB: 2z2y_B 3a3p_B 2z56_B 2z58_B 2z57_B 3a3n_B 3a3o_B	Back alignment and structure

>2z2z_A TK-subtilisin precursor; thermococcus kodakaraensis, hydrolase; 1.87A {Thermococcus kodakarensis} PDB: 2e1p_A 2zwp_A 2zwo_A	Back alignment and structure

>2w2n_P Proprotein convertase subtilisin/kexin type 9; hydrolase-receptor complex, PCSK9, proprotein converta low-density lipoprotein receptor, EGF; 2.30A {Homo sapiens} PDB: 2w2m_P 2w2o_P 2w2p_P 2w2q_P 2xtj_P	Back alignment and structure

>2z30_B TK-subtilisin; thermococcus kodakaraensis, hydrolase; 1.65A {Thermococcus kodakarensis} PDB: 2z2y_B 3a3p_B 2z56_B 2z58_B 2z57_B 3a3n_B 3a3o_B	Back alignment and structure

>2qtw_A Proprotein convertase subtilisin/kexin type 9 Pro; coronary heart disease, hypercholest low density lipoprotein receptor, autocatalytic cleavage; HET: NAG; 1.90A {Homo sapiens} PDB: 3m0c_A 2pmw_A 3h42_A 3bps_P 3gcw_P 3gcx_P 3p5b_P 3p5c_P	Back alignment and structure

>1v5i_B POIA1, IA-1=serine proteinase inhibitor; protease-inhibitor complex, subtilisin, hydrolase-Pro binding complex; 1.50A {Pleurotus ostreatus} SCOP: d.58.3.2 PDB: 1itp_A	Back alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST

Original result of RPS-BLAST against SCOP70(version1.75) database

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch

Original result of HHsearch against SCOP70(version1.75) database

ID	Alignment Graph	Length	Definition	Probability
Query		197
d1scjb_		71	Subtilisin prosegment {Bacillus subtilis [TaxId: 1	98.41
d1v5ib1		72	Proteinase A inhibitor 1, POIA1 {Oyster mushroom (	98.39
d1scjb_		71	Subtilisin prosegment {Bacillus subtilis [TaxId: 1	92.55
d1v5ib1		72	Proteinase A inhibitor 1, POIA1 {Oyster mushroom (	87.45

>d1scjb_ d.58.3.2 (B:) Subtilisin prosegment {Bacillus subtilis [TaxId: 1423]}	Back information, alignment and structure

class: Alpha and beta proteins (a+b)
fold: Ferredoxin-like
superfamily: Protease propeptides/inhibitors
family: Subtilase propeptides/inhibitors
domain: Subtilisin prosegment
species: Bacillus subtilis [TaxId: 1423]

Probab=98.41  E-value=3e-07  Score=63.61  Aligned_cols=68  Identities=15%  Similarity=0.138  Sum_probs=51.9

Q ss_pred             eeEEEEecCCCCCCcchhhHHHHHHHHHHHHhCCchhcccceEEEeecceeeEEeEeCHHHHHHhcCCCCeEEEcCCCCc
Q 029233           75 KHWLVRMDKPGVGVVTKAQMVDRYAQILTKVLGNEKDAQMCIYHVSWQSNFGFCCELDDECAQELTGVPGVLSVQPDENF  154 (197)
Q Consensus        75 ~~YIV~M~~p~~~~~~~~~~~~sH~q~LasVLgS~e~Ak~sI~ysYt~~f~GFAA~LTeeeA~kLs~lPGVVSVfPD~~~  154 (197)
                      +.|||.+.......     ..+.+.+++.+.       -.++.+.|. .++||+|+|+++++++|++.|+|..|-||...
T Consensus         2 ~~YIV~fK~~~~~~-----~~~~~~~~v~~~-------gg~v~~~~~-~i~gfs~~l~~~~~~~L~~~p~V~yVE~D~v~   68 (71)
T d1scjb_           2 KKYIVGFKQTMSAM-----SSAKKKDVISQK-------GGKVEKQFK-YVNAAAATLDEKAVKELKKDPSVAYVEEDHIA   68 (71)
T ss_dssp             EEEEEEECSSSSCC-----SHHHHHHHHHTT-------TCEEEEECS-SSSEEEEEECHHHHHHHHTSTTEEEEEECCEE
T ss_pred             CcEEEEECCCCChH-----HHHHHHHHHHHc-------CCeEEEEEe-ecceEEEEeCHHHHHHHHcCCCceEEeCCcEE
Confidence            57999998765433     233455444322       235888886 69999999999999999999999999999865


Q ss_pred             C
Q 029233          155 G  155 (197)
Q Consensus       155 ~  155 (197)
                      +
T Consensus        69 ~   69 (71)
T d1scjb_          69 H   69 (71)
T ss_dssp             E
T ss_pred             E
Confidence            4

PyMOL of d1scjb_

>d1v5ib1 d.58.3.2 (B:1-72) Proteinase A inhibitor 1, POIA1 {Oyster mushroom (Pleurotus ostreatus) [TaxId: 5322]}	Back information, alignment and structure

>d1scjb_ d.58.3.2 (B:) Subtilisin prosegment {Bacillus subtilis [TaxId: 1423]}	Back information, alignment and structure

>d1v5ib1 d.58.3.2 (B:1-72) Proteinase A inhibitor 1, POIA1 {Oyster mushroom (Pleurotus ostreatus) [TaxId: 5322]}	Back information, alignment and structure