Citrus Sinensis ID: 029233


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------
MCIYDASCDTHFGFCCNVDEETSRELARLPGVLTVRPDPDYNSVKKDYSCLDIQMSNPWNSQIGTTLLFPLGTTKHWLVRMDKPGVGVVTKAQMVDRYAQILTKVLGNEKDAQMCIYHVSWQSNFGFCCELDDECAQELTGVPGVLSVQPDENFGSENKDYGGNNLQNSMVPSDSSEASPTQIKTKKLFVTGIQWTQ
ccEEEEEccccEEEEEEEcHHHHHHHcccccEEEEcccccccccccccccccccccccccccccccccccccccEEEEEEEEccccccccHHHHHHHHHHHHHHHHccHHHHHHHHEEEEEEEEEEEEEEEcHHHHHHHccccccEEEcccccccccccccccccccccccccccccccccccccccEEEEEccccc
cEEEEEccccccccEccccHHHHHHHcccccEEEEEcccccccccccccccccccccccccccccccccccccccEEEEEEcccccccccHHHHHHHHHHHHHHHHccHHHHHHHHHHEcccccccEEEEEcHHHHHHHcccccEEEEEccccccccccccccccccccEEccccccccccccccccEEEEEccccc
mciydascdthfgfccnvdeETSRELArlpgvltvrpdpdynsvkkdyscldiqmsnpwnsqigttllfplgttkhwlvrmdkpgvgvvtKAQMVDRYAQILTKVLGNEKDAQMCIYHVSwqsnfgfccelDDECAQeltgvpgvlsvqpdenfgsenkdyggnnlqnsmvpsdsseasptqiktKKLFVTGIQWTQ
mciydascdthFGFCCNVDEETSRELarlpgvltvrpdpdynsVKKDYSCLDIQMSNPWNSQIGTTLLFPLGTTKHWLVRMDKPGVGVVTKAQMVDRYAQILTKVLGNEKDAQMCIYHVSWQSNFGFCCELDDECAQELTGVPGVLSVQPDENFGSENKDYGGNNLQNSMVPsdsseasptqiktkklfvtgiqwtq
MCIYDASCDTHFGFCCNVDEETSRELARLPGVLTVRPDPDYNSVKKDYSCLDIQMSNPWNSQIGTTLLFPLGTTKHWLVRMDKPGVGVVTKAQMVDRYAQILTKVLGNEKDAQMCIYHVSWQSNFGFCCELDDECAQELTGVPGVLSVQPDENFGSENKDYGGNNLQNSMVPSDSSEASPTQIKTKKLFVTGIQWTQ
*CIYDASCDTHFGFCCNVDEETSRELARLPGVLTVRPDPDYNSVKKDYSCLDIQMSNPWNSQIGTTLLFPLGTTKHWLVRMDKPGVGVVTKAQMVDRYAQILTKVLGNEKDAQMCIYHVSWQSNFGFCCELDDECAQELTGVPGVL*****************************************LFVTGI****
*CIYDASCDTHFGFCCNVDEE*******************************************TTLLFPLGTTKHWLVRMDKPG***VTKAQMVDRYAQILTKVLGNEKDAQMCIYHVSWQSNFGFCCELDDEC****TGVPGVLSVQPDENFGSENKDYGGNNLQNSM******************FVTGIQWT*
MCIYDASCDTHFGFCCNVDEETSRELARLPGVLTVRPDPDYNSVKKDYSCLDIQMSNPWNSQIGTTLLFPLGTTKHWLVRMDKPGVGVVTKAQMVDRYAQILTKVLGNEKDAQMCIYHVSWQSNFGFCCELDDECAQELTGVPGVLSVQPDENFGSENKDYGGNNLQNS************QIKTKKLFVTGIQWTQ
MCIYDASCDTHFGFCCNVDEETSRELARLPGVLTVRPDP***********************IGTTLLFPLGTTKHWLVRMDKPGVGVVTKAQMVDRYAQILTKVLGNEKDAQMCIYHVSWQSNFGFCCELDDECAQELTGVPGVLSVQPDENFGSENKDYGGNNLQNSMVPSDSSEASPTQIKTKKLFVTGIQWTQ
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MCIYDASCDTHFGFCCNVDEETSRELARLPGVLTVRPDPDYNSVKKDYSCLDIQMSNPWNSQIGTTLLFPLGTTKHWLVRMDKPGVGVVTKAQMVDRYAQILTKVLGNEKDAQMCIYHVSWQSNFGFCCELDDECAQELTGVPGVLSVQPDENFGSENKDYGGNNLQNSMVPSDSSEASPTQIKTKKLFVTGIQWTQ
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query197 2.2.26 [Sep-21-2011]
Q38732230 DAG protein, chloroplasti N/A no 0.619 0.530 0.341 7e-15
Q9LKA5 395 Uncharacterized protein A no no 0.446 0.222 0.337 1e-08
>sp|Q38732|DAG_ANTMA DAG protein, chloroplastic OS=Antirrhinum majus GN=DAG PE=2 SV=1 Back     alignment and function desciption
 Score = 80.5 bits (197), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 42/123 (34%), Positives = 66/123 (53%), Gaps = 1/123 (0%)

Query: 66  TLLFPLGTTKHWLVRMDKPGVGVVTKAQMVDRYAQILTKVLGNEKDAQMCIYHVSWQSNF 125
           T++ P     HWL+ M+ P     T+ QM+D Y   L  VLG+ ++A+  +Y  S  +  
Sbjct: 78  TIMLPGCDYNHWLIVMEFPKDPAPTREQMIDTYLNTLATVLGSMEEAKKNMYAFSTTTYT 137

Query: 126 GFCCELDDECAQELTGVPGVLSVQPDENFGSENKDYGGNNLQN-SMVPSDSSEASPTQIK 184
           GF C + +E +++  G+PGVL V PD     +NKDYGG+   N  ++P       P Q +
Sbjct: 138 GFQCTVTEETSEKFKGLPGVLWVLPDSYIDVKNKDYGGDKYVNGEIIPCQYPTYQPKQSR 197

Query: 185 TKK 187
           + K
Sbjct: 198 SSK 200




Acts very early in chloroplast development, being required for expression of RNA polymerase beta subunit gene, and hence indirectly for subsequent expression of CAB and RBCS genes.
Antirrhinum majus (taxid: 4151)
>sp|Q9LKA5|UMP1_ARATH Uncharacterized protein At3g15000, mitochondrial OS=Arabidopsis thaliana GN=At3g15000 PE=1 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query197
224110196 311 predicted protein [Populus trichocarpa] 0.984 0.623 0.75 7e-82
224097630 333 predicted protein [Populus trichocarpa] 0.954 0.564 0.740 3e-79
225464121 409 PREDICTED: uncharacterized protein LOC10 0.989 0.476 0.719 9e-79
296087958 315 unnamed protein product [Vitis vinifera] 0.989 0.619 0.719 3e-78
147867424 408 hypothetical protein VITISV_001132 [Viti 0.989 0.477 0.591 5e-73
449525960 394 PREDICTED: uncharacterized LOC101208465 0.989 0.494 0.653 8e-72
449446458 394 PREDICTED: uncharacterized protein LOC10 0.989 0.494 0.658 1e-71
255568010 371 RNA binding protein, putative [Ricinus c 0.822 0.436 0.773 4e-70
356576299 386 PREDICTED: uncharacterized protein LOC10 0.969 0.494 0.663 2e-68
9293999 396 unnamed protein product [Arabidopsis tha 0.923 0.459 0.594 2e-60
>gi|224110196|ref|XP_002315444.1| predicted protein [Populus trichocarpa] gi|222864484|gb|EEF01615.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score =  308 bits (790), Expect = 7e-82,   Method: Compositional matrix adjust.
 Identities = 147/196 (75%), Positives = 168/196 (85%), Gaps = 2/196 (1%)

Query: 1   MCIYDASCDTHFGFCCNVDEETSRELARLPGVLTVRPDPDYNSVKKDYSCLDIQMSNPWN 60
           MCIYDA  DTHFGFCC++DE+ S ELARLPGVL+VRPDPDYNSV+KDYS   +++S   N
Sbjct: 34  MCIYDACYDTHFGFCCDIDEDASLELARLPGVLSVRPDPDYNSVEKDYSS-GVKLSTLSN 92

Query: 61  SQIGTTLLFPLGTTKHWLVRMDKPGVGVVTKAQMVDRYAQILTKVLGNEKDAQMCIYHVS 120
            QIG+ LLFP G TKHWLVR+DKPGVGVVTKAQMVD YAQILTKV+G EKDAQMCIYHVS
Sbjct: 93  PQIGSKLLFPSGNTKHWLVRIDKPGVGVVTKAQMVDYYAQILTKVMGYEKDAQMCIYHVS 152

Query: 121 WQSNFGFCCELDDECAQELTGVPGVLSVQPDENFGSENKDYGGNNLQNSMVPSDSSEASP 180
           WQSNFGFCCELD+ECAQEL GVPGVLSV PD++F SENKDY G++L NS  P DSSEAS 
Sbjct: 153 WQSNFGFCCELDEECAQELAGVPGVLSVLPDKDFESENKDYRGDSLINSANPPDSSEASQ 212

Query: 181 -TQIKTKKLFVTGIQW 195
            T ++TKKLF+TG+ +
Sbjct: 213 ITHVRTKKLFITGLSF 228




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|224097630|ref|XP_002311019.1| predicted protein [Populus trichocarpa] gi|222850839|gb|EEE88386.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|225464121|ref|XP_002264164.1| PREDICTED: uncharacterized protein LOC100265361 [Vitis vinifera] Back     alignment and taxonomy information
>gi|296087958|emb|CBI35241.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|147867424|emb|CAN83272.1| hypothetical protein VITISV_001132 [Vitis vinifera] Back     alignment and taxonomy information
>gi|449525960|ref|XP_004169984.1| PREDICTED: uncharacterized LOC101208465 [Cucumis sativus] Back     alignment and taxonomy information
>gi|449446458|ref|XP_004140988.1| PREDICTED: uncharacterized protein LOC101208465 [Cucumis sativus] Back     alignment and taxonomy information
>gi|255568010|ref|XP_002524982.1| RNA binding protein, putative [Ricinus communis] gi|223535726|gb|EEF37389.1| RNA binding protein, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|356576299|ref|XP_003556270.1| PREDICTED: uncharacterized protein LOC100791652 [Glycine max] Back     alignment and taxonomy information
>gi|9293999|dbj|BAB01902.1| unnamed protein product [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query197
TAIR|locus:2088756374 AT3G20930 [Arabidopsis thalian 0.913 0.481 0.601 1.5e-58
TAIR|locus:2206639229 AT1G32580 "AT1G32580" [Arabido 0.527 0.454 0.471 5.5e-22
TAIR|locus:2051003219 DAL1 "differentiation and gree 0.558 0.502 0.472 1.5e-21
TAIR|locus:2063389232 MORF6 "multiple organellar RNA 0.532 0.452 0.462 4.4e-20
TAIR|locus:2200131232 MORF9 "multiple organellar RNA 0.710 0.603 0.342 6e-16
UNIPROTKB|Q2R8U1 374 Os11g0216400 "Os11g0216400 pro 0.527 0.278 0.314 8.4e-12
TAIR|locus:2083348244 MORF3 "multiple organellar RNA 0.761 0.614 0.282 6.4e-11
TAIR|locus:2086310 395 RIP1 "RNA-editing factor inter 0.593 0.296 0.322 2e-10
TAIR|locus:2030200192 AT1G72530 "AT1G72530" [Arabido 0.522 0.536 0.307 1.1e-07
TAIR|locus:2119782 419 MORF1 "multiple organellar RNA 0.583 0.274 0.270 1.5e-05
TAIR|locus:2088756 AT3G20930 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 601 (216.6 bits), Expect = 1.5e-58, P = 1.5e-58
 Identities = 116/193 (60%), Positives = 146/193 (75%)

Query:     3 IYDASCDTHFGFCCNVDEETSRELARLPGVLTVRPDPDYNSVKKDYSCLDIQMSNPWNSQ 62
             IYDAS DTHFGFCC++DE+ SR+LA LPGV+++RP+ DY+S KK+Y            S 
Sbjct:   112 IYDASFDTHFGFCCHIDEDASRQLASLPGVVSIRPEQDYSSEKKNYGI---------GSH 162

Query:    63 IGTTLLFPLGTTKHWLVRMDKPGVGVVTKAQMVDRYAQILTKVLGNEKDAQMCIYHVSWQ 122
              G +L F  GT KHW+VR+DKPGVG+VTKAQMVD   Q+L+KVL NEKDAQMC+YHVSWQ
Sbjct:   163 KGVSL-FDHGTVKHWMVRIDKPGVGIVTKAQMVDHCVQLLSKVLWNEKDAQMCLYHVSWQ 221

Query:   123 SNFGFCCELDDECAQELTGVPGVLSVQPDENFGSENKDYGGNNLQNSMVPSDSSEASPTQ 182
             S+FGFCC+LD+  A EL GVPGVL+V PD +F S NKDY G++ Q+S    D SE+ P  
Sbjct:   222 SDFGFCCDLDERSAVELAGVPGVLAVVPDNSFESLNKDYEGDSTQDSR-DQDDSESPP-- 278

Query:   183 IKTKKLFVTGIQW 195
             +KTKKLF+TG+ +
Sbjct:   279 VKTKKLFITGLSF 291


GO:0000166 "nucleotide binding" evidence=IEA
GO:0003676 "nucleic acid binding" evidence=IEA
GO:0003723 "RNA binding" evidence=ISS
GO:0008150 "biological_process" evidence=ND
GO:0000023 "maltose metabolic process" evidence=RCA
GO:0010027 "thylakoid membrane organization" evidence=RCA
GO:0019252 "starch biosynthetic process" evidence=RCA
TAIR|locus:2206639 AT1G32580 "AT1G32580" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2051003 DAL1 "differentiation and greening-like 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2063389 MORF6 "multiple organellar RNA editing factor 6" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2200131 MORF9 "multiple organellar RNA editing factor 9" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|Q2R8U1 Os11g0216400 "Os11g0216400 protein" [Oryza sativa Japonica Group (taxid:39947)] Back     alignment and assigned GO terms
TAIR|locus:2083348 MORF3 "multiple organellar RNA editing factor 3" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2086310 RIP1 "RNA-editing factor interacting protein 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2030200 AT1G72530 "AT1G72530" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2119782 MORF1 "multiple organellar RNA editing factor 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 197
PF0592282 Inhibitor_I9: Peptidase inhibitor I9; InterPro: IP 99.4
PF0592282 Inhibitor_I9: Peptidase inhibitor I9; InterPro: IP 95.89
>PF05922 Inhibitor_I9: Peptidase inhibitor I9; InterPro: IPR010259 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively Back     alignment and domain information
Probab=99.40  E-value=3.9e-13  Score=95.32  Aligned_cols=77  Identities=18%  Similarity=0.242  Sum_probs=57.8

Q ss_pred             eEEEEecCCCCCCcchhhHHHHHHHHHHHHhCCc----hhcccceEEEeecceeeEEeEeCHHHHHHhcCCCCeEEEcCC
Q 029233           76 HWLVRMDKPGVGVVTKAQMVDRYAQILTKVLGNE----KDAQMCIYHVSWQSNFGFCCELDDECAQELTGVPGVLSVQPD  151 (197)
Q Consensus        76 ~YIV~M~~p~~~~~~~~~~~~sH~q~LasVLgS~----e~Ak~sI~ysYt~~f~GFAA~LTeeeA~kLs~lPGVVSVfPD  151 (197)
                      .|||.|+.+....    ...+.|.+|+++++.+.    ......++|.|...++||+|+|+++++++|+++|+|.+|.||
T Consensus         1 ~YIV~~k~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~Gfs~~l~~~~i~~L~~~p~V~~Ve~D   76 (82)
T PF05922_consen    1 RYIVVFKDDASAA----SSFSSHKSWQASILKSALKSASSINAKVLYSYDNAFNGFSAKLSEEEIEKLRKDPGVKSVEPD   76 (82)
T ss_dssp             EEEEEE-TTSTHH----CHHHHHHHHHH----HHHHTH-TTT-EEEEEESSTSSEEEEEE-HHHHHHHHTSTTEEEEEEE
T ss_pred             CEEEEECCCCCcc----hhHHHHHHHHHHHHhhhhhhhcccCCceEEEEeeeEEEEEEEeCHHHHHHHHcCCCeEEEEeC
Confidence            5999999875433    25778888888776542    234567999999999999999999999999999999999999


Q ss_pred             CCcCC
Q 029233          152 ENFGS  156 (197)
Q Consensus       152 ~~~~~  156 (197)
                      ..+++
T Consensus        77 ~~v~l   81 (82)
T PF05922_consen   77 QVVSL   81 (82)
T ss_dssp             CEEEE
T ss_pred             ceEec
Confidence            88754



In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. Limited proteolysis of most large protein precursors is carried out in vivo by the subtilisin-like pro-protein convertases. Many important biological processes such as peptide hormone synthesis, viral protein processing and receptor maturation involve proteolytic processing by these enzymes []. The subtilisin-serine protease (SRSP) family hormone and pro-protein convertases (furin, PC1/3, PC2, PC4, PACE4, PC5/6, and PC7/7/LPC) act within the secretory pathway to cleave polypeptide precursors at specific basic sites, generating their biologically active forms. Serum proteins, pro-hormones, receptors, zymogens, viral surface glycoproteins, bacterial toxins, amongst others, are activated by this route []. The SRSPs share the same domain structure, including a signal peptide, the pro-peptide, the catalytic domain, the P/middle or homo B domain, and the C terminus. Proteinase propeptide inhibitors (sometimes refered to as activation peptides) are responsible for the modulation of folding and activity of the pro-enzyme or zymogen. The pro-segment docks into the enzyme moiety shielding the substrate binding site, thereby promoting inhibition of the enzyme. Several such propeptides share a similar topology [], despite often low sequence identities []. The propeptide region has an open-sandwich antiparallel-alpha/antiparallel-beta fold, with two alpha-helices and four beta-strands with a (beta/alpha/beta)x2 topology. This group of sequences contain the propeptide domain at the N terminus of peptidases belonging to MEROPS family S8A, subtilisins. A number of the members of this group of sequences belong to MEROPS inhibitor family I9, clan I-. The propeptide is removed by proteolytic cleavage; removal activating the enzyme.; GO: 0004252 serine-type endopeptidase activity, 0042802 identical protein binding, 0043086 negative regulation of catalytic activity; PDB: 3CNQ_P 1SPB_P 3CO0_P 1ITP_A 1V5I_B 1SCJ_B 3P5B_P 2XTJ_P 2W2M_P 2P4E_P ....

>PF05922 Inhibitor_I9: Peptidase inhibitor I9; InterPro: IPR010259 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query197
2w2n_P114 Proprotein convertase subtilisin/kexin type 9; hyd 99.59
2qtw_A124 Proprotein convertase subtilisin/kexin type 9 Pro; 99.18
3cnq_P80 Subtilisin BPN'; uncleaved, proenzyme, substrate c 99.11
1v5i_B76 POIA1, IA-1=serine proteinase inhibitor; protease- 98.46
2p4e_P 692 Proprotein convertase subtilisin/kexin type 9; pro 97.94
3t41_A 471 Epidermin leader peptide processing serine protea; 96.3
3afg_A 539 Subtilisin-like serine protease; propeptide, therm 96.22
3cnq_P80 Subtilisin BPN'; uncleaved, proenzyme, substrate c 95.73
2z30_B65 TK-subtilisin; thermococcus kodakaraensis, hydrola 95.56
2z2z_A 395 TK-subtilisin precursor; thermococcus kodakaraensi 94.59
2w2n_P114 Proprotein convertase subtilisin/kexin type 9; hyd 94.57
2z30_B65 TK-subtilisin; thermococcus kodakaraensis, hydrola 93.67
2qtw_A124 Proprotein convertase subtilisin/kexin type 9 Pro; 93.08
1v5i_B76 POIA1, IA-1=serine proteinase inhibitor; protease- 86.82
>2w2n_P Proprotein convertase subtilisin/kexin type 9; hydrolase-receptor complex, PCSK9, proprotein converta low-density lipoprotein receptor, EGF; 2.30A {Homo sapiens} PDB: 2w2m_P 2w2o_P 2w2p_P 2w2q_P 2xtj_P Back     alignment and structure
Probab=99.59  E-value=7e-16  Score=118.49  Aligned_cols=76  Identities=16%  Similarity=0.137  Sum_probs=64.7

Q ss_pred             eeEEEEecCCCCCCcchhhHHHHHHHHHHHHhCCchhcccceEEEeecceeeEEeEeCHHHHHHhcCCCCeEEEcCCCCc
Q 029233           75 KHWLVRMDKPGVGVVTKAQMVDRYAQILTKVLGNEKDAQMCIYHVSWQSNFGFCCELDDECAQELTGVPGVLSVQPDENF  154 (197)
Q Consensus        75 ~~YIV~M~~p~~~~~~~~~~~~sH~q~LasVLgS~e~Ak~sI~ysYt~~f~GFAA~LTeeeA~kLs~lPGVVSVfPD~~~  154 (197)
                      +.|||+|+....    + ..++.|++||++++++ +.+...|.|+|++.|+||+|+||++|+++|+++|+|++|+||+.+
T Consensus        38 ~~YIV~lk~~~~----~-~~~~~h~~~l~s~~~~-~~~~~~i~~sY~~~~~GFaa~Lt~~~~~~L~~~P~V~~VE~D~~v  111 (114)
T 2w2n_P           38 GTYVVVLKEETH----L-SQSERTARRLQAQAAR-RGYLTKILHVFHGLLPGFLVKMSGDLLELALKLPHVDYIEEDSSV  111 (114)
T ss_dssp             EEEEEEECTTCC----H-HHHHHHHHHHHHHHHH-TTCCCEEEEEECSSSSEEEEECCGGGHHHHHTSTTEEEEEEEEEE
T ss_pred             CcEEEEECCCCC----H-HHHHHHHHHHHHHhhh-cccCCceEEEecccceEEEEEcCHHHHHHHHcCCCccEEEeCceE
Confidence            589999986442    2 2567999999999865 335668999999999999999999999999999999999999876


Q ss_pred             CC
Q 029233          155 GS  156 (197)
Q Consensus       155 ~~  156 (197)
                      +.
T Consensus       112 ~~  113 (114)
T 2w2n_P          112 FA  113 (114)
T ss_dssp             EE
T ss_pred             ec
Confidence            53



>2qtw_A Proprotein convertase subtilisin/kexin type 9 Pro; coronary heart disease, hypercholest low density lipoprotein receptor, autocatalytic cleavage; HET: NAG; 1.90A {Homo sapiens} PDB: 3m0c_A 2pmw_A 3h42_A 3bps_P 3gcw_P 3gcx_P 3p5b_P 3p5c_P Back     alignment and structure
>3cnq_P Subtilisin BPN'; uncleaved, proenzyme, substrate complex, hydrolase, metal- binding, protease, secreted, serine protease, sporulation; 1.71A {Bacillus amyloliquefaciens} PDB: 3bgo_P 3co0_P 1spb_P 1scj_B Back     alignment and structure
>1v5i_B POIA1, IA-1=serine proteinase inhibitor; protease-inhibitor complex, subtilisin, hydrolase-Pro binding complex; 1.50A {Pleurotus ostreatus} SCOP: d.58.3.2 PDB: 1itp_A Back     alignment and structure
>2p4e_P Proprotein convertase subtilisin/kexin type 9; protease, LDL receptor, LDL, endocytosis, hydrol; 1.98A {Homo sapiens} Back     alignment and structure
>3t41_A Epidermin leader peptide processing serine protea; structural genomics, center for structural genomics of infec diseases, csgid; 1.95A {Staphylococcus aureus} PDB: 3qfh_A Back     alignment and structure
>3afg_A Subtilisin-like serine protease; propeptide, thermococcus kodakaraensis, hydrolas protease; 2.00A {Thermococcus kodakarensis} Back     alignment and structure
>3cnq_P Subtilisin BPN'; uncleaved, proenzyme, substrate complex, hydrolase, metal- binding, protease, secreted, serine protease, sporulation; 1.71A {Bacillus amyloliquefaciens} PDB: 3bgo_P 3co0_P 1spb_P 1scj_B Back     alignment and structure
>2z30_B TK-subtilisin; thermococcus kodakaraensis, hydrolase; 1.65A {Thermococcus kodakarensis} PDB: 2z2y_B 3a3p_B 2z56_B 2z58_B 2z57_B 3a3n_B 3a3o_B Back     alignment and structure
>2z2z_A TK-subtilisin precursor; thermococcus kodakaraensis, hydrolase; 1.87A {Thermococcus kodakarensis} PDB: 2e1p_A 2zwp_A 2zwo_A Back     alignment and structure
>2w2n_P Proprotein convertase subtilisin/kexin type 9; hydrolase-receptor complex, PCSK9, proprotein converta low-density lipoprotein receptor, EGF; 2.30A {Homo sapiens} PDB: 2w2m_P 2w2o_P 2w2p_P 2w2q_P 2xtj_P Back     alignment and structure
>2z30_B TK-subtilisin; thermococcus kodakaraensis, hydrolase; 1.65A {Thermococcus kodakarensis} PDB: 2z2y_B 3a3p_B 2z56_B 2z58_B 2z57_B 3a3n_B 3a3o_B Back     alignment and structure
>2qtw_A Proprotein convertase subtilisin/kexin type 9 Pro; coronary heart disease, hypercholest low density lipoprotein receptor, autocatalytic cleavage; HET: NAG; 1.90A {Homo sapiens} PDB: 3m0c_A 2pmw_A 3h42_A 3bps_P 3gcw_P 3gcx_P 3p5b_P 3p5c_P Back     alignment and structure
>1v5i_B POIA1, IA-1=serine proteinase inhibitor; protease-inhibitor complex, subtilisin, hydrolase-Pro binding complex; 1.50A {Pleurotus ostreatus} SCOP: d.58.3.2 PDB: 1itp_A Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query197
d1scjb_71 Subtilisin prosegment {Bacillus subtilis [TaxId: 1 98.41
d1v5ib172 Proteinase A inhibitor 1, POIA1 {Oyster mushroom ( 98.39
d1scjb_71 Subtilisin prosegment {Bacillus subtilis [TaxId: 1 92.55
d1v5ib172 Proteinase A inhibitor 1, POIA1 {Oyster mushroom ( 87.45
>d1scjb_ d.58.3.2 (B:) Subtilisin prosegment {Bacillus subtilis [TaxId: 1423]} Back     information, alignment and structure
class: Alpha and beta proteins (a+b)
fold: Ferredoxin-like
superfamily: Protease propeptides/inhibitors
family: Subtilase propeptides/inhibitors
domain: Subtilisin prosegment
species: Bacillus subtilis [TaxId: 1423]
Probab=98.41  E-value=3e-07  Score=63.61  Aligned_cols=68  Identities=15%  Similarity=0.138  Sum_probs=51.9

Q ss_pred             eeEEEEecCCCCCCcchhhHHHHHHHHHHHHhCCchhcccceEEEeecceeeEEeEeCHHHHHHhcCCCCeEEEcCCCCc
Q 029233           75 KHWLVRMDKPGVGVVTKAQMVDRYAQILTKVLGNEKDAQMCIYHVSWQSNFGFCCELDDECAQELTGVPGVLSVQPDENF  154 (197)
Q Consensus        75 ~~YIV~M~~p~~~~~~~~~~~~sH~q~LasVLgS~e~Ak~sI~ysYt~~f~GFAA~LTeeeA~kLs~lPGVVSVfPD~~~  154 (197)
                      +.|||.+.......     ..+.+.+++.+.       -.++.+.|. .++||+|+|+++++++|++.|+|..|-||...
T Consensus         2 ~~YIV~fK~~~~~~-----~~~~~~~~v~~~-------gg~v~~~~~-~i~gfs~~l~~~~~~~L~~~p~V~yVE~D~v~   68 (71)
T d1scjb_           2 KKYIVGFKQTMSAM-----SSAKKKDVISQK-------GGKVEKQFK-YVNAAAATLDEKAVKELKKDPSVAYVEEDHIA   68 (71)
T ss_dssp             EEEEEEECSSSSCC-----SHHHHHHHHHTT-------TCEEEEECS-SSSEEEEEECHHHHHHHHTSTTEEEEEECCEE
T ss_pred             CcEEEEECCCCChH-----HHHHHHHHHHHc-------CCeEEEEEe-ecceEEEEeCHHHHHHHHcCCCceEEeCCcEE
Confidence            57999998765433     233455444322       235888886 69999999999999999999999999999865


Q ss_pred             C
Q 029233          155 G  155 (197)
Q Consensus       155 ~  155 (197)
                      +
T Consensus        69 ~   69 (71)
T d1scjb_          69 H   69 (71)
T ss_dssp             E
T ss_pred             E
Confidence            4



>d1v5ib1 d.58.3.2 (B:1-72) Proteinase A inhibitor 1, POIA1 {Oyster mushroom (Pleurotus ostreatus) [TaxId: 5322]} Back     information, alignment and structure
>d1scjb_ d.58.3.2 (B:) Subtilisin prosegment {Bacillus subtilis [TaxId: 1423]} Back     information, alignment and structure
>d1v5ib1 d.58.3.2 (B:1-72) Proteinase A inhibitor 1, POIA1 {Oyster mushroom (Pleurotus ostreatus) [TaxId: 5322]} Back     information, alignment and structure