Citrus Sinensis ID: 025802


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------25
MAHLAARRTVASIITRTLTSPRSRLAIPILNKQQPQIGPDPICNPARFKTSGSSYSPLNDPSPNWSNRPPKETIMLDGCDYQHWLIVMEFPNPSELSEEEMINAYVKTLAAVVGSEEEAKKKIYSVCTTTYTGFGALIDEELSYKVKGQPGVLWVLPDSYIDVPNKDYGGDLFVDGKVIHRPQYRFTERQQRPRTRRRETTQADRRRLWAQNQSAPSQQPTSMSNQNPAQAGGTNFSINQGQNNQKSA
cHHHHHHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEEEEcccccccccHHHHHHHHHHHHHHHHccHHHHHccEEEEEEEEEEEEcEEEcHHHHHHHcccccEEEEcccccccccccccccccccccEEEccccccHHHHccccccccccccHHHHHcccccccccccccccccccccccccccccccccccccccccc
cHHHHHHHHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEcccccccEEEEEEcccccccccHHHHHHHHHHHHHHHHccHHHHHHHHHHEcccccccEEEEEcHHHHHHHcccccEEEEEccccccccccccccccccccEEEccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccc
MAHLAARRTVASIITRtltsprsrlaipilnkqqpqigpdpicnparfktsgssysplndpspnwsnrppketimldgcdyqHWLIVmefpnpselseEEMINAYVKTLAAVVGSEEEAKKKIYSVCTTTYTGFGALIDEElsykvkgqpgvlwvlpdsyidvpnkdyggdlfvdgkvihrpqyrfterqqrprtrrretTQADRRRLWaqnqsapsqqptsmsnqnpaqaggtnfsinqgqnnqksa
mahlaarrtvasiitrtltsprsrlaipilnkqqpqigpdPICNPARFKTSGssysplndpspnwsNRPPKETIMLDGCDYQHWLIVMEFPNPSELSEEEMINAYVKTLAAVVgseeeakkkiysVCTTTYTGFGALIDEELSYKVKGQPGVLWVLPDSYIDVPNKDYGGDLFvdgkvihrpqyrfterqqrprtrrrettqadrrrlwaqnqsapsqqptsmSNQNPAQAGGtnfsinqgqnnqksa
MAHLAARRTVASIITRTLTSPRSRLAIPILNKQQPQIGPDPICNPARFKTSGSSYSPLNDPSPNWSNRPPKETIMLDGCDYQHWLIVMEFPNPSELSEEEMINAYVKTLAAVVGSEEEAKKKIYSVCTTTYTGFGALIDEELSYKVKGQPGVLWVLPDSYIDVPNKDYGGDLFVDGKVIHRPQYrfterqqrprtrrrettqadrrrLWAQNQSAPSQQPTSMSNQNPAQAGGTNFSINQGQNNQKSA
**********ASIITRTL*******AIPI*******************************************TIMLDGCDYQHWLIVMEFPNP****EEEMINAYVKTLAAVVGSEEEAKKKIYSVCTTTYTGFGALIDEELSYKVKGQPGVLWVLPDSYIDVPNKDYGGDLFVDGKVIHRPQYRF**************************************************************
**************************************************************************MLDGCDYQHWLIVMEFPN****SEEEMINAYVKTLAAVVGSEEEAKKKIYSVCTTTYTGFGALIDEELSYKVKGQPGVLWVLPDSYIDVPNKDYGGDLFVDGK***********************************************************************
********TVASIITRTLTSPRSRLAIPILNKQQPQIGPDPICNPARFKTS*************WSNRPPKETIMLDGCDYQHWLIVMEFPNPSELSEEEMINAYVKTLAAVVGSEEEAKKKIYSVCTTTYTGFGALIDEELSYKVKGQPGVLWVLPDSYIDVPNKDYGGDLFVDGKVIHRPQYRFTE*****************************************QAGGTNFSINQ********
********TVASIITRTL*****RLAI*******PQIGP****************************RPPKETIMLDGCDYQHWLIVMEFPNPSELSEEEMINAYVKTLAAVVGSEEEAKKKIYSVCTTTYTGFGALIDEELSYKVKGQPGVLWVLPDSYIDVPNKDYGGDLFVDGKVIHRPQY****************************************************************
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MAHLAARRTVASIITRTLTSPRSRLAIPILNKQQPQIGPDPICNPARFKTSGSSYSPLNDPSPNWSNRPPKETIMLDGCDYQHWLIVMEFPNPSELSEEEMINAYVKTLAAVVGSEEEAKKKIYSVCTTTYTGFGALIDEELSYKVKGQPGVLWVLPDSYIDVPNKDYGGDLFVDGKVIHRPQYRFTERQQRPRTRRRETTQADRRRLWAQNQSAPSQQPTSMSNQNPAQAGGTNFSINQGQNNQKSA
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query248 2.2.26 [Sep-21-2011]
Q9LKA5 395 Uncharacterized protein A no no 0.520 0.326 0.610 3e-43
Q38732230 DAG protein, chloroplasti N/A no 0.544 0.586 0.549 4e-38
>sp|Q9LKA5|UMP1_ARATH Uncharacterized protein At3g15000, mitochondrial OS=Arabidopsis thaliana GN=At3g15000 PE=1 SV=1 Back     alignment and function desciption
 Score =  175 bits (443), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 80/131 (61%), Positives = 105/131 (80%), Gaps = 2/131 (1%)

Query: 56  SPLNDPSPNWSNRPPKETIMLDGCDYQHWLIVMEFPNPSELSEEEMINAYVKTLAAVVGS 115
           S LNDP+PNWSNRPPKETI+LDGCD++HWL+V+E P   E + +E+I++Y+KTLA +VGS
Sbjct: 72  SSLNDPNPNWSNRPPKETILLDGCDFEHWLVVVEPPQ-GEPTRDEIIDSYIKTLAQIVGS 130

Query: 116 EEEAKKKIYSVCTTTYTGFGALIDEELSYKVKGQPGVLWVLPDSYIDVPNKDYGGDLFVD 175
           E+EA+ KIYSV T  Y  FGAL+ E+LS+K+K    V WVLPDSY+DV NKDYGG+ F+D
Sbjct: 131 EDEARMKIYSVSTRCYYAFGALVSEDLSHKLKELSNVRWVLPDSYLDVRNKDYGGEPFID 190

Query: 176 GKVI-HRPQYR 185
           GK + + P+Y 
Sbjct: 191 GKAVPYDPKYH 201





Arabidopsis thaliana (taxid: 3702)
>sp|Q38732|DAG_ANTMA DAG protein, chloroplastic OS=Antirrhinum majus GN=DAG PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query248
225431796260 PREDICTED: uncharacterized protein At3g1 0.963 0.919 0.642 1e-84
255551847262 DAG protein, chloroplast precursor, puta 0.987 0.935 0.647 3e-83
449516421277 PREDICTED: uncharacterized protein At3g1 0.983 0.880 0.624 3e-83
449465561277 PREDICTED: uncharacterized protein At3g1 0.983 0.880 0.620 8e-83
224110350261 predicted protein [Populus trichocarpa] 0.983 0.934 0.617 5e-82
118483610261 unknown [Populus trichocarpa] 0.983 0.934 0.613 4e-81
388493544263 unknown [Medicago truncatula] 0.963 0.908 0.6 5e-76
357134159250 PREDICTED: DAG protein, chloroplastic-li 0.927 0.92 0.625 3e-75
388498556263 unknown [Medicago truncatula] 0.963 0.908 0.596 7e-75
356564316247 PREDICTED: uncharacterized protein At3g1 0.935 0.939 0.604 3e-74
>gi|225431796|ref|XP_002272388.1| PREDICTED: uncharacterized protein At3g15000, mitochondrial [Vitis vinifera] gi|147819172|emb|CAN69219.1| hypothetical protein VITISV_012015 [Vitis vinifera] gi|296083326|emb|CBI22962.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
 Score =  318 bits (816), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 167/260 (64%), Positives = 191/260 (73%), Gaps = 21/260 (8%)

Query: 1   MAHLAARRTVASIITRTLTSPRS----------RLAIPILNKQQPQIGPDPICNPARFKT 50
           MA    RRT+A  ++RTL+   S          R A  +++K  P +  +    P R KT
Sbjct: 1   MALFTGRRTLARFLSRTLSQSFSSSSLLASSRSRFAFALIDKHSPPLVSNSARVPTRLKT 60

Query: 51  SGSSYSPLNDPSPNWSNRPPKETIMLDGCDYQHWLIVMEFPNPSELSEEEMINAYVKTLA 110
           SGS YSPLNDPSPNWSNRPPKETI+LDGCDY+HWLIVMEFPN S+ SE+EMI AYVKTLA
Sbjct: 61  SGSGYSPLNDPSPNWSNRPPKETILLDGCDYEHWLIVMEFPNDSKPSEDEMIAAYVKTLA 120

Query: 111 AVVGSEEEAKKKIYSVCTTTYTGFGALIDEELSYKVKGQPGVLWVLPDSYIDVPNKDYGG 170
           AVVGSEEEAKKKIYSVCTTTYTGFGALI EELSYKVK  PGVLWVLPDSY+DVPNKDYGG
Sbjct: 121 AVVGSEEEAKKKIYSVCTTTYTGFGALISEELSYKVKELPGVLWVLPDSYLDVPNKDYGG 180

Query: 171 DLFVDGKVIHRPQYRFTERQ------QRPRTRRRETTQADRRRL-----WAQNQSAPSQQ 219
           DLF+DGKVIHRPQYR+ ERQ      +    RRRET Q +RR       WAQ++  P  Q
Sbjct: 181 DLFIDGKVIHRPQYRYNERQPTRSRPRPRYDRRRETMQVERREPMQRDNWAQDRREPMHQ 240

Query: 220 PTSMSNQNPAQAGGTNFSIN 239
           PTSM++QN  Q GG + S+N
Sbjct: 241 PTSMNDQNSPQGGGRDPSLN 260




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|255551847|ref|XP_002516969.1| DAG protein, chloroplast precursor, putative [Ricinus communis] gi|223544057|gb|EEF45583.1| DAG protein, chloroplast precursor, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|449516421|ref|XP_004165245.1| PREDICTED: uncharacterized protein At3g15000, mitochondrial-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|449465561|ref|XP_004150496.1| PREDICTED: uncharacterized protein At3g15000, mitochondrial-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|224110350|ref|XP_002315492.1| predicted protein [Populus trichocarpa] gi|222864532|gb|EEF01663.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|118483610|gb|ABK93700.1| unknown [Populus trichocarpa] Back     alignment and taxonomy information
>gi|388493544|gb|AFK34838.1| unknown [Medicago truncatula] Back     alignment and taxonomy information
>gi|357134159|ref|XP_003568685.1| PREDICTED: DAG protein, chloroplastic-like [Brachypodium distachyon] Back     alignment and taxonomy information
>gi|388498556|gb|AFK37344.1| unknown [Medicago truncatula] Back     alignment and taxonomy information
>gi|356564316|ref|XP_003550401.1| PREDICTED: uncharacterized protein At3g15000, mitochondrial-like [Glycine max] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query248
TAIR|locus:2083348244 MORF3 "multiple organellar RNA 0.729 0.741 0.658 6e-64
TAIR|locus:2086310 395 RIP1 "RNA-editing factor inter 0.536 0.336 0.602 6.7e-45
TAIR|locus:2051003219 DAL1 "differentiation and gree 0.524 0.593 0.590 7e-38
TAIR|locus:2206639229 AT1G32580 "AT1G32580" [Arabido 0.729 0.790 0.461 8.9e-38
TAIR|locus:2063389232 MORF6 "multiple organellar RNA 0.544 0.581 0.583 1.9e-37
TAIR|locus:2200131232 MORF9 "multiple organellar RNA 0.487 0.521 0.622 1.3e-36
UNIPROTKB|Q2R8U1 374 Os11g0216400 "Os11g0216400 pro 0.701 0.465 0.424 2.3e-30
TAIR|locus:2119782 419 MORF1 "multiple organellar RNA 0.451 0.267 0.5 6.8e-30
TAIR|locus:2156344 723 MORF4 "AT5G44780" [Arabidopsis 0.528 0.181 0.439 1.9e-25
TAIR|locus:2030200192 AT1G72530 "AT1G72530" [Arabido 0.495 0.640 0.455 2e-24
TAIR|locus:2083348 MORF3 "multiple organellar RNA editing factor 3" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 652 (234.6 bits), Expect = 6.0e-64, P = 6.0e-64
 Identities = 131/199 (65%), Positives = 157/199 (78%)

Query:     1 MAHLAARRTVASIITRTLTS--------P----RSRLAIPILNK---QQPQIGPDPICNP 45
             MA ++ RRT+++++ +TL+S        P    RSR A+P++ K    +  +GP  I   
Sbjct:     1 MALISTRRTLSTLLNKTLSSSTSYSSSFPTLSSRSRFAMPLIEKVSSSRTSLGPCYIST- 59

Query:    46 ARFKTSGSSYSPLNDPSPNWSNRPPKETIMLDGCDYQHWLIVMEFPNPSELSEEEMINAY 105
              R KTSGS YSPLNDPSPNWSNRPPKETI+LDGCDY+HWLIVMEF +P   +EEEMIN+Y
Sbjct:    60 -RPKTSGSGYSPLNDPSPNWSNRPPKETILLDGCDYEHWLIVMEFTDPKP-TEEEMINSY 117

Query:   106 VKTLAAVVGSEEEAKKKIYSVCTTTYTGFGALIDEELSYKVKGQPGVLWVLPDSYIDVPN 165
             VKTL +V+G +EEAKKKIYSVCT+TYTGFGALI EELS KVK  PGVLWVLPDSY+DVPN
Sbjct:   118 VKTLTSVLGWQEEAKKKIYSVCTSTYTGFGALISEELSCKVKALPGVLWVLPDSYLDVPN 177

Query:   166 KDYGGDLFVDGKVIHRPQY 184
             KDYGGDL+V+GKVI RPQY
Sbjct:   178 KDYGGDLYVEGKVIPRPQY 196




GO:0009507 "chloroplast" evidence=ISM
GO:0005739 "mitochondrion" evidence=IDA
GO:0050897 "cobalt ion binding" evidence=IDA
GO:1900864 "mitochondrial RNA modification" evidence=IMP
TAIR|locus:2086310 RIP1 "RNA-editing factor interacting protein 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2051003 DAL1 "differentiation and greening-like 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2206639 AT1G32580 "AT1G32580" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2063389 MORF6 "multiple organellar RNA editing factor 6" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2200131 MORF9 "multiple organellar RNA editing factor 9" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|Q2R8U1 Os11g0216400 "Os11g0216400 protein" [Oryza sativa Japonica Group (taxid:39947)] Back     alignment and assigned GO terms
TAIR|locus:2119782 MORF1 "multiple organellar RNA editing factor 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2156344 MORF4 "AT5G44780" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2030200 AT1G72530 "AT1G72530" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
GSVIVG00033046001
SubName- Full=Putative uncharacterized protein (Chromosome chr5 scaffold_64, whole genome shotgun sequence); (260 aa)
(Vitis vinifera)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 248
PF0592282 Inhibitor_I9: Peptidase inhibitor I9; InterPro: IP 99.46
>PF05922 Inhibitor_I9: Peptidase inhibitor I9; InterPro: IPR010259 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively Back     alignment and domain information
Probab=99.46  E-value=8.5e-14  Score=101.78  Aligned_cols=78  Identities=21%  Similarity=0.331  Sum_probs=55.3

Q ss_pred             eEEEEecCCCCCCCChHHHHHHHHHHHHHhhCC----hhhhhcceeEEecCcceeeeeecCHHHHHHhhCCCCeEEEeCC
Q 025802           83 HWLIVMEFPNPSELSEEEMINAYVKTLAAVVGS----EEEAKKKIYSVCTTTYTGFGALIDEELSYKVKGQPGVLWVLPD  158 (248)
Q Consensus        83 tYIV~M~~p~~p~~s~~~~~~sh~s~LaSVLgS----ee~Ak~~ILYSYt~afnGFAArLTeEEAekLk~lPGVVSVfPD  158 (248)
                      +|||.|..+..    .....++|.+|+.+++.+    .......++|+|+.+|+||+|+||++++++|+++|+|.+|.||
T Consensus         1 ~YIV~~k~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~Gfs~~l~~~~i~~L~~~p~V~~Ve~D   76 (82)
T PF05922_consen    1 RYIVVFKDDAS----AASSFSSHKSWQASILKSALKSASSINAKVLYSYDNAFNGFSAKLSEEEIEKLRKDPGVKSVEPD   76 (82)
T ss_dssp             EEEEEE-TTST----HHCHHHHHHHHHH----HHHHTH-TTT-EEEEEESSTSSEEEEEE-HHHHHHHHTSTTEEEEEEE
T ss_pred             CEEEEECCCCC----cchhHHHHHHHHHHHHhhhhhhhcccCCceEEEEeeeEEEEEEEeCHHHHHHHHcCCCeEEEEeC
Confidence            59999997642    122345555555544332    1234567999999999999999999999999999999999999


Q ss_pred             CCcCCC
Q 025802          159 SYIDVP  164 (248)
Q Consensus       159 s~~qLH  164 (248)
                      ..++||
T Consensus        77 ~~v~l~   82 (82)
T PF05922_consen   77 QVVSLH   82 (82)
T ss_dssp             CEEEE-
T ss_pred             ceEecC
Confidence            999886



In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. Limited proteolysis of most large protein precursors is carried out in vivo by the subtilisin-like pro-protein convertases. Many important biological processes such as peptide hormone synthesis, viral protein processing and receptor maturation involve proteolytic processing by these enzymes []. The subtilisin-serine protease (SRSP) family hormone and pro-protein convertases (furin, PC1/3, PC2, PC4, PACE4, PC5/6, and PC7/7/LPC) act within the secretory pathway to cleave polypeptide precursors at specific basic sites, generating their biologically active forms. Serum proteins, pro-hormones, receptors, zymogens, viral surface glycoproteins, bacterial toxins, amongst others, are activated by this route []. The SRSPs share the same domain structure, including a signal peptide, the pro-peptide, the catalytic domain, the P/middle or homo B domain, and the C terminus. Proteinase propeptide inhibitors (sometimes refered to as activation peptides) are responsible for the modulation of folding and activity of the pro-enzyme or zymogen. The pro-segment docks into the enzyme moiety shielding the substrate binding site, thereby promoting inhibition of the enzyme. Several such propeptides share a similar topology [], despite often low sequence identities []. The propeptide region has an open-sandwich antiparallel-alpha/antiparallel-beta fold, with two alpha-helices and four beta-strands with a (beta/alpha/beta)x2 topology. This group of sequences contain the propeptide domain at the N terminus of peptidases belonging to MEROPS family S8A, subtilisins. A number of the members of this group of sequences belong to MEROPS inhibitor family I9, clan I-. The propeptide is removed by proteolytic cleavage; removal activating the enzyme.; GO: 0004252 serine-type endopeptidase activity, 0042802 identical protein binding, 0043086 negative regulation of catalytic activity; PDB: 3CNQ_P 1SPB_P 3CO0_P 1ITP_A 1V5I_B 1SCJ_B 3P5B_P 2XTJ_P 2W2M_P 2P4E_P ....


Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query248
2w2n_P114 Proprotein convertase subtilisin/kexin type 9; hyd 99.51
3cnq_P80 Subtilisin BPN'; uncleaved, proenzyme, substrate c 99.17
2qtw_A124 Proprotein convertase subtilisin/kexin type 9 Pro; 99.11
1v5i_B76 POIA1, IA-1=serine proteinase inhibitor; protease- 98.29
2p4e_P 692 Proprotein convertase subtilisin/kexin type 9; pro 97.79
3t41_A 471 Epidermin leader peptide processing serine protea; 96.88
3afg_A 539 Subtilisin-like serine protease; propeptide, therm 96.23
2z2z_A 395 TK-subtilisin precursor; thermococcus kodakaraensi 95.45
2z30_B65 TK-subtilisin; thermococcus kodakaraensis, hydrola 95.29
>2w2n_P Proprotein convertase subtilisin/kexin type 9; hydrolase-receptor complex, PCSK9, proprotein converta low-density lipoprotein receptor, EGF; 2.30A {Homo sapiens} PDB: 2w2m_P 2w2o_P 2w2p_P 2w2q_P 2xtj_P Back     alignment and structure
Probab=99.51  E-value=1.6e-14  Score=114.49  Aligned_cols=77  Identities=17%  Similarity=0.122  Sum_probs=63.3

Q ss_pred             ceeEEEEecCCCCCCCChHHHHHHHHHHHHHhhCChhhhhcceeEEecCcceeeeeecCHHHHHHhhCCCCeEEEeCCCC
Q 025802           81 YQHWLIVMEFPNPSELSEEEMINAYVKTLAAVVGSEEEAKKKIYSVCTTTYTGFGALIDEELSYKVKGQPGVLWVLPDSY  160 (248)
Q Consensus        81 ~ktYIV~M~~p~~p~~s~~~~~~sh~s~LaSVLgSee~Ak~~ILYSYt~afnGFAArLTeEEAekLk~lPGVVSVfPDs~  160 (248)
                      .+.|||+|.....     +.....|++||.+++++ +.+...|+|+|+++|+||+|+||++++++|+++|+|+.|.||+.
T Consensus        37 p~~YIV~lk~~~~-----~~~~~~h~~~l~s~~~~-~~~~~~i~~sY~~~~~GFaa~Lt~~~~~~L~~~P~V~~VE~D~~  110 (114)
T 2w2n_P           37 PGTYVVVLKEETH-----LSQSERTARRLQAQAAR-RGYLTKILHVFHGLLPGFLVKMSGDLLELALKLPHVDYIEEDSS  110 (114)
T ss_dssp             EEEEEEEECTTCC-----HHHHHHHHHHHHHHHHH-TTCCCEEEEEECSSSSEEEEECCGGGHHHHHTSTTEEEEEEEEE
T ss_pred             CCcEEEEECCCCC-----HHHHHHHHHHHHHHhhh-cccCCceEEEecccceEEEEEcCHHHHHHHHcCCCccEEEeCce
Confidence            3789999985431     34566788899888753 22456799999999999999999999999999999999999997


Q ss_pred             cCC
Q 025802          161 IDV  163 (248)
Q Consensus       161 ~qL  163 (248)
                      +++
T Consensus       111 v~~  113 (114)
T 2w2n_P          111 VFA  113 (114)
T ss_dssp             EEE
T ss_pred             Eec
Confidence            654



>3cnq_P Subtilisin BPN'; uncleaved, proenzyme, substrate complex, hydrolase, metal- binding, protease, secreted, serine protease, sporulation; 1.71A {Bacillus amyloliquefaciens} PDB: 3bgo_P 3co0_P 1spb_P 1scj_B Back     alignment and structure
>2qtw_A Proprotein convertase subtilisin/kexin type 9 Pro; coronary heart disease, hypercholest low density lipoprotein receptor, autocatalytic cleavage; HET: NAG; 1.90A {Homo sapiens} PDB: 3m0c_A 2pmw_A 3h42_A 3bps_P 3gcw_P 3gcx_P 3p5b_P 3p5c_P Back     alignment and structure
>1v5i_B POIA1, IA-1=serine proteinase inhibitor; protease-inhibitor complex, subtilisin, hydrolase-Pro binding complex; 1.50A {Pleurotus ostreatus} SCOP: d.58.3.2 PDB: 1itp_A Back     alignment and structure
>2p4e_P Proprotein convertase subtilisin/kexin type 9; protease, LDL receptor, LDL, endocytosis, hydrol; 1.98A {Homo sapiens} Back     alignment and structure
>3t41_A Epidermin leader peptide processing serine protea; structural genomics, center for structural genomics of infec diseases, csgid; 1.95A {Staphylococcus aureus} PDB: 3qfh_A Back     alignment and structure
>3afg_A Subtilisin-like serine protease; propeptide, thermococcus kodakaraensis, hydrolas protease; 2.00A {Thermococcus kodakarensis} Back     alignment and structure
>2z2z_A TK-subtilisin precursor; thermococcus kodakaraensis, hydrolase; 1.87A {Thermococcus kodakarensis} PDB: 2e1p_A 2zwp_A 2zwo_A Back     alignment and structure
>2z30_B TK-subtilisin; thermococcus kodakaraensis, hydrolase; 1.65A {Thermococcus kodakarensis} PDB: 2z2y_B 3a3p_B 2z56_B 2z58_B 2z57_B 3a3n_B 3a3o_B Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query248
d1scjb_71 Subtilisin prosegment {Bacillus subtilis [TaxId: 1 98.25
d1v5ib172 Proteinase A inhibitor 1, POIA1 {Oyster mushroom ( 98.2
>d1scjb_ d.58.3.2 (B:) Subtilisin prosegment {Bacillus subtilis [TaxId: 1423]} Back     information, alignment and structure
class: Alpha and beta proteins (a+b)
fold: Ferredoxin-like
superfamily: Protease propeptides/inhibitors
family: Subtilase propeptides/inhibitors
domain: Subtilisin prosegment
species: Bacillus subtilis [TaxId: 1423]
Probab=98.25  E-value=1.9e-06  Score=61.52  Aligned_cols=69  Identities=13%  Similarity=0.142  Sum_probs=50.9

Q ss_pred             eeEEEEecCCCCCCCChHHHHHHHHHHHHHhhCChhhhhcceeEEecCcceeeeeecCHHHHHHhhCCCCeEEEeCCCCc
Q 025802           82 QHWLIVMEFPNPSELSEEEMINAYVKTLAAVVGSEEEAKKKIYSVCTTTYTGFGALIDEELSYKVKGQPGVLWVLPDSYI  161 (248)
Q Consensus        82 ktYIV~M~~p~~p~~s~~~~~~sh~s~LaSVLgSee~Ak~~ILYSYt~afnGFAArLTeEEAekLk~lPGVVSVfPDs~~  161 (248)
                      +.|||.+.......     ....+.+++.+       ...++.+.|+ .|+||+|.|++++++.|+..|+|..|=+|...
T Consensus         2 ~~YIV~fK~~~~~~-----~~~~~~~~v~~-------~gg~v~~~~~-~i~gfs~~l~~~~~~~L~~~p~V~yVE~D~v~   68 (71)
T d1scjb_           2 KKYIVGFKQTMSAM-----SSAKKKDVISQ-------KGGKVEKQFK-YVNAAAATLDEKAVKELKKDPSVAYVEEDHIA   68 (71)
T ss_dssp             EEEEEEECSSSSCC-----SHHHHHHHHHT-------TTCEEEEECS-SSSEEEEEECHHHHHHHHTSTTEEEEEECCEE
T ss_pred             CcEEEEECCCCChH-----HHHHHHHHHHH-------cCCeEEEEEe-ecceEEEEeCHHHHHHHHcCCCceEEeCCcEE
Confidence            67999988543211     12223333322       2346899997 69999999999999999999999999999876


Q ss_pred             CC
Q 025802          162 DV  163 (248)
Q Consensus       162 qL  163 (248)
                      +.
T Consensus        69 ~a   70 (71)
T d1scjb_          69 HE   70 (71)
T ss_dssp             EE
T ss_pred             Ec
Confidence            54



>d1v5ib1 d.58.3.2 (B:1-72) Proteinase A inhibitor 1, POIA1 {Oyster mushroom (Pleurotus ostreatus) [TaxId: 5322]} Back     information, alignment and structure