Citrus Sinensis ID: 047400


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------
PHPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGINTERDYPNVGVMDNCKVFQFNWC
cccccccHHHHHHHHHHHHHHHHHccccccccHHHHcccccccccccccccHHHHHHHHHHccccccccccccccccccccccccccc
ccccEcHHHHHHHHHHHHHHHHHHcccccEcHHHHHHHcccHcccccEcccHHHHHHHHHHHccEEEccccccccccccccccccccc
PHPLGSCWIFAVVGAIEGISKIVTnnlvdistqqlvdcdnqgesrscvgGFIETIYQYVIQnrginterdypnvgvmdnckvfqfnwc
PHPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGinterdypnvgVMDNCKVFQFNWC
PHPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGINTERDYPNVGVMDNCKVFQFNWC
****GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGINTERDYPNVGVMDNCKVFQFNW*
PHPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGINTERDYPNVGVMDNCKVF*FNW*
PHPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGINTERDYPNVGVMDNCKVFQFNWC
*HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGINTERDYPNVGVMDNCKVFQ***C
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiihhhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
PHPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGINTERDYPNVGVMDNCKVFQFNWC
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query88 2.2.26 [Sep-21-2011]
Q94B08 376 Germination-specific cyst no no 0.886 0.207 0.470 4e-16
P43297 462 Cysteine proteinase RD21a no no 0.920 0.175 0.439 5e-16
Q9STL4 361 KDEL-tailed cysteine endo no no 0.920 0.224 0.451 1e-15
P25251 328 Cysteine proteinase COT44 N/A no 0.852 0.228 0.473 1e-15
Q9STL5 364 KDEL-tailed cysteine endo no no 0.761 0.184 0.514 2e-15
P60994 215 Ervatamin-B OS=Tabernaemo N/A no 0.897 0.367 0.444 4e-15
Q86GF7 323 Crustapain OS=Pandalus bo N/A no 0.875 0.238 0.467 4e-15
Q7XR52 490 Cysteine protease 1 OS=Or no no 0.886 0.159 0.423 5e-15
P25776 458 Oryzain alpha chain OS=Or no no 0.920 0.176 0.439 7e-15
P25803 362 Vignain OS=Phaseolus vulg N/A no 0.920 0.223 0.439 8e-15
>sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS=Arabidopsis thaliana GN=GCP1 PE=2 SV=2 Back     alignment and function desciption
 Score = 83.6 bits (205), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 40/85 (47%), Positives = 55/85 (64%), Gaps = 7/85 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
           GSCW F+   A+EGI+KIVT  L+ +S Q+LVDCD   NQG    C GG ++  +Q++++
Sbjct: 167 GSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQG----CNGGLMDYAFQFIMK 222

Query: 62  NRGINTERDYPNVGVMDNCKVFQFN 86
           N G+NTE+DYP  G    C  F  N
Sbjct: 223 NGGLNTEKDYPYRGFGGKCNSFLKN 247





Arabidopsis thaliana (taxid: 3702)
EC: 3EC: .EC: 4EC: .EC: 2EC: 2EC: .EC: -
>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana GN=RD21A PE=1 SV=1 Back     alignment and function description
>sp|Q9STL4|CEP2_ARATH KDEL-tailed cysteine endopeptidase CEP2 OS=Arabidopsis thaliana GN=CEP2 PE=2 SV=1 Back     alignment and function description
>sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Brassica napus PE=2 SV=1 Back     alignment and function description
>sp|Q9STL5|CEP3_ARATH KDEL-tailed cysteine endopeptidase CEP3 OS=Arabidopsis thaliana GN=CEP3 PE=2 SV=1 Back     alignment and function description
>sp|P60994|ERVB_TABDI Ervatamin-B OS=Tabernaemontana divaricata PE=1 SV=1 Back     alignment and function description
>sp|Q86GF7|CRUST_PANBO Crustapain OS=Pandalus borealis GN=Cys PE=1 SV=1 Back     alignment and function description
>sp|Q7XR52|CYSP1_ORYSJ Cysteine protease 1 OS=Oryza sativa subsp. japonica GN=CP1 PE=2 SV=2 Back     alignment and function description
>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. japonica GN=Os04g0650000 PE=1 SV=2 Back     alignment and function description
>sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query88
242093994 341 hypothetical protein SORBIDRAFT_10g02798 0.840 0.217 0.513 4e-16
357474523 345 Cysteine proteinase [Medicago truncatula 0.886 0.226 0.474 4e-16
144905112 341 cysteine proteinase [Lotus japonicus] 0.875 0.225 0.493 5e-16
224099295 342 predicted protein [Populus trichocarpa] 0.772 0.198 0.529 5e-16
159479072 469 cysteine endopeptidase [Chlamydomonas re 0.875 0.164 0.487 8e-16
413922306 490 hypothetical protein ZEAMMB73_802227 [Ze 0.840 0.151 0.469 1e-15
414879924 336 TPA: hypothetical protein ZEAMMB73_17557 0.920 0.241 0.454 1e-15
356553978 496 PREDICTED: cysteine proteinase RD21a-lik 0.920 0.163 0.463 1e-15
356517350 343 PREDICTED: KDEL-tailed cysteine endopept 0.886 0.227 0.461 1e-15
224135841 342 predicted protein [Populus trichocarpa] 0.772 0.198 0.514 2e-15
>gi|242093994|ref|XP_002437487.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor] gi|241915710|gb|EER88854.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor] Back     alignment and taxonomy information
 Score = 88.6 bits (218), Expect = 4e-16,   Method: Composition-based stats.
 Identities = 39/76 (51%), Positives = 52/76 (68%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ V A+EGI+KIVTNNL+ +S Q+L+DCD   E   C GG ++  +Q+VI N G
Sbjct: 162 GGCWAFSAVAAMEGINKIVTNNLISLSEQELIDCDT--EDYGCQGGEMQKAFQFVIDNGG 219

Query: 65  INTERDYPNVGVMDNC 80
           I+TE DYP +G    C
Sbjct: 220 IDTEADYPFIGTNGTC 235




Source: Sorghum bicolor

Species: Sorghum bicolor

Genus: Sorghum

Family: Poaceae

Order: Poales

Class: Liliopsida

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|357474523|ref|XP_003607546.1| Cysteine proteinase [Medicago truncatula] gi|358347207|ref|XP_003637651.1| Cysteine proteinase [Medicago truncatula] gi|355503586|gb|AES84789.1| Cysteine proteinase [Medicago truncatula] gi|355508601|gb|AES89743.1| Cysteine proteinase [Medicago truncatula] Back     alignment and taxonomy information
>gi|144905112|dbj|BAF56429.1| cysteine proteinase [Lotus japonicus] Back     alignment and taxonomy information
>gi|224099295|ref|XP_002334495.1| predicted protein [Populus trichocarpa] gi|222872550|gb|EEF09681.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|159479072|ref|XP_001697622.1| cysteine endopeptidase [Chlamydomonas reinhardtii] gi|158274232|gb|EDP00016.1| cysteine endopeptidase [Chlamydomonas reinhardtii] Back     alignment and taxonomy information
>gi|413922306|gb|AFW62238.1| hypothetical protein ZEAMMB73_802227 [Zea mays] Back     alignment and taxonomy information
>gi|414879924|tpg|DAA57055.1| TPA: hypothetical protein ZEAMMB73_175573 [Zea mays] Back     alignment and taxonomy information
>gi|356553978|ref|XP_003545327.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max] Back     alignment and taxonomy information
>gi|356517350|ref|XP_003527350.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine max] gi|356577765|ref|XP_003556993.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine max] Back     alignment and taxonomy information
>gi|224135841|ref|XP_002327317.1| predicted protein [Populus trichocarpa] gi|222835687|gb|EEE74122.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query88
TAIR|locus:2038515 343 AT1G06260 [Arabidopsis thalian 0.863 0.221 0.486 5.4e-17
TAIR|locus:2825832 462 RD21A "responsive to dehydrati 0.920 0.175 0.439 2.2e-16
TAIR|locus:2167821 463 RD21B "esponsive to dehydratio 0.852 0.161 0.460 7.7e-16
UNIPROTKB|Q86GF7 323 Cys "Crustapain" [Pandalus bor 0.875 0.238 0.467 1.2e-15
TAIR|locus:2055440 345 AT2G34080 [Arabidopsis thalian 0.863 0.220 0.467 1.4e-15
TAIR|locus:505006391 364 CEP3 "cysteine endopeptidase 3 0.761 0.184 0.514 1.7e-15
ZFIN|ZDB-GENE-050522-559 330 ctssb.1 "cathepsin S, b.1" [Da 0.863 0.230 0.486 2.4e-15
TAIR|locus:2029924 355 AT1G29090 [Arabidopsis thalian 0.863 0.214 0.454 2.6e-15
TAIR|locus:2128243 364 AT4G11310 [Arabidopsis thalian 0.829 0.200 0.453 6.2e-15
TAIR|locus:2024362 437 XBCP3 "xylem bark cysteine pep 0.863 0.173 0.441 6.4e-15
TAIR|locus:2038515 AT1G06260 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 211 (79.3 bits), Expect = 5.4e-17, P = 5.4e-17
 Identities = 37/76 (48%), Positives = 50/76 (65%)

Query:     5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
             G CW F+ V AIEGI+KI T NLV +S QQL+DCD    ++ C GG +ET ++++  N G
Sbjct:   149 GGCWAFSAVAAIEGINKIKTGNLVSLSEQQLIDCDVGTYNKGCSGGLMETAFEFIKTNGG 208

Query:    65 INTERDYPNVGVMDNC 80
             + TE DYP  G+   C
Sbjct:   209 LATETDYPYTGIEGTC 224




GO:0005576 "extracellular region" evidence=ISM
GO:0006508 "proteolysis" evidence=IEA;ISS
GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS
GO:0048046 "apoplast" evidence=IDA
TAIR|locus:2825832 RD21A "responsive to dehydration 21A" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2167821 RD21B "esponsive to dehydration 21B" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|Q86GF7 Cys "Crustapain" [Pandalus borealis (taxid:6703)] Back     alignment and assigned GO terms
TAIR|locus:2055440 AT2G34080 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:505006391 CEP3 "cysteine endopeptidase 3" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
ZFIN|ZDB-GENE-050522-559 ctssb.1 "cathepsin S, b.1" [Danio rerio (taxid:7955)] Back     alignment and assigned GO terms
TAIR|locus:2029924 AT1G29090 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2128243 AT4G11310 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2024362 XBCP3 "xylem bark cysteine peptidase 3" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query88
pfam00112 213 pfam00112, Peptidase_C1, Papain family cysteine pr 3e-24
smart00645175 smart00645, Pept_C1, Papain family cysteine protea 5e-24
cd02248 210 cd02248, Peptidase_C1A, Peptidase C1A subfamily (M 4e-23
PTZ00200 448 PTZ00200, PTZ00200, cysteine proteinase; Provision 2e-20
PTZ00021 489 PTZ00021, PTZ00021, falcipain-2; Provisional 4e-14
PTZ00203 348 PTZ00203, PTZ00203, cathepsin L protease; Provisio 5e-11
cd02619 223 cd02619, Peptidase_C1, C1 Peptidase family (MEROPS 3e-10
cd02698 239 cd02698, Peptidase_C1A_CathepsinX, Cathepsin X; th 8e-07
cd02621 243 cd02621, Peptidase_C1A_CathepsinC, Cathepsin C; al 1e-05
>gnl|CDD|215726 pfam00112, Peptidase_C1, Papain family cysteine protease Back     alignment and domain information
 Score = 90.3 bits (225), Expect = 3e-24
 Identities = 39/84 (46%), Positives = 48/84 (57%), Gaps = 6/84 (7%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD--NQGESRSCVGGFIETIYQYVIQN 62
           GSCW F+ VGA+EG   I T  LV +S QQLVDCD  N G    C GG  +  ++Y+ +N
Sbjct: 23  GSCWAFSAVGALEGRYCIKTGKLVSLSEQQLVDCDTGNNG----CNGGLPDNAFEYIKKN 78

Query: 63  RGINTERDYPNVGVMDNCKVFQFN 86
            GI TE DYP       CK  + N
Sbjct: 79  GGIVTESDYPYTAHDGTCKFKKSN 102


Length = 213

>gnl|CDD|214761 smart00645, Pept_C1, Papain family cysteine protease Back     alignment and domain information
>gnl|CDD|239068 cd02248, Peptidase_C1A, Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W) Back     alignment and domain information
>gnl|CDD|240310 PTZ00200, PTZ00200, cysteine proteinase; Provisional Back     alignment and domain information
>gnl|CDD|240232 PTZ00021, PTZ00021, falcipain-2; Provisional Back     alignment and domain information
>gnl|CDD|185513 PTZ00203, PTZ00203, cathepsin L protease; Provisional Back     alignment and domain information
>gnl|CDD|239110 cd02619, Peptidase_C1, C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase) Back     alignment and domain information
>gnl|CDD|239149 cd02698, Peptidase_C1A_CathepsinX, Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity Back     alignment and domain information
>gnl|CDD|239112 cd02621, Peptidase_C1A_CathepsinC, Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 88
KOG1542 372 consensus Cysteine proteinase Cathepsin F [Posttra 99.97
KOG1543 325 consensus Cysteine proteinase Cathepsin L [Posttra 99.95
PTZ00200 448 cysteine proteinase; Provisional 99.94
PTZ00203 348 cathepsin L protease; Provisional 99.94
PTZ00021 489 falcipain-2; Provisional 99.94
cd02620 236 Peptidase_C1A_CathepsinB Cathepsin B group; compos 99.93
cd02698 239 Peptidase_C1A_CathepsinX Cathepsin X; the only pap 99.93
cd02621 243 Peptidase_C1A_CathepsinC Cathepsin C; also known a 99.93
PTZ00364 548 dipeptidyl-peptidase I precursor; Provisional 99.92
PTZ00049 693 cathepsin C-like protein; Provisional 99.92
cd02248 210 Peptidase_C1A Peptidase C1A subfamily (MEROPS data 99.91
smart00645174 Pept_C1 Papain family cysteine protease. 99.9
PTZ00462 1004 Serine-repeat antigen protein; Provisional 99.88
PF00112 219 Peptidase_C1: Papain family cysteine protease This 99.87
cd02619 223 Peptidase_C1 C1 Peptidase family (MEROPS database 99.84
KOG1544 470 consensus Predicted cysteine proteinase TIN-ag [Ge 99.58
COG4870 372 Cysteine protease [Posttranslational modification, 97.24
cd00585 437 Peptidase_C1B Peptidase C1B subfamily (MEROPS data 96.82
PF03051 438 Peptidase_C1_2: Peptidase C1-like family This fami 96.56
>KOG1542 consensus Cysteine proteinase Cathepsin F [Posttranslational modification, protein turnover, chaperones] Back     alignment and domain information
Probab=99.97  E-value=2.3e-32  Score=193.67  Aligned_cols=82  Identities=38%  Similarity=0.818  Sum_probs=78.0

Q ss_pred             CCCchHHHHHHHHHHHHHHHHHhCCCcccchhhhhcccCCCCCCCCCCCcHHHHHHHHHhhCCccCCCCcccccCCC-Cc
Q 047400            2 HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGINTERDYPNVGVMD-NC   80 (88)
Q Consensus         2 g~CgsCwAfa~~~~ie~~~~i~~~~~~~lS~Q~lidC~~~~~~~gC~GG~~~~a~~~~~~~~Gi~~e~~yPY~~~~~-~C   80 (88)
                      |.||||||||++++||+++.|++++++.|||||||||+.. ++ ||+||.+..||+|+++.+||..|.+|||+++++ .|
T Consensus       176 G~CGSCWAFS~tG~vEga~~i~~g~LvsLSEQeLvDCD~~-d~-gC~GGl~~nA~~~~~~~gGL~~E~dYPY~g~~~~~C  253 (372)
T KOG1542|consen  176 GMCGSCWAFSTTGAVEGAWAIATGKLVSLSEQELVDCDSC-DN-GCNGGLMDNAFKYIKKAGGLEKEKDYPYTGKKGNQC  253 (372)
T ss_pred             CcCcchhhhhhhhhhhhHHHhhcCcccccchhhhhcccCc-CC-cCCCCChhHHHHHHHHhCCccccccCCccccCCCcc
Confidence            7899999999999999999999999999999999999987 77 999999999999988888999999999999998 89


Q ss_pred             ccCCC
Q 047400           81 KVFQF   85 (88)
Q Consensus        81 ~~~~~   85 (88)
                      +.++.
T Consensus       254 ~~~~~  258 (372)
T KOG1542|consen  254 HFDKS  258 (372)
T ss_pred             ccchh
Confidence            98763



>KOG1543 consensus Cysteine proteinase Cathepsin L [Posttranslational modification, protein turnover, chaperones] Back     alignment and domain information
>PTZ00200 cysteine proteinase; Provisional Back     alignment and domain information
>PTZ00203 cathepsin L protease; Provisional Back     alignment and domain information
>PTZ00021 falcipain-2; Provisional Back     alignment and domain information
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag) Back     alignment and domain information
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity Back     alignment and domain information
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access Back     alignment and domain information
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional Back     alignment and domain information
>PTZ00049 cathepsin C-like protein; Provisional Back     alignment and domain information
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W) Back     alignment and domain information
>smart00645 Pept_C1 Papain family cysteine protease Back     alignment and domain information
>PTZ00462 Serine-repeat antigen protein; Provisional Back     alignment and domain information
>PF00112 Peptidase_C1: Papain family cysteine protease This is family C1 in the peptidase classification Back     alignment and domain information
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase) Back     alignment and domain information
>KOG1544 consensus Predicted cysteine proteinase TIN-ag [General function prediction only] Back     alignment and domain information
>COG4870 Cysteine protease [Posttranslational modification, protein turnover, chaperones] Back     alignment and domain information
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC) Back     alignment and domain information
>PF03051 Peptidase_C1_2: Peptidase C1-like family This family is a subfamily of the Prosite entry; InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query88
3p5w_A 220 Actinidin From Actinidia Arguta Planch (Sarusashi) 3e-16
1iwd_A 215 Proposed Amino Acid Sequence And The 1.63 Angstrom 3e-16
3p5u_A 220 Actinidin From Actinidia Arguta Planch (Sarusashi) 3e-15
2bdz_A 214 Mexicain From Jacaratia Mexicana Length = 214 4e-15
1aec_A 218 Crystal Structure Of Actinidin-E-64 Complex+ Length 4e-15
3bcn_A 209 Crystal Structure Of A Papain-Like Cysteine Proteas 1e-14
7pck_A 314 Crystal Structure Of Wild Type Human Procathepsin K 2e-14
3h7d_A 215 The Crystal Structure Of The Cathepsin K Variant M5 2e-14
1mem_A 215 Crystal Structure Of Cathepsin K Complexed With A P 2e-14
1u9v_A 217 Crystal Structure Of The Cysteine Protease Human Ca 2e-14
1snk_A 214 Cathepsin K Complexed With Carbamate Derivatized No 3e-14
3ovz_A 213 Cathepsin K In Complex With A Covalent Inhibitor Wi 3e-14
2pns_A 208 1.9 Angstrom Resolution Crystal Structure Of A Plan 4e-14
2f7d_A 215 A Mutant Rabbit Cathepsin K With A Nitrile Inhibito 4e-14
1ms6_A 222 Dipeptide Nitrile Inhibitor Bound To Cathepsin S. L 5e-14
1o0e_A 208 1.9 Angstrom Crystal Structure Of A Plant Cysteine 5e-14
3ovx_A 218 Cathepsin S In Complex With A Covalent Inhibitor Wi 6e-14
3n3g_A 217 4-(3-Trifluoromethylphenyl)-Pyrimidine-2-Carbonitri 6e-14
2g6d_A 217 Human Cathepsin S Mutant With Vinyl Sulfone Inhibit 6e-14
2fye_A 217 Mutant Human Cathepsin S With Irreversible Inhibito 6e-14
2f1g_A 220 Cathepsin S In Complex With Non-Covalent 2-(Benzoxa 6e-14
1cqd_A 221 The 2.1 Angstrom Structure Of A Cysteine Protease W 6e-14
2fq9_A 225 Cathepsin S With Nitrile Inhibitor Length = 225 6e-14
1vsn_A 215 Crystal Structure Of A Potent Small Molecule Inhibi 6e-14
1fh0_A 221 Crystal Structure Of Human Cathepsin V Complexed Wi 7e-14
1npz_A 217 Crystal Structures Of Cathepsin S Inhibitor Complex 7e-14
1s4v_A 229 The 2.0 A Crystal Structure Of The Kdel-Tailed Cyst 1e-13
2act_A 220 Crystallographic Refinement Of The Structure Of Act 2e-13
2c0y_A 315 The Crystal Structure Of A Cys25ala Mutant Of Human 3e-13
3kwn_A 219 Cathepsin S In Complex With Thioether Acetamide P3 5e-13
3iej_A 222 Pyrazole-Based Cathepsin S Inhibitors With Arylalky 5e-13
3mpe_A 220 Crystal Structure Of Human Cathepsin-S C25s Mutant 5e-13
1gec_E 216 Glycyl Endopeptidase-complex With Benzyloxycarbonyl 5e-13
3h6s_A 221 Strucure Of Clitocypin - Cathepsin V Complex Length 8e-13
1glo_A 217 Crystal Structure Of Cys25ser Mutant Of Human Cathe 1e-12
1yal_A 218 Carica Papaya Chymopapain At 1.7 Angstroms Resoluti 1e-12
8pch_A 220 Crystal Structure Of Porcine Cathepsin H Determined 1e-12
1meg_A 216 Crystal Structure Of A Caricain D158e Mutant In Com 2e-12
1ppo_A 216 Determination Of The Structure Of Papaya Protease O 2e-12
3hwn_A 258 Cathepsin L With Az13010160 Length = 258 2e-12
3of8_A 221 Structural Basis For Reversible And Irreversible In 2e-12
3hha_A 220 Crystal Structure Of Cathepsin L In Complex With Az 2e-12
3f75_A 224 Activated Toxoplasma Gondii Cathepsin L (Tgcpl) In 2e-12
3h89_A 220 A Combined Crystallographic And Molecular Dynamics 2e-12
1icf_A175 Crystal Structure Of Mhc Class Ii Associated P41 Ii 2e-12
2fo5_A 262 Crystal Structure Of Recombinant Barley Cysteine En 2e-12
1pci_A 322 Procaricain Length = 322 6e-12
3u8e_A 222 Crystal Structure Of Cysteine Protease From Bulbs O 1e-11
2nqd_B 221 Crystal Structure Of Cysteine Protease Inhibitor, C 2e-11
3iv2_A 220 Crystal Structure Of Mature Apo-Cathepsin L C25a Mu 2e-11
3kse_A 220 Unreduced Cathepsin L In Complex With Stefin A Leng 3e-11
3bc3_A 220 Exploring Inhibitor Binding At The S Subsites Of Ca 3e-11
1mhw_A175 Design Of Non-covalent Inhibitors Of Human Cathepsi 3e-11
1cjl_A 312 Crystal Structure Of A Cysteine Protease Proform Le 3e-11
1cs8_A 316 Crystal Structure Of Procathepsin L Length = 316 4e-11
2b1m_A 246 Crystal Structure Of A Papain-Fold Protein Without 5e-11
2vhs_A 217 Cathsilicatein, A Chimera Length = 217 4e-10
1khp_A 212 Monoclinic Form Of Papain/zlfg-dam Covalent Complex 4e-10
1ppp_A 212 Crystal Structure Of Papain-E64-C Complex. Binding 6e-10
1pip_A 212 Crystal Structure Of Papain-Succinyl-Gln-Val-Val-Al 7e-10
3ioq_A 213 Crystal Structure Of The Carica Candamarcensis Cyst 3e-09
3ima_A 212 Complex Strcuture Of Tarocystatin And Papain Length 4e-09
3tnx_A 363 Structure Of The Precursor Of A Thermostable Varian 6e-09
2cio_A 212 The High Resolution X-Ray Structure Of Papain Compl 8e-09
1stf_E 212 The Refined 2.4 Angstroms X-Ray Crystal Structure O 8e-09
2o6x_A 310 Crystal Structure Of Procathepsin L1 From Fasciola 1e-08
3qj3_A 331 Structure Of Digestive Procathepsin L2 Proteinase F 3e-08
3qt4_A 329 Structure Of Digestive Procathepsin L 3 Of Tenebrio 4e-08
1yvb_A 241 The Plasmodium Falciparum Cysteine Protease Falcipa 4e-08
2p7u_A 215 The Crystal Structure Of Rhodesain, The Major Cyste 5e-08
3d6s_A 223 Crystal Structure Of Mite Allergen Der F 1 Length = 8e-08
3pnr_A 240 Structure Of Pbicp-C In Complex With Falcipain-2 Le 3e-07
3bpm_A 243 Crystal Structure Of Falcipain-3 With Its Inhibitor 8e-07
1aim_A 215 Cruzain Inhibited By Benzoyl-Tyrosine-Alanine-Fluor 3e-06
1ewp_A 215 Cruzain Bound To Mor-Leu-Hpq Length = 215 3e-06
1m6d_A 214 Crystal Structure Of Human Cathepsin F Length = 214 6e-06
3iut_A 221 The Crystal Structure Of Cruzain In Complex With A 6e-06
3hd3_A 215 High Resolution Crystal Structure Of Cruzain Bound 6e-06
1k3b_B164 Crystal Structure Of Human Dipeptidyl Peptidase I ( 2e-05
3pdf_A 441 Discovery Of Novel Cyanamide-Based Inhibitors Of Ca 4e-05
3rvw_A 222 Crystal Structure Of Der P 1 Complexed With Fab 4c1 4e-05
2as8_A 222 Crystal Structure Of Mature And Fully Active Der P 2e-04
1jqp_A 438 Dipeptidyl Peptidase I (Cathepsin C), A Tetrameric 5e-04
1xkg_A 312 Crystal Structure Of The Major House Dust Mite Alle 7e-04
>pdb|3P5W|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi) Length = 220 Back     alignment and structure

Iteration: 1

Score = 80.1 bits (196), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 35/78 (44%), Positives = 50/78 (64%) Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64 GSCW F+ + A+EGI+KI T +L+ +S Q+LVDC +R C GGF+ +Q++I N G Sbjct: 23 GSCWAFSTIAAVEGINKIATGDLISLSEQELVDCGRTQNTRGCDGGFMTDGFQFIINNGG 82 Query: 65 INTERDYPNVGVMDNCKV 82 INTE +YP C + Sbjct: 83 INTEANYPYTAEEGQCNL 100
>pdb|1IWD|A Chain A, Proposed Amino Acid Sequence And The 1.63 Angstrom X-ray Crystal Structure Of A Plant Cysteine Protease Ervatamin B: Insight Into The Structural Basis Of Its Stability And Substrate Specificity Length = 215 Back     alignment and structure
>pdb|3P5U|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi) Length = 220 Back     alignment and structure
>pdb|2BDZ|A Chain A, Mexicain From Jacaratia Mexicana Length = 214 Back     alignment and structure
>pdb|1AEC|A Chain A, Crystal Structure Of Actinidin-E-64 Complex+ Length = 218 Back     alignment and structure
>pdb|3BCN|A Chain A, Crystal Structure Of A Papain-Like Cysteine Protease Ervatamin-A Complexed With Irreversible Inhibitor E-64 Length = 209 Back     alignment and structure
>pdb|7PCK|A Chain A, Crystal Structure Of Wild Type Human Procathepsin K Length = 314 Back     alignment and structure
>pdb|3H7D|A Chain A, The Crystal Structure Of The Cathepsin K Variant M5 In Compl Chondroitin-4-Sulfate Length = 215 Back     alignment and structure
>pdb|1MEM|A Chain A, Crystal Structure Of Cathepsin K Complexed With A Potent Vinyl Sulfone Inhibitor Length = 215 Back     alignment and structure
>pdb|1U9V|A Chain A, Crystal Structure Of The Cysteine Protease Human Cathepsin K In Complex With The Covalent Inhibitor Nvp-Abe854 Length = 217 Back     alignment and structure
>pdb|1SNK|A Chain A, Cathepsin K Complexed With Carbamate Derivatized Norleucine Aldehyde Length = 214 Back     alignment and structure
>pdb|3OVZ|A Chain A, Cathepsin K In Complex With A Covalent Inhibitor With A Ketoamide Warhead Length = 213 Back     alignment and structure
>pdb|2PNS|A Chain A, 1.9 Angstrom Resolution Crystal Structure Of A Plant Cysteine Protease Ervatamin-C Refinement With Cdna Derived Amino Acid Sequence Length = 208 Back     alignment and structure
>pdb|2F7D|A Chain A, A Mutant Rabbit Cathepsin K With A Nitrile Inhibitor Length = 215 Back     alignment and structure
>pdb|1MS6|A Chain A, Dipeptide Nitrile Inhibitor Bound To Cathepsin S. Length = 222 Back     alignment and structure
>pdb|1O0E|A Chain A, 1.9 Angstrom Crystal Structure Of A Plant Cysteine Protease Ervatamin C Length = 208 Back     alignment and structure
>pdb|3OVX|A Chain A, Cathepsin S In Complex With A Covalent Inhibitor With An Aldehyde Warhead Length = 218 Back     alignment and structure
>pdb|3N3G|A Chain A, 4-(3-Trifluoromethylphenyl)-Pyrimidine-2-Carbonitrile As Cathepsin S Inhibitors: N3, Not N1 Is Critically Important Length = 217 Back     alignment and structure
>pdb|2G6D|A Chain A, Human Cathepsin S Mutant With Vinyl Sulfone Inhibitor Cra- 14009 Length = 217 Back     alignment and structure
>pdb|2FYE|A Chain A, Mutant Human Cathepsin S With Irreversible Inhibitor Cra- 14013 Length = 217 Back     alignment and structure
>pdb|2F1G|A Chain A, Cathepsin S In Complex With Non-Covalent 2-(Benzoxazol-2-Ylamino)- Acetamide Length = 220 Back     alignment and structure
>pdb|1CQD|A Chain A, The 2.1 Angstrom Structure Of A Cysteine Protease With Proline Specificity From Ginger Rhizome, Zingiber Officinale Length = 221 Back     alignment and structure
>pdb|2FQ9|A Chain A, Cathepsin S With Nitrile Inhibitor Length = 225 Back     alignment and structure
>pdb|1VSN|A Chain A, Crystal Structure Of A Potent Small Molecule Inhibitor Bound To Cathepsin K Length = 215 Back     alignment and structure
>pdb|1FH0|A Chain A, Crystal Structure Of Human Cathepsin V Complexed With An Irreversible Vinyl Sulfone Inhibitor Length = 221 Back     alignment and structure
>pdb|1NPZ|A Chain A, Crystal Structures Of Cathepsin S Inhibitor Complexes Length = 217 Back     alignment and structure
>pdb|1S4V|A Chain A, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine Endopeptidase Functioning In Programmed Cell Death Of Ricinus Communis Endosperm Length = 229 Back     alignment and structure
>pdb|2ACT|A Chain A, Crystallographic Refinement Of The Structure Of Actinidin At 1.7 Angstroms Resolution By Fast Fourier Least-Squares Methods Length = 220 Back     alignment and structure
>pdb|2C0Y|A Chain A, The Crystal Structure Of A Cys25ala Mutant Of Human Procathepsin S Length = 315 Back     alignment and structure
>pdb|3KWN|A Chain A, Cathepsin S In Complex With Thioether Acetamide P3 Inhibitor Length = 219 Back     alignment and structure
>pdb|3IEJ|A Chain A, Pyrazole-Based Cathepsin S Inhibitors With Arylalkynes As P1 Binding Elements Length = 222 Back     alignment and structure
>pdb|3MPE|A Chain A, Crystal Structure Of Human Cathepsin-S C25s Mutant With Bound Drug Length = 220 Back     alignment and structure
>pdb|1GEC|E Chain E, Glycyl Endopeptidase-complex With Benzyloxycarbonyl-leucine-valine- Glycine-methylene Covalently Bound To Cysteine 25 Length = 216 Back     alignment and structure
>pdb|3H6S|A Chain A, Strucure Of Clitocypin - Cathepsin V Complex Length = 221 Back     alignment and structure
>pdb|1GLO|A Chain A, Crystal Structure Of Cys25ser Mutant Of Human Cathepsin S Length = 217 Back     alignment and structure
>pdb|1YAL|A Chain A, Carica Papaya Chymopapain At 1.7 Angstroms Resolution Length = 218 Back     alignment and structure
>pdb|8PCH|A Chain A, Crystal Structure Of Porcine Cathepsin H Determined At 2.1 Angstrom Resolution: Location Of The Mini-Chain C-Terminal Carboxyl Group Defines Cathepsin H Aminopeptidase Function Length = 220 Back     alignment and structure
>pdb|1MEG|A Chain A, Crystal Structure Of A Caricain D158e Mutant In Complex With E-64 Length = 216 Back     alignment and structure
>pdb|1PPO|A Chain A, Determination Of The Structure Of Papaya Protease Omega Length = 216 Back     alignment and structure
>pdb|3HWN|A Chain A, Cathepsin L With Az13010160 Length = 258 Back     alignment and structure
>pdb|3OF8|A Chain A, Structural Basis For Reversible And Irreversible Inhibition Of Human Cathepsin L By Their Respective Dipeptidyl Glyoxal And Diazomethylketone Inhibitors Length = 221 Back     alignment and structure
>pdb|3HHA|A Chain A, Crystal Structure Of Cathepsin L In Complex With Az12878478 Length = 220 Back     alignment and structure
>pdb|3F75|A Chain A, Activated Toxoplasma Gondii Cathepsin L (Tgcpl) In Complex With Its Propeptide Length = 224 Back     alignment and structure
>pdb|3H89|A Chain A, A Combined Crystallographic And Molecular Dynamics Study Of Cathepsin-L Retro-Binding Inhibitors(Compound 4) Length = 220 Back     alignment and structure
>pdb|1ICF|A Chain A, Crystal Structure Of Mhc Class Ii Associated P41 Ii Fragment In Complex With Cathepsin L Length = 175 Back     alignment and structure
>pdb|2FO5|A Chain A, Crystal Structure Of Recombinant Barley Cysteine Endoprotease B Isoform 2 (Ep-B2) In Complex With Leupeptin Length = 262 Back     alignment and structure
>pdb|1PCI|A Chain A, Procaricain Length = 322 Back     alignment and structure
>pdb|3U8E|A Chain A, Crystal Structure Of Cysteine Protease From Bulbs Of Crocus Sativus At 1.3 A Resolution Length = 222 Back     alignment and structure
>pdb|2NQD|B Chain B, Crystal Structure Of Cysteine Protease Inhibitor, Chagasin, In Complex With Human Cathepsin L Length = 221 Back     alignment and structure
>pdb|3IV2|A Chain A, Crystal Structure Of Mature Apo-Cathepsin L C25a Mutant Length = 220 Back     alignment and structure
>pdb|3KSE|A Chain A, Unreduced Cathepsin L In Complex With Stefin A Length = 220 Back     alignment and structure
>pdb|3BC3|A Chain A, Exploring Inhibitor Binding At The S Subsites Of Cathepsin L Length = 220 Back     alignment and structure
>pdb|1MHW|A Chain A, Design Of Non-covalent Inhibitors Of Human Cathepsin L. From The 96- Residue Proregion To Optimized Tripeptides Length = 175 Back     alignment and structure
>pdb|1CJL|A Chain A, Crystal Structure Of A Cysteine Protease Proform Length = 312 Back     alignment and structure
>pdb|1CS8|A Chain A, Crystal Structure Of Procathepsin L Length = 316 Back     alignment and structure
>pdb|2B1M|A Chain A, Crystal Structure Of A Papain-Fold Protein Without The Catalytic Cysteine From Seeds Of Pachyrhizus Erosus Length = 246 Back     alignment and structure
>pdb|2VHS|A Chain A, Cathsilicatein, A Chimera Length = 217 Back     alignment and structure
>pdb|1KHP|A Chain A, Monoclinic Form Of Papain/zlfg-dam Covalent Complex Length = 212 Back     alignment and structure
>pdb|1PPP|A Chain A, Crystal Structure Of Papain-E64-C Complex. Binding Diversity Of E64-C To Papain S2 And S3 Subsites Length = 212 Back     alignment and structure
>pdb|1PIP|A Chain A, Crystal Structure Of Papain-Succinyl-Gln-Val-Val-Ala-Ala-P- Nitroanilide Complex At 1.7 Angstroms Resolution: Noncovalent Binding Mode Of A Common Sequence Of Endogenous Thiol Protease Inhibitors Length = 212 Back     alignment and structure
>pdb|3IOQ|A Chain A, Crystal Structure Of The Carica Candamarcensis Cysteine Protease Cms1ms2 In Complex With E-64 Length = 213 Back     alignment and structure
>pdb|3IMA|A Chain A, Complex Strcuture Of Tarocystatin And Papain Length = 212 Back     alignment and structure
>pdb|3TNX|A Chain A, Structure Of The Precursor Of A Thermostable Variant Of Papain At 2.6 Angstroem Resolution Length = 363 Back     alignment and structure
>pdb|2CIO|A Chain A, The High Resolution X-Ray Structure Of Papain Complexed With Fragments Of The Trypanosoma Brucei Cysteine Protease Inhibitor Icp Length = 212 Back     alignment and structure
>pdb|1STF|E Chain E, The Refined 2.4 Angstroms X-Ray Crystal Structure Of Recombinant Human Stefin B In Complex With The Cysteine Proteinase Papain: A Novel Type Of Proteinase Inhibitor Interaction Length = 212 Back     alignment and structure
>pdb|2O6X|A Chain A, Crystal Structure Of Procathepsin L1 From Fasciola Hepatica Length = 310 Back     alignment and structure
>pdb|3QJ3|A Chain A, Structure Of Digestive Procathepsin L2 Proteinase From Tenebrio Molitor Larval Midgut Length = 331 Back     alignment and structure
>pdb|3QT4|A Chain A, Structure Of Digestive Procathepsin L 3 Of Tenebrio Molitor Larval Midgut Length = 329 Back     alignment and structure
>pdb|1YVB|A Chain A, The Plasmodium Falciparum Cysteine Protease Falcipain-2 Length = 241 Back     alignment and structure
>pdb|2P7U|A Chain A, The Crystal Structure Of Rhodesain, The Major Cysteine Protease Of T. Brucei Rhodesiense, Bound To Inhibitor K777 Length = 215 Back     alignment and structure
>pdb|3D6S|A Chain A, Crystal Structure Of Mite Allergen Der F 1 Length = 223 Back     alignment and structure
>pdb|3PNR|A Chain A, Structure Of Pbicp-C In Complex With Falcipain-2 Length = 240 Back     alignment and structure
>pdb|3BPM|A Chain A, Crystal Structure Of Falcipain-3 With Its Inhibitor, Leupeptin Length = 243 Back     alignment and structure
>pdb|1AIM|A Chain A, Cruzain Inhibited By Benzoyl-Tyrosine-Alanine-Fluoromethylketone Length = 215 Back     alignment and structure
>pdb|1EWP|A Chain A, Cruzain Bound To Mor-Leu-Hpq Length = 215 Back     alignment and structure
>pdb|1M6D|A Chain A, Crystal Structure Of Human Cathepsin F Length = 214 Back     alignment and structure
>pdb|3IUT|A Chain A, The Crystal Structure Of Cruzain In Complex With A Tetrafluorophenoxymethyl Ketone Inhibitor Length = 221 Back     alignment and structure
>pdb|3HD3|A Chain A, High Resolution Crystal Structure Of Cruzain Bound To The Vinyl Sulfone Inhibitor Smdc-256047 Length = 215 Back     alignment and structure
>pdb|1K3B|B Chain B, Crystal Structure Of Human Dipeptidyl Peptidase I (Cathepsin C): Exclusion Domain Added To An Endopeptidase Framework Creates The Machine For Activation Of Granular Serine Proteases Length = 164 Back     alignment and structure
>pdb|3PDF|A Chain A, Discovery Of Novel Cyanamide-Based Inhibitors Of Cathepsin C Length = 441 Back     alignment and structure
>pdb|3RVW|A Chain A, Crystal Structure Of Der P 1 Complexed With Fab 4c1 Length = 222 Back     alignment and structure
>pdb|2AS8|A Chain A, Crystal Structure Of Mature And Fully Active Der P 1 Allergen Length = 222 Back     alignment and structure
>pdb|1JQP|A Chain A, Dipeptidyl Peptidase I (Cathepsin C), A Tetrameric Cysteine Protease Of The Papain Family Length = 438 Back     alignment and structure
>pdb|1XKG|A Chain A, Crystal Structure Of The Major House Dust Mite Allergen Der P 1 In Its Pro Form At 1.61 A Resolution Length = 312 Back     alignment and structure

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query88
3p5u_A 220 Actinidin; SAD, cysteine proteinases, hydrolase; 1 1e-30
1o0e_A 208 Ervatamin C; plant cysteine protease, two domain, 5e-30
1iwd_A 215 Ervatamin B; cysteine protease, alpha-beta protein 6e-30
1by8_A 314 Protein (procathepsin K); hydrolase(sulfhydryl pro 7e-30
1cqd_A 221 Protein (protease II); cysteine protease, glycopro 1e-29
2fo5_A 262 Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cyst 1e-29
3f75_A 224 Toxopain-2, cathepsin L protease; medical structur 1e-29
1s4v_A 229 Cysteine endopeptidase; KDEL ER retention signal, 1e-29
1cs8_A 316 Human procathepsin L; prosegment, propeptide, inhi 3e-29
3kwz_A 215 Cathepsin K; enzyme inhibitor, covalent reversible 4e-29
2b1m_A 246 SPE31; papain-like, sugar binding protein; HET: NA 5e-29
2c0y_A 315 Procathepsin S; proenzyme, proteinase, hydrolase, 5e-29
8pch_A 220 Cathepsin H; hydrolase, protease, cysteine protein 6e-29
2xu3_A 220 Cathepsin L1; hydrolase, drug design, thiol protea 6e-29
1m6d_A 214 Cathepsin F, catsf; papain family cysteine proteas 7e-29
3ovx_A 218 Cathepsin S; hydrolase, covalent inhibitor, aldehy 9e-29
2oul_A 241 Falcipain 2; cysteine protease, inhibitor, macromo 1e-28
3qj3_A 331 Cathepsin L-like protein; hydrolase, proteinase, l 2e-28
3bwk_A 243 Cysteine protease falcipain-3; malaria, hydrolase; 2e-28
1pci_A 322 Procaricain; zymogen, hydrolase, thiol protease; 3 3e-28
2bdz_A 214 Mexicain; cysteine protease, peptidase_C1, papain- 4e-28
1ppo_A 216 Protease omega; hydrolase(thiol protease); 1.80A { 4e-28
1yal_A 218 Chymopapain; hydrolase, thiol protease; 1.70A {Car 5e-28
2cio_A 212 Papain; hydrolase/inhibitor, complex hydrolase/inh 1e-27
2o6x_A 310 Procathepsin L1, secreted cathepsin L 1; hydrolase 1e-27
3ioq_A 213 CMS1MS2; caricaceae, cysteine protease, papain fam 2e-27
3qt4_A 329 Cathepsin-L-like midgut cysteine proteinase; hydro 2e-27
1xkg_A 312 DER P I, major mite fecal allergen DER P 1; major 2e-27
3f5v_A 222 DER P 1 allergen; allergy, asthma, DUST mites, gly 8e-27
3i06_A 215 Cruzipain; autocatalytic cleavage, glycoprotein, p 4e-26
3u8e_A 222 Papain-like cysteine protease; papain-like cystein 5e-25
2wbf_X 265 Serine-repeat antigen protein; SERA, malaria, vacu 5e-22
3pdf_A 441 Cathepsin C, dipeptidyl peptidase 1; two domains, 1e-21
1deu_A 277 Procathepsin X; cysteine protease, proregion, pros 1e-18
3pbh_A 317 Procathepsin B; thiol protease, cysteine protease, 2e-16
3hhi_A 325 Cathepsin B-like cysteine protease; occluding loop 3e-16
3cbj_A 266 Cathepsin B; cathepsin B, occluding loop, chagas d 5e-16
3qsd_A 254 Cathepsin B-like peptidase (C01 family); cysteine 1e-15
3ois_A 291 Cysteine protease; alpha and beta, hydrolase; HET: 2e-13
3pw3_A 383 Aminopeptidase C; bleomycin, cysteine proteinase f 1e-04
>1o0e_A Ervatamin C; plant cysteine protease, two domain, stable at PH 2-12, HYDR; 1.90A {Tabernaemontana divaricata} SCOP: d.3.1.1 PDB: 2pns_A* 2pre_A* 3bcn_A* Length = 208 Back     alignment and structure
>1iwd_A Ervatamin B; cysteine protease, alpha-beta protein, catalytic DYAD, L-DOM domain., hydrolase; 1.63A {Tabernaemontana divaricata} SCOP: d.3.1.1 Length = 215 Back     alignment and structure
>1by8_A Protein (procathepsin K); hydrolase(sulfhydryl proteinase), papain; 2.60A {Homo sapiens} SCOP: d.3.1.1 PDB: 7pck_A Length = 314 Back     alignment and structure
>1cqd_A Protein (protease II); cysteine protease, glycoprotein, proline specificity, carboh papain family, hydrolase; HET: NAG FUL FUC; 2.10A {Zingiber officinale} SCOP: d.3.1.1 Length = 221 Back     alignment and structure
>2fo5_A Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cysteine endoprotease, endopeptidase, LEUP hydrolase; HET: AR7; 2.20A {Hordeum vulgare} Length = 262 Back     alignment and structure
>3f75_A Toxopain-2, cathepsin L protease; medical structural genomics of pathogenic protozoa, MSGPP, C protease, parasite, protozoa, hydrolase; 1.99A {Toxoplasma gondii} Length = 224 Back     alignment and structure
>1s4v_A Cysteine endopeptidase; KDEL ER retention signal, endosperm, ricinosomes, SEED germi senescence, hydrolase-hydrolase inhibitor complex; 2.00A {Ricinus communis} SCOP: d.3.1.1 Length = 229 Back     alignment and structure
>1cs8_A Human procathepsin L; prosegment, propeptide, inhibition, hydrolase; HET: OCS; 1.80A {Homo sapiens} SCOP: d.3.1.1 PDB: 1cjl_A 3hwn_A* Length = 316 Back     alignment and structure
>3kwz_A Cathepsin K; enzyme inhibitor, covalent reversible inhibitor, disease mutation, disulfide bond, glycoprotein, hydrolase, lysosome, protease; HET: KWZ; 1.49A {Homo sapiens} PDB: 1au0_A* 1au2_A* 1au3_A* 1au4_A* 1ayu_A* 1ayv_A* 1ayw_A* 1bgo_A* 1atk_A* 1nl6_A* 1nlj_A* 1q6k_A* 1mem_A* 1yk7_A* 1yk8_A* 1yt7_A* 2ato_A* 2aux_A* 2auz_A* 2bdl_A* ... Length = 215 Back     alignment and structure
>2b1m_A SPE31; papain-like, sugar binding protein; HET: NAG FUC PG4; 2.00A {Pachyrhizus erosus} PDB: 2b1n_A* Length = 246 Back     alignment and structure
>2c0y_A Procathepsin S; proenzyme, proteinase, hydrolase, thiol protease, prosegment binding loop, glycoprotein, lysosome, protease, zymogen; 2.1A {Homo sapiens} Length = 315 Back     alignment and structure
>8pch_A Cathepsin H; hydrolase, protease, cysteine proteinase, aminopeptidase; HET: NAG BMA; 2.10A {Sus scrofa} SCOP: d.3.1.1 PDB: 1nb3_A* 1nb5_A* Length = 220 Back     alignment and structure
>2xu3_A Cathepsin L1; hydrolase, drug design, thiol protease; HET: XU3 BTB; 0.90A {Homo sapiens} PDB: 2xu4_A* 2xu5_A* 2yj2_A* 2yj8_A* 2yj9_A* 2yjb_A* 2yjc_A* 3bc3_A* 3h89_A* 3h8b_A* 3h8c_A* 3of9_A* 3of8_A* 3hha_A* 2xu1_A* 3iv2_A* 3k24_A* 2nqd_B* 3kse_A* 2vhs_A ... Length = 220 Back     alignment and structure
>1m6d_A Cathepsin F, catsf; papain family cysteine protease, hydrolase; HET: MYP; 1.70A {Homo sapiens} SCOP: d.3.1.1 Length = 214 Back     alignment and structure
>3ovx_A Cathepsin S; hydrolase, covalent inhibitor, aldehyde warhead is covalently bound to Cys25, lysosomeal protein; HET: O64; 1.49A {Homo sapiens} PDB: 2h7j_A* 2f1g_A* 2hh5_B* 2hhn_A* 2hxz_A* 2op3_A* 2frq_A* 2fra_A* 2fq9_A* 2ft2_A* 2fud_A* 2g7y_A* 1ms6_A* 2r9m_A* 2r9n_A* 2r9o_A* 3n3g_A* 3n4c_A* 3mpe_A* 1nqc_A* ... Length = 218 Back     alignment and structure
>2oul_A Falcipain 2; cysteine protease, inhibitor, macromolecular interaction, HY hydrolase inhibitor complex; 2.20A {Plasmodium falciparum} SCOP: d.3.1.1 PDB: 2ghu_A 1yvb_A 3bpf_A* 3pnr_A Length = 241 Back     alignment and structure
>3qj3_A Cathepsin L-like protein; hydrolase, proteinase, larVal midgut; 1.85A {Tenebrio molitor} Length = 331 Back     alignment and structure
>3bwk_A Cysteine protease falcipain-3; malaria, hydrolase; HET: C1P; 2.42A {Plasmodium falciparum} PDB: 3bpm_A* Length = 243 Back     alignment and structure
>1pci_A Procaricain; zymogen, hydrolase, thiol protease; 3.20A {Carica papaya} SCOP: d.3.1.1 Length = 322 Back     alignment and structure
>2bdz_A Mexicain; cysteine protease, peptidase_C1, papain-like, HYDR; HET: E64; 2.10A {Jacaratia mexicana} Length = 214 Back     alignment and structure
>1ppo_A Protease omega; hydrolase(thiol protease); 1.80A {Carica papaya} SCOP: d.3.1.1 PDB: 1meg_A* Length = 216 Back     alignment and structure
>1yal_A Chymopapain; hydrolase, thiol protease; 1.70A {Carica papaya} SCOP: d.3.1.1 PDB: 1gec_E* Length = 218 Back     alignment and structure
>2cio_A Papain; hydrolase/inhibitor, complex hydrolase/inhibitor, ICP, cysteine protease, allergen, protease, thiol protease; 1.5A {Carica papaya} PDB: 1khq_A 1khp_A 1ppn_A 3e1z_B 3ima_A 3lfy_A 9pap_A 1bqi_A* 1bp4_A* 1pad_A 1pe6_A* 1pip_A* 1pop_A* 1ppd_A 1ppp_A* 1stf_E* 2pad_A 4pad_A* 5pad_A* 6pad_A* ... Length = 212 Back     alignment and structure
>2o6x_A Procathepsin L1, secreted cathepsin L 1; hydrolase, thiol protease, cysteine protease, zymogen, hydro; 1.40A {Fasciola hepatica} Length = 310 Back     alignment and structure
>3ioq_A CMS1MS2; caricaceae, cysteine protease, papain family, hydrolase; HET: E64 SO4; 1.87A {Carica candamarcensis} Length = 213 Back     alignment and structure
>3qt4_A Cathepsin-L-like midgut cysteine proteinase; hydrolase, zymogen, intramolecular DISS bonds, insect larVal midgut; HET: PG4 PG6; 2.11A {Tenebrio molitor} Length = 329 Back     alignment and structure
>1xkg_A DER P I, major mite fecal allergen DER P 1; major allergen, cysteine protease, house DUST mite, dermatop pteronyssinus; 1.61A {Dermatophagoides pteronyssinus} SCOP: d.3.1.1 Length = 312 Back     alignment and structure
>3i06_A Cruzipain; autocatalytic cleavage, glycoprotein, protease, thiol protease, zymogen; HET: QL2; 1.10A {Trypanosoma cruzi} PDB: 1ewm_A* 1ewo_A* 1ewl_A* 1f29_A* 1ewp_A* 1f2b_A* 1f2c_A* 1f2a_A* 1me4_A* 1u9q_X* 2aim_A* 2efm_A* 2oz2_A* 1me3_A* 3kku_A* 3lxs_A* 1aim_A* 3iut_A* 3hd3_A* 2p86_A* ... Length = 215 Back     alignment and structure
>3u8e_A Papain-like cysteine protease; papain-like cysteine peptidase, peptidase_C1A, hydrolase, in form; 1.31A {Crocus sativus} Length = 222 Back     alignment and structure
>2wbf_X Serine-repeat antigen protein; SERA, malaria, vacuole, protease, cathepsin, hydrolase, glycoprotein, thiol protease; HET: DMS; 1.60A {Plasmodium falciparum} PDB: 3ch3_X 3ch2_X Length = 265 Back     alignment and structure
>3pdf_A Cathepsin C, dipeptidyl peptidase 1; two domains, cystein protease, hydrolase-hydrolase inhibitor; HET: LXV NAG; 1.85A {Homo sapiens} PDB: 1jqp_A* 2djf_B* 1k3b_B* 2djg_B* 2djf_A* 1k3b_A* 2djg_A* 2djf_C* 1k3b_C* 2djg_C* Length = 441 Back     alignment and structure
>1deu_A Procathepsin X; cysteine protease, proregion, prosegment, HY; 1.70A {Homo sapiens} SCOP: d.3.1.1 PDB: 1ef7_A Length = 277 Back     alignment and structure
>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme, papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A 1pbh_A 1mir_A Length = 317 Back     alignment and structure
>3hhi_A Cathepsin B-like cysteine protease; occluding loop, hydrolase, THIO protease; HET: 074; 1.60A {Trypanosoma brucei} PDB: 3mor_A* Length = 325 Back     alignment and structure
>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco hydrolase, lysosome, protease, thiol protease, zymogen, CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A* 3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A* 1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A* 1qdq_A* 1csb_B* 1huc_B 2ipp_B ... Length = 266 Back     alignment and structure
>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase, digestive tract, hydrolase-hydrolase INH complex; HET: 074; 1.30A {Schistosoma mansoni} PDB: 3s3q_A* 3s3r_A* Length = 254 Back     alignment and structure
>3ois_A Cysteine protease; alpha and beta, hydrolase; HET: UDP; 1.65A {Xylella fastidiosa} Length = 291 Back     alignment and structure
>3pw3_A Aminopeptidase C; bleomycin, cysteine proteinase fold, structural genomics, JO center for structural genomics, JCSG; HET: MSE; 2.23A {Parabacteroides distasonis} Length = 383 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query88
1m6d_A 214 Cathepsin F, catsf; papain family cysteine proteas 99.95
2cio_A 212 Papain; hydrolase/inhibitor, complex hydrolase/inh 99.95
3kwz_A 215 Cathepsin K; enzyme inhibitor, covalent reversible 99.95
3ioq_A 213 CMS1MS2; caricaceae, cysteine protease, papain fam 99.95
1iwd_A 215 Ervatamin B; cysteine protease, alpha-beta protein 99.95
1cqd_A 221 Protein (protease II); cysteine protease, glycopro 99.95
2bdz_A 214 Mexicain; cysteine protease, peptidase_C1, papain- 99.95
3u8e_A 222 Papain-like cysteine protease; papain-like cystein 99.95
3qt4_A 329 Cathepsin-L-like midgut cysteine proteinase; hydro 99.95
1ppo_A 216 Protease omega; hydrolase(thiol protease); 1.80A { 99.95
3p5u_A 220 Actinidin; SAD, cysteine proteinases, hydrolase; 1 99.95
1o0e_A 208 Ervatamin C; plant cysteine protease, two domain, 99.95
3qj3_A 331 Cathepsin L-like protein; hydrolase, proteinase, l 99.95
1yal_A 218 Chymopapain; hydrolase, thiol protease; 1.70A {Car 99.95
2oul_A 241 Falcipain 2; cysteine protease, inhibitor, macromo 99.95
2xu3_A 220 Cathepsin L1; hydrolase, drug design, thiol protea 99.95
1pci_A 322 Procaricain; zymogen, hydrolase, thiol protease; 3 99.95
1by8_A 314 Protein (procathepsin K); hydrolase(sulfhydryl pro 99.95
2b1m_A 246 SPE31; papain-like, sugar binding protein; HET: NA 99.95
8pch_A 220 Cathepsin H; hydrolase, protease, cysteine protein 99.95
3tnx_A 363 Papain; hydrolase, cytoplasm for recombinant expre 99.95
1s4v_A 229 Cysteine endopeptidase; KDEL ER retention signal, 99.95
1cs8_A 316 Human procathepsin L; prosegment, propeptide, inhi 99.95
2fo5_A 262 Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cyst 99.95
2c0y_A 315 Procathepsin S; proenzyme, proteinase, hydrolase, 99.95
3ovx_A 218 Cathepsin S; hydrolase, covalent inhibitor, aldehy 99.95
3bwk_A 243 Cysteine protease falcipain-3; malaria, hydrolase; 99.95
3f75_A 224 Toxopain-2, cathepsin L protease; medical structur 99.95
3f5v_A 222 DER P 1 allergen; allergy, asthma, DUST mites, gly 99.94
2o6x_A 310 Procathepsin L1, secreted cathepsin L 1; hydrolase 99.94
3i06_A 215 Cruzipain; autocatalytic cleavage, glycoprotein, p 99.94
1xkg_A 312 DER P I, major mite fecal allergen DER P 1; major 99.94
3hhi_A 325 Cathepsin B-like cysteine protease; occluding loop 99.94
1deu_A 277 Procathepsin X; cysteine protease, proregion, pros 99.94
3pdf_A 441 Cathepsin C, dipeptidyl peptidase 1; two domains, 99.94
2wbf_X 265 Serine-repeat antigen protein; SERA, malaria, vacu 99.94
3qsd_A 254 Cathepsin B-like peptidase (C01 family); cysteine 99.94
3pbh_A 317 Procathepsin B; thiol protease, cysteine protease, 99.93
3cbj_A 266 Cathepsin B; cathepsin B, occluding loop, chagas d 99.93
3ois_A 291 Cysteine protease; alpha and beta, hydrolase; HET: 99.87
3pw3_A 383 Aminopeptidase C; bleomycin, cysteine proteinase f 99.83
2e01_A 457 Cysteine proteinase 1; bleomycin hydrolase, thiol 99.79
2cb5_A 453 Protein (bleomycin hydrolase); aminopeptidase, cys 99.79
>1m6d_A Cathepsin F, catsf; papain family cysteine protease, hydrolase; HET: MYP; 1.70A {Homo sapiens} SCOP: d.3.1.1 Back     alignment and structure
Probab=99.95  E-value=8.2e-29  Score=166.02  Aligned_cols=81  Identities=35%  Similarity=0.718  Sum_probs=75.2

Q ss_pred             CCCchHHHHHHHHHHHHHHHHHhCCCcccchhhhhcccCCCCCCCCCCCcHHHHHHHHHhhCCccCCCCcccccCCCCcc
Q 047400            2 HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGINTERDYPNVGVMDNCK   81 (88)
Q Consensus         2 g~CgsCwAfa~~~~ie~~~~i~~~~~~~lS~Q~lidC~~~~~~~gC~GG~~~~a~~~~~~~~Gi~~e~~yPY~~~~~~C~   81 (88)
                      |.||||||||++++||++++|+++.++.||+|+|+||+.. +. ||+||++..||+|+++++||++|++|||.+.+++|+
T Consensus        20 g~cGsCWAfa~~~~le~~~~i~~g~~~~lS~q~l~dC~~~-~~-gC~GG~~~~a~~~i~~~~Gi~~e~~yPY~~~~~~C~   97 (214)
T 1m6d_A           20 GMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKM-DK-ACMGGLPSNAYSAIKNLGGLETEDDYSYQGHMQSCQ   97 (214)
T ss_dssp             CSSBCHHHHHHHHHHHHHHHHHHSCCCCBCHHHHHHHCSS-SC-TTBCCCHHHHHHHHHHHTCBCBTTTSCCCSSCCCCC
T ss_pred             CCCchHHHHHHHHHHHHHHHHHcCCCcccCHHHHHHhcCC-CC-CCCCCCHHHHHHHHHHhcCcccccCcCcCCCCCcCC
Confidence            6899999999999999999999999999999999999875 55 999999999999999877899999999999999998


Q ss_pred             cCC
Q 047400           82 VFQ   84 (88)
Q Consensus        82 ~~~   84 (88)
                      .+.
T Consensus        98 ~~~  100 (214)
T 1m6d_A           98 FSA  100 (214)
T ss_dssp             CCG
T ss_pred             CCc
Confidence            653



>2cio_A Papain; hydrolase/inhibitor, complex hydrolase/inhibitor, ICP, cysteine protease, allergen, protease, thiol protease; 1.5A {Carica papaya} PDB: 1khq_A 1khp_A 1ppn_A 3e1z_B 3ima_A 3lfy_A 9pap_A 1bqi_A* 1bp4_A* 1pad_A 1pe6_A* 1pip_A* 1pop_A* 1ppd_A 1ppp_A* 1stf_E* 2pad_A 4pad_A* 5pad_A* 6pad_A* ... Back     alignment and structure
>3kwz_A Cathepsin K; enzyme inhibitor, covalent reversible inhibitor, disease mutation, disulfide bond, glycoprotein, hydrolase, lysosome, protease; HET: KWZ; 1.49A {Homo sapiens} PDB: 1au0_A* 1au2_A* 1au3_A* 1au4_A* 1ayu_A* 1ayv_A* 1ayw_A* 1bgo_A* 1atk_A* 1nl6_A* 1nlj_A* 1q6k_A* 1mem_A* 1yk7_A* 1yk8_A* 1yt7_A* 2ato_A* 2aux_A* 2auz_A* 2bdl_A* ... Back     alignment and structure
>3ioq_A CMS1MS2; caricaceae, cysteine protease, papain family, hydrolase; HET: E64 SO4; 1.87A {Carica candamarcensis} SCOP: d.3.1.1 Back     alignment and structure
>1iwd_A Ervatamin B; cysteine protease, alpha-beta protein, catalytic DYAD, L-DOM domain., hydrolase; 1.63A {Tabernaemontana divaricata} SCOP: d.3.1.1 Back     alignment and structure
>1cqd_A Protein (protease II); cysteine protease, glycoprotein, proline specificity, carboh papain family, hydrolase; HET: NAG FUL FUC; 2.10A {Zingiber officinale} SCOP: d.3.1.1 Back     alignment and structure
>2bdz_A Mexicain; cysteine protease, peptidase_C1, papain-like, HYDR; HET: E64; 2.10A {Jacaratia mexicana} Back     alignment and structure
>3u8e_A Papain-like cysteine protease; papain-like cysteine peptidase, peptidase_C1A, hydrolase, in form; 1.31A {Crocus sativus} SCOP: d.3.1.0 Back     alignment and structure
>3qt4_A Cathepsin-L-like midgut cysteine proteinase; hydrolase, zymogen, intramolecular DISS bonds, insect larVal midgut; HET: PG4 PG6; 2.11A {Tenebrio molitor} Back     alignment and structure
>1ppo_A Protease omega; hydrolase(thiol protease); 1.80A {Carica papaya} SCOP: d.3.1.1 PDB: 1meg_A* Back     alignment and structure
>1o0e_A Ervatamin C; plant cysteine protease, two domain, stable at PH 2-12, HYDR; 1.90A {Tabernaemontana divaricata} SCOP: d.3.1.1 PDB: 2pns_A* 2pre_A* 3bcn_A* Back     alignment and structure
>3qj3_A Cathepsin L-like protein; hydrolase, proteinase, larVal midgut; 1.85A {Tenebrio molitor} SCOP: d.3.1.0 Back     alignment and structure
>1yal_A Chymopapain; hydrolase, thiol protease; 1.70A {Carica papaya} SCOP: d.3.1.1 PDB: 1gec_E* Back     alignment and structure
>2oul_A Falcipain 2; cysteine protease, inhibitor, macromolecular interaction, HY hydrolase inhibitor complex; 2.20A {Plasmodium falciparum} SCOP: d.3.1.1 PDB: 2ghu_A 1yvb_A 3bpf_A* 3pnr_A Back     alignment and structure
>2xu3_A Cathepsin L1; hydrolase, drug design, thiol protease; HET: XU3 BTB; 0.90A {Homo sapiens} PDB: 2xu4_A* 2xu5_A* 2yj2_A* 2yj8_A* 2yj9_A* 2yjb_A* 2yjc_A* 3bc3_A* 3h89_A* 3h8b_A* 3h8c_A* 3of9_A* 3of8_A* 3hha_A* 2xu1_A* 3iv2_A* 3k24_A* 2nqd_B* 3kse_A* 2vhs_A ... Back     alignment and structure
>1pci_A Procaricain; zymogen, hydrolase, thiol protease; 3.20A {Carica papaya} SCOP: d.3.1.1 Back     alignment and structure
>1by8_A Protein (procathepsin K); hydrolase(sulfhydryl proteinase), papain; 2.60A {Homo sapiens} SCOP: d.3.1.1 PDB: 7pck_A Back     alignment and structure
>2b1m_A SPE31; papain-like, sugar binding protein; HET: NAG FUC PG4; 2.00A {Pachyrhizus erosus} PDB: 2b1n_A* Back     alignment and structure
>8pch_A Cathepsin H; hydrolase, protease, cysteine proteinase, aminopeptidase; HET: NAG BMA; 2.10A {Sus scrofa} SCOP: d.3.1.1 PDB: 1nb3_A* 1nb5_A* Back     alignment and structure
>3tnx_A Papain; hydrolase, cytoplasm for recombinant expression; 2.62A {Carica papaya} Back     alignment and structure
>1s4v_A Cysteine endopeptidase; KDEL ER retention signal, endosperm, ricinosomes, SEED germi senescence, hydrolase-hydrolase inhibitor complex; 2.00A {Ricinus communis} SCOP: d.3.1.1 Back     alignment and structure
>1cs8_A Human procathepsin L; prosegment, propeptide, inhibition, hydrolase; HET: OCS; 1.80A {Homo sapiens} SCOP: d.3.1.1 PDB: 1cjl_A 3hwn_A* Back     alignment and structure
>2fo5_A Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cysteine endoprotease, endopeptidase, LEUP hydrolase; HET: AR7; 2.20A {Hordeum vulgare} Back     alignment and structure
>2c0y_A Procathepsin S; proenzyme, proteinase, hydrolase, thiol protease, prosegment binding loop, glycoprotein, lysosome, protease, zymogen; 2.1A {Homo sapiens} Back     alignment and structure
>3ovx_A Cathepsin S; hydrolase, covalent inhibitor, aldehyde warhead is covalently bound to Cys25, lysosomeal protein; HET: O64; 1.49A {Homo sapiens} SCOP: d.3.1.1 PDB: 2h7j_A* 2f1g_A* 2hh5_B* 2hhn_A* 2hxz_A* 2op3_A* 2frq_A* 2fra_A* 2fq9_A* 2ft2_A* 2fud_A* 2g7y_A* 1ms6_A* 2r9m_A* 2r9n_A* 2r9o_A* 3n3g_A* 3n4c_A* 3mpe_A* 1nqc_A* ... Back     alignment and structure
>3bwk_A Cysteine protease falcipain-3; malaria, hydrolase; HET: C1P; 2.42A {Plasmodium falciparum} PDB: 3bpm_A* Back     alignment and structure
>3f75_A Toxopain-2, cathepsin L protease; medical structural genomics of pathogenic protozoa, MSGPP, C protease, parasite, protozoa, hydrolase; 1.99A {Toxoplasma gondii} SCOP: d.3.1.0 Back     alignment and structure
>2o6x_A Procathepsin L1, secreted cathepsin L 1; hydrolase, thiol protease, cysteine protease, zymogen, hydro; 1.40A {Fasciola hepatica} Back     alignment and structure
>3i06_A Cruzipain; autocatalytic cleavage, glycoprotein, protease, thiol protease, zymogen; HET: QL2; 1.10A {Trypanosoma cruzi} SCOP: d.3.1.1 PDB: 1ewm_A* 1ewo_A* 1ewl_A* 1f29_A* 1ewp_A* 1f2b_A* 1f2c_A* 1f2a_A* 1me4_A* 1u9q_X* 2aim_A* 2efm_A* 2oz2_A* 1me3_A* 3kku_A* 3lxs_A* 1aim_A* 3iut_A* 3hd3_A* 2p86_A* ... Back     alignment and structure
>1xkg_A DER P I, major mite fecal allergen DER P 1; major allergen, cysteine protease, house DUST mite, dermatop pteronyssinus; 1.61A {Dermatophagoides pteronyssinus} SCOP: d.3.1.1 Back     alignment and structure
>3hhi_A Cathepsin B-like cysteine protease; occluding loop, hydrolase, THIO protease; HET: 074; 1.60A {Trypanosoma brucei} SCOP: d.3.1.0 PDB: 4hwy_A* 3mor_A* Back     alignment and structure
>1deu_A Procathepsin X; cysteine protease, proregion, prosegment, HY; 1.70A {Homo sapiens} SCOP: d.3.1.1 PDB: 1ef7_A Back     alignment and structure
>3pdf_A Cathepsin C, dipeptidyl peptidase 1; two domains, cystein protease, hydrolase-hydrolase inhibitor; HET: LXV NAG; 1.85A {Homo sapiens} PDB: 1jqp_A* 2djf_B* 1k3b_B* 2djg_B* 2djf_A* 1k3b_A* 2djg_A* 2djf_C* 1k3b_C* 2djg_C* Back     alignment and structure
>2wbf_X Serine-repeat antigen protein; SERA, malaria, vacuole, protease, cathepsin, hydrolase, glycoprotein, thiol protease; HET: DMS; 1.60A {Plasmodium falciparum} PDB: 3ch3_X 3ch2_X Back     alignment and structure
>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase, digestive tract, hydrolase-hydrolase INH complex; HET: 074; 1.30A {Schistosoma mansoni} SCOP: d.3.1.0 PDB: 3s3q_A* 3s3r_A* Back     alignment and structure
>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme, papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A 1pbh_A 1mir_A Back     alignment and structure
>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco hydrolase, lysosome, protease, thiol protease, zymogen, CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A* 3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A* 1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A* 1qdq_A* 1csb_B* 1huc_B 2ipp_B ... Back     alignment and structure
>3ois_A Cysteine protease; alpha and beta, hydrolase; HET: UDP; 1.65A {Xylella fastidiosa} Back     alignment and structure
>3pw3_A Aminopeptidase C; bleomycin, cysteine proteinase fold, structural genomics, JO center for structural genomics, JCSG; HET: MSE; 2.23A {Parabacteroides distasonis} Back     alignment and structure
>2e01_A Cysteine proteinase 1; bleomycin hydrolase, thiol protease, C1 protease, hydrolase; 1.73A {Saccharomyces cerevisiae} PDB: 2e02_A 2e03_A 2dzy_A 1a6r_A 2e00_A 2dzz_A 3gcb_A 1gcb_A Back     alignment and structure
>2cb5_A Protein (bleomycin hydrolase); aminopeptidase, cysteine protease, SELF- compartmentalizing, cylinase; 1.85A {Homo sapiens} SCOP: d.3.1.1 PDB: 1cb5_A Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 88
d1aeca_ 218 d.3.1.1 (A:) Actinidin {Chinese gooseberry or kiwi 8e-13
g8pch.1 228 d.3.1.1 (P:,A:) Cathepsin H {Pig (Sus scrofa) [Tax 2e-12
d1o0ea_ 208 d.3.1.1 (A:) Ervatamin C {East indian rosebay (Erv 4e-12
d1cqda_ 216 d.3.1.1 (A:) Proline-specific cysteine protease {G 5e-12
d2oula1 241 d.3.1.1 (A:-16-224) Falcipain 2 {Plasmodium falcip 3e-11
d1m6da_ 214 d.3.1.1 (A:) Cathepsin F {Human (Homo sapiens) [Ta 4e-11
d1iwda_ 215 d.3.1.1 (A:) Ervatamin B {Adam's apple (Ervatamia 8e-11
d2h7ja1 217 d.3.1.1 (A:1-217) (Pro)cathepsin S {Human (Homo sa 2e-10
d1fh0a_ 221 d.3.1.1 (A:) (Pro)cathepsin V {Human (Homo sapiens 4e-10
d2r6na1 215 d.3.1.1 (A:1-215) (Pro)cathepsin K {Human (Homo sa 4e-10
d1me4a_ 215 d.3.1.1 (A:) Cruzain {Trypanosoma cruzi [TaxId: 56 2e-09
d1gmya_ 254 d.3.1.1 (A:) (Pro)cathepsin B {Human (Homo sapiens 2e-09
d1ppoa_ 216 d.3.1.1 (A:) Caricain (protease omega) {Papaya (Ca 2e-08
d1s4va_ 224 d.3.1.1 (A:) Vignain (bean endopeptidase) {Castor 3e-08
d1yala_ 218 d.3.1.1 (A:) Chymopapain {Papaya (Carica papaya) [ 4e-08
d1khqa_ 212 d.3.1.1 (A:) Papain {Papaya (Carica papaya) [TaxId 8e-07
g1k3b.1 233 d.3.1.1 (B:,C:) Cathepsin C (dipeptidyl peptidase 3e-06
d1deua_ 275 d.3.1.1 (A:) (Pro)cathepsin X {Human (Homo sapiens 2e-04
d1cs8a_ 316 d.3.1.1 (A:) (Pro)cathepsin L {Human (Homo sapiens 5e-04
d1xkga1 302 d.3.1.1 (A:4-305) Major mite fecal allergen der p 6e-04
>d1aeca_ d.3.1.1 (A:) Actinidin {Chinese gooseberry or kiwifruit (Actinidia chinensis) [TaxId: 3625]} Length = 218 Back     information, alignment and structure

class: Alpha and beta proteins (a+b)
fold: Cysteine proteinases
superfamily: Cysteine proteinases
family: Papain-like
domain: Actinidin
species: Chinese gooseberry or kiwifruit (Actinidia chinensis) [TaxId: 3625]
 Score = 58.9 bits (141), Expect = 8e-13
 Identities = 35/84 (41%), Positives = 49/84 (58%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ +  +EGI+KIVT  L+ +S Q+L+DC     +R C GG+I   +Q++I N G
Sbjct: 23  GGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGG 82

Query: 65  INTERDYPNVGVMDNCKVFQFNWC 88
           INTE +YP       C V   N  
Sbjct: 83  INTEENYPYTAQDGECNVDLQNEK 106


>d1o0ea_ d.3.1.1 (A:) Ervatamin C {East indian rosebay (Ervatamia coronaria) [TaxId: 52861]} Length = 208 Back     information, alignment and structure
>d1cqda_ d.3.1.1 (A:) Proline-specific cysteine protease {Ginger rhizome (Zingiber officinale) [TaxId: 94328]} Length = 216 Back     information, alignment and structure
>d2oula1 d.3.1.1 (A:-16-224) Falcipain 2 {Plasmodium falciparum [TaxId: 5833]} Length = 241 Back     information, alignment and structure
>d1m6da_ d.3.1.1 (A:) Cathepsin F {Human (Homo sapiens) [TaxId: 9606]} Length = 214 Back     information, alignment and structure
>d1iwda_ d.3.1.1 (A:) Ervatamin B {Adam's apple (Ervatamia coronaria) [TaxId: 52861]} Length = 215 Back     information, alignment and structure
>d2h7ja1 d.3.1.1 (A:1-217) (Pro)cathepsin S {Human (Homo sapiens) [TaxId: 9606]} Length = 217 Back     information, alignment and structure
>d1fh0a_ d.3.1.1 (A:) (Pro)cathepsin V {Human (Homo sapiens) [TaxId: 9606]} Length = 221 Back     information, alignment and structure
>d2r6na1 d.3.1.1 (A:1-215) (Pro)cathepsin K {Human (Homo sapiens) [TaxId: 9606]} Length = 215 Back     information, alignment and structure
>d1me4a_ d.3.1.1 (A:) Cruzain {Trypanosoma cruzi [TaxId: 5693]} Length = 215 Back     information, alignment and structure
>d1gmya_ d.3.1.1 (A:) (Pro)cathepsin B {Human (Homo sapiens) [TaxId: 9606]} Length = 254 Back     information, alignment and structure
>d1ppoa_ d.3.1.1 (A:) Caricain (protease omega) {Papaya (Carica papaya) [TaxId: 3649]} Length = 216 Back     information, alignment and structure
>d1s4va_ d.3.1.1 (A:) Vignain (bean endopeptidase) {Castor bean (Ricinus communis) [TaxId: 3988]} Length = 224 Back     information, alignment and structure
>d1yala_ d.3.1.1 (A:) Chymopapain {Papaya (Carica papaya) [TaxId: 3649]} Length = 218 Back     information, alignment and structure
>d1khqa_ d.3.1.1 (A:) Papain {Papaya (Carica papaya) [TaxId: 3649]} Length = 212 Back     information, alignment and structure
>d1deua_ d.3.1.1 (A:) (Pro)cathepsin X {Human (Homo sapiens) [TaxId: 9606]} Length = 275 Back     information, alignment and structure
>d1cs8a_ d.3.1.1 (A:) (Pro)cathepsin L {Human (Homo sapiens) [TaxId: 9606]} Length = 316 Back     information, alignment and structure
>d1xkga1 d.3.1.1 (A:4-305) Major mite fecal allergen der p 1 {House-dust mite (Dermatophagoides pteronyssinus) [TaxId: 6956]} Length = 302 Back     information, alignment and structure

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query88
d1xkga1 302 Major mite fecal allergen der p 1 {House-dust mite 99.93
d1ppoa_ 216 Caricain (protease omega) {Papaya (Carica papaya) 99.89
d1cs8a_ 316 (Pro)cathepsin L {Human (Homo sapiens) [TaxId: 960 99.89
d1cqda_ 216 Proline-specific cysteine protease {Ginger rhizome 99.89
d1m6da_ 214 Cathepsin F {Human (Homo sapiens) [TaxId: 9606]} 99.87
d1yala_ 218 Chymopapain {Papaya (Carica papaya) [TaxId: 3649]} 99.86
d2oula1 241 Falcipain 2 {Plasmodium falciparum [TaxId: 5833]} 99.85
d1khqa_ 212 Papain {Papaya (Carica papaya) [TaxId: 3649]} 99.85
d1fh0a_ 221 (Pro)cathepsin V {Human (Homo sapiens) [TaxId: 960 99.85
d1o0ea_ 208 Ervatamin C {East indian rosebay (Ervatamia corona 99.85
d2r6na1 215 (Pro)cathepsin K {Human (Homo sapiens) [TaxId: 960 99.85
d1deua_ 275 (Pro)cathepsin X {Human (Homo sapiens) [TaxId: 960 99.85
d1iwda_ 215 Ervatamin B {Adam's apple (Ervatamia coronaria) [T 99.83
g8pch.1 228 Cathepsin H {Pig (Sus scrofa) [TaxId: 9823]} 99.83
d2h7ja1 217 (Pro)cathepsin S {Human (Homo sapiens) [TaxId: 960 99.83
g1k3b.1 233 Cathepsin C (dipeptidyl peptidase I), catalytic do 99.83
d1aeca_ 218 Actinidin {Chinese gooseberry or kiwifruit (Actini 99.83
d1s4va_ 224 Vignain (bean endopeptidase) {Castor bean (Ricinus 99.81
d1me4a_ 215 Cruzain {Trypanosoma cruzi [TaxId: 5693]} 99.79
d1gmya_ 254 (Pro)cathepsin B {Human (Homo sapiens) [TaxId: 960 99.79
d3gcba_ 458 Bleomycin hydrolase {Baker's yeast (Saccharomyces 96.77
d2cb5a_ 453 Bleomycin hydrolase {Human (Homo sapiens) [TaxId: 96.47
>d1xkga1 d.3.1.1 (A:4-305) Major mite fecal allergen der p 1 {House-dust mite (Dermatophagoides pteronyssinus) [TaxId: 6956]} Back     information, alignment and structure
class: Alpha and beta proteins (a+b)
fold: Cysteine proteinases
superfamily: Cysteine proteinases
family: Papain-like
domain: Major mite fecal allergen der p 1
species: House-dust mite (Dermatophagoides pteronyssinus) [TaxId: 6956]
Probab=99.93  E-value=2.1e-26  Score=159.16  Aligned_cols=80  Identities=29%  Similarity=0.490  Sum_probs=74.6

Q ss_pred             CCCchHHHHHHHHHHHHHHHHHhCCCcccchhhhhcccCCCCCCCCCCCcHHHHHHHHHhhCCccCCCCcccccCCCCcc
Q 047400            2 HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGINTERDYPNVGVMDNCK   81 (88)
Q Consensus         2 g~CgsCwAfa~~~~ie~~~~i~~~~~~~lS~Q~lidC~~~~~~~gC~GG~~~~a~~~~~~~~Gi~~e~~yPY~~~~~~C~   81 (88)
                      |.||||||||++++||+++.|++++++.||+||||||+.  +. +|.||++..+++|+.++ ||++|++|||.+.++.|+
T Consensus       106 G~CGsCwAfa~~~~lE~~~~i~~~~~~~lS~q~lvdC~~--~~-~C~gG~~~~a~~~~~~~-Gi~~e~~yPY~~~~~~C~  181 (302)
T d1xkga1         106 GGCGSAWAFSGVAATESAYLAYRDQSLDLAEQELVDCAS--QH-GCHGDTIPRGIEYIQHN-GVVQESYYRYVAREQSCR  181 (302)
T ss_dssp             CSSBCHHHHHHHHHHHHHHHHHHCCCCCBCHHHHHHHTC--SS-TTBCCCHHHHHHHHHHH-CEEBGGGSCCCSSCCCCC
T ss_pred             cccceehhhhhHHhHHHHHHHhcCCcccchhHHHHhccc--cC-ccCCCccchhhhhhccC-cccchhhccccccccccc
Confidence            689999999999999999999999999999999999985  44 99999999999999998 999999999999999998


Q ss_pred             cCCC
Q 047400           82 VFQF   85 (88)
Q Consensus        82 ~~~~   85 (88)
                      ....
T Consensus       182 ~~~~  185 (302)
T d1xkga1         182 RPNA  185 (302)
T ss_dssp             CCSS
T ss_pred             cccc
Confidence            6543



>d1ppoa_ d.3.1.1 (A:) Caricain (protease omega) {Papaya (Carica papaya) [TaxId: 3649]} Back     information, alignment and structure
>d1cs8a_ d.3.1.1 (A:) (Pro)cathepsin L {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1cqda_ d.3.1.1 (A:) Proline-specific cysteine protease {Ginger rhizome (Zingiber officinale) [TaxId: 94328]} Back     information, alignment and structure
>d1m6da_ d.3.1.1 (A:) Cathepsin F {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1yala_ d.3.1.1 (A:) Chymopapain {Papaya (Carica papaya) [TaxId: 3649]} Back     information, alignment and structure
>d1khqa_ d.3.1.1 (A:) Papain {Papaya (Carica papaya) [TaxId: 3649]} Back     information, alignment and structure
>d1fh0a_ d.3.1.1 (A:) (Pro)cathepsin V {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1o0ea_ d.3.1.1 (A:) Ervatamin C {East indian rosebay (Ervatamia coronaria) [TaxId: 52861]} Back     information, alignment and structure
>d2r6na1 d.3.1.1 (A:1-215) (Pro)cathepsin K {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1deua_ d.3.1.1 (A:) (Pro)cathepsin X {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1iwda_ d.3.1.1 (A:) Ervatamin B {Adam's apple (Ervatamia coronaria) [TaxId: 52861]} Back     information, alignment and structure
>d2h7ja1 d.3.1.1 (A:1-217) (Pro)cathepsin S {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1aeca_ d.3.1.1 (A:) Actinidin {Chinese gooseberry or kiwifruit (Actinidia chinensis) [TaxId: 3625]} Back     information, alignment and structure
>d1s4va_ d.3.1.1 (A:) Vignain (bean endopeptidase) {Castor bean (Ricinus communis) [TaxId: 3988]} Back     information, alignment and structure
>d1me4a_ d.3.1.1 (A:) Cruzain {Trypanosoma cruzi [TaxId: 5693]} Back     information, alignment and structure
>d1gmya_ d.3.1.1 (A:) (Pro)cathepsin B {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d3gcba_ d.3.1.1 (A:) Bleomycin hydrolase {Baker's yeast (Saccharomyces cerevisiae), Gal6 [TaxId: 4932]} Back     information, alignment and structure
>d2cb5a_ d.3.1.1 (A:) Bleomycin hydrolase {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure