Psyllid ID: psy15346


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280
VCSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDNYGVEFGEESGERLSEEFGVRAESSEEFRENGEEE
cccccccHHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEEEEccccHHHHHHHHHHcccEEEEEEEEccHHcccccccccccEEEEEEEccccccccccEEEEcccccccccEEEEEEEEcccccccccEEEccccccccccHHccEEEEEcccccccccEEEEEEccccccccccEEEEEEccccccccccccEEEEccccccccccccccccccccccccccccHHHHHccccc
ccccccHHHHHHHHHHcccEEccccccccccccccccccccccccccccccccccccccccccEccccccccccccccEcEcccEEccccHHHHHHHHHHHccEEEEEEEEHHHHHcccccEHHccEEEEEEEEHHHHHcccccEEEEEccccccccEEEEEEccccccccEEEEEEcccccccccEEEEEEEEEEcccccccccEEEEEEccccccccccEEEEEEccccccccHHEEcccccccccccccccccccccHHcccEEcccHHHHcccccc
vcssgissstWVWVHKrglvtggahhsntgcqpvsfppcnhanyttsepecktlatpqpkchtrctndnygrgffqdkyrfkryywvnDEVADIQQEIMKNGPVVANMYLYSDifsyksgkygngpvvANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGwgeengrpyWTIVRVYAVSASAEIVAYATVKLIgwgeengrpYWTIVSTfgeqfgdkgTIKILRGRNEAIIESLVngalpkdnygvefgeesgeRLSEEfgvraesseefrengeee
vcssgissstWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLatpqpkchtrctndnygrgfFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTfgeqfgdkGTIKILRGRNEAIIESlvngalpkdnYGVEFGEESgerlseefgvraesseefrengeee
VCSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDNYGVEFGEESGERLSEEFGVRAesseefrengeee
********STWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTT****CKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDNYGV*******************************
VCSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDNYGVEF*****************************
********STWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDNYGVEFG****************************
VCSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDNY***********L***F*****************
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiihhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
VCSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDNYGVEFGEESGERLSEEFGVRAESSEEFRENGEEE
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query280 2.2.26 [Sep-21-2011]
P43510379 Cathepsin B-like cysteine yes N/A 0.667 0.493 0.285 2e-23
P43509344 Cathepsin B-like cysteine no N/A 0.657 0.534 0.304 3e-23
P43508335 Cathepsin B-like cysteine no N/A 0.660 0.552 0.288 1e-22
P07858339 Cathepsin B OS=Homo sapie yes N/A 0.657 0.542 0.302 4e-22
Q5R6D1339 Cathepsin B OS=Pongo abel yes N/A 0.657 0.542 0.302 5e-22
Q4R5M2339 Cathepsin B OS=Macaca fas N/A N/A 0.657 0.542 0.302 5e-22
P00787339 Cathepsin B OS=Rattus nor yes N/A 0.65 0.536 0.288 3e-21
P25792340 Cathepsin B-like cysteine N/A N/A 0.632 0.520 0.291 7e-21
P10605339 Cathepsin B OS=Mus muscul yes N/A 0.675 0.557 0.286 8e-21
P43157342 Cathepsin B-like cysteine N/A N/A 0.653 0.535 0.292 2e-20
>sp|P43510|CPR6_CAEEL Cathepsin B-like cysteine proteinase 6 OS=Caenorhabditis elegans GN=cpr-6 PE=1 SV=1 Back     alignment and function desciption
 Score =  110 bits (274), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 70/245 (28%), Positives = 103/245 (42%), Gaps = 58/245 (23%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +  K G+VTG  + +N GC+P  FPPC H +  T    C     P PKC
Sbjct: 174 CNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKC 233

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C +D   + + +DK+     Y V D+V  IQ+E+M +GP+     +Y D  +Y  G 
Sbjct: 234 EKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGV 293

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y                        V    ++     VK++GWG ++G PYWT+   +  
Sbjct: 294 Y------------------------VHTGGKLGGGHAVKLIGWGIDDGIPYWTVANSWNT 329

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                           WGE+                   G  +ILRG +E  IES V G 
Sbjct: 330 D---------------WGED-------------------GFFRILRGVDECGIESGVVGG 355

Query: 242 LPKDN 246
           +PK N
Sbjct: 356 IPKLN 360





Caenorhabditis elegans (taxid: 6239)
EC: 3EC: .EC: 4EC: .EC: 2EC: 2EC: .EC: -
>sp|P43509|CPR5_CAEEL Cathepsin B-like cysteine proteinase 5 OS=Caenorhabditis elegans GN=cpr-5 PE=2 SV=1 Back     alignment and function description
>sp|P43508|CPR4_CAEEL Cathepsin B-like cysteine proteinase 4 OS=Caenorhabditis elegans GN=cpr-4 PE=2 SV=1 Back     alignment and function description
>sp|P07858|CATB_HUMAN Cathepsin B OS=Homo sapiens GN=CTSB PE=1 SV=3 Back     alignment and function description
>sp|Q5R6D1|CATB_PONAB Cathepsin B OS=Pongo abelii GN=CTSB PE=2 SV=1 Back     alignment and function description
>sp|Q4R5M2|CATB_MACFA Cathepsin B OS=Macaca fascicularis GN=CTSB PE=2 SV=1 Back     alignment and function description
>sp|P00787|CATB_RAT Cathepsin B OS=Rattus norvegicus GN=Ctsb PE=1 SV=2 Back     alignment and function description
>sp|P25792|CYSP_SCHMA Cathepsin B-like cysteine proteinase OS=Schistosoma mansoni PE=2 SV=1 Back     alignment and function description
>sp|P10605|CATB_MOUSE Cathepsin B OS=Mus musculus GN=Ctsb PE=1 SV=2 Back     alignment and function description
>sp|P43157|CYSP_SCHJA Cathepsin B-like cysteine proteinase OS=Schistosoma japonicum GN=CATB PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query280
256086863271 cathepsin B-like peptidase (C01 family) 0.657 0.678 0.318 1e-27
242001640223 cathepsin B endopeptidase, putative [Ixo 0.65 0.816 0.316 7e-26
240992699337 cathepsin B endopeptidase, putative [Ixo 0.653 0.543 0.303 9e-25
156255405337 cathepsin B [Fasciola hepatica] 0.653 0.543 0.314 2e-24
29374027337 cathepsin B [Fasciola gigantica] 0.653 0.543 0.309 3e-24
240992702337 cathepsin B endopeptidase, putative [Ixo 0.653 0.543 0.307 3e-24
56753605346 SJCHGC02852 protein [Schistosoma japonic 0.664 0.537 0.296 5e-24
126681075337 cathepsin B-like cysteine protease form 0.653 0.543 0.303 8e-24
187105118302 cathepsin B-5880 precursor [Acyrthosipho 0.642 0.596 0.301 1e-23
22535408347 cathepsin B-like protease [Nilaparvata l 0.653 0.527 0.319 5e-23
>gi|256086863|ref|XP_002579605.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni] gi|353228447|emb|CCD74618.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni] Back     alignment and taxonomy information
 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 77/242 (31%), Positives = 113/242 (46%), Gaps = 58/242 (23%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  GI    W +    G+VTGG++ ++TGCQP  FP CNH + + S P C++   P P+C
Sbjct: 84  CRGGIPGMAWDYWKYEGIVTGGSNETHTGCQPYPFPECNHHSSSKSYPPCESYYFPTPEC 143

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           H  C  D+YG+ + +DK+  K  Y V  E   I +EI+ NGPV    Y+Y D  +YKSG 
Sbjct: 144 HETC-QDDYGKPYKKDKFYGKSSYNVASEEISIMKEILLNGPVEGGFYVYEDFLNYKSG- 201

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          ++ + +G Y        +    ++I+GWG                
Sbjct: 202 ---------------VYKHITGSY--------LGGHAIRIIGWGI--------------- 223

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                             ++N  PYW   +++  Q+GD+G  KILRG NE  IES+V   
Sbjct: 224 ------------------QQNHIPYWLCANSWNNQWGDQGYFKILRGTNECGIESMVTAG 265

Query: 242 LP 243
           LP
Sbjct: 266 LP 267




Source: Schistosoma mansoni

Species: Schistosoma mansoni

Genus: Schistosoma

Family: Schistosomatidae

Order: Strigeidida

Class: Trematoda

Phylum: Platyhelminthes

Superkingdom: Eukaryota

>gi|242001640|ref|XP_002435463.1| cathepsin B endopeptidase, putative [Ixodes scapularis] gi|215498799|gb|EEC08293.1| cathepsin B endopeptidase, putative [Ixodes scapularis] Back     alignment and taxonomy information
>gi|240992699|ref|XP_002404474.1| cathepsin B endopeptidase, putative [Ixodes scapularis] gi|215491571|gb|EEC01212.1| cathepsin B endopeptidase, putative [Ixodes scapularis] Back     alignment and taxonomy information
>gi|156255405|gb|ABU62925.1| cathepsin B [Fasciola hepatica] Back     alignment and taxonomy information
>gi|29374027|gb|AAO73004.1| cathepsin B [Fasciola gigantica] Back     alignment and taxonomy information
>gi|240992702|ref|XP_002404475.1| cathepsin B endopeptidase, putative [Ixodes scapularis] gi|215491572|gb|EEC01213.1| cathepsin B endopeptidase, putative [Ixodes scapularis] Back     alignment and taxonomy information
>gi|56753605|gb|AAW25005.1| SJCHGC02852 protein [Schistosoma japonicum] Back     alignment and taxonomy information
>gi|126681075|gb|ABO26563.1| cathepsin B-like cysteine protease form 1 [Ixodes ricinus] Back     alignment and taxonomy information
>gi|187105118|ref|NP_001119619.1| cathepsin B-5880 precursor [Acyrthosiphon pisum] gi|163300442|tpg|DAA06127.1| TPA_inf: cathepsin B transcript 5880 [Acyrthosiphon pisum] gi|239790051|dbj|BAH71611.1| ACYPI000015 [Acyrthosiphon pisum] gi|239790053|dbj|BAH71612.1| ACYPI000015 [Acyrthosiphon pisum] Back     alignment and taxonomy information
>gi|22535408|emb|CAC87118.1| cathepsin B-like protease [Nilaparvata lugens] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query280
UNIPROTKB|P07858339 CTSB "Cathepsin B" [Homo sapie 0.421 0.348 0.396 4.7e-32
UNIPROTKB|E2R6Q7339 CTSB "Uncharacterized protein" 0.421 0.348 0.404 8.8e-32
MGI|MGI:88561339 Ctsb "cathepsin B" [Mus muscul 0.421 0.348 0.380 1.2e-31
UNIPROTKB|P07688335 CTSB "Cathepsin B" [Bos taurus 0.421 0.352 0.396 1.4e-31
WB|WBGene00000786379 cpr-6 [Caenorhabditis elegans 0.428 0.316 0.360 1.5e-31
RGD|621509339 Ctsb "cathepsin B" [Rattus nor 0.421 0.348 0.380 1.6e-31
UNIPROTKB|Q6IN22339 Ctsb "Cathepsin B" [Rattus nor 0.421 0.348 0.380 1.6e-31
WB|WBGene00009347356 F32H5.1 [Caenorhabditis elegan 0.382 0.300 0.424 2.7e-30
UNIPROTKB|A1E295335 CTSB "Cathepsin B" [Sus scrofa 0.421 0.352 0.388 4.5e-30
UNIPROTKB|F1N9D7340 CTSB "Cathepsin B" [Gallus gal 0.425 0.35 0.363 6.8e-30
UNIPROTKB|P07858 CTSB "Cathepsin B" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
 Score = 224 (83.9 bits), Expect = 4.7e-32, Sum P(2) = 4.7e-32
 Identities = 48/121 (39%), Positives = 63/121 (52%)

Query:     2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
             C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct:   150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query:    62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
                C    Y   + QDK+     Y V++   DI  EI KNGPV     +YSD   YKSG 
Sbjct:   208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGV 266

Query:   122 Y 122
             Y
Sbjct:   267 Y 267


GO:0050790 "regulation of catalytic activity" evidence=IEA
GO:0005739 "mitochondrion" evidence=IEA
GO:0042470 "melanosome" evidence=IEA
GO:0005515 "protein binding" evidence=IPI
GO:0042981 "regulation of apoptotic process" evidence=TAS
GO:0006508 "proteolysis" evidence=IDA
GO:0005764 "lysosome" evidence=IDA
GO:0097067 "cellular response to thyroid hormone stimulus" evidence=IEP
GO:0008234 "cysteine-type peptidase activity" evidence=IDA
GO:0048471 "perinuclear region of cytoplasm" evidence=IDA
GO:0005622 "intracellular" evidence=TAS
GO:0036021 "endolysosome lumen" evidence=TAS
GO:0045087 "innate immune response" evidence=TAS
GO:0008233 "peptidase activity" evidence=IDA
GO:0004197 "cysteine-type endopeptidase activity" evidence=IDA
GO:0005615 "extracellular space" evidence=ISS
GO:0005730 "nucleolus" evidence=IDA
GO:0043231 "intracellular membrane-bounded organelle" evidence=IDA
UNIPROTKB|E2R6Q7 CTSB "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] Back     alignment and assigned GO terms
MGI|MGI:88561 Ctsb "cathepsin B" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms
UNIPROTKB|P07688 CTSB "Cathepsin B" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms
WB|WBGene00000786 cpr-6 [Caenorhabditis elegans (taxid:6239)] Back     alignment and assigned GO terms
RGD|621509 Ctsb "cathepsin B" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms
UNIPROTKB|Q6IN22 Ctsb "Cathepsin B" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms
WB|WBGene00009347 F32H5.1 [Caenorhabditis elegans (taxid:6239)] Back     alignment and assigned GO terms
UNIPROTKB|A1E295 CTSB "Cathepsin B" [Sus scrofa (taxid:9823)] Back     alignment and assigned GO terms
UNIPROTKB|F1N9D7 CTSB "Cathepsin B" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by EFICAz Software ?

Prediction LevelEC numberConfidence of Prediction
3rd Layer3.4.22LOW CONFIDENCE prediction!

Prediction of Functionally Associated Proteins


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query280
cd02620236 cd02620, Peptidase_C1A_CathepsinB, Cathepsin B gro 6e-41
cd02621243 cd02621, Peptidase_C1A_CathepsinC, Cathepsin C; al 7e-16
pfam00112213 pfam00112, Peptidase_C1, Papain family cysteine pr 1e-12
PTZ00049693 PTZ00049, PTZ00049, cathepsin C-like protein; Prov 3e-10
PTZ00364548 PTZ00364, PTZ00364, dipeptidyl-peptidase I precurs 4e-08
smart00645175 smart00645, Pept_C1, Papain family cysteine protea 2e-07
cd02248210 cd02248, Peptidase_C1A, Peptidase C1A subfamily (M 2e-06
cd02698239 cd02698, Peptidase_C1A_CathepsinX, Cathepsin X; th 3e-06
PTZ00203348 PTZ00203, PTZ00203, cathepsin L protease; Provisio 6e-04
cd02619223 cd02619, Peptidase_C1, C1 Peptidase family (MEROPS 0.002
>gnl|CDD|239111 cd02620, Peptidase_C1A_CathepsinB, Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag) Back     alignment and domain information
 Score =  140 bits (356), Expect = 6e-41
 Identities = 68/237 (28%), Positives = 91/237 (38%), Gaps = 72/237 (30%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W ++   G+VTGG       CQP + PPC H                 PKC
Sbjct: 69  CNGGYPDAAWKYLTTTGVVTGG-------CQPYTIPPCGHHPEGPPPCCGTP--YCTPKC 119

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C        + +DK++ K  Y V  +  DI +EIM NGPV A   +Y D   YKSG 
Sbjct: 120 QDGCEKT-----YEEDKHKGKSAYSVPSDETDIMKEIMTNGPVQAAFTVYEDFLYYKSG- 173

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          ++ + SG          +    VKI                    
Sbjct: 174 ---------------VYQHTSGKQ--------LGGHAVKI-------------------- 190

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                         IGWG ENG PYW   +++G  +G+ G  +ILRG NE  IES V
Sbjct: 191 --------------IGWGVENGVPYWLAANSWGTDWGENGYFRILRGSNECGIESEV 233


Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane antibody-mediated interstitial nephritis. It plays a role in renal tubulogenesis and is defective in hereditary tubulointerstitial disorders. TIN-Ag is exclusively expressed in kidney tissues. . Length = 236

>gnl|CDD|239112 cd02621, Peptidase_C1A_CathepsinC, Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access Back     alignment and domain information
>gnl|CDD|215726 pfam00112, Peptidase_C1, Papain family cysteine protease Back     alignment and domain information
>gnl|CDD|240244 PTZ00049, PTZ00049, cathepsin C-like protein; Provisional Back     alignment and domain information
>gnl|CDD|240381 PTZ00364, PTZ00364, dipeptidyl-peptidase I precursor; Provisional Back     alignment and domain information
>gnl|CDD|214761 smart00645, Pept_C1, Papain family cysteine protease Back     alignment and domain information
>gnl|CDD|239068 cd02248, Peptidase_C1A, Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W) Back     alignment and domain information
>gnl|CDD|239149 cd02698, Peptidase_C1A_CathepsinX, Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity Back     alignment and domain information
>gnl|CDD|185513 PTZ00203, PTZ00203, cathepsin L protease; Provisional Back     alignment and domain information
>gnl|CDD|239110 cd02619, Peptidase_C1, C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase) Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 280
cd02620236 Peptidase_C1A_CathepsinB Cathepsin B group; compos 100.0
cd02698239 Peptidase_C1A_CathepsinX Cathepsin X; the only pap 100.0
cd02621243 Peptidase_C1A_CathepsinC Cathepsin C; also known a 100.0
PTZ00049693 cathepsin C-like protein; Provisional 100.0
KOG1543|consensus325 100.0
PTZ00364548 dipeptidyl-peptidase I precursor; Provisional 100.0
cd02248210 Peptidase_C1A Peptidase C1A subfamily (MEROPS data 100.0
KOG1542|consensus372 100.0
PTZ00203348 cathepsin L protease; Provisional 100.0
PF00112219 Peptidase_C1: Papain family cysteine protease This 100.0
PTZ00200448 cysteine proteinase; Provisional 100.0
PTZ00021489 falcipain-2; Provisional 100.0
KOG1544|consensus470 99.98
PTZ00462 1004 Serine-repeat antigen protein; Provisional 99.95
cd02619223 Peptidase_C1 C1 Peptidase family (MEROPS database 99.94
smart00645174 Pept_C1 Papain family cysteine protease. 99.94
cd00585437 Peptidase_C1B Peptidase C1B subfamily (MEROPS data 99.48
COG4870372 Cysteine protease [Posttranslational modification, 99.38
PF03051438 Peptidase_C1_2: Peptidase C1-like family This fami 98.3
PTZ00203348 cathepsin L protease; Provisional 95.75
cd02698239 Peptidase_C1A_CathepsinX Cathepsin X; the only pap 95.7
cd02621243 Peptidase_C1A_CathepsinC Cathepsin C; also known a 95.46
KOG1543|consensus325 95.44
smart00645174 Pept_C1 Papain family cysteine protease. 95.26
cd02620236 Peptidase_C1A_CathepsinB Cathepsin B group; compos 94.88
COG3579444 PepC Aminopeptidase C [Amino acid transport and me 94.8
KOG1542|consensus372 94.52
cd02248210 Peptidase_C1A Peptidase C1A subfamily (MEROPS data 94.44
PTZ00200448 cysteine proteinase; Provisional 93.61
PTZ00364548 dipeptidyl-peptidase I precursor; Provisional 93.4
KOG1544|consensus470 92.86
PTZ00049693 cathepsin C-like protein; Provisional 92.52
PF13529144 Peptidase_C39_2: Peptidase_C39 like family; PDB: 3 92.26
PTZ00021489 falcipain-2; Provisional 92.23
PF00112219 Peptidase_C1: Papain family cysteine protease This 92.03
cd02619223 Peptidase_C1 C1 Peptidase family (MEROPS database 91.65
PTZ00462 1004 Serine-repeat antigen protein; Provisional 90.52
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag) Back     alignment and domain information
Probab=100.00  E-value=3e-38  Score=284.89  Aligned_cols=168  Identities=39%  Similarity=0.788  Sum_probs=130.8

Q ss_pred             CCCCCchHHHHHHHHhcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccCCCCCCCcccccccCCCCCcccccceee
Q psy15346          1 VCSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYR   80 (280)
Q Consensus         1 gC~GG~~~~A~~yi~~~Gi~te~~Y~~~~~C~PY~~~~C~~~~~~~~~~~C~~~~~~~p~c~~~c~~~~~~~~~~~~~~~   80 (280)
                      ||+||++..||+||+++|+++|.+|       ||+...|..+...  ...|..    ++.|...|.. .....+....++
T Consensus        68 gC~GG~~~~a~~~i~~~G~~~e~~y-------PY~~~~~~~~~~~--~~~~~~----~~~~~~~C~~-~~~~~~~~~~~~  133 (236)
T cd02620          68 GCNGGYPDAAWKYLTTTGVVTGGCQ-------PYTIPPCGHHPEG--PPPCCG----TPYCTPKCQD-GCEKTYEEDKHK  133 (236)
T ss_pred             CCCCCCHHHHHHHHHhcCCCcCCEe-------cCcCCCCccCCCC--CCCCCC----CCCCCCCCCc-CCccccceeeee
Confidence            7999999999999999999997666       9996544331111  122322    1233344541 111123445566


Q ss_pred             eEEEEEcCchHHHHHHHHHhCCcEEEEEEeCccccccccCccCCCcccccccccccccccccceeecCCchhhhhhheee
Q psy15346         81 FKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVK  160 (280)
Q Consensus        81 i~~~y~~~~~~~~Ik~~I~~~GPV~v~~~v~~~f~~~~~g~~~~~p~~~~~~~~~~~~~Y~~GVy~~~~~~~~~~~~~~~  160 (280)
                      +..++.+..++++||++|+++|||+++|.++++|+.                       |++|||+.....         
T Consensus       134 ~~~~~~~~~~~~~ik~~l~~~GPv~v~i~~~~~f~~-----------------------Y~~Giy~~~~~~---------  181 (236)
T cd02620         134 GKSAYSVPSDETDIMKEIMTNGPVQAAFTVYEDFLY-----------------------YKSGVYQHTSGK---------  181 (236)
T ss_pred             ecceeeeCCHHHHHHHHHHHCCCeEEEEEechhhhh-----------------------cCCcEEeecCCC---------
Confidence            777787766789999999999999999999888999                       999999865322         


Q ss_pred             eeccCcCCCCCceeeeeeeecccCccccCCeEEEEEEeeccCCccEEEEEcCCCCCCCCCceEEEEccCCcccccceeee
Q psy15346        161 IVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG  240 (280)
Q Consensus       161 ~~gwg~~~~~~~w~~~~~~~~~~~~~~~~~HaV~IVGwG~e~g~~YWiirNSWG~~WG~~Gy~kI~rg~n~cgIe~~~~~  240 (280)
                      .                          .++|||+|||||++++++|||||||||++||++|||||+||.|.|||+++++.
T Consensus       182 ~--------------------------~~~HaV~iVGyg~~~g~~YWivrNSWG~~WGe~Gy~ri~~~~~~cgi~~~~~~  235 (236)
T cd02620         182 Q--------------------------LGGHAVKIIGWGVENGVPYWLAANSWGTDWGENGYFRILRGSNECGIESEVVA  235 (236)
T ss_pred             C--------------------------cCCeEEEEEEEeccCCeeEEEEEeCCCCCCCCCcEEEEEccCcccccccceec
Confidence            1                          56899999999999999999999999999999999999999999999998875



Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane

>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity Back     alignment and domain information
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access Back     alignment and domain information
>PTZ00049 cathepsin C-like protein; Provisional Back     alignment and domain information
>KOG1543|consensus Back     alignment and domain information
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional Back     alignment and domain information
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W) Back     alignment and domain information
>KOG1542|consensus Back     alignment and domain information
>PTZ00203 cathepsin L protease; Provisional Back     alignment and domain information
>PF00112 Peptidase_C1: Papain family cysteine protease This is family C1 in the peptidase classification Back     alignment and domain information
>PTZ00200 cysteine proteinase; Provisional Back     alignment and domain information
>PTZ00021 falcipain-2; Provisional Back     alignment and domain information
>KOG1544|consensus Back     alignment and domain information
>PTZ00462 Serine-repeat antigen protein; Provisional Back     alignment and domain information
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase) Back     alignment and domain information
>smart00645 Pept_C1 Papain family cysteine protease Back     alignment and domain information
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC) Back     alignment and domain information
>COG4870 Cysteine protease [Posttranslational modification, protein turnover, chaperones] Back     alignment and domain information
>PF03051 Peptidase_C1_2: Peptidase C1-like family This family is a subfamily of the Prosite entry; InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families Back     alignment and domain information
>PTZ00203 cathepsin L protease; Provisional Back     alignment and domain information
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity Back     alignment and domain information
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access Back     alignment and domain information
>KOG1543|consensus Back     alignment and domain information
>smart00645 Pept_C1 Papain family cysteine protease Back     alignment and domain information
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag) Back     alignment and domain information
>COG3579 PepC Aminopeptidase C [Amino acid transport and metabolism] Back     alignment and domain information
>KOG1542|consensus Back     alignment and domain information
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W) Back     alignment and domain information
>PTZ00200 cysteine proteinase; Provisional Back     alignment and domain information
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional Back     alignment and domain information
>KOG1544|consensus Back     alignment and domain information
>PTZ00049 cathepsin C-like protein; Provisional Back     alignment and domain information
>PF13529 Peptidase_C39_2: Peptidase_C39 like family; PDB: 3ERV_A Back     alignment and domain information
>PTZ00021 falcipain-2; Provisional Back     alignment and domain information
>PF00112 Peptidase_C1: Papain family cysteine protease This is family C1 in the peptidase classification Back     alignment and domain information
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase) Back     alignment and domain information
>PTZ00462 Serine-repeat antigen protein; Provisional Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query280
1gmy_A261 Cathepsin B Complexed With Dipeptidyl Nitrile Inhib 3e-23
3cbj_A266 Chagasin-cathepsin B Complex Length = 266 1e-22
1huc_B205 The Refined 2.15 Angstroms X-Ray Crystal Structure 1e-22
3ai8_B256 Cathepsin B In Complex With The Nitroxoline Length 1e-22
3k9m_A254 Cathepsin B In Complex With Stefin A Length = 254 2e-22
1pbh_A317 Crystal Structure Of Human Recombinant Procathepsin 2e-22
3qsd_A254 Structure Of Cathepsin B1 From Schistosoma Mansoni 4e-22
1cpj_A260 Crystal Structures Of Recombinant Rat Cathepsin B A 6e-22
1cte_A254 Crystal Structures Of Recombinant Rat Cathepsin B A 6e-22
1mir_A322 Rat Procathepsin B Length = 322 7e-22
1sp4_B205 Crystal Structure Of Ns-134 In Complex With Bovine 2e-21
1qdq_A253 X-Ray Crystal Structure Of Bovine Cathepsin B-Ca074 3e-21
1ito_A256 Crystal Structure Analysis Of Bovine Spleen Catheps 4e-21
3mor_A317 Crystal Structure Of Cathepsin B From Trypanosoma B 3e-08
4hwy_A340 Trypanosoma Brucei Procathepsin B Solved From 40 Fs 4e-08
3hhi_A325 Crystal Structure Of Cathepsin B From T. Brucei In 4e-08
1jqp_A438 Dipeptidyl Peptidase I (Cathepsin C), A Tetrameric 1e-04
8pch_A220 Crystal Structure Of Porcine Cathepsin H Determined 2e-04
>pdb|1GMY|A Chain A, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor Length = 261 Back     alignment and structure

Iteration: 1

Score = 105 bits (262), Expect = 3e-23, Method: Compositional matrix adjust. Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%) Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61 C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC Sbjct: 72 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 129 Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121 C Y + QDK+ Y V++ DI EI KNGPV FS Sbjct: 130 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 176 Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181 +YSD YKSGVY Sbjct: 177 -----------VYSDFLLYKSGVYQ----------------------------------- 190 Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241 + E++ ++++GWG ENG PYW + +++ +GD G KILRG++ IES V Sbjct: 191 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 250 Query: 242 LPKDN 246 +P+ + Sbjct: 251 IPRTD 255
>pdb|3CBJ|A Chain A, Chagasin-cathepsin B Complex Length = 266 Back     alignment and structure
>pdb|1HUC|B Chain B, The Refined 2.15 Angstroms X-Ray Crystal Structure Of Human Liver Cathepsin B: The Structural Basis For Its Specificity Length = 205 Back     alignment and structure
>pdb|3AI8|B Chain B, Cathepsin B In Complex With The Nitroxoline Length = 256 Back     alignment and structure
>pdb|3K9M|A Chain A, Cathepsin B In Complex With Stefin A Length = 254 Back     alignment and structure
>pdb|1PBH|A Chain A, Crystal Structure Of Human Recombinant Procathepsin B At 3.2 Angstrom Resolution Length = 317 Back     alignment and structure
>pdb|3QSD|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In Complex With Ca074 Inhibitor Length = 254 Back     alignment and structure
>pdb|1CPJ|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A Cathepsin B-Inhibitor Complex: Implications For Structure- Based Inhibitor Design Length = 260 Back     alignment and structure
>pdb|1CTE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A Cathepsin B-Inhibitor Complex: Implications For Structure- Based Inhibitor Design Length = 254 Back     alignment and structure
>pdb|1MIR|A Chain A, Rat Procathepsin B Length = 322 Back     alignment and structure
>pdb|1SP4|B Chain B, Crystal Structure Of Ns-134 In Complex With Bovine Cathepsin B: A Two Headed Epoxysuccinyl Inhibitor Extends Along The Whole Active Site Cleft Length = 205 Back     alignment and structure
>pdb|1QDQ|A Chain A, X-Ray Crystal Structure Of Bovine Cathepsin B-Ca074 Complex Length = 253 Back     alignment and structure
>pdb|1ITO|A Chain A, Crystal Structure Analysis Of Bovine Spleen Cathepsin B- E64c Complex Length = 256 Back     alignment and structure
>pdb|3MOR|A Chain A, Crystal Structure Of Cathepsin B From Trypanosoma Brucei Length = 317 Back     alignment and structure
>pdb|4HWY|A Chain A, Trypanosoma Brucei Procathepsin B Solved From 40 Fs Free-electron Laser Pulse Data By Serial Femtosecond X-ray Crystallography Length = 340 Back     alignment and structure
>pdb|3HHI|A Chain A, Crystal Structure Of Cathepsin B From T. Brucei In Complex With Ca074 Length = 325 Back     alignment and structure
>pdb|1JQP|A Chain A, Dipeptidyl Peptidase I (Cathepsin C), A Tetrameric Cysteine Protease Of The Papain Family Length = 438 Back     alignment and structure
>pdb|8PCH|A Chain A, Crystal Structure Of Porcine Cathepsin H Determined At 2.1 Angstrom Resolution: Location Of The Mini-Chain C-Terminal Carboxyl Group Defines Cathepsin H Aminopeptidase Function Length = 220 Back     alignment and structure

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query280
3pbh_A317 Procathepsin B; thiol protease, cysteine protease, 3e-44
3qsd_A254 Cathepsin B-like peptidase (C01 family); cysteine 3e-44
3cbj_A266 Cathepsin B; cathepsin B, occluding loop, chagas d 9e-43
3hhi_A325 Cathepsin B-like cysteine protease; occluding loop 4e-39
1deu_A277 Procathepsin X; cysteine protease, proregion, pros 6e-23
3pdf_A441 Cathepsin C, dipeptidyl peptidase 1; two domains, 3e-20
3ois_A291 Cysteine protease; alpha and beta, hydrolase; HET: 8e-10
3qt4_A329 Cathepsin-L-like midgut cysteine proteinase; hydro 3e-05
1m6d_A214 Cathepsin F, catsf; papain family cysteine proteas 3e-05
3ovx_A218 Cathepsin S; hydrolase, covalent inhibitor, aldehy 5e-05
3kwz_A215 Cathepsin K; enzyme inhibitor, covalent reversible 5e-05
1by8_A314 Protein (procathepsin K); hydrolase(sulfhydryl pro 5e-05
2c0y_A315 Procathepsin S; proenzyme, proteinase, hydrolase, 6e-05
2o6x_A310 Procathepsin L1, secreted cathepsin L 1; hydrolase 8e-05
1iwd_A215 Ervatamin B; cysteine protease, alpha-beta protein 8e-05
1pci_A322 Procaricain; zymogen, hydrolase, thiol protease; 3 8e-05
3i06_A215 Cruzipain; autocatalytic cleavage, glycoprotein, p 9e-05
3p5u_A220 Actinidin; SAD, cysteine proteinases, hydrolase; 1 9e-05
3f5v_A222 DER P 1 allergen; allergy, asthma, DUST mites, gly 1e-04
1xkg_A312 DER P I, major mite fecal allergen DER P 1; major 1e-04
1cqd_A221 Protein (protease II); cysteine protease, glycopro 1e-04
8pch_A220 Cathepsin H; hydrolase, protease, cysteine protein 1e-04
1yal_A218 Chymopapain; hydrolase, thiol protease; 1.70A {Car 2e-04
2b1m_A246 SPE31; papain-like, sugar binding protein; HET: NA 2e-04
3qj3_A331 Cathepsin L-like protein; hydrolase, proteinase, l 3e-04
>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme, papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A 1pbh_A 1mir_A Length = 317 Back     alignment and structure
 Score =  151 bits (383), Expect = 3e-44
 Identities = 73/245 (29%), Positives = 104/245 (42%), Gaps = 61/245 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 134 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 191

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V++   DI  EI KNGPV     +YSD F      
Sbjct: 192 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSD-F------ 243

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                             YKSGVY       +  +A                        
Sbjct: 244 ----------------LLYKSGVYQHVTGEMMGGHA------------------------ 263

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                      ++++GWG ENG PYW + +++   +GD G  KILRG++   IES V   
Sbjct: 264 -----------IRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 312

Query: 242 LPKDN 246
           +P+ +
Sbjct: 313 IPRTD 317


>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase, digestive tract, hydrolase-hydrolase INH complex; HET: 074; 1.30A {Schistosoma mansoni} PDB: 3s3q_A* 3s3r_A* Length = 254 Back     alignment and structure
>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco hydrolase, lysosome, protease, thiol protease, zymogen, CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A* 3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A* 1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A* 1qdq_A* 1csb_B* 1huc_B 2ipp_B ... Length = 266 Back     alignment and structure
>3hhi_A Cathepsin B-like cysteine protease; occluding loop, hydrolase, THIO protease; HET: 074; 1.60A {Trypanosoma brucei} PDB: 3mor_A* Length = 325 Back     alignment and structure
>1deu_A Procathepsin X; cysteine protease, proregion, prosegment, HY; 1.70A {Homo sapiens} SCOP: d.3.1.1 PDB: 1ef7_A Length = 277 Back     alignment and structure
>3pdf_A Cathepsin C, dipeptidyl peptidase 1; two domains, cystein protease, hydrolase-hydrolase inhibitor; HET: LXV NAG; 1.85A {Homo sapiens} PDB: 1jqp_A* 2djf_B* 1k3b_B* 2djg_B* 2djf_A* 1k3b_A* 2djg_A* 2djf_C* 1k3b_C* 2djg_C* Length = 441 Back     alignment and structure
>3ois_A Cysteine protease; alpha and beta, hydrolase; HET: UDP; 1.65A {Xylella fastidiosa} Length = 291 Back     alignment and structure
>3qt4_A Cathepsin-L-like midgut cysteine proteinase; hydrolase, zymogen, intramolecular DISS bonds, insect larVal midgut; HET: PG4 PG6; 2.11A {Tenebrio molitor} Length = 329 Back     alignment and structure
>1m6d_A Cathepsin F, catsf; papain family cysteine protease, hydrolase; HET: MYP; 1.70A {Homo sapiens} SCOP: d.3.1.1 Length = 214 Back     alignment and structure
>3ovx_A Cathepsin S; hydrolase, covalent inhibitor, aldehyde warhead is covalently bound to Cys25, lysosomeal protein; HET: O64; 1.49A {Homo sapiens} PDB: 2h7j_A* 2f1g_A* 2hh5_B* 2hhn_A* 2hxz_A* 2op3_A* 2frq_A* 2fra_A* 2fq9_A* 2ft2_A* 2fud_A* 2g7y_A* 1ms6_A* 2r9m_A* 2r9n_A* 2r9o_A* 3n3g_A* 3n4c_A* 3mpe_A* 1nqc_A* ... Length = 218 Back     alignment and structure
>3kwz_A Cathepsin K; enzyme inhibitor, covalent reversible inhibitor, disease mutation, disulfide bond, glycoprotein, hydrolase, lysosome, protease; HET: KWZ; 1.49A {Homo sapiens} PDB: 1au0_A* 1au2_A* 1au3_A* 1au4_A* 1ayu_A* 1ayv_A* 1ayw_A* 1bgo_A* 1atk_A* 1nl6_A* 1nlj_A* 1q6k_A* 1mem_A* 1yk7_A* 1yk8_A* 1yt7_A* 2ato_A* 2aux_A* 2auz_A* 2bdl_A* ... Length = 215 Back     alignment and structure
>1by8_A Protein (procathepsin K); hydrolase(sulfhydryl proteinase), papain; 2.60A {Homo sapiens} SCOP: d.3.1.1 PDB: 7pck_A Length = 314 Back     alignment and structure
>2c0y_A Procathepsin S; proenzyme, proteinase, hydrolase, thiol protease, prosegment binding loop, glycoprotein, lysosome, protease, zymogen; 2.1A {Homo sapiens} Length = 315 Back     alignment and structure
>2o6x_A Procathepsin L1, secreted cathepsin L 1; hydrolase, thiol protease, cysteine protease, zymogen, hydro; 1.40A {Fasciola hepatica} Length = 310 Back     alignment and structure
>1iwd_A Ervatamin B; cysteine protease, alpha-beta protein, catalytic DYAD, L-DOM domain., hydrolase; 1.63A {Tabernaemontana divaricata} SCOP: d.3.1.1 Length = 215 Back     alignment and structure
>1pci_A Procaricain; zymogen, hydrolase, thiol protease; 3.20A {Carica papaya} SCOP: d.3.1.1 Length = 322 Back     alignment and structure
>3i06_A Cruzipain; autocatalytic cleavage, glycoprotein, protease, thiol protease, zymogen; HET: QL2; 1.10A {Trypanosoma cruzi} PDB: 1ewm_A* 1ewo_A* 1ewl_A* 1f29_A* 1ewp_A* 1f2b_A* 1f2c_A* 1f2a_A* 1me4_A* 1u9q_X* 2aim_A* 2efm_A* 2oz2_A* 1me3_A* 3kku_A* 3lxs_A* 1aim_A* 3iut_A* 3hd3_A* 2p86_A* ... Length = 215 Back     alignment and structure
>3p5u_A Actinidin; SAD, cysteine proteinases, hydrolase; 1.50A {Actinidia arguta} PDB: 3p5v_A 3p5w_A 3p5x_A 1aec_A* 2act_A Length = 220 Back     alignment and structure
>3f5v_A DER P 1 allergen; allergy, asthma, DUST mites, glycoprotein, hydrola protease, secreted, thiol protease; HET: P6G; 1.36A {Dermatophagoides pteronyssinus} PDB: 2as8_A 3rvw_A* 3rvx_A 3rvv_A* 3d6s_A* Length = 222 Back     alignment and structure
>1xkg_A DER P I, major mite fecal allergen DER P 1; major allergen, cysteine protease, house DUST mite, dermatop pteronyssinus; 1.61A {Dermatophagoides pteronyssinus} SCOP: d.3.1.1 Length = 312 Back     alignment and structure
>1cqd_A Protein (protease II); cysteine protease, glycoprotein, proline specificity, carboh papain family, hydrolase; HET: NAG FUL FUC; 2.10A {Zingiber officinale} SCOP: d.3.1.1 Length = 221 Back     alignment and structure
>8pch_A Cathepsin H; hydrolase, protease, cysteine proteinase, aminopeptidase; HET: NAG BMA; 2.10A {Sus scrofa} SCOP: d.3.1.1 PDB: 1nb3_A* 1nb5_A* Length = 220 Back     alignment and structure
>1yal_A Chymopapain; hydrolase, thiol protease; 1.70A {Carica papaya} SCOP: d.3.1.1 PDB: 1gec_E* Length = 218 Back     alignment and structure
>2b1m_A SPE31; papain-like, sugar binding protein; HET: NAG FUC PG4; 2.00A {Pachyrhizus erosus} PDB: 2b1n_A* Length = 246 Back     alignment and structure
>3qj3_A Cathepsin L-like protein; hydrolase, proteinase, larVal midgut; 1.85A {Tenebrio molitor} Length = 331 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query280
3pbh_A317 Procathepsin B; thiol protease, cysteine protease, 100.0
3qsd_A254 Cathepsin B-like peptidase (C01 family); cysteine 100.0
3cbj_A266 Cathepsin B; cathepsin B, occluding loop, chagas d 100.0
3hhi_A325 Cathepsin B-like cysteine protease; occluding loop 100.0
8pch_A220 Cathepsin H; hydrolase, protease, cysteine protein 100.0
3i06_A215 Cruzipain; autocatalytic cleavage, glycoprotein, p 100.0
1m6d_A214 Cathepsin F, catsf; papain family cysteine proteas 100.0
3kwz_A215 Cathepsin K; enzyme inhibitor, covalent reversible 100.0
1deu_A277 Procathepsin X; cysteine protease, proregion, pros 100.0
2xu3_A220 Cathepsin L1; hydrolase, drug design, thiol protea 100.0
3qt4_A329 Cathepsin-L-like midgut cysteine proteinase; hydro 100.0
3ovx_A218 Cathepsin S; hydrolase, covalent inhibitor, aldehy 100.0
2o6x_A310 Procathepsin L1, secreted cathepsin L 1; hydrolase 100.0
1xkg_A312 DER P I, major mite fecal allergen DER P 1; major 100.0
2b1m_A246 SPE31; papain-like, sugar binding protein; HET: NA 100.0
3f5v_A222 DER P 1 allergen; allergy, asthma, DUST mites, gly 100.0
1iwd_A215 Ervatamin B; cysteine protease, alpha-beta protein 100.0
1ppo_A216 Protease omega; hydrolase(thiol protease); 1.80A { 100.0
2c0y_A315 Procathepsin S; proenzyme, proteinase, hydrolase, 100.0
3p5u_A220 Actinidin; SAD, cysteine proteinases, hydrolase; 1 100.0
3qj3_A331 Cathepsin L-like protein; hydrolase, proteinase, l 100.0
3pdf_A441 Cathepsin C, dipeptidyl peptidase 1; two domains, 100.0
3u8e_A222 Papain-like cysteine protease; papain-like cystein 100.0
1yal_A218 Chymopapain; hydrolase, thiol protease; 1.70A {Car 100.0
1cqd_A221 Protein (protease II); cysteine protease, glycopro 100.0
1by8_A314 Protein (procathepsin K); hydrolase(sulfhydryl pro 100.0
1cs8_A316 Human procathepsin L; prosegment, propeptide, inhi 100.0
1pci_A322 Procaricain; zymogen, hydrolase, thiol protease; 3 100.0
3f75_A224 Toxopain-2, cathepsin L protease; medical structur 100.0
3ioq_A213 CMS1MS2; caricaceae, cysteine protease, papain fam 100.0
2oul_A241 Falcipain 2; cysteine protease, inhibitor, macromo 100.0
2cio_A212 Papain; hydrolase/inhibitor, complex hydrolase/inh 100.0
2bdz_A214 Mexicain; cysteine protease, peptidase_C1, papain- 100.0
1s4v_A229 Cysteine endopeptidase; KDEL ER retention signal, 100.0
1o0e_A208 Ervatamin C; plant cysteine protease, two domain, 100.0
3bwk_A243 Cysteine protease falcipain-3; malaria, hydrolase; 100.0
2fo5_A262 Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cyst 100.0
3tnx_A363 Papain; hydrolase, cytoplasm for recombinant expre 100.0
2wbf_X265 Serine-repeat antigen protein; SERA, malaria, vacu 100.0
3ois_A291 Cysteine protease; alpha and beta, hydrolase; HET: 99.97
2cb5_A453 Protein (bleomycin hydrolase); aminopeptidase, cys 99.84
2e01_A457 Cysteine proteinase 1; bleomycin hydrolase, thiol 99.82
3pw3_A383 Aminopeptidase C; bleomycin, cysteine proteinase f 99.52
3pbh_A317 Procathepsin B; thiol protease, cysteine protease, 96.41
3i06_A215 Cruzipain; autocatalytic cleavage, glycoprotein, p 96.18
3hhi_A325 Cathepsin B-like cysteine protease; occluding loop 96.18
1m6d_A214 Cathepsin F, catsf; papain family cysteine proteas 96.13
3cbj_A266 Cathepsin B; cathepsin B, occluding loop, chagas d 96.11
1deu_A277 Procathepsin X; cysteine protease, proregion, pros 96.01
2b1m_A246 SPE31; papain-like, sugar binding protein; HET: NA 95.93
3p5u_A220 Actinidin; SAD, cysteine proteinases, hydrolase; 1 95.86
1cqd_A221 Protein (protease II); cysteine protease, glycopro 95.86
1iwd_A215 Ervatamin B; cysteine protease, alpha-beta protein 95.83
2o6x_A310 Procathepsin L1, secreted cathepsin L 1; hydrolase 95.83
1pci_A322 Procaricain; zymogen, hydrolase, thiol protease; 3 95.81
3f5v_A222 DER P 1 allergen; allergy, asthma, DUST mites, gly 95.8
1ppo_A216 Protease omega; hydrolase(thiol protease); 1.80A { 95.8
2c0y_A315 Procathepsin S; proenzyme, proteinase, hydrolase, 95.8
1yal_A218 Chymopapain; hydrolase, thiol protease; 1.70A {Car 95.79
1xkg_A312 DER P I, major mite fecal allergen DER P 1; major 95.79
1by8_A314 Protein (procathepsin K); hydrolase(sulfhydryl pro 95.73
2xu3_A220 Cathepsin L1; hydrolase, drug design, thiol protea 95.73
8pch_A220 Cathepsin H; hydrolase, protease, cysteine protein 95.71
3kwz_A215 Cathepsin K; enzyme inhibitor, covalent reversible 95.69
3ovx_A218 Cathepsin S; hydrolase, covalent inhibitor, aldehy 95.63
1s4v_A229 Cysteine endopeptidase; KDEL ER retention signal, 95.62
3u8e_A222 Papain-like cysteine protease; papain-like cystein 95.6
3qsd_A254 Cathepsin B-like peptidase (C01 family); cysteine 95.6
3qt4_A329 Cathepsin-L-like midgut cysteine proteinase; hydro 95.51
2fo5_A262 Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cyst 95.49
3qj3_A331 Cathepsin L-like protein; hydrolase, proteinase, l 95.48
2wbf_X265 Serine-repeat antigen protein; SERA, malaria, vacu 95.33
1cs8_A316 Human procathepsin L; prosegment, propeptide, inhi 95.32
3f75_A224 Toxopain-2, cathepsin L protease; medical structur 95.2
2oul_A241 Falcipain 2; cysteine protease, inhibitor, macromo 94.87
3bwk_A243 Cysteine protease falcipain-3; malaria, hydrolase; 94.8
3pdf_A441 Cathepsin C, dipeptidyl peptidase 1; two domains, 94.79
3ois_A291 Cysteine protease; alpha and beta, hydrolase; HET: 93.95
2cio_A212 Papain; hydrolase/inhibitor, complex hydrolase/inh 92.52
1o0e_A208 Ervatamin C; plant cysteine protease, two domain, 92.49
2bdz_A214 Mexicain; cysteine protease, peptidase_C1, papain- 92.2
3ioq_A213 CMS1MS2; caricaceae, cysteine protease, papain fam 91.38
3tnx_A363 Papain; hydrolase, cytoplasm for recombinant expre 91.27
3pw3_A383 Aminopeptidase C; bleomycin, cysteine proteinase f 86.94
1pxv_A183 Cysteine protease; hydrolase; 1.80A {Staphylococcu 85.81
1cv8_A174 Staphopain; cysteine protease, thiol protease, pap 83.99
1x9y_A367 Cysteine proteinase; half-barrel, barrel-sandwich- 81.24
>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme, papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A 1pbh_A 1mir_A Back     alignment and structure
Probab=100.00  E-value=9.6e-47  Score=352.95  Aligned_cols=184  Identities=38%  Similarity=0.771  Sum_probs=164.1

Q ss_pred             CCCCCchHHHHHHHHhcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccCCCCCCCcccccccCCCCCcccccceee
Q psy15346          1 VCSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYR   80 (280)
Q Consensus         1 gC~GG~~~~A~~yi~~~Gi~te~~Y~~~~~C~PY~~~~C~~~~~~~~~~~C~~~~~~~p~c~~~c~~~~~~~~~~~~~~~   80 (280)
                      ||+||++..||+|++++||++|++|++.++|+||..++|.|+ ..+..+.|... ..++.|...|. ..+.+.+..+.++
T Consensus       133 GC~GG~~~~A~~yi~~~Gi~te~~Y~~~~~c~PY~~~~c~~~-~~~~~~~C~~~-~~~~~c~~~c~-~~~~~~~~~~~~~  209 (317)
T 3pbh_A          133 GCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHH-VNGSRPPCTGE-GDTPKCSKICE-PGYSPTYKQDKHY  209 (317)
T ss_dssp             GGGCBCHHHHHHHHHHTCEEBBCSTTCCCBSSCCCSCCCCCC-CTTCCSCCCSC-CCCCCCCCSCC-SSCCSCGGGSEEC
T ss_pred             CCCCCCHHHHHHHHHHhCCCcchhccCCCCCcCcccCccccc-ccCcCCCCCCc-CCCCccccccc-CCCccceeeeeee
Confidence            799999999999999999999999999999999999999985 45567889865 36788988887 5566667777777


Q ss_pred             eEEEEEcCchHHHHHHHHHhCCcEEEEEEeCccccccccCccCCCcccccccccccccccccceeecCCchhhhhhheee
Q psy15346         81 FKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVK  160 (280)
Q Consensus        81 i~~~y~~~~~~~~Ik~~I~~~GPV~v~~~v~~~f~~~~~g~~~~~p~~~~~~~~~~~~~Y~~GVy~~~~~~~~~~~~~~~  160 (280)
                      ....|.++.++++||++|+++|||+++|.++++|+.                       |++|||.+....         
T Consensus       210 ~~~~~~v~~~e~~i~~~i~~~GPV~v~i~~~~~f~~-----------------------Y~~GVy~~~~~~---------  257 (317)
T 3pbh_A          210 GYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLL-----------------------YKSGVYQHVTGE---------  257 (317)
T ss_dssp             BCCCEEECSCHHHHHHHHHHHCCEEEEEEEEGGGGG-----------------------EEEEEECCCSCC---------
T ss_pred             eeecccCCcHHHHHHHHHHHCCCEEEEEEecccccC-----------------------CCCcEEccCCCC---------
Confidence            777778877899999999999999999999989999                       999999886543         


Q ss_pred             eeccCcCCCCCceeeeeeeecccCccccCCeEEEEEEeeccCCccEEEEEcCCCCCCCCCceEEEEccCCcccccceeee
Q psy15346        161 IVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG  240 (280)
Q Consensus       161 ~~gwg~~~~~~~w~~~~~~~~~~~~~~~~~HaV~IVGwG~e~g~~YWiirNSWG~~WG~~Gy~kI~rg~n~cgIe~~~~~  240 (280)
                      .                          .++|||+|||||++++++|||||||||++||++|||||+||.|+||||+.+++
T Consensus       258 ~--------------------------~~~HaV~iVGyG~~~g~~YWivkNSWG~~WGe~GY~ri~rg~n~CgI~~~~~a  311 (317)
T 3pbh_A          258 M--------------------------MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVA  311 (317)
T ss_dssp             E--------------------------EEEEEEEEEEEEEETTEEEEEEECSBCTTSTBTTEEEEECSSCGGGTTTSCEE
T ss_pred             C--------------------------CCCEEEEEEEEEEeCCeeEEEEEcCCCCCcCCCcEEEEEcCCCcccCCCceEe
Confidence            2                          56999999999999999999999999999999999999999999999999999


Q ss_pred             Eeecc
Q psy15346        241 ALPKD  245 (280)
Q Consensus       241 ~~p~~  245 (280)
                      ++|++
T Consensus       312 ~~p~~  316 (317)
T 3pbh_A          312 GIPRT  316 (317)
T ss_dssp             CCBCC
T ss_pred             eecCC
Confidence            99975



>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase, digestive tract, hydrolase-hydrolase INH complex; HET: 074; 1.30A {Schistosoma mansoni} SCOP: d.3.1.0 PDB: 3s3q_A* 3s3r_A* Back     alignment and structure
>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco hydrolase, lysosome, protease, thiol protease, zymogen, CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A* 3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A* 1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A* 1qdq_A* 1csb_B* 1huc_B 2ipp_B ... Back     alignment and structure
>3hhi_A Cathepsin B-like cysteine protease; occluding loop, hydrolase, THIO protease; HET: 074; 1.60A {Trypanosoma brucei} SCOP: d.3.1.0 PDB: 4hwy_A* 3mor_A* Back     alignment and structure
>8pch_A Cathepsin H; hydrolase, protease, cysteine proteinase, aminopeptidase; HET: NAG BMA; 2.10A {Sus scrofa} SCOP: d.3.1.1 PDB: 1nb3_A* 1nb5_A* Back     alignment and structure
>3i06_A Cruzipain; autocatalytic cleavage, glycoprotein, protease, thiol protease, zymogen; HET: QL2; 1.10A {Trypanosoma cruzi} SCOP: d.3.1.1 PDB: 1ewm_A* 1ewo_A* 1ewl_A* 1f29_A* 1ewp_A* 1f2b_A* 1f2c_A* 1f2a_A* 1me4_A* 1u9q_X* 2aim_A* 2efm_A* 2oz2_A* 1me3_A* 3kku_A* 3lxs_A* 1aim_A* 3iut_A* 3hd3_A* 2p86_A* ... Back     alignment and structure
>1m6d_A Cathepsin F, catsf; papain family cysteine protease, hydrolase; HET: MYP; 1.70A {Homo sapiens} SCOP: d.3.1.1 Back     alignment and structure
>3kwz_A Cathepsin K; enzyme inhibitor, covalent reversible inhibitor, disease mutation, disulfide bond, glycoprotein, hydrolase, lysosome, protease; HET: KWZ; 1.49A {Homo sapiens} PDB: 1au0_A* 1au2_A* 1au3_A* 1au4_A* 1ayu_A* 1ayv_A* 1ayw_A* 1bgo_A* 1atk_A* 1nl6_A* 1nlj_A* 1q6k_A* 1mem_A* 1yk7_A* 1yk8_A* 1yt7_A* 2ato_A* 2aux_A* 2auz_A* 2bdl_A* ... Back     alignment and structure
>1deu_A Procathepsin X; cysteine protease, proregion, prosegment, HY; 1.70A {Homo sapiens} SCOP: d.3.1.1 PDB: 1ef7_A Back     alignment and structure
>2xu3_A Cathepsin L1; hydrolase, drug design, thiol protease; HET: XU3 BTB; 0.90A {Homo sapiens} PDB: 2xu4_A* 2xu5_A* 2yj2_A* 2yj8_A* 2yj9_A* 2yjb_A* 2yjc_A* 3bc3_A* 3h89_A* 3h8b_A* 3h8c_A* 3of9_A* 3of8_A* 3hha_A* 2xu1_A* 3iv2_A* 3k24_A* 2nqd_B* 3kse_A* 2vhs_A ... Back     alignment and structure
>3qt4_A Cathepsin-L-like midgut cysteine proteinase; hydrolase, zymogen, intramolecular DISS bonds, insect larVal midgut; HET: PG4 PG6; 2.11A {Tenebrio molitor} Back     alignment and structure
>3ovx_A Cathepsin S; hydrolase, covalent inhibitor, aldehyde warhead is covalently bound to Cys25, lysosomeal protein; HET: O64; 1.49A {Homo sapiens} SCOP: d.3.1.1 PDB: 2h7j_A* 2f1g_A* 2hh5_B* 2hhn_A* 2hxz_A* 2op3_A* 2frq_A* 2fra_A* 2fq9_A* 2ft2_A* 2fud_A* 2g7y_A* 1ms6_A* 2r9m_A* 2r9n_A* 2r9o_A* 3n3g_A* 3n4c_A* 3mpe_A* 1nqc_A* ... Back     alignment and structure
>2o6x_A Procathepsin L1, secreted cathepsin L 1; hydrolase, thiol protease, cysteine protease, zymogen, hydro; 1.40A {Fasciola hepatica} Back     alignment and structure
>1xkg_A DER P I, major mite fecal allergen DER P 1; major allergen, cysteine protease, house DUST mite, dermatop pteronyssinus; 1.61A {Dermatophagoides pteronyssinus} SCOP: d.3.1.1 Back     alignment and structure
>2b1m_A SPE31; papain-like, sugar binding protein; HET: NAG FUC PG4; 2.00A {Pachyrhizus erosus} PDB: 2b1n_A* Back     alignment and structure
>3f5v_A DER P 1 allergen; allergy, asthma, DUST mites, glycoprotein, hydrola protease, secreted, thiol protease; HET: P6G; 1.36A {Dermatophagoides pteronyssinus} PDB: 2as8_A 3rvw_A* 3rvx_A 3rvv_A* 3d6s_A* Back     alignment and structure
>1iwd_A Ervatamin B; cysteine protease, alpha-beta protein, catalytic DYAD, L-DOM domain., hydrolase; 1.63A {Tabernaemontana divaricata} SCOP: d.3.1.1 Back     alignment and structure
>1ppo_A Protease omega; hydrolase(thiol protease); 1.80A {Carica papaya} SCOP: d.3.1.1 PDB: 1meg_A* Back     alignment and structure
>2c0y_A Procathepsin S; proenzyme, proteinase, hydrolase, thiol protease, prosegment binding loop, glycoprotein, lysosome, protease, zymogen; 2.1A {Homo sapiens} Back     alignment and structure
>3p5u_A Actinidin; SAD, cysteine proteinases, hydrolase; 1.50A {Actinidia arguta} SCOP: d.3.1.1 PDB: 3p5v_A 3p5w_A 3p5x_A 1aec_A* 2act_A Back     alignment and structure
>3qj3_A Cathepsin L-like protein; hydrolase, proteinase, larVal midgut; 1.85A {Tenebrio molitor} SCOP: d.3.1.0 Back     alignment and structure
>3pdf_A Cathepsin C, dipeptidyl peptidase 1; two domains, cystein protease, hydrolase-hydrolase inhibitor; HET: LXV NAG; 1.85A {Homo sapiens} PDB: 1jqp_A* 2djf_B* 1k3b_B* 2djg_B* 2djf_A* 1k3b_A* 2djg_A* 2djf_C* 1k3b_C* 2djg_C* Back     alignment and structure
>3u8e_A Papain-like cysteine protease; papain-like cysteine peptidase, peptidase_C1A, hydrolase, in form; 1.31A {Crocus sativus} SCOP: d.3.1.0 Back     alignment and structure
>1yal_A Chymopapain; hydrolase, thiol protease; 1.70A {Carica papaya} SCOP: d.3.1.1 PDB: 1gec_E* Back     alignment and structure
>1cqd_A Protein (protease II); cysteine protease, glycoprotein, proline specificity, carboh papain family, hydrolase; HET: NAG FUL FUC; 2.10A {Zingiber officinale} SCOP: d.3.1.1 Back     alignment and structure
>1by8_A Protein (procathepsin K); hydrolase(sulfhydryl proteinase), papain; 2.60A {Homo sapiens} SCOP: d.3.1.1 PDB: 7pck_A Back     alignment and structure
>1cs8_A Human procathepsin L; prosegment, propeptide, inhibition, hydrolase; HET: OCS; 1.80A {Homo sapiens} SCOP: d.3.1.1 PDB: 1cjl_A 3hwn_A* Back     alignment and structure
>1pci_A Procaricain; zymogen, hydrolase, thiol protease; 3.20A {Carica papaya} SCOP: d.3.1.1 Back     alignment and structure
>3f75_A Toxopain-2, cathepsin L protease; medical structural genomics of pathogenic protozoa, MSGPP, C protease, parasite, protozoa, hydrolase; 1.99A {Toxoplasma gondii} SCOP: d.3.1.0 Back     alignment and structure
>3ioq_A CMS1MS2; caricaceae, cysteine protease, papain family, hydrolase; HET: E64 SO4; 1.87A {Carica candamarcensis} SCOP: d.3.1.1 Back     alignment and structure
>2oul_A Falcipain 2; cysteine protease, inhibitor, macromolecular interaction, HY hydrolase inhibitor complex; 2.20A {Plasmodium falciparum} SCOP: d.3.1.1 PDB: 2ghu_A 1yvb_A 3bpf_A* 3pnr_A Back     alignment and structure
>2cio_A Papain; hydrolase/inhibitor, complex hydrolase/inhibitor, ICP, cysteine protease, allergen, protease, thiol protease; 1.5A {Carica papaya} PDB: 1khq_A 1khp_A 1ppn_A 3e1z_B 3ima_A 3lfy_A 9pap_A 1bqi_A* 1bp4_A* 1pad_A 1pe6_A* 1pip_A* 1pop_A* 1ppd_A 1ppp_A* 1stf_E* 2pad_A 4pad_A* 5pad_A* 6pad_A* ... Back     alignment and structure
>2bdz_A Mexicain; cysteine protease, peptidase_C1, papain-like, HYDR; HET: E64; 2.10A {Jacaratia mexicana} Back     alignment and structure
>1s4v_A Cysteine endopeptidase; KDEL ER retention signal, endosperm, ricinosomes, SEED germi senescence, hydrolase-hydrolase inhibitor complex; 2.00A {Ricinus communis} SCOP: d.3.1.1 Back     alignment and structure
>1o0e_A Ervatamin C; plant cysteine protease, two domain, stable at PH 2-12, HYDR; 1.90A {Tabernaemontana divaricata} SCOP: d.3.1.1 PDB: 2pns_A* 2pre_A* 3bcn_A* Back     alignment and structure
>3bwk_A Cysteine protease falcipain-3; malaria, hydrolase; HET: C1P; 2.42A {Plasmodium falciparum} PDB: 3bpm_A* Back     alignment and structure
>2fo5_A Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cysteine endoprotease, endopeptidase, LEUP hydrolase; HET: AR7; 2.20A {Hordeum vulgare} Back     alignment and structure
>3tnx_A Papain; hydrolase, cytoplasm for recombinant expression; 2.62A {Carica papaya} Back     alignment and structure
>2wbf_X Serine-repeat antigen protein; SERA, malaria, vacuole, protease, cathepsin, hydrolase, glycoprotein, thiol protease; HET: DMS; 1.60A {Plasmodium falciparum} PDB: 3ch3_X 3ch2_X Back     alignment and structure
>3ois_A Cysteine protease; alpha and beta, hydrolase; HET: UDP; 1.65A {Xylella fastidiosa} Back     alignment and structure
>2cb5_A Protein (bleomycin hydrolase); aminopeptidase, cysteine protease, SELF- compartmentalizing, cylinase; 1.85A {Homo sapiens} SCOP: d.3.1.1 PDB: 1cb5_A Back     alignment and structure
>2e01_A Cysteine proteinase 1; bleomycin hydrolase, thiol protease, C1 protease, hydrolase; 1.73A {Saccharomyces cerevisiae} PDB: 2e02_A 2e03_A 2dzy_A 1a6r_A 2e00_A 2dzz_A 3gcb_A 1gcb_A Back     alignment and structure
>3pw3_A Aminopeptidase C; bleomycin, cysteine proteinase fold, structural genomics, JO center for structural genomics, JCSG; HET: MSE; 2.23A {Parabacteroides distasonis} Back     alignment and structure
>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme, papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A 1pbh_A 1mir_A Back     alignment and structure
>3i06_A Cruzipain; autocatalytic cleavage, glycoprotein, protease, thiol protease, zymogen; HET: QL2; 1.10A {Trypanosoma cruzi} SCOP: d.3.1.1 PDB: 1ewm_A* 1ewo_A* 1ewl_A* 1f29_A* 1ewp_A* 1f2b_A* 1f2c_A* 1f2a_A* 1me4_A* 1u9q_X* 2aim_A* 2efm_A* 2oz2_A* 1me3_A* 3kku_A* 3lxs_A* 1aim_A* 3iut_A* 3hd3_A* 2p86_A* ... Back     alignment and structure
>3hhi_A Cathepsin B-like cysteine protease; occluding loop, hydrolase, THIO protease; HET: 074; 1.60A {Trypanosoma brucei} SCOP: d.3.1.0 PDB: 4hwy_A* 3mor_A* Back     alignment and structure
>1m6d_A Cathepsin F, catsf; papain family cysteine protease, hydrolase; HET: MYP; 1.70A {Homo sapiens} SCOP: d.3.1.1 Back     alignment and structure
>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco hydrolase, lysosome, protease, thiol protease, zymogen, CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A* 3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A* 1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A* 1qdq_A* 1csb_B* 1huc_B 2ipp_B ... Back     alignment and structure
>1deu_A Procathepsin X; cysteine protease, proregion, prosegment, HY; 1.70A {Homo sapiens} SCOP: d.3.1.1 PDB: 1ef7_A Back     alignment and structure
>2b1m_A SPE31; papain-like, sugar binding protein; HET: NAG FUC PG4; 2.00A {Pachyrhizus erosus} PDB: 2b1n_A* Back     alignment and structure
>3p5u_A Actinidin; SAD, cysteine proteinases, hydrolase; 1.50A {Actinidia arguta} SCOP: d.3.1.1 PDB: 3p5v_A 3p5w_A 3p5x_A 1aec_A* 2act_A Back     alignment and structure
>1cqd_A Protein (protease II); cysteine protease, glycoprotein, proline specificity, carboh papain family, hydrolase; HET: NAG FUL FUC; 2.10A {Zingiber officinale} SCOP: d.3.1.1 Back     alignment and structure
>1iwd_A Ervatamin B; cysteine protease, alpha-beta protein, catalytic DYAD, L-DOM domain., hydrolase; 1.63A {Tabernaemontana divaricata} SCOP: d.3.1.1 Back     alignment and structure
>2o6x_A Procathepsin L1, secreted cathepsin L 1; hydrolase, thiol protease, cysteine protease, zymogen, hydro; 1.40A {Fasciola hepatica} Back     alignment and structure
>1pci_A Procaricain; zymogen, hydrolase, thiol protease; 3.20A {Carica papaya} SCOP: d.3.1.1 Back     alignment and structure
>3f5v_A DER P 1 allergen; allergy, asthma, DUST mites, glycoprotein, hydrola protease, secreted, thiol protease; HET: P6G; 1.36A {Dermatophagoides pteronyssinus} PDB: 2as8_A 3rvw_A* 3rvx_A 3rvv_A* 3d6s_A* Back     alignment and structure
>1ppo_A Protease omega; hydrolase(thiol protease); 1.80A {Carica papaya} SCOP: d.3.1.1 PDB: 1meg_A* Back     alignment and structure
>2c0y_A Procathepsin S; proenzyme, proteinase, hydrolase, thiol protease, prosegment binding loop, glycoprotein, lysosome, protease, zymogen; 2.1A {Homo sapiens} Back     alignment and structure
>1yal_A Chymopapain; hydrolase, thiol protease; 1.70A {Carica papaya} SCOP: d.3.1.1 PDB: 1gec_E* Back     alignment and structure
>1xkg_A DER P I, major mite fecal allergen DER P 1; major allergen, cysteine protease, house DUST mite, dermatop pteronyssinus; 1.61A {Dermatophagoides pteronyssinus} SCOP: d.3.1.1 Back     alignment and structure
>1by8_A Protein (procathepsin K); hydrolase(sulfhydryl proteinase), papain; 2.60A {Homo sapiens} SCOP: d.3.1.1 PDB: 7pck_A Back     alignment and structure
>2xu3_A Cathepsin L1; hydrolase, drug design, thiol protease; HET: XU3 BTB; 0.90A {Homo sapiens} PDB: 2xu4_A* 2xu5_A* 2yj2_A* 2yj8_A* 2yj9_A* 2yjb_A* 2yjc_A* 3bc3_A* 3h89_A* 3h8b_A* 3h8c_A* 3of9_A* 3of8_A* 3hha_A* 2xu1_A* 3iv2_A* 3k24_A* 2nqd_B* 3kse_A* 2vhs_A ... Back     alignment and structure
>8pch_A Cathepsin H; hydrolase, protease, cysteine proteinase, aminopeptidase; HET: NAG BMA; 2.10A {Sus scrofa} SCOP: d.3.1.1 PDB: 1nb3_A* 1nb5_A* Back     alignment and structure
>3kwz_A Cathepsin K; enzyme inhibitor, covalent reversible inhibitor, disease mutation, disulfide bond, glycoprotein, hydrolase, lysosome, protease; HET: KWZ; 1.49A {Homo sapiens} PDB: 1au0_A* 1au2_A* 1au3_A* 1au4_A* 1ayu_A* 1ayv_A* 1ayw_A* 1bgo_A* 1atk_A* 1nl6_A* 1nlj_A* 1q6k_A* 1mem_A* 1yk7_A* 1yk8_A* 1yt7_A* 2ato_A* 2aux_A* 2auz_A* 2bdl_A* ... Back     alignment and structure
>3ovx_A Cathepsin S; hydrolase, covalent inhibitor, aldehyde warhead is covalently bound to Cys25, lysosomeal protein; HET: O64; 1.49A {Homo sapiens} SCOP: d.3.1.1 PDB: 2h7j_A* 2f1g_A* 2hh5_B* 2hhn_A* 2hxz_A* 2op3_A* 2frq_A* 2fra_A* 2fq9_A* 2ft2_A* 2fud_A* 2g7y_A* 1ms6_A* 2r9m_A* 2r9n_A* 2r9o_A* 3n3g_A* 3n4c_A* 3mpe_A* 1nqc_A* ... Back     alignment and structure
>1s4v_A Cysteine endopeptidase; KDEL ER retention signal, endosperm, ricinosomes, SEED germi senescence, hydrolase-hydrolase inhibitor complex; 2.00A {Ricinus communis} SCOP: d.3.1.1 Back     alignment and structure
>3u8e_A Papain-like cysteine protease; papain-like cysteine peptidase, peptidase_C1A, hydrolase, in form; 1.31A {Crocus sativus} SCOP: d.3.1.0 Back     alignment and structure
>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase, digestive tract, hydrolase-hydrolase INH complex; HET: 074; 1.30A {Schistosoma mansoni} SCOP: d.3.1.0 PDB: 3s3q_A* 3s3r_A* Back     alignment and structure
>3qt4_A Cathepsin-L-like midgut cysteine proteinase; hydrolase, zymogen, intramolecular DISS bonds, insect larVal midgut; HET: PG4 PG6; 2.11A {Tenebrio molitor} Back     alignment and structure
>2fo5_A Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cysteine endoprotease, endopeptidase, LEUP hydrolase; HET: AR7; 2.20A {Hordeum vulgare} Back     alignment and structure
>3qj3_A Cathepsin L-like protein; hydrolase, proteinase, larVal midgut; 1.85A {Tenebrio molitor} SCOP: d.3.1.0 Back     alignment and structure
>2wbf_X Serine-repeat antigen protein; SERA, malaria, vacuole, protease, cathepsin, hydrolase, glycoprotein, thiol protease; HET: DMS; 1.60A {Plasmodium falciparum} PDB: 3ch3_X 3ch2_X Back     alignment and structure
>1cs8_A Human procathepsin L; prosegment, propeptide, inhibition, hydrolase; HET: OCS; 1.80A {Homo sapiens} SCOP: d.3.1.1 PDB: 1cjl_A 3hwn_A* Back     alignment and structure
>3f75_A Toxopain-2, cathepsin L protease; medical structural genomics of pathogenic protozoa, MSGPP, C protease, parasite, protozoa, hydrolase; 1.99A {Toxoplasma gondii} SCOP: d.3.1.0 Back     alignment and structure
>2oul_A Falcipain 2; cysteine protease, inhibitor, macromolecular interaction, HY hydrolase inhibitor complex; 2.20A {Plasmodium falciparum} SCOP: d.3.1.1 PDB: 2ghu_A 1yvb_A 3bpf_A* 3pnr_A Back     alignment and structure
>3bwk_A Cysteine protease falcipain-3; malaria, hydrolase; HET: C1P; 2.42A {Plasmodium falciparum} PDB: 3bpm_A* Back     alignment and structure
>3pdf_A Cathepsin C, dipeptidyl peptidase 1; two domains, cystein protease, hydrolase-hydrolase inhibitor; HET: LXV NAG; 1.85A {Homo sapiens} PDB: 1jqp_A* 2djf_B* 1k3b_B* 2djg_B* 2djf_A* 1k3b_A* 2djg_A* 2djf_C* 1k3b_C* 2djg_C* Back     alignment and structure
>3ois_A Cysteine protease; alpha and beta, hydrolase; HET: UDP; 1.65A {Xylella fastidiosa} Back     alignment and structure
>2cio_A Papain; hydrolase/inhibitor, complex hydrolase/inhibitor, ICP, cysteine protease, allergen, protease, thiol protease; 1.5A {Carica papaya} PDB: 1khq_A 1khp_A 1ppn_A 3e1z_B 3ima_A 3lfy_A 9pap_A 1bqi_A* 1bp4_A* 1pad_A 1pe6_A* 1pip_A* 1pop_A* 1ppd_A 1ppp_A* 1stf_E* 2pad_A 4pad_A* 5pad_A* 6pad_A* ... Back     alignment and structure
>1o0e_A Ervatamin C; plant cysteine protease, two domain, stable at PH 2-12, HYDR; 1.90A {Tabernaemontana divaricata} SCOP: d.3.1.1 PDB: 2pns_A* 2pre_A* 3bcn_A* Back     alignment and structure
>2bdz_A Mexicain; cysteine protease, peptidase_C1, papain-like, HYDR; HET: E64; 2.10A {Jacaratia mexicana} Back     alignment and structure
>3ioq_A CMS1MS2; caricaceae, cysteine protease, papain family, hydrolase; HET: E64 SO4; 1.87A {Carica candamarcensis} SCOP: d.3.1.1 Back     alignment and structure
>3tnx_A Papain; hydrolase, cytoplasm for recombinant expression; 2.62A {Carica papaya} Back     alignment and structure
>3pw3_A Aminopeptidase C; bleomycin, cysteine proteinase fold, structural genomics, JO center for structural genomics, JCSG; HET: MSE; 2.23A {Parabacteroides distasonis} Back     alignment and structure
>1pxv_A Cysteine protease; hydrolase; 1.80A {Staphylococcus aureus} SCOP: d.3.1.1 PDB: 1y4h_A Back     alignment and structure
>1cv8_A Staphopain; cysteine protease, thiol protease, papain family; HET: E64; 1.75A {Staphylococcus aureus} SCOP: d.3.1.1 Back     alignment and structure
>1x9y_A Cysteine proteinase; half-barrel, barrel-sandwich-hybrid, hydrolase; 2.50A {Staphylococcus aureus} SCOP: d.3.1.1 d.17.1.4 Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 280
d1gmya_254 d.3.1.1 (A:) (Pro)cathepsin B {Human (Homo sapiens 3e-20
g1k3b.1233 d.3.1.1 (B:,C:) Cathepsin C (dipeptidyl peptidase 2e-10
d1xkga1302 d.3.1.1 (A:4-305) Major mite fecal allergen der p 1e-09
g8pch.1228 d.3.1.1 (P:,A:) Cathepsin H {Pig (Sus scrofa) [Tax 1e-09
d1deua_275 d.3.1.1 (A:) (Pro)cathepsin X {Human (Homo sapiens 3e-08
d1m6da_214 d.3.1.1 (A:) Cathepsin F {Human (Homo sapiens) [Ta 2e-07
d1me4a_215 d.3.1.1 (A:) Cruzain {Trypanosoma cruzi [TaxId: 56 7e-07
d1cs8a_316 d.3.1.1 (A:) (Pro)cathepsin L {Human (Homo sapiens 1e-06
d1yala_218 d.3.1.1 (A:) Chymopapain {Papaya (Carica papaya) [ 5e-06
d1ppoa_216 d.3.1.1 (A:) Caricain (protease omega) {Papaya (Ca 7e-06
d2h7ja1217 d.3.1.1 (A:1-217) (Pro)cathepsin S {Human (Homo sa 1e-05
d1aeca_218 d.3.1.1 (A:) Actinidin {Chinese gooseberry or kiwi 2e-05
d2oula1241 d.3.1.1 (A:-16-224) Falcipain 2 {Plasmodium falcip 2e-05
d1cqda_216 d.3.1.1 (A:) Proline-specific cysteine protease {G 3e-05
d2r6na1215 d.3.1.1 (A:1-215) (Pro)cathepsin K {Human (Homo sa 4e-05
d1s4va_224 d.3.1.1 (A:) Vignain (bean endopeptidase) {Castor 4e-05
d1iwda_215 d.3.1.1 (A:) Ervatamin B {Adam's apple (Ervatamia 5e-05
d1fh0a_221 d.3.1.1 (A:) (Pro)cathepsin V {Human (Homo sapiens 4e-04
d1khqa_212 d.3.1.1 (A:) Papain {Papaya (Carica papaya) [TaxId 0.004
>d1gmya_ d.3.1.1 (A:) (Pro)cathepsin B {Human (Homo sapiens) [TaxId: 9606]} Length = 254 Back     information, alignment and structure

class: Alpha and beta proteins (a+b)
fold: Cysteine proteinases
superfamily: Cysteine proteinases
family: Papain-like
domain: (Pro)cathepsin B
species: Human (Homo sapiens) [TaxId: 9606]
 Score = 85.4 bits (210), Expect = 3e-20
 Identities = 72/243 (29%), Positives = 100/243 (41%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 72  CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 129

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V++   DI  EI KNGPV     +YSD   YKSG 
Sbjct: 130 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGV 188

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                                  ++I+GWG ENG PYW +V     
Sbjct: 189 YQHVTGEMMGG------------------------HAIRILGWGVENGTPYW-LVA---- 219

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                        +++   +GD G  KILRG++   IES V   
Sbjct: 220 -----------------------------NSWNTDWGDNGFFKILRGQDHCGIESEVVAG 250

Query: 242 LPK 244
           +P+
Sbjct: 251 IPR 253


>d1xkga1 d.3.1.1 (A:4-305) Major mite fecal allergen der p 1 {House-dust mite (Dermatophagoides pteronyssinus) [TaxId: 6956]} Length = 302 Back     information, alignment and structure
>d1deua_ d.3.1.1 (A:) (Pro)cathepsin X {Human (Homo sapiens) [TaxId: 9606]} Length = 275 Back     information, alignment and structure
>d1m6da_ d.3.1.1 (A:) Cathepsin F {Human (Homo sapiens) [TaxId: 9606]} Length = 214 Back     information, alignment and structure
>d1me4a_ d.3.1.1 (A:) Cruzain {Trypanosoma cruzi [TaxId: 5693]} Length = 215 Back     information, alignment and structure
>d1cs8a_ d.3.1.1 (A:) (Pro)cathepsin L {Human (Homo sapiens) [TaxId: 9606]} Length = 316 Back     information, alignment and structure
>d1yala_ d.3.1.1 (A:) Chymopapain {Papaya (Carica papaya) [TaxId: 3649]} Length = 218 Back     information, alignment and structure
>d1ppoa_ d.3.1.1 (A:) Caricain (protease omega) {Papaya (Carica papaya) [TaxId: 3649]} Length = 216 Back     information, alignment and structure
>d2h7ja1 d.3.1.1 (A:1-217) (Pro)cathepsin S {Human (Homo sapiens) [TaxId: 9606]} Length = 217 Back     information, alignment and structure
>d1aeca_ d.3.1.1 (A:) Actinidin {Chinese gooseberry or kiwifruit (Actinidia chinensis) [TaxId: 3625]} Length = 218 Back     information, alignment and structure
>d2oula1 d.3.1.1 (A:-16-224) Falcipain 2 {Plasmodium falciparum [TaxId: 5833]} Length = 241 Back     information, alignment and structure
>d1cqda_ d.3.1.1 (A:) Proline-specific cysteine protease {Ginger rhizome (Zingiber officinale) [TaxId: 94328]} Length = 216 Back     information, alignment and structure
>d2r6na1 d.3.1.1 (A:1-215) (Pro)cathepsin K {Human (Homo sapiens) [TaxId: 9606]} Length = 215 Back     information, alignment and structure
>d1s4va_ d.3.1.1 (A:) Vignain (bean endopeptidase) {Castor bean (Ricinus communis) [TaxId: 3988]} Length = 224 Back     information, alignment and structure
>d1iwda_ d.3.1.1 (A:) Ervatamin B {Adam's apple (Ervatamia coronaria) [TaxId: 52861]} Length = 215 Back     information, alignment and structure
>d1fh0a_ d.3.1.1 (A:) (Pro)cathepsin V {Human (Homo sapiens) [TaxId: 9606]} Length = 221 Back     information, alignment and structure
>d1khqa_ d.3.1.1 (A:) Papain {Papaya (Carica papaya) [TaxId: 3649]} Length = 212 Back     information, alignment and structure

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query280
d1gmya_254 (Pro)cathepsin B {Human (Homo sapiens) [TaxId: 960 100.0
d1deua_275 (Pro)cathepsin X {Human (Homo sapiens) [TaxId: 960 100.0
g8pch.1228 Cathepsin H {Pig (Sus scrofa) [TaxId: 9823]} 100.0
g1k3b.1233 Cathepsin C (dipeptidyl peptidase I), catalytic do 100.0
d1m6da_214 Cathepsin F {Human (Homo sapiens) [TaxId: 9606]} 100.0
d1yala_218 Chymopapain {Papaya (Carica papaya) [TaxId: 3649]} 99.98
d1xkga1302 Major mite fecal allergen der p 1 {House-dust mite 99.98
d1cs8a_316 (Pro)cathepsin L {Human (Homo sapiens) [TaxId: 960 99.98
d1me4a_215 Cruzain {Trypanosoma cruzi [TaxId: 5693]} 99.97
d1cqda_216 Proline-specific cysteine protease {Ginger rhizome 99.97
d2h7ja1217 (Pro)cathepsin S {Human (Homo sapiens) [TaxId: 960 99.97
d1ppoa_216 Caricain (protease omega) {Papaya (Carica papaya) 99.97
d2r6na1215 (Pro)cathepsin K {Human (Homo sapiens) [TaxId: 960 99.97
d2oula1241 Falcipain 2 {Plasmodium falciparum [TaxId: 5833]} 99.97
d1aeca_218 Actinidin {Chinese gooseberry or kiwifruit (Actini 99.97
d1fh0a_221 (Pro)cathepsin V {Human (Homo sapiens) [TaxId: 960 99.96
d1iwda_215 Ervatamin B {Adam's apple (Ervatamia coronaria) [T 99.96
d1khqa_212 Papain {Papaya (Carica papaya) [TaxId: 3649]} 99.96
d1s4va_224 Vignain (bean endopeptidase) {Castor bean (Ricinus 99.96
d1o0ea_208 Ervatamin C {East indian rosebay (Ervatamia corona 99.96
d3gcba_458 Bleomycin hydrolase {Baker's yeast (Saccharomyces 98.64
d2cb5a_453 Bleomycin hydrolase {Human (Homo sapiens) [TaxId: 98.17
d1aeca_218 Actinidin {Chinese gooseberry or kiwifruit (Actini 96.37
d1me4a_215 Cruzain {Trypanosoma cruzi [TaxId: 5693]} 96.13
d1xkga1302 Major mite fecal allergen der p 1 {House-dust mite 95.5
d1yala_218 Chymopapain {Papaya (Carica papaya) [TaxId: 3649]} 95.37
d1cqda_216 Proline-specific cysteine protease {Ginger rhizome 95.29
d2h7ja1217 (Pro)cathepsin S {Human (Homo sapiens) [TaxId: 960 95.26
d1m6da_214 Cathepsin F {Human (Homo sapiens) [TaxId: 9606]} 95.26
d1deua_275 (Pro)cathepsin X {Human (Homo sapiens) [TaxId: 960 95.23
g8pch.1228 Cathepsin H {Pig (Sus scrofa) [TaxId: 9823]} 95.15
d1gmya_254 (Pro)cathepsin B {Human (Homo sapiens) [TaxId: 960 94.92
d1cs8a_316 (Pro)cathepsin L {Human (Homo sapiens) [TaxId: 960 94.86
d1ppoa_216 Caricain (protease omega) {Papaya (Carica papaya) 94.71
d1s4va_224 Vignain (bean endopeptidase) {Castor bean (Ricinus 94.62
d1iwda_215 Ervatamin B {Adam's apple (Ervatamia coronaria) [T 94.22
g1k3b.1233 Cathepsin C (dipeptidyl peptidase I), catalytic do 93.77
d2r6na1215 (Pro)cathepsin K {Human (Homo sapiens) [TaxId: 960 93.75
d2oula1241 Falcipain 2 {Plasmodium falciparum [TaxId: 5833]} 90.47
d1fh0a_221 (Pro)cathepsin V {Human (Homo sapiens) [TaxId: 960 90.33
d1khqa_212 Papain {Papaya (Carica papaya) [TaxId: 3649]} 89.24
d1o0ea_208 Ervatamin C {East indian rosebay (Ervatamia corona 88.75
>d1gmya_ d.3.1.1 (A:) (Pro)cathepsin B {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
class: Alpha and beta proteins (a+b)
fold: Cysteine proteinases
superfamily: Cysteine proteinases
family: Papain-like
domain: (Pro)cathepsin B
species: Human (Homo sapiens) [TaxId: 9606]
Probab=100.00  E-value=5.2e-38  Score=278.41  Aligned_cols=183  Identities=38%  Similarity=0.778  Sum_probs=142.7

Q ss_pred             CCCCCchHHHHHHHHhcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccCCCCCCCcccccccCCCCCcccccceee
Q psy15346          1 VCSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYR   80 (280)
Q Consensus         1 gC~GG~~~~A~~yi~~~Gi~te~~Y~~~~~C~PY~~~~C~~~~~~~~~~~C~~~~~~~p~c~~~c~~~~~~~~~~~~~~~   80 (280)
                      ||.||++..|+++++++|++++..|.....|.+|..+.|... .++..+.|.... ..+.....|.. .....+....+.
T Consensus        71 gc~gg~~~~a~~~~~~~G~~~e~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~-~~~~~~~~~~~-~~~~~~~~~~~~  147 (254)
T d1gmya_          71 GCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHH-VNGSRPPCTGEG-DTPKCSKICEP-GYSPTYKQDKHY  147 (254)
T ss_dssp             GGGCBCHHHHHHHHHHTCBCBCCSTTCCCSSSCCCSCCCBSS-SCCSSCBCCSCC-CCCCCCCSCCT-TCCSCHHHHCBC
T ss_pred             CCCCCcHHHHHHHHHHcCcccccccccccccccccccccccc-ccCccCcccccc-CCccccccccC-Ccccceeeeeee
Confidence            699999999999999999999999988888888887776552 344445555432 22233333331 122222222222


Q ss_pred             eEEEEEcCchHHHHHHHHHhCCcEEEEEEeCccccccccCccCCCcccccccccccccccccceeecCCchhhhhhheee
Q psy15346         81 FKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVK  160 (280)
Q Consensus        81 i~~~y~~~~~~~~Ik~~I~~~GPV~v~~~v~~~f~~~~~g~~~~~p~~~~~~~~~~~~~Y~~GVy~~~~~~~~~~~~~~~  160 (280)
                      ....+.....++.||++|+++|||+++|.++++|..                       |++||+......         
T Consensus       148 ~~~~~~~~~~~~~ik~~l~~~gpv~~~~~~~~~f~~-----------------------y~~gi~~~~~~~---------  195 (254)
T d1gmya_         148 GYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLL-----------------------YKSGVYQHVTGE---------  195 (254)
T ss_dssp             BSCCEECCSCHHHHHHHHHHHCCEEEEEEEEGGGTT-----------------------CCSSEECCCSCC---------
T ss_pred             eeeeeccccHHHHHHHHHHHcCCEEEEEEeechhhh-----------------------ccCCcccccccc---------
Confidence            333344556789999999999999999999989999                       999999766433         


Q ss_pred             eeccCcCCCCCceeeeeeeecccCccccCCeEEEEEEeeccCCccEEEEEcCCCCCCCCCceEEEEccCCcccccceeee
Q psy15346        161 IVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG  240 (280)
Q Consensus       161 ~~gwg~~~~~~~w~~~~~~~~~~~~~~~~~HaV~IVGwG~e~g~~YWiirNSWG~~WG~~Gy~kI~rg~n~cgIe~~~~~  240 (280)
                      .                          .++|||+|||||++++.+|||||||||++||++|||||+||.|.||||+++++
T Consensus       196 ~--------------------------~~~Hav~IVGyg~~~g~~ywIvkNSWG~~WGd~GYf~i~~~~n~cgi~~~~~~  249 (254)
T d1gmya_         196 M--------------------------MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVA  249 (254)
T ss_dssp             E--------------------------EEEEEEEEEEEEEETTEEEEEEECSBCTTSTBTTEEEEECSSCGGGTTTSCEE
T ss_pred             c--------------------------cccEEEEEEEEeccCCceEEEEEcCCCCCcCCCceEEEEcCCCccCcCCceEe
Confidence            2                          56999999999999999999999999999999999999999999999999999


Q ss_pred             Eeec
Q psy15346        241 ALPK  244 (280)
Q Consensus       241 ~~p~  244 (280)
                      ++|+
T Consensus       250 ~~p~  253 (254)
T d1gmya_         250 GIPR  253 (254)
T ss_dssp             CCBC
T ss_pred             ccCC
Confidence            9996



>d1deua_ d.3.1.1 (A:) (Pro)cathepsin X {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1m6da_ d.3.1.1 (A:) Cathepsin F {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1yala_ d.3.1.1 (A:) Chymopapain {Papaya (Carica papaya) [TaxId: 3649]} Back     information, alignment and structure
>d1xkga1 d.3.1.1 (A:4-305) Major mite fecal allergen der p 1 {House-dust mite (Dermatophagoides pteronyssinus) [TaxId: 6956]} Back     information, alignment and structure
>d1cs8a_ d.3.1.1 (A:) (Pro)cathepsin L {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1me4a_ d.3.1.1 (A:) Cruzain {Trypanosoma cruzi [TaxId: 5693]} Back     information, alignment and structure
>d1cqda_ d.3.1.1 (A:) Proline-specific cysteine protease {Ginger rhizome (Zingiber officinale) [TaxId: 94328]} Back     information, alignment and structure
>d2h7ja1 d.3.1.1 (A:1-217) (Pro)cathepsin S {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1ppoa_ d.3.1.1 (A:) Caricain (protease omega) {Papaya (Carica papaya) [TaxId: 3649]} Back     information, alignment and structure
>d2r6na1 d.3.1.1 (A:1-215) (Pro)cathepsin K {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1aeca_ d.3.1.1 (A:) Actinidin {Chinese gooseberry or kiwifruit (Actinidia chinensis) [TaxId: 3625]} Back     information, alignment and structure
>d1fh0a_ d.3.1.1 (A:) (Pro)cathepsin V {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1iwda_ d.3.1.1 (A:) Ervatamin B {Adam's apple (Ervatamia coronaria) [TaxId: 52861]} Back     information, alignment and structure
>d1khqa_ d.3.1.1 (A:) Papain {Papaya (Carica papaya) [TaxId: 3649]} Back     information, alignment and structure
>d1s4va_ d.3.1.1 (A:) Vignain (bean endopeptidase) {Castor bean (Ricinus communis) [TaxId: 3988]} Back     information, alignment and structure
>d1o0ea_ d.3.1.1 (A:) Ervatamin C {East indian rosebay (Ervatamia coronaria) [TaxId: 52861]} Back     information, alignment and structure
>d3gcba_ d.3.1.1 (A:) Bleomycin hydrolase {Baker's yeast (Saccharomyces cerevisiae), Gal6 [TaxId: 4932]} Back     information, alignment and structure
>d2cb5a_ d.3.1.1 (A:) Bleomycin hydrolase {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1aeca_ d.3.1.1 (A:) Actinidin {Chinese gooseberry or kiwifruit (Actinidia chinensis) [TaxId: 3625]} Back     information, alignment and structure
>d1me4a_ d.3.1.1 (A:) Cruzain {Trypanosoma cruzi [TaxId: 5693]} Back     information, alignment and structure
>d1xkga1 d.3.1.1 (A:4-305) Major mite fecal allergen der p 1 {House-dust mite (Dermatophagoides pteronyssinus) [TaxId: 6956]} Back     information, alignment and structure
>d1yala_ d.3.1.1 (A:) Chymopapain {Papaya (Carica papaya) [TaxId: 3649]} Back     information, alignment and structure
>d1cqda_ d.3.1.1 (A:) Proline-specific cysteine protease {Ginger rhizome (Zingiber officinale) [TaxId: 94328]} Back     information, alignment and structure
>d2h7ja1 d.3.1.1 (A:1-217) (Pro)cathepsin S {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1m6da_ d.3.1.1 (A:) Cathepsin F {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1deua_ d.3.1.1 (A:) (Pro)cathepsin X {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1gmya_ d.3.1.1 (A:) (Pro)cathepsin B {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1cs8a_ d.3.1.1 (A:) (Pro)cathepsin L {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1ppoa_ d.3.1.1 (A:) Caricain (protease omega) {Papaya (Carica papaya) [TaxId: 3649]} Back     information, alignment and structure
>d1s4va_ d.3.1.1 (A:) Vignain (bean endopeptidase) {Castor bean (Ricinus communis) [TaxId: 3988]} Back     information, alignment and structure
>d1iwda_ d.3.1.1 (A:) Ervatamin B {Adam's apple (Ervatamia coronaria) [TaxId: 52861]} Back     information, alignment and structure
>d2r6na1 d.3.1.1 (A:1-215) (Pro)cathepsin K {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1fh0a_ d.3.1.1 (A:) (Pro)cathepsin V {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1khqa_ d.3.1.1 (A:) Papain {Papaya (Carica papaya) [TaxId: 3649]} Back     information, alignment and structure
>d1o0ea_ d.3.1.1 (A:) Ervatamin C {East indian rosebay (Ervatamia coronaria) [TaxId: 52861]} Back     information, alignment and structure