Psyllid ID: psy8713


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------31
MYTQQIRLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKLD
ccccccccccccccccccHHHHHHHHHHcccccccccccccccccccccHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEccccccccHHccccccccccccccccccccccccccccccccccccccEEEEEEccccccccccccccccccccccccccccccccEEEcccccHHHHHHHHHHHccccccccccccEEEEccEEEEcccccccccEEEEEEEEccccccccEEEEEcccccccccccEEEEEccccccccccccEEEccccc
ccccEEEEcEcHHHcEcHHHHHHHHHHccEcEccccccccccccccccccHHHHHHHHcccccccccccccccccccccccccccccEEHHHHcccccccccccEccccccHHEcccccccccEccccccccEEEEcccccccccccccccccccHHHcEEcEcccEEEcccHHHHHHHHHHHccEEEEEEEEHHHHHEEEEEEEEcccccccccHHHHHHHHHEcccccEEEEEEEHHHHHEEEEcEcccEEEEEEEEEEccccccEEEEEEccEcccccEccEEEEEccccHHHcccccEEEEEccc
mytqqirlcgfgcnggfpgmAWRYWVKSGivsggaygskqaeknslsniprahlkswmgvhpdynlpanrlpeligysevdedlpanfdsrtkwpncptireirdqgscgscwgcrpyeiapcehhvngtrpscdaskghtpkcvrecqenydvpykkdlnfgaksysvssNEKSIMKEIYehgpvegaftvfddlilyksgrffvpgnetTAMSLIKWTIRdntsqlgaegaftvfddlILYKSgkalgghairilgwgedeksKEKYWLIANSwntdwgdnglfKILRgkdecgiessitagvpkld
mytqqirlCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRilgwgedekSKEKYWLIANswntdwgdnGLFKILRgkdecgiessitagvpkld
MYTQQIRLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKLD
****QIRLCGFGCNGGFPGMAWRYWVKSGIVSGGAYG***********IPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGT***********PKCVRECQENYDVPYKKDLNFGAKSYS******SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE***********
***QQIRLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVH***************YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL*
MYTQQIRLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNG************PKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKLD
MYTQQIRLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHP************IGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKLD
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MYTQQIRLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKLD
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query309 2.2.26 [Sep-21-2011]
P07858339 Cathepsin B OS=Homo sapie yes N/A 0.504 0.460 0.494 2e-48
P07688335 Cathepsin B OS=Bos taurus yes N/A 0.495 0.456 0.507 2e-48
Q4R5M2339 Cathepsin B OS=Macaca fas N/A N/A 0.504 0.460 0.494 3e-48
Q5R6D1339 Cathepsin B OS=Pongo abel yes N/A 0.504 0.460 0.489 4e-48
A1E295335 Cathepsin B OS=Sus scrofa yes N/A 0.495 0.456 0.502 2e-47
P43233340 Cathepsin B OS=Gallus gal yes N/A 0.508 0.461 0.474 5e-46
P00787339 Cathepsin B OS=Rattus nor yes N/A 0.498 0.454 0.489 5e-46
P10605339 Cathepsin B OS=Mus muscul yes N/A 0.504 0.460 0.484 3e-45
P25807329 Gut-specific cysteine pro yes N/A 0.614 0.577 0.336 3e-37
P25792340 Cathepsin B-like cysteine N/A N/A 0.469 0.426 0.409 6e-36
>sp|P07858|CATB_HUMAN Cathepsin B OS=Homo sapiens GN=CTSB PE=1 SV=3 Back     alignment and function desciption
 Score =  192 bits (489), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 237 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHAIRILGWG +  +   YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 317

Query: 294 ECGIESSITAGVPKLD 309
            CGIES + AG+P+ D
Sbjct: 318 HCGIESEVVAGIPRTD 333




Thiol protease which is believed to participate in intracellular degradation and turnover of proteins. Has also been implicated in tumor invasion and metastasis.
Homo sapiens (taxid: 9606)
EC: 3EC: .EC: 4EC: .EC: 2EC: 2EC: .EC: 1
>sp|P07688|CATB_BOVIN Cathepsin B OS=Bos taurus GN=CTSB PE=1 SV=5 Back     alignment and function description
>sp|Q4R5M2|CATB_MACFA Cathepsin B OS=Macaca fascicularis GN=CTSB PE=2 SV=1 Back     alignment and function description
>sp|Q5R6D1|CATB_PONAB Cathepsin B OS=Pongo abelii GN=CTSB PE=2 SV=1 Back     alignment and function description
>sp|A1E295|CATB_PIG Cathepsin B OS=Sus scrofa GN=CTSB PE=1 SV=1 Back     alignment and function description
>sp|P43233|CATB_CHICK Cathepsin B OS=Gallus gallus GN=CTSB PE=2 SV=1 Back     alignment and function description
>sp|P00787|CATB_RAT Cathepsin B OS=Rattus norvegicus GN=Ctsb PE=1 SV=2 Back     alignment and function description
>sp|P10605|CATB_MOUSE Cathepsin B OS=Mus musculus GN=Ctsb PE=1 SV=2 Back     alignment and function description
>sp|P25807|CPR1_CAEEL Gut-specific cysteine proteinase OS=Caenorhabditis elegans GN=cpr-1 PE=1 SV=2 Back     alignment and function description
>sp|P25792|CYSP_SCHMA Cathepsin B-like cysteine proteinase OS=Schistosoma mansoni PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query309
194384502273 unnamed protein product [Homo sapiens] 0.715 0.809 0.447 4e-55
496317344 Sarcophaga pro-cathepsin B [Sarcophaga p 0.553 0.497 0.538 9e-54
347972086337 AGAP004533-PA [Anopheles gambiae str. PE 0.527 0.483 0.544 3e-53
47217183351 unnamed protein product [Tetraodon nigro 0.673 0.592 0.414 4e-53
125981197338 GA10694 [Drosophila pseudoobscura pseudo 0.553 0.505 0.547 7e-53
312374701335 hypothetical protein AND_15621 [Anophele 0.527 0.486 0.534 1e-51
195438776340 GK16352 [Drosophila willistoni] gi|19416 0.553 0.502 0.542 1e-51
195058549340 GH17748 [Drosophila grimshawi] gi|193896 0.734 0.667 0.395 2e-51
14141821340 probable cathepsin B-like cysteine prote 0.540 0.491 0.524 4e-51
91078958334 PREDICTED: similar to cathepsin b [Tribo 0.537 0.497 0.521 5e-51
>gi|194384502|dbj|BAG59411.1| unnamed protein product [Homo sapiens] Back     alignment and taxonomy information
 Score =  221 bits (563), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 120/268 (44%), Positives = 160/268 (59%), Gaps = 47/268 (17%)

Query: 44  NSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREI 103
           ++  N+  ++LK   G      L   + P+ + ++E D  LPA+FD+R +WP CPTI+EI
Sbjct: 45  HNFYNVDMSYLKRLCGTF----LGGPKPPQRVMFTE-DLKLPASFDAREQWPQCPTIKEI 99

Query: 104 RDQGSCGSCWGCRPYEIAPCEH--HVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLN 161
           RDQGSCGSCW     E        HVNG+RP C   +G TPKC + C+  Y   YK+D +
Sbjct: 100 RDQGSCGSCWAFGAVEAISDRICIHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKH 158

Query: 162 FGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTI 221
           +G  SYSVS++EK IM EIY++GPVEGAF+V+ D +LYKSG +                 
Sbjct: 159 YGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------- 201

Query: 222 RDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWG 281
                                + +G+ +GGHAIRILGWG +  +   YWL+ANSWNTDWG
Sbjct: 202 --------------------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWG 239

Query: 282 DNGLFKILRGKDECGIESSITAGVPKLD 309
           DNG FKILRG+D CGIES + AG+P+ D
Sbjct: 240 DNGFFKILRGQDHCGIESEVVAGIPRTD 267




Source: Homo sapiens

Species: Homo sapiens

Genus: Homo

Family: Hominidae

Order: Primates

Class: Mammalia

Phylum: Chordata

Superkingdom: Eukaryota

>gi|496317|dbj|BAA04103.1| Sarcophaga pro-cathepsin B [Sarcophaga peregrina] Back     alignment and taxonomy information
>gi|347972086|ref|XP_313835.5| AGAP004533-PA [Anopheles gambiae str. PEST] gi|333469165|gb|EAA09183.5| AGAP004533-PA [Anopheles gambiae str. PEST] Back     alignment and taxonomy information
>gi|47217183|emb|CAG11019.1| unnamed protein product [Tetraodon nigroviridis] Back     alignment and taxonomy information
>gi|125981197|ref|XP_001354605.1| GA10694 [Drosophila pseudoobscura pseudoobscura] gi|54642915|gb|EAL31659.1| GA10694 [Drosophila pseudoobscura pseudoobscura] Back     alignment and taxonomy information
>gi|312374701|gb|EFR22198.1| hypothetical protein AND_15621 [Anopheles darlingi] Back     alignment and taxonomy information
>gi|195438776|ref|XP_002067308.1| GK16352 [Drosophila willistoni] gi|194163393|gb|EDW78294.1| GK16352 [Drosophila willistoni] Back     alignment and taxonomy information
>gi|195058549|ref|XP_001995463.1| GH17748 [Drosophila grimshawi] gi|193896249|gb|EDV95115.1| GH17748 [Drosophila grimshawi] Back     alignment and taxonomy information
>gi|14141821|gb|AAK07477.2|AF329480_1 probable cathepsin B-like cysteine proteinase precursor [Glossina morsitans morsitans] gi|289743431|gb|ADD20463.1| putative cathepsin B-like cysteine proteinase precursor [Glossina morsitans morsitans] Back     alignment and taxonomy information
>gi|91078958|ref|XP_974220.1| PREDICTED: similar to cathepsin b [Tribolium castaneum] gi|270004841|gb|EFA01289.1| cathepsin B precursor [Tribolium castaneum] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query309
FB|FBgn0030521340 CtsB1 "Cathepsin B1" [Drosophi 0.394 0.358 0.539 3.8e-30
UNIPROTKB|F1N9D7340 CTSB "Cathepsin B" [Gallus gal 0.411 0.373 0.461 2e-45
UNIPROTKB|E2R6Q7339 CTSB "Uncharacterized protein" 0.359 0.327 0.530 8.6e-45
UNIPROTKB|A1E295335 CTSB "Cathepsin B" [Sus scrofa 0.359 0.331 0.521 1.4e-44
UNIPROTKB|P07858339 CTSB "Cathepsin B" [Homo sapie 0.359 0.327 0.513 1.8e-44
UNIPROTKB|P43233340 CTSB "Cathepsin B" [Gallus gal 0.411 0.373 0.453 2.9e-44
ZFIN|ZDB-GENE-040426-2650330 ctsba "cathepsin B, a" [Danio 0.288 0.269 0.573 5.2e-43
UNIPROTKB|Q6IN22339 Ctsb "Cathepsin B" [Rattus nor 0.414 0.377 0.466 2.2e-27
RGD|621509339 Ctsb "cathepsin B" [Rattus nor 0.414 0.377 0.466 2.2e-27
ZFIN|ZDB-GENE-070323-1326 ctsbb "capthepsin B, b" [Danio 0.359 0.340 0.504 2.5e-39
FB|FBgn0030521 CtsB1 "Cathepsin B1" [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
 Score = 333 (122.3 bits), Expect = 3.8e-30, P = 3.8e-30
 Identities = 68/126 (53%), Positives = 82/126 (65%)

Query:    99 TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
             T + I   G  GS  GCRPYEI+PCEHHVNGTRP C A  G TPKC   CQ  Y V Y K
Sbjct:   169 TRKGIVSGGPYGSNQGCRPYEISPCEHHVNGTRPPC-AHGGRTPKCSHVCQSGYTVDYAK 227

Query:   159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRF-FVPGNETT--AMS 215
             D +FG+KSYSV  N + I +EI  +GPVEGAFTV++DLILYK G +    G E    A+ 
Sbjct:   228 DKHFGSKSYSVRRNVREIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAIR 287

Query:   216 LIKWTI 221
             ++ W +
Sbjct:   288 ILGWGV 293


GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS
GO:0035071 "salivary gland cell autophagic cell death" evidence=IEP
GO:0048102 "autophagic cell death" evidence=IEP
GO:0006508 "proteolysis" evidence=IEA
GO:0050790 "regulation of catalytic activity" evidence=IEA
UNIPROTKB|F1N9D7 CTSB "Cathepsin B" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
UNIPROTKB|E2R6Q7 CTSB "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] Back     alignment and assigned GO terms
UNIPROTKB|A1E295 CTSB "Cathepsin B" [Sus scrofa (taxid:9823)] Back     alignment and assigned GO terms
UNIPROTKB|P07858 CTSB "Cathepsin B" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|P43233 CTSB "Cathepsin B" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
ZFIN|ZDB-GENE-040426-2650 ctsba "cathepsin B, a" [Danio rerio (taxid:7955)] Back     alignment and assigned GO terms
UNIPROTKB|Q6IN22 Ctsb "Cathepsin B" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms
RGD|621509 Ctsb "cathepsin B" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms
ZFIN|ZDB-GENE-070323-1 ctsbb "capthepsin B, b" [Danio rerio (taxid:7955)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

ID ?Name ?Annotated EC number ?Identity ?Query coverage ?Hit coverage ?RBH(Q2H) ?RBH(H2Q) ?
A1E295CATB_PIG3, ., 4, ., 2, 2, ., 10.50250.49510.4567yesN/A
P07688CATB_BOVIN3, ., 4, ., 2, 2, ., 10.50770.49510.4567yesN/A

EC Number Prediction by EFICAz Software ?

Prediction LevelEC numberConfidence of Prediction
3rd Layer3.4.22.10.824
3rd Layer3.4.220.766

Prediction of Functionally Associated Proteins


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query309
cd02620236 cd02620, Peptidase_C1A_CathepsinB, Cathepsin B gro 2e-71
cd02621243 cd02621, Peptidase_C1A_CathepsinC, Cathepsin C; al 1e-29
pfam00112213 pfam00112, Peptidase_C1, Papain family cysteine pr 2e-29
cd02248210 cd02248, Peptidase_C1A, Peptidase C1A subfamily (M 1e-21
smart00645175 smart00645, Pept_C1, Papain family cysteine protea 3e-21
PTZ00049693 PTZ00049, PTZ00049, cathepsin C-like protein; Prov 1e-17
cd02619223 cd02619, Peptidase_C1, C1 Peptidase family (MEROPS 3e-16
PTZ00200448 PTZ00200, PTZ00200, cysteine proteinase; Provision 3e-13
cd02698239 cd02698, Peptidase_C1A_CathepsinX, Cathepsin X; th 7e-10
PTZ00364548 PTZ00364, PTZ00364, dipeptidyl-peptidase I precurs 1e-08
cd02620236 cd02620, Peptidase_C1A_CathepsinB, Cathepsin B gro 2e-08
PTZ00203348 PTZ00203, PTZ00203, cathepsin L protease; Provisio 6e-07
smart00645175 smart00645, Pept_C1, Papain family cysteine protea 7e-07
COG4870372 COG4870, COG4870, Cysteine protease [Posttranslati 1e-06
PTZ00462 1004 PTZ00462, PTZ00462, Serine-repeat antigen protein; 2e-06
pfam00112213 pfam00112, Peptidase_C1, Papain family cysteine pr 6e-06
PTZ00021489 PTZ00021, PTZ00021, falcipain-2; Provisional 3e-04
pfam00112213 pfam00112, Peptidase_C1, Papain family cysteine pr 5e-04
>gnl|CDD|239111 cd02620, Peptidase_C1A_CathepsinB, Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag) Back     alignment and domain information
 Score =  220 bits (563), Expect = 2e-71
 Identities = 100/282 (35%), Positives = 128/282 (45%), Gaps = 108/282 (38%)

Query: 85  PANFDSRTKWPNCPTIREIRDQGS------------------------------------ 108
           P +FD+R KWPNC +I EIRDQG+                                    
Sbjct: 1   PESFDAREKWPNCISIGEIRDQGNCGSCWAFSAVEAFSDRLCIQSNGKENVLLSAQDLLS 60

Query: 109 -CGSC------------W-----------GCRPYEIAPCEHHVNGTRPSCDASKGHTPKC 144
            C  C            W           GC+PY I PC HH  G  P C      TP C
Sbjct: 61  CCSGCGDGCNGGYPDAAWKYLTTTGVVTGGCQPYTIPPCGHHPEGPPPCCG-----TPYC 115

Query: 145 VRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRF 204
             +CQ+  +  Y++D + G  +YSV S+E  IMKEI  +GPV+ AFTV++D + YKSG  
Sbjct: 116 TPKCQDGCEKTYEEDKHKGKSAYSVPSDETDIMKEIMTNGPVQAAFTVYEDFLYYKSG-- 173

Query: 205 FVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYK--SGKALGGHAIRILGWGED 262
                                                +Y+  SGK LGGHA++I+GWG +
Sbjct: 174 -------------------------------------VYQHTSGKQLGGHAVKIIGWGVE 196

Query: 263 EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG 304
                 YWL ANSW TDWG+NG F+ILRG +ECGIES + AG
Sbjct: 197 NG--VPYWLAANSWGTDWGENGYFRILRGSNECGIESEVVAG 236


Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane antibody-mediated interstitial nephritis. It plays a role in renal tubulogenesis and is defective in hereditary tubulointerstitial disorders. TIN-Ag is exclusively expressed in kidney tissues. . Length = 236

>gnl|CDD|239112 cd02621, Peptidase_C1A_CathepsinC, Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access Back     alignment and domain information
>gnl|CDD|215726 pfam00112, Peptidase_C1, Papain family cysteine protease Back     alignment and domain information
>gnl|CDD|239068 cd02248, Peptidase_C1A, Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W) Back     alignment and domain information
>gnl|CDD|214761 smart00645, Pept_C1, Papain family cysteine protease Back     alignment and domain information
>gnl|CDD|240244 PTZ00049, PTZ00049, cathepsin C-like protein; Provisional Back     alignment and domain information
>gnl|CDD|239110 cd02619, Peptidase_C1, C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase) Back     alignment and domain information
>gnl|CDD|240310 PTZ00200, PTZ00200, cysteine proteinase; Provisional Back     alignment and domain information
>gnl|CDD|239149 cd02698, Peptidase_C1A_CathepsinX, Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity Back     alignment and domain information
>gnl|CDD|240381 PTZ00364, PTZ00364, dipeptidyl-peptidase I precursor; Provisional Back     alignment and domain information
>gnl|CDD|239111 cd02620, Peptidase_C1A_CathepsinB, Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag) Back     alignment and domain information
>gnl|CDD|185513 PTZ00203, PTZ00203, cathepsin L protease; Provisional Back     alignment and domain information
>gnl|CDD|214761 smart00645, Pept_C1, Papain family cysteine protease Back     alignment and domain information
>gnl|CDD|227207 COG4870, COG4870, Cysteine protease [Posttranslational modification, protein turnover, chaperones] Back     alignment and domain information
>gnl|CDD|185641 PTZ00462, PTZ00462, Serine-repeat antigen protein; Provisional Back     alignment and domain information
>gnl|CDD|215726 pfam00112, Peptidase_C1, Papain family cysteine protease Back     alignment and domain information
>gnl|CDD|240232 PTZ00021, PTZ00021, falcipain-2; Provisional Back     alignment and domain information
>gnl|CDD|215726 pfam00112, Peptidase_C1, Papain family cysteine protease Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 309
KOG1542|consensus372 100.0
PTZ00203348 cathepsin L protease; Provisional 100.0
KOG1543|consensus325 100.0
PTZ00021489 falcipain-2; Provisional 100.0
PTZ00200448 cysteine proteinase; Provisional 100.0
cd02621243 Peptidase_C1A_CathepsinC Cathepsin C; also known a 100.0
cd02620236 Peptidase_C1A_CathepsinB Cathepsin B group; compos 100.0
cd02698239 Peptidase_C1A_CathepsinX Cathepsin X; the only pap 100.0
PTZ00049693 cathepsin C-like protein; Provisional 100.0
cd02248210 Peptidase_C1A Peptidase C1A subfamily (MEROPS data 100.0
PTZ00364548 dipeptidyl-peptidase I precursor; Provisional 100.0
PF00112219 Peptidase_C1: Papain family cysteine protease This 100.0
KOG1544|consensus470 100.0
PTZ00462 1004 Serine-repeat antigen protein; Provisional 100.0
smart00645174 Pept_C1 Papain family cysteine protease. 100.0
cd02619223 Peptidase_C1 C1 Peptidase family (MEROPS database 99.97
COG4870372 Cysteine protease [Posttranslational modification, 99.86
cd00585437 Peptidase_C1B Peptidase C1B subfamily (MEROPS data 99.55
PF03051438 Peptidase_C1_2: Peptidase C1-like family This fami 98.64
COG3579444 PepC Aminopeptidase C [Amino acid transport and me 96.4
PF0824658 Inhibitor_I29: Cathepsin propeptide inhibitor doma 95.85
KOG1543|consensus325 94.22
PF0812741 Propeptide_C1: Peptidase family C1 propeptide; Int 93.63
PF13529144 Peptidase_C39_2: Peptidase_C39 like family; PDB: 3 93.08
smart0084857 Inhibitor_I29 Cathepsin propeptide inhibitor domai 90.55
PTZ00364548 dipeptidyl-peptidase I precursor; Provisional 87.46
cd02698239 Peptidase_C1A_CathepsinX Cathepsin X; the only pap 84.23
cd02620236 Peptidase_C1A_CathepsinB Cathepsin B group; compos 81.65
cd02621243 Peptidase_C1A_CathepsinC Cathepsin C; also known a 80.78
PTZ00049693 cathepsin C-like protein; Provisional 80.21
>KOG1542|consensus Back     alignment and domain information
Probab=100.00  E-value=1.6e-51  Score=378.72  Aligned_cols=219  Identities=25%  Similarity=0.512  Sum_probs=167.1

Q ss_pred             CCCCCcccccccCCCCcHHHHHHHhCCCCCCCCCCCCCCcc-ccc-CCCCCCCCCceecCCCCCCCCCCccccCccCCCC
Q psy8713          34 GAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPEL-IGY-SEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGS  111 (309)
Q Consensus        34 ~~~~s~~~g~Nf~~~~~~e~~~~~lg~~~~~~~~~~~~~~~-~~~-~~~~~~lP~~~DwR~~~~~cg~vtpVkdQg~CGS  111 (309)
                      ++.+|-..|+|+|+|+++|||+++++......  . +.+.. ... .....+||++||||++    |+||||||||+|||
T Consensus       108 ~d~gsA~yGvtqFSDlT~eEFkk~~l~~~~~~--~-~~~~~~~~~~~~~~~~lP~~fDWR~k----gaVTpVKnQG~CGS  180 (372)
T KOG1542|consen  108 NDPGSAEYGVTQFSDLTEEEFKKIYLGVKRRG--S-KLPGDAAEAPIEPGESLPESFDWRDK----GAVTPVKNQGMCGS  180 (372)
T ss_pred             cCccccccCccchhhcCHHHHHHHhhcccccc--c-cCccccccCcCCCCCCCCcccchhcc----CCccccccCCcCcc
Confidence            44579999999999999999888765543321  0 11111 111 2347789999999999    99999999999999


Q ss_pred             Cc--------------------CCCccccCCCCCCCCCCCCCCCCCCC-------------------CCccccc-ccccC
Q psy8713         112 CW--------------------GCRPYEIAPCEHHVNGTRPSCDASKG-------------------HTPKCVR-ECQEN  151 (309)
Q Consensus       112 CW--------------------~cs~~~~~~C~~~~~g~~~~C~~~~~-------------------~~~~~~~-~c~~~  151 (309)
                      ||                    +.|.|++.+|+.-+    ..|+++..                   |+..-.+ .|...
T Consensus       181 CWAFS~tG~vEga~~i~~g~LvsLSEQeLvDCD~~d----~gC~GGl~~nA~~~~~~~gGL~~E~dYPY~g~~~~~C~~~  256 (372)
T KOG1542|consen  181 CWAFSTTGAVEGAWAIATGKLVSLSEQELVDCDSCD----NGCNGGLMDNAFKYIKKAGGLEKEKDYPYTGKKGNQCHFD  256 (372)
T ss_pred             hhhhhhhhhhhhHHHhhcCcccccchhhhhcccCcC----CcCCCCChhHHHHHHHHhCCccccccCCccccCCCccccc
Confidence            99                    66788899997643    34777653                   3332222 33211


Q ss_pred             cccccccccccee-eeeeecchHHHHHHHHHhcCCEEEEEecccccccCCCceEeCCCCcchhhhhhhhhhcccCcccCc
Q psy8713         152 YDVPYKKDLNFGA-KSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA  230 (309)
Q Consensus       152 ~~~~~~~~~~~~~-~~~~v~~~~~~ik~~l~~~GPv~v~~~~~~~f~~Y~sGiy~~~~~~~~~~~~~~~~i~~~~~~~~~  230 (309)
                      .     .+....+ ..+.++.||++|.+.|.++|||+|+|++ ..++.|.+||...-                       
T Consensus       257 ~-----~~~~v~I~~f~~l~~nE~~ia~wLv~~GPi~vgiNa-~~mQ~YrgGV~~P~-----------------------  307 (372)
T KOG1542|consen  257 K-----SKIVVSIKDFSMLSNNEDQIAAWLVTFGPLSVGINA-KPMQFYRGGVSCPS-----------------------  307 (372)
T ss_pred             h-----hhceEEEeccEecCCCHHHHHHHHHhcCCeEEEEch-HHHHHhcccccCCC-----------------------
Confidence            1     1112222 3345677999999999999999999997 57899999999831                       


Q ss_pred             CCcceeccccccccCCCccCCceEEEEEeeccCCC-CccEEEEEcCCCCCCCCCceEEEEccCCccccCcceEEEe
Q psy8713         231 EGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKS-KEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGV  305 (309)
Q Consensus       231 ~~~~~~~~~~~~~~~~~~~~~HaV~iVGyg~~~~~-g~~YWiikNSWG~~WG~~Gy~~i~~g~n~cgi~~~~~~~~  305 (309)
                                 ...|++..++|||+|||||..  . .++|||||||||++|||+|||||.||.|.|||++++++++
T Consensus       308 -----------~~~Cs~~~~~HaVLlvGyG~~--g~~~PYWIVKNSWG~~WGE~GY~~l~RG~N~CGi~~mvss~~  370 (372)
T KOG1542|consen  308 -----------KYICSPKLLNHAVLLVGYGSS--GYEKPYWIVKNSWGTSWGEKGYYKLCRGSNACGIADMVSSAA  370 (372)
T ss_pred             -----------cccCCccccCceEEEEeecCC--CCCCceEEEECCccccccccceEEEeccccccccccchhhhh
Confidence                       047988889999999999988  5 8999999999999999999999999999999999998765



>PTZ00203 cathepsin L protease; Provisional Back     alignment and domain information
>KOG1543|consensus Back     alignment and domain information
>PTZ00021 falcipain-2; Provisional Back     alignment and domain information
>PTZ00200 cysteine proteinase; Provisional Back     alignment and domain information
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access Back     alignment and domain information
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag) Back     alignment and domain information
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity Back     alignment and domain information
>PTZ00049 cathepsin C-like protein; Provisional Back     alignment and domain information
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W) Back     alignment and domain information
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional Back     alignment and domain information
>PF00112 Peptidase_C1: Papain family cysteine protease This is family C1 in the peptidase classification Back     alignment and domain information
>KOG1544|consensus Back     alignment and domain information
>PTZ00462 Serine-repeat antigen protein; Provisional Back     alignment and domain information
>smart00645 Pept_C1 Papain family cysteine protease Back     alignment and domain information
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase) Back     alignment and domain information
>COG4870 Cysteine protease [Posttranslational modification, protein turnover, chaperones] Back     alignment and domain information
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC) Back     alignment and domain information
>PF03051 Peptidase_C1_2: Peptidase C1-like family This family is a subfamily of the Prosite entry; InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families Back     alignment and domain information
>COG3579 PepC Aminopeptidase C [Amino acid transport and metabolism] Back     alignment and domain information
>PF08246 Inhibitor_I29: Cathepsin propeptide inhibitor domain (I29); InterPro: IPR013201 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively Back     alignment and domain information
>KOG1543|consensus Back     alignment and domain information
>PF08127 Propeptide_C1: Peptidase family C1 propeptide; InterPro: IPR012599 This domain is found at the N-terminal of cathepsin B and cathepsin B-like peptidases that belong to MEROPS peptidase subfamily C1A Back     alignment and domain information
>PF13529 Peptidase_C39_2: Peptidase_C39 like family; PDB: 3ERV_A Back     alignment and domain information
>smart00848 Inhibitor_I29 Cathepsin propeptide inhibitor domain (I29) Back     alignment and domain information
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional Back     alignment and domain information
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity Back     alignment and domain information
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag) Back     alignment and domain information
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access Back     alignment and domain information
>PTZ00049 cathepsin C-like protein; Provisional Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query309
1qdq_A253 X-Ray Crystal Structure Of Bovine Cathepsin B-Ca074 9e-50
1pbh_A317 Crystal Structure Of Human Recombinant Procathepsin 1e-49
3k9m_A254 Cathepsin B In Complex With Stefin A Length = 254 2e-49
1huc_B205 The Refined 2.15 Angstroms X-Ray Crystal Structure 2e-49
3ai8_B256 Cathepsin B In Complex With The Nitroxoline Length 2e-49
1gmy_A261 Cathepsin B Complexed With Dipeptidyl Nitrile Inhib 2e-49
1ito_A256 Crystal Structure Analysis Of Bovine Spleen Catheps 3e-49
1sp4_B205 Crystal Structure Of Ns-134 In Complex With Bovine 4e-49
3cbj_A266 Chagasin-cathepsin B Complex Length = 266 3e-48
1mir_A322 Rat Procathepsin B Length = 322 5e-47
1cpj_A260 Crystal Structures Of Recombinant Rat Cathepsin B A 6e-47
1cte_A254 Crystal Structures Of Recombinant Rat Cathepsin B A 6e-47
3qsd_A254 Structure Of Cathepsin B1 From Schistosoma Mansoni 9e-37
3hhi_A325 Crystal Structure Of Cathepsin B From T. Brucei In 7e-26
4hwy_A340 Trypanosoma Brucei Procathepsin B Solved From 40 Fs 7e-26
3mor_A317 Crystal Structure Of Cathepsin B From Trypanosoma B 8e-26
3pdf_A441 Discovery Of Novel Cyanamide-Based Inhibitors Of Ca 2e-17
1jqp_A438 Dipeptidyl Peptidase I (Cathepsin C), A Tetrameric 2e-15
1k3b_C69 Crystal Structure Of Human Dipeptidyl Peptidase I ( 6e-12
1huc_A47 The Refined 2.15 Angstroms X-Ray Crystal Structure 6e-10
1sp4_A48 Crystal Structure Of Ns-134 In Complex With Bovine 7e-10
2o6x_A310 Crystal Structure Of Procathepsin L1 From Fasciola 3e-08
8pch_A220 Crystal Structure Of Porcine Cathepsin H Determined 7e-08
3qj3_A331 Structure Of Digestive Procathepsin L2 Proteinase F 1e-07
1aim_A215 Cruzain Inhibited By Benzoyl-Tyrosine-Alanine-Fluor 2e-07
1ewp_A215 Cruzain Bound To Mor-Leu-Hpq Length = 215 2e-07
3iut_A221 The Crystal Structure Of Cruzain In Complex With A 6e-07
3hd3_A215 High Resolution Crystal Structure Of Cruzain Bound 6e-07
1m6d_A214 Crystal Structure Of Human Cathepsin F Length = 214 3e-06
3d6s_A223 Crystal Structure Of Mite Allergen Der F 1 Length = 6e-06
2p7u_A215 The Crystal Structure Of Rhodesain, The Major Cyste 8e-06
1fh0_A221 Crystal Structure Of Human Cathepsin V Complexed Wi 8e-06
3h6s_A221 Strucure Of Clitocypin - Cathepsin V Complex Length 9e-06
2fo5_A262 Crystal Structure Of Recombinant Barley Cysteine En 1e-05
3hha_A220 Crystal Structure Of Cathepsin L In Complex With Az 2e-05
3h89_A220 A Combined Crystallographic And Molecular Dynamics 2e-05
3of8_A221 Structural Basis For Reversible And Irreversible In 2e-05
3iv2_A220 Crystal Structure Of Mature Apo-Cathepsin L C25a Mu 2e-05
3bc3_A220 Exploring Inhibitor Binding At The S Subsites Of Ca 2e-05
3hwn_A258 Cathepsin L With Az13010160 Length = 258 2e-05
2nqd_B221 Crystal Structure Of Cysteine Protease Inhibitor, C 2e-05
3kse_A220 Unreduced Cathepsin L In Complex With Stefin A Leng 2e-05
3qt4_A329 Structure Of Digestive Procathepsin L 3 Of Tenebrio 3e-05
1aec_A218 Crystal Structure Of Actinidin-E-64 Complex+ Length 3e-05
2act_A220 Crystallographic Refinement Of The Structure Of Act 3e-05
1cs8_A316 Crystal Structure Of Procathepsin L Length = 316 4e-05
3tnx_A363 Structure Of The Precursor Of A Thermostable Varian 5e-05
1khp_A212 Monoclinic Form Of Papain/zlfg-dam Covalent Complex 5e-05
1cjl_A312 Crystal Structure Of A Cysteine Protease Proform Le 6e-05
2wbf_X265 Crystal Structure Analysis Of Sera5e From Plasmodiu 9e-05
3f75_A224 Activated Toxoplasma Gondii Cathepsin L (Tgcpl) In 1e-04
3p5w_A220 Actinidin From Actinidia Arguta Planch (Sarusashi) 2e-04
3h7d_A215 The Crystal Structure Of The Cathepsin K Variant M5 2e-04
2vhs_A217 Cathsilicatein, A Chimera Length = 217 2e-04
1pip_A212 Crystal Structure Of Papain-Succinyl-Gln-Val-Val-Al 2e-04
1s4v_A229 The 2.0 A Crystal Structure Of The Kdel-Tailed Cyst 2e-04
3ch2_X265 Crystal Structure Analysis Of Sera5e From Plasmodiu 2e-04
1ppp_A212 Crystal Structure Of Papain-E64-C Complex. Binding 2e-04
2as8_A222 Crystal Structure Of Mature And Fully Active Der P 2e-04
1ef7_A242 Crystal Structure Of Human Cathepsin X Length = 242 3e-04
1deu_A277 Crystal Structure Of Human Procathepsin X: A Cystei 3e-04
3rvw_A222 Crystal Structure Of Der P 1 Complexed With Fab 4c1 3e-04
3p5u_A220 Actinidin From Actinidia Arguta Planch (Sarusashi) 3e-04
2fye_A217 Mutant Human Cathepsin S With Irreversible Inhibito 4e-04
2cio_A212 The High Resolution X-Ray Structure Of Papain Compl 5e-04
3ima_A212 Complex Strcuture Of Tarocystatin And Papain Length 7e-04
1stf_E212 The Refined 2.4 Angstroms X-Ray Crystal Structure O 7e-04
1o0e_A208 1.9 Angstrom Crystal Structure Of A Plant Cysteine 8e-04
>pdb|1QDQ|A Chain A, X-Ray Crystal Structure Of Bovine Cathepsin B-Ca074 Complex Length = 253 Back     alignment and structure

Iteration: 1

Score = 193 bits (491), Expect = 9e-50, Method: Compositional matrix adjust. Identities = 99/193 (51%), Positives = 121/193 (62%), Gaps = 40/193 (20%) Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173 GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D +FG SYSV++NE Sbjct: 99 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNE 157 Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233 K IM EIY++GPVEGAF+V+ D +LYKSG + Sbjct: 158 KEIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 188 Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293 + SG+ +GGHAIRILGWG + + YWL+ANSWNTDWGDNG FKILRG+D Sbjct: 189 --------QHVSGEIMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 238 Query: 294 ECGIESSITAGVP 306 CGIES I AG+P Sbjct: 239 HCGIESEIVAGMP 251
>pdb|1PBH|A Chain A, Crystal Structure Of Human Recombinant Procathepsin B At 3.2 Angstrom Resolution Length = 317 Back     alignment and structure
>pdb|3K9M|A Chain A, Cathepsin B In Complex With Stefin A Length = 254 Back     alignment and structure
>pdb|1HUC|B Chain B, The Refined 2.15 Angstroms X-Ray Crystal Structure Of Human Liver Cathepsin B: The Structural Basis For Its Specificity Length = 205 Back     alignment and structure
>pdb|3AI8|B Chain B, Cathepsin B In Complex With The Nitroxoline Length = 256 Back     alignment and structure
>pdb|1GMY|A Chain A, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor Length = 261 Back     alignment and structure
>pdb|1ITO|A Chain A, Crystal Structure Analysis Of Bovine Spleen Cathepsin B- E64c Complex Length = 256 Back     alignment and structure
>pdb|1SP4|B Chain B, Crystal Structure Of Ns-134 In Complex With Bovine Cathepsin B: A Two Headed Epoxysuccinyl Inhibitor Extends Along The Whole Active Site Cleft Length = 205 Back     alignment and structure
>pdb|3CBJ|A Chain A, Chagasin-cathepsin B Complex Length = 266 Back     alignment and structure
>pdb|1MIR|A Chain A, Rat Procathepsin B Length = 322 Back     alignment and structure
>pdb|1CPJ|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A Cathepsin B-Inhibitor Complex: Implications For Structure- Based Inhibitor Design Length = 260 Back     alignment and structure
>pdb|1CTE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A Cathepsin B-Inhibitor Complex: Implications For Structure- Based Inhibitor Design Length = 254 Back     alignment and structure
>pdb|3QSD|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In Complex With Ca074 Inhibitor Length = 254 Back     alignment and structure
>pdb|3HHI|A Chain A, Crystal Structure Of Cathepsin B From T. Brucei In Complex With Ca074 Length = 325 Back     alignment and structure
>pdb|4HWY|A Chain A, Trypanosoma Brucei Procathepsin B Solved From 40 Fs Free-electron Laser Pulse Data By Serial Femtosecond X-ray Crystallography Length = 340 Back     alignment and structure
>pdb|3MOR|A Chain A, Crystal Structure Of Cathepsin B From Trypanosoma Brucei Length = 317 Back     alignment and structure
>pdb|3PDF|A Chain A, Discovery Of Novel Cyanamide-Based Inhibitors Of Cathepsin C Length = 441 Back     alignment and structure
>pdb|1JQP|A Chain A, Dipeptidyl Peptidase I (Cathepsin C), A Tetrameric Cysteine Protease Of The Papain Family Length = 438 Back     alignment and structure
>pdb|1K3B|C Chain C, Crystal Structure Of Human Dipeptidyl Peptidase I (Cathepsin C): Exclusion Domain Added To An Endopeptidase Framework Creates The Machine For Activation Of Granular Serine Proteases Length = 69 Back     alignment and structure
>pdb|1HUC|A Chain A, The Refined 2.15 Angstroms X-Ray Crystal Structure Of Human Liver Cathepsin B: The Structural Basis For Its Specificity Length = 47 Back     alignment and structure
>pdb|1SP4|A Chain A, Crystal Structure Of Ns-134 In Complex With Bovine Cathepsin B: A Two Headed Epoxysuccinyl Inhibitor Extends Along The Whole Active Site Cleft Length = 48 Back     alignment and structure
>pdb|2O6X|A Chain A, Crystal Structure Of Procathepsin L1 From Fasciola Hepatica Length = 310 Back     alignment and structure
>pdb|8PCH|A Chain A, Crystal Structure Of Porcine Cathepsin H Determined At 2.1 Angstrom Resolution: Location Of The Mini-Chain C-Terminal Carboxyl Group Defines Cathepsin H Aminopeptidase Function Length = 220 Back     alignment and structure
>pdb|3QJ3|A Chain A, Structure Of Digestive Procathepsin L2 Proteinase From Tenebrio Molitor Larval Midgut Length = 331 Back     alignment and structure
>pdb|1AIM|A Chain A, Cruzain Inhibited By Benzoyl-Tyrosine-Alanine-Fluoromethylketone Length = 215 Back     alignment and structure
>pdb|1EWP|A Chain A, Cruzain Bound To Mor-Leu-Hpq Length = 215 Back     alignment and structure
>pdb|3IUT|A Chain A, The Crystal Structure Of Cruzain In Complex With A Tetrafluorophenoxymethyl Ketone Inhibitor Length = 221 Back     alignment and structure
>pdb|3HD3|A Chain A, High Resolution Crystal Structure Of Cruzain Bound To The Vinyl Sulfone Inhibitor Smdc-256047 Length = 215 Back     alignment and structure
>pdb|1M6D|A Chain A, Crystal Structure Of Human Cathepsin F Length = 214 Back     alignment and structure
>pdb|3D6S|A Chain A, Crystal Structure Of Mite Allergen Der F 1 Length = 223 Back     alignment and structure
>pdb|2P7U|A Chain A, The Crystal Structure Of Rhodesain, The Major Cysteine Protease Of T. Brucei Rhodesiense, Bound To Inhibitor K777 Length = 215 Back     alignment and structure
>pdb|1FH0|A Chain A, Crystal Structure Of Human Cathepsin V Complexed With An Irreversible Vinyl Sulfone Inhibitor Length = 221 Back     alignment and structure
>pdb|3H6S|A Chain A, Strucure Of Clitocypin - Cathepsin V Complex Length = 221 Back     alignment and structure
>pdb|2FO5|A Chain A, Crystal Structure Of Recombinant Barley Cysteine Endoprotease B Isoform 2 (Ep-B2) In Complex With Leupeptin Length = 262 Back     alignment and structure
>pdb|3HHA|A Chain A, Crystal Structure Of Cathepsin L In Complex With Az12878478 Length = 220 Back     alignment and structure
>pdb|3H89|A Chain A, A Combined Crystallographic And Molecular Dynamics Study Of Cathepsin-L Retro-Binding Inhibitors(Compound 4) Length = 220 Back     alignment and structure
>pdb|3OF8|A Chain A, Structural Basis For Reversible And Irreversible Inhibition Of Human Cathepsin L By Their Respective Dipeptidyl Glyoxal And Diazomethylketone Inhibitors Length = 221 Back     alignment and structure
>pdb|3IV2|A Chain A, Crystal Structure Of Mature Apo-Cathepsin L C25a Mutant Length = 220 Back     alignment and structure
>pdb|3BC3|A Chain A, Exploring Inhibitor Binding At The S Subsites Of Cathepsin L Length = 220 Back     alignment and structure
>pdb|3HWN|A Chain A, Cathepsin L With Az13010160 Length = 258 Back     alignment and structure
>pdb|2NQD|B Chain B, Crystal Structure Of Cysteine Protease Inhibitor, Chagasin, In Complex With Human Cathepsin L Length = 221 Back     alignment and structure
>pdb|3KSE|A Chain A, Unreduced Cathepsin L In Complex With Stefin A Length = 220 Back     alignment and structure
>pdb|3QT4|A Chain A, Structure Of Digestive Procathepsin L 3 Of Tenebrio Molitor Larval Midgut Length = 329 Back     alignment and structure
>pdb|1AEC|A Chain A, Crystal Structure Of Actinidin-E-64 Complex+ Length = 218 Back     alignment and structure
>pdb|2ACT|A Chain A, Crystallographic Refinement Of The Structure Of Actinidin At 1.7 Angstroms Resolution By Fast Fourier Least-Squares Methods Length = 220 Back     alignment and structure
>pdb|1CS8|A Chain A, Crystal Structure Of Procathepsin L Length = 316 Back     alignment and structure
>pdb|3TNX|A Chain A, Structure Of The Precursor Of A Thermostable Variant Of Papain At 2.6 Angstroem Resolution Length = 363 Back     alignment and structure
>pdb|1KHP|A Chain A, Monoclinic Form Of Papain/zlfg-dam Covalent Complex Length = 212 Back     alignment and structure
>pdb|1CJL|A Chain A, Crystal Structure Of A Cysteine Protease Proform Length = 312 Back     alignment and structure
>pdb|2WBF|X Chain X, Crystal Structure Analysis Of Sera5e From Plasmodium Falciparum With Loop 690-700 Ordered Length = 265 Back     alignment and structure
>pdb|3F75|A Chain A, Activated Toxoplasma Gondii Cathepsin L (Tgcpl) In Complex With Its Propeptide Length = 224 Back     alignment and structure
>pdb|3P5W|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi) Length = 220 Back     alignment and structure
>pdb|3H7D|A Chain A, The Crystal Structure Of The Cathepsin K Variant M5 In Compl Chondroitin-4-Sulfate Length = 215 Back     alignment and structure
>pdb|2VHS|A Chain A, Cathsilicatein, A Chimera Length = 217 Back     alignment and structure
>pdb|1PIP|A Chain A, Crystal Structure Of Papain-Succinyl-Gln-Val-Val-Ala-Ala-P- Nitroanilide Complex At 1.7 Angstroms Resolution: Noncovalent Binding Mode Of A Common Sequence Of Endogenous Thiol Protease Inhibitors Length = 212 Back     alignment and structure
>pdb|1S4V|A Chain A, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine Endopeptidase Functioning In Programmed Cell Death Of Ricinus Communis Endosperm Length = 229 Back     alignment and structure
>pdb|3CH2|X Chain X, Crystal Structure Analysis Of Sera5e From Plasmodium Falciparum Length = 265 Back     alignment and structure
>pdb|1PPP|A Chain A, Crystal Structure Of Papain-E64-C Complex. Binding Diversity Of E64-C To Papain S2 And S3 Subsites Length = 212 Back     alignment and structure
>pdb|2AS8|A Chain A, Crystal Structure Of Mature And Fully Active Der P 1 Allergen Length = 222 Back     alignment and structure
>pdb|1EF7|A Chain A, Crystal Structure Of Human Cathepsin X Length = 242 Back     alignment and structure
>pdb|1DEU|A Chain A, Crystal Structure Of Human Procathepsin X: A Cysteine Protease With The Proregion Covalently Linked To The Active Site Cysteine Length = 277 Back     alignment and structure
>pdb|3RVW|A Chain A, Crystal Structure Of Der P 1 Complexed With Fab 4c1 Length = 222 Back     alignment and structure
>pdb|3P5U|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi) Length = 220 Back     alignment and structure
>pdb|2FYE|A Chain A, Mutant Human Cathepsin S With Irreversible Inhibitor Cra- 14013 Length = 217 Back     alignment and structure
>pdb|2CIO|A Chain A, The High Resolution X-Ray Structure Of Papain Complexed With Fragments Of The Trypanosoma Brucei Cysteine Protease Inhibitor Icp Length = 212 Back     alignment and structure
>pdb|3IMA|A Chain A, Complex Strcuture Of Tarocystatin And Papain Length = 212 Back     alignment and structure
>pdb|1STF|E Chain E, The Refined 2.4 Angstroms X-Ray Crystal Structure Of Recombinant Human Stefin B In Complex With The Cysteine Proteinase Papain: A Novel Type Of Proteinase Inhibitor Interaction Length = 212 Back     alignment and structure
>pdb|1O0E|A Chain A, 1.9 Angstrom Crystal Structure Of A Plant Cysteine Protease Ervatamin C Length = 208 Back     alignment and structure

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query309
3hhi_A325 Cathepsin B-like cysteine protease; occluding loop 1e-80
3hhi_A325 Cathepsin B-like cysteine protease; occluding loop 1e-08
3pbh_A317 Procathepsin B; thiol protease, cysteine protease, 7e-72
3pbh_A317 Procathepsin B; thiol protease, cysteine protease, 1e-16
3pbh_A317 Procathepsin B; thiol protease, cysteine protease, 1e-09
3cbj_A266 Cathepsin B; cathepsin B, occluding loop, chagas d 4e-71
3cbj_A266 Cathepsin B; cathepsin B, occluding loop, chagas d 4e-15
3cbj_A266 Cathepsin B; cathepsin B, occluding loop, chagas d 9e-10
3qsd_A254 Cathepsin B-like peptidase (C01 family); cysteine 6e-70
3qsd_A254 Cathepsin B-like peptidase (C01 family); cysteine 6e-15
3qsd_A254 Cathepsin B-like peptidase (C01 family); cysteine 5e-10
1deu_A277 Procathepsin X; cysteine protease, proregion, pros 5e-58
1deu_A277 Procathepsin X; cysteine protease, proregion, pros 1e-07
3pdf_A441 Cathepsin C, dipeptidyl peptidase 1; two domains, 5e-55
3pdf_A441 Cathepsin C, dipeptidyl peptidase 1; two domains, 3e-05
3ois_A291 Cysteine protease; alpha and beta, hydrolase; HET: 2e-29
2wbf_X265 Serine-repeat antigen protein; SERA, malaria, vacu 2e-15
8pch_A220 Cathepsin H; hydrolase, protease, cysteine protein 7e-14
1m6d_A214 Cathepsin F, catsf; papain family cysteine proteas 1e-12
3i06_A215 Cruzipain; autocatalytic cleavage, glycoprotein, p 3e-12
1xkg_A312 DER P I, major mite fecal allergen DER P 1; major 7e-12
1xkg_A312 DER P I, major mite fecal allergen DER P 1; major 7e-04
3f5v_A222 DER P 1 allergen; allergy, asthma, DUST mites, gly 8e-12
3f5v_A222 DER P 1 allergen; allergy, asthma, DUST mites, gly 2e-04
3qj3_A331 Cathepsin L-like protein; hydrolase, proteinase, l 6e-11
3f75_A224 Toxopain-2, cathepsin L protease; medical structur 2e-10
3u8e_A222 Papain-like cysteine protease; papain-like cystein 4e-10
2xu3_A220 Cathepsin L1; hydrolase, drug design, thiol protea 8e-10
2b1m_A246 SPE31; papain-like, sugar binding protein; HET: NA 1e-09
1ppo_A216 Protease omega; hydrolase(thiol protease); 1.80A { 1e-09
3qt4_A329 Cathepsin-L-like midgut cysteine proteinase; hydro 1e-09
1cs8_A316 Human procathepsin L; prosegment, propeptide, inhi 2e-09
3bwk_A243 Cysteine protease falcipain-3; malaria, hydrolase; 2e-09
2o6x_A310 Procathepsin L1, secreted cathepsin L 1; hydrolase 3e-09
2o6x_A310 Procathepsin L1, secreted cathepsin L 1; hydrolase 3e-04
2cio_A212 Papain; hydrolase/inhibitor, complex hydrolase/inh 4e-09
2oul_A241 Falcipain 2; cysteine protease, inhibitor, macromo 4e-09
2oul_A241 Falcipain 2; cysteine protease, inhibitor, macromo 7e-04
2fo5_A262 Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cyst 5e-09
3ioq_A213 CMS1MS2; caricaceae, cysteine protease, papain fam 6e-09
1by8_A314 Protein (procathepsin K); hydrolase(sulfhydryl pro 6e-09
3ovx_A218 Cathepsin S; hydrolase, covalent inhibitor, aldehy 9e-09
3p5u_A220 Actinidin; SAD, cysteine proteinases, hydrolase; 1 9e-09
2c0y_A315 Procathepsin S; proenzyme, proteinase, hydrolase, 1e-08
2c0y_A315 Procathepsin S; proenzyme, proteinase, hydrolase, 2e-04
3kwz_A215 Cathepsin K; enzyme inhibitor, covalent reversible 2e-08
2bdz_A214 Mexicain; cysteine protease, peptidase_C1, papain- 2e-08
1pci_A322 Procaricain; zymogen, hydrolase, thiol protease; 3 2e-08
1o0e_A208 Ervatamin C; plant cysteine protease, two domain, 3e-08
1cqd_A221 Protein (protease II); cysteine protease, glycopro 4e-08
1cqd_A221 Protein (protease II); cysteine protease, glycopro 6e-04
1iwd_A215 Ervatamin B; cysteine protease, alpha-beta protein 4e-08
1s4v_A229 Cysteine endopeptidase; KDEL ER retention signal, 5e-08
1yal_A218 Chymopapain; hydrolase, thiol protease; 1.70A {Car 7e-08
3pw3_A383 Aminopeptidase C; bleomycin, cysteine proteinase f 7e-05
>3hhi_A Cathepsin B-like cysteine protease; occluding loop, hydrolase, THIO protease; HET: 074; 1.60A {Trypanosoma brucei} PDB: 3mor_A* Length = 325 Back     alignment and structure
 Score =  246 bits (630), Expect = 1e-80
 Identities = 94/330 (28%), Positives = 121/330 (36%), Gaps = 105/330 (31%)

Query: 40  QAEKNS-LSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCP 98
           +A+ +  + NI     K   GV    N  +          E    LP++FDS   WPNCP
Sbjct: 27  KAKYDGVMQNITLREAKRLNGVIKKNNNASILPKRRFTEEEARAPLPSSFDSAEAWPNCP 86

Query: 99  TIREIRDQGSCGSCW-------------------------------------GC------ 115
           TI +I DQ +CGSCW                                     GC      
Sbjct: 87  TIPQIADQSACGSCWAVAAASAMSDRFCTMGGVQDVHISAGDLLACCSDCGDGCNGGDPD 146

Query: 116 ----------------RPYEIAPCEHHVNGTR--PSCDASKGHTPKCVRECQENYDVPYK 157
                           +PY    C HH       P C      TPKC   C +       
Sbjct: 147 RAWAYFSSTGLVSDYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPKCDYTCDDPT---IP 203

Query: 158 KDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLI 217
                   SY++   E   M+E++  GP E AF V++D I Y SG               
Sbjct: 204 VVNYRSWTSYAL-QGEDDYMRELFFRGPFEVAFDVYEDFIAYNSG--------------- 247

Query: 218 KWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWN 277
                             V+       SG+ LGGHA+R++GWG        YW IANSWN
Sbjct: 248 ------------------VYHH----VSGQYLGGHAVRLVGWGTSN--GVPYWKIANSWN 283

Query: 278 TDWGDNGLFKILRGKDECGIESSITAGVPK 307
           T+WG +G F I RG  ECGIE   +AG+P 
Sbjct: 284 TEWGMDGYFLIRRGSSECGIEDGGSAGIPL 313


>3hhi_A Cathepsin B-like cysteine protease; occluding loop, hydrolase, THIO protease; HET: 074; 1.60A {Trypanosoma brucei} PDB: 3mor_A* Length = 325 Back     alignment and structure
>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme, papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A 1pbh_A 1mir_A Length = 317 Back     alignment and structure
>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme, papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A 1pbh_A 1mir_A Length = 317 Back     alignment and structure
>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme, papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A 1pbh_A 1mir_A Length = 317 Back     alignment and structure
>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco hydrolase, lysosome, protease, thiol protease, zymogen, CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A* 3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A* 1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A* 1qdq_A* 1csb_B* 1huc_B 2ipp_B ... Length = 266 Back     alignment and structure
>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco hydrolase, lysosome, protease, thiol protease, zymogen, CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A* 3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A* 1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A* 1qdq_A* 1csb_B* 1huc_B 2ipp_B ... Length = 266 Back     alignment and structure
>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco hydrolase, lysosome, protease, thiol protease, zymogen, CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A* 3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A* 1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A* 1qdq_A* 1csb_B* 1huc_B 2ipp_B ... Length = 266 Back     alignment and structure
>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase, digestive tract, hydrolase-hydrolase INH complex; HET: 074; 1.30A {Schistosoma mansoni} PDB: 3s3q_A* 3s3r_A* Length = 254 Back     alignment and structure
>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase, digestive tract, hydrolase-hydrolase INH complex; HET: 074; 1.30A {Schistosoma mansoni} PDB: 3s3q_A* 3s3r_A* Length = 254 Back     alignment and structure
>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase, digestive tract, hydrolase-hydrolase INH complex; HET: 074; 1.30A {Schistosoma mansoni} PDB: 3s3q_A* 3s3r_A* Length = 254 Back     alignment and structure
>1deu_A Procathepsin X; cysteine protease, proregion, prosegment, HY; 1.70A {Homo sapiens} SCOP: d.3.1.1 PDB: 1ef7_A Length = 277 Back     alignment and structure
>1deu_A Procathepsin X; cysteine protease, proregion, prosegment, HY; 1.70A {Homo sapiens} SCOP: d.3.1.1 PDB: 1ef7_A Length = 277 Back     alignment and structure
>3pdf_A Cathepsin C, dipeptidyl peptidase 1; two domains, cystein protease, hydrolase-hydrolase inhibitor; HET: LXV NAG; 1.85A {Homo sapiens} PDB: 1jqp_A* 2djf_B* 1k3b_B* 2djg_B* 2djf_A* 1k3b_A* 2djg_A* 2djf_C* 1k3b_C* 2djg_C* Length = 441 Back     alignment and structure
>3pdf_A Cathepsin C, dipeptidyl peptidase 1; two domains, cystein protease, hydrolase-hydrolase inhibitor; HET: LXV NAG; 1.85A {Homo sapiens} PDB: 1jqp_A* 2djf_B* 1k3b_B* 2djg_B* 2djf_A* 1k3b_A* 2djg_A* 2djf_C* 1k3b_C* 2djg_C* Length = 441 Back     alignment and structure
>3ois_A Cysteine protease; alpha and beta, hydrolase; HET: UDP; 1.65A {Xylella fastidiosa} Length = 291 Back     alignment and structure
>2wbf_X Serine-repeat antigen protein; SERA, malaria, vacuole, protease, cathepsin, hydrolase, glycoprotein, thiol protease; HET: DMS; 1.60A {Plasmodium falciparum} PDB: 3ch3_X 3ch2_X Length = 265 Back     alignment and structure
>8pch_A Cathepsin H; hydrolase, protease, cysteine proteinase, aminopeptidase; HET: NAG BMA; 2.10A {Sus scrofa} SCOP: d.3.1.1 PDB: 1nb3_A* 1nb5_A* Length = 220 Back     alignment and structure
>1m6d_A Cathepsin F, catsf; papain family cysteine protease, hydrolase; HET: MYP; 1.70A {Homo sapiens} SCOP: d.3.1.1 Length = 214 Back     alignment and structure
>3i06_A Cruzipain; autocatalytic cleavage, glycoprotein, protease, thiol protease, zymogen; HET: QL2; 1.10A {Trypanosoma cruzi} PDB: 1ewm_A* 1ewo_A* 1ewl_A* 1f29_A* 1ewp_A* 1f2b_A* 1f2c_A* 1f2a_A* 1me4_A* 1u9q_X* 2aim_A* 2efm_A* 2oz2_A* 1me3_A* 3kku_A* 3lxs_A* 1aim_A* 3iut_A* 3hd3_A* 2p86_A* ... Length = 215 Back     alignment and structure
>1xkg_A DER P I, major mite fecal allergen DER P 1; major allergen, cysteine protease, house DUST mite, dermatop pteronyssinus; 1.61A {Dermatophagoides pteronyssinus} SCOP: d.3.1.1 Length = 312 Back     alignment and structure
>1xkg_A DER P I, major mite fecal allergen DER P 1; major allergen, cysteine protease, house DUST mite, dermatop pteronyssinus; 1.61A {Dermatophagoides pteronyssinus} SCOP: d.3.1.1 Length = 312 Back     alignment and structure
>3f5v_A DER P 1 allergen; allergy, asthma, DUST mites, glycoprotein, hydrola protease, secreted, thiol protease; HET: P6G; 1.36A {Dermatophagoides pteronyssinus} PDB: 2as8_A 3rvw_A* 3rvx_A 3rvv_A* 3d6s_A* Length = 222 Back     alignment and structure
>3qj3_A Cathepsin L-like protein; hydrolase, proteinase, larVal midgut; 1.85A {Tenebrio molitor} Length = 331 Back     alignment and structure
>3f75_A Toxopain-2, cathepsin L protease; medical structural genomics of pathogenic protozoa, MSGPP, C protease, parasite, protozoa, hydrolase; 1.99A {Toxoplasma gondii} Length = 224 Back     alignment and structure
>3u8e_A Papain-like cysteine protease; papain-like cysteine peptidase, peptidase_C1A, hydrolase, in form; 1.31A {Crocus sativus} Length = 222 Back     alignment and structure
>2xu3_A Cathepsin L1; hydrolase, drug design, thiol protease; HET: XU3 BTB; 0.90A {Homo sapiens} PDB: 2xu4_A* 2xu5_A* 2yj2_A* 2yj8_A* 2yj9_A* 2yjb_A* 2yjc_A* 3bc3_A* 3h89_A* 3h8b_A* 3h8c_A* 3of9_A* 3of8_A* 3hha_A* 2xu1_A* 3iv2_A* 3k24_A* 2nqd_B* 3kse_A* 2vhs_A ... Length = 220 Back     alignment and structure
>2b1m_A SPE31; papain-like, sugar binding protein; HET: NAG FUC PG4; 2.00A {Pachyrhizus erosus} PDB: 2b1n_A* Length = 246 Back     alignment and structure
>1ppo_A Protease omega; hydrolase(thiol protease); 1.80A {Carica papaya} SCOP: d.3.1.1 PDB: 1meg_A* Length = 216 Back     alignment and structure
>3qt4_A Cathepsin-L-like midgut cysteine proteinase; hydrolase, zymogen, intramolecular DISS bonds, insect larVal midgut; HET: PG4 PG6; 2.11A {Tenebrio molitor} Length = 329 Back     alignment and structure
>1cs8_A Human procathepsin L; prosegment, propeptide, inhibition, hydrolase; HET: OCS; 1.80A {Homo sapiens} SCOP: d.3.1.1 PDB: 1cjl_A 3hwn_A* Length = 316 Back     alignment and structure
>3bwk_A Cysteine protease falcipain-3; malaria, hydrolase; HET: C1P; 2.42A {Plasmodium falciparum} PDB: 3bpm_A* Length = 243 Back     alignment and structure
>2o6x_A Procathepsin L1, secreted cathepsin L 1; hydrolase, thiol protease, cysteine protease, zymogen, hydro; 1.40A {Fasciola hepatica} Length = 310 Back     alignment and structure
>2o6x_A Procathepsin L1, secreted cathepsin L 1; hydrolase, thiol protease, cysteine protease, zymogen, hydro; 1.40A {Fasciola hepatica} Length = 310 Back     alignment and structure
>2cio_A Papain; hydrolase/inhibitor, complex hydrolase/inhibitor, ICP, cysteine protease, allergen, protease, thiol protease; 1.5A {Carica papaya} PDB: 1khq_A 1khp_A 1ppn_A 3e1z_B 3ima_A 3lfy_A 9pap_A 1bqi_A* 1bp4_A* 1pad_A 1pe6_A* 1pip_A* 1pop_A* 1ppd_A 1ppp_A* 1stf_E* 2pad_A 4pad_A* 5pad_A* 6pad_A* ... Length = 212 Back     alignment and structure
>2oul_A Falcipain 2; cysteine protease, inhibitor, macromolecular interaction, HY hydrolase inhibitor complex; 2.20A {Plasmodium falciparum} SCOP: d.3.1.1 PDB: 2ghu_A 1yvb_A 3bpf_A* 3pnr_A Length = 241 Back     alignment and structure
>2oul_A Falcipain 2; cysteine protease, inhibitor, macromolecular interaction, HY hydrolase inhibitor complex; 2.20A {Plasmodium falciparum} SCOP: d.3.1.1 PDB: 2ghu_A 1yvb_A 3bpf_A* 3pnr_A Length = 241 Back     alignment and structure
>2fo5_A Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cysteine endoprotease, endopeptidase, LEUP hydrolase; HET: AR7; 2.20A {Hordeum vulgare} Length = 262 Back     alignment and structure
>3ioq_A CMS1MS2; caricaceae, cysteine protease, papain family, hydrolase; HET: E64 SO4; 1.87A {Carica candamarcensis} Length = 213 Back     alignment and structure
>1by8_A Protein (procathepsin K); hydrolase(sulfhydryl proteinase), papain; 2.60A {Homo sapiens} SCOP: d.3.1.1 PDB: 7pck_A Length = 314 Back     alignment and structure
>3ovx_A Cathepsin S; hydrolase, covalent inhibitor, aldehyde warhead is covalently bound to Cys25, lysosomeal protein; HET: O64; 1.49A {Homo sapiens} PDB: 2h7j_A* 2f1g_A* 2hh5_B* 2hhn_A* 2hxz_A* 2op3_A* 2frq_A* 2fra_A* 2fq9_A* 2ft2_A* 2fud_A* 2g7y_A* 1ms6_A* 2r9m_A* 2r9n_A* 2r9o_A* 3n3g_A* 3n4c_A* 3mpe_A* 1nqc_A* ... Length = 218 Back     alignment and structure
>3p5u_A Actinidin; SAD, cysteine proteinases, hydrolase; 1.50A {Actinidia arguta} PDB: 3p5v_A 3p5w_A 3p5x_A 1aec_A* 2act_A Length = 220 Back     alignment and structure
>2c0y_A Procathepsin S; proenzyme, proteinase, hydrolase, thiol protease, prosegment binding loop, glycoprotein, lysosome, protease, zymogen; 2.1A {Homo sapiens} Length = 315 Back     alignment and structure
>2c0y_A Procathepsin S; proenzyme, proteinase, hydrolase, thiol protease, prosegment binding loop, glycoprotein, lysosome, protease, zymogen; 2.1A {Homo sapiens} Length = 315 Back     alignment and structure
>3kwz_A Cathepsin K; enzyme inhibitor, covalent reversible inhibitor, disease mutation, disulfide bond, glycoprotein, hydrolase, lysosome, protease; HET: KWZ; 1.49A {Homo sapiens} PDB: 1au0_A* 1au2_A* 1au3_A* 1au4_A* 1ayu_A* 1ayv_A* 1ayw_A* 1bgo_A* 1atk_A* 1nl6_A* 1nlj_A* 1q6k_A* 1mem_A* 1yk7_A* 1yk8_A* 1yt7_A* 2ato_A* 2aux_A* 2auz_A* 2bdl_A* ... Length = 215 Back     alignment and structure
>2bdz_A Mexicain; cysteine protease, peptidase_C1, papain-like, HYDR; HET: E64; 2.10A {Jacaratia mexicana} Length = 214 Back     alignment and structure
>1pci_A Procaricain; zymogen, hydrolase, thiol protease; 3.20A {Carica papaya} SCOP: d.3.1.1 Length = 322 Back     alignment and structure
>1o0e_A Ervatamin C; plant cysteine protease, two domain, stable at PH 2-12, HYDR; 1.90A {Tabernaemontana divaricata} SCOP: d.3.1.1 PDB: 2pns_A* 2pre_A* 3bcn_A* Length = 208 Back     alignment and structure
>1cqd_A Protein (protease II); cysteine protease, glycoprotein, proline specificity, carboh papain family, hydrolase; HET: NAG FUL FUC; 2.10A {Zingiber officinale} SCOP: d.3.1.1 Length = 221 Back     alignment and structure
>1cqd_A Protein (protease II); cysteine protease, glycoprotein, proline specificity, carboh papain family, hydrolase; HET: NAG FUL FUC; 2.10A {Zingiber officinale} SCOP: d.3.1.1 Length = 221 Back     alignment and structure
>1iwd_A Ervatamin B; cysteine protease, alpha-beta protein, catalytic DYAD, L-DOM domain., hydrolase; 1.63A {Tabernaemontana divaricata} SCOP: d.3.1.1 Length = 215 Back     alignment and structure
>1s4v_A Cysteine endopeptidase; KDEL ER retention signal, endosperm, ricinosomes, SEED germi senescence, hydrolase-hydrolase inhibitor complex; 2.00A {Ricinus communis} SCOP: d.3.1.1 Length = 229 Back     alignment and structure
>1yal_A Chymopapain; hydrolase, thiol protease; 1.70A {Carica papaya} SCOP: d.3.1.1 PDB: 1gec_E* Length = 218 Back     alignment and structure
>3pw3_A Aminopeptidase C; bleomycin, cysteine proteinase fold, structural genomics, JO center for structural genomics, JCSG; HET: MSE; 2.23A {Parabacteroides distasonis} Length = 383 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query309
3pbh_A317 Procathepsin B; thiol protease, cysteine protease, 100.0
3hhi_A325 Cathepsin B-like cysteine protease; occluding loop 100.0
3qj3_A331 Cathepsin L-like protein; hydrolase, proteinase, l 100.0
3qt4_A329 Cathepsin-L-like midgut cysteine proteinase; hydro 100.0
1by8_A314 Protein (procathepsin K); hydrolase(sulfhydryl pro 100.0
2o6x_A310 Procathepsin L1, secreted cathepsin L 1; hydrolase 100.0
2c0y_A315 Procathepsin S; proenzyme, proteinase, hydrolase, 100.0
1cs8_A316 Human procathepsin L; prosegment, propeptide, inhi 100.0
3tnx_A363 Papain; hydrolase, cytoplasm for recombinant expre 100.0
3qsd_A254 Cathepsin B-like peptidase (C01 family); cysteine 100.0
3cbj_A266 Cathepsin B; cathepsin B, occluding loop, chagas d 100.0
3pdf_A441 Cathepsin C, dipeptidyl peptidase 1; two domains, 100.0
1pci_A322 Procaricain; zymogen, hydrolase, thiol protease; 3 100.0
1xkg_A312 DER P I, major mite fecal allergen DER P 1; major 100.0
1m6d_A214 Cathepsin F, catsf; papain family cysteine proteas 100.0
3i06_A215 Cruzipain; autocatalytic cleavage, glycoprotein, p 100.0
3kwz_A215 Cathepsin K; enzyme inhibitor, covalent reversible 100.0
8pch_A220 Cathepsin H; hydrolase, protease, cysteine protein 100.0
2xu3_A220 Cathepsin L1; hydrolase, drug design, thiol protea 100.0
3p5u_A220 Actinidin; SAD, cysteine proteinases, hydrolase; 1 100.0
3f5v_A222 DER P 1 allergen; allergy, asthma, DUST mites, gly 100.0
3ovx_A218 Cathepsin S; hydrolase, covalent inhibitor, aldehy 100.0
2b1m_A246 SPE31; papain-like, sugar binding protein; HET: NA 100.0
1ppo_A216 Protease omega; hydrolase(thiol protease); 1.80A { 100.0
3u8e_A222 Papain-like cysteine protease; papain-like cystein 100.0
1cqd_A221 Protein (protease II); cysteine protease, glycopro 100.0
1deu_A277 Procathepsin X; cysteine protease, proregion, pros 100.0
1yal_A218 Chymopapain; hydrolase, thiol protease; 1.70A {Car 100.0
1iwd_A215 Ervatamin B; cysteine protease, alpha-beta protein 100.0
2oul_A241 Falcipain 2; cysteine protease, inhibitor, macromo 100.0
3bwk_A243 Cysteine protease falcipain-3; malaria, hydrolase; 100.0
3ioq_A213 CMS1MS2; caricaceae, cysteine protease, papain fam 100.0
1s4v_A229 Cysteine endopeptidase; KDEL ER retention signal, 100.0
2cio_A212 Papain; hydrolase/inhibitor, complex hydrolase/inh 100.0
1o0e_A208 Ervatamin C; plant cysteine protease, two domain, 100.0
2bdz_A214 Mexicain; cysteine protease, peptidase_C1, papain- 100.0
3f75_A224 Toxopain-2, cathepsin L protease; medical structur 100.0
2fo5_A262 Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cyst 100.0
2wbf_X265 Serine-repeat antigen protein; SERA, malaria, vacu 100.0
3ois_A291 Cysteine protease; alpha and beta, hydrolase; HET: 100.0
2cb5_A453 Protein (bleomycin hydrolase); aminopeptidase, cys 99.92
2e01_A457 Cysteine proteinase 1; bleomycin hydrolase, thiol 99.91
3pw3_A383 Aminopeptidase C; bleomycin, cysteine proteinase f 99.66
2l95_A80 Crammer, LP06209P; cysteine proteinase inhibitor, 97.65
3f75_P106 Toxopain-2, cathepsin L propeptide; medical struct 97.08
3pw3_A 383 Aminopeptidase C; bleomycin, cysteine proteinase f 96.75
3pbh_A317 Procathepsin B; thiol protease, cysteine protease, 93.52
3qsd_A254 Cathepsin B-like peptidase (C01 family); cysteine 91.29
3cbj_A266 Cathepsin B; cathepsin B, occluding loop, chagas d 90.71
3hhi_A325 Cathepsin B-like cysteine protease; occluding loop 90.52
2cio_A212 Papain; hydrolase/inhibitor, complex hydrolase/inh 88.68
1pci_A322 Procaricain; zymogen, hydrolase, thiol protease; 3 87.7
3f5v_A222 DER P 1 allergen; allergy, asthma, DUST mites, gly 87.69
2bdz_A214 Mexicain; cysteine protease, peptidase_C1, papain- 87.66
1ppo_A216 Protease omega; hydrolase(thiol protease); 1.80A { 87.25
1o0e_A208 Ervatamin C; plant cysteine protease, two domain, 86.62
1yal_A218 Chymopapain; hydrolase, thiol protease; 1.70A {Car 86.27
1iwd_A215 Ervatamin B; cysteine protease, alpha-beta protein 86.23
3ioq_A213 CMS1MS2; caricaceae, cysteine protease, papain fam 86.2
1m6d_A214 Cathepsin F, catsf; papain family cysteine proteas 86.09
2b1m_A246 SPE31; papain-like, sugar binding protein; HET: NA 85.69
3kwz_A215 Cathepsin K; enzyme inhibitor, covalent reversible 85.25
2oul_A241 Falcipain 2; cysteine protease, inhibitor, macromo 85.2
1cqd_A221 Protein (protease II); cysteine protease, glycopro 85.16
3u8e_A222 Papain-like cysteine protease; papain-like cystein 84.67
8pch_A220 Cathepsin H; hydrolase, protease, cysteine protein 84.27
1xkg_A312 DER P I, major mite fecal allergen DER P 1; major 84.24
1s4v_A229 Cysteine endopeptidase; KDEL ER retention signal, 84.22
2xu3_A220 Cathepsin L1; hydrolase, drug design, thiol protea 84.22
3f75_A224 Toxopain-2, cathepsin L protease; medical structur 84.08
2o6x_A310 Procathepsin L1, secreted cathepsin L 1; hydrolase 83.86
3bwk_A243 Cysteine protease falcipain-3; malaria, hydrolase; 83.6
3p5u_A220 Actinidin; SAD, cysteine proteinases, hydrolase; 1 83.13
3qj3_A331 Cathepsin L-like protein; hydrolase, proteinase, l 82.81
3qt4_A329 Cathepsin-L-like midgut cysteine proteinase; hydro 82.81
2fo5_A262 Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cyst 82.76
1by8_A314 Protein (procathepsin K); hydrolase(sulfhydryl pro 82.5
3i06_A215 Cruzipain; autocatalytic cleavage, glycoprotein, p 82.03
3pdf_A441 Cathepsin C, dipeptidyl peptidase 1; two domains, 81.23
2c0y_A315 Procathepsin S; proenzyme, proteinase, hydrolase, 80.02
>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme, papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A 1pbh_A 1mir_A Back     alignment and structure
Probab=100.00  E-value=6.8e-57  Score=423.57  Aligned_cols=240  Identities=48%  Similarity=0.933  Sum_probs=185.9

Q ss_pred             HHHHHHHhccccCCCCCCcccccccCCCCcHHHHHHHhCCCCCCCCCCCCCCcccccCCCCCCCCCceecCCCCCCCCCC
Q psy8713          21 AWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTI  100 (309)
Q Consensus        21 a~~~~~~~~~~~~~~~~s~~~g~Nf~~~~~~e~~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~lP~~~DwR~~~~~cg~v  100 (309)
                      .=++|.++|-.   + .+|++|+| |+++++||++++||.++...  .  .+..... ....+||++||||++||+|+.|
T Consensus        11 ~~~~i~~~N~~---~-~~~~~~~n-f~~~~~e~~~~~lg~~~~~~--~--~~~~~~~-~~~~~lP~s~DwR~~~~~c~~v   80 (317)
T 3pbh_A           11 SDELVNYVNKR---N-TTWQAGHN-FYNVDMSYLKRLCGTFLGGP--K--PPQRVMF-TEDLKLPASFDAREQWPQCPTI   80 (317)
T ss_dssp             CHHHHHHHHHH---T-CSEEECCC-CSSCCHHHHHHTCCBCTTCC--C--CSEEECC-CSCCCCCSSEEHHHHCTTCGGG
T ss_pred             cHHHHHHHHCC---C-CCeEEeec-cccCCHHHHHHHcCCCCCcc--c--cCccccc-ccccCCCCCEechhccCCCCCc
Confidence            34677777732   3 48999999 77999999999999876541  1  1222211 1246899999999999999999


Q ss_pred             ccccCccCCCCCc----------------------CCCccccCCC---------CCCC----------CC--------CC
Q psy8713         101 REIRDQGSCGSCW----------------------GCRPYEIAPC---------EHHV----------NG--------TR  131 (309)
Q Consensus       101 tpVkdQg~CGSCW----------------------~cs~~~~~~C---------~~~~----------~g--------~~  131 (309)
                      +||||||.|||||                      ..|++++.+|         .++.          +|        ..
T Consensus        81 tpVkdQg~CGSCWAFsa~~ale~~~~i~~~~~~~~~LSeq~LvdC~~~~~~~GC~GG~~~~A~~yi~~~Gi~te~~Y~~~  160 (317)
T 3pbh_A           81 KEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESH  160 (317)
T ss_dssp             TCCCBCCSSCCHHHHHHHHHHHHHHHHHTTTSCCCCBCHHHHHHHSCTTTBCGGGCBCHHHHHHHHHHTCEEBBCSTTCC
T ss_pred             CCccccCCccchHHHHHHHHHHHHHHHHhCCCCcccCCHHHHHHhccccCCCCCCCCCHHHHHHHHHHhCCCcchhccCC
Confidence            9999999999999                      2334444444         3321          00        01


Q ss_pred             CCCCCCC------------------CCCcccccccccCccccccccccceeeeeeecchHHHHHHHHHhcCCEEEEEecc
Q psy8713         132 PSCDASK------------------GHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVF  193 (309)
Q Consensus       132 ~~C~~~~------------------~~~~~~~~~c~~~~~~~~~~~~~~~~~~~~v~~~~~~ik~~l~~~GPv~v~~~~~  193 (309)
                      ..|.++.                  .+.+.|...|...+...+..+..+....|.++.++++||++|+++|||+|+|.++
T Consensus       161 ~~c~PY~~~~c~~~~~~~~~~C~~~~~~~~c~~~c~~~~~~~~~~~~~~~~~~~~v~~~e~~i~~~i~~~GPV~v~i~~~  240 (317)
T 3pbh_A          161 VGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVY  240 (317)
T ss_dssp             CBSSCCCSCCCCCCCTTCCSCCCSCCCCCCCCCSCCSSCCSCGGGSEECBCCCEEECSCHHHHHHHHHHHCCEEEEEEEE
T ss_pred             CCCcCcccCcccccccCcCCCCCCcCCCCcccccccCCCccceeeeeeeeeecccCCcHHHHHHHHHHHCCCEEEEEEec
Confidence            1222322                  1233444556655655666666676666788889999999999999999999999


Q ss_pred             cccccCCCceEeCCCCcchhhhhhhhhhcccCcccCcCCcceeccccccccCCCccCCceEEEEEeeccCCCCccEEEEE
Q psy8713         194 DDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIA  273 (309)
Q Consensus       194 ~~f~~Y~sGiy~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~HaV~iVGyg~~~~~g~~YWiik  273 (309)
                      ++|++|++|||.                                     ..|+...++|||+|||||++  ++++|||||
T Consensus       241 ~~f~~Y~~GVy~-------------------------------------~~~~~~~~~HaV~iVGyG~~--~g~~YWivk  281 (317)
T 3pbh_A          241 SDFLLYKSGVYQ-------------------------------------HVTGEMMGGHAIRILGWGVE--NGTPYWLVA  281 (317)
T ss_dssp             GGGGGEEEEEEC-------------------------------------CCSCCEEEEEEEEEEEEEEE--TTEEEEEEE
T ss_pred             ccccCCCCcEEc-------------------------------------cCCCCCCCCEEEEEEEEEEe--CCeeEEEEE
Confidence            999999999998                                     44666677999999999998  889999999


Q ss_pred             cCCCCCCCCCceEEEEccCCccccCcceEEEeecCC
Q psy8713         274 NSWNTDWGDNGLFKILRGKDECGIESSITAGVPKLD  309 (309)
Q Consensus       274 NSWG~~WG~~Gy~~i~~g~n~cgi~~~~~~~~~~~~  309 (309)
                      ||||++|||+|||||+||.|+|||++.+++++|++|
T Consensus       282 NSWG~~WGe~GY~ri~rg~n~CgI~~~~~a~~p~~d  317 (317)
T 3pbh_A          282 NSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTD  317 (317)
T ss_dssp             CSBCTTSTBTTEEEEECSSCGGGTTTSCEECCBCCC
T ss_pred             cCCCCCcCCCcEEEEEcCCCcccCCCceEeeecCCC
Confidence            999999999999999999999999999999999998



>3hhi_A Cathepsin B-like cysteine protease; occluding loop, hydrolase, THIO protease; HET: 074; 1.60A {Trypanosoma brucei} SCOP: d.3.1.0 PDB: 4hwy_A* 3mor_A* Back     alignment and structure
>3qj3_A Cathepsin L-like protein; hydrolase, proteinase, larVal midgut; 1.85A {Tenebrio molitor} SCOP: d.3.1.0 Back     alignment and structure
>3qt4_A Cathepsin-L-like midgut cysteine proteinase; hydrolase, zymogen, intramolecular DISS bonds, insect larVal midgut; HET: PG4 PG6; 2.11A {Tenebrio molitor} Back     alignment and structure
>1by8_A Protein (procathepsin K); hydrolase(sulfhydryl proteinase), papain; 2.60A {Homo sapiens} SCOP: d.3.1.1 PDB: 7pck_A Back     alignment and structure
>2o6x_A Procathepsin L1, secreted cathepsin L 1; hydrolase, thiol protease, cysteine protease, zymogen, hydro; 1.40A {Fasciola hepatica} Back     alignment and structure
>2c0y_A Procathepsin S; proenzyme, proteinase, hydrolase, thiol protease, prosegment binding loop, glycoprotein, lysosome, protease, zymogen; 2.1A {Homo sapiens} Back     alignment and structure
>1cs8_A Human procathepsin L; prosegment, propeptide, inhibition, hydrolase; HET: OCS; 1.80A {Homo sapiens} SCOP: d.3.1.1 PDB: 1cjl_A 3hwn_A* Back     alignment and structure
>3tnx_A Papain; hydrolase, cytoplasm for recombinant expression; 2.62A {Carica papaya} Back     alignment and structure
>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase, digestive tract, hydrolase-hydrolase INH complex; HET: 074; 1.30A {Schistosoma mansoni} SCOP: d.3.1.0 PDB: 3s3q_A* 3s3r_A* Back     alignment and structure
>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco hydrolase, lysosome, protease, thiol protease, zymogen, CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A* 3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A* 1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A* 1qdq_A* 1csb_B* 1huc_B 2ipp_B ... Back     alignment and structure
>3pdf_A Cathepsin C, dipeptidyl peptidase 1; two domains, cystein protease, hydrolase-hydrolase inhibitor; HET: LXV NAG; 1.85A {Homo sapiens} PDB: 1jqp_A* 2djf_B* 1k3b_B* 2djg_B* 2djf_A* 1k3b_A* 2djg_A* 2djf_C* 1k3b_C* 2djg_C* Back     alignment and structure
>1pci_A Procaricain; zymogen, hydrolase, thiol protease; 3.20A {Carica papaya} SCOP: d.3.1.1 Back     alignment and structure
>1xkg_A DER P I, major mite fecal allergen DER P 1; major allergen, cysteine protease, house DUST mite, dermatop pteronyssinus; 1.61A {Dermatophagoides pteronyssinus} SCOP: d.3.1.1 Back     alignment and structure
>1m6d_A Cathepsin F, catsf; papain family cysteine protease, hydrolase; HET: MYP; 1.70A {Homo sapiens} SCOP: d.3.1.1 Back     alignment and structure
>3i06_A Cruzipain; autocatalytic cleavage, glycoprotein, protease, thiol protease, zymogen; HET: QL2; 1.10A {Trypanosoma cruzi} SCOP: d.3.1.1 PDB: 1ewm_A* 1ewo_A* 1ewl_A* 1f29_A* 1ewp_A* 1f2b_A* 1f2c_A* 1f2a_A* 1me4_A* 1u9q_X* 2aim_A* 2efm_A* 2oz2_A* 1me3_A* 3kku_A* 3lxs_A* 1aim_A* 3iut_A* 3hd3_A* 2p86_A* ... Back     alignment and structure
>3kwz_A Cathepsin K; enzyme inhibitor, covalent reversible inhibitor, disease mutation, disulfide bond, glycoprotein, hydrolase, lysosome, protease; HET: KWZ; 1.49A {Homo sapiens} PDB: 1au0_A* 1au2_A* 1au3_A* 1au4_A* 1ayu_A* 1ayv_A* 1ayw_A* 1bgo_A* 1atk_A* 1nl6_A* 1nlj_A* 1q6k_A* 1mem_A* 1yk7_A* 1yk8_A* 1yt7_A* 2ato_A* 2aux_A* 2auz_A* 2bdl_A* ... Back     alignment and structure
>8pch_A Cathepsin H; hydrolase, protease, cysteine proteinase, aminopeptidase; HET: NAG BMA; 2.10A {Sus scrofa} SCOP: d.3.1.1 PDB: 1nb3_A* 1nb5_A* Back     alignment and structure
>2xu3_A Cathepsin L1; hydrolase, drug design, thiol protease; HET: XU3 BTB; 0.90A {Homo sapiens} PDB: 2xu4_A* 2xu5_A* 2yj2_A* 2yj8_A* 2yj9_A* 2yjb_A* 2yjc_A* 3bc3_A* 3h89_A* 3h8b_A* 3h8c_A* 3of9_A* 3of8_A* 3hha_A* 2xu1_A* 3iv2_A* 3k24_A* 2nqd_B* 3kse_A* 2vhs_A ... Back     alignment and structure
>3ovx_A Cathepsin S; hydrolase, covalent inhibitor, aldehyde warhead is covalently bound to Cys25, lysosomeal protein; HET: O64; 1.49A {Homo sapiens} SCOP: d.3.1.1 PDB: 2h7j_A* 2f1g_A* 2hh5_B* 2hhn_A* 2hxz_A* 2op3_A* 2frq_A* 2fra_A* 2fq9_A* 2ft2_A* 2fud_A* 2g7y_A* 1ms6_A* 2r9m_A* 2r9n_A* 2r9o_A* 3n3g_A* 3n4c_A* 3mpe_A* 1nqc_A* ... Back     alignment and structure
>2b1m_A SPE31; papain-like, sugar binding protein; HET: NAG FUC PG4; 2.00A {Pachyrhizus erosus} PDB: 2b1n_A* Back     alignment and structure
>1ppo_A Protease omega; hydrolase(thiol protease); 1.80A {Carica papaya} SCOP: d.3.1.1 PDB: 1meg_A* Back     alignment and structure
>3u8e_A Papain-like cysteine protease; papain-like cysteine peptidase, peptidase_C1A, hydrolase, in form; 1.31A {Crocus sativus} SCOP: d.3.1.0 Back     alignment and structure
>1cqd_A Protein (protease II); cysteine protease, glycoprotein, proline specificity, carboh papain family, hydrolase; HET: NAG FUL FUC; 2.10A {Zingiber officinale} SCOP: d.3.1.1 Back     alignment and structure
>1deu_A Procathepsin X; cysteine protease, proregion, prosegment, HY; 1.70A {Homo sapiens} SCOP: d.3.1.1 PDB: 1ef7_A Back     alignment and structure
>1yal_A Chymopapain; hydrolase, thiol protease; 1.70A {Carica papaya} SCOP: d.3.1.1 PDB: 1gec_E* Back     alignment and structure
>1iwd_A Ervatamin B; cysteine protease, alpha-beta protein, catalytic DYAD, L-DOM domain., hydrolase; 1.63A {Tabernaemontana divaricata} SCOP: d.3.1.1 Back     alignment and structure
>2oul_A Falcipain 2; cysteine protease, inhibitor, macromolecular interaction, HY hydrolase inhibitor complex; 2.20A {Plasmodium falciparum} SCOP: d.3.1.1 PDB: 2ghu_A 1yvb_A 3bpf_A* 3pnr_A Back     alignment and structure
>3bwk_A Cysteine protease falcipain-3; malaria, hydrolase; HET: C1P; 2.42A {Plasmodium falciparum} PDB: 3bpm_A* Back     alignment and structure
>3ioq_A CMS1MS2; caricaceae, cysteine protease, papain family, hydrolase; HET: E64 SO4; 1.87A {Carica candamarcensis} SCOP: d.3.1.1 Back     alignment and structure
>1s4v_A Cysteine endopeptidase; KDEL ER retention signal, endosperm, ricinosomes, SEED germi senescence, hydrolase-hydrolase inhibitor complex; 2.00A {Ricinus communis} SCOP: d.3.1.1 Back     alignment and structure
>2cio_A Papain; hydrolase/inhibitor, complex hydrolase/inhibitor, ICP, cysteine protease, allergen, protease, thiol protease; 1.5A {Carica papaya} PDB: 1khq_A 1khp_A 1ppn_A 3e1z_B 3ima_A 3lfy_A 9pap_A 1bqi_A* 1bp4_A* 1pad_A 1pe6_A* 1pip_A* 1pop_A* 1ppd_A 1ppp_A* 1stf_E* 2pad_A 4pad_A* 5pad_A* 6pad_A* ... Back     alignment and structure
>1o0e_A Ervatamin C; plant cysteine protease, two domain, stable at PH 2-12, HYDR; 1.90A {Tabernaemontana divaricata} SCOP: d.3.1.1 PDB: 2pns_A* 2pre_A* 3bcn_A* Back     alignment and structure
>2bdz_A Mexicain; cysteine protease, peptidase_C1, papain-like, HYDR; HET: E64; 2.10A {Jacaratia mexicana} Back     alignment and structure
>3f75_A Toxopain-2, cathepsin L protease; medical structural genomics of pathogenic protozoa, MSGPP, C protease, parasite, protozoa, hydrolase; 1.99A {Toxoplasma gondii} SCOP: d.3.1.0 Back     alignment and structure
>2fo5_A Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cysteine endoprotease, endopeptidase, LEUP hydrolase; HET: AR7; 2.20A {Hordeum vulgare} Back     alignment and structure
>2wbf_X Serine-repeat antigen protein; SERA, malaria, vacuole, protease, cathepsin, hydrolase, glycoprotein, thiol protease; HET: DMS; 1.60A {Plasmodium falciparum} PDB: 3ch3_X 3ch2_X Back     alignment and structure
>3ois_A Cysteine protease; alpha and beta, hydrolase; HET: UDP; 1.65A {Xylella fastidiosa} Back     alignment and structure
>2cb5_A Protein (bleomycin hydrolase); aminopeptidase, cysteine protease, SELF- compartmentalizing, cylinase; 1.85A {Homo sapiens} SCOP: d.3.1.1 PDB: 1cb5_A Back     alignment and structure
>2e01_A Cysteine proteinase 1; bleomycin hydrolase, thiol protease, C1 protease, hydrolase; 1.73A {Saccharomyces cerevisiae} PDB: 2e02_A 2e03_A 2dzy_A 1a6r_A 2e00_A 2dzz_A 3gcb_A 1gcb_A Back     alignment and structure
>3pw3_A Aminopeptidase C; bleomycin, cysteine proteinase fold, structural genomics, JO center for structural genomics, JCSG; HET: MSE; 2.23A {Parabacteroides distasonis} Back     alignment and structure
>2l95_A Crammer, LP06209P; cysteine proteinase inhibitor, intrinsic disorder P like protein, hydrolase; NMR {Drosophila melanogaster} Back     alignment and structure
>3f75_P Toxopain-2, cathepsin L propeptide; medical structural genomics of pathogenic protozoa, MSGPP, C protease, parasite, protozoa, hydrolase; 1.99A {Toxoplasma gondii} Back     alignment and structure
>3pw3_A Aminopeptidase C; bleomycin, cysteine proteinase fold, structural genomics, JO center for structural genomics, JCSG; HET: MSE; 2.23A {Parabacteroides distasonis} Back     alignment and structure
>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme, papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A 1pbh_A 1mir_A Back     alignment and structure
>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase, digestive tract, hydrolase-hydrolase INH complex; HET: 074; 1.30A {Schistosoma mansoni} SCOP: d.3.1.0 PDB: 3s3q_A* 3s3r_A* Back     alignment and structure
>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco hydrolase, lysosome, protease, thiol protease, zymogen, CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A* 3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A* 1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A* 1qdq_A* 1csb_B* 1huc_B 2ipp_B ... Back     alignment and structure
>3hhi_A Cathepsin B-like cysteine protease; occluding loop, hydrolase, THIO protease; HET: 074; 1.60A {Trypanosoma brucei} SCOP: d.3.1.0 PDB: 4hwy_A* 3mor_A* Back     alignment and structure
>2cio_A Papain; hydrolase/inhibitor, complex hydrolase/inhibitor, ICP, cysteine protease, allergen, protease, thiol protease; 1.5A {Carica papaya} PDB: 1khq_A 1khp_A 1ppn_A 3e1z_B 3ima_A 3lfy_A 9pap_A 1bqi_A* 1bp4_A* 1pad_A 1pe6_A* 1pip_A* 1pop_A* 1ppd_A 1ppp_A* 1stf_E* 2pad_A 4pad_A* 5pad_A* 6pad_A* ... Back     alignment and structure
>1pci_A Procaricain; zymogen, hydrolase, thiol protease; 3.20A {Carica papaya} SCOP: d.3.1.1 Back     alignment and structure
>3f5v_A DER P 1 allergen; allergy, asthma, DUST mites, glycoprotein, hydrola protease, secreted, thiol protease; HET: P6G; 1.36A {Dermatophagoides pteronyssinus} PDB: 2as8_A 3rvw_A* 3rvx_A 3rvv_A* 3d6s_A* Back     alignment and structure
>2bdz_A Mexicain; cysteine protease, peptidase_C1, papain-like, HYDR; HET: E64; 2.10A {Jacaratia mexicana} Back     alignment and structure
>1ppo_A Protease omega; hydrolase(thiol protease); 1.80A {Carica papaya} SCOP: d.3.1.1 PDB: 1meg_A* Back     alignment and structure
>1o0e_A Ervatamin C; plant cysteine protease, two domain, stable at PH 2-12, HYDR; 1.90A {Tabernaemontana divaricata} SCOP: d.3.1.1 PDB: 2pns_A* 2pre_A* 3bcn_A* Back     alignment and structure
>1yal_A Chymopapain; hydrolase, thiol protease; 1.70A {Carica papaya} SCOP: d.3.1.1 PDB: 1gec_E* Back     alignment and structure
>1iwd_A Ervatamin B; cysteine protease, alpha-beta protein, catalytic DYAD, L-DOM domain., hydrolase; 1.63A {Tabernaemontana divaricata} SCOP: d.3.1.1 Back     alignment and structure
>3ioq_A CMS1MS2; caricaceae, cysteine protease, papain family, hydrolase; HET: E64 SO4; 1.87A {Carica candamarcensis} SCOP: d.3.1.1 Back     alignment and structure
>1m6d_A Cathepsin F, catsf; papain family cysteine protease, hydrolase; HET: MYP; 1.70A {Homo sapiens} SCOP: d.3.1.1 Back     alignment and structure
>2b1m_A SPE31; papain-like, sugar binding protein; HET: NAG FUC PG4; 2.00A {Pachyrhizus erosus} PDB: 2b1n_A* Back     alignment and structure
>3kwz_A Cathepsin K; enzyme inhibitor, covalent reversible inhibitor, disease mutation, disulfide bond, glycoprotein, hydrolase, lysosome, protease; HET: KWZ; 1.49A {Homo sapiens} PDB: 1au0_A* 1au2_A* 1au3_A* 1au4_A* 1ayu_A* 1ayv_A* 1ayw_A* 1bgo_A* 1atk_A* 1nl6_A* 1nlj_A* 1q6k_A* 1mem_A* 1yk7_A* 1yk8_A* 1yt7_A* 2ato_A* 2aux_A* 2auz_A* 2bdl_A* ... Back     alignment and structure
>2oul_A Falcipain 2; cysteine protease, inhibitor, macromolecular interaction, HY hydrolase inhibitor complex; 2.20A {Plasmodium falciparum} SCOP: d.3.1.1 PDB: 2ghu_A 1yvb_A 3bpf_A* 3pnr_A Back     alignment and structure
>1cqd_A Protein (protease II); cysteine protease, glycoprotein, proline specificity, carboh papain family, hydrolase; HET: NAG FUL FUC; 2.10A {Zingiber officinale} SCOP: d.3.1.1 Back     alignment and structure
>3u8e_A Papain-like cysteine protease; papain-like cysteine peptidase, peptidase_C1A, hydrolase, in form; 1.31A {Crocus sativus} SCOP: d.3.1.0 Back     alignment and structure
>8pch_A Cathepsin H; hydrolase, protease, cysteine proteinase, aminopeptidase; HET: NAG BMA; 2.10A {Sus scrofa} SCOP: d.3.1.1 PDB: 1nb3_A* 1nb5_A* Back     alignment and structure
>1xkg_A DER P I, major mite fecal allergen DER P 1; major allergen, cysteine protease, house DUST mite, dermatop pteronyssinus; 1.61A {Dermatophagoides pteronyssinus} SCOP: d.3.1.1 Back     alignment and structure
>1s4v_A Cysteine endopeptidase; KDEL ER retention signal, endosperm, ricinosomes, SEED germi senescence, hydrolase-hydrolase inhibitor complex; 2.00A {Ricinus communis} SCOP: d.3.1.1 Back     alignment and structure
>2xu3_A Cathepsin L1; hydrolase, drug design, thiol protease; HET: XU3 BTB; 0.90A {Homo sapiens} PDB: 2xu4_A* 2xu5_A* 2yj2_A* 2yj8_A* 2yj9_A* 2yjb_A* 2yjc_A* 3bc3_A* 3h89_A* 3h8b_A* 3h8c_A* 3of9_A* 3of8_A* 3hha_A* 2xu1_A* 3iv2_A* 3k24_A* 2nqd_B* 3kse_A* 2vhs_A ... Back     alignment and structure
>3f75_A Toxopain-2, cathepsin L protease; medical structural genomics of pathogenic protozoa, MSGPP, C protease, parasite, protozoa, hydrolase; 1.99A {Toxoplasma gondii} SCOP: d.3.1.0 Back     alignment and structure
>2o6x_A Procathepsin L1, secreted cathepsin L 1; hydrolase, thiol protease, cysteine protease, zymogen, hydro; 1.40A {Fasciola hepatica} Back     alignment and structure
>3bwk_A Cysteine protease falcipain-3; malaria, hydrolase; HET: C1P; 2.42A {Plasmodium falciparum} PDB: 3bpm_A* Back     alignment and structure
>3p5u_A Actinidin; SAD, cysteine proteinases, hydrolase; 1.50A {Actinidia arguta} SCOP: d.3.1.1 PDB: 3p5v_A 3p5w_A 3p5x_A 1aec_A* 2act_A Back     alignment and structure
>3qj3_A Cathepsin L-like protein; hydrolase, proteinase, larVal midgut; 1.85A {Tenebrio molitor} SCOP: d.3.1.0 Back     alignment and structure
>3qt4_A Cathepsin-L-like midgut cysteine proteinase; hydrolase, zymogen, intramolecular DISS bonds, insect larVal midgut; HET: PG4 PG6; 2.11A {Tenebrio molitor} Back     alignment and structure
>2fo5_A Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cysteine endoprotease, endopeptidase, LEUP hydrolase; HET: AR7; 2.20A {Hordeum vulgare} Back     alignment and structure
>1by8_A Protein (procathepsin K); hydrolase(sulfhydryl proteinase), papain; 2.60A {Homo sapiens} SCOP: d.3.1.1 PDB: 7pck_A Back     alignment and structure
>3i06_A Cruzipain; autocatalytic cleavage, glycoprotein, protease, thiol protease, zymogen; HET: QL2; 1.10A {Trypanosoma cruzi} SCOP: d.3.1.1 PDB: 1ewm_A* 1ewo_A* 1ewl_A* 1f29_A* 1ewp_A* 1f2b_A* 1f2c_A* 1f2a_A* 1me4_A* 1u9q_X* 2aim_A* 2efm_A* 2oz2_A* 1me3_A* 3kku_A* 3lxs_A* 1aim_A* 3iut_A* 3hd3_A* 2p86_A* ... Back     alignment and structure
>3pdf_A Cathepsin C, dipeptidyl peptidase 1; two domains, cystein protease, hydrolase-hydrolase inhibitor; HET: LXV NAG; 1.85A {Homo sapiens} PDB: 1jqp_A* 2djf_B* 1k3b_B* 2djg_B* 2djf_A* 1k3b_A* 2djg_A* 2djf_C* 1k3b_C* 2djg_C* Back     alignment and structure
>2c0y_A Procathepsin S; proenzyme, proteinase, hydrolase, thiol protease, prosegment binding loop, glycoprotein, lysosome, protease, zymogen; 2.1A {Homo sapiens} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 309
d1gmya_254 d.3.1.1 (A:) (Pro)cathepsin B {Human (Homo sapiens 1e-37
g1k3b.1233 d.3.1.1 (B:,C:) Cathepsin C (dipeptidyl peptidase 3e-31
g8pch.1228 d.3.1.1 (P:,A:) Cathepsin H {Pig (Sus scrofa) [Tax 6e-24
d1deua_275 d.3.1.1 (A:) (Pro)cathepsin X {Human (Homo sapiens 1e-23
d1s4va_224 d.3.1.1 (A:) Vignain (bean endopeptidase) {Castor 1e-22
d1m6da_214 d.3.1.1 (A:) Cathepsin F {Human (Homo sapiens) [Ta 3e-22
d1me4a_215 d.3.1.1 (A:) Cruzain {Trypanosoma cruzi [TaxId: 56 3e-21
d1cs8a_316 d.3.1.1 (A:) (Pro)cathepsin L {Human (Homo sapiens 4e-21
d1xkga1302 d.3.1.1 (A:4-305) Major mite fecal allergen der p 8e-21
d2oula1241 d.3.1.1 (A:-16-224) Falcipain 2 {Plasmodium falcip 4e-20
d1aeca_218 d.3.1.1 (A:) Actinidin {Chinese gooseberry or kiwi 4e-20
d1fh0a_221 d.3.1.1 (A:) (Pro)cathepsin V {Human (Homo sapiens 6e-20
d1ppoa_216 d.3.1.1 (A:) Caricain (protease omega) {Papaya (Ca 6e-20
d2h7ja1217 d.3.1.1 (A:1-217) (Pro)cathepsin S {Human (Homo sa 7e-20
d1yala_218 d.3.1.1 (A:) Chymopapain {Papaya (Carica papaya) [ 8e-19
d1cqda_216 d.3.1.1 (A:) Proline-specific cysteine protease {G 2e-18
d2r6na1215 d.3.1.1 (A:1-215) (Pro)cathepsin K {Human (Homo sa 6e-18
d1iwda_215 d.3.1.1 (A:) Ervatamin B {Adam's apple (Ervatamia 7e-18
d1o0ea_208 d.3.1.1 (A:) Ervatamin C {East indian rosebay (Erv 8e-18
d1khqa_212 d.3.1.1 (A:) Papain {Papaya (Carica papaya) [TaxId 1e-17
d3gcba_458 d.3.1.1 (A:) Bleomycin hydrolase {Baker's yeast (S 9e-05
d2cb5a_453 d.3.1.1 (A:) Bleomycin hydrolase {Human (Homo sapi 0.002
>d1gmya_ d.3.1.1 (A:) (Pro)cathepsin B {Human (Homo sapiens) [TaxId: 9606]} Length = 254 Back     information, alignment and structure

class: Alpha and beta proteins (a+b)
fold: Cysteine proteinases
superfamily: Cysteine proteinases
family: Papain-like
domain: (Pro)cathepsin B
species: Human (Homo sapiens) [TaxId: 9606]
 Score =  132 bits (332), Expect = 1e-37
 Identities = 98/205 (47%), Positives = 124/205 (60%), Gaps = 40/205 (19%)

Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
           +   G   S  GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++
Sbjct: 89  LVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCT-GEGDTPKCSKICEPGYSPTYKQDKHY 147

Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
           G  SYSVS++EK IM EIY++GPVEGAF+V+ D +LYKSG +                  
Sbjct: 148 GYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQ----------------- 190

Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
                               + +G+ +GGHAIRILGWG +      YWL+ANSWNTDWGD
Sbjct: 191 --------------------HVTGEMMGGHAIRILGWGVENG--TPYWLVANSWNTDWGD 228

Query: 283 NGLFKILRGKDECGIESSITAGVPK 307
           NG FKILRG+D CGIES + AG+P+
Sbjct: 229 NGFFKILRGQDHCGIESEVVAGIPR 253


>d1deua_ d.3.1.1 (A:) (Pro)cathepsin X {Human (Homo sapiens) [TaxId: 9606]} Length = 275 Back     information, alignment and structure
>d1s4va_ d.3.1.1 (A:) Vignain (bean endopeptidase) {Castor bean (Ricinus communis) [TaxId: 3988]} Length = 224 Back     information, alignment and structure
>d1m6da_ d.3.1.1 (A:) Cathepsin F {Human (Homo sapiens) [TaxId: 9606]} Length = 214 Back     information, alignment and structure
>d1me4a_ d.3.1.1 (A:) Cruzain {Trypanosoma cruzi [TaxId: 5693]} Length = 215 Back     information, alignment and structure
>d1cs8a_ d.3.1.1 (A:) (Pro)cathepsin L {Human (Homo sapiens) [TaxId: 9606]} Length = 316 Back     information, alignment and structure
>d1xkga1 d.3.1.1 (A:4-305) Major mite fecal allergen der p 1 {House-dust mite (Dermatophagoides pteronyssinus) [TaxId: 6956]} Length = 302 Back     information, alignment and structure
>d2oula1 d.3.1.1 (A:-16-224) Falcipain 2 {Plasmodium falciparum [TaxId: 5833]} Length = 241 Back     information, alignment and structure
>d1aeca_ d.3.1.1 (A:) Actinidin {Chinese gooseberry or kiwifruit (Actinidia chinensis) [TaxId: 3625]} Length = 218 Back     information, alignment and structure
>d1fh0a_ d.3.1.1 (A:) (Pro)cathepsin V {Human (Homo sapiens) [TaxId: 9606]} Length = 221 Back     information, alignment and structure
>d1ppoa_ d.3.1.1 (A:) Caricain (protease omega) {Papaya (Carica papaya) [TaxId: 3649]} Length = 216 Back     information, alignment and structure
>d2h7ja1 d.3.1.1 (A:1-217) (Pro)cathepsin S {Human (Homo sapiens) [TaxId: 9606]} Length = 217 Back     information, alignment and structure
>d1yala_ d.3.1.1 (A:) Chymopapain {Papaya (Carica papaya) [TaxId: 3649]} Length = 218 Back     information, alignment and structure
>d1cqda_ d.3.1.1 (A:) Proline-specific cysteine protease {Ginger rhizome (Zingiber officinale) [TaxId: 94328]} Length = 216 Back     information, alignment and structure
>d2r6na1 d.3.1.1 (A:1-215) (Pro)cathepsin K {Human (Homo sapiens) [TaxId: 9606]} Length = 215 Back     information, alignment and structure
>d1iwda_ d.3.1.1 (A:) Ervatamin B {Adam's apple (Ervatamia coronaria) [TaxId: 52861]} Length = 215 Back     information, alignment and structure
>d1o0ea_ d.3.1.1 (A:) Ervatamin C {East indian rosebay (Ervatamia coronaria) [TaxId: 52861]} Length = 208 Back     information, alignment and structure
>d1khqa_ d.3.1.1 (A:) Papain {Papaya (Carica papaya) [TaxId: 3649]} Length = 212 Back     information, alignment and structure
>d3gcba_ d.3.1.1 (A:) Bleomycin hydrolase {Baker's yeast (Saccharomyces cerevisiae), Gal6 [TaxId: 4932]} Length = 458 Back     information, alignment and structure
>d2cb5a_ d.3.1.1 (A:) Bleomycin hydrolase {Human (Homo sapiens) [TaxId: 9606]} Length = 453 Back     information, alignment and structure

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query309
d1cs8a_316 (Pro)cathepsin L {Human (Homo sapiens) [TaxId: 960 100.0
d1xkga1302 Major mite fecal allergen der p 1 {House-dust mite 100.0
d1gmya_254 (Pro)cathepsin B {Human (Homo sapiens) [TaxId: 960 100.0
g1k3b.1233 Cathepsin C (dipeptidyl peptidase I), catalytic do 100.0
g8pch.1228 Cathepsin H {Pig (Sus scrofa) [TaxId: 9823]} 100.0
d1me4a_215 Cruzain {Trypanosoma cruzi [TaxId: 5693]} 100.0
d1m6da_214 Cathepsin F {Human (Homo sapiens) [TaxId: 9606]} 100.0
d2oula1241 Falcipain 2 {Plasmodium falciparum [TaxId: 5833]} 100.0
d2h7ja1217 (Pro)cathepsin S {Human (Homo sapiens) [TaxId: 960 100.0
d1yala_218 Chymopapain {Papaya (Carica papaya) [TaxId: 3649]} 100.0
d1deua_275 (Pro)cathepsin X {Human (Homo sapiens) [TaxId: 960 100.0
d1cqda_216 Proline-specific cysteine protease {Ginger rhizome 100.0
d2r6na1215 (Pro)cathepsin K {Human (Homo sapiens) [TaxId: 960 100.0
d1ppoa_216 Caricain (protease omega) {Papaya (Carica papaya) 100.0
d1aeca_218 Actinidin {Chinese gooseberry or kiwifruit (Actini 100.0
d1fh0a_221 (Pro)cathepsin V {Human (Homo sapiens) [TaxId: 960 100.0
d1khqa_212 Papain {Papaya (Carica papaya) [TaxId: 3649]} 100.0
d1s4va_224 Vignain (bean endopeptidase) {Castor bean (Ricinus 100.0
d1o0ea_208 Ervatamin C {East indian rosebay (Ervatamia corona 100.0
d1iwda_215 Ervatamin B {Adam's apple (Ervatamia coronaria) [T 100.0
d3gcba_458 Bleomycin hydrolase {Baker's yeast (Saccharomyces 98.93
d2cb5a_453 Bleomycin hydrolase {Human (Homo sapiens) [TaxId: 98.59
>d1cs8a_ d.3.1.1 (A:) (Pro)cathepsin L {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
class: Alpha and beta proteins (a+b)
fold: Cysteine proteinases
superfamily: Cysteine proteinases
family: Papain-like
domain: (Pro)cathepsin L
species: Human (Homo sapiens) [TaxId: 9606]
Probab=100.00  E-value=9.3e-50  Score=372.42  Aligned_cols=238  Identities=20%  Similarity=0.386  Sum_probs=174.7

Q ss_pred             cchHHHHHHHHhccccCCCCCCcccccccCCCCcHHHHHHHhCCCCCCCCCCCCCCcccccCCCCCCCCCceecCCCCCC
Q psy8713          17 FPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN   96 (309)
Q Consensus        17 ~~~~a~~~~~~~~~~~~~~~~s~~~g~Nf~~~~~~e~~~~~lg~~~~~~~~~~~~~~~~~~~~~~~~lP~~~DwR~~~~~   96 (309)
                      ++..+.++|.+||.....+-.+|++|+|+|+|++.|||++++.......    .............+||++||||++   
T Consensus        34 iF~~N~~~I~~~N~~~~~~~~~~~~g~N~fsDlt~eEf~~~~~~~~~~~----~~~~~~~~~~~~~~lP~s~Dwr~~---  106 (316)
T d1cs8a_          34 VWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRK----PRKGKVFQEPLFYEAPRSVDWREK---  106 (316)
T ss_dssp             HHHHHHHHHHHHHHHHHTTCCSEEECCCTTTTCCHHHHHHHHCCBCCCC----CSCCEECCCCTTCCCCSCEEGGGG---
T ss_pred             HHHHHHHHHHHHHhHhhcCCCceEEeceeccccCcHHHHhhhccccccc----cccCccccCcccccCCCceECCcC---
Confidence            4667788899999987766679999999999999999988887554331    111111222346789999999999   


Q ss_pred             CCCCccccCccCCCCCc--------------------CCCccccCCCCCCCCCCCCCCCCCC------------------
Q psy8713          97 CPTIREIRDQGSCGSCW--------------------GCRPYEIAPCEHHVNGTRPSCDASK------------------  138 (309)
Q Consensus        97 cg~vtpVkdQg~CGSCW--------------------~cs~~~~~~C~~~~~g~~~~C~~~~------------------  138 (309)
                       |+|+||||||.|||||                    ..|++++.+|....  ....|.++.                  
T Consensus       107 -g~vtpVkdQG~CGsCwAfa~~~~~E~~~~i~~~~~~~lS~Q~lvdC~~~~--~~~~c~gg~~~~a~~y~~~~g~~~~e~  183 (316)
T d1cs8a_         107 -GYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQ--GNEGCNGGLMDYAFQYVQDNGGLDSEE  183 (316)
T ss_dssp             -TCCCCCCBCCSSSCHHHHHHHHHHHHHHHHHHSCCCCBCHHHHHHHCGGG--TCCGGGCBCHHHHHHHHHHHTCEEBTT
T ss_pred             -CcccccccCCCCceeeehhhhHHHHHHHHhhcCCcccchhhhhhhccccc--cCCCCCCCchHHHHHHHHhcCcccccc
Confidence             9999999999999999                    45667777775421  122233322                  


Q ss_pred             -CCCcccccccccCccccccccccceeee-eeecchHHHHHHHHHhcCCEEEEEecc-cccccCCCceEeCCCCcchhhh
Q psy8713         139 -GHTPKCVRECQENYDVPYKKDLNFGAKS-YSVSSNEKSIMKEIYEHGPVEGAFTVF-DDLILYKSGRFFVPGNETTAMS  215 (309)
Q Consensus       139 -~~~~~~~~~c~~~~~~~~~~~~~~~~~~-~~v~~~~~~ik~~l~~~GPv~v~~~~~-~~f~~Y~sGiy~~~~~~~~~~~  215 (309)
                       +++..+...|......     ....... .....++++|+++|+++|||++++.+. .+|+.|.+|||..         
T Consensus       184 ~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~l~~~l~~~gpv~v~i~~~~~~f~~y~~Gi~~~---------  249 (316)
T d1cs8a_         184 SYPYEATEESCKYNPKY-----SVANDAGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFE---------  249 (316)
T ss_dssp             TSCCCSSCCCCCCCGGG-----EEECCCCEEECCSCHHHHHHHHHHHCCEEEEECCCSHHHHTEEEEEECC---------
T ss_pred             ccccccccccccccccc-----ccccccccccccCcHHHHHHHHHHhCCeEEEEEeccchhccccCCcccC---------
Confidence             2333333333221111     1111111 234568899999999999999999985 6799999999985         


Q ss_pred             hhhhhhcccCcccCcCCcceeccccccccCCCccCCceEEEEEeeccC--CCCccEEEEEcCCCCCCCCCceEEEEccC-
Q psy8713         216 LIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDE--KSKEKYWLIANSWNTDWGDNGLFKILRGK-  292 (309)
Q Consensus       216 ~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~HaV~iVGyg~~~--~~g~~YWiikNSWG~~WG~~Gy~~i~~g~-  292 (309)
                                                 ..|+...+||||+|||||.+.  .++++|||||||||++|||+|||||+|+. 
T Consensus       250 ---------------------------~~c~~~~~nHaV~iVGyG~d~~~~~g~~YWIikNSWG~~WGe~GY~ri~r~~~  302 (316)
T d1cs8a_         250 ---------------------------PDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRR  302 (316)
T ss_dssp             ---------------------------TTCCSSCCCEEEEEEEEEEECCSSCCEEEEEEECSBCTTSTBTTEEEEECSSS
T ss_pred             ---------------------------CCCCCCcCCEEEEEEEEcccccCCCCCeEEEEEeCCCCCcccCCEEEEeeCCC
Confidence                                       346666789999999999653  36889999999999999999999999986 


Q ss_pred             CccccCcceEEEe
Q psy8713         293 DECGIESSITAGV  305 (309)
Q Consensus       293 n~cgi~~~~~~~~  305 (309)
                      |.|||++.+++++
T Consensus       303 n~CGI~~~~~yP~  315 (316)
T d1cs8a_         303 NHCGIASAASYPT  315 (316)
T ss_dssp             SGGGTTTSCEEEC
T ss_pred             CcCccCCeeeeee
Confidence            8999999987753



>d1xkga1 d.3.1.1 (A:4-305) Major mite fecal allergen der p 1 {House-dust mite (Dermatophagoides pteronyssinus) [TaxId: 6956]} Back     information, alignment and structure
>d1gmya_ d.3.1.1 (A:) (Pro)cathepsin B {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1me4a_ d.3.1.1 (A:) Cruzain {Trypanosoma cruzi [TaxId: 5693]} Back     information, alignment and structure
>d1m6da_ d.3.1.1 (A:) Cathepsin F {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d2h7ja1 d.3.1.1 (A:1-217) (Pro)cathepsin S {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1yala_ d.3.1.1 (A:) Chymopapain {Papaya (Carica papaya) [TaxId: 3649]} Back     information, alignment and structure
>d1deua_ d.3.1.1 (A:) (Pro)cathepsin X {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1cqda_ d.3.1.1 (A:) Proline-specific cysteine protease {Ginger rhizome (Zingiber officinale) [TaxId: 94328]} Back     information, alignment and structure
>d2r6na1 d.3.1.1 (A:1-215) (Pro)cathepsin K {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1ppoa_ d.3.1.1 (A:) Caricain (protease omega) {Papaya (Carica papaya) [TaxId: 3649]} Back     information, alignment and structure
>d1aeca_ d.3.1.1 (A:) Actinidin {Chinese gooseberry or kiwifruit (Actinidia chinensis) [TaxId: 3625]} Back     information, alignment and structure
>d1fh0a_ d.3.1.1 (A:) (Pro)cathepsin V {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1khqa_ d.3.1.1 (A:) Papain {Papaya (Carica papaya) [TaxId: 3649]} Back     information, alignment and structure
>d1s4va_ d.3.1.1 (A:) Vignain (bean endopeptidase) {Castor bean (Ricinus communis) [TaxId: 3988]} Back     information, alignment and structure
>d1o0ea_ d.3.1.1 (A:) Ervatamin C {East indian rosebay (Ervatamia coronaria) [TaxId: 52861]} Back     information, alignment and structure
>d1iwda_ d.3.1.1 (A:) Ervatamin B {Adam's apple (Ervatamia coronaria) [TaxId: 52861]} Back     information, alignment and structure
>d3gcba_ d.3.1.1 (A:) Bleomycin hydrolase {Baker's yeast (Saccharomyces cerevisiae), Gal6 [TaxId: 4932]} Back     information, alignment and structure
>d2cb5a_ d.3.1.1 (A:) Bleomycin hydrolase {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure