RPS-BLAST 2.2.26 [Sep-21-2011]

Database: pdb70 
           27,921 sequences; 6,701,793 total letters

Searching..................................................done

Query= psy282
         (233 letters)



>1m6d_A Cathepsin F, catsf; papain family cysteine protease, hydrolase;
           HET: MYP; 1.70A {Homo sapiens} SCOP: d.3.1.1
          Length = 214

 Score =  123 bits (311), Expect = 4e-35
 Identities = 46/128 (35%), Positives = 73/128 (57%), Gaps = 5/128 (3%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSET-MKKILYKYGPLSVLLN 60
           GLE+E DY Y+        C +   K K+   +D +  + +E  +   L K GP+SV +N
Sbjct: 80  GLETEDDYSYQGHMQ---SCQFSAEKAKV-YIQDSVELSQNEQKLAAWLAKRGPISVAIN 135

Query: 61  SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKI 120
           +  +  Y     R     CSP+ + HAVLLVGYG++ D+P+W ++NSWG    ++G++ +
Sbjct: 136 AFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGQRSDVPFWAIKNSWGTDWGEKGYYYL 195

Query: 121 ERGNNACG 128
            RG+ ACG
Sbjct: 196 HRGSGACG 203



 Score = 88.7 bits (221), Expect = 8e-22
 Identities = 30/79 (37%), Positives = 49/79 (62%)

Query: 138 ETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYW 197
           + +   L K GP+SV +N+  + FY     R     CSP+ + HAVLLVGYG++ D+P+W
Sbjct: 118 QKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGQRSDVPFW 177

Query: 198 LVRNSWGPIGPDEGFFKIE 216
            ++NSWG    ++G++ + 
Sbjct: 178 AIKNSWGTDWGEKGYYYLH 196


>3i06_A Cruzipain; autocatalytic cleavage, glycoprotein, protease, thiol
           protease, zymogen; HET: QL2; 1.10A {Trypanosoma cruzi}
           PDB: 1ewm_A* 1ewo_A* 1ewl_A* 1f29_A* 1ewp_A* 1f2b_A*
           1f2c_A* 1f2a_A* 1me4_A* 1u9q_X* 2aim_A* 2efm_A* 2oz2_A*
           1me3_A* 3kku_A* 3lxs_A* 1aim_A* 3iut_A* 3hd3_A* 2p86_A*
           ...
          Length = 215

 Score =  117 bits (296), Expect = 6e-33
 Identities = 38/131 (29%), Positives = 57/131 (43%), Gaps = 12/131 (9%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKLF-TGKDFLHFNGSETMKKILYKYGPLSVLLN 60
            + +E  YPY +  G    C      V    TG   L  +    +   L   GP++V ++
Sbjct: 82  AVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQD-EAQIAAWLAVNGPVAVAVD 140

Query: 61  SDLIHDYNG---TPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGF 117
           +     Y G   T        C    L H VLLVGY     +PYW+++NSW     +EG+
Sbjct: 141 ASSWMTYTGGVMTS-------CVSEQLDHGVLLVGYNDSAAVPYWIIKNSWTTQWGEEGY 193

Query: 118 FKIERGNNACG 128
            +I +G+N C 
Sbjct: 194 IRIAKGSNQCL 204



 Score = 81.8 bits (203), Expect = 2e-19
 Identities = 25/84 (29%), Positives = 38/84 (45%), Gaps = 10/84 (11%)

Query: 136 GSETMKKILYKYGPLSVGLNSHLIHFYNG---TPIRKNDETCSPYDLGHAVLLVGYGKQD 192
               +   L   GP++V +++     Y G   T        C    L H VLLVGY    
Sbjct: 121 DEAQIAAWLAVNGPVAVAVDASSWMTYTGGVMTS-------CVSEQLDHGVLLVGYNDSA 173

Query: 193 DIPYWLVRNSWGPIGPDEGFFKIE 216
            +PYW+++NSW     +EG+ +I 
Sbjct: 174 AVPYWIIKNSWTTQWGEEGYIRIA 197


>3f5v_A DER P 1 allergen; allergy, asthma, DUST mites, glycoprotein,
           hydrola protease, secreted, thiol protease; HET: P6G;
           1.36A {Dermatophagoides pteronyssinus} PDB: 2as8_A
           3rvw_A* 3rvx_A 3rvv_A* 3d6s_A*
          Length = 222

 Score =  107 bits (269), Expect = 7e-29
 Identities = 25/131 (19%), Positives = 48/131 (36%), Gaps = 9/131 (6%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYK-YGPLSVLLN 60
           G+  E  Y Y         C    ++    +    ++   +  +++ L + +  ++V++ 
Sbjct: 87  GVVQESYYRYVAREQ---SCRRPNAQRFGISNYCQIYPPNANKIREALAQTHSAIAVIIG 143

Query: 61  SDLIHD---YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGF 117
              +     Y+G  I             HAV +VGY     + YW+VRNSW     D G+
Sbjct: 144 IKDLDAFRHYDGRTI--IQRDNGYQPNYHAVNIVGYSNAQGVDYWIVRNSWDTNWGDNGY 201

Query: 118 FKIERGNNACG 128
                  +   
Sbjct: 202 GYFAANIDLMM 212



 Score = 75.6 bits (187), Expect = 6e-17
 Identities = 20/86 (23%), Positives = 32/86 (37%), Gaps = 7/86 (8%)

Query: 136 GSET--MKKILYKYGPLSVGLN-SHLI--HFYNGTPIRKNDETCSPYDLGHAVLLVGYGK 190
            +     + +   +  ++V +    L     Y+G  I             HAV +VGY  
Sbjct: 122 PNANKIREALAQTHSAIAVIIGIKDLDAFRHYDGRTI--IQRDNGYQPNYHAVNIVGYSN 179

Query: 191 QDDIPYWLVRNSWGPIGPDEGFFKIE 216
              + YW+VRNSW     D G+    
Sbjct: 180 AQGVDYWIVRNSWDTNWGDNGYGYFA 205


>3pdf_A Cathepsin C, dipeptidyl peptidase 1; two domains, cystein protease,
           hydrolase-hydrolase inhibitor; HET: LXV NAG; 1.85A {Homo
           sapiens} PDB: 1jqp_A* 2djf_B* 1k3b_B* 2djg_B* 2djf_A*
           1k3b_A* 2djg_A* 2djf_C* 1k3b_C* 2djg_C*
          Length = 441

 Score =  110 bits (277), Expect = 1e-28
 Identities = 40/136 (29%), Positives = 57/136 (41%), Gaps = 10/136 (7%)

Query: 2   GLESEKDYPYKNANGE-KFKCAYDKSKVKLFTGKDFLHFNGSET-MKKILYKYGPLSVLL 59
           GL  E  +PY   +   K K    +     +      +   +E  MK  L  +GP++V  
Sbjct: 291 GLVEEACFPYTGTDSPCKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAF 350

Query: 60  N--SDLIHDYNG---TPIRKNDETCSPYDLGHAVLLVGYG--KQDDIPYWLVRNSWGPIG 112
               D +H Y           D         HAVLLVGYG      + YW+V+NSWG   
Sbjct: 351 EVYDDFLH-YKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGW 409

Query: 113 PDEGFFKIERGNNACG 128
            + G+F+I RG + C 
Sbjct: 410 GENGYFRIRRGTDECA 425



 Score = 76.8 bits (189), Expect = 2e-16
 Identities = 27/85 (31%), Positives = 37/85 (43%), Gaps = 6/85 (7%)

Query: 138 ETMKKILYKYGPLSVGLNSHL-IHFYNG---TPIRKNDETCSPYDLGHAVLLVGYG--KQ 191
             MK  L  +GP++V    +     Y           D         HAVLLVGYG    
Sbjct: 334 ALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSA 393

Query: 192 DDIPYWLVRNSWGPIGPDEGFFKIE 216
             + YW+V+NSWG    + G+F+I 
Sbjct: 394 SGMDYWIVKNSWGTGWGENGYFRIR 418


>8pch_A Cathepsin H; hydrolase, protease, cysteine proteinase,
           aminopeptidase; HET: NAG BMA; 2.10A {Sus scrofa} SCOP:
           d.3.1.1 PDB: 1nb3_A* 1nb5_A*
          Length = 220

 Score =  105 bits (265), Expect = 3e-28
 Identities = 46/138 (33%), Positives = 64/138 (46%), Gaps = 23/138 (16%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKLF-TGKDFLHFNGSETMKKILYKYGPLSVLLN 60
           G+  E  YPYK  +     C +   K   F      +  N  E M + +  Y P+S    
Sbjct: 83  GIMGEDTYPYKGQDD---HCKFQPDKAIAFVKDVANITMNDEEAMVEAVALYNPVSFAFE 139

Query: 61  SDLIHD--------YNGTPIRKNDETC--SPYDLGHAVLLVGYGKQDDIPYWLVRNSWGP 110
               +D        Y+         +C  +P  + HAVL VGYG+++ IPYW+V+NSWGP
Sbjct: 140 VT--NDFLMYRKGIYS-------STSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGP 190

Query: 111 IGPDEGFFKIERGNNACG 128
                G+F IERG N CG
Sbjct: 191 QWGMNGYFLIERGKNMCG 208



 Score = 78.3 bits (194), Expect = 7e-18
 Identities = 32/88 (36%), Positives = 47/88 (53%), Gaps = 12/88 (13%)

Query: 136 GSET-MKKILYKYGPLSVGLN-SHLIHFYNGTPIRK---NDETC--SPYDLGHAVLLVGY 188
             E  M + +  Y P+S     ++    Y     RK   +  +C  +P  + HAVL VGY
Sbjct: 119 NDEEAMVEAVALYNPVSFAFEVTNDFLMY-----RKGIYSSTSCHKTPDKVNHAVLAVGY 173

Query: 189 GKQDDIPYWLVRNSWGPIGPDEGFFKIE 216
           G+++ IPYW+V+NSWGP     G+F IE
Sbjct: 174 GEENGIPYWIVKNSWGPQWGMNGYFLIE 201


>1deu_A Procathepsin X; cysteine protease, proregion, prosegment, HY; 1.70A
           {Homo sapiens} SCOP: d.3.1.1 PDB: 1ef7_A
          Length = 277

 Score =  105 bits (264), Expect = 1e-27
 Identities = 37/155 (23%), Positives = 57/155 (36%), Gaps = 23/155 (14%)

Query: 2   GLESEKDYPYKNANGE-----------KFKCAYDKSKVKLFTGKDFLHFNGSETMKKILY 50
           G+  E    Y+  + E           +FK  +      L+   D+   +G E M   +Y
Sbjct: 122 GIPDETCNNYQAKDQECDKFNQCGTCNEFKECHAIRNYTLWRVGDYGSLSGREKMMAEIY 181

Query: 51  KYGPLSVLLN--SDLIHDYNG---TPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVR 105
             GP+S  +     L + Y G          E      + H V + G+G  D   YW+VR
Sbjct: 182 ANGPISCGIMATERLAN-YTGGIYA------EYQDTTYINHVVSVAGWGISDGTEYWIVR 234

Query: 106 NSWGPIGPDEGFFKIERGNNACGKDFLHFNGSETM 140
           NSWG    + G+ +I       GK   +    E  
Sbjct: 235 NSWGEPWGERGWLRIVTSTYKDGKGARYNLAIEEH 269



 Score = 81.6 bits (202), Expect = 8e-19
 Identities = 27/92 (29%), Positives = 41/92 (44%), Gaps = 10/92 (10%)

Query: 129 KDFLHFNGSETMKKILYKYGPLSVGLN-SHLIHFYNG---TPIRKNDETCSPYDLGHAVL 184
            D+   +G E M   +Y  GP+S G+  +  +  Y G          E      + H V 
Sbjct: 165 GDYGSLSGREKMMAEIYANGPISCGIMATERLANYTGGIYA------EYQDTTYINHVVS 218

Query: 185 LVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIE 216
           + G+G  D   YW+VRNSWG    + G+ +I 
Sbjct: 219 VAGWGISDGTEYWIVRNSWGEPWGERGWLRIV 250


>1xkg_A DER P I, major mite fecal allergen DER P 1; major allergen,
           cysteine protease, house DUST mite, dermatop
           pteronyssinus; 1.61A {Dermatophagoides pteronyssinus}
           SCOP: d.3.1.1
          Length = 312

 Score =  105 bits (265), Expect = 1e-27
 Identities = 25/131 (19%), Positives = 48/131 (36%), Gaps = 9/131 (6%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYK-YGPLSVLLN 60
           G+  E  Y Y         C    ++    +    ++   +  +++ L + +  ++V++ 
Sbjct: 167 GVVQESYYRYVAREQ---SCRRPNAQRFGISNYCQIYPPNANKIREALAQTHSAIAVIIG 223

Query: 61  SDLIHD---YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGF 117
              +     Y+G  I             HAV +VGY     + YW+VRNSW     D G+
Sbjct: 224 IKDLDAFRHYDGRTI--IQRDNGYQPNYHAVNIVGYSNAQGVDYWIVRNSWDTNWGDNGY 281

Query: 118 FKIERGNNACG 128
                  +   
Sbjct: 282 GYFAANIDLMM 292



 Score = 74.6 bits (184), Expect = 5e-16
 Identities = 20/86 (23%), Positives = 32/86 (37%), Gaps = 7/86 (8%)

Query: 136 GSET--MKKILYKYGPLSVGLN-SHLI--HFYNGTPIRKNDETCSPYDLGHAVLLVGYGK 190
            +     + +   +  ++V +    L     Y+G  I             HAV +VGY  
Sbjct: 202 PNANKIREALAQTHSAIAVIIGIKDLDAFRHYDGRTI--IQRDNGYQPNYHAVNIVGYSN 259

Query: 191 QDDIPYWLVRNSWGPIGPDEGFFKIE 216
              + YW+VRNSW     D G+    
Sbjct: 260 AQGVDYWIVRNSWDTNWGDNGYGYFA 285


>3qj3_A Cathepsin L-like protein; hydrolase, proteinase, larVal midgut;
           1.85A {Tenebrio molitor}
          Length = 331

 Score =  105 bits (263), Expect = 4e-27
 Identities = 46/137 (33%), Positives = 67/137 (48%), Gaps = 22/137 (16%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKLF-TGKDFLHFNGSETMKKILYKYGPLSVLLN 60
           G++SE  YPY+ A+G    C YD ++V    +G  +L       +  ++   GP++V  +
Sbjct: 197 GIDSEGAYPYEMADG---NCHYDPNQVAARLSGYVYLSGPDENMLADMVATKGPVAVAFD 253

Query: 61  SDLIHD--------YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIG 112
           +D            Y          TC      HAVL+VGYG ++   YWLV+NSWG   
Sbjct: 254 AD--DPFGSYSGGVYYN-------PTCETNKFTHAVLIVGYGNENGQDYWLVKNSWGDGW 304

Query: 113 PDEGFFKIERG-NNACG 128
             +G+FKI R  NN CG
Sbjct: 305 GLDGYFKIARNANNHCG 321



 Score = 81.1 bits (201), Expect = 2e-18
 Identities = 28/83 (33%), Positives = 42/83 (50%), Gaps = 4/83 (4%)

Query: 136 GSET-MKKILYKYGPLSVGLN-SHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDD 193
             E  +  ++   GP++V  +       Y+G      + TC      HAVL+VGYG ++ 
Sbjct: 233 PDENMLADMVATKGPVAVAFDADDPFGSYSGGVY--YNPTCETNKFTHAVLIVGYGNENG 290

Query: 194 IPYWLVRNSWGPIGPDEGFFKIE 216
             YWLV+NSWG     +G+FKI 
Sbjct: 291 QDYWLVKNSWGDGWGLDGYFKIA 313


>3hhi_A Cathepsin B-like cysteine protease; occluding loop, hydrolase, THIO
           protease; HET: 074; 1.60A {Trypanosoma brucei} PDB:
           3mor_A*
          Length = 325

 Score =  103 bits (260), Expect = 9e-27
 Identities = 38/155 (24%), Positives = 55/155 (35%), Gaps = 36/155 (23%)

Query: 2   GLESEKDYPYKNANGE----------------------KFKCAYDKSKVKLFTGKDFLHF 39
           GL S+   PY   +                         + C      V  +        
Sbjct: 156 GLVSDYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPKCDYTCDDPTIPVVNYRSWTSYAL 215

Query: 40  NGSETMKKILYKYGPLSVLLN--SDLIHDYNGTPIRKN---DETCSPYDLGHAVLLVGYG 94
            G +   + L+  GP  V  +   D I  Y       +         Y  GHAV LVG+G
Sbjct: 216 QGEDDYMRELFFRGPFEVAFDVYEDFIA-Y------NSGVYHHVSGQYLGGHAVRLVGWG 268

Query: 95  KQDDIPYWLVRNSWGPI-GPDEGFFKIERGNNACG 128
             + +PYW + NSW    G  +G+F I RG++ CG
Sbjct: 269 TSNGVPYWKIANSWNTEWG-MDGYFLIRRGSSECG 302



 Score = 75.5 bits (186), Expect = 3e-16
 Identities = 25/94 (26%), Positives = 37/94 (39%), Gaps = 12/94 (12%)

Query: 128 GKDFLHFNGSETMKKILYKYGPLSVGLNSHL-IHFYNGTPIRKN---DETCSPYDLGHAV 183
                   G +   + L+  GP  V  + +     Y       +         Y  GHAV
Sbjct: 209 SWTSYALQGEDDYMRELFFRGPFEVAFDVYEDFIAY------NSGVYHHVSGQYLGGHAV 262

Query: 184 LLVGYGKQDDIPYWLVRNSWGPI-GPDEGFFKIE 216
            LVG+G  + +PYW + NSW    G  +G+F I 
Sbjct: 263 RLVGWGTSNGVPYWKIANSWNTEWG-MDGYFLIR 295


>2wbf_X Serine-repeat antigen protein; SERA, malaria, vacuole, protease,
           cathepsin, hydrolase, glycoprotein, thiol protease; HET:
           DMS; 1.60A {Plasmodium falciparum} PDB: 3ch3_X 3ch2_X
          Length = 265

 Score =  101 bits (254), Expect = 2e-26
 Identities = 39/165 (23%), Positives = 68/165 (41%), Gaps = 36/165 (21%)

Query: 2   GLESEKDYPYK---------------NANGEKFKCAYDKSKVKLFTGKDFLHFNGS---- 42
            L +E +YPY                    +  K  ++K++     GK +  +       
Sbjct: 92  FLPAESNYPYNYVKVGEQCPKVEDHWMNLWDNGKILHNKNEPNSLDGKGYTAYESERFHD 151

Query: 43  ------ETMKKILYKYGPLSVLLN--SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG 94
                 + +K  +   G +   +   + + ++++G   +K    C      HAV +VGYG
Sbjct: 152 NMDAFVKIIKTEVMNKGSVIAYIKAENVMGYEFSG---KKVKNLCGDDTADHAVNIVGYG 208

Query: 95  KQDDI-----PYWLVRNSWGPIGPDEGFFKIER-GNNACGKDFLH 133
              +       YW+VRNSWGP   DEG+FK++  G   C  +F+H
Sbjct: 209 NYVNSEGEKKSYWIVRNSWGPYWGDEGYFKVDMYGPTHCHFNFIH 253



 Score = 78.1 bits (193), Expect = 1e-17
 Identities = 26/86 (30%), Positives = 44/86 (51%), Gaps = 10/86 (11%)

Query: 138 ETMKKILYKYGPLSVGLN-SHLI-HFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDI- 194
           + +K  +   G +   +   +++ + ++G   +K    C      HAV +VGYG   +  
Sbjct: 158 KIIKTEVMNKGSVIAYIKAENVMGYEFSG---KKVKNLCGDDTADHAVNIVGYGNYVNSE 214

Query: 195 ----PYWLVRNSWGPIGPDEGFFKIE 216
                YW+VRNSWGP   DEG+FK++
Sbjct: 215 GEKKSYWIVRNSWGPYWGDEGYFKVD 240


>3kwz_A Cathepsin K; enzyme inhibitor, covalent reversible inhibitor,
           disease mutation, disulfide bond, glycoprotein,
           hydrolase, lysosome, protease; HET: KWZ; 1.49A {Homo
           sapiens} PDB: 1au0_A* 1au2_A* 1au3_A* 1au4_A* 1ayu_A*
           1ayv_A* 1ayw_A* 1bgo_A* 1atk_A* 1nl6_A* 1nlj_A* 1q6k_A*
           1mem_A* 1yk7_A* 1yk8_A* 1yt7_A* 2ato_A* 2aux_A* 2auz_A*
           2bdl_A* ...
          Length = 215

 Score = 99.1 bits (248), Expect = 8e-26
 Identities = 40/138 (28%), Positives = 65/138 (47%), Gaps = 23/138 (16%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKLF-TGKDFLHFNGSETMKKILYKYGPLSVLLN 60
           G++SE  YPY         C Y+ +       G   +     + +K+ + + GP+SV ++
Sbjct: 80  GIDSEDAYPYVGQEE---SCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAID 136

Query: 61  SDLIHD---------YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPI 111
           +              Y        DE+C+  +L HAVL VGYG Q    +W+++NSWG  
Sbjct: 137 AS--LTSFQFYSKGVY-------YDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGEN 187

Query: 112 GPDEGFFKIERG-NNACG 128
             ++G+  + R  NNACG
Sbjct: 188 WGNKGYILMARNKNNACG 205



 Score = 76.0 bits (188), Expect = 4e-17
 Identities = 29/89 (32%), Positives = 49/89 (55%), Gaps = 15/89 (16%)

Query: 136 GSET-MKKILYKYGPLSVGLNSHLIHF-------YNGTPIRKNDETCSPYDLGHAVLLVG 187
           G+E  +K+ + + GP+SV +++ L  F       Y        DE+C+  +L HAVL VG
Sbjct: 116 GNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYY-------DESCNSDNLNHAVLAVG 168

Query: 188 YGKQDDIPYWLVRNSWGPIGPDEGFFKIE 216
           YG Q    +W+++NSWG    ++G+  + 
Sbjct: 169 YGIQKGNKHWIIKNSWGENWGNKGYILMA 197


>2o6x_A Procathepsin L1, secreted cathepsin L 1; hydrolase, thiol protease,
           cysteine protease, zymogen, hydro; 1.40A {Fasciola
           hepatica}
          Length = 310

 Score =  100 bits (252), Expect = 1e-25
 Identities = 45/137 (32%), Positives = 64/137 (46%), Gaps = 22/137 (16%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKLF-TGKDFLHFNGSETMKKILYKYGPLSVLLN 60
           GLE+E  YPY    G   +C Y+K       TG   +H      +K ++   GP +V ++
Sbjct: 172 GLETESSYPYTAVEG---QCRYNKQLGVAKVTGFYTVHSGSEVELKNLVGAEGPAAVAVD 228

Query: 61  SDLIHD--------YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIG 112
            +   D        Y         +TCSP  + HAVL VGYG Q    YW+V+NSWG   
Sbjct: 229 VE--SDFMMYRSGIYQS-------QTCSPLRVNHAVLAVGYGTQGGTDYWIVKNSWGLSW 279

Query: 113 PDEGFFKIERG-NNACG 128
            + G+ ++ R   N CG
Sbjct: 280 GERGYIRMVRNRGNMCG 296



 Score = 77.7 bits (192), Expect = 3e-17
 Identities = 30/86 (34%), Positives = 44/86 (51%), Gaps = 10/86 (11%)

Query: 136 GSET-MKKILYKYGPLSVGLN-SHLIHFYNGTPIRK---NDETCSPYDLGHAVLLVGYGK 190
           GSE  +K ++   GP +V ++       Y     R      +TCSP  + HAVL VGYG 
Sbjct: 208 GSEVELKNLVGAEGPAAVAVDVESDFMMY-----RSGIYQSQTCSPLRVNHAVLAVGYGT 262

Query: 191 QDDIPYWLVRNSWGPIGPDEGFFKIE 216
           Q    YW+V+NSWG    + G+ ++ 
Sbjct: 263 QGGTDYWIVKNSWGLSWGERGYIRMV 288


>3qt4_A Cathepsin-L-like midgut cysteine proteinase; hydrolase, zymogen,
           intramolecular DISS bonds, insect larVal midgut; HET:
           PG4 PG6; 2.11A {Tenebrio molitor}
          Length = 329

 Score =  100 bits (252), Expect = 2e-25
 Identities = 40/130 (30%), Positives = 67/130 (51%), Gaps = 8/130 (6%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKLF-TGKDFLHFNGSETMKKILYKYGPLSVLLN 60
           G+ SE  YPY+        C +D S+     +G   L      ++   + + GP++V ++
Sbjct: 195 GIMSESAYPYEAQGD---YCRFDSSQSVTTLSGYYDLPSGDENSLADAVGQAGPVAVAID 251

Query: 61  -SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFK 119
            +D +  Y+G      D+TC+  DL H VL+VGYG  +   YW+++NSWG    + G+++
Sbjct: 252 ATDELQFYSGGLF--YDQTCNQSDLNHGVLVVGYGSDNGQDYWILKNSWGSGWGESGYWR 309

Query: 120 IERG-NNACG 128
             R   N CG
Sbjct: 310 QVRNYGNNCG 319



 Score = 78.5 bits (194), Expect = 2e-17
 Identities = 27/83 (32%), Positives = 48/83 (57%), Gaps = 4/83 (4%)

Query: 136 GSET-MKKILYKYGPLSVGLN-SHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDD 193
           G E  +   + + GP++V ++ +  + FY+G      D+TC+  DL H VL+VGYG  + 
Sbjct: 231 GDENSLADAVGQAGPVAVAIDATDELQFYSGGLF--YDQTCNQSDLNHGVLVVGYGSDNG 288

Query: 194 IPYWLVRNSWGPIGPDEGFFKIE 216
             YW+++NSWG    + G+++  
Sbjct: 289 QDYWILKNSWGSGWGESGYWRQV 311


>2b1m_A SPE31; papain-like, sugar binding protein; HET: NAG FUC PG4; 2.00A
           {Pachyrhizus erosus} PDB: 2b1n_A*
          Length = 246

 Score = 98.8 bits (247), Expect = 2e-25
 Identities = 42/139 (30%), Positives = 71/139 (51%), Gaps = 17/139 (12%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKL----FTGKDFLHFN---GSETMKKILYKYGP 54
           G+ SE DYPYK  +G   KC  ++ + K+    +  +   + +    +E+  +      P
Sbjct: 81  GIASEADYPYKARDG---KCKANEIQDKVTIDNYGVQILSNESTESEAESSLQSFVLEQP 137

Query: 55  LSVLLNSDLIHDYNGTPIRKNDETCS-PYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGP 113
           +SV +++   H Y+G     +   CS PY + H VL+VGYG +D + YW+ +NSWG    
Sbjct: 138 ISVSIDAKDFHFYSGGIY--DGGNCSSPYGINHFVLIVGYGSEDGVDYWIAKNSWGEDWG 195

Query: 114 DEGFFKIERGNNA----CG 128
            +G+ +I+R        CG
Sbjct: 196 IDGYIRIQRNTGNLLGVCG 214



 Score = 76.5 bits (189), Expect = 5e-17
 Identities = 28/82 (34%), Positives = 47/82 (57%), Gaps = 3/82 (3%)

Query: 136 GSETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCS-PYDLGHAVLLVGYGKQDDI 194
            +E+  +      P+SV +++   HFY+G     +   CS PY + H VL+VGYG +D +
Sbjct: 124 EAESSLQSFVLEQPISVSIDAKDFHFYSGGIY--DGGNCSSPYGINHFVLIVGYGSEDGV 181

Query: 195 PYWLVRNSWGPIGPDEGFFKIE 216
            YW+ +NSWG     +G+ +I+
Sbjct: 182 DYWIAKNSWGEDWGIDGYIRIQ 203


>1by8_A Protein (procathepsin K); hydrolase(sulfhydryl proteinase), papain;
           2.60A {Homo sapiens} SCOP: d.3.1.1 PDB: 7pck_A
          Length = 314

 Score = 98.1 bits (245), Expect = 1e-24
 Identities = 42/134 (31%), Positives = 67/134 (50%), Gaps = 15/134 (11%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKLF-TGKDFLHFNGSETMKKILYKYGPLSVLLN 60
           G++SE  YPY         C Y+ +       G   +     + +K+ + + GP+SV ++
Sbjct: 179 GIDSEDAYPYVGQEE---SCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAID 235

Query: 61  SDLI--HDYNGTPIRK---NDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDE 115
           + L     Y      K    DE+C+  +L HAVL VGYG Q    +W+++NSWG    ++
Sbjct: 236 ASLTSFQFY-----SKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNK 290

Query: 116 GFFKIERG-NNACG 128
           G+  + R  NNACG
Sbjct: 291 GYILMARNKNNACG 304



 Score = 75.0 bits (185), Expect = 4e-16
 Identities = 29/89 (32%), Positives = 49/89 (55%), Gaps = 15/89 (16%)

Query: 136 GSET-MKKILYKYGPLSVGLNSHLIHF-------YNGTPIRKNDETCSPYDLGHAVLLVG 187
           G+E  +K+ + + GP+SV +++ L  F       Y        DE+C+  +L HAVL VG
Sbjct: 215 GNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYY-------DESCNSDNLNHAVLAVG 267

Query: 188 YGKQDDIPYWLVRNSWGPIGPDEGFFKIE 216
           YG Q    +W+++NSWG    ++G+  + 
Sbjct: 268 YGIQKGNKHWIIKNSWGENWGNKGYILMA 296


>2xu3_A Cathepsin L1; hydrolase, drug design, thiol protease; HET: XU3 BTB;
           0.90A {Homo sapiens} PDB: 2xu4_A* 2xu5_A* 2yj2_A*
           2yj8_A* 2yj9_A* 2yjb_A* 2yjc_A* 3bc3_A* 3h89_A* 3h8b_A*
           3h8c_A* 3of9_A* 3of8_A* 3hha_A* 2xu1_A* 3iv2_A* 3k24_A*
           2nqd_B* 3kse_A* 2vhs_A ...
          Length = 220

 Score = 93.7 bits (234), Expect = 1e-23
 Identities = 42/142 (29%), Positives = 63/142 (44%), Gaps = 28/142 (19%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSET-MKKILYKYGPLSVLLN 60
           GL+SE+ YPY+        C Y+           F+     E  + K +   GP+SV ++
Sbjct: 82  GLDSEESYPYEATEE---SCKYNPKYSVA-NDTGFVDIPKQEKALMKAVATVGPISVAID 137

Query: 61  SDLIHD---------YNGTPIRKNDETCSPYDLGHAVLLVGYGKQ----DDIPYWLVRNS 107
           +   H+         Y        +  CS  D+ H VL+VGYG +    D+  YWLV+NS
Sbjct: 138 AG--HESFLFYKEGIY-------FEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNS 188

Query: 108 WGPIGPDEGFFKIERG-NNACG 128
           WG      G+ K+ +   N CG
Sbjct: 189 WGEEWGMGGYVKMAKDRRNHCG 210



 Score = 69.5 bits (171), Expect = 1e-14
 Identities = 28/93 (30%), Positives = 43/93 (46%), Gaps = 18/93 (19%)

Query: 135 NGSETMKKILYKYGPLSVGLNSHLIHF-------YNGTPIRKNDETCSPYDLGHAVLLVG 187
              + + K +   GP+SV +++    F       Y        +  CS  D+ H VL+VG
Sbjct: 117 KQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYF-------EPDCSSEDMDHGVLVVG 169

Query: 188 YGKQ----DDIPYWLVRNSWGPIGPDEGFFKIE 216
           YG +    D+  YWLV+NSWG      G+ K+ 
Sbjct: 170 YGFESTESDNNKYWLVKNSWGEEWGMGGYVKMA 202


>3ovx_A Cathepsin S; hydrolase, covalent inhibitor, aldehyde warhead is
           covalently bound to Cys25, lysosomeal protein; HET: O64;
           1.49A {Homo sapiens} PDB: 2h7j_A* 2f1g_A* 2hh5_B*
           2hhn_A* 2hxz_A* 2op3_A* 2frq_A* 2fra_A* 2fq9_A* 2ft2_A*
           2fud_A* 2g7y_A* 1ms6_A* 2r9m_A* 2r9n_A* 2r9o_A* 3n3g_A*
           3n4c_A* 3mpe_A* 1nqc_A* ...
          Length = 218

 Score = 93.3 bits (233), Expect = 1e-23
 Identities = 39/132 (29%), Positives = 65/132 (49%), Gaps = 12/132 (9%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKLF-TGKDFLHFNGSETMKKILYKYGPLSVLLN 60
           G++S+  YPYK  +    KC YD        +    L +   + +K+ +   GP+SV ++
Sbjct: 84  GIDSDASYPYKAMDQ---KCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVD 140

Query: 61  ---SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGF 117
                     +G      + +C+  ++ H VL+VGYG  +   YWLV+NSWG    +EG+
Sbjct: 141 ARHPSFFLYRSGV---YYEPSCTQ-NVNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGY 196

Query: 118 FKIERG-NNACG 128
            ++ R   N CG
Sbjct: 197 IRMARNKGNHCG 208



 Score = 69.5 bits (171), Expect = 1e-14
 Identities = 29/85 (34%), Positives = 48/85 (56%), Gaps = 8/85 (9%)

Query: 136 GSET-MKKILYKYGPLSVGLN-SHLI-HFY-NGTPIRKNDETCSPYDLGHAVLLVGYGKQ 191
           G E  +K+ +   GP+SVG++  H     Y +G      + +C+  ++ H VL+VGYG  
Sbjct: 120 GREDVLKEAVANKGPVSVGVDARHPSFFLYRSGV---YYEPSCTQ-NVNHGVLVVGYGDL 175

Query: 192 DDIPYWLVRNSWGPIGPDEGFFKIE 216
           +   YWLV+NSWG    +EG+ ++ 
Sbjct: 176 NGKEYWLVKNSWGHNFGEEGYIRMA 200


>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco
           hydrolase, lysosome, protease, thiol protease, zymogen,
           CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A*
           3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A*
           1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A*
           1qdq_A* 1csb_B* 1huc_B 2ipp_B ...
          Length = 266

 Score = 92.7 bits (231), Expect = 6e-23
 Identities = 34/109 (31%), Positives = 51/109 (46%), Gaps = 14/109 (12%)

Query: 26  SKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLN--SDLIHDYNGTPIRKN---DETCS 80
            + K +    +   N  + +   +YK GP+    +  SD +  Y      K+        
Sbjct: 147 KQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLL-Y------KSGVYQHVTG 199

Query: 81  PYDLGHAVLLVGYGKQDDIPYWLVRNSWGPI-GPDEGFFKIERGNNACG 128
               GHA+ ++G+G ++  PYWLV NSW    G D GFFKI RG + CG
Sbjct: 200 EMMGGHAIRILGWGVENGTPYWLVANSWNTDWG-DNGFFKILRGQDHCG 247



 Score = 75.8 bits (187), Expect = 9e-17
 Identities = 27/86 (31%), Positives = 40/86 (46%), Gaps = 12/86 (13%)

Query: 135 NGSETMKKILYKYGPLSVGLNSHL-IHFYNGTPIRKN---DETCSPYDLGHAVLLVGYGK 190
           N  + +   +YK GP+    + +     Y      K+            GHA+ ++G+G 
Sbjct: 161 NSEKDIMAEIYKNGPVEGAFSVYSDFLLY------KSGVYQHVTGEMMGGHAIRILGWGV 214

Query: 191 QDDIPYWLVRNSWGPI-GPDEGFFKI 215
           ++  PYWLV NSW    G D GFFKI
Sbjct: 215 ENGTPYWLVANSWNTDWG-DNGFFKI 239


>2c0y_A Procathepsin S; proenzyme, proteinase, hydrolase, thiol protease,
           prosegment binding loop, glycoprotein, lysosome,
           protease, zymogen; 2.1A {Homo sapiens}
          Length = 315

 Score = 92.7 bits (231), Expect = 1e-22
 Identities = 40/132 (30%), Positives = 67/132 (50%), Gaps = 12/132 (9%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKLF-TGKDFLHFNGSETMKKILYKYGPLSVLLN 60
           G++S+  YPYK  +    KC YD        +    L +   + +K+ +   GP+SV ++
Sbjct: 181 GIDSDASYPYKAMDQ---KCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVD 237

Query: 61  SDLI--HDY-NGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGF 117
           +       Y +G      + +C+  ++ H VL+VGYG  +   YWLV+NSWG    +EG+
Sbjct: 238 ARHPSFFLYRSGV---YYEPSCTQ-NVNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGY 293

Query: 118 FKIERG-NNACG 128
            ++ R   N CG
Sbjct: 294 IRMARNKGNHCG 305



 Score = 68.4 bits (168), Expect = 6e-14
 Identities = 28/89 (31%), Positives = 47/89 (52%), Gaps = 16/89 (17%)

Query: 136 GSET-MKKILYKYGPLSVGLNSHLIHF-------YNGTPIRKNDETCSPYDLGHAVLLVG 187
           G E  +K+ +   GP+SVG+++    F       Y        + +C+  ++ H VL+VG
Sbjct: 217 GREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYY-------EPSCTQ-NVNHGVLVVG 268

Query: 188 YGKQDDIPYWLVRNSWGPIGPDEGFFKIE 216
           YG  +   YWLV+NSWG    +EG+ ++ 
Sbjct: 269 YGDLNGKEYWLVKNSWGHNFGEEGYIRMA 297


>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase,
           digestive tract, hydrolase-hydrolase INH complex; HET:
           074; 1.30A {Schistosoma mansoni} PDB: 3s3q_A* 3s3r_A*
          Length = 254

 Score = 91.1 bits (227), Expect = 2e-22
 Identities = 30/115 (26%), Positives = 52/115 (45%), Gaps = 14/115 (12%)

Query: 20  KCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLN--SDLIHDYNGTPIRKN-- 75
           K     ++ K      +   N  + ++K + KYGP+        D ++ Y      K+  
Sbjct: 137 KYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGFTVYEDFLN-Y------KSGI 189

Query: 76  -DETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPI-GPDEGFFKIERGNNACG 128
                     GHA+ ++G+G ++  PYWL+ NSW    G + G+F+I RG + C 
Sbjct: 190 YKHITGETLGGHAIRIIGWGVENKAPYWLIANSWNEDWG-ENGYFRIVRGRDECS 243



 Score = 76.9 bits (190), Expect = 3e-17
 Identities = 25/87 (28%), Positives = 42/87 (48%), Gaps = 12/87 (13%)

Query: 135 NGSETMKKILYKYGPLSVGLNSHL-IHFYNGTPIRKN---DETCSPYDLGHAVLLVGYGK 190
           N  + ++K + KYGP+  G   +     Y      K+            GHA+ ++G+G 
Sbjct: 157 NDEKAIQKEIMKYGPVEAGFTVYEDFLNY------KSGIYKHITGETLGGHAIRIIGWGV 210

Query: 191 QDDIPYWLVRNSWGPI-GPDEGFFKIE 216
           ++  PYWL+ NSW    G + G+F+I 
Sbjct: 211 ENKAPYWLIANSWNEDWG-ENGYFRIV 236


>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme,
           papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A
           1pbh_A 1mir_A
          Length = 317

 Score = 92.0 bits (229), Expect = 3e-22
 Identities = 34/109 (31%), Positives = 51/109 (46%), Gaps = 14/109 (12%)

Query: 26  SKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLN--SDLIHDYNGTPIRKN---DETCS 80
            + K +    +   N  + +   +YK GP+    +  SD +  Y      K+        
Sbjct: 204 KQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLL-Y------KSGVYQHVTG 256

Query: 81  PYDLGHAVLLVGYGKQDDIPYWLVRNSWGPI-GPDEGFFKIERGNNACG 128
               GHA+ ++G+G ++  PYWLV NSW    G D GFFKI RG + CG
Sbjct: 257 EMMGGHAIRILGWGVENGTPYWLVANSWNTDWG-DNGFFKILRGQDHCG 304



 Score = 74.7 bits (184), Expect = 4e-16
 Identities = 27/87 (31%), Positives = 40/87 (45%), Gaps = 12/87 (13%)

Query: 135 NGSETMKKILYKYGPLSVGLNSHL-IHFYNGTPIRKN---DETCSPYDLGHAVLLVGYGK 190
           N  + +   +YK GP+    + +     Y      K+            GHA+ ++G+G 
Sbjct: 218 NSEKDIMAEIYKNGPVEGAFSVYSDFLLY------KSGVYQHVTGEMMGGHAIRILGWGV 271

Query: 191 QDDIPYWLVRNSWGPI-GPDEGFFKIE 216
           ++  PYWLV NSW    G D GFFKI 
Sbjct: 272 ENGTPYWLVANSWNTDWG-DNGFFKIL 297


>1cs8_A Human procathepsin L; prosegment, propeptide, inhibition,
           hydrolase; HET: OCS; 1.80A {Homo sapiens} SCOP: d.3.1.1
           PDB: 1cjl_A 3hwn_A*
          Length = 316

 Score = 91.9 bits (229), Expect = 3e-22
 Identities = 42/142 (29%), Positives = 63/142 (44%), Gaps = 28/142 (19%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSET-MKKILYKYGPLSVLLN 60
           GL+SE+ YPY+        C Y+           F+     E  + K +   GP+SV ++
Sbjct: 178 GLDSEESYPYEATEE---SCKYNPKYSVA-NDAGFVDIPKQEKALMKAVATVGPISVAID 233

Query: 61  SDLIHD---------YNGTPIRKNDETCSPYDLGHAVLLVGYG----KQDDIPYWLVRNS 107
           +   H+         Y        +  CS  D+ H VL+VGYG    + D+  YWLV+NS
Sbjct: 234 AG--HESFLFYKEGIY-------FEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNS 284

Query: 108 WGPIGPDEGFFKIERG-NNACG 128
           WG      G+ K+ +   N CG
Sbjct: 285 WGEEWGMGGYVKMAKDRRNHCG 306



 Score = 68.8 bits (169), Expect = 6e-14
 Identities = 28/93 (30%), Positives = 43/93 (46%), Gaps = 18/93 (19%)

Query: 135 NGSETMKKILYKYGPLSVGLNSHLIHF-------YNGTPIRKNDETCSPYDLGHAVLLVG 187
              + + K +   GP+SV +++    F       Y        +  CS  D+ H VL+VG
Sbjct: 213 KQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYF-------EPDCSSEDMDHGVLVVG 265

Query: 188 YG----KQDDIPYWLVRNSWGPIGPDEGFFKIE 216
           YG    + D+  YWLV+NSWG      G+ K+ 
Sbjct: 266 YGFESTESDNNKYWLVKNSWGEEWGMGGYVKMA 298


>1ppo_A Protease omega; hydrolase(thiol protease); 1.80A {Carica papaya}
           SCOP: d.3.1.1 PDB: 1meg_A*
          Length = 216

 Score = 89.8 bits (224), Expect = 3e-22
 Identities = 35/135 (25%), Positives = 53/135 (39%), Gaps = 16/135 (11%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--LHFNGSETMKKILYKYGPLSVLL 59
           G+     YPYK   G    C   +    +        +  N    +   + K  P+SV++
Sbjct: 79  GIHLRSKYPYKAKQG---TCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAK-QPVSVVV 134

Query: 60  NSDLI--HDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGF 117
            S       Y G      +  C    + HAV  VGYGK     Y L++NSWG    ++G+
Sbjct: 135 ESKGRPFQLYKGGIF---EGPCGT-KVDHAVTAVGYGKSGGKGYILIKNSWGTAWGEKGY 190

Query: 118 FKIERGNNA----CG 128
            +I+R        CG
Sbjct: 191 IRIKRAPGNSPGVCG 205



 Score = 65.9 bits (162), Expect = 2e-13
 Identities = 25/89 (28%), Positives = 39/89 (43%), Gaps = 18/89 (20%)

Query: 136 GSET-MKKILYKYGPLSVGLNSHLIHF-------YNGTPIRKNDETCSPYDLGHAVLLVG 187
            +E  +   + K  P+SV + S    F       + G         C    + HAV  VG
Sbjct: 116 NNEGNLLNAIAK-QPVSVVVESKGRPFQLYKGGIFEGP--------CGT-KVDHAVTAVG 165

Query: 188 YGKQDDIPYWLVRNSWGPIGPDEGFFKIE 216
           YGK     Y L++NSWG    ++G+ +I+
Sbjct: 166 YGKSGGKGYILIKNSWGTAWGEKGYIRIK 194


>1iwd_A Ervatamin B; cysteine protease, alpha-beta protein, catalytic DYAD,
           L-DOM domain., hydrolase; 1.63A {Tabernaemontana
           divaricata} SCOP: d.3.1.1
          Length = 215

 Score = 89.5 bits (223), Expect = 3e-22
 Identities = 34/133 (25%), Positives = 58/133 (43%), Gaps = 14/133 (10%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLNS 61
           G++++++YPY    G    C   + +V    G   +  N    ++  +    P+SV + +
Sbjct: 80  GIDTQQNYPYSAVQG---SCKPYRLRVVSINGFQRVTRNNESALQSAVAS-QPVSVTVEA 135

Query: 62  DLI--HDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFK 119
                  Y+          C      H V++VGYG Q    YW+VRNSWG    ++G+  
Sbjct: 136 AGAPFQHYSSGIF---TGPCGT-AQNHGVVIVGYGTQSGKNYWIVRNSWGQNWGNQGYIW 191

Query: 120 IERGNNA----CG 128
           +ER   +    CG
Sbjct: 192 MERNVASSAGLCG 204



 Score = 63.3 bits (155), Expect = 2e-12
 Identities = 23/76 (30%), Positives = 34/76 (44%), Gaps = 16/76 (21%)

Query: 148 GPLSVGLNSHLIHF-------YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVR 200
            P+SV + +    F       + G         C      H V++VGYG Q    YW+VR
Sbjct: 127 QPVSVTVEAAGAPFQHYSSGIFTGP--------CGT-AQNHGVVIVGYGTQSGKNYWIVR 177

Query: 201 NSWGPIGPDEGFFKIE 216
           NSWG    ++G+  +E
Sbjct: 178 NSWGQNWGNQGYIWME 193


>3ois_A Cysteine protease; alpha and beta, hydrolase; HET: UDP; 1.65A
           {Xylella fastidiosa}
          Length = 291

 Score = 90.7 bits (225), Expect = 5e-22
 Identities = 25/144 (17%), Positives = 51/144 (35%), Gaps = 19/144 (13%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKS-------------KVKLFTGKDFLHFNGSET-MKK 47
           G+  EK++PY +   +     +                  + +   ++         +K 
Sbjct: 137 GVCPEKEWPYGDTPADPRTEEFPPGAPASKKPSDQCYKDAQNYKITEYSRVAQDIDHLKA 196

Query: 48  ILYKYGPLSV--LLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVR 105
            L    P      + +  + + +              + GHAVL VGY   D+I ++ +R
Sbjct: 197 CLAVGSPFVFGFSVYNSWVGNNSLPVRIPLPTKNDTLEGGHAVLCVGYD--DEIRHFRIR 254

Query: 106 NSWGPIGPDEGFFKIERG-NNACG 128
           NSWG    ++G+F +     +   
Sbjct: 255 NSWGNNVGEDGYFWMPYEYISNTQ 278



 Score = 78.7 bits (194), Expect = 1e-17
 Identities = 21/80 (26%), Positives = 36/80 (45%), Gaps = 4/80 (5%)

Query: 138 ETMKKILYKYGPLSVGLNSH--LIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIP 195
           + +K  L    P   G + +   +   +              + GHAVL VGY   D+I 
Sbjct: 192 DHLKACLAVGSPFVFGFSVYNSWVGNNSLPVRIPLPTKNDTLEGGHAVLCVGYD--DEIR 249

Query: 196 YWLVRNSWGPIGPDEGFFKI 215
           ++ +RNSWG    ++G+F +
Sbjct: 250 HFRIRNSWGNNVGEDGYFWM 269


>3bwk_A Cysteine protease falcipain-3; malaria, hydrolase; HET: C1P; 2.42A
           {Plasmodium falciparum} PDB: 3bpm_A*
          Length = 243

 Score = 88.4 bits (220), Expect = 2e-21
 Identities = 38/145 (26%), Positives = 59/145 (40%), Gaps = 29/145 (20%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLN- 60
           GL S+ DYPY +   E   C   +   +  T K ++     +  K+ L   GP+S+ +  
Sbjct: 99  GLCSQDDYPYVSNLPET--CNLKRCNERY-TIKSYVSIP-DDKFKEALRYLGPISISIAA 154

Query: 61  SDLIHDYNG---TPIRKNDETCSPYDLGHAVLLVGYG----------KQDDIPYWLVRNS 107
           SD    Y G            C      HAV+LVGYG          + +   Y++++NS
Sbjct: 155 SDDFAFYRGGFYDG------ECGA-APNHAVILVGYGMKDIYNEDTGRMEKFYYYIIKNS 207

Query: 108 WGPIGPDEGFFKIERGNNA----CG 128
           WG    + G+  +E   N     C 
Sbjct: 208 WGSDWGEGGYINLETDENGYKKTCS 232



 Score = 68.4 bits (168), Expect = 4e-14
 Identities = 25/91 (27%), Positives = 39/91 (42%), Gaps = 21/91 (23%)

Query: 140 MKKILYKYGPLSVGLN-SHLIHFYNG---TPIRKNDETCSPYDLGHAVLLVGYG------ 189
            K+ L   GP+S+ +  S    FY G            C      HAV+LVGYG      
Sbjct: 138 FKEALRYLGPISISIAASDDFAFYRGGFYDG------ECGA-APNHAVILVGYGMKDIYN 190

Query: 190 ----KQDDIPYWLVRNSWGPIGPDEGFFKIE 216
               + +   Y++++NSWG    + G+  +E
Sbjct: 191 EDTGRMEKFYYYIIKNSWGSDWGEGGYINLE 221


>1cqd_A Protein (protease II); cysteine protease, glycoprotein, proline
           specificity, carboh papain family, hydrolase; HET: NAG
           FUL FUC; 2.10A {Zingiber officinale} SCOP: d.3.1.1
          Length = 221

 Score = 87.6 bits (218), Expect = 2e-21
 Identities = 32/134 (23%), Positives = 62/134 (46%), Gaps = 15/134 (11%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKL-FTGKDFLHFNGSETMKKILYKYGPLSVLLN 60
           G+ SE+ YPY+  +G    C    +   +     + +  +  ++++K +    P+SV ++
Sbjct: 82  GINSEETYPYRGQDG---ICNSTVNAPVVSIDSYENVPSHNEQSLQKAVAN-QPVSVTMD 137

Query: 61  SDLI--HDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFF 118
           +       Y          +C+     HA+ +VGYG ++D  +W+V+NSWG    + G+ 
Sbjct: 138 AAGRDFQLYRSGIF---TGSCNI-SANHALTVVGYGTENDKDFWIVKNSWGKNWGESGYI 193

Query: 119 KIERGNNA----CG 128
           + ER        CG
Sbjct: 194 RAERNIENPDGKCG 207



 Score = 65.6 bits (161), Expect = 3e-13
 Identities = 22/88 (25%), Positives = 41/88 (46%), Gaps = 16/88 (18%)

Query: 136 GSETMKKILYKYGPLSVGLNSHLIHF-------YNGTPIRKNDETCSPYDLGHAVLLVGY 188
            +E   +      P+SV +++    F       + G+        C+     HA+ +VGY
Sbjct: 118 HNEQSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGS--------CNI-SANHALTVVGY 168

Query: 189 GKQDDIPYWLVRNSWGPIGPDEGFFKIE 216
           G ++D  +W+V+NSWG    + G+ + E
Sbjct: 169 GTENDKDFWIVKNSWGKNWGESGYIRAE 196


>1s4v_A Cysteine endopeptidase; KDEL ER retention signal, endosperm,
           ricinosomes, SEED germi senescence, hydrolase-hydrolase
           inhibitor complex; 2.00A {Ricinus communis} SCOP:
           d.3.1.1
          Length = 229

 Score = 86.8 bits (216), Expect = 5e-21
 Identities = 37/143 (25%), Positives = 60/143 (41%), Gaps = 31/143 (21%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--LHFNGSETMKKILYKYGPLSVLL 59
           G+ +E +YPY+  +G    C   K      +      +  N    + K +    P+SV +
Sbjct: 82  GITTEANYPYEAYDG---TCDVSKENAPAVSIDGHENVPENDENALLKAVAN-QPVSVAI 137

Query: 60  NSDLIHD---------YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDD-IPYWLVRNSWG 109
           ++              + G+        C   +L H V +VGYG   D   YW V+NSWG
Sbjct: 138 DAG--GSDFQFYSEGVFTGS--------CGT-ELDHGVAIVGYGTTIDGTKYWTVKNSWG 186

Query: 110 PIGPDEGFFKIERG----NNACG 128
           P   ++G+ ++ERG       CG
Sbjct: 187 PEWGEKGYIRMERGISDKEGLCG 209



 Score = 61.0 bits (149), Expect = 1e-11
 Identities = 25/89 (28%), Positives = 39/89 (43%), Gaps = 17/89 (19%)

Query: 136 GSETMKKILYKYGPLSVGLNSHLIHF-------YNGTPIRKNDETCSPYDLGHAVLLVGY 188
             E          P+SV +++    F       + G+        C   +L H V +VGY
Sbjct: 119 NDENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGS--------CGT-ELDHGVAIVGY 169

Query: 189 GKQDD-IPYWLVRNSWGPIGPDEGFFKIE 216
           G   D   YW V+NSWGP   ++G+ ++E
Sbjct: 170 GTTIDGTKYWTVKNSWGPEWGEKGYIRME 198


>2oul_A Falcipain 2; cysteine protease, inhibitor, macromolecular
           interaction, HY hydrolase inhibitor complex; 2.20A
           {Plasmodium falciparum} SCOP: d.3.1.1 PDB: 2ghu_A 1yvb_A
           3bpf_A* 3pnr_A
          Length = 241

 Score = 86.5 bits (215), Expect = 9e-21
 Identities = 37/146 (25%), Positives = 58/146 (39%), Gaps = 31/146 (21%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSET-MKKILYKYGPLSVLLN 60
           G+  + DYPY +       C  D+   K        + +  +  +K+ L   GP+S+ + 
Sbjct: 97  GICPDGDYPYVSDAPNL--CNIDRCTEK---YGIKNYLSVPDNKLKEALRFLGPISISVA 151

Query: 61  -SDLIHDYNG---TPIRKNDETCSPYDLGHAVLLVGYG----------KQDDIPYWLVRN 106
            SD    Y              C    L HAV+LVG+G          K +   Y++++N
Sbjct: 152 VSDDFAFYKEGIFDG------ECGD-QLNHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKN 204

Query: 107 SWGPIGPDEGFFKIERGNNA----CG 128
           SWG    + GF  IE   +     CG
Sbjct: 205 SWGQQWGERGFINIETDESGLMRKCG 230



 Score = 66.1 bits (162), Expect = 2e-13
 Identities = 27/91 (29%), Positives = 40/91 (43%), Gaps = 21/91 (23%)

Query: 140 MKKILYKYGPLSVGLN-SHLIHFYNG---TPIRKNDETCSPYDLGHAVLLVGYG------ 189
           +K+ L   GP+S+ +  S    FY              C    L HAV+LVG+G      
Sbjct: 136 LKEALRFLGPISISVAVSDDFAFYKEGIFDG------ECGD-QLNHAVMLVGFGMKEIVN 188

Query: 190 ----KQDDIPYWLVRNSWGPIGPDEGFFKIE 216
               K +   Y++++NSWG    + GF  IE
Sbjct: 189 PLTKKGEKHYYYIIKNSWGQQWGERGFINIE 219


>3p5u_A Actinidin; SAD, cysteine proteinases, hydrolase; 1.50A {Actinidia
           arguta} PDB: 3p5v_A 3p5w_A 3p5x_A 1aec_A* 2act_A
          Length = 220

 Score = 85.2 bits (212), Expect = 1e-20
 Identities = 37/141 (26%), Positives = 61/141 (43%), Gaps = 29/141 (20%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--LHFNGSETMKKILYKYGPLSVLL 59
           G+ +E +YPY    G   +C  D  + K  +   +  + +N    ++  +    P+SV L
Sbjct: 82  GINTEANYPYTAEEG---QCNLDLQQEKYVSIDTYENVPYNNEWALQTAVAY-QPVSVAL 137

Query: 60  NSDLIHD---------YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGP 110
            +              + G         C    + HAV +VGYG +  I YW+V+NSWG 
Sbjct: 138 EAA--GYNFQHYSSGIFTG--------PCGT-AVDHAVTIVGYGTEGGIDYWIVKNSWGT 186

Query: 111 IGPDEGFFKIERG---NNACG 128
              +EG+ +I+R       CG
Sbjct: 187 TWGEEGYMRIQRNVGGVGQCG 207



 Score = 63.3 bits (155), Expect = 2e-12
 Identities = 24/71 (33%), Positives = 36/71 (50%), Gaps = 6/71 (8%)

Query: 148 GPLSVGLN-SHLI-HFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGP 205
            P+SV L  +      Y+          C    + HAV +VGYG +  I YW+V+NSWG 
Sbjct: 131 QPVSVALEAAGYNFQHYSSGIF---TGPCGT-AVDHAVTIVGYGTEGGIDYWIVKNSWGT 186

Query: 206 IGPDEGFFKIE 216
              +EG+ +I+
Sbjct: 187 TWGEEGYMRIQ 197


>3ioq_A CMS1MS2; caricaceae, cysteine protease, papain family, hydrolase;
           HET: E64 SO4; 1.87A {Carica candamarcensis}
          Length = 213

 Score = 83.3 bits (207), Expect = 7e-20
 Identities = 31/135 (22%), Positives = 56/135 (41%), Gaps = 20/135 (14%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--LHFNGSETMKKILYKYGPLSVLL 59
           G+   + YPY+       +C   ++K           +  N  + + + +    P+S+++
Sbjct: 79  GIHLRQYYPYEGVQR---QCRASQAKGPKVKTDGVGRVPRNNEQALIQRIAI-QPVSIVV 134

Query: 60  NSDLI--HDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGF 117
            +      +Y G         C    + HAV  VGYG      Y L++NSWG    + G+
Sbjct: 135 EAKGRAFQNYRGGIF---AGPCGT-SIDHAVAAVGYGN----DYILIKNSWGTGWGEGGY 186

Query: 118 FKIERGNNA----CG 128
            +I+RG+      CG
Sbjct: 187 IRIKRGSGNPQGACG 201



 Score = 59.0 bits (144), Expect = 5e-11
 Identities = 20/76 (26%), Positives = 32/76 (42%), Gaps = 20/76 (26%)

Query: 148 GPLSVGLNSHLIHF-------YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVR 200
            P+S+ + +    F       + G         C    + HAV  VGYG      Y L++
Sbjct: 128 QPVSIVVEAKGRAFQNYRGGIFAGP--------CGT-SIDHAVAAVGYGN----DYILIK 174

Query: 201 NSWGPIGPDEGFFKIE 216
           NSWG    + G+ +I+
Sbjct: 175 NSWGTGWGEGGYIRIK 190


>1yal_A Chymopapain; hydrolase, thiol protease; 1.70A {Carica papaya} SCOP:
           d.3.1.1 PDB: 1gec_E*
          Length = 218

 Score = 83.3 bits (207), Expect = 7e-20
 Identities = 36/135 (26%), Positives = 54/135 (40%), Gaps = 16/135 (11%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--LHFNGSETMKKILYKYGPLSVLL 59
           G+ + K YPY+       KC              +  +  N   +    L    PLSVL+
Sbjct: 79  GVHTSKVYPYQAKQY---KCRATDKPGPKVKITGYKRVPSNCETSFLGALAN-QPLSVLV 134

Query: 60  NSDLI--HDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGF 117
            +       Y        D  C    L HAV  VGYG  D   Y +++NSWGP   ++G+
Sbjct: 135 EAGGKPFQLYKSGVF---DGPCGT-KLDHAVTAVGYGTSDGKNYIIIKNSWGPNWGEKGY 190

Query: 118 FKIERGNNA----CG 128
            +++R +      CG
Sbjct: 191 MRLKRQSGNSQGTCG 205



 Score = 63.3 bits (155), Expect = 2e-12
 Identities = 26/89 (29%), Positives = 39/89 (43%), Gaps = 18/89 (20%)

Query: 136 GSET-MKKILYKYGPLSVGLNSHLIHF-------YNGTPIRKNDETCSPYDLGHAVLLVG 187
             ET     L    PLSV + +    F       ++G         C    L HAV  VG
Sbjct: 116 NCETSFLGALAN-QPLSVLVEAGGKPFQLYKSGVFDGP--------CGT-KLDHAVTAVG 165

Query: 188 YGKQDDIPYWLVRNSWGPIGPDEGFFKIE 216
           YG  D   Y +++NSWGP   ++G+ +++
Sbjct: 166 YGTSDGKNYIIIKNSWGPNWGEKGYMRLK 194


>2bdz_A Mexicain; cysteine protease, peptidase_C1, papain-like, HYDR; HET:
           E64; 2.10A {Jacaratia mexicana}
          Length = 214

 Score = 83.3 bits (207), Expect = 8e-20
 Identities = 36/135 (26%), Positives = 60/135 (44%), Gaps = 20/135 (14%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--LHFNGSETMKKILYKYGPLSVLL 59
           G+ +E++YPY+   G   +C     K        +  +  N   ++ + +    P+SV+ 
Sbjct: 79  GVHTEREYPYEKKQG---RCRAKDKKGPKVYITGYKYVPANDEISLIQAIAN-QPVSVVT 134

Query: 60  NSDLI--HDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGF 117
           +S       Y G      +  C   +  HAV  VGYGK     Y L++NSWGP   ++G+
Sbjct: 135 DSRGRGFQFYKGGIY---EGPCGT-NTDHAVTAVGYGK----TYLLLKNSWGPNWGEKGY 186

Query: 118 FKIERG----NNACG 128
            +I+R        CG
Sbjct: 187 IRIKRASGRSKGTCG 201



 Score = 59.4 bits (145), Expect = 5e-11
 Identities = 26/89 (29%), Positives = 39/89 (43%), Gaps = 22/89 (24%)

Query: 136 GSET-MKKILYKYGPLSVGLNSHLIHF-------YNGTPIRKNDETCSPYDLGHAVLLVG 187
             E  + + +    P+SV  +S    F       Y G         C   +  HAV  VG
Sbjct: 116 NDEISLIQAIAN-QPVSVVTDSRGRGFQFYKGGIYEGP--------CGT-NTDHAVTAVG 165

Query: 188 YGKQDDIPYWLVRNSWGPIGPDEGFFKIE 216
           YGK     Y L++NSWGP   ++G+ +I+
Sbjct: 166 YGK----TYLLLKNSWGPNWGEKGYIRIK 190


>1pci_A Procaricain; zymogen, hydrolase, thiol protease; 3.20A {Carica
           papaya} SCOP: d.3.1.1
          Length = 322

 Score = 84.2 bits (209), Expect = 2e-19
 Identities = 33/142 (23%), Positives = 51/142 (35%), Gaps = 30/142 (21%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--LHFNGSETMKKILYKYGPLSVLL 59
           G+     YPYK   G    C   +    +        +  N    +   + K  P+SV++
Sbjct: 185 GIHLRSKYPYKAKQG---TCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAK-QPVSVVV 240

Query: 60  NSDLIHD---------YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGP 110
            S              + G         C    +  AV  VGYGK     Y L++NSWG 
Sbjct: 241 ESK--GRPFQLYKGGIFEG--------PCGT-KVDGAVTAVGYGKSGGKGYILIKNSWGT 289

Query: 111 IGPDEGFFKIERGNNA----CG 128
              ++G+ +I+R        CG
Sbjct: 290 AWGEKGYIRIKRAPGNSPGVCG 311



 Score = 62.3 bits (152), Expect = 1e-11
 Identities = 22/76 (28%), Positives = 33/76 (43%), Gaps = 16/76 (21%)

Query: 148 GPLSVGLNSHLIHF-------YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVR 200
            P+SV + S    F       + G         C    +  AV  VGYGK     Y L++
Sbjct: 234 QPVSVVVESKGRPFQLYKGGIFEGP--------CGT-KVDGAVTAVGYGKSGGKGYILIK 284

Query: 201 NSWGPIGPDEGFFKIE 216
           NSWG    ++G+ +I+
Sbjct: 285 NSWGTAWGEKGYIRIK 300


>1o0e_A Ervatamin C; plant cysteine protease, two domain, stable at PH
           2-12, HYDR; 1.90A {Tabernaemontana divaricata} SCOP:
           d.3.1.1 PDB: 2pns_A* 2pre_A* 3bcn_A*
          Length = 208

 Score = 81.9 bits (203), Expect = 2e-19
 Identities = 34/131 (25%), Positives = 55/131 (41%), Gaps = 17/131 (12%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLNS 61
           G++++ +YPYK   G    C      V +  G + + F  +E   K      P +V +++
Sbjct: 80  GIDTQANYPYKAVQG---PCQAASKVVSI-DGYNGVPFC-NEXALKQAVAVQPSTVAIDA 134

Query: 62  DL--IHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFK 119
                  Y+               L H V +VGY       YW+VRNSWG    ++G+ +
Sbjct: 135 SSAQFQQYSSGIF----SGPCGTKLNHGVTIVGYQA----NYWIVRNSWGRYWGEKGYIR 186

Query: 120 IER--GNNACG 128
           + R  G   CG
Sbjct: 187 MLRVGGCGLCG 197



 Score = 52.3 bits (126), Expect = 1e-08
 Identities = 20/83 (24%), Positives = 33/83 (39%), Gaps = 10/83 (12%)

Query: 136 GSETMKKILYKYGPLSVGLNS--HLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDD 193
            +E   K      P +V +++       Y+               L H V +VGY     
Sbjct: 114 CNEXALKQAVAVQPSTVAIDASSAQFQQYSSGIF----SGPCGTKLNHGVTIVGYQA--- 166

Query: 194 IPYWLVRNSWGPIGPDEGFFKIE 216
             YW+VRNSWG    ++G+ ++ 
Sbjct: 167 -NYWIVRNSWGRYWGEKGYIRML 188


>2cio_A Papain; hydrolase/inhibitor, complex hydrolase/inhibitor, ICP,
           cysteine protease, allergen, protease, thiol protease;
           1.5A {Carica papaya} PDB: 1khq_A 1khp_A 1ppn_A 3e1z_B
           3ima_A 3lfy_A 9pap_A 1bqi_A* 1bp4_A* 1pad_A 1pe6_A*
           1pip_A* 1pop_A* 1ppd_A 1ppp_A* 1stf_E* 2pad_A 4pad_A*
           5pad_A* 6pad_A* ...
          Length = 212

 Score = 81.3 bits (202), Expect = 4e-19
 Identities = 31/135 (22%), Positives = 47/135 (34%), Gaps = 20/135 (14%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--LHFNGSETMKKILYKYGPLSVLL 59
           G+     YPY+        C   +             +       +   +    P+SV+L
Sbjct: 79  GIHYRNTYPYEGVQR---YCRSREKGPYAAKTDGVRQVQPYNEGALLYSIAN-QPVSVVL 134

Query: 60  NSDLI--HDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGF 117
            +       Y G         C    + HAV  VGYG      Y L++NSWG    + G+
Sbjct: 135 EAAGKDFQLYRGGIF---VGPCGN-KVDHAVAAVGYGP----NYILIKNSWGTGWGENGY 186

Query: 118 FKIERGNNA----CG 128
            +I+RG       CG
Sbjct: 187 IRIKRGTGNSYGVCG 201



 Score = 59.0 bits (144), Expect = 5e-11
 Identities = 23/89 (25%), Positives = 36/89 (40%), Gaps = 22/89 (24%)

Query: 136 GSET-MKKILYKYGPLSVGLNSHLIHF-------YNGTPIRKNDETCSPYDLGHAVLLVG 187
            +E  +   +    P+SV L +    F       + G         C    + HAV  VG
Sbjct: 116 YNEGALLYSIAN-QPVSVVLEAAGKDFQLYRGGIFVGP--------CGN-KVDHAVAAVG 165

Query: 188 YGKQDDIPYWLVRNSWGPIGPDEGFFKIE 216
           YG      Y L++NSWG    + G+ +I+
Sbjct: 166 YGP----NYILIKNSWGTGWGENGYIRIK 190


>3f75_A Toxopain-2, cathepsin L protease; medical structural genomics of
           pathogenic protozoa, MSGPP, C protease, parasite,
           protozoa, hydrolase; 1.99A {Toxoplasma gondii}
          Length = 224

 Score = 81.4 bits (202), Expect = 5e-19
 Identities = 34/142 (23%), Positives = 55/142 (38%), Gaps = 30/142 (21%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKL-FTGKDFLHFNGSETMKKILYKYGPLSVLLN 60
           G+ SE  YPY   +    +C     +  +   G   +       MK  L K  P+S+ + 
Sbjct: 88  GICSEDAYPYLARDE---ECRAQSCEKVVKILGFKDVPRRSEAAMKAALAK-SPVSIAIE 143

Query: 61  SDLIHD---------YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDD--IPYWLVRNSWG 109
           +D             ++         +C   DL H VLLVGYG   +    +W+++NSWG
Sbjct: 144 AD--QMPFQFYHEGVFDA--------SCGT-DLDHGVLLVGYGTDKESKKDFWIMKNSWG 192

Query: 110 PIGPDEGFFKIERG---NNACG 128
                +G+  +         CG
Sbjct: 193 TGWGRDGYMYMAMHKGEEGQCG 214



 Score = 59.1 bits (144), Expect = 7e-11
 Identities = 20/78 (25%), Positives = 36/78 (46%), Gaps = 18/78 (23%)

Query: 148 GPLSVGLNSHLIHF-------YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDD--IPYWL 198
            P+S+ + +  + F       ++ +        C   DL H VLLVGYG   +    +W+
Sbjct: 136 SPVSIAIEADQMPFQFYHEGVFDAS--------CGT-DLDHGVLLVGYGTDKESKKDFWI 186

Query: 199 VRNSWGPIGPDEGFFKIE 216
           ++NSWG     +G+  + 
Sbjct: 187 MKNSWGTGWGRDGYMYMA 204


>3u8e_A Papain-like cysteine protease; papain-like cysteine peptidase,
           peptidase_C1A, hydrolase, in form; 1.31A {Crocus
           sativus}
          Length = 222

 Score = 80.3 bits (198), Expect = 1e-18
 Identities = 36/135 (26%), Positives = 54/135 (40%), Gaps = 13/135 (9%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLNS 61
           G+ S+ +YPY   +G    C  +K       G  + +   S +         P+SV + +
Sbjct: 80  GIASDANYPYTGVDG---TCDLNKPIAARIDG--YTNVPNSSSALLDAVAKQPVSVNIYT 134

Query: 62  DL--IHDYNGTPIRKNDE-TCSPYDLGHAVLLVGYGKQDD-IPYWLVRNSWGPIGPDEGF 117
                  Y G  I      +  P  + H VL+VGYG       YW+V+NSWG     +G+
Sbjct: 135 SSTSFQLYTGPGIFAGSSCSDDPATVDHTVLIVGYGSNGTNADYWIVKNSWGTEWGIDGY 194

Query: 118 FKIERGNNA----CG 128
             I R  N     C 
Sbjct: 195 ILIRRNTNRPDGVCA 209



 Score = 60.6 bits (147), Expect = 2e-11
 Identities = 27/111 (24%), Positives = 43/111 (38%), Gaps = 4/111 (3%)

Query: 110 PIGPDEGFFKIERGNNACGKDFLHFNGSETMKKILYKYGPLSVGLNS--HLIHFYNGTPI 167
           P    +G   + +   A    + +   S +         P+SV + +       Y G  I
Sbjct: 88  PYTGVDGTCDLNKPIAARIDGYTNVPNSSSALLDAVAKQPVSVNIYTSSTSFQLYTGPGI 147

Query: 168 RKNDE-TCSPYDLGHAVLLVGYGKQDD-IPYWLVRNSWGPIGPDEGFFKIE 216
                 +  P  + H VL+VGYG       YW+V+NSWG     +G+  I 
Sbjct: 148 FAGSSCSDDPATVDHTVLIVGYGSNGTNADYWIVKNSWGTEWGIDGYILIR 198


>2fo5_A Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cysteine
           endoprotease, endopeptidase, LEUP hydrolase; HET: AR7;
           2.20A {Hordeum vulgare}
          Length = 262

 Score = 80.0 bits (198), Expect = 3e-18
 Identities = 38/146 (26%), Positives = 61/146 (41%), Gaps = 34/146 (23%)

Query: 2   GLESEKDYPYKNANGEKFKCAYDKSKVKL-----FTGKDFLHFNGSETMKKILYKYGPLS 56
           GL +E  YPY+ A G    C   ++           G   +  N  E + + +    P+S
Sbjct: 84  GLITEAAYPYRAARG---TCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVAN-QPVS 139

Query: 57  VLLNSDLIHD---------YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDD-IPYWLVRN 106
           V + +              + G         C   +L H V +VGYG  +D   YW V+N
Sbjct: 140 VAVEAS--GKAFMFYSEGVFTGE--------CGT-ELDHGVAVVGYGVAEDGKAYWTVKN 188

Query: 107 SWGPIGPDEGFFKIERGNNA----CG 128
           SWGP   ++G+ ++E+ + A    CG
Sbjct: 189 SWGPSWGEQGYIRVEKDSGASGGLCG 214



 Score = 61.5 bits (150), Expect = 1e-11
 Identities = 26/89 (29%), Positives = 39/89 (43%), Gaps = 17/89 (19%)

Query: 136 GSETMKKILYKYGPLSVGLNSHLIHF-------YNGTPIRKNDETCSPYDLGHAVLLVGY 188
            SE          P+SV + +    F       + G         C   +L H V +VGY
Sbjct: 124 NSEEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGE--------CGT-ELDHGVAVVGY 174

Query: 189 GKQDD-IPYWLVRNSWGPIGPDEGFFKIE 216
           G  +D   YW V+NSWGP   ++G+ ++E
Sbjct: 175 GVAEDGKAYWTVKNSWGPSWGEQGYIRVE 203


>3pw3_A Aminopeptidase C; bleomycin, cysteine proteinase fold, structural
           genomics, JO center for structural genomics, JCSG; HET:
           MSE; 2.23A {Parabacteroides distasonis}
          Length = 383

 Score = 38.7 bits (89), Expect = 9e-04
 Identities = 11/36 (30%), Positives = 18/36 (50%), Gaps = 1/36 (2%)

Query: 86  HAVLLVGYGK-QDDIPYWLVRNSWGPIGPDEGFFKI 120
           H + + G  K Q+   Y++V+NSWG      G +  
Sbjct: 318 HGMQIYGIAKDQEGNEYYMVKNSWGTNSKYNGIWYA 353



 Score = 38.7 bits (89), Expect = 9e-04
 Identities = 11/36 (30%), Positives = 18/36 (50%), Gaps = 1/36 (2%)

Query: 181 HAVLLVGYGK-QDDIPYWLVRNSWGPIGPDEGFFKI 215
           H + + G  K Q+   Y++V+NSWG      G +  
Sbjct: 318 HGMQIYGIAKDQEGNEYYMVKNSWGTNSKYNGIWYA 353


>2e01_A Cysteine proteinase 1; bleomycin hydrolase, thiol protease, C1
           protease, hydrolase; 1.73A {Saccharomyces cerevisiae}
           PDB: 2e02_A 2e03_A 2dzy_A 1a6r_A 2e00_A 2dzz_A 3gcb_A
           1gcb_A
          Length = 457

 Score = 33.2 bits (75), Expect = 0.072
 Identities = 9/36 (25%), Positives = 15/36 (41%), Gaps = 3/36 (8%)

Query: 86  HAVLLVGYGKQDD---IPYWLVRNSWGPIGPDEGFF 118
            A+L+ G    +       + V NSWG     +G +
Sbjct: 373 AAMLITGCHVDETSKLPLRYRVENSWGKDSGKDGLY 408



 Score = 33.2 bits (75), Expect = 0.072
 Identities = 9/36 (25%), Positives = 15/36 (41%), Gaps = 3/36 (8%)

Query: 181 HAVLLVGYGKQDD---IPYWLVRNSWGPIGPDEGFF 213
            A+L+ G    +       + V NSWG     +G +
Sbjct: 373 AAMLITGCHVDETSKLPLRYRVENSWGKDSGKDGLY 408


>2cb5_A Protein (bleomycin hydrolase); aminopeptidase, cysteine protease,
           SELF- compartmentalizing, cylinase; 1.85A {Homo sapiens}
           SCOP: d.3.1.1 PDB: 1cb5_A
          Length = 453

 Score = 31.3 bits (70), Expect = 0.26
 Identities = 11/37 (29%), Positives = 16/37 (43%), Gaps = 4/37 (10%)

Query: 86  HAVLLVGYGKQDDIP----YWLVRNSWGPIGPDEGFF 118
           HA+      ++DD       W V NSWG     +G+ 
Sbjct: 371 HAMTFTAVSEKDDQDGAFTKWRVENSWGEDHGHKGYL 407



 Score = 31.3 bits (70), Expect = 0.26
 Identities = 11/37 (29%), Positives = 16/37 (43%), Gaps = 4/37 (10%)

Query: 181 HAVLLVGYGKQDDIP----YWLVRNSWGPIGPDEGFF 213
           HA+      ++DD       W V NSWG     +G+ 
Sbjct: 371 HAMTFTAVSEKDDQDGAFTKWRVENSWGEDHGHKGYL 407


>1atg_A MODA, periplasmic molybdate-binding protein; tungstate, ABC
           transporter; 1.20A {Azotobacter vinelandii} SCOP:
           c.94.1.1
          Length = 231

 Score = 27.2 bits (61), Expect = 3.8
 Identities = 4/25 (16%), Positives = 8/25 (32%)

Query: 124 NNACGKDFLHFNGSETMKKILYKYG 148
             A  + F+ +        I+   G
Sbjct: 202 EKANAEQFMSWMKGPKAVAIIKAAG 226


>1x9y_A Cysteine proteinase; half-barrel, barrel-sandwich-hybrid,
           hydrolase; 2.50A {Staphylococcus aureus} SCOP: d.3.1.1
           d.17.1.4
          Length = 367

 Score = 27.3 bits (59), Expect = 4.3
 Identities = 18/108 (16%), Positives = 38/108 (35%), Gaps = 20/108 (18%)

Query: 8   DYPYKNANGEKFK-CAYDKSKVKLFT---GKDFLHFNGSET---MKKILYKYGPLSVLLN 60
              Y   + +    CA   +++  +    G+D  +  G  +   + ++      + +L  
Sbjct: 234 RTLYPEVSEQDLPNCATFPNQMIEYGKSQGRDIHYQEGVPSYNQVDQLTKDNVGIMILAQ 293

Query: 61  SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSW 108
           S               +  +   LGHA+ +VG  K +D    +  N W
Sbjct: 294 S-------------VSQNPNDPHLGHALAVVGNAKINDQEKLIYWNPW 328



 Score = 26.9 bits (58), Expect = 6.2
 Identities = 10/33 (30%), Positives = 16/33 (48%)

Query: 171 DETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSW 203
            +  +   LGHA+ +VG  K +D    +  N W
Sbjct: 296 SQNPNDPHLGHALAVVGNAKINDQEKLIYWNPW 328


>2b9s_A Topoisomerase I-like protein; vanadate complex, isomerase/DNA
           complex; HET: DNA; 2.27A {Leishmania donovani}
          Length = 432

 Score = 27.3 bits (60), Expect = 5.5
 Identities = 9/47 (19%), Positives = 19/47 (40%)

Query: 22  AYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLNSDLIHDYN 68
            + KS +   + K F  +N S T+ +   +         +D +  +N
Sbjct: 371 DHLKSFMDGLSAKVFRTYNASITLDRWFKEKPVDPKWSTADKLAYFN 417


  Database: pdb70
    Posted date:  Sep 4, 2012  3:40 AM
  Number of letters in database: 6,701,793
  Number of sequences in database:  27,921
  
Lambda     K      H
   0.319    0.142    0.459 

Gapped
Lambda     K      H
   0.267   0.0856    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 27921
Number of Hits to DB: 3,732,767
Number of extensions: 217514
Number of successful extensions: 546
Number of sequences better than 10.0: 1
Number of HSP's gapped: 446
Number of HSP's successfully gapped: 94
Length of query: 233
Length of database: 6,701,793
Length adjustment: 90
Effective length of query: 143
Effective length of database: 4,188,903
Effective search space: 599013129
Effective search space used: 599013129
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 56 (25.1 bits)