RPS-BLAST 2.2.26 [Sep-21-2011]

Database: pdb70 
           27,921 sequences; 6,701,793 total letters

Searching..................................................done

Query= psy4960
         (341 letters)



>1xkg_A DER P I, major mite fecal allergen DER P 1; major allergen,
           cysteine protease, house DUST mite, dermatop
           pteronyssinus; 1.61A {Dermatophagoides pteronyssinus}
           SCOP: d.3.1.1
          Length = 312

 Score =  210 bits (537), Expect = 2e-66
 Identities = 74/308 (24%), Positives = 118/308 (38%), Gaps = 25/308 (8%)

Query: 40  VDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY-YGTSGSSDRSPQEILQR-TGL 97
           +  F+ Y   +N++Y    + +   + F +  K         +  SD S  E   R    
Sbjct: 5   IKTFEEYKKAFNKSYATFEDEEAARKNFLESVKYVQSNGGAINHLSDLSLDEFKNRFLMS 64

Query: 98  RLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFAT 157
               +  E L+   +   +       G  P  +D RQ +   + P+  QG CGS WAF+ 
Sbjct: 65  A---EAFEHLKTQFDLNAETNACSINGNAPAEIDLRQMRT--VTPIRMQGGCGSAWAFSG 119

Query: 158 TAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYGLESQADYPYRN 217
            A  ES     +     L++ +LV+C      C+G  I    EY++  G+  ++ Y Y  
Sbjct: 120 VAATESAYLAYRDQSLDLAEQELVDCA-SQHGCHGDTIPRGIEYIQHNGVVQESYYRYVA 178

Query: 218 KENITFRCTYEKEKA---KVFVQDTWVTSG-VDHMMHLL--QSGPIGVYLNHRLIES--- 268
           +E     C     +      + Q   +     + +   L      I V +  + +++   
Sbjct: 179 REQ---SCRRPNAQRFGISNYCQ---IYPPNANKIREALAQTHSAIAVIIGIKDLDAFRH 232

Query: 269 YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA 328
           YDG  I             HAV IVGY    G+  WIVRNSW     D+GY       + 
Sbjct: 233 YDGRTI--IQRDNGYQPNYHAVNIVGYSNAQGVDYWIVRNSWDTNWGDNGYGYFAANIDL 290

Query: 329 CGIESYAY 336
             IE Y Y
Sbjct: 291 MMIEEYPY 298


>1m6d_A Cathepsin F, catsf; papain family cysteine protease, hydrolase;
           HET: MYP; 1.70A {Homo sapiens} SCOP: d.3.1.1
          Length = 214

 Score =  203 bits (518), Expect = 1e-64
 Identities = 77/212 (36%), Positives = 115/212 (54%), Gaps = 8/212 (3%)

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG 186
           P   DWR  K  V   V+ QG CGSCWAF+ T  +E Q  L + TL  LS+ +L++CD  
Sbjct: 2   PPEWDWRS-KGAV-TKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKM 59

Query: 187 NLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSG 244
           +  C GG    A+  +K   GLE++ DY Y+        C +  EKAKV++QD   ++  
Sbjct: 60  DKACMGGLPSNAYSAIKNLGGLETEDDYSYQGHMQ---SCQFSAEKAKVYIQDSVELSQN 116

Query: 245 VDHMMHLLQS-GPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT 303
              +   L   GPI V +N   ++ Y     R     C+P  +DHAV +VGYG+++ +  
Sbjct: 117 EQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGQRSDVPF 176

Query: 304 WIVRNSWGDIGPDHGYFQIERGANACGIESYA 335
           W ++NSWG    + GY+ + RG+ ACG+ + A
Sbjct: 177 WAIKNSWGTDWGEKGYYYLHRGSGACGVNTMA 208


>3f5v_A DER P 1 allergen; allergy, asthma, DUST mites, glycoprotein,
           hydrola protease, secreted, thiol protease; HET: P6G;
           1.36A {Dermatophagoides pteronyssinus} PDB: 2as8_A
           3rvw_A* 3rvx_A 3rvv_A* 3d6s_A*
          Length = 222

 Score =  200 bits (511), Expect = 1e-63
 Identities = 62/222 (27%), Positives = 92/222 (41%), Gaps = 20/222 (9%)

Query: 124 GPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC 183
           G  P  +D RQ    V  P+  QG CGS WAF+  A  ES     ++    L++ +LV+C
Sbjct: 8   GNAPAEIDLRQ-MRTV-TPIRMQGGCGSAWAFSGVAATESAYLAYRQQSLDLAEQELVDC 65

Query: 184 DHGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEK-AKV--FVQDTW 240
                 C+G  I    EY++  G+  ++ Y Y  +E     C     +   +  + Q   
Sbjct: 66  A-SQHGCHGDTIPRGIEYIQHNGVVQESYYRYVAREQ---SCRRPNAQRFGISNYCQ--- 118

Query: 241 VTSG-VDHMMHLL--QSGPIGVYLNHRLIES---YDGNPIRRNDWACNPHKLDHAVAIVG 294
           +     + +   L      I V +  + +++   YDG  I             HAV IVG
Sbjct: 119 IYPPNANKIREALAQTHSAIAVIIGIKDLDAFRHYDGRTI--IQRDNGYQPNYHAVNIVG 176

Query: 295 YGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYAY 336
           Y    G+  WIVRNSW     D+GY       +   IE Y Y
Sbjct: 177 YSNAQGVDYWIVRNSWDTNWGDNGYGYFAANIDLMMIEEYPY 218


>3qt4_A Cathepsin-L-like midgut cysteine proteinase; hydrolase, zymogen,
           intramolecular DISS bonds, insect larVal midgut; HET:
           PG4 PG6; 2.11A {Tenebrio molitor}
          Length = 329

 Score =  202 bits (517), Expect = 4e-63
 Identities = 85/329 (25%), Positives = 143/329 (43%), Gaps = 63/329 (19%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---YGTSGSS---------DRSP 88
           + +  + +   ++Y+   E   R   FK +  +  E+   +     +         D S 
Sbjct: 25  EQWSQFKLTHKKSYSSPIEEIRRQLIFKDNVAKIAEHNAKFEKGEVTYSKAMNQFGDMSK 84

Query: 89  QEILQ-RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQG 147
           +E L      +    +            +      K PL  S+DWR + V   + V+ QG
Sbjct: 85  EEFLAYVNRGKAQKPK-------HPENLRMPYVSSKKPLAASVDWRSNAV---SEVKDQG 134

Query: 148 RCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH--GNLNCNGGNIDVAFEYVKQY 205
           +CGS W+F+TT  +E Q+AL +  L  LS+  L++C    GN  C+GG +D AF Y+  Y
Sbjct: 135 QCGSSWSFSTTGAVEGQLALQRGRLTSLSEQNLIDCSSSYGNAGCDGGWMDSAFSYIHDY 194

Query: 206 GLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSG-VDHMMHLL-QSGPI----- 257
           G+ S++ YPY  + +    C ++  ++   +     + SG  + +   + Q+GP+     
Sbjct: 195 GIMSESAYPYEAQGD---YCRFDSSQSVTTLSGYYDLPSGDENSLADAVGQAGPVAVAID 251

Query: 258 ----------GVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVR 307
                     G++                 D  CN   L+H V +VGYG  NG   WI++
Sbjct: 252 ATDELQFYSGGLF----------------YDQTCNQSDLNHGVLVVGYGSDNGQDYWILK 295

Query: 308 NSWGDIGPDHGYFQIERG-ANACGIESYA 335
           NSWG    + GY++  R   N CGI + A
Sbjct: 296 NSWGSGWGESGYWRQVRNYGNNCGIATAA 324


>1by8_A Protein (procathepsin K); hydrolase(sulfhydryl proteinase), papain;
           2.60A {Homo sapiens} SCOP: d.3.1.1 PDB: 7pck_A
          Length = 314

 Score =  202 bits (515), Expect = 4e-63
 Identities = 84/315 (26%), Positives = 140/315 (44%), Gaps = 34/315 (10%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---YGTSGSS---------DRSP 88
             ++ +     + Y +  +  +R   ++++ K    +         +         D + 
Sbjct: 9   THWELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTS 68

Query: 89  QEILQ-RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQG 147
           +E++Q  TGL++                       +G  P S+D+R+ K  V  PV++QG
Sbjct: 69  EEVVQKMTGLKVPLSH-------SRSNDTLYIPEWEGRAPDSVDYRK-KGYV-TPVKNQG 119

Query: 148 RCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQ-YG 206
           +CGSCWAF++   LE Q+      L  LS   LV+C   N  C GG +  AF+YV++  G
Sbjct: 120 QCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRG 179

Query: 207 LESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSG-VDHMMH-LLQSGPIGVYLN- 262
           ++S+  YPY  +E     C Y         +    +  G    +   + + GP+ V ++ 
Sbjct: 180 IDSEDAYPYVGQEE---SCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDA 236

Query: 263 -HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQ 321
                + Y        D +CN   L+HAV  VGYG + G   WI++NSWG+   + GY  
Sbjct: 237 SLTSFQFYSKGVY--YDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYIL 294

Query: 322 IERG-ANACGIESYA 335
           + R   NACGI + A
Sbjct: 295 MARNKNNACGIANLA 309


>1pci_A Procaricain; zymogen, hydrolase, thiol protease; 3.20A {Carica
           papaya} SCOP: d.3.1.1
          Length = 322

 Score =  200 bits (510), Expect = 4e-62
 Identities = 84/319 (26%), Positives = 139/319 (43%), Gaps = 44/319 (13%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGS--------SDRSPQEIL 92
             F ++++  N+ Y + +E   RFE FK +    DE    + S        +D S  E  
Sbjct: 20  QLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKKNNSYWLGLNEFADLSNDEFN 79

Query: 93  Q-RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGS 151
           +   G  +            +   +         LP+++DWR+ K  V  PV  QG CGS
Sbjct: 80  EKYVGSLIDATI-------EQSYDEEFINEDIVNLPENVDWRK-KGAV-TPVRHQGSCGS 130

Query: 152 CWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYGLESQA 211
           CWAF+  A +E    +    L  LS+ +LV+C+  +  C GG    A EYV + G+  ++
Sbjct: 131 CWAFSAVATVEGINKIRTGKLVELSEQELVDCERRSHGCKGGYPPYALEYVAKNGIHLRS 190

Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQDT---WVTSG-VDHMMHLLQSGPIGVYLN--HRL 265
            YPY+ K+     C  ++    + V+ +    V      ++++ +   P+ V +    R 
Sbjct: 191 KYPYKAKQG---TCRAKQVGGPI-VKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRP 246

Query: 266 IESYDG----NPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQ 321
            + Y G     P       C   K+D AV  VGYG+  G    +++NSWG    + GY +
Sbjct: 247 FQLYKGGIFEGP-------CGT-KVDGAVTAVGYGKSGGKGYILIKNSWGTAWGEKGYIR 298

Query: 322 IERGANA----CGIESYAY 336
           I+R        CG+   +Y
Sbjct: 299 IKRAPGNSPGVCGLYKSSY 317


>3qj3_A Cathepsin L-like protein; hydrolase, proteinase, larVal midgut;
           1.85A {Tenebrio molitor}
          Length = 331

 Score =  200 bits (510), Expect = 5e-62
 Identities = 91/316 (28%), Positives = 148/316 (46%), Gaps = 30/316 (9%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---YGTSGSS---------DRSP 88
           + ++ +   + R+Y +  E   R + F++  +  +E+   Y     S         D +P
Sbjct: 20  EKWENFKTTYARSYVNAKEETFRKQIFQKKLETFEEHNEKYRQGLVSYTLGVNLFTDMTP 79

Query: 89  QEILQ-RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQG 147
           +E+     GL               + ++ L        P S DWR  +  V +PV++QG
Sbjct: 80  EEMKAYTHGLI--MPADLHKNGIPIKTREDLGLNASVRYPASFDWRD-QGMV-SPVKNQG 135

Query: 148 RCGSCWAFATTAILESQVALLKKTL--YPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY 205
            CGS WAF++T  +ESQ+ +         +S+ QLV+C    L C+GG ++ AF YV Q 
Sbjct: 136 SCGSSWAFSSTGAIESQMKIANGAGYDSSVSEQQLVDCVPNALGCSGGWMNDAFTYVAQN 195

Query: 206 -GLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSG-VDHMMHLLQS-GPIGVYL 261
            G++S+  YPY   +     C Y+  +    +    +++    + +  ++ + GP+ V  
Sbjct: 196 GGIDSEGAYPYEMADG---NCHYDPNQVAARLSGYVYLSGPDENMLADMVATKGPVAVAF 252

Query: 262 N-HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
           +      SY G      +  C  +K  HAV IVGYG +NG   W+V+NSWGD     GYF
Sbjct: 253 DADDPFGSYSGGVY--YNPTCETNKFTHAVLIVGYGNENGQDYWLVKNSWGDGWGLDGYF 310

Query: 321 QIERGA-NACGIESYA 335
           +I R A N CGI   A
Sbjct: 311 KIARNANNHCGIAGVA 326


>3i06_A Cruzipain; autocatalytic cleavage, glycoprotein, protease, thiol
           protease, zymogen; HET: QL2; 1.10A {Trypanosoma cruzi}
           PDB: 1ewm_A* 1ewo_A* 1ewl_A* 1f29_A* 1ewp_A* 1f2b_A*
           1f2c_A* 1f2a_A* 1me4_A* 1u9q_X* 2aim_A* 2efm_A* 2oz2_A*
           1me3_A* 3kku_A* 3lxs_A* 1aim_A* 3iut_A* 3hd3_A* 2p86_A*
           ...
          Length = 215

 Score =  194 bits (496), Expect = 2e-61
 Identities = 65/217 (29%), Positives = 104/217 (47%), Gaps = 17/217 (7%)

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG 186
           P ++DWR  +  V   V+ QG+CGSCWAF+    +E Q  L    L  LS+  LV CD  
Sbjct: 2   PAAVDWRA-RGAV-TAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCDKT 59

Query: 187 NLNCNGGNIDVAFEYVKQY---GLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVT 242
           +  C+GG ++ AFE++ Q     + ++  YPY + E I+  CT         +     + 
Sbjct: 60  DSGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELP 119

Query: 243 SGVDHMMH-LLQSGPIGVYLNHRLIESYDG---NPIRRNDWACNPHKLDHAVAIVGYGEK 298
                +   L  +GP+ V ++     +Y G            C   +LDH V +VGY + 
Sbjct: 120 QDEAQIAAWLAVNGPVAVAVDASSWMTYTGGVMTS-------CVSEQLDHGVLLVGYNDS 172

Query: 299 NGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYA 335
             +  WI++NSW     + GY +I +G+N C ++  A
Sbjct: 173 AAVPYWIIKNSWTTQWGEEGYIRIAKGSNQCLVKEEA 209


>2o6x_A Procathepsin L1, secreted cathepsin L 1; hydrolase, thiol protease,
           cysteine protease, zymogen, hydro; 1.40A {Fasciola
           hepatica}
          Length = 310

 Score =  197 bits (503), Expect = 3e-61
 Identities = 88/314 (28%), Positives = 143/314 (45%), Gaps = 34/314 (10%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---YGTSGSS---------DRSP 88
           D +  +   +N+ Y   ++ + R   ++++ K   E+   +     +         D + 
Sbjct: 3   DLWHQWKRMYNKEYNGADD-QHRRNIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTF 61

Query: 89  QEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGR 148
           +E   +       +      A          E     +P  +DWR+S    +  V+ QG 
Sbjct: 62  EEFKAK----YLTE---MSRASDILSHGVPYEANNRAVPDKIDWRESGY--VTEVKDQGN 112

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH--GNLNCNGGNIDVAFEYVKQYG 206
           CGS WAF+TT  +E Q    ++T    S+ QLV+C    GN  C GG ++ A++Y+KQ+G
Sbjct: 113 CGSGWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSRPWGNNGCGGGLMENAYQYLKQFG 172

Query: 207 LESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSG-VDHMMHLLQS-GPIGVYLN- 262
           LE+++ YPY   E    +C Y K+     V     V SG    + +L+ + GP  V ++ 
Sbjct: 173 LETESSYPYTAVEG---QCRYNKQLGVAKVTGFYTVHSGSEVELKNLVGAEGPAAVAVDV 229

Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQI 322
                 Y           C+P +++HAV  VGYG + G   WIV+NSWG    + GY ++
Sbjct: 230 ESDFMMYRSGIY--QSQTCSPLRVNHAVLAVGYGTQGGTDYWIVKNSWGLSWGERGYIRM 287

Query: 323 ERG-ANACGIESYA 335
            R   N CGI S A
Sbjct: 288 VRNRGNMCGIASLA 301


>1ppo_A Protease omega; hydrolase(thiol protease); 1.80A {Carica papaya}
           SCOP: d.3.1.1 PDB: 1meg_A*
          Length = 216

 Score =  189 bits (484), Expect = 1e-59
 Identities = 67/225 (29%), Positives = 109/225 (48%), Gaps = 28/225 (12%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LP+++DWR+     + PV  QG CGSCWAF+  A +E    +    L  LS+ +LV+C+ 
Sbjct: 1   LPENVDWRKKGA--VTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCER 58

Query: 186 GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDT---WVT 242
            +  C GG    A EYV + G+  ++ YPY+ K+     C  ++    + V+ +    V 
Sbjct: 59  RSHGCKGGYPPYALEYVAKNGIHLRSKYPYKAKQG---TCRAKQVGGPI-VKTSGVGRVQ 114

Query: 243 SGV-DHMMHLLQSGPIGVYLN--HRLIESYDG----NPIRRNDWACNPHKLDHAVAIVGY 295
                ++++ +   P+ V +    R  + Y G     P       C   K+DHAV  VGY
Sbjct: 115 PNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIFEGP-------CGT-KVDHAVTAVGY 166

Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIERGANA----CGIESYAY 336
           G+  G    +++NSWG    + GY +I+R        CG+   +Y
Sbjct: 167 GKSGGKGYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGLYKSSY 211


>2b1m_A SPE31; papain-like, sugar binding protein; HET: NAG FUC PG4; 2.00A
           {Pachyrhizus erosus} PDB: 2b1n_A*
          Length = 246

 Score =  190 bits (485), Expect = 2e-59
 Identities = 66/225 (29%), Positives = 113/225 (50%), Gaps = 22/225 (9%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
            P+S DW + K  +   V+ QG+CGS WAF+ T  +E+  A+    L  LS+ +L++C  
Sbjct: 2   APESWDWSK-KGVI-TKVKFQGQCGSGWAFSATGAIEAAHAIATGNLVSLSEQELIDCVD 59

Query: 186 GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSG 244
            +  C  G    +FE+V ++ G+ S+ADYPY+ ++    +C   + + KV + +  V   
Sbjct: 60  ESEGCYNGWHYQSFEWVVKHGGIASEADYPYKARDG---KCKANEIQDKVTIDNYGVQIL 116

Query: 245 V---------DHMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWAC-NPHKLDHAVAIVG 294
                       +   +   PI V ++ +    Y G     +   C +P+ ++H V IVG
Sbjct: 117 SNESTESEAESSLQSFVLEQPISVSIDAKDFHFYSGGIY--DGGNCSSPYGINHFVLIVG 174

Query: 295 YGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA----CGIESYA 335
           YG ++G+  WI +NSWG+     GY +I+R        CG+  +A
Sbjct: 175 YGSEDGVDYWIAKNSWGEDWGIDGYIRIQRNTGNLLGVCGMNYFA 219


>3kwz_A Cathepsin K; enzyme inhibitor, covalent reversible inhibitor,
           disease mutation, disulfide bond, glycoprotein,
           hydrolase, lysosome, protease; HET: KWZ; 1.49A {Homo
           sapiens} PDB: 1au0_A* 1au2_A* 1au3_A* 1au4_A* 1ayu_A*
           1ayv_A* 1ayw_A* 1bgo_A* 1atk_A* 1nl6_A* 1nlj_A* 1q6k_A*
           1mem_A* 1yk7_A* 1yk8_A* 1yt7_A* 2ato_A* 2aux_A* 2auz_A*
           2bdl_A* ...
          Length = 215

 Score =  188 bits (479), Expect = 7e-59
 Identities = 74/216 (34%), Positives = 110/216 (50%), Gaps = 14/216 (6%)

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG 186
           P S+D+R+ K  V  PV++QG+CGSCWAF++   LE Q+      L  LS   LV+C   
Sbjct: 2   PDSVDYRK-KGYV-TPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE 59

Query: 187 NLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSG 244
           N  C GG +  AF+YV++  G++S+  YPY  +E     C Y         +    +  G
Sbjct: 60  NDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEE---SCMYNPTGKAAKCRGYREIPEG 116

Query: 245 -VDHMMHLL-QSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNG 300
               +   + + GP+ V ++      + Y        D +CN   L+HAV  VGYG + G
Sbjct: 117 NEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY--YDESCNSDNLNHAVLAVGYGIQKG 174

Query: 301 ILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
              WI++NSWG+   + GY  + R   NACGI + A
Sbjct: 175 NKHWIIKNSWGENWGNKGYILMARNKNNACGIANLA 210


>8pch_A Cathepsin H; hydrolase, protease, cysteine proteinase,
           aminopeptidase; HET: NAG BMA; 2.10A {Sus scrofa} SCOP:
           d.3.1.1 PDB: 1nb3_A* 1nb5_A*
          Length = 220

 Score =  187 bits (477), Expect = 2e-58
 Identities = 83/232 (35%), Positives = 122/232 (52%), Gaps = 43/232 (18%)

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH- 185
           P S+DWR+    V +PV++QG CGSCW F+TT  LES VA+    +  L++ QLV+C   
Sbjct: 2   PPSMDWRKKGNFV-SPVKNQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQQLVDCAQN 60

Query: 186 -GNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VT 242
             N  C GG    AFEY++   G+  +  YPY+ +++    C ++ +KA  FV+D   +T
Sbjct: 61  FNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYKGQDD---HCKFQPDKAIAFVKDVANIT 117

Query: 243 SG-VDHMMHLLQS-GPI---------------GVYLNHRLIESYDGNPIRRNDWACN--P 283
               + M+  +    P+               G+Y                +  +C+  P
Sbjct: 118 MNDEEAMVEAVALYNPVSFAFEVTNDFLMYRKGIY----------------SSTSCHKTP 161

Query: 284 HKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYA 335
            K++HAV  VGYGE+NGI  WIV+NSWG     +GYF IERG N CG+ + A
Sbjct: 162 DKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACA 213


>1yal_A Chymopapain; hydrolase, thiol protease; 1.70A {Carica papaya} SCOP:
           d.3.1.1 PDB: 1gec_E*
          Length = 218

 Score =  186 bits (475), Expect = 3e-58
 Identities = 70/230 (30%), Positives = 108/230 (46%), Gaps = 40/230 (17%)

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG 186
           P+S+DWR      + PV++QG CGSCWAF+T A +E    ++   L  LS+ +LV+CD  
Sbjct: 2   PQSIDWRAKGA--VTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH 59

Query: 187 NLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDT---WVTS 243
           +  C GG    + +YV   G+ +   YPY+ K+    +C    +     V+ T    V S
Sbjct: 60  SYGCKGGYQTTSLQYVANNGVHTSKVYPYQAKQY---KCRATDKPGPK-VKITGYKRVPS 115

Query: 244 GV-DHMMHLLQSGPIGVYLNHRLIES------------YDGNPIRRNDWACNPHKLDHAV 290
                 +  L + P+ V      +E+            +DG         C   KLDHAV
Sbjct: 116 NCETSFLGALANQPLSVL-----VEAGGKPFQLYKSGVFDGP--------CGT-KLDHAV 161

Query: 291 AIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA----CGIESYAY 336
             VGYG  +G    I++NSWG    + GY +++R +      CG+   +Y
Sbjct: 162 TAVGYGTSDGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSY 211


>1cqd_A Protein (protease II); cysteine protease, glycoprotein, proline
           specificity, carboh papain family, hydrolase; HET: NAG
           FUL FUC; 2.10A {Zingiber officinale} SCOP: d.3.1.1
          Length = 221

 Score =  185 bits (472), Expect = 8e-58
 Identities = 73/225 (32%), Positives = 110/225 (48%), Gaps = 28/225 (12%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LP S+DWR+    V  PV++QG CGSCWAF+T A +E    ++   L  LS+ QLV+C  
Sbjct: 3   LPDSIDWRE-NGAV-VPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTT 60

Query: 186 GNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTS 243
            N  C GG ++ AF+++    G+ S+  YPYR ++     C        V +     V S
Sbjct: 61  ANHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQDG---ICNSTVNAPVVSIDSYENVPS 117

Query: 244 GV-DHMMHLLQSGPIGVYLNH-----RLIES--YDGNPIRRNDWACNPHKLDHAVAIVGY 295
                +   + + P+ V ++      +L  S  + G+        CN    +HA+ +VGY
Sbjct: 118 HNEQSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGS--------CNI-SANHALTVVGY 168

Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIERGANA----CGIESYAY 336
           G +N    WIV+NSWG    + GY + ER        CGI  +A 
Sbjct: 169 GTENDKDFWIVKNSWGKNWGESGYIRAERNIENPDGKCGITRFAS 213


>1cs8_A Human procathepsin L; prosegment, propeptide, inhibition,
           hydrolase; HET: OCS; 1.80A {Homo sapiens} SCOP: d.3.1.1
           PDB: 1cjl_A 3hwn_A*
          Length = 316

 Score =  188 bits (480), Expect = 9e-58
 Identities = 94/320 (29%), Positives = 139/320 (43%), Gaps = 43/320 (13%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---YGTSGSS---------DRSP 88
             +  +    NR Y   NE   R   ++++ K  + +   Y     S         D + 
Sbjct: 10  AQWTKWKAMHNRLY-GMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTS 68

Query: 89  QEILQ-RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQG 147
           +E  Q   G +            + R  K   E      P+S+DWR+ K  V  PV++QG
Sbjct: 69  EEFRQVMNGFQ----------NRKPRKGKVFQEPLFYEAPRSVDWRE-KGYV-TPVKNQG 116

Query: 148 RCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH--GNLNCNGGNIDVAFEYVKQY 205
           +CGSCWAF+ T  LE Q+      L  LS+  LV+C    GN  CNGG +D AF+YV+  
Sbjct: 117 QCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDN 176

Query: 206 -GLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHMMH-LLQSGPIGVYLN 262
            GL+S+  YPY   E     C Y  + +         +      +M  +   GPI V ++
Sbjct: 177 GGLDSEESYPYEATEE---SCKYNPKYSVANDAGFVDIPKQEKALMKAVATVGPISVAID 233

Query: 263 --HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG----EKNGILTWIVRNSWGDIGPD 316
             H     Y        +  C+   +DH V +VGYG    E +    W+V+NSWG+    
Sbjct: 234 AGHESFLFYKEGIY--FEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGM 291

Query: 317 HGYFQIERGA-NACGIESYA 335
            GY ++ +   N CGI S A
Sbjct: 292 GGYVKMAKDRRNHCGIASAA 311


>2c0y_A Procathepsin S; proenzyme, proteinase, hydrolase, thiol protease,
           prosegment binding loop, glycoprotein, lysosome,
           protease, zymogen; 2.1A {Homo sapiens}
          Length = 315

 Score =  187 bits (476), Expect = 4e-57
 Identities = 82/318 (25%), Positives = 140/318 (44%), Gaps = 40/318 (12%)

Query: 41  DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---YGTSGSS---------DRSP 88
             +  +   + + Y + NE   R   ++++ K    +   +     S         D + 
Sbjct: 10  HHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTS 69

Query: 89  QEILQ-RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQG 147
           +E++   + LR+  + +  +                  LP S+DWR+     +  V+ QG
Sbjct: 70  EEVMSLMSSLRVPSQWQRNITYKSNPN---------RILPDSVDWREK--GCVTEVKYQG 118

Query: 148 RCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH---GNLNCNGGNIDVAFEYVKQ 204
            CG+ WAF+    LE+Q+ L    L  LS   LV+C     GN  CNGG +  AF+Y+  
Sbjct: 119 SCGAAWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIID 178

Query: 205 -YGLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSG-VDHMMHLLQS-GPIGVY 260
             G++S A YPY+  +    +C Y+ +         T +  G  D +   + + GP+ V 
Sbjct: 179 NKGIDSDASYPYKAMDQ---KCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVG 235

Query: 261 LN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHG 318
           ++  H     Y        + +C    ++H V +VGYG+ NG   W+V+NSWG    + G
Sbjct: 236 VDARHPSFFLYRSGVY--YEPSCTQ-NVNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEG 292

Query: 319 YFQIERG-ANACGIESYA 335
           Y ++ R   N CGI S+ 
Sbjct: 293 YIRMARNKGNHCGIASFP 310


>3ioq_A CMS1MS2; caricaceae, cysteine protease, papain family, hydrolase;
           HET: E64 SO4; 1.87A {Carica candamarcensis}
          Length = 213

 Score =  182 bits (465), Expect = 8e-57
 Identities = 66/225 (29%), Positives = 105/225 (46%), Gaps = 32/225 (14%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P S+DWRQ K  V  PV +QG CGSCW F++ A +E    ++   L  LS+ +L++C+ 
Sbjct: 1   IPTSIDWRQ-KGAV-TPVRNQGGCGSCWTFSSVAAVEGINKIVTGQLLSLSEQELLDCER 58

Query: 186 GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDT---WVT 242
            +  C GG    A +YV   G+  +  YPY   +    +C   + K    V+      V 
Sbjct: 59  RSYGCRGGFPLYALQYVANSGIHLRQYYPYEGVQR---QCRASQAKGPK-VKTDGVGRVP 114

Query: 243 SGV-DHMMHLLQSGPIGVYLN--HRLIESYDG----NPIRRNDWACNPHKLDHAVAIVGY 295
                 ++  +   P+ + +    R  ++Y G     P       C    +DHAVA VGY
Sbjct: 115 RNNEQALIQRIAIQPVSIVVEAKGRAFQNYRGGIFAGP-------CGT-SIDHAVAAVGY 166

Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIERGANA----CGIESYAY 336
           G        +++NSWG    + GY +I+RG+      CG+ S + 
Sbjct: 167 GNDY----ILIKNSWGTGWGEGGYIRIKRGSGNPQGACGVLSDSV 207


>2bdz_A Mexicain; cysteine protease, peptidase_C1, papain-like, HYDR; HET:
           E64; 2.10A {Jacaratia mexicana}
          Length = 214

 Score =  182 bits (465), Expect = 9e-57
 Identities = 65/220 (29%), Positives = 112/220 (50%), Gaps = 24/220 (10%)

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG 186
           P+S+DWR+ K  V  PV++Q  CGSCWAF+T A +E    ++   L  LS+ +L++C+  
Sbjct: 2   PESIDWRE-KGAV-TPVKNQNPCGSCWAFSTVATIEGINKIITGQLISLSEQELLDCERR 59

Query: 187 NLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDT---WVTS 243
           +  C+GG    + +YV   G+ ++ +YPY  K+    RC  + +K    V  T   +V +
Sbjct: 60  SHGCDGGYQTTSLQYVVDNGVHTEREYPYEKKQG---RCRAKDKKGPK-VYITGYKYVPA 115

Query: 244 GV-DHMMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNG 300
                ++  + + P+ V  +   R  + Y G      +  C     DHAV  VGYG+   
Sbjct: 116 NDEISLIQAIANQPVSVVTDSRGRGFQFYKGGIY---EGPCG-TNTDHAVTAVGYGKT-- 169

Query: 301 ILTWIVRNSWGDIGPDHGYFQIERGA----NACGIESYAY 336
               +++NSWG    + GY +I+R +      CG+ + ++
Sbjct: 170 --YLLLKNSWGPNWGEKGYIRIKRASGRSKGTCGVYTSSF 207


>2cio_A Papain; hydrolase/inhibitor, complex hydrolase/inhibitor, ICP,
           cysteine protease, allergen, protease, thiol protease;
           1.5A {Carica papaya} PDB: 1khq_A 1khp_A 1ppn_A 3e1z_B
           3ima_A 3lfy_A 9pap_A 1bqi_A* 1bp4_A* 1pad_A 1pe6_A*
           1pip_A* 1pop_A* 1ppd_A 1ppp_A* 1stf_E* 2pad_A 4pad_A*
           5pad_A* 6pad_A* ...
          Length = 212

 Score =  181 bits (463), Expect = 1e-56
 Identities = 66/225 (29%), Positives = 107/225 (47%), Gaps = 32/225 (14%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P+ +DWRQ K  V  PV++QG CGSCWAF+    +E  + +    L   S+ +L++CD 
Sbjct: 1   IPEYVDWRQ-KGAV-TPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNQYSEQELLDCDR 58

Query: 186 GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDT---WVT 242
            +  CNGG    A + V QYG+  +  YPY   +     C   ++      +      V 
Sbjct: 59  RSYGCNGGYPWSALQLVAQYGIHYRNTYPYEGVQR---YCRSREKGPYA-AKTDGVRQVQ 114

Query: 243 SG-VDHMMHLLQSGPIGVYLN--HRLIESYDG----NPIRRNDWACNPHKLDHAVAIVGY 295
                 +++ + + P+ V L    +  + Y G     P       C   K+DHAVA VGY
Sbjct: 115 PYNEGALLYSIANQPVSVVLEAAGKDFQLYRGGIFVGP-------CGN-KVDHAVAAVGY 166

Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIERGANA----CGIESYAY 336
           G        +++NSWG    ++GY +I+RG       CG+ + ++
Sbjct: 167 GPN----YILIKNSWGTGWGENGYIRIKRGTGNSYGVCGLYTSSF 207


>1iwd_A Ervatamin B; cysteine protease, alpha-beta protein, catalytic DYAD,
           L-DOM domain., hydrolase; 1.63A {Tabernaemontana
           divaricata} SCOP: d.3.1.1
          Length = 215

 Score =  181 bits (462), Expect = 3e-56
 Identities = 71/225 (31%), Positives = 105/225 (46%), Gaps = 31/225 (13%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LP  +DWR  K  V N +++Q +CGSCWAF+  A +ES   +    L  LS+ +LV+CD 
Sbjct: 1   LPSFVDWRS-KGAV-NSIKNQKQCGSCWAFSAVAAVESINKIRTGQLISLSEQELVDCDT 58

Query: 186 GNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDT---WV 241
            +  CNGG ++ AF+Y+    G+++Q +YPY   +     C   + +    V       V
Sbjct: 59  ASHGCNGGWMNNAFQYIITNGGIDTQQNYPYSAVQG---SCKPYRLRV---VSINGFQRV 112

Query: 242 TSGV-DHMMHLLQSGPIGVYLN--HRLIESYDG----NPIRRNDWACNPHKLDHAVAIVG 294
           T      +   + S P+ V +       + Y       P       C     +H V IVG
Sbjct: 113 TRNNESALQSAVASQPVSVTVEAAGAPFQHYSSGIFTGP-------CGT-AQNHGVVIVG 164

Query: 295 YGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA----CGIESYA 335
           YG ++G   WIVRNSWG    + GY  +ER   +    CGI    
Sbjct: 165 YGTQSGKNYWIVRNSWGQNWGNQGYIWMERNVASSAGLCGIAQLP 209


>3bwk_A Cysteine protease falcipain-3; malaria, hydrolase; HET: C1P; 2.42A
           {Plasmodium falciparum} PDB: 3bpm_A*
          Length = 243

 Score =  182 bits (463), Expect = 3e-56
 Identities = 77/254 (30%), Positives = 114/254 (44%), Gaps = 32/254 (12%)

Query: 108 EADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVAL 167
           EA+ E V K            + DWR     V  PV+ Q  CGSCWAF++   +ESQ A+
Sbjct: 2   EANYEDVIKKYKPADAKLDRIAYDWRL-HGGV-TPVKDQALCGSCWAFSSVGSVESQYAI 59

Query: 168 LKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCT 226
            KK L+  S+ +LV+C   N  C GG I  AF+ +    GL SQ DYPY +    T  C 
Sbjct: 60  RKKALFLFSEQELVDCSVKNNGCYGGYITNAFDDMIDLGGLCSQDDYPYVSNLPET--CN 117

Query: 227 YEKEKAKVFVQDTWVTSGVDHMMHLLQS-GPIGVYLN-HRLIESYDG---NPIRRNDWAC 281
            ++   +  +  ++V+   D     L+  GPI + +        Y G   +        C
Sbjct: 118 LKRCNERYTI-KSYVSIPDDKFKEALRYLGPISISIAASDDFAFYRGGFYDG------EC 170

Query: 282 NPHKLDHAVAIVGYGEKNGILT----------WIVRNSWGDIGPDHGYFQIERGANA--- 328
                +HAV +VGYG K+              +I++NSWG    + GY  +E   N    
Sbjct: 171 GA-APNHAVILVGYGMKDIYNEDTGRMEKFYYYIIKNSWGSDWGEGGYINLETDENGYKK 229

Query: 329 -CGIESYAYLASVK 341
            C I + AY+  ++
Sbjct: 230 TCSIGTEAYVPLLE 243


>2xu3_A Cathepsin L1; hydrolase, drug design, thiol protease; HET: XU3 BTB;
           0.90A {Homo sapiens} PDB: 2xu4_A* 2xu5_A* 2yj2_A*
           2yj8_A* 2yj9_A* 2yjb_A* 2yjc_A* 3bc3_A* 3h89_A* 3h8b_A*
           3h8c_A* 3of9_A* 3of8_A* 3hha_A* 2xu1_A* 3iv2_A* 3k24_A*
           2nqd_B* 3kse_A* 2vhs_A ...
          Length = 220

 Score =  180 bits (460), Expect = 5e-56
 Identities = 78/221 (35%), Positives = 112/221 (50%), Gaps = 19/221 (8%)

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH- 185
           P+S+DWR+ K  V  PV++QG+CGSCWAF+ T  LE Q+      L  LS+  LV+C   
Sbjct: 2   PRSVDWRE-KGYV-TPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGP 59

Query: 186 -GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVT 242
            GN  CNGG +D AF+YV+   GL+S+  YPY   E     C Y  + +         + 
Sbjct: 60  QGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEE---SCKYNPKYSVANDTGFVDIP 116

Query: 243 SGVDHMMHLLQS-GPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG--- 296
                +M  + + GPI V ++  H     Y        +  C+   +DH V +VGYG   
Sbjct: 117 KQEKALMKAVATVGPISVAIDAGHESFLFYKEGIY--FEPDCSSEDMDHGVLVVGYGFES 174

Query: 297 -EKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYA 335
            E +    W+V+NSWG+     GY ++ +   N CGI S A
Sbjct: 175 TESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAA 215


>2oul_A Falcipain 2; cysteine protease, inhibitor, macromolecular
           interaction, HY hydrolase inhibitor complex; 2.20A
           {Plasmodium falciparum} SCOP: d.3.1.1 PDB: 2ghu_A 1yvb_A
           3bpf_A* 3pnr_A
          Length = 241

 Score =  180 bits (460), Expect = 1e-55
 Identities = 70/250 (28%), Positives = 116/250 (46%), Gaps = 34/250 (13%)

Query: 112 ERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKT 171
           E +KK+  E  +     + DWR     V  PV+ Q  CGSCWAF++   +ESQ A+ K  
Sbjct: 6   EVIKKYRGE--ENFDHAAYDWRL-HSGV-TPVKDQKNCGSCWAFSSIGSVESQYAIRKNK 61

Query: 172 LYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKE 230
           L  LS+ +LV+C   N  CNGG I+ AFE + +  G+    DYPY +       C  ++ 
Sbjct: 62  LITLSEQELVDCSFKNYGCNGGLINNAFEDMIELGGICPDGDYPYVSDAPNL--CNIDRC 119

Query: 231 KAKVFVQDTWVTSGVDHMMHLLQS-GPIGVYLN-HRLIESYDG---NPIRRNDWACNPHK 285
             K  +   +++   + +   L+  GPI + +        Y     +        C   +
Sbjct: 120 TEKYGI-KNYLSVPDNKLKEALRFLGPISISVAVSDDFAFYKEGIFDG------ECGD-Q 171

Query: 286 LDHAVAIVGYGEKNGILT----------WIVRNSWGDIGPDHGYFQIERGANA----CGI 331
           L+HAV +VG+G K  +            +I++NSWG    + G+  IE   +     CG+
Sbjct: 172 LNHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNSWGQQWGERGFINIETDESGLMRKCGL 231

Query: 332 ESYAYLASVK 341
            + A++  ++
Sbjct: 232 GTDAFIPLIE 241


>3pdf_A Cathepsin C, dipeptidyl peptidase 1; two domains, cystein protease,
           hydrolase-hydrolase inhibitor; HET: LXV NAG; 1.85A {Homo
           sapiens} PDB: 1jqp_A* 2djf_B* 1k3b_B* 2djg_B* 2djf_A*
           1k3b_A* 2djg_A* 2djf_C* 1k3b_C* 2djg_C*
          Length = 441

 Score =  185 bits (472), Expect = 2e-55
 Identities = 74/315 (23%), Positives = 130/315 (41%), Gaps = 36/315 (11%)

Query: 48  VKWNRTYTDDNEIKTRFEYFKQDGKETDE-----YYGTSGS----SDRSPQEILQRTGLR 98
           V  N  +  +++ K     +K D              T+ +       +  ++++R    
Sbjct: 125 VYVNTAHLKNSQEKYSNRLYKYDHNFVKAINAIQKSWTATTYMEYETLTLGDMIRR---- 180

Query: 99  LTGKEKERLEADRERVKKFLNERKKGPLPKSLDWR-QSKVKVLNPVESQGRCGSCWAFAT 157
             G  ++        +   + + K   LP S DWR    +  ++PV +Q  CGSC++FA+
Sbjct: 181 SGGHSRKIPRPKPAPLTAEIQQ-KILFLPTSWDWRNVHGINFVSPVRNQASCGSCYSFAS 239

Query: 158 TAILESQVALL--KKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQ-YGLESQADYP 214
             +LE+++ +L        LS  ++V C      C GG   +      Q +GL  +A +P
Sbjct: 240 MGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFP 299

Query: 215 YRNKENITFRCTYEKEKAKVFVQD------TWVTSGVDHMMH-LLQSGPIGVYLN-HRLI 266
           Y   ++    C  +++  + +  +       +       M   L+  GP+ V    +   
Sbjct: 300 YTGTDS---PCKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDF 356

Query: 267 ESYDG---NPIRRNDWACNPHKLDHAVAIVGYGEKNGILT--WIVRNSWG-DIGPDHGYF 320
             Y     +     D        +HAV +VGYG  +      WIV+NSWG   G ++GYF
Sbjct: 357 LHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWG-ENGYF 415

Query: 321 QIERGANACGIESYA 335
           +I RG + C IES A
Sbjct: 416 RIRRGTDECAIESIA 430


>1s4v_A Cysteine endopeptidase; KDEL ER retention signal, endosperm,
           ricinosomes, SEED germi senescence, hydrolase-hydrolase
           inhibitor complex; 2.00A {Ricinus communis} SCOP:
           d.3.1.1
          Length = 229

 Score =  179 bits (457), Expect = 2e-55
 Identities = 78/229 (34%), Positives = 114/229 (49%), Gaps = 33/229 (14%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           +P S+DWR+ K  V   V+ QG+CGSCWAF+T   +E    +    L  LS+ +LV+CD 
Sbjct: 2   VPASVDWRK-KGAV-TSVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDT 59

Query: 186 -GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDT---W 240
             N  CNGG +D AFE++KQ  G+ ++A+YPY   +     C   KE A   V       
Sbjct: 60  DQNQGCNGGLMDYAFEFIKQRGGITTEANYPYEAYDG---TCDVSKENAPA-VSIDGHEN 115

Query: 241 VTSGV-DHMMHLLQSGPIGVYLNH-----RLIES--YDGNPIRRNDWACNPHKLDHAVAI 292
           V     + ++  + + P+ V ++      +      + G+        C   +LDH VAI
Sbjct: 116 VPENDENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGS--------CGT-ELDHGVAI 166

Query: 293 VGYG-EKNGILTWIVRNSWGDIGPDHGYFQIERGA----NACGIESYAY 336
           VGYG   +G   W V+NSWG    + GY ++ERG       CGI   A 
Sbjct: 167 VGYGTTIDGTKYWTVKNSWGPEWGEKGYIRMERGISDKEGLCGIAMEAS 215


>3p5u_A Actinidin; SAD, cysteine proteinases, hydrolase; 1.50A {Actinidia
           arguta} PDB: 3p5v_A 3p5w_A 3p5x_A 1aec_A* 2act_A
          Length = 220

 Score =  178 bits (454), Expect = 4e-55
 Identities = 69/227 (30%), Positives = 104/227 (45%), Gaps = 30/227 (13%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LP  +DWR S    +  ++ QG+CGS WAF+T A +E    +    L  LS+ +LV+C  
Sbjct: 1   LPDYVDWRSSGA--VVDIKDQGQCGSAWAFSTIAAVEGINKIATGDLISLSEQELVDCGR 58

Query: 186 --GNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDT--- 239
                 C+GG +   F+++    G+ ++A+YPY  +E    +C  + ++ K  V      
Sbjct: 59  TQNTRGCDGGFMTDGFQFIINNGGINTEANYPYTAEEG---QCNLDLQQEKY-VSIDTYE 114

Query: 240 WVTSG-VDHMMHLLQSGPIGVYLN--HRLIESYDG----NPIRRNDWACNPHKLDHAVAI 292
            V       +   +   P+ V L       + Y       P       C    +DHAV I
Sbjct: 115 NVPYNNEWALQTAVAYQPVSVALEAAGYNFQHYSSGIFTGP-------CGT-AVDHAVTI 166

Query: 293 VGYGEKNGILTWIVRNSWGDIGPDHGYFQIERG---ANACGIESYAY 336
           VGYG + GI  WIV+NSWG    + GY +I+R       CGI   A 
Sbjct: 167 VGYGTEGGIDYWIVKNSWGTTWGEEGYMRIQRNVGGVGQCGIAKKAS 213


>3ovx_A Cathepsin S; hydrolase, covalent inhibitor, aldehyde warhead is
           covalently bound to Cys25, lysosomeal protein; HET: O64;
           1.49A {Homo sapiens} PDB: 2h7j_A* 2f1g_A* 2hh5_B*
           2hhn_A* 2hxz_A* 2op3_A* 2frq_A* 2fra_A* 2fq9_A* 2ft2_A*
           2fud_A* 2g7y_A* 1ms6_A* 2r9m_A* 2r9n_A* 2r9o_A* 3n3g_A*
           3n4c_A* 3mpe_A* 1nqc_A* ...
          Length = 218

 Score =  178 bits (453), Expect = 6e-55
 Identities = 73/220 (33%), Positives = 111/220 (50%), Gaps = 18/220 (8%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LP S+DWR+     +  V+ QG CG+CWAF+    LE+Q+ L    L  LS   LV+C  
Sbjct: 2   LPDSVDWREKGC--VTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 59

Query: 186 ---GNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TW 240
              GN  CNGG +  AF+Y+    G++S A YPY+  +    +C Y+ +         T 
Sbjct: 60  EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQ---KCQYDSKYRAATCSKYTE 116

Query: 241 VTSG-VDHMMHLLQS-GPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
           +  G  D +   + + GP+ V ++  H     Y        + +C    ++H V +VGYG
Sbjct: 117 LPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVY--YEPSCTQ-NVNHGVLVVGYG 173

Query: 297 EKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYA 335
           + NG   W+V+NSWG    + GY ++ R   N CGI S+ 
Sbjct: 174 DLNGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFP 213


>1o0e_A Ervatamin C; plant cysteine protease, two domain, stable at PH
           2-12, HYDR; 1.90A {Tabernaemontana divaricata} SCOP:
           d.3.1.1 PDB: 2pns_A* 2pre_A* 3bcn_A*
          Length = 208

 Score =  174 bits (443), Expect = 1e-53
 Identities = 69/219 (31%), Positives = 103/219 (47%), Gaps = 24/219 (10%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LP+ +DWR+ K  V  PV++QG CGSCWAF+T + +ES   +    L  LS+ +LV+CD 
Sbjct: 1   LPEQIDWRK-KGAV-TPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDK 58

Query: 186 GNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAKV--FVQDTWVT 242
            N  C GG    A++Y+    G+++QA+YPY+  +     C    +   +  +     V 
Sbjct: 59  KNHGCLGGAFVFAYQYIINNGGIDTQANYPYKAVQG---PCQAASKVVSIDGYNG---VP 112

Query: 243 SGVDH-MMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKN 299
              +  +   +   P  V ++      + Y               KL+H V IVGY    
Sbjct: 113 FCNEXALKQAVAVQPSTVAIDASSAQFQQYSSGIFS----GPCGTKLNHGVTIVGYQAN- 167

Query: 300 GILTWIVRNSWGDIGPDHGYFQIER--GANACGIESYAY 336
               WIVRNSWG    + GY ++ R  G   CGI    Y
Sbjct: 168 ---YWIVRNSWGRYWGEKGYIRMLRVGGCGLCGIARLPY 203


>2fo5_A Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cysteine
           endoprotease, endopeptidase, LEUP hydrolase; HET: AR7;
           2.20A {Hordeum vulgare}
          Length = 262

 Score =  175 bits (447), Expect = 1e-53
 Identities = 77/229 (33%), Positives = 112/229 (48%), Gaps = 30/229 (13%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
           LP S+DWRQ K  V   V+ QG+CGSCWAF+T   +E   A+   +L  LS+ +L++CD 
Sbjct: 4   LPPSVDWRQ-KGAV-TGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDT 61

Query: 185 HGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKE--KAKVFVQDT-- 239
             N  C GG +D AFEY+K   GL ++A YPYR        C   +    + V V     
Sbjct: 62  ADNDGCQGGLMDNAFEYIKNNGGLITEAAYPYRAARG---TCNVARAAQNSPVVVHIDGH 118

Query: 240 -WVTSGV-DHMMHLLQSGPIGVYLN--HRLIESYDG---NPIRRNDWACNPHKLDHAVAI 292
             V +   + +   + + P+ V +    +    Y              C   +LDH VA+
Sbjct: 119 QDVPANSEEDLARAVANQPVSVAVEASGKAFMFYSEGVFTG------ECGT-ELDHGVAV 171

Query: 293 VGYG-EKNGILTWIVRNSWGDIGPDHGYFQIERGANA----CGIESYAY 336
           VGYG  ++G   W V+NSWG    + GY ++E+ + A    CGI   A 
Sbjct: 172 VGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSGASGGLCGIAMEAS 220


>3f75_A Toxopain-2, cathepsin L protease; medical structural genomics of
           pathogenic protozoa, MSGPP, C protease, parasite,
           protozoa, hydrolase; 1.99A {Toxoplasma gondii}
          Length = 224

 Score =  174 bits (443), Expect = 2e-53
 Identities = 66/227 (29%), Positives = 100/227 (44%), Gaps = 31/227 (13%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
           LP  +DWR  +  V  PV+ Q  CGSCWAF+TT  LE         L  LS+ +L++C  
Sbjct: 7   LPAGVDWRS-RGCV-TPVKDQRDCGSCWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSR 64

Query: 186 --GNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWV 241
             GN +C+GG ++ AF+YV    G+ S+  YPY  ++     C  +  +  V +     V
Sbjct: 65  AEGNQSCSGGEMNDAFQYVLDSGGICSEDAYPYLARDE---ECRAQSCEKVVKILGFKDV 121

Query: 242 TSG-VDHMMHLLQSGPIGVYLNH-----RLIES--YDGNPIRRNDWACNPHKLDHAVAIV 293
                  M   L   P+ + +       +      +D +        C    LDH V +V
Sbjct: 122 PRRSEAAMKAALAKSPVSIAIEADQMPFQFYHEGVFDAS--------CGT-DLDHGVLLV 172

Query: 294 GYG--EKNGILTWIVRNSWGDIGPDHGYFQIERG---ANACGIESYA 335
           GYG  +++    WI++NSWG      GY  +         CG+   A
Sbjct: 173 GYGTDKESKKDFWIMKNSWGTGWGRDGYMYMAMHKGEEGQCGLLLDA 219


>3hhi_A Cathepsin B-like cysteine protease; occluding loop, hydrolase, THIO
           protease; HET: 074; 1.60A {Trypanosoma brucei} PDB:
           3mor_A*
          Length = 325

 Score =  170 bits (433), Expect = 9e-51
 Identities = 69/297 (23%), Positives = 102/297 (34%), Gaps = 67/297 (22%)

Query: 84  SDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQ--SKVKVLN 141
            + + +E  +  G+         L       ++F  E  + PLP S D  +       + 
Sbjct: 35  QNITLREAKRLNGVIKKNNNASIL-----PKRRFTEEEARAPLPSSFDSAEAWPNCPTIP 89

Query: 142 PVESQGRCGSCWAFATTAILESQVALLK-KTLYPLSKSQLVECDHGNLN-CNGGNIDVAF 199
            +  Q  CGSCWA A  + +  +   +       +S   L+ C     + CNGG+ D A+
Sbjct: 90  QIADQSACGSCWAVAAASAMSDRFCTMGGVQDVHISAGDLLACCSDCGDGCNGGDPDRAW 149

Query: 200 EYVKQYGLESQADYPYR------------------NKENITFRCTYEKEK-----AKVFV 236
            Y    GL S    PY                        T +C Y  +           
Sbjct: 150 AYFSSTGLVSDYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPKCDYTCDDPTIPVVNYRS 209

Query: 237 QDTWVTSGVDHMMH-LLQSGPI---------------GVYLNHRLIESYDGNPIRRNDWA 280
             ++   G D  M  L   GP                GVY +H                 
Sbjct: 210 WTSYALQGEDDYMRELFFRGPFEVAFDVYEDFIAYNSGVY-HHV---------------- 252

Query: 281 CNPHKLDHAVAIVGYGEKNGILTWIVRNSWG-DIGPDHGYFQIERGANACGIESYAY 336
              +   HAV +VG+G  NG+  W + NSW  + G   GYF I RG++ CGIE    
Sbjct: 253 SGQYLGGHAVRLVGWGTSNGVPYWKIANSWNTEWG-MDGYFLIRRGSSECGIEDGGS 308


>3u8e_A Papain-like cysteine protease; papain-like cysteine peptidase,
           peptidase_C1A, hydrolase, in form; 1.31A {Crocus
           sativus}
          Length = 222

 Score =  161 bits (410), Expect = 1e-48
 Identities = 70/219 (31%), Positives = 98/219 (44%), Gaps = 14/219 (6%)

Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG 186
           P S+DWR+ K  V   V+ QG CG CWAF  T  +E   A+    L  +S+ Q+V+CD  
Sbjct: 2   PASIDWRK-KGAV-TSVKDQGACGMCWAFGATGAIEGIDAITTGRLISVSEQQIVDCDTX 59

Query: 187 NLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGV 245
                GG+ D AF +V    G+ S A+YPY   +     C   K  A      T V +  
Sbjct: 60  XXXXXGGDADDAFRWVITNGGIASDANYPYTGVDG---TCDLNKPIAARIDGYTNVPNSS 116

Query: 246 DHMMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDW-ACNPHKLDHAVAIVGYG-EKNGI 301
             ++  +   P+ V +       + Y G  I      + +P  +DH V IVGYG      
Sbjct: 117 SALLDAVAKQPVSVNIYTSSTSFQLYTGPGIFAGSSCSDDPATVDHTVLIVGYGSNGTNA 176

Query: 302 LTWIVRNSWGDIGPDHGYFQIERGANA----CGIESYAY 336
             WIV+NSWG      GY  I R  N     C I+++  
Sbjct: 177 DYWIVKNSWGTEWGIDGYILIRRNTNRPDGVCAIDAWGS 215


>1deu_A Procathepsin X; cysteine protease, proregion, prosegment, HY; 1.70A
           {Homo sapiens} SCOP: d.3.1.1 PDB: 1ef7_A
          Length = 277

 Score =  162 bits (413), Expect = 2e-48
 Identities = 62/255 (24%), Positives = 99/255 (38%), Gaps = 63/255 (24%)

Query: 126 LPKSLDWR-QSKVKVLNPVESQ---GRCGSCWAFATTAILESQVALLKKTLYP---LSKS 178
           LPKS DWR    V   +   +Q     CGSCWA A+T+ +  ++ + +K  +P   LS  
Sbjct: 36  LPKSWDWRNVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRKGAWPSTLLSVQ 95

Query: 179 QLVECDHGNLNCNGGNIDVAFEYVKQYGLESQADYPY------------RNKENITFRCT 226
            +++C +   +C GGN    ++Y  Q+G+  +    Y                N    C 
Sbjct: 96  NVIDCGNAG-SCEGGNDLSVWDYAHQHGIPDETCNNYQAKDQECDKFNQCGTCNEFKECH 154

Query: 227 YEKEKAKVFVQDTWVTSGVDHMMH-LLQSGPI---------------GVYLNHRLIESYD 270
             +      V D    SG + MM  +  +GPI               G+Y          
Sbjct: 155 AIRNYTLWRVGDYGSLSGREKMMAEIYANGPISCGIMATERLANYTGGIY---------- 204

Query: 271 GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWG-DIGPDHGYFQIERGANAC 329
                  ++      ++H V++ G+G  +G   WIVRNSWG   G + G+ +I       
Sbjct: 205 ------AEYQDTT-YINHVVSVAGWGISDGTEYWIVRNSWGEPWG-ERGWLRIVTSTYKD 256

Query: 330 G--------IESYAY 336
           G        IE +  
Sbjct: 257 GKGARYNLAIEEHCT 271


>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme,
           papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A
           1pbh_A 1mir_A
          Length = 317

 Score =  162 bits (412), Expect = 9e-48
 Identities = 68/307 (22%), Positives = 111/307 (36%), Gaps = 82/307 (26%)

Query: 84  SDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQ--SKVKVLN 141
            +     + +  G  L G +  +     E +K          LP S D R+   +   + 
Sbjct: 32  YNVDMSYLKRLCGTFLGGPKPPQRVMFTEDLK----------LPASFDAREQWPQCPTIK 81

Query: 142 PVESQGRCGSCWAFATTAILESQVALLKKTLY--PLSKSQLVEC--DHGNLNCNGGNIDV 197
            +  QG CGSCWAF     +  ++ +         +S   L+ C        CNGG    
Sbjct: 82  EIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAE 141

Query: 198 AFEYVKQYGLESQADYPYR----------------------NKENITFRC--------TY 227
           A+ +  + GL S   Y                           E  T +C        + 
Sbjct: 142 AWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSP 201

Query: 228 EKEKAKVFVQDTW-VTSGVDHMMH-LLQSGPI---------------GVYLNHRLIESYD 270
             ++ K +  +++ V++    +M  + ++GP+               GVY  H       
Sbjct: 202 TYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY-QHV------ 254

Query: 271 GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWG-DIGPDHGYFQIERGANAC 329
                            HA+ I+G+G +NG   W+V NSW  D G D+G+F+I RG + C
Sbjct: 255 ----------TGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWG-DNGFFKILRGQDHC 303

Query: 330 GIESYAY 336
           GIES   
Sbjct: 304 GIESEVV 310


>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase,
           digestive tract, hydrolase-hydrolase INH complex; HET:
           074; 1.30A {Schistosoma mansoni} PDB: 3s3q_A* 3s3r_A*
          Length = 254

 Score =  159 bits (405), Expect = 2e-47
 Identities = 62/265 (23%), Positives = 99/265 (37%), Gaps = 72/265 (27%)

Query: 126 LPKSLDWRQ--SKVKVLNPVESQGRCGSCWAFATTAILESQVALLK--KTLYPLSKSQLV 181
           +P S D R+   + K +  +  Q RCGSCWAF     +  +  +    K    LS   L+
Sbjct: 3   IPSSFDSRKKWPRCKSIATIRDQSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVDLL 62

Query: 182 ECDH-GNLNCNGGNIDVAFEYVKQYGLESQADYPYR-----------------------N 217
            C     L C GG +  A++Y  + G+ + +                            +
Sbjct: 63  SCCESCGLGCEGGILGPAWDYWVKEGIVTGSSKENHAGCEPYPFPKCEHHTKGKYPPCGS 122

Query: 218 KENITFRC--TYEKEKAKVFVQDT-------WVTSGVDHMMH-LLQSGPI---------- 257
           K   T RC  T +K+    + QD         V +    +   +++ GP+          
Sbjct: 123 KIYKTPRCKQTCQKKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGFTVYEDF 182

Query: 258 -----GVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWG- 311
                G+Y  H                        HA+ I+G+G +N    W++ NSW  
Sbjct: 183 LNYKSGIY-KHI----------------TGETLGGHAIRIIGWGVENKAPYWLIANSWNE 225

Query: 312 DIGPDHGYFQIERGANACGIESYAY 336
           D G ++GYF+I RG + C IES   
Sbjct: 226 DWG-ENGYFRIVRGRDECSIESEVT 249


>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco
           hydrolase, lysosome, protease, thiol protease, zymogen,
           CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A*
           3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A*
           1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A*
           1qdq_A* 1csb_B* 1huc_B 2ipp_B ...
          Length = 266

 Score =  158 bits (403), Expect = 6e-47
 Identities = 64/265 (24%), Positives = 100/265 (37%), Gaps = 72/265 (27%)

Query: 126 LPKSLDWRQ--SKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLY--PLSKSQLV 181
           LP S D R+   +   +  +  QG CGS WAF     +  ++ +         +S   L+
Sbjct: 7   LPASFDAREQWPQCPTIKEIRDQGSCGSAWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 66

Query: 182 EC--DHGNLNCNGGNIDVAFEYVKQYGLESQADY-------PYR---------------N 217
            C        CNGG    A+ +  + GL S   Y       PY                 
Sbjct: 67  TCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEAHVNGARPPCT 126

Query: 218 KENITFRC--------TYEKEKAKVFVQDT-WVTSGVDHMMH-LLQSGPI---------- 257
            E  T +C        +   ++ K +  ++  V++    +M  + ++GP+          
Sbjct: 127 GEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDF 186

Query: 258 -----GVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWG- 311
                GVY  H                        HA+ I+G+G +NG   W+V NSW  
Sbjct: 187 LLYKSGVY-QHV----------------TGEMMGGHAIRILGWGVENGTPYWLVANSWNT 229

Query: 312 DIGPDHGYFQIERGANACGIESYAY 336
           D G D+G+F+I RG + CGIES   
Sbjct: 230 DWG-DNGFFKILRGQDHCGIESEVV 253


>2wbf_X Serine-repeat antigen protein; SERA, malaria, vacuole, protease,
           cathepsin, hydrolase, glycoprotein, thiol protease; HET:
           DMS; 1.60A {Plasmodium falciparum} PDB: 3ch3_X 3ch2_X
          Length = 265

 Score =  157 bits (398), Expect = 4e-46
 Identities = 50/249 (20%), Positives = 85/249 (34%), Gaps = 44/249 (17%)

Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
             +  D     +     VE QG C + W FA+   LE+   +       +S   +  C  
Sbjct: 10  CNRLKDENN-CISN-LQVEDQGNCDTSWIFASKYHLETIRCMKGYEPTKISALYVANCYK 67

Query: 186 --GNLNCNGGNIDVAF-EYVKQYG-LESQADYPYRNKENITF---------------RCT 226
                 C+ G+  + F + ++ YG L ++++YPY   +                   +  
Sbjct: 68  GEHKDRCDEGSSPMEFLQIIEDYGFLPAESNYPYNYVKVGEQCPKVEDHWMNLWDNGKIL 127

Query: 227 YEKEKAKVFVQDTWVTSGVDHMMHLLQS------------GPIGVYLN--HRLIESYDGN 272
           + K +        +     +     + +            G +  Y+   + +   + G 
Sbjct: 128 HNKNEPNSLDGKGYTAYESERFHDNMDAFVKIIKTEVMNKGSVIAYIKAENVMGYEFSGK 187

Query: 273 PIRRNDWACNPHKLDHAVAIVGYGEKNGILT-----WIVRNSWGDIGPDHGYFQIER-GA 326
            +      C     DHAV IVGYG            WIVRNSWG    D GYF+++  G 
Sbjct: 188 KV---KNLCGDDTADHAVNIVGYGNYVNSEGEKKSYWIVRNSWGPYWGDEGYFKVDMYGP 244

Query: 327 NACGIESYA 335
             C      
Sbjct: 245 THCHFNFIH 253


>3ois_A Cysteine protease; alpha and beta, hydrolase; HET: UDP; 1.65A
           {Xylella fastidiosa}
          Length = 291

 Score =  145 bits (366), Expect = 3e-41
 Identities = 51/274 (18%), Positives = 100/274 (36%), Gaps = 37/274 (13%)

Query: 89  QEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGR 148
           Q +L+R      G   +  +  R+       +     LP  +D           V  QGR
Sbjct: 22  QTVLKRRKKSGYGYIPDIADI-RDFSYTP-EKSVIAALPPKVDLTPP-----FQVYDQGR 74

Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC-----DHGNLNCNGG-NIDVAFEYV 202
            GSC A A  A ++ +    K++      S+L          G++N + G  I    + +
Sbjct: 75  IGSCTANALAAAIQFERIHDKQSP-EFIPSRLFIYYNERKIEGHVNYDSGAMIRDGIKVL 133

Query: 203 KQYGLESQADYPYR--------NKENITFRCTYEK-----EKAKVFVQDTW--VTSGVDH 247
            + G+  + ++PY          +       + +      + A+ +    +  V   +DH
Sbjct: 134 HKLGVCPEKEWPYGDTPADPRTEEFPPGAPASKKPSDQCYKDAQNYKITEYSRVAQDIDH 193

Query: 248 MMHLL-QSGPIGVYLN-HRLIESYDGNPIRRNDW-ACNPHKLDHAVAIVGYGEKNGILTW 304
           +   L    P     + +      +  P+R       +  +  HAV  VGY ++     +
Sbjct: 194 LKACLAVGSPFVFGFSVYNSWVGNNSLPVRIPLPTKNDTLEGGHAVLCVGYDDEIRH--F 251

Query: 305 IVRNSWG-DIGPDHGYFQIERGANA-CGIESYAY 336
            +RNSWG ++G + GYF +     +   +    +
Sbjct: 252 RIRNSWGNNVG-EDGYFWMPYEYISNTQLADDFW 284


>3pw3_A Aminopeptidase C; bleomycin, cysteine proteinase fold, structural
           genomics, JO center for structural genomics, JCSG; HET:
           MSE; 2.23A {Parabacteroides distasonis}
          Length = 383

 Score = 61.8 bits (149), Expect = 8e-11
 Identities = 19/86 (22%), Positives = 38/86 (44%), Gaps = 12/86 (13%)

Query: 142 PVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGN------------LN 189
            V++Q R G+CW +++ + LES++  + K  Y LS+   V   + +              
Sbjct: 24  SVKNQNRAGTCWCYSSYSFLESELLRMGKGEYDLSEMFTVYNTYLDRADAAVRTHGDVSF 83

Query: 190 CNGGNIDVAFEYVKQYGLESQADYPY 215
             GG+   A   ++ +GL  + +   
Sbjct: 84  SQGGSFYDALYGMETFGLVPEEEMRP 109



 Score = 42.1 bits (98), Expect = 2e-04
 Identities = 16/86 (18%), Positives = 29/86 (33%), Gaps = 1/86 (1%)

Query: 236 VQDTWVTSGVDHMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
            +     SG D    L               + +     R+  +       DH + I G 
Sbjct: 266 DEKVQELSGSDMAHWLKLKPEEKKLNTKPQPQKWCTQAERQLAYDNYETTDDHGMQIYGI 325

Query: 296 G-EKNGILTWIVRNSWGDIGPDHGYF 320
             ++ G   ++V+NSWG     +G +
Sbjct: 326 AKDQEGNEYYMVKNSWGTNSKYNGIW 351


>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis,
           programmed cell death; HET: DTP; 6.90A {Drosophila
           melanogaster} PDB: 3iz8_A*
          Length = 1221

 Score = 49.5 bits (117), Expect = 1e-06
 Identities = 54/422 (12%), Positives = 114/422 (27%), Gaps = 153/422 (36%)

Query: 6   CDHQETNTEQVTYNVNTDSAIYVWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIK---- 61
              QE   ++    V   +  ++   +  +  +     + YI + +R Y D+        
Sbjct: 72  LSKQEEMVQKFVEEVLRINYKFLMSPIKTEQRQPSMMTRMYIEQRDRLYNDNQVFAKYNV 131

Query: 62  TRFEYFKQ---------------------DGKET-------DE----------YYGTSGS 83
           +R + + +                      GK                     ++    +
Sbjct: 132 SRLQPYLKLRQALLELRPAKNVLIDGVLGSGKTWVALDVCLSYKVQCKMDFKIFWLNLKN 191

Query: 84  --SDRSPQEILQRTGLRLTGK----------EKERLEADRERVKKFLNERKKGPLPKSL- 130
             S  +  E+LQ+   ++              K R+ + +  +++ L  +   P    L 
Sbjct: 192 CNSPETVLEMLQKLLYQIDPNWTSRSDHSSNIKLRIHSIQAELRRLLKSK---PYENCLL 248

Query: 131 --D--WRQSKVKVLNPVESQGRCGSCWAFATT----------AILESQVAL--LKKTLYP 174
                         N         SC    TT          A   + ++L     TL P
Sbjct: 249 VLLNVQNAKAWNAFN--------LSCKILLTTRFKQVTDFLSAATTTHISLDHHSMTLTP 300

Query: 175 -LSKSQL---VECDHGNL---NCNGGNIDVA------------FEYVKQYGLESQA---D 212
              KS L   ++C   +L           ++            ++  K    +      +
Sbjct: 301 DEVKSLLLKYLDCRPQDLPREVLTTNPRRLSIIAESIRDGLATWDNWKHVNCDKLTTIIE 360

Query: 213 YPYRNKENITFRCTYEKEKAKVFVQDTWVTSGV-------------DHMMH------LLQ 253
                 E   +R  +++    VF     + + +               +++      L++
Sbjct: 361 SSLNVLEPAEYRKMFDR--LSVFPPSAHIPTILLSLIWFDVIKSDVMVVVNKLHKYSLVE 418

Query: 254 SGP------I-GVYLN-----------HR-LIESYDGNPIRRND-WACNPHKLDHAVAIV 293
             P      I  +YL            HR +++ Y  N  +  D     P  LD      
Sbjct: 419 KQPKESTISIPSIYLELKVKLENEYALHRSIVDHY--NIPKTFDSDDLIPPYLD------ 470

Query: 294 GY 295
            Y
Sbjct: 471 QY 472


>2pff_B Fatty acid synthase subunit beta; fatty acid synthase,
           acyl-carrier-protein, beta-ketoacyl RED beta-ketoacyl
           synthase, dehydratase; 4.00A {Saccharomyces cerevisiae}
          Length = 2006

 Score = 41.6 bits (97), Expect = 4e-04
 Identities = 51/287 (17%), Positives = 92/287 (32%), Gaps = 100/287 (34%)

Query: 69  QDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKK----- 123
           +D  E +E  G       SP        L ++   +E+++    +    L   K+     
Sbjct: 325 EDSLENNE--GVP-----SPM-------LSISNLTQEQVQDYVNKTNSHLPAGKQVEISL 370

Query: 124 ----------GPLPKSL---DWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKK 170
                     GP P+SL   +    K K                 A + + +S++     
Sbjct: 371 VNGAKNLVVSGP-PQSLYGLNLTLRKAK-----------------APSGLDQSRI----- 407

Query: 171 TLYPLSKSQLVECDHGNLNCNGGNIDVAF--EYVKQYGLESQADYPYRNKENITFRCTYE 228
              P S+ +L    +  L      +   F    +         D     K N++F     
Sbjct: 408 ---PFSERKLK-FSNRFL-----PVASPFHSHLLVPASDLINKDLV---KNNVSFN---- 451

Query: 229 KEKAKVFVQDTWVTSGVDHMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDW-ACNPHKLD 287
            +  ++ V DT+   G D     L+   +   ++ R+++     P+    W      K  
Sbjct: 452 AKDIQIPVYDTF--DGSD-----LRV--LSGSISERIVDCIIRLPV---KWETTTQFKAT 499

Query: 288 HAVAIVGYGEKNGILTWIVRNSWG-----------DIGP--DHGYFQ 321
           H +   G G  +G+     RN  G           DI P  D+G+ Q
Sbjct: 500 HILDF-GPGGASGLGVLTHRNKDGTGVRVIVAGTLDINPDDDYGFKQ 545



 Score = 32.3 bits (73), Expect = 0.30
 Identities = 56/342 (16%), Positives = 97/342 (28%), Gaps = 115/342 (33%)

Query: 15  QVTYNVNTDSAIYVWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKT------RFEYFK 68
           +    V T S            ++  + F   + +    +  D+E  T      +F  + 
Sbjct: 17  EHVLLVPTASFF------IASQLQ--EQFNKILPEPTEGFAADDEPTTPAELVGKFLGYV 68

Query: 69  QDGKETDEYYGTSGSSDRSPQEILQRTGLR------LTGKEKERLEADRERVKKFLNERK 122
               E  +     G  D    ++L    L       L G +   L A        L +  
Sbjct: 69  SSLVEPSK----VGQFD----QVL-NLCLTEFENCYLEGNDIHALAAK-------LLQEN 112

Query: 123 KGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVE 182
              L K+ +  ++ +                             + K+     S S L  
Sbjct: 113 DTTLVKTKELIKNYITAR-------------------------IMAKRPFDKKSNSALFR 147

Query: 183 -CDHGNLNC----NG-GNIDVAF-EYVKQYGLESQADYPYRNKENITF-------RCTYE 228
               GN        G GN D  F E    Y       Y     + I F            
Sbjct: 148 AVGEGNAQLVAIFGGQGNTDDYFEELRDLY-----QTYHVLVGDLIKFSAETLSELIRTT 202

Query: 229 KEKAKVFVQ--D--TWVTS-----GVDHMMHLLQSGP-IGVY-LNHRLIESYDGNPIRRN 277
            +  KVF Q  +   W+ +       D+++ +  S P IGV  L H ++           
Sbjct: 203 LDAEKVFTQGLNILEWLENPSNTPDKDYLLSIPISCPLIGVIQLAHYVV----------- 251

Query: 278 DWAC-----NPHKL-DHAVAIVGYGEKNGILTWIV---RNSW 310
                     P +L  +     G+ +  G++T +     +SW
Sbjct: 252 --TAKLLGFTPGELRSYLKGATGHSQ--GLVTAVAIAETDSW 289



 Score = 31.6 bits (71), Expect = 0.50
 Identities = 56/294 (19%), Positives = 86/294 (29%), Gaps = 109/294 (37%)

Query: 18   YNVNTDSAIYVWRDLAYDSIKQVDAFK-TYIVKWNRT------------YTDDNEIKTRF 64
            Y   + +A  VW + A +  K    F    IV  N                 +N     F
Sbjct: 1636 YK-TSKAAQDVW-NRADNHFKDTYGFSILDIVINNPVNLTIHFGGEKGKRIRENYSAMIF 1693

Query: 65   EYFKQDG--------KETDEYYGTSGSSDRSPQEILQRT-----GLRLTGKEKERLEADR 111
            E    DG        KE +E+  ++  + RS + +L  T      L L  K      A  
Sbjct: 1694 ETIV-DGKLKTEKIFKEINEH--STSYTFRSEKGLLSATQFTQPALTLMEK------AAF 1744

Query: 112  ERVKKFLNERKKGPLPK-------SL----------------D------WRQSKVKVLNP 142
            E +K       KG +P        SL                       +R   ++V  P
Sbjct: 1745 EDLK------SKGLIPADATFAGHSLGEYAALASLADVMSIESLVEVVFYRGMTMQVAVP 1798

Query: 143  VESQGRCGSCWAFATTAILESQVAL------LKKTLYPLSKS--QLVECDHGNLNCNGGN 194
             +  GR      +   AI   +VA       L+  +  + K    LVE    N N     
Sbjct: 1799 RDELGRSN----YGMIAINPGRVAASFSQEALQYVVERVGKRTGWLVEI--VNYNVENQ- 1851

Query: 195  IDVAFEYVKQY-------GLESQADYPYRNKE-NITFR-----CTYEKEKAKVF 235
                     QY        L++  +     K   I         + E+ +  +F
Sbjct: 1852 ---------QYVAAGDLRALDTVTNVLNFIKLQKIDIIELQKSLSLEEVEGHLF 1896


>3lvg_D LCB, clathrin light chain B; SELF assembly, coated PIT, cytoplasmic
           vesicle, membrane, Ca structural protein; 7.94A {Bos
           taurus}
          Length = 190

 Score = 37.1 bits (85), Expect = 0.004
 Identities = 14/82 (17%), Positives = 24/82 (29%), Gaps = 24/82 (29%)

Query: 47  IVKWNRTYT------DDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLT 100
           I KW           D        E+ ++  K+ +E+        +S Q           
Sbjct: 87  IRKWREEQRKRLQELDAASKVMEQEWREKAKKDLEEWN-----QRQSEQ----------- 130

Query: 101 GKEKERLEADRERVKKFLNERK 122
             EK +   +R   K F  +  
Sbjct: 131 -VEKNK-INNRIADKAFYQQPD 150


>2pff_A Fatty acid synthase subunit alpha, 3-oxoacyl-[acyl-carrier-PR;
           fatty acid synthase, acyl-carrier-protein, beta-ketoacyl
           RED beta-ketoacyl synthase, dehydratase; 4.00A
           {Saccharomyces cerevisiae}
          Length = 1688

 Score = 35.2 bits (81), Expect = 0.031
 Identities = 16/90 (17%), Positives = 27/90 (30%), Gaps = 31/90 (34%)

Query: 37  IKQVD--AFKTYIVKWNRTYT----DDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQE 90
           I   +          W  + T    DD ++K ++E                         
Sbjct: 861 ISYHNGNLKGRPYTGWVDSKTKEPVDDKDVKAKYE-----------------------TS 897

Query: 91  ILQRTGLRLTGKEKERLEADRERVKKFLNE 120
           IL+ +G+RL   E E         K+ + E
Sbjct: 898 ILEHSGIRLI--EPELFNGYNPEKKEMIQE 925


>2e01_A Cysteine proteinase 1; bleomycin hydrolase, thiol protease, C1
           protease, hydrolase; 1.73A {Saccharomyces cerevisiae}
           PDB: 2e02_A 2e03_A 2dzy_A 1a6r_A 2e00_A 2dzz_A 3gcb_A
           1gcb_A
          Length = 457

 Score = 33.2 bits (75), Expect = 0.11
 Identities = 10/37 (27%), Positives = 13/37 (35%), Gaps = 3/37 (8%)

Query: 287 DHAVAIVGYG---EKNGILTWIVRNSWGDIGPDHGYF 320
             A+ I G          L + V NSWG      G +
Sbjct: 372 TAAMLITGCHVDETSKLPLRYRVENSWGKDSGKDGLY 408



 Score = 32.8 bits (74), Expect = 0.14
 Identities = 16/45 (35%), Positives = 20/45 (44%), Gaps = 1/45 (2%)

Query: 141 NPVESQGRCGSCWAFATTAILESQVA-LLKKTLYPLSKSQLVECD 184
            PV +Q   G CW FA T  L   V   L    + LS++ L   D
Sbjct: 66  TPVTNQKSSGRCWLFAATNQLRLNVLSELNLKEFELSQAYLFFYD 110


>2cb5_A Protein (bleomycin hydrolase); aminopeptidase, cysteine protease,
           SELF- compartmentalizing, cylinase; 1.85A {Homo sapiens}
           SCOP: d.3.1.1 PDB: 1cb5_A
          Length = 453

 Score = 33.2 bits (75), Expect = 0.13
 Identities = 10/38 (26%), Positives = 13/38 (34%), Gaps = 4/38 (10%)

Query: 287 DHAVAIVGYG----EKNGILTWIVRNSWGDIGPDHGYF 320
            HA+          +      W V NSWG+     GY 
Sbjct: 370 THAMTFTAVSEKDDQDGAFTKWRVENSWGEDHGHKGYL 407



 Score = 32.4 bits (73), Expect = 0.19
 Identities = 10/46 (21%), Positives = 17/46 (36%), Gaps = 1/46 (2%)

Query: 141 NPVESQGRCGSCWAFATTAILESQVA-LLKKTLYPLSKSQLVECDH 185
            P+ +Q   G  W F+   ++       L    +  S+S L   D 
Sbjct: 61  KPITNQKSSGRSWIFSCLNVMRLPFMKKLNIEEFEFSQSYLFFWDK 106


>3f75_P Toxopain-2, cathepsin L propeptide; medical structural genomics
          of pathogenic protozoa, MSGPP, C protease, parasite,
          protozoa, hydrolase; 1.99A {Toxoplasma gondii}
          Length = 106

 Score = 29.3 bits (66), Expect = 0.59
 Identities = 10/53 (18%), Positives = 27/53 (50%), Gaps = 5/53 (9%)

Query: 17 TYNVNTDSAIYVWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQ 69
          +++ +   +I+ W++  +      DAF ++   + ++Y  + E + R+  FK 
Sbjct: 4  SHHHHHHGSIWEWKEAHFQ-----DAFSSFQAMYAKSYATEEEKQRRYAIFKN 51


>3b21_A ORF169B, OSPI; bacterial protein, effector, type 3 secretion SYST
           unknown function; 2.01A {Shigella flexneri}
          Length = 220

 Score = 30.1 bits (67), Expect = 0.86
 Identities = 21/78 (26%), Positives = 33/78 (42%), Gaps = 3/78 (3%)

Query: 87  SPQEILQRTGLRLTGKEKERLEADRERVKKFL---NERKKGPLPKSLDWRQSKVKVLNPV 143
           SP+ ++    L+ T   +   E     VKK L   N +  G + K  +   +  K++NP 
Sbjct: 5   SPEFMINGVSLQGTAGYEAHTEEGNVNVKKLLESLNSKSLGDMDKDSELAATLQKMINPS 64

Query: 144 ESQGRCGSCWAFATTAIL 161
              G C  C   A  A+L
Sbjct: 65  GGDGNCSGCALHACMAML 82


>1x9y_A Cysteine proteinase; half-barrel, barrel-sandwich-hybrid,
           hydrolase; 2.50A {Staphylococcus aureus} SCOP: d.3.1.1
           d.17.1.4
          Length = 367

 Score = 29.2 bits (64), Expect = 2.2
 Identities = 30/176 (17%), Positives = 49/176 (27%), Gaps = 39/176 (22%)

Query: 135 SKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGN 194
           + +K     E Q     C  F+  A+L +         + + ++   E    +L      
Sbjct: 192 NTLKNFKIREQQFDNSWCAGFSMAALLNATKNTDTYNAHDIMRTLYPEVSEQDLPNCATF 251

Query: 195 IDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQS 254
            +   EY K  G +        +   +                D      V  M+     
Sbjct: 252 PNQMIEYGKSQGRDIHYQEGVPSYNQV----------------DQLTKDNVGIMILA--- 292

Query: 255 GPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSW 310
                       +S   NP        N   L HA+A+VG  + N     I  N W
Sbjct: 293 ------------QSVSQNP--------NDPHLGHALAVVGNAKINDQEKLIYWNPW 328


>1m65_A Hypothetical protein YCDX; structural genomics, beta-alpha-barrel,
           metallo-enzyme, STRU function project, S2F, unknown
           function; 1.57A {Escherichia coli} SCOP: c.6.3.1 PDB:
           1m68_A 1pb0_A
          Length = 245

 Score = 27.8 bits (62), Expect = 4.7
 Identities = 10/39 (25%), Positives = 13/39 (33%), Gaps = 6/39 (15%)

Query: 90  EILQRTGLRLTGKEKER-LEADRERVKKFLNERKKGPLP 127
           +IL            ER L     R+  FL  R   P+ 
Sbjct: 207 KILDAVDF-----PPERILNVSPRRLLNFLESRGMAPIA 240


>1qht_A Protein (DNA polymerase); archaea, hyperthermostable, family B
           polymer alpha family polymerase, transferase; 2.10A
           {Thermococcus SP} SCOP: c.55.3.5 e.8.1.1 PDB: 1tgo_A
           2xhb_A* 2vwj_A* 2vwk_A* 1wns_A* 1wn7_A 1qqc_A* 4ahc_A*
           4ail_C* 3a2f_A* 2jgu_A* 1d5a_A
          Length = 775

 Score = 28.2 bits (63), Expect = 5.6
 Identities = 10/36 (27%), Positives = 19/36 (52%)

Query: 105 ERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVL 140
             L  +R+++K+ +         K LD+RQ  +K+L
Sbjct: 454 GDLLEERQKIKRKMKATVDPLEKKLLDYRQRAIKIL 489


>2edd_A Netrin receptor DCC; tumor suppressor protein DCC, colorectal
          cancer suppressor, structural genomics, NPPSFA; NMR
          {Homo sapiens}
          Length = 123

 Score = 26.4 bits (58), Expect = 6.7
 Identities = 10/49 (20%), Positives = 21/49 (42%), Gaps = 1/49 (2%)

Query: 15 QVTYNVNTDSAIYV-WRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKT 62
           V     T  A+ V W D +    ++    + Y V+W  +++   + K+
Sbjct: 24 GVQAVALTHDAVRVSWADNSVPKNQKTSEVRLYTVRWRTSFSASAKYKS 72


  Database: pdb70
    Posted date:  Sep 4, 2012  3:40 AM
  Number of letters in database: 6,701,793
  Number of sequences in database:  27,921
  
Lambda     K      H
   0.316    0.133    0.410 

Gapped
Lambda     K      H
   0.267   0.0856    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 27921
Number of Hits to DB: 5,199,069
Number of extensions: 301791
Number of successful extensions: 926
Number of sequences better than 10.0: 1
Number of HSP's gapped: 752
Number of HSP's successfully gapped: 63
Length of query: 341
Length of database: 6,701,793
Length adjustment: 94
Effective length of query: 247
Effective length of database: 4,077,219
Effective search space: 1007073093
Effective search space used: 1007073093
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 58 (25.9 bits)