RPS-BLAST 2.2.26 [Sep-21-2011]

Database: pdb70 
           27,921 sequences; 6,701,793 total letters

Searching..................................................done

Query= psy1664
         (524 letters)



>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme,
           papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A
           1pbh_A 1mir_A
          Length = 317

 Score =  342 bits (879), Expect = e-115
 Identities = 133/276 (48%), Positives = 183/276 (66%), Gaps = 15/276 (5%)

Query: 62  LTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGS 121
           + +S L+   G       P  R+     +     +LP  FDAR  WP CPTI+EIRDQGS
Sbjct: 34  VDMSYLKRLCGTFLGGPKPPQRV-----MFTEDLKLPASFDAREQWPQCPTIKEIRDQGS 88

Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVT 180
           CGS WA GAVEA+SDR+CI +     V +S++DL++CC   CG+GC GG+  +AW +W  
Sbjct: 89  CGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTR 148

Query: 181 TGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDL 239
            G+VSGG Y S  GCRPY I PCE ++NGS   C      TP+C + C+PGY  +Y+ D 
Sbjct: 149 KGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGD-TPKCSKICEPGYSPTYKQDK 207

Query: 240 NFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRII 299
           ++G  +YS+  +E+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV G  +G HAIRI+
Sbjct: 208 HYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRIL 267

Query: 300 GWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
           GWG E    GT     YWLVANS+NT+WG+NG F+I
Sbjct: 268 GWGVE---NGT----PYWLVANSWNTDWGDNGFFKI 296



 Score =  217 bits (555), Expect = 7e-67
 Identities = 84/171 (49%), Positives = 122/171 (71%), Gaps = 9/171 (5%)

Query: 328 GENGLFRIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
           G      +GCRPY I PCE ++NGSR  C      TP+C + C+PGY  +Y+ D ++G  
Sbjct: 154 GGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGD-TPKCSKICEPGYSPTYKQDKHYGYN 212

Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
           +YS+  +E+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV G  +G HAIRI+GWG E
Sbjct: 213 SYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE 272

Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
               GT     YWLVANS+NT+WG+NG F+I+RGQ+ CGIE+++ AG+P+ 
Sbjct: 273 ---NGT----PYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRT 316


>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase,
           digestive tract, hydrolase-hydrolase INH complex; HET:
           074; 1.30A {Schistosoma mansoni} PDB: 3s3q_A* 3s3r_A*
          Length = 254

 Score =  335 bits (861), Expect = e-113
 Identities = 120/241 (49%), Positives = 152/241 (63%), Gaps = 8/241 (3%)

Query: 96  ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
           E+P  FD+R  WP C +I  IRDQ  CGS WA GAVEAMSDR CI S GK++V LS+ DL
Sbjct: 2   EIPSSFDSRKKWPRCKSIATIRDQSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVDL 61

Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQ 214
           +SCC+ CG GC+GG  G AW YWV  GIV+G +  +  GC PY    CE +  G +  C 
Sbjct: 62  LSCCESCGLGCEGGILGPAWDYWVKEGIVTGSSKENHAGCEPYPFPKCEHHTKGKYPPCG 121

Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
                TP C + CQ  Y   Y  D + G+ +Y++  +E+ I +EI ++GPVE   T+Y D
Sbjct: 122 SKIYKTPRCKQTCQKKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGFTVYED 181

Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
            + YK+GIYKH+ G  LG HAIRIIGWG E           YWL+ANS+N +WGENG FR
Sbjct: 182 FLNYKSGIYKHITGETLGGHAIRIIGWGVE---NKA----PYWLIANSWNEDWGENGYFR 234

Query: 335 I 335
           I
Sbjct: 235 I 235



 Score =  211 bits (540), Expect = 2e-65
 Identities = 75/169 (44%), Positives = 101/169 (59%), Gaps = 8/169 (4%)

Query: 328 GENGLFRIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
           G +     GC PY    CE +  G    C +    TP C + CQ  Y   Y  D + G+ 
Sbjct: 92  GSSKENHAGCEPYPFPKCEHHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYTQDKHRGKS 151

Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
           +Y++  +E+ I +EI ++GPVE   T+Y D + YK+GIYKH+ G  LG HAIRIIGWG E
Sbjct: 152 SYNVKNDEKAIQKEIMKYGPVEAGFTVYEDFLNYKSGIYKHITGETLGGHAIRIIGWGVE 211

Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
                      YWL+ANS+N +WGENG FRIVRG++EC IE+++TAG  
Sbjct: 212 ---NKA----PYWLIANSWNEDWGENGYFRIVRGRDECSIESEVTAGRI 253


>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco
           hydrolase, lysosome, protease, thiol protease, zymogen,
           CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A*
           3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A*
           1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A*
           1qdq_A* 1csb_B* 1huc_B 2ipp_B ...
          Length = 266

 Score =  333 bits (856), Expect = e-112
 Identities = 127/243 (52%), Positives = 173/243 (71%), Gaps = 10/243 (4%)

Query: 95  EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
            +LP  FDAR  WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI +     V +S++D
Sbjct: 5   LKLPASFDAREQWPQCPTIKEIRDQGSCGSAWAFGAVEAISDRICIHTNAHVSVEVSAED 64

Query: 155 LVSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSS 212
           L++CC   CG+GC GG+  +AW +W   G+VSGG Y S  GCRPY I PCE ++NG+   
Sbjct: 65  LLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEAHVNGARPP 124

Query: 213 CQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIY 272
           C      TP+C + C+PGY  +Y+ D ++G  +YS+  +E+ IM EI+++GPVEG+ ++Y
Sbjct: 125 CTGEGD-TPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVY 183

Query: 273 ADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGL 332
           +D +LYK+G+Y+HV G  +G HAIRI+GWG E    GT     YWLVANS+NT+WG+NG 
Sbjct: 184 SDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT----PYWLVANSWNTDWGDNGF 236

Query: 333 FRI 335
           F+I
Sbjct: 237 FKI 239



 Score =  216 bits (553), Expect = 4e-67
 Identities = 83/171 (48%), Positives = 122/171 (71%), Gaps = 9/171 (5%)

Query: 328 GENGLFRIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
           G      +GCRPY I PCE ++NG+R  C      TP+C + C+PGY  +Y+ D ++G  
Sbjct: 97  GGLYESHVGCRPYSIPPCEAHVNGARPPCTGEGD-TPKCSKICEPGYSPTYKQDKHYGYN 155

Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
           +YS+  +E+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV G  +G HAIRI+GWG E
Sbjct: 156 SYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE 215

Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
               GT     YWLVANS+NT+WG+NG F+I+RGQ+ CGIE+++ AG+P+ 
Sbjct: 216 ---NGT----PYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRT 259


>3hhi_A Cathepsin B-like cysteine protease; occluding loop, hydrolase, THIO
           protease; HET: 074; 1.60A {Trypanosoma brucei} PDB:
           3mor_A*
          Length = 325

 Score =  312 bits (802), Expect = e-103
 Identities = 101/286 (35%), Positives = 142/286 (49%), Gaps = 23/286 (8%)

Query: 54  AEKNA-LSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPT 112
           A+ +  +  +TL E +   GV   +              +    LP  FD+   WP CPT
Sbjct: 28  AKYDGVMQNITLREAKRLNGVIKKNNNASILPKRRFTEEEARAPLPSSFDSAEAWPNCPT 87

Query: 113 IQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHG 172
           I +I DQ +CGS WA+ A  AMSDR C    G + V +S+ DL++CC DCG+GC GG   
Sbjct: 88  IPQIADQSACGSCWAVAAASAMSDRFCTMG-GVQDVHISAGDLLACCSDCGDGCNGGDPD 146

Query: 173 KAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHS--SCQDNEPNTPECIRKCQP 229
           +AW Y+ +TG+VS         C+PY    C  +    +    C     +TP+C   C  
Sbjct: 147 RAWAYFSSTGLVSD-------YCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPKCDYTCDD 199

Query: 230 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG 289
                          +Y+L   E+  MRE+F  GP E +  +Y D I Y +G+Y HV+G 
Sbjct: 200 PT---IPVVNYRSWTSYALQ-GEDDYMRELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQ 255

Query: 290 PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
            LG HA+R++GWG      G      YW +ANS+NT WG +G F I
Sbjct: 256 YLGGHAVRLVGWGTS---NGV----PYWKIANSWNTEWGMDGYFLI 294



 Score =  193 bits (492), Expect = 2e-57
 Identities = 61/174 (35%), Positives = 84/174 (48%), Gaps = 14/174 (8%)

Query: 327 WGENGLFRIGCRPYEI-PCERYMNGSRS--SCQANEPNTPECIRKCQPGYDVSYEDDLNF 383
           +   GL    C+PY    C  +         C     +TP+C   C              
Sbjct: 152 FSSTGLVSDYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPKCDYTCDDPT---IPVVNYR 208

Query: 384 GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGW 443
              +Y+L   E+  MRE+F  GP E +  +Y D I Y +G+Y HV+G  LG HA+R++GW
Sbjct: 209 SWTSYALQ-GEDDYMRELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYLGGHAVRLVGW 267

Query: 444 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
           G      G      YW +ANS+NT WG +G F I RG +ECGIE   +AG+P  
Sbjct: 268 GTS---NGV----PYWKIANSWNTEWGMDGYFLIRRGSSECGIEDGGSAGIPLA 314


>1deu_A Procathepsin X; cysteine protease, proregion, prosegment, HY; 1.70A
           {Homo sapiens} SCOP: d.3.1.1 PDB: 1ef7_A
          Length = 277

 Score =  253 bits (649), Expect = 3e-81
 Identities = 64/263 (24%), Positives = 101/263 (38%), Gaps = 34/263 (12%)

Query: 77  SKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQ---GSCGSGWALGAVEA 133
           + L +   P   +   P  +LP+ +D R N          R+Q     CGS WA  +  A
Sbjct: 17  APLGRTTYPRPHEYLSP-ADLPKSWDWR-NVDGVNYASITRNQHIPQYCGSCWAHASTSA 74

Query: 134 MSDRVCIASRGKRH-VRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASK 192
           M+DR+ I  +G      LS  +++ C       C+GG     W Y    GI         
Sbjct: 75  MADRINIKRKGAWPSTLLSVQNVIDCG--NAGSCEGGNDLSVWDYAHQHGIPD------- 125

Query: 193 QGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 252
           + C  Y+       +      + N+  T    ++C    + +           Y   +  
Sbjct: 126 ETCNNYQAK-----DQEC--DKFNQCGTCNEFKECHAIRNYTLWRV-----GDYGSLSGR 173

Query: 253 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 312
           E +M EI+ +GP+   +     +  Y  GIY          H + + GWG     +GT  
Sbjct: 174 EKMMAEIYANGPISCGIMATERLANYTGGIYAEYQDTTYINHVVSVAGWGIS---DGT-- 228

Query: 313 VVKYWLVANSFNTNWGENGLFRI 335
             +YW+V NS+   WGE G  RI
Sbjct: 229 --EYWIVRNSWGEPWGERGWLRI 249



 Score =  164 bits (417), Expect = 4e-47
 Identities = 42/182 (23%), Positives = 69/182 (37%), Gaps = 33/182 (18%)

Query: 327 WGENGLFRIGCRPYEI---PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNF 383
             ++G+    C  Y+     C+++          N+  T    ++C    + +       
Sbjct: 118 AHQHGIPDETCNNYQAKDQECDKF----------NQCGTCNEFKECHAIRNYTLWRV--- 164

Query: 384 GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGW 443
               Y   +  E +M EI+ +GP+   +     +  Y  GIY          H + + GW
Sbjct: 165 --GDYGSLSGREKMMAEIYANGPISCGIMATERLANYTGGIYAEYQDTTYINHVVSVAGW 222

Query: 444 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECG--------IEADITAGLP 495
           G     +GT    +YW+V NS+   WGE G  RIV    + G        IE   T G P
Sbjct: 223 GIS---DGT----EYWIVRNSWGEPWGERGWLRIVTSTYKDGKGARYNLAIEEHCTFGDP 275

Query: 496 KI 497
            +
Sbjct: 276 IV 277


>3pdf_A Cathepsin C, dipeptidyl peptidase 1; two domains, cystein protease,
           hydrolase-hydrolase inhibitor; HET: LXV NAG; 1.85A {Homo
           sapiens} PDB: 1jqp_A* 2djf_B* 1k3b_B* 2djg_B* 2djf_A*
           1k3b_A* 2djg_A* 2djf_C* 1k3b_C* 2djg_C*
          Length = 441

 Score =  244 bits (624), Expect = 1e-75
 Identities = 78/286 (27%), Positives = 121/286 (42%), Gaps = 42/286 (14%)

Query: 57  NALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEI 116
                LTL ++  R G H          PL  ++   +  LP  +D R        +  +
Sbjct: 167 MEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQQKILFLPTSWDWRNVHG-INFVSPV 225

Query: 117 RDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGF-HGKAW 175
           R+Q SCGS ++  ++  +  R+ I +   +   LS  ++VSC +    GC+GGF +  A 
Sbjct: 226 RNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQ-YAQGCEGGFPYLIAG 284

Query: 176 KYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSY 235
           KY    G+V        + C PY         G+ S C+            C   Y   Y
Sbjct: 285 KYAQDFGLVE-------EACFPYT--------GTDSPCK--------MKEDCFRYYSSEY 321

Query: 236 EDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP----- 290
                     +    NE  +  E+  HGP+  +  +Y D + YK GIY H          
Sbjct: 322 HYVG-----GFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPF 376

Query: 291 -LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
            L  HA+ ++G+G +     ++S + YW+V NS+ T WGENG FRI
Sbjct: 377 ELTNHAVLLVGYGTD-----SASGMDYWIVKNSWGTGWGENGYFRI 417



 Score =  156 bits (396), Expect = 2e-42
 Identities = 53/175 (30%), Positives = 75/175 (42%), Gaps = 32/175 (18%)

Query: 327 WGENGLFRIGCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
             + GL    C PY         G+ S C+            C   Y   Y         
Sbjct: 287 AQDFGLVEEACFPYT--------GTDSPCK--------MKEDCFRYYSSEYHYVG----- 325

Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP------LGEHAIRI 440
            +    NE  +  E+  HGP+  +  +Y D + YK GIY H           L  HA+ +
Sbjct: 326 GFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLL 385

Query: 441 IGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
           +G+G +     ++S + YW+V NS+ T WGENG FRI RG +EC IE+   A  P
Sbjct: 386 VGYGTD-----SASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATP 435


>3ois_A Cysteine protease; alpha and beta, hydrolase; HET: UDP; 1.65A
           {Xylella fastidiosa}
          Length = 291

 Score =  162 bits (412), Expect = 2e-46
 Identities = 47/279 (16%), Positives = 81/279 (29%), Gaps = 45/279 (16%)

Query: 70  RMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALG 129
             G  PD  +   R          +  LP   D    +        + DQG  GS  A  
Sbjct: 32  GYGYIPD--IADIRDFSYTPEKSVIAALPPKVDLTPPFQ-------VYDQGRIGSCTANA 82

Query: 130 AVEAMSDRVCIASRGKRHVRLSSDDLVSCCK--DCGNGCQGGFHGKAWKYWVTTGIVSGG 187
              A+        +    +        +  K     N   G       K     G+    
Sbjct: 83  LAAAIQFERIHDKQSPEFIPSRLFIYYNERKIEGHVNYDSGAMIRDGIKVLHKLGVCPEK 142

Query: 188 TYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYS 247
            +       PY                  E   P      +P  D  Y+D  N+    YS
Sbjct: 143 EW-------PYGDTPA---------DPRTEEFPPGAPASKKPS-DQCYKDAQNYKITEYS 185

Query: 248 -LPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYK----HVAGGPLGEHAIRIIGWG 302
            +  + + +   +    P     ++Y   +   +   +           G HA+  +G+ 
Sbjct: 186 RVAQDIDHLKACLAVGSPFVFGFSVYNSWVGNNSLPVRIPLPTKNDTLEGGHAVLCVGYD 245

Query: 303 QEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIGCRPYE 341
            E         ++++ + NS+  N GE+G F +   PYE
Sbjct: 246 DE---------IRHFRIRNSWGNNVGEDGYFWM---PYE 272



 Score = 94.5 bits (235), Expect = 9e-22
 Identities = 23/169 (13%), Positives = 54/169 (31%), Gaps = 25/169 (14%)

Query: 327 WGENGLFRIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGR 385
             + G+      PY   P +           A++  + +C +  Q      Y        
Sbjct: 133 LHKLGVCPEKEWPYGDTPADPRTEEFPPGAPASKKPSDQCYKDAQNYKITEY-------- 184

Query: 386 IAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYK----HVAGGPLGEHAIRII 441
               +  + + +   +    P     ++Y   +   +   +           G HA+  +
Sbjct: 185 --SRVAQDIDHLKACLAVGSPFVFGFSVYNSWVGNNSLPVRIPLPTKNDTLEGGHAVLCV 242

Query: 442 GWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQ-NECGIEAD 489
           G+  E         ++++ + NS+  N GE+G F +     +   +  D
Sbjct: 243 GYDDE---------IRHFRIRNSWGNNVGEDGYFWMPYEYISNTQLADD 282


>2wbf_X Serine-repeat antigen protein; SERA, malaria, vacuole, protease,
           cathepsin, hydrolase, glycoprotein, thiol protease; HET:
           DMS; 1.60A {Plasmodium falciparum} PDB: 3ch3_X 3ch2_X
          Length = 265

 Score =  120 bits (303), Expect = 4e-31
 Identities = 47/249 (18%), Positives = 87/249 (34%), Gaps = 28/249 (11%)

Query: 106 NWPYCP---------TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
           N  YC          +  ++ DQG+C + W   +   +    C+  +G    ++S+  + 
Sbjct: 6   NKEYCNRLKDENNCISNLQVEDQGNCDTSWIFASKYHLETIRCM--KGYEPTKISALYVA 63

Query: 157 SCCKDCGN-GCQGGFHGKAWKYWV--TTGIVSGGTY---ASKQGCRPYEIPCERYMNGSH 210
           +C K      C  G     +   +     + +   Y     K G +  ++         +
Sbjct: 64  NCYKGEHKDRCDEGSSPMEFLQIIEDYGFLPAESNYPYNYVKVGEQCPKVEDHWMNLWDN 123

Query: 211 SSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMT 270
                N+        K    Y+     D         + A  + I  E+   G V   + 
Sbjct: 124 GKILHNKNEPNSLDGKGYTAYESERFHDN--------MDAFVKIIKTEVMNKGSVIAYIK 175

Query: 271 IYADMILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGE 329
               M    +G   K++ G    +HA+ I+G+G     EG      YW+V NS+   WG+
Sbjct: 176 AENVMGYEFSGKKVKNLCGDDTADHAVNIVGYGNYVNSEGEKK--SYWIVRNSWGPYWGD 233

Query: 330 NGLFRIGCR 338
            G F++   
Sbjct: 234 EGYFKVDMY 242



 Score = 77.4 bits (191), Expect = 4e-16
 Identities = 30/152 (19%), Positives = 53/152 (34%), Gaps = 6/152 (3%)

Query: 339 PYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYS--LPANEET 396
           PY                 N  +  + +        +  +    +    +   + A  + 
Sbjct: 100 PYNYVKVGEQCPKVEDHWMNLWDNGKILHNKNEPNSLDGKGYTAYESERFHDNMDAFVKI 159

Query: 397 IMREIFRHGPVEGSMTIYADMILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTSSV 455
           I  E+   G V   +     M    +G   K++ G    +HA+ I+G+G     EG    
Sbjct: 160 IKTEVMNKGSVIAYIKAENVMGYEFSGKKVKNLCGDDTADHAVNIVGYGNYVNSEGEKK- 218

Query: 456 VKYWLVANSFNTNWGENGLFRIVR-GQNECGI 486
             YW+V NS+   WG+ G F++   G   C  
Sbjct: 219 -SYWIVRNSWGPYWGDEGYFKVDMYGPTHCHF 249


>1cs8_A Human procathepsin L; prosegment, propeptide, inhibition,
           hydrolase; HET: OCS; 1.80A {Homo sapiens} SCOP: d.3.1.1
           PDB: 1cjl_A 3hwn_A*
          Length = 316

 Score = 99.2 bits (248), Expect = 3e-23
 Identities = 70/286 (24%), Positives = 109/286 (38%), Gaps = 56/286 (19%)

Query: 57  NALSKLTLSEL-EMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQE 115
           NA   +T  E  ++  G                       E P   D    W     +  
Sbjct: 61  NAFGDMTSEEFRQVMNGFQNRKPRKGKVFQ-----EPLFYEAPRSVD----WREKGYVTP 111

Query: 116 IRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFHGKA 174
           +++QG CGS WA  A  A+  ++   + G+  + LS  +LV C    GN GC GG    A
Sbjct: 112 VKNQGQCGSCWAFSATGALEGQMFRKT-GRL-ISLSEQNLVDCSGPQGNEGCNGGLMDYA 169

Query: 175 WKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDV 233
           ++Y     G+ S  +Y       PYE         +  SC+ N   +           D 
Sbjct: 170 FQYVQDNGGLDSEESY-------PYE--------ATEESCKYNPKYSV--------ANDA 206

Query: 234 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGP 290
            + D          +P  E+ +M+ +   GP+  S+ I A     + YK GIY       
Sbjct: 207 GFVD----------IPKQEKALMKAVATVGPI--SVAIDAGHESFLFYKEGIYFEPDCSS 254

Query: 291 LG-EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
              +H + ++G+G E      +   KYWLV NS+   WG  G  ++
Sbjct: 255 EDMDHGVLVVGYGFESTESDNN---KYWLVKNSWGEEWGMGGYVKM 297



 Score = 68.8 bits (169), Expect = 6e-13
 Identities = 31/104 (29%), Positives = 50/104 (48%), Gaps = 10/104 (9%)

Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGPLG-EHAIRIIGW 443
             +P  E+ +M+ +   GP+  S+ I A     + YK GIY          +H + ++G+
Sbjct: 209 VDIPKQEKALMKAVATVGPI--SVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGY 266

Query: 444 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQ-NECGI 486
           G E      +   KYWLV NS+   WG  G  ++ + + N CGI
Sbjct: 267 GFESTESDNN---KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGI 307


>2o6x_A Procathepsin L1, secreted cathepsin L 1; hydrolase, thiol protease,
           cysteine protease, zymogen, hydro; 1.40A {Fasciola
           hepatica}
          Length = 310

 Score = 95.8 bits (239), Expect = 4e-22
 Identities = 62/292 (21%), Positives = 102/292 (34%), Gaps = 71/292 (24%)

Query: 57  NALSKLTLSEL-EMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINW-------P 108
           N  + +T  E     +          + L   V        +P+  D    W        
Sbjct: 54  NQFTDMTFEEFKAKYLTEMSR---ASDILSHGVPYEANNRAVPDKID----WRESGYVTE 106

Query: 109 YCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQ 167
                  ++DQG+CGSGWA      M  +          +  S   LV C +  GN GC 
Sbjct: 107 -------VKDQGNCGSGWAFSTTGTMEGQYMKNE-RTS-ISFSEQQLVDCSRPWGNNGCG 157

Query: 168 GGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKC 227
           GG    A++Y    G+ +  +Y       PY              C+ N+          
Sbjct: 158 GGLMENAYQYLKQFGLETESSY-------PYT--------AVEGQCRYNKQLGV------ 196

Query: 228 QPGYDVSYEDDLNFGRIAYSLPAN-EETIMREIFRHGPVEGSMTIYA--DMILYKTGIYK 284
                  +            + +  E  +   +   GP   ++ +    D ++Y++GIY+
Sbjct: 197 --AKVTGFYT----------VHSGSEVELKNLVGAEGPA--AVAVDVESDFMMYRSGIYQ 242

Query: 285 HVAGGPLG-EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
                PL   HA+  +G+G +    GT     YW+V NS+  +WGE G  R+
Sbjct: 243 SQTCSPLRVNHAVLAVGYGTQ---GGT----DYWIVKNSWGLSWGERGYIRM 287



 Score = 68.0 bits (167), Expect = 1e-12
 Identities = 32/98 (32%), Positives = 52/98 (53%), Gaps = 13/98 (13%)

Query: 393 NEETIMREIFRHGPVEGSMTIYA--DMILYKTGIYKHVAGGPLG-EHAIRIIGWGQEPLG 449
           +E  +   +   GP   ++ +    D ++Y++GIY+     PL   HA+  +G+G +   
Sbjct: 209 SEVELKNLVGAEGPA--AVAVDVESDFMMYRSGIYQSQTCSPLRVNHAVLAVGYGTQ--- 263

Query: 450 EGTSSVVKYWLVANSFNTNWGENGLFRIVRGQ-NECGI 486
            GT     YW+V NS+  +WGE G  R+VR + N CGI
Sbjct: 264 GGT----DYWIVKNSWGLSWGERGYIRMVRNRGNMCGI 297


>2xu3_A Cathepsin L1; hydrolase, drug design, thiol protease; HET: XU3 BTB;
           0.90A {Homo sapiens} PDB: 2xu4_A* 2xu5_A* 2yj2_A*
           2yj8_A* 2yj9_A* 2yjb_A* 2yjc_A* 3bc3_A* 3h89_A* 3h8b_A*
           3h8c_A* 3of9_A* 3of8_A* 3hha_A* 2xu1_A* 3iv2_A* 3k24_A*
           2nqd_B* 3kse_A* 2vhs_A ...
          Length = 220

 Score = 93.7 bits (234), Expect = 5e-22
 Identities = 61/227 (26%), Positives = 96/227 (42%), Gaps = 46/227 (20%)

Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFHGK 173
            +++QG CGS WA  A  A+  ++   + G+  + LS  +LV C    GN GC GG    
Sbjct: 15  PVKNQGQCGSCWAFSATGALEGQMFRKT-GRL-ISLSEQNLVDCSGPQGNEGCNGGLMDY 72

Query: 174 AWKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYD 232
           A++Y     G+ S  +Y       PYE         +  SC+ N   +           D
Sbjct: 73  AFQYVQDNGGLDSEESY-------PYE--------ATEESCKYNPKYSV--------AND 109

Query: 233 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGG 289
             + D          +P  E+ +M+ +   GP+  S+ I A     + YK GIY      
Sbjct: 110 TGFVD----------IPKQEKALMKAVATVGPI--SVAIDAGHESFLFYKEGIYFEPDCS 157

Query: 290 PLG-EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
               +H + ++G+G E      +   KYWLV NS+   WG  G  ++
Sbjct: 158 SEDMDHGVLVVGYGFESTESDNN---KYWLVKNSWGEEWGMGGYVKM 201



 Score = 68.3 bits (168), Expect = 3e-13
 Identities = 31/102 (30%), Positives = 50/102 (49%), Gaps = 10/102 (9%)

Query: 390 LPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGPLG-EHAIRIIGWGQ 445
           +P  E+ +M+ +   GP+  S+ I A     + YK GIY          +H + ++G+G 
Sbjct: 115 IPKQEKALMKAVATVGPI--SVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGF 172

Query: 446 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQ-NECGI 486
           E      +   KYWLV NS+   WG  G  ++ + + N CGI
Sbjct: 173 ESTESDNN---KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGI 211


>3qt4_A Cathepsin-L-like midgut cysteine proteinase; hydrolase, zymogen,
           intramolecular DISS bonds, insect larVal midgut; HET:
           PG4 PG6; 2.11A {Tenebrio molitor}
          Length = 329

 Score = 96.2 bits (240), Expect = 5e-22
 Identities = 64/285 (22%), Positives = 110/285 (38%), Gaps = 57/285 (20%)

Query: 57  NALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQE 115
           N    ++  E L            P++   L +      + L    D R N      + E
Sbjct: 77  NQFGDMSKEEFLAYVNRGKAQK--PKHPENLRMPYVSSKKPLAASVDWRSN-----AVSE 129

Query: 116 IRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFHGKA 174
           ++DQG CGS W+     A+  ++ +   G+    LS  +L+ C    GN GC GG+   A
Sbjct: 130 VKDQGQCGSSWSFSTTGAVEGQLALQR-GRL-TSLSEQNLIDCSSSYGNAGCDGGWMDSA 187

Query: 175 WKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
           + Y    GI+S   Y       PYE             C+ +   +              
Sbjct: 188 FSYIHDYGIMSESAY-------PYE--------AQGDYCRFDSSQSV--------TTLSG 224

Query: 235 YEDDLNFGRIAYSLPA-NEETIMREIFRHGPVEGSMTIYA--DMILYKTGIYKHVAGGPL 291
           Y D          LP+ +E ++   + + GPV  ++ I A  ++  Y  G++        
Sbjct: 225 YYD----------LPSGDENSLADAVGQAGPV--AVAIDATDELQFYSGGLFYDQTCNQS 272

Query: 292 G-EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
              H + ++G+G +    G      YW++ NS+ + WGE+G +R 
Sbjct: 273 DLNHGVLVVGYGSD---NGQ----DYWILKNSWGSGWGESGYWRQ 310



 Score = 64.2 bits (157), Expect = 2e-11
 Identities = 27/98 (27%), Positives = 49/98 (50%), Gaps = 13/98 (13%)

Query: 393 NEETIMREIFRHGPVEGSMTIYA--DMILYKTGIYKHVAGGPLG-EHAIRIIGWGQEPLG 449
           +E ++   + + GPV  ++ I A  ++  Y  G++           H + ++G+G +   
Sbjct: 232 DENSLADAVGQAGPV--AVAIDATDELQFYSGGLFYDQTCNQSDLNHGVLVVGYGSD--- 286

Query: 450 EGTSSVVKYWLVANSFNTNWGENGLFRIVRGQ-NECGI 486
            G      YW++ NS+ + WGE+G +R VR   N CGI
Sbjct: 287 NGQ----DYWILKNSWGSGWGESGYWRQVRNYGNNCGI 320


>2bdz_A Mexicain; cysteine protease, peptidase_C1, papain-like, HYDR; HET:
           E64; 2.10A {Jacaratia mexicana}
          Length = 214

 Score = 93.3 bits (233), Expect = 5e-22
 Identities = 58/228 (25%), Positives = 92/228 (40%), Gaps = 60/228 (26%)

Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFHGK 173
            +++Q  CGS WA   V  +     I + G+  + LS  +L+ C  +  + GC GG+   
Sbjct: 15  PVKNQNPCGSCWAFSTVATIEGINKIIT-GQL-ISLSEQELLDC--ERRSHGCDGGYQTT 70

Query: 174 AWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDV 233
           + +Y V  G+ +   Y       PYE             C+  +   P+        Y  
Sbjct: 71  SLQYVVDNGVHTEREY-------PYE--------KKQGRCRAKDKKGPK-------VYIT 108

Query: 234 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGP 290
            Y+           +PAN+E  + +   + PV  S+   +       YK GIY+    GP
Sbjct: 109 GYKY----------VPANDEISLIQAIANQPV--SVVTDSRGRGFQFYKGGIYE----GP 152

Query: 291 LGE---HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
            G    HA+  +G+G+            Y L+ NS+  NWGE G  RI
Sbjct: 153 CGTNTDHAVTAVGYGK-----------TYLLLKNSWGPNWGEKGYIRI 189



 Score = 65.6 bits (161), Expect = 2e-12
 Identities = 41/158 (25%), Positives = 62/158 (39%), Gaps = 52/158 (32%)

Query: 339 PYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIM 398
           PYE          +  C+A +   P+        Y   Y+           +PAN+E  +
Sbjct: 87  PYE--------KKQGRCRAKDKKGPK-------VYITGYKY----------VPANDEISL 121

Query: 399 REIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGPLGE---HAIRIIGWGQEPLGEGT 452
            +   + PV  S+   +       YK GIY+    GP G    HA+  +G+G+       
Sbjct: 122 IQAIANQPV--SVVTDSRGRGFQFYKGGIYE----GPCGTNTDHAVTAVGYGK------- 168

Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRG----QNECGI 486
                Y L+ NS+  NWGE G  RI R     +  CG+
Sbjct: 169 ----TYLLLKNSWGPNWGEKGYIRIKRASGRSKGTCGV 202


>1ppo_A Protease omega; hydrolase(thiol protease); 1.80A {Carica papaya}
           SCOP: d.3.1.1 PDB: 1meg_A*
          Length = 216

 Score = 93.3 bits (233), Expect = 5e-22
 Identities = 62/228 (27%), Positives = 88/228 (38%), Gaps = 56/228 (24%)

Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFHGK 173
            +R QGSCGS WA  AV  +     I + GK  V LS  +LV C  +  + GC+GG+   
Sbjct: 15  PVRHQGSCGSCWAFSAVATVEGINKIRT-GKL-VELSEQELVDC--ERRSHGCKGGYPPY 70

Query: 174 AWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDV 233
           A +Y    GI     Y       PY+            +C+  +   P            
Sbjct: 71  ALEYVAKNGIHLRSKY-------PYK--------AKQGTCRAKQVGGP-------IVKTS 108

Query: 234 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGP 290
                         +  N E  +       PV  S+ + +      LYK GI++    GP
Sbjct: 109 GVGR----------VQPNNEGNLLNAIAKQPV--SVVVESKGRPFQLYKGGIFE----GP 152

Query: 291 LGE---HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
            G    HA+  +G+G+     G      Y L+ NS+ T WGE G  RI
Sbjct: 153 CGTKVDHAVTAVGYGKS---GGK----GYILIKNSWGTAWGEKGYIRI 193



 Score = 64.8 bits (159), Expect = 4e-12
 Identities = 37/158 (23%), Positives = 56/158 (35%), Gaps = 48/158 (30%)

Query: 339 PYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIM 398
           PY+          + +C+A +   P                          +  N E  +
Sbjct: 87  PYK--------AKQGTCRAKQVGGP-------IVKTSGVGR----------VQPNNEGNL 121

Query: 399 REIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGPLGE---HAIRIIGWGQEPLGEGT 452
                  PV  S+ + +      LYK GI++    GP G    HA+  +G+G+     G 
Sbjct: 122 LNAIAKQPV--SVVVESKGRPFQLYKGGIFE----GPCGTKVDHAVTAVGYGKS---GGK 172

Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQN----ECGI 486
                Y L+ NS+ T WGE G  RI R        CG+
Sbjct: 173 ----GYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGL 206


>8pch_A Cathepsin H; hydrolase, protease, cysteine proteinase,
           aminopeptidase; HET: NAG BMA; 2.10A {Sus scrofa} SCOP:
           d.3.1.1 PDB: 1nb3_A* 1nb5_A*
          Length = 220

 Score = 93.3 bits (233), Expect = 5e-22
 Identities = 61/229 (26%), Positives = 94/229 (41%), Gaps = 52/229 (22%)

Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFHGK 173
            +++QGSCGS W      A+   V IA+ GK  + L+   LV C ++  N GCQGG   +
Sbjct: 16  PVKNQGSCGSCWTFSTTGALESAVAIAT-GKM-LSLAEQQLVDCAQNFNNHGCQGGLPSQ 73

Query: 174 AWKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYD 232
           A++Y     GI+   TY       PY+        G    C+                + 
Sbjct: 74  AFEYIRYNKGIMGEDTY-------PYK--------GQDDHCKFQPDKAI--------AFV 110

Query: 233 VSYEDDLNFGRIAYSLPAN-EETIMREIFRHGPVEGSMTIYA--DMILYKTGIYKHVAGG 289
               +          +  N EE ++  +  + PV  S       D ++Y+ GIY   +  
Sbjct: 111 KDVAN----------ITMNDEEAMVEAVALYNPV--SFAFEVTNDFLMYRKGIYSSTSCH 158

Query: 290 PLGE---HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
              +   HA+  +G+G+E    G      YW+V NS+   WG NG F I
Sbjct: 159 KTPDKVNHAVLAVGYGEE---NGI----PYWIVKNSWGPQWGMNGYFLI 200



 Score = 74.0 bits (183), Expect = 2e-15
 Identities = 32/99 (32%), Positives = 50/99 (50%), Gaps = 14/99 (14%)

Query: 393 NEETIMREIFRHGPVEGSMTIYA--DMILYKTGIYKHVAGGPLGE---HAIRIIGWGQEP 447
           +EE ++  +  + PV  S       D ++Y+ GIY   +     +   HA+  +G+G+E 
Sbjct: 120 DEEAMVEAVALYNPV--SFAFEVTNDFLMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEE- 176

Query: 448 LGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGI 486
              G      YW+V NS+   WG NG F I RG+N CG+
Sbjct: 177 --NGI----PYWIVKNSWGPQWGMNGYFLIERGKNMCGL 209


>3i06_A Cruzipain; autocatalytic cleavage, glycoprotein, protease, thiol
           protease, zymogen; HET: QL2; 1.10A {Trypanosoma cruzi}
           PDB: 1ewm_A* 1ewo_A* 1ewl_A* 1f29_A* 1ewp_A* 1f2b_A*
           1f2c_A* 1f2a_A* 1me4_A* 1u9q_X* 2aim_A* 2efm_A* 2oz2_A*
           1me3_A* 3kku_A* 3lxs_A* 1aim_A* 3iut_A* 3hd3_A* 2p86_A*
           ...
          Length = 215

 Score = 91.8 bits (229), Expect = 2e-21
 Identities = 54/225 (24%), Positives = 87/225 (38%), Gaps = 47/225 (20%)

Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
            ++DQG CGS WA  A+  +  +  +A        LS   LVSC K   +GC GG    A
Sbjct: 15  AVKDQGQCGSCWAFSAIGNVECQWFLAG-HPL-TNLSEQMLVSCDKTD-SGCSGGLMNNA 71

Query: 175 WKYWVTT---GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGY 231
           +++ V      + +  +Y       PY         G    C  +               
Sbjct: 72  FEWIVQENNGAVYTEDSY-------PYA-----SGEGISPPCTTS--------GHTVGAT 111

Query: 232 DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA-DMILYKTGIYKHVAGGP 290
              + +          LP +E  I   +  +GPV  ++ + A   + Y  G+        
Sbjct: 112 ITGHVE----------LPQDEAQIAAWLAVNGPV--AVAVDASSWMTYTGGVMTSCVSEQ 159

Query: 291 LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
           L +H + ++G+         S+ V YW++ NS+ T WGE G  RI
Sbjct: 160 L-DHGVLLVGYN-------DSAAVPYWIIKNSWTTQWGEEGYIRI 196



 Score = 78.3 bits (194), Expect = 7e-17
 Identities = 29/98 (29%), Positives = 49/98 (50%), Gaps = 11/98 (11%)

Query: 390 LPANEETIMREIFRHGPVEGSMTIYA-DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPL 448
           LP +E  I   +  +GPV  ++ + A   + Y  G+        L +H + ++G+     
Sbjct: 118 LPQDEAQIAAWLAVNGPV--AVAVDASSWMTYTGGVMTSCVSEQL-DHGVLLVGYN---- 170

Query: 449 GEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGI 486
               S+ V YW++ NS+ T WGE G  RI +G N+C +
Sbjct: 171 ---DSAAVPYWIIKNSWTTQWGEEGYIRIAKGSNQCLV 205


>2cio_A Papain; hydrolase/inhibitor, complex hydrolase/inhibitor, ICP,
           cysteine protease, allergen, protease, thiol protease;
           1.5A {Carica papaya} PDB: 1khq_A 1khp_A 1ppn_A 3e1z_B
           3ima_A 3lfy_A 9pap_A 1bqi_A* 1bp4_A* 1pad_A 1pe6_A*
           1pip_A* 1pop_A* 1ppd_A 1ppp_A* 1stf_E* 2pad_A 4pad_A*
           5pad_A* 6pad_A* ...
          Length = 212

 Score = 91.4 bits (228), Expect = 2e-21
 Identities = 60/228 (26%), Positives = 85/228 (37%), Gaps = 60/228 (26%)

Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFHGK 173
            +++QGSCGS WA  AV  +   + I + G    + S  +L+ C  D  + GC GG+   
Sbjct: 15  PVKNQGSCGSCWAFSAVVTIEGIIKIRT-GNL-NQYSEQELLDC--DRRSYGCNGGYPWS 70

Query: 174 AWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDV 233
           A +     GI    TY       PYE        G    C+  E                
Sbjct: 71  ALQLVAQYGIHYRNTY-------PYE--------GVQRYCRSREKGPYA-------AKTD 108

Query: 234 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGP 290
                         +    E  +     + PV  S+ + A   D  LY+ GI+     GP
Sbjct: 109 GVRQ----------VQPYNEGALLYSIANQPV--SVVLEAAGKDFQLYRGGIFV----GP 152

Query: 291 LGE---HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
            G    HA+  +G+G             Y L+ NS+ T WGENG  RI
Sbjct: 153 CGNKVDHAVAAVGYGP-----------NYILIKNSWGTGWGENGYIRI 189



 Score = 63.2 bits (155), Expect = 1e-11
 Identities = 33/109 (30%), Positives = 46/109 (42%), Gaps = 27/109 (24%)

Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGPLGE---HAIRII 441
             +    E  +     + PV  S+ + A   D  LY+ GI+     GP G    HA+  +
Sbjct: 111 RQVQPYNEGALLYSIANQPV--SVVLEAAGKDFQLYRGGIFV----GPCGNKVDHAVAAV 164

Query: 442 GWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQN----ECGI 486
           G+G             Y L+ NS+ T WGENG  RI RG       CG+
Sbjct: 165 GYGP-----------NYILIKNSWGTGWGENGYIRIKRGTGNSYGVCGL 202


>1pci_A Procaricain; zymogen, hydrolase, thiol protease; 3.20A {Carica
           papaya} SCOP: d.3.1.1
          Length = 322

 Score = 93.9 bits (234), Expect = 3e-21
 Identities = 74/294 (25%), Positives = 110/294 (37%), Gaps = 77/294 (26%)

Query: 57  NALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINW-------P 108
           N  + L+  E  E  +G   D+ + Q+       +++ +  LPE  D    W       P
Sbjct: 68  NEFADLSNDEFNEKYVGSLIDATIEQSYDEEF--INEDIVNLPENVD----WRKKGAVTP 121

Query: 109 YCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQ 167
                  +R QGSCGS WA  AV  +     I + GK  V LS  +LV C  +  + GC+
Sbjct: 122 -------VRHQGSCGSCWAFSAVATVEGINKIRT-GKL-VELSEQELVDC--ERRSHGCK 170

Query: 168 GGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKC 227
           GG+   A +Y    GI     Y       PY+            +C+  +   P      
Sbjct: 171 GGYPPYALEYVAKNGIHLRSKY-------PYK--------AKQGTCRAKQVGGPI----- 210

Query: 228 QPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYK 284
                               +  N E  +       PV  S+ + +      LYK GI++
Sbjct: 211 --VKTSGVGR----------VQPNNEGNLLNAIAKQPV--SVVVESKGRPFQLYKGGIFE 256

Query: 285 HVAGGPLGE---HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
               GP G     A+  +G+G+     G      Y L+ NS+ T WGE G  RI
Sbjct: 257 ----GPCGTKVDGAVTAVGYGKS---GGK----GYILIKNSWGTAWGEKGYIRI 299



 Score = 63.0 bits (154), Expect = 5e-11
 Identities = 36/158 (22%), Positives = 55/158 (34%), Gaps = 48/158 (30%)

Query: 339 PYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIM 398
           PY+          + +C+A +   P                          +  N E  +
Sbjct: 193 PYK--------AKQGTCRAKQVGGPI-------VKTSGVGR----------VQPNNEGNL 227

Query: 399 REIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGPLGE---HAIRIIGWGQEPLGEGT 452
                  PV  S+ + +      LYK GI++    GP G     A+  +G+G+     G 
Sbjct: 228 LNAIAKQPV--SVVVESKGRPFQLYKGGIFE----GPCGTKVDGAVTAVGYGKS---GGK 278

Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQN----ECGI 486
                Y L+ NS+ T WGE G  RI R        CG+
Sbjct: 279 ----GYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGL 312


>3ioq_A CMS1MS2; caricaceae, cysteine protease, papain family, hydrolase;
           HET: E64 SO4; 1.87A {Carica candamarcensis}
          Length = 213

 Score = 91.0 bits (227), Expect = 3e-21
 Identities = 59/229 (25%), Positives = 88/229 (38%), Gaps = 62/229 (27%)

Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFHGK 173
            +R+QG CGS W   +V A+     I + G+  + LS  +L+ C  +  + GC+GGF   
Sbjct: 15  PVRNQGGCGSCWTFSSVAAVEGINKIVT-GQL-LSLSEQELLDC--ERRSYGCRGGFPLY 70

Query: 174 AWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDV 233
           A +Y   +GI     Y       PYE        G    C+ ++   P            
Sbjct: 71  ALQYVANSGIHLRQYY-------PYE--------GVQRQCRASQAKGP--------KVKT 107

Query: 234 S-YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGG 289
                          +P N E  + +     PV  S+ + A       Y+ GI+     G
Sbjct: 108 DGVGR----------VPRNNEQALIQRIAIQPV--SIVVEAKGRAFQNYRGGIFA----G 151

Query: 290 PLGE---HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
           P G    HA+  +G+G             Y L+ NS+ T WGE G  RI
Sbjct: 152 PCGTSIDHAVAAVGYGN-----------DYILIKNSWGTGWGEGGYIRI 189



 Score = 63.2 bits (155), Expect = 1e-11
 Identities = 33/109 (30%), Positives = 46/109 (42%), Gaps = 27/109 (24%)

Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGPLGE---HAIRII 441
             +P N E  + +     PV  S+ + A       Y+ GI+     GP G    HA+  +
Sbjct: 111 GRVPRNNEQALIQRIAIQPV--SIVVEAKGRAFQNYRGGIFA----GPCGTSIDHAVAAV 164

Query: 442 GWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRG----QNECGI 486
           G+G             Y L+ NS+ T WGE G  RI RG    Q  CG+
Sbjct: 165 GYGN-----------DYILIKNSWGTGWGEGGYIRIKRGSGNPQGACGV 202


>2fo5_A Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cysteine
           endoprotease, endopeptidase, LEUP hydrolase; HET: AR7;
           2.20A {Hordeum vulgare}
          Length = 262

 Score = 91.9 bits (229), Expect = 4e-21
 Identities = 55/225 (24%), Positives = 91/225 (40%), Gaps = 44/225 (19%)

Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
            ++DQG CGS WA   V ++     I + G   V LS  +L+ C     +GCQGG    A
Sbjct: 18  GVKDQGKCGSCWAFSTVVSVEGINAIRT-GSL-VSLSEQELIDCDTADNDGCQGGLMDNA 75

Query: 175 WKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDV 233
           ++Y     G+++   Y       PY          +  +C          +      +  
Sbjct: 76  FEYIKNNGGLITEAAY-------PYR--------AARGTCNVARAAQNSPV----VVHID 116

Query: 234 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGP 290
            ++D          +PAN E  +     + PV  S+ + A     + Y  G++    G  
Sbjct: 117 GHQD----------VPANSEEDLARAVANQPV--SVAVEASGKAFMFYSEGVFTGECGTE 164

Query: 291 LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
           L +H + ++G+G    G+       YW V NS+  +WGE G  R+
Sbjct: 165 L-DHGVAVVGYGVAEDGK------AYWTVKNSWGPSWGEQGYIRV 202



 Score = 64.2 bits (157), Expect = 1e-11
 Identities = 29/104 (27%), Positives = 47/104 (45%), Gaps = 16/104 (15%)

Query: 390 LPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
           +PAN E  +     + PV  S+ + A     + Y  G++    G  L +H + ++G+G  
Sbjct: 121 VPANSEEDLARAVANQPV--SVAVEASGKAFMFYSEGVFTGECGTEL-DHGVAVVGYGVA 177

Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQN----ECGI 486
             G+       YW V NS+  +WGE G  R+ +        CGI
Sbjct: 178 EDGK------AYWTVKNSWGPSWGEQGYIRVEKDSGASGGLCGI 215


>1s4v_A Cysteine endopeptidase; KDEL ER retention signal, endosperm,
           ricinosomes, SEED germi senescence, hydrolase-hydrolase
           inhibitor complex; 2.00A {Ricinus communis} SCOP:
           d.3.1.1
          Length = 229

 Score = 91.1 bits (227), Expect = 5e-21
 Identities = 61/229 (26%), Positives = 91/229 (39%), Gaps = 55/229 (24%)

Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
            ++DQG CGS WA   + A+     I +  K  V LS  +LV C  D   GC GG    A
Sbjct: 16  SVKDQGQCGSCWAFSTIVAVEGINQIKT-NKL-VSLSEQELVDCDTDQNQGCNGGLMDYA 73

Query: 175 WKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDV 233
           +++     GI +   Y       PYE            +C  +        ++  P   +
Sbjct: 74  FEFIKQRGGITTEANY-------PYE--------AYDGTCDVS--------KENAPAVSI 110

Query: 234 S-YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGG 289
             +E+          +P N+E  + +   + PV  S+ I A   D   Y  G++     G
Sbjct: 111 DGHEN----------VPENDENALLKAVANQPV--SVAIDAGGSDFQFYSEGVFT----G 154

Query: 290 PLGE---HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
             G    H + I+G+G    G       KYW V NS+   WGE G  R+
Sbjct: 155 SCGTELDHGVAIVGYGTTIDGT------KYWTVKNSWGPEWGEKGYIRM 197



 Score = 64.1 bits (157), Expect = 8e-12
 Identities = 34/107 (31%), Positives = 48/107 (44%), Gaps = 22/107 (20%)

Query: 390 LPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGPLGE---HAIRIIGW 443
           +P N+E  + +   + PV  S+ I A   D   Y  G++     G  G    H + I+G+
Sbjct: 116 VPENDENALLKAVANQPV--SVAIDAGGSDFQFYSEGVFT----GSCGTELDHGVAIVGY 169

Query: 444 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRG----QNECGI 486
           G    G       KYW V NS+   WGE G  R+ RG    +  CGI
Sbjct: 170 GTTIDGT------KYWTVKNSWGPEWGEKGYIRMERGISDKEGLCGI 210


>1yal_A Chymopapain; hydrolase, thiol protease; 1.70A {Carica papaya} SCOP:
           d.3.1.1 PDB: 1gec_E*
          Length = 218

 Score = 90.7 bits (226), Expect = 5e-21
 Identities = 55/228 (24%), Positives = 93/228 (40%), Gaps = 56/228 (24%)

Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFHGK 173
            +++QG+CGS WA   +  +     I + G   + LS  +LV C  D  + GC+GG+   
Sbjct: 15  PVKNQGACGSCWAFSTIATVEGINKIVT-GNL-LELSEQELVDC--DKHSYGCKGGYQTT 70

Query: 174 AWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDV 233
           + +Y    G+ +   Y       PY+             C+  +   P+           
Sbjct: 71  SLQYVANNGVHTSKVY-------PYQ--------AKQYKCRATDKPGPK-------VKIT 108

Query: 234 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGP 290
            Y+           +P+N ET       + P+  S+ + A      LYK+G++     GP
Sbjct: 109 GYKR----------VPSNCETSFLGALANQPL--SVLVEAGGKPFQLYKSGVFD----GP 152

Query: 291 LGE---HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
            G    HA+  +G+G     +G      Y ++ NS+  NWGE G  R+
Sbjct: 153 CGTKLDHAVTAVGYGTS---DGK----NYIIIKNSWGPNWGEKGYMRL 193



 Score = 61.8 bits (151), Expect = 4e-11
 Identities = 38/158 (24%), Positives = 62/158 (39%), Gaps = 48/158 (30%)

Query: 339 PYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIM 398
           PY+          +  C+A +   P+            Y+           +P+N ET  
Sbjct: 87  PYQ--------AKQYKCRATDKPGPK-------VKITGYKR----------VPSNCETSF 121

Query: 399 REIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGPLGE---HAIRIIGWGQEPLGEGT 452
                + P+  S+ + A      LYK+G++     GP G    HA+  +G+G     +G 
Sbjct: 122 LGALANQPL--SVLVEAGGKPFQLYKSGVFD----GPCGTKLDHAVTAVGYGTS---DGK 172

Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRG----QNECGI 486
                Y ++ NS+  NWGE G  R+ R     Q  CG+
Sbjct: 173 ----NYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGV 206


>1m6d_A Cathepsin F, catsf; papain family cysteine protease, hydrolase;
           HET: MYP; 1.70A {Homo sapiens} SCOP: d.3.1.1
          Length = 214

 Score = 89.8 bits (224), Expect = 8e-21
 Identities = 54/230 (23%), Positives = 88/230 (38%), Gaps = 58/230 (25%)

Query: 115 EIRDQGSCGSGWA---LGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGF 170
           +++DQG CGS WA    G VE               + LS  +L+ C  D  +  C GG 
Sbjct: 15  KVKDQGMCGSCWAFSVTGNVEGQ-----WFLNQGTLLSLSEQELLDC--DKMDKACMGGL 67

Query: 171 HGKAWKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQP 229
              A+       G+ +   Y        Y+        G   SCQ +         +   
Sbjct: 68  PSNAYSAIKNLGGLETEDDY-------SYQ--------GHMQSCQFS--------AEKAK 104

Query: 230 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA-DMILYKTGIYKHVAG 288
            Y     +          L  NE+ +   + + GP+  S+ I A  M  Y+ GI + +  
Sbjct: 105 VYIQDSVE----------LSQNEQKLAAWLAKRGPI--SVAINAFGMQFYRHGISRPLRP 152

Query: 289 --GPLG-EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
              P   +HA+ ++G+GQ            +W + NS+ T+WGE G + +
Sbjct: 153 LCSPWLIDHAVLLVGYGQR---SDV----PFWAIKNSWGTDWGEKGYYYL 195



 Score = 77.1 bits (191), Expect = 2e-16
 Identities = 30/103 (29%), Positives = 51/103 (49%), Gaps = 13/103 (12%)

Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYA-DMILYKTGIYKHVAG--GPLG-EHAIRIIGW 443
             L  NE+ +   + + GP+  S+ I A  M  Y+ GI + +     P   +HA+ ++G+
Sbjct: 111 VELSQNEQKLAAWLAKRGPI--SVAINAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGY 168

Query: 444 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGI 486
           GQ            +W + NS+ T+WGE G + + RG   CG+
Sbjct: 169 GQR---SDV----PFWAIKNSWGTDWGEKGYYYLHRGSGACGV 204


>3p5u_A Actinidin; SAD, cysteine proteinases, hydrolase; 1.50A {Actinidia
           arguta} PDB: 3p5v_A 3p5w_A 3p5x_A 1aec_A* 2act_A
          Length = 220

 Score = 89.9 bits (224), Expect = 9e-21
 Identities = 63/229 (27%), Positives = 93/229 (40%), Gaps = 55/229 (24%)

Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFHGK 173
           +I+DQG CGS WA   + A+     IA+ G   + LS  +LV C +     GC GGF   
Sbjct: 15  DIKDQGQCGSAWAFSTIAAVEGINKIAT-GDL-ISLSEQELVDCGRTQNTRGCDGGFMTD 72

Query: 174 AWKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYD 232
            +++ +   GI +   Y       PY              C  +                
Sbjct: 73  GFQFIINNGGINTEANY-------PYT--------AEEGQCNLDLQQEKY-------VSI 110

Query: 233 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGG 289
            +YE+          +P N E  ++    + PV  S+ + A   +   Y +GI+     G
Sbjct: 111 DTYEN----------VPYNNEWALQTAVAYQPV--SVALEAAGYNFQHYSSGIFT----G 154

Query: 290 PLGE---HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
           P G    HA+ I+G+G E    G      YW+V NS+ T WGE G  RI
Sbjct: 155 PCGTAVDHAVTIVGYGTE---GGI----DYWIVKNSWGTTWGEEGYMRI 196



 Score = 62.9 bits (154), Expect = 2e-11
 Identities = 41/157 (26%), Positives = 59/157 (37%), Gaps = 47/157 (29%)

Query: 339 PYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIM 398
           PY              C  +                 +YE+          +P N E  +
Sbjct: 90  PYT--------AEEGQCNLDLQQEKY-------VSIDTYEN----------VPYNNEWAL 124

Query: 399 REIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGPLGE---HAIRIIGWGQEPLGEGT 452
           +    + PV  S+ + A   +   Y +GI+     GP G    HA+ I+G+G E    G 
Sbjct: 125 QTAVAYQPV--SVALEAAGYNFQHYSSGIFT----GPCGTAVDHAVTIVGYGTE---GGI 175

Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRG---QNECGI 486
                YW+V NS+ T WGE G  RI R      +CGI
Sbjct: 176 ----DYWIVKNSWGTTWGEEGYMRIQRNVGGVGQCGI 208


>2b1m_A SPE31; papain-like, sugar binding protein; HET: NAG FUC PG4; 2.00A
           {Pachyrhizus erosus} PDB: 2b1n_A*
          Length = 246

 Score = 90.4 bits (225), Expect = 1e-20
 Identities = 62/228 (27%), Positives = 95/228 (41%), Gaps = 48/228 (21%)

Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
           +++ QG CGSGWA  A  A+     IA+ G   V LS  +L+ C  +   GC  G+H ++
Sbjct: 16  KVKFQGQCGSGWAFSATGAIEAAHAIAT-GNL-VSLSEQELIDCVDES-EGCYNGWHYQS 72

Query: 175 WKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDV 233
           +++ V   GI S   Y       PY+             C+ NE      I      Y V
Sbjct: 73  FEWVVKHGGIASEADY-------PYK--------ARDGKCKANEIQDKVTID----NYGV 113

Query: 234 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA-DMILYKTGIYKHVAGGPLG 292
               + +         +  E+ ++      P+  S++I A D   Y  GIY    GG   
Sbjct: 114 QILSNES-------TESEAESSLQSFVLEQPI--SVSIDAKDFHFYSGGIYD---GGNCS 161

Query: 293 E-----HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
                 H + I+G+G E   +G      YW+  NS+  +WG +G  RI
Sbjct: 162 SPYGINHFVLIVGYGSE---DGV----DYWIAKNSWGEDWGIDGYIRI 202



 Score = 68.4 bits (168), Expect = 3e-13
 Identities = 39/158 (24%), Positives = 59/158 (37%), Gaps = 41/158 (25%)

Query: 339 PYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIM 398
           PY+             C+ANE      I      Y V    + +         +  E+ +
Sbjct: 89  PYK--------ARDGKCKANEIQDKVTID----NYGVQILSNES-------TESEAESSL 129

Query: 399 REIFRHGPVEGSMTIYA-DMILYKTGIYKHVAGGPLGE-----HAIRIIGWGQEPLGEGT 452
           +      P+  S++I A D   Y  GIY    GG         H + I+G+G E   +G 
Sbjct: 130 QSFVLEQPI--SVSIDAKDFHFYSGGIYD---GGNCSSPYGINHFVLIVGYGSE---DGV 181

Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNE----CGI 486
                YW+  NS+  +WG +G  RI R        CG+
Sbjct: 182 ----DYWIAKNSWGEDWGIDGYIRIQRNTGNLLGVCGM 215


>3qj3_A Cathepsin L-like protein; hydrolase, proteinase, larVal midgut;
           1.85A {Tenebrio molitor}
          Length = 331

 Score = 91.5 bits (228), Expect = 2e-20
 Identities = 70/295 (23%), Positives = 107/295 (36%), Gaps = 70/295 (23%)

Query: 57  NALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQ---LSDPLEELPEGFDARINW----- 107
           N  + +T  E      G+   + L +N +P+  +     +     P  FD    W     
Sbjct: 72  NLFTDMTPEEMKAYTHGLIMPADLHKNGIPIKTREDLGLNASVRYPASFD----WRDQGM 127

Query: 108 --PYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNG 165
             P       +++QGSCGS WA  +  A+  ++ IA+       +S   LV C  +   G
Sbjct: 128 VSP-------VKNQGSCGSSWAFSSTGAIESQMKIANGAGYDSSVSEQQLVDCVPNA-LG 179

Query: 166 CQGGFHGKAWKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECI 224
           C GG+   A+ Y     GI S G Y       PYE         +  +C  +        
Sbjct: 180 CSGGWMNDAFTYVAQNGGIDSEGAY-------PYE--------MADGNCHYDPNQVA--- 221

Query: 225 RKCQPGYDVSYEDDLNFGRIAYSLPA-NEETIMREIFRHGPVEGSMTIYA--DMILYKTG 281
                     Y            L   +E  +   +   GPV  ++   A      Y  G
Sbjct: 222 -----ARLSGYVY----------LSGPDENMLADMVATKGPV--AVAFDADDPFGSYSGG 264

Query: 282 IYKHVAGGPLG-EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
           +Y +         HA+ I+G+G E    G      YWLV NS+   WG +G F+I
Sbjct: 265 VYYNPTCETNKFTHAVLIVGYGNE---NGQ----DYWLVKNSWGDGWGLDGYFKI 312



 Score = 69.2 bits (170), Expect = 5e-13
 Identities = 31/98 (31%), Positives = 44/98 (44%), Gaps = 13/98 (13%)

Query: 393 NEETIMREIFRHGPVEGSMTIYA--DMILYKTGIYKHVAGGPLG-EHAIRIIGWGQEPLG 449
           +E  +   +   GPV  ++   A      Y  G+Y +         HA+ I+G+G E   
Sbjct: 234 DENMLADMVATKGPV--AVAFDADDPFGSYSGGVYYNPTCETNKFTHAVLIVGYGNE--- 288

Query: 450 EGTSSVVKYWLVANSFNTNWGENGLFRIVRGQ-NECGI 486
            G      YWLV NS+   WG +G F+I R   N CGI
Sbjct: 289 NGQ----DYWLVKNSWGDGWGLDGYFKIARNANNHCGI 322


>3f5v_A DER P 1 allergen; allergy, asthma, DUST mites, glycoprotein,
           hydrola protease, secreted, thiol protease; HET: P6G;
           1.36A {Dermatophagoides pteronyssinus} PDB: 2as8_A
           3rvw_A* 3rvx_A 3rvv_A* 3d6s_A*
          Length = 222

 Score = 88.3 bits (220), Expect = 4e-20
 Identities = 48/228 (21%), Positives = 71/228 (31%), Gaps = 54/228 (23%)

Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
            IR QG CGS WA   V A           +  + L+  +LV C     +GC G    + 
Sbjct: 24  PIRMQGGCGSAWAFSGVAATESAYLAYR-QQS-LDLAEQELVDCASQ--HGCHGDTIPRG 79

Query: 175 WKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
            +Y    G+V    Y        Y             SC+                   +
Sbjct: 80  IEYIQHNGVVQESYY-------RYV--------AREQSCRRPNAQR---------FGISN 115

Query: 235 YEDDLNFGRIAYSLPANEETIMRE--IFRHGPVEGSMTIYA----DMILYKTGIYKHVAG 288
           Y            +       +RE     H  +  ++ I          Y          
Sbjct: 116 YCQ----------IYPPNANKIREALAQTHSAI--AVIIGIKDLDAFRHYDGRTIIQRDN 163

Query: 289 GPLG-EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
           G     HA+ I+G+         +  V YW+V NS++TNWG+NG    
Sbjct: 164 GYQPNYHAVNIVGYS-------NAQGVDYWIVRNSWDTNWGDNGYGYF 204



 Score = 64.5 bits (158), Expect = 5e-12
 Identities = 24/102 (23%), Positives = 38/102 (37%), Gaps = 15/102 (14%)

Query: 391 PANEETIMREIFR-HGPVEGSMTIYA----DMILYKTGIYKHVAGGPLG-EHAIRIIGWG 444
           P N   I   + + H  +  ++ I          Y          G     HA+ I+G+ 
Sbjct: 121 PPNANKIREALAQTHSAI--AVIIGIKDLDAFRHYDGRTIIQRDNGYQPNYHAVNIVGYS 178

Query: 445 QEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGI 486
                   +  V YW+V NS++TNWG+NG        +   I
Sbjct: 179 -------NAQGVDYWIVRNSWDTNWGDNGYGYFAANIDLMMI 213


>1o0e_A Ervatamin C; plant cysteine protease, two domain, stable at PH
           2-12, HYDR; 1.90A {Tabernaemontana divaricata} SCOP:
           d.3.1.1 PDB: 2pns_A* 2pre_A* 3bcn_A*
          Length = 208

 Score = 86.9 bits (216), Expect = 7e-20
 Identities = 54/224 (24%), Positives = 80/224 (35%), Gaps = 54/224 (24%)

Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFHGK 173
            +++QGSCGS WA   V  +     I + G   + LS  +LV C  D  N GC GG    
Sbjct: 15  PVKNQGSCGSCWAFSTVSTVESINQIRT-GNL-ISLSEQELVDC--DKKNHGCLGGAFVF 70

Query: 174 AWKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYD 232
           A++Y +   GI +   Y       PY+             CQ                  
Sbjct: 71  AYQYIINNGGIDTQANY-------PYK--------AVQGPCQAASKVV----------SI 105

Query: 233 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTI-YADMILYKTGIYKHVAGGPL 291
             Y            +P   E  +++     P   ++    A    Y +GI+    G  L
Sbjct: 106 DGYNG----------VPFCNEXALKQAVAVQPSTVAIDASSAQFQQYSSGIFSGPCGTKL 155

Query: 292 GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
             H + I+G+              YW+V NS+   WGE G  R+
Sbjct: 156 -NHGVTIVGYQA-----------NYWIVRNSWGRYWGEKGYIRM 187



 Score = 66.5 bits (163), Expect = 9e-13
 Identities = 27/100 (27%), Positives = 42/100 (42%), Gaps = 15/100 (15%)

Query: 390 LPANEETIMREIFRHGPVEGSMTI-YADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPL 448
           +P   E  +++     P   ++    A    Y +GI+    G  L  H + I+G+     
Sbjct: 111 VPFCNEXALKQAVAVQPSTVAIDASSAQFQQYSSGIFSGPCGTKL-NHGVTIVGYQA--- 166

Query: 449 GEGTSSVVKYWLVANSFNTNWGENGLFRIVR--GQNECGI 486
                    YW+V NS+   WGE G  R++R  G   CGI
Sbjct: 167 --------NYWIVRNSWGRYWGEKGYIRMLRVGGCGLCGI 198


>1xkg_A DER P I, major mite fecal allergen DER P 1; major allergen,
           cysteine protease, house DUST mite, dermatop
           pteronyssinus; 1.61A {Dermatophagoides pteronyssinus}
           SCOP: d.3.1.1
          Length = 312

 Score = 89.2 bits (222), Expect = 9e-20
 Identities = 60/289 (20%), Positives = 86/289 (29%), Gaps = 61/289 (21%)

Query: 57  NALSKLTLSEL-EMRMGVHPDSKL--PQNRLPLLVQLSDPLEELPEGFDARINWPYCPTI 113
           N LS L+L E     +      +    Q  L             P   D R       T 
Sbjct: 47  NHLSDLSLDEFKNRFLMSAEAFEHLKTQFDLNAETNACSINGNAPAEIDLRQMR--TVT- 103

Query: 114 QEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGK 173
             IR QG CGS WA   V A           +  + L+  +LV C     +GC G    +
Sbjct: 104 -PIRMQGGCGSAWAFSGVAATESAYLAYR-DQ-SLDLAEQELVDCASQ--HGCHGDTIPR 158

Query: 174 AWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDV 233
             +Y    G+V    Y        Y             SC+                   
Sbjct: 159 GIEYIQHNGVVQESYY-------RYV--------AREQSCRRPNAQR---------FGIS 194

Query: 234 SYEDDLNFGRIAYSLPANEETIMRE--IFRHGPVEGSMTIYA----DMILYKTGIYKHVA 287
           +Y            +       +RE     H  +  ++ I          Y         
Sbjct: 195 NYCQ----------IYPPNANKIREALAQTHSAI--AVIIGIKDLDAFRHYDGRTIIQRD 242

Query: 288 GGPLG-EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
            G     HA+ I+G+         +  V YW+V NS++TNWG+NG    
Sbjct: 243 NGYQPNYHAVNIVGYS-------NAQGVDYWIVRNSWDTNWGDNGYGYF 284



 Score = 65.7 bits (161), Expect = 5e-12
 Identities = 24/102 (23%), Positives = 38/102 (37%), Gaps = 15/102 (14%)

Query: 391 PANEETIMREIFR-HGPVEGSMTIYA----DMILYKTGIYKHVAGGPLG-EHAIRIIGWG 444
           P N   I   + + H  +  ++ I          Y          G     HA+ I+G+ 
Sbjct: 201 PPNANKIREALAQTHSAI--AVIIGIKDLDAFRHYDGRTIIQRDNGYQPNYHAVNIVGYS 258

Query: 445 QEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGI 486
                   +  V YW+V NS++TNWG+NG        +   I
Sbjct: 259 -------NAQGVDYWIVRNSWDTNWGDNGYGYFAANIDLMMI 293


>1by8_A Protein (procathepsin K); hydrolase(sulfhydryl proteinase), papain;
           2.60A {Homo sapiens} SCOP: d.3.1.1 PDB: 7pck_A
          Length = 314

 Score = 88.8 bits (221), Expect = 1e-19
 Identities = 68/286 (23%), Positives = 112/286 (39%), Gaps = 58/286 (20%)

Query: 57  NALSKLTLSEL-EMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQE 115
           N L  +T  E+ +   G+       ++   L +   +     P+  D    +     +  
Sbjct: 61  NHLGDMTSEEVVQKMTGLKVPLSHSRSNDTLYI--PEWEGRAPDSVD----YRKKGYVTP 114

Query: 116 IRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAW 175
           +++QG CGS WA  +V A+  ++   + GK  + LS  +LV C  +  +GC GG+   A+
Sbjct: 115 VKNQGQCGSCWAFSSVGALEGQLKKKT-GKL-LNLSPQNLVDCVSEN-DGCGGGYMTNAF 171

Query: 176 KYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
           +Y     GI S   Y       PY         G   SC  N                  
Sbjct: 172 QYVQKNRGIDSEDAY-------PYV--------GQEESCMYNPTGKA--------AKCRG 208

Query: 235 YEDDLNFGRIAYSLPA-NEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGP 290
           Y +          +P  NE+ + R + R GPV  S+ I A       Y  G+Y   +   
Sbjct: 209 YRE----------IPEGNEKALKRAVARVGPV--SVAIDASLTSFQFYSKGVYYDESCNS 256

Query: 291 LG-EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
               HA+  +G+G +   +G     K+W++ NS+  NWG  G   +
Sbjct: 257 DNLNHAVLAVGYGIQ---KGN----KHWIIKNSWGENWGNKGYILM 295



 Score = 63.4 bits (155), Expect = 3e-11
 Identities = 31/99 (31%), Positives = 48/99 (48%), Gaps = 14/99 (14%)

Query: 393 NEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGPLG-EHAIRIIGWGQEPL 448
           NE+ + R + R GPV  S+ I A       Y  G+Y   +       HA+  +G+G +  
Sbjct: 216 NEKALKRAVARVGPV--SVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQ-- 271

Query: 449 GEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQ-NECGI 486
            +G     K+W++ NS+  NWG  G   + R + N CGI
Sbjct: 272 -KGN----KHWIIKNSWGENWGNKGYILMARNKNNACGI 305


>1cqd_A Protein (protease II); cysteine protease, glycoprotein, proline
           specificity, carboh papain family, hydrolase; HET: NAG
           FUL FUC; 2.10A {Zingiber officinale} SCOP: d.3.1.1
          Length = 221

 Score = 86.0 bits (214), Expect = 2e-19
 Identities = 64/230 (27%), Positives = 96/230 (41%), Gaps = 60/230 (26%)

Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFHGK 173
            +++QG CGS WA   V A+     I + G   + LS   LV C     N GC+GG+   
Sbjct: 17  PVKNQGGCGSCWAFSTVAAVEGINQIVT-GDL-ISLSEQQLVDC--TTANHGCRGGWMNP 72

Query: 174 AWKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPG-Y 231
           A+++ V   GI S  TY       PY         G    C               P   
Sbjct: 73  AFQFIVNNGGINSEETY-------PYR--------GQDGICNST---------VNAPVVS 108

Query: 232 DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAG 288
             SYE+          +P++ E  +++   + PV  S+T+ A   D  LY++GI+     
Sbjct: 109 IDSYEN----------VPSHNEQSLQKAVANQPV--SVTMDAAGRDFQLYRSGIFT---- 152

Query: 289 GPLGE---HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
           G       HA+ ++G+G E           +W+V NS+  NWGE+G  R 
Sbjct: 153 GSCNISANHALTVVGYGTE---NDK----DFWIVKNSWGKNWGESGYIRA 195



 Score = 62.2 bits (152), Expect = 3e-11
 Identities = 32/107 (29%), Positives = 52/107 (48%), Gaps = 23/107 (21%)

Query: 390 LPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGPLGE---HAIRIIGW 443
           +P++ E  +++   + PV  S+T+ A   D  LY++GI+     G       HA+ ++G+
Sbjct: 115 VPSHNEQSLQKAVANQPV--SVTMDAAGRDFQLYRSGIFT----GSCNISANHALTVVGY 168

Query: 444 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNE----CGI 486
           G E           +W+V NS+  NWGE+G  R  R        CGI
Sbjct: 169 GTE---NDK----DFWIVKNSWGKNWGESGYIRAERNIENPDGKCGI 208


>1iwd_A Ervatamin B; cysteine protease, alpha-beta protein, catalytic DYAD,
           L-DOM domain., hydrolase; 1.63A {Tabernaemontana
           divaricata} SCOP: d.3.1.1
          Length = 215

 Score = 84.9 bits (211), Expect = 5e-19
 Identities = 56/225 (24%), Positives = 87/225 (38%), Gaps = 51/225 (22%)

Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
            I++Q  CGS WA  AV A+     I + G+  + LS  +LV C     +GC GG+   A
Sbjct: 15  SIKNQKQCGSCWAFSAVAAVESINKIRT-GQL-ISLSEQELVDCDTAS-HGCNGGWMNNA 71

Query: 175 WKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDV 233
           ++Y +T  GI +   Y       PY             SC+                   
Sbjct: 72  FQYIITNGGIDTQQNY-------PYS--------AVQGSCKPYRLRV---------VSIN 107

Query: 234 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGP 290
            ++           +  N E+ ++      PV  S+T+ A       Y +GI+    G  
Sbjct: 108 GFQR----------VTRNNESALQSAVASQPV--SVTVEAAGAPFQHYSSGIFTGPCGTA 155

Query: 291 LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
              H + I+G+G +    G      YW+V NS+  NWG  G   +
Sbjct: 156 Q-NHGVVIVGYGTQ---SGK----NYWIVRNSWGQNWGNQGYIWM 192



 Score = 61.0 bits (149), Expect = 6e-11
 Identities = 29/104 (27%), Positives = 44/104 (42%), Gaps = 17/104 (16%)

Query: 390 LPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
           +  N E+ ++      PV  S+T+ A       Y +GI+    G     H + I+G+G +
Sbjct: 112 VTRNNESALQSAVASQPV--SVTVEAAGAPFQHYSSGIFTGPCGTAQ-NHGVVIVGYGTQ 168

Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQN----ECGI 486
               G      YW+V NS+  NWG  G   + R        CGI
Sbjct: 169 ---SGK----NYWIVRNSWGQNWGNQGYIWMERNVASSAGLCGI 205


>3f75_A Toxopain-2, cathepsin L protease; medical structural genomics of
           pathogenic protozoa, MSGPP, C protease, parasite,
           protozoa, hydrolase; 1.99A {Toxoplasma gondii}
          Length = 224

 Score = 84.9 bits (211), Expect = 5e-19
 Identities = 55/227 (24%), Positives = 88/227 (38%), Gaps = 50/227 (22%)

Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFHGK 173
            ++DQ  CGS WA     A+    C  + GK  V LS  +L+ C +  GN  C GG    
Sbjct: 21  PVKDQRDCGSCWAFSTTGALEGAHCAKT-GKL-VSLSEQELMDCSRAEGNQSCSGGEMND 78

Query: 174 AWKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYD 232
           A++Y + + GI S   Y       PY              C+            C+    
Sbjct: 79  AFQYVLDSGGICSEDAY-------PYL--------ARDEECRAQ---------SCEKVVK 114

Query: 233 VS-YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAG 288
           +  ++D          +P   E  M+      PV  S+ I A       Y  G++    G
Sbjct: 115 ILGFKD----------VPRRSEAAMKAALAKSPV--SIAIEADQMPFQFYHEGVFDASCG 162

Query: 289 GPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
             L +H + ++G+G +       S   +W++ NS+ T WG +G   +
Sbjct: 163 TDL-DHGVLLVGYGTDK-----ESKKDFWIMKNSWGTGWGRDGYMYM 203



 Score = 60.3 bits (147), Expect = 1e-10
 Identities = 25/103 (24%), Positives = 45/103 (43%), Gaps = 14/103 (13%)

Query: 390 LPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
           +P   E  M+      PV  S+ I A       Y  G++    G  L +H + ++G+G +
Sbjct: 121 VPRRSEAAMKAALAKSPV--SIAIEADQMPFQFYHEGVFDASCGTDL-DHGVLLVGYGTD 177

Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRG---QNECGI 486
                  S   +W++ NS+ T WG +G   +      + +CG+
Sbjct: 178 K-----ESKKDFWIMKNSWGTGWGRDGYMYMAMHKGEEGQCGL 215


>3kwz_A Cathepsin K; enzyme inhibitor, covalent reversible inhibitor,
           disease mutation, disulfide bond, glycoprotein,
           hydrolase, lysosome, protease; HET: KWZ; 1.49A {Homo
           sapiens} PDB: 1au0_A* 1au2_A* 1au3_A* 1au4_A* 1ayu_A*
           1ayv_A* 1ayw_A* 1bgo_A* 1atk_A* 1nl6_A* 1nlj_A* 1q6k_A*
           1mem_A* 1yk7_A* 1yk8_A* 1yt7_A* 2ato_A* 2aux_A* 2auz_A*
           2bdl_A* ...
          Length = 215

 Score = 84.5 bits (210), Expect = 7e-19
 Identities = 59/228 (25%), Positives = 92/228 (40%), Gaps = 53/228 (23%)

Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
            +++QG CGS WA  +V A+  ++   + GK  + LS  +LV C  +  +GC GG+   A
Sbjct: 15  PVKNQGQCGSCWAFSSVGALEGQLKKKT-GKL-LNLSPQNLVDCVSEN-DGCGGGYMTNA 71

Query: 175 WKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPG-YD 232
           ++Y     GI S   Y       PY         G   SC  N                 
Sbjct: 72  FQYVQKNRGIDSEDAY-------PYV--------GQEESCMYN---------PTGKAAKC 107

Query: 233 VSYEDDLNFGRIAYSLPAN-EETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAG 288
             Y +          +P   E+ + R + R GPV  S+ I A       Y  G+Y   + 
Sbjct: 108 RGYRE----------IPEGNEKALKRAVARVGPV--SVAIDASLTSFQFYSKGVYYDESC 155

Query: 289 GPLG-EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
                 HA+  +G+G +   +G     K+W++ NS+  NWG  G   +
Sbjct: 156 NSDNLNHAVLAVGYGIQ---KGN----KHWIIKNSWGENWGNKGYILM 196



 Score = 59.4 bits (145), Expect = 2e-10
 Identities = 31/99 (31%), Positives = 48/99 (48%), Gaps = 14/99 (14%)

Query: 393 NEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGPLG-EHAIRIIGWGQEPL 448
           NE+ + R + R GPV  S+ I A       Y  G+Y   +       HA+  +G+G +  
Sbjct: 117 NEKALKRAVARVGPV--SVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQ-- 172

Query: 449 GEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQ-NECGI 486
            +G     K+W++ NS+  NWG  G   + R + N CGI
Sbjct: 173 -KGN----KHWIIKNSWGENWGNKGYILMARNKNNACGI 206


>2oul_A Falcipain 2; cysteine protease, inhibitor, macromolecular
           interaction, HY hydrolase inhibitor complex; 2.20A
           {Plasmodium falciparum} SCOP: d.3.1.1 PDB: 2ghu_A 1yvb_A
           3bpf_A* 3pnr_A
          Length = 241

 Score = 83.4 bits (207), Expect = 3e-18
 Identities = 54/229 (23%), Positives = 90/229 (39%), Gaps = 50/229 (21%)

Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFHGK 173
            ++DQ +CGS WA  ++ ++  +  I    K  + LS  +LV C     N GC GG    
Sbjct: 32  PVKDQKNCGSCWAFSSIGSVESQYAIRK-NK-LITLSEQELVDC--SFKNYGCNGGLINN 87

Query: 174 AWKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYD 232
           A++  +   GI   G Y       PY        + + + C  +         +C   Y 
Sbjct: 88  AFEDMIELGGICPDGDY-------PYV-------SDAPNLCNID---------RCTEKYG 124

Query: 233 VS-YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA--DMILYKTGIYKHVAGG 289
           +  Y            +P N+      +   GP+  S+++    D   YK GI+    G 
Sbjct: 125 IKNYLS----------VPDNKL--KEALRFLGPI--SISVAVSDDFAFYKEGIFDGECGD 170

Query: 290 PLGEHAIRIIGWGQE---PLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
            L  HA+ ++G+G +              Y+++ NS+   WGE G   I
Sbjct: 171 QL-NHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNSWGQQWGERGFINI 218



 Score = 64.9 bits (159), Expect = 5e-12
 Identities = 27/108 (25%), Positives = 47/108 (43%), Gaps = 14/108 (12%)

Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYA--DMILYKTGIYKHVAGGPLGEHAIRIIGWGQ 445
            S+P N+      +   GP+  S+++    D   YK GI+    G  L  HA+ ++G+G 
Sbjct: 129 LSVPDNKL--KEALRFLGPI--SISVAVSDDFAFYKEGIFDGECGDQL-NHAVMLVGFGM 183

Query: 446 E---PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNE----CGI 486
           +              Y+++ NS+   WGE G   I   ++     CG+
Sbjct: 184 KEIVNPLTKKGEKHYYYIIKNSWGQQWGERGFINIETDESGLMRKCGL 231


>3bwk_A Cysteine protease falcipain-3; malaria, hydrolase; HET: C1P; 2.42A
           {Plasmodium falciparum} PDB: 3bpm_A*
          Length = 243

 Score = 83.0 bits (206), Expect = 3e-18
 Identities = 50/228 (21%), Positives = 87/228 (38%), Gaps = 48/228 (21%)

Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
            ++DQ  CGS WA  +V ++  +  I          S  +LV C     NGC GG+   A
Sbjct: 34  PVKDQALCGSCWAFSSVGSVESQYAIRK-KA-LFLFSEQELVDCSVKN-NGCYGGYITNA 90

Query: 175 WKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDV 233
           +   +   G+ S   Y       PY        +    +C            +C   Y +
Sbjct: 91  FDDMIDLGGLCSQDDY-------PYV-------SNLPETCNLK---------RCNERYTI 127

Query: 234 S-YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA--DMILYKTGIYKHVAGGP 290
             Y            +P ++      +   GP+  S++I A  D   Y+ G Y    G  
Sbjct: 128 KSYVS----------IPDDKF--KEALRYLGPI--SISIAASDDFAFYRGGFYDGECGAA 173

Query: 291 LGEHAIRIIGWGQEPLGEGTSSV---VKYWLVANSFNTNWGENGLFRI 335
              HA+ ++G+G + +    +       Y+++ NS+ ++WGE G   +
Sbjct: 174 P-NHAVILVGYGMKDIYNEDTGRMEKFYYYIIKNSWGSDWGEGGYINL 220



 Score = 64.5 bits (158), Expect = 7e-12
 Identities = 26/107 (24%), Positives = 49/107 (45%), Gaps = 14/107 (13%)

Query: 389 SLPANEETIMREIFRHGPVEGSMTIYA--DMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
           S+P ++      +   GP+  S++I A  D   Y+ G Y    G     HA+ ++G+G +
Sbjct: 132 SIPDDKF--KEALRYLGPI--SISIAASDDFAFYRGGFYDGECGAAP-NHAVILVGYGMK 186

Query: 447 PLGEGTSSV---VKYWLVANSFNTNWGENGLFRIVRGQNE----CGI 486
            +    +       Y+++ NS+ ++WGE G   +   +N     C I
Sbjct: 187 DIYNEDTGRMEKFYYYIIKNSWGSDWGEGGYINLETDENGYKKTCSI 233


>2c0y_A Procathepsin S; proenzyme, proteinase, hydrolase, thiol protease,
           prosegment binding loop, glycoprotein, lysosome,
           protease, zymogen; 2.1A {Homo sapiens}
          Length = 315

 Score = 83.8 bits (208), Expect = 5e-18
 Identities = 72/294 (24%), Positives = 115/294 (39%), Gaps = 74/294 (25%)

Query: 57  NALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINW-------P 108
           N L  +T  E + +        ++P      +   S+P   LP+  D    W        
Sbjct: 62  NHLGDMTSEEVMSLMSS----LRVPSQWQRNITYKSNPNRILPDSVD----WREKGCVTE 113

Query: 109 YCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCG--NGC 166
                  ++ QGSCG+ WA  AV A+  ++ + + GK  V LS+ +LV C  +     GC
Sbjct: 114 -------VKYQGSCGAAWAFSAVGALEAQLKLKT-GKL-VSLSAQNLVDCSTEKYGNKGC 164

Query: 167 QGGFHGKAWKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIR 225
            GGF   A++Y +   GI S  +Y       PY+             CQ +         
Sbjct: 165 NGGFMTTAFQYIIDNKGIDSDASY-------PYK--------AMDQKCQYDSKYRA---- 205

Query: 226 KCQPGYDVSYEDDLNFGRIAYSLPAN-EETIMREIFRHGPVEGSMTIYA---DMILYKTG 281
                    Y +          LP   E+ +   +   GPV  S+ + A      LY++G
Sbjct: 206 ----ATCSKYTE----------LPYGREDVLKEAVANKGPV--SVGVDARHPSFFLYRSG 249

Query: 282 IYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
           +Y   +      H + ++G+G      G     +YWLV NS+  N+GE G  R+
Sbjct: 250 VYYEPSCTQNVNHGVLVVGYGDL---NGK----EYWLVKNSWGHNFGEEGYIRM 296



 Score = 62.3 bits (152), Expect = 7e-11
 Identities = 30/98 (30%), Positives = 48/98 (48%), Gaps = 13/98 (13%)

Query: 393 NEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLG 449
            E+ +   +   GPV  S+ + A      LY++G+Y   +      H + ++G+G     
Sbjct: 218 REDVLKEAVANKGPV--SVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDL--- 272

Query: 450 EGTSSVVKYWLVANSFNTNWGENGLFRIVRGQ-NECGI 486
            G     +YWLV NS+  N+GE G  R+ R + N CGI
Sbjct: 273 NGK----EYWLVKNSWGHNFGEEGYIRMARNKGNHCGI 306


>3ovx_A Cathepsin S; hydrolase, covalent inhibitor, aldehyde warhead is
           covalently bound to Cys25, lysosomeal protein; HET: O64;
           1.49A {Homo sapiens} PDB: 2h7j_A* 2f1g_A* 2hh5_B*
           2hhn_A* 2hxz_A* 2op3_A* 2frq_A* 2fra_A* 2fq9_A* 2ft2_A*
           2fud_A* 2g7y_A* 1ms6_A* 2r9m_A* 2r9n_A* 2r9o_A* 3n3g_A*
           3n4c_A* 3mpe_A* 1nqc_A* ...
          Length = 218

 Score = 80.6 bits (200), Expect = 1e-17
 Identities = 65/228 (28%), Positives = 101/228 (44%), Gaps = 51/228 (22%)

Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDC-GN-GCQGGFHG 172
           E++ QGSCG+ WA  AV A+  ++ + + GK  V LS+ +LV C  +  GN GC GGF  
Sbjct: 16  EVKYQGSCGACWAFSAVGALEAQLKLKT-GKL-VSLSAQNLVDCSTEKYGNKGCNGGFMT 73

Query: 173 KAWKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGY 231
            A++Y +   GI S  +Y       PY+             CQ +         K +   
Sbjct: 74  TAFQYIIDNKGIDSDASY-------PYK--------AMDQKCQYD--------SKYRAAT 110

Query: 232 DVSYEDDLNFGRIAYSLPAN-EETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVA 287
              Y +          LP   E+ +   +   GPV  S+ + A      LY++G+Y   +
Sbjct: 111 CSKYTE----------LPYGREDVLKEAVANKGPV--SVGVDARHPSFFLYRSGVYYEPS 158

Query: 288 GGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
                 H + ++G+G      G     +YWLV NS+  N+GE G  R+
Sbjct: 159 CTQNVNHGVLVVGYGDL---NGK----EYWLVKNSWGHNFGEEGYIRM 199



 Score = 60.2 bits (147), Expect = 1e-10
 Identities = 30/98 (30%), Positives = 48/98 (48%), Gaps = 13/98 (13%)

Query: 393 NEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLG 449
            E+ +   +   GPV  S+ + A      LY++G+Y   +      H + ++G+G     
Sbjct: 121 REDVLKEAVANKGPV--SVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDL--- 175

Query: 450 EGTSSVVKYWLVANSFNTNWGENGLFRIVRGQ-NECGI 486
            G     +YWLV NS+  N+GE G  R+ R + N CGI
Sbjct: 176 NGK----EYWLVKNSWGHNFGEEGYIRMARNKGNHCGI 209


>3u8e_A Papain-like cysteine protease; papain-like cysteine peptidase,
           peptidase_C1A, hydrolase, in form; 1.31A {Crocus
           sativus}
          Length = 222

 Score = 68.7 bits (168), Expect = 2e-13
 Identities = 51/227 (22%), Positives = 83/227 (36%), Gaps = 50/227 (22%)

Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
            ++DQG+CG  WA GA  A+     I + G+  + +S   +V C         GG    A
Sbjct: 15  SVKDQGACGMCWAFGATGAIEGIDAITT-GRL-ISVSEQQIVDCDTXXXXXX-GGDADDA 71

Query: 175 WKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDV 233
           +++ +T  GI S   Y       PY         G   +C  N+P               
Sbjct: 72  FRWVITNGGIASDANY-------PYT--------GVDGTCDLNKPIAARI---------D 107

Query: 234 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTI-YADMILYKT-GIYKHVAGGPL 291
            Y +          +P +   ++  +    PV  ++        LY   GI+   +    
Sbjct: 108 GYTN----------VPNSSSALLDAV-AKQPVSVNIYTSSTSFQLYTGPGIFAGSSCSDD 156

Query: 292 G---EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
               +H + I+G+G             YW+V NS+ T WG +G   I
Sbjct: 157 PATVDHTVLIVGYGSNGTNA------DYWIVKNSWGTEWGIDGYILI 197



 Score = 58.3 bits (141), Expect = 7e-10
 Identities = 25/107 (23%), Positives = 42/107 (39%), Gaps = 16/107 (14%)

Query: 389 SLPANEETIMREIFRHGPVEGSMTI-YADMILYKT-GIYKHVAGGPLG---EHAIRIIGW 443
           ++P +   ++  +    PV  ++        LY   GI+   +        +H + I+G+
Sbjct: 111 NVPNSSSALLDAV-AKQPVSVNIYTSSTSFQLYTGPGIFAGSSCSDDPATVDHTVLIVGY 169

Query: 444 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNE----CGI 486
           G             YW+V NS+ T WG +G   I R  N     C I
Sbjct: 170 GSNGTNA------DYWIVKNSWGTEWGIDGYILIRRNTNRPDGVCAI 210


>2pff_B Fatty acid synthase subunit beta; fatty acid synthase,
           acyl-carrier-protein, beta-ketoacyl RED beta-ketoacyl
           synthase, dehydratase; 4.00A {Saccharomyces cerevisiae}
          Length = 2006

 Score = 42.7 bits (100), Expect = 3e-04
 Identities = 54/385 (14%), Positives = 107/385 (27%), Gaps = 143/385 (37%)

Query: 209 SHSSCQDNEPNTPE---------CIRKCQPGYDVSYEDDLN-----FGR----------I 244
           +     D+EP TP               +P     ++  LN     F            +
Sbjct: 45  TEGFAADDEPTTPAELVGKFLGYVSSLVEPSKVGQFDQVLNLCLTEFENCYLEGNDIHAL 104

Query: 245 AYSLPANEETIM---REIFRHGPVEGSMTIY--ADMILYKTGIYKHVAGGPLGEHA---- 295
           A  L    +T +   +E+ ++         Y  A ++  +   +   +   L        
Sbjct: 105 AAKLLQENDTTLVKTKELIKN---------YITARIMAKRP--FDKKSNSALFRAVGEGN 153

Query: 296 IRII---GWGQEPLGEGTSSVVKYW--LVANSFNTNWGENGLFRIGCRPYEIPCERYMNG 350
            +++   G GQ     G +    Y+  L             L++     Y +     +  
Sbjct: 154 AQLVAIFG-GQ-----GNTDD--YFEELRD-----------LYQT----YHVLVGDLIKF 190

Query: 351 SRSSCQANEPNTPECIRKCQPGYDV--------SYEDDLNFGRIAYSLP-------ANEE 395
           S  +       T +  +    G ++        +  D      I  S P       A+  
Sbjct: 191 SAETLSELIRTTLDAEKVFTQGLNILEWLENPSNTPDKDYLLSIPISCPLIGVIQLAHYV 250

Query: 396 TIMREI-FRHGPV----EGSMTIYADMILYKTGIYKHVAGGPLGEH-------AIRI--- 440
              + + F  G +    +G+      ++   T +   +A     E        AI +   
Sbjct: 251 VTAKLLGFTPGELRSYLKGATGHSQGLV---TAVA--IAETDSWESFFVSVRKAITVLFF 305

Query: 441 IGW-GQE--PLGEGTSSVVKYWLVANSFNTNWGENG-------LFRIVRGQNECGIEADI 490
           IG    E  P      S+++  L          EN        L   +    +  ++  +
Sbjct: 306 IGVRCYEAYPNTSLPPSILEDSL----------ENNEGVPSPML--SISNLTQEQVQDYV 353

Query: 491 T---AGLPK-----IGLEIDSNEIN 507
               + LP      I L      +N
Sbjct: 354 NKTNSHLPAGKQVEISL------VN 372



 Score = 36.6 bits (84), Expect = 0.026
 Identities = 39/265 (14%), Positives = 68/265 (25%), Gaps = 108/265 (40%)

Query: 351 SRSSCQANEPNTPE---------CIRKCQPGYDVSYEDDLN-----FGR----------I 386
           +      +EP TP               +P     ++  LN     F            +
Sbjct: 45  TEGFAADDEPTTPAELVGKFLGYVSSLVEPSKVGQFDQVLNLCLTEFENCYLEGNDIHAL 104

Query: 387 AYSLPANEETIM---REIFR---------HGPV-------------EGSMTIYADMI--- 418
           A  L    +T +   +E+ +           P              EG+  + A  I   
Sbjct: 105 AAKLLQENDTTLVKTKELIKNYITARIMAKRPFDKKSNSALFRAVGEGNAQLVA--IFGG 162

Query: 419 -------------LYKTGIYK-------HVAGGPLGE-------------HAIRIIGWGQ 445
                        LY+T  Y          +   L E               + I+ W +
Sbjct: 163 QGNTDDYFEELRDLYQT--YHVLVGDLIKFSAETLSELIRTTLDAEKVFTQGLNILEWLE 220

Query: 446 EP--------LGEGTSSV--------VKYWLVANSFNTNWGENGLFRIVRGQNECGIEAD 489
            P        L     S           Y + A       GE  L   ++G      +  
Sbjct: 221 NPSNTPDKDYLLSIPISCPLIGVIQLAHYVVTAKLLGFTPGE--LRSYLKGATGHS-QGL 277

Query: 490 ITAGLPKIGLEIDSNEINLGKMMTL 514
           +TA         +S  +++ K +T+
Sbjct: 278 VTAVAIAETDSWESFFVSVRKAITV 302



 Score = 30.4 bits (68), Expect = 1.7
 Identities = 19/99 (19%), Positives = 30/99 (30%), Gaps = 26/99 (26%)

Query: 203  ERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSY-----EDDLNFGRIAYSLPANEETIMR 257
            E Y      +  D +  T E I K    +  SY     +  L+     ++ PA       
Sbjct: 1686 ENYSAMIFETIVDGKLKT-EKIFKEINEHSTSYTFRSEKGLLS--ATQFTQPA------- 1735

Query: 258  EIFRHGPVEGSMTIYADMILYKTGIY---KHVAGGPLGE 293
             +            + D+     G+       AG  LGE
Sbjct: 1736 -LTLM-----EKAAFEDLK--SKGLIPADATFAGHSLGE 1766



 Score = 30.4 bits (68), Expect = 1.9
 Identities = 11/83 (13%), Positives = 22/83 (26%), Gaps = 32/83 (38%)

Query: 382  NFGRIAYSLPANEETIMREIFR-----------HGPVEG---------------SMTIYA 415
            N+  + +    + +    +IF+               +G                   + 
Sbjct: 1687 NYSAMIFETIVDGKLKTEKIFKEINEHSTSYTFRSE-KGLLSATQFTQPALTLMEKAAFE 1745

Query: 416  DMILYKTGIY---KHVAGGPLGE 435
            D+     G+       AG  LGE
Sbjct: 1746 DLK--SKGLIPADATFAGHSLGE 1766


>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis,
           programmed cell death; HET: DTP; 6.90A {Drosophila
           melanogaster} PDB: 3iz8_A*
          Length = 1221

 Score = 42.1 bits (98), Expect = 5e-04
 Identities = 63/489 (12%), Positives = 115/489 (23%), Gaps = 174/489 (35%)

Query: 1   MGKSTADAVATFLKDLDLSQSSRNHSNGVF------CD-----------LSKAFD----- 38
            GK+             +          +F      C+           L    D     
Sbjct: 161 SGKTWV--ALDVCLSYKVQ---CKMDFKIFWLNLKNCNSPETVLEMLQKLLYQIDPNWTS 215

Query: 39  RVDHSILLPKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLV--QLSDPLEE 96
           R DHS  +  L    + +  L +L      ++   + +         LLV   + +   +
Sbjct: 216 RSDHSSNIK-LRI-HSIQAELRRL------LKSKPYENC--------LLVLLNVQNA--K 257

Query: 97  LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
               F+       C  +   R +              ++D +           +S D   
Sbjct: 258 AWNAFNLS-----CKILLTTRFKQ-------------VTDFL----SAATTTHISLDHH- 294

Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSH------ 210
                                                 CRP ++P E  +  ++      
Sbjct: 295 ----------SMTLTPDE----------VKSLLLKYLDCRPQDLPRE--VLTTNPRRLSI 332

Query: 211 --SSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGS 268
              S +D    T +  +      +    D L    I  SL   E    R++F        
Sbjct: 333 IAESIRDG-LATWDNWKH----VNC---DKLT-TIIESSLNVLEPAEYRKMFD------R 377

Query: 269 MTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWG 328
           ++++                  +    + +I W      +    V K  L   S      
Sbjct: 378 LSVFPP----------SA---HIPTILLSLI-WFDVIKSDVMVVVNK--LHKYSLVEKQP 421

Query: 329 ENGLFRIGCRPYEIPCERYMNGSRSSCQANEPNTPE---------CIRKCQPGYDVS--Y 377
           +     I      I    Y+       +    N             I K     D+   Y
Sbjct: 422 KESTISI----PSI----YL---ELKVKLE--NEYALHRSIVDHYNIPKTFDSDDLIPPY 468

Query: 378 EDDLNFGRIAYSLPANEET----IMREIF----------RH-----GPVEGSMTIYADMI 418
            D   +  I + L   E      + R +F          RH           +     + 
Sbjct: 469 LDQYFYSHIGHHLKNIEHPERMTLFRMVFLDFRFLEQKIRHDSTAWNASGSILNTLQQLK 528

Query: 419 LYKTGIYKH 427
            YK  I  +
Sbjct: 529 FYKPYICDN 537


>1qzv_F Plant photosystem I: subunit PSAF; photosynthesis,plant
          photosynthetic reaction center, peripheral antenna;
          HET: CL1 PQN; 4.44A {Pisum sativum} SCOP: i.5.1.1
          Length = 154

 Score = 36.9 bits (84), Expect = 0.004
 Identities = 12/41 (29%), Positives = 17/41 (41%), Gaps = 11/41 (26%)

Query: 55 EKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLE 95
          EK AL KL  + L++      DS       P L  +   +E
Sbjct: 18 EKQALKKLQ-ASLKL---YADDSA------PALA-IKATME 47


>3pw3_A Aminopeptidase C; bleomycin, cysteine proteinase fold, structural
           genomics, JO center for structural genomics, JCSG; HET:
           MSE; 2.23A {Parabacteroides distasonis}
          Length = 383

 Score = 33.3 bits (75), Expect = 0.17
 Identities = 50/392 (12%), Positives = 98/392 (25%), Gaps = 67/392 (17%)

Query: 101 FDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK 160
           F      P    I  +++Q   G+ W   +   +     +   GK    LS    V    
Sbjct: 14  FTTVKENP----ITSVKNQNRAGTCWCYSSYSFL--ESELLRMGKGEYDLSEMFTVYNTY 67

Query: 161 DCGNGCQGGFHGKAWKYWVTTGIVSGGTY------ASKQGCRPYEIPCERYMNGSHSSCQ 214
                     HG             GG++          G  P E        G   +  
Sbjct: 68  LDRADAAVRTHGD-------VSFSQGGSFYDALYGMETFGLVPEEE----MRPGMMYADT 116

Query: 215 DNEPNTPE---CIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTI 271
            +  N  E                   L        L       + +I+   P E     
Sbjct: 117 LS--NHTELSALTDAMVAAIAKGKLRKLQSDENNAMLWKKAVAAVHQIYLGVPPE---KF 171

Query: 272 YADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENG 331
                 Y    +    G    ++ + +  +   P         ++ L       NW    
Sbjct: 172 TYKGKEYTPKSFFESTGLKASDY-VSLTSYTHHPFYT------QFPL---EIQDNW---- 217

Query: 332 LFRIGCRPYEIPCERYMNGSRSSCQANEP------NTPECIRKCQPGYDVSYEDDLNFGR 385
                   Y +P + +M    ++             +     +         E       
Sbjct: 218 ---RHGMSYNLPLDEFMEVFDNAINTGYTIAWGSDVSESGFTRDGVAVMPDDEKVQELSG 274

Query: 386 IAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQ 445
              +     +   +++      +   T        +   Y +       +H ++I G  +
Sbjct: 275 SDMAHWLKLKPEEKKLNTKPQPQKWCTQ-----AERQLAYDNYETTD--DHGMQIYGIAK 327

Query: 446 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 477
           +  G       +Y++V NS+ TN   NG++  
Sbjct: 328 DQEG------NEYYMVKNSWGTNSKYNGIWYA 353


>3r8s_0 50S ribosomal protein L32; protein biosynthesis, RNA, tRNA,
           transfer RNA, 23S ribosomal subunit, ribosome recycling
           factor, RRF, ribosome; 3.00A {Escherichia coli} PDB:
           1p85_Z 1p86_Z 2awb_0 2aw4_0 2i2v_0 2j28_0 2i2t_0*
           2qao_0* 2qba_0* 2qbc_0* 2qbe_0 2qbg_0 2qbi_0* 2qbk_0*
           2qov_0 2qox_0 2qoz_0* 2qp1_0* 2rdo_0 2vhm_0 ...
          Length = 56

 Score = 27.2 bits (61), Expect = 2.0
 Identities = 8/23 (34%), Positives = 11/23 (47%), Gaps = 2/23 (8%)

Query: 143 RGKR--HVRLSSDDLVSCCKDCG 163
           RG R  H  L++   +S  K  G
Sbjct: 12  RGMRRSHDALTAVTSLSVDKTSG 34


>1lr7_A Follistatin, FS1; heparin-binding, cystine-rich, sucrose
           octasulphate, hormone/growth factor complex; HET: SO4;
           1.50A {Rattus norvegicus} SCOP: g.3.11.3 g.68.1.1 PDB:
           1lr8_A* 1lr9_A
          Length = 74

 Score = 27.2 bits (60), Expect = 3.1
 Identities = 10/29 (34%), Positives = 13/29 (44%), Gaps = 2/29 (6%)

Query: 201 PCERYMNGSHSSCQDNEPNTPECIRKCQP 229
            CE    G    C+ N+ N P C+  C P
Sbjct: 3   TCENVDCGPGKKCRMNKKNKPRCV--CAP 29



 Score = 26.0 bits (57), Expect = 7.2
 Identities = 10/29 (34%), Positives = 13/29 (44%), Gaps = 2/29 (6%)

Query: 343 PCERYMNGSRSSCQANEPNTPECIRKCQP 371
            CE    G    C+ N+ N P C+  C P
Sbjct: 3   TCENVDCGPGKKCRMNKKNKPRCV--CAP 29


>2rjq_A Adamts-5; metalloprotease domain, aggrecanase, cleavage on PAIR of
           BAS residues, extracellular matrix, glycoprotein,
           hydrolase, ME binding; HET: NAG BAT; 2.60A {Homo
           sapiens}
          Length = 378

 Score = 28.9 bits (64), Expect = 4.8
 Identities = 14/84 (16%), Positives = 29/84 (34%), Gaps = 5/84 (5%)

Query: 163 GNGCQGGFHGKAWKYWVTTGIVSGGTYASKQG-CRPYEIPCERYMNGSHSSCQDNEPNTP 221
            +       G      + + I++    +     C    I    +++  H +C  + P   
Sbjct: 161 DSKFCEETFGSTEDKRLMSSILTSIDASKPWSKCTSATI--TEFLDDGHGNCLLDLPRKQ 218

Query: 222 ECIRKCQPG--YDVSYEDDLNFGR 243
               +  PG  YD + + +L FG 
Sbjct: 219 ILGPEELPGQTYDATQQCNLTFGP 242


>1igr_A Insulin-like growth factor receptor 1; hormone receptor, insulin
           receptor family; HET: NAG FUC BMA MAN; 2.60A {Homo
           sapiens} SCOP: c.10.2.5 c.10.2.5 g.3.9.1
          Length = 478

 Score = 28.6 bits (63), Expect = 5.2
 Identities = 15/92 (16%), Positives = 29/92 (31%), Gaps = 9/92 (9%)

Query: 158 CCKDCGNGCQGGFHGKA----WKYWVTTGIVS---GGTYASKQGCRPYEIPCERYMNGSH 210
           C  +C   C    +  A      Y+     V      TY  +         C   +  + 
Sbjct: 201 CHPECLGSCSAPDNDTACVACRHYYYAGVCVPACPPNTYRFEGWRCVDRDFC-ANILSAE 259

Query: 211 SSCQDNE-PNTPECIRKCQPGYDVSYEDDLNF 241
           SS  +    +  EC+++C  G+  +    +  
Sbjct: 260 SSDSEGFVIHDGECMQECPSGFIRNGSQSMYC 291


>3hh2_C Follistatin; protein-protein complex, TB domain, cystine knot
           motif, TGF- fold, disulfide linked dimer, CLE PAIR of
           basic residues, cytokine; HET: CIT; 2.15A {Homo sapiens}
           PDB: 2b0u_C* 2p6a_D
          Length = 288

 Score = 28.4 bits (62), Expect = 6.1
 Identities = 12/45 (26%), Positives = 16/45 (35%), Gaps = 2/45 (4%)

Query: 194 GCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDD 238
            C P +  CE    G    C+ N+ N P C+  C P         
Sbjct: 58  NCIPCKETCENVDCGPGKKCRMNKKNKPRCV--CAPDCSNITWKG 100



 Score = 28.1 bits (61), Expect = 7.1
 Identities = 14/62 (22%), Positives = 20/62 (32%), Gaps = 2/62 (3%)

Query: 319 VANSFNTNWGENGLFRIGCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYE 378
           V ++    W         C P +  CE    G    C+ N+ N P C+  C P       
Sbjct: 41  VNDNTLFKWMIFNGGAPNCIPCKETCENVDCGPGKKCRMNKKNKPRCV--CAPDCSNITW 98

Query: 379 DD 380
             
Sbjct: 99  KG 100


>1n1i_A Merozoite surface protein-1; MSP1, malaria, surface antigen,
           glycoprotein, EGF domain, cell adhesion; HET: HIS; 2.40A
           {Plasmodium knowlesi strain H} SCOP: g.3.11.4 g.3.11.4
          Length = 105

 Score = 26.6 bits (58), Expect = 8.2
 Identities = 6/30 (20%), Positives = 11/30 (36%), Gaps = 2/30 (6%)

Query: 202 CERYMNGSHSSCQDNEPNTPECIRKCQPGY 231
           C       +++C      T E   +C  G+
Sbjct: 13  CIDTNVPENAACYRYLDGTEEW--RCLLGF 40


>1ob1_C Major merozoite surface protein; immune system,
           immunoglobulin/complex, immunoglobulin, antib fragment,
           MSP1-19, EGF-like domain; 2.90A {Plasmodium falciparum}
           SCOP: g.3.11.4 g.3.11.4 PDB: 1cej_A 2flg_A
          Length = 99

 Score = 26.3 bits (57), Expect = 9.0
 Identities = 8/30 (26%), Positives = 11/30 (36%), Gaps = 2/30 (6%)

Query: 202 CERYMNGSHSSCQDNEPNTPECIRKCQPGY 231
           C +     +S C  +     EC  KC   Y
Sbjct: 7   CVKKQCPQNSGCFRHLDEREEC--KCLLNY 34


>3nvx_A Protein A39; beta-propeller, viral protein; HET: NAG; 2.00A
           {Vaccinia virus} PDB: 3nvn_A*
          Length = 383

 Score = 27.8 bits (61), Expect = 9.7
 Identities = 13/142 (9%), Positives = 27/142 (19%), Gaps = 8/142 (5%)

Query: 134 MSDRVCIASRGKRHV-RLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASK 192
           + D +     G   V   S++ L        N                   +  GT    
Sbjct: 19  LDDVLYTGVNG--AVYTFSNNKLNKTGLTNNNYI----TTSIKVEDADKDTLVCGTNNGN 72

Query: 193 QGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 252
             C   +   +    G                       D++   +       +  P   
Sbjct: 73  PKCWKIDGSDDPKHRG-RGYAPYQNSKVTIISYNECVLSDINISKEGIKRWRRFDGPCGY 131

Query: 253 ETIMREIFRHGPVEGSMTIYAD 274
           +    +            +  D
Sbjct: 132 DLYTADNVIPKDGLRGAFVDKD 153


  Database: pdb70
    Posted date:  Sep 4, 2012  3:40 AM
  Number of letters in database: 6,701,793
  Number of sequences in database:  27,921
  
Lambda     K      H
   0.318    0.137    0.435 

Gapped
Lambda     K      H
   0.267   0.0856    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 27921
Number of Hits to DB: 8,240,073
Number of extensions: 511290
Number of successful extensions: 1330
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1201
Number of HSP's successfully gapped: 129
Length of query: 524
Length of database: 6,701,793
Length adjustment: 98
Effective length of query: 426
Effective length of database: 3,965,535
Effective search space: 1689317910
Effective search space used: 1689317910
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 59 (26.3 bits)