RPS-BLAST 2.2.26 [Sep-21-2011]

Database: pdb70 
           27,921 sequences; 6,701,793 total letters

Searching..................................................done

Query= psy11694
         (655 letters)



>3qj3_A Cathepsin L-like protein; hydrolase, proteinase, larVal midgut;
           1.85A {Tenebrio molitor}
          Length = 331

 Score =  260 bits (666), Expect = 1e-81
 Identities = 89/317 (28%), Positives = 145/317 (45%), Gaps = 43/317 (13%)

Query: 185 HAIQGNNLTELSVQH--------HDKVYSSVEDLLRRHENFVTNVEKAEDYQSE-DSGTA 235
           H ++G+ L    V          + + Y + ++   R + F   +E  E++  +   G  
Sbjct: 6   HHLEGSALPSTFVAEKWENFKTTYARSYVNAKEETFRKQIFQKKLETFEEHNEKYRQGLV 65

Query: 236 VF--GVNKFFDLSESDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHG 293
            +  GVN F D++  +++  T              L  P   ++    ++  +   L   
Sbjct: 66  SYTLGVNLFTDMTPEEMKAYTH------------GLIMPADLHKNGIPIKTREDLGLNAS 113

Query: 294 DDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTE--LSVQQLVDC 351
              P +FDWR +G++S VK QG C   WAFS+ G +E+   I   +  +  +S QQLVDC
Sbjct: 114 VRYPASFDWRDQGMVSPVKNQGSCGSSWAFSSTGAIESQMKIANGAGYDSSVSEQQLVDC 173

Query: 352 DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRI 411
             +  GC+GG M+DA  Y+  NGG+ S+ AYPY+ ++    C   +      ++  Y  +
Sbjct: 174 VPNALGCSGGWMNDAFTYVAQNGGIDSEGAYPYEMADGN--CHY-DPNQVAARLSGYVYL 230

Query: 412 PYGEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVID----LNQRL--------YGTS-- 456
              +E  +   VAT+GP++V  +A+  F  YSGGV         +         YG    
Sbjct: 231 SGPDENMLADMVATKGPVAVAFDADDPFGSYSGGVYYNPTCETNKFTHAVLIVGYGNENG 290

Query: 457 IPYWIVKNSWGSDWGEK 473
             YW+VKNSWG  WG  
Sbjct: 291 QDYWLVKNSWGDGWGLD 307



 Score =  177 bits (451), Expect = 2e-50
 Identities = 56/148 (37%), Positives = 80/148 (54%), Gaps = 11/148 (7%)

Query: 506 LVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKE 565
           LVDC  +  GC+GG M+DA  Y+  NGG+ S+ AYPY+ ++    C   +      ++  
Sbjct: 170 LVDCVPNALGCSGGWMNDAFTYVAQNGGIDSEGAYPYEMADGN--CHY-DPNQVAARLSG 226

Query: 566 YSRIPYGEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVIDLNQRLCNPKAQNHALIIVG 624
           Y  +   +E  +   VAT+GP++V  +A+  F  YSGGV       C      HA++IVG
Sbjct: 227 YVYLSGPDENMLADMVATKGPVAVAFDADDPFGSYSGGV--YYNPTCETNKFTHAVLIVG 284

Query: 625 YGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
           YG E  +D     YW+VKNSWG  WG  
Sbjct: 285 YGNENGQD-----YWLVKNSWGDGWGLD 307



 Score =  114 bits (288), Expect = 4e-28
 Identities = 37/161 (22%), Positives = 69/161 (42%), Gaps = 17/161 (10%)

Query: 43  TRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQRE-DSGTAVFE--VNKFFDLSD 99
            ++ NF   + + Y + ++   R + F   +E  E++  +   G   +   VN F D++ 
Sbjct: 20  EKWENFKTTYARSYVNAKEETFRKQIFQKKLETFEEHNEKYRQGLVSYTLGVNLFTDMTP 79

Query: 100 SDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGV 159
            +++  T              L  P   ++    ++  +   L      P +FDWR +G+
Sbjct: 80  EEMKAYTH------------GLIMPADLHKNGIPIKTREDLGLNASVRYPASFDWRDQGM 127

Query: 160 ISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTE--LSVQ 198
           +S VK QG C   WAFS+ G +E+   I      +  +S Q
Sbjct: 128 VSPVKNQGSCGSSWAFSSTGAIESQMKIANGAGYDSSVSEQ 168


>1xkg_A DER P I, major mite fecal allergen DER P 1; major allergen,
           cysteine protease, house DUST mite, dermatop
           pteronyssinus; 1.61A {Dermatophagoides pteronyssinus}
           SCOP: d.3.1.1
          Length = 312

 Score =  244 bits (625), Expect = 8e-76
 Identities = 68/295 (23%), Positives = 116/295 (39%), Gaps = 47/295 (15%)

Query: 198 QHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQL-TGL 256
           +  +K Y++ ED     +NF+ +V+  +             +N   DLS  + +      
Sbjct: 13  KAFNKSYATFEDEEAARKNFLESVKYVQSNGG--------AINHLSDLSLDEFKNRFLMS 64

Query: 257 NLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGK 316
                    Q  L A              + N+     + P   D R    ++ ++ QG 
Sbjct: 65  AEAFEHLKTQFDLNA--------------ETNACSINGNAPAEIDLRQMRTVTPIRMQGG 110

Query: 317 CACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDNGGV 376
           C   WAFS V   E+ +    +   +L+ Q+LVDC  S  GC+G  +   ++YI  N GV
Sbjct: 111 CGSAWAFSGVAATESAYLAYRDQSLDLAEQELVDCA-SQHGCHGDTIPRGIEYIQHN-GV 168

Query: 377 VSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVA-TRGPLSVGMNA 435
           V +  Y Y A E    C     +     +  Y +I      ++++ +A T   ++V +  
Sbjct: 169 VQESYYRYVAREQS--CRRPNAQR--FGISNYCQIYPPNANKIREALAQTHSAIAVIIGI 224

Query: 436 NGLF---YYSGGVIDL----NQRL--------YGTS--IPYWIVKNSWGSDWGEK 473
             L    +Y G  I       Q          Y  +  + YWIV+NSW ++WG+ 
Sbjct: 225 KDLDAFRHYDGRTIIQRDNGYQPNYHAVNIVGYSNAQGVDYWIVRNSWDTNWGDN 279



 Score =  158 bits (402), Expect = 8e-44
 Identities = 48/166 (28%), Positives = 74/166 (44%), Gaps = 21/166 (12%)

Query: 491 TGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERG 550
                     LA ++LVDC  S  GC+G  +   ++YI  NG VV +  Y Y A E    
Sbjct: 131 RDQSLD----LAEQELVDCA-SQHGCHGDTIPRGIEYIQHNG-VVQESYYRYVAREQS-- 182

Query: 551 CLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVA-TRGPLSVGMNANGLF---YYSGGVIDL 606
           C     +     +  Y +I      ++++ +A T   ++V +    L    +Y G  I  
Sbjct: 183 CRRPNAQR--FGISNYCQIYPPNANKIREALAQTHSAIAVIIGIKDLDAFRHYDGRTII- 239

Query: 607 NQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
            QR    +   HA+ IVGY   +  D     YWIV+NSW ++WG+ 
Sbjct: 240 -QRDNGYQPNYHAVNIVGYSNAQGVD-----YWIVRNSWDTNWGDN 279



 Score =  118 bits (298), Expect = 1e-29
 Identities = 31/160 (19%), Positives = 57/160 (35%), Gaps = 23/160 (14%)

Query: 40  SPVTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSD 99
           S +  F  + +  +K Y++ ED     +NF+ +V+  +             +N   DLS 
Sbjct: 3   SSIKTFEEYKKAFNKSYATFEDEEAARKNFLESVKYVQSNGGA--------INHLSDLSL 54

Query: 100 SDLQQL-TGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEG 158
            + +               Q  L A              + N+     + P   D R   
Sbjct: 55  DEFKNRFLMSAEAFEHLKTQFDLNA--------------ETNACSINGNAPAEIDLRQMR 100

Query: 159 VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
            ++ ++ QG C   WAFS V   E+ +    +   +L+ Q
Sbjct: 101 TVTPIRMQGGCGSAWAFSGVAATESAYLAYRDQSLDLAEQ 140


>1by8_A Protein (procathepsin K); hydrolase(sulfhydryl proteinase), papain;
           2.60A {Homo sapiens} SCOP: d.3.1.1 PDB: 7pck_A
          Length = 314

 Score =  242 bits (620), Expect = 4e-75
 Identities = 91/295 (30%), Positives = 136/295 (46%), Gaps = 41/295 (13%)

Query: 199 HHDKVYSSVEDLLRRHENFVTNVEKAEDYQSE-DSGTAVF--GVNKFFDLSESDLQQ-LT 254
            H K Y++  D + R   +  N++    +  E   G   +   +N   D++  ++ Q +T
Sbjct: 17  THRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTSEEVVQKMT 76

Query: 255 GLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQ 314
           GL +  +      +L  P                        P++ D+R +G ++ VK Q
Sbjct: 77  GLKVPLSHSRSNDTLYIP------------------EWEGRAPDSVDYRKKGYVTPVKNQ 118

Query: 315 GKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDNG 374
           G+C  CWAFS+VG +E     +   L  LS Q LVDC   N GC GG M +A QY+  N 
Sbjct: 119 GQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNR 178

Query: 375 GVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMN 434
           G+ S+ AYPY   E    C+     G   K + Y  IP G E+ +K+ VA  GP+SV ++
Sbjct: 179 GIDSEDAYPYVGQEES--CMY-NPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAID 235

Query: 435 ANG--LFYYSGGVID----LNQRL--------YGTS--IPYWIVKNSWGSDWGEK 473
           A+     +YS GV       +  L        YG      +WI+KNSWG +WG K
Sbjct: 236 ASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNK 290



 Score =  165 bits (420), Expect = 3e-46
 Identities = 64/164 (39%), Positives = 88/164 (53%), Gaps = 16/164 (9%)

Query: 491 TGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERG 550
           TG L +    L+ + LVDC   N GC GG M +A QY+  N G+ S+ AYPY   E    
Sbjct: 141 TGKLLN----LSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEES-- 194

Query: 551 CLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANG--LFYYSGGVIDLNQ 608
           C+     G   K + Y  IP G E+ +K+ VA  GP+SV ++A+     +YS GV     
Sbjct: 195 CMY-NPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYY--D 251

Query: 609 RLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
             CN    NHA++ VGYG ++        +WI+KNSWG +WG K
Sbjct: 252 ESCNSDNLNHAVLAVGYGIQKGNK-----HWIIKNSWGENWGNK 290



 Score =  111 bits (279), Expect = 4e-27
 Identities = 38/160 (23%), Positives = 68/160 (42%), Gaps = 22/160 (13%)

Query: 43  TRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDY-QREDSGTAVFE--VNKFFDLSD 99
           T +  + + H K Y++  D + R   +  N++    +      G   +E  +N   D++ 
Sbjct: 9   THWELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTS 68

Query: 100 SDLQQ-LTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEG 158
            ++ Q +TGL +  +      +L  P                        P++ D+R +G
Sbjct: 69  EEVVQKMTGLKVPLSHSRSNDTLYIP------------------EWEGRAPDSVDYRKKG 110

Query: 159 VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
            ++ VK QG+C  CWAFS+VG +E     +   L  LS Q
Sbjct: 111 YVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQ 150


>1pci_A Procaricain; zymogen, hydrolase, thiol protease; 3.20A {Carica
           papaya} SCOP: d.3.1.1
          Length = 322

 Score =  240 bits (615), Expect = 3e-74
 Identities = 91/290 (31%), Positives = 141/290 (48%), Gaps = 38/290 (13%)

Query: 199 HHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESD-LQQLTGLN 257
           +H+K Y +V++ L R E F  N+   ++   +++   + G+N+F DLS  +  ++  G  
Sbjct: 28  NHNKFYENVDEKLYRFEIFKDNLNYIDETNKKNNSYWL-GLNEFADLSNDEFNEKYVGSL 86

Query: 258 LDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKC 317
           +D+T+E                               +LPE  DWR +G ++ V+ QG C
Sbjct: 87  IDATIEQSYDEEFIN------------------EDIVNLPENVDWRKKGAVTPVRHQGSC 128

Query: 318 ACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDNGGVV 377
             CWAFSAV  VE ++ I+   L ELS Q+LVDC+  + GC GG    AL+Y+  N G+ 
Sbjct: 129 GSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERRSHGCKGGYPPYALEYVAKN-GIH 187

Query: 378 SDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANG 437
               YPYKA +    C   +  G  VK     R+    E  +   +A   P+SV + + G
Sbjct: 188 LRSKYPYKAKQGT--CRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAK-QPVSVVVESKG 244

Query: 438 L-F-YYSGGVID------LNQRL----YGTS--IPYWIVKNSWGSDWGEK 473
             F  Y GG+ +      ++  +    YG S    Y ++KNSWG+ WGEK
Sbjct: 245 RPFQLYKGGIFEGPCGTKVDGAVTAVGYGKSGGKGYILIKNSWGTAWGEK 294



 Score =  153 bits (389), Expect = 7e-42
 Identities = 54/158 (34%), Positives = 78/158 (49%), Gaps = 15/158 (9%)

Query: 497 KLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEE 556
           KL  L+ ++LVDC+  + GC GG    AL+Y+  N G+     YPYKA +    C   + 
Sbjct: 150 KLVELSEQELVDCERRSHGCKGGYPPYALEYVAKN-GIHLRSKYPYKAKQGT--CRAKQV 206

Query: 557 EGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDLNQRLCNPK 614
            G  VK     R+    E  +   +A   P+SV + + G  F  Y GG+ +     C  K
Sbjct: 207 GGPIVKTSGVGRVQPNNEGNLLNAIAK-QPVSVVVESKGRPFQLYKGGIFEGP---CGTK 262

Query: 615 AQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
             + A+  VGYG+   K      Y ++KNSWG+ WGEK
Sbjct: 263 V-DGAVTAVGYGKSGGKG-----YILIKNSWGTAWGEK 294



 Score =  113 bits (286), Expect = 5e-28
 Identities = 46/157 (29%), Positives = 78/157 (49%), Gaps = 20/157 (12%)

Query: 43  TRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSD- 101
             F ++M +H+K Y +V++ L R E F  N+   ++  ++++   +  +N+F DLS+ + 
Sbjct: 20  QLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKKNNSYWL-GLNEFADLSNDEF 78

Query: 102 LQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVIS 161
            ++  G  +D+T+E                               +LPE  DWR +G ++
Sbjct: 79  NEKYVGSLIDATIEQSYDEEFIN------------------EDIVNLPENVDWRKKGAVT 120

Query: 162 KVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
            V+ QG C  CWAFSAV  VE ++ I+   L ELS Q
Sbjct: 121 PVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQ 157


>1m6d_A Cathepsin F, catsf; papain family cysteine protease, hydrolase;
           HET: MYP; 1.70A {Homo sapiens} SCOP: d.3.1.1
          Length = 214

 Score =  233 bits (596), Expect = 6e-73
 Identities = 68/193 (35%), Positives = 104/193 (53%), Gaps = 20/193 (10%)

Query: 297 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG 356
           P  +DWR++G ++KVK+QG C  CWAFS  G VE    +   +L  LS Q+L+DCD  + 
Sbjct: 2   PPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDK 61

Query: 357 GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEE 416
            C GG   +A   I + GG+ ++  Y Y+       C    E+  KV +++   +    E
Sbjct: 62  ACMGGLPSNAYSAIKNLGGLETEDDYSYQGHMQS--CQFSAEKA-KVYIQDSVELS-QNE 117

Query: 417 EEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRL--------------YGTS--IPYW 460
           +++  W+A RGP+SV +NA G+ +Y  G+    + L              YG    +P+W
Sbjct: 118 QKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGQRSDVPFW 177

Query: 461 IVKNSWGSDWGEK 473
            +KNSWG+DWGEK
Sbjct: 178 AIKNSWGTDWGEK 190



 Score =  182 bits (464), Expect = 9e-54
 Identities = 54/162 (33%), Positives = 89/162 (54%), Gaps = 13/162 (8%)

Query: 491 TGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERG 550
            G L S    L+ ++L+DCD  +  C GG   +A   I + GG+ ++  Y Y+       
Sbjct: 42  QGTLLS----LSEQELLDCDKMDKACMGGLPSNAYSAIKNLGGLETEDDYSYQGHMQS-- 95

Query: 551 CLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRL 610
           C    E+  KV +++   +    E+++  W+A RGP+SV +NA G+ +Y  G+    + L
Sbjct: 96  CQFSAEKA-KVYIQDSVELS-QNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPL 153

Query: 611 CNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
           C+P   +HA+++VGYG+          +W +KNSWG+DWGEK
Sbjct: 154 CSPWLIDHAVLLVGYGQRSDVP-----FWAIKNSWGTDWGEK 190



 Score = 82.5 bits (205), Expect = 4e-18
 Identities = 23/50 (46%), Positives = 30/50 (60%)

Query: 149 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
           P  +DWR++G ++KVK+QG C  CWAFS  G VE    +    L  LS Q
Sbjct: 2   PPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQ 51


>2c0y_A Procathepsin S; proenzyme, proteinase, hydrolase, thiol protease,
           prosegment binding loop, glycoprotein, lysosome,
           protease, zymogen; 2.1A {Homo sapiens}
          Length = 315

 Score =  235 bits (603), Expect = 1e-72
 Identities = 97/297 (32%), Positives = 141/297 (47%), Gaps = 45/297 (15%)

Query: 199 HHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAV---FGVNKFFDLSESDLQQL-T 254
            + K Y    +   R   +  N++    +  E S        G+N   D++  ++  L +
Sbjct: 18  TYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMS 77

Query: 255 GLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQ 314
            L + S     Q      + SN                   LP++ DWR +G +++VK Q
Sbjct: 78  SLRVPS-----QWQRNITYKSN---------------PNRILPDSVDWREKGCVTEVKYQ 117

Query: 315 GKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS---NGGCNGGRMDDALQYII 371
           G C   WAFSAVG +EA   ++   L  LS Q LVDC      N GCNGG M  A QYII
Sbjct: 118 GSCGAAWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYII 177

Query: 372 DNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSV 431
           DN G+ SD +YPYKA + +  C   + +       +Y+ +PYG E+ +K+ VA +GP+SV
Sbjct: 178 DNKGIDSDASYPYKAMDQK--CQY-DSKYRAATCSKYTELPYGREDVLKEAVANKGPVSV 234

Query: 432 GMNANGL-F-YYSGGVIDL---NQRL--------YGTS--IPYWIVKNSWGSDWGEK 473
           G++A    F  Y  GV       Q +        YG      YW+VKNSWG ++GE+
Sbjct: 235 GVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEE 291



 Score =  155 bits (393), Expect = 2e-42
 Identities = 68/167 (40%), Positives = 96/167 (57%), Gaps = 20/167 (11%)

Query: 491 TGVLPSKLSRLATEKLVDCDMS---NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASES 547
           TG L S    L+ + LVDC      N GCNGG M  A QYIIDN G+ SD +YPYKA + 
Sbjct: 140 TGKLVS----LSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQ 195

Query: 548 ERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVID 605
           +  C   + +       +Y+ +PYG E+ +K+ VA +GP+SVG++A    F  Y  GV  
Sbjct: 196 K--CQY-DSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGV-- 250

Query: 606 LNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
             +  C     NH +++VGYG+   K+     YW+VKNSWG ++GE+
Sbjct: 251 YYEPSCTQN-VNHGVLVVGYGDLNGKE-----YWLVKNSWGHNFGEE 291



 Score =  110 bits (276), Expect = 1e-26
 Identities = 38/160 (23%), Positives = 66/160 (41%), Gaps = 24/160 (15%)

Query: 43  TRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQRE-DSGTAVFE--VNKFFDLSD 99
             +  + + + K Y    +   R   +  N++    +  E   G   ++  +N   D++ 
Sbjct: 10  HHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTS 69

Query: 100 SDLQQL-TGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEG 158
            ++  L + L + S     Q      + SN                   LP++ DWR +G
Sbjct: 70  EEVMSLMSSLRVPS-----QWQRNITYKSN---------------PNRILPDSVDWREKG 109

Query: 159 VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
            +++VK QG C   WAFSAVG +EA   ++   L  LS Q
Sbjct: 110 CVTEVKYQGSCGAAWAFSAVGALEAQLKLKTGKLVSLSAQ 149


>2b1m_A SPE31; papain-like, sugar binding protein; HET: NAG FUC PG4; 2.00A
           {Pachyrhizus erosus} PDB: 2b1n_A*
          Length = 246

 Score =  232 bits (594), Expect = 3e-72
 Identities = 68/198 (34%), Positives = 103/198 (52%), Gaps = 20/198 (10%)

Query: 295 DLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS 354
           D PE++DW  +GVI+KVK QG+C   WAFSA G +EA HAI   +L  LS Q+L+DC   
Sbjct: 1   DAPESWDWSKKGVITKVKFQGQCGSGWAFSATGAIEAAHAIATGNLVSLSEQELIDCVDE 60

Query: 355 NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASE----SERGCLVGEEEGFKVKVKEYSR 410
           + GC  G    + ++++ +GG+ S+  YPYKA +    +         + + V++     
Sbjct: 61  SEGCYNGWHYQSFEWVVKHGGIASEADYPYKARDGKCKANEIQDKVTIDNYGVQILSNES 120

Query: 411 IPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVID-----LNQRL--------YGTS- 456
                E  ++ +V    P+SV ++A    +YSGG+ D         +        YG+  
Sbjct: 121 TESEAESSLQSFVLE-QPISVSIDAKDFHFYSGGIYDGGNCSSPYGINHFVLIVGYGSED 179

Query: 457 -IPYWIVKNSWGSDWGEK 473
            + YWI KNSWG DWG  
Sbjct: 180 GVDYWIAKNSWGEDWGID 197



 Score =  173 bits (440), Expect = 7e-50
 Identities = 52/166 (31%), Positives = 83/166 (50%), Gaps = 15/166 (9%)

Query: 491 TGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASE---- 546
           TG L S    L+ ++L+DC   + GC  G    + ++++ +GG+ S+  YPYKA +    
Sbjct: 43  TGNLVS----LSEQELIDCVDESEGCYNGWHYQSFEWVVKHGGIASEADYPYKARDGKCK 98

Query: 547 SERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDL 606
           +         + + V++          E  ++ +V    P+SV ++A    +YSGG+ D 
Sbjct: 99  ANEIQDKVTIDNYGVQILSNESTESEAESSLQSFVLE-QPISVSIDAKDFHFYSGGIYD- 156

Query: 607 NQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
                +P   NH ++IVGYG E+  D     YWI KNSWG DWG  
Sbjct: 157 GGNCSSPYGINHFVLIVGYGSEDGVD-----YWIAKNSWGEDWGID 197



 Score = 85.0 bits (211), Expect = 1e-18
 Identities = 30/52 (57%), Positives = 36/52 (69%)

Query: 147 DLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
           D PE++DW  +GVI+KVK QG+C   WAFSA G +EA HAI   NL  LS Q
Sbjct: 1   DAPESWDWSKKGVITKVKFQGQCGSGWAFSATGAIEAAHAIATGNLVSLSEQ 52


>2o6x_A Procathepsin L1, secreted cathepsin L 1; hydrolase, thiol protease,
           cysteine protease, zymogen, hydro; 1.40A {Fasciola
           hepatica}
          Length = 310

 Score =  234 bits (600), Expect = 3e-72
 Identities = 81/295 (27%), Positives = 130/295 (44%), Gaps = 43/295 (14%)

Query: 199 HHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAV---FGVNKFFDLSESDLQQLTG 255
            ++K Y+  +D  RR   +  NV+  +++        V    G+N+F D++  + +    
Sbjct: 11  MYNKEYNGADDQHRR-NIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEFKAKYL 69

Query: 256 LNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQG 315
                             +     +++ +       +   +P+  DWR  G +++VK+QG
Sbjct: 70  ------------------TEMSRASDILSHGVPYEANNRAVPDKIDWRESGYVTEVKDQG 111

Query: 316 KCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS--NGGCNGGRMDDALQYIIDN 373
            C   WAFS  G +E  +     +    S QQLVDC     N GC GG M++A QY +  
Sbjct: 112 NCGSGWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSRPWGNNGCGGGLMENAYQY-LKQ 170

Query: 374 GGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGM 433
            G+ ++ +YPY A E +  C   ++ G   KV  +  +  G E E+K  V   GP +V +
Sbjct: 171 FGLETESSYPYTAVEGQ--CRYNKQLG-VAKVTGFYTVHSGSEVELKNLVGAEGPAAVAV 227

Query: 434 NANGLF-YYSGGVID----LNQRL--------YGTS--IPYWIVKNSWGSDWGEK 473
           +    F  Y  G+         R+        YGT     YWIVKNSWG  WGE+
Sbjct: 228 DVESDFMMYRSGIYQSQTCSPLRVNHAVLAVGYGTQGGTDYWIVKNSWGLSWGER 282



 Score =  158 bits (401), Expect = 1e-43
 Identities = 59/166 (35%), Positives = 83/166 (50%), Gaps = 20/166 (12%)

Query: 491 TGVLPSKLSRLATEK-LVDCDMS--NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASES 547
                S  S    E+ LVDC     N GC GG M++A QY+    G+ ++ +YPY A E 
Sbjct: 133 ERTSIS-FS----EQQLVDCSRPWGNNGCGGGLMENAYQYL-KQFGLETESSYPYTAVEG 186

Query: 548 ERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVIDL 606
           +  C   ++ G   KV  +  +  G E E+K  V   GP +V ++    F  Y  G+   
Sbjct: 187 Q--CRYNKQLG-VAKVTGFYTVHSGSEVELKNLVGAEGPAAVAVDVESDFMMYRSGIYQ- 242

Query: 607 NQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
             + C+P   NHA++ VGYG +   D     YWIVKNSWG  WGE+
Sbjct: 243 -SQTCSPLRVNHAVLAVGYGTQGGTD-----YWIVKNSWGLSWGER 282



 Score =  108 bits (272), Expect = 3e-26
 Identities = 31/159 (19%), Positives = 63/159 (39%), Gaps = 22/159 (13%)

Query: 43  TRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQRE-DSGTAVFE--VNKFFDLSD 99
             +  + R ++K Y+  +D  RR   +  NV+  +++    D G   +   +N+F D++ 
Sbjct: 3   DLWHQWKRMYNKEYNGADDQHRR-NIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTF 61

Query: 100 SDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGV 159
            + +                      +     +++ +       +   +P+  DWR  G 
Sbjct: 62  EEFKAKYL------------------TEMSRASDILSHGVPYEANNRAVPDKIDWRESGY 103

Query: 160 ISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
           +++VK+QG C   WAFS  G +E  +          S Q
Sbjct: 104 VTEVKDQGNCGSGWAFSTTGTMEGQYMKNERTSISFSEQ 142


>3bwk_A Cysteine protease falcipain-3; malaria, hydrolase; HET: C1P; 2.42A
           {Plasmodium falciparum} PDB: 3bpm_A*
          Length = 243

 Score =  230 bits (589), Expect = 2e-71
 Identities = 71/206 (34%), Positives = 103/206 (50%), Gaps = 28/206 (13%)

Query: 291 RHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVD 350
                   A+DWR  G ++ VK+Q  C  CWAFS+VG VE+ +AI+  +L   S Q+LVD
Sbjct: 15  ADAKLDRIAYDWRLHGGVTPVKDQALCGSCWAFSSVGSVESQYAIRKKALFLFSEQELVD 74

Query: 351 CDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSR 410
           C + N GC GG + +A   +ID GG+ S   YPY ++  E  C +      +  +K Y  
Sbjct: 75  CSVKNNGCYGGYITNAFDDMIDLGGLCSQDDYPYVSNLPET-CNLKRCNE-RYTIKSYVS 132

Query: 411 IPYGEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVIDLN--QRL--------YGTS--- 456
           IP   +++ K+ +   GP+S+ + A+  F +Y GG  D               YG     
Sbjct: 133 IP---DDKFKEALRYLGPISISIAASDDFAFYRGGFYDGECGAAPNHAVILVGYGMKDIY 189

Query: 457 ---------IPYWIVKNSWGSDWGEK 473
                      Y+I+KNSWGSDWGE 
Sbjct: 190 NEDTGRMEKFYYYIIKNSWGSDWGEG 215



 Score =  169 bits (431), Expect = 1e-48
 Identities = 56/168 (33%), Positives = 87/168 (51%), Gaps = 19/168 (11%)

Query: 491 TGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERG 550
              L       + ++LVDC + N GC GG + +A   +ID GG+ S   YPY ++  E  
Sbjct: 61  KKALFL----FSEQELVDCSVKNNGCYGGYITNAFDDMIDLGGLCSQDDYPYVSNLPET- 115

Query: 551 CLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVIDLNQR 609
           C +      +  +K Y  IP   +++ K+ +   GP+S+ + A+  F +Y GG  D    
Sbjct: 116 CNLKRCNE-RYTIKSYVSIP---DDKFKEALRYLGPISISIAASDDFAFYRGGFYDGE-- 169

Query: 610 LCNPKAQNHALIIVGYGEEEKKDGTS-----IPYWIVKNSWGSDWGEK 652
            C     NHA+I+VGYG ++  +  +       Y+I+KNSWGSDWGE 
Sbjct: 170 -CGAA-PNHAVILVGYGMKDIYNEDTGRMEKFYYYIIKNSWGSDWGEG 215



 Score = 86.9 bits (216), Expect = 2e-19
 Identities = 23/56 (41%), Positives = 31/56 (55%)

Query: 143 RHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
                   A+DWR  G ++ VK+Q  C  CWAFS+VG VE+ +AI+   L   S Q
Sbjct: 15  ADAKLDRIAYDWRLHGGVTPVKDQALCGSCWAFSSVGSVESQYAIRKKALFLFSEQ 70


>3qt4_A Cathepsin-L-like midgut cysteine proteinase; hydrolase, zymogen,
           intramolecular DISS bonds, insect larVal midgut; HET:
           PG4 PG6; 2.11A {Tenebrio molitor}
          Length = 329

 Score =  233 bits (596), Expect = 2e-71
 Identities = 91/295 (30%), Positives = 133/295 (45%), Gaps = 42/295 (14%)

Query: 199 HHDKVYSSVEDLLRRHENFVTNVEKAEDYQSE-DSGTAVF--GVNKFFDLSESDLQQLTG 255
            H K YSS  + +RR   F  NV K  ++ ++ + G   +   +N+F D+S+ +      
Sbjct: 33  THKKSYSSPIEEIRRQLIFKDNVAKIAEHNAKFEKGEVTYSKAMNQFGDMSKEEFLAYVN 92

Query: 256 LNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQG 315
                                Q        +   +     L  + DWR+  V S+VK+QG
Sbjct: 93  -----------------RGKAQKPKHPENLRMPYVSSKKPLAASVDWRSNAV-SEVKDQG 134

Query: 316 KCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS--NGGCNGGRMDDALQYIIDN 373
           +C   W+FS  G VE   A+Q   LT LS Q L+DC  S  N GC+GG MD A  YI  +
Sbjct: 135 QCGSSWSFSTTGAVEGQLALQRGRLTSLSEQNLIDCSSSYGNAGCDGGWMDSAFSYIH-D 193

Query: 374 GGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGM 433
            G++S+ AYPY+A      C   +       +  Y  +P G+E  +   V   GP++V +
Sbjct: 194 YGIMSESAYPYEAQGDY--CRF-DSSQSVTTLSGYYDLPSGDENSLADAVGQAGPVAVAI 250

Query: 434 NANGLF-YYSGGVIDL----NQRL--------YGTS--IPYWIVKNSWGSDWGEK 473
           +A     +YSGG+          L        YG+     YWI+KNSWGS WGE 
Sbjct: 251 DATDELQFYSGGLFYDQTCNQSDLNHGVLVVGYGSDNGQDYWILKNSWGSGWGES 305



 Score =  157 bits (400), Expect = 2e-43
 Identities = 62/166 (37%), Positives = 88/166 (53%), Gaps = 20/166 (12%)

Query: 491 TGVLPSKLSRLATEK-LVDCDMS--NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASES 547
            G L S LS    E+ L+DC  S  N GC+GG MD A  YI  + G++S+ AYPY+A   
Sbjct: 156 RGRLTS-LS----EQNLIDCSSSYGNAGCDGGWMDSAFSYIH-DYGIMSESAYPYEAQGD 209

Query: 548 ERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVIDL 606
              C   +       +  Y  +P G+E  +   V   GP++V ++A     +YSGG+   
Sbjct: 210 Y--CRF-DSSQSVTTLSGYYDLPSGDENSLADAVGQAGPVAVAIDATDELQFYSGGL--F 264

Query: 607 NQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
             + CN    NH +++VGYG +  +D     YWI+KNSWGS WGE 
Sbjct: 265 YDQTCNQSDLNHGVLVVGYGSDNGQD-----YWILKNSWGSGWGES 305



 Score =  109 bits (274), Expect = 2e-26
 Identities = 42/159 (26%), Positives = 64/159 (40%), Gaps = 21/159 (13%)

Query: 43  TRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDY-QREDSGTAVFE--VNKFFDLSD 99
            ++  F   H K YSS  + +RR   F  NV K  ++  + + G   +   +N+F D+S 
Sbjct: 25  EQWSQFKLTHKKSYSSPIEEIRRQLIFKDNVAKIAEHNAKFEKGEVTYSKAMNQFGDMSK 84

Query: 100 SDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGV 159
            +                           Q        +   +     L  + DWR+  V
Sbjct: 85  EEFLAYVN-----------------RGKAQKPKHPENLRMPYVSSKKPLAASVDWRSNAV 127

Query: 160 ISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
            S+VK+QG+C   W+FS  G VE   A+Q   LT LS Q
Sbjct: 128 -SEVKDQGQCGSSWSFSTTGAVEGQLALQRGRLTSLSEQ 165


>2oul_A Falcipain 2; cysteine protease, inhibitor, macromolecular
           interaction, HY hydrolase inhibitor complex; 2.20A
           {Plasmodium falciparum} SCOP: d.3.1.1 PDB: 2ghu_A 1yvb_A
           3bpf_A* 3pnr_A
          Length = 241

 Score =  228 bits (583), Expect = 1e-70
 Identities = 65/204 (31%), Positives = 103/204 (50%), Gaps = 28/204 (13%)

Query: 293 GDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCD 352
            +    A+DWR    ++ VK+Q  C  CWAFS++G VE+ +AI+ N L  LS Q+LVDC 
Sbjct: 15  ENFDHAAYDWRLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCS 74

Query: 353 MSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIP 412
             N GCNGG +++A + +I+ GG+  D  YPY +          +    K  +K Y  +P
Sbjct: 75  FKNYGCNGGLINNAFEDMIELGGICPDGDYPYVS--DAPNLCNIDRCTEKYGIKNYLSVP 132

Query: 413 YGEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVIDLN--QRL--------YGTS----- 456
              + ++K+ +   GP+S+ +  +  F +Y  G+ D     +L        +G       
Sbjct: 133 ---DNKLKEALRFLGPISISVAVSDDFAFYKEGIFDGECGDQLNHAVMLVGFGMKEIVNP 189

Query: 457 -------IPYWIVKNSWGSDWGEK 473
                    Y+I+KNSWG  WGE+
Sbjct: 190 LTKKGEKHYYYIIKNSWGQQWGER 213



 Score =  167 bits (426), Expect = 5e-48
 Identities = 50/153 (32%), Positives = 81/153 (52%), Gaps = 15/153 (9%)

Query: 506 LVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKE 565
           LVDC   N GCNGG +++A + +I+ GG+  D  YPY +          +    K  +K 
Sbjct: 70  LVDCSFKNYGCNGGLINNAFEDMIELGGICPDGDYPYVS--DAPNLCNIDRCTEKYGIKN 127

Query: 566 YSRIPYGEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVIDLNQRLCNPKAQNHALIIVG 624
           Y  +P   + ++K+ +   GP+S+ +  +  F +Y  G+ D     C  +  NHA+++VG
Sbjct: 128 YLSVP---DNKLKEALRFLGPISISVAVSDDFAFYKEGIFDGE---CGDQL-NHAVMLVG 180

Query: 625 YGEEE-----KKDGTSIPYWIVKNSWGSDWGEK 652
           +G +E      K G    Y+I+KNSWG  WGE+
Sbjct: 181 FGMKEIVNPLTKKGEKHYYYIIKNSWGQQWGER 213



 Score = 85.7 bits (213), Expect = 5e-19
 Identities = 23/54 (42%), Positives = 33/54 (61%)

Query: 145 GDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
            +    A+DWR    ++ VK+Q  C  CWAFS++G VE+ +AI+ N L  LS Q
Sbjct: 15  ENFDHAAYDWRLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQ 68


>3i06_A Cruzipain; autocatalytic cleavage, glycoprotein, protease, thiol
           protease, zymogen; HET: QL2; 1.10A {Trypanosoma cruzi}
           PDB: 1ewm_A* 1ewo_A* 1ewl_A* 1f29_A* 1ewp_A* 1f2b_A*
           1f2c_A* 1f2a_A* 1me4_A* 1u9q_X* 2aim_A* 2efm_A* 2oz2_A*
           1me3_A* 3kku_A* 3lxs_A* 1aim_A* 3iut_A* 3hd3_A* 2p86_A*
           ...
          Length = 215

 Score =  226 bits (578), Expect = 3e-70
 Identities = 71/191 (37%), Positives = 112/191 (58%), Gaps = 15/191 (7%)

Query: 297 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG 356
           P A DWRA G ++ VK+QG+C  CWAFSA+G VE    + G+ LT LS Q LV CD ++ 
Sbjct: 2   PAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDS 61

Query: 357 GCNGGRMDDALQYII--DNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYG 414
           GC+GG M++A ++I+  +NG V ++ +YPY + E                +  +  +P  
Sbjct: 62  GCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQD 121

Query: 415 EEEEMKKWVATRGPLSVGMNANGLFYYSGGVID--LNQRL--------YGTS--IPYWIV 462
            E ++  W+A  GP++V ++A+    Y+GGV+   ++++L        Y  S  +PYWI+
Sbjct: 122 -EAQIAAWLAVNGPVAVAVDASSWMTYTGGVMTSCVSEQLDHGVLLVGYNDSAAVPYWII 180

Query: 463 KNSWGSDWGEK 473
           KNSW + WGE+
Sbjct: 181 KNSWTTQWGEE 191



 Score =  169 bits (431), Expect = 6e-49
 Identities = 47/164 (28%), Positives = 85/164 (51%), Gaps = 16/164 (9%)

Query: 491 TGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYII--DNGGVVSDQAYPYKASESE 548
              L +    L+ + LV CD ++ GC+GG M++A ++I+  +NG V ++ +YPY + E  
Sbjct: 42  GHPLTN----LSEQMLVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGI 97

Query: 549 RGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQ 608
                         +  +  +P   E ++  W+A  GP++V ++A+    Y+GGV+    
Sbjct: 98  SPPCTTSGHTVGATITGHVELPQD-EAQIAAWLAVNGPVAVAVDASSWMTYTGGVMT--- 153

Query: 609 RLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
             C  +  +H +++VGY +          YWI+KNSW + WGE+
Sbjct: 154 -SCVSEQLDHGVLLVGYNDSAAVP-----YWIIKNSWTTQWGEE 191



 Score = 80.6 bits (200), Expect = 2e-17
 Identities = 27/50 (54%), Positives = 34/50 (68%)

Query: 149 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
           P A DWRA G ++ VK+QG+C  CWAFSA+G VE    + G+ LT LS Q
Sbjct: 2   PAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQ 51


>1cs8_A Human procathepsin L; prosegment, propeptide, inhibition,
           hydrolase; HET: OCS; 1.80A {Homo sapiens} SCOP: d.3.1.1
           PDB: 1cjl_A 3hwn_A*
          Length = 316

 Score =  229 bits (587), Expect = 3e-70
 Identities = 91/301 (30%), Positives = 138/301 (45%), Gaps = 52/301 (17%)

Query: 199 HHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAV---FGVNKFFDLSESDLQQL-T 254
            H+++Y   E+  RR   +  N++  E +  E           +N F D++  + +Q+  
Sbjct: 18  MHNRLYGMNEEGWRR-AVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMN 76

Query: 255 GLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQ 314
           G                    N+   + + FQ        + P + DWR +G ++ VK Q
Sbjct: 77  GFQ------------------NRKPRKGKVFQE---PLFYEAPRSVDWREKGYVTPVKNQ 115

Query: 315 GKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG--GCNGGRMDDALQYIID 372
           G+C  CWAFSA G +E     +   L  LS Q LVDC    G  GCNGG MD A QY+ D
Sbjct: 116 GQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQD 175

Query: 373 NGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVG 432
           NGG+ S+++YPY+A+E    C    +         +  IP  +E+ + K VAT GP+SV 
Sbjct: 176 NGGLDSEESYPYEATEES--CKYNPKYS-VANDAGFVDIP-KQEKALMKAVATVGPISVA 231

Query: 433 MNANGL-F-YYSGGVID----LNQRL--------YGTSI------PYWIVKNSWGSDWGE 472
           ++A    F +Y  G+       ++ +        YG          YW+VKNSWG +WG 
Sbjct: 232 IDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGM 291

Query: 473 K 473
            
Sbjct: 292 G 292



 Score =  156 bits (396), Expect = 6e-43
 Identities = 65/167 (38%), Positives = 94/167 (56%), Gaps = 17/167 (10%)

Query: 491 TGVLPSKLSRLATEK-LVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASES 547
           TG L S LS    E+ LVDC    G  GCNGG MD A QY+ DNGG+ S+++YPY+A+E 
Sbjct: 138 TGRLIS-LS----EQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEE 192

Query: 548 ERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVID 605
              C    +         +  IP  +E+ + K VAT GP+SV ++A    F +Y  G+  
Sbjct: 193 S--CKYNPKYS-VANDAGFVDIP-KQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYF 248

Query: 606 LNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
             +  C+ +  +H +++VGYG E  +   +  YW+VKNSWG +WG  
Sbjct: 249 --EPDCSSEDMDHGVLVVGYGFESTESDNN-KYWLVKNSWGEEWGMG 292



 Score =  107 bits (270), Expect = 6e-26
 Identities = 40/160 (25%), Positives = 69/160 (43%), Gaps = 26/160 (16%)

Query: 43  TRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQRE-DSGTAVFE--VNKFFDLSD 99
            ++  +   H+++Y   E+  RR   +  N++  E + +E   G   F   +N F D++ 
Sbjct: 10  AQWTKWKAMHNRLYGMNEEGWRR-AVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTS 68

Query: 100 SDLQQL-TGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEG 158
            + +Q+  G                    N+   + + FQ        + P + DWR +G
Sbjct: 69  EEFRQVMNGFQ------------------NRKPRKGKVFQE---PLFYEAPRSVDWREKG 107

Query: 159 VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
            ++ VK QG+C  CWAFSA G +E     +   L  LS Q
Sbjct: 108 YVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQ 147


>3ioq_A CMS1MS2; caricaceae, cysteine protease, papain family, hydrolase;
           HET: E64 SO4; 1.87A {Carica candamarcensis}
          Length = 213

 Score =  225 bits (576), Expect = 5e-70
 Identities = 66/190 (34%), Positives = 99/190 (52%), Gaps = 18/190 (9%)

Query: 296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSN 355
           +P + DWR +G ++ V+ QG C  CW FS+V  VE ++ I    L  LS Q+L+DC+  +
Sbjct: 1   IPTSIDWRQKGAVTPVRNQGGCGSCWTFSSVAAVEGINKIVTGQLLSLSEQELLDCERRS 60

Query: 356 GGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGE 415
            GC GG    ALQY+  N G+   Q YPY+  + +  C   + +G KVK     R+P   
Sbjct: 61  YGCRGGFPLYALQYVA-NSGIHLRQYYPYEGVQRQ--CRASQAKGPKVKTDGVGRVPRNN 117

Query: 416 EEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDL--NQRL--------YGTSIPYWIVK 463
           E+ + + +A   P+S+ + A G  F  Y GG+        +        YG    Y ++K
Sbjct: 118 EQALIQRIA-IQPVSIVVEAKGRAFQNYRGGIFAGPCGTSIDHAVAAVGYGN--DYILIK 174

Query: 464 NSWGSDWGEK 473
           NSWG+ WGE 
Sbjct: 175 NSWGTGWGEG 184



 Score =  156 bits (397), Expect = 3e-44
 Identities = 50/149 (33%), Positives = 75/149 (50%), Gaps = 19/149 (12%)

Query: 506 LVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKE 565
           L+DC+  + GC GG    ALQY+  N G+   Q YPY+  + +  C   + +G KVK   
Sbjct: 53  LLDCERRSYGCRGGFPLYALQYVA-NSGIHLRQYYPYEGVQRQ--CRASQAKGPKVKTDG 109

Query: 566 YSRIPYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDLNQRLCNPKAQNHALIIV 623
             R+P   E+ + + +A   P+S+ + A G  F  Y GG+       C     +HA+  V
Sbjct: 110 VGRVPRNNEQALIQRIA-IQPVSIVVEAKGRAFQNYRGGIFAGP---CGTSI-DHAVAAV 164

Query: 624 GYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
           GYG +         Y ++KNSWG+ WGE 
Sbjct: 165 GYGND---------YILIKNSWGTGWGEG 184



 Score = 84.0 bits (209), Expect = 1e-18
 Identities = 21/51 (41%), Positives = 30/51 (58%)

Query: 148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
           +P + DWR +G ++ V+ QG C  CW FS+V  VE ++ I    L  LS Q
Sbjct: 1   IPTSIDWRQKGAVTPVRNQGGCGSCWTFSSVAAVEGINKIVTGQLLSLSEQ 51


>1ppo_A Protease omega; hydrolase(thiol protease); 1.80A {Carica papaya}
           SCOP: d.3.1.1 PDB: 1meg_A*
          Length = 216

 Score =  224 bits (574), Expect = 1e-69
 Identities = 72/192 (37%), Positives = 100/192 (52%), Gaps = 18/192 (9%)

Query: 296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSN 355
           LPE  DWR +G ++ V+ QG C  CWAFSAV  VE ++ I+   L ELS Q+LVDC+  +
Sbjct: 1   LPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERRS 60

Query: 356 GGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGE 415
            GC GG    AL+Y+  N G+     YPYKA +    C   +  G  VK     R+    
Sbjct: 61  HGCKGGYPPYALEYVAKN-GIHLRSKYPYKAKQGT--CRAKQVGGPIVKTSGVGRVQPNN 117

Query: 416 EEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDL--NQRL--------YGTS--IPYWI 461
           E  +   +A   P+SV + + G  F  Y GG+ +     ++        YG S    Y +
Sbjct: 118 EGNLLNAIAK-QPVSVVVESKGRPFQLYKGGIFEGPCGTKVDHAVTAVGYGKSGGKGYIL 176

Query: 462 VKNSWGSDWGEK 473
           +KNSWG+ WGEK
Sbjct: 177 IKNSWGTAWGEK 188



 Score =  161 bits (411), Expect = 3e-46
 Identities = 56/164 (34%), Positives = 80/164 (48%), Gaps = 19/164 (11%)

Query: 491 TGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERG 550
           TG L      L+ ++LVDC+  + GC GG    AL+Y+  N G+     YPYKA +    
Sbjct: 42  TGKLVE----LSEQELVDCERRSHGCKGGYPPYALEYVAKN-GIHLRSKYPYKAKQGT-- 94

Query: 551 CLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDLNQ 608
           C   +  G  VK     R+    E  +   +A   P+SV + + G  F  Y GG+ +   
Sbjct: 95  CRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAK-QPVSVVVESKGRPFQLYKGGIFEGP- 152

Query: 609 RLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
             C  K  +HA+  VGYG+   K      Y ++KNSWG+ WGEK
Sbjct: 153 --CGTKV-DHAVTAVGYGKSGGKG-----YILIKNSWGTAWGEK 188



 Score = 83.3 bits (207), Expect = 2e-18
 Identities = 26/51 (50%), Positives = 33/51 (64%)

Query: 148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
           LPE  DWR +G ++ V+ QG C  CWAFSAV  VE ++ I+   L ELS Q
Sbjct: 1   LPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQ 51


>2cio_A Papain; hydrolase/inhibitor, complex hydrolase/inhibitor, ICP,
           cysteine protease, allergen, protease, thiol protease;
           1.5A {Carica papaya} PDB: 1khq_A 1khp_A 1ppn_A 3e1z_B
           3ima_A 3lfy_A 9pap_A 1bqi_A* 1bp4_A* 1pad_A 1pe6_A*
           1pip_A* 1pop_A* 1ppd_A 1ppp_A* 1stf_E* 2pad_A 4pad_A*
           5pad_A* 6pad_A* ...
          Length = 212

 Score =  224 bits (573), Expect = 2e-69
 Identities = 64/190 (33%), Positives = 94/190 (49%), Gaps = 18/190 (9%)

Query: 296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSN 355
           +PE  DWR +G ++ VK QG C  CWAFSAV  +E +  I+  +L + S Q+L+DCD  +
Sbjct: 1   IPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNQYSEQELLDCDRRS 60

Query: 356 GGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGE 415
            GCNGG    ALQ +    G+     YPY+  +    C   E+  +  K     ++    
Sbjct: 61  YGCNGGYPWSALQLVA-QYGIHYRNTYPYEGVQRY--CRSREKGPYAAKTDGVRQVQPYN 117

Query: 416 EEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDL--NQRL--------YGTSIPYWIVK 463
           E  +   +A   P+SV + A G  F  Y GG+       ++        YG    Y ++K
Sbjct: 118 EGALLYSIAN-QPVSVVLEAAGKDFQLYRGGIFVGPCGNKVDHAVAAVGYGP--NYILIK 174

Query: 464 NSWGSDWGEK 473
           NSWG+ WGE 
Sbjct: 175 NSWGTGWGEN 184



 Score =  154 bits (392), Expect = 1e-43
 Identities = 47/149 (31%), Positives = 67/149 (44%), Gaps = 19/149 (12%)

Query: 506 LVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKE 565
           L+DCD  + GCNGG    ALQ +    G+     YPY+  +    C   E+  +  K   
Sbjct: 53  LLDCDRRSYGCNGGYPWSALQLVA-QYGIHYRNTYPYEGVQRY--CRSREKGPYAAKTDG 109

Query: 566 YSRIPYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDLNQRLCNPKAQNHALIIV 623
             ++    E  +   +A   P+SV + A G  F  Y GG+       C  K  +HA+  V
Sbjct: 110 VRQVQPYNEGALLYSIAN-QPVSVVLEAAGKDFQLYRGGIFVGP---CGNKV-DHAVAAV 164

Query: 624 GYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
           GYG           Y ++KNSWG+ WGE 
Sbjct: 165 GYGPN---------YILIKNSWGTGWGEN 184



 Score = 84.0 bits (209), Expect = 1e-18
 Identities = 24/51 (47%), Positives = 32/51 (62%)

Query: 148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
           +PE  DWR +G ++ VK QG C  CWAFSAV  +E +  I+  NL + S Q
Sbjct: 1   IPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNQYSEQ 51


>1cqd_A Protein (protease II); cysteine protease, glycoprotein, proline
           specificity, carboh papain family, hydrolase; HET: NAG
           FUL FUC; 2.10A {Zingiber officinale} SCOP: d.3.1.1
          Length = 221

 Score =  224 bits (573), Expect = 2e-69
 Identities = 78/194 (40%), Positives = 104/194 (53%), Gaps = 18/194 (9%)

Query: 294 DDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDM 353
           DDLP++ DWR  G +  VK QG C  CWAFS V  VE ++ I    L  LS QQLVDC  
Sbjct: 1   DDLPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTT 60

Query: 354 SNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPY 413
           +N GC GG M+ A Q+I++NGG+ S++ YPY+  +    C         V +  Y  +P 
Sbjct: 61  ANHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQDGI--CNSTVNAP-VVSIDSYENVPS 117

Query: 414 GEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDLN--QRL--------YGTS--IPY 459
             E+ ++K VA   P+SV M+A G  F  Y  G+   +             YGT     +
Sbjct: 118 HNEQSLQKAVAN-QPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGTENDKDF 176

Query: 460 WIVKNSWGSDWGEK 473
           WIVKNSWG +WGE 
Sbjct: 177 WIVKNSWGKNWGES 190



 Score =  157 bits (400), Expect = 1e-44
 Identities = 62/149 (41%), Positives = 84/149 (56%), Gaps = 15/149 (10%)

Query: 506 LVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKE 565
           LVDC  +N GC GG M+ A Q+I++NGG+ S++ YPY+  +    C         V +  
Sbjct: 55  LVDCTTANHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQDGI--CNSTVNAP-VVSIDS 111

Query: 566 YSRIPYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDLNQRLCNPKAQNHALIIV 623
           Y  +P   E+ ++K VA   P+SV M+A G  F  Y  G+   +   CN  A NHAL +V
Sbjct: 112 YENVPSHNEQSLQKAVAN-QPVSVTMDAAGRDFQLYRSGIFTGS---CNISA-NHALTVV 166

Query: 624 GYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
           GYG E  KD     +WIVKNSWG +WGE 
Sbjct: 167 GYGTENDKD-----FWIVKNSWGKNWGES 190



 Score = 87.6 bits (218), Expect = 7e-20
 Identities = 26/53 (49%), Positives = 32/53 (60%)

Query: 146 DDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
           DDLP++ DWR  G +  VK QG C  CWAFS V  VE ++ I   +L  LS Q
Sbjct: 1   DDLPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQ 53


>2bdz_A Mexicain; cysteine protease, peptidase_C1, papain-like, HYDR; HET:
           E64; 2.10A {Jacaratia mexicana}
          Length = 214

 Score =  223 bits (570), Expect = 4e-69
 Identities = 67/189 (35%), Positives = 105/189 (55%), Gaps = 18/189 (9%)

Query: 297 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG 356
           PE+ DWR +G ++ VK Q  C  CWAFS V  +E ++ I    L  LS Q+L+DC+  + 
Sbjct: 2   PESIDWREKGAVTPVKNQNPCGSCWAFSTVATIEGINKIITGQLISLSEQELLDCERRSH 61

Query: 357 GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEE 416
           GC+GG    +LQY++DN GV +++ YPY+  +    C   +++G KV +  Y  +P  +E
Sbjct: 62  GCDGGYQTTSLQYVVDN-GVHTEREYPYEKKQGR--CRAKDKKGPKVYITGYKYVPANDE 118

Query: 417 EEMKKWVATRGPLSVGMNANGL-F-YYSGGVID--LNQRL--------YGTSIPYWIVKN 464
             + + +A   P+SV  ++ G  F +Y GG+ +               YG    Y ++KN
Sbjct: 119 ISLIQAIAN-QPVSVVTDSRGRGFQFYKGGIYEGPCGTNTDHAVTAVGYGK--TYLLLKN 175

Query: 465 SWGSDWGEK 473
           SWG +WGEK
Sbjct: 176 SWGPNWGEK 184



 Score =  156 bits (398), Expect = 3e-44
 Identities = 50/149 (33%), Positives = 83/149 (55%), Gaps = 19/149 (12%)

Query: 506 LVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKE 565
           L+DC+  + GC+GG    +LQY++DN GV +++ YPY+  +    C   +++G KV +  
Sbjct: 53  LLDCERRSHGCDGGYQTTSLQYVVDN-GVHTEREYPYEKKQGR--CRAKDKKGPKVYITG 109

Query: 566 YSRIPYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDLNQRLCNPKAQNHALIIV 623
           Y  +P  +E  + + +A   P+SV  ++ G  F +Y GG+ +     C     +HA+  V
Sbjct: 110 YKYVPANDEISLIQAIAN-QPVSVVTDSRGRGFQFYKGGIYEGP---CGTN-TDHAVTAV 164

Query: 624 GYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
           GYG+          Y ++KNSWG +WGEK
Sbjct: 165 GYGKT---------YLLLKNSWGPNWGEK 184



 Score = 82.1 bits (204), Expect = 6e-18
 Identities = 22/50 (44%), Positives = 29/50 (58%)

Query: 149 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
           PE+ DWR +G ++ VK Q  C  CWAFS V  +E ++ I    L  LS Q
Sbjct: 2   PESIDWREKGAVTPVKNQNPCGSCWAFSTVATIEGINKIITGQLISLSEQ 51


>8pch_A Cathepsin H; hydrolase, protease, cysteine proteinase,
           aminopeptidase; HET: NAG BMA; 2.10A {Sus scrofa} SCOP:
           d.3.1.1 PDB: 1nb3_A* 1nb5_A*
          Length = 220

 Score =  220 bits (563), Expect = 5e-68
 Identities = 69/197 (35%), Positives = 92/197 (46%), Gaps = 23/197 (11%)

Query: 297 PEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS- 354
           P + DWR +G  +S VK QG C  CW FS  G +E+  AI    +  L+ QQLVDC  + 
Sbjct: 2   PPSMDWRKKGNFVSPVKNQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQQLVDCAQNF 61

Query: 355 -NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPY 413
            N GC GG    A +YI  N G++ +  YPYK  +    C    ++     VK+ + I  
Sbjct: 62  NNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYKGQDDH--CKFQPDKA-IAFVKDVANITM 118

Query: 414 GEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVID------LNQRL--------YGTS-- 456
            +EE M + VA   P+S        F  Y  G+           ++        YG    
Sbjct: 119 NDEEAMVEAVALYNPVSFAFEVTNDFLMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENG 178

Query: 457 IPYWIVKNSWGSDWGEK 473
           IPYWIVKNSWG  WG  
Sbjct: 179 IPYWIVKNSWGPQWGMN 195



 Score =  170 bits (434), Expect = 2e-49
 Identities = 58/166 (34%), Positives = 77/166 (46%), Gaps = 17/166 (10%)

Query: 491 TGVLPSKLSRLATEK-LVDCDMS--NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASES 547
           TG + S L+    E+ LVDC  +  N GC GG    A +YI  N G++ +  YPYK  + 
Sbjct: 43  TGKMLS-LA----EQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYKGQDD 97

Query: 548 ERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVIDL 606
              C    ++     VK+ + I   +EE M + VA   P+S        F  Y  G+   
Sbjct: 98  H--CKFQPDKA-IAFVKDVANITMNDEEAMVEAVALYNPVSFAFEVTNDFLMYRKGIYSS 154

Query: 607 NQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
                 P   NHA++ VGYGEE         YWIVKNSWG  WG  
Sbjct: 155 TSCHKTPDKVNHAVLAVGYGEENGIP-----YWIVKNSWGPQWGMN 195



 Score = 75.2 bits (186), Expect = 1e-15
 Identities = 21/51 (41%), Positives = 28/51 (54%), Gaps = 1/51 (1%)

Query: 149 PEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
           P + DWR +G  +S VK QG C  CW FS  G +E+  AI    +  L+ Q
Sbjct: 2   PPSMDWRKKGNFVSPVKNQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQ 52


>3kwz_A Cathepsin K; enzyme inhibitor, covalent reversible inhibitor,
           disease mutation, disulfide bond, glycoprotein,
           hydrolase, lysosome, protease; HET: KWZ; 1.49A {Homo
           sapiens} PDB: 1au0_A* 1au2_A* 1au3_A* 1au4_A* 1ayu_A*
           1ayv_A* 1ayw_A* 1bgo_A* 1atk_A* 1nl6_A* 1nlj_A* 1q6k_A*
           1mem_A* 1yk7_A* 1yk8_A* 1yt7_A* 2ato_A* 2aux_A* 2auz_A*
           2bdl_A* ...
          Length = 215

 Score =  219 bits (561), Expect = 1e-67
 Identities = 74/193 (38%), Positives = 102/193 (52%), Gaps = 19/193 (9%)

Query: 297 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG 356
           P++ D+R +G ++ VK QG+C  CWAFS+VG +E     +   L  LS Q LVDC   N 
Sbjct: 2   PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSEND 61

Query: 357 GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEE 416
           GC GG M +A QY+  N G+ S+ AYPY   E    C+         K + Y  IP G E
Sbjct: 62  GCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEES--CMYNPTGK-AAKCRGYREIPEGNE 118

Query: 417 EEMKKWVATRGPLSVGMNANG--LFYYSGGVID----LNQRL--------YGTS--IPYW 460
           + +K+ VA  GP+SV ++A+     +YS GV       +  L        YG      +W
Sbjct: 119 KALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHW 178

Query: 461 IVKNSWGSDWGEK 473
           I+KNSWG +WG K
Sbjct: 179 IIKNSWGENWGNK 191



 Score =  165 bits (420), Expect = 2e-47
 Identities = 64/165 (38%), Positives = 87/165 (52%), Gaps = 18/165 (10%)

Query: 491 TGVLPSKLSRLATEK-LVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESER 549
           TG L + LS     + LVDC   N GC GG M +A QY+  N G+ S+ AYPY   E   
Sbjct: 42  TGKLLN-LS----PQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEES- 95

Query: 550 GCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANG--LFYYSGGVIDLN 607
            C+         K + Y  IP G E+ +K+ VA  GP+SV ++A+     +YS GV    
Sbjct: 96  -CMYNPTGK-AAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGV--YY 151

Query: 608 QRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
              CN    NHA++ VGYG ++        +WI+KNSWG +WG K
Sbjct: 152 DESCNSDNLNHAVLAVGYGIQKGNK-----HWIIKNSWGENWGNK 191



 Score = 79.8 bits (198), Expect = 3e-17
 Identities = 21/50 (42%), Positives = 31/50 (62%)

Query: 149 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
           P++ D+R +G ++ VK QG+C  CWAFS+VG +E     +   L  LS Q
Sbjct: 2   PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQ 51


>1s4v_A Cysteine endopeptidase; KDEL ER retention signal, endosperm,
           ricinosomes, SEED germi senescence, hydrolase-hydrolase
           inhibitor complex; 2.00A {Ricinus communis} SCOP:
           d.3.1.1
          Length = 229

 Score =  219 bits (561), Expect = 1e-67
 Identities = 79/194 (40%), Positives = 111/194 (57%), Gaps = 19/194 (9%)

Query: 296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS- 354
           +P + DWR +G ++ VK+QG+C  CWAFS +  VE ++ I+ N L  LS Q+LVDCD   
Sbjct: 2   VPASVDWRKKGAVTSVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTDQ 61

Query: 355 NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYG 414
           N GCNGG MD A ++I   GG+ ++  YPY+A +    C V +E    V +  +  +P  
Sbjct: 62  NQGCNGGLMDYAFEFIKQRGGITTEANYPYEAYDGT--CDVSKENAPAVSIDGHENVPEN 119

Query: 415 EEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDLN--QRL--------YGTS---IPY 459
           +E  + K VA   P+SV ++A G  F +YS GV   +    L        YGT+     Y
Sbjct: 120 DENALLKAVAN-QPVSVAIDAGGSDFQFYSEGVFTGSCGTELDHGVAIVGYGTTIDGTKY 178

Query: 460 WIVKNSWGSDWGEK 473
           W VKNSWG +WGEK
Sbjct: 179 WTVKNSWGPEWGEK 192



 Score =  157 bits (399), Expect = 2e-44
 Identities = 62/150 (41%), Positives = 84/150 (56%), Gaps = 14/150 (9%)

Query: 506 LVDCDMS-NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVK 564
           LVDCD   N GCNGG MD A ++I   GG+ ++  YPY+A +    C V +E    V + 
Sbjct: 54  LVDCDTDQNQGCNGGLMDYAFEFIKQRGGITTEANYPYEAYDGT--CDVSKENAPAVSID 111

Query: 565 EYSRIPYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDLNQRLCNPKAQNHALII 622
            +  +P  +E  + K VA   P+SV ++A G  F +YS GV   +   C  +  +H + I
Sbjct: 112 GHENVPENDENALLKAVAN-QPVSVAIDAGGSDFQFYSEGVFTGS---CGTE-LDHGVAI 166

Query: 623 VGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
           VGYG     DGT   YW VKNSWG +WGEK
Sbjct: 167 VGYGTTI--DGT--KYWTVKNSWGPEWGEK 192



 Score = 84.1 bits (209), Expect = 1e-18
 Identities = 23/51 (45%), Positives = 34/51 (66%)

Query: 148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
           +P + DWR +G ++ VK+QG+C  CWAFS +  VE ++ I+ N L  LS Q
Sbjct: 2   VPASVDWRKKGAVTSVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQ 52


>1iwd_A Ervatamin B; cysteine protease, alpha-beta protein, catalytic DYAD,
           L-DOM domain., hydrolase; 1.63A {Tabernaemontana
           divaricata} SCOP: d.3.1.1
          Length = 215

 Score =  219 bits (560), Expect = 1e-67
 Identities = 76/192 (39%), Positives = 108/192 (56%), Gaps = 19/192 (9%)

Query: 296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSN 355
           LP   DWR++G ++ +K Q +C  CWAFSAV  VE+++ I+   L  LS Q+LVDCD ++
Sbjct: 1   LPSFVDWRSKGAVNSIKNQKQCGSCWAFSAVAAVESINKIRTGQLISLSEQELVDCDTAS 60

Query: 356 GGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGE 415
            GCNGG M++A QYII NGG+ + Q YPY A +    C         V +  + R+    
Sbjct: 61  HGCNGGWMNNAFQYIITNGGIDTQQNYPYSAVQGS--CKPYRLRV--VSINGFQRVTRNN 116

Query: 416 EEEMKKWVATRGPLSVGMNANGL-F-YYSGGVID------LNQRL----YGTS--IPYWI 461
           E  ++  VA+  P+SV + A G  F +YS G+         N  +    YGT     YWI
Sbjct: 117 ESALQSAVAS-QPVSVTVEAAGAPFQHYSSGIFTGPCGTAQNHGVVIVGYGTQSGKNYWI 175

Query: 462 VKNSWGSDWGEK 473
           V+NSWG +WG +
Sbjct: 176 VRNSWGQNWGNQ 187



 Score =  155 bits (394), Expect = 8e-44
 Identities = 60/149 (40%), Positives = 84/149 (56%), Gaps = 16/149 (10%)

Query: 506 LVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKE 565
           LVDCD ++ GCNGG M++A QYII NGG+ + Q YPY A +    C         V +  
Sbjct: 53  LVDCDTASHGCNGGWMNNAFQYIITNGGIDTQQNYPYSAVQGS--CKPYRLRV--VSING 108

Query: 566 YSRIPYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDLNQRLCNPKAQNHALIIV 623
           + R+    E  ++  VA+  P+SV + A G  F +YS G+       C   AQNH ++IV
Sbjct: 109 FQRVTRNNESALQSAVAS-QPVSVTVEAAGAPFQHYSSGIFTGP---CG-TAQNHGVVIV 163

Query: 624 GYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
           GYG +  K+     YWIV+NSWG +WG +
Sbjct: 164 GYGTQSGKN-----YWIVRNSWGQNWGNQ 187



 Score = 83.7 bits (208), Expect = 2e-18
 Identities = 23/51 (45%), Positives = 33/51 (64%)

Query: 148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
           LP   DWR++G ++ +K Q +C  CWAFSAV  VE+++ I+   L  LS Q
Sbjct: 1   LPSFVDWRSKGAVNSIKNQKQCGSCWAFSAVAAVESINKIRTGQLISLSEQ 51


>3f5v_A DER P 1 allergen; allergy, asthma, DUST mites, glycoprotein,
           hydrola protease, secreted, thiol protease; HET: P6G;
           1.36A {Dermatophagoides pteronyssinus} PDB: 2as8_A
           3rvw_A* 3rvx_A 3rvv_A* 3d6s_A*
          Length = 222

 Score =  218 bits (557), Expect = 5e-67
 Identities = 54/204 (26%), Positives = 88/204 (43%), Gaps = 24/204 (11%)

Query: 288 NSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQ 347
           N+     + P   D R    ++ ++ QG C   WAFS V   E+ +        +L+ Q+
Sbjct: 2   NACSINGNAPAEIDLRQMRTVTPIRMQGGCGSAWAFSGVAATESAYLAYRQQSLDLAEQE 61

Query: 348 LVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKE 407
           LVDC  S  GC+G  +   ++YI  N GVV +  Y Y A E    C     +     +  
Sbjct: 62  LVDCA-SQHGCHGDTIPRGIEYIQHN-GVVQESYYRYVAREQS--CRRPNAQR--FGISN 115

Query: 408 YSRIPYGEEEEMKKWVA-TRGPLSVGMNANGLF---YYSGGVIDL----NQRL------- 452
           Y +I      ++++ +A T   ++V +    L    +Y G  I       Q         
Sbjct: 116 YCQIYPPNANKIREALAQTHSAIAVIIGIKDLDAFRHYDGRTIIQRDNGYQPNYHAVNIV 175

Query: 453 -YGTS--IPYWIVKNSWGSDWGEK 473
            Y  +  + YWIV+NSW ++WG+ 
Sbjct: 176 GYSNAQGVDYWIVRNSWDTNWGDN 199



 Score =  157 bits (400), Expect = 2e-44
 Identities = 48/166 (28%), Positives = 74/166 (44%), Gaps = 21/166 (12%)

Query: 491 TGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERG 550
                     LA ++LVDC  S  GC+G  +   ++YI  NG VV +  Y Y A E    
Sbjct: 51  RQQSLD----LAEQELVDCA-SQHGCHGDTIPRGIEYIQHNG-VVQESYYRYVAREQS-- 102

Query: 551 CLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVA-TRGPLSVGMNANGLF---YYSGGVIDL 606
           C     +     +  Y +I      ++++ +A T   ++V +    L    +Y G  I  
Sbjct: 103 CRRPNAQR--FGISNYCQIYPPNANKIREALAQTHSAIAVIIGIKDLDAFRHYDGRTII- 159

Query: 607 NQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
            QR    +   HA+ IVGY   +  D     YWIV+NSW ++WG+ 
Sbjct: 160 -QRDNGYQPNYHAVNIVGYSNAQGVD-----YWIVRNSWDTNWGDN 199



 Score = 86.4 bits (215), Expect = 2e-19
 Identities = 15/59 (25%), Positives = 25/59 (42%)

Query: 140 NSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
           N+     + P   D R    ++ ++ QG C   WAFS V   E+ +        +L+ Q
Sbjct: 2   NACSINGNAPAEIDLRQMRTVTPIRMQGGCGSAWAFSGVAATESAYLAYRQQSLDLAEQ 60


>2fo5_A Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cysteine
           endoprotease, endopeptidase, LEUP hydrolase; HET: AR7;
           2.20A {Hordeum vulgare}
          Length = 262

 Score =  219 bits (559), Expect = 7e-67
 Identities = 80/196 (40%), Positives = 113/196 (57%), Gaps = 18/196 (9%)

Query: 295 DLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS 354
           DLP + DWR +G ++ VK+QGKC  CWAFS V  VE ++AI+  SL  LS Q+L+DCD +
Sbjct: 3   DLPPSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTA 62

Query: 355 -NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE-RGCLVGEEEGFKVKVKEYSRIP 412
            N GC GG MD+A +YI +NGG++++ AYPY+A+          +     V +  +  +P
Sbjct: 63  DNDGCQGGLMDNAFEYIKNNGGLITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVP 122

Query: 413 YGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDL--NQRL--------YGTS---I 457
              EE++ + VA   P+SV + A+G  F +YS GV        L        YG +    
Sbjct: 123 ANSEEDLARAVAN-QPVSVAVEASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGK 181

Query: 458 PYWIVKNSWGSDWGEK 473
            YW VKNSWG  WGE+
Sbjct: 182 AYWTVKNSWGPSWGEQ 197



 Score =  154 bits (392), Expect = 5e-43
 Identities = 58/151 (38%), Positives = 85/151 (56%), Gaps = 13/151 (8%)

Query: 506 LVDCDMS-NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE-RGCLVGEEEGFKVKV 563
           L+DCD + N GC GG MD+A +YI +NGG++++ AYPY+A+          +     V +
Sbjct: 56  LIDCDTADNDGCQGGLMDNAFEYIKNNGGLITEAAYPYRAARGTCNVARAAQNSPVVVHI 115

Query: 564 KEYSRIPYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDLNQRLCNPKAQNHALI 621
             +  +P   EE++ + VA   P+SV + A+G  F +YS GV       C  +  +H + 
Sbjct: 116 DGHQDVPANSEEDLARAVAN-QPVSVAVEASGKAFMFYSEGVFTGE---CGTE-LDHGVA 170

Query: 622 IVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
           +VGYG  E  DG    YW VKNSWG  WGE+
Sbjct: 171 VVGYGVAE--DGK--AYWTVKNSWGPSWGEQ 197



 Score = 86.2 bits (214), Expect = 6e-19
 Identities = 27/52 (51%), Positives = 36/52 (69%)

Query: 147 DLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
           DLP + DWR +G ++ VK+QGKC  CWAFS V  VE ++AI+  +L  LS Q
Sbjct: 3   DLPPSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQ 54


>3p5u_A Actinidin; SAD, cysteine proteinases, hydrolase; 1.50A {Actinidia
           arguta} PDB: 3p5v_A 3p5w_A 3p5x_A 1aec_A* 2act_A
          Length = 220

 Score =  216 bits (554), Expect = 1e-66
 Identities = 75/194 (38%), Positives = 107/194 (55%), Gaps = 19/194 (9%)

Query: 296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS- 354
           LP+  DWR+ G +  +K+QG+C   WAFS +  VE ++ I    L  LS Q+LVDC  + 
Sbjct: 1   LPDYVDWRSSGAVVDIKDQGQCGSAWAFSTIAAVEGINKIATGDLISLSEQELVDCGRTQ 60

Query: 355 -NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPY 413
              GC+GG M D  Q+II+NGG+ ++  YPY A E +  C +  ++   V +  Y  +PY
Sbjct: 61  NTRGCDGGFMTDGFQFIINNGGINTEANYPYTAEEGQ--CNLDLQQEKYVSIDTYENVPY 118

Query: 414 GEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDL--NQRL--------YGTS--IPY 459
             E  ++  VA   P+SV + A G  F +YS G+        +        YGT   I Y
Sbjct: 119 NNEWALQTAVA-YQPVSVALEAAGYNFQHYSSGIFTGPCGTAVDHAVTIVGYGTEGGIDY 177

Query: 460 WIVKNSWGSDWGEK 473
           WIVKNSWG+ WGE+
Sbjct: 178 WIVKNSWGTTWGEE 191



 Score =  153 bits (390), Expect = 4e-43
 Identities = 60/151 (39%), Positives = 83/151 (54%), Gaps = 16/151 (10%)

Query: 506 LVDCDMS--NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKV 563
           LVDC  +    GC+GG M D  Q+II+NGG+ ++  YPY A E +  C +  ++   V +
Sbjct: 53  LVDCGRTQNTRGCDGGFMTDGFQFIINNGGINTEANYPYTAEEGQ--CNLDLQQEKYVSI 110

Query: 564 KEYSRIPYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDLNQRLCNPKAQNHALI 621
             Y  +PY  E  ++  VA   P+SV + A G  F +YS G+       C     +HA+ 
Sbjct: 111 DTYENVPYNNEWALQTAVA-YQPVSVALEAAGYNFQHYSSGIFTGP---CGTAV-DHAVT 165

Query: 622 IVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
           IVGYG E   D     YWIVKNSWG+ WGE+
Sbjct: 166 IVGYGTEGGID-----YWIVKNSWGTTWGEE 191



 Score = 83.3 bits (207), Expect = 3e-18
 Identities = 21/51 (41%), Positives = 31/51 (60%)

Query: 148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
           LP+  DWR+ G +  +K+QG+C   WAFS +  VE ++ I   +L  LS Q
Sbjct: 1   LPDYVDWRSSGAVVDIKDQGQCGSAWAFSTIAAVEGINKIATGDLISLSEQ 51


>3ovx_A Cathepsin S; hydrolase, covalent inhibitor, aldehyde warhead is
           covalently bound to Cys25, lysosomeal protein; HET: O64;
           1.49A {Homo sapiens} PDB: 2h7j_A* 2f1g_A* 2hh5_B*
           2hhn_A* 2hxz_A* 2op3_A* 2frq_A* 2fra_A* 2fq9_A* 2ft2_A*
           2fud_A* 2g7y_A* 1ms6_A* 2r9m_A* 2r9n_A* 2r9o_A* 3n3g_A*
           3n4c_A* 3mpe_A* 1nqc_A* ...
          Length = 218

 Score =  216 bits (553), Expect = 2e-66
 Identities = 83/196 (42%), Positives = 112/196 (57%), Gaps = 21/196 (10%)

Query: 296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS- 354
           LP++ DWR +G +++VK QG C  CWAFSAVG +EA   ++   L  LS Q LVDC    
Sbjct: 2   LPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEK 61

Query: 355 --NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIP 412
             N GCNGG M  A QYIIDN G+ SD +YPYKA + +  C    +        +Y+ +P
Sbjct: 62  YGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQK--CQYDSKYR-AATCSKYTELP 118

Query: 413 YGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDL---NQRL--------YGTS--I 457
           YG E+ +K+ VA +GP+SVG++A    F  Y  GV       Q +        YG     
Sbjct: 119 YGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDLNGK 178

Query: 458 PYWIVKNSWGSDWGEK 473
            YW+VKNSWG ++GE+
Sbjct: 179 EYWLVKNSWGHNFGEE 194



 Score =  157 bits (399), Expect = 2e-44
 Identities = 69/168 (41%), Positives = 95/168 (56%), Gaps = 22/168 (13%)

Query: 491 TGVLPSKLSRLATEK-LVDCDMS---NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASE 546
           TG L S LS     + LVDC      N GCNGG M  A QYIIDN G+ SD +YPYKA +
Sbjct: 43  TGKLVS-LS----AQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMD 97

Query: 547 SERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVI 604
            +  C    +        +Y+ +PYG E+ +K+ VA +GP+SVG++A    F  Y  GV 
Sbjct: 98  QK--CQYDSKYR-AATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGV- 153

Query: 605 DLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
              +  C     NH +++VGYG+   K+     YW+VKNSWG ++GE+
Sbjct: 154 -YYEPSCTQNV-NHGVLVVGYGDLNGKE-----YWLVKNSWGHNFGEE 194



 Score = 81.4 bits (202), Expect = 1e-17
 Identities = 25/51 (49%), Positives = 34/51 (66%)

Query: 148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
           LP++ DWR +G +++VK QG C  CWAFSAVG +EA   ++   L  LS Q
Sbjct: 2   LPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQ 52


>1yal_A Chymopapain; hydrolase, thiol protease; 1.70A {Carica papaya} SCOP:
           d.3.1.1 PDB: 1gec_E*
          Length = 218

 Score =  216 bits (552), Expect = 2e-66
 Identities = 80/191 (41%), Positives = 107/191 (56%), Gaps = 18/191 (9%)

Query: 297 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG 356
           P++ DWRA+G ++ VK QG C  CWAFS +  VE ++ I   +L ELS Q+LVDCD  + 
Sbjct: 2   PQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKHSY 61

Query: 357 GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEE 416
           GC GG    +LQY+  N GV + + YPY+A + +  C   ++ G KVK+  Y R+P   E
Sbjct: 62  GCKGGYQTTSLQYVA-NNGVHTSKVYPYQAKQYK--CRATDKPGPKVKITGYKRVPSNCE 118

Query: 417 EEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDL--NQRL--------YGTS--IPYWIV 462
                 +A   PLSV + A G  F  Y  GV D     +L        YGTS    Y I+
Sbjct: 119 TSFLGALAN-QPLSVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTSDGKNYIII 177

Query: 463 KNSWGSDWGEK 473
           KNSWG +WGEK
Sbjct: 178 KNSWGPNWGEK 188



 Score =  155 bits (394), Expect = 1e-43
 Identities = 60/149 (40%), Positives = 80/149 (53%), Gaps = 15/149 (10%)

Query: 506 LVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKE 565
           LVDCD  + GC GG    +LQY+  N GV + + YPY+A + +  C   ++ G KVK+  
Sbjct: 53  LVDCDKHSYGCKGGYQTTSLQYVA-NNGVHTSKVYPYQAKQYK--CRATDKPGPKVKITG 109

Query: 566 YSRIPYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDLNQRLCNPKAQNHALIIV 623
           Y R+P   E      +A   PLSV + A G  F  Y  GV D     C  K  +HA+  V
Sbjct: 110 YKRVPSNCETSFLGALAN-QPLSVLVEAGGKPFQLYKSGVFDGP---CGTKL-DHAVTAV 164

Query: 624 GYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
           GYG  + K+     Y I+KNSWG +WGEK
Sbjct: 165 GYGTSDGKN-----YIIIKNSWGPNWGEK 188



 Score = 81.8 bits (203), Expect = 8e-18
 Identities = 25/50 (50%), Positives = 33/50 (66%)

Query: 149 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
           P++ DWRA+G ++ VK QG C  CWAFS +  VE ++ I   NL ELS Q
Sbjct: 2   PQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQ 51


>3f75_A Toxopain-2, cathepsin L protease; medical structural genomics of
           pathogenic protozoa, MSGPP, C protease, parasite,
           protozoa, hydrolase; 1.99A {Toxoplasma gondii}
          Length = 224

 Score =  214 bits (548), Expect = 9e-66
 Identities = 74/198 (37%), Positives = 106/198 (53%), Gaps = 22/198 (11%)

Query: 294 DDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDM 353
            +LP   DWR+ G ++ VK+Q  C  CWAFS  G +E  H  +   L  LS Q+L+DC  
Sbjct: 5   SELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSR 64

Query: 354 SNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRI 411
           + G   C+GG M+DA QY++D+GG+ S+ AYPY A + E  C     E   VK+  +  +
Sbjct: 65  AEGNQSCSGGEMNDAFQYVLDSGGICSEDAYPYLARDEE--CRAQSCEK-VVKILGFKDV 121

Query: 412 PYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDLN--QRL--------YGTS--- 456
           P   E  MK  +A   P+S+ + A+ + F +Y  GV D +    L        YGT    
Sbjct: 122 PRRSEAAMKAALAK-SPVSIAIEADQMPFQFYHEGVFDASCGTDLDHGVLLVGYGTDKES 180

Query: 457 -IPYWIVKNSWGSDWGEK 473
              +WI+KNSWG+ WG  
Sbjct: 181 KKDFWIMKNSWGTGWGRD 198



 Score =  150 bits (380), Expect = 8e-42
 Identities = 59/166 (35%), Positives = 93/166 (56%), Gaps = 19/166 (11%)

Query: 491 TGVLPSKLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE 548
           TG L S    L+ ++L+DC  + G   C+GG M+DA QY++D+GG+ S+ AYPY A + E
Sbjct: 48  TGKLVS----LSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDSGGICSEDAYPYLARDEE 103

Query: 549 RGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDL 606
             C     E   VK+  +  +P   E  MK  +A   P+S+ + A+ + F +Y  GV D 
Sbjct: 104 --CRAQSCEK-VVKILGFKDVPRRSEAAMKAALAK-SPVSIAIEADQMPFQFYHEGVFDA 159

Query: 607 NQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
           +   C     +H +++VGYG +++       +WI+KNSWG+ WG  
Sbjct: 160 S---CGTDL-DHGVLLVGYGTDKES---KKDFWIMKNSWGTGWGRD 198



 Score = 84.1 bits (209), Expect = 1e-18
 Identities = 22/53 (41%), Positives = 29/53 (54%)

Query: 146 DDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
            +LP   DWR+ G ++ VK+Q  C  CWAFS  G +E  H  +   L  LS Q
Sbjct: 5   SELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTTGALEGAHCAKTGKLVSLSEQ 57


>1o0e_A Ervatamin C; plant cysteine protease, two domain, stable at PH
           2-12, HYDR; 1.90A {Tabernaemontana divaricata} SCOP:
           d.3.1.1 PDB: 2pns_A* 2pre_A* 3bcn_A*
          Length = 208

 Score =  213 bits (545), Expect = 1e-65
 Identities = 75/188 (39%), Positives = 105/188 (55%), Gaps = 16/188 (8%)

Query: 296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSN 355
           LPE  DWR +G ++ VK QG C  CWAFS V  VE+++ I+  +L  LS Q+LVDCD  N
Sbjct: 1   LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDKKN 60

Query: 356 GGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGE 415
            GC GG    A QYII+NGG+ +   YPYKA +    C    +    V +  Y+ +P+  
Sbjct: 61  HGCLGGAFVFAYQYIINNGGIDTQANYPYKAVQGP--CQAASK---VVSIDGYNGVPFCN 115

Query: 416 EEEMKKWVATRGPLSVGMNANGLFY--YSGGV--------IDLNQRLYGTSIPYWIVKNS 465
           E  +K+ VA   P +V ++A+   +  YS G+        ++    + G    YWIV+NS
Sbjct: 116 EXALKQAVA-VQPSTVAIDASSAQFQQYSSGIFSGPCGTKLNHGVTIVGYQANYWIVRNS 174

Query: 466 WGSDWGEK 473
           WG  WGEK
Sbjct: 175 WGRYWGEK 182



 Score =  142 bits (361), Expect = 2e-39
 Identities = 60/164 (36%), Positives = 82/164 (50%), Gaps = 25/164 (15%)

Query: 491 TGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERG 550
           TG L S    L+ ++LVDCD  N GC GG    A QYII+NGG+ +   YPYKA +    
Sbjct: 42  TGNLIS----LSEQELVDCDKKNHGCLGGAFVFAYQYIINNGGIDTQANYPYKAVQGP-- 95

Query: 551 CLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY--YSGGVIDLNQ 608
           C    +    V +  Y+ +P+  E  +K+ VA   P +V ++A+   +  YS G+     
Sbjct: 96  CQAASK---VVSIDGYNGVPFCNEXALKQAVA-VQPSTVAIDASSAQFQQYSSGIFS--- 148

Query: 609 RLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
                   NH + IVGY            YWIV+NSWG  WGEK
Sbjct: 149 -GPCGTKLNHGVTIVGYQAN---------YWIVRNSWGRYWGEK 182



 Score = 83.5 bits (207), Expect = 2e-18
 Identities = 26/51 (50%), Positives = 33/51 (64%)

Query: 148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
           LPE  DWR +G ++ VK QG C  CWAFS V  VE+++ I+  NL  LS Q
Sbjct: 1   LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQ 51


>2xu3_A Cathepsin L1; hydrolase, drug design, thiol protease; HET: XU3 BTB;
           0.90A {Homo sapiens} PDB: 2xu4_A* 2xu5_A* 2yj2_A*
           2yj8_A* 2yj9_A* 2yjb_A* 2yjc_A* 3bc3_A* 3h89_A* 3h8b_A*
           3h8c_A* 3of9_A* 3of8_A* 3hha_A* 2xu1_A* 3iv2_A* 3k24_A*
           2nqd_B* 3kse_A* 2vhs_A ...
          Length = 220

 Score =  209 bits (534), Expect = 8e-64
 Identities = 74/199 (37%), Positives = 103/199 (51%), Gaps = 26/199 (13%)

Query: 297 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS-- 354
           P + DWR +G ++ VK QG+C  CWAFSA G +E     +   L  LS Q LVDC     
Sbjct: 2   PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQG 61

Query: 355 NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYG 414
           N GCNGG MD A QY+ DNGG+ S+++YPY+A+E    C    +         +  IP  
Sbjct: 62  NEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEES--CKYNPKYS-VANDTGFVDIP-K 117

Query: 415 EEEEMKKWVATRGPLSVGMNANGLF--YYSGGVID----LNQRL--------YGTS---- 456
           +E+ + K VAT GP+SV ++A      +Y  G+       ++ +        YG      
Sbjct: 118 QEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTES 177

Query: 457 --IPYWIVKNSWGSDWGEK 473
               YW+VKNSWG +WG  
Sbjct: 178 DNNKYWLVKNSWGEEWGMG 196



 Score =  156 bits (397), Expect = 3e-44
 Identities = 64/167 (38%), Positives = 93/167 (55%), Gaps = 17/167 (10%)

Query: 491 TGVLPSKLSRLATEK-LVDCDMS--NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASES 547
           TG L S LS    E+ LVDC     N GCNGG MD A QY+ DNGG+ S+++YPY+A+E 
Sbjct: 42  TGRLIS-LS----EQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEE 96

Query: 548 ERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGLF--YYSGGVID 605
              C    +         +  IP  +E+ + K VAT GP+SV ++A      +Y  G+  
Sbjct: 97  S--CKYNPKYS-VANDTGFVDIP-KQEKALMKAVATVGPISVAIDAGHESFLFYKEGI-- 150

Query: 606 LNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
             +  C+ +  +H +++VGYG E  +   +  YW+VKNSWG +WG  
Sbjct: 151 YFEPDCSSEDMDHGVLVVGYGFESTESDNN-KYWLVKNSWGEEWGMG 196



 Score = 79.5 bits (197), Expect = 5e-17
 Identities = 22/50 (44%), Positives = 29/50 (58%)

Query: 149 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
           P + DWR +G ++ VK QG+C  CWAFSA G +E     +   L  LS Q
Sbjct: 2   PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQ 51


>3pdf_A Cathepsin C, dipeptidyl peptidase 1; two domains, cystein protease,
           hydrolase-hydrolase inhibitor; HET: LXV NAG; 1.85A {Homo
           sapiens} PDB: 1jqp_A* 2djf_B* 1k3b_B* 2djg_B* 2djf_A*
           1k3b_A* 2djg_A* 2djf_C* 1k3b_C* 2djg_C*
          Length = 441

 Score =  215 bits (549), Expect = 3e-63
 Identities = 62/306 (20%), Positives = 113/306 (36%), Gaps = 50/306 (16%)

Query: 199 HHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQLTGLNL 258
           + +  +         +  +  +    +   +           ++  L+  D+ + +G   
Sbjct: 126 YVNTAHLKNSQEKYSNRLYKYDHNFVKAINAIQKSWTATTYMEYETLTLGDMIRRSG--- 182

Query: 259 DSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWR---AEGVISKVKEQG 315
                          S      +         +    LP ++DWR       +S V+ Q 
Sbjct: 183 -------------GHSRKIPRPKPAPLTAEIQQKILFLPTSWDWRNVHGINFVSPVRNQA 229

Query: 316 KCACCWAFSAVGVVEAMHAIQGNSLT--ELSVQQLVDCDMSNGGCNGGRMDDALQYIIDN 373
            C  C++F+++G++EA   I  N+     LS Q++V C     GC GG           +
Sbjct: 230 SCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYLIAGKYAQD 289

Query: 374 GGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIP----YGEEEEMKKWVATRGPL 429
            G+V +  +PY  ++S   C + +E+ F+    EY  +        E  MK  +   GP+
Sbjct: 290 FGLVEEACFPYTGTDSP--CKM-KEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPM 346

Query: 430 SVGMNA-NGLFYYSGGV---------IDLNQRL--------YGTS----IPYWIVKNSWG 467
           +V     +   +Y  G+          +  +          YGT     + YWIVKNSWG
Sbjct: 347 AVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWG 406

Query: 468 SDWGEK 473
           + WGE 
Sbjct: 407 TGWGEN 412



 Score =  163 bits (414), Expect = 3e-44
 Identities = 46/164 (28%), Positives = 74/164 (45%), Gaps = 14/164 (8%)

Query: 497 KLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEE 556
           +   L+ +++V C     GC GG           + G+V +  +PY  ++S   C + +E
Sbjct: 255 QTPILSPQEVVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYTGTDSP--CKM-KE 311

Query: 557 EGFKVKVKEYSRIP----YGEEEEMKKWVATRGPLSVGMNA-NGLFYYSGGV---IDLNQ 608
           + F+    EY  +        E  MK  +   GP++V     +   +Y  G+     L  
Sbjct: 312 DCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRD 371

Query: 609 RLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
                +  NHA+++VGYG +    G    YWIVKNSWG+ WGE 
Sbjct: 372 PFNPFELTNHAVLLVGYGTDS-ASGM--DYWIVKNSWGTGWGEN 412



 Score = 82.1 bits (203), Expect = 8e-17
 Identities = 24/153 (15%), Positives = 49/153 (32%), Gaps = 21/153 (13%)

Query: 52  HDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQLTGLNLD 111
            +  +         +  +  +    +               ++  L+  D+ + +G    
Sbjct: 127 VNTAHLKNSQEKYSNRLYKYDHNFVKAINAIQKSWTATTYMEYETLTLGDMIRRSG---- 182

Query: 112 STLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWR---AEGVISKVKEQGK 168
                         S      +         +    LP ++DWR       +S V+ Q  
Sbjct: 183 ------------GHSRKIPRPKPAPLTAEIQQKILFLPTSWDWRNVHGINFVSPVRNQAS 230

Query: 169 CACCWAFSAVGVVEAMHAIQGNNLT--ELSVQH 199
           C  C++F+++G++EA   I  NN     LS Q 
Sbjct: 231 CGSCYSFASMGMLEARIRILTNNSQTPILSPQE 263


>2wbf_X Serine-repeat antigen protein; SERA, malaria, vacuole, protease,
           cathepsin, hydrolase, glycoprotein, thiol protease; HET:
           DMS; 1.60A {Plasmodium falciparum} PDB: 3ch3_X 3ch2_X
          Length = 265

 Score =  201 bits (512), Expect = 5e-60
 Identities = 42/227 (18%), Positives = 78/227 (34%), Gaps = 47/227 (20%)

Query: 294 DDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDM 353
           +      D        +V++QG C   W F++   +E +  ++G   T++S   + +C  
Sbjct: 8   EYCNRLKDENNCISNLQVEDQGNCDTSWIFASKYHLETIRCMKGYEPTKISALYVANCYK 67

Query: 354 S--NGGCNGGRMDDA-LQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGF--------- 401
                 C+ G      LQ I D G + ++  YPY   +    C   E+            
Sbjct: 68  GEHKDRCDEGSSPMEFLQIIEDYGFLPAESNYPYNYVKVGEQCPKVEDHWMNLWDNGKIL 127

Query: 402 ---------------KVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGL--FYYSGG 444
                            + + +        + +K  V  +G +   + A  +  + +SG 
Sbjct: 128 HNKNEPNSLDGKGYTAYESERFHDNMDAFVKIIKTEVMNKGSVIAYIKAENVMGYEFSGK 187

Query: 445 VIDLN---QRL--------YGTSI-------PYWIVKNSWGSDWGEK 473
            +                 YG  +        YWIV+NSWG  WG++
Sbjct: 188 KVKNLCGDDTADHAVNIVGYGNYVNSEGEKKSYWIVRNSWGPYWGDE 234



 Score =  155 bits (395), Expect = 2e-43
 Identities = 40/175 (22%), Positives = 63/175 (36%), Gaps = 30/175 (17%)

Query: 506 LVDCDMS--NGGCNGGRMDDA-LQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGF--- 559
           + +C        C+ G      LQ I D G + ++  YPY   +    C   E+      
Sbjct: 62  VANCYKGEHKDRCDEGSSPMEFLQIIEDYGFLPAESNYPYNYVKVGEQCPKVEDHWMNLW 121

Query: 560 ---------------------KVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANG-LF 597
                                  + + +        + +K  V  +G +   + A   + 
Sbjct: 122 DNGKILHNKNEPNSLDGKGYTAYESERFHDNMDAFVKIIKTEVMNKGSVIAYIKAENVMG 181

Query: 598 YYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
           Y   G    N  LC     +HA+ IVGYG     +G    YWIV+NSWG  WG++
Sbjct: 182 YEFSGKKVKN--LCGDDTADHAVNIVGYGNYVNSEGEKKSYWIVRNSWGPYWGDE 234



 Score = 77.4 bits (191), Expect = 5e-16
 Identities = 11/53 (20%), Positives = 23/53 (43%)

Query: 146 DDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
           +      D        +V++QG C   W F++   +E +  ++G   T++S  
Sbjct: 8   EYCNRLKDENNCISNLQVEDQGNCDTSWIFASKYHLETIRCMKGYEPTKISAL 60


>3hhi_A Cathepsin B-like cysteine protease; occluding loop, hydrolase, THIO
           protease; HET: 074; 1.60A {Trypanosoma brucei} PDB:
           3mor_A*
          Length = 325

 Score =  192 bits (489), Expect = 5e-56
 Identities = 60/278 (21%), Positives = 97/278 (34%), Gaps = 59/278 (21%)

Query: 238 GVNKFF-DLSESDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDL 296
             +    +++  + ++L G+                   N   + +   +F        L
Sbjct: 29  KYDGVMQNITLREAKRLNGV----------------IKKNNNASILPKRRFTEEEARAPL 72

Query: 297 PEAFD----WRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQ-GNSLTELSVQQLVDC 351
           P +FD    W     I ++ +Q  C  CWA +A   +        G     +S   L+ C
Sbjct: 73  PSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTMGGVQDVHISAGDLLAC 132

Query: 352 DMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYPYKA------SESERGCLVGEEEGFK-- 402
               G GCNGG  D A  Y     G+VSD   PY        S+S+ G     +  F   
Sbjct: 133 CSDCGDGCNGGDPDRAWAYFSST-GLVSDYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTP 191

Query: 403 -------------VKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNA-NGLFYYSGGV--- 445
                        V  + ++      E++  + +  RGP  V  +       Y+ GV   
Sbjct: 192 KCDYTCDDPTIPVVNYRSWTSYALQGEDDYMRELFFRGPFEVAFDVYEDFIAYNSGVYHH 251

Query: 446 ----IDLNQ--RL--YGT--SIPYWIVKNSWGSDWGEK 473
                      RL  +GT   +PYW + NSW ++WG  
Sbjct: 252 VSGQYLGGHAVRLVGWGTSNGVPYWKIANSWNTEWGMD 289



 Score =  139 bits (353), Expect = 7e-37
 Identities = 44/179 (24%), Positives = 69/179 (38%), Gaps = 32/179 (17%)

Query: 497 KLSRLATEKLVDCDMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYPYKA------SESER 549
           +   ++   L+ C    G GCNGG  D A  Y    G +VSD   PY        S+S+ 
Sbjct: 120 QDVHISAGDLLACCSDCGDGCNGGDPDRAWAYFSSTG-LVSDYCQPYPFPHCSHHSKSKN 178

Query: 550 GCLVGEEEGFK---------------VKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNA- 593
           G     +  F                V  + ++      E++  + +  RGP  V  +  
Sbjct: 179 GYPPCSQFNFDTPKCDYTCDDPTIPVVNYRSWTSYALQGEDDYMRELFFRGPFEVAFDVY 238

Query: 594 NGLFYYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
                Y+ GV      +       HA+ +VG+G     +G   PYW + NSW ++WG  
Sbjct: 239 EDFIAYNSGVY---HHVSGQYLGGHAVRLVGWGTS---NGV--PYWKIANSWNTEWGMD 289



 Score = 77.8 bits (192), Expect = 8e-16
 Identities = 18/113 (15%), Positives = 35/113 (30%), Gaps = 21/113 (18%)

Query: 92  NKFFDLSDSDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEA 151
               +++  + ++L G+                   N   + +   +F        LP +
Sbjct: 32  GVMQNITLREAKRLNGV----------------IKKNNNASILPKRRFTEEEARAPLPSS 75

Query: 152 FD----WRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNN-LTELSVQH 199
           FD    W     I ++ +Q  C  CWA +A   +       G      +S   
Sbjct: 76  FDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTMGGVQDVHISAGD 128


>1deu_A Procathepsin X; cysteine protease, proregion, prosegment, HY; 1.70A
           {Homo sapiens} SCOP: d.3.1.1 PDB: 1ef7_A
          Length = 277

 Score =  181 bits (461), Expect = 2e-52
 Identities = 55/214 (25%), Positives = 83/214 (38%), Gaps = 35/214 (16%)

Query: 293 GDDLPEAFDWRAEG---VISKVKEQ---GKCACCWAFSAVGVVEAMHAIQGNSLTE---L 343
             DLP+++DWR        S  + Q     C  CWA ++   +     I+         L
Sbjct: 33  PADLPKSWDWRNVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRKGAWPSTLL 92

Query: 344 SVQQLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE-------RGCLVG 396
           SVQ ++DC  + G C GG       Y   +G +  +    Y+A + E         C   
Sbjct: 93  SVQNVIDCG-NAGSCEGGNDLSVWDYAHQHG-IPDETCNNYQAKDQECDKFNQCGTCNEF 150

Query: 397 EEEGFKVKVKEYSRIPYGE---EEEMKKWVATRGPLSVGMNANGLFY-YSGGV---IDLN 449
           +E         +    YG     E+M   +   GP+S G+ A      Y+GG+       
Sbjct: 151 KECHAIRNYTLWRVGDYGSLSGREKMMAEIYANGPISCGIMATERLANYTGGIYAEYQDT 210

Query: 450 QRL--------YGTS--IPYWIVKNSWGSDWGEK 473
             +        +G S    YWIV+NSWG  WGE+
Sbjct: 211 TYINHVVSVAGWGISDGTEYWIVRNSWGEPWGER 244



 Score =  142 bits (360), Expect = 2e-38
 Identities = 44/167 (26%), Positives = 69/167 (41%), Gaps = 21/167 (12%)

Query: 497 KLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE-------R 549
             + L+ + ++DC  + G C GG       Y   +G +  +    Y+A + E        
Sbjct: 88  PSTLLSVQNVIDCG-NAGSCEGGNDLSVWDYAHQHG-IPDETCNNYQAKDQECDKFNQCG 145

Query: 550 GCLVGEEEGFKVKVKEYSRIPYGE---EEEMKKWVATRGPLSVGMNANGLFY-YSGGVID 605
            C   +E         +    YG     E+M   +   GP+S G+ A      Y+GG+  
Sbjct: 146 TCNEFKECHAIRNYTLWRVGDYGSLSGREKMMAEIYANGPISCGIMATERLANYTGGI-- 203

Query: 606 LNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
             +        NH + + G+G     DGT   YWIV+NSWG  WGE+
Sbjct: 204 YAEYQDTTYI-NHVVSVAGWGIS---DGT--EYWIVRNSWGEPWGER 244



 Score = 64.6 bits (158), Expect = 1e-11
 Identities = 15/73 (20%), Positives = 27/73 (36%), Gaps = 13/73 (17%)

Query: 145 GDDLPEAFDWRAEG---VISKVKEQ---GKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             DLP+++DWR        S  + Q     C  CWA ++   +     I+          
Sbjct: 33  PADLPKSWDWRNVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRKGAWPS--- 89

Query: 199 HHDKVYSSVEDLL 211
                  SV++++
Sbjct: 90  ----TLLSVQNVI 98


>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase,
           digestive tract, hydrolase-hydrolase INH complex; HET:
           074; 1.30A {Schistosoma mansoni} PDB: 3s3q_A* 3s3r_A*
          Length = 254

 Score =  178 bits (455), Expect = 5e-52
 Identities = 56/230 (24%), Positives = 88/230 (38%), Gaps = 52/230 (22%)

Query: 295 DLPEAFDWRAE----GVISKVKEQGKCACCWAFSAVGVVEAMHAIQ--GNSLTELSVQQL 348
           ++P +FD R +      I+ +++Q +C  CWAF AV  +     IQ  G    ELS   L
Sbjct: 2   EIPSSFDSRKKWPRCKSIATIRDQSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVDL 61

Query: 349 VDCDMSNG-GCNGGRMDDALQYIIDNGGV--------VSDQAYPYKASES---------- 389
           + C  S G GC GG +  A  Y +  G V           + YP+   E           
Sbjct: 62  LSCCESCGLGCEGGILGPAWDYWVKEGIVTGSSKENHAGCEPYPFPKCEHHTKGKYPPCG 121

Query: 390 ---------ERGCLVGEEEGFKVKVKEYSRIPY---GEEEEMKKWVATRGPLSVGMNA-N 436
                    ++ C    +  +  + K   +  Y    +E+ ++K +   GP+  G     
Sbjct: 122 SKIYKTPRCKQTCQKKYKTPYT-QDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGFTVYE 180

Query: 437 GLFYYSGGV-------IDLNQ--RL--YGTS--IPYWIVKNSWGSDWGEK 473
               Y  G+              R+  +G     PYW++ NSW  DWGE 
Sbjct: 181 DFLNYKSGIYKHITGETLGGHAIRIIGWGVENKAPYWLIANSWNEDWGEN 230



 Score =  135 bits (343), Expect = 3e-36
 Identities = 42/189 (22%), Positives = 70/189 (37%), Gaps = 41/189 (21%)

Query: 496 SKLSRLATEKLVDCDMSNG-GCNGGRMDDALQYIIDNGGV--------VSDQAYPYKASE 546
            +   L+   L+ C  S G GC GG +  A  Y +  G V           + YP+   E
Sbjct: 51  KQNVELSAVDLLSCCESCGLGCEGGILGPAWDYWVKEGIVTGSSKENHAGCEPYPFPKCE 110

Query: 547 S-------------------ERGCLVGEEEGFKVKVKEYSRIPY---GEEEEMKKWVATR 584
                               ++ C    +  +  + K   +  Y    +E+ ++K +   
Sbjct: 111 HHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYT-QDKHRGKSSYNVKNDEKAIQKEIMKY 169

Query: 585 GPLSVGMNA-NGLFYYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKN 643
           GP+  G         Y  G+    + +       HA+ I+G+G E   +    PYW++ N
Sbjct: 170 GPVEAGFTVYEDFLNYKSGI---YKHITGETLGGHAIRIIGWGVE---NKA--PYWLIAN 221

Query: 644 SWGSDWGEK 652
           SW  DWGE 
Sbjct: 222 SWNEDWGEN 230



 Score = 73.0 bits (180), Expect = 1e-14
 Identities = 18/59 (30%), Positives = 28/59 (47%), Gaps = 6/59 (10%)

Query: 147 DLPEAFDWRAE----GVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNN--LTELSVQH 199
           ++P +FD R +      I+ +++Q +C  CWAF AV  +     IQ       ELS   
Sbjct: 2   EIPSSFDSRKKWPRCKSIATIRDQSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVD 60


>3u8e_A Papain-like cysteine protease; papain-like cysteine peptidase,
           peptidase_C1A, hydrolase, in form; 1.31A {Crocus
           sativus}
          Length = 222

 Score =  174 bits (443), Expect = 8e-51
 Identities = 64/197 (32%), Positives = 96/197 (48%), Gaps = 26/197 (13%)

Query: 297 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG 356
           P + DWR +G ++ VK+QG C  CWAF A G +E + AI    L  +S QQ+VDCD    
Sbjct: 2   PASIDWRKKGAVTSVKDQGACGMCWAFGATGAIEGIDAITTGRLISVSEQQIVDCDTXXX 61

Query: 357 GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEE 416
              GG  DDA +++I NGG+ SD  YPY   +    C + +      ++  Y+ +P    
Sbjct: 62  XXXGGDADDAFRWVITNGGIASDANYPYTGVDGT--CDLNKPIA--ARIDGYTNVP-NSS 116

Query: 417 EEMKKWVATRGPLSVGMNANGL---FYYSGGVI----------DLNQRL----YGTS--- 456
             +   VA   P+SV +  +      Y   G+            ++  +    YG++   
Sbjct: 117 SALLDAVAK-QPVSVNIYTSSTSFQLYTGPGIFAGSSCSDDPATVDHTVLIVGYGSNGTN 175

Query: 457 IPYWIVKNSWGSDWGEK 473
             YWIVKNSWG++WG  
Sbjct: 176 ADYWIVKNSWGTEWGID 192



 Score =  117 bits (295), Expect = 3e-30
 Identities = 54/166 (32%), Positives = 81/166 (48%), Gaps = 19/166 (11%)

Query: 491 TGVLPSKLSRLATEK-LVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESER 549
           TG L S +S    E+ +VDCD       GG  DDA +++I NGG+ SD  YPY   +   
Sbjct: 42  TGRLIS-VS----EQQIVDCDTXXXXXXGGDADDAFRWVITNGGIASDANYPYTGVDGT- 95

Query: 550 GCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGL---FYYSGGVIDL 606
            C + +      ++  Y+ +P      +   VA   P+SV +  +      Y   G+   
Sbjct: 96  -CDLNKPIA--ARIDGYTNVP-NSSSALLDAVAK-QPVSVNIYTSSTSFQLYTGPGIFAG 150

Query: 607 NQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
           +    +P   +H ++IVGYG      GT+  YWIVKNSWG++WG  
Sbjct: 151 SSCSDDPATVDHTVLIVGYGSN----GTNADYWIVKNSWGTEWGID 192



 Score = 81.4 bits (201), Expect = 1e-17
 Identities = 22/50 (44%), Positives = 30/50 (60%)

Query: 149 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
           P + DWR +G ++ VK+QG C  CWAF A G +E + AI    L  +S Q
Sbjct: 2   PASIDWRKKGAVTSVKDQGACGMCWAFGATGAIEGIDAITTGRLISVSEQ 51


>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco
           hydrolase, lysosome, protease, thiol protease, zymogen,
           CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A*
           3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A*
           1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A*
           1qdq_A* 1csb_B* 1huc_B 2ipp_B ...
          Length = 266

 Score =  173 bits (441), Expect = 7e-50
 Identities = 56/232 (24%), Positives = 84/232 (36%), Gaps = 52/232 (22%)

Query: 293 GDDLPEAFDWRAE----GVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLT--ELSVQ 346
              LP +FD R +      I ++++QG C   WAF AV  +     I  N+    E+S +
Sbjct: 4   DLKLPASFDAREQWPQCPTIKEIRDQGSCGSAWAFGAVEAISDRICIHTNAHVSVEVSAE 63

Query: 347 QLVDCDMS--NGGCNGGRMDDALQYIIDNGGV------VSDQAYPYKASESE-------- 390
            L+ C  S    GCNGG   +A  +    G V            PY     E        
Sbjct: 64  DLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEAHVNGARP 123

Query: 391 ------------RGCLVGEEEGFKVKVKEYSRIPY---GEEEEMKKWVATRGPLSVGMNA 435
                       + C  G    +K + K Y    Y     E+++   +   GP+    + 
Sbjct: 124 PCTGEGDTPKCSKICEPGYSPTYK-QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSV 182

Query: 436 NG-LFYYSGGV-------IDLNQ--RL--YGT--SIPYWIVKNSWGSDWGEK 473
                 Y  GV       +      R+  +G     PYW+V NSW +DWG+ 
Sbjct: 183 YSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDN 234



 Score =  131 bits (332), Expect = 1e-34
 Identities = 46/185 (24%), Positives = 68/185 (36%), Gaps = 41/185 (22%)

Query: 500 RLATEKLVDCDMS--NGGCNGGRMDDALQYIIDNGGV------VSDQAYPYKASESE--- 548
            ++ E L+ C  S    GCNGG   +A  +    G V            PY     E   
Sbjct: 59  EVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEAHV 118

Query: 549 -----------------RGCLVGEEEGFKVKVKEYSRIPY---GEEEEMKKWVATRGPLS 588
                            + C  G    +K + K Y    Y     E+++   +   GP+ 
Sbjct: 119 NGARPPCTGEGDTPKCSKICEPGYSPTYK-QDKHYGYNSYSVSNSEKDIMAEIYKNGPVE 177

Query: 589 VGMNANG-LFYYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGS 647
              +       Y  GV    Q +       HA+ I+G+G E   +GT  PYW+V NSW +
Sbjct: 178 GAFSVYSDFLLYKSGV---YQHVTGEMMGGHAIRILGWGVE---NGT--PYWLVANSWNT 229

Query: 648 DWGEK 652
           DWG+ 
Sbjct: 230 DWGDN 234



 Score = 70.4 bits (173), Expect = 1e-13
 Identities = 18/61 (29%), Positives = 27/61 (44%), Gaps = 6/61 (9%)

Query: 145 GDDLPEAFDWRAE----GVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLT--ELSVQ 198
              LP +FD R +      I ++++QG C   WAF AV  +     I  N     E+S +
Sbjct: 4   DLKLPASFDAREQWPQCPTIKEIRDQGSCGSAWAFGAVEAISDRICIHTNAHVSVEVSAE 63

Query: 199 H 199
            
Sbjct: 64  D 64


>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme,
           papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A
           1pbh_A 1mir_A
          Length = 317

 Score =  173 bits (440), Expect = 4e-49
 Identities = 68/288 (23%), Positives = 104/288 (36%), Gaps = 76/288 (26%)

Query: 238 GVNKFFDLSESDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLP 297
           G N F+++  S L++L G  L                        +  Q         LP
Sbjct: 28  GHN-FYNVDMSYLKRLCGTFLGGP---------------------KPPQRVMFTEDLKLP 65

Query: 298 EAFDWRAE----GVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSL--TELSVQQLVDC 351
            +FD R +      I ++++QG C  CWAF AV  +     I  N+    E+S + L+ C
Sbjct: 66  ASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTC 125

Query: 352 DMS--NGGCNGGRMDDALQYIIDNGGVVSDQAY-------PYKASESE------------ 390
             S    GCNGG   +A  +     G+VS   Y       PY     E            
Sbjct: 126 CGSMCGDGCNGGYPAEAWNFWTRK-GLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTG 184

Query: 391 --------RGCLVGEEEGFKVKVKEYSRIPY---GEEEEMKKWVATRGPLSVGMNA-NGL 438
                   + C  G    +K + K Y    Y     E+++   +   GP+    +  +  
Sbjct: 185 EGDTPKCSKICEPGYSPTYK-QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDF 243

Query: 439 FYYSGGV-------IDLNQ--RL--YGT--SIPYWIVKNSWGSDWGEK 473
             Y  GV       +      R+  +G     PYW+V NSW +DWG+ 
Sbjct: 244 LLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDN 291



 Score =  129 bits (327), Expect = 2e-33
 Identities = 48/186 (25%), Positives = 72/186 (38%), Gaps = 43/186 (23%)

Query: 500 RLATEKLVDCDMS--NGGCNGGRMDDALQYIIDNGGVVSDQAY-------PYKASESE-- 548
            ++ E L+ C  S    GCNGG   +A  +    G +VS   Y       PY     E  
Sbjct: 116 EVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKG-LVSGGLYESHVGCRPYSIPPCEHH 174

Query: 549 ------------------RGCLVGEEEGFKVKVKEYSRIPY---GEEEEMKKWVATRGPL 587
                             + C  G    +K + K Y    Y     E+++   +   GP+
Sbjct: 175 VNGSRPPCTGEGDTPKCSKICEPGYSPTYK-QDKHYGYNSYSVSNSEKDIMAEIYKNGPV 233

Query: 588 SVGMNA-NGLFYYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWG 646
               +  +    Y  GV    Q +       HA+ I+G+G E   +GT  PYW+V NSW 
Sbjct: 234 EGAFSVYSDFLLYKSGVY---QHVTGEMMGGHAIRILGWGVE---NGT--PYWLVANSWN 285

Query: 647 SDWGEK 652
           +DWG+ 
Sbjct: 286 TDWGDN 291



 Score = 70.5 bits (173), Expect = 2e-13
 Identities = 26/112 (23%), Positives = 41/112 (36%), Gaps = 27/112 (24%)

Query: 94  FFDLSDSDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFD 153
           F+++  S L++L G  L                        +  Q         LP +FD
Sbjct: 31  FYNVDMSYLKRLCGTFLGGP---------------------KPPQRVMFTEDLKLPASFD 69

Query: 154 WRAE----GVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNL--TELSVQH 199
            R +      I ++++QG C  CWAF AV  +     I  N     E+S + 
Sbjct: 70  AREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAED 121


>3ois_A Cysteine protease; alpha and beta, hydrolase; HET: UDP; 1.65A
           {Xylella fastidiosa}
          Length = 291

 Score =  170 bits (432), Expect = 2e-48
 Identities = 34/214 (15%), Positives = 69/214 (32%), Gaps = 38/214 (17%)

Query: 294 DDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGN--SLTELSVQQLVDC 351
             LP   D        +V +QG+   C A +    ++                +    + 
Sbjct: 55  AALPPKVDLTPP---FQVYDQGRIGSCTANALAAAIQFERIHDKQSPEFIPSRLFIYYNE 111

Query: 352 --DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASE--------------SERGCLV 395
                +   + G M      ++   GV  ++ +PY  +               S++    
Sbjct: 112 RKIEGHVNYDSGAMIRDGIKVLHKLGVCPEKEWPYGDTPADPRTEEFPPGAPASKKPSDQ 171

Query: 396 GEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNA--NGLFYYSGGVIDLNQRL- 452
             ++    K+ EYSR+   + + +K  +A   P   G +   + +   S  V        
Sbjct: 172 CYKDAQNYKITEYSRVA-QDIDHLKACLAVGSPFVFGFSVYNSWVGNNSLPVRIPLPTKN 230

Query: 453 -------------YGTSIPYWIVKNSWGSDWGEK 473
                        Y   I ++ ++NSWG++ GE 
Sbjct: 231 DTLEGGHAVLCVGYDDEIRHFRIRNSWGNNVGED 264



 Score =  137 bits (347), Expect = 2e-36
 Identities = 30/165 (18%), Positives = 63/165 (38%), Gaps = 26/165 (15%)

Query: 506 LVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASE--------------SER 549
             +      +   + G M      ++   GV  ++ +PY  +               S++
Sbjct: 108 YYNERKIEGHVNYDSGAMIRDGIKVLHKLGVCPEKEWPYGDTPADPRTEEFPPGAPASKK 167

Query: 550 GCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNA--NGLFYYSGGVIDLN 607
                 ++    K+ EYSR+   + + +K  +A   P   G +   + +   S  V    
Sbjct: 168 PSDQCYKDAQNYKITEYSRVA-QDIDHLKACLAVGSPFVFGFSVYNSWVGNNSLPVRIPL 226

Query: 608 QRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
               +     HA++ VGY +E +       ++ ++NSWG++ GE 
Sbjct: 227 PTKNDTLEGGHAVLCVGYDDEIR-------HFRIRNSWGNNVGED 264



 Score = 61.4 bits (149), Expect = 2e-10
 Identities = 8/53 (15%), Positives = 17/53 (32%), Gaps = 3/53 (5%)

Query: 146 DDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             LP   D        +V +QG+   C A +    ++        +   +  +
Sbjct: 55  AALPPKVDLTPP---FQVYDQGRIGSCTANALAAAIQFERIHDKQSPEFIPSR 104


>2pff_B Fatty acid synthase subunit beta; fatty acid synthase,
           acyl-carrier-protein, beta-ketoacyl RED beta-ketoacyl
           synthase, dehydratase; 4.00A {Saccharomyces cerevisiae}
          Length = 2006

 Score = 82.0 bits (202), Expect = 3e-16
 Identities = 108/611 (17%), Positives = 168/611 (27%), Gaps = 212/611 (34%)

Query: 63  LRRHENFVTNVEKA-EDYQREDSGTAVFE-VNKFFDLSDSDLQQLTGLNLDSTLEDIQPS 120
           L+  E F   + +  E +  +D  T   E V KF     S ++       D  L      
Sbjct: 33  LQ--EQFNKILPEPTEGFAADDEPTTPAELVGKFLGYVSSLVEPSKVGQFDQVLNLC--- 87

Query: 121 LQAPFSSNQTDTEMRAFQFNSLRHGD--DLPEAFDWRAEGVISKVKEQGKC---ACCWA- 174
                        +  F+   L   D   L        +  + K KE  K    A   A 
Sbjct: 88  -------------LTEFENCYLEGNDIHALAAKLLQENDTTLVKTKELIKNYITARIMAK 134

Query: 175 -----------FSAVGVVEA-MHAI---QGNN---LTELSVQHHDKVYSS-VEDLLRRH- 214
                      F AVG   A + AI   QGN      EL   +  + Y   V DL++   
Sbjct: 135 RPFDKKSNSALFRAVGEGNAQLVAIFGGQGNTDDYFEELRDLY--QTYHVLVGDLIKFSA 192

Query: 215 ---ENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQLTGLNLDSTLE--DIQPS- 268
                 +     AE   ++                        GLN+   LE     P  
Sbjct: 193 ETLSELIRTTLDAEKVFTQ------------------------GLNILEWLENPSNTPDK 228

Query: 269 ---LQAPFS------------------SNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGV 307
              L  P S                     T  E+R++   +  H   L  A        
Sbjct: 229 DYLLSIPISCPLIGVIQLAHYVVTAKLLGFTPGELRSYLKGATGHSQGLVTAV------A 282

Query: 308 ISKVK------EQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG----- 356
           I++           + A    F  +GV     A    SL    ++  ++    N      
Sbjct: 283 IAETDSWESFFVSVRKAITVLFF-IGV-RCYEAYPNTSLPPSILEDSLE----NNEGVPS 336

Query: 357 ---GCNGGRMDDALQYIID-----------------NGG---VVSDQAYPYKASESERGC 393
                +     + +Q  ++                 NG    VVS    P    +S    
Sbjct: 337 PMLSISNLT-QEQVQDYVNKTNSHLPAGKQVEISLVNGAKNLVVS--GPP----QS---- 385

Query: 394 LVGEEEGF-KVKV---KEYSRIPYGEEEEMKKWVATR-GPLSVGMNANGLF---YYSGGV 445
           L G      K K     + SRIP+ E    K   + R  P++        F         
Sbjct: 386 LYGLNLTLRKAKAPSGLDQSRIPFSER---KLKFSNRFLPVASP------FHSHLLVPAS 436

Query: 446 IDLNQRLYGTSIPYWIVKNSWGSDWGEKVEDKVGSSGNRTRDL-ELTGVLPSKLSRLATE 504
             +N+ L   ++ +         D    V D     G    DL  L+G +  ++      
Sbjct: 437 DLINKDLVKNNVSF------NAKDIQIPVYD--TFDG---SDLRVLSGSISERIVDCIIR 485

Query: 505 KLVDCDMSNGGCNGGRMDDALQ----YIIDNG-GVVSDQAYPYKASESERGC---LVG-- 554
             V              +   Q    +I+D G G  S        ++   G    + G  
Sbjct: 486 LPVK------------WETTTQFKATHILDFGPGGASGLGVLTHRNKDGTGVRVIVAGTL 533

Query: 555 -----EEEGFK 560
                ++ GFK
Sbjct: 534 DINPDDDYGFK 544



 Score = 56.2 bits (135), Expect = 3e-08
 Identities = 90/533 (16%), Positives = 168/533 (31%), Gaps = 173/533 (32%)

Query: 113 TLE--DIQPSLQAPFSSNQTDTEMRAFQFNSLRH-------GDDLPEAFDWRAE------ 157
           TL    ++  L  P +S    ++++  QFN +          DD P      AE      
Sbjct: 10  TLSHGSLEHVLLVPTASFFIASQLQE-QFNKILPEPTEGFAADDEPTT---PAELVGKFL 65

Query: 158 GVISKVKEQGKCACCWAFSAV---GVVEAMHAI-QGNN---LTELSVQHHDKVYSSVEDL 210
           G +S + E  K      F  V    + E  +   +GN+   L    +Q +D      ++L
Sbjct: 66  GYVSSLVEPSKVG---QFDQVLNLCLTEFENCYLEGNDIHALAAKLLQENDTTLVKTKEL 122

Query: 211 LRR--HENFVTN--VEKAED---YQSEDSGT----AVFG----VNKFFDLSESDLQQL-- 253
           ++       +     +K  +   +++   G     A+FG     + +F+    +L+ L  
Sbjct: 123 IKNYITARIMAKRPFDKKSNSALFRAVGEGNAQLVAIFGGQGNTDDYFE----ELRDLYQ 178

Query: 254 --TGLNLDSTLEDIQPSLQAPFSSNQTDTEM---RAFQFNS-LRHGDDLPEAFDWRAEGV 307
               L +   ++    +L         D E    +       L +  + P+  D+     
Sbjct: 179 TYHVL-VGDLIKFSAETLS-ELIRTTLDAEKVFTQGLNILEWLENPSNTPDK-DYLLSIP 235

Query: 308 ISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDAL 367
           IS                +G            + +L    +V   +   G   G +   L
Sbjct: 236 IS-------------CPLIG------------VIQL-AHYVVTAKLL--GFTPGELRSYL 267

Query: 368 QYIIDNG-GVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATR 426
           +    +  G+V+  A     ++S         E F V V++   + +        ++  R
Sbjct: 268 KGATGHSQGLVT--AVAIAETDSW--------ESFFVSVRKAITVLF--------FIGVR 309

Query: 427 GPLSVGMNANGLFYYSGGVIDLNQRLYG-TSIPYWIVKNSWGSDWGEKVEDKVGSSGNRT 485
                                     Y  TS+P  I+++S   +  E         G  +
Sbjct: 310 C----------------------YEAYPNTSLPPSILEDSL--ENNE---------GVPS 336

Query: 486 RDLELTGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYI-IDNGG---VVSDQAYP 541
             L ++      L++   +  V+        N          I + NG    VVS    P
Sbjct: 337 PMLSISN-----LTQEQVQDYVN------KTNSHLPAGKQVEISLVNGAKNLVVS--GPP 383

Query: 542 YKASESERGCLVGEEEGF-KVKV---KEYSRIPYGEEEEMKKWVATR-GPLSV 589
               +S    L G      K K     + SRIP+ E    K   + R  P++ 
Sbjct: 384 ----QS----LYGLNLTLRKAKAPSGLDQSRIPFSER---KLKFSNRFLPVAS 425



 Score = 45.8 bits (108), Expect = 4e-05
 Identities = 89/549 (16%), Positives = 172/549 (31%), Gaps = 189/549 (34%)

Query: 134 MRAFQFNSLRHGD-----DLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQ 188
            R     +L HG       +P A  + A    S+++EQ        F+ + + E      
Sbjct: 6   TRPL---TLSHGSLEHVLLVPTASFFIA----SQLQEQ--------FNKI-LPEP----- 44

Query: 189 GNNLTELSVQHHDKVYSSVEDLLRRHENFVTN-VEKAEDYQSEDSGTAVFGVNKFFDLSE 247
               TE      +   ++  +L+ +   +V++ VE ++  Q +          +F    E
Sbjct: 45  ----TEGFAADDEP--TTPAELVGKFLGYVSSLVEPSKVGQFDQVLNLCL--TEF----E 92

Query: 248 SDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGV 307
           +    L G        DI  +L A        T ++  +   ++        +   A  +
Sbjct: 93  NCY--LEG-------NDIH-ALAAKLLQENDTTLVKTKEL--IK-------NY-ITARIM 132

Query: 308 ISK-VKEQGKCACCWAFSAVGVVEA-MHAI---QGNS---LTELS---------VQQLVD 350
             +   ++   A    F AVG   A + AI   QGN+     EL          V  L+ 
Sbjct: 133 AKRPFDKKSNSAL---FRAVGEGNAQLVAIFGGQGNTDDYFEELRDLYQTYHVLVGDLIK 189

Query: 351 CDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVK--VKEY 408
                         + L  +I    + +++ +                +G  +   ++  
Sbjct: 190 -----------FSAETLSELIRT-TLDAEKVFT---------------QGLNILEWLENP 222

Query: 409 SRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLYGTSIPYWIVKNSWGS 468
           S  P  ++     ++ +  P+S  +          GVI L          Y +     G 
Sbjct: 223 SNTP-DKD-----YLLS-IPISCPLI---------GVIQLAH--------YVVTAKLLGF 258

Query: 469 DWGEKVEDKVGSSGNRTRDLELTGVLPSKLSRLAT--EKLVDCDMSNGGCNGGRMDDALQ 526
             GE      G++G  ++ L +T V  +         E            +  +    L 
Sbjct: 259 TPGELRSYLKGATG-HSQGL-VTAVAIA----ETDSWESFFV--------SVRKAITVLF 304

Query: 527 YIIDNGGVVSDQAYPYKASESE--RGCLVGEEEGFK---VKVKEYSRIPYGEEEEMKKWV 581
           +I    GV   +AYP  +          +   EG     + +   ++      E+++ +V
Sbjct: 305 FI----GVRCYEAYPNTSLPPSILEDS-LENNEGVPSPMLSISNLTQ------EQVQDYV 353

Query: 582 A---TRGP----LSVGMNANGL--FYYSGGVIDL---NQRLCNPKAQNHALIIVGYGEEE 629
               +  P    + + +  NG      SG    L   N  L   KA +            
Sbjct: 354 NKTNSHLPAGKQVEISL-VNGAKNLVVSGPPQSLYGLNLTLRKAKAPS------------ 400

Query: 630 KKDGTSIPY 638
             D + IP+
Sbjct: 401 GLDQSRIPF 409


>3pw3_A Aminopeptidase C; bleomycin, cysteine proteinase fold, structural
           genomics, JO center for structural genomics, JCSG; HET:
           MSE; 2.23A {Parabacteroides distasonis}
          Length = 383

 Score = 57.9 bits (139), Expect = 4e-09
 Identities = 22/116 (18%), Positives = 41/116 (35%), Gaps = 13/116 (11%)

Query: 288 NSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQ 347
           ++ +   +    F    E  I+ VK Q +   CW +S+   +E+     G    +LS   
Sbjct: 2   DTEKKVSEEGFVFTTVKENPITSVKNQNRAGTCWCYSSYSFLESELLRMGKGEYDLSEMF 61

Query: 348 LVDCDMSNGG------------CNGGRMDDALQYIIDNGGVVSDQAYPYKASESER 391
            V     +                GG   DAL Y ++  G+V ++        ++ 
Sbjct: 62  TVYNTYLDRADAAVRTHGDVSFSQGGSFYDAL-YGMETFGLVPEEEMRPGMMYADT 116



 Score = 44.8 bits (105), Expect = 6e-05
 Identities = 46/366 (12%), Positives = 100/366 (27%), Gaps = 51/366 (13%)

Query: 140 NSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
           ++ +   +    F    E  I+ VK Q +   CW +S+   +E+     G    +LS   
Sbjct: 2   DTEKKVSEEGFVFTTVKENPITSVKNQNRAGTCWCYSSYSFLESELLRMGKGEYDLSEMF 61

Query: 200 ------HDKVYSSVEDLLRRHEN-------FVTNVEK-----AEDYQSEDSGTAVFGVNK 241
                  D+  ++V        +        +  +E       E+ +           + 
Sbjct: 62  TVYNTYLDRADAAVRTHGDVSFSQGGSFYDALYGMETFGLVPEEEMRPGMMYADTLSNHT 121

Query: 242 FFDLSESDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFD 301
                   +           L+            N      +A       +    PE F 
Sbjct: 122 ELSALTDAMVAAIAKGKLRKLQS---------DENNAMLWKKAVAAVHQIYLGVPPEKFT 172

Query: 302 WRAEGVISKVKEQGKCACCWAFSAVG-----------VVEAMHAIQGNSLTELSVQQL-- 348
           ++ +    K   +        + ++             +E     +      L + +   
Sbjct: 173 YKGKEYTPKSFFESTGLKASDYVSLTSYTHHPFYTQFPLEIQDNWRHGMSYNLPLDEFME 232

Query: 349 -VDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKE 407
             D  ++ G       D +      +G  V          +   G  +      K + K+
Sbjct: 233 VFDNAINTGYTIAWGSDVSESGFTRDGVAVMPDDEKV---QELSGSDMAHWLKLKPEEKK 289

Query: 408 YSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLYGTSIPYWIVKNSWG 467
            +  P  ++   +             + +G+  Y G   D           Y++VKNSWG
Sbjct: 290 LNTKPQPQKWCTQAERQLAYDNYETTDDHGMQIY-GIAKDQEGN------EYYMVKNSWG 342

Query: 468 SDWGEK 473
           ++    
Sbjct: 343 TNSKYN 348



 Score = 39.8 bits (92), Expect = 0.002
 Identities = 10/36 (27%), Positives = 20/36 (55%), Gaps = 4/36 (11%)

Query: 617 NHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
           +H + I G  ++++       Y++VKNSWG++    
Sbjct: 317 DHGMQIYGIAKDQEG----NEYYMVKNSWGTNSKYN 348


>3f75_P Toxopain-2, cathepsin L propeptide; medical structural genomics of
           pathogenic protozoa, MSGPP, C protease, parasite,
           protozoa, hydrolase; 1.99A {Toxoplasma gondii}
          Length = 106

 Score = 52.4 bits (126), Expect = 9e-09
 Identities = 16/73 (21%), Positives = 32/73 (43%), Gaps = 2/73 (2%)

Query: 43  TRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDL 102
             F +F   + K Y++ E+  RR+  F  N+     + ++   +   ++N F DLS  + 
Sbjct: 23  DAFSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGY-SYSLKMNHFGDLSRDEF 81

Query: 103 QQL-TGLNLDSTL 114
           ++   G      L
Sbjct: 82  RRKYLGFKKSRNL 94



 Score = 47.8 bits (114), Expect = 5e-07
 Identities = 14/66 (21%), Positives = 27/66 (40%), Gaps = 2/66 (3%)

Query: 198 QHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQL-TGL 256
             + K Y++ E+  RR+  F  N+     +  +   +    +N F DLS  + ++   G 
Sbjct: 30  AMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGY-SYSLKMNHFGDLSRDEFRRKYLGF 88

Query: 257 NLDSTL 262
                L
Sbjct: 89  KKSRNL 94


>2l95_A Crammer, LP06209P; cysteine proteinase inhibitor, intrinsic
           disorder P like protein, hydrolase; NMR {Drosophila
           melanogaster}
          Length = 80

 Score = 42.0 bits (99), Expect = 3e-05
 Identities = 15/71 (21%), Positives = 35/71 (49%), Gaps = 4/71 (5%)

Query: 43  TRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDY-QREDSGTAVFE--VNKFFDLSD 99
             ++ +    DK Y + EDL+RR   +  +  + E++ ++ + G   ++  +N   DL+ 
Sbjct: 8   EEWVEYKSKFDKNYEAEEDLMRR-RIYAESKARIEEHNRKFEKGEVTWKMGINHLADLTP 66

Query: 100 SDLQQLTGLNL 110
            +  Q +G  +
Sbjct: 67  EEFAQRSGKKV 77



 Score = 37.7 bits (88), Expect = 8e-04
 Identities = 16/62 (25%), Positives = 31/62 (50%), Gaps = 4/62 (6%)

Query: 200 HDKVYSSVEDLLRRHENFVTNVEKAEDYQSE-DSGTAVF--GVNKFFDLSESDLQQLTGL 256
            DK Y + EDL+RR   +  +  + E++  + + G   +  G+N   DL+  +  Q +G 
Sbjct: 17  FDKNYEAEEDLMRR-RIYAESKARIEEHNRKFEKGEVTWKMGINHLADLTPEEFAQRSGK 75

Query: 257 NL 258
            +
Sbjct: 76  KV 77


>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis,
           programmed cell death; HET: DTP; 6.90A {Drosophila
           melanogaster} PDB: 3iz8_A*
          Length = 1221

 Score = 44.5 bits (104), Expect = 1e-04
 Identities = 81/531 (15%), Positives = 139/531 (26%), Gaps = 150/531 (28%)

Query: 193 TELSVQHHDKVYSSVEDLLRRH-ENFVTN--VEKAEDY-----QSED------SGTAVFG 238
            E     +       +D+L    + FV N   +  +D        E+      S  AV G
Sbjct: 9   FETGEHQY-----QYKDILSVFEDAFVDNFDCKDVQDMPKSILSKEEIDHIIMSKDAVSG 63

Query: 239 VNKFFDLSESDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLR---HGDD 295
             + F    S  +++    ++  L      L +P  + Q    M    +   R   + D 
Sbjct: 64  TLRLFWTLLSKQEEMVQKFVEEVLRINYKFLMSPIKTEQRQPSMMTRMYIEQRDRLYND- 122

Query: 296 LPEAFD----WRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQG---NSLTELSVQQL 348
             + F      R +    K+++    A      A  V+  +  + G     +        
Sbjct: 123 -NQVFAKYNVSRLQ-PYLKLRQ----ALLELRPAKNVL--IDGVLGSGKTWVALDVCLSY 174

Query: 349 -VDCDMSN-------GGCNGGR----MDDALQYIID-NGGVVSDQAYPYK---------- 385
            V C M           CN       M   L Y ID N    SD +   K          
Sbjct: 175 KVQCKMDFKIFWLNLKNCNSPETVLEMLQKLLYQIDPNWTSRSDHSSNIKLRIHSIQAEL 234

Query: 386 ----ASESERGCL-----VGEE---EGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGM 433
                S+    CL     V        F +      +I           + TR       
Sbjct: 235 RRLLKSKPYENCLLVLLNVQNAKAWNAFNLS----CKI----------LLTTR------- 273

Query: 434 NANGLFYYSGGVIDLNQRLYGTSIPYWIVKNSWGSDWGEKVE--DKVGSSGNRTRDL--E 489
                      V D       T I   +  +S      E      K      R +DL  E
Sbjct: 274 --------FKQVTDFLSAATTTHIS--LDHHSMTLTPDEVKSLLLKY--LDCRPQDLPRE 321

Query: 490 LTGVLPSKLSRLATE-----------KLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQ 538
           +    P +LS +A             K V+CD         ++   ++  ++       +
Sbjct: 322 VLTTNPRRLSIIAESIRDGLATWDNWKHVNCD---------KLTTIIESSLNVLEPAEYR 372

Query: 539 AYPYKASESERGCLVGEEEGFKVKVKEYSRI----PYGEEEEMKKWVATRGPLSVGMNAN 594
              +         L        +     S I       +   +   +     L       
Sbjct: 373 KM-FDR-------LSVFPPSAHIPTILLSLIWFDVIKSDVMVVVNKLHKYS-LVEKQPKE 423

Query: 595 GLFYYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIP------YW 639
                    ++L  +L N +   H  I+  Y   +  D   +       Y+
Sbjct: 424 STISIPSIYLELKVKLEN-EYALHRSIVDHYNIPKTFDSDDLIPPYLDQYF 473



 Score = 36.8 bits (84), Expect = 0.025
 Identities = 60/449 (13%), Positives = 113/449 (25%), Gaps = 144/449 (32%)

Query: 9   LGEKGLGY---LHTFM---IK------VALLESNIFQTRGY---LNSP-----VTRFLNF 48
            GE    Y   L  F    +       V  +  +I         + S        R    
Sbjct: 11  TGEHQYQYKDILSVFEDAFVDNFDCKDVQDMPKSILSKEEIDHIIMSKDAVSGTLRLFWT 70

Query: 49  MRDHDK--VYSSVEDLLRRHENFVTNVEKAEDYQRE-DSGTAVFEVNKFFDLSDSD---- 101
           +    +  V   VE++LR +  F+ +  K E  Q    +   + + ++ +  +D+     
Sbjct: 71  LLSKQEEMVQKFVEEVLRINYKFLMSPIKTEQRQPSMMTRMYIEQRDRLY--NDNQVFAK 128

Query: 102 ---------------LQQL---TGLNLD------------STLEDIQPSLQAPFS----- 126
                          L +L     + +D                  +   +  F      
Sbjct: 129 YNVSRLQPYLKLRQALLELRPAKNVLIDGVLGSGKTWVALDVCLSYKVQCKMDFKIFWLN 188

Query: 127 -SNQTDTEMRAFQFNSL------------RHGDDLPEAFDWRAEGVISKVKEQGKCAC-- 171
             N    E        L             H  ++          +   +K +    C  
Sbjct: 189 LKNCNSPETVLEMLQKLLYQIDPNWTSRSDHSSNIKLRIHSIQAELRRLLKSKPYENCLL 248

Query: 172 ---------CW-AFS----------AVGVVEAMHAIQGNNLTELSVQHHDKVYSSVE--D 209
                     W AF+             V +    +     T +S+ HH    +  E   
Sbjct: 249 VLLNVQNAKAWNAFNLSCKILLTTRFKQVTD---FLSAATTTHISLDHHSMTLTPDEVKS 305

Query: 210 LLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDL---QQLTGLNLDSTLEDIQ 266
           LL +      +  + +D   E   T    ++   +     L        +N D     I+
Sbjct: 306 LLLK----YLDC-RPQDLPREVLTTNPRRLSIIAESIRDGLATWDNWKHVNCDKLTTIIE 360

Query: 267 PSLQAPFSSNQTDTEMRAFQFNSL---RHGDDLPEAFDWRAEGVISKVKEQGKCACCWAF 323
            SL           E R   F+ L        +P         ++S +         W  
Sbjct: 361 SSLN-----VLEPAEYRK-MFDRLSVFPPSAHIPTI-------LLSLI---------WFD 398

Query: 324 SAVGVVEAMHAIQGNSLTELSVQQLVDCD 352
                V  +       + +L    LV+  
Sbjct: 399 VIKSDVMVV-------VNKLHKYSLVEKQ 420


>2cb5_A Protein (bleomycin hydrolase); aminopeptidase, cysteine protease,
           SELF- compartmentalizing, cylinase; 1.85A {Homo sapiens}
           SCOP: d.3.1.1 PDB: 1cb5_A
          Length = 453

 Score = 37.8 bits (87), Expect = 0.011
 Identities = 14/36 (38%), Positives = 19/36 (52%), Gaps = 1/36 (2%)

Query: 617 NHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
            HA+      E++ +DG     W V+NSWG D G K
Sbjct: 370 THAMTFTAVSEKDDQDGAFT-KWRVENSWGEDHGHK 404


>2e01_A Cysteine proteinase 1; bleomycin hydrolase, thiol protease, C1
           protease, hydrolase; 1.73A {Saccharomyces cerevisiae}
           PDB: 2e02_A 2e03_A 2dzy_A 1a6r_A 2e00_A 2dzz_A 3gcb_A
           1gcb_A
          Length = 457

 Score = 37.0 bits (85), Expect = 0.017
 Identities = 16/68 (23%), Positives = 28/68 (41%), Gaps = 7/68 (10%)

Query: 590 GMNANGLFYYSGGVIDLN----QRLCNPKAQ-NHALIIVGYGEEEKKDGTSIPYWIVKNS 644
           G+    L+ Y     +L      R+   ++    A++I G   +E         + V+NS
Sbjct: 340 GVMDIELWNYPAIGYNLPQQKASRIRYHESLMTAAMLITGCHVDE--TSKLPLRYRVENS 397

Query: 645 WGSDWGEK 652
           WG D G+ 
Sbjct: 398 WGKDSGKD 405



 Score = 31.2 bits (70), Expect = 1.1
 Identities = 18/106 (16%), Positives = 30/106 (28%), Gaps = 4/106 (3%)

Query: 139 FNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
            N  R        F+       + V  Q     CW F+A   +   + +   NL E  + 
Sbjct: 44  LNKTRLQKQDNRVFNTVVSTDSTPVTNQKSSGRCWLFAATNQLRL-NVLSELNLKEFELS 102

Query: 199 HHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFD 244
                Y    D L +   F+  +  + D   +             D
Sbjct: 103 Q---AYLFFYDKLEKANYFLDQIVSSADQDIDSRLVQYLLAAPTED 145



 Score = 28.9 bits (64), Expect = 6.0
 Identities = 15/68 (22%), Positives = 20/68 (29%), Gaps = 1/68 (1%)

Query: 287 FNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLT-ELSV 345
            N  R        F+       + V  Q     CW F+A   +      + N    ELS 
Sbjct: 44  LNKTRLQKQDNRVFNTVVSTDSTPVTNQKSSGRCWLFAATNQLRLNVLSELNLKEFELSQ 103

Query: 346 QQLVDCDM 353
             L   D 
Sbjct: 104 AYLFFYDK 111


>2spc_A Spectrin; cytoskeleton; 1.80A {Drosophila melanogaster} SCOP:
          a.7.1.1
          Length = 107

 Score = 29.4 bits (66), Expect = 1.0
 Identities = 15/62 (24%), Positives = 25/62 (40%), Gaps = 9/62 (14%)

Query: 17 LHTFMIKVALLESNIFQTRGYLNSPVTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKA 76
          L  +M    L ES +     +LN+          D      +VE L+++HE+F   +   
Sbjct: 5  LQLYMRDCELAESWMSAREAFLNA---------DDDANAGGNVEALIKKHEDFDKAINGH 55

Query: 77 ED 78
          E 
Sbjct: 56 EQ 57


>3tqc_A Pantothenate kinase; biosynthesis of cofactors, prosthetic groups,
           carriers, TRAN; HET: ADP; 2.30A {Coxiella burnetii}
          Length = 321

 Score = 29.2 bits (65), Expect = 3.8
 Identities = 8/23 (34%), Positives = 14/23 (60%)

Query: 244 DLSESDLQQLTGLNLDSTLEDIQ 266
            L+ESDL +L G     +L+++ 
Sbjct: 34  TLTESDLDKLQGQIEIVSLKEVT 56


>3vcz_A Endoribonuclease L-PSP; virulence, pathogenesis, infectious
           diseases, center for STR genomics of infectious
           diseases, csgid, translation; HET: GOL; 1.80A {Vibrio
           vulnificus}
          Length = 153

 Score = 28.0 bits (63), Expect = 6.2
 Identities = 9/42 (21%), Positives = 15/42 (35%), Gaps = 6/42 (14%)

Query: 408 YSRIPYGEEEEMKKWVATR------GPLSVGMNANGLFYYSG 443
            +   Y +   M K + T       GP   G++   +   SG
Sbjct: 14  GTENLYFQSNAMTKVLHTDSAPAAIGPYIQGVDLGNMVLTSG 55



 Score = 28.0 bits (63), Expect = 6.2
 Identities = 9/42 (21%), Positives = 15/42 (35%), Gaps = 6/42 (14%)

Query: 566 YSRIPYGEEEEMKKWVATR------GPLSVGMNANGLFYYSG 601
            +   Y +   M K + T       GP   G++   +   SG
Sbjct: 14  GTENLYFQSNAMTKVLHTDSAPAAIGPYIQGVDLGNMVLTSG 55


>3fb2_A Spectrin alpha chain, brain spectrin; non-erythroid alpha chain
           alpha-II spectrin, fordrin alpha chain, sptan1,
           SPTA2_human, NESG, HR5563A; 2.30A {Homo sapiens}
          Length = 218

 Score = 28.3 bits (63), Expect = 6.5
 Identities = 14/62 (22%), Positives = 24/62 (38%), Gaps = 9/62 (14%)

Query: 17  LHTFMIKVALLESNIFQTRGYLNSPVTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKA 76
           L  F       E+ +     +LN+          D      SVE L+++HE+F   +   
Sbjct: 121 LQLFHRDCEQAENWMAAREAFLNTE---------DKGDSLDSVEALIKKHEDFDKAINVQ 171

Query: 77  ED 78
           E+
Sbjct: 172 EE 173


>3aez_A Pantothenate kinase; transferase, homodimer, COA biosynthesis,
           nucleotide binding binding, cytoplasm,
           nucleotide-binding; HET: GDP PAZ; 2.20A {Mycobacterium
           tuberculosis} PDB: 2ges_A* 2geu_A* 2gev_A* 2zs7_A*
           2zs8_A* 2zs9_A* 2zsa_A* 2zsb_A* 2zsd_A* 2zse_A* 2zsf_A*
           2get_A* 3af0_A* 3af1_A* 3af2_A* 3af3_A* 3af4_A* 3avp_A*
           3avo_A* 3avq_A*
          Length = 312

 Score = 28.3 bits (63), Expect = 7.6
 Identities = 7/23 (30%), Positives = 12/23 (52%)

Query: 244 DLSESDLQQLTGLNLDSTLEDIQ 266
            L+E +L  L GL     L +++
Sbjct: 28  ALTEEELVGLRGLGEQIDLLEVE 50


>4ezg_A Putative uncharacterized protein; internalin-A, leucine-rich repeat
           protein, structural genomi center for structural
           genomics, JCSG; HET: MSE; 1.50A {Listeria monocytogenes}
          Length = 197

 Score = 28.0 bits (63), Expect = 8.6
 Identities = 6/31 (19%), Positives = 16/31 (51%), Gaps = 1/31 (3%)

Query: 238 GVNKFFDLSESDLQQLTGLNLDST-LEDIQP 267
           G +   +++E+ +  LT + L +  + D+  
Sbjct: 31  GQSSTANITEAQMNSLTYITLANINVTDLTG 61


>1sq5_A Pantothenate kinase; P-loop, transferase; HET: PAU ADP; 2.20A
           {Escherichia coli} SCOP: c.37.1.6 PDB: 1esm_A* 1esn_A*
          Length = 308

 Score = 28.0 bits (62), Expect = 9.5
 Identities = 9/22 (40%), Positives = 16/22 (72%)

Query: 245 LSESDLQQLTGLNLDSTLEDIQ 266
           LSE ++ +L G+N D +LE++ 
Sbjct: 23  LSEDEIARLKGINEDLSLEEVA 44


  Database: pdb70
    Posted date:  Sep 4, 2012  3:40 AM
  Number of letters in database: 6,701,793
  Number of sequences in database:  27,921
  
Lambda     K      H
   0.315    0.133    0.400 

Gapped
Lambda     K      H
   0.267   0.0856    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 27921
Number of Hits to DB: 10,031,962
Number of extensions: 605563
Number of successful extensions: 1604
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1276
Number of HSP's successfully gapped: 155
Length of query: 655
Length of database: 6,701,793
Length adjustment: 100
Effective length of query: 555
Effective length of database: 3,909,693
Effective search space: 2169879615
Effective search space used: 2169879615
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 60 (26.7 bits)