RPS-BLAST 2.2.26 [Sep-21-2011]

Database: pdb70 
           27,921 sequences; 6,701,793 total letters

Searching..................................................done

Query= psy15353
         (344 letters)



>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme,
           papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A
           1pbh_A 1mir_A
          Length = 317

 Score =  317 bits (814), Expect = e-108
 Identities = 122/329 (37%), Positives = 165/329 (50%), Gaps = 17/329 (5%)

Query: 15  RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRK 74
           R   +  SD  ++ +N+   TW AG NF  N+   YL++        F    +P      
Sbjct: 4   RPSFHPLSDELVNYVNKRNTTWQAGHNF-YNVDMSYLKRLCGT----FLGGPKPPQRVMF 58

Query: 75  TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
           T D      +P  FDAREQWP C TI  + D G+C +   F AV A SDR CI +    +
Sbjct: 59  TED----LKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVS 114

Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
             +S E + +CC          C+ G     WNF  ++G V+GG Y    GC+P +I PC
Sbjct: 115 VEVSAEDLLTCCGSMC---GDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPC 171

Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
            HH +    P       PK  C   C  P Y   + QDKH    +Y V ++E  I  EI 
Sbjct: 172 EHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIY 228

Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
            +GP    F++Y DF  YKSGVY+H +   +    H+ +++GWG ENGTPYWLV N+W  
Sbjct: 229 KNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPYWLVANSWNT 286

Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPK 343
            WGD G  KILRG+  C  E  + AG P+
Sbjct: 287 DWGDNGFFKILRGQDHCGIESEVVAGIPR 315


>3hhi_A Cathepsin B-like cysteine protease; occluding loop, hydrolase, THIO
           protease; HET: 074; 1.60A {Trypanosoma brucei} PDB:
           3mor_A*
          Length = 325

 Score =  293 bits (751), Expect = 2e-98
 Identities = 104/337 (30%), Positives = 145/337 (43%), Gaps = 31/337 (9%)

Query: 13  LVRGELYKFSDAYIDQINREAN-TWTAGRN-FPANLSEEYLRQFLIADAKYFDQSDRPLP 70
           LV  +    S A++D++NR     W A  +    N++    ++           ++  + 
Sbjct: 2   LVAEDAPVLSKAFVDRVNRLNRGIWKAKYDGVMQNITLREAKRLNGV---IKKNNNASIL 58

Query: 71  GDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACA---APHIFAAVGAFSDRRCI 127
             R+  + E  A +P  FD+ E WPNC TI  + D  AC    A    AA  A SDR C 
Sbjct: 59  PKRRFTEEEARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWA---VAAASAMSDRFCT 115

Query: 128 KSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQ 187
              G Q+  +S   + +CC  C       C+ G   R W +    G V+         CQ
Sbjct: 116 MG-GVQDVHISAGDLLACCSDC----GDGCNGGDPDRAWAYFSSTGLVSDY-------CQ 163

Query: 188 PSTISPCSHHGSAPT-LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNE 246
           P     CSHH  +    P C        KC   C +PT         +R+  +Y +   E
Sbjct: 164 PYPFPHCSHHSKSKNGYPPCSQFNFDTPKCDYTCDDPTIP----VVNYRSWTSYALQ-GE 218

Query: 247 DAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYW 306
           D   +E+   GP    F +Y+DF  Y SGVY H S   L    H+ +L+GWGT NG PYW
Sbjct: 219 DDYMRELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYLGG--HAVRLVGWGTSNGVPYW 276

Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
            + N+W   WG  G   I RG  EC  E   +AG P 
Sbjct: 277 KIANSWNTEWGMDGYFLIRRGSSECGIEDGGSAGIPL 313


>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco
           hydrolase, lysosome, protease, thiol protease, zymogen,
           CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A*
           3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A*
           1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A*
           1qdq_A* 1csb_B* 1huc_B 2ipp_B ...
          Length = 266

 Score =  286 bits (733), Expect = 2e-96
 Identities = 104/260 (40%), Positives = 138/260 (53%), Gaps = 8/260 (3%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
           +P  FDAREQWP C TI  + D G+C +   F AV A SDR CI +    +  +S E + 
Sbjct: 7   LPASFDAREQWPQCPTIKEIRDQGSCGSAWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 66

Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
           +CC          C+ G     WNF  ++G V+GG Y    GC+P +I PC  H +    
Sbjct: 67  TCCGSMC---GDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEAHVNGARP 123

Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
           P       PK  C   C  P Y   + QDKH    +Y V ++E  I  EI  +GP    F
Sbjct: 124 PCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF 180

Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
           ++Y DF  YKSGVY+H +   +    H+ +++GWG ENGTPYWLV N+W   WGD G  K
Sbjct: 181 SVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPYWLVANSWNTDWGDNGFFK 238

Query: 324 ILRGKYECAFEYLIAAGKPK 343
           ILRG+  C  E  + AG P+
Sbjct: 239 ILRGQDHCGIESEVVAGIPR 258


>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase,
           digestive tract, hydrolase-hydrolase INH complex; HET:
           074; 1.30A {Schistosoma mansoni} PDB: 3s3q_A* 3s3r_A*
          Length = 254

 Score =  284 bits (728), Expect = 8e-96
 Identities = 101/262 (38%), Positives = 143/262 (54%), Gaps = 14/262 (5%)

Query: 84  VPDRFDAREQWPNCGTIGHVPDTGACA---APHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
           +P  FD+R++WP C +I  + D   C    A   F AV A SDR CI+S G+QN  LS  
Sbjct: 3   IPSSFDSRKKWPRCKSIATIRDQSRCGSCWA---FGAVEAMSDRSCIQSGGKQNVELSAV 59

Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
            + SCC+ C       C  G +   W++  K G VTG    +  GC+P     C HH   
Sbjct: 60  DLLSCCESC----GLGCEGGILGPAWDYWVKEGIVTGSSKENHAGCEPYPFPKCEHHTKG 115

Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
              P C ++     +C   C    Y   + QDKHR   +Y V ++E AI+KEI+ +GP  
Sbjct: 116 -KYPPCGSKIYKTPRCKQTC-QKKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVE 173

Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
           A F +Y+DF +YKSG+YKH +   L    H+ ++IGWG EN  PYWL+ N+W   WG+ G
Sbjct: 174 AGFTVYEDFLNYKSGIYKHITGETLGG--HAIRIIGWGVENKAPYWLIANSWNEDWGENG 231

Query: 321 TVKILRGKYECAFEYLIAAGKP 342
             +I+RG+ EC+ E  + AG+ 
Sbjct: 232 YFRIVRGRDECSIESEVTAGRI 253


>3pdf_A Cathepsin C, dipeptidyl peptidase 1; two domains, cystein protease,
           hydrolase-hydrolase inhibitor; HET: LXV NAG; 1.85A {Homo
           sapiens} PDB: 1jqp_A* 2djf_B* 1k3b_B* 2djg_B* 2djf_A*
           1k3b_A* 2djg_A* 2djf_C* 1k3b_C* 2djg_C*
          Length = 441

 Score =  210 bits (537), Expect = 6e-65
 Identities = 80/339 (23%), Positives = 127/339 (37%), Gaps = 54/339 (15%)

Query: 15  RGELYKFSDAYIDQINREANTWTAGRNF-PANLSEEYLRQFLIADAKYFDQSDRPLPGDR 73
              LYK+   ++  IN    +WTA        L+   + +       +  +  RP P   
Sbjct: 140 SNRLYKYDHNFVKAINAIQKSWTATTYMEYETLTLGDMIRRSGG---HSRKIPRPKPAPL 196

Query: 74  KTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQ 133
                +    +P  +D R        +  V +  +C + + FA++G    R  I +   Q
Sbjct: 197 TAEIQQKILFLPTSWDWRNVHG-INFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQ 255

Query: 134 NRPLSTEYVASCCKICRYDDNKSCSHGSVFRT-WNFLHKRGSVTGGDY---GDRTGCQPS 189
              LS + V SC +       + C  G  +     +    G V    +   G  + C+  
Sbjct: 256 TPILSPQEVVSCSQY-----AQGCEGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPCKM- 309

Query: 190 TISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAI 249
                                                R +  + H     ++   NE  +
Sbjct: 310 --------------------------------KEDCFRYYSSEYHYVG-GFYGGCNEALM 336

Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL----HSGKLIGWGTEN--GT 303
           K E++ HGP    F +YDDF HYK G+Y HT      N      H+  L+G+GT++  G 
Sbjct: 337 KLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGM 396

Query: 304 PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
            YW+V N+WG  WG+ G  +I RG  ECA E +  A  P
Sbjct: 397 DYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATP 435


>1deu_A Procathepsin X; cysteine protease, proregion, prosegment, HY; 1.70A
           {Homo sapiens} SCOP: d.3.1.1 PDB: 1ef7_A
          Length = 277

 Score =  191 bits (487), Expect = 3e-59
 Identities = 55/265 (20%), Positives = 90/265 (33%), Gaps = 41/265 (15%)

Query: 74  KTYDPEYSATVPDRFDAREQWPNCGTIGHVPD------TGACAAPHIFAAVGAFSDRRCI 127
           + ++    A +P  +D R             +       G+C A    A+  A +DR  I
Sbjct: 26  RPHEYLSPADLPKSWDWRNVDG-VNYASITRNQHIPQYCGSCWA---HASTSAMADRINI 81

Query: 128 KSKGQQNRP-LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGC 186
           K KG      LS + V  C       +  SC  G+    W++ H+ G            C
Sbjct: 82  KRKGAWPSTLLSVQNVIDCG------NAGSCEGGNDLSVWDYAHQHGIPDET-------C 128

Query: 187 QPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNE 246
                                            C +       ++        Y      
Sbjct: 129 NNYQAKDQECDKFNQC---------GTCNEFKEC-HAIRNYTLWRVGD-----YGSLSGR 173

Query: 247 DAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYW 306
           + +  EI A+GP +      +   +Y  G+Y    +    N  H   + GWG  +GT YW
Sbjct: 174 EKMMAEIYANGPISCGIMATERLANYTGGIYAEYQDTTYIN--HVVSVAGWGISDGTEYW 231

Query: 307 LVINTWGPHWGDRGTVKILRGKYEC 331
           +V N+WG  WG+RG ++I+   Y+ 
Sbjct: 232 IVRNSWGEPWGERGWLRIVTSTYKD 256


>3ois_A Cysteine protease; alpha and beta, hydrolase; HET: UDP; 1.65A
           {Xylella fastidiosa}
          Length = 291

 Score =  123 bits (311), Expect = 3e-33
 Identities = 42/267 (15%), Positives = 86/267 (32%), Gaps = 46/267 (17%)

Query: 68  PLPGDRKTYDPEYS--ATVPDRFDAREQWP-----NCGTIGHVPDTGACAAPHIFAAVGA 120
                  +Y PE S  A +P + D    +        G+         C A    A   A
Sbjct: 39  IADIRDFSYTPEKSVIAALPPKVDLTPPFQVYDQGRIGS---------CTA---NALAAA 86

Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRT-WNFLHKRGSVTGGD 179
               R    +  +  P    ++    +  + + + +   G++ R     LHK G     +
Sbjct: 87  IQFERIHDKQSPEFIPSR-LFIYYNER--KIEGHVNYDSGAMIRDGIKVLHKLGVCPEKE 143

Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
           +       P   +P          P     K P  +C+                ++ T  
Sbjct: 144 W-------PYGDTPADPRTEEFP-PGAPASKKPSDQCYK-----------DAQNYKITEY 184

Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGW 297
             V  + D +K  +    P    F++Y+ +    S   +     K +     H+   +G+
Sbjct: 185 SRVAQDIDHLKACLAVGSPFVFGFSVYNSWVGNNSLPVRIPLPTKNDTLEGGHAVLCVGY 244

Query: 298 GTENGTPYWLVINTWGPHWGDRGTVKI 324
             ++   ++ + N+WG + G+ G   +
Sbjct: 245 --DDEIRHFRIRNSWGNNVGEDGYFWM 269


>2wbf_X Serine-repeat antigen protein; SERA, malaria, vacuole, protease,
           cathepsin, hydrolase, glycoprotein, thiol protease; HET:
           DMS; 1.60A {Plasmodium falciparum} PDB: 3ch3_X 3ch2_X
          Length = 265

 Score = 94.3 bits (235), Expect = 1e-22
 Identities = 53/247 (21%), Positives = 81/247 (32%), Gaps = 31/247 (12%)

Query: 94  WPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDD 153
             NC +   V D G C    IFA+       RC+K  G +   +S  YVA+C K    + 
Sbjct: 16  ENNCISNLQVEDQGNCDTSWIFASKYHLETIRCMK--GYEPTKISALYVANCYKG---EH 70

Query: 154 NKSCSHGSVFRT-WNFLHKRGSV-TGGDY---GDRTGCQPSTISPCSHHGSAPTLPSCEN 208
              C  GS        +   G +    +Y     + G Q             P +     
Sbjct: 71  KDRCDEGSSPMEFLQIIEDYGFLPAESNYPYNYVKVGEQ------------CPKVEDHWM 118

Query: 209 QKVPKLKCHTRCTNP-TYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF-ALY 266
                 K       P +     +           +D     IK E++  G   A   A  
Sbjct: 119 NLWDNGKILHNKNEPNSLDGKGYTAYESERFHDNMDAFVKIIKTEVMNKGSVIAYIKAEN 178

Query: 267 DDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE-----NGTPYWLVINTWGPHWGDRGT 321
              Y +     K+       +  H+  ++G+G           YW+V N+WGP+WGD G 
Sbjct: 179 VMGYEFSGKKVKNLCGDDTAD--HAVNIVGYGNYVNSEGEKKSYWIVRNSWGPYWGDEGY 236

Query: 322 VKILRGK 328
            K+    
Sbjct: 237 FKVDMYG 243


>3i06_A Cruzipain; autocatalytic cleavage, glycoprotein, protease, thiol
           protease, zymogen; HET: QL2; 1.10A {Trypanosoma cruzi}
           PDB: 1ewm_A* 1ewo_A* 1ewl_A* 1f29_A* 1ewp_A* 1f2b_A*
           1f2c_A* 1f2a_A* 1me4_A* 1u9q_X* 2aim_A* 2efm_A* 2oz2_A*
           1me3_A* 3kku_A* 3lxs_A* 1aim_A* 3iut_A* 3hd3_A* 2p86_A*
           ...
          Length = 215

 Score = 80.2 bits (199), Expect = 7e-18
 Identities = 23/91 (25%), Positives = 41/91 (45%), Gaps = 6/91 (6%)

Query: 242 VDDNEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE 300
           +  +E  I   +  +GP      A    +  Y  GV     + +L+   H   L+G+   
Sbjct: 118 LPQDEAQIAAWLAVNGPVAVAVDA--SSWMTYTGGVMTSCVSEQLD---HGVLLVGYNDS 172

Query: 301 NGTPYWLVINTWGPHWGDRGTVKILRGKYEC 331
              PYW++ N+W   WG+ G ++I +G  +C
Sbjct: 173 AAVPYWIIKNSWTTQWGEEGYIRIAKGSNQC 203


>8pch_A Cathepsin H; hydrolase, protease, cysteine proteinase,
           aminopeptidase; HET: NAG BMA; 2.10A {Sus scrofa} SCOP:
           d.3.1.1 PDB: 1nb3_A* 1nb5_A*
          Length = 220

 Score = 79.8 bits (198), Expect = 1e-17
 Identities = 34/88 (38%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL-HSGKLIGWGTENGT 303
           +E+A+ + +  + P +  F + +DF  Y+ G+Y  TS  K  + + H+   +G+G ENG 
Sbjct: 120 DEEAMVEAVALYNPVSFAFEVTNDFLMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGI 179

Query: 304 PYWLVINTWGPHWGDRGTVKILRGKYEC 331
           PYW+V N+WGP WG  G   I RGK  C
Sbjct: 180 PYWIVKNSWGPQWGMNGYFLIERGKNMC 207


>1m6d_A Cathepsin F, catsf; papain family cysteine protease, hydrolase;
           HET: MYP; 1.70A {Homo sapiens} SCOP: d.3.1.1
          Length = 214

 Score = 78.3 bits (194), Expect = 3e-17
 Identities = 22/92 (23%), Positives = 40/92 (43%), Gaps = 4/92 (4%)

Query: 242 VDDNEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHTSNAKLENYL-HSGKLIGWGT 299
           +  NE  +   +   GP +    A       Y+ G+ +          + H+  L+G+G 
Sbjct: 113 LSQNEQKLAAWLAKRGPISVAINA--FGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGQ 170

Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYEC 331
            +  P+W + N+WG  WG++G   + RG   C
Sbjct: 171 RSDVPFWAIKNSWGTDWGEKGYYYLHRGSGAC 202


>3qj3_A Cathepsin L-like protein; hydrolase, proteinase, larVal midgut;
           1.85A {Tenebrio molitor}
          Length = 331

 Score = 78.8 bits (195), Expect = 1e-16
 Identities = 30/87 (34%), Positives = 40/87 (45%), Gaps = 5/87 (5%)

Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENYLHSGKLIGWGTENG 302
           +E+ +   +   GP    F   D F  Y  GVY +      K     H+  ++G+G ENG
Sbjct: 234 DENMLADMVATKGPVAVAFDADDPFGSYSGGVYYNPTCETNKFT---HAVLIVGYGNENG 290

Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKY 329
             YWLV N+WG  WG  G  KI R   
Sbjct: 291 QDYWLVKNSWGDGWGLDGYFKIARNAN 317



 Score = 33.0 bits (76), Expect = 0.12
 Identities = 20/101 (19%), Positives = 37/101 (36%), Gaps = 9/101 (8%)

Query: 25  YIDQINREAN----TWTAGRNFPANLS-EEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
             ++ N +      ++T G N   +++ EE                +      R+     
Sbjct: 52  TFEEHNEKYRQGLVSYTLGVNLFTDMTPEEMKAYTHGLIMPADLHKNGIPIKTREDLGLN 111

Query: 80  YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGA 120
            S   P  FD R+Q    G +  V + G+C +   F++ GA
Sbjct: 112 ASVRYPASFDWRDQ----GMVSPVKNQGSCGSSWAFSSTGA 148


>1ppo_A Protease omega; hydrolase(thiol protease); 1.80A {Carica papaya}
           SCOP: d.3.1.1 PDB: 1meg_A*
          Length = 216

 Score = 76.0 bits (188), Expect = 2e-16
 Identities = 23/85 (27%), Positives = 41/85 (48%), Gaps = 5/85 (5%)

Query: 245 NEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT 303
           NE  +    +A  P +    +    F  YK G+++     K++   H+   +G+G   G 
Sbjct: 117 NEGNLLN-AIAKQPVSVVVESKGRPFQLYKGGIFEGPCGTKVD---HAVTAVGYGKSGGK 172

Query: 304 PYWLVINTWGPHWGDRGTVKILRGK 328
            Y L+ N+WG  WG++G ++I R  
Sbjct: 173 GYILIKNSWGTAWGEKGYIRIKRAP 197


>3p5u_A Actinidin; SAD, cysteine proteinases, hydrolase; 1.50A {Actinidia
           arguta} PDB: 3p5v_A 3p5w_A 3p5x_A 1aec_A* 2act_A
          Length = 220

 Score = 74.1 bits (183), Expect = 1e-15
 Identities = 28/85 (32%), Positives = 47/85 (55%), Gaps = 5/85 (5%)

Query: 245 NEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT 303
           NE A++   +A+ P +    A   +F HY SG++       ++   H+  ++G+GTE G 
Sbjct: 120 NEWALQT-AVAYQPVSVALEAAGYNFQHYSSGIFTGPCGTAVD---HAVTIVGYGTEGGI 175

Query: 304 PYWLVINTWGPHWGDRGTVKILRGK 328
            YW+V N+WG  WG+ G ++I R  
Sbjct: 176 DYWIVKNSWGTTWGEEGYMRIQRNV 200


>1iwd_A Ervatamin B; cysteine protease, alpha-beta protein, catalytic DYAD,
           L-DOM domain., hydrolase; 1.63A {Tabernaemontana
           divaricata} SCOP: d.3.1.1
          Length = 215

 Score = 73.7 bits (182), Expect = 1e-15
 Identities = 27/103 (26%), Positives = 49/103 (47%), Gaps = 8/103 (7%)

Query: 230 FQDKHRTTLTYWVD---DNEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHTSNAKL 285
                  ++  +     +NE A++   +A  P + T  A    F HY SG++        
Sbjct: 98  PYRLRVVSINGFQRVTRNNESALQS-AVASQPVSVTVEAAGAPFQHYSSGIFTGPCGTAQ 156

Query: 286 ENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGK 328
               H   ++G+GT++G  YW+V N+WG +WG++G + + R  
Sbjct: 157 N---HGVVIVGYGTQSGKNYWIVRNSWGQNWGNQGYIWMERNV 196


>1cqd_A Protein (protease II); cysteine protease, glycoprotein, proline
           specificity, carboh papain family, hydrolase; HET: NAG
           FUL FUC; 2.10A {Zingiber officinale} SCOP: d.3.1.1
          Length = 221

 Score = 73.7 bits (182), Expect = 1e-15
 Identities = 28/85 (32%), Positives = 49/85 (57%), Gaps = 5/85 (5%)

Query: 245 NEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT 303
           NE +++K  +A+ P + T  A   DF  Y+SG++  + N       H+  ++G+GTEN  
Sbjct: 119 NEQSLQK-AVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISAN---HALTVVGYGTENDK 174

Query: 304 PYWLVINTWGPHWGDRGTVKILRGK 328
            +W+V N+WG +WG+ G ++  R  
Sbjct: 175 DFWIVKNSWGKNWGESGYIRAERNI 199


>3kwz_A Cathepsin K; enzyme inhibitor, covalent reversible inhibitor,
           disease mutation, disulfide bond, glycoprotein,
           hydrolase, lysosome, protease; HET: KWZ; 1.49A {Homo
           sapiens} PDB: 1au0_A* 1au2_A* 1au3_A* 1au4_A* 1ayu_A*
           1ayv_A* 1ayw_A* 1bgo_A* 1atk_A* 1nl6_A* 1nlj_A* 1q6k_A*
           1mem_A* 1yk7_A* 1yk8_A* 1yt7_A* 2ato_A* 2aux_A* 2auz_A*
           2bdl_A* ...
          Length = 215

 Score = 73.7 bits (182), Expect = 1e-15
 Identities = 26/87 (29%), Positives = 45/87 (51%), Gaps = 6/87 (6%)

Query: 245 NEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHT--SNAKLENYLHSGKLIGWGTEN 301
           NE A+K+ +   GP +    A    F  Y  GVY     ++  L    H+   +G+G + 
Sbjct: 117 NEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLN---HAVLAVGYGIQK 173

Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGK 328
           G  +W++ N+WG +WG++G + + R K
Sbjct: 174 GNKHWIIKNSWGENWGNKGYILMARNK 200


>2o6x_A Procathepsin L1, secreted cathepsin L 1; hydrolase, thiol protease,
           cysteine protease, zymogen, hydro; 1.40A {Fasciola
           hepatica}
          Length = 310

 Score = 75.0 bits (185), Expect = 2e-15
 Identities = 29/86 (33%), Positives = 50/86 (58%), Gaps = 5/86 (5%)

Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENYLHSGKLIGWGTENG 302
           +E  +K  + A GP      +  DF  Y+SG+Y+    S  ++    H+   +G+GT+ G
Sbjct: 209 SEVELKNLVGAEGPAAVAVDVESDFMMYRSGIYQSQTCSPLRVN---HAVLAVGYGTQGG 265

Query: 303 TPYWLVINTWGPHWGDRGTVKILRGK 328
           T YW+V N+WG  WG+RG ++++R +
Sbjct: 266 TDYWIVKNSWGLSWGERGYIRMVRNR 291


>1yal_A Chymopapain; hydrolase, thiol protease; 1.70A {Carica papaya} SCOP:
           d.3.1.1 PDB: 1gec_E*
          Length = 218

 Score = 73.3 bits (181), Expect = 2e-15
 Identities = 27/85 (31%), Positives = 45/85 (52%), Gaps = 5/85 (5%)

Query: 245 NEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT 303
            E +     LA+ P +    A    F  YKSGV+      KL+   H+   +G+GT +G 
Sbjct: 117 CETSFLG-ALANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKLD---HAVTAVGYGTSDGK 172

Query: 304 PYWLVINTWGPHWGDRGTVKILRGK 328
            Y ++ N+WGP+WG++G +++ R  
Sbjct: 173 NYIIIKNSWGPNWGEKGYMRLKRQS 197


>3f5v_A DER P 1 allergen; allergy, asthma, DUST mites, glycoprotein,
           hydrola protease, secreted, thiol protease; HET: P6G;
           1.36A {Dermatophagoides pteronyssinus} PDB: 2as8_A
           3rvw_A* 3rvx_A 3rvv_A* 3d6s_A*
          Length = 222

 Score = 73.3 bits (181), Expect = 2e-15
 Identities = 23/108 (21%), Positives = 38/108 (35%), Gaps = 7/108 (6%)

Query: 230 FQDKHRTTLTYWVD---DNEDAIKKEI-LAHGPTT--ATFALYDDFYHYKSGVYKHTSNA 283
             +  R  ++ +      N + I++ +   H            D F HY         N 
Sbjct: 105 RPNAQRFGISNYCQIYPPNANKIREALAQTHSAIAVIIGIKDLDAFRHYDGRTIIQRDNG 164

Query: 284 KLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYEC 331
              NY H+  ++G+    G  YW+V N+W  +WGD G           
Sbjct: 165 YQPNY-HAVNIVGYSNAQGVDYWIVRNSWDTNWGDNGYGYFAANIDLM 211


>1pci_A Procaricain; zymogen, hydrolase, thiol protease; 3.20A {Carica
           papaya} SCOP: d.3.1.1
          Length = 322

 Score = 75.0 bits (185), Expect = 2e-15
 Identities = 23/85 (27%), Positives = 40/85 (47%), Gaps = 5/85 (5%)

Query: 245 NEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT 303
           NE  +   I A  P +    +    F  YK G+++     K++    +   +G+G   G 
Sbjct: 223 NEGNLLNAI-AKQPVSVVVESKGRPFQLYKGGIFEGPCGTKVD---GAVTAVGYGKSGGK 278

Query: 304 PYWLVINTWGPHWGDRGTVKILRGK 328
            Y L+ N+WG  WG++G ++I R  
Sbjct: 279 GYILIKNSWGTAWGEKGYIRIKRAP 303


>1by8_A Protein (procathepsin K); hydrolase(sulfhydryl proteinase), papain;
           2.60A {Homo sapiens} SCOP: d.3.1.1 PDB: 7pck_A
          Length = 314

 Score = 74.2 bits (183), Expect = 3e-15
 Identities = 26/87 (29%), Positives = 46/87 (52%), Gaps = 6/87 (6%)

Query: 245 NEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVY--KHTSNAKLENYLHSGKLIGWGTEN 301
           NE A+K+ +   GP +    A    F  Y  GVY  +  ++  L    H+   +G+G + 
Sbjct: 216 NEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLN---HAVLAVGYGIQK 272

Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGK 328
           G  +W++ N+WG +WG++G + + R K
Sbjct: 273 GNKHWIIKNSWGENWGNKGYILMARNK 299


>3qt4_A Cathepsin-L-like midgut cysteine proteinase; hydrolase, zymogen,
           intramolecular DISS bonds, insect larVal midgut; HET:
           PG4 PG6; 2.11A {Tenebrio molitor}
          Length = 329

 Score = 73.8 bits (182), Expect = 4e-15
 Identities = 21/86 (24%), Positives = 42/86 (48%), Gaps = 5/86 (5%)

Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENYLHSGKLIGWGTENG 302
           +E+++   +   GP        D+   Y  G++     + + L    H   ++G+G++NG
Sbjct: 232 DENSLADAVGQAGPVAVAIDATDELQFYSGGLFYDQTCNQSDLN---HGVLVVGYGSDNG 288

Query: 303 TPYWLVINTWGPHWGDRGTVKILRGK 328
             YW++ N+WG  WG+ G  + +R  
Sbjct: 289 QDYWILKNSWGSGWGESGYWRQVRNY 314


>2b1m_A SPE31; papain-like, sugar binding protein; HET: NAG FUC PG4; 2.00A
           {Pachyrhizus erosus} PDB: 2b1n_A*
          Length = 246

 Score = 72.6 bits (179), Expect = 5e-15
 Identities = 23/86 (26%), Positives = 43/86 (50%), Gaps = 4/86 (4%)

Query: 244 DNEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG 302
           + E +++   +   P + +  A   DF+ Y  G+Y   + +      H   ++G+G+E+G
Sbjct: 124 EAESSLQS-FVLEQPISVSIDA--KDFHFYSGGIYDGGNCSSPYGINHFVLIVGYGSEDG 180

Query: 303 TPYWLVINTWGPHWGDRGTVKILRGK 328
             YW+  N+WG  WG  G ++I R  
Sbjct: 181 VDYWIAKNSWGEDWGIDGYIRIQRNT 206


>3ovx_A Cathepsin S; hydrolase, covalent inhibitor, aldehyde warhead is
           covalently bound to Cys25, lysosomeal protein; HET: O64;
           1.49A {Homo sapiens} PDB: 2h7j_A* 2f1g_A* 2hh5_B*
           2hhn_A* 2hxz_A* 2op3_A* 2frq_A* 2fra_A* 2fq9_A* 2ft2_A*
           2fud_A* 2g7y_A* 1ms6_A* 2r9m_A* 2r9n_A* 2r9o_A* 3n3g_A*
           3n4c_A* 3mpe_A* 1nqc_A* ...
          Length = 218

 Score = 71.0 bits (175), Expect = 1e-14
 Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 5/86 (5%)

Query: 245 NEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHTS-NAKLENYLHSGKLIGWGTENG 302
            ED +K+ +   GP +    A +  F+ Y+SGVY   S    +    H   ++G+G  NG
Sbjct: 121 REDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVN---HGVLVVGYGDLNG 177

Query: 303 TPYWLVINTWGPHWGDRGTVKILRGK 328
             YWLV N+WG ++G+ G +++ R K
Sbjct: 178 KEYWLVKNSWGHNFGEEGYIRMARNK 203


>2c0y_A Procathepsin S; proenzyme, proteinase, hydrolase, thiol protease,
           prosegment binding loop, glycoprotein, lysosome,
           protease, zymogen; 2.1A {Homo sapiens}
          Length = 315

 Score = 71.9 bits (177), Expect = 2e-14
 Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 5/86 (5%)

Query: 245 NEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHTS-NAKLENYLHSGKLIGWGTENG 302
            ED +K+ +   GP +    A +  F+ Y+SGVY   S    +    H   ++G+G  NG
Sbjct: 218 REDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVN---HGVLVVGYGDLNG 274

Query: 303 TPYWLVINTWGPHWGDRGTVKILRGK 328
             YWLV N+WG ++G+ G +++ R K
Sbjct: 275 KEYWLVKNSWGHNFGEEGYIRMARNK 300


>1xkg_A DER P I, major mite fecal allergen DER P 1; major allergen,
           cysteine protease, house DUST mite, dermatop
           pteronyssinus; 1.61A {Dermatophagoides pteronyssinus}
           SCOP: d.3.1.1
          Length = 312

 Score = 71.5 bits (176), Expect = 3e-14
 Identities = 23/108 (21%), Positives = 38/108 (35%), Gaps = 7/108 (6%)

Query: 230 FQDKHRTTLTYWVD---DNEDAIKKEI-LAHGPTT--ATFALYDDFYHYKSGVYKHTSNA 283
             +  R  ++ +      N + I++ +   H            D F HY         N 
Sbjct: 185 RPNAQRFGISNYCQIYPPNANKIREALAQTHSAIAVIIGIKDLDAFRHYDGRTIIQRDNG 244

Query: 284 KLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYEC 331
              NY H+  ++G+    G  YW+V N+W  +WGD G           
Sbjct: 245 YQPNY-HAVNIVGYSNAQGVDYWIVRNSWDTNWGDNGYGYFAANIDLM 291



 Score = 29.1 bits (66), Expect = 2.2
 Identities = 17/100 (17%), Positives = 33/100 (33%), Gaps = 19/100 (19%)

Query: 25  YIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATV 84
           Y+       N      +  ++LS +  +   +  A+ F+        + +T     +   
Sbjct: 38  YVQSNGGAIN------HL-SDLSLDEFKNRFLMSAEAFEHLKTQFDLNAETNACSINGNA 90

Query: 85  PDRFDAREQWPNCGTIGHVPDTGAC----AAPHIFAAVGA 120
           P   D R+      T+  +   G C    A    F+ V A
Sbjct: 91  PAEIDLRQM----RTVTPIRMQGGCGSAWA----FSGVAA 122


>3bwk_A Cysteine protease falcipain-3; malaria, hydrolase; HET: C1P; 2.42A
           {Plasmodium falciparum} PDB: 3bpm_A*
          Length = 243

 Score = 70.3 bits (173), Expect = 3e-14
 Identities = 26/108 (24%), Positives = 48/108 (44%), Gaps = 13/108 (12%)

Query: 231 QDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLH 290
           +   R T+  +V   +D  K+ +   GP + + A  DDF  Y+ G Y     A      H
Sbjct: 120 RCNERYTIKSYVSIPDDKFKEALRYLGPISISIAASDDFAFYRGGFYDGECGAAPN---H 176

Query: 291 SGKLIGWGTEN----------GTPYWLVINTWGPHWGDRGTVKILRGK 328
           +  L+G+G ++             Y+++ N+WG  WG+ G + +   +
Sbjct: 177 AVILVGYGMKDIYNEDTGRMEKFYYYIIKNSWGSDWGEGGYINLETDE 224


>2oul_A Falcipain 2; cysteine protease, inhibitor, macromolecular
           interaction, HY hydrolase inhibitor complex; 2.20A
           {Plasmodium falciparum} SCOP: d.3.1.1 PDB: 2ghu_A 1yvb_A
           3bpf_A* 3pnr_A
          Length = 241

 Score = 69.9 bits (172), Expect = 4e-14
 Identities = 24/110 (21%), Positives = 51/110 (46%), Gaps = 13/110 (11%)

Query: 229 FFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENY 288
             +   +  +  ++   ++ +K+ +   GP + + A+ DDF  YK G++      +L   
Sbjct: 116 IDRCTEKYGIKNYLSVPDNKLKEALRFLGPISISVAVSDDFAFYKEGIFDGECGDQLN-- 173

Query: 289 LHSGKLIGWGTENG----------TPYWLVINTWGPHWGDRGTVKILRGK 328
            H+  L+G+G +              Y+++ N+WG  WG+RG + I   +
Sbjct: 174 -HAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNSWGQQWGERGFINIETDE 222


>2xu3_A Cathepsin L1; hydrolase, drug design, thiol protease; HET: XU3 BTB;
           0.90A {Homo sapiens} PDB: 2xu4_A* 2xu5_A* 2yj2_A*
           2yj8_A* 2yj9_A* 2yjb_A* 2yjc_A* 3bc3_A* 3h89_A* 3h8b_A*
           3h8c_A* 3of9_A* 3of8_A* 3hha_A* 2xu1_A* 3iv2_A* 3k24_A*
           2nqd_B* 3kse_A* 2vhs_A ...
          Length = 220

 Score = 68.3 bits (168), Expect = 1e-13
 Identities = 28/94 (29%), Positives = 46/94 (48%), Gaps = 10/94 (10%)

Query: 242 VDDNEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVY--KHTSNAKLENYLHSGKLIGWG 298
           +   E A+ K +   GP +    A ++ F  YK G+Y     S+  ++   H   ++G+G
Sbjct: 115 IPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMD---HGVLVVGYG 171

Query: 299 TE----NGTPYWLVINTWGPHWGDRGTVKILRGK 328
            E    +   YWLV N+WG  WG  G VK+ + +
Sbjct: 172 FESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDR 205


>1s4v_A Cysteine endopeptidase; KDEL ER retention signal, endosperm,
           ricinosomes, SEED germi senescence, hydrolase-hydrolase
           inhibitor complex; 2.00A {Ricinus communis} SCOP:
           d.3.1.1
          Length = 229

 Score = 67.2 bits (165), Expect = 3e-13
 Identities = 30/85 (35%), Positives = 50/85 (58%), Gaps = 6/85 (7%)

Query: 245 NEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE-NG 302
           +E+A+ K  +A+ P +    A   DF  Y  GV+  +   +L+   H   ++G+GT  +G
Sbjct: 120 DENALLK-AVANQPVSVAIDAGGSDFQFYSEGVFTGSCGTELD---HGVAIVGYGTTIDG 175

Query: 303 TPYWLVINTWGPHWGDRGTVKILRG 327
           T YW V N+WGP WG++G +++ RG
Sbjct: 176 TKYWTVKNSWGPEWGEKGYIRMERG 200


>3f75_A Toxopain-2, cathepsin L protease; medical structural genomics of
           pathogenic protozoa, MSGPP, C protease, parasite,
           protozoa, hydrolase; 1.99A {Toxoplasma gondii}
          Length = 224

 Score = 66.4 bits (163), Expect = 5e-13
 Identities = 26/87 (29%), Positives = 41/87 (47%), Gaps = 7/87 (8%)

Query: 245 NEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT--EN 301
           +E A+K   LA  P +    A    F  Y  GV+  +    L+   H   L+G+GT  E+
Sbjct: 125 SEAAMKA-ALAKSPVSIAIEADQMPFQFYHEGVFDASCGTDLD---HGVLLVGYGTDKES 180

Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGK 328
              +W++ N+WG  WG  G + +   K
Sbjct: 181 KKDFWIMKNSWGTGWGRDGYMYMAMHK 207


>1cs8_A Human procathepsin L; prosegment, propeptide, inhibition,
           hydrolase; HET: OCS; 1.80A {Homo sapiens} SCOP: d.3.1.1
           PDB: 1cjl_A 3hwn_A*
          Length = 316

 Score = 67.7 bits (166), Expect = 5e-13
 Identities = 28/94 (29%), Positives = 46/94 (48%), Gaps = 10/94 (10%)

Query: 242 VDDNEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVY--KHTSNAKLENYLHSGKLIGWG 298
           +   E A+ K +   GP +    A ++ F  YK G+Y     S+  ++   H   ++G+G
Sbjct: 211 IPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMD---HGVLVVGYG 267

Query: 299 TE----NGTPYWLVINTWGPHWGDRGTVKILRGK 328
            E    +   YWLV N+WG  WG  G VK+ + +
Sbjct: 268 FESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDR 301


>2fo5_A Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cysteine
           endoprotease, endopeptidase, LEUP hydrolase; HET: AR7;
           2.20A {Hordeum vulgare}
          Length = 262

 Score = 66.5 bits (163), Expect = 8e-13
 Identities = 24/87 (27%), Positives = 46/87 (52%), Gaps = 6/87 (6%)

Query: 244 DNEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG-TEN 301
           ++E+ + +  +A+ P +    A    F  Y  GV+      +L+   H   ++G+G  E+
Sbjct: 124 NSEEDLAR-AVANQPVSVAVEASGKAFMFYSEGVFTGECGTELD---HGVAVVGYGVAED 179

Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGK 328
           G  YW V N+WGP WG++G +++ +  
Sbjct: 180 GKAYWTVKNSWGPSWGEQGYIRVEKDS 206


>2cio_A Papain; hydrolase/inhibitor, complex hydrolase/inhibitor, ICP,
           cysteine protease, allergen, protease, thiol protease;
           1.5A {Carica papaya} PDB: 1khq_A 1khp_A 1ppn_A 3e1z_B
           3ima_A 3lfy_A 9pap_A 1bqi_A* 1bp4_A* 1pad_A 1pe6_A*
           1pip_A* 1pop_A* 1ppd_A 1ppp_A* 1stf_E* 2pad_A 4pad_A*
           5pad_A* 6pad_A* ...
          Length = 212

 Score = 64.0 bits (157), Expect = 3e-12
 Identities = 25/85 (29%), Positives = 42/85 (49%), Gaps = 9/85 (10%)

Query: 245 NEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT 303
           NE A+    +A+ P +    A   DF  Y+ G++      K++   H+   +G+G     
Sbjct: 117 NEGALLY-SIANQPVSVVLEAAGKDFQLYRGGIFVGPCGNKVD---HAVAAVGYGPN--- 169

Query: 304 PYWLVINTWGPHWGDRGTVKILRGK 328
            Y L+ N+WG  WG+ G ++I RG 
Sbjct: 170 -YILIKNSWGTGWGENGYIRIKRGT 193


>3ioq_A CMS1MS2; caricaceae, cysteine protease, papain family, hydrolase;
           HET: E64 SO4; 1.87A {Carica candamarcensis}
          Length = 213

 Score = 62.9 bits (154), Expect = 7e-12
 Identities = 23/85 (27%), Positives = 42/85 (49%), Gaps = 9/85 (10%)

Query: 245 NEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT 303
           NE A+ +  +A  P +    A    F +Y+ G++       ++   H+   +G+G +   
Sbjct: 117 NEQALIQ-RIAIQPVSIVVEAKGRAFQNYRGGIFAGPCGTSID---HAVAAVGYGND--- 169

Query: 304 PYWLVINTWGPHWGDRGTVKILRGK 328
            Y L+ N+WG  WG+ G ++I RG 
Sbjct: 170 -YILIKNSWGTGWGEGGYIRIKRGS 193


>3u8e_A Papain-like cysteine protease; papain-like cysteine peptidase,
           peptidase_C1A, hydrolase, in form; 1.31A {Crocus
           sativus}
          Length = 222

 Score = 61.4 bits (149), Expect = 3e-11
 Identities = 22/91 (24%), Positives = 40/91 (43%), Gaps = 5/91 (5%)

Query: 242 VDDNEDAIKKEILAHGPTTATF-ALYDDFYHYKS-GVYKHTSNAKLENYL-HSGKLIGWG 298
           V ++  A+   + A  P +         F  Y   G++  +S +     + H+  ++G+G
Sbjct: 112 VPNSSSALLDAV-AKQPVSVNIYTSSTSFQLYTGPGIFAGSSCSDDPATVDHTVLIVGYG 170

Query: 299 TE-NGTPYWLVINTWGPHWGDRGTVKILRGK 328
           +      YW+V N+WG  WG  G + I R  
Sbjct: 171 SNGTNADYWIVKNSWGTEWGIDGYILIRRNT 201


>2bdz_A Mexicain; cysteine protease, peptidase_C1, papain-like, HYDR; HET:
           E64; 2.10A {Jacaratia mexicana}
          Length = 214

 Score = 60.9 bits (149), Expect = 3e-11
 Identities = 22/84 (26%), Positives = 43/84 (51%), Gaps = 9/84 (10%)

Query: 245 NEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT 303
           +E ++ +  +A+ P +    +    F  YK G+Y+       +   H+   +G+G     
Sbjct: 117 DEISLIQ-AIANQPVSVVTDSRGRGFQFYKGGIYEGPCGTNTD---HAVTAVGYGKT--- 169

Query: 304 PYWLVINTWGPHWGDRGTVKILRG 327
            Y L+ N+WGP+WG++G ++I R 
Sbjct: 170 -YLLLKNSWGPNWGEKGYIRIKRA 192


>1o0e_A Ervatamin C; plant cysteine protease, two domain, stable at PH
           2-12, HYDR; 1.90A {Tabernaemontana divaricata} SCOP:
           d.3.1.1 PDB: 2pns_A* 2pre_A* 3bcn_A*
          Length = 208

 Score = 58.4 bits (142), Expect = 3e-10
 Identities = 25/87 (28%), Positives = 42/87 (48%), Gaps = 8/87 (9%)

Query: 242 VDDNEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE 300
           V    +   K+ +A  P+T    A    F  Y SG++      KL    H   ++G+   
Sbjct: 111 VPFCNEXALKQAVAVQPSTVAIDASSAQFQQYSSGIFSGPCGTKLN---HGVTIVGYQAN 167

Query: 301 NGTPYWLVINTWGPHWGDRGTVKILRG 327
               YW+V N+WG +WG++G +++LR 
Sbjct: 168 ----YWIVRNSWGRYWGEKGYIRMLRV 190


>2pff_B Fatty acid synthase subunit beta; fatty acid synthase,
            acyl-carrier-protein, beta-ketoacyl RED beta-ketoacyl
            synthase, dehydratase; 4.00A {Saccharomyces cerevisiae}
          Length = 2006

 Score = 38.1 bits (88), Expect = 0.005
 Identities = 36/222 (16%), Positives = 58/222 (26%), Gaps = 81/222 (36%)

Query: 17   ELYKFSDAYIDQINREANTWT-AGRNFPANLSEEYLRQFLIAD---------AKYFD-QS 65
            +LYK S A      ++   W  A  +F           F I D           +F  + 
Sbjct: 1634 DLYKTSKAA-----QDV--WNRADNHFKDTYG------FSILDIVINNPVNLTIHFGGEK 1680

Query: 66   DRPLPGDRKTYDPEYSATVPDRFDAREQW-----PNCGTIGHVPDTGACAA-----PHIF 115
             + +   R+ Y      T+ D     E+       +  +     + G  +A     P + 
Sbjct: 1681 GKRI---RENYSAMIFETIVDGKLKTEKIFKEINEHSTSYTFRSEKGLLSATQFTQPALT 1737

Query: 116  AA-VGAFSDRRCIKSKGQQNRPLST--------EYVASCCKICRYDDNKSCSHGSVFRTW 166
                 AF     +KSKG    P           EY A                  V    
Sbjct: 1738 LMEKAAF---EDLKSKG--LIPADATFAGHSLGEYAALASL------------ADVM--- 1777

Query: 167  NF------LHKRGS-----VTGGDYGDRT----GCQPSTISP 193
            +       +  RG      V   + G          P  ++ 
Sbjct: 1778 SIESLVEVVFYRGMTMQVAVPRDELGRSNYGMIAINPGRVAA 1819



 Score = 33.5 bits (76), Expect = 0.12
 Identities = 17/95 (17%), Positives = 29/95 (30%), Gaps = 32/95 (33%)

Query: 14  VRGELYKFSDAYIDQINREA-----------NTWTAGRNF-----PANLS--EEYLRQFL 55
           +     +    Y+++ N              N     +N      P +L      LR+  
Sbjct: 341 ISNLTQEQVQDYVNKTNSHLPAGKQVEISLVN---GAKNLVVSGPPQSLYGLNLTLRK-A 396

Query: 56  IADAKYFDQSDRPLPGDRKTYDPEYSA-----TVP 85
            A +   DQS  P   +RK    ++S        P
Sbjct: 397 KAPSG-LDQSRIPFS-ERK---LKFSNRFLPVASP 426



 Score = 33.5 bits (76), Expect = 0.13
 Identities = 51/307 (16%), Positives = 90/307 (29%), Gaps = 100/307 (32%)

Query: 43  PANLSEEYLRQ-FLIADAKYF------DQ--SDRPLPGDRKTYDPEYSATVP--DRFDAR 91
           P  LS   L    L+  A +F      +Q     P P +    D E +       +F   
Sbjct: 8   PLTLSHGSLEHVLLVPTASFFIASQLQEQFNKILPEPTEGFAADDEPTTPAELVGKF--- 64

Query: 92  EQWPNCGTIGHV-----PDTGACAAPHIFAAVGAFSDRRCIKSK----------GQQNRP 136
                   +G+V     P         +   +  F +   ++             + +  
Sbjct: 65  --------LGYVSSLVEPSKVGQFDQVLNLCLTEF-ENCYLEGNDIHALAAKLLQENDTT 115

Query: 137 LSTE------YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT-----GG-----DY 180
           L         Y+ +     R  D KS S  ++FR        G+       GG     DY
Sbjct: 116 LVKTKELIKNYITARIMAKRPFDKKSNS--ALFRA----VGEGNAQLVAIFGGQGNTDDY 169

Query: 181 GD------RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
            +      +T      +       SA TL          L   T      + +G   +  
Sbjct: 170 FEELRDLYQTY--HVLVGDLIKF-SAETLSE--------LIRTTLDAEKVFTQGL--N-- 214

Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
              +  W+++  +   K+ L   P   +  L         GV       +L +Y+ + KL
Sbjct: 215 ---ILEWLENPSNTPDKDYLLSIP--ISCPL--------IGVI------QLAHYVVTAKL 255

Query: 295 IGWGTEN 301
           +G+    
Sbjct: 256 LGFTPGE 262


>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis,
           programmed cell death; HET: DTP; 6.90A {Drosophila
           melanogaster} PDB: 3iz8_A*
          Length = 1221

 Score = 34.8 bits (79), Expect = 0.041
 Identities = 18/102 (17%), Positives = 36/102 (35%), Gaps = 18/102 (17%)

Query: 3   HILVFLLGC-TLV--RGELYKFSDAYI-DQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
               F L C  L+  R          + D ++    T  +  +    L+ + ++  L   
Sbjct: 258 AWNAFNLSCKILLTTR-------FKQVTDFLSAATTTHISLDHHSMTLTPDEVKSLL--- 307

Query: 59  AKYFDQSDRPLPGDRKTYDPEY----SATVPDRFDAREQWPN 96
            KY D   + LP +  T +P      + ++ D     + W +
Sbjct: 308 LKYLDCRPQDLPREVLTTNPRRLSIIAESIRDGLATWDNWKH 349



 Score = 29.1 bits (64), Expect = 2.4
 Identities = 12/63 (19%), Positives = 26/63 (41%), Gaps = 5/63 (7%)

Query: 228 GFFQDKHRTTLTYWVDDNEDAIKKEILAHGP-TTATFA--LYDDFYHYKSGVYKHTSNAK 284
               D+ ++ L  ++D     + +E+L   P   +  A  + D    + +  +KH +  K
Sbjct: 297 TLTPDEVKSLLLKYLDCRPQDLPREVLTTNPRRLSIIAESIRDGLATWDN--WKHVNCDK 354

Query: 285 LEN 287
           L  
Sbjct: 355 LTT 357


>3pw3_A Aminopeptidase C; bleomycin, cysteine proteinase fold, structural
           genomics, JO center for structural genomics, JCSG; HET:
           MSE; 2.23A {Parabacteroides distasonis}
          Length = 383

 Score = 34.4 bits (78), Expect = 0.048
 Identities = 9/32 (28%), Positives = 16/32 (50%), Gaps = 1/32 (3%)

Query: 290 HSGKLIGWGT-ENGTPYWLVINTWGPHWGDRG 320
           H  ++ G    + G  Y++V N+WG +    G
Sbjct: 318 HGMQIYGIAKDQEGNEYYMVKNSWGTNSKYNG 349


>1qzv_F Plant photosystem I: subunit PSAF; photosynthesis,plant
          photosynthetic reaction center, peripheral antenna;
          HET: CL1 PQN; 4.44A {Pisum sativum} SCOP: i.5.1.1
          Length = 154

 Score = 29.9 bits (66), Expect = 0.64
 Identities = 7/22 (31%), Positives = 10/22 (45%), Gaps = 3/22 (13%)

Query: 64 QSDRPLPGDRKTYDPEYSATVP 85
          Q+ + L    K Y  + SA  P
Sbjct: 20 QALKKLQASLKLYADD-SA--P 38


>3s88_I GP1, GP, envelope glycoprotein; glycosylation, viral membrane,
           immune system-viral protein C; HET: NAG; 3.35A {Sudan
           ebolavirus} PDB: 3ve0_I*
          Length = 298

 Score = 29.4 bits (65), Expect = 1.4
 Identities = 13/45 (28%), Positives = 18/45 (40%)

Query: 77  DPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
            P+ S  +P   D    +P C  +     TG C   + F   GAF
Sbjct: 100 KPDGSECLPPPPDGVRGFPRCRYVHKAQGTGPCPGDYAFHKDGAF 144


>3csy_I Envelope glycoprotein GP1; glycoprotein-antibody complex, immune
           system-viral protein C; HET: NAG BMA MAN; 3.40A {Zaire
           ebola virus}
          Length = 334

 Score = 29.5 bits (65), Expect = 1.7
 Identities = 15/45 (33%), Positives = 19/45 (42%)

Query: 77  DPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
            P+ S  +P   D    +P C  +  V  TG CA    F   GAF
Sbjct: 100 KPDGSECLPAAPDGIRGFPRCRYVHKVSGTGPCAGDFAFHKEGAF 144


>2e8b_A Probable molybdopterin-guanine dinucleotide biosy protein A;
           putative protein, molybdenum cofactor, structural G
           NPPSFA; 1.61A {Aquifex aeolicus}
          Length = 201

 Score = 28.0 bits (63), Expect = 3.7
 Identities = 7/55 (12%), Positives = 16/55 (29%), Gaps = 4/55 (7%)

Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSG--KLIGW 297
            ++ +    +             +  H   GVY      K+E  +  G  ++   
Sbjct: 111 KKETVLY--VLENFKEPVSVAKTEKLHTLVGVYSKKLLEKIEERIKKGDYRIWAL 163


>3ejf_A Non-structural protein 3; IBV, coronavirus, X-domain, macro domain,
           NSP3, ADRP, hydrolase, ribosomal frameshifting; 1.60A
           {Avian infectious bronchitis virus} PDB: 3eke_A* 3ewo_A
           3ewp_A*
          Length = 176

 Score = 27.6 bits (62), Expect = 4.9
 Identities = 8/28 (28%), Positives = 10/28 (35%)

Query: 308 VINTWGPHWGDRGTVKILRGKYECAFEY 335
           V N  GP  GD    + L   Y+     
Sbjct: 94  VNNVVGPRHGDNNLHEKLVAAYKNVLVD 121


>1olr_A Endo-beta-1,4-glucanase; hydrolase, cellulase, cellulose
           degradation, endoglucanase, glycosyl hydrolase, GH
           family 12, humicola grisea CEL12A; HET: PCA; 1.2A
           {Humicola grisea} SCOP: b.29.1.11 PDB: 1uu4_A* 1uu5_A*
           1uu6_A* 1w2u_A*
          Length = 224

 Score = 27.6 bits (61), Expect = 5.8
 Identities = 8/28 (28%), Positives = 11/28 (39%)

Query: 297 WGTENGTPYWLVINTWGPHWGDRGTVKI 324
           +G  +G  Y L+ N WG      G    
Sbjct: 9   YGYWSGNGYELLNNLWGKDTATSGWQCT 36


>2xd3_A MALX, maltose/maltodextrin-binding protein; solute-binding protein,
           sugar binding protein, virulence, alpha-glucan, sugar
           transport; HET: GLC; 2.00A {Streptococcus pneumoniae}
           PDB: 2xd2_A*
          Length = 416

 Score = 27.4 bits (61), Expect = 6.9
 Identities = 5/20 (25%), Positives = 10/20 (50%)

Query: 236 TTLTYWVDDNEDAIKKEILA 255
             LT +VD+   +  +E+  
Sbjct: 35  KELTVYVDEGYKSYIEEVAK 54


>1e5k_A Molybdopterin-guanine dinucleotide biosynthesis protein A;
           molybdopterin nucleotidyl-transferase,; HET: CIT; 1.35A
           {Escherichia coli} SCOP: c.68.1.8 PDB: 1h4e_A* 1hjl_A*
           1hjj_A* 1h4c_A* 1h4d_A* 1fr9_A 1frw_A*
          Length = 201

 Score = 26.9 bits (60), Expect = 8.9
 Identities = 6/55 (10%), Positives = 15/55 (27%), Gaps = 2/55 (3%)

Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSG--KLIGW 297
             D   +           +    +  H    +        L  YL +G  +++ +
Sbjct: 106 PPDLAARLNHQRKDAPVVWVHDGERDHPTIALVNRAIEPLLLEYLQAGERRVMVF 160


>1y08_A Hypothetical protein SPY0861; cysteine proteinase, papain-like fold
           with major insertions, hydrolase; 1.93A {Streptococcus
           pyogenes} SCOP: d.3.1.12 PDB: 2avw_A 2au1_A
          Length = 323

 Score = 26.9 bits (58), Expect = 9.7
 Identities = 16/79 (20%), Positives = 30/79 (37%), Gaps = 3/79 (3%)

Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGV--YKHTSNAKLENYLHSGKL 294
            L +W D N+D IK+ +  H          +  +  K  +    H  ++KL  Y      
Sbjct: 86  MLHWWFDQNKDQIKRYLEEHPEKQKINFNGEQMFDVKEAIDTKNHQLDSKLFEYFKEKAF 145

Query: 295 IGWGTENGTPY-WLVINTW 312
               T++   +   VI+ +
Sbjct: 146 PYLSTKHLGVFPDHVIDMF 164


  Database: pdb70
    Posted date:  Sep 4, 2012  3:40 AM
  Number of letters in database: 6,701,793
  Number of sequences in database:  27,921
  
Lambda     K      H
   0.320    0.137    0.455 

Gapped
Lambda     K      H
   0.267   0.0856    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 27921
Number of Hits to DB: 5,571,439
Number of extensions: 329752
Number of successful extensions: 854
Number of sequences better than 10.0: 1
Number of HSP's gapped: 775
Number of HSP's successfully gapped: 62
Length of query: 344
Length of database: 6,701,793
Length adjustment: 94
Effective length of query: 250
Effective length of database: 4,077,219
Effective search space: 1019304750
Effective search space used: 1019304750
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 58 (25.9 bits)