RPS-BLAST 2.2.26 [Sep-21-2011]

Database: pdb70 
           27,921 sequences; 6,701,793 total letters

Searching..................................................done

Query= psy3960
         (351 letters)



>8pch_A Cathepsin H; hydrolase, protease, cysteine proteinase,
           aminopeptidase; HET: NAG BMA; 2.10A {Sus scrofa} SCOP:
           d.3.1.1 PDB: 1nb3_A* 1nb5_A*
          Length = 220

 Score =  316 bits (812), Expect = e-109
 Identities = 82/199 (41%), Positives = 119/199 (59%), Gaps = 3/199 (1%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F TTGA+E A  +   K+  L++Q L+DC+  + N+GC GG   +++++I
Sbjct: 19  NQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYI 78

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + G+  +D Y PY GQD +C      A A +    N+T N E+A+  A+A + PVS A
Sbjct: 79  RYNKGIMGEDTY-PYKGQDDHCKFQPDKAIAFVKDVANITMNDEEAMVEAVALYNPVSFA 137

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            + +   F  Y  G+Y    C+ +PD ++HAVLAVGYGE +G PYW VKNSW   WG  G
Sbjct: 138 FEVTN-DFLMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNG 196

Query: 330 YVLMSIKDNNCGVMTAPTY 348
           Y L+    N CG+    +Y
Sbjct: 197 YFLIERGKNMCGLAACASY 215


>3qt4_A Cathepsin-L-like midgut cysteine proteinase; hydrolase, zymogen,
           intramolecular DISS bonds, insect larVal midgut; HET:
           PG4 PG6; 2.11A {Tenebrio molitor}
          Length = 329

 Score =  305 bits (784), Expect = e-103
 Identities = 85/202 (42%), Positives = 122/202 (60%), Gaps = 5/202 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGS WSF TTGAVEG   ++  +L  LS+Q LIDCS  YGN GCDGG    ++ +I
Sbjct: 132 DQGQCGSSWSFSTTGAVEGQLALQRGRLTSLSEQNLIDCSSSYGNAGCDGGWMDSAFSYI 191

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
             +G+ ++  Y PY  Q  YC   ++ +  T++G+ ++    E++L  A+ + GPV+VAI
Sbjct: 192 HDYGIMSESAY-PYEAQGDYCRFDSSQSVTTLSGYYDLPSGDENSLADAVGQAGPVAVAI 250

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           DA+     FY  G++YD+ C  +   L+H VL VGYG  +G+ YW +KNSW + WG  GY
Sbjct: 251 DATD-ELQFYSGGLFYDQTC--NQSDLNHGVLVVGYGSDNGQDYWILKNSWGSGWGESGY 307

Query: 331 VLMSI-KDNNCGVMTAPTYVTM 351
                   NNCG+ TA +Y  +
Sbjct: 308 WRQVRNYGNNCGIATAASYPAL 329


>3kwz_A Cathepsin K; enzyme inhibitor, covalent reversible inhibitor,
           disease mutation, disulfide bond, glycoprotein,
           hydrolase, lysosome, protease; HET: KWZ; 1.49A {Homo
           sapiens} PDB: 1au0_A* 1au2_A* 1au3_A* 1au4_A* 1ayu_A*
           1ayv_A* 1ayw_A* 1bgo_A* 1atk_A* 1nl6_A* 1nlj_A* 1q6k_A*
           1mem_A* 1yk7_A* 1yk8_A* 1yt7_A* 2ato_A* 2aux_A* 2auz_A*
           2bdl_A* ...
          Length = 215

 Score =  299 bits (767), Expect = e-102
 Identities = 94/203 (46%), Positives = 126/203 (62%), Gaps = 7/203 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+GC GG    ++Q++
Sbjct: 18  NQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDGCGGGYMTNAFQYV 75

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            K+ G+ ++D Y PY+GQ+  C    T   A   G+  +   +E ALK A+A+ GPVSVA
Sbjct: 76  QKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVA 134

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF FY  GVYYDE C  + D L+HAVLAVGYG   G  +W +KNSW   WGN+G
Sbjct: 135 IDASLTSFQFYSKGVYYDESC--NSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKG 192

Query: 330 YVLMSI-KDNNCGVMTAPTYVTM 351
           Y+LM+  K+N CG+    ++  M
Sbjct: 193 YILMARNKNNACGIANLASFPKM 215


>2o6x_A Procathepsin L1, secreted cathepsin L 1; hydrolase, thiol protease,
           cysteine protease, zymogen, hydro; 1.40A {Fasciola
           hepatica}
          Length = 310

 Score =  302 bits (775), Expect = e-102
 Identities = 81/202 (40%), Positives = 109/202 (53%), Gaps = 5/202 (2%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGS W+F TTG +EG Y    +     S+Q L+DCS  +GNNGC GG    +YQ++
Sbjct: 109 DQGNCGSGWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSRPWGNNGCGGGLMENAYQYL 168

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
            + GL T+  Y PY   +  C        A +TGF  V   SE  LK  +   GP +VA+
Sbjct: 169 KQFGLETESSY-PYTAVEGQCRYNKQLGVAKVTGFYTVHSGSEVELKNLVGAEGPAAVAV 227

Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
           D     F  Y +G+Y  + C  SP  ++HAVLAVGYG   G  YW VKNSW   WG +GY
Sbjct: 228 DVES-DFMMYRSGIYQSQTC--SPLRVNHAVLAVGYGTQGGTDYWIVKNSWGLSWGERGY 284

Query: 331 VLMSI-KDNNCGVMTAPTYVTM 351
           + M   + N CG+ +  +   +
Sbjct: 285 IRMVRNRGNMCGIASLASLPMV 306


>3ovx_A Cathepsin S; hydrolase, covalent inhibitor, aldehyde warhead is
           covalently bound to Cys25, lysosomeal protein; HET: O64;
           1.49A {Homo sapiens} PDB: 2h7j_A* 2f1g_A* 2hh5_B*
           2hhn_A* 2hxz_A* 2op3_A* 2frq_A* 2fra_A* 2fq9_A* 2ft2_A*
           2fud_A* 2g7y_A* 1ms6_A* 2r9m_A* 2r9n_A* 2r9o_A* 3n3g_A*
           3n4c_A* 3mpe_A* 1nqc_A* ...
          Length = 218

 Score =  297 bits (764), Expect = e-101
 Identities = 84/204 (41%), Positives = 119/204 (58%), Gaps = 7/204 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQW 209
            Q  CG+CW+F   GA+E    +K  KL  LS Q L+DCS   YGN GC+GG    ++Q+
Sbjct: 19  YQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQY 78

Query: 210 IMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
           I+ + G+ +   Y PY   D  C   +    AT + +  +    ED LK A+A  GPVSV
Sbjct: 79  IIDNKGIDSDASY-PYKAMDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSV 137

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQ 328
            +DA   SF  Y +GVYY+  C  +   ++H VL VGYG+L+GK YW VKNSW   +G +
Sbjct: 138 GVDARHPSFFLYRSGVYYEPSCTQN---VNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEE 194

Query: 329 GYVLMSI-KDNNCGVMTAPTYVTM 351
           GY+ M+  K N+CG+ + P+Y  +
Sbjct: 195 GYIRMARNKGNHCGIASFPSYPEI 218


>2c0y_A Procathepsin S; proenzyme, proteinase, hydrolase, thiol protease,
           prosegment binding loop, glycoprotein, lysosome,
           protease, zymogen; 2.1A {Homo sapiens}
          Length = 315

 Score =  300 bits (771), Expect = e-101
 Identities = 83/204 (40%), Positives = 118/204 (57%), Gaps = 7/204 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSW-GYGNNGCDGGEDFRSYQW 209
            Q  CG+ W+F   GA+E    +K  KL  LS Q L+DCS   YGN GC+GG    ++Q+
Sbjct: 116 YQGSCGAAWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQY 175

Query: 210 IMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
           I+ + G+ +   Y PY   D  C   +    AT + +  +    ED LK A+A  GPVSV
Sbjct: 176 IIDNKGIDSDASY-PYKAMDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSV 234

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQ 328
            +DA   SF  Y +GVYY+  C  +   ++H VL VGYG+L+GK YW VKNSW   +G +
Sbjct: 235 GVDARHPSFFLYRSGVYYEPSCTQN---VNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEE 291

Query: 329 GYVLMSI-KDNNCGVMTAPTYVTM 351
           GY+ M+  K N+CG+ + P+Y  +
Sbjct: 292 GYIRMARNKGNHCGIASFPSYPEI 315


>1by8_A Protein (procathepsin K); hydrolase(sulfhydryl proteinase), papain;
           2.60A {Homo sapiens} SCOP: d.3.1.1 PDB: 7pck_A
          Length = 314

 Score =  298 bits (766), Expect = e-100
 Identities = 94/203 (46%), Positives = 126/203 (62%), Gaps = 7/203 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F + GA+EG    K  KL  LS Q L+DC     N+GC GG    ++Q++
Sbjct: 117 NQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDGCGGGYMTNAFQYV 174

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            K+ G+ ++D Y PY+GQ+  C    T   A   G+  +   +E ALK A+A+ GPVSVA
Sbjct: 175 QKNRGIDSEDAY-PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVA 233

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS  SF FY  GVYYDE C  + D L+HAVLAVGYG   G  +W +KNSW   WGN+G
Sbjct: 234 IDASLTSFQFYSKGVYYDESC--NSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKG 291

Query: 330 YVLMSI-KDNNCGVMTAPTYVTM 351
           Y+LM+  K+N CG+    ++  M
Sbjct: 292 YILMARNKNNACGIANLASFPKM 314


>2xu3_A Cathepsin L1; hydrolase, drug design, thiol protease; HET: XU3 BTB;
           0.90A {Homo sapiens} PDB: 2xu4_A* 2xu5_A* 2yj2_A*
           2yj8_A* 2yj9_A* 2yjb_A* 2yjc_A* 3bc3_A* 3h89_A* 3h8b_A*
           3h8c_A* 3of9_A* 3of8_A* 3hha_A* 2xu1_A* 3iv2_A* 3k24_A*
           2nqd_B* 3kse_A* 2vhs_A ...
          Length = 220

 Score =  292 bits (750), Expect = 1e-99
 Identities = 90/207 (43%), Positives = 125/207 (60%), Gaps = 10/207 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F  TGA+EG  + K  +L  LS+Q L+DCS   GN GC+GG    ++Q++
Sbjct: 18  NQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYV 77

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL +++ Y PY   +  C      + A  TGFV++ P  E AL  A+A  GP+SVA
Sbjct: 78  QDNGGLDSEESY-PYEATEESCKYNPKYSVANDTGFVDI-PKQEKALMKAVATVGPISVA 135

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYW 325
           IDA  +SF FY  G+Y++  C  S + +DH VL VGYG    E D   YW VKNSW   W
Sbjct: 136 IDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEW 193

Query: 326 GNQGYVLMSI-KDNNCGVMTAPTYVTM 351
           G  GYV M+  + N+CG+ +A +Y T+
Sbjct: 194 GMGGYVKMAKDRRNHCGIASAASYPTV 220


>1cs8_A Human procathepsin L; prosegment, propeptide, inhibition,
           hydrolase; HET: OCS; 1.80A {Homo sapiens} SCOP: d.3.1.1
           PDB: 1cjl_A 3hwn_A*
          Length = 316

 Score =  293 bits (753), Expect = 1e-98
 Identities = 89/207 (42%), Positives = 124/207 (59%), Gaps = 10/207 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F  TGA+EG  + K  +L  LS+Q L+DCS   GN GC+GG    ++Q++
Sbjct: 114 NQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYV 173

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             + GL +++ Y PY   +  C      + A   GFV++ P  E AL  A+A  GP+SVA
Sbjct: 174 QDNGGLDSEESY-PYEATEESCKYNPKYSVANDAGFVDI-PKQEKALMKAVATVGPISVA 231

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----ELDGKPYWQVKNSWSTYW 325
           IDA  +SF FY  G+Y++  C  S + +DH VL VGYG    E D   YW VKNSW   W
Sbjct: 232 IDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEW 289

Query: 326 GNQGYVLMSI-KDNNCGVMTAPTYVTM 351
           G  GYV M+  + N+CG+ +A +Y T+
Sbjct: 290 GMGGYVKMAKDRRNHCGIASAASYPTV 316


>3qj3_A Cathepsin L-like protein; hydrolase, proteinase, larVal midgut;
           1.85A {Tenebrio molitor}
          Length = 331

 Score =  292 bits (749), Expect = 7e-98
 Identities = 70/205 (34%), Positives = 108/205 (52%), Gaps = 10/205 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKL--AVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
           +Q  CGS W+F +TGA+E    + +     + +S+Q L+DC       GC GG    ++ 
Sbjct: 133 NQGSCGSSWAFSSTGAIESQMKIANGAGYDSSVSEQQLVDCV--PNALGCSGGWMNDAFT 190

Query: 209 WIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVS 267
           ++ ++ G+ ++  Y PY   D  CH       A ++G+V ++   E+ L   +A  GPV+
Sbjct: 191 YVAQNGGIDSEGAY-PYEMADGNCHYDPNQVAARLSGYVYLSGPDENMLADMVATKGPVA 249

Query: 268 VAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGN 327
           VA DA    F  Y  GVYY+  C    +   HAVL VGYG  +G+ YW VKNSW   WG 
Sbjct: 250 VAFDADD-PFGSYSGGVYYNPTC--ETNKFTHAVLIVGYGNENGQDYWLVKNSWGDGWGL 306

Query: 328 QGYVLMS-IKDNNCGVMTAPTYVTM 351
            GY  ++   +N+CG+    +  T+
Sbjct: 307 DGYFKIARNANNHCGIAGVASVPTL 331


>1xkg_A DER P I, major mite fecal allergen DER P 1; major allergen,
           cysteine protease, house DUST mite, dermatop
           pteronyssinus; 1.61A {Dermatophagoides pteronyssinus}
           SCOP: d.3.1.1
          Length = 312

 Score =  287 bits (737), Expect = 3e-96
 Identities = 54/203 (26%), Positives = 87/203 (42%), Gaps = 9/203 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
            Q  CGS W+F    A E AY     +   L++Q L+DC+     +GC G    R  ++I
Sbjct: 107 MQGGCGSAWAFSGVAATESAYLAYRDQSLDLAEQELVDCA---SQHGCHGDTIPRGIEYI 163

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAK-HGPVSVA 269
             +G+  +  Y  Y+ ++  C   N      ++ +  + P + + ++ ALA+ H  ++V 
Sbjct: 164 QHNGVVQESYY-RYVAREQSCRRPNAQR-FGISNYCQIYPPNANKIREALAQTHSAIAVI 221

Query: 270 IDASQ-KSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQ 328
           I      +F  Y                  HAV  VGY    G  YW V+NSW T WG+ 
Sbjct: 222 IGIKDLDAFRHYDGRTIIQRDN--GYQPNYHAVNIVGYSNAQGVDYWIVRNSWDTNWGDN 279

Query: 329 GYVLMSIKDNNCGVMTAPTYVTM 351
           GY   +   +   +   P  V +
Sbjct: 280 GYGYFAANIDLMMIEEYPYVVIL 302


>3f5v_A DER P 1 allergen; allergy, asthma, DUST mites, glycoprotein,
           hydrola protease, secreted, thiol protease; HET: P6G;
           1.36A {Dermatophagoides pteronyssinus} PDB: 2as8_A
           3rvw_A* 3rvx_A 3rvv_A* 3d6s_A*
          Length = 222

 Score =  280 bits (718), Expect = 1e-94
 Identities = 54/203 (26%), Positives = 88/203 (43%), Gaps = 9/203 (4%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
            Q  CGS W+F    A E AY    ++   L++Q L+DC+     +GC G    R  ++I
Sbjct: 27  MQGGCGSAWAFSGVAATESAYLAYRQQSLDLAEQELVDCA---SQHGCHGDTIPRGIEYI 83

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAK-HGPVSVA 269
             +G+  +  Y  Y+ ++  C   N      ++ +  + P + + ++ ALA+ H  ++V 
Sbjct: 84  QHNGVVQESYY-RYVAREQSCRRPNAQR-FGISNYCQIYPPNANKIREALAQTHSAIAVI 141

Query: 270 IDASQ-KSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQ 328
           I      +F  Y                  HAV  VGY    G  YW V+NSW T WG+ 
Sbjct: 142 IGIKDLDAFRHYDGRTIIQRDN--GYQPNYHAVNIVGYSNAQGVDYWIVRNSWDTNWGDN 199

Query: 329 GYVLMSIKDNNCGVMTAPTYVTM 351
           GY   +   +   +   P  V +
Sbjct: 200 GYGYFAANIDLMMIEEYPYVVIL 222


>3f75_A Toxopain-2, cathepsin L protease; medical structural genomics of
           pathogenic protozoa, MSGPP, C protease, parasite,
           protozoa, hydrolase; 1.99A {Toxoplasma gondii}
          Length = 224

 Score =  278 bits (713), Expect = 7e-94
 Identities = 88/207 (42%), Positives = 120/207 (57%), Gaps = 12/207 (5%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F TTGA+EGA+  K  KL  LS+Q L+DCS   GN  C GGE   ++Q++
Sbjct: 24  DQRDCGSCWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYV 83

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           +   G+ ++D Y PYL +D  C   +      + GF +V   SE A+K ALA   PVS+A
Sbjct: 84  LDSGGICSEDAY-PYLARDEECRAQSCEKVVKILGFKDVPRRSEAAMKAALA-KSPVSIA 141

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG--ELDGKPYWQVKNSWSTYWGN 327
           I+A Q  F FY  GV +D  C      LDH VL VGYG  +   K +W +KNSW T WG 
Sbjct: 142 IEADQMPFQFYHEGV-FDASCGTD---LDHGVLLVGYGTDKESKKDFWIMKNSWGTGWGR 197

Query: 328 QGYVLMS---IKDNNCGVMTAPTYVTM 351
            GY+ M+    ++  CG++   ++  M
Sbjct: 198 DGYMYMAMHKGEEGQCGLLLDASFPVM 224


>3p5u_A Actinidin; SAD, cysteine proteinases, hydrolase; 1.50A {Actinidia
           arguta} PDB: 3p5v_A 3p5w_A 3p5x_A 1aec_A* 2act_A
          Length = 220

 Score =  270 bits (693), Expect = 6e-91
 Identities = 74/206 (35%), Positives = 110/206 (53%), Gaps = 11/206 (5%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGS W+F T  AVEG   +    L  LS+Q L+DC       GCDGG     +Q+I
Sbjct: 18  DQGQCGSAWAFSTIAAVEGINKIATGDLISLSEQELVDCGRTQNTRGCDGGFMTDGFQFI 77

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHI-ANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
           + + G+ T+ +Y PY  ++  C++        ++  + NV  N+E AL+ A+A + PVSV
Sbjct: 78  INNGGINTEANY-PYTAEEGQCNLDLQQEKYVSIDTYENVPYNNEWALQTAVA-YQPVSV 135

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQ 328
           A++A+  +F  Y +G++    C  +   +DHAV  VGYG   G  YW VKNSW T WG +
Sbjct: 136 ALEAAGYNFQHYSSGIFTGP-CGTA---VDHAVTIVGYGTEGGIDYWIVKNSWGTTWGEE 191

Query: 329 GYVLMS---IKDNNCGVMTAPTYVTM 351
           GY+ +         CG+    +Y   
Sbjct: 192 GYMRIQRNVGGVGQCGIAKKASYPVK 217


>1m6d_A Cathepsin F, catsf; papain family cysteine protease, hydrolase;
           HET: MYP; 1.70A {Homo sapiens} SCOP: d.3.1.1
          Length = 214

 Score =  269 bits (690), Expect = 2e-90
 Identities = 77/202 (38%), Positives = 104/202 (51%), Gaps = 7/202 (3%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ +CGSCW+F  TG VEG +++    L  LS+Q L+DC     +  C GG    +Y  I
Sbjct: 18  DQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCD--KMDKACMGGLPSNAYSAI 75

Query: 211 MK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
               GL T+DDY  Y G    C  +   A   +   V +   +E  L   LAK GP+SVA
Sbjct: 76  KNLGGLETEDDY-SYQGHMQSCQFSAEKAKVYIQDSVEL-SQNEQKLAAWLAKRGPISVA 133

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           I+A      FY +G+    +   SP  +DHAVL VGYG+    P+W +KNSW T WG +G
Sbjct: 134 INAF--GMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGQRSDVPFWAIKNSWGTDWGEKG 191

Query: 330 YVLMSIKDNNCGVMTAPTYVTM 351
           Y  +      CGV T  +   +
Sbjct: 192 YYYLHRGSGACGVNTMASSAVV 213


>1pci_A Procaricain; zymogen, hydrolase, thiol protease; 3.20A {Carica
           papaya} SCOP: d.3.1.1
          Length = 322

 Score =  269 bits (691), Expect = 4e-89
 Identities = 69/206 (33%), Positives = 101/206 (49%), Gaps = 13/206 (6%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
            Q  CGSCW+F     VEG   ++  KL  LS+Q L+DC     ++GC GG    + +++
Sbjct: 124 HQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCE--RRSHGCKGGYPPYALEYV 181

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTAT-ATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            K+G+  +  Y PY  +   C            +G   V PN+E  L  A+A   PVSV 
Sbjct: 182 AKNGIHLRSKY-PYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIA-KQPVSVV 239

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           +++  + F  Y  G++    C      +D AV AVGYG+  GK Y  +KNSW T WG +G
Sbjct: 240 VESKGRPFQLYKGGIFEGP-CGTK---VDGAVTAVGYGKSGGKGYILIKNSWGTAWGEKG 295

Query: 330 YVLMSIKDNN----CGVMTAPTYVTM 351
           Y+ +     N    CG+  +  Y T 
Sbjct: 296 YIRIKRAPGNSPGVCGLYKSSYYPTK 321


>1ppo_A Protease omega; hydrolase(thiol protease); 1.80A {Carica papaya}
           SCOP: d.3.1.1 PDB: 1meg_A*
          Length = 216

 Score =  260 bits (667), Expect = 5e-87
 Identities = 69/203 (33%), Positives = 101/203 (49%), Gaps = 13/203 (6%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
            Q  CGSCW+F     VEG   ++  KL  LS+Q L+DC     ++GC GG    + +++
Sbjct: 18  HQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCE--RRSHGCKGGYPPYALEYV 75

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTAT-ATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            K+G+  +  Y PY  +   C            +G   V PN+E  L  A+A   PVSV 
Sbjct: 76  AKNGIHLRSKY-PYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIA-KQPVSVV 133

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           +++  + F  Y  G++    C      +DHAV AVGYG+  GK Y  +KNSW T WG +G
Sbjct: 134 VESKGRPFQLYKGGIFEGP-CGTK---VDHAVTAVGYGKSGGKGYILIKNSWGTAWGEKG 189

Query: 330 YVLMSIKDNN----CGVMTAPTY 348
           Y+ +     N    CG+  +  Y
Sbjct: 190 YIRIKRAPGNSPGVCGLYKSSYY 212


>1yal_A Chymopapain; hydrolase, thiol protease; 1.70A {Carica papaya} SCOP:
           d.3.1.1 PDB: 1gec_E*
          Length = 218

 Score =  258 bits (663), Expect = 2e-86
 Identities = 76/206 (36%), Positives = 103/206 (50%), Gaps = 13/206 (6%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F T   VEG   +    L  LS+Q L+DC     + GC GG    S Q++
Sbjct: 18  NQGACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCD--KHSYGCKGGYQTTSLQYV 75

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTAT-ATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
             +G+ T   Y PY  +   C   +       +TG+  V  N E +   ALA + P+SV 
Sbjct: 76  ANNGVHTSKVY-PYQAKQYKCRATDKPGPKVKITGYKRVPSNCETSFLGALA-NQPLSVL 133

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           ++A  K F  Y +GV+ D  C      LDHAV AVGYG  DGK Y  +KNSW   WG +G
Sbjct: 134 VEAGGKPFQLYKSGVF-DGPCGTK---LDHAVTAVGYGTSDGKNYIIIKNSWGPNWGEKG 189

Query: 330 YVLMSIKDNN----CGVMTAPTYVTM 351
           Y+ +  +  N    CGV  +  Y   
Sbjct: 190 YMRLKRQSGNSQGTCGVYKSSYYPFK 215


>1iwd_A Ervatamin B; cysteine protease, alpha-beta protein, catalytic DYAD,
           L-DOM domain., hydrolase; 1.63A {Tabernaemontana
           divaricata} SCOP: d.3.1.1
          Length = 215

 Score =  258 bits (662), Expect = 2e-86
 Identities = 74/205 (36%), Positives = 109/205 (53%), Gaps = 14/205 (6%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F    AVE    ++  +L  LS+Q L+DC     ++GC+GG    ++Q+I
Sbjct: 18  NQKQCGSCWAFSAVAAVESINKIRTGQLISLSEQELVDCD--TASHGCNGGWMNNAFQYI 75

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           + + G+ TQ +Y PY      C         ++ GF  VT N+E AL+ A+A   PVSV 
Sbjct: 76  ITNGGIDTQQNY-PYSAVQGSCK-PYRLRVVSINGFQRVTRNNESALQSAVA-SQPVSVT 132

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           ++A+   F  Y +G++            +H V+ VGYG   GK YW V+NSW   WGNQG
Sbjct: 133 VEAAGAPFQHYSSGIFTGPCGTA----QNHGVVIVGYGTQSGKNYWIVRNSWGQNWGNQG 188

Query: 330 YVLMSIKDNN----CGVMTAPTYVT 350
           Y+ M     +    CG+   P+Y T
Sbjct: 189 YIWMERNVASSAGLCGIAQLPSYPT 213


>1cqd_A Protein (protease II); cysteine protease, glycoprotein, proline
           specificity, carboh papain family, hydrolase; HET: NAG
           FUL FUC; 2.10A {Zingiber officinale} SCOP: d.3.1.1
          Length = 221

 Score =  257 bits (660), Expect = 5e-86
 Identities = 73/203 (35%), Positives = 112/203 (55%), Gaps = 13/203 (6%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F T  AVEG   +    L  LS+Q L+DC+    N+GC GG    ++Q+I
Sbjct: 20  NQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCT--TANHGCRGGWMNPAFQFI 77

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           + + G+ +++ Y PY GQD  C+        ++  + NV  ++E +L+ A+A + PVSV 
Sbjct: 78  VNNGGINSEETY-PYRGQDGICNSTVNAPVVSIDSYENVPSHNEQSLQKAVA-NQPVSVT 135

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           +DA+ + F  Y +G++    CN S    +HA+  VGYG  + K +W VKNSW   WG  G
Sbjct: 136 MDAAGRDFQLYRSGIFTGS-CNIS---ANHALTVVGYGTENDKDFWIVKNSWGKNWGESG 191

Query: 330 YVLMS----IKDNNCGVMTAPTY 348
           Y+         D  CG+    +Y
Sbjct: 192 YIRAERNIENPDGKCGITRFASY 214


>2bdz_A Mexicain; cysteine protease, peptidase_C1, papain-like, HYDR; HET:
           E64; 2.10A {Jacaratia mexicana}
          Length = 214

 Score =  255 bits (655), Expect = 3e-85
 Identities = 72/206 (34%), Positives = 105/206 (50%), Gaps = 17/206 (8%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q+ CGSCW+F T   +EG   +   +L  LS+Q L+DC     ++GCDGG    S Q++
Sbjct: 18  NQNPCGSCWAFSTVATIEGINKIITGQLISLSEQELLDCE--RRSHGCDGGYQTTSLQYV 75

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTAT-ATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           + +G+ T+ +Y PY  +   C   +       +TG+  V  N E +L  A+A + PVSV 
Sbjct: 76  VDNGVHTEREY-PYEKKQGRCRAKDKKGPKVYITGYKYVPANDEISLIQAIA-NQPVSVV 133

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
            D+  + F FY  G+Y      N     DHAV AVGY    GK Y  +KNSW   WG +G
Sbjct: 134 TDSRGRGFQFYKGGIYEGPCGTN----TDHAVTAVGY----GKTYLLLKNSWGPNWGEKG 185

Query: 330 YVLMS----IKDNNCGVMTAPTYVTM 351
           Y+ +          CGV T+  +   
Sbjct: 186 YIRIKRASGRSKGTCGVYTSSFFPIK 211


>1s4v_A Cysteine endopeptidase; KDEL ER retention signal, endosperm,
           ricinosomes, SEED germi senescence, hydrolase-hydrolase
           inhibitor complex; 2.00A {Ricinus communis} SCOP:
           d.3.1.1
          Length = 229

 Score =  255 bits (654), Expect = 7e-85
 Identities = 86/208 (41%), Positives = 113/208 (54%), Gaps = 14/208 (6%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F T  AVEG   +K  KL  LS+Q L+DC     N GC+GG    ++++I
Sbjct: 19  DQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTDQ-NQGCNGGLMDYAFEFI 77

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATA-TMTGFVNVTPNSEDALKLALAKHGPVSV 268
            +  G+ T+ +Y PY   D  C ++   A A ++ G  NV  N E+AL  A+A + PVSV
Sbjct: 78  KQRGGITTEANY-PYEAYDGTCDVSKENAPAVSIDGHENVPENDENALLKAVA-NQPVSV 135

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGN 327
           AIDA    F FY  GV+    C      LDH V  VGYG  +DG  YW VKNSW   WG 
Sbjct: 136 AIDAGGSDFQFYSEGVFTGS-CGTE---LDHGVAIVGYGTTIDGTKYWTVKNSWGPEWGE 191

Query: 328 QGYVLMS----IKDNNCGVMTAPTYVTM 351
           +GY+ M      K+  CG+    +Y   
Sbjct: 192 KGYIRMERGISDKEGLCGIAMEASYPIK 219


>2cio_A Papain; hydrolase/inhibitor, complex hydrolase/inhibitor, ICP,
           cysteine protease, allergen, protease, thiol protease;
           1.5A {Carica papaya} PDB: 1khq_A 1khp_A 1ppn_A 3e1z_B
           3ima_A 3lfy_A 9pap_A 1bqi_A* 1bp4_A* 1pad_A 1pe6_A*
           1pip_A* 1pop_A* 1ppd_A 1ppp_A* 1stf_E* 2pad_A 4pad_A*
           5pad_A* 6pad_A* ...
          Length = 212

 Score =  251 bits (643), Expect = 2e-83
 Identities = 69/203 (33%), Positives = 101/203 (49%), Gaps = 17/203 (8%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F     +EG   ++   L   S+Q L+DC     + GC+GG  + + Q +
Sbjct: 18  NQGSCGSCWAFSAVVTIEGIIKIRTGNLNQYSEQELLDCD--RRSYGCNGGYPWSALQLV 75

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTAT-ATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
            ++G+  ++ Y PY G   YC         A   G   V P +E AL  ++A + PVSV 
Sbjct: 76  AQYGIHYRNTY-PYEGVQRYCRSREKGPYAAKTDGVRQVQPYNEGALLYSIA-NQPVSVV 133

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           ++A+ K F  Y  G++     N     +DHAV AVGY    G  Y  +KNSW T WG  G
Sbjct: 134 LEAAGKDFQLYRGGIFVGPCGNK----VDHAVAAVGY----GPNYILIKNSWGTGWGENG 185

Query: 330 YVLMSIKDNN----CGVMTAPTY 348
           Y+ +     N    CG+ T+  Y
Sbjct: 186 YIRIKRGTGNSYGVCGLYTSSFY 208


>2b1m_A SPE31; papain-like, sugar binding protein; HET: NAG FUC PG4; 2.00A
           {Pachyrhizus erosus} PDB: 2b1n_A*
          Length = 246

 Score =  251 bits (644), Expect = 4e-83
 Identities = 73/213 (34%), Positives = 107/213 (50%), Gaps = 19/213 (8%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
            Q  CGS W+F  TGA+E A+ +    L  LS+Q LIDC     + GC  G  ++S++W+
Sbjct: 19  FQGQCGSGWAFSATGAIEAAHAIATGNLVSLSEQELIDCV--DESEGCYNGWHYQSFEWV 76

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGF-------VNVTPNSEDALKLALAK 262
           +KH G+ ++ DY PY  +D  C         T+  +        +    +E +L+  +  
Sbjct: 77  VKHGGIASEADY-PYKARDGKCKANEIQDKVTIDNYGVQILSNESTESEAESSLQSFVL- 134

Query: 263 HGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWS 322
             P+SV+IDA    F FY  G+Y    C+ SP G++H VL VGYG  DG  YW  KNSW 
Sbjct: 135 EQPISVSIDAK--DFHFYSGGIYDGGNCS-SPYGINHFVLIVGYGSEDGVDYWIAKNSWG 191

Query: 323 TYWGNQGYVLMS----IKDNNCGVMTAPTYVTM 351
             WG  GY+ +          CG+    +Y  +
Sbjct: 192 EDWGIDGYIRIQRNTGNLLGVCGMNYFASYPII 224


>3ioq_A CMS1MS2; caricaceae, cysteine protease, papain family, hydrolase;
           HET: E64 SO4; 1.87A {Carica candamarcensis}
          Length = 213

 Score =  250 bits (640), Expect = 5e-83
 Identities = 67/203 (33%), Positives = 97/203 (47%), Gaps = 17/203 (8%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F +  AVEG   +   +L  LS+Q L+DC     + GC GG    + Q++
Sbjct: 18  NQGGCGSCWTFSSVAAVEGINKIVTGQLLSLSEQELLDCE--RRSYGCRGGFPLYALQYV 75

Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTAT-ATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
              G+  +  Y PY G    C  +          G   V  N+E AL   +A   PVS+ 
Sbjct: 76  ANSGIHLRQYY-PYEGVQRQCRASQAKGPKVKTDGVGRVPRNNEQALIQRIA-IQPVSIV 133

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           ++A  ++F  Y  G++    C  S   +DHAV AVGY    G  Y  +KNSW T WG  G
Sbjct: 134 VEAKGRAFQNYRGGIFAGP-CGTS---IDHAVAAVGY----GNDYILIKNSWGTGWGEGG 185

Query: 330 YVLMS----IKDNNCGVMTAPTY 348
           Y+ +          CGV++   +
Sbjct: 186 YIRIKRGSGNPQGACGVLSDSVF 208


>2fo5_A Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cysteine
           endoprotease, endopeptidase, LEUP hydrolase; HET: AR7;
           2.20A {Hordeum vulgare}
          Length = 262

 Score =  251 bits (643), Expect = 7e-83
 Identities = 83/211 (39%), Positives = 110/211 (52%), Gaps = 17/211 (8%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F T  +VEG   ++   L  LS+Q LIDC     N+GC GG    ++++I
Sbjct: 21  DQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCD-TADNDGCQGGLMDNAFEYI 79

Query: 211 MKH-GLPTQDDYGPYLGQDAYCH----IANTTATATMTGFVNVTPNSEDALKLALAKHGP 265
             + GL T+  Y PY      C+      N+     + G  +V  NSE+ L  A+A + P
Sbjct: 80  KNNGGLITEAAY-PYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVA-NQP 137

Query: 266 VSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTY 324
           VSVA++AS K+F FY  GV+  E C      LDH V  VGYG   DGK YW VKNSW   
Sbjct: 138 VSVAVEASGKAFMFYSEGVFTGE-CGTE---LDHGVAVVGYGVAEDGKAYWTVKNSWGPS 193

Query: 325 WGNQGYVLMSIKDNN----CGVMTAPTYVTM 351
           WG QGY+ +          CG+    +Y   
Sbjct: 194 WGEQGYIRVEKDSGASGGLCGIAMEASYPVK 224


>3i06_A Cruzipain; autocatalytic cleavage, glycoprotein, protease, thiol
           protease, zymogen; HET: QL2; 1.10A {Trypanosoma cruzi}
           PDB: 1ewm_A* 1ewo_A* 1ewl_A* 1f29_A* 1ewp_A* 1f2b_A*
           1f2c_A* 1f2a_A* 1me4_A* 1u9q_X* 2aim_A* 2efm_A* 2oz2_A*
           1me3_A* 3kku_A* 3lxs_A* 1aim_A* 3iut_A* 3hd3_A* 2p86_A*
           ...
          Length = 215

 Score =  249 bits (638), Expect = 1e-82
 Identities = 73/206 (35%), Positives = 108/206 (52%), Gaps = 16/206 (7%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F   G VE  +++    L  LS+Q L+ C     ++GC GG    +++WI
Sbjct: 18  DQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCD--KTDSGCSGGLMNNAFEWI 75

Query: 211 MKH---GLPTQDDYGPYLGQDAY---CHIANTTATATMTGFVNVTPNSEDALKLALAKHG 264
           ++     + T+D Y PY   +     C  +  T  AT+TG V +  + E  +   LA +G
Sbjct: 76  VQENNGAVYTEDSY-PYASGEGISPPCTTSGHTVGATITGHVELPQD-EAQIAAWLAVNG 133

Query: 265 PVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTY 324
           PV+VA+DAS  S+  Y  GV      +   + LDH VL VGY +    PYW +KNSW+T 
Sbjct: 134 PVAVAVDAS--SWMTYTGGVM----TSCVSEQLDHGVLLVGYNDSAAVPYWIIKNSWTTQ 187

Query: 325 WGNQGYVLMSIKDNNCGVMTAPTYVT 350
           WG +GY+ ++   N C V    +   
Sbjct: 188 WGEEGYIRIAKGSNQCLVKEEASSAV 213


>3bwk_A Cysteine protease falcipain-3; malaria, hydrolase; HET: C1P; 2.42A
           {Plasmodium falciparum} PDB: 3bpm_A*
          Length = 243

 Score =  245 bits (629), Expect = 7e-81
 Identities = 72/217 (33%), Positives = 112/217 (51%), Gaps = 27/217 (12%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ++CGSCW+F + G+VE  Y ++ K L + S+Q L+DCS    NNGC GG    ++  +
Sbjct: 37  DQALCGSCWAFSSVGSVESQYAIRKKALFLFSEQELVDCS--VKNNGCYGGYITNAFDDM 94

Query: 211 MKH-GLPTQDDYGPYLGQ-DAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
           +   GL +QDDY PY+      C++       T+  +V++    +D  K AL   GP+S+
Sbjct: 95  IDLGGLCSQDDY-PYVSNLPETCNLKRCNERYTIKSYVSIP---DDKFKEALRYLGPISI 150

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----------ELDGKPYWQVK 318
           +I AS   F+FY  G  YD +C  +    +HAV+ VGYG           ++   Y+ +K
Sbjct: 151 SIAASD-DFAFYRGGF-YDGECGAA---PNHAVILVGYGMKDIYNEDTGRMEKFYYYIIK 205

Query: 319 NSWSTYWGNQGYVLMS----IKDNNCGVMTAPTYVTM 351
           NSW + WG  GY+ +          C + T      +
Sbjct: 206 NSWGSDWGEGGYINLETDENGYKKTCSIGTEAYVPLL 242


>2wbf_X Serine-repeat antigen protein; SERA, malaria, vacuole, protease,
           cathepsin, hydrolase, glycoprotein, thiol protease; HET:
           DMS; 1.60A {Plasmodium falciparum} PDB: 3ch3_X 3ch2_X
          Length = 265

 Score =  244 bits (624), Expect = 5e-80
 Identities = 53/235 (22%), Positives = 80/235 (34%), Gaps = 39/235 (16%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  C + W F +   +E    MK  +   +S   + +C  G   + CD G     +  I
Sbjct: 27  DQGNCDTSWIFASKYHLETIRCMKGYEPTKISALYVANCYKGEHKDRCDEGSSPMEFLQI 86

Query: 211 MKH--GLPTQDDYGPYLGQDAYCHIA-------------------NTTATATMTGFVNVT 249
           ++    LP + +Y PY                             N   +    G+    
Sbjct: 87  IEDYGFLPAESNY-PYNYVKVGEQCPKVEDHWMNLWDNGKILHNKNEPNSLDGKGYTAYE 145

Query: 250 PNS--------EDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAV 301
                         +K  +   G V   I A      +  +G      C    D  DHAV
Sbjct: 146 SERFHDNMDAFVKIIKTEVMNKGSVIAYIKAENVMG-YEFSGKKVKNLC--GDDTADHAV 202

Query: 302 LAVGYG-----ELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVT 350
             VGYG     E + K YW V+NSW  YWG++GY  + +    +C      + V 
Sbjct: 203 NIVGYGNYVNSEGEKKSYWIVRNSWGPYWGDEGYFKVDMYGPTHCHFNFIHSVVI 257


>2oul_A Falcipain 2; cysteine protease, inhibitor, macromolecular
           interaction, HY hydrolase inhibitor complex; 2.20A
           {Plasmodium falciparum} SCOP: d.3.1.1 PDB: 2ghu_A 1yvb_A
           3bpf_A* 3pnr_A
          Length = 241

 Score =  239 bits (612), Expect = 2e-78
 Identities = 67/217 (30%), Positives = 115/217 (52%), Gaps = 27/217 (12%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CGSCW+F + G+VE  Y ++  KL  LS+Q L+DCS  + N GC+GG    +++ +
Sbjct: 35  DQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCS--FKNYGCNGGLINNAFEDM 92

Query: 211 MKH-GLPTQDDYGPYLG-QDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
           ++  G+    DY PY+      C+I   T    +  +++V    ++ LK AL   GP+S+
Sbjct: 93  IELGGICPDGDY-PYVSDAPNLCNIDRCTEKYGIKNYLSVP---DNKLKEALRFLGPISI 148

Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG----------ELDGKPYWQVK 318
           ++  S   F+FY  G+ +D +C +    L+HAV+ VG+G          + +   Y+ +K
Sbjct: 149 SVAVS-DDFAFYKEGI-FDGECGDQ---LNHAVMLVGFGMKEIVNPLTKKGEKHYYYIIK 203

Query: 319 NSWSTYWGNQGYVLMSIKDNN----CGVMTAPTYVTM 351
           NSW   WG +G++ +   ++     CG+ T      +
Sbjct: 204 NSWGQQWGERGFINIETDESGLMRKCGLGTDAFIPLI 240


>1o0e_A Ervatamin C; plant cysteine protease, two domain, stable at PH
           2-12, HYDR; 1.90A {Tabernaemontana divaricata} SCOP:
           d.3.1.1 PDB: 2pns_A* 2pre_A* 3bcn_A*
          Length = 208

 Score =  215 bits (550), Expect = 1e-69
 Identities = 73/201 (36%), Positives = 101/201 (50%), Gaps = 17/201 (8%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           +Q  CGSCW+F T   VE    ++   L  LS+Q L+DC     N+GC GG    +YQ+I
Sbjct: 18  NQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCD--KKNHGCLGGAFVFAYQYI 75

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           + + G+ TQ +Y PY      C  A +    ++ G+  V   +E ALK A+A   P +VA
Sbjct: 76  INNGGIDTQANY-PYKAVQGPCQ-AASKV-VSIDGYNGVPFCNEXALKQAVA-VQPSTVA 131

Query: 270 IDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
           IDAS   F  Y +G++           L+H V  VGY       YW V+NSW  YWG +G
Sbjct: 132 IDASSAQFQQYSSGIFSGPCGTK----LNHGVTIVGY----QANYWIVRNSWGRYWGEKG 183

Query: 330 YVLMS--IKDNNCGVMTAPTY 348
           Y+ M        CG+   P Y
Sbjct: 184 YIRMLRVGGCGLCGIARLPYY 204


>3pdf_A Cathepsin C, dipeptidyl peptidase 1; two domains, cystein protease,
           hydrolase-hydrolase inhibitor; HET: LXV NAG; 1.85A {Homo
           sapiens} PDB: 1jqp_A* 2djf_B* 1k3b_B* 2djg_B* 2djf_A*
           1k3b_A* 2djg_A* 2djf_C* 1k3b_C* 2djg_C*
          Length = 441

 Score =  217 bits (554), Expect = 3e-67
 Identities = 63/213 (29%), Positives = 94/213 (44%), Gaps = 16/213 (7%)

Query: 150 LDQSVCGSCWSFGTTGAVEGAYYMKHKKL--AVLSQQALIDCSWGYGNNGCDGGEDFRSY 207
            +Q+ CGSC+SF + G +E    +        +LS Q ++ CS      GC+GG  +   
Sbjct: 226 RNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCS--QYAQGCEGGFPYLIA 283

Query: 208 QWIMK-HGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTP----NSEDALKLALAK 262
               +  GL  +  + PY G D+ C +         + +  V       +E  +KL L  
Sbjct: 284 GKYAQDFGLVEEACF-PYTGTDSPCKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVH 342

Query: 263 HGPVSVAIDASQKSFSFYVNGVYYDEKC---NNSPDGLDHAVLAVGYGE--LDGKPYWQV 317
           HGP++VA +     F  Y  G+Y+        N  +  +HAVL VGYG     G  YW V
Sbjct: 343 HGPMAVAFEVYD-DFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIV 401

Query: 318 KNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVT 350
           KNSW T WG  GY  +    + C + +     T
Sbjct: 402 KNSWGTGWGENGYFRIRRGTDECAIESIAVAAT 434


>1deu_A Procathepsin X; cysteine protease, proregion, prosegment, HY; 1.70A
           {Homo sapiens} SCOP: d.3.1.1 PDB: 1ef7_A
          Length = 277

 Score =  210 bits (536), Expect = 1e-66
 Identities = 53/225 (23%), Positives = 92/225 (40%), Gaps = 35/225 (15%)

Query: 151 DQSV---CGSCWSFGTTGAVEGAYYMKHK---KLAVLSQQALIDCSWGYGNNGCDGGEDF 204
           +Q +   CGSCW+  +T A+     +K K      +LS Q +IDC        C+GG D 
Sbjct: 56  NQHIPQYCGSCWAHASTSAMADRINIKRKGAWPSTLLSVQNVIDCG---NAGSCEGGNDL 112

Query: 205 RSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNV-------------TPN 251
             + +  +HG+P +     Y  +D  C   N   T       +              + +
Sbjct: 113 SVWDYAHQHGIPDETCN-NYQAKDQECDKFNQCGTCNEFKECHAIRNYTLWRVGDYGSLS 171

Query: 252 SEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDG 311
             + +   +  +GP+S  I A++   + Y  G+Y + +       ++H V   G+G  DG
Sbjct: 172 GREKMMAEIYANGPISCGIMATE-RLANYTGGIYAEYQDTTY---INHVVSVAGWGISDG 227

Query: 312 KPYWQVKNSWSTYWGNQGYVLM--------SIKDNNCGVMTAPTY 348
             YW V+NSW   WG +G++ +             N  +    T+
Sbjct: 228 TEYWIVRNSWGEPWGERGWLRIVTSTYKDGKGARYNLAIEEHCTF 272


>3hhi_A Cathepsin B-like cysteine protease; occluding loop, hydrolase, THIO
           protease; HET: 074; 1.60A {Trypanosoma brucei} PDB:
           3mor_A*
          Length = 325

 Score =  208 bits (532), Expect = 3e-65
 Identities = 60/223 (26%), Positives = 87/223 (39%), Gaps = 31/223 (13%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMK-HKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQW 209
           DQS CGSCW+     A+   +      +   +S   L+ C    G+ GC+GG+  R++ +
Sbjct: 93  DQSACGSCWAVAAASAMSDRFCTMGGVQDVHISAGDLLACCSDCGD-GCNGGDPDRAWAY 151

Query: 210 IMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTP------------------- 250
               GL +     PY       H  +       + F   TP                   
Sbjct: 152 FSSTGLVSDYCQ-PYPFPHCSHHSKSKNGYPPCSQFNFDTPKCDYTCDDPTIPVVNYRSW 210

Query: 251 -----NSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVG 305
                  ED     L   GP  VA D  +  F  Y +GVY+            HAV  VG
Sbjct: 211 TSYALQGEDDYMRELFFRGPFEVAFDVYE-DFIAYNSGVYHHVSGQYL---GGHAVRLVG 266

Query: 306 YGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
           +G  +G PYW++ NSW+T WG  GY L+    + CG+    + 
Sbjct: 267 WGTSNGVPYWKIANSWNTEWGMDGYFLIRRGSSECGIEDGGSA 309


>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme,
           papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A
           1pbh_A 1mir_A
          Length = 317

 Score =  205 bits (524), Expect = 3e-64
 Identities = 56/231 (24%), Positives = 87/231 (37%), Gaps = 37/231 (16%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKL--AVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
           DQ  CGSCW+FG   A+     +         +S + L+ C      +GC+GG    ++ 
Sbjct: 85  DQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWN 144

Query: 209 WIMKHGLPTQDDYG------PYLGQDAYCHIANTTATATMTGFVNV-------------- 248
           +  + GL +   Y       PY       H+  +    T  G                  
Sbjct: 145 FWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYK 204

Query: 249 -----------TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGL 297
                        NSE  +   + K+GPV  A       F  Y +GVY            
Sbjct: 205 QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYS-DFLLYKSGVYQHVTGEMM---G 260

Query: 298 DHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
            HA+  +G+G  +G PYW V NSW+T WG+ G+  +    ++CG+ +    
Sbjct: 261 GHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVA 311


>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco
           hydrolase, lysosome, protease, thiol protease, zymogen,
           CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A*
           3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A*
           1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A*
           1qdq_A* 1csb_B* 1huc_B 2ipp_B ...
          Length = 266

 Score =  202 bits (517), Expect = 8e-64
 Identities = 55/225 (24%), Positives = 84/225 (37%), Gaps = 37/225 (16%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKL--AVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
           DQ  CGS W+FG   A+     +         +S + L+ C      +GC+GG    ++ 
Sbjct: 28  DQGSCGSAWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWN 87

Query: 209 WIMKHGLPTQDDYG------PYLGQDAYCHIANTTATATMTGFVNV-------------- 248
           +  + GL +   Y       PY       H+       T  G                  
Sbjct: 88  FWTRKGLVSGGLYESHVGCRPYSIPPCEAHVNGARPPCTGEGDTPKCSKICEPGYSPTYK 147

Query: 249 -----------TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGL 297
                        NSE  +   + K+GPV  A       F  Y +GVY            
Sbjct: 148 QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYS-DFLLYKSGVYQHVTGEMM---G 203

Query: 298 DHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGV 342
            HA+  +G+G  +G PYW V NSW+T WG+ G+  +    ++CG+
Sbjct: 204 GHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGI 248


>3ois_A Cysteine protease; alpha and beta, hydrolase; HET: UDP; 1.65A
           {Xylella fastidiosa}
          Length = 291

 Score =  202 bits (515), Expect = 3e-63
 Identities = 46/236 (19%), Positives = 75/236 (31%), Gaps = 25/236 (10%)

Query: 130 NKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALI-- 187
            K+   A+P + ++     + DQ   GSC +     A++       +    +  +  I  
Sbjct: 50  EKSVIAALPPKVDLTPPFQVYDQGRIGSCTANALAAAIQFERIHDKQSPEFIPSRLFIYY 109

Query: 188 DCSWGYGNNGCDGGEDFRSYQWIM-KHGLPTQDDYGPYLGQDAYCHIANTTATA------ 240
           +     G+   D G   R    ++ K G+  + ++ PY    A          A      
Sbjct: 110 NERKIEGHVNYDSGAMIRDGIKVLHKLGVCPEKEW-PYGDTPADPRTEEFPPGAPASKKP 168

Query: 241 -----------TMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEK 289
                       +T +  V     D LK  LA   P                  V     
Sbjct: 169 SDQCYKDAQNYKITEYSRV-AQDIDHLKACLAVGSPFVFGFSVYNSWVGNNSLPVRIPLP 227

Query: 290 CNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMS-IKDNNCGVMT 344
             N      HAVL VGY   D   +++++NSW    G  GY  M     +N  +  
Sbjct: 228 TKNDTLEGGHAVLCVGYD--DEIRHFRIRNSWGNNVGEDGYFWMPYEYISNTQLAD 281


>3u8e_A Papain-like cysteine protease; papain-like cysteine peptidase,
           peptidase_C1A, hydrolase, in form; 1.31A {Crocus
           sativus}
          Length = 222

 Score =  197 bits (503), Expect = 2e-62
 Identities = 77/208 (37%), Positives = 107/208 (51%), Gaps = 13/208 (6%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
           DQ  CG CW+FG TGA+EG   +   +L  +S+Q ++DC          GG+   +++W+
Sbjct: 18  DQGACGMCWAFGATGAIEGIDAITTGRLISVSEQQIVDCD--TXXXXXXGGDADDAFRWV 75

Query: 211 MKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVA 269
           + + G+ +  +Y PY G D  C   N    A + G+ NV PNS  AL  A+A   PVSV 
Sbjct: 76  ITNGGIASDANY-PYTGVDGTCD-LNKPIAARIDGYTNV-PNSSSALLDAVA-KQPVSVN 131

Query: 270 IDASQKSFSFYVN-GVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGN 327
           I  S  SF  Y   G++    C++ P  +DH VL VGYG       YW VKNSW T WG 
Sbjct: 132 IYTSSTSFQLYTGPGIFAGSSCSDDPATVDHTVLIVGYGSNGTNADYWIVKNSWGTEWGI 191

Query: 328 QGYVLMS----IKDNNCGVMTAPTYVTM 351
            GY+L+       D  C +    +Y T 
Sbjct: 192 DGYILIRRNTNRPDGVCAIDAWGSYPTK 219


>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase,
           digestive tract, hydrolase-hydrolase INH complex; HET:
           074; 1.30A {Schistosoma mansoni} PDB: 3s3q_A* 3s3r_A*
          Length = 254

 Score =  191 bits (487), Expect = 2e-59
 Identities = 55/232 (23%), Positives = 86/232 (37%), Gaps = 39/232 (16%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKH--KKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
           DQS CGSCW+FG   A+     ++   K+   LS   L+ C    G  GC+GG    ++ 
Sbjct: 24  DQSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVDLLSCCESCGL-GCEGGILGPAWD 82

Query: 209 WIMKHGLPTQDDYGPYLGQDAY-----------------------------CHIANTTAT 239
           + +K G+ T      + G + Y                             C     T  
Sbjct: 83  YWVKEGIVTGSSKENHAGCEPYPFPKCEHHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPY 142

Query: 240 ATMTGFVNV---TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDG 296
                         N E A++  + K+GPV       +  F  Y +G+Y           
Sbjct: 143 TQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGFTVYE-DFLNYKSGIYKHITGETL--- 198

Query: 297 LDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 348
             HA+  +G+G  +  PYW + NSW+  WG  GY  +    + C + +  T 
Sbjct: 199 GGHAIRIIGWGVENKAPYWLIANSWNEDWGENGYFRIVRGRDECSIESEVTA 250


>3pw3_A Aminopeptidase C; bleomycin, cysteine proteinase fold, structural
           genomics, JO center for structural genomics, JCSG; HET:
           MSE; 2.23A {Parabacteroides distasonis}
          Length = 383

 Score = 46.4 bits (109), Expect = 8e-06
 Identities = 13/39 (33%), Positives = 17/39 (43%), Gaps = 1/39 (2%)

Query: 297 LDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQGYVLMS 334
            DH +   G   + +G  Y+ VKNSW T     G    S
Sbjct: 316 DDHGMQIYGIAKDQEGNEYYMVKNSWGTNSKYNGIWYAS 354



 Score = 42.5 bits (99), Expect = 1e-04
 Identities = 13/81 (16%), Positives = 32/81 (39%), Gaps = 10/81 (12%)

Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWG---------YGN-NGCDG 200
           +Q+  G+CW + +   +E       K    LS+   +  ++          +G+ +   G
Sbjct: 27  NQNRAGTCWCYSSYSFLESELLRMGKGEYDLSEMFTVYNTYLDRADAAVRTHGDVSFSQG 86

Query: 201 GEDFRSYQWIMKHGLPTQDDY 221
           G  + +   +   GL  +++ 
Sbjct: 87  GSFYDALYGMETFGLVPEEEM 107


>2pff_B Fatty acid synthase subunit beta; fatty acid synthase,
           acyl-carrier-protein, beta-ketoacyl RED beta-ketoacyl
           synthase, dehydratase; 4.00A {Saccharomyces cerevisiae}
          Length = 2006

 Score = 44.3 bits (104), Expect = 5e-05
 Identities = 54/364 (14%), Positives = 101/364 (27%), Gaps = 121/364 (33%)

Query: 4   TFLPLLLLSVGMVKTYQLSKNGTNGLSLKVAPMTTETELNKISCFHAY-------GIPDA 56
            F  +L L +   +   L  N  + L+ K+      T +        Y         P  
Sbjct: 79  QFDQVLNLCLTEFENCYLEGNDIHALAAKLLQENDTTLVKTKELIKNYITARIMAKRPFD 138

Query: 57  TIEPQSVLPDVSDFKVNIYRLF---------F--LRPRFHENEKIRYNWTY--IGEELVN 103
                ++   V +    +  +F         F  LR  +          TY  +  +L+ 
Sbjct: 139 KKSNSALFRAVGEGNAQLVAIFGGQGNTDDYFEELRDLYQ---------TYHVLVGDLIK 189

Query: 104 GIILEKWRLVTSEGEKVSKYSL------WVRYNKASK------DAIPV-----------R 140
                   L+ +  +    ++       W+  N ++        +IP+            
Sbjct: 190 FSAETLSELIRTTLDAEKVFTQGLNILEWLE-NPSNTPDKDYLLSIPISCPLIGVIQLAH 248

Query: 141 Y----EMKGYN-SLLDQSVCGSCWSFGTTGAVEG--------------AYYMKHKK-LAV 180
           Y    ++ G+    L   +       G TG  +G              ++++  +K + V
Sbjct: 249 YVVTAKLLGFTPGELRSYLK------GATGHSQGLVTAVAIAETDSWESFFVSVRKAITV 302

Query: 181 L------SQQA----------LIDCSWGYGNNGCDGGEDFRSYQWIMKH--GLPTQ---- 218
           L        +A          L D          +  E   S    M     L  +    
Sbjct: 303 LFFIGVRCYEAYPNTSLPPSILEDS--------LENNEGVPSP---MLSISNLTQEQVQD 351

Query: 219 --DDYGPYLGQDAYCHIA--NTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQ 274
             +    +L       I+  N      ++G     P S   L L L K       +D S+
Sbjct: 352 YVNKTNSHLPAGKQVEISLVNGAKNLVVSGP----PQSLYGLNLTLRKA-KAPSGLDQSR 406

Query: 275 KSFS 278
             FS
Sbjct: 407 IPFS 410


>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis,
           programmed cell death; HET: DTP; 6.90A {Drosophila
           melanogaster} PDB: 3iz8_A*
          Length = 1221

 Score = 43.3 bits (101), Expect = 9e-05
 Identities = 42/372 (11%), Positives = 99/372 (26%), Gaps = 119/372 (31%)

Query: 11  LSVGMVKT------------YQLSKNGTNGL----SLKVAPMTTETELNKISCFHAYGIP 54
           L++    +            YQ+  N T+      ++K+   + + EL ++     Y   
Sbjct: 187 LNLKNCNSPETVLEMLQKLLYQIDPNWTSRSDHSSNIKLRIHSIQAELRRLLKSKPY--E 244

Query: 55  DATIEPQSVLPDVSDFKVNIYRLFFLRPRFHENEKI----RY--NWTYIGEELVNGIILE 108
           +  +    VL +V +     +  F L        KI    R+     ++       I L+
Sbjct: 245 NCLL----VLLNVQN--AKAWNAFNLSC------KILLTTRFKQVTDFLSAATTTHISLD 292

Query: 109 KWRLVTSEGEKVSKYSLWVRYNKASKDAIPVRYEMKGYNSLLDQSVCGSCWSFGTTGAVE 168
              +  +  E  S    ++         +P   E+   N     S+              
Sbjct: 293 HHSMTLTPDEVKSLLLKYL---DCRPQDLPR--EVLTTNPRR-LSIIA------------ 334

Query: 169 GAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQW-----IMKHGLPTQDDYGP 223
                       +        +W            ++         I++  L   +   P
Sbjct: 335 ----------ESIRDGL---ATWDN----------WKHVNCDKLTTIIESSLNVLE---P 368

Query: 224 YLGQDAY--CHIANTTATATMTGF---VNVTPN----------SEDALKLA--LAKHGPV 266
              +  +    +           F    ++               D + +   L K+   
Sbjct: 369 AEYRKMFDRLSV-----------FPPSAHIPTILLSLIWFDVIKSDVMVVVNKLHKYS-- 415

Query: 267 SVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQ-VKNSWSTYW 325
              ++   K  +  +  +Y + K     +   H  +   Y           +      Y+
Sbjct: 416 --LVEKQPKESTISIPSIYLELKVKLENEYALHRSIVDHYNIPKTFDSDDLIPPYLDQYF 473

Query: 326 GNQ-GYVLMSIK 336
            +  G+ L +I+
Sbjct: 474 YSHIGHHLKNIE 485


>2cb5_A Protein (bleomycin hydrolase); aminopeptidase, cysteine protease,
           SELF- compartmentalizing, cylinase; 1.85A {Homo sapiens}
           SCOP: d.3.1.1 PDB: 1cb5_A
          Length = 453

 Score = 35.9 bits (82), Expect = 0.017
 Identities = 20/85 (23%), Positives = 35/85 (41%), Gaps = 4/85 (4%)

Query: 254 DALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELD--- 310
           D  K   +K G   + +   +  F   +  +   E+       + HA+      E D   
Sbjct: 326 DVGKHFNSKLGLSDMNLYDHELVFGVSLKNMNKAERLTFGESLMTHAMTFTAVSEKDDQD 385

Query: 311 GKP-YWQVKNSWSTYWGNQGYVLMS 334
           G    W+V+NSW    G++GY+ M+
Sbjct: 386 GAFTKWRVENSWGEDHGHKGYLCMT 410


>2e01_A Cysteine proteinase 1; bleomycin hydrolase, thiol protease, C1
           protease, hydrolase; 1.73A {Saccharomyces cerevisiae}
           PDB: 2e02_A 2e03_A 2dzy_A 1a6r_A 2e00_A 2dzz_A 3gcb_A
           1gcb_A
          Length = 457

 Score = 35.5 bits (81), Expect = 0.021
 Identities = 19/109 (17%), Positives = 33/109 (30%), Gaps = 19/109 (17%)

Query: 245 FVNVTPNS-EDALKLALAKHGPVSVAIDASQKSFS----FYVNGVYYDEKCNNSPD---- 295
           ++NV   +    +   L  +  V       +          +    Y     N P     
Sbjct: 303 YLNVDNETLSKLVVKRLQNNKAVFFGSHTPKFMDKKTGVMDIELWNYPAIGYNLPQQKAS 362

Query: 296 -------GLDHAVLAVGYG--ELDGKP-YWQVKNSWSTYWGNQGYVLMS 334
                   +  A+L  G    E    P  ++V+NSW    G  G  +M+
Sbjct: 363 RIRYHESLMTAAMLITGCHVDETSKLPLRYRVENSWGKDSGKDGLYVMT 411


>1qzv_F Plant photosystem I: subunit PSAF; photosynthesis,plant
           photosynthetic reaction center, peripheral antenna; HET:
           CL1 PQN; 4.44A {Pisum sativum} SCOP: i.5.1.1
          Length = 154

 Score = 31.8 bits (71), Expect = 0.17
 Identities = 7/21 (33%), Positives = 12/21 (57%), Gaps = 1/21 (4%)

Query: 253 EDALKLALAKHGPVSVAIDAS 273
           + +LKL      P ++AI A+
Sbjct: 26  QASLKLYADDSAP-ALAIKAT 45


>2ebj_A Pyrrolidone carboxyl peptidase; TTHA08 degradation of proteins and
           peptides, structural genomics; 1.90A {Thermus
           thermophilus}
          Length = 192

 Score = 27.1 bits (59), Expect = 6.6
 Identities = 10/39 (25%), Positives = 15/39 (38%)

Query: 219 DDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALK 257
              G YL   A+             GF+++ P+   ALK
Sbjct: 132 LSAGSYLCNQAFYLSLYRLPEEVPVGFLHLPPDETLALK 170


  Database: pdb70
    Posted date:  Sep 4, 2012  3:40 AM
  Number of letters in database: 6,701,793
  Number of sequences in database:  27,921
  
Lambda     K      H
   0.318    0.134    0.422 

Gapped
Lambda     K      H
   0.267   0.0856    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 27921
Number of Hits to DB: 5,454,553
Number of extensions: 320586
Number of successful extensions: 861
Number of sequences better than 10.0: 1
Number of HSP's gapped: 652
Number of HSP's successfully gapped: 55
Length of query: 351
Length of database: 6,701,793
Length adjustment: 94
Effective length of query: 257
Effective length of database: 4,077,219
Effective search space: 1047845283
Effective search space used: 1047845283
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 58 (25.9 bits)