RPS-BLAST 2.2.26 [Sep-21-2011]

Database: pdb70 
           27,921 sequences; 6,701,793 total letters

Searching..................................................done

Query= psy403
         (233 letters)



>3kwz_A Cathepsin K; enzyme inhibitor, covalent reversible inhibitor,
           disease mutation, disulfide bond, glycoprotein,
           hydrolase, lysosome, protease; HET: KWZ; 1.49A {Homo
           sapiens} PDB: 1au0_A* 1au2_A* 1au3_A* 1au4_A* 1ayu_A*
           1ayv_A* 1ayw_A* 1bgo_A* 1atk_A* 1nl6_A* 1nlj_A* 1q6k_A*
           1mem_A* 1yk7_A* 1yk8_A* 1yt7_A* 2ato_A* 2aux_A* 2auz_A*
           2bdl_A* ...
          Length = 215

 Score =  242 bits (619), Expect = 1e-81
 Identities = 84/233 (36%), Positives = 108/233 (46%), Gaps = 85/233 (36%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
           M  AFQY++ N GID+E +YPY                                  +++C
Sbjct: 68  MTNAFQYVQKNRGIDSEDAYPYVGQ-------------------------------EESC 96

Query: 61  RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECN 120
            Y      A  RGY +IPEG+E  LK AVA +GPVS+AIDAS  SFQFYS+GVYY+  CN
Sbjct: 97  MYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCN 156

Query: 121 STQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDHA 180
           S  L+HAVL VGYG  + GN +W++KNSW                               
Sbjct: 157 SDNLNHAVLAVGYGI-QKGNKHWIIKNSWGE----------------------------- 186

Query: 181 VLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASFPLV 233
                                    WG++GYI MARN+ N CG+A+ ASFP +
Sbjct: 187 ------------------------NWGNKGYILMARNKNNACGIANLASFPKM 215


>2xu3_A Cathepsin L1; hydrolase, drug design, thiol protease; HET: XU3 BTB;
           0.90A {Homo sapiens} PDB: 2xu4_A* 2xu5_A* 2yj2_A*
           2yj8_A* 2yj9_A* 2yjb_A* 2yjc_A* 3bc3_A* 3h89_A* 3h8b_A*
           3h8c_A* 3of9_A* 3of8_A* 3hha_A* 2xu1_A* 3iv2_A* 3k24_A*
           2nqd_B* 3kse_A* 2vhs_A ...
          Length = 220

 Score =  241 bits (618), Expect = 2e-81
 Identities = 88/233 (37%), Positives = 116/233 (49%), Gaps = 82/233 (35%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
           MD AFQY++DN G+D+E SYPYEA                                +++C
Sbjct: 70  MDYAFQYVQDNGGLDSEESYPYEAT-------------------------------EESC 98

Query: 61  RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECN 120
           +Y    S A D G+VDIP   E  L  AVAT+GP+S+AIDA H+SF FY EG+Y+EP+C+
Sbjct: 99  KYNPKYSVANDTGFVDIP-KQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCS 157

Query: 121 STQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDHA 180
           S  +DH VLVVGYG +                                            
Sbjct: 158 SEDMDHGVLVVGYGFE-------------------------------------------- 173

Query: 181 VLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASFPLV 233
                  T+ + N YWLVKNSW   WG  GY+KMA++R N+CG+AS+AS+P V
Sbjct: 174 ------STESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 220


>8pch_A Cathepsin H; hydrolase, protease, cysteine proteinase,
           aminopeptidase; HET: NAG BMA; 2.10A {Sus scrofa} SCOP:
           d.3.1.1 PDB: 1nb3_A* 1nb5_A*
          Length = 220

 Score =  239 bits (613), Expect = 1e-80
 Identities = 63/232 (27%), Positives = 91/232 (39%), Gaps = 85/232 (36%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
             QAF+YI+ N GI  E +YPY+                                 DD+C
Sbjct: 71  PSQAFEYIRYNKGIMGEDTYPYKGQ-------------------------------DDHC 99

Query: 61  RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECN 120
           +++  K+ A  +   +I   DE  +  AVA   PVS A + ++  F  Y +G+Y    C+
Sbjct: 100 KFQPDKAIAFVKDVANITMNDEEAMVEAVALYNPVSFAFEVTN-DFLMYRKGIYSSTSCH 158

Query: 121 STQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDHA 180
            T                                                     +++HA
Sbjct: 159 KT---------------------------------------------------PDKVNHA 167

Query: 181 VLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASFPL 232
           VL VGYG  ENG  YW+VKNSW   WG  GY  + R + N CG+A+ AS+P+
Sbjct: 168 VLAVGYGE-ENGIPYWIVKNSWGPQWGMNGYFLIERGK-NMCGLAACASYPI 217


>3ovx_A Cathepsin S; hydrolase, covalent inhibitor, aldehyde warhead is
           covalently bound to Cys25, lysosomeal protein; HET: O64;
           1.49A {Homo sapiens} PDB: 2h7j_A* 2f1g_A* 2hh5_B*
           2hhn_A* 2hxz_A* 2op3_A* 2frq_A* 2fra_A* 2fq9_A* 2ft2_A*
           2fud_A* 2g7y_A* 1ms6_A* 2r9m_A* 2r9n_A* 2r9o_A* 3n3g_A*
           3n4c_A* 3mpe_A* 1nqc_A* ...
          Length = 218

 Score =  238 bits (609), Expect = 4e-80
 Identities = 81/233 (34%), Positives = 102/233 (43%), Gaps = 86/233 (36%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
           M  AFQYI DN GID+++SYPY+AM                               D  C
Sbjct: 72  MTTAFQYIIDNKGIDSDASYPYKAM-------------------------------DQKC 100

Query: 61  RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECN 120
           +Y      A    Y ++P G E  LK AVA  GPVS+ +DA H SF  Y  GVYYEP C 
Sbjct: 101 QYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSC- 159

Query: 121 STQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDHA 180
           +  ++H VLVVGYG   NG +YWLVKNSW                               
Sbjct: 160 TQNVNHGVLVVGYGD-LNGKEYWLVKNSWGH----------------------------- 189

Query: 181 VLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASFPLV 233
                                    +G+EGYI+MARN+ N+CG+AS  S+P +
Sbjct: 190 ------------------------NFGEEGYIRMARNKGNHCGIASFPSYPEI 218


>3qj3_A Cathepsin L-like protein; hydrolase, proteinase, larVal midgut;
           1.85A {Tenebrio molitor}
          Length = 331

 Score =  240 bits (615), Expect = 2e-79
 Identities = 77/233 (33%), Positives = 93/233 (39%), Gaps = 86/233 (36%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
           M+ AF Y+  N GID+E +YPYE                                 D NC
Sbjct: 185 MNDAFTYVAQNGGIDSEGAYPYEMA-------------------------------DGNC 213

Query: 61  RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECN 120
            Y   +  A   GYV +   DE  L   VAT GPV++A DA    F  YS GVYY P C 
Sbjct: 214 HYDPNQVAARLSGYVYLSGPDENMLADMVATKGPVAVAFDADD-PFGSYSGGVYYNPTCE 272

Query: 121 STQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDHA 180
           + +  HAVL+VGYG  ENG DYWLVKNSW                               
Sbjct: 273 TNKFTHAVLIVGYGN-ENGQDYWLVKNSWGD----------------------------- 302

Query: 181 VLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASFPLV 233
                                    WG +GY K+ARN  N+CG+A  AS P +
Sbjct: 303 ------------------------GWGLDGYFKIARNANNHCGIAGVASVPTL 331


>1cs8_A Human procathepsin L; prosegment, propeptide, inhibition,
           hydrolase; HET: OCS; 1.80A {Homo sapiens} SCOP: d.3.1.1
           PDB: 1cjl_A 3hwn_A*
          Length = 316

 Score =  239 bits (612), Expect = 3e-79
 Identities = 88/233 (37%), Positives = 116/233 (49%), Gaps = 82/233 (35%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
           MD AFQY++DN G+D+E SYPYEA                                +++C
Sbjct: 166 MDYAFQYVQDNGGLDSEESYPYEAT-------------------------------EESC 194

Query: 61  RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECN 120
           +Y    S A D G+VDIP   E  L  AVAT+GP+S+AIDA H+SF FY EG+Y+EP+C+
Sbjct: 195 KYNPKYSVANDAGFVDIP-KQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCS 253

Query: 121 STQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDHA 180
           S  +DH VLVVGYG +                                            
Sbjct: 254 SEDMDHGVLVVGYGFE-------------------------------------------- 269

Query: 181 VLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASFPLV 233
                  T+ + N YWLVKNSW   WG  GY+KMA++R N+CG+AS+AS+P V
Sbjct: 270 ------STESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 316


>1by8_A Protein (procathepsin K); hydrolase(sulfhydryl proteinase), papain;
           2.60A {Homo sapiens} SCOP: d.3.1.1 PDB: 7pck_A
          Length = 314

 Score =  239 bits (611), Expect = 4e-79
 Identities = 84/233 (36%), Positives = 108/233 (46%), Gaps = 85/233 (36%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
           M  AFQY++ N GID+E +YPY                                  +++C
Sbjct: 167 MTNAFQYVQKNRGIDSEDAYPYVGQ-------------------------------EESC 195

Query: 61  RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECN 120
            Y      A  RGY +IPEG+E  LK AVA +GPVS+AIDAS  SFQFYS+GVYY+  CN
Sbjct: 196 MYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCN 255

Query: 121 STQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDHA 180
           S  L+HAVL VGYG  + GN +W++KNSW                               
Sbjct: 256 SDNLNHAVLAVGYGI-QKGNKHWIIKNSWGE----------------------------- 285

Query: 181 VLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASFPLV 233
                                    WG++GYI MARN+ N CG+A+ ASFP +
Sbjct: 286 ------------------------NWGNKGYILMARNKNNACGIANLASFPKM 314


>2c0y_A Procathepsin S; proenzyme, proteinase, hydrolase, thiol protease,
           prosegment binding loop, glycoprotein, lysosome,
           protease, zymogen; 2.1A {Homo sapiens}
          Length = 315

 Score =  234 bits (600), Expect = 2e-77
 Identities = 81/233 (34%), Positives = 102/233 (43%), Gaps = 86/233 (36%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
           M  AFQYI DN GID+++SYPY+AM                               D  C
Sbjct: 169 MTTAFQYIIDNKGIDSDASYPYKAM-------------------------------DQKC 197

Query: 61  RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECN 120
           +Y      A    Y ++P G E  LK AVA  GPVS+ +DA H SF  Y  GVYYEP C 
Sbjct: 198 QYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSC- 256

Query: 121 STQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDHA 180
           +  ++H VLVVGYG   NG +YWLVKNSW                               
Sbjct: 257 TQNVNHGVLVVGYGD-LNGKEYWLVKNSWGH----------------------------- 286

Query: 181 VLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASFPLV 233
                                    +G+EGYI+MARN+ N+CG+AS  S+P +
Sbjct: 287 ------------------------NFGEEGYIRMARNKGNHCGIASFPSYPEI 315


>3qt4_A Cathepsin-L-like midgut cysteine proteinase; hydrolase, zymogen,
           intramolecular DISS bonds, insect larVal midgut; HET:
           PG4 PG6; 2.11A {Tenebrio molitor}
          Length = 329

 Score =  231 bits (591), Expect = 7e-76
 Identities = 76/233 (32%), Positives = 104/233 (44%), Gaps = 87/233 (37%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
           MD AF YI  ++GI +ES+YPYEA                                 D C
Sbjct: 184 MDSAFSYIH-DYGIMSESAYPYEAQ-------------------------------GDYC 211

Query: 61  RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECN 120
           R+  ++S     GY D+P GDE  L  AV   GPV++AIDA+    QFYS G++Y+  CN
Sbjct: 212 RFDSSQSVTTLSGYYDLPSGDENSLADAVGQAGPVAVAIDATD-ELQFYSGGLFYDQTCN 270

Query: 121 STQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDHA 180
            + L+H VLVVGYG+ +NG DYW++KNSW +                             
Sbjct: 271 QSDLNHGVLVVGYGS-DNGQDYWILKNSWGS----------------------------- 300

Query: 181 VLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASFPLV 233
                                    WG+ GY +  RN  NNCG+A++AS+P +
Sbjct: 301 ------------------------GWGESGYWRQVRNYGNNCGIATAASYPAL 329


>1s4v_A Cysteine endopeptidase; KDEL ER retention signal, endosperm,
           ricinosomes, SEED germi senescence, hydrolase-hydrolase
           inhibitor complex; 2.00A {Ricinus communis} SCOP:
           d.3.1.1
          Length = 229

 Score =  225 bits (575), Expect = 9e-75
 Identities = 76/234 (32%), Positives = 98/234 (41%), Gaps = 85/234 (36%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
           MD AF++IK   GI TE++YPYEA                                D  C
Sbjct: 70  MDYAFEFIKQRGGITTEANYPYEAY-------------------------------DGTC 98

Query: 61  RYKRAKSGAV-DRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPEC 119
              +  + AV   G+ ++PE DE  L  AVA   PVS+AIDA    FQFYSEGV+    C
Sbjct: 99  DVSKENAPAVSIDGHENVPENDENALLKAVAN-QPVSVAIDAGGSDFQFYSEGVFTGS-C 156

Query: 120 NSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDH 179
             T+LDH V +VGYGT  +G  YW VKNSW   WG++GYI+M R                
Sbjct: 157 -GTELDHGVAIVGYGTTIDGTKYWTVKNSWGPEWGEKGYIRMERG--------------- 200

Query: 180 AVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASFPLV 233
                                                ++E  CG+A  AS+P+ 
Sbjct: 201 -----------------------------------ISDKEGLCGIAMEASYPIK 219


>3f75_A Toxopain-2, cathepsin L protease; medical structural genomics of
           pathogenic protozoa, MSGPP, C protease, parasite,
           protozoa, hydrolase; 1.99A {Toxoplasma gondii}
          Length = 224

 Score =  224 bits (574), Expect = 1e-74
 Identities = 72/234 (30%), Positives = 99/234 (42%), Gaps = 86/234 (36%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
           M+ AFQY+ D+ GI +E +YPY A                                D+ C
Sbjct: 76  MNDAFQYVLDSGGICSEDAYPYLAR-------------------------------DEEC 104

Query: 61  RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECN 120
           R +  +      G+ D+P   E  +KAA+A   PVSIAI+A    FQFY EGV ++  C 
Sbjct: 105 RAQSCEKVVKILGFKDVPRRSEAAMKAALAK-SPVSIAIEADQMPFQFYHEGV-FDASC- 161

Query: 121 STQLDHAVLVVGYGTD-ENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDH 179
            T LDH VL+VGYGTD E+  D+W++KNSW T WG +GY+ MA +               
Sbjct: 162 GTDLDHGVLLVGYGTDKESKKDFWIMKNSWGTGWGRDGYMYMAMH--------------- 206

Query: 180 AVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASFPLV 233
                                               +  E  CG+   ASFP++
Sbjct: 207 ------------------------------------KGEEGQCGLLLDASFPVM 224


>2o6x_A Procathepsin L1, secreted cathepsin L 1; hydrolase, thiol protease,
           cysteine protease, zymogen, hydro; 1.40A {Fasciola
           hepatica}
          Length = 310

 Score =  227 bits (581), Expect = 1e-74
 Identities = 69/233 (29%), Positives = 95/233 (40%), Gaps = 87/233 (37%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
           M+ A+QY+K   G++TESSYPY A+                               +  C
Sbjct: 161 MENAYQYLK-QFGLETESSYPYTAV-------------------------------EGQC 188

Query: 61  RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECN 120
           RY +    A   G+  +  G E +LK  V   GP ++A+D     F  Y  G+Y    C+
Sbjct: 189 RYNKQLGVAKVTGFYTVHSGSEVELKNLVGAEGPAAVAVDVES-DFMMYRSGIYQSQTCS 247

Query: 121 STQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDHA 180
             +++HAVL VGYGT + G DYW+VKNSW                               
Sbjct: 248 PLRVNHAVLAVGYGT-QGGTDYWIVKNSWGL----------------------------- 277

Query: 181 VLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASFPLV 233
                                   +WG+ GYI+M RNR N CG+AS AS P+V
Sbjct: 278 ------------------------SWGERGYIRMVRNRGNMCGIASLASLPMV 306


>1m6d_A Cathepsin F, catsf; papain family cysteine protease, hydrolase;
           HET: MYP; 1.70A {Homo sapiens} SCOP: d.3.1.1
          Length = 214

 Score =  221 bits (567), Expect = 9e-74
 Identities = 54/233 (23%), Positives = 83/233 (35%), Gaps = 87/233 (37%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
              A+  IK+  G++TE  Y Y+                                   +C
Sbjct: 68  PSNAYSAIKNLGGLETEDDYSYQGH-------------------------------MQSC 96

Query: 61  RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECN 120
           ++   K+    +  V++   +E KL A +A  GP+S+AI+A     QFY  G+       
Sbjct: 97  QFSAEKAKVYIQDSVELS-QNEQKLAAWLAKRGPISVAINAF--GMQFYRHGISRP---- 149

Query: 121 STQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDHA 180
                                                            P C+   +DHA
Sbjct: 150 -----------------------------------------------LRPLCSPWLIDHA 162

Query: 181 VLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASFPLV 233
           VL+VGYG   +   +W +KNSW T WG++GY  + R     CGV + AS  +V
Sbjct: 163 VLLVGYGQ-RSDVPFWAIKNSWGTDWGEKGYYYLHRGS-GACGVNTMASSAVV 213


>1cqd_A Protein (protease II); cysteine protease, glycoprotein, proline
           specificity, carboh papain family, hydrolase; HET: NAG
           FUL FUC; 2.10A {Zingiber officinale} SCOP: d.3.1.1
          Length = 221

 Score =  222 bits (567), Expect = 1e-73
 Identities = 56/172 (32%), Positives = 81/172 (47%), Gaps = 35/172 (20%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
           M+ AFQ+I +N GI++E +YPY                                  D  C
Sbjct: 70  MNPAFQFIVNNGGINSEETYPYRGQ-------------------------------DGIC 98

Query: 61  RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECN 120
                        Y ++P  +E  L+ AVA   PVS+ +DA+ + FQ Y  G++    C 
Sbjct: 99  NSTVNAPVVSIDSYENVPSHNEQSLQKAVAN-QPVSVTMDAAGRDFQLYRSGIFTGS-C- 155

Query: 121 STQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPEC 172
           +   +HA+ VVGYGT EN  D+W+VKNSW   WG+ GYI+  RN    + +C
Sbjct: 156 NISANHALTVVGYGT-ENDKDFWIVKNSWGKNWGESGYIRAERNIENPDGKC 206


>1ppo_A Protease omega; hydrolase(thiol protease); 1.80A {Carica papaya}
           SCOP: d.3.1.1 PDB: 1meg_A*
          Length = 216

 Score =  221 bits (566), Expect = 2e-73
 Identities = 52/173 (30%), Positives = 73/173 (42%), Gaps = 37/173 (21%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
              A +Y+  N GI   S YPY+A                                   C
Sbjct: 68  PPYALEYVAKN-GIHLRSKYPYKAK-------------------------------QGTC 95

Query: 61  RYKRAKSGAV-DRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPEC 119
           R K+     V   G   +   +E  L  A+A   PVS+ +++  + FQ Y  G++  P C
Sbjct: 96  RAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAK-QPVSVVVESKGRPFQLYKGGIFEGP-C 153

Query: 120 NSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPEC 172
             T++DHAV  VGYG    G  Y L+KNSW T WG++GYI++ R        C
Sbjct: 154 -GTKVDHAVTAVGYGK-SGGKGYILIKNSWGTAWGEKGYIRIKRAPGNSPGVC 204


>3p5u_A Actinidin; SAD, cysteine proteinases, hydrolase; 1.50A {Actinidia
           arguta} PDB: 3p5v_A 3p5w_A 3p5x_A 1aec_A* 2act_A
          Length = 220

 Score =  221 bits (565), Expect = 2e-73
 Identities = 69/233 (29%), Positives = 98/233 (42%), Gaps = 87/233 (37%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
           M   FQ+I +N GI+TE++YPY A                                +  C
Sbjct: 70  MTDGFQFIINNGGINTEANYPYTAE-------------------------------EGQC 98

Query: 61  RYKRAKSGAV-DRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPEC 119
                +   V    Y ++P  +E+ L+ AVA   PVS+A++A+  +FQ YS G++  P C
Sbjct: 99  NLDLQQEKYVSIDTYENVPYNNEWALQTAVA-YQPVSVALEAAGYNFQHYSSGIFTGP-C 156

Query: 120 NSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDH 179
             T +DHAV +VGYGT E G DYW+VKNSW TTWG+EGY+++ RN               
Sbjct: 157 -GTAVDHAVTIVGYGT-EGGIDYWIVKNSWGTTWGEEGYMRIQRN--------------- 199

Query: 180 AVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASFPL 232
                                                     CG+A  AS+P+
Sbjct: 200 ------------------------------------VGGVGQCGIAKKASYPV 216


>1iwd_A Ervatamin B; cysteine protease, alpha-beta protein, catalytic DYAD,
           L-DOM domain., hydrolase; 1.63A {Tabernaemontana
           divaricata} SCOP: d.3.1.1
          Length = 215

 Score =  218 bits (557), Expect = 3e-72
 Identities = 57/172 (33%), Positives = 89/172 (51%), Gaps = 36/172 (20%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
           M+ AFQYI  N GIDT+ +YPY A+                                 +C
Sbjct: 68  MNNAFQYIITNGGIDTQQNYPYSAV-------------------------------QGSC 96

Query: 61  RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECN 120
           +  R +  +++ G+  +   +E  L++AVA+  PVS+ ++A+   FQ YS G++  P   
Sbjct: 97  KPYRLRVVSIN-GFQRVTRNNESALQSAVAS-QPVSVTVEAAGAPFQHYSSGIFTGP--C 152

Query: 121 STQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPEC 172
            T  +H V++VGYGT ++G +YW+V+NSW   WG++GYI M RN       C
Sbjct: 153 GTAQNHGVVIVGYGT-QSGKNYWIVRNSWGQNWGNQGYIWMERNVASSAGLC 203


>2fo5_A Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cysteine
           endoprotease, endopeptidase, LEUP hydrolase; HET: AR7;
           2.20A {Hordeum vulgare}
          Length = 262

 Score =  219 bits (560), Expect = 5e-72
 Identities = 65/176 (36%), Positives = 87/176 (49%), Gaps = 38/176 (21%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
           MD AF+YIK+N G+ TE++YPY A                                   C
Sbjct: 72  MDNAFEYIKNNGGLITEAAYPYRAA-------------------------------RGTC 100

Query: 61  RYKRAKS----GAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYE 116
              RA           G+ D+P   E  L  AVA   PVS+A++AS ++F FYSEGV+  
Sbjct: 101 NVARAAQNSPVVVHIDGHQDVPANSEEDLARAVAN-QPVSVAVEASGKAFMFYSEGVFTG 159

Query: 117 PECNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPEC 172
             C  T+LDH V VVGYG  E+G  YW VKNSW  +WG++GYI++ ++       C
Sbjct: 160 E-C-GTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSGASGGLC 213



 Score =  120 bits (302), Expect = 2e-33
 Identities = 32/69 (46%), Positives = 46/69 (66%), Gaps = 4/69 (5%)

Query: 168 YEPECNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENN---CGV 224
           +  EC  T+LDH V VVGYG  E+G  YW VKNSW  +WG++GYI++ ++   +   CG+
Sbjct: 157 FTGEC-GTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSGASGGLCGI 215

Query: 225 ASSASFPLV 233
           A  AS+P+ 
Sbjct: 216 AMEASYPVK 224


>3f5v_A DER P 1 allergen; allergy, asthma, DUST mites, glycoprotein,
           hydrola protease, secreted, thiol protease; HET: P6G;
           1.36A {Dermatophagoides pteronyssinus} PDB: 2as8_A
           3rvw_A* 3rvx_A 3rvv_A* 3d6s_A*
          Length = 222

 Score =  217 bits (556), Expect = 5e-72
 Identities = 43/167 (25%), Positives = 66/167 (39%), Gaps = 36/167 (21%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
           + +  +YI+ N G+  ES Y Y A                                + +C
Sbjct: 76  IPRGIEYIQHN-GVVQESYYRYVAR-------------------------------EQSC 103

Query: 61  RYKRAKSGAVDRGYVDIPEGDEYKLKAAVA-TIGPVSIAIDASH-QSFQFYSEGVYYEPE 118
           R   A+       Y  I   +  K++ A+A T   +++ I      +F+ Y      + +
Sbjct: 104 RRPNAQ-RFGISNYCQIYPPNANKIREALAQTHSAIAVIIGIKDLDAFRHYDGRTIIQRD 162

Query: 119 CNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNR 165
                  HAV +VGY     G DYW+V+NSW+T WGD GY   A N 
Sbjct: 163 NGYQPNYHAVNIVGYSN-AQGVDYWIVRNSWDTNWGDNGYGYFAANI 208


>1xkg_A DER P I, major mite fecal allergen DER P 1; major allergen,
           cysteine protease, house DUST mite, dermatop
           pteronyssinus; 1.61A {Dermatophagoides pteronyssinus}
           SCOP: d.3.1.1
          Length = 312

 Score =  219 bits (559), Expect = 3e-71
 Identities = 43/167 (25%), Positives = 67/167 (40%), Gaps = 36/167 (21%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
           + +  +YI+ N G+  ES Y Y A                                + +C
Sbjct: 156 IPRGIEYIQHN-GVVQESYYRYVAR-------------------------------EQSC 183

Query: 61  RYKRAKSGAVDRGYVDIPEGDEYKLKAAVA-TIGPVSIAIDASH-QSFQFYSEGVYYEPE 118
           R   A+   +   Y  I   +  K++ A+A T   +++ I      +F+ Y      + +
Sbjct: 184 RRPNAQRFGIS-NYCQIYPPNANKIREALAQTHSAIAVIIGIKDLDAFRHYDGRTIIQRD 242

Query: 119 CNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNR 165
                  HAV +VGY     G DYW+V+NSW+T WGD GY   A N 
Sbjct: 243 NGYQPNYHAVNIVGYSN-AQGVDYWIVRNSWDTNWGDNGYGYFAANI 288


>2b1m_A SPE31; papain-like, sugar binding protein; HET: NAG FUC PG4; 2.00A
           {Pachyrhizus erosus} PDB: 2b1n_A*
          Length = 246

 Score =  215 bits (550), Expect = 1e-70
 Identities = 57/241 (23%), Positives = 87/241 (36%), Gaps = 93/241 (38%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
             Q+F+++  + GI +E+ YPY+A                                D  C
Sbjct: 69  HYQSFEWVVKHGGIASEADYPYKAR-------------------------------DGKC 97

Query: 61  RYKRAKSGAVDRGY-------VDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGV 113
           +    +       Y              E  L++ V    P+S++IDA    F FYS G+
Sbjct: 98  KANEIQDKVTIDNYGVQILSNESTESEAESSLQSFVLE-QPISVSIDAK--DFHFYSGGI 154

Query: 114 YYEPECNSTQ-LDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPEC 172
           Y    C+S   ++H VL+VGYG+ E+G DYW+ KNSW   WG +GYI++ RN        
Sbjct: 155 YDGGNCSSPYGINHFVLIVGYGS-EDGVDYWIAKNSWGEDWGIDGYIRIQRNT------- 206

Query: 173 NSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASFPL 232
                                                       N    CG+   AS+P+
Sbjct: 207 -------------------------------------------GNLLGVCGMNYFASYPI 223

Query: 233 V 233
           +
Sbjct: 224 I 224


>3bwk_A Cysteine protease falcipain-3; malaria, hydrolase; HET: C1P; 2.42A
           {Plasmodium falciparum} PDB: 3bpm_A*
          Length = 243

 Score =  215 bits (550), Expect = 1e-70
 Identities = 52/236 (22%), Positives = 83/236 (35%), Gaps = 83/236 (35%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
           +  AF  + D  G+ ++  YPY + +                               + C
Sbjct: 87  ITNAFDDMIDLGGLCSQDDYPYVSNL------------------------------PETC 116

Query: 61  RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECN 120
             KR       + YV IP+      K A+  +GP+SI+I AS   F FY  G  Y+ EC 
Sbjct: 117 NLKRCNERYTIKSYVSIPDDK---FKEALRYLGPISISIAASD-DFAFYRGGF-YDGEC- 170

Query: 121 STQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDHA 180
               +HAV++VGYG  +  N+                                       
Sbjct: 171 GAAPNHAVILVGYGMKDIYNE--------------------------------------- 191

Query: 181 VLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNR---ENNCGVASSASFPLV 233
                         Y+++KNSW + WG+ GYI +  +    +  C + + A  PL+
Sbjct: 192 -----DTGRMEKFYYYIIKNSWGSDWGEGGYINLETDENGYKKTCSIGTEAYVPLL 242


>2oul_A Falcipain 2; cysteine protease, inhibitor, macromolecular
           interaction, HY hydrolase inhibitor complex; 2.20A
           {Plasmodium falciparum} SCOP: d.3.1.1 PDB: 2ghu_A 1yvb_A
           3bpf_A* 3pnr_A
          Length = 241

 Score =  213 bits (545), Expect = 5e-70
 Identities = 50/236 (21%), Positives = 85/236 (36%), Gaps = 83/236 (35%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
           ++ AF+ + +  GI  +  YPY +                                 + C
Sbjct: 85  INNAFEDMIELGGICPDGDYPYVSDA------------------------------PNLC 114

Query: 61  RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECN 120
              R       + Y+ +P+     LK A+  +GP+SI++  S   F FY EG+ ++ EC 
Sbjct: 115 NIDRCTEKYGIKNYLSVPDNK---LKEALRFLGPISISVAVSD-DFAFYKEGI-FDGEC- 168

Query: 121 STQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDHA 180
             QL+HAV++VG+G  E  N                                        
Sbjct: 169 GDQLNHAVMLVGFGMKEIVN---------------------------------------- 188

Query: 181 VLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNREN---NCGVASSASFPLV 233
                       + Y+++KNSW   WG+ G+I +  +       CG+ + A  PL+
Sbjct: 189 ----PLTKKGEKHYYYIIKNSWGQQWGERGFINIETDESGLMRKCGLGTDAFIPLI 240


>3ioq_A CMS1MS2; caricaceae, cysteine protease, papain family, hydrolase;
           HET: E64 SO4; 1.87A {Carica candamarcensis}
          Length = 213

 Score =  211 bits (541), Expect = 9e-70
 Identities = 56/173 (32%), Positives = 74/173 (42%), Gaps = 41/173 (23%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
              A QY+  N GI     YPYE +                                  C
Sbjct: 68  PLYALQYVA-NSGIHLRQYYPYEGV-------------------------------QRQC 95

Query: 61  RYKRAKSGAV-DRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPEC 119
           R  +AK   V   G   +P  +E  L   +A I PVSI ++A  ++FQ Y  G++  P C
Sbjct: 96  RASQAKGPKVKTDGVGRVPRNNEQALIQRIA-IQPVSIVVEAKGRAFQNYRGGIFAGP-C 153

Query: 120 NSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPEC 172
             T +DHAV  VGYG D     Y L+KNSW T WG+ GYI++ R     +  C
Sbjct: 154 -GTSIDHAVAAVGYGND-----YILIKNSWGTGWGEGGYIRIKRGSGNPQGAC 200


>2cio_A Papain; hydrolase/inhibitor, complex hydrolase/inhibitor, ICP,
           cysteine protease, allergen, protease, thiol protease;
           1.5A {Carica papaya} PDB: 1khq_A 1khp_A 1ppn_A 3e1z_B
           3ima_A 3lfy_A 9pap_A 1bqi_A* 1bp4_A* 1pad_A 1pe6_A*
           1pip_A* 1pop_A* 1ppd_A 1ppp_A* 1stf_E* 2pad_A 4pad_A*
           5pad_A* 6pad_A* ...
          Length = 212

 Score =  211 bits (539), Expect = 2e-69
 Identities = 47/173 (27%), Positives = 71/173 (41%), Gaps = 41/173 (23%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
              A Q +   +GI   ++YPYE +                                  C
Sbjct: 68  PWSALQLVA-QYGIHYRNTYPYEGV-------------------------------QRYC 95

Query: 61  RYKRAKSGAV-DRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPEC 119
           R +     A    G   +   +E  L  ++A   PVS+ ++A+ + FQ Y  G++  P C
Sbjct: 96  RSREKGPYAAKTDGVRQVQPYNEGALLYSIAN-QPVSVVLEAAGKDFQLYRGGIFVGP-C 153

Query: 120 NSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPEC 172
              ++DHAV  VGYG +     Y L+KNSW T WG+ GYI++ R        C
Sbjct: 154 -GNKVDHAVAAVGYGPN-----YILIKNSWGTGWGENGYIRIKRGTGNSYGVC 200


>1yal_A Chymopapain; hydrolase, thiol protease; 1.70A {Carica papaya} SCOP:
           d.3.1.1 PDB: 1gec_E*
          Length = 218

 Score =  211 bits (539), Expect = 2e-69
 Identities = 50/173 (28%), Positives = 75/173 (43%), Gaps = 37/173 (21%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
              + QY+ +N G+ T   YPY+A                                   C
Sbjct: 68  QTTSLQYVANN-GVHTSKVYPYQAK-------------------------------QYKC 95

Query: 61  R-YKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPEC 119
           R   +        GY  +P   E     A+A   P+S+ ++A  + FQ Y  GV+  P C
Sbjct: 96  RATDKPGPKVKITGYKRVPSNCETSFLGALAN-QPLSVLVEAGGKPFQLYKSGVFDGP-C 153

Query: 120 NSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPEC 172
             T+LDHAV  VGYGT  +G +Y ++KNSW   WG++GY+++ R     +  C
Sbjct: 154 -GTKLDHAVTAVGYGT-SDGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTC 204


>2bdz_A Mexicain; cysteine protease, peptidase_C1, papain-like, HYDR; HET:
           E64; 2.10A {Jacaratia mexicana}
          Length = 214

 Score =  210 bits (538), Expect = 3e-69
 Identities = 63/233 (27%), Positives = 82/233 (35%), Gaps = 91/233 (39%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
              + QY+ DN G+ TE  YPYE                                    C
Sbjct: 68  QTTSLQYVVDN-GVHTEREYPYEKK-------------------------------QGRC 95

Query: 61  RYKRAKSGAV-DRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPEC 119
           R K  K   V   GY  +P  DE  L  A+A   PVS+  D+  + FQFY  G+Y  P C
Sbjct: 96  RAKDKKGPKVYITGYKYVPANDEISLIQAIAN-QPVSVVTDSRGRGFQFYKGGIYEGP-C 153

Query: 120 NSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDH 179
             T  DHAV  VGYG       Y L+KNSW   WG++GYI++ R                
Sbjct: 154 -GTNTDHAVTAVGYGKT-----YLLLKNSWGPNWGEKGYIRIKRA--------------- 192

Query: 180 AVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASFPL 232
                                              +   +  CGV +S+ FP+
Sbjct: 193 -----------------------------------SGRSKGTCGVYTSSFFPI 210


>2wbf_X Serine-repeat antigen protein; SERA, malaria, vacuole, protease,
           cathepsin, hydrolase, glycoprotein, thiol protease; HET:
           DMS; 1.60A {Plasmodium falciparum} PDB: 3ch3_X 3ch2_X
          Length = 265

 Score =  212 bits (541), Expect = 4e-69
 Identities = 46/243 (18%), Positives = 71/243 (29%), Gaps = 73/243 (30%)

Query: 1   MDQAF-QYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDN 59
               F Q I+D   +  ES+YPY  +                  C K  +       +  
Sbjct: 79  SPMEFLQIIEDYGFLPAESNYPYNYVKVGEQ-------------CPKVEDHWMNLWDNGK 125

Query: 60  CRYKRAKSGAVD---------RGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYS 110
             + + +  ++D           + D  +     +K  V   G V   I A +     +S
Sbjct: 126 ILHNKNEPNSLDGKGYTAYESERFHDNMDAFVKIIKTEVMNKGSVIAYIKAENVMGYEFS 185

Query: 111 EGVYYEPECNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEP 170
            G   +  C     DHAV +VGYG   N                                
Sbjct: 186 -GKKVKNLCGDDTADHAVNIVGYGNYVN-------------------------------- 212

Query: 171 ECNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASF 230
                            ++     YW+V+NSW   WGDEGY K+      +C      S 
Sbjct: 213 -----------------SEGEKKSYWIVRNSWGPYWGDEGYFKVDMYGPTHCHFNFIHSV 255

Query: 231 PLV 233
            + 
Sbjct: 256 VIF 258


>1pci_A Procaricain; zymogen, hydrolase, thiol protease; 3.20A {Carica
           papaya} SCOP: d.3.1.1
          Length = 322

 Score =  211 bits (540), Expect = 3e-68
 Identities = 51/173 (29%), Positives = 72/173 (41%), Gaps = 37/173 (21%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
              A +Y+  N GI   S YPY+A                                   C
Sbjct: 174 PPYALEYVAKN-GIHLRSKYPYKAK-------------------------------QGTC 201

Query: 61  RYKRAKSGAV-DRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPEC 119
           R K+     V   G   +   +E  L  A+A   PVS+ +++  + FQ Y  G++  P C
Sbjct: 202 RAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAK-QPVSVVVESKGRPFQLYKGGIFEGP-C 259

Query: 120 NSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPEC 172
             T++D AV  VGYG    G  Y L+KNSW T WG++GYI++ R        C
Sbjct: 260 -GTKVDGAVTAVGYGK-SGGKGYILIKNSWGTAWGEKGYIRIKRAPGNSPGVC 310


>3i06_A Cruzipain; autocatalytic cleavage, glycoprotein, protease, thiol
           protease, zymogen; HET: QL2; 1.10A {Trypanosoma cruzi}
           PDB: 1ewm_A* 1ewo_A* 1ewl_A* 1f29_A* 1ewp_A* 1f2b_A*
           1f2c_A* 1f2a_A* 1me4_A* 1u9q_X* 2aim_A* 2efm_A* 2oz2_A*
           1me3_A* 3kku_A* 3lxs_A* 1aim_A* 3iut_A* 3hd3_A* 2p86_A*
           ...
          Length = 215

 Score =  206 bits (528), Expect = 9e-68
 Identities = 57/167 (34%), Positives = 83/167 (49%), Gaps = 36/167 (21%)

Query: 1   MDQAFQYI--KDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDD 58
           M+ AF++I  ++N  + TE SYPY +                                  
Sbjct: 68  MNNAFEWIVQENNGAVYTEDSYPYAS----------------------------GEGISP 99

Query: 59  NCRYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPE 118
            C       GA   G+V++P+ DE ++ A +A  GPV++A+DAS  S+  Y+ GV     
Sbjct: 100 PCTTSGHTVGATITGHVELPQ-DEAQIAAWLAVNGPVAVAVDAS--SWMTYTGGVM--TS 154

Query: 119 CNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNR 165
           C S QLDH VL+VGY        YW++KNSW T WG+EGYI++A+  
Sbjct: 155 CVSEQLDHGVLLVGYND-SAAVPYWIIKNSWTTQWGEEGYIRIAKGS 200


>3pdf_A Cathepsin C, dipeptidyl peptidase 1; two domains, cystein protease,
           hydrolase-hydrolase inhibitor; HET: LXV NAG; 1.85A {Homo
           sapiens} PDB: 1jqp_A* 2djf_B* 1k3b_B* 2djg_B* 2djf_A*
           1k3b_A* 2djg_A* 2djf_C* 1k3b_C* 2djg_C*
          Length = 441

 Score =  190 bits (484), Expect = 1e-58
 Identities = 46/238 (19%), Positives = 73/238 (30%), Gaps = 86/238 (36%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
                     + G+  E+ +PY                                  D  C
Sbjct: 279 PYLIAGKYAQDFGLVEEACFPYTGT-------------------------------DSPC 307

Query: 61  RYKRAKSGAVDRGYVDIP----EGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYE 116
           + K          Y  +       +E  +K  +   GP+++A +  +  F  Y +G+Y+ 
Sbjct: 308 KMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEV-YDDFLHYKKGIYHH 366

Query: 117 PECNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQ 176
                                                               +P      
Sbjct: 367 TGLR------------------------------------------------DPFNPFEL 378

Query: 177 LDHAVLVVGYGTDE-NGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASFPLV 233
            +HAVL+VGYGTD  +G DYW+VKNSW T WG+ GY ++ R   + C + S A     
Sbjct: 379 TNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGT-DECAIESIAVAATP 435


>1o0e_A Ervatamin C; plant cysteine protease, two domain, stable at PH
           2-12, HYDR; 1.90A {Tabernaemontana divaricata} SCOP:
           d.3.1.1 PDB: 2pns_A* 2pre_A* 3bcn_A*
          Length = 208

 Score =  172 bits (438), Expect = 3e-54
 Identities = 59/233 (25%), Positives = 85/233 (36%), Gaps = 93/233 (39%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
              A+QYI +N GIDT+++YPY+A+                                  C
Sbjct: 68  FVFAYQYIINNGGIDTQANYPYKAV-------------------------------QGPC 96

Query: 61  RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECN 120
           +           GY  +P  +E  LK AVA + P ++AIDAS   FQ YS G++  P   
Sbjct: 97  QAA--SKVVSIDGYNGVPFCNEXALKQAVA-VQPSTVAIDASSAQFQQYSSGIFSGPCG- 152

Query: 121 STQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDHA 180
            T+L+H V +VGY       +YW+V+NSW   WG++GYI+M R                 
Sbjct: 153 -TKLNHGVTIVGYQA-----NYWIVRNSWGRYWGEKGYIRMLRV---------------- 190

Query: 181 VLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASFPLV 233
                                                    CG+A    +P  
Sbjct: 191 ------------------------------------GGCGLCGIARLPYYPTK 207


>1deu_A Procathepsin X; cysteine protease, proregion, prosegment, HY; 1.70A
           {Homo sapiens} SCOP: d.3.1.1 PDB: 1ef7_A
          Length = 277

 Score =  172 bits (438), Expect = 1e-53
 Identities = 41/165 (24%), Positives = 68/165 (41%), Gaps = 22/165 (13%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
               + Y    HGI  E+   Y+A                   C K +   T     +  
Sbjct: 111 DLSVWDYAH-QHGIPDETCNNYQAKDQ---------------ECDKFNQCGTCNEFKECH 154

Query: 61  RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECN 120
             +      V  G      G E K+ A +   GP+S  I A  +    Y+ G+Y E +  
Sbjct: 155 AIRNYTLWRV--GDYGSLSGRE-KMMAEIYANGPISCGIMA-TERLANYTGGIYAEYQD- 209

Query: 121 STQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNR 165
           +T ++H V V G+G   +G +YW+V+NSW   WG+ G++++  + 
Sbjct: 210 TTYINHVVSVAGWGI-SDGTEYWIVRNSWGEPWGERGWLRIVTST 253


>3hhi_A Cathepsin B-like cysteine protease; occluding loop, hydrolase, THIO
           protease; HET: 074; 1.60A {Trypanosoma brucei} PDB:
           3mor_A*
          Length = 325

 Score =  170 bits (432), Expect = 5e-52
 Identities = 41/168 (24%), Positives = 61/168 (36%), Gaps = 17/168 (10%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
            D+A+ Y   + G+ ++   PY           F    +     +              C
Sbjct: 145 PDRAWAYFS-STGLVSDYCQPYP----------FPHCSHHSKSKNGYPPCSQFNFDTPKC 193

Query: 61  RYKRAKSGAVD---RGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEP 117
            Y            R +       E      +   GP  +A D  ++ F  Y+ GVY+  
Sbjct: 194 DYTCDDPTIPVVNYRSWTSYALQGEDDYMRELFFRGPFEVAFDV-YEDFIAYNSGVYHHV 252

Query: 118 ECNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNR 165
                   HAV +VG+GT  NG  YW + NSWNT WG +GY  + R  
Sbjct: 253 SG-QYLGGHAVRLVGWGT-SNGVPYWKIANSWNTEWGMDGYFLIRRGS 298


>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme,
           papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A
           1pbh_A 1mir_A
          Length = 317

 Score =  164 bits (417), Expect = 8e-50
 Identities = 40/168 (23%), Positives = 63/168 (37%), Gaps = 9/168 (5%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
             +A+ +     G+ +   Y         P  +     ++       + +         C
Sbjct: 139 PAEAWNFWTR-KGLVSGGLYESHVGC--RPYSIPPCEHHVNGSRPPCTGEGDTPKCSKIC 195

Query: 61  RYKRAKSGAVDRGYVDIP---EGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEP 117
               + +   D+ Y          E  + A +   GPV  A    +  F  Y  GVY   
Sbjct: 196 EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSV-YSDFLLYKSGVYQHV 254

Query: 118 ECNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNR 165
                   HA+ ++G+G  ENG  YWLV NSWNT WGD G+ K+ R +
Sbjct: 255 -TGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQ 300


>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase,
           digestive tract, hydrolase-hydrolase INH complex; HET:
           074; 1.30A {Schistosoma mansoni} PDB: 3s3q_A* 3s3r_A*
          Length = 254

 Score =  160 bits (406), Expect = 6e-49
 Identities = 42/168 (25%), Positives = 61/168 (36%), Gaps = 8/168 (4%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
           +  A+ Y     GI T SS    A     P                     T       C
Sbjct: 77  LGPAWDYWVKE-GIVTGSSKENHAGCEPYPFPKCEHHTKGKYPPCGSKIYKTPR-CKQTC 134

Query: 61  RYKRAKSGAVDRGYVDIP---EGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEP 117
           + K       D+         + DE  ++  +   GPV       ++ F  Y  G+Y   
Sbjct: 135 QKKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGFTV-YEDFLNYKSGIYKHI 193

Query: 118 ECNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNR 165
               T   HA+ ++G+G  EN   YWL+ NSWN  WG+ GY ++ R R
Sbjct: 194 -TGETLGGHAIRIIGWGV-ENKAPYWLIANSWNEDWGENGYFRIVRGR 239


>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco
           hydrolase, lysosome, protease, thiol protease, zymogen,
           CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A*
           3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A*
           1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A*
           1qdq_A* 1csb_B* 1huc_B 2ipp_B ...
          Length = 266

 Score =  159 bits (405), Expect = 1e-48
 Identities = 40/168 (23%), Positives = 63/168 (37%), Gaps = 9/168 (5%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
             +A+ +     G+ +   Y         P  +     ++       + +         C
Sbjct: 82  PAEAWNFWTR-KGLVSGGLYESHV--GCRPYSIPPCEAHVNGARPPCTGEGDTPKCSKIC 138

Query: 61  RYKRAKSGAVDRGYVDIP---EGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEP 117
               + +   D+ Y          E  + A +   GPV  A    +  F  Y  GVY   
Sbjct: 139 EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSV-YSDFLLYKSGVYQHV 197

Query: 118 ECNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNR 165
                   HA+ ++G+G  ENG  YWLV NSWNT WGD G+ K+ R +
Sbjct: 198 -TGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQ 243


>3ois_A Cysteine protease; alpha and beta, hydrolase; HET: UDP; 1.65A
           {Xylella fastidiosa}
          Length = 291

 Score =  155 bits (392), Expect = 2e-46
 Identities = 28/170 (16%), Positives = 52/170 (30%), Gaps = 24/170 (14%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
           M +    +    G+  E  +PY                                +     
Sbjct: 125 MIRDGIKVLHKLGVCPEKEWPYGDTP---------------ADPRTEEFPPGAPASKKPS 169

Query: 61  RYKRAKSGAVD-RGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEP-- 117
                 +       Y  +   D   LKA +A   P        + S+   +      P  
Sbjct: 170 DQCYKDAQNYKITEYSRVA-QDIDHLKACLAVGSPFVFGFSV-YNSWVGNNSLPVRIPLP 227

Query: 118 -ECNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRV 166
            + ++ +  HAVL VGY   ++   ++ ++NSW    G++GY  M    +
Sbjct: 228 TKNDTLEGGHAVLCVGY---DDEIRHFRIRNSWGNNVGEDGYFWMPYEYI 274


>3u8e_A Papain-like cysteine protease; papain-like cysteine peptidase,
           peptidase_C1A, hydrolase, in form; 1.31A {Crocus
           sativus}
          Length = 222

 Score =  148 bits (374), Expect = 1e-44
 Identities = 55/175 (31%), Positives = 80/175 (45%), Gaps = 37/175 (21%)

Query: 1   MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
            D AF+++  N GI ++++YPY  +                               D  C
Sbjct: 68  ADDAFRWVITNGGIASDANYPYTGV-------------------------------DGTC 96

Query: 61  RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFY-SEGVYYEPEC 119
              +  +  +D GY ++P      L  AVA   PVS+ I  S  SFQ Y   G++    C
Sbjct: 97  DLNKPIAARID-GYTNVP-NSSSALLDAVAK-QPVSVNIYTSSTSFQLYTGPGIFAGSSC 153

Query: 120 NS--TQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPEC 172
           +     +DH VL+VGYG++    DYW+VKNSW T WG +GYI + RN    +  C
Sbjct: 154 SDDPATVDHTVLIVGYGSNGTNADYWIVKNSWGTEWGIDGYILIRRNTNRPDGVC 208


>3pw3_A Aminopeptidase C; bleomycin, cysteine proteinase fold, structural
           genomics, JO center for structural genomics, JCSG; HET:
           MSE; 2.23A {Parabacteroides distasonis}
          Length = 383

 Score = 59.1 bits (142), Expect = 2e-10
 Identities = 25/124 (20%), Positives = 42/124 (33%), Gaps = 10/124 (8%)

Query: 94  PVSIAIDASHQSFQFYSEGVYYEPECNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTW 153
            ++   D S   F    +GV   P+      +    + G             K       
Sbjct: 243 TIAWGSDVSESGF--TRDGVAVMPD-----DEKVQELSGSDMAHWLKLKPEEKKLNTKPQ 295

Query: 154 GDEGYIKMARNRVYYEPECNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIK 213
             +   +  R   Y   +   T  DH + + G   D+ GN+Y++VKNSW T     G   
Sbjct: 296 PQKWCTQAERQLAY---DNYETTDDHGMQIYGIAKDQEGNEYYMVKNSWGTNSKYNGIWY 352

Query: 214 MARN 217
            ++ 
Sbjct: 353 ASKA 356



 Score = 59.1 bits (142), Expect = 2e-10
 Identities = 22/117 (18%), Positives = 38/117 (32%), Gaps = 7/117 (5%)

Query: 52  TVTSGDDNCRYKRAKSGAVDR----GYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQ 107
           T+  G D       + G           ++   D             ++          Q
Sbjct: 243 TIAWGSDVSESGFTRDGVAVMPDDEKVQELSGSDMAHWLKLKPEEKKLNTKPQPQKWCTQ 302

Query: 108 FYSEGVYYEPECNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARN 164
              +  Y   +   T  DH + + G   D+ GN+Y++VKNSW T     G    ++ 
Sbjct: 303 AERQLAY---DNYETTDDHGMQIYGIAKDQEGNEYYMVKNSWGTNSKYNGIWYASKA 356


>2e01_A Cysteine proteinase 1; bleomycin hydrolase, thiol protease, C1
           protease, hydrolase; 1.73A {Saccharomyces cerevisiae}
           PDB: 2e02_A 2e03_A 2dzy_A 1a6r_A 2e00_A 2dzz_A 3gcb_A
           1gcb_A
          Length = 457

 Score = 40.9 bits (95), Expect = 2e-04
 Identities = 12/45 (26%), Positives = 20/45 (44%), Gaps = 2/45 (4%)

Query: 122 TQLDHAVLVVGYGTDENGND--YWLVKNSWNTTWGDEGYIKMARN 164
           + +  A+L+ G   DE       + V+NSW    G +G   M + 
Sbjct: 369 SLMTAAMLITGCHVDETSKLPLRYRVENSWGKDSGKDGLYVMTQK 413



 Score = 40.9 bits (95), Expect = 2e-04
 Identities = 12/45 (26%), Positives = 20/45 (44%), Gaps = 2/45 (4%)

Query: 175 TQLDHAVLVVGYGTDENGND--YWLVKNSWNTTWGDEGYIKMARN 217
           + +  A+L+ G   DE       + V+NSW    G +G   M + 
Sbjct: 369 SLMTAAMLITGCHVDETSKLPLRYRVENSWGKDSGKDGLYVMTQK 413


>2cb5_A Protein (bleomycin hydrolase); aminopeptidase, cysteine protease,
           SELF- compartmentalizing, cylinase; 1.85A {Homo sapiens}
           SCOP: d.3.1.1 PDB: 1cb5_A
          Length = 453

 Score = 37.0 bits (85), Expect = 0.003
 Identities = 11/48 (22%), Positives = 20/48 (41%), Gaps = 3/48 (6%)

Query: 120 NSTQLDHAVLVVGYGTDENGND---YWLVKNSWNTTWGDEGYIKMARN 164
             + + HA+        ++ +     W V+NSW    G +GY+ M   
Sbjct: 365 GESLMTHAMTFTAVSEKDDQDGAFTKWRVENSWGEDHGHKGYLCMTDE 412



 Score = 37.0 bits (85), Expect = 0.003
 Identities = 11/48 (22%), Positives = 20/48 (41%), Gaps = 3/48 (6%)

Query: 173 NSTQLDHAVLVVGYGTDENGND---YWLVKNSWNTTWGDEGYIKMARN 217
             + + HA+        ++ +     W V+NSW    G +GY+ M   
Sbjct: 365 GESLMTHAMTFTAVSEKDDQDGAFTKWRVENSWGEDHGHKGYLCMTDE 412


>1e8u_A HN, hemagglutinin-neuraminidase; sialidase, hydrolase; HET: SLB
           NAG; 2.0A {Newcastle disease virus} SCOP: b.68.1.1 PDB:
           1e8t_A* 1e8v_A* 1usr_A* 1usx_A*
          Length = 454

 Score = 30.9 bits (69), Expect = 0.31
 Identities = 11/96 (11%), Positives = 24/96 (25%)

Query: 73  GYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECNSTQLDHAVLVVG 132
           G +      +   +         +             ++  Y        ++  A+L + 
Sbjct: 195 GGLKPNSPSDTVQEGKYVIYKRYNDTCPDEQDYQIRMAKSSYKPGRFGGKRIQQAILSIK 254

Query: 133 YGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYY 168
             T    +    V  +  T  G EG I       + 
Sbjct: 255 VSTSLGEDPVLTVPPNTVTLMGAEGRILTVGTSHFL 290


>3t1e_A Hemagglutinin-neuraminidase; beta-propeller, 4 helix bundle,
           neuraminidase membrane protein, ectodomain, hydrolase;
           3.30A {Newcastle disease virus}
          Length = 537

 Score = 30.1 bits (67), Expect = 0.72
 Identities = 9/96 (9%), Positives = 24/96 (25%)

Query: 73  GYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECNSTQLDHAVLVVG 132
           G +      +   +         +             ++  Y        ++  A+L + 
Sbjct: 284 GGLKPSSPSDTAQEGRYVIYKRYNDTCPDEQDYQIRMAKSSYKPGRFGGKRVQQAILSIK 343

Query: 133 YGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYY 168
             T    +    +  +  T  G EG +       + 
Sbjct: 344 VSTSLGEDPVLTIPPNTVTLMGAEGRVLTVGTSHFL 379


>1t9f_A Protein 1D10; structural genomics, PSI, protein structure
           initiative, southeast collaboratory for structural
           genomics, secsg; 2.00A {Caenorhabditis elegans} SCOP:
           b.42.6.1
          Length = 187

 Score = 29.1 bits (65), Expect = 0.89
 Identities = 14/109 (12%), Positives = 27/109 (24%), Gaps = 19/109 (17%)

Query: 103 HQSFQFY--SEGVYYEPECNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIK 160
           + +      S  V Y              V      ++ N +W +  + N        IK
Sbjct: 17  NANDGSRLHSHDVKYGSGSGQQS------VTAVKNSDDINSHWQIFPALNAKCNRGDAIK 70

Query: 161 ---------MARNRVYYEPECNSTQLDHAVLVVGYGTDENG--NDYWLV 198
                    +      +     +        V  +G++      D W V
Sbjct: 71  CGDKIRLKHLTTGTFLHSHHFTAPLSKQHQEVSAFGSEAESDTGDDWTV 119


>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis,
           programmed cell death; HET: DTP; 6.90A {Drosophila
           melanogaster} PDB: 3iz8_A*
          Length = 1221

 Score = 29.8 bits (66), Expect = 1.0
 Identities = 15/68 (22%), Positives = 27/68 (39%), Gaps = 11/68 (16%)

Query: 6   QYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVT-SGDDNCRYKR 64
            YI DN          YE +++ +  FL +       IC K ++ + +    +D   ++ 
Sbjct: 532 PYICDNDPK-------YERLVNAILDFLPK--IEENLICSKYTDLLRIALMAEDEAIFEE 582

Query: 65  AKSGAVDR 72
           A    V R
Sbjct: 583 AHK-QVQR 589


>2pff_B Fatty acid synthase subunit beta; fatty acid synthase,
            acyl-carrier-protein, beta-ketoacyl RED beta-ketoacyl
            synthase, dehydratase; 4.00A {Saccharomyces cerevisiae}
          Length = 2006

 Score = 29.2 bits (65), Expect = 1.4
 Identities = 20/116 (17%), Positives = 36/116 (31%), Gaps = 31/116 (26%)

Query: 4    AFQYIKDNHGIDTESSYP------YEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGD 57
            AF+ +K    I  ++++       Y A+ S         L  ++     R        G 
Sbjct: 1743 AFEDLKSKGLIPADATFAGHSLGEYAALASLADVMSIESLVEVV---FYR--------G- 1790

Query: 58   DNCRYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGV 113
                       AV       P  +  +    +  I P  +A   S ++ Q+  E V
Sbjct: 1791 ------MTMQVAV-------PRDELGRSNYGMIAINPGRVAASFSQEALQYVVERV 1833


>1u14_A Hypothetical UPF0244 protein YJJX; structural genomics, protein
           structure initiative, PSI, midwest center for structural
           genomics, MCSG; 1.68A {Salmonella typhimurium} SCOP:
           c.51.4.3 PDB: 1u5w_A
          Length = 172

 Score = 27.9 bits (62), Expect = 1.8
 Identities = 19/84 (22%), Positives = 27/84 (32%), Gaps = 27/84 (32%)

Query: 153 WGDEGYIKMARNRVYY----EPECNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGD 208
           +G E     ARNRV       P+      D  V +   G D++    W            
Sbjct: 47  FGSEETRAGARNRVDNARRLHPQA-----DFWVAIEA-GIDDDATFSW------------ 88

Query: 209 EGYIKMARNRENNCGVASSASFPL 232
                +  +     G A SA+ PL
Sbjct: 89  -----VVIDNGVQRGEARSATLPL 107


>1cv8_A Staphopain; cysteine protease, thiol protease, papain family; HET:
           E64; 1.75A {Staphylococcus aureus} SCOP: d.3.1.1
          Length = 174

 Score = 27.5 bits (60), Expect = 3.0
 Identities = 10/36 (27%), Positives = 15/36 (41%)

Query: 117 PECNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTT 152
              N     HA+ VVG     NG +  ++ N W+  
Sbjct: 111 ESRNGMHAGHAMAVVGNAKLNNGQEVIIIWNPWDNG 146



 Score = 27.5 bits (60), Expect = 3.0
 Identities = 10/36 (27%), Positives = 15/36 (41%)

Query: 170 PECNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTT 205
              N     HA+ VVG     NG +  ++ N W+  
Sbjct: 111 ESRNGMHAGHAMAVVGNAKLNNGQEVIIIWNPWDNG 146


>3mal_A Stromal cell-derived factor 2-like protein; trefoil fold, MIR
           motifs, unfolded protein response, putativ binding
           protein, plant protein; 1.95A {Arabidopsis thaliana}
          Length = 199

 Score = 27.6 bits (61), Expect = 3.2
 Identities = 18/80 (22%), Positives = 28/80 (35%), Gaps = 12/80 (15%)

Query: 130 VVGYGTDENGNDYWLVKNSWNTTWGDEGYIK---------MARNRVYYEPECNSTQLDHA 180
           V G+    + N YW+VK    TT      +K         M   +  +     S    + 
Sbjct: 51  VTGFPGVVDSNSYWIVKPVPGTTEKQGDAVKSGATIRLQHMKTRKWLHSHLHASPISGNL 110

Query: 181 VLVVGYGTDENG--NDYWLV 198
             V  +G D N    D+W +
Sbjct: 111 E-VSCFGDDTNSDTGDHWKL 129


>1tk7_A CG4244-PB; WW domain, notch, signaling protein; NMR {Drosophila
           melanogaster} SCOP: b.72.1.1 b.72.1.1
          Length = 88

 Score = 26.0 bits (57), Expect = 4.3
 Identities = 12/76 (15%), Positives = 23/76 (30%), Gaps = 9/76 (11%)

Query: 133 YGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDHAVLVVGYGTDENG 192
                +   Y++   +  T W D       R +       N   L     +        G
Sbjct: 19  KKIQSDNRVYFVNHKNRTTQWED------PRTQGQEVSLINEGPLPPGWEIR---YTAAG 69

Query: 193 NDYWLVKNSWNTTWGD 208
             +++  N+  TT+ D
Sbjct: 70  ERFFVDHNTRRTTFED 85


>1zwy_A Hypothetical UPF0244 protein VC0702; hypothetical protein,
           structural genomics, PSI, protein STRU initiative; 1.90A
           {Vibrio cholerae} SCOP: c.51.4.3 PDB: 1zno_A
          Length = 185

 Score = 27.1 bits (60), Expect = 4.4
 Identities = 16/84 (19%), Positives = 26/84 (30%), Gaps = 27/84 (32%)

Query: 153 WGDEGYIKMARNRVYY----EPECNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGD 208
             DE   + A NRV       P       ++ V +   G +EN    W++          
Sbjct: 57  MSDEETKQGALNRVRNAKQRHPGA-----EYYVGLEA-GIEENKTFAWMI---------- 100

Query: 209 EGYIKMARNRENNCGVASSASFPL 232
              +      +   G + SA   L
Sbjct: 101 ---V----ESDQQRGESRSACLML 117


>3erv_A Putative C39-like peptidase; structural genomics, unknown function,
           PSI-2, protein structure initiative; 2.10A {Bacillus
           anthracis}
          Length = 236

 Score = 26.7 bits (58), Expect = 7.1
 Identities = 16/113 (14%), Positives = 34/113 (30%), Gaps = 9/113 (7%)

Query: 74  YVDIPEGDEYKLKAAVATIGPV--SIAIDASHQSFQFYSEGVYYEPECNSTQLDHAVLVV 131
            VD+      +L  +V    PV        +      ++       + + T  +H V+++
Sbjct: 124 AVDLTGKSIEELYKSVKAGQPVVIITNATFAPLDEDEFTTWETNNGDVSITYNEHCVVLI 183

Query: 132 GYGTDENG---NDYWLVKNSWNTTWGD--EGYIKMARNRVYYEPECNSTQLDH 179
           GY  D+      D                + +++M    + Y          H
Sbjct: 184 GY--DQESVYIRDPLKDSLDVKVPREKFEQAWVQMGSQAISYVKRSKEGHHHH 234


  Database: pdb70
    Posted date:  Sep 4, 2012  3:40 AM
  Number of letters in database: 6,701,793
  Number of sequences in database:  27,921
  
Lambda     K      H
   0.317    0.133    0.421 

Gapped
Lambda     K      H
   0.267   0.0856    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 27921
Number of Hits to DB: 3,675,998
Number of extensions: 208274
Number of successful extensions: 823
Number of sequences better than 10.0: 1
Number of HSP's gapped: 604
Number of HSP's successfully gapped: 117
Length of query: 233
Length of database: 6,701,793
Length adjustment: 90
Effective length of query: 143
Effective length of database: 4,188,903
Effective search space: 599013129
Effective search space used: 599013129
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 56 (25.1 bits)