RPS-BLAST 2.2.26 [Sep-21-2011]
Database: pdb70
27,921 sequences; 6,701,793 total letters
Searching..................................................done
Query= psy403
(233 letters)
>3kwz_A Cathepsin K; enzyme inhibitor, covalent reversible inhibitor,
disease mutation, disulfide bond, glycoprotein,
hydrolase, lysosome, protease; HET: KWZ; 1.49A {Homo
sapiens} PDB: 1au0_A* 1au2_A* 1au3_A* 1au4_A* 1ayu_A*
1ayv_A* 1ayw_A* 1bgo_A* 1atk_A* 1nl6_A* 1nlj_A* 1q6k_A*
1mem_A* 1yk7_A* 1yk8_A* 1yt7_A* 2ato_A* 2aux_A* 2auz_A*
2bdl_A* ...
Length = 215
Score = 242 bits (619), Expect = 1e-81
Identities = 84/233 (36%), Positives = 108/233 (46%), Gaps = 85/233 (36%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
M AFQY++ N GID+E +YPY +++C
Sbjct: 68 MTNAFQYVQKNRGIDSEDAYPYVGQ-------------------------------EESC 96
Query: 61 RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECN 120
Y A RGY +IPEG+E LK AVA +GPVS+AIDAS SFQFYS+GVYY+ CN
Sbjct: 97 MYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCN 156
Query: 121 STQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDHA 180
S L+HAVL VGYG + GN +W++KNSW
Sbjct: 157 SDNLNHAVLAVGYGI-QKGNKHWIIKNSWGE----------------------------- 186
Query: 181 VLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASFPLV 233
WG++GYI MARN+ N CG+A+ ASFP +
Sbjct: 187 ------------------------NWGNKGYILMARNKNNACGIANLASFPKM 215
>2xu3_A Cathepsin L1; hydrolase, drug design, thiol protease; HET: XU3 BTB;
0.90A {Homo sapiens} PDB: 2xu4_A* 2xu5_A* 2yj2_A*
2yj8_A* 2yj9_A* 2yjb_A* 2yjc_A* 3bc3_A* 3h89_A* 3h8b_A*
3h8c_A* 3of9_A* 3of8_A* 3hha_A* 2xu1_A* 3iv2_A* 3k24_A*
2nqd_B* 3kse_A* 2vhs_A ...
Length = 220
Score = 241 bits (618), Expect = 2e-81
Identities = 88/233 (37%), Positives = 116/233 (49%), Gaps = 82/233 (35%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
MD AFQY++DN G+D+E SYPYEA +++C
Sbjct: 70 MDYAFQYVQDNGGLDSEESYPYEAT-------------------------------EESC 98
Query: 61 RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECN 120
+Y S A D G+VDIP E L AVAT+GP+S+AIDA H+SF FY EG+Y+EP+C+
Sbjct: 99 KYNPKYSVANDTGFVDIP-KQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCS 157
Query: 121 STQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDHA 180
S +DH VLVVGYG +
Sbjct: 158 SEDMDHGVLVVGYGFE-------------------------------------------- 173
Query: 181 VLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASFPLV 233
T+ + N YWLVKNSW WG GY+KMA++R N+CG+AS+AS+P V
Sbjct: 174 ------STESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 220
>8pch_A Cathepsin H; hydrolase, protease, cysteine proteinase,
aminopeptidase; HET: NAG BMA; 2.10A {Sus scrofa} SCOP:
d.3.1.1 PDB: 1nb3_A* 1nb5_A*
Length = 220
Score = 239 bits (613), Expect = 1e-80
Identities = 63/232 (27%), Positives = 91/232 (39%), Gaps = 85/232 (36%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
QAF+YI+ N GI E +YPY+ DD+C
Sbjct: 71 PSQAFEYIRYNKGIMGEDTYPYKGQ-------------------------------DDHC 99
Query: 61 RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECN 120
+++ K+ A + +I DE + AVA PVS A + ++ F Y +G+Y C+
Sbjct: 100 KFQPDKAIAFVKDVANITMNDEEAMVEAVALYNPVSFAFEVTN-DFLMYRKGIYSSTSCH 158
Query: 121 STQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDHA 180
T +++HA
Sbjct: 159 KT---------------------------------------------------PDKVNHA 167
Query: 181 VLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASFPL 232
VL VGYG ENG YW+VKNSW WG GY + R + N CG+A+ AS+P+
Sbjct: 168 VLAVGYGE-ENGIPYWIVKNSWGPQWGMNGYFLIERGK-NMCGLAACASYPI 217
>3ovx_A Cathepsin S; hydrolase, covalent inhibitor, aldehyde warhead is
covalently bound to Cys25, lysosomeal protein; HET: O64;
1.49A {Homo sapiens} PDB: 2h7j_A* 2f1g_A* 2hh5_B*
2hhn_A* 2hxz_A* 2op3_A* 2frq_A* 2fra_A* 2fq9_A* 2ft2_A*
2fud_A* 2g7y_A* 1ms6_A* 2r9m_A* 2r9n_A* 2r9o_A* 3n3g_A*
3n4c_A* 3mpe_A* 1nqc_A* ...
Length = 218
Score = 238 bits (609), Expect = 4e-80
Identities = 81/233 (34%), Positives = 102/233 (43%), Gaps = 86/233 (36%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
M AFQYI DN GID+++SYPY+AM D C
Sbjct: 72 MTTAFQYIIDNKGIDSDASYPYKAM-------------------------------DQKC 100
Query: 61 RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECN 120
+Y A Y ++P G E LK AVA GPVS+ +DA H SF Y GVYYEP C
Sbjct: 101 QYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSC- 159
Query: 121 STQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDHA 180
+ ++H VLVVGYG NG +YWLVKNSW
Sbjct: 160 TQNVNHGVLVVGYGD-LNGKEYWLVKNSWGH----------------------------- 189
Query: 181 VLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASFPLV 233
+G+EGYI+MARN+ N+CG+AS S+P +
Sbjct: 190 ------------------------NFGEEGYIRMARNKGNHCGIASFPSYPEI 218
>3qj3_A Cathepsin L-like protein; hydrolase, proteinase, larVal midgut;
1.85A {Tenebrio molitor}
Length = 331
Score = 240 bits (615), Expect = 2e-79
Identities = 77/233 (33%), Positives = 93/233 (39%), Gaps = 86/233 (36%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
M+ AF Y+ N GID+E +YPYE D NC
Sbjct: 185 MNDAFTYVAQNGGIDSEGAYPYEMA-------------------------------DGNC 213
Query: 61 RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECN 120
Y + A GYV + DE L VAT GPV++A DA F YS GVYY P C
Sbjct: 214 HYDPNQVAARLSGYVYLSGPDENMLADMVATKGPVAVAFDADD-PFGSYSGGVYYNPTCE 272
Query: 121 STQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDHA 180
+ + HAVL+VGYG ENG DYWLVKNSW
Sbjct: 273 TNKFTHAVLIVGYGN-ENGQDYWLVKNSWGD----------------------------- 302
Query: 181 VLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASFPLV 233
WG +GY K+ARN N+CG+A AS P +
Sbjct: 303 ------------------------GWGLDGYFKIARNANNHCGIAGVASVPTL 331
>1cs8_A Human procathepsin L; prosegment, propeptide, inhibition,
hydrolase; HET: OCS; 1.80A {Homo sapiens} SCOP: d.3.1.1
PDB: 1cjl_A 3hwn_A*
Length = 316
Score = 239 bits (612), Expect = 3e-79
Identities = 88/233 (37%), Positives = 116/233 (49%), Gaps = 82/233 (35%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
MD AFQY++DN G+D+E SYPYEA +++C
Sbjct: 166 MDYAFQYVQDNGGLDSEESYPYEAT-------------------------------EESC 194
Query: 61 RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECN 120
+Y S A D G+VDIP E L AVAT+GP+S+AIDA H+SF FY EG+Y+EP+C+
Sbjct: 195 KYNPKYSVANDAGFVDIP-KQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCS 253
Query: 121 STQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDHA 180
S +DH VLVVGYG +
Sbjct: 254 SEDMDHGVLVVGYGFE-------------------------------------------- 269
Query: 181 VLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASFPLV 233
T+ + N YWLVKNSW WG GY+KMA++R N+CG+AS+AS+P V
Sbjct: 270 ------STESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 316
>1by8_A Protein (procathepsin K); hydrolase(sulfhydryl proteinase), papain;
2.60A {Homo sapiens} SCOP: d.3.1.1 PDB: 7pck_A
Length = 314
Score = 239 bits (611), Expect = 4e-79
Identities = 84/233 (36%), Positives = 108/233 (46%), Gaps = 85/233 (36%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
M AFQY++ N GID+E +YPY +++C
Sbjct: 167 MTNAFQYVQKNRGIDSEDAYPYVGQ-------------------------------EESC 195
Query: 61 RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECN 120
Y A RGY +IPEG+E LK AVA +GPVS+AIDAS SFQFYS+GVYY+ CN
Sbjct: 196 MYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCN 255
Query: 121 STQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDHA 180
S L+HAVL VGYG + GN +W++KNSW
Sbjct: 256 SDNLNHAVLAVGYGI-QKGNKHWIIKNSWGE----------------------------- 285
Query: 181 VLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASFPLV 233
WG++GYI MARN+ N CG+A+ ASFP +
Sbjct: 286 ------------------------NWGNKGYILMARNKNNACGIANLASFPKM 314
>2c0y_A Procathepsin S; proenzyme, proteinase, hydrolase, thiol protease,
prosegment binding loop, glycoprotein, lysosome,
protease, zymogen; 2.1A {Homo sapiens}
Length = 315
Score = 234 bits (600), Expect = 2e-77
Identities = 81/233 (34%), Positives = 102/233 (43%), Gaps = 86/233 (36%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
M AFQYI DN GID+++SYPY+AM D C
Sbjct: 169 MTTAFQYIIDNKGIDSDASYPYKAM-------------------------------DQKC 197
Query: 61 RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECN 120
+Y A Y ++P G E LK AVA GPVS+ +DA H SF Y GVYYEP C
Sbjct: 198 QYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSC- 256
Query: 121 STQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDHA 180
+ ++H VLVVGYG NG +YWLVKNSW
Sbjct: 257 TQNVNHGVLVVGYGD-LNGKEYWLVKNSWGH----------------------------- 286
Query: 181 VLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASFPLV 233
+G+EGYI+MARN+ N+CG+AS S+P +
Sbjct: 287 ------------------------NFGEEGYIRMARNKGNHCGIASFPSYPEI 315
>3qt4_A Cathepsin-L-like midgut cysteine proteinase; hydrolase, zymogen,
intramolecular DISS bonds, insect larVal midgut; HET:
PG4 PG6; 2.11A {Tenebrio molitor}
Length = 329
Score = 231 bits (591), Expect = 7e-76
Identities = 76/233 (32%), Positives = 104/233 (44%), Gaps = 87/233 (37%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
MD AF YI ++GI +ES+YPYEA D C
Sbjct: 184 MDSAFSYIH-DYGIMSESAYPYEAQ-------------------------------GDYC 211
Query: 61 RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECN 120
R+ ++S GY D+P GDE L AV GPV++AIDA+ QFYS G++Y+ CN
Sbjct: 212 RFDSSQSVTTLSGYYDLPSGDENSLADAVGQAGPVAVAIDATD-ELQFYSGGLFYDQTCN 270
Query: 121 STQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDHA 180
+ L+H VLVVGYG+ +NG DYW++KNSW +
Sbjct: 271 QSDLNHGVLVVGYGS-DNGQDYWILKNSWGS----------------------------- 300
Query: 181 VLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASFPLV 233
WG+ GY + RN NNCG+A++AS+P +
Sbjct: 301 ------------------------GWGESGYWRQVRNYGNNCGIATAASYPAL 329
>1s4v_A Cysteine endopeptidase; KDEL ER retention signal, endosperm,
ricinosomes, SEED germi senescence, hydrolase-hydrolase
inhibitor complex; 2.00A {Ricinus communis} SCOP:
d.3.1.1
Length = 229
Score = 225 bits (575), Expect = 9e-75
Identities = 76/234 (32%), Positives = 98/234 (41%), Gaps = 85/234 (36%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
MD AF++IK GI TE++YPYEA D C
Sbjct: 70 MDYAFEFIKQRGGITTEANYPYEAY-------------------------------DGTC 98
Query: 61 RYKRAKSGAV-DRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPEC 119
+ + AV G+ ++PE DE L AVA PVS+AIDA FQFYSEGV+ C
Sbjct: 99 DVSKENAPAVSIDGHENVPENDENALLKAVAN-QPVSVAIDAGGSDFQFYSEGVFTGS-C 156
Query: 120 NSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDH 179
T+LDH V +VGYGT +G YW VKNSW WG++GYI+M R
Sbjct: 157 -GTELDHGVAIVGYGTTIDGTKYWTVKNSWGPEWGEKGYIRMERG--------------- 200
Query: 180 AVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASFPLV 233
++E CG+A AS+P+
Sbjct: 201 -----------------------------------ISDKEGLCGIAMEASYPIK 219
>3f75_A Toxopain-2, cathepsin L protease; medical structural genomics of
pathogenic protozoa, MSGPP, C protease, parasite,
protozoa, hydrolase; 1.99A {Toxoplasma gondii}
Length = 224
Score = 224 bits (574), Expect = 1e-74
Identities = 72/234 (30%), Positives = 99/234 (42%), Gaps = 86/234 (36%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
M+ AFQY+ D+ GI +E +YPY A D+ C
Sbjct: 76 MNDAFQYVLDSGGICSEDAYPYLAR-------------------------------DEEC 104
Query: 61 RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECN 120
R + + G+ D+P E +KAA+A PVSIAI+A FQFY EGV ++ C
Sbjct: 105 RAQSCEKVVKILGFKDVPRRSEAAMKAALAK-SPVSIAIEADQMPFQFYHEGV-FDASC- 161
Query: 121 STQLDHAVLVVGYGTD-ENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDH 179
T LDH VL+VGYGTD E+ D+W++KNSW T WG +GY+ MA +
Sbjct: 162 GTDLDHGVLLVGYGTDKESKKDFWIMKNSWGTGWGRDGYMYMAMH--------------- 206
Query: 180 AVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASFPLV 233
+ E CG+ ASFP++
Sbjct: 207 ------------------------------------KGEEGQCGLLLDASFPVM 224
>2o6x_A Procathepsin L1, secreted cathepsin L 1; hydrolase, thiol protease,
cysteine protease, zymogen, hydro; 1.40A {Fasciola
hepatica}
Length = 310
Score = 227 bits (581), Expect = 1e-74
Identities = 69/233 (29%), Positives = 95/233 (40%), Gaps = 87/233 (37%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
M+ A+QY+K G++TESSYPY A+ + C
Sbjct: 161 MENAYQYLK-QFGLETESSYPYTAV-------------------------------EGQC 188
Query: 61 RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECN 120
RY + A G+ + G E +LK V GP ++A+D F Y G+Y C+
Sbjct: 189 RYNKQLGVAKVTGFYTVHSGSEVELKNLVGAEGPAAVAVDVES-DFMMYRSGIYQSQTCS 247
Query: 121 STQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDHA 180
+++HAVL VGYGT + G DYW+VKNSW
Sbjct: 248 PLRVNHAVLAVGYGT-QGGTDYWIVKNSWGL----------------------------- 277
Query: 181 VLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASFPLV 233
+WG+ GYI+M RNR N CG+AS AS P+V
Sbjct: 278 ------------------------SWGERGYIRMVRNRGNMCGIASLASLPMV 306
>1m6d_A Cathepsin F, catsf; papain family cysteine protease, hydrolase;
HET: MYP; 1.70A {Homo sapiens} SCOP: d.3.1.1
Length = 214
Score = 221 bits (567), Expect = 9e-74
Identities = 54/233 (23%), Positives = 83/233 (35%), Gaps = 87/233 (37%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
A+ IK+ G++TE Y Y+ +C
Sbjct: 68 PSNAYSAIKNLGGLETEDDYSYQGH-------------------------------MQSC 96
Query: 61 RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECN 120
++ K+ + V++ +E KL A +A GP+S+AI+A QFY G+
Sbjct: 97 QFSAEKAKVYIQDSVELS-QNEQKLAAWLAKRGPISVAINAF--GMQFYRHGISRP---- 149
Query: 121 STQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDHA 180
P C+ +DHA
Sbjct: 150 -----------------------------------------------LRPLCSPWLIDHA 162
Query: 181 VLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASFPLV 233
VL+VGYG + +W +KNSW T WG++GY + R CGV + AS +V
Sbjct: 163 VLLVGYGQ-RSDVPFWAIKNSWGTDWGEKGYYYLHRGS-GACGVNTMASSAVV 213
>1cqd_A Protein (protease II); cysteine protease, glycoprotein, proline
specificity, carboh papain family, hydrolase; HET: NAG
FUL FUC; 2.10A {Zingiber officinale} SCOP: d.3.1.1
Length = 221
Score = 222 bits (567), Expect = 1e-73
Identities = 56/172 (32%), Positives = 81/172 (47%), Gaps = 35/172 (20%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
M+ AFQ+I +N GI++E +YPY D C
Sbjct: 70 MNPAFQFIVNNGGINSEETYPYRGQ-------------------------------DGIC 98
Query: 61 RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECN 120
Y ++P +E L+ AVA PVS+ +DA+ + FQ Y G++ C
Sbjct: 99 NSTVNAPVVSIDSYENVPSHNEQSLQKAVAN-QPVSVTMDAAGRDFQLYRSGIFTGS-C- 155
Query: 121 STQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPEC 172
+ +HA+ VVGYGT EN D+W+VKNSW WG+ GYI+ RN + +C
Sbjct: 156 NISANHALTVVGYGT-ENDKDFWIVKNSWGKNWGESGYIRAERNIENPDGKC 206
>1ppo_A Protease omega; hydrolase(thiol protease); 1.80A {Carica papaya}
SCOP: d.3.1.1 PDB: 1meg_A*
Length = 216
Score = 221 bits (566), Expect = 2e-73
Identities = 52/173 (30%), Positives = 73/173 (42%), Gaps = 37/173 (21%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
A +Y+ N GI S YPY+A C
Sbjct: 68 PPYALEYVAKN-GIHLRSKYPYKAK-------------------------------QGTC 95
Query: 61 RYKRAKSGAV-DRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPEC 119
R K+ V G + +E L A+A PVS+ +++ + FQ Y G++ P C
Sbjct: 96 RAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAK-QPVSVVVESKGRPFQLYKGGIFEGP-C 153
Query: 120 NSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPEC 172
T++DHAV VGYG G Y L+KNSW T WG++GYI++ R C
Sbjct: 154 -GTKVDHAVTAVGYGK-SGGKGYILIKNSWGTAWGEKGYIRIKRAPGNSPGVC 204
>3p5u_A Actinidin; SAD, cysteine proteinases, hydrolase; 1.50A {Actinidia
arguta} PDB: 3p5v_A 3p5w_A 3p5x_A 1aec_A* 2act_A
Length = 220
Score = 221 bits (565), Expect = 2e-73
Identities = 69/233 (29%), Positives = 98/233 (42%), Gaps = 87/233 (37%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
M FQ+I +N GI+TE++YPY A + C
Sbjct: 70 MTDGFQFIINNGGINTEANYPYTAE-------------------------------EGQC 98
Query: 61 RYKRAKSGAV-DRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPEC 119
+ V Y ++P +E+ L+ AVA PVS+A++A+ +FQ YS G++ P C
Sbjct: 99 NLDLQQEKYVSIDTYENVPYNNEWALQTAVA-YQPVSVALEAAGYNFQHYSSGIFTGP-C 156
Query: 120 NSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDH 179
T +DHAV +VGYGT E G DYW+VKNSW TTWG+EGY+++ RN
Sbjct: 157 -GTAVDHAVTIVGYGT-EGGIDYWIVKNSWGTTWGEEGYMRIQRN--------------- 199
Query: 180 AVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASFPL 232
CG+A AS+P+
Sbjct: 200 ------------------------------------VGGVGQCGIAKKASYPV 216
>1iwd_A Ervatamin B; cysteine protease, alpha-beta protein, catalytic DYAD,
L-DOM domain., hydrolase; 1.63A {Tabernaemontana
divaricata} SCOP: d.3.1.1
Length = 215
Score = 218 bits (557), Expect = 3e-72
Identities = 57/172 (33%), Positives = 89/172 (51%), Gaps = 36/172 (20%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
M+ AFQYI N GIDT+ +YPY A+ +C
Sbjct: 68 MNNAFQYIITNGGIDTQQNYPYSAV-------------------------------QGSC 96
Query: 61 RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECN 120
+ R + +++ G+ + +E L++AVA+ PVS+ ++A+ FQ YS G++ P
Sbjct: 97 KPYRLRVVSIN-GFQRVTRNNESALQSAVAS-QPVSVTVEAAGAPFQHYSSGIFTGP--C 152
Query: 121 STQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPEC 172
T +H V++VGYGT ++G +YW+V+NSW WG++GYI M RN C
Sbjct: 153 GTAQNHGVVIVGYGT-QSGKNYWIVRNSWGQNWGNQGYIWMERNVASSAGLC 203
>2fo5_A Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cysteine
endoprotease, endopeptidase, LEUP hydrolase; HET: AR7;
2.20A {Hordeum vulgare}
Length = 262
Score = 219 bits (560), Expect = 5e-72
Identities = 65/176 (36%), Positives = 87/176 (49%), Gaps = 38/176 (21%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
MD AF+YIK+N G+ TE++YPY A C
Sbjct: 72 MDNAFEYIKNNGGLITEAAYPYRAA-------------------------------RGTC 100
Query: 61 RYKRAKS----GAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYE 116
RA G+ D+P E L AVA PVS+A++AS ++F FYSEGV+
Sbjct: 101 NVARAAQNSPVVVHIDGHQDVPANSEEDLARAVAN-QPVSVAVEASGKAFMFYSEGVFTG 159
Query: 117 PECNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPEC 172
C T+LDH V VVGYG E+G YW VKNSW +WG++GYI++ ++ C
Sbjct: 160 E-C-GTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSGASGGLC 213
Score = 120 bits (302), Expect = 2e-33
Identities = 32/69 (46%), Positives = 46/69 (66%), Gaps = 4/69 (5%)
Query: 168 YEPECNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENN---CGV 224
+ EC T+LDH V VVGYG E+G YW VKNSW +WG++GYI++ ++ + CG+
Sbjct: 157 FTGEC-GTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSGASGGLCGI 215
Query: 225 ASSASFPLV 233
A AS+P+
Sbjct: 216 AMEASYPVK 224
>3f5v_A DER P 1 allergen; allergy, asthma, DUST mites, glycoprotein,
hydrola protease, secreted, thiol protease; HET: P6G;
1.36A {Dermatophagoides pteronyssinus} PDB: 2as8_A
3rvw_A* 3rvx_A 3rvv_A* 3d6s_A*
Length = 222
Score = 217 bits (556), Expect = 5e-72
Identities = 43/167 (25%), Positives = 66/167 (39%), Gaps = 36/167 (21%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
+ + +YI+ N G+ ES Y Y A + +C
Sbjct: 76 IPRGIEYIQHN-GVVQESYYRYVAR-------------------------------EQSC 103
Query: 61 RYKRAKSGAVDRGYVDIPEGDEYKLKAAVA-TIGPVSIAIDASH-QSFQFYSEGVYYEPE 118
R A+ Y I + K++ A+A T +++ I +F+ Y + +
Sbjct: 104 RRPNAQ-RFGISNYCQIYPPNANKIREALAQTHSAIAVIIGIKDLDAFRHYDGRTIIQRD 162
Query: 119 CNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNR 165
HAV +VGY G DYW+V+NSW+T WGD GY A N
Sbjct: 163 NGYQPNYHAVNIVGYSN-AQGVDYWIVRNSWDTNWGDNGYGYFAANI 208
>1xkg_A DER P I, major mite fecal allergen DER P 1; major allergen,
cysteine protease, house DUST mite, dermatop
pteronyssinus; 1.61A {Dermatophagoides pteronyssinus}
SCOP: d.3.1.1
Length = 312
Score = 219 bits (559), Expect = 3e-71
Identities = 43/167 (25%), Positives = 67/167 (40%), Gaps = 36/167 (21%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
+ + +YI+ N G+ ES Y Y A + +C
Sbjct: 156 IPRGIEYIQHN-GVVQESYYRYVAR-------------------------------EQSC 183
Query: 61 RYKRAKSGAVDRGYVDIPEGDEYKLKAAVA-TIGPVSIAIDASH-QSFQFYSEGVYYEPE 118
R A+ + Y I + K++ A+A T +++ I +F+ Y + +
Sbjct: 184 RRPNAQRFGIS-NYCQIYPPNANKIREALAQTHSAIAVIIGIKDLDAFRHYDGRTIIQRD 242
Query: 119 CNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNR 165
HAV +VGY G DYW+V+NSW+T WGD GY A N
Sbjct: 243 NGYQPNYHAVNIVGYSN-AQGVDYWIVRNSWDTNWGDNGYGYFAANI 288
>2b1m_A SPE31; papain-like, sugar binding protein; HET: NAG FUC PG4; 2.00A
{Pachyrhizus erosus} PDB: 2b1n_A*
Length = 246
Score = 215 bits (550), Expect = 1e-70
Identities = 57/241 (23%), Positives = 87/241 (36%), Gaps = 93/241 (38%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
Q+F+++ + GI +E+ YPY+A D C
Sbjct: 69 HYQSFEWVVKHGGIASEADYPYKAR-------------------------------DGKC 97
Query: 61 RYKRAKSGAVDRGY-------VDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGV 113
+ + Y E L++ V P+S++IDA F FYS G+
Sbjct: 98 KANEIQDKVTIDNYGVQILSNESTESEAESSLQSFVLE-QPISVSIDAK--DFHFYSGGI 154
Query: 114 YYEPECNSTQ-LDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPEC 172
Y C+S ++H VL+VGYG+ E+G DYW+ KNSW WG +GYI++ RN
Sbjct: 155 YDGGNCSSPYGINHFVLIVGYGS-EDGVDYWIAKNSWGEDWGIDGYIRIQRNT------- 206
Query: 173 NSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASFPL 232
N CG+ AS+P+
Sbjct: 207 -------------------------------------------GNLLGVCGMNYFASYPI 223
Query: 233 V 233
+
Sbjct: 224 I 224
>3bwk_A Cysteine protease falcipain-3; malaria, hydrolase; HET: C1P; 2.42A
{Plasmodium falciparum} PDB: 3bpm_A*
Length = 243
Score = 215 bits (550), Expect = 1e-70
Identities = 52/236 (22%), Positives = 83/236 (35%), Gaps = 83/236 (35%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
+ AF + D G+ ++ YPY + + + C
Sbjct: 87 ITNAFDDMIDLGGLCSQDDYPYVSNL------------------------------PETC 116
Query: 61 RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECN 120
KR + YV IP+ K A+ +GP+SI+I AS F FY G Y+ EC
Sbjct: 117 NLKRCNERYTIKSYVSIPDDK---FKEALRYLGPISISIAASD-DFAFYRGGF-YDGEC- 170
Query: 121 STQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDHA 180
+HAV++VGYG + N+
Sbjct: 171 GAAPNHAVILVGYGMKDIYNE--------------------------------------- 191
Query: 181 VLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNR---ENNCGVASSASFPLV 233
Y+++KNSW + WG+ GYI + + + C + + A PL+
Sbjct: 192 -----DTGRMEKFYYYIIKNSWGSDWGEGGYINLETDENGYKKTCSIGTEAYVPLL 242
>2oul_A Falcipain 2; cysteine protease, inhibitor, macromolecular
interaction, HY hydrolase inhibitor complex; 2.20A
{Plasmodium falciparum} SCOP: d.3.1.1 PDB: 2ghu_A 1yvb_A
3bpf_A* 3pnr_A
Length = 241
Score = 213 bits (545), Expect = 5e-70
Identities = 50/236 (21%), Positives = 85/236 (36%), Gaps = 83/236 (35%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
++ AF+ + + GI + YPY + + C
Sbjct: 85 INNAFEDMIELGGICPDGDYPYVSDA------------------------------PNLC 114
Query: 61 RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECN 120
R + Y+ +P+ LK A+ +GP+SI++ S F FY EG+ ++ EC
Sbjct: 115 NIDRCTEKYGIKNYLSVPDNK---LKEALRFLGPISISVAVSD-DFAFYKEGI-FDGEC- 168
Query: 121 STQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDHA 180
QL+HAV++VG+G E N
Sbjct: 169 GDQLNHAVMLVGFGMKEIVN---------------------------------------- 188
Query: 181 VLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNREN---NCGVASSASFPLV 233
+ Y+++KNSW WG+ G+I + + CG+ + A PL+
Sbjct: 189 ----PLTKKGEKHYYYIIKNSWGQQWGERGFINIETDESGLMRKCGLGTDAFIPLI 240
>3ioq_A CMS1MS2; caricaceae, cysteine protease, papain family, hydrolase;
HET: E64 SO4; 1.87A {Carica candamarcensis}
Length = 213
Score = 211 bits (541), Expect = 9e-70
Identities = 56/173 (32%), Positives = 74/173 (42%), Gaps = 41/173 (23%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
A QY+ N GI YPYE + C
Sbjct: 68 PLYALQYVA-NSGIHLRQYYPYEGV-------------------------------QRQC 95
Query: 61 RYKRAKSGAV-DRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPEC 119
R +AK V G +P +E L +A I PVSI ++A ++FQ Y G++ P C
Sbjct: 96 RASQAKGPKVKTDGVGRVPRNNEQALIQRIA-IQPVSIVVEAKGRAFQNYRGGIFAGP-C 153
Query: 120 NSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPEC 172
T +DHAV VGYG D Y L+KNSW T WG+ GYI++ R + C
Sbjct: 154 -GTSIDHAVAAVGYGND-----YILIKNSWGTGWGEGGYIRIKRGSGNPQGAC 200
>2cio_A Papain; hydrolase/inhibitor, complex hydrolase/inhibitor, ICP,
cysteine protease, allergen, protease, thiol protease;
1.5A {Carica papaya} PDB: 1khq_A 1khp_A 1ppn_A 3e1z_B
3ima_A 3lfy_A 9pap_A 1bqi_A* 1bp4_A* 1pad_A 1pe6_A*
1pip_A* 1pop_A* 1ppd_A 1ppp_A* 1stf_E* 2pad_A 4pad_A*
5pad_A* 6pad_A* ...
Length = 212
Score = 211 bits (539), Expect = 2e-69
Identities = 47/173 (27%), Positives = 71/173 (41%), Gaps = 41/173 (23%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
A Q + +GI ++YPYE + C
Sbjct: 68 PWSALQLVA-QYGIHYRNTYPYEGV-------------------------------QRYC 95
Query: 61 RYKRAKSGAV-DRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPEC 119
R + A G + +E L ++A PVS+ ++A+ + FQ Y G++ P C
Sbjct: 96 RSREKGPYAAKTDGVRQVQPYNEGALLYSIAN-QPVSVVLEAAGKDFQLYRGGIFVGP-C 153
Query: 120 NSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPEC 172
++DHAV VGYG + Y L+KNSW T WG+ GYI++ R C
Sbjct: 154 -GNKVDHAVAAVGYGPN-----YILIKNSWGTGWGENGYIRIKRGTGNSYGVC 200
>1yal_A Chymopapain; hydrolase, thiol protease; 1.70A {Carica papaya} SCOP:
d.3.1.1 PDB: 1gec_E*
Length = 218
Score = 211 bits (539), Expect = 2e-69
Identities = 50/173 (28%), Positives = 75/173 (43%), Gaps = 37/173 (21%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
+ QY+ +N G+ T YPY+A C
Sbjct: 68 QTTSLQYVANN-GVHTSKVYPYQAK-------------------------------QYKC 95
Query: 61 R-YKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPEC 119
R + GY +P E A+A P+S+ ++A + FQ Y GV+ P C
Sbjct: 96 RATDKPGPKVKITGYKRVPSNCETSFLGALAN-QPLSVLVEAGGKPFQLYKSGVFDGP-C 153
Query: 120 NSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPEC 172
T+LDHAV VGYGT +G +Y ++KNSW WG++GY+++ R + C
Sbjct: 154 -GTKLDHAVTAVGYGT-SDGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTC 204
>2bdz_A Mexicain; cysteine protease, peptidase_C1, papain-like, HYDR; HET:
E64; 2.10A {Jacaratia mexicana}
Length = 214
Score = 210 bits (538), Expect = 3e-69
Identities = 63/233 (27%), Positives = 82/233 (35%), Gaps = 91/233 (39%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
+ QY+ DN G+ TE YPYE C
Sbjct: 68 QTTSLQYVVDN-GVHTEREYPYEKK-------------------------------QGRC 95
Query: 61 RYKRAKSGAV-DRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPEC 119
R K K V GY +P DE L A+A PVS+ D+ + FQFY G+Y P C
Sbjct: 96 RAKDKKGPKVYITGYKYVPANDEISLIQAIAN-QPVSVVTDSRGRGFQFYKGGIYEGP-C 153
Query: 120 NSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDH 179
T DHAV VGYG Y L+KNSW WG++GYI++ R
Sbjct: 154 -GTNTDHAVTAVGYGKT-----YLLLKNSWGPNWGEKGYIRIKRA--------------- 192
Query: 180 AVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASFPL 232
+ + CGV +S+ FP+
Sbjct: 193 -----------------------------------SGRSKGTCGVYTSSFFPI 210
>2wbf_X Serine-repeat antigen protein; SERA, malaria, vacuole, protease,
cathepsin, hydrolase, glycoprotein, thiol protease; HET:
DMS; 1.60A {Plasmodium falciparum} PDB: 3ch3_X 3ch2_X
Length = 265
Score = 212 bits (541), Expect = 4e-69
Identities = 46/243 (18%), Positives = 71/243 (29%), Gaps = 73/243 (30%)
Query: 1 MDQAF-QYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDN 59
F Q I+D + ES+YPY + C K + +
Sbjct: 79 SPMEFLQIIEDYGFLPAESNYPYNYVKVGEQ-------------CPKVEDHWMNLWDNGK 125
Query: 60 CRYKRAKSGAVD---------RGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYS 110
+ + + ++D + D + +K V G V I A + +S
Sbjct: 126 ILHNKNEPNSLDGKGYTAYESERFHDNMDAFVKIIKTEVMNKGSVIAYIKAENVMGYEFS 185
Query: 111 EGVYYEPECNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEP 170
G + C DHAV +VGYG N
Sbjct: 186 -GKKVKNLCGDDTADHAVNIVGYGNYVN-------------------------------- 212
Query: 171 ECNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASF 230
++ YW+V+NSW WGDEGY K+ +C S
Sbjct: 213 -----------------SEGEKKSYWIVRNSWGPYWGDEGYFKVDMYGPTHCHFNFIHSV 255
Query: 231 PLV 233
+
Sbjct: 256 VIF 258
>1pci_A Procaricain; zymogen, hydrolase, thiol protease; 3.20A {Carica
papaya} SCOP: d.3.1.1
Length = 322
Score = 211 bits (540), Expect = 3e-68
Identities = 51/173 (29%), Positives = 72/173 (41%), Gaps = 37/173 (21%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
A +Y+ N GI S YPY+A C
Sbjct: 174 PPYALEYVAKN-GIHLRSKYPYKAK-------------------------------QGTC 201
Query: 61 RYKRAKSGAV-DRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPEC 119
R K+ V G + +E L A+A PVS+ +++ + FQ Y G++ P C
Sbjct: 202 RAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAK-QPVSVVVESKGRPFQLYKGGIFEGP-C 259
Query: 120 NSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPEC 172
T++D AV VGYG G Y L+KNSW T WG++GYI++ R C
Sbjct: 260 -GTKVDGAVTAVGYGK-SGGKGYILIKNSWGTAWGEKGYIRIKRAPGNSPGVC 310
>3i06_A Cruzipain; autocatalytic cleavage, glycoprotein, protease, thiol
protease, zymogen; HET: QL2; 1.10A {Trypanosoma cruzi}
PDB: 1ewm_A* 1ewo_A* 1ewl_A* 1f29_A* 1ewp_A* 1f2b_A*
1f2c_A* 1f2a_A* 1me4_A* 1u9q_X* 2aim_A* 2efm_A* 2oz2_A*
1me3_A* 3kku_A* 3lxs_A* 1aim_A* 3iut_A* 3hd3_A* 2p86_A*
...
Length = 215
Score = 206 bits (528), Expect = 9e-68
Identities = 57/167 (34%), Positives = 83/167 (49%), Gaps = 36/167 (21%)
Query: 1 MDQAFQYI--KDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDD 58
M+ AF++I ++N + TE SYPY +
Sbjct: 68 MNNAFEWIVQENNGAVYTEDSYPYAS----------------------------GEGISP 99
Query: 59 NCRYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPE 118
C GA G+V++P+ DE ++ A +A GPV++A+DAS S+ Y+ GV
Sbjct: 100 PCTTSGHTVGATITGHVELPQ-DEAQIAAWLAVNGPVAVAVDAS--SWMTYTGGVM--TS 154
Query: 119 CNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNR 165
C S QLDH VL+VGY YW++KNSW T WG+EGYI++A+
Sbjct: 155 CVSEQLDHGVLLVGYND-SAAVPYWIIKNSWTTQWGEEGYIRIAKGS 200
>3pdf_A Cathepsin C, dipeptidyl peptidase 1; two domains, cystein protease,
hydrolase-hydrolase inhibitor; HET: LXV NAG; 1.85A {Homo
sapiens} PDB: 1jqp_A* 2djf_B* 1k3b_B* 2djg_B* 2djf_A*
1k3b_A* 2djg_A* 2djf_C* 1k3b_C* 2djg_C*
Length = 441
Score = 190 bits (484), Expect = 1e-58
Identities = 46/238 (19%), Positives = 73/238 (30%), Gaps = 86/238 (36%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
+ G+ E+ +PY D C
Sbjct: 279 PYLIAGKYAQDFGLVEEACFPYTGT-------------------------------DSPC 307
Query: 61 RYKRAKSGAVDRGYVDIP----EGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYE 116
+ K Y + +E +K + GP+++A + + F Y +G+Y+
Sbjct: 308 KMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEV-YDDFLHYKKGIYHH 366
Query: 117 PECNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQ 176
+P
Sbjct: 367 TGLR------------------------------------------------DPFNPFEL 378
Query: 177 LDHAVLVVGYGTDE-NGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASFPLV 233
+HAVL+VGYGTD +G DYW+VKNSW T WG+ GY ++ R + C + S A
Sbjct: 379 TNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGT-DECAIESIAVAATP 435
>1o0e_A Ervatamin C; plant cysteine protease, two domain, stable at PH
2-12, HYDR; 1.90A {Tabernaemontana divaricata} SCOP:
d.3.1.1 PDB: 2pns_A* 2pre_A* 3bcn_A*
Length = 208
Score = 172 bits (438), Expect = 3e-54
Identities = 59/233 (25%), Positives = 85/233 (36%), Gaps = 93/233 (39%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
A+QYI +N GIDT+++YPY+A+ C
Sbjct: 68 FVFAYQYIINNGGIDTQANYPYKAV-------------------------------QGPC 96
Query: 61 RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECN 120
+ GY +P +E LK AVA + P ++AIDAS FQ YS G++ P
Sbjct: 97 QAA--SKVVSIDGYNGVPFCNEXALKQAVA-VQPSTVAIDASSAQFQQYSSGIFSGPCG- 152
Query: 121 STQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDHA 180
T+L+H V +VGY +YW+V+NSW WG++GYI+M R
Sbjct: 153 -TKLNHGVTIVGYQA-----NYWIVRNSWGRYWGEKGYIRMLRV---------------- 190
Query: 181 VLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRENNCGVASSASFPLV 233
CG+A +P
Sbjct: 191 ------------------------------------GGCGLCGIARLPYYPTK 207
>1deu_A Procathepsin X; cysteine protease, proregion, prosegment, HY; 1.70A
{Homo sapiens} SCOP: d.3.1.1 PDB: 1ef7_A
Length = 277
Score = 172 bits (438), Expect = 1e-53
Identities = 41/165 (24%), Positives = 68/165 (41%), Gaps = 22/165 (13%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
+ Y HGI E+ Y+A C K + T +
Sbjct: 111 DLSVWDYAH-QHGIPDETCNNYQAKDQ---------------ECDKFNQCGTCNEFKECH 154
Query: 61 RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECN 120
+ V G G E K+ A + GP+S I A + Y+ G+Y E +
Sbjct: 155 AIRNYTLWRV--GDYGSLSGRE-KMMAEIYANGPISCGIMA-TERLANYTGGIYAEYQD- 209
Query: 121 STQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNR 165
+T ++H V V G+G +G +YW+V+NSW WG+ G++++ +
Sbjct: 210 TTYINHVVSVAGWGI-SDGTEYWIVRNSWGEPWGERGWLRIVTST 253
>3hhi_A Cathepsin B-like cysteine protease; occluding loop, hydrolase, THIO
protease; HET: 074; 1.60A {Trypanosoma brucei} PDB:
3mor_A*
Length = 325
Score = 170 bits (432), Expect = 5e-52
Identities = 41/168 (24%), Positives = 61/168 (36%), Gaps = 17/168 (10%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
D+A+ Y + G+ ++ PY F + + C
Sbjct: 145 PDRAWAYFS-STGLVSDYCQPYP----------FPHCSHHSKSKNGYPPCSQFNFDTPKC 193
Query: 61 RYKRAKSGAVD---RGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEP 117
Y R + E + GP +A D ++ F Y+ GVY+
Sbjct: 194 DYTCDDPTIPVVNYRSWTSYALQGEDDYMRELFFRGPFEVAFDV-YEDFIAYNSGVYHHV 252
Query: 118 ECNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNR 165
HAV +VG+GT NG YW + NSWNT WG +GY + R
Sbjct: 253 SG-QYLGGHAVRLVGWGT-SNGVPYWKIANSWNTEWGMDGYFLIRRGS 298
>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme,
papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A
1pbh_A 1mir_A
Length = 317
Score = 164 bits (417), Expect = 8e-50
Identities = 40/168 (23%), Positives = 63/168 (37%), Gaps = 9/168 (5%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
+A+ + G+ + Y P + ++ + + C
Sbjct: 139 PAEAWNFWTR-KGLVSGGLYESHVGC--RPYSIPPCEHHVNGSRPPCTGEGDTPKCSKIC 195
Query: 61 RYKRAKSGAVDRGYVDIP---EGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEP 117
+ + D+ Y E + A + GPV A + F Y GVY
Sbjct: 196 EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSV-YSDFLLYKSGVYQHV 254
Query: 118 ECNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNR 165
HA+ ++G+G ENG YWLV NSWNT WGD G+ K+ R +
Sbjct: 255 -TGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQ 300
>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase,
digestive tract, hydrolase-hydrolase INH complex; HET:
074; 1.30A {Schistosoma mansoni} PDB: 3s3q_A* 3s3r_A*
Length = 254
Score = 160 bits (406), Expect = 6e-49
Identities = 42/168 (25%), Positives = 61/168 (36%), Gaps = 8/168 (4%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
+ A+ Y GI T SS A P T C
Sbjct: 77 LGPAWDYWVKE-GIVTGSSKENHAGCEPYPFPKCEHHTKGKYPPCGSKIYKTPR-CKQTC 134
Query: 61 RYKRAKSGAVDRGYVDIP---EGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEP 117
+ K D+ + DE ++ + GPV ++ F Y G+Y
Sbjct: 135 QKKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGFTV-YEDFLNYKSGIYKHI 193
Query: 118 ECNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNR 165
T HA+ ++G+G EN YWL+ NSWN WG+ GY ++ R R
Sbjct: 194 -TGETLGGHAIRIIGWGV-ENKAPYWLIANSWNEDWGENGYFRIVRGR 239
>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco
hydrolase, lysosome, protease, thiol protease, zymogen,
CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A*
3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A*
1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A*
1qdq_A* 1csb_B* 1huc_B 2ipp_B ...
Length = 266
Score = 159 bits (405), Expect = 1e-48
Identities = 40/168 (23%), Positives = 63/168 (37%), Gaps = 9/168 (5%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
+A+ + G+ + Y P + ++ + + C
Sbjct: 82 PAEAWNFWTR-KGLVSGGLYESHV--GCRPYSIPPCEAHVNGARPPCTGEGDTPKCSKIC 138
Query: 61 RYKRAKSGAVDRGYVDIP---EGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEP 117
+ + D+ Y E + A + GPV A + F Y GVY
Sbjct: 139 EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSV-YSDFLLYKSGVYQHV 197
Query: 118 ECNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNR 165
HA+ ++G+G ENG YWLV NSWNT WGD G+ K+ R +
Sbjct: 198 -TGEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQ 243
>3ois_A Cysteine protease; alpha and beta, hydrolase; HET: UDP; 1.65A
{Xylella fastidiosa}
Length = 291
Score = 155 bits (392), Expect = 2e-46
Identities = 28/170 (16%), Positives = 52/170 (30%), Gaps = 24/170 (14%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
M + + G+ E +PY +
Sbjct: 125 MIRDGIKVLHKLGVCPEKEWPYGDTP---------------ADPRTEEFPPGAPASKKPS 169
Query: 61 RYKRAKSGAVD-RGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEP-- 117
+ Y + D LKA +A P + S+ + P
Sbjct: 170 DQCYKDAQNYKITEYSRVA-QDIDHLKACLAVGSPFVFGFSV-YNSWVGNNSLPVRIPLP 227
Query: 118 -ECNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRV 166
+ ++ + HAVL VGY ++ ++ ++NSW G++GY M +
Sbjct: 228 TKNDTLEGGHAVLCVGY---DDEIRHFRIRNSWGNNVGEDGYFWMPYEYI 274
>3u8e_A Papain-like cysteine protease; papain-like cysteine peptidase,
peptidase_C1A, hydrolase, in form; 1.31A {Crocus
sativus}
Length = 222
Score = 148 bits (374), Expect = 1e-44
Identities = 55/175 (31%), Positives = 80/175 (45%), Gaps = 37/175 (21%)
Query: 1 MDQAFQYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGDDNC 60
D AF+++ N GI ++++YPY + D C
Sbjct: 68 ADDAFRWVITNGGIASDANYPYTGV-------------------------------DGTC 96
Query: 61 RYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFY-SEGVYYEPEC 119
+ + +D GY ++P L AVA PVS+ I S SFQ Y G++ C
Sbjct: 97 DLNKPIAARID-GYTNVP-NSSSALLDAVAK-QPVSVNIYTSSTSFQLYTGPGIFAGSSC 153
Query: 120 NS--TQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPEC 172
+ +DH VL+VGYG++ DYW+VKNSW T WG +GYI + RN + C
Sbjct: 154 SDDPATVDHTVLIVGYGSNGTNADYWIVKNSWGTEWGIDGYILIRRNTNRPDGVC 208
>3pw3_A Aminopeptidase C; bleomycin, cysteine proteinase fold, structural
genomics, JO center for structural genomics, JCSG; HET:
MSE; 2.23A {Parabacteroides distasonis}
Length = 383
Score = 59.1 bits (142), Expect = 2e-10
Identities = 25/124 (20%), Positives = 42/124 (33%), Gaps = 10/124 (8%)
Query: 94 PVSIAIDASHQSFQFYSEGVYYEPECNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTW 153
++ D S F +GV P+ + + G K
Sbjct: 243 TIAWGSDVSESGF--TRDGVAVMPD-----DEKVQELSGSDMAHWLKLKPEEKKLNTKPQ 295
Query: 154 GDEGYIKMARNRVYYEPECNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIK 213
+ + R Y + T DH + + G D+ GN+Y++VKNSW T G
Sbjct: 296 PQKWCTQAERQLAY---DNYETTDDHGMQIYGIAKDQEGNEYYMVKNSWGTNSKYNGIWY 352
Query: 214 MARN 217
++
Sbjct: 353 ASKA 356
Score = 59.1 bits (142), Expect = 2e-10
Identities = 22/117 (18%), Positives = 38/117 (32%), Gaps = 7/117 (5%)
Query: 52 TVTSGDDNCRYKRAKSGAVDR----GYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQ 107
T+ G D + G ++ D ++ Q
Sbjct: 243 TIAWGSDVSESGFTRDGVAVMPDDEKVQELSGSDMAHWLKLKPEEKKLNTKPQPQKWCTQ 302
Query: 108 FYSEGVYYEPECNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIKMARN 164
+ Y + T DH + + G D+ GN+Y++VKNSW T G ++
Sbjct: 303 AERQLAY---DNYETTDDHGMQIYGIAKDQEGNEYYMVKNSWGTNSKYNGIWYASKA 356
>2e01_A Cysteine proteinase 1; bleomycin hydrolase, thiol protease, C1
protease, hydrolase; 1.73A {Saccharomyces cerevisiae}
PDB: 2e02_A 2e03_A 2dzy_A 1a6r_A 2e00_A 2dzz_A 3gcb_A
1gcb_A
Length = 457
Score = 40.9 bits (95), Expect = 2e-04
Identities = 12/45 (26%), Positives = 20/45 (44%), Gaps = 2/45 (4%)
Query: 122 TQLDHAVLVVGYGTDENGND--YWLVKNSWNTTWGDEGYIKMARN 164
+ + A+L+ G DE + V+NSW G +G M +
Sbjct: 369 SLMTAAMLITGCHVDETSKLPLRYRVENSWGKDSGKDGLYVMTQK 413
Score = 40.9 bits (95), Expect = 2e-04
Identities = 12/45 (26%), Positives = 20/45 (44%), Gaps = 2/45 (4%)
Query: 175 TQLDHAVLVVGYGTDENGND--YWLVKNSWNTTWGDEGYIKMARN 217
+ + A+L+ G DE + V+NSW G +G M +
Sbjct: 369 SLMTAAMLITGCHVDETSKLPLRYRVENSWGKDSGKDGLYVMTQK 413
>2cb5_A Protein (bleomycin hydrolase); aminopeptidase, cysteine protease,
SELF- compartmentalizing, cylinase; 1.85A {Homo sapiens}
SCOP: d.3.1.1 PDB: 1cb5_A
Length = 453
Score = 37.0 bits (85), Expect = 0.003
Identities = 11/48 (22%), Positives = 20/48 (41%), Gaps = 3/48 (6%)
Query: 120 NSTQLDHAVLVVGYGTDENGND---YWLVKNSWNTTWGDEGYIKMARN 164
+ + HA+ ++ + W V+NSW G +GY+ M
Sbjct: 365 GESLMTHAMTFTAVSEKDDQDGAFTKWRVENSWGEDHGHKGYLCMTDE 412
Score = 37.0 bits (85), Expect = 0.003
Identities = 11/48 (22%), Positives = 20/48 (41%), Gaps = 3/48 (6%)
Query: 173 NSTQLDHAVLVVGYGTDENGND---YWLVKNSWNTTWGDEGYIKMARN 217
+ + HA+ ++ + W V+NSW G +GY+ M
Sbjct: 365 GESLMTHAMTFTAVSEKDDQDGAFTKWRVENSWGEDHGHKGYLCMTDE 412
>1e8u_A HN, hemagglutinin-neuraminidase; sialidase, hydrolase; HET: SLB
NAG; 2.0A {Newcastle disease virus} SCOP: b.68.1.1 PDB:
1e8t_A* 1e8v_A* 1usr_A* 1usx_A*
Length = 454
Score = 30.9 bits (69), Expect = 0.31
Identities = 11/96 (11%), Positives = 24/96 (25%)
Query: 73 GYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECNSTQLDHAVLVVG 132
G + + + + ++ Y ++ A+L +
Sbjct: 195 GGLKPNSPSDTVQEGKYVIYKRYNDTCPDEQDYQIRMAKSSYKPGRFGGKRIQQAILSIK 254
Query: 133 YGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYY 168
T + V + T G EG I +
Sbjct: 255 VSTSLGEDPVLTVPPNTVTLMGAEGRILTVGTSHFL 290
>3t1e_A Hemagglutinin-neuraminidase; beta-propeller, 4 helix bundle,
neuraminidase membrane protein, ectodomain, hydrolase;
3.30A {Newcastle disease virus}
Length = 537
Score = 30.1 bits (67), Expect = 0.72
Identities = 9/96 (9%), Positives = 24/96 (25%)
Query: 73 GYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGVYYEPECNSTQLDHAVLVVG 132
G + + + + ++ Y ++ A+L +
Sbjct: 284 GGLKPSSPSDTAQEGRYVIYKRYNDTCPDEQDYQIRMAKSSYKPGRFGGKRVQQAILSIK 343
Query: 133 YGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYY 168
T + + + T G EG + +
Sbjct: 344 VSTSLGEDPVLTIPPNTVTLMGAEGRVLTVGTSHFL 379
>1t9f_A Protein 1D10; structural genomics, PSI, protein structure
initiative, southeast collaboratory for structural
genomics, secsg; 2.00A {Caenorhabditis elegans} SCOP:
b.42.6.1
Length = 187
Score = 29.1 bits (65), Expect = 0.89
Identities = 14/109 (12%), Positives = 27/109 (24%), Gaps = 19/109 (17%)
Query: 103 HQSFQFY--SEGVYYEPECNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGDEGYIK 160
+ + S V Y V ++ N +W + + N IK
Sbjct: 17 NANDGSRLHSHDVKYGSGSGQQS------VTAVKNSDDINSHWQIFPALNAKCNRGDAIK 70
Query: 161 ---------MARNRVYYEPECNSTQLDHAVLVVGYGTDENG--NDYWLV 198
+ + + V +G++ D W V
Sbjct: 71 CGDKIRLKHLTTGTFLHSHHFTAPLSKQHQEVSAFGSEAESDTGDDWTV 119
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis,
programmed cell death; HET: DTP; 6.90A {Drosophila
melanogaster} PDB: 3iz8_A*
Length = 1221
Score = 29.8 bits (66), Expect = 1.0
Identities = 15/68 (22%), Positives = 27/68 (39%), Gaps = 11/68 (16%)
Query: 6 QYIKDNHGIDTESSYPYEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVT-SGDDNCRYKR 64
YI DN YE +++ + FL + IC K ++ + + +D ++
Sbjct: 532 PYICDNDPK-------YERLVNAILDFLPK--IEENLICSKYTDLLRIALMAEDEAIFEE 582
Query: 65 AKSGAVDR 72
A V R
Sbjct: 583 AHK-QVQR 589
>2pff_B Fatty acid synthase subunit beta; fatty acid synthase,
acyl-carrier-protein, beta-ketoacyl RED beta-ketoacyl
synthase, dehydratase; 4.00A {Saccharomyces cerevisiae}
Length = 2006
Score = 29.2 bits (65), Expect = 1.4
Identities = 20/116 (17%), Positives = 36/116 (31%), Gaps = 31/116 (26%)
Query: 4 AFQYIKDNHGIDTESSYP------YEAMMSFLPSFLFRRLCYLLTICHKRSNQVTVTSGD 57
AF+ +K I ++++ Y A+ S L ++ R G
Sbjct: 1743 AFEDLKSKGLIPADATFAGHSLGEYAALASLADVMSIESLVEVV---FYR--------G- 1790
Query: 58 DNCRYKRAKSGAVDRGYVDIPEGDEYKLKAAVATIGPVSIAIDASHQSFQFYSEGV 113
AV P + + + I P +A S ++ Q+ E V
Sbjct: 1791 ------MTMQVAV-------PRDELGRSNYGMIAINPGRVAASFSQEALQYVVERV 1833
>1u14_A Hypothetical UPF0244 protein YJJX; structural genomics, protein
structure initiative, PSI, midwest center for structural
genomics, MCSG; 1.68A {Salmonella typhimurium} SCOP:
c.51.4.3 PDB: 1u5w_A
Length = 172
Score = 27.9 bits (62), Expect = 1.8
Identities = 19/84 (22%), Positives = 27/84 (32%), Gaps = 27/84 (32%)
Query: 153 WGDEGYIKMARNRVYY----EPECNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGD 208
+G E ARNRV P+ D V + G D++ W
Sbjct: 47 FGSEETRAGARNRVDNARRLHPQA-----DFWVAIEA-GIDDDATFSW------------ 88
Query: 209 EGYIKMARNRENNCGVASSASFPL 232
+ + G A SA+ PL
Sbjct: 89 -----VVIDNGVQRGEARSATLPL 107
>1cv8_A Staphopain; cysteine protease, thiol protease, papain family; HET:
E64; 1.75A {Staphylococcus aureus} SCOP: d.3.1.1
Length = 174
Score = 27.5 bits (60), Expect = 3.0
Identities = 10/36 (27%), Positives = 15/36 (41%)
Query: 117 PECNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTT 152
N HA+ VVG NG + ++ N W+
Sbjct: 111 ESRNGMHAGHAMAVVGNAKLNNGQEVIIIWNPWDNG 146
Score = 27.5 bits (60), Expect = 3.0
Identities = 10/36 (27%), Positives = 15/36 (41%)
Query: 170 PECNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTT 205
N HA+ VVG NG + ++ N W+
Sbjct: 111 ESRNGMHAGHAMAVVGNAKLNNGQEVIIIWNPWDNG 146
>3mal_A Stromal cell-derived factor 2-like protein; trefoil fold, MIR
motifs, unfolded protein response, putativ binding
protein, plant protein; 1.95A {Arabidopsis thaliana}
Length = 199
Score = 27.6 bits (61), Expect = 3.2
Identities = 18/80 (22%), Positives = 28/80 (35%), Gaps = 12/80 (15%)
Query: 130 VVGYGTDENGNDYWLVKNSWNTTWGDEGYIK---------MARNRVYYEPECNSTQLDHA 180
V G+ + N YW+VK TT +K M + + S +
Sbjct: 51 VTGFPGVVDSNSYWIVKPVPGTTEKQGDAVKSGATIRLQHMKTRKWLHSHLHASPISGNL 110
Query: 181 VLVVGYGTDENG--NDYWLV 198
V +G D N D+W +
Sbjct: 111 E-VSCFGDDTNSDTGDHWKL 129
>1tk7_A CG4244-PB; WW domain, notch, signaling protein; NMR {Drosophila
melanogaster} SCOP: b.72.1.1 b.72.1.1
Length = 88
Score = 26.0 bits (57), Expect = 4.3
Identities = 12/76 (15%), Positives = 23/76 (30%), Gaps = 9/76 (11%)
Query: 133 YGTDENGNDYWLVKNSWNTTWGDEGYIKMARNRVYYEPECNSTQLDHAVLVVGYGTDENG 192
+ Y++ + T W D R + N L + G
Sbjct: 19 KKIQSDNRVYFVNHKNRTTQWED------PRTQGQEVSLINEGPLPPGWEIR---YTAAG 69
Query: 193 NDYWLVKNSWNTTWGD 208
+++ N+ TT+ D
Sbjct: 70 ERFFVDHNTRRTTFED 85
>1zwy_A Hypothetical UPF0244 protein VC0702; hypothetical protein,
structural genomics, PSI, protein STRU initiative; 1.90A
{Vibrio cholerae} SCOP: c.51.4.3 PDB: 1zno_A
Length = 185
Score = 27.1 bits (60), Expect = 4.4
Identities = 16/84 (19%), Positives = 26/84 (30%), Gaps = 27/84 (32%)
Query: 153 WGDEGYIKMARNRVYY----EPECNSTQLDHAVLVVGYGTDENGNDYWLVKNSWNTTWGD 208
DE + A NRV P ++ V + G +EN W++
Sbjct: 57 MSDEETKQGALNRVRNAKQRHPGA-----EYYVGLEA-GIEENKTFAWMI---------- 100
Query: 209 EGYIKMARNRENNCGVASSASFPL 232
+ + G + SA L
Sbjct: 101 ---V----ESDQQRGESRSACLML 117
>3erv_A Putative C39-like peptidase; structural genomics, unknown function,
PSI-2, protein structure initiative; 2.10A {Bacillus
anthracis}
Length = 236
Score = 26.7 bits (58), Expect = 7.1
Identities = 16/113 (14%), Positives = 34/113 (30%), Gaps = 9/113 (7%)
Query: 74 YVDIPEGDEYKLKAAVATIGPV--SIAIDASHQSFQFYSEGVYYEPECNSTQLDHAVLVV 131
VD+ +L +V PV + ++ + + T +H V+++
Sbjct: 124 AVDLTGKSIEELYKSVKAGQPVVIITNATFAPLDEDEFTTWETNNGDVSITYNEHCVVLI 183
Query: 132 GYGTDENG---NDYWLVKNSWNTTWGD--EGYIKMARNRVYYEPECNSTQLDH 179
GY D+ D + +++M + Y H
Sbjct: 184 GY--DQESVYIRDPLKDSLDVKVPREKFEQAWVQMGSQAISYVKRSKEGHHHH 234
Database: pdb70
Posted date: Sep 4, 2012 3:40 AM
Number of letters in database: 6,701,793
Number of sequences in database: 27,921
Lambda K H
0.317 0.133 0.421
Gapped
Lambda K H
0.267 0.0856 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 27921
Number of Hits to DB: 3,675,998
Number of extensions: 208274
Number of successful extensions: 823
Number of sequences better than 10.0: 1
Number of HSP's gapped: 604
Number of HSP's successfully gapped: 117
Length of query: 233
Length of database: 6,701,793
Length adjustment: 90
Effective length of query: 143
Effective length of database: 4,188,903
Effective search space: 599013129
Effective search space used: 599013129
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 56 (25.1 bits)