RPS-BLAST 2.2.26 [Sep-21-2011]
Database: pdb70
27,921 sequences; 6,701,793 total letters
Searching..................................................done
Query= psy1664
(524 letters)
>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme,
papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A
1pbh_A 1mir_A
Length = 317
Score = 342 bits (879), Expect = e-115
Identities = 133/276 (48%), Positives = 183/276 (66%), Gaps = 15/276 (5%)
Query: 62 LTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGS 121
+ +S L+ G P R+ + +LP FDAR WP CPTI+EIRDQGS
Sbjct: 34 VDMSYLKRLCGTFLGGPKPPQRV-----MFTEDLKLPASFDAREQWPQCPTIKEIRDQGS 88
Query: 122 CGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKD-CGNGCQGGFHGKAWKYWVT 180
CGS WA GAVEA+SDR+CI + V +S++DL++CC CG+GC GG+ +AW +W
Sbjct: 89 CGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTR 148
Query: 181 TGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDL 239
G+VSGG Y S GCRPY I PCE ++NGS C TP+C + C+PGY +Y+ D
Sbjct: 149 KGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGD-TPKCSKICEPGYSPTYKQDK 207
Query: 240 NFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRII 299
++G +YS+ +E+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+
Sbjct: 208 HYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRIL 267
Query: 300 GWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
GWG E GT YWLVANS+NT+WG+NG F+I
Sbjct: 268 GWGVE---NGT----PYWLVANSWNTDWGDNGFFKI 296
Score = 217 bits (555), Expect = 7e-67
Identities = 84/171 (49%), Positives = 122/171 (71%), Gaps = 9/171 (5%)
Query: 328 GENGLFRIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
G +GCRPY I PCE ++NGSR C TP+C + C+PGY +Y+ D ++G
Sbjct: 154 GGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGD-TPKCSKICEPGYSPTYKQDKHYGYN 212
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
+YS+ +E+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E
Sbjct: 213 SYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE 272
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
GT YWLVANS+NT+WG+NG F+I+RGQ+ CGIE+++ AG+P+
Sbjct: 273 ---NGT----PYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRT 316
>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase,
digestive tract, hydrolase-hydrolase INH complex; HET:
074; 1.30A {Schistosoma mansoni} PDB: 3s3q_A* 3s3r_A*
Length = 254
Score = 335 bits (861), Expect = e-113
Identities = 120/241 (49%), Positives = 152/241 (63%), Gaps = 8/241 (3%)
Query: 96 ELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDL 155
E+P FD+R WP C +I IRDQ CGS WA GAVEAMSDR CI S GK++V LS+ DL
Sbjct: 2 EIPSSFDSRKKWPRCKSIATIRDQSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVDL 61
Query: 156 VSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSSCQ 214
+SCC+ CG GC+GG G AW YWV GIV+G + + GC PY CE + G + C
Sbjct: 62 LSCCESCGLGCEGGILGPAWDYWVKEGIVTGSSKENHAGCEPYPFPKCEHHTKGKYPPCG 121
Query: 215 DNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYAD 274
TP C + CQ Y Y D + G+ +Y++ +E+ I +EI ++GPVE T+Y D
Sbjct: 122 SKIYKTPRCKQTCQKKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGFTVYED 181
Query: 275 MILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFR 334
+ YK+GIYKH+ G LG HAIRIIGWG E YWL+ANS+N +WGENG FR
Sbjct: 182 FLNYKSGIYKHITGETLGGHAIRIIGWGVE---NKA----PYWLIANSWNEDWGENGYFR 234
Query: 335 I 335
I
Sbjct: 235 I 235
Score = 211 bits (540), Expect = 2e-65
Identities = 75/169 (44%), Positives = 101/169 (59%), Gaps = 8/169 (4%)
Query: 328 GENGLFRIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
G + GC PY CE + G C + TP C + CQ Y Y D + G+
Sbjct: 92 GSSKENHAGCEPYPFPKCEHHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYTQDKHRGKS 151
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
+Y++ +E+ I +EI ++GPVE T+Y D + YK+GIYKH+ G LG HAIRIIGWG E
Sbjct: 152 SYNVKNDEKAIQKEIMKYGPVEAGFTVYEDFLNYKSGIYKHITGETLGGHAIRIIGWGVE 211
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
YWL+ANS+N +WGENG FRIVRG++EC IE+++TAG
Sbjct: 212 ---NKA----PYWLIANSWNEDWGENGYFRIVRGRDECSIESEVTAGRI 253
>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco
hydrolase, lysosome, protease, thiol protease, zymogen,
CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A*
3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A*
1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A*
1qdq_A* 1csb_B* 1huc_B 2ipp_B ...
Length = 266
Score = 333 bits (856), Expect = e-112
Identities = 127/243 (52%), Positives = 173/243 (71%), Gaps = 10/243 (4%)
Query: 95 EELPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDD 154
+LP FDAR WP CPTI+EIRDQGSCGS WA GAVEA+SDR+CI + V +S++D
Sbjct: 5 LKLPASFDAREQWPQCPTIKEIRDQGSCGSAWAFGAVEAISDRICIHTNAHVSVEVSAED 64
Query: 155 LVSCCKD-CGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHSS 212
L++CC CG+GC GG+ +AW +W G+VSGG Y S GCRPY I PCE ++NG+
Sbjct: 65 LLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEAHVNGARPP 124
Query: 213 CQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIY 272
C TP+C + C+PGY +Y+ D ++G +YS+ +E+ IM EI+++GPVEG+ ++Y
Sbjct: 125 CTGEGD-TPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVY 183
Query: 273 ADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGL 332
+D +LYK+G+Y+HV G +G HAIRI+GWG E GT YWLVANS+NT+WG+NG
Sbjct: 184 SDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE---NGT----PYWLVANSWNTDWGDNGF 236
Query: 333 FRI 335
F+I
Sbjct: 237 FKI 239
Score = 216 bits (553), Expect = 4e-67
Identities = 83/171 (48%), Positives = 122/171 (71%), Gaps = 9/171 (5%)
Query: 328 GENGLFRIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
G +GCRPY I PCE ++NG+R C TP+C + C+PGY +Y+ D ++G
Sbjct: 97 GGLYESHVGCRPYSIPPCEAHVNGARPPCTGEGD-TPKCSKICEPGYSPTYKQDKHYGYN 155
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
+YS+ +E+ IM EI+++GPVEG+ ++Y+D +LYK+G+Y+HV G +G HAIRI+GWG E
Sbjct: 156 SYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE 215
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
GT YWLVANS+NT+WG+NG F+I+RGQ+ CGIE+++ AG+P+
Sbjct: 216 ---NGT----PYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRT 259
>3hhi_A Cathepsin B-like cysteine protease; occluding loop, hydrolase, THIO
protease; HET: 074; 1.60A {Trypanosoma brucei} PDB:
3mor_A*
Length = 325
Score = 312 bits (802), Expect = e-103
Identities = 101/286 (35%), Positives = 142/286 (49%), Gaps = 23/286 (8%)
Query: 54 AEKNA-LSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPT 112
A+ + + +TL E + GV + + LP FD+ WP CPT
Sbjct: 28 AKYDGVMQNITLREAKRLNGVIKKNNNASILPKRRFTEEEARAPLPSSFDSAEAWPNCPT 87
Query: 113 IQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHG 172
I +I DQ +CGS WA+ A AMSDR C G + V +S+ DL++CC DCG+GC GG
Sbjct: 88 IPQIADQSACGSCWAVAAASAMSDRFCTMG-GVQDVHISAGDLLACCSDCGDGCNGGDPD 146
Query: 173 KAWKYWVTTGIVSGGTYASKQGCRPYEI-PCERYMNGSHS--SCQDNEPNTPECIRKCQP 229
+AW Y+ +TG+VS C+PY C + + C +TP+C C
Sbjct: 147 RAWAYFSSTGLVSD-------YCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPKCDYTCDD 199
Query: 230 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGG 289
+Y+L E+ MRE+F GP E + +Y D I Y +G+Y HV+G
Sbjct: 200 PT---IPVVNYRSWTSYALQ-GEDDYMRELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQ 255
Query: 290 PLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
LG HA+R++GWG G YW +ANS+NT WG +G F I
Sbjct: 256 YLGGHAVRLVGWGTS---NGV----PYWKIANSWNTEWGMDGYFLI 294
Score = 193 bits (492), Expect = 2e-57
Identities = 61/174 (35%), Positives = 84/174 (48%), Gaps = 14/174 (8%)
Query: 327 WGENGLFRIGCRPYEI-PCERYMNGSRS--SCQANEPNTPECIRKCQPGYDVSYEDDLNF 383
+ GL C+PY C + C +TP+C C
Sbjct: 152 FSSTGLVSDYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPKCDYTCDDPT---IPVVNYR 208
Query: 384 GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGW 443
+Y+L E+ MRE+F GP E + +Y D I Y +G+Y HV+G LG HA+R++GW
Sbjct: 209 SWTSYALQ-GEDDYMRELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYLGGHAVRLVGW 267
Query: 444 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLPKI 497
G G YW +ANS+NT WG +G F I RG +ECGIE +AG+P
Sbjct: 268 GTS---NGV----PYWKIANSWNTEWGMDGYFLIRRGSSECGIEDGGSAGIPLA 314
>1deu_A Procathepsin X; cysteine protease, proregion, prosegment, HY; 1.70A
{Homo sapiens} SCOP: d.3.1.1 PDB: 1ef7_A
Length = 277
Score = 253 bits (649), Expect = 3e-81
Identities = 64/263 (24%), Positives = 101/263 (38%), Gaps = 34/263 (12%)
Query: 77 SKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQ---GSCGSGWALGAVEA 133
+ L + P + P +LP+ +D R N R+Q CGS WA + A
Sbjct: 17 APLGRTTYPRPHEYLSP-ADLPKSWDWR-NVDGVNYASITRNQHIPQYCGSCWAHASTSA 74
Query: 134 MSDRVCIASRGKRH-VRLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASK 192
M+DR+ I +G LS +++ C C+GG W Y GI
Sbjct: 75 MADRINIKRKGAWPSTLLSVQNVIDCG--NAGSCEGGNDLSVWDYAHQHGIPD------- 125
Query: 193 QGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 252
+ C Y+ + + N+ T ++C + + Y +
Sbjct: 126 ETCNNYQAK-----DQEC--DKFNQCGTCNEFKECHAIRNYTLWRV-----GDYGSLSGR 173
Query: 253 ETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSS 312
E +M EI+ +GP+ + + Y GIY H + + GWG +GT
Sbjct: 174 EKMMAEIYANGPISCGIMATERLANYTGGIYAEYQDTTYINHVVSVAGWGIS---DGT-- 228
Query: 313 VVKYWLVANSFNTNWGENGLFRI 335
+YW+V NS+ WGE G RI
Sbjct: 229 --EYWIVRNSWGEPWGERGWLRI 249
Score = 164 bits (417), Expect = 4e-47
Identities = 42/182 (23%), Positives = 69/182 (37%), Gaps = 33/182 (18%)
Query: 327 WGENGLFRIGCRPYEI---PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNF 383
++G+ C Y+ C+++ N+ T ++C + +
Sbjct: 118 AHQHGIPDETCNNYQAKDQECDKF----------NQCGTCNEFKECHAIRNYTLWRV--- 164
Query: 384 GRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGW 443
Y + E +M EI+ +GP+ + + Y GIY H + + GW
Sbjct: 165 --GDYGSLSGREKMMAEIYANGPISCGIMATERLANYTGGIYAEYQDTTYINHVVSVAGW 222
Query: 444 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECG--------IEADITAGLP 495
G +GT +YW+V NS+ WGE G RIV + G IE T G P
Sbjct: 223 GIS---DGT----EYWIVRNSWGEPWGERGWLRIVTSTYKDGKGARYNLAIEEHCTFGDP 275
Query: 496 KI 497
+
Sbjct: 276 IV 277
>3pdf_A Cathepsin C, dipeptidyl peptidase 1; two domains, cystein protease,
hydrolase-hydrolase inhibitor; HET: LXV NAG; 1.85A {Homo
sapiens} PDB: 1jqp_A* 2djf_B* 1k3b_B* 2djg_B* 2djf_A*
1k3b_A* 2djg_A* 2djf_C* 1k3b_C* 2djg_C*
Length = 441
Score = 244 bits (624), Expect = 1e-75
Identities = 78/286 (27%), Positives = 121/286 (42%), Gaps = 42/286 (14%)
Query: 57 NALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEI 116
LTL ++ R G H PL ++ + LP +D R + +
Sbjct: 167 MEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQQKILFLPTSWDWRNVHG-INFVSPV 225
Query: 117 RDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGF-HGKAW 175
R+Q SCGS ++ ++ + R+ I + + LS ++VSC + GC+GGF + A
Sbjct: 226 RNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQ-YAQGCEGGFPYLIAG 284
Query: 176 KYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSY 235
KY G+V + C PY G+ S C+ C Y Y
Sbjct: 285 KYAQDFGLVE-------EACFPYT--------GTDSPCK--------MKEDCFRYYSSEY 321
Query: 236 EDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP----- 290
+ NE + E+ HGP+ + +Y D + YK GIY H
Sbjct: 322 HYVG-----GFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPF 376
Query: 291 -LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
L HA+ ++G+G + ++S + YW+V NS+ T WGENG FRI
Sbjct: 377 ELTNHAVLLVGYGTD-----SASGMDYWIVKNSWGTGWGENGYFRI 417
Score = 156 bits (396), Expect = 2e-42
Identities = 53/175 (30%), Positives = 75/175 (42%), Gaps = 32/175 (18%)
Query: 327 WGENGLFRIGCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRI 386
+ GL C PY G+ S C+ C Y Y
Sbjct: 287 AQDFGLVEEACFPYT--------GTDSPCK--------MKEDCFRYYSSEYHYVG----- 325
Query: 387 AYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGP------LGEHAIRI 440
+ NE + E+ HGP+ + +Y D + YK GIY H L HA+ +
Sbjct: 326 GFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLL 385
Query: 441 IGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGIEADITAGLP 495
+G+G + ++S + YW+V NS+ T WGENG FRI RG +EC IE+ A P
Sbjct: 386 VGYGTD-----SASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATP 435
>3ois_A Cysteine protease; alpha and beta, hydrolase; HET: UDP; 1.65A
{Xylella fastidiosa}
Length = 291
Score = 162 bits (412), Expect = 2e-46
Identities = 47/279 (16%), Positives = 81/279 (29%), Gaps = 45/279 (16%)
Query: 70 RMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQEIRDQGSCGSGWALG 129
G PD + R + LP D + + DQG GS A
Sbjct: 32 GYGYIPD--IADIRDFSYTPEKSVIAALPPKVDLTPPFQ-------VYDQGRIGSCTANA 82
Query: 130 AVEAMSDRVCIASRGKRHVRLSSDDLVSCCK--DCGNGCQGGFHGKAWKYWVTTGIVSGG 187
A+ + + + K N G K G+
Sbjct: 83 LAAAIQFERIHDKQSPEFIPSRLFIYYNERKIEGHVNYDSGAMIRDGIKVLHKLGVCPEK 142
Query: 188 TYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYS 247
+ PY E P +P D Y+D N+ YS
Sbjct: 143 EW-------PYGDTPA---------DPRTEEFPPGAPASKKPS-DQCYKDAQNYKITEYS 185
Query: 248 -LPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYK----HVAGGPLGEHAIRIIGWG 302
+ + + + + P ++Y + + + G HA+ +G+
Sbjct: 186 RVAQDIDHLKACLAVGSPFVFGFSVYNSWVGNNSLPVRIPLPTKNDTLEGGHAVLCVGYD 245
Query: 303 QEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIGCRPYE 341
E ++++ + NS+ N GE+G F + PYE
Sbjct: 246 DE---------IRHFRIRNSWGNNVGEDGYFWM---PYE 272
Score = 94.5 bits (235), Expect = 9e-22
Identities = 23/169 (13%), Positives = 54/169 (31%), Gaps = 25/169 (14%)
Query: 327 WGENGLFRIGCRPYEI-PCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGR 385
+ G+ PY P + A++ + +C + Q Y
Sbjct: 133 LHKLGVCPEKEWPYGDTPADPRTEEFPPGAPASKKPSDQCYKDAQNYKITEY-------- 184
Query: 386 IAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYK----HVAGGPLGEHAIRII 441
+ + + + + P ++Y + + + G HA+ +
Sbjct: 185 --SRVAQDIDHLKACLAVGSPFVFGFSVYNSWVGNNSLPVRIPLPTKNDTLEGGHAVLCV 242
Query: 442 GWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQ-NECGIEAD 489
G+ E ++++ + NS+ N GE+G F + + + D
Sbjct: 243 GYDDE---------IRHFRIRNSWGNNVGEDGYFWMPYEYISNTQLADD 282
>2wbf_X Serine-repeat antigen protein; SERA, malaria, vacuole, protease,
cathepsin, hydrolase, glycoprotein, thiol protease; HET:
DMS; 1.60A {Plasmodium falciparum} PDB: 3ch3_X 3ch2_X
Length = 265
Score = 120 bits (303), Expect = 4e-31
Identities = 47/249 (18%), Positives = 87/249 (34%), Gaps = 28/249 (11%)
Query: 106 NWPYCP---------TIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
N YC + ++ DQG+C + W + + C+ +G ++S+ +
Sbjct: 6 NKEYCNRLKDENNCISNLQVEDQGNCDTSWIFASKYHLETIRCM--KGYEPTKISALYVA 63
Query: 157 SCCKDCGN-GCQGGFHGKAWKYWV--TTGIVSGGTY---ASKQGCRPYEIPCERYMNGSH 210
+C K C G + + + + Y K G + ++ +
Sbjct: 64 NCYKGEHKDRCDEGSSPMEFLQIIEDYGFLPAESNYPYNYVKVGEQCPKVEDHWMNLWDN 123
Query: 211 SSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMT 270
N+ K Y+ D + A + I E+ G V +
Sbjct: 124 GKILHNKNEPNSLDGKGYTAYESERFHDN--------MDAFVKIIKTEVMNKGSVIAYIK 175
Query: 271 IYADMILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGE 329
M +G K++ G +HA+ I+G+G EG YW+V NS+ WG+
Sbjct: 176 AENVMGYEFSGKKVKNLCGDDTADHAVNIVGYGNYVNSEGEKK--SYWIVRNSWGPYWGD 233
Query: 330 NGLFRIGCR 338
G F++
Sbjct: 234 EGYFKVDMY 242
Score = 77.4 bits (191), Expect = 4e-16
Identities = 30/152 (19%), Positives = 53/152 (34%), Gaps = 6/152 (3%)
Query: 339 PYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYS--LPANEET 396
PY N + + + + + + + + A +
Sbjct: 100 PYNYVKVGEQCPKVEDHWMNLWDNGKILHNKNEPNSLDGKGYTAYESERFHDNMDAFVKI 159
Query: 397 IMREIFRHGPVEGSMTIYADMILYKTGIY-KHVAGGPLGEHAIRIIGWGQEPLGEGTSSV 455
I E+ G V + M +G K++ G +HA+ I+G+G EG
Sbjct: 160 IKTEVMNKGSVIAYIKAENVMGYEFSGKKVKNLCGDDTADHAVNIVGYGNYVNSEGEKK- 218
Query: 456 VKYWLVANSFNTNWGENGLFRIVR-GQNECGI 486
YW+V NS+ WG+ G F++ G C
Sbjct: 219 -SYWIVRNSWGPYWGDEGYFKVDMYGPTHCHF 249
>1cs8_A Human procathepsin L; prosegment, propeptide, inhibition,
hydrolase; HET: OCS; 1.80A {Homo sapiens} SCOP: d.3.1.1
PDB: 1cjl_A 3hwn_A*
Length = 316
Score = 99.2 bits (248), Expect = 3e-23
Identities = 70/286 (24%), Positives = 109/286 (38%), Gaps = 56/286 (19%)
Query: 57 NALSKLTLSEL-EMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQE 115
NA +T E ++ G E P D W +
Sbjct: 61 NAFGDMTSEEFRQVMNGFQNRKPRKGKVFQ-----EPLFYEAPRSVD----WREKGYVTP 111
Query: 116 IRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFHGKA 174
+++QG CGS WA A A+ ++ + G+ + LS +LV C GN GC GG A
Sbjct: 112 VKNQGQCGSCWAFSATGALEGQMFRKT-GRL-ISLSEQNLVDCSGPQGNEGCNGGLMDYA 169
Query: 175 WKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDV 233
++Y G+ S +Y PYE + SC+ N + D
Sbjct: 170 FQYVQDNGGLDSEESY-------PYE--------ATEESCKYNPKYSV--------ANDA 206
Query: 234 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGP 290
+ D +P E+ +M+ + GP+ S+ I A + YK GIY
Sbjct: 207 GFVD----------IPKQEKALMKAVATVGPI--SVAIDAGHESFLFYKEGIYFEPDCSS 254
Query: 291 LG-EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+H + ++G+G E + KYWLV NS+ WG G ++
Sbjct: 255 EDMDHGVLVVGYGFESTESDNN---KYWLVKNSWGEEWGMGGYVKM 297
Score = 68.8 bits (169), Expect = 6e-13
Identities = 31/104 (29%), Positives = 50/104 (48%), Gaps = 10/104 (9%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGPLG-EHAIRIIGW 443
+P E+ +M+ + GP+ S+ I A + YK GIY +H + ++G+
Sbjct: 209 VDIPKQEKALMKAVATVGPI--SVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGY 266
Query: 444 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQ-NECGI 486
G E + KYWLV NS+ WG G ++ + + N CGI
Sbjct: 267 GFESTESDNN---KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGI 307
>2o6x_A Procathepsin L1, secreted cathepsin L 1; hydrolase, thiol protease,
cysteine protease, zymogen, hydro; 1.40A {Fasciola
hepatica}
Length = 310
Score = 95.8 bits (239), Expect = 4e-22
Identities = 62/292 (21%), Positives = 102/292 (34%), Gaps = 71/292 (24%)
Query: 57 NALSKLTLSEL-EMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINW-------P 108
N + +T E + + L V +P+ D W
Sbjct: 54 NQFTDMTFEEFKAKYLTEMSR---ASDILSHGVPYEANNRAVPDKID----WRESGYVTE 106
Query: 109 YCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQ 167
++DQG+CGSGWA M + + S LV C + GN GC
Sbjct: 107 -------VKDQGNCGSGWAFSTTGTMEGQYMKNE-RTS-ISFSEQQLVDCSRPWGNNGCG 157
Query: 168 GGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKC 227
GG A++Y G+ + +Y PY C+ N+
Sbjct: 158 GGLMENAYQYLKQFGLETESSY-------PYT--------AVEGQCRYNKQLGV------ 196
Query: 228 QPGYDVSYEDDLNFGRIAYSLPAN-EETIMREIFRHGPVEGSMTIYA--DMILYKTGIYK 284
+ + + E + + GP ++ + D ++Y++GIY+
Sbjct: 197 --AKVTGFYT----------VHSGSEVELKNLVGAEGPA--AVAVDVESDFMMYRSGIYQ 242
Query: 285 HVAGGPLG-EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
PL HA+ +G+G + GT YW+V NS+ +WGE G R+
Sbjct: 243 SQTCSPLRVNHAVLAVGYGTQ---GGT----DYWIVKNSWGLSWGERGYIRM 287
Score = 68.0 bits (167), Expect = 1e-12
Identities = 32/98 (32%), Positives = 52/98 (53%), Gaps = 13/98 (13%)
Query: 393 NEETIMREIFRHGPVEGSMTIYA--DMILYKTGIYKHVAGGPLG-EHAIRIIGWGQEPLG 449
+E + + GP ++ + D ++Y++GIY+ PL HA+ +G+G +
Sbjct: 209 SEVELKNLVGAEGPA--AVAVDVESDFMMYRSGIYQSQTCSPLRVNHAVLAVGYGTQ--- 263
Query: 450 EGTSSVVKYWLVANSFNTNWGENGLFRIVRGQ-NECGI 486
GT YW+V NS+ +WGE G R+VR + N CGI
Sbjct: 264 GGT----DYWIVKNSWGLSWGERGYIRMVRNRGNMCGI 297
>2xu3_A Cathepsin L1; hydrolase, drug design, thiol protease; HET: XU3 BTB;
0.90A {Homo sapiens} PDB: 2xu4_A* 2xu5_A* 2yj2_A*
2yj8_A* 2yj9_A* 2yjb_A* 2yjc_A* 3bc3_A* 3h89_A* 3h8b_A*
3h8c_A* 3of9_A* 3of8_A* 3hha_A* 2xu1_A* 3iv2_A* 3k24_A*
2nqd_B* 3kse_A* 2vhs_A ...
Length = 220
Score = 93.7 bits (234), Expect = 5e-22
Identities = 61/227 (26%), Positives = 96/227 (42%), Gaps = 46/227 (20%)
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFHGK 173
+++QG CGS WA A A+ ++ + G+ + LS +LV C GN GC GG
Sbjct: 15 PVKNQGQCGSCWAFSATGALEGQMFRKT-GRL-ISLSEQNLVDCSGPQGNEGCNGGLMDY 72
Query: 174 AWKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYD 232
A++Y G+ S +Y PYE + SC+ N + D
Sbjct: 73 AFQYVQDNGGLDSEESY-------PYE--------ATEESCKYNPKYSV--------AND 109
Query: 233 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGG 289
+ D +P E+ +M+ + GP+ S+ I A + YK GIY
Sbjct: 110 TGFVD----------IPKQEKALMKAVATVGPI--SVAIDAGHESFLFYKEGIYFEPDCS 157
Query: 290 PLG-EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+H + ++G+G E + KYWLV NS+ WG G ++
Sbjct: 158 SEDMDHGVLVVGYGFESTESDNN---KYWLVKNSWGEEWGMGGYVKM 201
Score = 68.3 bits (168), Expect = 3e-13
Identities = 31/102 (30%), Positives = 50/102 (49%), Gaps = 10/102 (9%)
Query: 390 LPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGPLG-EHAIRIIGWGQ 445
+P E+ +M+ + GP+ S+ I A + YK GIY +H + ++G+G
Sbjct: 115 IPKQEKALMKAVATVGPI--SVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGF 172
Query: 446 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQ-NECGI 486
E + KYWLV NS+ WG G ++ + + N CGI
Sbjct: 173 ESTESDNN---KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGI 211
>3qt4_A Cathepsin-L-like midgut cysteine proteinase; hydrolase, zymogen,
intramolecular DISS bonds, insect larVal midgut; HET:
PG4 PG6; 2.11A {Tenebrio molitor}
Length = 329
Score = 96.2 bits (240), Expect = 5e-22
Identities = 64/285 (22%), Positives = 110/285 (38%), Gaps = 57/285 (20%)
Query: 57 NALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQE 115
N ++ E L P++ L + + L D R N + E
Sbjct: 77 NQFGDMSKEEFLAYVNRGKAQK--PKHPENLRMPYVSSKKPLAASVDWRSN-----AVSE 129
Query: 116 IRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFHGKA 174
++DQG CGS W+ A+ ++ + G+ LS +L+ C GN GC GG+ A
Sbjct: 130 VKDQGQCGSSWSFSTTGAVEGQLALQR-GRL-TSLSEQNLIDCSSSYGNAGCDGGWMDSA 187
Query: 175 WKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
+ Y GI+S Y PYE C+ + +
Sbjct: 188 FSYIHDYGIMSESAY-------PYE--------AQGDYCRFDSSQSV--------TTLSG 224
Query: 235 YEDDLNFGRIAYSLPA-NEETIMREIFRHGPVEGSMTIYA--DMILYKTGIYKHVAGGPL 291
Y D LP+ +E ++ + + GPV ++ I A ++ Y G++
Sbjct: 225 YYD----------LPSGDENSLADAVGQAGPV--AVAIDATDELQFYSGGLFYDQTCNQS 272
Query: 292 G-EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
H + ++G+G + G YW++ NS+ + WGE+G +R
Sbjct: 273 DLNHGVLVVGYGSD---NGQ----DYWILKNSWGSGWGESGYWRQ 310
Score = 64.2 bits (157), Expect = 2e-11
Identities = 27/98 (27%), Positives = 49/98 (50%), Gaps = 13/98 (13%)
Query: 393 NEETIMREIFRHGPVEGSMTIYA--DMILYKTGIYKHVAGGPLG-EHAIRIIGWGQEPLG 449
+E ++ + + GPV ++ I A ++ Y G++ H + ++G+G +
Sbjct: 232 DENSLADAVGQAGPV--AVAIDATDELQFYSGGLFYDQTCNQSDLNHGVLVVGYGSD--- 286
Query: 450 EGTSSVVKYWLVANSFNTNWGENGLFRIVRGQ-NECGI 486
G YW++ NS+ + WGE+G +R VR N CGI
Sbjct: 287 NGQ----DYWILKNSWGSGWGESGYWRQVRNYGNNCGI 320
>2bdz_A Mexicain; cysteine protease, peptidase_C1, papain-like, HYDR; HET:
E64; 2.10A {Jacaratia mexicana}
Length = 214
Score = 93.3 bits (233), Expect = 5e-22
Identities = 58/228 (25%), Positives = 92/228 (40%), Gaps = 60/228 (26%)
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFHGK 173
+++Q CGS WA V + I + G+ + LS +L+ C + + GC GG+
Sbjct: 15 PVKNQNPCGSCWAFSTVATIEGINKIIT-GQL-ISLSEQELLDC--ERRSHGCDGGYQTT 70
Query: 174 AWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDV 233
+ +Y V G+ + Y PYE C+ + P+ Y
Sbjct: 71 SLQYVVDNGVHTEREY-------PYE--------KKQGRCRAKDKKGPK-------VYIT 108
Query: 234 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGP 290
Y+ +PAN+E + + + PV S+ + YK GIY+ GP
Sbjct: 109 GYKY----------VPANDEISLIQAIANQPV--SVVTDSRGRGFQFYKGGIYE----GP 152
Query: 291 LGE---HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G HA+ +G+G+ Y L+ NS+ NWGE G RI
Sbjct: 153 CGTNTDHAVTAVGYGK-----------TYLLLKNSWGPNWGEKGYIRI 189
Score = 65.6 bits (161), Expect = 2e-12
Identities = 41/158 (25%), Positives = 62/158 (39%), Gaps = 52/158 (32%)
Query: 339 PYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIM 398
PYE + C+A + P+ Y Y+ +PAN+E +
Sbjct: 87 PYE--------KKQGRCRAKDKKGPK-------VYITGYKY----------VPANDEISL 121
Query: 399 REIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGPLGE---HAIRIIGWGQEPLGEGT 452
+ + PV S+ + YK GIY+ GP G HA+ +G+G+
Sbjct: 122 IQAIANQPV--SVVTDSRGRGFQFYKGGIYE----GPCGTNTDHAVTAVGYGK------- 168
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRG----QNECGI 486
Y L+ NS+ NWGE G RI R + CG+
Sbjct: 169 ----TYLLLKNSWGPNWGEKGYIRIKRASGRSKGTCGV 202
>1ppo_A Protease omega; hydrolase(thiol protease); 1.80A {Carica papaya}
SCOP: d.3.1.1 PDB: 1meg_A*
Length = 216
Score = 93.3 bits (233), Expect = 5e-22
Identities = 62/228 (27%), Positives = 88/228 (38%), Gaps = 56/228 (24%)
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFHGK 173
+R QGSCGS WA AV + I + GK V LS +LV C + + GC+GG+
Sbjct: 15 PVRHQGSCGSCWAFSAVATVEGINKIRT-GKL-VELSEQELVDC--ERRSHGCKGGYPPY 70
Query: 174 AWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDV 233
A +Y GI Y PY+ +C+ + P
Sbjct: 71 ALEYVAKNGIHLRSKY-------PYK--------AKQGTCRAKQVGGP-------IVKTS 108
Query: 234 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGP 290
+ N E + PV S+ + + LYK GI++ GP
Sbjct: 109 GVGR----------VQPNNEGNLLNAIAKQPV--SVVVESKGRPFQLYKGGIFE----GP 152
Query: 291 LGE---HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G HA+ +G+G+ G Y L+ NS+ T WGE G RI
Sbjct: 153 CGTKVDHAVTAVGYGKS---GGK----GYILIKNSWGTAWGEKGYIRI 193
Score = 64.8 bits (159), Expect = 4e-12
Identities = 37/158 (23%), Positives = 56/158 (35%), Gaps = 48/158 (30%)
Query: 339 PYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIM 398
PY+ + +C+A + P + N E +
Sbjct: 87 PYK--------AKQGTCRAKQVGGP-------IVKTSGVGR----------VQPNNEGNL 121
Query: 399 REIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGPLGE---HAIRIIGWGQEPLGEGT 452
PV S+ + + LYK GI++ GP G HA+ +G+G+ G
Sbjct: 122 LNAIAKQPV--SVVVESKGRPFQLYKGGIFE----GPCGTKVDHAVTAVGYGKS---GGK 172
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQN----ECGI 486
Y L+ NS+ T WGE G RI R CG+
Sbjct: 173 ----GYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGL 206
>8pch_A Cathepsin H; hydrolase, protease, cysteine proteinase,
aminopeptidase; HET: NAG BMA; 2.10A {Sus scrofa} SCOP:
d.3.1.1 PDB: 1nb3_A* 1nb5_A*
Length = 220
Score = 93.3 bits (233), Expect = 5e-22
Identities = 61/229 (26%), Positives = 94/229 (41%), Gaps = 52/229 (22%)
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFHGK 173
+++QGSCGS W A+ V IA+ GK + L+ LV C ++ N GCQGG +
Sbjct: 16 PVKNQGSCGSCWTFSTTGALESAVAIAT-GKM-LSLAEQQLVDCAQNFNNHGCQGGLPSQ 73
Query: 174 AWKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYD 232
A++Y GI+ TY PY+ G C+ +
Sbjct: 74 AFEYIRYNKGIMGEDTY-------PYK--------GQDDHCKFQPDKAI--------AFV 110
Query: 233 VSYEDDLNFGRIAYSLPAN-EETIMREIFRHGPVEGSMTIYA--DMILYKTGIYKHVAGG 289
+ + N EE ++ + + PV S D ++Y+ GIY +
Sbjct: 111 KDVAN----------ITMNDEEAMVEAVALYNPV--SFAFEVTNDFLMYRKGIYSSTSCH 158
Query: 290 PLGE---HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+ HA+ +G+G+E G YW+V NS+ WG NG F I
Sbjct: 159 KTPDKVNHAVLAVGYGEE---NGI----PYWIVKNSWGPQWGMNGYFLI 200
Score = 74.0 bits (183), Expect = 2e-15
Identities = 32/99 (32%), Positives = 50/99 (50%), Gaps = 14/99 (14%)
Query: 393 NEETIMREIFRHGPVEGSMTIYA--DMILYKTGIYKHVAGGPLGE---HAIRIIGWGQEP 447
+EE ++ + + PV S D ++Y+ GIY + + HA+ +G+G+E
Sbjct: 120 DEEAMVEAVALYNPV--SFAFEVTNDFLMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEE- 176
Query: 448 LGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGI 486
G YW+V NS+ WG NG F I RG+N CG+
Sbjct: 177 --NGI----PYWIVKNSWGPQWGMNGYFLIERGKNMCGL 209
>3i06_A Cruzipain; autocatalytic cleavage, glycoprotein, protease, thiol
protease, zymogen; HET: QL2; 1.10A {Trypanosoma cruzi}
PDB: 1ewm_A* 1ewo_A* 1ewl_A* 1f29_A* 1ewp_A* 1f2b_A*
1f2c_A* 1f2a_A* 1me4_A* 1u9q_X* 2aim_A* 2efm_A* 2oz2_A*
1me3_A* 3kku_A* 3lxs_A* 1aim_A* 3iut_A* 3hd3_A* 2p86_A*
...
Length = 215
Score = 91.8 bits (229), Expect = 2e-21
Identities = 54/225 (24%), Positives = 87/225 (38%), Gaps = 47/225 (20%)
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
++DQG CGS WA A+ + + +A LS LVSC K +GC GG A
Sbjct: 15 AVKDQGQCGSCWAFSAIGNVECQWFLAG-HPL-TNLSEQMLVSCDKTD-SGCSGGLMNNA 71
Query: 175 WKYWVTT---GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGY 231
+++ V + + +Y PY G C +
Sbjct: 72 FEWIVQENNGAVYTEDSY-------PYA-----SGEGISPPCTTS--------GHTVGAT 111
Query: 232 DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA-DMILYKTGIYKHVAGGP 290
+ + LP +E I + +GPV ++ + A + Y G+
Sbjct: 112 ITGHVE----------LPQDEAQIAAWLAVNGPV--AVAVDASSWMTYTGGVMTSCVSEQ 159
Query: 291 LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
L +H + ++G+ S+ V YW++ NS+ T WGE G RI
Sbjct: 160 L-DHGVLLVGYN-------DSAAVPYWIIKNSWTTQWGEEGYIRI 196
Score = 78.3 bits (194), Expect = 7e-17
Identities = 29/98 (29%), Positives = 49/98 (50%), Gaps = 11/98 (11%)
Query: 390 LPANEETIMREIFRHGPVEGSMTIYA-DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPL 448
LP +E I + +GPV ++ + A + Y G+ L +H + ++G+
Sbjct: 118 LPQDEAQIAAWLAVNGPV--AVAVDASSWMTYTGGVMTSCVSEQL-DHGVLLVGYN---- 170
Query: 449 GEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGI 486
S+ V YW++ NS+ T WGE G RI +G N+C +
Sbjct: 171 ---DSAAVPYWIIKNSWTTQWGEEGYIRIAKGSNQCLV 205
>2cio_A Papain; hydrolase/inhibitor, complex hydrolase/inhibitor, ICP,
cysteine protease, allergen, protease, thiol protease;
1.5A {Carica papaya} PDB: 1khq_A 1khp_A 1ppn_A 3e1z_B
3ima_A 3lfy_A 9pap_A 1bqi_A* 1bp4_A* 1pad_A 1pe6_A*
1pip_A* 1pop_A* 1ppd_A 1ppp_A* 1stf_E* 2pad_A 4pad_A*
5pad_A* 6pad_A* ...
Length = 212
Score = 91.4 bits (228), Expect = 2e-21
Identities = 60/228 (26%), Positives = 85/228 (37%), Gaps = 60/228 (26%)
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFHGK 173
+++QGSCGS WA AV + + I + G + S +L+ C D + GC GG+
Sbjct: 15 PVKNQGSCGSCWAFSAVVTIEGIIKIRT-GNL-NQYSEQELLDC--DRRSYGCNGGYPWS 70
Query: 174 AWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDV 233
A + GI TY PYE G C+ E
Sbjct: 71 ALQLVAQYGIHYRNTY-------PYE--------GVQRYCRSREKGPYA-------AKTD 108
Query: 234 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGP 290
+ E + + PV S+ + A D LY+ GI+ GP
Sbjct: 109 GVRQ----------VQPYNEGALLYSIANQPV--SVVLEAAGKDFQLYRGGIFV----GP 152
Query: 291 LGE---HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G HA+ +G+G Y L+ NS+ T WGENG RI
Sbjct: 153 CGNKVDHAVAAVGYGP-----------NYILIKNSWGTGWGENGYIRI 189
Score = 63.2 bits (155), Expect = 1e-11
Identities = 33/109 (30%), Positives = 46/109 (42%), Gaps = 27/109 (24%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGPLGE---HAIRII 441
+ E + + PV S+ + A D LY+ GI+ GP G HA+ +
Sbjct: 111 RQVQPYNEGALLYSIANQPV--SVVLEAAGKDFQLYRGGIFV----GPCGNKVDHAVAAV 164
Query: 442 GWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQN----ECGI 486
G+G Y L+ NS+ T WGENG RI RG CG+
Sbjct: 165 GYGP-----------NYILIKNSWGTGWGENGYIRIKRGTGNSYGVCGL 202
>1pci_A Procaricain; zymogen, hydrolase, thiol protease; 3.20A {Carica
papaya} SCOP: d.3.1.1
Length = 322
Score = 93.9 bits (234), Expect = 3e-21
Identities = 74/294 (25%), Positives = 110/294 (37%), Gaps = 77/294 (26%)
Query: 57 NALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINW-------P 108
N + L+ E E +G D+ + Q+ +++ + LPE D W P
Sbjct: 68 NEFADLSNDEFNEKYVGSLIDATIEQSYDEEF--INEDIVNLPENVD----WRKKGAVTP 121
Query: 109 YCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQ 167
+R QGSCGS WA AV + I + GK V LS +LV C + + GC+
Sbjct: 122 -------VRHQGSCGSCWAFSAVATVEGINKIRT-GKL-VELSEQELVDC--ERRSHGCK 170
Query: 168 GGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKC 227
GG+ A +Y GI Y PY+ +C+ + P
Sbjct: 171 GGYPPYALEYVAKNGIHLRSKY-------PYK--------AKQGTCRAKQVGGPI----- 210
Query: 228 QPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYK 284
+ N E + PV S+ + + LYK GI++
Sbjct: 211 --VKTSGVGR----------VQPNNEGNLLNAIAKQPV--SVVVESKGRPFQLYKGGIFE 256
Query: 285 HVAGGPLGE---HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
GP G A+ +G+G+ G Y L+ NS+ T WGE G RI
Sbjct: 257 ----GPCGTKVDGAVTAVGYGKS---GGK----GYILIKNSWGTAWGEKGYIRI 299
Score = 63.0 bits (154), Expect = 5e-11
Identities = 36/158 (22%), Positives = 55/158 (34%), Gaps = 48/158 (30%)
Query: 339 PYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIM 398
PY+ + +C+A + P + N E +
Sbjct: 193 PYK--------AKQGTCRAKQVGGPI-------VKTSGVGR----------VQPNNEGNL 227
Query: 399 REIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGPLGE---HAIRIIGWGQEPLGEGT 452
PV S+ + + LYK GI++ GP G A+ +G+G+ G
Sbjct: 228 LNAIAKQPV--SVVVESKGRPFQLYKGGIFE----GPCGTKVDGAVTAVGYGKS---GGK 278
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQN----ECGI 486
Y L+ NS+ T WGE G RI R CG+
Sbjct: 279 ----GYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGL 312
>3ioq_A CMS1MS2; caricaceae, cysteine protease, papain family, hydrolase;
HET: E64 SO4; 1.87A {Carica candamarcensis}
Length = 213
Score = 91.0 bits (227), Expect = 3e-21
Identities = 59/229 (25%), Positives = 88/229 (38%), Gaps = 62/229 (27%)
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFHGK 173
+R+QG CGS W +V A+ I + G+ + LS +L+ C + + GC+GGF
Sbjct: 15 PVRNQGGCGSCWTFSSVAAVEGINKIVT-GQL-LSLSEQELLDC--ERRSYGCRGGFPLY 70
Query: 174 AWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDV 233
A +Y +GI Y PYE G C+ ++ P
Sbjct: 71 ALQYVANSGIHLRQYY-------PYE--------GVQRQCRASQAKGP--------KVKT 107
Query: 234 S-YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGG 289
+P N E + + PV S+ + A Y+ GI+ G
Sbjct: 108 DGVGR----------VPRNNEQALIQRIAIQPV--SIVVEAKGRAFQNYRGGIFA----G 151
Query: 290 PLGE---HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
P G HA+ +G+G Y L+ NS+ T WGE G RI
Sbjct: 152 PCGTSIDHAVAAVGYGN-----------DYILIKNSWGTGWGEGGYIRI 189
Score = 63.2 bits (155), Expect = 1e-11
Identities = 33/109 (30%), Positives = 46/109 (42%), Gaps = 27/109 (24%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGPLGE---HAIRII 441
+P N E + + PV S+ + A Y+ GI+ GP G HA+ +
Sbjct: 111 GRVPRNNEQALIQRIAIQPV--SIVVEAKGRAFQNYRGGIFA----GPCGTSIDHAVAAV 164
Query: 442 GWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRG----QNECGI 486
G+G Y L+ NS+ T WGE G RI RG Q CG+
Sbjct: 165 GYGN-----------DYILIKNSWGTGWGEGGYIRIKRGSGNPQGACGV 202
>2fo5_A Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cysteine
endoprotease, endopeptidase, LEUP hydrolase; HET: AR7;
2.20A {Hordeum vulgare}
Length = 262
Score = 91.9 bits (229), Expect = 4e-21
Identities = 55/225 (24%), Positives = 91/225 (40%), Gaps = 44/225 (19%)
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
++DQG CGS WA V ++ I + G V LS +L+ C +GCQGG A
Sbjct: 18 GVKDQGKCGSCWAFSTVVSVEGINAIRT-GSL-VSLSEQELIDCDTADNDGCQGGLMDNA 75
Query: 175 WKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDV 233
++Y G+++ Y PY + +C + +
Sbjct: 76 FEYIKNNGGLITEAAY-------PYR--------AARGTCNVARAAQNSPV----VVHID 116
Query: 234 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGP 290
++D +PAN E + + PV S+ + A + Y G++ G
Sbjct: 117 GHQD----------VPANSEEDLARAVANQPV--SVAVEASGKAFMFYSEGVFTGECGTE 164
Query: 291 LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
L +H + ++G+G G+ YW V NS+ +WGE G R+
Sbjct: 165 L-DHGVAVVGYGVAEDGK------AYWTVKNSWGPSWGEQGYIRV 202
Score = 64.2 bits (157), Expect = 1e-11
Identities = 29/104 (27%), Positives = 47/104 (45%), Gaps = 16/104 (15%)
Query: 390 LPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
+PAN E + + PV S+ + A + Y G++ G L +H + ++G+G
Sbjct: 121 VPANSEEDLARAVANQPV--SVAVEASGKAFMFYSEGVFTGECGTEL-DHGVAVVGYGVA 177
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQN----ECGI 486
G+ YW V NS+ +WGE G R+ + CGI
Sbjct: 178 EDGK------AYWTVKNSWGPSWGEQGYIRVEKDSGASGGLCGI 215
>1s4v_A Cysteine endopeptidase; KDEL ER retention signal, endosperm,
ricinosomes, SEED germi senescence, hydrolase-hydrolase
inhibitor complex; 2.00A {Ricinus communis} SCOP:
d.3.1.1
Length = 229
Score = 91.1 bits (227), Expect = 5e-21
Identities = 61/229 (26%), Positives = 91/229 (39%), Gaps = 55/229 (24%)
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
++DQG CGS WA + A+ I + K V LS +LV C D GC GG A
Sbjct: 16 SVKDQGQCGSCWAFSTIVAVEGINQIKT-NKL-VSLSEQELVDCDTDQNQGCNGGLMDYA 73
Query: 175 WKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDV 233
+++ GI + Y PYE +C + ++ P +
Sbjct: 74 FEFIKQRGGITTEANY-------PYE--------AYDGTCDVS--------KENAPAVSI 110
Query: 234 S-YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGG 289
+E+ +P N+E + + + PV S+ I A D Y G++ G
Sbjct: 111 DGHEN----------VPENDENALLKAVANQPV--SVAIDAGGSDFQFYSEGVFT----G 154
Query: 290 PLGE---HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G H + I+G+G G KYW V NS+ WGE G R+
Sbjct: 155 SCGTELDHGVAIVGYGTTIDGT------KYWTVKNSWGPEWGEKGYIRM 197
Score = 64.1 bits (157), Expect = 8e-12
Identities = 34/107 (31%), Positives = 48/107 (44%), Gaps = 22/107 (20%)
Query: 390 LPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGPLGE---HAIRIIGW 443
+P N+E + + + PV S+ I A D Y G++ G G H + I+G+
Sbjct: 116 VPENDENALLKAVANQPV--SVAIDAGGSDFQFYSEGVFT----GSCGTELDHGVAIVGY 169
Query: 444 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRG----QNECGI 486
G G KYW V NS+ WGE G R+ RG + CGI
Sbjct: 170 GTTIDGT------KYWTVKNSWGPEWGEKGYIRMERGISDKEGLCGI 210
>1yal_A Chymopapain; hydrolase, thiol protease; 1.70A {Carica papaya} SCOP:
d.3.1.1 PDB: 1gec_E*
Length = 218
Score = 90.7 bits (226), Expect = 5e-21
Identities = 55/228 (24%), Positives = 93/228 (40%), Gaps = 56/228 (24%)
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFHGK 173
+++QG+CGS WA + + I + G + LS +LV C D + GC+GG+
Sbjct: 15 PVKNQGACGSCWAFSTIATVEGINKIVT-GNL-LELSEQELVDC--DKHSYGCKGGYQTT 70
Query: 174 AWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDV 233
+ +Y G+ + Y PY+ C+ + P+
Sbjct: 71 SLQYVANNGVHTSKVY-------PYQ--------AKQYKCRATDKPGPK-------VKIT 108
Query: 234 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGP 290
Y+ +P+N ET + P+ S+ + A LYK+G++ GP
Sbjct: 109 GYKR----------VPSNCETSFLGALANQPL--SVLVEAGGKPFQLYKSGVFD----GP 152
Query: 291 LGE---HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G HA+ +G+G +G Y ++ NS+ NWGE G R+
Sbjct: 153 CGTKLDHAVTAVGYGTS---DGK----NYIIIKNSWGPNWGEKGYMRL 193
Score = 61.8 bits (151), Expect = 4e-11
Identities = 38/158 (24%), Positives = 62/158 (39%), Gaps = 48/158 (30%)
Query: 339 PYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIM 398
PY+ + C+A + P+ Y+ +P+N ET
Sbjct: 87 PYQ--------AKQYKCRATDKPGPK-------VKITGYKR----------VPSNCETSF 121
Query: 399 REIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGPLGE---HAIRIIGWGQEPLGEGT 452
+ P+ S+ + A LYK+G++ GP G HA+ +G+G +G
Sbjct: 122 LGALANQPL--SVLVEAGGKPFQLYKSGVFD----GPCGTKLDHAVTAVGYGTS---DGK 172
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRG----QNECGI 486
Y ++ NS+ NWGE G R+ R Q CG+
Sbjct: 173 ----NYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGV 206
>1m6d_A Cathepsin F, catsf; papain family cysteine protease, hydrolase;
HET: MYP; 1.70A {Homo sapiens} SCOP: d.3.1.1
Length = 214
Score = 89.8 bits (224), Expect = 8e-21
Identities = 54/230 (23%), Positives = 88/230 (38%), Gaps = 58/230 (25%)
Query: 115 EIRDQGSCGSGWA---LGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGF 170
+++DQG CGS WA G VE + LS +L+ C D + C GG
Sbjct: 15 KVKDQGMCGSCWAFSVTGNVEGQ-----WFLNQGTLLSLSEQELLDC--DKMDKACMGGL 67
Query: 171 HGKAWKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQP 229
A+ G+ + Y Y+ G SCQ + +
Sbjct: 68 PSNAYSAIKNLGGLETEDDY-------SYQ--------GHMQSCQFS--------AEKAK 104
Query: 230 GYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA-DMILYKTGIYKHVAG 288
Y + L NE+ + + + GP+ S+ I A M Y+ GI + +
Sbjct: 105 VYIQDSVE----------LSQNEQKLAAWLAKRGPI--SVAINAFGMQFYRHGISRPLRP 152
Query: 289 --GPLG-EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
P +HA+ ++G+GQ +W + NS+ T+WGE G + +
Sbjct: 153 LCSPWLIDHAVLLVGYGQR---SDV----PFWAIKNSWGTDWGEKGYYYL 195
Score = 77.1 bits (191), Expect = 2e-16
Identities = 30/103 (29%), Positives = 51/103 (49%), Gaps = 13/103 (12%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYA-DMILYKTGIYKHVAG--GPLG-EHAIRIIGW 443
L NE+ + + + GP+ S+ I A M Y+ GI + + P +HA+ ++G+
Sbjct: 111 VELSQNEQKLAAWLAKRGPI--SVAINAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGY 168
Query: 444 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGI 486
GQ +W + NS+ T+WGE G + + RG CG+
Sbjct: 169 GQR---SDV----PFWAIKNSWGTDWGEKGYYYLHRGSGACGV 204
>3p5u_A Actinidin; SAD, cysteine proteinases, hydrolase; 1.50A {Actinidia
arguta} PDB: 3p5v_A 3p5w_A 3p5x_A 1aec_A* 2act_A
Length = 220
Score = 89.9 bits (224), Expect = 9e-21
Identities = 63/229 (27%), Positives = 93/229 (40%), Gaps = 55/229 (24%)
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFHGK 173
+I+DQG CGS WA + A+ IA+ G + LS +LV C + GC GGF
Sbjct: 15 DIKDQGQCGSAWAFSTIAAVEGINKIAT-GDL-ISLSEQELVDCGRTQNTRGCDGGFMTD 72
Query: 174 AWKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYD 232
+++ + GI + Y PY C +
Sbjct: 73 GFQFIINNGGINTEANY-------PYT--------AEEGQCNLDLQQEKY-------VSI 110
Query: 233 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGG 289
+YE+ +P N E ++ + PV S+ + A + Y +GI+ G
Sbjct: 111 DTYEN----------VPYNNEWALQTAVAYQPV--SVALEAAGYNFQHYSSGIFT----G 154
Query: 290 PLGE---HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
P G HA+ I+G+G E G YW+V NS+ T WGE G RI
Sbjct: 155 PCGTAVDHAVTIVGYGTE---GGI----DYWIVKNSWGTTWGEEGYMRI 196
Score = 62.9 bits (154), Expect = 2e-11
Identities = 41/157 (26%), Positives = 59/157 (37%), Gaps = 47/157 (29%)
Query: 339 PYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIM 398
PY C + +YE+ +P N E +
Sbjct: 90 PYT--------AEEGQCNLDLQQEKY-------VSIDTYEN----------VPYNNEWAL 124
Query: 399 REIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGPLGE---HAIRIIGWGQEPLGEGT 452
+ + PV S+ + A + Y +GI+ GP G HA+ I+G+G E G
Sbjct: 125 QTAVAYQPV--SVALEAAGYNFQHYSSGIFT----GPCGTAVDHAVTIVGYGTE---GGI 175
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRG---QNECGI 486
YW+V NS+ T WGE G RI R +CGI
Sbjct: 176 ----DYWIVKNSWGTTWGEEGYMRIQRNVGGVGQCGI 208
>2b1m_A SPE31; papain-like, sugar binding protein; HET: NAG FUC PG4; 2.00A
{Pachyrhizus erosus} PDB: 2b1n_A*
Length = 246
Score = 90.4 bits (225), Expect = 1e-20
Identities = 62/228 (27%), Positives = 95/228 (41%), Gaps = 48/228 (21%)
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
+++ QG CGSGWA A A+ IA+ G V LS +L+ C + GC G+H ++
Sbjct: 16 KVKFQGQCGSGWAFSATGAIEAAHAIAT-GNL-VSLSEQELIDCVDES-EGCYNGWHYQS 72
Query: 175 WKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDV 233
+++ V GI S Y PY+ C+ NE I Y V
Sbjct: 73 FEWVVKHGGIASEADY-------PYK--------ARDGKCKANEIQDKVTID----NYGV 113
Query: 234 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA-DMILYKTGIYKHVAGGPLG 292
+ + + E+ ++ P+ S++I A D Y GIY GG
Sbjct: 114 QILSNES-------TESEAESSLQSFVLEQPI--SVSIDAKDFHFYSGGIYD---GGNCS 161
Query: 293 E-----HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
H + I+G+G E +G YW+ NS+ +WG +G RI
Sbjct: 162 SPYGINHFVLIVGYGSE---DGV----DYWIAKNSWGEDWGIDGYIRI 202
Score = 68.4 bits (168), Expect = 3e-13
Identities = 39/158 (24%), Positives = 59/158 (37%), Gaps = 41/158 (25%)
Query: 339 PYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIM 398
PY+ C+ANE I Y V + + + E+ +
Sbjct: 89 PYK--------ARDGKCKANEIQDKVTID----NYGVQILSNES-------TESEAESSL 129
Query: 399 REIFRHGPVEGSMTIYA-DMILYKTGIYKHVAGGPLGE-----HAIRIIGWGQEPLGEGT 452
+ P+ S++I A D Y GIY GG H + I+G+G E +G
Sbjct: 130 QSFVLEQPI--SVSIDAKDFHFYSGGIYD---GGNCSSPYGINHFVLIVGYGSE---DGV 181
Query: 453 SSVVKYWLVANSFNTNWGENGLFRIVRGQNE----CGI 486
YW+ NS+ +WG +G RI R CG+
Sbjct: 182 ----DYWIAKNSWGEDWGIDGYIRIQRNTGNLLGVCGM 215
>3qj3_A Cathepsin L-like protein; hydrolase, proteinase, larVal midgut;
1.85A {Tenebrio molitor}
Length = 331
Score = 91.5 bits (228), Expect = 2e-20
Identities = 70/295 (23%), Positives = 107/295 (36%), Gaps = 70/295 (23%)
Query: 57 NALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQ---LSDPLEELPEGFDARINW----- 107
N + +T E G+ + L +N +P+ + + P FD W
Sbjct: 72 NLFTDMTPEEMKAYTHGLIMPADLHKNGIPIKTREDLGLNASVRYPASFD----WRDQGM 127
Query: 108 --PYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNG 165
P +++QGSCGS WA + A+ ++ IA+ +S LV C + G
Sbjct: 128 VSP-------VKNQGSCGSSWAFSSTGAIESQMKIANGAGYDSSVSEQQLVDCVPNA-LG 179
Query: 166 CQGGFHGKAWKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECI 224
C GG+ A+ Y GI S G Y PYE + +C +
Sbjct: 180 CSGGWMNDAFTYVAQNGGIDSEGAY-------PYE--------MADGNCHYDPNQVA--- 221
Query: 225 RKCQPGYDVSYEDDLNFGRIAYSLPA-NEETIMREIFRHGPVEGSMTIYA--DMILYKTG 281
Y L +E + + GPV ++ A Y G
Sbjct: 222 -----ARLSGYVY----------LSGPDENMLADMVATKGPV--AVAFDADDPFGSYSGG 264
Query: 282 IYKHVAGGPLG-EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+Y + HA+ I+G+G E G YWLV NS+ WG +G F+I
Sbjct: 265 VYYNPTCETNKFTHAVLIVGYGNE---NGQ----DYWLVKNSWGDGWGLDGYFKI 312
Score = 69.2 bits (170), Expect = 5e-13
Identities = 31/98 (31%), Positives = 44/98 (44%), Gaps = 13/98 (13%)
Query: 393 NEETIMREIFRHGPVEGSMTIYA--DMILYKTGIYKHVAGGPLG-EHAIRIIGWGQEPLG 449
+E + + GPV ++ A Y G+Y + HA+ I+G+G E
Sbjct: 234 DENMLADMVATKGPV--AVAFDADDPFGSYSGGVYYNPTCETNKFTHAVLIVGYGNE--- 288
Query: 450 EGTSSVVKYWLVANSFNTNWGENGLFRIVRGQ-NECGI 486
G YWLV NS+ WG +G F+I R N CGI
Sbjct: 289 NGQ----DYWLVKNSWGDGWGLDGYFKIARNANNHCGI 322
>3f5v_A DER P 1 allergen; allergy, asthma, DUST mites, glycoprotein,
hydrola protease, secreted, thiol protease; HET: P6G;
1.36A {Dermatophagoides pteronyssinus} PDB: 2as8_A
3rvw_A* 3rvx_A 3rvv_A* 3d6s_A*
Length = 222
Score = 88.3 bits (220), Expect = 4e-20
Identities = 48/228 (21%), Positives = 71/228 (31%), Gaps = 54/228 (23%)
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
IR QG CGS WA V A + + L+ +LV C +GC G +
Sbjct: 24 PIRMQGGCGSAWAFSGVAATESAYLAYR-QQS-LDLAEQELVDCASQ--HGCHGDTIPRG 79
Query: 175 WKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
+Y G+V Y Y SC+ +
Sbjct: 80 IEYIQHNGVVQESYY-------RYV--------AREQSCRRPNAQR---------FGISN 115
Query: 235 YEDDLNFGRIAYSLPANEETIMRE--IFRHGPVEGSMTIYA----DMILYKTGIYKHVAG 288
Y + +RE H + ++ I Y
Sbjct: 116 YCQ----------IYPPNANKIREALAQTHSAI--AVIIGIKDLDAFRHYDGRTIIQRDN 163
Query: 289 GPLG-EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G HA+ I+G+ + V YW+V NS++TNWG+NG
Sbjct: 164 GYQPNYHAVNIVGYS-------NAQGVDYWIVRNSWDTNWGDNGYGYF 204
Score = 64.5 bits (158), Expect = 5e-12
Identities = 24/102 (23%), Positives = 38/102 (37%), Gaps = 15/102 (14%)
Query: 391 PANEETIMREIFR-HGPVEGSMTIYA----DMILYKTGIYKHVAGGPLG-EHAIRIIGWG 444
P N I + + H + ++ I Y G HA+ I+G+
Sbjct: 121 PPNANKIREALAQTHSAI--AVIIGIKDLDAFRHYDGRTIIQRDNGYQPNYHAVNIVGYS 178
Query: 445 QEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGI 486
+ V YW+V NS++TNWG+NG + I
Sbjct: 179 -------NAQGVDYWIVRNSWDTNWGDNGYGYFAANIDLMMI 213
>1o0e_A Ervatamin C; plant cysteine protease, two domain, stable at PH
2-12, HYDR; 1.90A {Tabernaemontana divaricata} SCOP:
d.3.1.1 PDB: 2pns_A* 2pre_A* 3bcn_A*
Length = 208
Score = 86.9 bits (216), Expect = 7e-20
Identities = 54/224 (24%), Positives = 80/224 (35%), Gaps = 54/224 (24%)
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFHGK 173
+++QGSCGS WA V + I + G + LS +LV C D N GC GG
Sbjct: 15 PVKNQGSCGSCWAFSTVSTVESINQIRT-GNL-ISLSEQELVDC--DKKNHGCLGGAFVF 70
Query: 174 AWKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYD 232
A++Y + GI + Y PY+ CQ
Sbjct: 71 AYQYIINNGGIDTQANY-------PYK--------AVQGPCQAASKVV----------SI 105
Query: 233 VSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTI-YADMILYKTGIYKHVAGGPL 291
Y +P E +++ P ++ A Y +GI+ G L
Sbjct: 106 DGYNG----------VPFCNEXALKQAVAVQPSTVAIDASSAQFQQYSSGIFSGPCGTKL 155
Query: 292 GEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
H + I+G+ YW+V NS+ WGE G R+
Sbjct: 156 -NHGVTIVGYQA-----------NYWIVRNSWGRYWGEKGYIRM 187
Score = 66.5 bits (163), Expect = 9e-13
Identities = 27/100 (27%), Positives = 42/100 (42%), Gaps = 15/100 (15%)
Query: 390 LPANEETIMREIFRHGPVEGSMTI-YADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPL 448
+P E +++ P ++ A Y +GI+ G L H + I+G+
Sbjct: 111 VPFCNEXALKQAVAVQPSTVAIDASSAQFQQYSSGIFSGPCGTKL-NHGVTIVGYQA--- 166
Query: 449 GEGTSSVVKYWLVANSFNTNWGENGLFRIVR--GQNECGI 486
YW+V NS+ WGE G R++R G CGI
Sbjct: 167 --------NYWIVRNSWGRYWGEKGYIRMLRVGGCGLCGI 198
>1xkg_A DER P I, major mite fecal allergen DER P 1; major allergen,
cysteine protease, house DUST mite, dermatop
pteronyssinus; 1.61A {Dermatophagoides pteronyssinus}
SCOP: d.3.1.1
Length = 312
Score = 89.2 bits (222), Expect = 9e-20
Identities = 60/289 (20%), Positives = 86/289 (29%), Gaps = 61/289 (21%)
Query: 57 NALSKLTLSEL-EMRMGVHPDSKL--PQNRLPLLVQLSDPLEELPEGFDARINWPYCPTI 113
N LS L+L E + + Q L P D R T
Sbjct: 47 NHLSDLSLDEFKNRFLMSAEAFEHLKTQFDLNAETNACSINGNAPAEIDLRQMR--TVT- 103
Query: 114 QEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGK 173
IR QG CGS WA V A + + L+ +LV C +GC G +
Sbjct: 104 -PIRMQGGCGSAWAFSGVAATESAYLAYR-DQ-SLDLAEQELVDCASQ--HGCHGDTIPR 158
Query: 174 AWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDV 233
+Y G+V Y Y SC+
Sbjct: 159 GIEYIQHNGVVQESYY-------RYV--------AREQSCRRPNAQR---------FGIS 194
Query: 234 SYEDDLNFGRIAYSLPANEETIMRE--IFRHGPVEGSMTIYA----DMILYKTGIYKHVA 287
+Y + +RE H + ++ I Y
Sbjct: 195 NYCQ----------IYPPNANKIREALAQTHSAI--AVIIGIKDLDAFRHYDGRTIIQRD 242
Query: 288 GGPLG-EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G HA+ I+G+ + V YW+V NS++TNWG+NG
Sbjct: 243 NGYQPNYHAVNIVGYS-------NAQGVDYWIVRNSWDTNWGDNGYGYF 284
Score = 65.7 bits (161), Expect = 5e-12
Identities = 24/102 (23%), Positives = 38/102 (37%), Gaps = 15/102 (14%)
Query: 391 PANEETIMREIFR-HGPVEGSMTIYA----DMILYKTGIYKHVAGGPLG-EHAIRIIGWG 444
P N I + + H + ++ I Y G HA+ I+G+
Sbjct: 201 PPNANKIREALAQTHSAI--AVIIGIKDLDAFRHYDGRTIIQRDNGYQPNYHAVNIVGYS 258
Query: 445 QEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNECGI 486
+ V YW+V NS++TNWG+NG + I
Sbjct: 259 -------NAQGVDYWIVRNSWDTNWGDNGYGYFAANIDLMMI 293
>1by8_A Protein (procathepsin K); hydrolase(sulfhydryl proteinase), papain;
2.60A {Homo sapiens} SCOP: d.3.1.1 PDB: 7pck_A
Length = 314
Score = 88.8 bits (221), Expect = 1e-19
Identities = 68/286 (23%), Positives = 112/286 (39%), Gaps = 58/286 (20%)
Query: 57 NALSKLTLSEL-EMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINWPYCPTIQE 115
N L +T E+ + G+ ++ L + + P+ D + +
Sbjct: 61 NHLGDMTSEEVVQKMTGLKVPLSHSRSNDTLYI--PEWEGRAPDSVD----YRKKGYVTP 114
Query: 116 IRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKAW 175
+++QG CGS WA +V A+ ++ + GK + LS +LV C + +GC GG+ A+
Sbjct: 115 VKNQGQCGSCWAFSSVGALEGQLKKKT-GKL-LNLSPQNLVDCVSEN-DGCGGGYMTNAF 171
Query: 176 KYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVS 234
+Y GI S Y PY G SC N
Sbjct: 172 QYVQKNRGIDSEDAY-------PYV--------GQEESCMYNPTGKA--------AKCRG 208
Query: 235 YEDDLNFGRIAYSLPA-NEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGP 290
Y + +P NE+ + R + R GPV S+ I A Y G+Y +
Sbjct: 209 YRE----------IPEGNEKALKRAVARVGPV--SVAIDASLTSFQFYSKGVYYDESCNS 256
Query: 291 LG-EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
HA+ +G+G + +G K+W++ NS+ NWG G +
Sbjct: 257 DNLNHAVLAVGYGIQ---KGN----KHWIIKNSWGENWGNKGYILM 295
Score = 63.4 bits (155), Expect = 3e-11
Identities = 31/99 (31%), Positives = 48/99 (48%), Gaps = 14/99 (14%)
Query: 393 NEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGPLG-EHAIRIIGWGQEPL 448
NE+ + R + R GPV S+ I A Y G+Y + HA+ +G+G +
Sbjct: 216 NEKALKRAVARVGPV--SVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQ-- 271
Query: 449 GEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQ-NECGI 486
+G K+W++ NS+ NWG G + R + N CGI
Sbjct: 272 -KGN----KHWIIKNSWGENWGNKGYILMARNKNNACGI 305
>1cqd_A Protein (protease II); cysteine protease, glycoprotein, proline
specificity, carboh papain family, hydrolase; HET: NAG
FUL FUC; 2.10A {Zingiber officinale} SCOP: d.3.1.1
Length = 221
Score = 86.0 bits (214), Expect = 2e-19
Identities = 64/230 (27%), Positives = 96/230 (41%), Gaps = 60/230 (26%)
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFHGK 173
+++QG CGS WA V A+ I + G + LS LV C N GC+GG+
Sbjct: 17 PVKNQGGCGSCWAFSTVAAVEGINQIVT-GDL-ISLSEQQLVDC--TTANHGCRGGWMNP 72
Query: 174 AWKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPG-Y 231
A+++ V GI S TY PY G C P
Sbjct: 73 AFQFIVNNGGINSEETY-------PYR--------GQDGICNST---------VNAPVVS 108
Query: 232 DVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAG 288
SYE+ +P++ E +++ + PV S+T+ A D LY++GI+
Sbjct: 109 IDSYEN----------VPSHNEQSLQKAVANQPV--SVTMDAAGRDFQLYRSGIFT---- 152
Query: 289 GPLGE---HAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
G HA+ ++G+G E +W+V NS+ NWGE+G R
Sbjct: 153 GSCNISANHALTVVGYGTE---NDK----DFWIVKNSWGKNWGESGYIRA 195
Score = 62.2 bits (152), Expect = 3e-11
Identities = 32/107 (29%), Positives = 52/107 (48%), Gaps = 23/107 (21%)
Query: 390 LPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGPLGE---HAIRIIGW 443
+P++ E +++ + PV S+T+ A D LY++GI+ G HA+ ++G+
Sbjct: 115 VPSHNEQSLQKAVANQPV--SVTMDAAGRDFQLYRSGIFT----GSCNISANHALTVVGY 168
Query: 444 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNE----CGI 486
G E +W+V NS+ NWGE+G R R CGI
Sbjct: 169 GTE---NDK----DFWIVKNSWGKNWGESGYIRAERNIENPDGKCGI 208
>1iwd_A Ervatamin B; cysteine protease, alpha-beta protein, catalytic DYAD,
L-DOM domain., hydrolase; 1.63A {Tabernaemontana
divaricata} SCOP: d.3.1.1
Length = 215
Score = 84.9 bits (211), Expect = 5e-19
Identities = 56/225 (24%), Positives = 87/225 (38%), Gaps = 51/225 (22%)
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
I++Q CGS WA AV A+ I + G+ + LS +LV C +GC GG+ A
Sbjct: 15 SIKNQKQCGSCWAFSAVAAVESINKIRT-GQL-ISLSEQELVDCDTAS-HGCNGGWMNNA 71
Query: 175 WKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDV 233
++Y +T GI + Y PY SC+
Sbjct: 72 FQYIITNGGIDTQQNY-------PYS--------AVQGSCKPYRLRV---------VSIN 107
Query: 234 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGP 290
++ + N E+ ++ PV S+T+ A Y +GI+ G
Sbjct: 108 GFQR----------VTRNNESALQSAVASQPV--SVTVEAAGAPFQHYSSGIFTGPCGTA 155
Query: 291 LGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
H + I+G+G + G YW+V NS+ NWG G +
Sbjct: 156 Q-NHGVVIVGYGTQ---SGK----NYWIVRNSWGQNWGNQGYIWM 192
Score = 61.0 bits (149), Expect = 6e-11
Identities = 29/104 (27%), Positives = 44/104 (42%), Gaps = 17/104 (16%)
Query: 390 LPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
+ N E+ ++ PV S+T+ A Y +GI+ G H + I+G+G +
Sbjct: 112 VTRNNESALQSAVASQPV--SVTVEAAGAPFQHYSSGIFTGPCGTAQ-NHGVVIVGYGTQ 168
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQN----ECGI 486
G YW+V NS+ NWG G + R CGI
Sbjct: 169 ---SGK----NYWIVRNSWGQNWGNQGYIWMERNVASSAGLCGI 205
>3f75_A Toxopain-2, cathepsin L protease; medical structural genomics of
pathogenic protozoa, MSGPP, C protease, parasite,
protozoa, hydrolase; 1.99A {Toxoplasma gondii}
Length = 224
Score = 84.9 bits (211), Expect = 5e-19
Identities = 55/227 (24%), Positives = 88/227 (38%), Gaps = 50/227 (22%)
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFHGK 173
++DQ CGS WA A+ C + GK V LS +L+ C + GN C GG
Sbjct: 21 PVKDQRDCGSCWAFSTTGALEGAHCAKT-GKL-VSLSEQELMDCSRAEGNQSCSGGEMND 78
Query: 174 AWKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYD 232
A++Y + + GI S Y PY C+ C+
Sbjct: 79 AFQYVLDSGGICSEDAY-------PYL--------ARDEECRAQ---------SCEKVVK 114
Query: 233 VS-YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAG 288
+ ++D +P E M+ PV S+ I A Y G++ G
Sbjct: 115 ILGFKD----------VPRRSEAAMKAALAKSPV--SIAIEADQMPFQFYHEGVFDASCG 162
Query: 289 GPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
L +H + ++G+G + S +W++ NS+ T WG +G +
Sbjct: 163 TDL-DHGVLLVGYGTDK-----ESKKDFWIMKNSWGTGWGRDGYMYM 203
Score = 60.3 bits (147), Expect = 1e-10
Identities = 25/103 (24%), Positives = 45/103 (43%), Gaps = 14/103 (13%)
Query: 390 LPANEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
+P E M+ PV S+ I A Y G++ G L +H + ++G+G +
Sbjct: 121 VPRRSEAAMKAALAKSPV--SIAIEADQMPFQFYHEGVFDASCGTDL-DHGVLLVGYGTD 177
Query: 447 PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRG---QNECGI 486
S +W++ NS+ T WG +G + + +CG+
Sbjct: 178 K-----ESKKDFWIMKNSWGTGWGRDGYMYMAMHKGEEGQCGL 215
>3kwz_A Cathepsin K; enzyme inhibitor, covalent reversible inhibitor,
disease mutation, disulfide bond, glycoprotein,
hydrolase, lysosome, protease; HET: KWZ; 1.49A {Homo
sapiens} PDB: 1au0_A* 1au2_A* 1au3_A* 1au4_A* 1ayu_A*
1ayv_A* 1ayw_A* 1bgo_A* 1atk_A* 1nl6_A* 1nlj_A* 1q6k_A*
1mem_A* 1yk7_A* 1yk8_A* 1yt7_A* 2ato_A* 2aux_A* 2auz_A*
2bdl_A* ...
Length = 215
Score = 84.5 bits (210), Expect = 7e-19
Identities = 59/228 (25%), Positives = 92/228 (40%), Gaps = 53/228 (23%)
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
+++QG CGS WA +V A+ ++ + GK + LS +LV C + +GC GG+ A
Sbjct: 15 PVKNQGQCGSCWAFSSVGALEGQLKKKT-GKL-LNLSPQNLVDCVSEN-DGCGGGYMTNA 71
Query: 175 WKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPG-YD 232
++Y GI S Y PY G SC N
Sbjct: 72 FQYVQKNRGIDSEDAY-------PYV--------GQEESCMYN---------PTGKAAKC 107
Query: 233 VSYEDDLNFGRIAYSLPAN-EETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAG 288
Y + +P E+ + R + R GPV S+ I A Y G+Y +
Sbjct: 108 RGYRE----------IPEGNEKALKRAVARVGPV--SVAIDASLTSFQFYSKGVYYDESC 155
Query: 289 GPLG-EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
HA+ +G+G + +G K+W++ NS+ NWG G +
Sbjct: 156 NSDNLNHAVLAVGYGIQ---KGN----KHWIIKNSWGENWGNKGYILM 196
Score = 59.4 bits (145), Expect = 2e-10
Identities = 31/99 (31%), Positives = 48/99 (48%), Gaps = 14/99 (14%)
Query: 393 NEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGPLG-EHAIRIIGWGQEPL 448
NE+ + R + R GPV S+ I A Y G+Y + HA+ +G+G +
Sbjct: 117 NEKALKRAVARVGPV--SVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQ-- 172
Query: 449 GEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQ-NECGI 486
+G K+W++ NS+ NWG G + R + N CGI
Sbjct: 173 -KGN----KHWIIKNSWGENWGNKGYILMARNKNNACGI 206
>2oul_A Falcipain 2; cysteine protease, inhibitor, macromolecular
interaction, HY hydrolase inhibitor complex; 2.20A
{Plasmodium falciparum} SCOP: d.3.1.1 PDB: 2ghu_A 1yvb_A
3bpf_A* 3pnr_A
Length = 241
Score = 83.4 bits (207), Expect = 3e-18
Identities = 54/229 (23%), Positives = 90/229 (39%), Gaps = 50/229 (21%)
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGN-GCQGGFHGK 173
++DQ +CGS WA ++ ++ + I K + LS +LV C N GC GG
Sbjct: 32 PVKDQKNCGSCWAFSSIGSVESQYAIRK-NK-LITLSEQELVDC--SFKNYGCNGGLINN 87
Query: 174 AWKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYD 232
A++ + GI G Y PY + + + C + +C Y
Sbjct: 88 AFEDMIELGGICPDGDY-------PYV-------SDAPNLCNID---------RCTEKYG 124
Query: 233 VS-YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA--DMILYKTGIYKHVAGG 289
+ Y +P N+ + GP+ S+++ D YK GI+ G
Sbjct: 125 IKNYLS----------VPDNKL--KEALRFLGPI--SISVAVSDDFAFYKEGIFDGECGD 170
Query: 290 PLGEHAIRIIGWGQE---PLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
L HA+ ++G+G + Y+++ NS+ WGE G I
Sbjct: 171 QL-NHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNSWGQQWGERGFINI 218
Score = 64.9 bits (159), Expect = 5e-12
Identities = 27/108 (25%), Positives = 47/108 (43%), Gaps = 14/108 (12%)
Query: 388 YSLPANEETIMREIFRHGPVEGSMTIYA--DMILYKTGIYKHVAGGPLGEHAIRIIGWGQ 445
S+P N+ + GP+ S+++ D YK GI+ G L HA+ ++G+G
Sbjct: 129 LSVPDNKL--KEALRFLGPI--SISVAVSDDFAFYKEGIFDGECGDQL-NHAVMLVGFGM 183
Query: 446 E---PLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNE----CGI 486
+ Y+++ NS+ WGE G I ++ CG+
Sbjct: 184 KEIVNPLTKKGEKHYYYIIKNSWGQQWGERGFINIETDESGLMRKCGL 231
>3bwk_A Cysteine protease falcipain-3; malaria, hydrolase; HET: C1P; 2.42A
{Plasmodium falciparum} PDB: 3bpm_A*
Length = 243
Score = 83.0 bits (206), Expect = 3e-18
Identities = 50/228 (21%), Positives = 87/228 (38%), Gaps = 48/228 (21%)
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
++DQ CGS WA +V ++ + I S +LV C NGC GG+ A
Sbjct: 34 PVKDQALCGSCWAFSSVGSVESQYAIRK-KA-LFLFSEQELVDCSVKN-NGCYGGYITNA 90
Query: 175 WKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDV 233
+ + G+ S Y PY + +C +C Y +
Sbjct: 91 FDDMIDLGGLCSQDDY-------PYV-------SNLPETCNLK---------RCNERYTI 127
Query: 234 S-YEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTIYA--DMILYKTGIYKHVAGGP 290
Y +P ++ + GP+ S++I A D Y+ G Y G
Sbjct: 128 KSYVS----------IPDDKF--KEALRYLGPI--SISIAASDDFAFYRGGFYDGECGAA 173
Query: 291 LGEHAIRIIGWGQEPLGEGTSSV---VKYWLVANSFNTNWGENGLFRI 335
HA+ ++G+G + + + Y+++ NS+ ++WGE G +
Sbjct: 174 P-NHAVILVGYGMKDIYNEDTGRMEKFYYYIIKNSWGSDWGEGGYINL 220
Score = 64.5 bits (158), Expect = 7e-12
Identities = 26/107 (24%), Positives = 49/107 (45%), Gaps = 14/107 (13%)
Query: 389 SLPANEETIMREIFRHGPVEGSMTIYA--DMILYKTGIYKHVAGGPLGEHAIRIIGWGQE 446
S+P ++ + GP+ S++I A D Y+ G Y G HA+ ++G+G +
Sbjct: 132 SIPDDKF--KEALRYLGPI--SISIAASDDFAFYRGGFYDGECGAAP-NHAVILVGYGMK 186
Query: 447 PLGEGTSSV---VKYWLVANSFNTNWGENGLFRIVRGQNE----CGI 486
+ + Y+++ NS+ ++WGE G + +N C I
Sbjct: 187 DIYNEDTGRMEKFYYYIIKNSWGSDWGEGGYINLETDENGYKKTCSI 233
>2c0y_A Procathepsin S; proenzyme, proteinase, hydrolase, thiol protease,
prosegment binding loop, glycoprotein, lysosome,
protease, zymogen; 2.1A {Homo sapiens}
Length = 315
Score = 83.8 bits (208), Expect = 5e-18
Identities = 72/294 (24%), Positives = 115/294 (39%), Gaps = 74/294 (25%)
Query: 57 NALSKLTLSE-LEMRMGVHPDSKLPQNRLPLLVQLSDPLEELPEGFDARINW-------P 108
N L +T E + + ++P + S+P LP+ D W
Sbjct: 62 NHLGDMTSEEVMSLMSS----LRVPSQWQRNITYKSNPNRILPDSVD----WREKGCVTE 113
Query: 109 YCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCG--NGC 166
++ QGSCG+ WA AV A+ ++ + + GK V LS+ +LV C + GC
Sbjct: 114 -------VKYQGSCGAAWAFSAVGALEAQLKLKT-GKL-VSLSAQNLVDCSTEKYGNKGC 164
Query: 167 QGGFHGKAWKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIR 225
GGF A++Y + GI S +Y PY+ CQ +
Sbjct: 165 NGGFMTTAFQYIIDNKGIDSDASY-------PYK--------AMDQKCQYDSKYRA---- 205
Query: 226 KCQPGYDVSYEDDLNFGRIAYSLPAN-EETIMREIFRHGPVEGSMTIYA---DMILYKTG 281
Y + LP E+ + + GPV S+ + A LY++G
Sbjct: 206 ----ATCSKYTE----------LPYGREDVLKEAVANKGPV--SVGVDARHPSFFLYRSG 249
Query: 282 IYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+Y + H + ++G+G G +YWLV NS+ N+GE G R+
Sbjct: 250 VYYEPSCTQNVNHGVLVVGYGDL---NGK----EYWLVKNSWGHNFGEEGYIRM 296
Score = 62.3 bits (152), Expect = 7e-11
Identities = 30/98 (30%), Positives = 48/98 (48%), Gaps = 13/98 (13%)
Query: 393 NEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLG 449
E+ + + GPV S+ + A LY++G+Y + H + ++G+G
Sbjct: 218 REDVLKEAVANKGPV--SVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDL--- 272
Query: 450 EGTSSVVKYWLVANSFNTNWGENGLFRIVRGQ-NECGI 486
G +YWLV NS+ N+GE G R+ R + N CGI
Sbjct: 273 NGK----EYWLVKNSWGHNFGEEGYIRMARNKGNHCGI 306
>3ovx_A Cathepsin S; hydrolase, covalent inhibitor, aldehyde warhead is
covalently bound to Cys25, lysosomeal protein; HET: O64;
1.49A {Homo sapiens} PDB: 2h7j_A* 2f1g_A* 2hh5_B*
2hhn_A* 2hxz_A* 2op3_A* 2frq_A* 2fra_A* 2fq9_A* 2ft2_A*
2fud_A* 2g7y_A* 1ms6_A* 2r9m_A* 2r9n_A* 2r9o_A* 3n3g_A*
3n4c_A* 3mpe_A* 1nqc_A* ...
Length = 218
Score = 80.6 bits (200), Expect = 1e-17
Identities = 65/228 (28%), Positives = 101/228 (44%), Gaps = 51/228 (22%)
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDC-GN-GCQGGFHG 172
E++ QGSCG+ WA AV A+ ++ + + GK V LS+ +LV C + GN GC GGF
Sbjct: 16 EVKYQGSCGACWAFSAVGALEAQLKLKT-GKL-VSLSAQNLVDCSTEKYGNKGCNGGFMT 73
Query: 173 KAWKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGY 231
A++Y + GI S +Y PY+ CQ + K +
Sbjct: 74 TAFQYIIDNKGIDSDASY-------PYK--------AMDQKCQYD--------SKYRAAT 110
Query: 232 DVSYEDDLNFGRIAYSLPAN-EETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVA 287
Y + LP E+ + + GPV S+ + A LY++G+Y +
Sbjct: 111 CSKYTE----------LPYGREDVLKEAVANKGPV--SVGVDARHPSFFLYRSGVYYEPS 158
Query: 288 GGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
H + ++G+G G +YWLV NS+ N+GE G R+
Sbjct: 159 CTQNVNHGVLVVGYGDL---NGK----EYWLVKNSWGHNFGEEGYIRM 199
Score = 60.2 bits (147), Expect = 1e-10
Identities = 30/98 (30%), Positives = 48/98 (48%), Gaps = 13/98 (13%)
Query: 393 NEETIMREIFRHGPVEGSMTIYA---DMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLG 449
E+ + + GPV S+ + A LY++G+Y + H + ++G+G
Sbjct: 121 REDVLKEAVANKGPV--SVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDL--- 175
Query: 450 EGTSSVVKYWLVANSFNTNWGENGLFRIVRGQ-NECGI 486
G +YWLV NS+ N+GE G R+ R + N CGI
Sbjct: 176 NGK----EYWLVKNSWGHNFGEEGYIRMARNKGNHCGI 209
>3u8e_A Papain-like cysteine protease; papain-like cysteine peptidase,
peptidase_C1A, hydrolase, in form; 1.31A {Crocus
sativus}
Length = 222
Score = 68.7 bits (168), Expect = 2e-13
Identities = 51/227 (22%), Positives = 83/227 (36%), Gaps = 50/227 (22%)
Query: 115 EIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCKDCGNGCQGGFHGKA 174
++DQG+CG WA GA A+ I + G+ + +S +V C GG A
Sbjct: 15 SVKDQGACGMCWAFGATGAIEGIDAITT-GRL-ISVSEQQIVDCDTXXXXXX-GGDADDA 71
Query: 175 WKYWVTT-GIVSGGTYASKQGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDV 233
+++ +T GI S Y PY G +C N+P
Sbjct: 72 FRWVITNGGIASDANY-------PYT--------GVDGTCDLNKPIAARI---------D 107
Query: 234 SYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTI-YADMILYKT-GIYKHVAGGPL 291
Y + +P + ++ + PV ++ LY GI+ +
Sbjct: 108 GYTN----------VPNSSSALLDAV-AKQPVSVNIYTSSTSFQLYTGPGIFAGSSCSDD 156
Query: 292 G---EHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 335
+H + I+G+G YW+V NS+ T WG +G I
Sbjct: 157 PATVDHTVLIVGYGSNGTNA------DYWIVKNSWGTEWGIDGYILI 197
Score = 58.3 bits (141), Expect = 7e-10
Identities = 25/107 (23%), Positives = 42/107 (39%), Gaps = 16/107 (14%)
Query: 389 SLPANEETIMREIFRHGPVEGSMTI-YADMILYKT-GIYKHVAGGPLG---EHAIRIIGW 443
++P + ++ + PV ++ LY GI+ + +H + I+G+
Sbjct: 111 NVPNSSSALLDAV-AKQPVSVNIYTSSTSFQLYTGPGIFAGSSCSDDPATVDHTVLIVGY 169
Query: 444 GQEPLGEGTSSVVKYWLVANSFNTNWGENGLFRIVRGQNE----CGI 486
G YW+V NS+ T WG +G I R N C I
Sbjct: 170 GSNGTNA------DYWIVKNSWGTEWGIDGYILIRRNTNRPDGVCAI 210
>2pff_B Fatty acid synthase subunit beta; fatty acid synthase,
acyl-carrier-protein, beta-ketoacyl RED beta-ketoacyl
synthase, dehydratase; 4.00A {Saccharomyces cerevisiae}
Length = 2006
Score = 42.7 bits (100), Expect = 3e-04
Identities = 54/385 (14%), Positives = 107/385 (27%), Gaps = 143/385 (37%)
Query: 209 SHSSCQDNEPNTPE---------CIRKCQPGYDVSYEDDLN-----FGR----------I 244
+ D+EP TP +P ++ LN F +
Sbjct: 45 TEGFAADDEPTTPAELVGKFLGYVSSLVEPSKVGQFDQVLNLCLTEFENCYLEGNDIHAL 104
Query: 245 AYSLPANEETIM---REIFRHGPVEGSMTIY--ADMILYKTGIYKHVAGGPLGEHA---- 295
A L +T + +E+ ++ Y A ++ + + + L
Sbjct: 105 AAKLLQENDTTLVKTKELIKN---------YITARIMAKRP--FDKKSNSALFRAVGEGN 153
Query: 296 IRII---GWGQEPLGEGTSSVVKYW--LVANSFNTNWGENGLFRIGCRPYEIPCERYMNG 350
+++ G GQ G + Y+ L L++ Y + +
Sbjct: 154 AQLVAIFG-GQ-----GNTDD--YFEELRD-----------LYQT----YHVLVGDLIKF 190
Query: 351 SRSSCQANEPNTPECIRKCQPGYDV--------SYEDDLNFGRIAYSLP-------ANEE 395
S + T + + G ++ + D I S P A+
Sbjct: 191 SAETLSELIRTTLDAEKVFTQGLNILEWLENPSNTPDKDYLLSIPISCPLIGVIQLAHYV 250
Query: 396 TIMREI-FRHGPV----EGSMTIYADMILYKTGIYKHVAGGPLGEH-------AIRI--- 440
+ + F G + +G+ ++ T + +A E AI +
Sbjct: 251 VTAKLLGFTPGELRSYLKGATGHSQGLV---TAVA--IAETDSWESFFVSVRKAITVLFF 305
Query: 441 IGW-GQE--PLGEGTSSVVKYWLVANSFNTNWGENG-------LFRIVRGQNECGIEADI 490
IG E P S+++ L EN L + + ++ +
Sbjct: 306 IGVRCYEAYPNTSLPPSILEDSL----------ENNEGVPSPML--SISNLTQEQVQDYV 353
Query: 491 T---AGLPK-----IGLEIDSNEIN 507
+ LP I L +N
Sbjct: 354 NKTNSHLPAGKQVEISL------VN 372
Score = 36.6 bits (84), Expect = 0.026
Identities = 39/265 (14%), Positives = 68/265 (25%), Gaps = 108/265 (40%)
Query: 351 SRSSCQANEPNTPE---------CIRKCQPGYDVSYEDDLN-----FGR----------I 386
+ +EP TP +P ++ LN F +
Sbjct: 45 TEGFAADDEPTTPAELVGKFLGYVSSLVEPSKVGQFDQVLNLCLTEFENCYLEGNDIHAL 104
Query: 387 AYSLPANEETIM---REIFR---------HGPV-------------EGSMTIYADMI--- 418
A L +T + +E+ + P EG+ + A I
Sbjct: 105 AAKLLQENDTTLVKTKELIKNYITARIMAKRPFDKKSNSALFRAVGEGNAQLVA--IFGG 162
Query: 419 -------------LYKTGIYK-------HVAGGPLGE-------------HAIRIIGWGQ 445
LY+T Y + L E + I+ W +
Sbjct: 163 QGNTDDYFEELRDLYQT--YHVLVGDLIKFSAETLSELIRTTLDAEKVFTQGLNILEWLE 220
Query: 446 EP--------LGEGTSSV--------VKYWLVANSFNTNWGENGLFRIVRGQNECGIEAD 489
P L S Y + A GE L ++G +
Sbjct: 221 NPSNTPDKDYLLSIPISCPLIGVIQLAHYVVTAKLLGFTPGE--LRSYLKGATGHS-QGL 277
Query: 490 ITAGLPKIGLEIDSNEINLGKMMTL 514
+TA +S +++ K +T+
Sbjct: 278 VTAVAIAETDSWESFFVSVRKAITV 302
Score = 30.4 bits (68), Expect = 1.7
Identities = 19/99 (19%), Positives = 30/99 (30%), Gaps = 26/99 (26%)
Query: 203 ERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSY-----EDDLNFGRIAYSLPANEETIMR 257
E Y + D + T E I K + SY + L+ ++ PA
Sbjct: 1686 ENYSAMIFETIVDGKLKT-EKIFKEINEHSTSYTFRSEKGLLS--ATQFTQPA------- 1735
Query: 258 EIFRHGPVEGSMTIYADMILYKTGIY---KHVAGGPLGE 293
+ + D+ G+ AG LGE
Sbjct: 1736 -LTLM-----EKAAFEDLK--SKGLIPADATFAGHSLGE 1766
Score = 30.4 bits (68), Expect = 1.9
Identities = 11/83 (13%), Positives = 22/83 (26%), Gaps = 32/83 (38%)
Query: 382 NFGRIAYSLPANEETIMREIFR-----------HGPVEG---------------SMTIYA 415
N+ + + + + +IF+ +G +
Sbjct: 1687 NYSAMIFETIVDGKLKTEKIFKEINEHSTSYTFRSE-KGLLSATQFTQPALTLMEKAAFE 1745
Query: 416 DMILYKTGIY---KHVAGGPLGE 435
D+ G+ AG LGE
Sbjct: 1746 DLK--SKGLIPADATFAGHSLGE 1766
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis,
programmed cell death; HET: DTP; 6.90A {Drosophila
melanogaster} PDB: 3iz8_A*
Length = 1221
Score = 42.1 bits (98), Expect = 5e-04
Identities = 63/489 (12%), Positives = 115/489 (23%), Gaps = 174/489 (35%)
Query: 1 MGKSTADAVATFLKDLDLSQSSRNHSNGVF------CD-----------LSKAFD----- 38
GK+ + +F C+ L D
Sbjct: 161 SGKTWV--ALDVCLSYKVQ---CKMDFKIFWLNLKNCNSPETVLEMLQKLLYQIDPNWTS 215
Query: 39 RVDHSILLPKLPFYGAEKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLV--QLSDPLEE 96
R DHS + L + + L +L ++ + + LLV + + +
Sbjct: 216 RSDHSSNIK-LRI-HSIQAELRRL------LKSKPYENC--------LLVLLNVQNA--K 257
Query: 97 LPEGFDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLV 156
F+ C + R + ++D + +S D
Sbjct: 258 AWNAFNLS-----CKILLTTRFKQ-------------VTDFL----SAATTTHISLDHH- 294
Query: 157 SCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASKQGCRPYEIPCERYMNGSH------ 210
CRP ++P E + ++
Sbjct: 295 ----------SMTLTPDE----------VKSLLLKYLDCRPQDLPRE--VLTTNPRRLSI 332
Query: 211 --SSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGS 268
S +D T + + + D L I SL E R++F
Sbjct: 333 IAESIRDG-LATWDNWKH----VNC---DKLT-TIIESSLNVLEPAEYRKMFD------R 377
Query: 269 MTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWG 328
++++ + + +I W + V K L S
Sbjct: 378 LSVFPP----------SA---HIPTILLSLI-WFDVIKSDVMVVVNK--LHKYSLVEKQP 421
Query: 329 ENGLFRIGCRPYEIPCERYMNGSRSSCQANEPNTPE---------CIRKCQPGYDVS--Y 377
+ I I Y+ + N I K D+ Y
Sbjct: 422 KESTISI----PSI----YL---ELKVKLE--NEYALHRSIVDHYNIPKTFDSDDLIPPY 468
Query: 378 EDDLNFGRIAYSLPANEET----IMREIF----------RH-----GPVEGSMTIYADMI 418
D + I + L E + R +F RH + +
Sbjct: 469 LDQYFYSHIGHHLKNIEHPERMTLFRMVFLDFRFLEQKIRHDSTAWNASGSILNTLQQLK 528
Query: 419 LYKTGIYKH 427
YK I +
Sbjct: 529 FYKPYICDN 537
>1qzv_F Plant photosystem I: subunit PSAF; photosynthesis,plant
photosynthetic reaction center, peripheral antenna;
HET: CL1 PQN; 4.44A {Pisum sativum} SCOP: i.5.1.1
Length = 154
Score = 36.9 bits (84), Expect = 0.004
Identities = 12/41 (29%), Positives = 17/41 (41%), Gaps = 11/41 (26%)
Query: 55 EKNALSKLTLSELEMRMGVHPDSKLPQNRLPLLVQLSDPLE 95
EK AL KL + L++ DS P L + +E
Sbjct: 18 EKQALKKLQ-ASLKL---YADDSA------PALA-IKATME 47
>3pw3_A Aminopeptidase C; bleomycin, cysteine proteinase fold, structural
genomics, JO center for structural genomics, JCSG; HET:
MSE; 2.23A {Parabacteroides distasonis}
Length = 383
Score = 33.3 bits (75), Expect = 0.17
Identities = 50/392 (12%), Positives = 98/392 (25%), Gaps = 67/392 (17%)
Query: 101 FDARINWPYCPTIQEIRDQGSCGSGWALGAVEAMSDRVCIASRGKRHVRLSSDDLVSCCK 160
F P I +++Q G+ W + + + GK LS V
Sbjct: 14 FTTVKENP----ITSVKNQNRAGTCWCYSSYSFL--ESELLRMGKGEYDLSEMFTVYNTY 67
Query: 161 DCGNGCQGGFHGKAWKYWVTTGIVSGGTY------ASKQGCRPYEIPCERYMNGSHSSCQ 214
HG GG++ G P E G +
Sbjct: 68 LDRADAAVRTHGD-------VSFSQGGSFYDALYGMETFGLVPEEE----MRPGMMYADT 116
Query: 215 DNEPNTPE---CIRKCQPGYDVSYEDDLNFGRIAYSLPANEETIMREIFRHGPVEGSMTI 271
+ N E L L + +I+ P E
Sbjct: 117 LS--NHTELSALTDAMVAAIAKGKLRKLQSDENNAMLWKKAVAAVHQIYLGVPPE---KF 171
Query: 272 YADMILYKTGIYKHVAGGPLGEHAIRIIGWGQEPLGEGTSSVVKYWLVANSFNTNWGENG 331
Y + G ++ + + + P ++ L NW
Sbjct: 172 TYKGKEYTPKSFFESTGLKASDY-VSLTSYTHHPFYT------QFPL---EIQDNW---- 217
Query: 332 LFRIGCRPYEIPCERYMNGSRSSCQANEP------NTPECIRKCQPGYDVSYEDDLNFGR 385
Y +P + +M ++ + + E
Sbjct: 218 ---RHGMSYNLPLDEFMEVFDNAINTGYTIAWGSDVSESGFTRDGVAVMPDDEKVQELSG 274
Query: 386 IAYSLPANEETIMREIFRHGPVEGSMTIYADMILYKTGIYKHVAGGPLGEHAIRIIGWGQ 445
+ + +++ + T + Y + +H ++I G +
Sbjct: 275 SDMAHWLKLKPEEKKLNTKPQPQKWCTQ-----AERQLAYDNYETTD--DHGMQIYGIAK 327
Query: 446 EPLGEGTSSVVKYWLVANSFNTNWGENGLFRI 477
+ G +Y++V NS+ TN NG++
Sbjct: 328 DQEG------NEYYMVKNSWGTNSKYNGIWYA 353
>3r8s_0 50S ribosomal protein L32; protein biosynthesis, RNA, tRNA,
transfer RNA, 23S ribosomal subunit, ribosome recycling
factor, RRF, ribosome; 3.00A {Escherichia coli} PDB:
1p85_Z 1p86_Z 2awb_0 2aw4_0 2i2v_0 2j28_0 2i2t_0*
2qao_0* 2qba_0* 2qbc_0* 2qbe_0 2qbg_0 2qbi_0* 2qbk_0*
2qov_0 2qox_0 2qoz_0* 2qp1_0* 2rdo_0 2vhm_0 ...
Length = 56
Score = 27.2 bits (61), Expect = 2.0
Identities = 8/23 (34%), Positives = 11/23 (47%), Gaps = 2/23 (8%)
Query: 143 RGKR--HVRLSSDDLVSCCKDCG 163
RG R H L++ +S K G
Sbjct: 12 RGMRRSHDALTAVTSLSVDKTSG 34
>1lr7_A Follistatin, FS1; heparin-binding, cystine-rich, sucrose
octasulphate, hormone/growth factor complex; HET: SO4;
1.50A {Rattus norvegicus} SCOP: g.3.11.3 g.68.1.1 PDB:
1lr8_A* 1lr9_A
Length = 74
Score = 27.2 bits (60), Expect = 3.1
Identities = 10/29 (34%), Positives = 13/29 (44%), Gaps = 2/29 (6%)
Query: 201 PCERYMNGSHSSCQDNEPNTPECIRKCQP 229
CE G C+ N+ N P C+ C P
Sbjct: 3 TCENVDCGPGKKCRMNKKNKPRCV--CAP 29
Score = 26.0 bits (57), Expect = 7.2
Identities = 10/29 (34%), Positives = 13/29 (44%), Gaps = 2/29 (6%)
Query: 343 PCERYMNGSRSSCQANEPNTPECIRKCQP 371
CE G C+ N+ N P C+ C P
Sbjct: 3 TCENVDCGPGKKCRMNKKNKPRCV--CAP 29
>2rjq_A Adamts-5; metalloprotease domain, aggrecanase, cleavage on PAIR of
BAS residues, extracellular matrix, glycoprotein,
hydrolase, ME binding; HET: NAG BAT; 2.60A {Homo
sapiens}
Length = 378
Score = 28.9 bits (64), Expect = 4.8
Identities = 14/84 (16%), Positives = 29/84 (34%), Gaps = 5/84 (5%)
Query: 163 GNGCQGGFHGKAWKYWVTTGIVSGGTYASKQG-CRPYEIPCERYMNGSHSSCQDNEPNTP 221
+ G + + I++ + C I +++ H +C + P
Sbjct: 161 DSKFCEETFGSTEDKRLMSSILTSIDASKPWSKCTSATI--TEFLDDGHGNCLLDLPRKQ 218
Query: 222 ECIRKCQPG--YDVSYEDDLNFGR 243
+ PG YD + + +L FG
Sbjct: 219 ILGPEELPGQTYDATQQCNLTFGP 242
>1igr_A Insulin-like growth factor receptor 1; hormone receptor, insulin
receptor family; HET: NAG FUC BMA MAN; 2.60A {Homo
sapiens} SCOP: c.10.2.5 c.10.2.5 g.3.9.1
Length = 478
Score = 28.6 bits (63), Expect = 5.2
Identities = 15/92 (16%), Positives = 29/92 (31%), Gaps = 9/92 (9%)
Query: 158 CCKDCGNGCQGGFHGKA----WKYWVTTGIVS---GGTYASKQGCRPYEIPCERYMNGSH 210
C +C C + A Y+ V TY + C + +
Sbjct: 201 CHPECLGSCSAPDNDTACVACRHYYYAGVCVPACPPNTYRFEGWRCVDRDFC-ANILSAE 259
Query: 211 SSCQDNE-PNTPECIRKCQPGYDVSYEDDLNF 241
SS + + EC+++C G+ + +
Sbjct: 260 SSDSEGFVIHDGECMQECPSGFIRNGSQSMYC 291
>3hh2_C Follistatin; protein-protein complex, TB domain, cystine knot
motif, TGF- fold, disulfide linked dimer, CLE PAIR of
basic residues, cytokine; HET: CIT; 2.15A {Homo sapiens}
PDB: 2b0u_C* 2p6a_D
Length = 288
Score = 28.4 bits (62), Expect = 6.1
Identities = 12/45 (26%), Positives = 16/45 (35%), Gaps = 2/45 (4%)
Query: 194 GCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDD 238
C P + CE G C+ N+ N P C+ C P
Sbjct: 58 NCIPCKETCENVDCGPGKKCRMNKKNKPRCV--CAPDCSNITWKG 100
Score = 28.1 bits (61), Expect = 7.1
Identities = 14/62 (22%), Positives = 20/62 (32%), Gaps = 2/62 (3%)
Query: 319 VANSFNTNWGENGLFRIGCRPYEIPCERYMNGSRSSCQANEPNTPECIRKCQPGYDVSYE 378
V ++ W C P + CE G C+ N+ N P C+ C P
Sbjct: 41 VNDNTLFKWMIFNGGAPNCIPCKETCENVDCGPGKKCRMNKKNKPRCV--CAPDCSNITW 98
Query: 379 DD 380
Sbjct: 99 KG 100
>1n1i_A Merozoite surface protein-1; MSP1, malaria, surface antigen,
glycoprotein, EGF domain, cell adhesion; HET: HIS; 2.40A
{Plasmodium knowlesi strain H} SCOP: g.3.11.4 g.3.11.4
Length = 105
Score = 26.6 bits (58), Expect = 8.2
Identities = 6/30 (20%), Positives = 11/30 (36%), Gaps = 2/30 (6%)
Query: 202 CERYMNGSHSSCQDNEPNTPECIRKCQPGY 231
C +++C T E +C G+
Sbjct: 13 CIDTNVPENAACYRYLDGTEEW--RCLLGF 40
>1ob1_C Major merozoite surface protein; immune system,
immunoglobulin/complex, immunoglobulin, antib fragment,
MSP1-19, EGF-like domain; 2.90A {Plasmodium falciparum}
SCOP: g.3.11.4 g.3.11.4 PDB: 1cej_A 2flg_A
Length = 99
Score = 26.3 bits (57), Expect = 9.0
Identities = 8/30 (26%), Positives = 11/30 (36%), Gaps = 2/30 (6%)
Query: 202 CERYMNGSHSSCQDNEPNTPECIRKCQPGY 231
C + +S C + EC KC Y
Sbjct: 7 CVKKQCPQNSGCFRHLDEREEC--KCLLNY 34
>3nvx_A Protein A39; beta-propeller, viral protein; HET: NAG; 2.00A
{Vaccinia virus} PDB: 3nvn_A*
Length = 383
Score = 27.8 bits (61), Expect = 9.7
Identities = 13/142 (9%), Positives = 27/142 (19%), Gaps = 8/142 (5%)
Query: 134 MSDRVCIASRGKRHV-RLSSDDLVSCCKDCGNGCQGGFHGKAWKYWVTTGIVSGGTYASK 192
+ D + G V S++ L N + GT
Sbjct: 19 LDDVLYTGVNG--AVYTFSNNKLNKTGLTNNNYI----TTSIKVEDADKDTLVCGTNNGN 72
Query: 193 QGCRPYEIPCERYMNGSHSSCQDNEPNTPECIRKCQPGYDVSYEDDLNFGRIAYSLPANE 252
C + + G D++ + + P
Sbjct: 73 PKCWKIDGSDDPKHRG-RGYAPYQNSKVTIISYNECVLSDINISKEGIKRWRRFDGPCGY 131
Query: 253 ETIMREIFRHGPVEGSMTIYAD 274
+ + + D
Sbjct: 132 DLYTADNVIPKDGLRGAFVDKD 153
Database: pdb70
Posted date: Sep 4, 2012 3:40 AM
Number of letters in database: 6,701,793
Number of sequences in database: 27,921
Lambda K H
0.318 0.137 0.435
Gapped
Lambda K H
0.267 0.0856 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 27921
Number of Hits to DB: 8,240,073
Number of extensions: 511290
Number of successful extensions: 1330
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1201
Number of HSP's successfully gapped: 129
Length of query: 524
Length of database: 6,701,793
Length adjustment: 98
Effective length of query: 426
Effective length of database: 3,965,535
Effective search space: 1689317910
Effective search space used: 1689317910
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 59 (26.3 bits)