RPS-BLAST 2.2.26 [Sep-21-2011]
Database: pdb70
27,921 sequences; 6,701,793 total letters
Searching..................................................done
Query= psy14862
(263 letters)
>1by8_A Protein (procathepsin K); hydrolase(sulfhydryl proteinase), papain;
2.60A {Homo sapiens} SCOP: d.3.1.1 PDB: 7pck_A
Length = 314
Score = 132 bits (334), Expect = 3e-37
Identities = 45/153 (29%), Positives = 72/153 (47%), Gaps = 9/153 (5%)
Query: 70 EEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVN 126
EE LD ++ + + + +QY++ + R I+ NLK I + + G TY +N
Sbjct: 4 EEILD--THWELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMN 61
Query: 127 RFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQD 186
DMT E ++ L ++ S + +S++Y+ KG V P V++
Sbjct: 62 HLGDMTSEEVVQKMTGL---KVPLSHSRSNDTLYIPEWEGRAPDSVDYRKKGYVTP-VKN 117
Query: 187 QHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
Q CGSCWA S+V LE K +L+ LS Q
Sbjct: 118 QGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQ 150
Score = 39.1 bits (92), Expect = 8e-04
Identities = 14/35 (40%), Positives = 17/35 (48%), Gaps = 4/35 (11%)
Query: 226 RFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNES 260
+FY GV C +LNHAVL VGY +
Sbjct: 242 QFYSKGVYYDES--C--NSDNLNHAVLAVGYGIQK 272
>1pci_A Procaricain; zymogen, hydrolase, thiol protease; 3.20A {Carica
papaya} SCOP: d.3.1.1
Length = 322
Score = 130 bits (330), Expect = 1e-36
Identities = 47/145 (32%), Positives = 77/145 (53%), Gaps = 9/145 (6%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDS 134
F ++ + + Y++ E RF+IF++NL ID + + +Y G+N FAD+++
Sbjct: 20 QLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYID---ETNKKNNSYWLGLNEFADLSND 76
Query: 135 EFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
EFN I+ F + + L E+++++ KG V P V+ Q CGSCW
Sbjct: 77 EFN---EKYVGSLIDATIEQSYDEEFINEDIVNLPENVDWRKKGAVTP-VRHQGSCGSCW 132
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A SAVA +E I+ +L+ELS+Q
Sbjct: 133 AFSAVATVEGINKIRTGKLVELSEQ 157
Score = 38.8 bits (91), Expect = 0.001
Identities = 11/35 (31%), Positives = 15/35 (42%), Gaps = 6/35 (17%)
Query: 226 RFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNES 260
+ YKGG+ P C ++ AV VGY
Sbjct: 248 QLYKGGIFEGP---CGT---KVDGAVTAVGYGKSG 276
>2c0y_A Procathepsin S; proenzyme, proteinase, hydrolase, thiol protease,
prosegment binding loop, glycoprotein, lysosome,
protease, zymogen; 2.1A {Homo sapiens}
Length = 315
Score = 129 bits (327), Expect = 4e-36
Identities = 47/153 (30%), Positives = 76/153 (49%), Gaps = 11/153 (7%)
Query: 70 EEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVN 126
+ LDH + + + Y +QY +E R I+ NLK + + +H G +Y G+N
Sbjct: 5 DPTLDH--HWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMN 62
Query: 127 RFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQD 186
DMT E +SSL T++ S+ + L +S+++++KG V V+
Sbjct: 63 HLGDMTSEEVMSLMSSLRVPSQWQRNITYK-----SNPNRILPDSVDWREKGCVTE-VKY 116
Query: 187 QHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
Q CG+ WA SAV LE+ +K +L+ LS Q
Sbjct: 117 QGSCGAAWAFSAVGALEAQLKLKTGKLVSLSAQ 149
Score = 38.4 bits (90), Expect = 0.001
Identities = 12/35 (34%), Positives = 19/35 (54%), Gaps = 5/35 (14%)
Query: 226 RFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNES 260
Y+ GV P C++ ++NH VL VGY + +
Sbjct: 244 FLYRSGVYYEPS--CTQ---NVNHGVLVVGYGDLN 273
>3qj3_A Cathepsin L-like protein; hydrolase, proteinase, larVal midgut;
1.85A {Tenebrio molitor}
Length = 331
Score = 128 bits (323), Expect = 2e-35
Identities = 42/149 (28%), Positives = 69/149 (46%), Gaps = 8/149 (5%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTDS 134
++++F Y R Y + E R IF+ L+T + + K+ QG +Y GVN F DMT
Sbjct: 21 KWENFKTTYARSYVNAKEETFRKQIFQKKLETFEEHNEKYRQGLVSYTLGVNLFTDMTPE 80
Query: 135 EFN--HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGS 192
E + + +N + S S +++D+G V P V++Q CGS
Sbjct: 81 EMKAYTHGLIMPADLHKNGIPIKTREDLGLNASVRYPASFDWRDQGMVSP-VKNQGSCGS 139
Query: 193 CWAHSAVACLESAYAIKHNELIE--LSKQ 219
WA S+ +ES I + + +S+Q
Sbjct: 140 SWAFSSTGAIESQMKIANGAGYDSSVSEQ 168
Score = 38.8 bits (91), Expect = 0.001
Identities = 15/35 (42%), Positives = 16/35 (45%), Gaps = 4/35 (11%)
Query: 226 RFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNES 260
Y GGV P C HAVL VGY NE+
Sbjct: 259 GSYSGGVYYNPT--C--ETNKFTHAVLIVGYGNEN 289
>2o6x_A Procathepsin L1, secreted cathepsin L 1; hydrolase, thiol protease,
cysteine protease, zymogen, hydro; 1.40A {Fasciola
hepatica}
Length = 310
Score = 126 bits (319), Expect = 5e-35
Identities = 39/146 (26%), Positives = 68/146 (46%), Gaps = 9/146 (6%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTD 133
+ + + R Y ++Y+ + + R +I+ N+K I + +H+ G TY G+N+F DMT
Sbjct: 3 DLWHQWKRMYNKEYNGADD-QHRRNIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTF 61
Query: 134 SEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
EF + + L + N + + + I++++ G V V+DQ CGS
Sbjct: 62 EEFKAKYLTEMSRASDILSHGVPYEANNRA----VPDKIDWRESGYVTE-VKDQGNCGSG 116
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
WA S +E Y I S+Q
Sbjct: 117 WAFSTTGTMEGQYMKNERTSISFSEQ 142
Score = 38.0 bits (89), Expect = 0.002
Identities = 13/35 (37%), Positives = 17/35 (48%), Gaps = 4/35 (11%)
Query: 226 RFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNES 260
Y+ G+ CS P +NHAVL VGY +
Sbjct: 234 MMYRSGIYQSQ--TCS--PLRVNHAVLAVGYGTQG 264
>3qt4_A Cathepsin-L-like midgut cysteine proteinase; hydrolase, zymogen,
intramolecular DISS bonds, insect larVal midgut; HET:
PG4 PG6; 2.11A {Tenebrio molitor}
Length = 329
Score = 127 bits (320), Expect = 6e-35
Identities = 42/145 (28%), Positives = 69/145 (47%), Gaps = 8/145 (5%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTDS 134
Q+ F +++ Y S E RR IF++N+ I + K E+G TY +N+F DM+
Sbjct: 26 QWSQFKLTHKKSYSSPIEEIRRQLIFKDNVAKIAEHNAKFEKGEVTYSKAMNQFGDMSKE 85
Query: 135 EFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
EF ++ + + K S+ LA S++++ V V+DQ CGS W
Sbjct: 86 EFLAYVNRG---KAQKPKHPENLRMPYVSSKKPLAASVDWRSNA-VSE-VKDQGQCGSSW 140
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
+ S +E A++ L LS+Q
Sbjct: 141 SFSTTGAVEGQLALQRGRLTSLSEQ 165
Score = 39.2 bits (92), Expect = 7e-04
Identities = 13/35 (37%), Positives = 18/35 (51%), Gaps = 4/35 (11%)
Query: 226 RFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNES 260
+FY GG+ C LNH VL VGY +++
Sbjct: 257 QFYSGGLFYDQT--C--NQSDLNHGVLVVGYGSDN 287
>1xkg_A DER P I, major mite fecal allergen DER P 1; major allergen,
cysteine protease, house DUST mite, dermatop
pteronyssinus; 1.61A {Dermatophagoides pteronyssinus}
SCOP: d.3.1.1
Length = 312
Score = 125 bits (317), Expect = 1e-34
Identities = 30/144 (20%), Positives = 61/144 (42%), Gaps = 10/144 (6%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
F+++ + + + Y + + E F ++K + +N +D++ EF
Sbjct: 6 KTFEEYKKAFNKSYATFEDEEAARKNFLESVKYVQSNGG--------AINHLSDLSLDEF 57
Query: 137 NHG-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
+ L S + + + + S + I+ + V P ++ Q CGS WA
Sbjct: 58 KNRFLMSAEAFEHLKTQFDLNAETNACSINGNAPAEIDLRQMRTVTP-IRMQGGCGSAWA 116
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S VA ESAY ++ ++L++Q
Sbjct: 117 FSGVAATESAYLAYRDQSLDLAEQ 140
Score = 39.2 bits (92), Expect = 9e-04
Identities = 11/35 (31%), Positives = 12/35 (34%), Gaps = 4/35 (11%)
Query: 226 RFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNES 260
R Y G + G HAV VGY N
Sbjct: 231 RHYDGRTIIQRD--N--GYQPNYHAVNIVGYSNAQ 261
>1cs8_A Human procathepsin L; prosegment, propeptide, inhibition,
hydrolase; HET: OCS; 1.80A {Homo sapiens} SCOP: d.3.1.1
PDB: 1cjl_A 3hwn_A*
Length = 316
Score = 123 bits (312), Expect = 6e-34
Identities = 41/146 (28%), Positives = 68/146 (46%), Gaps = 11/146 (7%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTD 133
Q+ + + R Y + E R ++ N+K I+ + ++ +G ++ +N F DMT
Sbjct: 10 AQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTS 68
Query: 134 SEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
EF ++ + + F Y S+++++KG V P V++Q CGSC
Sbjct: 69 EEFRQVMNGF------QNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTP-VKNQGQCGSC 121
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
WA SA LE K LI LS+Q
Sbjct: 122 WAFSATGALEGQMFRKTGRLISLSEQ 147
Score = 39.2 bits (92), Expect = 7e-04
Identities = 16/38 (42%), Positives = 20/38 (52%), Gaps = 4/38 (10%)
Query: 226 RFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNESTRT 263
FYK G+ P CS ++H VL VGY EST +
Sbjct: 240 LFYKEGIYFEPD--CS--SEDMDHGVLVVGYGFESTES 273
>3pdf_A Cathepsin C, dipeptidyl peptidase 1; two domains, cystein protease,
hydrolase-hydrolase inhibitor; HET: LXV NAG; 1.85A {Homo
sapiens} PDB: 1jqp_A* 2djf_B* 1k3b_B* 2djg_B* 2djf_A*
1k3b_A* 2djg_A* 2djf_C* 1k3b_C* 2djg_C*
Length = 441
Score = 78.3 bits (193), Expect = 8e-17
Identities = 18/151 (11%), Positives = 49/151 (32%), Gaps = 8/151 (5%)
Query: 75 HGNQFKDFVREYER--QYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMT 132
G + + +S+ + +++ + + ++ + +T
Sbjct: 114 TGKKVGTASENVYVNTAHLKNSQEKYSNRLYKYDHNFVKAINAIQKSWTATTYMEYETLT 173
Query: 133 DSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKG--KVLPKVQDQHLC 190
+ + + L S ++++ + V++Q C
Sbjct: 174 LGDMIRRSGGHSRKIPRPKPAPLTAEIQQKILF--LPTSWDWRNVHGINFVSPVRNQASC 231
Query: 191 GSCWAHSAVACLESAYAIKHN--ELIELSKQ 219
GSC++ +++ LE+ I N + LS Q
Sbjct: 232 GSCYSFASMGMLEARIRILTNNSQTPILSPQ 262
Score = 32.8 bits (75), Expect = 0.091
Identities = 12/34 (35%), Positives = 14/34 (41%), Gaps = 5/34 (14%)
Query: 226 RFYKGGVMNLPHMLCSK---GPYSLNHAVLNVGY 256
YK G+ + H NHAVL VGY
Sbjct: 357 LHYKKGIYH--HTGLRDPFNPFELTNHAVLLVGY 388
>3bwk_A Cysteine protease falcipain-3; malaria, hydrolase; HET: C1P; 2.42A
{Plasmodium falciparum} PDB: 3bpm_A*
Length = 243
Score = 71.5 bits (176), Expect = 4e-15
Identities = 23/61 (37%), Positives = 36/61 (59%), Gaps = 1/61 (1%)
Query: 159 SFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSK 218
+ +++ + +++ G V P V+DQ LCGSCWA S+V +ES YAI+ L S+
Sbjct: 11 KYKPADAKLDRIAYDWRLHGGVTP-VKDQALCGSCWAFSSVGSVESQYAIRKKALFLFSE 69
Query: 219 Q 219
Q
Sbjct: 70 Q 70
Score = 38.0 bits (89), Expect = 0.001
Identities = 12/37 (32%), Positives = 17/37 (45%), Gaps = 6/37 (16%)
Query: 226 RFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNESTR 262
FY+GG + C + NHAV+ VGY +
Sbjct: 159 AFYRGGFYDGE---CGA---APNHAVILVGYGMKDIY 189
>3f5v_A DER P 1 allergen; allergy, asthma, DUST mites, glycoprotein,
hydrola protease, secreted, thiol protease; HET: P6G;
1.36A {Dermatophagoides pteronyssinus} PDB: 2as8_A
3rvw_A* 3rvx_A 3rvv_A* 3d6s_A*
Length = 222
Score = 70.6 bits (174), Expect = 7e-15
Identities = 19/63 (30%), Positives = 32/63 (50%), Gaps = 3/63 (4%)
Query: 157 TYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIEL 216
T + + + + I+ + V P ++ Q CGS WA S VA ESAY + ++L
Sbjct: 1 TNACSINGN--APAEIDLRQMRTVTP-IRMQGGCGSAWAFSGVAATESAYLAYRQQSLDL 57
Query: 217 SKQ 219
++Q
Sbjct: 58 AEQ 60
Score = 39.0 bits (92), Expect = 6e-04
Identities = 11/34 (32%), Positives = 12/34 (35%), Gaps = 4/34 (11%)
Query: 226 RFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNE 259
R Y G + G HAV VGY N
Sbjct: 151 RHYDGRTIIQRD--N--GYQPNYHAVNIVGYSNA 180
>3ioq_A CMS1MS2; caricaceae, cysteine protease, papain family, hydrolase;
HET: E64 SO4; 1.87A {Carica candamarcensis}
Length = 213
Score = 69.8 bits (172), Expect = 1e-14
Identities = 22/52 (42%), Positives = 33/52 (63%), Gaps = 1/52 (1%)
Query: 168 LAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+ SI+++ KG V P V++Q CGSCW S+VA +E I +L+ LS+Q
Sbjct: 1 IPTSIDWRQKGAVTP-VRNQGGCGSCWTFSSVAAVEGINKIVTGQLLSLSEQ 51
Score = 39.4 bits (93), Expect = 5e-04
Identities = 13/34 (38%), Positives = 19/34 (55%), Gaps = 6/34 (17%)
Query: 226 RFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNE 259
+ Y+GG+ P C S++HAV VGY N+
Sbjct: 142 QNYRGGIFAGP---CGT---SIDHAVAAVGYGND 169
>1ppo_A Protease omega; hydrolase(thiol protease); 1.80A {Carica papaya}
SCOP: d.3.1.1 PDB: 1meg_A*
Length = 216
Score = 69.8 bits (172), Expect = 1e-14
Identities = 25/52 (48%), Positives = 36/52 (69%), Gaps = 1/52 (1%)
Query: 168 LAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
L E+++++ KG V P V+ Q CGSCWA SAVA +E I+ +L+ELS+Q
Sbjct: 1 LPENVDWRKKGAVTP-VRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQ 51
Score = 39.4 bits (93), Expect = 4e-04
Identities = 12/35 (34%), Positives = 16/35 (45%), Gaps = 6/35 (17%)
Query: 226 RFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNES 260
+ YKGG+ P C ++HAV VGY
Sbjct: 142 QLYKGGIFEGP---CGT---KVDHAVTAVGYGKSG 170
>1s4v_A Cysteine endopeptidase; KDEL ER retention signal, endosperm,
ricinosomes, SEED germi senescence, hydrolase-hydrolase
inhibitor complex; 2.00A {Ricinus communis} SCOP:
d.3.1.1
Length = 229
Score = 69.9 bits (172), Expect = 2e-14
Identities = 22/52 (42%), Positives = 33/52 (63%), Gaps = 1/52 (1%)
Query: 168 LAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+ S++++ KG V V+DQ CGSCWA S + +E IK N+L+ LS+Q
Sbjct: 2 VPASVDWRKKGAVTS-VKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQ 52
Score = 39.8 bits (94), Expect = 3e-04
Identities = 11/35 (31%), Positives = 13/35 (37%), Gaps = 6/35 (17%)
Query: 226 RFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNES 260
+FY GV C L+H V VGY
Sbjct: 145 QFYSEGVFTGS---CGT---ELDHGVAIVGYGTTI 173
>1cqd_A Protein (protease II); cysteine protease, glycoprotein, proline
specificity, carboh papain family, hydrolase; HET: NAG
FUL FUC; 2.10A {Zingiber officinale} SCOP: d.3.1.1
Length = 221
Score = 69.5 bits (171), Expect = 2e-14
Identities = 24/52 (46%), Positives = 35/52 (67%), Gaps = 1/52 (1%)
Query: 168 LAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
L +SI++++ G V+P V++Q CGSCWA S VA +E I +LI LS+Q
Sbjct: 3 LPDSIDWRENGAVVP-VKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQ 53
Score = 39.4 bits (93), Expect = 4e-04
Identities = 11/35 (31%), Positives = 17/35 (48%), Gaps = 6/35 (17%)
Query: 226 RFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNES 260
+ Y+ G+ C+ S NHA+ VGY E+
Sbjct: 144 QLYRSGIFTGS---CNI---SANHALTVVGYGTEN 172
>2oul_A Falcipain 2; cysteine protease, inhibitor, macromolecular
interaction, HY hydrolase inhibitor complex; 2.20A
{Plasmodium falciparum} SCOP: d.3.1.1 PDB: 2ghu_A 1yvb_A
3bpf_A* 3pnr_A
Length = 241
Score = 69.9 bits (172), Expect = 2e-14
Identities = 23/50 (46%), Positives = 34/50 (68%), Gaps = 1/50 (2%)
Query: 170 ESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+ +++ V P V+DQ CGSCWA S++ +ES YAI+ N+LI LS+Q
Sbjct: 20 AAYDWRLHSGVTP-VKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQ 68
Score = 37.6 bits (88), Expect = 0.002
Identities = 12/35 (34%), Positives = 17/35 (48%), Gaps = 6/35 (17%)
Query: 226 RFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNES 260
FYK G+ + C LNHAV+ VG+ +
Sbjct: 157 AFYKEGIFDGE---CGD---QLNHAVMLVGFGMKE 185
>3kwz_A Cathepsin K; enzyme inhibitor, covalent reversible inhibitor,
disease mutation, disulfide bond, glycoprotein,
hydrolase, lysosome, protease; HET: KWZ; 1.49A {Homo
sapiens} PDB: 1au0_A* 1au2_A* 1au3_A* 1au4_A* 1ayu_A*
1ayv_A* 1ayw_A* 1bgo_A* 1atk_A* 1nl6_A* 1nlj_A* 1q6k_A*
1mem_A* 1yk7_A* 1yk8_A* 1yt7_A* 2ato_A* 2aux_A* 2auz_A*
2bdl_A* ...
Length = 215
Score = 69.4 bits (171), Expect = 2e-14
Identities = 23/50 (46%), Positives = 32/50 (64%), Gaps = 1/50 (2%)
Query: 170 ESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+S++Y+ KG V P V++Q CGSCWA S+V LE K +L+ LS Q
Sbjct: 3 DSVDYRKKGYVTP-VKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQ 51
Score = 39.8 bits (94), Expect = 4e-04
Identities = 14/31 (45%), Positives = 16/31 (51%), Gaps = 4/31 (12%)
Query: 226 RFYKGGVMNLPHMLCSKGPYSLNHAVLNVGY 256
+FY GV C +LNHAVL VGY
Sbjct: 143 QFYSKGVYYDES--C--NSDNLNHAVLAVGY 169
>3f75_A Toxopain-2, cathepsin L protease; medical structural genomics of
pathogenic protozoa, MSGPP, C protease, parasite,
protozoa, hydrolase; 1.99A {Toxoplasma gondii}
Length = 224
Score = 69.5 bits (171), Expect = 2e-14
Identities = 22/52 (42%), Positives = 32/52 (61%), Gaps = 1/52 (1%)
Query: 168 LAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
L ++++ +G V P V+DQ CGSCWA S LE A+ K +L+ LS+Q
Sbjct: 7 LPAGVDWRSRGCVTP-VKDQRDCGSCWAFSTTGALEGAHCAKTGKLVSLSEQ 57
Score = 37.1 bits (87), Expect = 0.003
Identities = 12/31 (38%), Positives = 15/31 (48%), Gaps = 6/31 (19%)
Query: 226 RFYKGGVMNLPHMLCSKGPYSLNHAVLNVGY 256
+FY GV + C L+H VL VGY
Sbjct: 150 QFYHEGVFDAS---CGT---DLDHGVLLVGY 174
>2bdz_A Mexicain; cysteine protease, peptidase_C1, papain-like, HYDR; HET:
E64; 2.10A {Jacaratia mexicana}
Length = 214
Score = 69.0 bits (170), Expect = 2e-14
Identities = 25/50 (50%), Positives = 35/50 (70%), Gaps = 1/50 (2%)
Query: 170 ESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
ESI++++KG V P V++Q+ CGSCWA S VA +E I +LI LS+Q
Sbjct: 3 ESIDWREKGAVTP-VKNQNPCGSCWAFSTVATIEGINKIITGQLISLSEQ 51
Score = 38.2 bits (90), Expect = 0.001
Identities = 13/31 (41%), Positives = 17/31 (54%), Gaps = 6/31 (19%)
Query: 226 RFYKGGVMNLPHMLCSKGPYSLNHAVLNVGY 256
+FYKGG+ P C + +HAV VGY
Sbjct: 142 QFYKGGIYEGP---CGT---NTDHAVTAVGY 166
>3f75_P Toxopain-2, cathepsin L propeptide; medical structural genomics of
pathogenic protozoa, MSGPP, C protease, parasite,
protozoa, hydrolase; 1.99A {Toxoplasma gondii}
Length = 106
Score = 66.3 bits (162), Expect = 2e-14
Identities = 18/60 (30%), Positives = 33/60 (55%), Gaps = 1/60 (1%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
+ F F Y + Y ++ E +RR+ IF+NNL I + + + + + +N F D++ EF
Sbjct: 23 DAFSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQ-QGYSYSLKMNHFGDLSRDEF 81
>1iwd_A Ervatamin B; cysteine protease, alpha-beta protein, catalytic DYAD,
L-DOM domain., hydrolase; 1.63A {Tabernaemontana
divaricata} SCOP: d.3.1.1
Length = 215
Score = 69.1 bits (170), Expect = 2e-14
Identities = 22/50 (44%), Positives = 33/50 (66%), Gaps = 1/50 (2%)
Query: 170 ESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
++++ KG V +++Q CGSCWA SAVA +ES I+ +LI LS+Q
Sbjct: 3 SFVDWRSKGAVNS-IKNQKQCGSCWAFSAVAAVESINKIRTGQLISLSEQ 51
Score = 39.8 bits (94), Expect = 3e-04
Identities = 11/35 (31%), Positives = 16/35 (45%), Gaps = 6/35 (17%)
Query: 226 RFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNES 260
+ Y G+ P C + NH V+ VGY +S
Sbjct: 141 QHYSSGIFTGP---CGT---AQNHGVVIVGYGTQS 169
>2cio_A Papain; hydrolase/inhibitor, complex hydrolase/inhibitor, ICP,
cysteine protease, allergen, protease, thiol protease;
1.5A {Carica papaya} PDB: 1khq_A 1khp_A 1ppn_A 3e1z_B
3ima_A 3lfy_A 9pap_A 1bqi_A* 1bp4_A* 1pad_A 1pe6_A*
1pip_A* 1pop_A* 1ppd_A 1ppp_A* 1stf_E* 2pad_A 4pad_A*
5pad_A* 6pad_A* ...
Length = 212
Score = 69.0 bits (170), Expect = 2e-14
Identities = 21/50 (42%), Positives = 31/50 (62%), Gaps = 1/50 (2%)
Query: 170 ESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
E ++++ KG V P V++Q CGSCWA SAV +E I+ L + S+Q
Sbjct: 3 EYVDWRQKGAVTP-VKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNQYSEQ 51
Score = 38.2 bits (90), Expect = 0.001
Identities = 11/31 (35%), Positives = 16/31 (51%), Gaps = 6/31 (19%)
Query: 226 RFYKGGVMNLPHMLCSKGPYSLNHAVLNVGY 256
+ Y+GG+ P C ++HAV VGY
Sbjct: 142 QLYRGGIFVGP---CGN---KVDHAVAAVGY 166
>3ovx_A Cathepsin S; hydrolase, covalent inhibitor, aldehyde warhead is
covalently bound to Cys25, lysosomeal protein; HET: O64;
1.49A {Homo sapiens} PDB: 2h7j_A* 2f1g_A* 2hh5_B*
2hhn_A* 2hxz_A* 2op3_A* 2frq_A* 2fra_A* 2fq9_A* 2ft2_A*
2fud_A* 2g7y_A* 1ms6_A* 2r9m_A* 2r9n_A* 2r9o_A* 3n3g_A*
3n4c_A* 3mpe_A* 1nqc_A* ...
Length = 218
Score = 69.1 bits (170), Expect = 2e-14
Identities = 22/52 (42%), Positives = 34/52 (65%), Gaps = 1/52 (1%)
Query: 168 LAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
L +S+++++KG V V+ Q CG+CWA SAV LE+ +K +L+ LS Q
Sbjct: 2 LPDSVDWREKGCVTE-VKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQ 52
Score = 39.0 bits (92), Expect = 6e-04
Identities = 12/35 (34%), Positives = 19/35 (54%), Gaps = 5/35 (14%)
Query: 226 RFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNES 260
Y+ GV P C++ ++NH VL VGY + +
Sbjct: 147 FLYRSGVYYEPS--CTQ---NVNHGVLVVGYGDLN 176
>3p5u_A Actinidin; SAD, cysteine proteinases, hydrolase; 1.50A {Actinidia
arguta} PDB: 3p5v_A 3p5w_A 3p5x_A 1aec_A* 2act_A
Length = 220
Score = 69.1 bits (170), Expect = 2e-14
Identities = 18/50 (36%), Positives = 30/50 (60%), Gaps = 1/50 (2%)
Query: 170 ESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+ ++++ G V+ ++DQ CGS WA S +A +E I +LI LS+Q
Sbjct: 3 DYVDWRSSGAVVD-IKDQGQCGSAWAFSTIAAVEGINKIATGDLISLSEQ 51
Score = 39.0 bits (92), Expect = 6e-04
Identities = 11/35 (31%), Positives = 16/35 (45%), Gaps = 6/35 (17%)
Query: 226 RFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNES 260
+ Y G+ P C +++HAV VGY E
Sbjct: 145 QHYSSGIFTGP---CGT---AVDHAVTIVGYGTEG 173
>1yal_A Chymopapain; hydrolase, thiol protease; 1.70A {Carica papaya} SCOP:
d.3.1.1 PDB: 1gec_E*
Length = 218
Score = 69.1 bits (170), Expect = 3e-14
Identities = 23/50 (46%), Positives = 33/50 (66%), Gaps = 1/50 (2%)
Query: 170 ESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+SI+++ KG V P V++Q CGSCWA S +A +E I L+ELS+Q
Sbjct: 3 QSIDWRAKGAVTP-VKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQ 51
Score = 39.8 bits (94), Expect = 3e-04
Identities = 13/35 (37%), Positives = 16/35 (45%), Gaps = 6/35 (17%)
Query: 226 RFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNES 260
+ YK GV + P C L+HAV VGY
Sbjct: 142 QLYKSGVFDGP---CGT---KLDHAVTAVGYGTSD 170
>2fo5_A Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cysteine
endoprotease, endopeptidase, LEUP hydrolase; HET: AR7;
2.20A {Hordeum vulgare}
Length = 262
Score = 69.6 bits (171), Expect = 3e-14
Identities = 23/52 (44%), Positives = 32/52 (61%), Gaps = 1/52 (1%)
Query: 168 LAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
L S++++ KG V V+DQ CGSCWA S V +E AI+ L+ LS+Q
Sbjct: 4 LPPSVDWRQKGAVTG-VKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQ 54
Score = 39.6 bits (93), Expect = 5e-04
Identities = 11/35 (31%), Positives = 12/35 (34%), Gaps = 6/35 (17%)
Query: 226 RFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNES 260
FY GV C L+H V VGY
Sbjct: 150 MFYSEGVFTGE---CGT---ELDHGVAVVGYGVAE 178
>3u8e_A Papain-like cysteine protease; papain-like cysteine peptidase,
peptidase_C1A, hydrolase, in form; 1.31A {Crocus
sativus}
Length = 222
Score = 69.1 bits (169), Expect = 3e-14
Identities = 21/50 (42%), Positives = 28/50 (56%), Gaps = 1/50 (2%)
Query: 170 ESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
SI+++ KG V V+DQ CG CWA A +E AI LI +S+Q
Sbjct: 3 ASIDWRKKGAVTS-VKDQGACGMCWAFGATGAIEGIDAITTGRLISVSEQ 51
Score = 27.5 bits (61), Expect = 4.3
Identities = 22/102 (21%), Positives = 39/102 (38%), Gaps = 4/102 (3%)
Query: 164 NSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIK-HNELIELSKQPPK 222
+ G+A NY G ++ + ++ V SA + + ++
Sbjct: 77 TNGGIASDANYPYTGVDGTCDLNKPIAARIDGYTNVPNSSSALLDAVAKQPVSVNIYTSS 136
Query: 223 THGRFYKG-GVMNLPHMLCSKGPYSLNHAVLNVGYDNESTRT 263
T + Y G G+ CS P +++H VL VGY + T
Sbjct: 137 TSFQLYTGPGIFAGSS--CSDDPATVDHTVLIVGYGSNGTNA 176
>1o0e_A Ervatamin C; plant cysteine protease, two domain, stable at PH
2-12, HYDR; 1.90A {Tabernaemontana divaricata} SCOP:
d.3.1.1 PDB: 2pns_A* 2pre_A* 3bcn_A*
Length = 208
Score = 68.8 bits (169), Expect = 3e-14
Identities = 25/52 (48%), Positives = 34/52 (65%), Gaps = 1/52 (1%)
Query: 168 LAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
L E I+++ KG V P V++Q CGSCWA S V+ +ES I+ LI LS+Q
Sbjct: 1 LPEQIDWRKKGAVTP-VKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQ 51
>2b1m_A SPE31; papain-like, sugar binding protein; HET: NAG FUC PG4; 2.00A
{Pachyrhizus erosus} PDB: 2b1n_A*
Length = 246
Score = 69.2 bits (170), Expect = 3e-14
Identities = 21/52 (40%), Positives = 30/52 (57%), Gaps = 1/52 (1%)
Query: 168 LAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
ES ++ KG + V+ Q CGS WA SA +E+A+AI L+ LS+Q
Sbjct: 2 APESWDWSKKGVITK-VKFQGQCGSGWAFSATGAIEAAHAIATGNLVSLSEQ 52
Score = 42.6 bits (101), Expect = 5e-05
Identities = 16/35 (45%), Positives = 21/35 (60%), Gaps = 3/35 (8%)
Query: 226 RFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNES 260
FY GG+ + + CS PY +NH VL VGY +E
Sbjct: 148 HFYSGGIYDGGN--CSS-PYGINHFVLIVGYGSED 179
>2xu3_A Cathepsin L1; hydrolase, drug design, thiol protease; HET: XU3 BTB;
0.90A {Homo sapiens} PDB: 2xu4_A* 2xu5_A* 2yj2_A*
2yj8_A* 2yj9_A* 2yjb_A* 2yjc_A* 3bc3_A* 3h89_A* 3h8b_A*
3h8c_A* 3of9_A* 3of8_A* 3hha_A* 2xu1_A* 3iv2_A* 3k24_A*
2nqd_B* 3kse_A* 2vhs_A ...
Length = 220
Score = 68.3 bits (168), Expect = 4e-14
Identities = 23/50 (46%), Positives = 31/50 (62%), Gaps = 1/50 (2%)
Query: 170 ESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
S+++++KG V P V++Q CGSCWA SA LE K LI LS+Q
Sbjct: 3 RSVDWREKGYVTP-VKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQ 51
Score = 40.6 bits (96), Expect = 2e-04
Identities = 16/38 (42%), Positives = 20/38 (52%), Gaps = 4/38 (10%)
Query: 226 RFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNESTRT 263
FYK G+ P CS ++H VL VGY EST +
Sbjct: 144 LFYKEGIYFEPD--CS--SEDMDHGVLVVGYGFESTES 177
>3i06_A Cruzipain; autocatalytic cleavage, glycoprotein, protease, thiol
protease, zymogen; HET: QL2; 1.10A {Trypanosoma cruzi}
PDB: 1ewm_A* 1ewo_A* 1ewl_A* 1f29_A* 1ewp_A* 1f2b_A*
1f2c_A* 1f2a_A* 1me4_A* 1u9q_X* 2aim_A* 2efm_A* 2oz2_A*
1me3_A* 3kku_A* 3lxs_A* 1aim_A* 3iut_A* 3hd3_A* 2p86_A*
...
Length = 215
Score = 67.5 bits (166), Expect = 8e-14
Identities = 18/50 (36%), Positives = 31/50 (62%), Gaps = 1/50 (2%)
Query: 170 ESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+++++ +G V V+DQ CGSCWA SA+ +E + + + L LS+Q
Sbjct: 3 AAVDWRARGAVTA-VKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQ 51
Score = 38.6 bits (91), Expect = 9e-04
Identities = 12/35 (34%), Positives = 17/35 (48%), Gaps = 6/35 (17%)
Query: 226 RFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNES 260
Y GGV + S L+H VL VGY++ +
Sbjct: 145 MTYTGGV------MTSCVSEQLDHGVLLVGYNDSA 173
>1m6d_A Cathepsin F, catsf; papain family cysteine protease, hydrolase;
HET: MYP; 1.70A {Homo sapiens} SCOP: d.3.1.1
Length = 214
Score = 67.5 bits (166), Expect = 9e-14
Identities = 18/50 (36%), Positives = 28/50 (56%), Gaps = 1/50 (2%)
Query: 170 ESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+++ KG V V+DQ +CGSCWA S +E + + L+ LS+Q
Sbjct: 3 PEWDWRSKGAVTK-VKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQ 51
Score = 44.8 bits (107), Expect = 7e-06
Identities = 15/35 (42%), Positives = 21/35 (60%), Gaps = 2/35 (5%)
Query: 226 RFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNES 260
+FY+ G+ LCS P+ ++HAVL VGY S
Sbjct: 140 QFYRHGISRPLRPLCS--PWLIDHAVLLVGYGQRS 172
>8pch_A Cathepsin H; hydrolase, protease, cysteine proteinase,
aminopeptidase; HET: NAG BMA; 2.10A {Sus scrofa} SCOP:
d.3.1.1 PDB: 1nb3_A* 1nb5_A*
Length = 220
Score = 65.9 bits (162), Expect = 3e-13
Identities = 19/50 (38%), Positives = 31/50 (62%)
Query: 170 ESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
S++++ KG + V++Q CGSCW S LESA AI +++ L++Q
Sbjct: 3 PSMDWRKKGNFVSPVKNQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQ 52
Score = 44.8 bits (107), Expect = 8e-06
Identities = 14/34 (41%), Positives = 18/34 (52%), Gaps = 2/34 (5%)
Query: 226 RFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNE 259
Y+ G+ + C K P +NHAVL VGY E
Sbjct: 145 LMYRKGIYSSTS--CHKTPDKVNHAVLAVGYGEE 176
>2l95_A Crammer, LP06209P; cysteine proteinase inhibitor, intrinsic
disorder P like protein, hydrolase; NMR {Drosophila
melanogaster}
Length = 80
Score = 61.2 bits (149), Expect = 1e-12
Identities = 15/64 (23%), Positives = 35/64 (54%), Gaps = 4/64 (6%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTDS 134
++ ++ ++++ Y+++ + R I+ + I+ + K E+G T+ G+N AD+T
Sbjct: 9 EWVEYKSKFDKNYEAEED-LMRRRIYAESKARIEEHNRKFEKGEVTWKMGINHLADLTPE 67
Query: 135 EFNH 138
EF
Sbjct: 68 EFAQ 71
>3hhi_A Cathepsin B-like cysteine protease; occluding loop, hydrolase, THIO
protease; HET: 074; 1.60A {Trypanosoma brucei} PDB:
3mor_A*
Length = 325
Score = 62.0 bits (151), Expect = 2e-11
Identities = 18/115 (15%), Positives = 40/115 (34%), Gaps = 7/115 (6%)
Query: 110 IDYYTKHEQGTATYGVN-RFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGL 168
+D + +G + ++T E + ++ N + L
Sbjct: 15 VDRVNRLNRGIWKAKYDGVMQNITLREAKRLNGVI--KKNNNASILPKRRFTEEEARAPL 72
Query: 169 AESINYKDK---GKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHN-ELIELSKQ 219
S + + +P++ DQ CGSCWA +A + + + + + +S
Sbjct: 73 PSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTMGGVQDVHISAG 127
Score = 27.7 bits (62), Expect = 3.9
Identities = 10/30 (33%), Positives = 13/30 (43%), Gaps = 5/30 (16%)
Query: 227 FYKGGVMNLPHMLCSKGPYSLNHAVLNVGY 256
Y GV + H+ Y HAV VG+
Sbjct: 243 AYNSGVYH--HVSGQ---YLGGHAVRLVGW 267
>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme,
papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A
1pbh_A 1mir_A
Length = 317
Score = 62.0 bits (151), Expect = 2e-11
Identities = 26/127 (20%), Positives = 45/127 (35%), Gaps = 16/127 (12%)
Query: 98 RRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFET 157
F + L ++Y K T G N F ++ S + L
Sbjct: 5 PSFHPLSDEL--VNYVNKR-NTTWQAGHN-FYNVDMSYLKRLCGTF-------LGGPKPP 53
Query: 158 YSFNSSNSYGLAESINYKDKG---KVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELI 214
+ L S + +++ + +++DQ CGSCWA AV + I N +
Sbjct: 54 QRVMFTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHV 113
Query: 215 --ELSKQ 219
E+S +
Sbjct: 114 SVEVSAE 120
Score = 28.9 bits (65), Expect = 1.5
Identities = 8/31 (25%), Positives = 11/31 (35%), Gaps = 5/31 (16%)
Query: 226 RFYKGGVMNLPHMLCSKGPYSLNHAVLNVGY 256
YK GV G HA+ +G+
Sbjct: 244 LLYKSGVYQ-----HVTGEMMGGHAIRILGW 269
>2wbf_X Serine-repeat antigen protein; SERA, malaria, vacuole, protease,
cathepsin, hydrolase, glycoprotein, thiol protease; HET:
DMS; 1.60A {Plasmodium falciparum} PDB: 3ch3_X 3ch2_X
Length = 265
Score = 60.4 bits (147), Expect = 5e-11
Identities = 12/56 (21%), Positives = 22/56 (39%), Gaps = 1/56 (1%)
Query: 164 NSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
N + + L V+DQ C + W ++ LE+ +K E ++S
Sbjct: 6 NKEYCNRLKDENNCISNLQ-VEDQGNCDTSWIFASKYHLETIRCMKGYEPTKISAL 60
Score = 34.6 bits (80), Expect = 0.024
Identities = 11/36 (30%), Positives = 14/36 (38%), Gaps = 4/36 (11%)
Query: 226 RFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNEST 261
+ G LC G + +HAV VGY N
Sbjct: 181 GYEFSGKKV--KNLC--GDDTADHAVNIVGYGNYVN 212
>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase,
digestive tract, hydrolase-hydrolase INH complex; HET:
074; 1.30A {Schistosoma mansoni} PDB: 3s3q_A* 3s3r_A*
Length = 254
Score = 51.1 bits (123), Expect = 6e-08
Identities = 15/43 (34%), Positives = 22/43 (51%), Gaps = 2/43 (4%)
Query: 179 KVLPKVQDQHLCGSCWAHSAVACLESAYAIKHN--ELIELSKQ 219
K + ++DQ CGSCWA AV + I+ + +ELS
Sbjct: 17 KSIATIRDQSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAV 59
Score = 29.9 bits (68), Expect = 0.62
Identities = 7/30 (23%), Positives = 11/30 (36%), Gaps = 5/30 (16%)
Query: 227 FYKGGVMNLPHMLCSKGPYSLNHAVLNVGY 256
YK G+ G HA+ +G+
Sbjct: 184 NYKSGIYK-----HITGETLGGHAIRIIGW 208
>1deu_A Procathepsin X; cysteine protease, proregion, prosegment, HY; 1.70A
{Homo sapiens} SCOP: d.3.1.1 PDB: 1ef7_A
Length = 277
Score = 50.8 bits (122), Expect = 1e-07
Identities = 16/58 (27%), Positives = 28/58 (48%), Gaps = 8/58 (13%)
Query: 170 ESINYKDKGKV--LPKVQDQHL---CGSCWAHSAVACLESAYAIKHN---ELIELSKQ 219
+S ++++ V ++QH+ CGSCWAH++ + + IK LS Q
Sbjct: 38 KSWDWRNVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRKGAWPSTLLSVQ 95
Score = 30.4 bits (69), Expect = 0.48
Identities = 7/30 (23%), Positives = 10/30 (33%), Gaps = 5/30 (16%)
Query: 227 FYKGGVMNLPHMLCSKGPYSLNHAVLNVGY 256
Y GG+ +NH V G+
Sbjct: 198 NYTGGIYAEYQ-----DTTYINHVVSVAGW 222
>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco
hydrolase, lysosome, protease, thiol protease, zymogen,
CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A*
3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A*
1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A*
1qdq_A* 1csb_B* 1huc_B 2ipp_B ...
Length = 266
Score = 49.2 bits (118), Expect = 3e-07
Identities = 13/38 (34%), Positives = 19/38 (50%), Gaps = 2/38 (5%)
Query: 184 VQDQHLCGSCWAHSAVACLESAYAIKHNELI--ELSKQ 219
++DQ CGS WA AV + I N + E+S +
Sbjct: 26 IRDQGSCGSAWAFGAVEAISDRICIHTNAHVSVEVSAE 63
Score = 29.6 bits (67), Expect = 0.94
Identities = 9/34 (26%), Positives = 13/34 (38%), Gaps = 5/34 (14%)
Query: 227 FYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNES 260
YK GV G HA+ +G+ E+
Sbjct: 188 LYKSGVYQ-----HVTGEMMGGHAIRILGWGVEN 216
>3ois_A Cysteine protease; alpha and beta, hydrolase; HET: UDP; 1.65A
{Xylella fastidiosa}
Length = 291
Score = 48.7 bits (116), Expect = 6e-07
Identities = 15/112 (13%), Positives = 30/112 (26%), Gaps = 15/112 (13%)
Query: 114 TKHEQGTATYGVNRFADMTDSEFNHG-LSSLDWEQIENLKSTFE----TYSFNSSNSYGL 168
+ H + G+ + S + + + +Y+ S L
Sbjct: 3 SSHHHHHHSSGLVPRGSHMQTVLKRRKKSGYGY-----IPDIADIRDFSYTPEKSVIAAL 57
Query: 169 AESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQP 220
++ V DQ GSC A++ A ++ E
Sbjct: 58 PPKVDLTPP---FQ-VYDQGRIGSCTANALAAAIQFERIHDKQ-SPEFIPSR 104
Score = 36.0 bits (83), Expect = 0.008
Identities = 10/33 (30%), Positives = 14/33 (42%), Gaps = 1/33 (3%)
Query: 227 FYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNE 259
+ +P + HAVL VGYD+E
Sbjct: 216 GNNSLPVRIP-LPTKNDTLEGGHAVLCVGYDDE 247
>2pff_B Fatty acid synthase subunit beta; fatty acid synthase,
acyl-carrier-protein, beta-ketoacyl RED beta-ketoacyl
synthase, dehydratase; 4.00A {Saccharomyces cerevisiae}
Length = 2006
Score = 44.3 bits (104), Expect = 3e-05
Identities = 29/142 (20%), Positives = 44/142 (30%), Gaps = 45/142 (31%)
Query: 101 DIFRNNLKTIDYYTKHEQG---TATYGVNRFADMTDS---------EFNHGLSSLDWE-Q 147
DI NN + + E+G Y F + D E N +S + +
Sbjct: 1663 DIVINNPVNLTIHFGGEKGKRIRENYSAMIFETIVDGKLKTEKIFKEINEHSTSYTFRSE 1722
Query: 148 IENLKSTFETYSFNSSN--------SYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS-- 197
L +T E + K KG ++P D G HS
Sbjct: 1723 KGLLSAT--------QFTQPALTLMEKAAFEDL--KSKG-LIPA--DATFAG----HSLG 1765
Query: 198 ---AVACLESAYAIKHNELIEL 216
A+A L A + L+E+
Sbjct: 1766 EYAALASL--ADVMSIESLVEV 1785
Score = 39.3 bits (91), Expect = 0.001
Identities = 39/270 (14%), Positives = 68/270 (25%), Gaps = 114/270 (42%)
Query: 14 ASVRGLTFQY---EAEWASGCVHTYLGWHPRWTGRVHNLILQRSQPNSYGS-----EEAS 65
S R LT + E V T + + P E +
Sbjct: 4 YSTRPLTLSHGSLEHVLL---VPTASFFI----ASQLQEQFNKILPEPTEGFAADDEPTT 56
Query: 66 TFDL-EEFLDH---------GNQFKDFVR----EYERQY---------------DSDSEI 96
+L +FL + QF + E+E Y ++D+ +
Sbjct: 57 PAELVGKFLGYVSSLVEPSKVGQFDQVLNLCLTEFENCYLEGNDIHALAAKLLQENDTTL 116
Query: 97 ERRFDIFRNNLKTIDYYTKHEQGTATYGVNR-FADMTDSEF----NHGLSSL-------- 143
+ ++ +N Y T A R F ++S G + L
Sbjct: 117 VKTKELIKN------YIT------ARIMAKRPFDKKSNSALFRAVGEGNAQLVAIFGGQG 164
Query: 144 ---DWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVA 200
D+ E L+ ++TY V D
Sbjct: 165 NTDDY--FEELRDLYQTYH----------------------VLVGD-------------- 186
Query: 201 CLESAYAIKHNELIELSKQPPKTHGRFYKG 230
++ + L EL + F +G
Sbjct: 187 LIKFS----AETLSELIRTTLDAEKVFTQG 212
Score = 29.2 bits (65), Expect = 1.8
Identities = 27/174 (15%), Positives = 47/174 (27%), Gaps = 77/174 (44%)
Query: 108 KTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS--SLDWEQIENLKSTFETYS------ 159
++ ++ +G + M LS +L EQ+++ + ++
Sbjct: 322 SILEDSLENNEGVPSP-------M--------LSISNLTQEQVQDYVNKTNSHLPAGKQV 366
Query: 160 ----FNSSNSY---GLAES----INYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAI 208
N + + G +S K K P DQ
Sbjct: 367 EISLVNGAKNLVVSGPPQSLYGLNLTLRKAKA-PSGLDQS-------------------- 405
Query: 209 KHNELIELSKQPPKTHGRFYKGGVMNLP-----HMLCSKGPYSLNHAVLNVGYD 257
+ I S++ K RF LP H S L A + D
Sbjct: 406 R----IPFSERKLKFSNRF-------LPVASPFH---S--HL-LVPASDLINKD 442
>3pw3_A Aminopeptidase C; bleomycin, cysteine proteinase fold, structural
genomics, JO center for structural genomics, JCSG; HET:
MSE; 2.23A {Parabacteroides distasonis}
Length = 383
Score = 39.8 bits (92), Expect = 5e-04
Identities = 11/35 (31%), Positives = 19/35 (54%)
Query: 183 KVQDQHLCGSCWAHSAVACLESAYAIKHNELIELS 217
V++Q+ G+CW +S+ + LES +LS
Sbjct: 24 SVKNQNRAGTCWCYSSYSFLESELLRMGKGEYDLS 58
>1qzv_F Plant photosystem I: subunit PSAF; photosynthesis,plant
photosynthetic reaction center, peripheral antenna; HET:
CL1 PQN; 4.44A {Pisum sativum} SCOP: i.5.1.1
Length = 154
Score = 32.6 bits (73), Expect = 0.057
Identities = 6/28 (21%), Positives = 17/28 (60%), Gaps = 4/28 (14%)
Query: 146 EQIENLKSTFETYSFNSSNSYGLAESIN 173
+ ++ L+++ + Y+ +S+ + LA I
Sbjct: 20 QALKKLQASLKLYADDSAPA--LA--IK 43
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis,
programmed cell death; HET: DTP; 6.90A {Drosophila
melanogaster} PDB: 3iz8_A*
Length = 1221
Score = 30.6 bits (68), Expect = 0.57
Identities = 21/130 (16%), Positives = 42/130 (32%), Gaps = 39/130 (30%)
Query: 27 WASGCVHTYLGWHPRWTGRVHNLILQRSQPNSYGSEEASTFDLEEFLDH----------- 75
+ ++++G H + + L R + D FL+
Sbjct: 468 YLDQYFYSHIGHHLKNIEHPERMTLFR---MVF-------LDF-RFLEQKIRHDSTAWNA 516
Query: 76 ----GNQFKDFVREYERQY--DSDSEIERRFDIFRNNLKTID---YYTKHEQGTATYGVN 126
N + + Y + Y D+D + ER + + L I+ +K+ +
Sbjct: 517 SGSILNTLQQL-KFY-KPYICDNDPKYERLVNAILDFLPKIEENLICSKYTD------LL 568
Query: 127 RFADMTDSEF 136
R A M + E
Sbjct: 569 RIALMAEDEA 578
Score = 29.8 bits (66), Expect = 1.2
Identities = 30/177 (16%), Positives = 52/177 (29%), Gaps = 57/177 (32%)
Query: 103 FRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYS-FN 161
R+ L T D + KH ++ + + SSL+ + + F+ S F
Sbjct: 337 IRDGLATWDNW-KH------VNCDKLTTIIE-------SSLNVLEPAEYRKMFDRLSVFP 382
Query: 162 SSNSYGLAESINYKDKGKVL-------PKVQDQHLCGSCWAHSAVAC--LESAYAIKHNE 212
S + +L K + +S V ES +I +
Sbjct: 383 --------PSAHIPT--ILLSLIWFDVIKSDVMVVVNKLHKYSLVEKQPKESTISI-PSI 431
Query: 213 LIELSKQPPKT---HGRF---YKGGVMNLPHMLCSKGP-------YSLNHAVLNVGY 256
+EL + H Y N+P S Y +H +G+
Sbjct: 432 YLELKVKLENEYALHRSIVDHY-----NIPKTFDSDDLIPPYLDQYFYSH----IGH 479
>3qf4_A ABC transporter, ATP-binding protein; multidrug transporter,
transport protein; HET: ANP; 2.90A {Thermotoga maritima}
Length = 587
Score = 29.5 bits (67), Expect = 1.3
Identities = 12/41 (29%), Positives = 19/41 (46%), Gaps = 3/41 (7%)
Query: 59 YGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERR 99
+G E+A+ ++ E Q DF+ Y DS +ER
Sbjct: 439 WGREDATDDEIVEAAKIA-QIHDFIISLPEGY--DSRVERG 476
>2qtp_A Uncharacterized protein; structural genomics, joint center for
structural genomics, J protein structure initiative,
PSI-2; 2.10A {Silicibacter pomeroyi dss-3} SCOP:
d.79.9.1
Length = 194
Score = 28.7 bits (64), Expect = 1.6
Identities = 10/30 (33%), Positives = 15/30 (50%), Gaps = 2/30 (6%)
Query: 12 GKASVRGLTFQYEAEWASGCVHTYLGWHPR 41
GK+++ G E E A+ +H LG R
Sbjct: 81 GKSAMVGE--NGELEHAAAILHPKLGAPLR 108
>2cb5_A Protein (bleomycin hydrolase); aminopeptidase, cysteine protease,
SELF- compartmentalizing, cylinase; 1.85A {Homo sapiens}
SCOP: d.3.1.1 PDB: 1cb5_A
Length = 453
Score = 29.0 bits (64), Expect = 1.7
Identities = 10/61 (16%), Positives = 19/61 (31%), Gaps = 2/61 (3%)
Query: 159 SFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHN-ELIELS 217
+ + + P + +Q G W S + + + K N E E S
Sbjct: 39 CLKRATVQRAQHVFQHAVPQEGKP-ITNQKSSGRSWIFSCLNVMRLPFMKKLNIEEFEFS 97
Query: 218 K 218
+
Sbjct: 98 Q 98
>2ot3_A RAB5 GDP/GTP exchange factor; rabex-5, VPS9 domain, vesicular
traffic, protein transport; 2.10A {Homo sapiens} SCOP:
a.222.1.1 PDB: 1txu_A
Length = 274
Score = 27.8 bits (61), Expect = 3.2
Identities = 12/83 (14%), Positives = 26/83 (31%), Gaps = 4/83 (4%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
K+F+ + + + EI ++ +F + + EQ F
Sbjct: 16 SKEFIEFLKTFHKTGQEIYKQTKLFLEGMHYKRDLSIEEQSEC---AQDFYHNVAERMQT 72
Query: 139 GLSSLDWEQIENLKSTFETYSFN 161
+ E++E + E Y
Sbjct: 73 -RGKVPPERVEKIMDQIEKYIMT 94
>1qgi_A Protein (chitosanase); hydrolase, chitosan degradation; HET: GCS
NAG; 1.60A {Bacillus circulans} SCOP: d.2.1.7 PDB:
2d05_A
Length = 259
Score = 27.8 bits (61), Expect = 3.8
Identities = 11/35 (31%), Positives = 12/35 (34%)
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESIN 173
GL W I L + E N YG E I
Sbjct: 20 GLDGEQWNNIMKLINKPEQDDLNWIKYYGYCEDIE 54
>3l22_A SUSD superfamily protein; structural genomics, joint center
structural genomics, JCSG, protein structure initiative;
HET: MSE; 2.05A {Bacteroides fragilis}
Length = 441
Score = 27.5 bits (61), Expect = 5.3
Identities = 9/81 (11%), Positives = 22/81 (27%), Gaps = 9/81 (11%)
Query: 48 NLILQRSQPNSYGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNL 107
+ RS + + + + E++ + E R DI R
Sbjct: 353 KQVRGRSVDAATDPLNIDNLSGDALK-------EAIYN-EKRLEFIGEGIRGIDIMRRG- 403
Query: 108 KTIDYYTKHEQGTATYGVNRF 128
+ ++E ++
Sbjct: 404 EHFIKVGENETINVGPSDEKY 424
>3qbu_A Putative uncharacterized protein; metallo enzyme, peptidoglycan,
TIM barrel, deacetylase, HYDR; 2.57A {Helicobacter
pylori}
Length = 326
Score = 27.1 bits (60), Expect = 5.4
Identities = 9/62 (14%), Positives = 20/62 (32%), Gaps = 18/62 (29%)
Query: 20 TFQYEAEWASGCVHTYLGWHPRWTGRVHNLILQRSQPNSYGSEEASTFDLEEFLDHGNQF 79
F + V + + HP + R L++ E+ ++H N+
Sbjct: 262 QFDWVYREMDYAVFS-MTIHPDVSARPQVLLM-----------------HEKIIEHINKH 303
Query: 80 KD 81
+
Sbjct: 304 EG 305
>2e01_A Cysteine proteinase 1; bleomycin hydrolase, thiol protease, C1
protease, hydrolase; 1.73A {Saccharomyces cerevisiae}
PDB: 2e02_A 2e03_A 2dzy_A 1a6r_A 2e00_A 2dzz_A 3gcb_A
1gcb_A
Length = 457
Score = 27.0 bits (59), Expect = 6.9
Identities = 8/29 (27%), Positives = 11/29 (37%)
Query: 183 KVQDQHLCGSCWAHSAVACLESAYAIKHN 211
V +Q G CW +A L + N
Sbjct: 67 PVTNQKSSGRCWLFAATNQLRLNVLSELN 95
>3vaa_A Shikimate kinase, SK; structural genomics, center for structural
genomics of infec diseases, csgid, metal binding,
transferase; 1.70A {Bacteroides thetaiotaomicron}
Length = 199
Score = 26.4 bits (59), Expect = 7.1
Identities = 7/21 (33%), Positives = 9/21 (42%), Gaps = 5/21 (23%)
Query: 90 YDSDSEIERRF-----DIFRN 105
D D IE RF ++F
Sbjct: 54 IDLDWYIEERFHKTVGELFTE 74
>3s6o_A Polysaccharide deacetylase family protein; ssgcid, NIH, structural
genomics, seattle structural genomic for infectious
disease; 1.85A {Burkholderia pseudomallei}
Length = 321
Score = 26.9 bits (59), Expect = 7.3
Identities = 10/46 (21%), Positives = 13/46 (28%), Gaps = 17/46 (36%)
Query: 36 LGWHPRWTGRVHNLILQRSQPNSYGSEEASTFDLEEFLDHGNQFKD 81
+G H R GR L+ FLDH +
Sbjct: 264 IGMHCRLLGRPGRFRA-----------------LQRFLDHIERHDR 292
>2r2i_A Guanylyl cyclase-activating protein 1; EF hand, GCAP, guanylate
cyclase activating protein, GCAP1, GCAP-1, calcium,
lipoprotein, myristate; HET: MYR; 2.00A {Gallus gallus}
Length = 198
Score = 26.7 bits (59), Expect = 7.9
Identities = 15/84 (17%), Positives = 29/84 (34%), Gaps = 23/84 (27%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIF-RNNLKTIDYYTKHEQGTATYGVNRFADMTDSE 135
+FK F ++ +E+ F+ F N ID+ E
Sbjct: 35 YEFKQFFGLKNLSPSANKYVEQMFETFDFNKDGYIDF---------------------ME 73
Query: 136 FNHGLSSLDWEQI-ENLKSTFETY 158
+ LS + ++ + L+ F+ Y
Sbjct: 74 YVAALSLVLKGKVDQKLRWYFKLY 97
>3trf_A Shikimate kinase, SK; amino acid biosynthesis, transferase; 2.60A
{Coxiella burnetii}
Length = 185
Score = 26.0 bits (58), Expect = 9.4
Identities = 10/19 (52%), Positives = 11/19 (57%), Gaps = 5/19 (26%)
Query: 90 YDSDSEIERRF-----DIF 103
YDSD EIE+R IF
Sbjct: 34 YDSDKEIEKRTGADIAWIF 52
>2pt5_A Shikimate kinase, SK; aromatic amino acid biosynthesis, P-loop
kinase, SHI kinase, shikimate pathway; 2.10A {Aquifex
aeolicus}
Length = 168
Score = 25.9 bits (58), Expect = 9.9
Identities = 7/21 (33%), Positives = 10/21 (47%), Gaps = 5/21 (23%)
Query: 90 YDSDSEIERRF-----DIFRN 105
YD D E+++R IF
Sbjct: 29 YDVDEEVQKREGLSIPQIFEK 49
Database: pdb70
Posted date: Sep 4, 2012 3:40 AM
Number of letters in database: 6,701,793
Number of sequences in database: 27,921
Lambda K H
0.316 0.132 0.409
Gapped
Lambda K H
0.267 0.0856 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 27921
Number of Hits to DB: 4,024,798
Number of extensions: 228597
Number of successful extensions: 581
Number of sequences better than 10.0: 1
Number of HSP's gapped: 521
Number of HSP's successfully gapped: 110
Length of query: 263
Length of database: 6,701,793
Length adjustment: 92
Effective length of query: 171
Effective length of database: 4,133,061
Effective search space: 706753431
Effective search space used: 706753431
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 56 (25.1 bits)