RPS-BLAST 2.2.26 [Sep-21-2011]
Database: pdb70
27,921 sequences; 6,701,793 total letters
Searching..................................................done
Query= psy15346
(280 letters)
>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme,
papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A
1pbh_A 1mir_A
Length = 317
Score = 151 bits (383), Expect = 3e-44
Identities = 73/245 (29%), Positives = 104/245 (42%), Gaps = 61/245 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 134 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 191
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V++ DI EI KNGPV +YSD F
Sbjct: 192 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSD-F------ 243
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
YKSGVY + +A
Sbjct: 244 ----------------LLYKSGVYQHVTGEMMGGHA------------------------ 263
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
++++GWG ENG PYW + +++ +GD G KILRG++ IES V
Sbjct: 264 -----------IRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 312
Query: 242 LPKDN 246
+P+ +
Sbjct: 313 IPRTD 317
>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase,
digestive tract, hydrolase-hydrolase INH complex; HET:
074; 1.30A {Schistosoma mansoni} PDB: 3s3q_A* 3s3r_A*
Length = 254
Score = 149 bits (379), Expect = 3e-44
Identities = 70/242 (28%), Positives = 100/242 (41%), Gaps = 60/242 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C GI W + K G+VTG + ++ GC+P FP C H + P C + P+C
Sbjct: 72 CEGGILGPAWDYWVKEGIVTGSSKENHAGCEPYPFPKCEH-HTKGKYPPCGSKIYKTPRC 130
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+R K Y V ++ IQ+EIMK GPV A +Y D F
Sbjct: 131 KQTC-QKKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGFTVYED-F------ 182
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+YKSG+Y + +A
Sbjct: 183 ----------------LNYKSGIYKHITGETLGGHA------------------------ 202
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+++IGWG EN PYW I +++ E +G+ G +I+RGR+E IES V
Sbjct: 203 -----------IRIIGWGVENKAPYWLIANSWNEDWGENGYFRIVRGRDECSIESEVTAG 251
Query: 242 LP 243
Sbjct: 252 RI 253
>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco
hydrolase, lysosome, protease, thiol protease, zymogen,
CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A*
3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A*
1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A*
1qdq_A* 1csb_B* 1huc_B 2ipp_B ...
Length = 266
Score = 146 bits (370), Expect = 9e-43
Identities = 70/245 (28%), Positives = 102/245 (41%), Gaps = 61/245 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++GLV+GG + S+ GC+P S PPC + + P C PKC
Sbjct: 77 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEA-HVNGARPPCTG-EGDTPKC 134
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V++ DI EI KNGPV +YSD F
Sbjct: 135 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSD-F------ 186
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
YKSGVY + +A
Sbjct: 187 ----------------LLYKSGVYQHVTGEMMGGHA------------------------ 206
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
++++GWG ENG PYW + +++ +GD G KILRG++ IES V
Sbjct: 207 -----------IRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 255
Query: 242 LPKDN 246
+P+ +
Sbjct: 256 IPRTD 260
>3hhi_A Cathepsin B-like cysteine protease; occluding loop, hydrolase, THIO
protease; HET: 074; 1.60A {Trypanosoma brucei} PDB:
3mor_A*
Length = 325
Score = 138 bits (349), Expect = 4e-39
Identities = 58/244 (23%), Positives = 80/244 (32%), Gaps = 71/244 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 60
C+ G W + GLV+ CQP FP C +H+ P C PK
Sbjct: 140 CNGGDPDRAWAYFSSTGLVSDY-------CQPYPFPHCSHHSKSKNGYPPCSQFNFDTPK 192
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C C + YR Y + E D +E+ GP +Y D F
Sbjct: 193 CDYTCDDPTIP----VVNYRSWTSYALQGE-DDYMRELFFRGPFEVAFDVYED-F----- 241
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
+Y SGVY + + +A V++VGWG NG PYW
Sbjct: 242 -----------------IAYNSGVYHHVSGQYLGGHA-VRLVGWGTSNGVPYW------- 276
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
+A N W ++G G I RG +E IE +
Sbjct: 277 ------KIA------------N---SW------NTEWGMDGYFLIRRGSSECGIEDGGSA 309
Query: 241 ALPK 244
+P
Sbjct: 310 GIPL 313
>1deu_A Procathepsin X; cysteine protease, proregion, prosegment, HY; 1.70A
{Homo sapiens} SCOP: d.3.1.1 PDB: 1ef7_A
Length = 277
Score = 94.3 bits (235), Expect = 6e-23
Identities = 40/251 (15%), Positives = 61/251 (24%), Gaps = 88/251 (35%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S W + H+ G+ C +C T
Sbjct: 106 CEGGNDLSVWDYAHQHGIPDET-------CNNYQAKDQEC----DKFNQCGT-----CNE 149
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + EI NGP+
Sbjct: 150 FKECHAIR------NYTLWRVGDYGSLSGREKMMAEIYANGPI----------------- 186
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+ + +Y G+YA + + V + GWG +G YW IVR
Sbjct: 187 -----SCG-IMATERLANYTGGIYAEYQDTTYINHV-VSVAGWGISDGTEYW-IVR---- 234
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI-------- 233
N W GE +G++G ++I+ +
Sbjct: 235 --------------------NS---W------GEPWGERGWLRIVTSTYKDGKGARYNLA 265
Query: 234 IESLVNGALPK 244
IE P
Sbjct: 266 IEEHCTFGDPI 276
>3pdf_A Cathepsin C, dipeptidyl peptidase 1; two domains, cystein protease,
hydrolase-hydrolase inhibitor; HET: LXV NAG; 1.85A {Homo
sapiens} PDB: 1jqp_A* 2djf_B* 1k3b_B* 2djg_B* 2djf_A*
1k3b_A* 2djg_A* 2djf_C* 1k3b_C* 2djg_C*
Length = 441
Score = 88.7 bits (220), Expect = 3e-20
Identities = 47/243 (19%), Positives = 82/243 (33%), Gaps = 82/243 (33%)
Query: 2 CSSGISSST-WVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 60
C G + GLV C P YT ++ CK +
Sbjct: 274 CEGGFPYLIAGKYAQDFGLVEEA-------CFP----------YTGTDSPCK----MKED 312
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C R + + + +Y +E A ++ E++ +GP+ +Y D YK G
Sbjct: 313 CF---------RYYSSEYHYVGGFYGGCNE-ALMKLELVHHGPMAVAFEVYDDFLHYKKG 362
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
Y + +G+ E+ +A V +VG+G ++
Sbjct: 363 IY-----------------HHTGLRDPFNPFELTNHA-VLLVGYGTDS------------ 392
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
+G YW + +++G +G+ G +I RG +E IES+
Sbjct: 393 --------------------ASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVA 432
Query: 241 ALP 243
A P
Sbjct: 433 ATP 435
>3ois_A Cysteine protease; alpha and beta, hydrolase; HET: UDP; 1.65A
{Xylella fastidiosa}
Length = 291
Score = 57.2 bits (138), Expect = 8e-10
Identities = 20/176 (11%), Positives = 44/176 (25%), Gaps = 42/176 (23%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
S + +HK G+ P P + P P +C
Sbjct: 121 DSGAMIRDGIKVLHKLGVCPEK-------EWPYGDTPADP-RTEEFPPGAPASKKPSDQC 172
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ Y+ Y V ++ ++ + P V +Y+ S
Sbjct: 173 YK-----------DAQNYKITEYSRVAQDIDHLKACLAVGSPFVFGFSVYNSWVGNNSLP 221
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
+ + + + V VG+ ++ ++ +R
Sbjct: 222 V--------------------RIPLPTKNDTLEGGHAVLCVGY--DDEIRHF-RIR 254
>2pff_B Fatty acid synthase subunit beta; fatty acid synthase,
acyl-carrier-protein, beta-ketoacyl RED beta-ketoacyl
synthase, dehydratase; 4.00A {Saccharomyces cerevisiae}
Length = 2006
Score = 45.4 bits (107), Expect = 2e-05
Identities = 34/234 (14%), Positives = 70/234 (29%), Gaps = 72/234 (30%)
Query: 77 DKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSD 136
D + Y + I +I+ N PV N+ ++ F + GK + Y
Sbjct: 1650 DNHFKDTYGF------SIL-DIVINNPV--NLTIH---FGGEKGKR-----IRENYSA-- 1690
Query: 137 IFSYKSGVYAVSASAEIVAYATVKIVG--WGEENG---RPYWTIVRVYAVSASAEIVAYA 191
+++ V + +I + E G +T + + A
Sbjct: 1691 -MIFETIVDGKLKTEKIFKEINEHSTSYTFRSEKGLLSATQFTQPALTLME-------KA 1742
Query: 192 TVKLIGWGEENG-RPYWTIVSTF-----GE-----------QFGDKGTIKILRGRNEAII 234
+ + + G P +TF GE ++++ R +
Sbjct: 1743 AFEDL---KSKGLIP---ADATFAGHSLGEYAALASLADVMSIES--LVEVVFYRGMTMQ 1794
Query: 235 ESLVNGALPKDNYG----------VEFGEESGERLSEEFGVRAESSEEFRE--N 276
++ L + NYG F +E+ + + E + + E N
Sbjct: 1795 VAVPRDELGRSNYGMIAINPGRVAASFSQEALQYVVERV---GKRTGWLVEIVN 1845
Score = 30.0 bits (67), Expect = 1.0
Identities = 45/278 (16%), Positives = 80/278 (28%), Gaps = 84/278 (30%)
Query: 43 NYTTSEPECKTLATPQPKCHT----RCTNDN----Y----GRG----FFQD-KYRFKRY- 84
NY T+ P K R + G+G +F++ + ++ Y
Sbjct: 125 NYITA---RIMAKRPFDKKSNSALFRAVGEGNAQLVAIFGGQGNTDDYFEELRDLYQTYH 181
Query: 85 YWVND---EVADIQQEIMKNGPVVANMYLYS-DIFSYKSGKYGNGPVVANMYLYSDIFSY 140
V D A+ E+++ ++ +I + N P YL S S
Sbjct: 182 VLVGDLIKFSAETLSELIRTTLDAEKVFTQGLNILEWLENP-SNTPDKD--YLLSIPISC 238
Query: 141 K-SGVYAVSASAEIVAYA-TVKIVGW--GEENGRPYWTIVRVYAVSASAEIVAYAT---- 192
GV + Y T K++G+ GE + +A +A
Sbjct: 239 PLIGVIQ------LAHYVVTAKLLGFTPGELRSYLKGATGHSQGL-VTAVAIAETDSWES 291
Query: 193 --------VKL---IGWGEENGRPYWTIVSTFGEQFGDKG-----------------TIK 224
+ + IG P ++ + E +
Sbjct: 292 FFVSVRKAITVLFFIGVRCYEAYPNTSLPPSILEDSLENNEGVPSPMLSISNLTQEQVQD 351
Query: 225 ILRGRN------EAIIESLVNGA-------LPKDNYGV 249
+ N + + SLVNGA P+ YG+
Sbjct: 352 YVNKTNSHLPAGKQVEISLVNGAKNLVVSGPPQSLYGL 389
>3qt4_A Cathepsin-L-like midgut cysteine proteinase; hydrolase, zymogen,
intramolecular DISS bonds, insect larVal midgut; HET:
PG4 PG6; 2.11A {Tenebrio molitor}
Length = 329
Score = 43.8 bits (104), Expect = 3e-05
Identities = 11/38 (28%), Positives = 24/38 (63%)
Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
V ++G+G +NG+ YW + +++G +G+ G + +R
Sbjct: 278 VLVVGYGSDNGQDYWILKNSWGSGWGESGYWRQVRNYG 315
>1m6d_A Cathepsin F, catsf; papain family cysteine protease, hydrolase;
HET: MYP; 1.70A {Homo sapiens} SCOP: d.3.1.1
Length = 214
Score = 42.8 bits (102), Expect = 3e-05
Identities = 13/38 (34%), Positives = 24/38 (63%)
Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
V L+G+G+ + P+W I +++G +G+KG + RG
Sbjct: 163 VLLVGYGQRSDVPFWAIKNSWGTDWGEKGYYYLHRGSG 200
>3ovx_A Cathepsin S; hydrolase, covalent inhibitor, aldehyde warhead is
covalently bound to Cys25, lysosomeal protein; HET: O64;
1.49A {Homo sapiens} PDB: 2h7j_A* 2f1g_A* 2hh5_B*
2hhn_A* 2hxz_A* 2op3_A* 2frq_A* 2fra_A* 2fq9_A* 2ft2_A*
2fud_A* 2g7y_A* 1ms6_A* 2r9m_A* 2r9n_A* 2r9o_A* 3n3g_A*
3n4c_A* 3mpe_A* 1nqc_A* ...
Length = 218
Score = 42.5 bits (101), Expect = 5e-05
Identities = 13/38 (34%), Positives = 27/38 (71%)
Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
V ++G+G+ NG+ YW + +++G FG++G I++ R +
Sbjct: 167 VLVVGYGDLNGKEYWLVKNSWGHNFGEEGYIRMARNKG 204
>3kwz_A Cathepsin K; enzyme inhibitor, covalent reversible inhibitor,
disease mutation, disulfide bond, glycoprotein,
hydrolase, lysosome, protease; HET: KWZ; 1.49A {Homo
sapiens} PDB: 1au0_A* 1au2_A* 1au3_A* 1au4_A* 1ayu_A*
1ayv_A* 1ayw_A* 1bgo_A* 1atk_A* 1nl6_A* 1nlj_A* 1q6k_A*
1mem_A* 1yk7_A* 1yk8_A* 1yt7_A* 2ato_A* 2aux_A* 2auz_A*
2bdl_A* ...
Length = 215
Score = 42.5 bits (101), Expect = 5e-05
Identities = 14/38 (36%), Positives = 25/38 (65%)
Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
V +G+G + G +W I +++GE +G+KG I + R +N
Sbjct: 164 VLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKN 201
>1by8_A Protein (procathepsin K); hydrolase(sulfhydryl proteinase), papain;
2.60A {Homo sapiens} SCOP: d.3.1.1 PDB: 7pck_A
Length = 314
Score = 43.0 bits (102), Expect = 5e-05
Identities = 14/38 (36%), Positives = 25/38 (65%)
Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
V +G+G + G +W I +++GE +G+KG I + R +N
Sbjct: 263 VLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKN 300
>2c0y_A Procathepsin S; proenzyme, proteinase, hydrolase, thiol protease,
prosegment binding loop, glycoprotein, lysosome,
protease, zymogen; 2.1A {Homo sapiens}
Length = 315
Score = 42.6 bits (101), Expect = 6e-05
Identities = 13/38 (34%), Positives = 27/38 (71%)
Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
V ++G+G+ NG+ YW + +++G FG++G I++ R +
Sbjct: 264 VLVVGYGDLNGKEYWLVKNSWGHNFGEEGYIRMARNKG 301
>2o6x_A Procathepsin L1, secreted cathepsin L 1; hydrolase, thiol protease,
cysteine protease, zymogen, hydro; 1.40A {Fasciola
hepatica}
Length = 310
Score = 42.2 bits (100), Expect = 8e-05
Identities = 12/38 (31%), Positives = 25/38 (65%)
Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
V +G+G + G YW + +++G +G++G I+++R R
Sbjct: 255 VLAVGYGTQGGTDYWIVKNSWGLSWGERGYIRMVRNRG 292
>1iwd_A Ervatamin B; cysteine protease, alpha-beta protein, catalytic DYAD,
L-DOM domain., hydrolase; 1.63A {Tabernaemontana
divaricata} SCOP: d.3.1.1
Length = 215
Score = 41.7 bits (99), Expect = 8e-05
Identities = 11/38 (28%), Positives = 26/38 (68%)
Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
V ++G+G ++G+ YW + +++G+ +G++G I + R
Sbjct: 160 VVIVGYGTQSGKNYWIVRNSWGQNWGNQGYIWMERNVA 197
>1pci_A Procaricain; zymogen, hydrolase, thiol protease; 3.20A {Carica
papaya} SCOP: d.3.1.1
Length = 322
Score = 42.2 bits (100), Expect = 8e-05
Identities = 13/38 (34%), Positives = 23/38 (60%)
Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
V +G+G+ G+ Y I +++G +G+KG I+I R
Sbjct: 267 VTAVGYGKSGGKGYILIKNSWGTAWGEKGYIRIKRAPG 304
>3i06_A Cruzipain; autocatalytic cleavage, glycoprotein, protease, thiol
protease, zymogen; HET: QL2; 1.10A {Trypanosoma cruzi}
PDB: 1ewm_A* 1ewo_A* 1ewl_A* 1f29_A* 1ewp_A* 1f2b_A*
1f2c_A* 1f2a_A* 1me4_A* 1u9q_X* 2aim_A* 2efm_A* 2oz2_A*
1me3_A* 3kku_A* 3lxs_A* 1aim_A* 3iut_A* 3hd3_A* 2p86_A*
...
Length = 215
Score = 41.7 bits (99), Expect = 9e-05
Identities = 14/38 (36%), Positives = 25/38 (65%)
Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
V L+G+ + PYW I +++ Q+G++G I+I +G N
Sbjct: 164 VLLVGYNDSAAVPYWIIKNSWTTQWGEEGYIRIAKGSN 201
>3p5u_A Actinidin; SAD, cysteine proteinases, hydrolase; 1.50A {Actinidia
arguta} PDB: 3p5v_A 3p5w_A 3p5x_A 1aec_A* 2act_A
Length = 220
Score = 41.7 bits (99), Expect = 9e-05
Identities = 12/39 (30%), Positives = 24/39 (61%)
Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 231
V ++G+G E G YW + +++G +G++G ++I R
Sbjct: 164 VTIVGYGTEGGIDYWIVKNSWGTTWGEEGYMRIQRNVGG 202
>3f5v_A DER P 1 allergen; allergy, asthma, DUST mites, glycoprotein,
hydrola protease, secreted, thiol protease; HET: P6G;
1.36A {Dermatophagoides pteronyssinus} PDB: 2as8_A
3rvw_A* 3rvx_A 3rvv_A* 3d6s_A*
Length = 222
Score = 41.7 bits (99), Expect = 1e-04
Identities = 10/44 (22%), Positives = 20/44 (45%)
Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
V ++G+ G YW + +++ +GD G + +IE
Sbjct: 172 VNIVGYSNAQGVDYWIVRNSWDTNWGDNGYGYFAANIDLMMIEE 215
>1xkg_A DER P I, major mite fecal allergen DER P 1; major allergen,
cysteine protease, house DUST mite, dermatop
pteronyssinus; 1.61A {Dermatophagoides pteronyssinus}
SCOP: d.3.1.1
Length = 312
Score = 41.8 bits (99), Expect = 1e-04
Identities = 10/44 (22%), Positives = 20/44 (45%)
Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
V ++G+ G YW + +++ +GD G + +IE
Sbjct: 252 VNIVGYSNAQGVDYWIVRNSWDTNWGDNGYGYFAANIDLMMIEE 295
>1cqd_A Protein (protease II); cysteine protease, glycoprotein, proline
specificity, carboh papain family, hydrolase; HET: NAG
FUL FUC; 2.10A {Zingiber officinale} SCOP: d.3.1.1
Length = 221
Score = 41.4 bits (98), Expect = 1e-04
Identities = 10/38 (26%), Positives = 24/38 (63%)
Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
+ ++G+G EN + +W + +++G+ +G+ G I+ R
Sbjct: 163 LTVVGYGTENDKDFWIVKNSWGKNWGESGYIRAERNIE 200
Score = 32.1 bits (74), Expect = 0.13
Identities = 15/56 (26%), Positives = 25/56 (44%), Gaps = 9/56 (16%)
Query: 124 NGPVVANMYLYSDIF-SYKSGVYAVSASAEI---VAYATVKIVGWGEENGRPYWTI 175
N PV M F Y+SG++ S + + +VG+G EN + +W +
Sbjct: 129 NQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALT-----VVGYGTENDKDFWIV 179
>8pch_A Cathepsin H; hydrolase, protease, cysteine proteinase,
aminopeptidase; HET: NAG BMA; 2.10A {Sus scrofa} SCOP:
d.3.1.1 PDB: 1nb3_A* 1nb5_A*
Length = 220
Score = 41.3 bits (98), Expect = 1e-04
Identities = 18/38 (47%), Positives = 26/38 (68%)
Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
V +G+GEENG PYW + +++G Q+G G I RG+N
Sbjct: 168 VLAVGYGEENGIPYWIVKNSWGPQWGMNGYFLIERGKN 205
>1yal_A Chymopapain; hydrolase, thiol protease; 1.70A {Carica papaya} SCOP:
d.3.1.1 PDB: 1gec_E*
Length = 218
Score = 40.6 bits (96), Expect = 2e-04
Identities = 11/38 (28%), Positives = 23/38 (60%)
Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
V +G+G +G+ Y I +++G +G+KG +++ R
Sbjct: 161 VTAVGYGTSDGKNYIIIKNSWGPNWGEKGYMRLKRQSG 198
Score = 32.1 bits (74), Expect = 0.14
Identities = 15/56 (26%), Positives = 23/56 (41%), Gaps = 9/56 (16%)
Query: 124 NGPVVANMYLYSDIF-SYKSGVYAVSASAEI---VAYATVKIVGWGEENGRPYWTI 175
N P+ + F YKSGV+ ++ V VG+G +G+ Y I
Sbjct: 127 NQPLSVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVT-----AVGYGTSDGKNYIII 177
>2b1m_A SPE31; papain-like, sugar binding protein; HET: NAG FUC PG4; 2.00A
{Pachyrhizus erosus} PDB: 2b1n_A*
Length = 246
Score = 40.7 bits (96), Expect = 2e-04
Identities = 14/38 (36%), Positives = 23/38 (60%)
Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
V ++G+G E+G YW +++GE +G G I+I R
Sbjct: 170 VLIVGYGSEDGVDYWIAKNSWGEDWGIDGYIRIQRNTG 207
>3qj3_A Cathepsin L-like protein; hydrolase, proteinase, larVal midgut;
1.85A {Tenebrio molitor}
Length = 331
Score = 40.7 bits (96), Expect = 3e-04
Identities = 15/38 (39%), Positives = 25/38 (65%)
Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
V ++G+G ENG+ YW + +++G+ +G G KI R N
Sbjct: 280 VLIVGYGNENGQDYWLVKNSWGDGWGLDGYFKIARNAN 317
>1ppo_A Protease omega; hydrolase(thiol protease); 1.80A {Carica papaya}
SCOP: d.3.1.1 PDB: 1meg_A*
Length = 216
Score = 38.6 bits (91), Expect = 0.001
Identities = 13/38 (34%), Positives = 23/38 (60%)
Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
V +G+G+ G+ Y I +++G +G+KG I+I R
Sbjct: 161 VTAVGYGKSGGKGYILIKNSWGTAWGEKGYIRIKRAPG 198
>2fo5_A Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cysteine
endoprotease, endopeptidase, LEUP hydrolase; HET: AR7;
2.20A {Hordeum vulgare}
Length = 262
Score = 38.0 bits (89), Expect = 0.002
Identities = 12/39 (30%), Positives = 27/39 (69%), Gaps = 1/39 (2%)
Query: 193 VKLIGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
V ++G+G E+G+ YWT+ +++G +G++G I++ +
Sbjct: 169 VAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSG 207
>2wbf_X Serine-repeat antigen protein; SERA, malaria, vacuole, protease,
cathepsin, hydrolase, glycoprotein, thiol protease; HET:
DMS; 1.60A {Plasmodium falciparum} PDB: 3ch3_X 3ch2_X
Length = 265
Score = 38.1 bits (89), Expect = 0.002
Identities = 16/64 (25%), Positives = 29/64 (45%), Gaps = 11/64 (17%)
Query: 193 VKLIGWG-----EENGRPYWTIVSTFGEQFGDKGTIKILRGRNE----AIIES--LVNGA 241
V ++G+G E + YW + +++G +GD+G K+ I S + N
Sbjct: 202 VNIVGYGNYVNSEGEKKSYWIVRNSWGPYWGDEGYFKVDMYGPTHCHFNFIHSVVIFNVD 261
Query: 242 LPKD 245
LP +
Sbjct: 262 LPMN 265
>3u8e_A Papain-like cysteine protease; papain-like cysteine peptidase,
peptidase_C1A, hydrolase, in form; 1.31A {Crocus
sativus}
Length = 222
Score = 37.5 bits (87), Expect = 0.002
Identities = 12/39 (30%), Positives = 21/39 (53%), Gaps = 1/39 (2%)
Query: 193 VKLIGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
V ++G+G YW + +++G ++G G I I R N
Sbjct: 164 VLIVGYGSNGTNADYWIVKNSWGTEWGIDGYILIRRNTN 202
>1s4v_A Cysteine endopeptidase; KDEL ER retention signal, endosperm,
ricinosomes, SEED germi senescence, hydrolase-hydrolase
inhibitor complex; 2.00A {Ricinus communis} SCOP:
d.3.1.1
Length = 229
Score = 35.2 bits (82), Expect = 0.011
Identities = 14/37 (37%), Positives = 27/37 (72%), Gaps = 1/37 (2%)
Query: 193 VKLIGWGEE-NGRPYWTIVSTFGEQFGDKGTIKILRG 228
V ++G+G +G YWT+ +++G ++G+KG I++ RG
Sbjct: 164 VAIVGYGTTIDGTKYWTVKNSWGPEWGEKGYIRMERG 200
>1cs8_A Human procathepsin L; prosegment, propeptide, inhibition,
hydrolase; HET: OCS; 1.80A {Homo sapiens} SCOP: d.3.1.1
PDB: 1cjl_A 3hwn_A*
Length = 316
Score = 35.3 bits (82), Expect = 0.014
Identities = 12/42 (28%), Positives = 25/42 (59%), Gaps = 4/42 (9%)
Query: 193 VKLIGWGEE----NGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
V ++G+G E + YW + +++GE++G G +K+ + R
Sbjct: 261 VLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRR 302
>3f75_A Toxopain-2, cathepsin L protease; medical structural genomics of
pathogenic protozoa, MSGPP, C protease, parasite,
protozoa, hydrolase; 1.99A {Toxoplasma gondii}
Length = 224
Score = 34.8 bits (81), Expect = 0.016
Identities = 10/41 (24%), Positives = 24/41 (58%), Gaps = 2/41 (4%)
Query: 193 VKLIGWG--EENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 231
V L+G+G +E+ + +W + +++G +G G + + + E
Sbjct: 169 VLLVGYGTDKESKKDFWIMKNSWGTGWGRDGYMYMAMHKGE 209
>2xu3_A Cathepsin L1; hydrolase, drug design, thiol protease; HET: XU3 BTB;
0.90A {Homo sapiens} PDB: 2xu4_A* 2xu5_A* 2yj2_A*
2yj8_A* 2yj9_A* 2yjb_A* 2yjc_A* 3bc3_A* 3h89_A* 3h8b_A*
3h8c_A* 3of9_A* 3of8_A* 3hha_A* 2xu1_A* 3iv2_A* 3k24_A*
2nqd_B* 3kse_A* 2vhs_A ...
Length = 220
Score = 34.8 bits (81), Expect = 0.017
Identities = 12/42 (28%), Positives = 25/42 (59%), Gaps = 4/42 (9%)
Query: 193 VKLIGWGEE----NGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
V ++G+G E + YW + +++GE++G G +K+ + R
Sbjct: 165 VLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRR 206
>2oul_A Falcipain 2; cysteine protease, inhibitor, macromolecular
interaction, HY hydrolase inhibitor complex; 2.20A
{Plasmodium falciparum} SCOP: d.3.1.1 PDB: 2ghu_A 1yvb_A
3bpf_A* 3pnr_A
Length = 241
Score = 34.9 bits (81), Expect = 0.017
Identities = 19/110 (17%), Positives = 40/110 (36%), Gaps = 32/110 (29%)
Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEI---VAYATVKIVGWGEENGRPYWTIVRVYA 180
GP+ ++ + D YK G++ ++ V +VG+G + T
Sbjct: 143 LGPISISVAVSDDFAFYKEGIFDGECGDQLNHAVM-----LVGFGMKEIVNPLT------ 191
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
+ Y+ I +++G+Q+G++G I I +
Sbjct: 192 ------------------KKGEKHYYYIIKNSWGQQWGERGFINIETDES 223
>3pw3_A Aminopeptidase C; bleomycin, cysteine proteinase fold, structural
genomics, JO center for structural genomics, JCSG; HET:
MSE; 2.23A {Parabacteroides distasonis}
Length = 383
Score = 34.4 bits (78), Expect = 0.035
Identities = 25/185 (13%), Positives = 53/185 (28%), Gaps = 15/185 (8%)
Query: 66 TNDNYGRGF---FQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKY 122
T+ + F QD +R Y N + + + + SD+ +
Sbjct: 201 THHPFYTQFPLEIQDNWRHGMSY--NLPLDEFMEVFDNAINTGYTIAWGSDVSESGFTRD 258
Query: 123 GNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVS 182
G V+ + ++ + E + W + R
Sbjct: 259 G-VAVMPDDEKVQELSGSDMAHWLKLKPEEKKLNTKPQPQKWCTQAERQLAYDNYETTDD 317
Query: 183 ASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG-RNEAIIESLVN-G 240
+I A ++ G Y+ + +++G G + + +V+
Sbjct: 318 HGMQIYGIAK-------DQEGNEYYMVKNSWGTNSKYNGIWYASKAFVRYKTMNIVVHKD 370
Query: 241 ALPKD 245
ALPK
Sbjct: 371 ALPKA 375
>3bwk_A Cysteine protease falcipain-3; malaria, hydrolase; HET: C1P; 2.42A
{Plasmodium falciparum} PDB: 3bpm_A*
Length = 243
Score = 33.7 bits (78), Expect = 0.048
Identities = 16/95 (16%), Positives = 28/95 (29%), Gaps = 32/95 (33%)
Query: 140 YKSGVYAVSASAEI---VAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLI 196
Y+ G Y A V +VG+G ++
Sbjct: 161 YRGGFYDGECGAAPNHAVI-----LVGYGMKDIYNE------------------------ 191
Query: 197 GWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 231
G Y+ I +++G +G+ G I + N
Sbjct: 192 DTGRMEKFYYYIIKNSWGSDWGEGGYINLETDENG 226
>2g81_I BTCI, bowman-BIRK type SEED trypsin and chymotrypsin inhibitor;
proteinase inhibitor, protein structure, bowman-BIRK
inhibitor; HET: P6G PGE; 1.55A {Vigna unguiculata}
SCOP: g.3.13.1 PDB: 1h34_A 1tab_I 2r33_A 1bbi_A 2bbi_A
1d6r_I 1k9b_A 1pi2_A
Length = 83
Score = 29.9 bits (67), Expect = 0.23
Identities = 12/46 (26%), Positives = 17/46 (36%), Gaps = 1/46 (2%)
Query: 21 TGGAHHSNTGCQPVSFPPCNHANYTTSEP-ECKTLATPQPKCHTRC 65
+G S S P C+ T S P +C+ CH+ C
Sbjct: 1 SGHHEDSTDEASESSKPCCDRCECTKSIPPQCRCSDVRLNSCHSAC 46
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis,
programmed cell death; HET: DTP; 6.90A {Drosophila
melanogaster} PDB: 3iz8_A*
Length = 1221
Score = 31.4 bits (70), Expect = 0.35
Identities = 11/69 (15%), Positives = 27/69 (39%), Gaps = 13/69 (18%)
Query: 51 CKTLATPQPKC---------HTRCTNDNYGRGFFQDKYR--FKRYYWVNDEVADIQQEIM 99
CK L T + K T + D++ D+ + + +++ D+ +E++
Sbjct: 266 CKILLTTRFKQVTDFLSAATTTHISLDHHSMTLTPDEVKSLLLK--YLDCRPQDLPREVL 323
Query: 100 KNGPVVANM 108
P ++
Sbjct: 324 TTNPRRLSI 332
Score = 27.1 bits (59), Expect = 8.9
Identities = 9/60 (15%), Positives = 17/60 (28%), Gaps = 15/60 (25%)
Query: 77 DKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY---KSGKYGNGPVVANMYL 133
D Y + + +D P + Y YS I + + ++L
Sbjct: 451 DHYNIPKTFDSDDL-----------IPPYLDQYFYSHI-GHHLKNIEHPERMTLFRMVFL 498
>3tri_A Pyrroline-5-carboxylate reductase; amino acid biosynthesis,
oxidoreductase; HET: NAP; 2.50A {Coxiella burnetii}
Length = 280
Score = 29.0 bits (66), Expect = 1.7
Identities = 5/41 (12%), Positives = 11/41 (26%), Gaps = 1/41 (2%)
Query: 231 EAIIESLVNGALPKDN-YGVEFGEESGERLSEEFGVRAESS 270
I+ L+ + + + E+ GV
Sbjct: 16 RNIVVGLIANGYDPNRICVTNRSLDKLDFFKEKCGVHTTQD 56
>3hq1_A 2-isopropylmalate synthase; LEUA, mycobacterium tuberculosis
inhibition, bromopyruvate, amino-acid biosynthesis; HET:
FLC; 1.70A {Mycobacterium tuberculosis} PDB: 1sr9_A
3hpz_A 3hps_A* 3fig_A 3u6w_A 3hpx_A
Length = 644
Score = 28.6 bits (64), Expect = 2.6
Identities = 15/91 (16%), Positives = 26/91 (28%), Gaps = 8/91 (8%)
Query: 123 GNGPV------VANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
GNGP+ +A++ + Y +A+SA + A A V+ +P
Sbjct: 531 GNGPLAAFVHALADVGFDVAVLDYY--EHAMSAGDDAQAAAYVEASVTIASPAQPGEAGR 588
Query: 177 RVYAVSASAEIVAYATVKLIGWGEENGRPYW 207
A + W
Sbjct: 589 HASDPVTIASPAQPGEAGRHASDPVTSKTVW 619
>2cio_A Papain; hydrolase/inhibitor, complex hydrolase/inhibitor, ICP,
cysteine protease, allergen, protease, thiol protease;
1.5A {Carica papaya} PDB: 1khq_A 1khp_A 1ppn_A 3e1z_B
3ima_A 3lfy_A 9pap_A 1bqi_A* 1bp4_A* 1pad_A 1pe6_A*
1pip_A* 1pop_A* 1ppd_A 1ppp_A* 1stf_E* 2pad_A 4pad_A*
5pad_A* 6pad_A* ...
Length = 212
Score = 27.4 bits (62), Expect = 4.1
Identities = 10/28 (35%), Positives = 16/28 (57%)
Query: 203 GRPYWTIVSTFGEQFGDKGTIKILRGRN 230
G Y I +++G +G+ G I+I RG
Sbjct: 167 GPNYILIKNSWGTGWGENGYIRIKRGTG 194
>1rlj_A NRDI protein; flavoprotein, FMN, thioredoxin, alpha/beta/alpha
sandwich, structural genomics, PSI, protein structure
initiative; HET: FMN; 2.00A {Bacillus subtilis} SCOP:
c.23.5.7
Length = 139
Score = 27.1 bits (60), Expect = 4.4
Identities = 10/62 (16%), Positives = 21/62 (33%), Gaps = 4/62 (6%)
Query: 204 RPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDNYGVEFGEESGERLSEEF 263
P+ + T T L ++ +G +G F +S + +S ++
Sbjct: 45 TPFVLVTYTTNFGQVPASTQSFLEKYAHLLLGVAASGNK---VWGDNFA-KSADTISRQY 100
Query: 264 GV 265
V
Sbjct: 101 QV 102
>3lvg_D LCB, clathrin light chain B; SELF assembly, coated PIT, cytoplasmic
vesicle, membrane, Ca structural protein; 7.94A {Bos
taurus}
Length = 190
Score = 27.4 bits (60), Expect = 4.5
Identities = 8/28 (28%), Positives = 14/28 (50%)
Query: 253 EESGERLSEEFGVRAESSEEFRENGEEE 280
EE +RL E +E+RE +++
Sbjct: 92 EEQRKRLQELDAASKVMEQEWREKAKKD 119
>3fcp_A L-Ala-D/L-Glu epimerase, A muconate lactonizing enzyme; structural
genomics, nysgrc,target 9450E, PSI-2; 1.80A {Klebsiella
pneumoniae subsp}
Length = 381
Score = 27.3 bits (61), Expect = 5.5
Identities = 10/37 (27%), Positives = 13/37 (35%)
Query: 161 IVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIG 197
I G GE + + S+ I Y T L G
Sbjct: 46 ICGIGEATTIGGLSYGVESPEAISSAITHYLTPLLKG 82
>3ioq_A CMS1MS2; caricaceae, cysteine protease, papain family, hydrolase;
HET: E64 SO4; 1.87A {Carica candamarcensis}
Length = 213
Score = 27.0 bits (61), Expect = 6.2
Identities = 10/28 (35%), Positives = 16/28 (57%)
Query: 203 GRPYWTIVSTFGEQFGDKGTIKILRGRN 230
G Y I +++G +G+ G I+I RG
Sbjct: 167 GNDYILIKNSWGTGWGEGGYIRIKRGSG 194
>3qld_A Mandelate racemase/muconate lactonizing protein; structural
genomics, PSI-2, isomerase; HET: MSE; 1.85A
{Alicyclobacillus acidocaldarius LAA1}
Length = 388
Score = 26.6 bits (59), Expect = 9.6
Identities = 11/59 (18%), Positives = 16/59 (27%)
Query: 161 IVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGD 219
I GW E T +A +V + + W + T E D
Sbjct: 42 IEGWSECVALAEPTYTEECTDTAWVMLVHHLVPRFARWLRAASQDQDVDPRTVCEALRD 100
Database: pdb70
Posted date: Sep 4, 2012 3:40 AM
Number of letters in database: 6,701,793
Number of sequences in database: 27,921
Lambda K H
0.315 0.133 0.414
Gapped
Lambda K H
0.267 0.0856 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 27921
Number of Hits to DB: 4,510,215
Number of extensions: 274203
Number of successful extensions: 786
Number of sequences better than 10.0: 1
Number of HSP's gapped: 728
Number of HSP's successfully gapped: 97
Length of query: 280
Length of database: 6,701,793
Length adjustment: 92
Effective length of query: 188
Effective length of database: 4,133,061
Effective search space: 777015468
Effective search space used: 777015468
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 57 (25.5 bits)