RPS-BLAST 2.2.26 [Sep-21-2011]
Database: pdb70
27,921 sequences; 6,701,793 total letters
Searching..................................................done
Query= psy15348
(298 letters)
>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme,
papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A
1pbh_A 1mir_A
Length = 317
Score = 115 bits (290), Expect = 2e-30
Identities = 64/198 (32%), Positives = 93/198 (46%), Gaps = 27/198 (13%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 134 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 191
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAF--WRSFCTKYT 186
C Y + QDK+ G Y GP AF + F
Sbjct: 192 SKIC-EPGYSPTYKQDKHY--GYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDF----- 243
Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
L +G VY + +A ++I+GWG ENG PYW + +++ +GD G KILRG
Sbjct: 244 --LLYKSG-VYQHVTGEMMGGHA-IRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRG 299
Query: 247 RNEAIIESLVNGALPKDN 264
++ IES V +P+ +
Sbjct: 300 QDHCGIESEVVAGIPRTD 317
>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase,
digestive tract, hydrolase-hydrolase INH complex; HET:
074; 1.30A {Schistosoma mansoni} PDB: 3s3q_A* 3s3r_A*
Length = 254
Score = 113 bits (286), Expect = 2e-30
Identities = 57/195 (29%), Positives = 86/195 (44%), Gaps = 26/195 (13%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C GI W + K G+VTG + ++ GC+P FP C H + P C + P+C
Sbjct: 72 CEGGILGPAWDYWVKEGIVTGSSKENHAGCEPYPFPKCEH-HTKGKYPPCGSKIYKTPRC 130
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAF--WRSFCTKYT 186
C Y + QDK++ G Y +GP F + F
Sbjct: 131 KQTC-QKKYKTPYTQDKHR--GKSSYNVKNDEKAIQKEIMKYGPVEAGFTVYEDF----- 182
Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
L +G +Y + +A ++I+GWG EN PYW I +++ E +G+ G +I+RG
Sbjct: 183 --LNYKSG-IYKHITGETLGGHA-IRIIGWGVENKAPYWLIANSWNEDWGENGYFRIVRG 238
Query: 247 RNEAIIESLVNGALP 261
R+E IES V
Sbjct: 239 RDECSIESEVTAGRI 253
>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco
hydrolase, lysosome, protease, thiol protease, zymogen,
CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A*
3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A*
1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A*
1qdq_A* 1csb_B* 1huc_B 2ipp_B ...
Length = 266
Score = 110 bits (277), Expect = 6e-29
Identities = 61/198 (30%), Positives = 91/198 (45%), Gaps = 27/198 (13%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C+ G + W + ++GLV+GG + S+ GC+P S PPC + + P C PKC
Sbjct: 77 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEA-HVNGARPPCTG-EGDTPKC 134
Query: 140 HTRCTNDNYGRGFFQDKYQINGLGLYFDP-----------HFGPFWPAF--WRSFCTKYT 186
C Y + QDK+ G Y GP AF + F
Sbjct: 135 SKIC-EPGYSPTYKQDKHY--GYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDF----- 186
Query: 187 RPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 246
L +G VY + +A ++I+GWG ENG PYW + +++ +GD G KILRG
Sbjct: 187 --LLYKSG-VYQHVTGEMMGGHA-IRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRG 242
Query: 247 RNEAIIESLVNGALPKDN 264
++ IES V +P+ +
Sbjct: 243 QDHCGIESEVVAGIPRTD 260
>3hhi_A Cathepsin B-like cysteine protease; occluding loop, hydrolase, THIO
protease; HET: 074; 1.60A {Trypanosoma brucei} PDB:
3mor_A*
Length = 325
Score = 99.0 bits (247), Expect = 3e-24
Identities = 53/195 (27%), Positives = 78/195 (40%), Gaps = 33/195 (16%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 138
C+ G WA+ GLV+ CQP FP C +H+ P C PK
Sbjct: 140 CNGGDPDRAWAYFSSTGLVSDY-------CQPYPFPHCSHHSKSKNGYPPCSQFNFDTPK 192
Query: 139 CHTRCTNDNYGRGFFQDK--YQING-------LGLYFDPHFGPFWPAF--WRSFCTKYTR 187
C C + ++ Y + G L GPF AF + F
Sbjct: 193 CDYTCDDPTIPVVNYRSWTSYALQGEDDYMRELFFR-----GPFEVAFDVYEDF------ 241
Query: 188 PLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 247
+ +G VY + + +A V++VGWG NG PYW I +++ ++G G I RG
Sbjct: 242 -IAYNSG-VYHHVSGQYLGGHA-VRLVGWGTSNGVPYWKIANSWNTEWGMDGYFLIRRGS 298
Query: 248 NEAIIESLVNGALPK 262
+E IE + +P
Sbjct: 299 SECGIEDGGSAGIPL 313
>1deu_A Procathepsin X; cysteine protease, proregion, prosegment, HY; 1.70A
{Homo sapiens} SCOP: d.3.1.1 PDB: 1ef7_A
Length = 277
Score = 68.5 bits (168), Expect = 1e-13
Identities = 35/198 (17%), Positives = 61/198 (30%), Gaps = 42/198 (21%)
Query: 80 CSSGISSSTWAWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 139
C G S W + H+ G+ C +C T +C
Sbjct: 106 CEGGNDLSVWDYAHQHGIPDET-------CNNYQAKDQEC----DKFNQCGT-CNEFKEC 153
Query: 140 HTRCTNDNYGRGFFQDKYQING-----LGLYFDPHFGPFWPAF--WRSFCTKYTRPLFQT 192
H + G + ++G +Y GP T
Sbjct: 154 HAIRNYTLWRVGDY---GSLSGREKMMAEIY---ANGPISCGIMATERL-------ANYT 200
Query: 193 NGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI- 251
G +YA + + V + GWG +G YW + +++GE +G++G ++I+ +
Sbjct: 201 GG-IYAEYQDTTYINHV-VSVAGWGISDGTEYWIVRNSWGEPWGERGWLRIVTSTYKDGK 258
Query: 252 -------IESLVNGALPK 262
IE P
Sbjct: 259 GARYNLAIEEHCTFGDPI 276
>3pdf_A Cathepsin C, dipeptidyl peptidase 1; two domains, cystein protease,
hydrolase-hydrolase inhibitor; HET: LXV NAG; 1.85A {Homo
sapiens} PDB: 1jqp_A* 2djf_B* 1k3b_B* 2djg_B* 2djf_A*
1k3b_A* 2djg_A* 2djf_C* 1k3b_C* 2djg_C*
Length = 441
Score = 65.2 bits (159), Expect = 4e-12
Identities = 19/53 (35%), Positives = 32/53 (60%), Gaps = 2/53 (3%)
Query: 211 VKIVGWGEEN--GRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 261
V +VG+G ++ G YW + +++G +G+ G +I RG +E IES+ A P
Sbjct: 383 VLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATP 435
>3qt4_A Cathepsin-L-like midgut cysteine proteinase; hydrolase, zymogen,
intramolecular DISS bonds, insect larVal midgut; HET:
PG4 PG6; 2.11A {Tenebrio molitor}
Length = 329
Score = 47.7 bits (114), Expect = 2e-06
Identities = 12/38 (31%), Positives = 24/38 (63%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
V +VG+G +NG+ YW + +++G +G+ G + +R
Sbjct: 278 VLVVGYGSDNGQDYWILKNSWGSGWGESGYWRQVRNYG 315
>1m6d_A Cathepsin F, catsf; papain family cysteine protease, hydrolase;
HET: MYP; 1.70A {Homo sapiens} SCOP: d.3.1.1
Length = 214
Score = 46.7 bits (112), Expect = 2e-06
Identities = 13/38 (34%), Positives = 24/38 (63%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
V +VG+G+ + P+W I +++G +G+KG + RG
Sbjct: 163 VLLVGYGQRSDVPFWAIKNSWGTDWGEKGYYYLHRGSG 200
>1by8_A Protein (procathepsin K); hydrolase(sulfhydryl proteinase), papain;
2.60A {Homo sapiens} SCOP: d.3.1.1 PDB: 7pck_A
Length = 314
Score = 47.2 bits (113), Expect = 2e-06
Identities = 15/38 (39%), Positives = 25/38 (65%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
V VG+G + G +W I +++GE +G+KG I + R +N
Sbjct: 263 VLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKN 300
>3ovx_A Cathepsin S; hydrolase, covalent inhibitor, aldehyde warhead is
covalently bound to Cys25, lysosomeal protein; HET: O64;
1.49A {Homo sapiens} PDB: 2h7j_A* 2f1g_A* 2hh5_B*
2hhn_A* 2hxz_A* 2op3_A* 2frq_A* 2fra_A* 2fq9_A* 2ft2_A*
2fud_A* 2g7y_A* 1ms6_A* 2r9m_A* 2r9n_A* 2r9o_A* 3n3g_A*
3n4c_A* 3mpe_A* 1nqc_A* ...
Length = 218
Score = 46.3 bits (111), Expect = 3e-06
Identities = 14/38 (36%), Positives = 27/38 (71%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
V +VG+G+ NG+ YW + +++G FG++G I++ R +
Sbjct: 167 VLVVGYGDLNGKEYWLVKNSWGHNFGEEGYIRMARNKG 204
>3kwz_A Cathepsin K; enzyme inhibitor, covalent reversible inhibitor,
disease mutation, disulfide bond, glycoprotein,
hydrolase, lysosome, protease; HET: KWZ; 1.49A {Homo
sapiens} PDB: 1au0_A* 1au2_A* 1au3_A* 1au4_A* 1ayu_A*
1ayv_A* 1ayw_A* 1bgo_A* 1atk_A* 1nl6_A* 1nlj_A* 1q6k_A*
1mem_A* 1yk7_A* 1yk8_A* 1yt7_A* 2ato_A* 2aux_A* 2auz_A*
2bdl_A* ...
Length = 215
Score = 46.3 bits (111), Expect = 3e-06
Identities = 15/38 (39%), Positives = 25/38 (65%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
V VG+G + G +W I +++GE +G+KG I + R +N
Sbjct: 164 VLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKN 201
>2c0y_A Procathepsin S; proenzyme, proteinase, hydrolase, thiol protease,
prosegment binding loop, glycoprotein, lysosome,
protease, zymogen; 2.1A {Homo sapiens}
Length = 315
Score = 46.5 bits (111), Expect = 4e-06
Identities = 14/38 (36%), Positives = 27/38 (71%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
V +VG+G+ NG+ YW + +++G FG++G I++ R +
Sbjct: 264 VLVVGYGDLNGKEYWLVKNSWGHNFGEEGYIRMARNKG 301
>2o6x_A Procathepsin L1, secreted cathepsin L 1; hydrolase, thiol protease,
cysteine protease, zymogen, hydro; 1.40A {Fasciola
hepatica}
Length = 310
Score = 46.5 bits (111), Expect = 4e-06
Identities = 13/38 (34%), Positives = 25/38 (65%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
V VG+G + G YW + +++G +G++G I+++R R
Sbjct: 255 VLAVGYGTQGGTDYWIVKNSWGLSWGERGYIRMVRNRG 292
>3p5u_A Actinidin; SAD, cysteine proteinases, hydrolase; 1.50A {Actinidia
arguta} PDB: 3p5v_A 3p5w_A 3p5x_A 1aec_A* 2act_A
Length = 220
Score = 46.0 bits (110), Expect = 4e-06
Identities = 14/39 (35%), Positives = 24/39 (61%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
V IVG+G E G YW + +++G +G++G ++I R
Sbjct: 164 VTIVGYGTEGGIDYWIVKNSWGTTWGEEGYMRIQRNVGG 202
>1pci_A Procaricain; zymogen, hydrolase, thiol protease; 3.20A {Carica
papaya} SCOP: d.3.1.1
Length = 322
Score = 46.5 bits (111), Expect = 4e-06
Identities = 14/38 (36%), Positives = 23/38 (60%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
V VG+G+ G+ Y I +++G +G+KG I+I R
Sbjct: 267 VTAVGYGKSGGKGYILIKNSWGTAWGEKGYIRIKRAPG 304
>1iwd_A Ervatamin B; cysteine protease, alpha-beta protein, catalytic DYAD,
L-DOM domain., hydrolase; 1.63A {Tabernaemontana
divaricata} SCOP: d.3.1.1
Length = 215
Score = 46.0 bits (110), Expect = 4e-06
Identities = 13/38 (34%), Positives = 26/38 (68%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
V IVG+G ++G+ YW + +++G+ +G++G I + R
Sbjct: 160 VVIVGYGTQSGKNYWIVRNSWGQNWGNQGYIWMERNVA 197
>3f5v_A DER P 1 allergen; allergy, asthma, DUST mites, glycoprotein,
hydrola protease, secreted, thiol protease; HET: P6G;
1.36A {Dermatophagoides pteronyssinus} PDB: 2as8_A
3rvw_A* 3rvx_A 3rvv_A* 3d6s_A*
Length = 222
Score = 45.6 bits (109), Expect = 5e-06
Identities = 12/44 (27%), Positives = 20/44 (45%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
V IVG+ G YW + +++ +GD G + +IE
Sbjct: 172 VNIVGYSNAQGVDYWIVRNSWDTNWGDNGYGYFAANIDLMMIEE 215
>1cqd_A Protein (protease II); cysteine protease, glycoprotein, proline
specificity, carboh papain family, hydrolase; HET: NAG
FUL FUC; 2.10A {Zingiber officinale} SCOP: d.3.1.1
Length = 221
Score = 45.6 bits (109), Expect = 5e-06
Identities = 11/38 (28%), Positives = 24/38 (63%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
+ +VG+G EN + +W + +++G+ +G+ G I+ R
Sbjct: 163 LTVVGYGTENDKDFWIVKNSWGKNWGESGYIRAERNIE 200
>3i06_A Cruzipain; autocatalytic cleavage, glycoprotein, protease, thiol
protease, zymogen; HET: QL2; 1.10A {Trypanosoma cruzi}
PDB: 1ewm_A* 1ewo_A* 1ewl_A* 1f29_A* 1ewp_A* 1f2b_A*
1f2c_A* 1f2a_A* 1me4_A* 1u9q_X* 2aim_A* 2efm_A* 2oz2_A*
1me3_A* 3kku_A* 3lxs_A* 1aim_A* 3iut_A* 3hd3_A* 2p86_A*
...
Length = 215
Score = 45.6 bits (109), Expect = 6e-06
Identities = 14/38 (36%), Positives = 25/38 (65%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
V +VG+ + PYW I +++ Q+G++G I+I +G N
Sbjct: 164 VLLVGYNDSAAVPYWIIKNSWTTQWGEEGYIRIAKGSN 201
>1xkg_A DER P I, major mite fecal allergen DER P 1; major allergen,
cysteine protease, house DUST mite, dermatop
pteronyssinus; 1.61A {Dermatophagoides pteronyssinus}
SCOP: d.3.1.1
Length = 312
Score = 46.1 bits (110), Expect = 6e-06
Identities = 12/44 (27%), Positives = 20/44 (45%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 254
V IVG+ G YW + +++ +GD G + +IE
Sbjct: 252 VNIVGYSNAQGVDYWIVRNSWDTNWGDNGYGYFAANIDLMMIEE 295
>8pch_A Cathepsin H; hydrolase, protease, cysteine proteinase,
aminopeptidase; HET: NAG BMA; 2.10A {Sus scrofa} SCOP:
d.3.1.1 PDB: 1nb3_A* 1nb5_A*
Length = 220
Score = 45.1 bits (108), Expect = 6e-06
Identities = 19/38 (50%), Positives = 26/38 (68%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
V VG+GEENG PYW + +++G Q+G G I RG+N
Sbjct: 168 VLAVGYGEENGIPYWIVKNSWGPQWGMNGYFLIERGKN 205
>1yal_A Chymopapain; hydrolase, thiol protease; 1.70A {Carica papaya} SCOP:
d.3.1.1 PDB: 1gec_E*
Length = 218
Score = 44.8 bits (107), Expect = 8e-06
Identities = 12/38 (31%), Positives = 23/38 (60%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
V VG+G +G+ Y I +++G +G+KG +++ R
Sbjct: 161 VTAVGYGTSDGKNYIIIKNSWGPNWGEKGYMRLKRQSG 198
>2b1m_A SPE31; papain-like, sugar binding protein; HET: NAG FUC PG4; 2.00A
{Pachyrhizus erosus} PDB: 2b1n_A*
Length = 246
Score = 44.5 bits (106), Expect = 1e-05
Identities = 16/38 (42%), Positives = 23/38 (60%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
V IVG+G E+G YW +++GE +G G I+I R
Sbjct: 170 VLIVGYGSEDGVDYWIAKNSWGEDWGIDGYIRIQRNTG 207
>3qj3_A Cathepsin L-like protein; hydrolase, proteinase, larVal midgut;
1.85A {Tenebrio molitor}
Length = 331
Score = 44.9 bits (107), Expect = 1e-05
Identities = 17/38 (44%), Positives = 25/38 (65%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
V IVG+G ENG+ YW + +++G+ +G G KI R N
Sbjct: 280 VLIVGYGNENGQDYWLVKNSWGDGWGLDGYFKIARNAN 317
>1ppo_A Protease omega; hydrolase(thiol protease); 1.80A {Carica papaya}
SCOP: d.3.1.1 PDB: 1meg_A*
Length = 216
Score = 42.8 bits (102), Expect = 5e-05
Identities = 14/38 (36%), Positives = 23/38 (60%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
V VG+G+ G+ Y I +++G +G+KG I+I R
Sbjct: 161 VTAVGYGKSGGKGYILIKNSWGTAWGEKGYIRIKRAPG 198
>2fo5_A Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cysteine
endoprotease, endopeptidase, LEUP hydrolase; HET: AR7;
2.20A {Hordeum vulgare}
Length = 262
Score = 42.3 bits (100), Expect = 8e-05
Identities = 13/39 (33%), Positives = 27/39 (69%), Gaps = 1/39 (2%)
Query: 211 VKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
V +VG+G E+G+ YWT+ +++G +G++G I++ +
Sbjct: 169 VAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSG 207
>2wbf_X Serine-repeat antigen protein; SERA, malaria, vacuole, protease,
cathepsin, hydrolase, glycoprotein, thiol protease; HET:
DMS; 1.60A {Plasmodium falciparum} PDB: 3ch3_X 3ch2_X
Length = 265
Score = 41.6 bits (98), Expect = 1e-04
Identities = 18/64 (28%), Positives = 29/64 (45%), Gaps = 11/64 (17%)
Query: 211 VKIVGWG-----EENGRPYWTIVSTFGEQFGDKGTIKILRGRNE----AIIES--LVNGA 259
V IVG+G E + YW + +++G +GD+G K+ I S + N
Sbjct: 202 VNIVGYGNYVNSEGEKKSYWIVRNSWGPYWGDEGYFKVDMYGPTHCHFNFIHSVVIFNVD 261
Query: 260 LPKD 263
LP +
Sbjct: 262 LPMN 265
>1s4v_A Cysteine endopeptidase; KDEL ER retention signal, endosperm,
ricinosomes, SEED germi senescence, hydrolase-hydrolase
inhibitor complex; 2.00A {Ricinus communis} SCOP:
d.3.1.1
Length = 229
Score = 39.4 bits (93), Expect = 5e-04
Identities = 16/37 (43%), Positives = 27/37 (72%), Gaps = 1/37 (2%)
Query: 211 VKIVGWGEE-NGRPYWTIVSTFGEQFGDKGTIKILRG 246
V IVG+G +G YWT+ +++G ++G+KG I++ RG
Sbjct: 164 VAIVGYGTTIDGTKYWTVKNSWGPEWGEKGYIRMERG 200
>1cs8_A Human procathepsin L; prosegment, propeptide, inhibition,
hydrolase; HET: OCS; 1.80A {Homo sapiens} SCOP: d.3.1.1
PDB: 1cjl_A 3hwn_A*
Length = 316
Score = 39.2 bits (92), Expect = 0.001
Identities = 13/42 (30%), Positives = 25/42 (59%), Gaps = 4/42 (9%)
Query: 211 VKIVGWGEE----NGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
V +VG+G E + YW + +++GE++G G +K+ + R
Sbjct: 261 VLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRR 302
>3f75_A Toxopain-2, cathepsin L protease; medical structural genomics of
pathogenic protozoa, MSGPP, C protease, parasite,
protozoa, hydrolase; 1.99A {Toxoplasma gondii}
Length = 224
Score = 38.7 bits (91), Expect = 0.001
Identities = 10/41 (24%), Positives = 24/41 (58%), Gaps = 2/41 (4%)
Query: 211 VKIVGWG--EENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
V +VG+G +E+ + +W + +++G +G G + + + E
Sbjct: 169 VLLVGYGTDKESKKDFWIMKNSWGTGWGRDGYMYMAMHKGE 209
>2xu3_A Cathepsin L1; hydrolase, drug design, thiol protease; HET: XU3 BTB;
0.90A {Homo sapiens} PDB: 2xu4_A* 2xu5_A* 2yj2_A*
2yj8_A* 2yj9_A* 2yjb_A* 2yjc_A* 3bc3_A* 3h89_A* 3h8b_A*
3h8c_A* 3of9_A* 3of8_A* 3hha_A* 2xu1_A* 3iv2_A* 3k24_A*
2nqd_B* 3kse_A* 2vhs_A ...
Length = 220
Score = 38.7 bits (91), Expect = 0.001
Identities = 13/42 (30%), Positives = 25/42 (59%), Gaps = 4/42 (9%)
Query: 211 VKIVGWGEE----NGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
V +VG+G E + YW + +++GE++G G +K+ + R
Sbjct: 165 VLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRR 206
>2pff_B Fatty acid synthase subunit beta; fatty acid synthase,
acyl-carrier-protein, beta-ketoacyl RED beta-ketoacyl
synthase, dehydratase; 4.00A {Saccharomyces cerevisiae}
Length = 2006
Score = 39.6 bits (92), Expect = 0.001
Identities = 20/141 (14%), Positives = 41/141 (29%), Gaps = 53/141 (37%)
Query: 183 TKYTRP-LFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVSTF-----GE--- 233
T++T+P L + A + + P +TF GE
Sbjct: 1729 TQFTQPALT-------LME-------KAAFED--LKSKGLIP---ADATFAGHSLGEYAA 1769
Query: 234 --------QFGDKGTIKILRGRNEAIIESLVNGALPKDNYG----------VEFGEESGE 275
++++ R + ++ L + NYG F +E+ +
Sbjct: 1770 LASLADVMSIES--LVEVVFYRGMTMQVAVPRDELGRSNYGMIAINPGRVAASFSQEALQ 1827
Query: 276 RLSEEFGVRAESSEEFRE--N 294
+ E + + E N
Sbjct: 1828 YVVERV---GKRTGWLVEIVN 1845
Score = 34.6 bits (79), Expect = 0.045
Identities = 68/430 (15%), Positives = 116/430 (26%), Gaps = 183/430 (42%)
Query: 2 GG-GTSSR----IRDMSYGATVYNRRPYALSCIEARAVATATPLAFAVCRSSKMHVE--- 53
GG G + +RD+ Y Y I+ + T + L + K+ +
Sbjct: 161 GGQGNTDDYFEELRDL-Y--QTY--HVLVGDLIK-FSAETLSELIRTTLDAEKVFTQGLN 214
Query: 54 -----------------CT---SFRFIAGVKQRCAWLVS---------------RWMTIW 78
+ S I GV Q ++V+ + T
Sbjct: 215 ILEWLENPSNTPDKDYLLSIPISCPLI-GVIQLAHYVVTAKLLGFTPGELRSYLKGAT-- 271
Query: 79 VCSSGI-------SSSTWA--------------WVHKRGLVTGGAHHSNTGCQ----PVS 113
S G+ + +W ++ G C S
Sbjct: 272 GHSQGLVTAVAIAETDSWESFFVSVRKAITVLFFI--------GVR-----CYEAYPNTS 318
Query: 114 FPPCNHANYTTSEPECKTLATP----------QPKCHTRCTNDNYGRGFFQDKYQI---N 160
PP + S + + +P Q + + TN + G + +I N
Sbjct: 319 LPP---SILEDSLENNEGVPSPMLSISNLTQEQVQDYVNKTNSHLPAG---KQVEISLVN 372
Query: 161 GLGLYFDP-HF---GPFWPAFWRSFCT---KYTRP--LFQTNGRV--------------- 196
G + GP P K P L Q+ R+
Sbjct: 373 G------AKNLVVSGP--PQSLYGLNLTLRKAKAPSGLDQS--RIPFSERKLKFSNRFLP 422
Query: 197 --------YAVSASAEIVAYATVKIVGW-GEENGRPYWTIVSTF-GEQFGDKGTIKILRG 246
V AS I V + ++ P + TF G D LR
Sbjct: 423 VASPFHSHLLVPASDLINKDLVKNNVSFNAKDIQIP---VYDTFDGS---D------LRV 470
Query: 247 RNEAIIESLVNGAL--PKD---------NYGVEFG--EESG-----ERLSEEFGVRAESS 288
+ +I E +V+ + P + ++FG SG R + GVR +
Sbjct: 471 LSGSISERIVDCIIRLPVKWETTTQFKATHILDFGPGGASGLGVLTHRNKDGTGVRVIVA 530
Query: 289 EEFRENGEEE 298
N +++
Sbjct: 531 GTLDINPDDD 540
>3u8e_A Papain-like cysteine protease; papain-like cysteine peptidase,
peptidase_C1A, hydrolase, in form; 1.31A {Crocus
sativus}
Length = 222
Score = 38.3 bits (89), Expect = 0.001
Identities = 14/39 (35%), Positives = 21/39 (53%), Gaps = 1/39 (2%)
Query: 211 VKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
V IVG+G YW + +++G ++G G I I R N
Sbjct: 164 VLIVGYGSNGTNADYWIVKNSWGTEWGIDGYILIRRNTN 202
>3bwk_A Cysteine protease falcipain-3; malaria, hydrolase; HET: C1P; 2.42A
{Plasmodium falciparum} PDB: 3bpm_A*
Length = 243
Score = 35.7 bits (83), Expect = 0.011
Identities = 11/49 (22%), Positives = 22/49 (44%), Gaps = 10/49 (20%)
Query: 211 VKIVGWGEEN----------GRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
V +VG+G ++ Y+ I +++G +G+ G I + N
Sbjct: 178 VILVGYGMKDIYNEDTGRMEKFYYYIIKNSWGSDWGEGGYINLETDENG 226
>2oul_A Falcipain 2; cysteine protease, inhibitor, macromolecular
interaction, HY hydrolase inhibitor complex; 2.20A
{Plasmodium falciparum} SCOP: d.3.1.1 PDB: 2ghu_A 1yvb_A
3bpf_A* 3pnr_A
Length = 241
Score = 34.9 bits (81), Expect = 0.017
Identities = 12/48 (25%), Positives = 24/48 (50%), Gaps = 10/48 (20%)
Query: 211 VKIVGWGEENG----------RPYWTIVSTFGEQFGDKGTIKILRGRN 248
V +VG+G + Y+ I +++G+Q+G++G I I +
Sbjct: 176 VMLVGFGMKEIVNPLTKKGEKHYYYIIKNSWGQQWGERGFINIETDES 223
>3ois_A Cysteine protease; alpha and beta, hydrolase; HET: UDP; 1.65A
{Xylella fastidiosa}
Length = 291
Score = 34.1 bits (78), Expect = 0.041
Identities = 7/39 (17%), Positives = 17/39 (43%), Gaps = 2/39 (5%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
V VG+ ++ ++ I +++G G+ G +
Sbjct: 239 VLCVGY--DDEIRHFRIRNSWGNNVGEDGYFWMPYEYIS 275
>3ioq_A CMS1MS2; caricaceae, cysteine protease, papain family, hydrolase;
HET: E64 SO4; 1.87A {Carica candamarcensis}
Length = 213
Score = 30.9 bits (71), Expect = 0.39
Identities = 13/38 (34%), Positives = 21/38 (55%), Gaps = 4/38 (10%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
V VG+G + Y I +++G +G+ G I+I RG
Sbjct: 161 VAAVGYGND----YILIKNSWGTGWGEGGYIRIKRGSG 194
>2cio_A Papain; hydrolase/inhibitor, complex hydrolase/inhibitor, ICP,
cysteine protease, allergen, protease, thiol protease;
1.5A {Carica papaya} PDB: 1khq_A 1khp_A 1ppn_A 3e1z_B
3ima_A 3lfy_A 9pap_A 1bqi_A* 1bp4_A* 1pad_A 1pe6_A*
1pip_A* 1pop_A* 1ppd_A 1ppp_A* 1stf_E* 2pad_A 4pad_A*
5pad_A* 6pad_A* ...
Length = 212
Score = 30.5 bits (70), Expect = 0.46
Identities = 13/38 (34%), Positives = 20/38 (52%), Gaps = 4/38 (10%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
V VG+G Y I +++G +G+ G I+I RG
Sbjct: 161 VAAVGYGPN----YILIKNSWGTGWGENGYIRIKRGTG 194
>1rlj_A NRDI protein; flavoprotein, FMN, thioredoxin, alpha/beta/alpha
sandwich, structural genomics, PSI, protein structure
initiative; HET: FMN; 2.00A {Bacillus subtilis} SCOP:
c.23.5.7
Length = 139
Score = 29.1 bits (65), Expect = 0.89
Identities = 11/81 (13%), Positives = 26/81 (32%), Gaps = 4/81 (4%)
Query: 203 AEIVAYATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 262
+ ++ V + P+ + T T L ++ +G
Sbjct: 26 VNKTGFQQIRKVDEMDHVDTPFVLVTYTTNFGQVPASTQSFLEKYAHLLLGVAASGNK-- 83
Query: 263 DNYGVEFGEESGERLSEEFGV 283
+G F +S + +S ++ V
Sbjct: 84 -VWGDNFA-KSADTISRQYQV 102
>3tri_A Pyrroline-5-carboxylate reductase; amino acid biosynthesis,
oxidoreductase; HET: NAP; 2.50A {Coxiella burnetii}
Length = 280
Score = 29.0 bits (66), Expect = 1.8
Identities = 5/41 (12%), Positives = 11/41 (26%), Gaps = 1/41 (2%)
Query: 249 EAIIESLVNGALPKDN-YGVEFGEESGERLSEEFGVRAESS 288
I+ L+ + + + E+ GV
Sbjct: 16 RNIVVGLIANGYDPNRICVTNRSLDKLDFFKEKCGVHTTQD 56
>2g81_I BTCI, bowman-BIRK type SEED trypsin and chymotrypsin inhibitor;
proteinase inhibitor, protein structure, bowman-BIRK
inhibitor; HET: P6G PGE; 1.55A {Vigna unguiculata} SCOP:
g.3.13.1 PDB: 1h34_A 1tab_I 2r33_A 1bbi_A 2bbi_A 1d6r_I
1k9b_A 1pi2_A
Length = 83
Score = 26.9 bits (59), Expect = 2.4
Identities = 12/46 (26%), Positives = 17/46 (36%), Gaps = 1/46 (2%)
Query: 99 TGGAHHSNTGCQPVSFPPCNHANYTTSEP-ECKTLATPQPKCHTRC 143
+G S S P C+ T S P +C+ CH+ C
Sbjct: 1 SGHHEDSTDEASESSKPCCDRCECTKSIPPQCRCSDVRLNSCHSAC 46
>2bdz_A Mexicain; cysteine protease, peptidase_C1, papain-like, HYDR; HET:
E64; 2.10A {Jacaratia mexicana}
Length = 214
Score = 28.2 bits (64), Expect = 2.5
Identities = 12/38 (31%), Positives = 21/38 (55%), Gaps = 4/38 (10%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 248
V VG+G+ Y + +++G +G+KG I+I R
Sbjct: 161 VTAVGYGKT----YLLLKNSWGPNWGEKGYIRIKRASG 194
>3pw3_A Aminopeptidase C; bleomycin, cysteine proteinase fold, structural
genomics, JO center for structural genomics, JCSG; HET:
MSE; 2.23A {Parabacteroides distasonis}
Length = 383
Score = 28.3 bits (62), Expect = 3.7
Identities = 18/125 (14%), Positives = 35/125 (28%), Gaps = 4/125 (3%)
Query: 142 RCTNDNYGRGFFQDKYQINGLGLYFDPHFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSA 201
G D ++ L H+ P + + R A
Sbjct: 252 ESGFTRDGVAVMPDDEKVQELSGSDMAHWLKLKPEEKKLNTKPQPQKWCTQAERQLAYDN 311
Query: 202 SAEIVAYATVKIVGWG-EENGRPYWTIVSTFGEQFGDKGTIKILRG-RNEAIIESLVN-G 258
+ I G ++ G Y+ + +++G G + + +V+
Sbjct: 312 YETTDDHGMQ-IYGIAKDQEGNEYYMVKNSWGTNSKYNGIWYASKAFVRYKTMNIVVHKD 370
Query: 259 ALPKD 263
ALPK
Sbjct: 371 ALPKA 375
>3psh_A Protein HI_1472; substrate binding protein, periplasmic binding
protein, MOLY binding protein, metal transport; 1.50A
{Haemophilus influenzae} PDB: 3psa_A
Length = 326
Score = 28.1 bits (63), Expect = 3.8
Identities = 10/63 (15%), Positives = 18/63 (28%), Gaps = 4/63 (6%)
Query: 195 RVYAVSASA-EIVAY--ATVKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN-EA 250
R + I AT +IVG + + + L N E+
Sbjct: 20 RAVVLQHQTLNIAVQLDATKQIVGVLSNWKKQLGKNYVRLAPELENMAMPGDLNSVNIES 79
Query: 251 IIE 253
++
Sbjct: 80 LLA 82
>3lvg_D LCB, clathrin light chain B; SELF assembly, coated PIT, cytoplasmic
vesicle, membrane, Ca structural protein; 7.94A {Bos
taurus}
Length = 190
Score = 27.4 bits (60), Expect = 4.1
Identities = 8/28 (28%), Positives = 14/28 (50%)
Query: 271 EESGERLSEEFGVRAESSEEFRENGEEE 298
EE +RL E +E+RE +++
Sbjct: 92 EEQRKRLQELDAASKVMEQEWREKAKKD 119
>2z9v_A Aspartate aminotransferase; pyridoxamine, pyruvate; HET: PXM; 1.70A
{Mesorhizobium loti} PDB: 2z9u_A* 2z9w_A* 2z9x_A*
Length = 392
Score = 27.6 bits (62), Expect = 5.1
Identities = 2/35 (5%), Positives = 10/35 (28%), Gaps = 1/35 (2%)
Query: 169 HFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSASA 203
+ P + + K + + + + +
Sbjct: 35 DYDPAFQLLYEKVVDKA-QKAMRLSNKPVILHGEP 68
>1iug_A Putative aspartate aminotransferase; wild type,
pyridoxal-5'-phosphate form, riken structural
genomics/proteomics initiative, RSGI; HET: LLP; 2.20A
{Thermus thermophilus} SCOP: c.67.1.3
Length = 352
Score = 27.6 bits (62), Expect = 5.3
Identities = 7/34 (20%), Positives = 11/34 (32%), Gaps = 1/34 (2%)
Query: 169 HFGPFWPAFWRSFCTKYTRPLFQTNGRVYAVSAS 202
H + R F+T G V ++ S
Sbjct: 27 HRTEAAREVFLK-ARGLLREAFRTEGEVLILTGS 59
>1yb4_A Tartronic semialdehyde reductase; structural genomics,
oxidoreductase, salmonella typhimurium LT2, PSI,
protein ST initiative; 2.40A {Salmonella typhimurium}
Length = 295
Score = 27.5 bits (62), Expect = 5.8
Identities = 8/39 (20%), Positives = 12/39 (30%)
Query: 1 MGGGTSSRIRDMSYGATVYNRRPYALSCIEARAVATATP 39
MG + + + V P A + AV T
Sbjct: 14 MGSPMAINLARAGHQLHVTTIGPVADELLSLGAVNVETA 52
>2fyx_A Transposase, putative; structural genomics, joint center for
structural genomics, J protein structure initiative,
PSI-2; 1.90A {Deinococcus radiodurans} SCOP: d.58.57.1
PDB: 2xqc_A 2xm3_A 2xma_A 2xo6_A
Length = 143
Score = 26.6 bits (57), Expect = 6.1
Identities = 5/47 (10%), Positives = 13/47 (27%)
Query: 181 FCTKYTRPLFQTNGRVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 227
+ TKY + +I +++V + +
Sbjct: 29 WATKYRHQVLVDEVADGLKDILRDIATQNGLELVALEVMPDYVHLLL 75
>3isl_A Purine catabolism protein PUCG; pyridoxalphosphate, PLP dependent
enzymes, purine metabolism transaminases,
aminotransferases; HET: PLP; 2.06A {Bacillus subtilis}
Length = 416
Score = 27.3 bits (61), Expect = 7.0
Identities = 9/36 (25%), Positives = 14/36 (38%), Gaps = 2/36 (5%)
Query: 169 HFGPFWPAFWRSFCTKYTRPLFQT-NGRVYAVSASA 203
F P + + R LFQT N Y + ++
Sbjct: 37 QFDPAFTGIMNE-TMEMLRELFQTKNRWAYPIDGTS 71
>1wvq_A Hypothetical protein PAE2307; phosphorylated histidine, structural
genomics, unknown funct; HET: NEP; 1.45A {Pyrobaculum
aerophilum} PDB: 2gl0_A* 2jb7_A*
Length = 167
Score = 26.7 bits (59), Expect = 8.1
Identities = 12/29 (41%), Positives = 19/29 (65%), Gaps = 1/29 (3%)
Query: 249 EAIIESLVNGALPKDNYGVEFGEESGERL 277
E + E+LV ++P +G+ F E SG+RL
Sbjct: 33 EDLYEALVT-SVPGVKFGIAFCEASGKRL 60
>1o0e_A Ervatamin C; plant cysteine protease, two domain, stable at PH
2-12, HYDR; 1.90A {Tabernaemontana divaricata} SCOP:
d.3.1.1 PDB: 2pns_A* 2pre_A* 3bcn_A*
Length = 208
Score = 26.8 bits (60), Expect = 8.2
Identities = 13/39 (33%), Positives = 22/39 (56%), Gaps = 4/39 (10%)
Query: 211 VKIVGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 249
V IVG+ YW + +++G +G+KG I++LR
Sbjct: 159 VTIVGYQAN----YWIVRNSWGRYWGEKGYIRMLRVGGC 193
>2ekm_A Hypothetical protein ST1511; NPPSFA, national project on protein ST
and functional analyses, riken structural
genomics/proteomi initiative; 2.06A {Sulfolobus
tokodaii}
Length = 162
Score = 26.2 bits (58), Expect = 8.6
Identities = 11/29 (37%), Positives = 18/29 (62%), Gaps = 1/29 (3%)
Query: 249 EAIIESLVNGALPKDNYGVEFGEESGERL 277
E + E+L + + P +G+ F E SG+RL
Sbjct: 30 EDLYETLAS-SSPHLKFGIAFCEASGKRL 57
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis,
programmed cell death; HET: DTP; 6.90A {Drosophila
melanogaster} PDB: 3iz8_A*
Length = 1221
Score = 27.1 bits (59), Expect = 8.8
Identities = 11/45 (24%), Positives = 18/45 (40%), Gaps = 14/45 (31%)
Query: 38 TPLAFAVCRSSKMHVECTSFRFIAGVKQRCAWLVSRWMTIWVCSS 82
T +A VC S K V+C + + W+ + C+S
Sbjct: 164 TWVALDVCLSYK--VQC---KMDFKI---------FWLNLKNCNS 194
>1vgg_A Conserved hypothetical protein TT1634; thermus thermophilus HB8,
structural genomics, riken structural
genomics/proteomics initiative, RSGI; 1.75A {Thermus
thermophilus} SCOP: d.256.1.1
Length = 161
Score = 26.2 bits (58), Expect = 9.1
Identities = 13/29 (44%), Positives = 19/29 (65%), Gaps = 1/29 (3%)
Query: 249 EAIIESLVNGALPKDNYGVEFGEESGERL 277
E + E+LV A+P +G+ F E SG+RL
Sbjct: 28 EDLHEALVT-AVPGIRFGLAFSEASGKRL 55
Database: pdb70
Posted date: Sep 4, 2012 3:40 AM
Number of letters in database: 6,701,793
Number of sequences in database: 27,921
Lambda K H
0.320 0.134 0.438
Gapped
Lambda K H
0.267 0.0856 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 27921
Number of Hits to DB: 4,774,363
Number of extensions: 280172
Number of successful extensions: 791
Number of sequences better than 10.0: 1
Number of HSP's gapped: 772
Number of HSP's successfully gapped: 71
Length of query: 298
Length of database: 6,701,793
Length adjustment: 93
Effective length of query: 205
Effective length of database: 4,105,140
Effective search space: 841553700
Effective search space used: 841553700
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 57 (25.5 bits)