RPS-BLAST 2.2.26 [Sep-21-2011]
Database: pdb70
27,921 sequences; 6,701,793 total letters
Searching..................................................done
Query= psy15353
(344 letters)
>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme,
papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A
1pbh_A 1mir_A
Length = 317
Score = 317 bits (814), Expect = e-108
Identities = 122/329 (37%), Positives = 165/329 (50%), Gaps = 17/329 (5%)
Query: 15 RGELYKFSDAYIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRK 74
R + SD ++ +N+ TW AG NF N+ YL++ F +P
Sbjct: 4 RPSFHPLSDELVNYVNKRNTTWQAGHNF-YNVDMSYLKRLCGT----FLGGPKPPQRVMF 58
Query: 75 TYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQN 134
T D +P FDAREQWP C TI + D G+C + F AV A SDR CI + +
Sbjct: 59 TED----LKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVS 114
Query: 135 RPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPC 194
+S E + +CC C+ G WNF ++G V+GG Y GC+P +I PC
Sbjct: 115 VEVSAEDLLTCCGSMC---GDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPC 171
Query: 195 SHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEIL 254
HH + P PK C C P Y + QDKH +Y V ++E I EI
Sbjct: 172 EHHVNGSRPPCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIY 228
Query: 255 AHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGP 314
+GP F++Y DF YKSGVY+H + + H+ +++GWG ENGTPYWLV N+W
Sbjct: 229 KNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPYWLVANSWNT 286
Query: 315 HWGDRGTVKILRGKYECAFEYLIAAGKPK 343
WGD G KILRG+ C E + AG P+
Sbjct: 287 DWGDNGFFKILRGQDHCGIESEVVAGIPR 315
>3hhi_A Cathepsin B-like cysteine protease; occluding loop, hydrolase, THIO
protease; HET: 074; 1.60A {Trypanosoma brucei} PDB:
3mor_A*
Length = 325
Score = 293 bits (751), Expect = 2e-98
Identities = 104/337 (30%), Positives = 145/337 (43%), Gaps = 31/337 (9%)
Query: 13 LVRGELYKFSDAYIDQINREAN-TWTAGRN-FPANLSEEYLRQFLIADAKYFDQSDRPLP 70
LV + S A++D++NR W A + N++ ++ ++ +
Sbjct: 2 LVAEDAPVLSKAFVDRVNRLNRGIWKAKYDGVMQNITLREAKRLNGV---IKKNNNASIL 58
Query: 71 GDRKTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACA---APHIFAAVGAFSDRRCI 127
R+ + E A +P FD+ E WPNC TI + D AC A AA A SDR C
Sbjct: 59 PKRRFTEEEARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWA---VAAASAMSDRFCT 115
Query: 128 KSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQ 187
G Q+ +S + +CC C C+ G R W + G V+ CQ
Sbjct: 116 MG-GVQDVHISAGDLLACCSDC----GDGCNGGDPDRAWAYFSSTGLVSDY-------CQ 163
Query: 188 PSTISPCSHHGSAPT-LPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNE 246
P CSHH + P C KC C +PT +R+ +Y + E
Sbjct: 164 PYPFPHCSHHSKSKNGYPPCSQFNFDTPKCDYTCDDPTIP----VVNYRSWTSYALQ-GE 218
Query: 247 DAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYW 306
D +E+ GP F +Y+DF Y SGVY H S L H+ +L+GWGT NG PYW
Sbjct: 219 DDYMRELFFRGPFEVAFDVYEDFIAYNSGVYHHVSGQYLGG--HAVRLVGWGTSNGVPYW 276
Query: 307 LVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKPK 343
+ N+W WG G I RG EC E +AG P
Sbjct: 277 KIANSWNTEWGMDGYFLIRRGSSECGIEDGGSAGIPL 313
>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco
hydrolase, lysosome, protease, thiol protease, zymogen,
CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A*
3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A*
1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A*
1qdq_A* 1csb_B* 1huc_B 2ipp_B ...
Length = 266
Score = 286 bits (733), Expect = 2e-96
Identities = 104/260 (40%), Positives = 138/260 (53%), Gaps = 8/260 (3%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVA 143
+P FDAREQWP C TI + D G+C + F AV A SDR CI + + +S E +
Sbjct: 7 LPASFDAREQWPQCPTIKEIRDQGSCGSAWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 66
Query: 144 SCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSAPTL 203
+CC C+ G WNF ++G V+GG Y GC+P +I PC H +
Sbjct: 67 TCCGSMC---GDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEAHVNGARP 123
Query: 204 PSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF 263
P PK C C P Y + QDKH +Y V ++E I EI +GP F
Sbjct: 124 PCTGEGDTPK--CSKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF 180
Query: 264 ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVK 323
++Y DF YKSGVY+H + + H+ +++GWG ENGTPYWLV N+W WGD G K
Sbjct: 181 SVYSDFLLYKSGVYQHVTGEMMGG--HAIRILGWGVENGTPYWLVANSWNTDWGDNGFFK 238
Query: 324 ILRGKYECAFEYLIAAGKPK 343
ILRG+ C E + AG P+
Sbjct: 239 ILRGQDHCGIESEVVAGIPR 258
>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase,
digestive tract, hydrolase-hydrolase INH complex; HET:
074; 1.30A {Schistosoma mansoni} PDB: 3s3q_A* 3s3r_A*
Length = 254
Score = 284 bits (728), Expect = 8e-96
Identities = 101/262 (38%), Positives = 143/262 (54%), Gaps = 14/262 (5%)
Query: 84 VPDRFDAREQWPNCGTIGHVPDTGACA---APHIFAAVGAFSDRRCIKSKGQQNRPLSTE 140
+P FD+R++WP C +I + D C A F AV A SDR CI+S G+QN LS
Sbjct: 3 IPSSFDSRKKWPRCKSIATIRDQSRCGSCWA---FGAVEAMSDRSCIQSGGKQNVELSAV 59
Query: 141 YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGCQPSTISPCSHHGSA 200
+ SCC+ C C G + W++ K G VTG + GC+P C HH
Sbjct: 60 DLLSCCESC----GLGCEGGILGPAWDYWVKEGIVTGSSKENHAGCEPYPFPKCEHHTKG 115
Query: 201 PTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTT 260
P C ++ +C C Y + QDKHR +Y V ++E AI+KEI+ +GP
Sbjct: 116 -KYPPCGSKIYKTPRCKQTC-QKKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVE 173
Query: 261 ATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRG 320
A F +Y+DF +YKSG+YKH + L H+ ++IGWG EN PYWL+ N+W WG+ G
Sbjct: 174 AGFTVYEDFLNYKSGIYKHITGETLGG--HAIRIIGWGVENKAPYWLIANSWNEDWGENG 231
Query: 321 TVKILRGKYECAFEYLIAAGKP 342
+I+RG+ EC+ E + AG+
Sbjct: 232 YFRIVRGRDECSIESEVTAGRI 253
>3pdf_A Cathepsin C, dipeptidyl peptidase 1; two domains, cystein protease,
hydrolase-hydrolase inhibitor; HET: LXV NAG; 1.85A {Homo
sapiens} PDB: 1jqp_A* 2djf_B* 1k3b_B* 2djg_B* 2djf_A*
1k3b_A* 2djg_A* 2djf_C* 1k3b_C* 2djg_C*
Length = 441
Score = 210 bits (537), Expect = 6e-65
Identities = 80/339 (23%), Positives = 127/339 (37%), Gaps = 54/339 (15%)
Query: 15 RGELYKFSDAYIDQINREANTWTAGRNF-PANLSEEYLRQFLIADAKYFDQSDRPLPGDR 73
LYK+ ++ IN +WTA L+ + + + + RP P
Sbjct: 140 SNRLYKYDHNFVKAINAIQKSWTATTYMEYETLTLGDMIRRSGG---HSRKIPRPKPAPL 196
Query: 74 KTYDPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQ 133
+ +P +D R + V + +C + + FA++G R I + Q
Sbjct: 197 TAEIQQKILFLPTSWDWRNVHG-INFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQ 255
Query: 134 NRPLSTEYVASCCKICRYDDNKSCSHGSVFRT-WNFLHKRGSVTGGDY---GDRTGCQPS 189
LS + V SC + + C G + + G V + G + C+
Sbjct: 256 TPILSPQEVVSCSQY-----AQGCEGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPCKM- 309
Query: 190 TISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNEDAI 249
R + + H ++ NE +
Sbjct: 310 --------------------------------KEDCFRYYSSEYHYVG-GFYGGCNEALM 336
Query: 250 KKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL----HSGKLIGWGTEN--GT 303
K E++ HGP F +YDDF HYK G+Y HT N H+ L+G+GT++ G
Sbjct: 337 KLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGM 396
Query: 304 PYWLVINTWGPHWGDRGTVKILRGKYECAFEYLIAAGKP 342
YW+V N+WG WG+ G +I RG ECA E + A P
Sbjct: 397 DYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATP 435
>1deu_A Procathepsin X; cysteine protease, proregion, prosegment, HY; 1.70A
{Homo sapiens} SCOP: d.3.1.1 PDB: 1ef7_A
Length = 277
Score = 191 bits (487), Expect = 3e-59
Identities = 55/265 (20%), Positives = 90/265 (33%), Gaps = 41/265 (15%)
Query: 74 KTYDPEYSATVPDRFDAREQWPNCGTIGHVPD------TGACAAPHIFAAVGAFSDRRCI 127
+ ++ A +P +D R + G+C A A+ A +DR I
Sbjct: 26 RPHEYLSPADLPKSWDWRNVDG-VNYASITRNQHIPQYCGSCWA---HASTSAMADRINI 81
Query: 128 KSKGQQNRP-LSTEYVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVTGGDYGDRTGC 186
K KG LS + V C + SC G+ W++ H+ G C
Sbjct: 82 KRKGAWPSTLLSVQNVIDCG------NAGSCEGGNDLSVWDYAHQHGIPDET-------C 128
Query: 187 QPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLTYWVDDNE 246
C + ++ Y
Sbjct: 129 NNYQAKDQECDKFNQC---------GTCNEFKEC-HAIRNYTLWRVGD-----YGSLSGR 173
Query: 247 DAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGTPYW 306
+ + EI A+GP + + +Y G+Y + N H + GWG +GT YW
Sbjct: 174 EKMMAEIYANGPISCGIMATERLANYTGGIYAEYQDTTYIN--HVVSVAGWGISDGTEYW 231
Query: 307 LVINTWGPHWGDRGTVKILRGKYEC 331
+V N+WG WG+RG ++I+ Y+
Sbjct: 232 IVRNSWGEPWGERGWLRIVTSTYKD 256
>3ois_A Cysteine protease; alpha and beta, hydrolase; HET: UDP; 1.65A
{Xylella fastidiosa}
Length = 291
Score = 123 bits (311), Expect = 3e-33
Identities = 42/267 (15%), Positives = 86/267 (32%), Gaps = 46/267 (17%)
Query: 68 PLPGDRKTYDPEYS--ATVPDRFDAREQWP-----NCGTIGHVPDTGACAAPHIFAAVGA 120
+Y PE S A +P + D + G+ C A A A
Sbjct: 39 IADIRDFSYTPEKSVIAALPPKVDLTPPFQVYDQGRIGS---------CTA---NALAAA 86
Query: 121 FSDRRCIKSKGQQNRPLSTEYVASCCKICRYDDNKSCSHGSVFRT-WNFLHKRGSVTGGD 179
R + + P ++ + + + + + G++ R LHK G +
Sbjct: 87 IQFERIHDKQSPEFIPSR-LFIYYNER--KIEGHVNYDSGAMIRDGIKVLHKLGVCPEKE 143
Query: 180 YGDRTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKHRTTLT 239
+ P +P P K P +C+ ++ T
Sbjct: 144 W-------PYGDTPADPRTEEFP-PGAPASKKPSDQCYK-----------DAQNYKITEY 184
Query: 240 YWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL--HSGKLIGW 297
V + D +K + P F++Y+ + S + K + H+ +G+
Sbjct: 185 SRVAQDIDHLKACLAVGSPFVFGFSVYNSWVGNNSLPVRIPLPTKNDTLEGGHAVLCVGY 244
Query: 298 GTENGTPYWLVINTWGPHWGDRGTVKI 324
++ ++ + N+WG + G+ G +
Sbjct: 245 --DDEIRHFRIRNSWGNNVGEDGYFWM 269
>2wbf_X Serine-repeat antigen protein; SERA, malaria, vacuole, protease,
cathepsin, hydrolase, glycoprotein, thiol protease; HET:
DMS; 1.60A {Plasmodium falciparum} PDB: 3ch3_X 3ch2_X
Length = 265
Score = 94.3 bits (235), Expect = 1e-22
Identities = 53/247 (21%), Positives = 81/247 (32%), Gaps = 31/247 (12%)
Query: 94 WPNCGTIGHVPDTGACAAPHIFAAVGAFSDRRCIKSKGQQNRPLSTEYVASCCKICRYDD 153
NC + V D G C IFA+ RC+K G + +S YVA+C K +
Sbjct: 16 ENNCISNLQVEDQGNCDTSWIFASKYHLETIRCMK--GYEPTKISALYVANCYKG---EH 70
Query: 154 NKSCSHGSVFRT-WNFLHKRGSV-TGGDY---GDRTGCQPSTISPCSHHGSAPTLPSCEN 208
C GS + G + +Y + G Q P +
Sbjct: 71 KDRCDEGSSPMEFLQIIEDYGFLPAESNYPYNYVKVGEQ------------CPKVEDHWM 118
Query: 209 QKVPKLKCHTRCTNP-TYGRGFFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATF-ALY 266
K P + + +D IK E++ G A A
Sbjct: 119 NLWDNGKILHNKNEPNSLDGKGYTAYESERFHDNMDAFVKIIKTEVMNKGSVIAYIKAEN 178
Query: 267 DDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE-----NGTPYWLVINTWGPHWGDRGT 321
Y + K+ + H+ ++G+G YW+V N+WGP+WGD G
Sbjct: 179 VMGYEFSGKKVKNLCGDDTAD--HAVNIVGYGNYVNSEGEKKSYWIVRNSWGPYWGDEGY 236
Query: 322 VKILRGK 328
K+
Sbjct: 237 FKVDMYG 243
>3i06_A Cruzipain; autocatalytic cleavage, glycoprotein, protease, thiol
protease, zymogen; HET: QL2; 1.10A {Trypanosoma cruzi}
PDB: 1ewm_A* 1ewo_A* 1ewl_A* 1f29_A* 1ewp_A* 1f2b_A*
1f2c_A* 1f2a_A* 1me4_A* 1u9q_X* 2aim_A* 2efm_A* 2oz2_A*
1me3_A* 3kku_A* 3lxs_A* 1aim_A* 3iut_A* 3hd3_A* 2p86_A*
...
Length = 215
Score = 80.2 bits (199), Expect = 7e-18
Identities = 23/91 (25%), Positives = 41/91 (45%), Gaps = 6/91 (6%)
Query: 242 VDDNEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE 300
+ +E I + +GP A + Y GV + +L+ H L+G+
Sbjct: 118 LPQDEAQIAAWLAVNGPVAVAVDA--SSWMTYTGGVMTSCVSEQLD---HGVLLVGYNDS 172
Query: 301 NGTPYWLVINTWGPHWGDRGTVKILRGKYEC 331
PYW++ N+W WG+ G ++I +G +C
Sbjct: 173 AAVPYWIIKNSWTTQWGEEGYIRIAKGSNQC 203
>8pch_A Cathepsin H; hydrolase, protease, cysteine proteinase,
aminopeptidase; HET: NAG BMA; 2.10A {Sus scrofa} SCOP:
d.3.1.1 PDB: 1nb3_A* 1nb5_A*
Length = 220
Score = 79.8 bits (198), Expect = 1e-17
Identities = 34/88 (38%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYL-HSGKLIGWGTENGT 303
+E+A+ + + + P + F + +DF Y+ G+Y TS K + + H+ +G+G ENG
Sbjct: 120 DEEAMVEAVALYNPVSFAFEVTNDFLMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGI 179
Query: 304 PYWLVINTWGPHWGDRGTVKILRGKYEC 331
PYW+V N+WGP WG G I RGK C
Sbjct: 180 PYWIVKNSWGPQWGMNGYFLIERGKNMC 207
>1m6d_A Cathepsin F, catsf; papain family cysteine protease, hydrolase;
HET: MYP; 1.70A {Homo sapiens} SCOP: d.3.1.1
Length = 214
Score = 78.3 bits (194), Expect = 3e-17
Identities = 22/92 (23%), Positives = 40/92 (43%), Gaps = 4/92 (4%)
Query: 242 VDDNEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHTSNAKLENYL-HSGKLIGWGT 299
+ NE + + GP + A Y+ G+ + + H+ L+G+G
Sbjct: 113 LSQNEQKLAAWLAKRGPISVAINA--FGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGQ 170
Query: 300 ENGTPYWLVINTWGPHWGDRGTVKILRGKYEC 331
+ P+W + N+WG WG++G + RG C
Sbjct: 171 RSDVPFWAIKNSWGTDWGEKGYYYLHRGSGAC 202
>3qj3_A Cathepsin L-like protein; hydrolase, proteinase, larVal midgut;
1.85A {Tenebrio molitor}
Length = 331
Score = 78.8 bits (195), Expect = 1e-16
Identities = 30/87 (34%), Positives = 40/87 (45%), Gaps = 5/87 (5%)
Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENYLHSGKLIGWGTENG 302
+E+ + + GP F D F Y GVY + K H+ ++G+G ENG
Sbjct: 234 DENMLADMVATKGPVAVAFDADDPFGSYSGGVYYNPTCETNKFT---HAVLIVGYGNENG 290
Query: 303 TPYWLVINTWGPHWGDRGTVKILRGKY 329
YWLV N+WG WG G KI R
Sbjct: 291 QDYWLVKNSWGDGWGLDGYFKIARNAN 317
Score = 33.0 bits (76), Expect = 0.12
Identities = 20/101 (19%), Positives = 37/101 (36%), Gaps = 9/101 (8%)
Query: 25 YIDQINREAN----TWTAGRNFPANLS-EEYLRQFLIADAKYFDQSDRPLPGDRKTYDPE 79
++ N + ++T G N +++ EE + R+
Sbjct: 52 TFEEHNEKYRQGLVSYTLGVNLFTDMTPEEMKAYTHGLIMPADLHKNGIPIKTREDLGLN 111
Query: 80 YSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGA 120
S P FD R+Q G + V + G+C + F++ GA
Sbjct: 112 ASVRYPASFDWRDQ----GMVSPVKNQGSCGSSWAFSSTGA 148
>1ppo_A Protease omega; hydrolase(thiol protease); 1.80A {Carica papaya}
SCOP: d.3.1.1 PDB: 1meg_A*
Length = 216
Score = 76.0 bits (188), Expect = 2e-16
Identities = 23/85 (27%), Positives = 41/85 (48%), Gaps = 5/85 (5%)
Query: 245 NEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT 303
NE + +A P + + F YK G+++ K++ H+ +G+G G
Sbjct: 117 NEGNLLN-AIAKQPVSVVVESKGRPFQLYKGGIFEGPCGTKVD---HAVTAVGYGKSGGK 172
Query: 304 PYWLVINTWGPHWGDRGTVKILRGK 328
Y L+ N+WG WG++G ++I R
Sbjct: 173 GYILIKNSWGTAWGEKGYIRIKRAP 197
>3p5u_A Actinidin; SAD, cysteine proteinases, hydrolase; 1.50A {Actinidia
arguta} PDB: 3p5v_A 3p5w_A 3p5x_A 1aec_A* 2act_A
Length = 220
Score = 74.1 bits (183), Expect = 1e-15
Identities = 28/85 (32%), Positives = 47/85 (55%), Gaps = 5/85 (5%)
Query: 245 NEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT 303
NE A++ +A+ P + A +F HY SG++ ++ H+ ++G+GTE G
Sbjct: 120 NEWALQT-AVAYQPVSVALEAAGYNFQHYSSGIFTGPCGTAVD---HAVTIVGYGTEGGI 175
Query: 304 PYWLVINTWGPHWGDRGTVKILRGK 328
YW+V N+WG WG+ G ++I R
Sbjct: 176 DYWIVKNSWGTTWGEEGYMRIQRNV 200
>1iwd_A Ervatamin B; cysteine protease, alpha-beta protein, catalytic DYAD,
L-DOM domain., hydrolase; 1.63A {Tabernaemontana
divaricata} SCOP: d.3.1.1
Length = 215
Score = 73.7 bits (182), Expect = 1e-15
Identities = 27/103 (26%), Positives = 49/103 (47%), Gaps = 8/103 (7%)
Query: 230 FQDKHRTTLTYWVD---DNEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHTSNAKL 285
++ + +NE A++ +A P + T A F HY SG++
Sbjct: 98 PYRLRVVSINGFQRVTRNNESALQS-AVASQPVSVTVEAAGAPFQHYSSGIFTGPCGTAQ 156
Query: 286 ENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGK 328
H ++G+GT++G YW+V N+WG +WG++G + + R
Sbjct: 157 N---HGVVIVGYGTQSGKNYWIVRNSWGQNWGNQGYIWMERNV 196
>1cqd_A Protein (protease II); cysteine protease, glycoprotein, proline
specificity, carboh papain family, hydrolase; HET: NAG
FUL FUC; 2.10A {Zingiber officinale} SCOP: d.3.1.1
Length = 221
Score = 73.7 bits (182), Expect = 1e-15
Identities = 28/85 (32%), Positives = 49/85 (57%), Gaps = 5/85 (5%)
Query: 245 NEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT 303
NE +++K +A+ P + T A DF Y+SG++ + N H+ ++G+GTEN
Sbjct: 119 NEQSLQK-AVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISAN---HALTVVGYGTENDK 174
Query: 304 PYWLVINTWGPHWGDRGTVKILRGK 328
+W+V N+WG +WG+ G ++ R
Sbjct: 175 DFWIVKNSWGKNWGESGYIRAERNI 199
>3kwz_A Cathepsin K; enzyme inhibitor, covalent reversible inhibitor,
disease mutation, disulfide bond, glycoprotein,
hydrolase, lysosome, protease; HET: KWZ; 1.49A {Homo
sapiens} PDB: 1au0_A* 1au2_A* 1au3_A* 1au4_A* 1ayu_A*
1ayv_A* 1ayw_A* 1bgo_A* 1atk_A* 1nl6_A* 1nlj_A* 1q6k_A*
1mem_A* 1yk7_A* 1yk8_A* 1yt7_A* 2ato_A* 2aux_A* 2auz_A*
2bdl_A* ...
Length = 215
Score = 73.7 bits (182), Expect = 1e-15
Identities = 26/87 (29%), Positives = 45/87 (51%), Gaps = 6/87 (6%)
Query: 245 NEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHT--SNAKLENYLHSGKLIGWGTEN 301
NE A+K+ + GP + A F Y GVY ++ L H+ +G+G +
Sbjct: 117 NEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLN---HAVLAVGYGIQK 173
Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGK 328
G +W++ N+WG +WG++G + + R K
Sbjct: 174 GNKHWIIKNSWGENWGNKGYILMARNK 200
>2o6x_A Procathepsin L1, secreted cathepsin L 1; hydrolase, thiol protease,
cysteine protease, zymogen, hydro; 1.40A {Fasciola
hepatica}
Length = 310
Score = 75.0 bits (185), Expect = 2e-15
Identities = 29/86 (33%), Positives = 50/86 (58%), Gaps = 5/86 (5%)
Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENYLHSGKLIGWGTENG 302
+E +K + A GP + DF Y+SG+Y+ S ++ H+ +G+GT+ G
Sbjct: 209 SEVELKNLVGAEGPAAVAVDVESDFMMYRSGIYQSQTCSPLRVN---HAVLAVGYGTQGG 265
Query: 303 TPYWLVINTWGPHWGDRGTVKILRGK 328
T YW+V N+WG WG+RG ++++R +
Sbjct: 266 TDYWIVKNSWGLSWGERGYIRMVRNR 291
>1yal_A Chymopapain; hydrolase, thiol protease; 1.70A {Carica papaya} SCOP:
d.3.1.1 PDB: 1gec_E*
Length = 218
Score = 73.3 bits (181), Expect = 2e-15
Identities = 27/85 (31%), Positives = 45/85 (52%), Gaps = 5/85 (5%)
Query: 245 NEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT 303
E + LA+ P + A F YKSGV+ KL+ H+ +G+GT +G
Sbjct: 117 CETSFLG-ALANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKLD---HAVTAVGYGTSDGK 172
Query: 304 PYWLVINTWGPHWGDRGTVKILRGK 328
Y ++ N+WGP+WG++G +++ R
Sbjct: 173 NYIIIKNSWGPNWGEKGYMRLKRQS 197
>3f5v_A DER P 1 allergen; allergy, asthma, DUST mites, glycoprotein,
hydrola protease, secreted, thiol protease; HET: P6G;
1.36A {Dermatophagoides pteronyssinus} PDB: 2as8_A
3rvw_A* 3rvx_A 3rvv_A* 3d6s_A*
Length = 222
Score = 73.3 bits (181), Expect = 2e-15
Identities = 23/108 (21%), Positives = 38/108 (35%), Gaps = 7/108 (6%)
Query: 230 FQDKHRTTLTYWVD---DNEDAIKKEI-LAHGPTT--ATFALYDDFYHYKSGVYKHTSNA 283
+ R ++ + N + I++ + H D F HY N
Sbjct: 105 RPNAQRFGISNYCQIYPPNANKIREALAQTHSAIAVIIGIKDLDAFRHYDGRTIIQRDNG 164
Query: 284 KLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYEC 331
NY H+ ++G+ G YW+V N+W +WGD G
Sbjct: 165 YQPNY-HAVNIVGYSNAQGVDYWIVRNSWDTNWGDNGYGYFAANIDLM 211
>1pci_A Procaricain; zymogen, hydrolase, thiol protease; 3.20A {Carica
papaya} SCOP: d.3.1.1
Length = 322
Score = 75.0 bits (185), Expect = 2e-15
Identities = 23/85 (27%), Positives = 40/85 (47%), Gaps = 5/85 (5%)
Query: 245 NEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT 303
NE + I A P + + F YK G+++ K++ + +G+G G
Sbjct: 223 NEGNLLNAI-AKQPVSVVVESKGRPFQLYKGGIFEGPCGTKVD---GAVTAVGYGKSGGK 278
Query: 304 PYWLVINTWGPHWGDRGTVKILRGK 328
Y L+ N+WG WG++G ++I R
Sbjct: 279 GYILIKNSWGTAWGEKGYIRIKRAP 303
>1by8_A Protein (procathepsin K); hydrolase(sulfhydryl proteinase), papain;
2.60A {Homo sapiens} SCOP: d.3.1.1 PDB: 7pck_A
Length = 314
Score = 74.2 bits (183), Expect = 3e-15
Identities = 26/87 (29%), Positives = 46/87 (52%), Gaps = 6/87 (6%)
Query: 245 NEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVY--KHTSNAKLENYLHSGKLIGWGTEN 301
NE A+K+ + GP + A F Y GVY + ++ L H+ +G+G +
Sbjct: 216 NEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLN---HAVLAVGYGIQK 272
Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGK 328
G +W++ N+WG +WG++G + + R K
Sbjct: 273 GNKHWIIKNSWGENWGNKGYILMARNK 299
>3qt4_A Cathepsin-L-like midgut cysteine proteinase; hydrolase, zymogen,
intramolecular DISS bonds, insect larVal midgut; HET:
PG4 PG6; 2.11A {Tenebrio molitor}
Length = 329
Score = 73.8 bits (182), Expect = 4e-15
Identities = 21/86 (24%), Positives = 42/86 (48%), Gaps = 5/86 (5%)
Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHT--SNAKLENYLHSGKLIGWGTENG 302
+E+++ + GP D+ Y G++ + + L H ++G+G++NG
Sbjct: 232 DENSLADAVGQAGPVAVAIDATDELQFYSGGLFYDQTCNQSDLN---HGVLVVGYGSDNG 288
Query: 303 TPYWLVINTWGPHWGDRGTVKILRGK 328
YW++ N+WG WG+ G + +R
Sbjct: 289 QDYWILKNSWGSGWGESGYWRQVRNY 314
>2b1m_A SPE31; papain-like, sugar binding protein; HET: NAG FUC PG4; 2.00A
{Pachyrhizus erosus} PDB: 2b1n_A*
Length = 246
Score = 72.6 bits (179), Expect = 5e-15
Identities = 23/86 (26%), Positives = 43/86 (50%), Gaps = 4/86 (4%)
Query: 244 DNEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENG 302
+ E +++ + P + + A DF+ Y G+Y + + H ++G+G+E+G
Sbjct: 124 EAESSLQS-FVLEQPISVSIDA--KDFHFYSGGIYDGGNCSSPYGINHFVLIVGYGSEDG 180
Query: 303 TPYWLVINTWGPHWGDRGTVKILRGK 328
YW+ N+WG WG G ++I R
Sbjct: 181 VDYWIAKNSWGEDWGIDGYIRIQRNT 206
>3ovx_A Cathepsin S; hydrolase, covalent inhibitor, aldehyde warhead is
covalently bound to Cys25, lysosomeal protein; HET: O64;
1.49A {Homo sapiens} PDB: 2h7j_A* 2f1g_A* 2hh5_B*
2hhn_A* 2hxz_A* 2op3_A* 2frq_A* 2fra_A* 2fq9_A* 2ft2_A*
2fud_A* 2g7y_A* 1ms6_A* 2r9m_A* 2r9n_A* 2r9o_A* 3n3g_A*
3n4c_A* 3mpe_A* 1nqc_A* ...
Length = 218
Score = 71.0 bits (175), Expect = 1e-14
Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 5/86 (5%)
Query: 245 NEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHTS-NAKLENYLHSGKLIGWGTENG 302
ED +K+ + GP + A + F+ Y+SGVY S + H ++G+G NG
Sbjct: 121 REDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVN---HGVLVVGYGDLNG 177
Query: 303 TPYWLVINTWGPHWGDRGTVKILRGK 328
YWLV N+WG ++G+ G +++ R K
Sbjct: 178 KEYWLVKNSWGHNFGEEGYIRMARNK 203
>2c0y_A Procathepsin S; proenzyme, proteinase, hydrolase, thiol protease,
prosegment binding loop, glycoprotein, lysosome,
protease, zymogen; 2.1A {Homo sapiens}
Length = 315
Score = 71.9 bits (177), Expect = 2e-14
Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 5/86 (5%)
Query: 245 NEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHTS-NAKLENYLHSGKLIGWGTENG 302
ED +K+ + GP + A + F+ Y+SGVY S + H ++G+G NG
Sbjct: 218 REDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVN---HGVLVVGYGDLNG 274
Query: 303 TPYWLVINTWGPHWGDRGTVKILRGK 328
YWLV N+WG ++G+ G +++ R K
Sbjct: 275 KEYWLVKNSWGHNFGEEGYIRMARNK 300
>1xkg_A DER P I, major mite fecal allergen DER P 1; major allergen,
cysteine protease, house DUST mite, dermatop
pteronyssinus; 1.61A {Dermatophagoides pteronyssinus}
SCOP: d.3.1.1
Length = 312
Score = 71.5 bits (176), Expect = 3e-14
Identities = 23/108 (21%), Positives = 38/108 (35%), Gaps = 7/108 (6%)
Query: 230 FQDKHRTTLTYWVD---DNEDAIKKEI-LAHGPTT--ATFALYDDFYHYKSGVYKHTSNA 283
+ R ++ + N + I++ + H D F HY N
Sbjct: 185 RPNAQRFGISNYCQIYPPNANKIREALAQTHSAIAVIIGIKDLDAFRHYDGRTIIQRDNG 244
Query: 284 KLENYLHSGKLIGWGTENGTPYWLVINTWGPHWGDRGTVKILRGKYEC 331
NY H+ ++G+ G YW+V N+W +WGD G
Sbjct: 245 YQPNY-HAVNIVGYSNAQGVDYWIVRNSWDTNWGDNGYGYFAANIDLM 291
Score = 29.1 bits (66), Expect = 2.2
Identities = 17/100 (17%), Positives = 33/100 (33%), Gaps = 19/100 (19%)
Query: 25 YIDQINREANTWTAGRNFPANLSEEYLRQFLIADAKYFDQSDRPLPGDRKTYDPEYSATV 84
Y+ N + ++LS + + + A+ F+ + +T +
Sbjct: 38 YVQSNGGAIN------HL-SDLSLDEFKNRFLMSAEAFEHLKTQFDLNAETNACSINGNA 90
Query: 85 PDRFDAREQWPNCGTIGHVPDTGAC----AAPHIFAAVGA 120
P D R+ T+ + G C A F+ V A
Sbjct: 91 PAEIDLRQM----RTVTPIRMQGGCGSAWA----FSGVAA 122
>3bwk_A Cysteine protease falcipain-3; malaria, hydrolase; HET: C1P; 2.42A
{Plasmodium falciparum} PDB: 3bpm_A*
Length = 243
Score = 70.3 bits (173), Expect = 3e-14
Identities = 26/108 (24%), Positives = 48/108 (44%), Gaps = 13/108 (12%)
Query: 231 QDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLH 290
+ R T+ +V +D K+ + GP + + A DDF Y+ G Y A H
Sbjct: 120 RCNERYTIKSYVSIPDDKFKEALRYLGPISISIAASDDFAFYRGGFYDGECGAAPN---H 176
Query: 291 SGKLIGWGTEN----------GTPYWLVINTWGPHWGDRGTVKILRGK 328
+ L+G+G ++ Y+++ N+WG WG+ G + + +
Sbjct: 177 AVILVGYGMKDIYNEDTGRMEKFYYYIIKNSWGSDWGEGGYINLETDE 224
>2oul_A Falcipain 2; cysteine protease, inhibitor, macromolecular
interaction, HY hydrolase inhibitor complex; 2.20A
{Plasmodium falciparum} SCOP: d.3.1.1 PDB: 2ghu_A 1yvb_A
3bpf_A* 3pnr_A
Length = 241
Score = 69.9 bits (172), Expect = 4e-14
Identities = 24/110 (21%), Positives = 51/110 (46%), Gaps = 13/110 (11%)
Query: 229 FFQDKHRTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENY 288
+ + + ++ ++ +K+ + GP + + A+ DDF YK G++ +L
Sbjct: 116 IDRCTEKYGIKNYLSVPDNKLKEALRFLGPISISVAVSDDFAFYKEGIFDGECGDQLN-- 173
Query: 289 LHSGKLIGWGTENG----------TPYWLVINTWGPHWGDRGTVKILRGK 328
H+ L+G+G + Y+++ N+WG WG+RG + I +
Sbjct: 174 -HAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNSWGQQWGERGFINIETDE 222
>2xu3_A Cathepsin L1; hydrolase, drug design, thiol protease; HET: XU3 BTB;
0.90A {Homo sapiens} PDB: 2xu4_A* 2xu5_A* 2yj2_A*
2yj8_A* 2yj9_A* 2yjb_A* 2yjc_A* 3bc3_A* 3h89_A* 3h8b_A*
3h8c_A* 3of9_A* 3of8_A* 3hha_A* 2xu1_A* 3iv2_A* 3k24_A*
2nqd_B* 3kse_A* 2vhs_A ...
Length = 220
Score = 68.3 bits (168), Expect = 1e-13
Identities = 28/94 (29%), Positives = 46/94 (48%), Gaps = 10/94 (10%)
Query: 242 VDDNEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVY--KHTSNAKLENYLHSGKLIGWG 298
+ E A+ K + GP + A ++ F YK G+Y S+ ++ H ++G+G
Sbjct: 115 IPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMD---HGVLVVGYG 171
Query: 299 TE----NGTPYWLVINTWGPHWGDRGTVKILRGK 328
E + YWLV N+WG WG G VK+ + +
Sbjct: 172 FESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDR 205
>1s4v_A Cysteine endopeptidase; KDEL ER retention signal, endosperm,
ricinosomes, SEED germi senescence, hydrolase-hydrolase
inhibitor complex; 2.00A {Ricinus communis} SCOP:
d.3.1.1
Length = 229
Score = 67.2 bits (165), Expect = 3e-13
Identities = 30/85 (35%), Positives = 50/85 (58%), Gaps = 6/85 (7%)
Query: 245 NEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE-NG 302
+E+A+ K +A+ P + A DF Y GV+ + +L+ H ++G+GT +G
Sbjct: 120 DENALLK-AVANQPVSVAIDAGGSDFQFYSEGVFTGSCGTELD---HGVAIVGYGTTIDG 175
Query: 303 TPYWLVINTWGPHWGDRGTVKILRG 327
T YW V N+WGP WG++G +++ RG
Sbjct: 176 TKYWTVKNSWGPEWGEKGYIRMERG 200
>3f75_A Toxopain-2, cathepsin L protease; medical structural genomics of
pathogenic protozoa, MSGPP, C protease, parasite,
protozoa, hydrolase; 1.99A {Toxoplasma gondii}
Length = 224
Score = 66.4 bits (163), Expect = 5e-13
Identities = 26/87 (29%), Positives = 41/87 (47%), Gaps = 7/87 (8%)
Query: 245 NEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGT--EN 301
+E A+K LA P + A F Y GV+ + L+ H L+G+GT E+
Sbjct: 125 SEAAMKA-ALAKSPVSIAIEADQMPFQFYHEGVFDASCGTDLD---HGVLLVGYGTDKES 180
Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGK 328
+W++ N+WG WG G + + K
Sbjct: 181 KKDFWIMKNSWGTGWGRDGYMYMAMHK 207
>1cs8_A Human procathepsin L; prosegment, propeptide, inhibition,
hydrolase; HET: OCS; 1.80A {Homo sapiens} SCOP: d.3.1.1
PDB: 1cjl_A 3hwn_A*
Length = 316
Score = 67.7 bits (166), Expect = 5e-13
Identities = 28/94 (29%), Positives = 46/94 (48%), Gaps = 10/94 (10%)
Query: 242 VDDNEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVY--KHTSNAKLENYLHSGKLIGWG 298
+ E A+ K + GP + A ++ F YK G+Y S+ ++ H ++G+G
Sbjct: 211 IPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMD---HGVLVVGYG 267
Query: 299 TE----NGTPYWLVINTWGPHWGDRGTVKILRGK 328
E + YWLV N+WG WG G VK+ + +
Sbjct: 268 FESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDR 301
>2fo5_A Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cysteine
endoprotease, endopeptidase, LEUP hydrolase; HET: AR7;
2.20A {Hordeum vulgare}
Length = 262
Score = 66.5 bits (163), Expect = 8e-13
Identities = 24/87 (27%), Positives = 46/87 (52%), Gaps = 6/87 (6%)
Query: 244 DNEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWG-TEN 301
++E+ + + +A+ P + A F Y GV+ +L+ H ++G+G E+
Sbjct: 124 NSEEDLAR-AVANQPVSVAVEASGKAFMFYSEGVFTGECGTELD---HGVAVVGYGVAED 179
Query: 302 GTPYWLVINTWGPHWGDRGTVKILRGK 328
G YW V N+WGP WG++G +++ +
Sbjct: 180 GKAYWTVKNSWGPSWGEQGYIRVEKDS 206
>2cio_A Papain; hydrolase/inhibitor, complex hydrolase/inhibitor, ICP,
cysteine protease, allergen, protease, thiol protease;
1.5A {Carica papaya} PDB: 1khq_A 1khp_A 1ppn_A 3e1z_B
3ima_A 3lfy_A 9pap_A 1bqi_A* 1bp4_A* 1pad_A 1pe6_A*
1pip_A* 1pop_A* 1ppd_A 1ppp_A* 1stf_E* 2pad_A 4pad_A*
5pad_A* 6pad_A* ...
Length = 212
Score = 64.0 bits (157), Expect = 3e-12
Identities = 25/85 (29%), Positives = 42/85 (49%), Gaps = 9/85 (10%)
Query: 245 NEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT 303
NE A+ +A+ P + A DF Y+ G++ K++ H+ +G+G
Sbjct: 117 NEGALLY-SIANQPVSVVLEAAGKDFQLYRGGIFVGPCGNKVD---HAVAAVGYGPN--- 169
Query: 304 PYWLVINTWGPHWGDRGTVKILRGK 328
Y L+ N+WG WG+ G ++I RG
Sbjct: 170 -YILIKNSWGTGWGENGYIRIKRGT 193
>3ioq_A CMS1MS2; caricaceae, cysteine protease, papain family, hydrolase;
HET: E64 SO4; 1.87A {Carica candamarcensis}
Length = 213
Score = 62.9 bits (154), Expect = 7e-12
Identities = 23/85 (27%), Positives = 42/85 (49%), Gaps = 9/85 (10%)
Query: 245 NEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT 303
NE A+ + +A P + A F +Y+ G++ ++ H+ +G+G +
Sbjct: 117 NEQALIQ-RIAIQPVSIVVEAKGRAFQNYRGGIFAGPCGTSID---HAVAAVGYGND--- 169
Query: 304 PYWLVINTWGPHWGDRGTVKILRGK 328
Y L+ N+WG WG+ G ++I RG
Sbjct: 170 -YILIKNSWGTGWGEGGYIRIKRGS 193
>3u8e_A Papain-like cysteine protease; papain-like cysteine peptidase,
peptidase_C1A, hydrolase, in form; 1.31A {Crocus
sativus}
Length = 222
Score = 61.4 bits (149), Expect = 3e-11
Identities = 22/91 (24%), Positives = 40/91 (43%), Gaps = 5/91 (5%)
Query: 242 VDDNEDAIKKEILAHGPTTATF-ALYDDFYHYKS-GVYKHTSNAKLENYL-HSGKLIGWG 298
V ++ A+ + A P + F Y G++ +S + + H+ ++G+G
Sbjct: 112 VPNSSSALLDAV-AKQPVSVNIYTSSTSFQLYTGPGIFAGSSCSDDPATVDHTVLIVGYG 170
Query: 299 TE-NGTPYWLVINTWGPHWGDRGTVKILRGK 328
+ YW+V N+WG WG G + I R
Sbjct: 171 SNGTNADYWIVKNSWGTEWGIDGYILIRRNT 201
>2bdz_A Mexicain; cysteine protease, peptidase_C1, papain-like, HYDR; HET:
E64; 2.10A {Jacaratia mexicana}
Length = 214
Score = 60.9 bits (149), Expect = 3e-11
Identities = 22/84 (26%), Positives = 43/84 (51%), Gaps = 9/84 (10%)
Query: 245 NEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTENGT 303
+E ++ + +A+ P + + F YK G+Y+ + H+ +G+G
Sbjct: 117 DEISLIQ-AIANQPVSVVTDSRGRGFQFYKGGIYEGPCGTNTD---HAVTAVGYGKT--- 169
Query: 304 PYWLVINTWGPHWGDRGTVKILRG 327
Y L+ N+WGP+WG++G ++I R
Sbjct: 170 -YLLLKNSWGPNWGEKGYIRIKRA 192
>1o0e_A Ervatamin C; plant cysteine protease, two domain, stable at PH
2-12, HYDR; 1.90A {Tabernaemontana divaricata} SCOP:
d.3.1.1 PDB: 2pns_A* 2pre_A* 3bcn_A*
Length = 208
Score = 58.4 bits (142), Expect = 3e-10
Identities = 25/87 (28%), Positives = 42/87 (48%), Gaps = 8/87 (9%)
Query: 242 VDDNEDAIKKEILAHGPTTATF-ALYDDFYHYKSGVYKHTSNAKLENYLHSGKLIGWGTE 300
V + K+ +A P+T A F Y SG++ KL H ++G+
Sbjct: 111 VPFCNEXALKQAVAVQPSTVAIDASSAQFQQYSSGIFSGPCGTKLN---HGVTIVGYQAN 167
Query: 301 NGTPYWLVINTWGPHWGDRGTVKILRG 327
YW+V N+WG +WG++G +++LR
Sbjct: 168 ----YWIVRNSWGRYWGEKGYIRMLRV 190
>2pff_B Fatty acid synthase subunit beta; fatty acid synthase,
acyl-carrier-protein, beta-ketoacyl RED beta-ketoacyl
synthase, dehydratase; 4.00A {Saccharomyces cerevisiae}
Length = 2006
Score = 38.1 bits (88), Expect = 0.005
Identities = 36/222 (16%), Positives = 58/222 (26%), Gaps = 81/222 (36%)
Query: 17 ELYKFSDAYIDQINREANTWT-AGRNFPANLSEEYLRQFLIAD---------AKYFD-QS 65
+LYK S A ++ W A +F F I D +F +
Sbjct: 1634 DLYKTSKAA-----QDV--WNRADNHFKDTYG------FSILDIVINNPVNLTIHFGGEK 1680
Query: 66 DRPLPGDRKTYDPEYSATVPDRFDAREQW-----PNCGTIGHVPDTGACAA-----PHIF 115
+ + R+ Y T+ D E+ + + + G +A P +
Sbjct: 1681 GKRI---RENYSAMIFETIVDGKLKTEKIFKEINEHSTSYTFRSEKGLLSATQFTQPALT 1737
Query: 116 AA-VGAFSDRRCIKSKGQQNRPLST--------EYVASCCKICRYDDNKSCSHGSVFRTW 166
AF +KSKG P EY A V
Sbjct: 1738 LMEKAAF---EDLKSKG--LIPADATFAGHSLGEYAALASL------------ADVM--- 1777
Query: 167 NF------LHKRGS-----VTGGDYGDRT----GCQPSTISP 193
+ + RG V + G P ++
Sbjct: 1778 SIESLVEVVFYRGMTMQVAVPRDELGRSNYGMIAINPGRVAA 1819
Score = 33.5 bits (76), Expect = 0.12
Identities = 17/95 (17%), Positives = 29/95 (30%), Gaps = 32/95 (33%)
Query: 14 VRGELYKFSDAYIDQINREA-----------NTWTAGRNF-----PANLS--EEYLRQFL 55
+ + Y+++ N N +N P +L LR+
Sbjct: 341 ISNLTQEQVQDYVNKTNSHLPAGKQVEISLVN---GAKNLVVSGPPQSLYGLNLTLRK-A 396
Query: 56 IADAKYFDQSDRPLPGDRKTYDPEYSA-----TVP 85
A + DQS P +RK ++S P
Sbjct: 397 KAPSG-LDQSRIPFS-ERK---LKFSNRFLPVASP 426
Score = 33.5 bits (76), Expect = 0.13
Identities = 51/307 (16%), Positives = 90/307 (29%), Gaps = 100/307 (32%)
Query: 43 PANLSEEYLRQ-FLIADAKYF------DQ--SDRPLPGDRKTYDPEYSATVP--DRFDAR 91
P LS L L+ A +F +Q P P + D E + +F
Sbjct: 8 PLTLSHGSLEHVLLVPTASFFIASQLQEQFNKILPEPTEGFAADDEPTTPAELVGKF--- 64
Query: 92 EQWPNCGTIGHV-----PDTGACAAPHIFAAVGAFSDRRCIKSK----------GQQNRP 136
+G+V P + + F + ++ + +
Sbjct: 65 --------LGYVSSLVEPSKVGQFDQVLNLCLTEF-ENCYLEGNDIHALAAKLLQENDTT 115
Query: 137 LSTE------YVASCCKICRYDDNKSCSHGSVFRTWNFLHKRGSVT-----GG-----DY 180
L Y+ + R D KS S ++FR G+ GG DY
Sbjct: 116 LVKTKELIKNYITARIMAKRPFDKKSNS--ALFRA----VGEGNAQLVAIFGGQGNTDDY 169
Query: 181 GD------RTGCQPSTISPCSHHGSAPTLPSCENQKVPKLKCHTRCTNPTYGRGFFQDKH 234
+ +T + SA TL L T + +G +
Sbjct: 170 FEELRDLYQTY--HVLVGDLIKF-SAETLSE--------LIRTTLDAEKVFTQGL--N-- 214
Query: 235 RTTLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSGKL 294
+ W+++ + K+ L P + L GV +L +Y+ + KL
Sbjct: 215 ---ILEWLENPSNTPDKDYLLSIP--ISCPL--------IGVI------QLAHYVVTAKL 255
Query: 295 IGWGTEN 301
+G+
Sbjct: 256 LGFTPGE 262
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis,
programmed cell death; HET: DTP; 6.90A {Drosophila
melanogaster} PDB: 3iz8_A*
Length = 1221
Score = 34.8 bits (79), Expect = 0.041
Identities = 18/102 (17%), Positives = 36/102 (35%), Gaps = 18/102 (17%)
Query: 3 HILVFLLGC-TLV--RGELYKFSDAYI-DQINREANTWTAGRNFPANLSEEYLRQFLIAD 58
F L C L+ R + D ++ T + + L+ + ++ L
Sbjct: 258 AWNAFNLSCKILLTTR-------FKQVTDFLSAATTTHISLDHHSMTLTPDEVKSLL--- 307
Query: 59 AKYFDQSDRPLPGDRKTYDPEY----SATVPDRFDAREQWPN 96
KY D + LP + T +P + ++ D + W +
Sbjct: 308 LKYLDCRPQDLPREVLTTNPRRLSIIAESIRDGLATWDNWKH 349
Score = 29.1 bits (64), Expect = 2.4
Identities = 12/63 (19%), Positives = 26/63 (41%), Gaps = 5/63 (7%)
Query: 228 GFFQDKHRTTLTYWVDDNEDAIKKEILAHGP-TTATFA--LYDDFYHYKSGVYKHTSNAK 284
D+ ++ L ++D + +E+L P + A + D + + +KH + K
Sbjct: 297 TLTPDEVKSLLLKYLDCRPQDLPREVLTTNPRRLSIIAESIRDGLATWDN--WKHVNCDK 354
Query: 285 LEN 287
L
Sbjct: 355 LTT 357
>3pw3_A Aminopeptidase C; bleomycin, cysteine proteinase fold, structural
genomics, JO center for structural genomics, JCSG; HET:
MSE; 2.23A {Parabacteroides distasonis}
Length = 383
Score = 34.4 bits (78), Expect = 0.048
Identities = 9/32 (28%), Positives = 16/32 (50%), Gaps = 1/32 (3%)
Query: 290 HSGKLIGWGT-ENGTPYWLVINTWGPHWGDRG 320
H ++ G + G Y++V N+WG + G
Sbjct: 318 HGMQIYGIAKDQEGNEYYMVKNSWGTNSKYNG 349
>1qzv_F Plant photosystem I: subunit PSAF; photosynthesis,plant
photosynthetic reaction center, peripheral antenna;
HET: CL1 PQN; 4.44A {Pisum sativum} SCOP: i.5.1.1
Length = 154
Score = 29.9 bits (66), Expect = 0.64
Identities = 7/22 (31%), Positives = 10/22 (45%), Gaps = 3/22 (13%)
Query: 64 QSDRPLPGDRKTYDPEYSATVP 85
Q+ + L K Y + SA P
Sbjct: 20 QALKKLQASLKLYADD-SA--P 38
>3s88_I GP1, GP, envelope glycoprotein; glycosylation, viral membrane,
immune system-viral protein C; HET: NAG; 3.35A {Sudan
ebolavirus} PDB: 3ve0_I*
Length = 298
Score = 29.4 bits (65), Expect = 1.4
Identities = 13/45 (28%), Positives = 18/45 (40%)
Query: 77 DPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
P+ S +P D +P C + TG C + F GAF
Sbjct: 100 KPDGSECLPPPPDGVRGFPRCRYVHKAQGTGPCPGDYAFHKDGAF 144
>3csy_I Envelope glycoprotein GP1; glycoprotein-antibody complex, immune
system-viral protein C; HET: NAG BMA MAN; 3.40A {Zaire
ebola virus}
Length = 334
Score = 29.5 bits (65), Expect = 1.7
Identities = 15/45 (33%), Positives = 19/45 (42%)
Query: 77 DPEYSATVPDRFDAREQWPNCGTIGHVPDTGACAAPHIFAAVGAF 121
P+ S +P D +P C + V TG CA F GAF
Sbjct: 100 KPDGSECLPAAPDGIRGFPRCRYVHKVSGTGPCAGDFAFHKEGAF 144
>2e8b_A Probable molybdopterin-guanine dinucleotide biosy protein A;
putative protein, molybdenum cofactor, structural G
NPPSFA; 1.61A {Aquifex aeolicus}
Length = 201
Score = 28.0 bits (63), Expect = 3.7
Identities = 7/55 (12%), Positives = 16/55 (29%), Gaps = 4/55 (7%)
Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSG--KLIGW 297
++ + + + H GVY K+E + G ++
Sbjct: 111 KKETVLY--VLENFKEPVSVAKTEKLHTLVGVYSKKLLEKIEERIKKGDYRIWAL 163
>3ejf_A Non-structural protein 3; IBV, coronavirus, X-domain, macro domain,
NSP3, ADRP, hydrolase, ribosomal frameshifting; 1.60A
{Avian infectious bronchitis virus} PDB: 3eke_A* 3ewo_A
3ewp_A*
Length = 176
Score = 27.6 bits (62), Expect = 4.9
Identities = 8/28 (28%), Positives = 10/28 (35%)
Query: 308 VINTWGPHWGDRGTVKILRGKYECAFEY 335
V N GP GD + L Y+
Sbjct: 94 VNNVVGPRHGDNNLHEKLVAAYKNVLVD 121
>1olr_A Endo-beta-1,4-glucanase; hydrolase, cellulase, cellulose
degradation, endoglucanase, glycosyl hydrolase, GH
family 12, humicola grisea CEL12A; HET: PCA; 1.2A
{Humicola grisea} SCOP: b.29.1.11 PDB: 1uu4_A* 1uu5_A*
1uu6_A* 1w2u_A*
Length = 224
Score = 27.6 bits (61), Expect = 5.8
Identities = 8/28 (28%), Positives = 11/28 (39%)
Query: 297 WGTENGTPYWLVINTWGPHWGDRGTVKI 324
+G +G Y L+ N WG G
Sbjct: 9 YGYWSGNGYELLNNLWGKDTATSGWQCT 36
>2xd3_A MALX, maltose/maltodextrin-binding protein; solute-binding protein,
sugar binding protein, virulence, alpha-glucan, sugar
transport; HET: GLC; 2.00A {Streptococcus pneumoniae}
PDB: 2xd2_A*
Length = 416
Score = 27.4 bits (61), Expect = 6.9
Identities = 5/20 (25%), Positives = 10/20 (50%)
Query: 236 TTLTYWVDDNEDAIKKEILA 255
LT +VD+ + +E+
Sbjct: 35 KELTVYVDEGYKSYIEEVAK 54
>1e5k_A Molybdopterin-guanine dinucleotide biosynthesis protein A;
molybdopterin nucleotidyl-transferase,; HET: CIT; 1.35A
{Escherichia coli} SCOP: c.68.1.8 PDB: 1h4e_A* 1hjl_A*
1hjj_A* 1h4c_A* 1h4d_A* 1fr9_A 1frw_A*
Length = 201
Score = 26.9 bits (60), Expect = 8.9
Identities = 6/55 (10%), Positives = 15/55 (27%), Gaps = 2/55 (3%)
Query: 245 NEDAIKKEILAHGPTTATFALYDDFYHYKSGVYKHTSNAKLENYLHSG--KLIGW 297
D + + + H + L YL +G +++ +
Sbjct: 106 PPDLAARLNHQRKDAPVVWVHDGERDHPTIALVNRAIEPLLLEYLQAGERRVMVF 160
>1y08_A Hypothetical protein SPY0861; cysteine proteinase, papain-like fold
with major insertions, hydrolase; 1.93A {Streptococcus
pyogenes} SCOP: d.3.1.12 PDB: 2avw_A 2au1_A
Length = 323
Score = 26.9 bits (58), Expect = 9.7
Identities = 16/79 (20%), Positives = 30/79 (37%), Gaps = 3/79 (3%)
Query: 237 TLTYWVDDNEDAIKKEILAHGPTTATFALYDDFYHYKSGV--YKHTSNAKLENYLHSGKL 294
L +W D N+D IK+ + H + + K + H ++KL Y
Sbjct: 86 MLHWWFDQNKDQIKRYLEEHPEKQKINFNGEQMFDVKEAIDTKNHQLDSKLFEYFKEKAF 145
Query: 295 IGWGTENGTPY-WLVINTW 312
T++ + VI+ +
Sbjct: 146 PYLSTKHLGVFPDHVIDMF 164
Database: pdb70
Posted date: Sep 4, 2012 3:40 AM
Number of letters in database: 6,701,793
Number of sequences in database: 27,921
Lambda K H
0.320 0.137 0.455
Gapped
Lambda K H
0.267 0.0856 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 27921
Number of Hits to DB: 5,571,439
Number of extensions: 329752
Number of successful extensions: 854
Number of sequences better than 10.0: 1
Number of HSP's gapped: 775
Number of HSP's successfully gapped: 62
Length of query: 344
Length of database: 6,701,793
Length adjustment: 94
Effective length of query: 250
Effective length of database: 4,077,219
Effective search space: 1019304750
Effective search space used: 1019304750
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 58 (25.9 bits)