RPS-BLAST 2.2.26 [Sep-21-2011]
Database: pdb70
27,921 sequences; 6,701,793 total letters
Searching..................................................done
Query= psy282
(233 letters)
>1m6d_A Cathepsin F, catsf; papain family cysteine protease, hydrolase;
HET: MYP; 1.70A {Homo sapiens} SCOP: d.3.1.1
Length = 214
Score = 123 bits (311), Expect = 4e-35
Identities = 46/128 (35%), Positives = 73/128 (57%), Gaps = 5/128 (3%)
Query: 2 GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSET-MKKILYKYGPLSVLLN 60
GLE+E DY Y+ C + K K+ +D + + +E + L K GP+SV +N
Sbjct: 80 GLETEDDYSYQGHMQ---SCQFSAEKAKV-YIQDSVELSQNEQKLAAWLAKRGPISVAIN 135
Query: 61 SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKI 120
+ + Y R CSP+ + HAVLLVGYG++ D+P+W ++NSWG ++G++ +
Sbjct: 136 AFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGQRSDVPFWAIKNSWGTDWGEKGYYYL 195
Query: 121 ERGNNACG 128
RG+ ACG
Sbjct: 196 HRGSGACG 203
Score = 88.7 bits (221), Expect = 8e-22
Identities = 30/79 (37%), Positives = 49/79 (62%)
Query: 138 ETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYW 197
+ + L K GP+SV +N+ + FY R CSP+ + HAVLLVGYG++ D+P+W
Sbjct: 118 QKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGQRSDVPFW 177
Query: 198 LVRNSWGPIGPDEGFFKIE 216
++NSWG ++G++ +
Sbjct: 178 AIKNSWGTDWGEKGYYYLH 196
>3i06_A Cruzipain; autocatalytic cleavage, glycoprotein, protease, thiol
protease, zymogen; HET: QL2; 1.10A {Trypanosoma cruzi}
PDB: 1ewm_A* 1ewo_A* 1ewl_A* 1f29_A* 1ewp_A* 1f2b_A*
1f2c_A* 1f2a_A* 1me4_A* 1u9q_X* 2aim_A* 2efm_A* 2oz2_A*
1me3_A* 3kku_A* 3lxs_A* 1aim_A* 3iut_A* 3hd3_A* 2p86_A*
...
Length = 215
Score = 117 bits (296), Expect = 6e-33
Identities = 38/131 (29%), Positives = 57/131 (43%), Gaps = 12/131 (9%)
Query: 2 GLESEKDYPYKNANGEKFKCAYDKSKVKLF-TGKDFLHFNGSETMKKILYKYGPLSVLLN 60
+ +E YPY + G C V TG L + + L GP++V ++
Sbjct: 82 AVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQD-EAQIAAWLAVNGPVAVAVD 140
Query: 61 SDLIHDYNG---TPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGF 117
+ Y G T C L H VLLVGY +PYW+++NSW +EG+
Sbjct: 141 ASSWMTYTGGVMTS-------CVSEQLDHGVLLVGYNDSAAVPYWIIKNSWTTQWGEEGY 193
Query: 118 FKIERGNNACG 128
+I +G+N C
Sbjct: 194 IRIAKGSNQCL 204
Score = 81.8 bits (203), Expect = 2e-19
Identities = 25/84 (29%), Positives = 38/84 (45%), Gaps = 10/84 (11%)
Query: 136 GSETMKKILYKYGPLSVGLNSHLIHFYNG---TPIRKNDETCSPYDLGHAVLLVGYGKQD 192
+ L GP++V +++ Y G T C L H VLLVGY
Sbjct: 121 DEAQIAAWLAVNGPVAVAVDASSWMTYTGGVMTS-------CVSEQLDHGVLLVGYNDSA 173
Query: 193 DIPYWLVRNSWGPIGPDEGFFKIE 216
+PYW+++NSW +EG+ +I
Sbjct: 174 AVPYWIIKNSWTTQWGEEGYIRIA 197
>3f5v_A DER P 1 allergen; allergy, asthma, DUST mites, glycoprotein,
hydrola protease, secreted, thiol protease; HET: P6G;
1.36A {Dermatophagoides pteronyssinus} PDB: 2as8_A
3rvw_A* 3rvx_A 3rvv_A* 3d6s_A*
Length = 222
Score = 107 bits (269), Expect = 7e-29
Identities = 25/131 (19%), Positives = 48/131 (36%), Gaps = 9/131 (6%)
Query: 2 GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYK-YGPLSVLLN 60
G+ E Y Y C ++ + ++ + +++ L + + ++V++
Sbjct: 87 GVVQESYYRYVAREQ---SCRRPNAQRFGISNYCQIYPPNANKIREALAQTHSAIAVIIG 143
Query: 61 SDLIHD---YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGF 117
+ Y+G I HAV +VGY + YW+VRNSW D G+
Sbjct: 144 IKDLDAFRHYDGRTI--IQRDNGYQPNYHAVNIVGYSNAQGVDYWIVRNSWDTNWGDNGY 201
Query: 118 FKIERGNNACG 128
+
Sbjct: 202 GYFAANIDLMM 212
Score = 75.6 bits (187), Expect = 6e-17
Identities = 20/86 (23%), Positives = 32/86 (37%), Gaps = 7/86 (8%)
Query: 136 GSET--MKKILYKYGPLSVGLN-SHLI--HFYNGTPIRKNDETCSPYDLGHAVLLVGYGK 190
+ + + + ++V + L Y+G I HAV +VGY
Sbjct: 122 PNANKIREALAQTHSAIAVIIGIKDLDAFRHYDGRTI--IQRDNGYQPNYHAVNIVGYSN 179
Query: 191 QDDIPYWLVRNSWGPIGPDEGFFKIE 216
+ YW+VRNSW D G+
Sbjct: 180 AQGVDYWIVRNSWDTNWGDNGYGYFA 205
>3pdf_A Cathepsin C, dipeptidyl peptidase 1; two domains, cystein protease,
hydrolase-hydrolase inhibitor; HET: LXV NAG; 1.85A {Homo
sapiens} PDB: 1jqp_A* 2djf_B* 1k3b_B* 2djg_B* 2djf_A*
1k3b_A* 2djg_A* 2djf_C* 1k3b_C* 2djg_C*
Length = 441
Score = 110 bits (277), Expect = 1e-28
Identities = 40/136 (29%), Positives = 57/136 (41%), Gaps = 10/136 (7%)
Query: 2 GLESEKDYPYKNANGE-KFKCAYDKSKVKLFTGKDFLHFNGSET-MKKILYKYGPLSVLL 59
GL E +PY + K K + + + +E MK L +GP++V
Sbjct: 291 GLVEEACFPYTGTDSPCKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAF 350
Query: 60 N--SDLIHDYNG---TPIRKNDETCSPYDLGHAVLLVGYG--KQDDIPYWLVRNSWGPIG 112
D +H Y D HAVLLVGYG + YW+V+NSWG
Sbjct: 351 EVYDDFLH-YKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGW 409
Query: 113 PDEGFFKIERGNNACG 128
+ G+F+I RG + C
Sbjct: 410 GENGYFRIRRGTDECA 425
Score = 76.8 bits (189), Expect = 2e-16
Identities = 27/85 (31%), Positives = 37/85 (43%), Gaps = 6/85 (7%)
Query: 138 ETMKKILYKYGPLSVGLNSHL-IHFYNG---TPIRKNDETCSPYDLGHAVLLVGYG--KQ 191
MK L +GP++V + Y D HAVLLVGYG
Sbjct: 334 ALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSA 393
Query: 192 DDIPYWLVRNSWGPIGPDEGFFKIE 216
+ YW+V+NSWG + G+F+I
Sbjct: 394 SGMDYWIVKNSWGTGWGENGYFRIR 418
>8pch_A Cathepsin H; hydrolase, protease, cysteine proteinase,
aminopeptidase; HET: NAG BMA; 2.10A {Sus scrofa} SCOP:
d.3.1.1 PDB: 1nb3_A* 1nb5_A*
Length = 220
Score = 105 bits (265), Expect = 3e-28
Identities = 46/138 (33%), Positives = 64/138 (46%), Gaps = 23/138 (16%)
Query: 2 GLESEKDYPYKNANGEKFKCAYDKSKVKLF-TGKDFLHFNGSETMKKILYKYGPLSVLLN 60
G+ E YPYK + C + K F + N E M + + Y P+S
Sbjct: 83 GIMGEDTYPYKGQDD---HCKFQPDKAIAFVKDVANITMNDEEAMVEAVALYNPVSFAFE 139
Query: 61 SDLIHD--------YNGTPIRKNDETC--SPYDLGHAVLLVGYGKQDDIPYWLVRNSWGP 110
+D Y+ +C +P + HAVL VGYG+++ IPYW+V+NSWGP
Sbjct: 140 VT--NDFLMYRKGIYS-------STSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGP 190
Query: 111 IGPDEGFFKIERGNNACG 128
G+F IERG N CG
Sbjct: 191 QWGMNGYFLIERGKNMCG 208
Score = 78.3 bits (194), Expect = 7e-18
Identities = 32/88 (36%), Positives = 47/88 (53%), Gaps = 12/88 (13%)
Query: 136 GSET-MKKILYKYGPLSVGLN-SHLIHFYNGTPIRK---NDETC--SPYDLGHAVLLVGY 188
E M + + Y P+S ++ Y RK + +C +P + HAVL VGY
Sbjct: 119 NDEEAMVEAVALYNPVSFAFEVTNDFLMY-----RKGIYSSTSCHKTPDKVNHAVLAVGY 173
Query: 189 GKQDDIPYWLVRNSWGPIGPDEGFFKIE 216
G+++ IPYW+V+NSWGP G+F IE
Sbjct: 174 GEENGIPYWIVKNSWGPQWGMNGYFLIE 201
>1deu_A Procathepsin X; cysteine protease, proregion, prosegment, HY; 1.70A
{Homo sapiens} SCOP: d.3.1.1 PDB: 1ef7_A
Length = 277
Score = 105 bits (264), Expect = 1e-27
Identities = 37/155 (23%), Positives = 57/155 (36%), Gaps = 23/155 (14%)
Query: 2 GLESEKDYPYKNANGE-----------KFKCAYDKSKVKLFTGKDFLHFNGSETMKKILY 50
G+ E Y+ + E +FK + L+ D+ +G E M +Y
Sbjct: 122 GIPDETCNNYQAKDQECDKFNQCGTCNEFKECHAIRNYTLWRVGDYGSLSGREKMMAEIY 181
Query: 51 KYGPLSVLLN--SDLIHDYNG---TPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVR 105
GP+S + L + Y G E + H V + G+G D YW+VR
Sbjct: 182 ANGPISCGIMATERLAN-YTGGIYA------EYQDTTYINHVVSVAGWGISDGTEYWIVR 234
Query: 106 NSWGPIGPDEGFFKIERGNNACGKDFLHFNGSETM 140
NSWG + G+ +I GK + E
Sbjct: 235 NSWGEPWGERGWLRIVTSTYKDGKGARYNLAIEEH 269
Score = 81.6 bits (202), Expect = 8e-19
Identities = 27/92 (29%), Positives = 41/92 (44%), Gaps = 10/92 (10%)
Query: 129 KDFLHFNGSETMKKILYKYGPLSVGLN-SHLIHFYNG---TPIRKNDETCSPYDLGHAVL 184
D+ +G E M +Y GP+S G+ + + Y G E + H V
Sbjct: 165 GDYGSLSGREKMMAEIYANGPISCGIMATERLANYTGGIYA------EYQDTTYINHVVS 218
Query: 185 LVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIE 216
+ G+G D YW+VRNSWG + G+ +I
Sbjct: 219 VAGWGISDGTEYWIVRNSWGEPWGERGWLRIV 250
>1xkg_A DER P I, major mite fecal allergen DER P 1; major allergen,
cysteine protease, house DUST mite, dermatop
pteronyssinus; 1.61A {Dermatophagoides pteronyssinus}
SCOP: d.3.1.1
Length = 312
Score = 105 bits (265), Expect = 1e-27
Identities = 25/131 (19%), Positives = 48/131 (36%), Gaps = 9/131 (6%)
Query: 2 GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYK-YGPLSVLLN 60
G+ E Y Y C ++ + ++ + +++ L + + ++V++
Sbjct: 167 GVVQESYYRYVAREQ---SCRRPNAQRFGISNYCQIYPPNANKIREALAQTHSAIAVIIG 223
Query: 61 SDLIHD---YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGF 117
+ Y+G I HAV +VGY + YW+VRNSW D G+
Sbjct: 224 IKDLDAFRHYDGRTI--IQRDNGYQPNYHAVNIVGYSNAQGVDYWIVRNSWDTNWGDNGY 281
Query: 118 FKIERGNNACG 128
+
Sbjct: 282 GYFAANIDLMM 292
Score = 74.6 bits (184), Expect = 5e-16
Identities = 20/86 (23%), Positives = 32/86 (37%), Gaps = 7/86 (8%)
Query: 136 GSET--MKKILYKYGPLSVGLN-SHLI--HFYNGTPIRKNDETCSPYDLGHAVLLVGYGK 190
+ + + + ++V + L Y+G I HAV +VGY
Sbjct: 202 PNANKIREALAQTHSAIAVIIGIKDLDAFRHYDGRTI--IQRDNGYQPNYHAVNIVGYSN 259
Query: 191 QDDIPYWLVRNSWGPIGPDEGFFKIE 216
+ YW+VRNSW D G+
Sbjct: 260 AQGVDYWIVRNSWDTNWGDNGYGYFA 285
>3qj3_A Cathepsin L-like protein; hydrolase, proteinase, larVal midgut;
1.85A {Tenebrio molitor}
Length = 331
Score = 105 bits (263), Expect = 4e-27
Identities = 46/137 (33%), Positives = 67/137 (48%), Gaps = 22/137 (16%)
Query: 2 GLESEKDYPYKNANGEKFKCAYDKSKVKLF-TGKDFLHFNGSETMKKILYKYGPLSVLLN 60
G++SE YPY+ A+G C YD ++V +G +L + ++ GP++V +
Sbjct: 197 GIDSEGAYPYEMADG---NCHYDPNQVAARLSGYVYLSGPDENMLADMVATKGPVAVAFD 253
Query: 61 SDLIHD--------YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIG 112
+D Y TC HAVL+VGYG ++ YWLV+NSWG
Sbjct: 254 AD--DPFGSYSGGVYYN-------PTCETNKFTHAVLIVGYGNENGQDYWLVKNSWGDGW 304
Query: 113 PDEGFFKIERG-NNACG 128
+G+FKI R NN CG
Sbjct: 305 GLDGYFKIARNANNHCG 321
Score = 81.1 bits (201), Expect = 2e-18
Identities = 28/83 (33%), Positives = 42/83 (50%), Gaps = 4/83 (4%)
Query: 136 GSET-MKKILYKYGPLSVGLN-SHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDD 193
E + ++ GP++V + Y+G + TC HAVL+VGYG ++
Sbjct: 233 PDENMLADMVATKGPVAVAFDADDPFGSYSGGVY--YNPTCETNKFTHAVLIVGYGNENG 290
Query: 194 IPYWLVRNSWGPIGPDEGFFKIE 216
YWLV+NSWG +G+FKI
Sbjct: 291 QDYWLVKNSWGDGWGLDGYFKIA 313
>3hhi_A Cathepsin B-like cysteine protease; occluding loop, hydrolase, THIO
protease; HET: 074; 1.60A {Trypanosoma brucei} PDB:
3mor_A*
Length = 325
Score = 103 bits (260), Expect = 9e-27
Identities = 38/155 (24%), Positives = 55/155 (35%), Gaps = 36/155 (23%)
Query: 2 GLESEKDYPYKNANGE----------------------KFKCAYDKSKVKLFTGKDFLHF 39
GL S+ PY + + C V +
Sbjct: 156 GLVSDYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPKCDYTCDDPTIPVVNYRSWTSYAL 215
Query: 40 NGSETMKKILYKYGPLSVLLN--SDLIHDYNGTPIRKN---DETCSPYDLGHAVLLVGYG 94
G + + L+ GP V + D I Y + Y GHAV LVG+G
Sbjct: 216 QGEDDYMRELFFRGPFEVAFDVYEDFIA-Y------NSGVYHHVSGQYLGGHAVRLVGWG 268
Query: 95 KQDDIPYWLVRNSWGPI-GPDEGFFKIERGNNACG 128
+ +PYW + NSW G +G+F I RG++ CG
Sbjct: 269 TSNGVPYWKIANSWNTEWG-MDGYFLIRRGSSECG 302
Score = 75.5 bits (186), Expect = 3e-16
Identities = 25/94 (26%), Positives = 37/94 (39%), Gaps = 12/94 (12%)
Query: 128 GKDFLHFNGSETMKKILYKYGPLSVGLNSHL-IHFYNGTPIRKN---DETCSPYDLGHAV 183
G + + L+ GP V + + Y + Y GHAV
Sbjct: 209 SWTSYALQGEDDYMRELFFRGPFEVAFDVYEDFIAY------NSGVYHHVSGQYLGGHAV 262
Query: 184 LLVGYGKQDDIPYWLVRNSWGPI-GPDEGFFKIE 216
LVG+G + +PYW + NSW G +G+F I
Sbjct: 263 RLVGWGTSNGVPYWKIANSWNTEWG-MDGYFLIR 295
>2wbf_X Serine-repeat antigen protein; SERA, malaria, vacuole, protease,
cathepsin, hydrolase, glycoprotein, thiol protease; HET:
DMS; 1.60A {Plasmodium falciparum} PDB: 3ch3_X 3ch2_X
Length = 265
Score = 101 bits (254), Expect = 2e-26
Identities = 39/165 (23%), Positives = 68/165 (41%), Gaps = 36/165 (21%)
Query: 2 GLESEKDYPYK---------------NANGEKFKCAYDKSKVKLFTGKDFLHFNGS---- 42
L +E +YPY + K ++K++ GK + +
Sbjct: 92 FLPAESNYPYNYVKVGEQCPKVEDHWMNLWDNGKILHNKNEPNSLDGKGYTAYESERFHD 151
Query: 43 ------ETMKKILYKYGPLSVLLN--SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYG 94
+ +K + G + + + + ++++G +K C HAV +VGYG
Sbjct: 152 NMDAFVKIIKTEVMNKGSVIAYIKAENVMGYEFSG---KKVKNLCGDDTADHAVNIVGYG 208
Query: 95 KQDDI-----PYWLVRNSWGPIGPDEGFFKIER-GNNACGKDFLH 133
+ YW+VRNSWGP DEG+FK++ G C +F+H
Sbjct: 209 NYVNSEGEKKSYWIVRNSWGPYWGDEGYFKVDMYGPTHCHFNFIH 253
Score = 78.1 bits (193), Expect = 1e-17
Identities = 26/86 (30%), Positives = 44/86 (51%), Gaps = 10/86 (11%)
Query: 138 ETMKKILYKYGPLSVGLN-SHLI-HFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDI- 194
+ +K + G + + +++ + ++G +K C HAV +VGYG +
Sbjct: 158 KIIKTEVMNKGSVIAYIKAENVMGYEFSG---KKVKNLCGDDTADHAVNIVGYGNYVNSE 214
Query: 195 ----PYWLVRNSWGPIGPDEGFFKIE 216
YW+VRNSWGP DEG+FK++
Sbjct: 215 GEKKSYWIVRNSWGPYWGDEGYFKVD 240
>3kwz_A Cathepsin K; enzyme inhibitor, covalent reversible inhibitor,
disease mutation, disulfide bond, glycoprotein,
hydrolase, lysosome, protease; HET: KWZ; 1.49A {Homo
sapiens} PDB: 1au0_A* 1au2_A* 1au3_A* 1au4_A* 1ayu_A*
1ayv_A* 1ayw_A* 1bgo_A* 1atk_A* 1nl6_A* 1nlj_A* 1q6k_A*
1mem_A* 1yk7_A* 1yk8_A* 1yt7_A* 2ato_A* 2aux_A* 2auz_A*
2bdl_A* ...
Length = 215
Score = 99.1 bits (248), Expect = 8e-26
Identities = 40/138 (28%), Positives = 65/138 (47%), Gaps = 23/138 (16%)
Query: 2 GLESEKDYPYKNANGEKFKCAYDKSKVKLF-TGKDFLHFNGSETMKKILYKYGPLSVLLN 60
G++SE YPY C Y+ + G + + +K+ + + GP+SV ++
Sbjct: 80 GIDSEDAYPYVGQEE---SCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAID 136
Query: 61 SDLIHD---------YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPI 111
+ Y DE+C+ +L HAVL VGYG Q +W+++NSWG
Sbjct: 137 AS--LTSFQFYSKGVY-------YDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGEN 187
Query: 112 GPDEGFFKIERG-NNACG 128
++G+ + R NNACG
Sbjct: 188 WGNKGYILMARNKNNACG 205
Score = 76.0 bits (188), Expect = 4e-17
Identities = 29/89 (32%), Positives = 49/89 (55%), Gaps = 15/89 (16%)
Query: 136 GSET-MKKILYKYGPLSVGLNSHLIHF-------YNGTPIRKNDETCSPYDLGHAVLLVG 187
G+E +K+ + + GP+SV +++ L F Y DE+C+ +L HAVL VG
Sbjct: 116 GNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYY-------DESCNSDNLNHAVLAVG 168
Query: 188 YGKQDDIPYWLVRNSWGPIGPDEGFFKIE 216
YG Q +W+++NSWG ++G+ +
Sbjct: 169 YGIQKGNKHWIIKNSWGENWGNKGYILMA 197
>2o6x_A Procathepsin L1, secreted cathepsin L 1; hydrolase, thiol protease,
cysteine protease, zymogen, hydro; 1.40A {Fasciola
hepatica}
Length = 310
Score = 100 bits (252), Expect = 1e-25
Identities = 45/137 (32%), Positives = 64/137 (46%), Gaps = 22/137 (16%)
Query: 2 GLESEKDYPYKNANGEKFKCAYDKSKVKLF-TGKDFLHFNGSETMKKILYKYGPLSVLLN 60
GLE+E YPY G +C Y+K TG +H +K ++ GP +V ++
Sbjct: 172 GLETESSYPYTAVEG---QCRYNKQLGVAKVTGFYTVHSGSEVELKNLVGAEGPAAVAVD 228
Query: 61 SDLIHD--------YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIG 112
+ D Y +TCSP + HAVL VGYG Q YW+V+NSWG
Sbjct: 229 VE--SDFMMYRSGIYQS-------QTCSPLRVNHAVLAVGYGTQGGTDYWIVKNSWGLSW 279
Query: 113 PDEGFFKIERG-NNACG 128
+ G+ ++ R N CG
Sbjct: 280 GERGYIRMVRNRGNMCG 296
Score = 77.7 bits (192), Expect = 3e-17
Identities = 30/86 (34%), Positives = 44/86 (51%), Gaps = 10/86 (11%)
Query: 136 GSET-MKKILYKYGPLSVGLN-SHLIHFYNGTPIRK---NDETCSPYDLGHAVLLVGYGK 190
GSE +K ++ GP +V ++ Y R +TCSP + HAVL VGYG
Sbjct: 208 GSEVELKNLVGAEGPAAVAVDVESDFMMY-----RSGIYQSQTCSPLRVNHAVLAVGYGT 262
Query: 191 QDDIPYWLVRNSWGPIGPDEGFFKIE 216
Q YW+V+NSWG + G+ ++
Sbjct: 263 QGGTDYWIVKNSWGLSWGERGYIRMV 288
>3qt4_A Cathepsin-L-like midgut cysteine proteinase; hydrolase, zymogen,
intramolecular DISS bonds, insect larVal midgut; HET:
PG4 PG6; 2.11A {Tenebrio molitor}
Length = 329
Score = 100 bits (252), Expect = 2e-25
Identities = 40/130 (30%), Positives = 67/130 (51%), Gaps = 8/130 (6%)
Query: 2 GLESEKDYPYKNANGEKFKCAYDKSKVKLF-TGKDFLHFNGSETMKKILYKYGPLSVLLN 60
G+ SE YPY+ C +D S+ +G L ++ + + GP++V ++
Sbjct: 195 GIMSESAYPYEAQGD---YCRFDSSQSVTTLSGYYDLPSGDENSLADAVGQAGPVAVAID 251
Query: 61 -SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFK 119
+D + Y+G D+TC+ DL H VL+VGYG + YW+++NSWG + G+++
Sbjct: 252 ATDELQFYSGGLF--YDQTCNQSDLNHGVLVVGYGSDNGQDYWILKNSWGSGWGESGYWR 309
Query: 120 IERG-NNACG 128
R N CG
Sbjct: 310 QVRNYGNNCG 319
Score = 78.5 bits (194), Expect = 2e-17
Identities = 27/83 (32%), Positives = 48/83 (57%), Gaps = 4/83 (4%)
Query: 136 GSET-MKKILYKYGPLSVGLN-SHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDD 193
G E + + + GP++V ++ + + FY+G D+TC+ DL H VL+VGYG +
Sbjct: 231 GDENSLADAVGQAGPVAVAIDATDELQFYSGGLF--YDQTCNQSDLNHGVLVVGYGSDNG 288
Query: 194 IPYWLVRNSWGPIGPDEGFFKIE 216
YW+++NSWG + G+++
Sbjct: 289 QDYWILKNSWGSGWGESGYWRQV 311
>2b1m_A SPE31; papain-like, sugar binding protein; HET: NAG FUC PG4; 2.00A
{Pachyrhizus erosus} PDB: 2b1n_A*
Length = 246
Score = 98.8 bits (247), Expect = 2e-25
Identities = 42/139 (30%), Positives = 71/139 (51%), Gaps = 17/139 (12%)
Query: 2 GLESEKDYPYKNANGEKFKCAYDKSKVKL----FTGKDFLHFN---GSETMKKILYKYGP 54
G+ SE DYPYK +G KC ++ + K+ + + + + +E+ + P
Sbjct: 81 GIASEADYPYKARDG---KCKANEIQDKVTIDNYGVQILSNESTESEAESSLQSFVLEQP 137
Query: 55 LSVLLNSDLIHDYNGTPIRKNDETCS-PYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGP 113
+SV +++ H Y+G + CS PY + H VL+VGYG +D + YW+ +NSWG
Sbjct: 138 ISVSIDAKDFHFYSGGIY--DGGNCSSPYGINHFVLIVGYGSEDGVDYWIAKNSWGEDWG 195
Query: 114 DEGFFKIERGNNA----CG 128
+G+ +I+R CG
Sbjct: 196 IDGYIRIQRNTGNLLGVCG 214
Score = 76.5 bits (189), Expect = 5e-17
Identities = 28/82 (34%), Positives = 47/82 (57%), Gaps = 3/82 (3%)
Query: 136 GSETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCS-PYDLGHAVLLVGYGKQDDI 194
+E+ + P+SV +++ HFY+G + CS PY + H VL+VGYG +D +
Sbjct: 124 EAESSLQSFVLEQPISVSIDAKDFHFYSGGIY--DGGNCSSPYGINHFVLIVGYGSEDGV 181
Query: 195 PYWLVRNSWGPIGPDEGFFKIE 216
YW+ +NSWG +G+ +I+
Sbjct: 182 DYWIAKNSWGEDWGIDGYIRIQ 203
>1by8_A Protein (procathepsin K); hydrolase(sulfhydryl proteinase), papain;
2.60A {Homo sapiens} SCOP: d.3.1.1 PDB: 7pck_A
Length = 314
Score = 98.1 bits (245), Expect = 1e-24
Identities = 42/134 (31%), Positives = 67/134 (50%), Gaps = 15/134 (11%)
Query: 2 GLESEKDYPYKNANGEKFKCAYDKSKVKLF-TGKDFLHFNGSETMKKILYKYGPLSVLLN 60
G++SE YPY C Y+ + G + + +K+ + + GP+SV ++
Sbjct: 179 GIDSEDAYPYVGQEE---SCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAID 235
Query: 61 SDLI--HDYNGTPIRK---NDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDE 115
+ L Y K DE+C+ +L HAVL VGYG Q +W+++NSWG ++
Sbjct: 236 ASLTSFQFY-----SKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNK 290
Query: 116 GFFKIERG-NNACG 128
G+ + R NNACG
Sbjct: 291 GYILMARNKNNACG 304
Score = 75.0 bits (185), Expect = 4e-16
Identities = 29/89 (32%), Positives = 49/89 (55%), Gaps = 15/89 (16%)
Query: 136 GSET-MKKILYKYGPLSVGLNSHLIHF-------YNGTPIRKNDETCSPYDLGHAVLLVG 187
G+E +K+ + + GP+SV +++ L F Y DE+C+ +L HAVL VG
Sbjct: 215 GNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYY-------DESCNSDNLNHAVLAVG 267
Query: 188 YGKQDDIPYWLVRNSWGPIGPDEGFFKIE 216
YG Q +W+++NSWG ++G+ +
Sbjct: 268 YGIQKGNKHWIIKNSWGENWGNKGYILMA 296
>2xu3_A Cathepsin L1; hydrolase, drug design, thiol protease; HET: XU3 BTB;
0.90A {Homo sapiens} PDB: 2xu4_A* 2xu5_A* 2yj2_A*
2yj8_A* 2yj9_A* 2yjb_A* 2yjc_A* 3bc3_A* 3h89_A* 3h8b_A*
3h8c_A* 3of9_A* 3of8_A* 3hha_A* 2xu1_A* 3iv2_A* 3k24_A*
2nqd_B* 3kse_A* 2vhs_A ...
Length = 220
Score = 93.7 bits (234), Expect = 1e-23
Identities = 42/142 (29%), Positives = 63/142 (44%), Gaps = 28/142 (19%)
Query: 2 GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSET-MKKILYKYGPLSVLLN 60
GL+SE+ YPY+ C Y+ F+ E + K + GP+SV ++
Sbjct: 82 GLDSEESYPYEATEE---SCKYNPKYSVA-NDTGFVDIPKQEKALMKAVATVGPISVAID 137
Query: 61 SDLIHD---------YNGTPIRKNDETCSPYDLGHAVLLVGYGKQ----DDIPYWLVRNS 107
+ H+ Y + CS D+ H VL+VGYG + D+ YWLV+NS
Sbjct: 138 AG--HESFLFYKEGIY-------FEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNS 188
Query: 108 WGPIGPDEGFFKIERG-NNACG 128
WG G+ K+ + N CG
Sbjct: 189 WGEEWGMGGYVKMAKDRRNHCG 210
Score = 69.5 bits (171), Expect = 1e-14
Identities = 28/93 (30%), Positives = 43/93 (46%), Gaps = 18/93 (19%)
Query: 135 NGSETMKKILYKYGPLSVGLNSHLIHF-------YNGTPIRKNDETCSPYDLGHAVLLVG 187
+ + K + GP+SV +++ F Y + CS D+ H VL+VG
Sbjct: 117 KQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYF-------EPDCSSEDMDHGVLVVG 169
Query: 188 YGKQ----DDIPYWLVRNSWGPIGPDEGFFKIE 216
YG + D+ YWLV+NSWG G+ K+
Sbjct: 170 YGFESTESDNNKYWLVKNSWGEEWGMGGYVKMA 202
>3ovx_A Cathepsin S; hydrolase, covalent inhibitor, aldehyde warhead is
covalently bound to Cys25, lysosomeal protein; HET: O64;
1.49A {Homo sapiens} PDB: 2h7j_A* 2f1g_A* 2hh5_B*
2hhn_A* 2hxz_A* 2op3_A* 2frq_A* 2fra_A* 2fq9_A* 2ft2_A*
2fud_A* 2g7y_A* 1ms6_A* 2r9m_A* 2r9n_A* 2r9o_A* 3n3g_A*
3n4c_A* 3mpe_A* 1nqc_A* ...
Length = 218
Score = 93.3 bits (233), Expect = 1e-23
Identities = 39/132 (29%), Positives = 65/132 (49%), Gaps = 12/132 (9%)
Query: 2 GLESEKDYPYKNANGEKFKCAYDKSKVKLF-TGKDFLHFNGSETMKKILYKYGPLSVLLN 60
G++S+ YPYK + KC YD + L + + +K+ + GP+SV ++
Sbjct: 84 GIDSDASYPYKAMDQ---KCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVD 140
Query: 61 ---SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGF 117
+G + +C+ ++ H VL+VGYG + YWLV+NSWG +EG+
Sbjct: 141 ARHPSFFLYRSGV---YYEPSCTQ-NVNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGY 196
Query: 118 FKIERG-NNACG 128
++ R N CG
Sbjct: 197 IRMARNKGNHCG 208
Score = 69.5 bits (171), Expect = 1e-14
Identities = 29/85 (34%), Positives = 48/85 (56%), Gaps = 8/85 (9%)
Query: 136 GSET-MKKILYKYGPLSVGLN-SHLI-HFY-NGTPIRKNDETCSPYDLGHAVLLVGYGKQ 191
G E +K+ + GP+SVG++ H Y +G + +C+ ++ H VL+VGYG
Sbjct: 120 GREDVLKEAVANKGPVSVGVDARHPSFFLYRSGV---YYEPSCTQ-NVNHGVLVVGYGDL 175
Query: 192 DDIPYWLVRNSWGPIGPDEGFFKIE 216
+ YWLV+NSWG +EG+ ++
Sbjct: 176 NGKEYWLVKNSWGHNFGEEGYIRMA 200
>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco
hydrolase, lysosome, protease, thiol protease, zymogen,
CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A*
3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A*
1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A*
1qdq_A* 1csb_B* 1huc_B 2ipp_B ...
Length = 266
Score = 92.7 bits (231), Expect = 6e-23
Identities = 34/109 (31%), Positives = 51/109 (46%), Gaps = 14/109 (12%)
Query: 26 SKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLN--SDLIHDYNGTPIRKN---DETCS 80
+ K + + N + + +YK GP+ + SD + Y K+
Sbjct: 147 KQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLL-Y------KSGVYQHVTG 199
Query: 81 PYDLGHAVLLVGYGKQDDIPYWLVRNSWGPI-GPDEGFFKIERGNNACG 128
GHA+ ++G+G ++ PYWLV NSW G D GFFKI RG + CG
Sbjct: 200 EMMGGHAIRILGWGVENGTPYWLVANSWNTDWG-DNGFFKILRGQDHCG 247
Score = 75.8 bits (187), Expect = 9e-17
Identities = 27/86 (31%), Positives = 40/86 (46%), Gaps = 12/86 (13%)
Query: 135 NGSETMKKILYKYGPLSVGLNSHL-IHFYNGTPIRKN---DETCSPYDLGHAVLLVGYGK 190
N + + +YK GP+ + + Y K+ GHA+ ++G+G
Sbjct: 161 NSEKDIMAEIYKNGPVEGAFSVYSDFLLY------KSGVYQHVTGEMMGGHAIRILGWGV 214
Query: 191 QDDIPYWLVRNSWGPI-GPDEGFFKI 215
++ PYWLV NSW G D GFFKI
Sbjct: 215 ENGTPYWLVANSWNTDWG-DNGFFKI 239
>2c0y_A Procathepsin S; proenzyme, proteinase, hydrolase, thiol protease,
prosegment binding loop, glycoprotein, lysosome,
protease, zymogen; 2.1A {Homo sapiens}
Length = 315
Score = 92.7 bits (231), Expect = 1e-22
Identities = 40/132 (30%), Positives = 67/132 (50%), Gaps = 12/132 (9%)
Query: 2 GLESEKDYPYKNANGEKFKCAYDKSKVKLF-TGKDFLHFNGSETMKKILYKYGPLSVLLN 60
G++S+ YPYK + KC YD + L + + +K+ + GP+SV ++
Sbjct: 181 GIDSDASYPYKAMDQ---KCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVD 237
Query: 61 SDLI--HDY-NGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGF 117
+ Y +G + +C+ ++ H VL+VGYG + YWLV+NSWG +EG+
Sbjct: 238 ARHPSFFLYRSGV---YYEPSCTQ-NVNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGY 293
Query: 118 FKIERG-NNACG 128
++ R N CG
Sbjct: 294 IRMARNKGNHCG 305
Score = 68.4 bits (168), Expect = 6e-14
Identities = 28/89 (31%), Positives = 47/89 (52%), Gaps = 16/89 (17%)
Query: 136 GSET-MKKILYKYGPLSVGLNSHLIHF-------YNGTPIRKNDETCSPYDLGHAVLLVG 187
G E +K+ + GP+SVG+++ F Y + +C+ ++ H VL+VG
Sbjct: 217 GREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYY-------EPSCTQ-NVNHGVLVVG 268
Query: 188 YGKQDDIPYWLVRNSWGPIGPDEGFFKIE 216
YG + YWLV+NSWG +EG+ ++
Sbjct: 269 YGDLNGKEYWLVKNSWGHNFGEEGYIRMA 297
>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase,
digestive tract, hydrolase-hydrolase INH complex; HET:
074; 1.30A {Schistosoma mansoni} PDB: 3s3q_A* 3s3r_A*
Length = 254
Score = 91.1 bits (227), Expect = 2e-22
Identities = 30/115 (26%), Positives = 52/115 (45%), Gaps = 14/115 (12%)
Query: 20 KCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLN--SDLIHDYNGTPIRKN-- 75
K ++ K + N + ++K + KYGP+ D ++ Y K+
Sbjct: 137 KYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGFTVYEDFLN-Y------KSGI 189
Query: 76 -DETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPI-GPDEGFFKIERGNNACG 128
GHA+ ++G+G ++ PYWL+ NSW G + G+F+I RG + C
Sbjct: 190 YKHITGETLGGHAIRIIGWGVENKAPYWLIANSWNEDWG-ENGYFRIVRGRDECS 243
Score = 76.9 bits (190), Expect = 3e-17
Identities = 25/87 (28%), Positives = 42/87 (48%), Gaps = 12/87 (13%)
Query: 135 NGSETMKKILYKYGPLSVGLNSHL-IHFYNGTPIRKN---DETCSPYDLGHAVLLVGYGK 190
N + ++K + KYGP+ G + Y K+ GHA+ ++G+G
Sbjct: 157 NDEKAIQKEIMKYGPVEAGFTVYEDFLNY------KSGIYKHITGETLGGHAIRIIGWGV 210
Query: 191 QDDIPYWLVRNSWGPI-GPDEGFFKIE 216
++ PYWL+ NSW G + G+F+I
Sbjct: 211 ENKAPYWLIANSWNEDWG-ENGYFRIV 236
>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme,
papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A
1pbh_A 1mir_A
Length = 317
Score = 92.0 bits (229), Expect = 3e-22
Identities = 34/109 (31%), Positives = 51/109 (46%), Gaps = 14/109 (12%)
Query: 26 SKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLN--SDLIHDYNGTPIRKN---DETCS 80
+ K + + N + + +YK GP+ + SD + Y K+
Sbjct: 204 KQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLL-Y------KSGVYQHVTG 256
Query: 81 PYDLGHAVLLVGYGKQDDIPYWLVRNSWGPI-GPDEGFFKIERGNNACG 128
GHA+ ++G+G ++ PYWLV NSW G D GFFKI RG + CG
Sbjct: 257 EMMGGHAIRILGWGVENGTPYWLVANSWNTDWG-DNGFFKILRGQDHCG 304
Score = 74.7 bits (184), Expect = 4e-16
Identities = 27/87 (31%), Positives = 40/87 (45%), Gaps = 12/87 (13%)
Query: 135 NGSETMKKILYKYGPLSVGLNSHL-IHFYNGTPIRKN---DETCSPYDLGHAVLLVGYGK 190
N + + +YK GP+ + + Y K+ GHA+ ++G+G
Sbjct: 218 NSEKDIMAEIYKNGPVEGAFSVYSDFLLY------KSGVYQHVTGEMMGGHAIRILGWGV 271
Query: 191 QDDIPYWLVRNSWGPI-GPDEGFFKIE 216
++ PYWLV NSW G D GFFKI
Sbjct: 272 ENGTPYWLVANSWNTDWG-DNGFFKIL 297
>1cs8_A Human procathepsin L; prosegment, propeptide, inhibition,
hydrolase; HET: OCS; 1.80A {Homo sapiens} SCOP: d.3.1.1
PDB: 1cjl_A 3hwn_A*
Length = 316
Score = 91.9 bits (229), Expect = 3e-22
Identities = 42/142 (29%), Positives = 63/142 (44%), Gaps = 28/142 (19%)
Query: 2 GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSET-MKKILYKYGPLSVLLN 60
GL+SE+ YPY+ C Y+ F+ E + K + GP+SV ++
Sbjct: 178 GLDSEESYPYEATEE---SCKYNPKYSVA-NDAGFVDIPKQEKALMKAVATVGPISVAID 233
Query: 61 SDLIHD---------YNGTPIRKNDETCSPYDLGHAVLLVGYG----KQDDIPYWLVRNS 107
+ H+ Y + CS D+ H VL+VGYG + D+ YWLV+NS
Sbjct: 234 AG--HESFLFYKEGIY-------FEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNS 284
Query: 108 WGPIGPDEGFFKIERG-NNACG 128
WG G+ K+ + N CG
Sbjct: 285 WGEEWGMGGYVKMAKDRRNHCG 306
Score = 68.8 bits (169), Expect = 6e-14
Identities = 28/93 (30%), Positives = 43/93 (46%), Gaps = 18/93 (19%)
Query: 135 NGSETMKKILYKYGPLSVGLNSHLIHF-------YNGTPIRKNDETCSPYDLGHAVLLVG 187
+ + K + GP+SV +++ F Y + CS D+ H VL+VG
Sbjct: 213 KQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYF-------EPDCSSEDMDHGVLVVG 265
Query: 188 YG----KQDDIPYWLVRNSWGPIGPDEGFFKIE 216
YG + D+ YWLV+NSWG G+ K+
Sbjct: 266 YGFESTESDNNKYWLVKNSWGEEWGMGGYVKMA 298
>1ppo_A Protease omega; hydrolase(thiol protease); 1.80A {Carica papaya}
SCOP: d.3.1.1 PDB: 1meg_A*
Length = 216
Score = 89.8 bits (224), Expect = 3e-22
Identities = 35/135 (25%), Positives = 53/135 (39%), Gaps = 16/135 (11%)
Query: 2 GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--LHFNGSETMKKILYKYGPLSVLL 59
G+ YPYK G C + + + N + + K P+SV++
Sbjct: 79 GIHLRSKYPYKAKQG---TCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAK-QPVSVVV 134
Query: 60 NSDLI--HDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGF 117
S Y G + C + HAV VGYGK Y L++NSWG ++G+
Sbjct: 135 ESKGRPFQLYKGGIF---EGPCGT-KVDHAVTAVGYGKSGGKGYILIKNSWGTAWGEKGY 190
Query: 118 FKIERGNNA----CG 128
+I+R CG
Sbjct: 191 IRIKRAPGNSPGVCG 205
Score = 65.9 bits (162), Expect = 2e-13
Identities = 25/89 (28%), Positives = 39/89 (43%), Gaps = 18/89 (20%)
Query: 136 GSET-MKKILYKYGPLSVGLNSHLIHF-------YNGTPIRKNDETCSPYDLGHAVLLVG 187
+E + + K P+SV + S F + G C + HAV VG
Sbjct: 116 NNEGNLLNAIAK-QPVSVVVESKGRPFQLYKGGIFEGP--------CGT-KVDHAVTAVG 165
Query: 188 YGKQDDIPYWLVRNSWGPIGPDEGFFKIE 216
YGK Y L++NSWG ++G+ +I+
Sbjct: 166 YGKSGGKGYILIKNSWGTAWGEKGYIRIK 194
>1iwd_A Ervatamin B; cysteine protease, alpha-beta protein, catalytic DYAD,
L-DOM domain., hydrolase; 1.63A {Tabernaemontana
divaricata} SCOP: d.3.1.1
Length = 215
Score = 89.5 bits (223), Expect = 3e-22
Identities = 34/133 (25%), Positives = 58/133 (43%), Gaps = 14/133 (10%)
Query: 2 GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLNS 61
G++++++YPY G C + +V G + N ++ + P+SV + +
Sbjct: 80 GIDTQQNYPYSAVQG---SCKPYRLRVVSINGFQRVTRNNESALQSAVAS-QPVSVTVEA 135
Query: 62 DLI--HDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFK 119
Y+ C H V++VGYG Q YW+VRNSWG ++G+
Sbjct: 136 AGAPFQHYSSGIF---TGPCGT-AQNHGVVIVGYGTQSGKNYWIVRNSWGQNWGNQGYIW 191
Query: 120 IERGNNA----CG 128
+ER + CG
Sbjct: 192 MERNVASSAGLCG 204
Score = 63.3 bits (155), Expect = 2e-12
Identities = 23/76 (30%), Positives = 34/76 (44%), Gaps = 16/76 (21%)
Query: 148 GPLSVGLNSHLIHF-------YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVR 200
P+SV + + F + G C H V++VGYG Q YW+VR
Sbjct: 127 QPVSVTVEAAGAPFQHYSSGIFTGP--------CGT-AQNHGVVIVGYGTQSGKNYWIVR 177
Query: 201 NSWGPIGPDEGFFKIE 216
NSWG ++G+ +E
Sbjct: 178 NSWGQNWGNQGYIWME 193
>3ois_A Cysteine protease; alpha and beta, hydrolase; HET: UDP; 1.65A
{Xylella fastidiosa}
Length = 291
Score = 90.7 bits (225), Expect = 5e-22
Identities = 25/144 (17%), Positives = 51/144 (35%), Gaps = 19/144 (13%)
Query: 2 GLESEKDYPYKNANGEKFKCAYDKS-------------KVKLFTGKDFLHFNGSET-MKK 47
G+ EK++PY + + + + + ++ +K
Sbjct: 137 GVCPEKEWPYGDTPADPRTEEFPPGAPASKKPSDQCYKDAQNYKITEYSRVAQDIDHLKA 196
Query: 48 ILYKYGPLSV--LLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVR 105
L P + + + + + + GHAVL VGY D+I ++ +R
Sbjct: 197 CLAVGSPFVFGFSVYNSWVGNNSLPVRIPLPTKNDTLEGGHAVLCVGYD--DEIRHFRIR 254
Query: 106 NSWGPIGPDEGFFKIERG-NNACG 128
NSWG ++G+F + +
Sbjct: 255 NSWGNNVGEDGYFWMPYEYISNTQ 278
Score = 78.7 bits (194), Expect = 1e-17
Identities = 21/80 (26%), Positives = 36/80 (45%), Gaps = 4/80 (5%)
Query: 138 ETMKKILYKYGPLSVGLNSH--LIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIP 195
+ +K L P G + + + + + GHAVL VGY D+I
Sbjct: 192 DHLKACLAVGSPFVFGFSVYNSWVGNNSLPVRIPLPTKNDTLEGGHAVLCVGYD--DEIR 249
Query: 196 YWLVRNSWGPIGPDEGFFKI 215
++ +RNSWG ++G+F +
Sbjct: 250 HFRIRNSWGNNVGEDGYFWM 269
>3bwk_A Cysteine protease falcipain-3; malaria, hydrolase; HET: C1P; 2.42A
{Plasmodium falciparum} PDB: 3bpm_A*
Length = 243
Score = 88.4 bits (220), Expect = 2e-21
Identities = 38/145 (26%), Positives = 59/145 (40%), Gaps = 29/145 (20%)
Query: 2 GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLN- 60
GL S+ DYPY + E C + + T K ++ + K+ L GP+S+ +
Sbjct: 99 GLCSQDDYPYVSNLPET--CNLKRCNERY-TIKSYVSIP-DDKFKEALRYLGPISISIAA 154
Query: 61 SDLIHDYNG---TPIRKNDETCSPYDLGHAVLLVGYG----------KQDDIPYWLVRNS 107
SD Y G C HAV+LVGYG + + Y++++NS
Sbjct: 155 SDDFAFYRGGFYDG------ECGA-APNHAVILVGYGMKDIYNEDTGRMEKFYYYIIKNS 207
Query: 108 WGPIGPDEGFFKIERGNNA----CG 128
WG + G+ +E N C
Sbjct: 208 WGSDWGEGGYINLETDENGYKKTCS 232
Score = 68.4 bits (168), Expect = 4e-14
Identities = 25/91 (27%), Positives = 39/91 (42%), Gaps = 21/91 (23%)
Query: 140 MKKILYKYGPLSVGLN-SHLIHFYNG---TPIRKNDETCSPYDLGHAVLLVGYG------ 189
K+ L GP+S+ + S FY G C HAV+LVGYG
Sbjct: 138 FKEALRYLGPISISIAASDDFAFYRGGFYDG------ECGA-APNHAVILVGYGMKDIYN 190
Query: 190 ----KQDDIPYWLVRNSWGPIGPDEGFFKIE 216
+ + Y++++NSWG + G+ +E
Sbjct: 191 EDTGRMEKFYYYIIKNSWGSDWGEGGYINLE 221
>1cqd_A Protein (protease II); cysteine protease, glycoprotein, proline
specificity, carboh papain family, hydrolase; HET: NAG
FUL FUC; 2.10A {Zingiber officinale} SCOP: d.3.1.1
Length = 221
Score = 87.6 bits (218), Expect = 2e-21
Identities = 32/134 (23%), Positives = 62/134 (46%), Gaps = 15/134 (11%)
Query: 2 GLESEKDYPYKNANGEKFKCAYDKSKVKL-FTGKDFLHFNGSETMKKILYKYGPLSVLLN 60
G+ SE+ YPY+ +G C + + + + + ++++K + P+SV ++
Sbjct: 82 GINSEETYPYRGQDG---ICNSTVNAPVVSIDSYENVPSHNEQSLQKAVAN-QPVSVTMD 137
Query: 61 SDLI--HDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFF 118
+ Y +C+ HA+ +VGYG ++D +W+V+NSWG + G+
Sbjct: 138 AAGRDFQLYRSGIF---TGSCNI-SANHALTVVGYGTENDKDFWIVKNSWGKNWGESGYI 193
Query: 119 KIERGNNA----CG 128
+ ER CG
Sbjct: 194 RAERNIENPDGKCG 207
Score = 65.6 bits (161), Expect = 3e-13
Identities = 22/88 (25%), Positives = 41/88 (46%), Gaps = 16/88 (18%)
Query: 136 GSETMKKILYKYGPLSVGLNSHLIHF-------YNGTPIRKNDETCSPYDLGHAVLLVGY 188
+E + P+SV +++ F + G+ C+ HA+ +VGY
Sbjct: 118 HNEQSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGS--------CNI-SANHALTVVGY 168
Query: 189 GKQDDIPYWLVRNSWGPIGPDEGFFKIE 216
G ++D +W+V+NSWG + G+ + E
Sbjct: 169 GTENDKDFWIVKNSWGKNWGESGYIRAE 196
>1s4v_A Cysteine endopeptidase; KDEL ER retention signal, endosperm,
ricinosomes, SEED germi senescence, hydrolase-hydrolase
inhibitor complex; 2.00A {Ricinus communis} SCOP:
d.3.1.1
Length = 229
Score = 86.8 bits (216), Expect = 5e-21
Identities = 37/143 (25%), Positives = 60/143 (41%), Gaps = 31/143 (21%)
Query: 2 GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--LHFNGSETMKKILYKYGPLSVLL 59
G+ +E +YPY+ +G C K + + N + K + P+SV +
Sbjct: 82 GITTEANYPYEAYDG---TCDVSKENAPAVSIDGHENVPENDENALLKAVAN-QPVSVAI 137
Query: 60 NSDLIHD---------YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDD-IPYWLVRNSWG 109
++ + G+ C +L H V +VGYG D YW V+NSWG
Sbjct: 138 DAG--GSDFQFYSEGVFTGS--------CGT-ELDHGVAIVGYGTTIDGTKYWTVKNSWG 186
Query: 110 PIGPDEGFFKIERG----NNACG 128
P ++G+ ++ERG CG
Sbjct: 187 PEWGEKGYIRMERGISDKEGLCG 209
Score = 61.0 bits (149), Expect = 1e-11
Identities = 25/89 (28%), Positives = 39/89 (43%), Gaps = 17/89 (19%)
Query: 136 GSETMKKILYKYGPLSVGLNSHLIHF-------YNGTPIRKNDETCSPYDLGHAVLLVGY 188
E P+SV +++ F + G+ C +L H V +VGY
Sbjct: 119 NDENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGS--------CGT-ELDHGVAIVGY 169
Query: 189 GKQDD-IPYWLVRNSWGPIGPDEGFFKIE 216
G D YW V+NSWGP ++G+ ++E
Sbjct: 170 GTTIDGTKYWTVKNSWGPEWGEKGYIRME 198
>2oul_A Falcipain 2; cysteine protease, inhibitor, macromolecular
interaction, HY hydrolase inhibitor complex; 2.20A
{Plasmodium falciparum} SCOP: d.3.1.1 PDB: 2ghu_A 1yvb_A
3bpf_A* 3pnr_A
Length = 241
Score = 86.5 bits (215), Expect = 9e-21
Identities = 37/146 (25%), Positives = 58/146 (39%), Gaps = 31/146 (21%)
Query: 2 GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSET-MKKILYKYGPLSVLLN 60
G+ + DYPY + C D+ K + + + +K+ L GP+S+ +
Sbjct: 97 GICPDGDYPYVSDAPNL--CNIDRCTEK---YGIKNYLSVPDNKLKEALRFLGPISISVA 151
Query: 61 -SDLIHDYNG---TPIRKNDETCSPYDLGHAVLLVGYG----------KQDDIPYWLVRN 106
SD Y C L HAV+LVG+G K + Y++++N
Sbjct: 152 VSDDFAFYKEGIFDG------ECGD-QLNHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKN 204
Query: 107 SWGPIGPDEGFFKIERGNNA----CG 128
SWG + GF IE + CG
Sbjct: 205 SWGQQWGERGFINIETDESGLMRKCG 230
Score = 66.1 bits (162), Expect = 2e-13
Identities = 27/91 (29%), Positives = 40/91 (43%), Gaps = 21/91 (23%)
Query: 140 MKKILYKYGPLSVGLN-SHLIHFYNG---TPIRKNDETCSPYDLGHAVLLVGYG------ 189
+K+ L GP+S+ + S FY C L HAV+LVG+G
Sbjct: 136 LKEALRFLGPISISVAVSDDFAFYKEGIFDG------ECGD-QLNHAVMLVGFGMKEIVN 188
Query: 190 ----KQDDIPYWLVRNSWGPIGPDEGFFKIE 216
K + Y++++NSWG + GF IE
Sbjct: 189 PLTKKGEKHYYYIIKNSWGQQWGERGFINIE 219
>3p5u_A Actinidin; SAD, cysteine proteinases, hydrolase; 1.50A {Actinidia
arguta} PDB: 3p5v_A 3p5w_A 3p5x_A 1aec_A* 2act_A
Length = 220
Score = 85.2 bits (212), Expect = 1e-20
Identities = 37/141 (26%), Positives = 61/141 (43%), Gaps = 29/141 (20%)
Query: 2 GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--LHFNGSETMKKILYKYGPLSVLL 59
G+ +E +YPY G +C D + K + + + +N ++ + P+SV L
Sbjct: 82 GINTEANYPYTAEEG---QCNLDLQQEKYVSIDTYENVPYNNEWALQTAVAY-QPVSVAL 137
Query: 60 NSDLIHD---------YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGP 110
+ + G C + HAV +VGYG + I YW+V+NSWG
Sbjct: 138 EAA--GYNFQHYSSGIFTG--------PCGT-AVDHAVTIVGYGTEGGIDYWIVKNSWGT 186
Query: 111 IGPDEGFFKIERG---NNACG 128
+EG+ +I+R CG
Sbjct: 187 TWGEEGYMRIQRNVGGVGQCG 207
Score = 63.3 bits (155), Expect = 2e-12
Identities = 24/71 (33%), Positives = 36/71 (50%), Gaps = 6/71 (8%)
Query: 148 GPLSVGLN-SHLI-HFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGP 205
P+SV L + Y+ C + HAV +VGYG + I YW+V+NSWG
Sbjct: 131 QPVSVALEAAGYNFQHYSSGIF---TGPCGT-AVDHAVTIVGYGTEGGIDYWIVKNSWGT 186
Query: 206 IGPDEGFFKIE 216
+EG+ +I+
Sbjct: 187 TWGEEGYMRIQ 197
>3ioq_A CMS1MS2; caricaceae, cysteine protease, papain family, hydrolase;
HET: E64 SO4; 1.87A {Carica candamarcensis}
Length = 213
Score = 83.3 bits (207), Expect = 7e-20
Identities = 31/135 (22%), Positives = 56/135 (41%), Gaps = 20/135 (14%)
Query: 2 GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--LHFNGSETMKKILYKYGPLSVLL 59
G+ + YPY+ +C ++K + N + + + + P+S+++
Sbjct: 79 GIHLRQYYPYEGVQR---QCRASQAKGPKVKTDGVGRVPRNNEQALIQRIAI-QPVSIVV 134
Query: 60 NSDLI--HDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGF 117
+ +Y G C + HAV VGYG Y L++NSWG + G+
Sbjct: 135 EAKGRAFQNYRGGIF---AGPCGT-SIDHAVAAVGYGN----DYILIKNSWGTGWGEGGY 186
Query: 118 FKIERGNNA----CG 128
+I+RG+ CG
Sbjct: 187 IRIKRGSGNPQGACG 201
Score = 59.0 bits (144), Expect = 5e-11
Identities = 20/76 (26%), Positives = 32/76 (42%), Gaps = 20/76 (26%)
Query: 148 GPLSVGLNSHLIHF-------YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVR 200
P+S+ + + F + G C + HAV VGYG Y L++
Sbjct: 128 QPVSIVVEAKGRAFQNYRGGIFAGP--------CGT-SIDHAVAAVGYGN----DYILIK 174
Query: 201 NSWGPIGPDEGFFKIE 216
NSWG + G+ +I+
Sbjct: 175 NSWGTGWGEGGYIRIK 190
>1yal_A Chymopapain; hydrolase, thiol protease; 1.70A {Carica papaya} SCOP:
d.3.1.1 PDB: 1gec_E*
Length = 218
Score = 83.3 bits (207), Expect = 7e-20
Identities = 36/135 (26%), Positives = 54/135 (40%), Gaps = 16/135 (11%)
Query: 2 GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--LHFNGSETMKKILYKYGPLSVLL 59
G+ + K YPY+ KC + + N + L PLSVL+
Sbjct: 79 GVHTSKVYPYQAKQY---KCRATDKPGPKVKITGYKRVPSNCETSFLGALAN-QPLSVLV 134
Query: 60 NSDLI--HDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGF 117
+ Y D C L HAV VGYG D Y +++NSWGP ++G+
Sbjct: 135 EAGGKPFQLYKSGVF---DGPCGT-KLDHAVTAVGYGTSDGKNYIIIKNSWGPNWGEKGY 190
Query: 118 FKIERGNNA----CG 128
+++R + CG
Sbjct: 191 MRLKRQSGNSQGTCG 205
Score = 63.3 bits (155), Expect = 2e-12
Identities = 26/89 (29%), Positives = 39/89 (43%), Gaps = 18/89 (20%)
Query: 136 GSET-MKKILYKYGPLSVGLNSHLIHF-------YNGTPIRKNDETCSPYDLGHAVLLVG 187
ET L PLSV + + F ++G C L HAV VG
Sbjct: 116 NCETSFLGALAN-QPLSVLVEAGGKPFQLYKSGVFDGP--------CGT-KLDHAVTAVG 165
Query: 188 YGKQDDIPYWLVRNSWGPIGPDEGFFKIE 216
YG D Y +++NSWGP ++G+ +++
Sbjct: 166 YGTSDGKNYIIIKNSWGPNWGEKGYMRLK 194
>2bdz_A Mexicain; cysteine protease, peptidase_C1, papain-like, HYDR; HET:
E64; 2.10A {Jacaratia mexicana}
Length = 214
Score = 83.3 bits (207), Expect = 8e-20
Identities = 36/135 (26%), Positives = 60/135 (44%), Gaps = 20/135 (14%)
Query: 2 GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--LHFNGSETMKKILYKYGPLSVLL 59
G+ +E++YPY+ G +C K + + N ++ + + P+SV+
Sbjct: 79 GVHTEREYPYEKKQG---RCRAKDKKGPKVYITGYKYVPANDEISLIQAIAN-QPVSVVT 134
Query: 60 NSDLI--HDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGF 117
+S Y G + C + HAV VGYGK Y L++NSWGP ++G+
Sbjct: 135 DSRGRGFQFYKGGIY---EGPCGT-NTDHAVTAVGYGK----TYLLLKNSWGPNWGEKGY 186
Query: 118 FKIERG----NNACG 128
+I+R CG
Sbjct: 187 IRIKRASGRSKGTCG 201
Score = 59.4 bits (145), Expect = 5e-11
Identities = 26/89 (29%), Positives = 39/89 (43%), Gaps = 22/89 (24%)
Query: 136 GSET-MKKILYKYGPLSVGLNSHLIHF-------YNGTPIRKNDETCSPYDLGHAVLLVG 187
E + + + P+SV +S F Y G C + HAV VG
Sbjct: 116 NDEISLIQAIAN-QPVSVVTDSRGRGFQFYKGGIYEGP--------CGT-NTDHAVTAVG 165
Query: 188 YGKQDDIPYWLVRNSWGPIGPDEGFFKIE 216
YGK Y L++NSWGP ++G+ +I+
Sbjct: 166 YGK----TYLLLKNSWGPNWGEKGYIRIK 190
>1pci_A Procaricain; zymogen, hydrolase, thiol protease; 3.20A {Carica
papaya} SCOP: d.3.1.1
Length = 322
Score = 84.2 bits (209), Expect = 2e-19
Identities = 33/142 (23%), Positives = 51/142 (35%), Gaps = 30/142 (21%)
Query: 2 GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--LHFNGSETMKKILYKYGPLSVLL 59
G+ YPYK G C + + + N + + K P+SV++
Sbjct: 185 GIHLRSKYPYKAKQG---TCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAK-QPVSVVV 240
Query: 60 NSDLIHD---------YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGP 110
S + G C + AV VGYGK Y L++NSWG
Sbjct: 241 ESK--GRPFQLYKGGIFEG--------PCGT-KVDGAVTAVGYGKSGGKGYILIKNSWGT 289
Query: 111 IGPDEGFFKIERGNNA----CG 128
++G+ +I+R CG
Sbjct: 290 AWGEKGYIRIKRAPGNSPGVCG 311
Score = 62.3 bits (152), Expect = 1e-11
Identities = 22/76 (28%), Positives = 33/76 (43%), Gaps = 16/76 (21%)
Query: 148 GPLSVGLNSHLIHF-------YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVR 200
P+SV + S F + G C + AV VGYGK Y L++
Sbjct: 234 QPVSVVVESKGRPFQLYKGGIFEGP--------CGT-KVDGAVTAVGYGKSGGKGYILIK 284
Query: 201 NSWGPIGPDEGFFKIE 216
NSWG ++G+ +I+
Sbjct: 285 NSWGTAWGEKGYIRIK 300
>1o0e_A Ervatamin C; plant cysteine protease, two domain, stable at PH
2-12, HYDR; 1.90A {Tabernaemontana divaricata} SCOP:
d.3.1.1 PDB: 2pns_A* 2pre_A* 3bcn_A*
Length = 208
Score = 81.9 bits (203), Expect = 2e-19
Identities = 34/131 (25%), Positives = 55/131 (41%), Gaps = 17/131 (12%)
Query: 2 GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLNS 61
G++++ +YPYK G C V + G + + F +E K P +V +++
Sbjct: 80 GIDTQANYPYKAVQG---PCQAASKVVSI-DGYNGVPFC-NEXALKQAVAVQPSTVAIDA 134
Query: 62 DL--IHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFK 119
Y+ L H V +VGY YW+VRNSWG ++G+ +
Sbjct: 135 SSAQFQQYSSGIF----SGPCGTKLNHGVTIVGYQA----NYWIVRNSWGRYWGEKGYIR 186
Query: 120 IER--GNNACG 128
+ R G CG
Sbjct: 187 MLRVGGCGLCG 197
Score = 52.3 bits (126), Expect = 1e-08
Identities = 20/83 (24%), Positives = 33/83 (39%), Gaps = 10/83 (12%)
Query: 136 GSETMKKILYKYGPLSVGLNS--HLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDD 193
+E K P +V +++ Y+ L H V +VGY
Sbjct: 114 CNEXALKQAVAVQPSTVAIDASSAQFQQYSSGIF----SGPCGTKLNHGVTIVGYQA--- 166
Query: 194 IPYWLVRNSWGPIGPDEGFFKIE 216
YW+VRNSWG ++G+ ++
Sbjct: 167 -NYWIVRNSWGRYWGEKGYIRML 188
>2cio_A Papain; hydrolase/inhibitor, complex hydrolase/inhibitor, ICP,
cysteine protease, allergen, protease, thiol protease;
1.5A {Carica papaya} PDB: 1khq_A 1khp_A 1ppn_A 3e1z_B
3ima_A 3lfy_A 9pap_A 1bqi_A* 1bp4_A* 1pad_A 1pe6_A*
1pip_A* 1pop_A* 1ppd_A 1ppp_A* 1stf_E* 2pad_A 4pad_A*
5pad_A* 6pad_A* ...
Length = 212
Score = 81.3 bits (202), Expect = 4e-19
Identities = 31/135 (22%), Positives = 47/135 (34%), Gaps = 20/135 (14%)
Query: 2 GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--LHFNGSETMKKILYKYGPLSVLL 59
G+ YPY+ C + + + + P+SV+L
Sbjct: 79 GIHYRNTYPYEGVQR---YCRSREKGPYAAKTDGVRQVQPYNEGALLYSIAN-QPVSVVL 134
Query: 60 NSDLI--HDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGF 117
+ Y G C + HAV VGYG Y L++NSWG + G+
Sbjct: 135 EAAGKDFQLYRGGIF---VGPCGN-KVDHAVAAVGYGP----NYILIKNSWGTGWGENGY 186
Query: 118 FKIERGNNA----CG 128
+I+RG CG
Sbjct: 187 IRIKRGTGNSYGVCG 201
Score = 59.0 bits (144), Expect = 5e-11
Identities = 23/89 (25%), Positives = 36/89 (40%), Gaps = 22/89 (24%)
Query: 136 GSET-MKKILYKYGPLSVGLNSHLIHF-------YNGTPIRKNDETCSPYDLGHAVLLVG 187
+E + + P+SV L + F + G C + HAV VG
Sbjct: 116 YNEGALLYSIAN-QPVSVVLEAAGKDFQLYRGGIFVGP--------CGN-KVDHAVAAVG 165
Query: 188 YGKQDDIPYWLVRNSWGPIGPDEGFFKIE 216
YG Y L++NSWG + G+ +I+
Sbjct: 166 YGP----NYILIKNSWGTGWGENGYIRIK 190
>3f75_A Toxopain-2, cathepsin L protease; medical structural genomics of
pathogenic protozoa, MSGPP, C protease, parasite,
protozoa, hydrolase; 1.99A {Toxoplasma gondii}
Length = 224
Score = 81.4 bits (202), Expect = 5e-19
Identities = 34/142 (23%), Positives = 55/142 (38%), Gaps = 30/142 (21%)
Query: 2 GLESEKDYPYKNANGEKFKCAYDKSKVKL-FTGKDFLHFNGSETMKKILYKYGPLSVLLN 60
G+ SE YPY + +C + + G + MK L K P+S+ +
Sbjct: 88 GICSEDAYPYLARDE---ECRAQSCEKVVKILGFKDVPRRSEAAMKAALAK-SPVSIAIE 143
Query: 61 SDLIHD---------YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDD--IPYWLVRNSWG 109
+D ++ +C DL H VLLVGYG + +W+++NSWG
Sbjct: 144 AD--QMPFQFYHEGVFDA--------SCGT-DLDHGVLLVGYGTDKESKKDFWIMKNSWG 192
Query: 110 PIGPDEGFFKIERG---NNACG 128
+G+ + CG
Sbjct: 193 TGWGRDGYMYMAMHKGEEGQCG 214
Score = 59.1 bits (144), Expect = 7e-11
Identities = 20/78 (25%), Positives = 36/78 (46%), Gaps = 18/78 (23%)
Query: 148 GPLSVGLNSHLIHF-------YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDD--IPYWL 198
P+S+ + + + F ++ + C DL H VLLVGYG + +W+
Sbjct: 136 SPVSIAIEADQMPFQFYHEGVFDAS--------CGT-DLDHGVLLVGYGTDKESKKDFWI 186
Query: 199 VRNSWGPIGPDEGFFKIE 216
++NSWG +G+ +
Sbjct: 187 MKNSWGTGWGRDGYMYMA 204
>3u8e_A Papain-like cysteine protease; papain-like cysteine peptidase,
peptidase_C1A, hydrolase, in form; 1.31A {Crocus
sativus}
Length = 222
Score = 80.3 bits (198), Expect = 1e-18
Identities = 36/135 (26%), Positives = 54/135 (40%), Gaps = 13/135 (9%)
Query: 2 GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLNS 61
G+ S+ +YPY +G C +K G + + S + P+SV + +
Sbjct: 80 GIASDANYPYTGVDG---TCDLNKPIAARIDG--YTNVPNSSSALLDAVAKQPVSVNIYT 134
Query: 62 DL--IHDYNGTPIRKNDE-TCSPYDLGHAVLLVGYGKQDD-IPYWLVRNSWGPIGPDEGF 117
Y G I + P + H VL+VGYG YW+V+NSWG +G+
Sbjct: 135 SSTSFQLYTGPGIFAGSSCSDDPATVDHTVLIVGYGSNGTNADYWIVKNSWGTEWGIDGY 194
Query: 118 FKIERGNNA----CG 128
I R N C
Sbjct: 195 ILIRRNTNRPDGVCA 209
Score = 60.6 bits (147), Expect = 2e-11
Identities = 27/111 (24%), Positives = 43/111 (38%), Gaps = 4/111 (3%)
Query: 110 PIGPDEGFFKIERGNNACGKDFLHFNGSETMKKILYKYGPLSVGLNS--HLIHFYNGTPI 167
P +G + + A + + S + P+SV + + Y G I
Sbjct: 88 PYTGVDGTCDLNKPIAARIDGYTNVPNSSSALLDAVAKQPVSVNIYTSSTSFQLYTGPGI 147
Query: 168 RKNDE-TCSPYDLGHAVLLVGYGKQDD-IPYWLVRNSWGPIGPDEGFFKIE 216
+ P + H VL+VGYG YW+V+NSWG +G+ I
Sbjct: 148 FAGSSCSDDPATVDHTVLIVGYGSNGTNADYWIVKNSWGTEWGIDGYILIR 198
>2fo5_A Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cysteine
endoprotease, endopeptidase, LEUP hydrolase; HET: AR7;
2.20A {Hordeum vulgare}
Length = 262
Score = 80.0 bits (198), Expect = 3e-18
Identities = 38/146 (26%), Positives = 61/146 (41%), Gaps = 34/146 (23%)
Query: 2 GLESEKDYPYKNANGEKFKCAYDKSKVKL-----FTGKDFLHFNGSETMKKILYKYGPLS 56
GL +E YPY+ A G C ++ G + N E + + + P+S
Sbjct: 84 GLITEAAYPYRAARG---TCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVAN-QPVS 139
Query: 57 VLLNSDLIHD---------YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDD-IPYWLVRN 106
V + + + G C +L H V +VGYG +D YW V+N
Sbjct: 140 VAVEAS--GKAFMFYSEGVFTGE--------CGT-ELDHGVAVVGYGVAEDGKAYWTVKN 188
Query: 107 SWGPIGPDEGFFKIERGNNA----CG 128
SWGP ++G+ ++E+ + A CG
Sbjct: 189 SWGPSWGEQGYIRVEKDSGASGGLCG 214
Score = 61.5 bits (150), Expect = 1e-11
Identities = 26/89 (29%), Positives = 39/89 (43%), Gaps = 17/89 (19%)
Query: 136 GSETMKKILYKYGPLSVGLNSHLIHF-------YNGTPIRKNDETCSPYDLGHAVLLVGY 188
SE P+SV + + F + G C +L H V +VGY
Sbjct: 124 NSEEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGE--------CGT-ELDHGVAVVGY 174
Query: 189 GKQDD-IPYWLVRNSWGPIGPDEGFFKIE 216
G +D YW V+NSWGP ++G+ ++E
Sbjct: 175 GVAEDGKAYWTVKNSWGPSWGEQGYIRVE 203
>3pw3_A Aminopeptidase C; bleomycin, cysteine proteinase fold, structural
genomics, JO center for structural genomics, JCSG; HET:
MSE; 2.23A {Parabacteroides distasonis}
Length = 383
Score = 38.7 bits (89), Expect = 9e-04
Identities = 11/36 (30%), Positives = 18/36 (50%), Gaps = 1/36 (2%)
Query: 86 HAVLLVGYGK-QDDIPYWLVRNSWGPIGPDEGFFKI 120
H + + G K Q+ Y++V+NSWG G +
Sbjct: 318 HGMQIYGIAKDQEGNEYYMVKNSWGTNSKYNGIWYA 353
Score = 38.7 bits (89), Expect = 9e-04
Identities = 11/36 (30%), Positives = 18/36 (50%), Gaps = 1/36 (2%)
Query: 181 HAVLLVGYGK-QDDIPYWLVRNSWGPIGPDEGFFKI 215
H + + G K Q+ Y++V+NSWG G +
Sbjct: 318 HGMQIYGIAKDQEGNEYYMVKNSWGTNSKYNGIWYA 353
>2e01_A Cysteine proteinase 1; bleomycin hydrolase, thiol protease, C1
protease, hydrolase; 1.73A {Saccharomyces cerevisiae}
PDB: 2e02_A 2e03_A 2dzy_A 1a6r_A 2e00_A 2dzz_A 3gcb_A
1gcb_A
Length = 457
Score = 33.2 bits (75), Expect = 0.072
Identities = 9/36 (25%), Positives = 15/36 (41%), Gaps = 3/36 (8%)
Query: 86 HAVLLVGYGKQDD---IPYWLVRNSWGPIGPDEGFF 118
A+L+ G + + V NSWG +G +
Sbjct: 373 AAMLITGCHVDETSKLPLRYRVENSWGKDSGKDGLY 408
Score = 33.2 bits (75), Expect = 0.072
Identities = 9/36 (25%), Positives = 15/36 (41%), Gaps = 3/36 (8%)
Query: 181 HAVLLVGYGKQDD---IPYWLVRNSWGPIGPDEGFF 213
A+L+ G + + V NSWG +G +
Sbjct: 373 AAMLITGCHVDETSKLPLRYRVENSWGKDSGKDGLY 408
>2cb5_A Protein (bleomycin hydrolase); aminopeptidase, cysteine protease,
SELF- compartmentalizing, cylinase; 1.85A {Homo sapiens}
SCOP: d.3.1.1 PDB: 1cb5_A
Length = 453
Score = 31.3 bits (70), Expect = 0.26
Identities = 11/37 (29%), Positives = 16/37 (43%), Gaps = 4/37 (10%)
Query: 86 HAVLLVGYGKQDDIP----YWLVRNSWGPIGPDEGFF 118
HA+ ++DD W V NSWG +G+
Sbjct: 371 HAMTFTAVSEKDDQDGAFTKWRVENSWGEDHGHKGYL 407
Score = 31.3 bits (70), Expect = 0.26
Identities = 11/37 (29%), Positives = 16/37 (43%), Gaps = 4/37 (10%)
Query: 181 HAVLLVGYGKQDDIP----YWLVRNSWGPIGPDEGFF 213
HA+ ++DD W V NSWG +G+
Sbjct: 371 HAMTFTAVSEKDDQDGAFTKWRVENSWGEDHGHKGYL 407
>1atg_A MODA, periplasmic molybdate-binding protein; tungstate, ABC
transporter; 1.20A {Azotobacter vinelandii} SCOP:
c.94.1.1
Length = 231
Score = 27.2 bits (61), Expect = 3.8
Identities = 4/25 (16%), Positives = 8/25 (32%)
Query: 124 NNACGKDFLHFNGSETMKKILYKYG 148
A + F+ + I+ G
Sbjct: 202 EKANAEQFMSWMKGPKAVAIIKAAG 226
>1x9y_A Cysteine proteinase; half-barrel, barrel-sandwich-hybrid,
hydrolase; 2.50A {Staphylococcus aureus} SCOP: d.3.1.1
d.17.1.4
Length = 367
Score = 27.3 bits (59), Expect = 4.3
Identities = 18/108 (16%), Positives = 38/108 (35%), Gaps = 20/108 (18%)
Query: 8 DYPYKNANGEKFK-CAYDKSKVKLFT---GKDFLHFNGSET---MKKILYKYGPLSVLLN 60
Y + + CA +++ + G+D + G + + ++ + +L
Sbjct: 234 RTLYPEVSEQDLPNCATFPNQMIEYGKSQGRDIHYQEGVPSYNQVDQLTKDNVGIMILAQ 293
Query: 61 SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSW 108
S + + LGHA+ +VG K +D + N W
Sbjct: 294 S-------------VSQNPNDPHLGHALAVVGNAKINDQEKLIYWNPW 328
Score = 26.9 bits (58), Expect = 6.2
Identities = 10/33 (30%), Positives = 16/33 (48%)
Query: 171 DETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSW 203
+ + LGHA+ +VG K +D + N W
Sbjct: 296 SQNPNDPHLGHALAVVGNAKINDQEKLIYWNPW 328
>2b9s_A Topoisomerase I-like protein; vanadate complex, isomerase/DNA
complex; HET: DNA; 2.27A {Leishmania donovani}
Length = 432
Score = 27.3 bits (60), Expect = 5.5
Identities = 9/47 (19%), Positives = 19/47 (40%)
Query: 22 AYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLNSDLIHDYN 68
+ KS + + K F +N S T+ + + +D + +N
Sbjct: 371 DHLKSFMDGLSAKVFRTYNASITLDRWFKEKPVDPKWSTADKLAYFN 417
Database: pdb70
Posted date: Sep 4, 2012 3:40 AM
Number of letters in database: 6,701,793
Number of sequences in database: 27,921
Lambda K H
0.319 0.142 0.459
Gapped
Lambda K H
0.267 0.0856 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 27921
Number of Hits to DB: 3,732,767
Number of extensions: 217514
Number of successful extensions: 546
Number of sequences better than 10.0: 1
Number of HSP's gapped: 446
Number of HSP's successfully gapped: 94
Length of query: 233
Length of database: 6,701,793
Length adjustment: 90
Effective length of query: 143
Effective length of database: 4,188,903
Effective search space: 599013129
Effective search space used: 599013129
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 56 (25.1 bits)