RPS-BLAST 2.2.26 [Sep-21-2011]
Database: pdb70
27,921 sequences; 6,701,793 total letters
Searching..................................................done
Query= psy4960
(341 letters)
>1xkg_A DER P I, major mite fecal allergen DER P 1; major allergen,
cysteine protease, house DUST mite, dermatop
pteronyssinus; 1.61A {Dermatophagoides pteronyssinus}
SCOP: d.3.1.1
Length = 312
Score = 210 bits (537), Expect = 2e-66
Identities = 74/308 (24%), Positives = 118/308 (38%), Gaps = 25/308 (8%)
Query: 40 VDAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY-YGTSGSSDRSPQEILQR-TGL 97
+ F+ Y +N++Y + + + F + K + SD S E R
Sbjct: 5 IKTFEEYKKAFNKSYATFEDEEAARKNFLESVKYVQSNGGAINHLSDLSLDEFKNRFLMS 64
Query: 98 RLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFAT 157
+ E L+ + + G P +D RQ + + P+ QG CGS WAF+
Sbjct: 65 A---EAFEHLKTQFDLNAETNACSINGNAPAEIDLRQMRT--VTPIRMQGGCGSAWAFSG 119
Query: 158 TAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYGLESQADYPYRN 217
A ES + L++ +LV+C C+G I EY++ G+ ++ Y Y
Sbjct: 120 VAATESAYLAYRDQSLDLAEQELVDCA-SQHGCHGDTIPRGIEYIQHNGVVQESYYRYVA 178
Query: 218 KENITFRCTYEKEKA---KVFVQDTWVTSG-VDHMMHLL--QSGPIGVYLNHRLIES--- 268
+E C + + Q + + + L I V + + +++
Sbjct: 179 REQ---SCRRPNAQRFGISNYCQ---IYPPNANKIREALAQTHSAIAVIIGIKDLDAFRH 232
Query: 269 YDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA 328
YDG I HAV IVGY G+ WIVRNSW D+GY +
Sbjct: 233 YDGRTI--IQRDNGYQPNYHAVNIVGYSNAQGVDYWIVRNSWDTNWGDNGYGYFAANIDL 290
Query: 329 CGIESYAY 336
IE Y Y
Sbjct: 291 MMIEEYPY 298
>1m6d_A Cathepsin F, catsf; papain family cysteine protease, hydrolase;
HET: MYP; 1.70A {Homo sapiens} SCOP: d.3.1.1
Length = 214
Score = 203 bits (518), Expect = 1e-64
Identities = 77/212 (36%), Positives = 115/212 (54%), Gaps = 8/212 (3%)
Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG 186
P DWR K V V+ QG CGSCWAF+ T +E Q L + TL LS+ +L++CD
Sbjct: 2 PPEWDWRS-KGAV-TKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKM 59
Query: 187 NLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSG 244
+ C GG A+ +K GLE++ DY Y+ C + EKAKV++QD ++
Sbjct: 60 DKACMGGLPSNAYSAIKNLGGLETEDDYSYQGHMQ---SCQFSAEKAKVYIQDSVELSQN 116
Query: 245 VDHMMHLLQS-GPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILT 303
+ L GPI V +N ++ Y R C+P +DHAV +VGYG+++ +
Sbjct: 117 EQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGQRSDVPF 176
Query: 304 WIVRNSWGDIGPDHGYFQIERGANACGIESYA 335
W ++NSWG + GY+ + RG+ ACG+ + A
Sbjct: 177 WAIKNSWGTDWGEKGYYYLHRGSGACGVNTMA 208
>3f5v_A DER P 1 allergen; allergy, asthma, DUST mites, glycoprotein,
hydrola protease, secreted, thiol protease; HET: P6G;
1.36A {Dermatophagoides pteronyssinus} PDB: 2as8_A
3rvw_A* 3rvx_A 3rvv_A* 3d6s_A*
Length = 222
Score = 200 bits (511), Expect = 1e-63
Identities = 62/222 (27%), Positives = 92/222 (41%), Gaps = 20/222 (9%)
Query: 124 GPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC 183
G P +D RQ V P+ QG CGS WAF+ A ES ++ L++ +LV+C
Sbjct: 8 GNAPAEIDLRQ-MRTV-TPIRMQGGCGSAWAFSGVAATESAYLAYRQQSLDLAEQELVDC 65
Query: 184 DHGNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEK-AKV--FVQDTW 240
C+G I EY++ G+ ++ Y Y +E C + + + Q
Sbjct: 66 A-SQHGCHGDTIPRGIEYIQHNGVVQESYYRYVAREQ---SCRRPNAQRFGISNYCQ--- 118
Query: 241 VTSG-VDHMMHLL--QSGPIGVYLNHRLIES---YDGNPIRRNDWACNPHKLDHAVAIVG 294
+ + + L I V + + +++ YDG I HAV IVG
Sbjct: 119 IYPPNANKIREALAQTHSAIAVIIGIKDLDAFRHYDGRTI--IQRDNGYQPNYHAVNIVG 176
Query: 295 YGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYAY 336
Y G+ WIVRNSW D+GY + IE Y Y
Sbjct: 177 YSNAQGVDYWIVRNSWDTNWGDNGYGYFAANIDLMMIEEYPY 218
>3qt4_A Cathepsin-L-like midgut cysteine proteinase; hydrolase, zymogen,
intramolecular DISS bonds, insect larVal midgut; HET:
PG4 PG6; 2.11A {Tenebrio molitor}
Length = 329
Score = 202 bits (517), Expect = 4e-63
Identities = 85/329 (25%), Positives = 143/329 (43%), Gaps = 63/329 (19%)
Query: 41 DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---YGTSGSS---------DRSP 88
+ + + + ++Y+ E R FK + + E+ + + D S
Sbjct: 25 EQWSQFKLTHKKSYSSPIEEIRRQLIFKDNVAKIAEHNAKFEKGEVTYSKAMNQFGDMSK 84
Query: 89 QEILQ-RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQG 147
+E L + + + K PL S+DWR + V + V+ QG
Sbjct: 85 EEFLAYVNRGKAQKPK-------HPENLRMPYVSSKKPLAASVDWRSNAV---SEVKDQG 134
Query: 148 RCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH--GNLNCNGGNIDVAFEYVKQY 205
+CGS W+F+TT +E Q+AL + L LS+ L++C GN C+GG +D AF Y+ Y
Sbjct: 135 QCGSSWSFSTTGAVEGQLALQRGRLTSLSEQNLIDCSSSYGNAGCDGGWMDSAFSYIHDY 194
Query: 206 GLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSG-VDHMMHLL-QSGPI----- 257
G+ S++ YPY + + C ++ ++ + + SG + + + Q+GP+
Sbjct: 195 GIMSESAYPYEAQGD---YCRFDSSQSVTTLSGYYDLPSGDENSLADAVGQAGPVAVAID 251
Query: 258 ----------GVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVR 307
G++ D CN L+H V +VGYG NG WI++
Sbjct: 252 ATDELQFYSGGLF----------------YDQTCNQSDLNHGVLVVGYGSDNGQDYWILK 295
Query: 308 NSWGDIGPDHGYFQIERG-ANACGIESYA 335
NSWG + GY++ R N CGI + A
Sbjct: 296 NSWGSGWGESGYWRQVRNYGNNCGIATAA 324
>1by8_A Protein (procathepsin K); hydrolase(sulfhydryl proteinase), papain;
2.60A {Homo sapiens} SCOP: d.3.1.1 PDB: 7pck_A
Length = 314
Score = 202 bits (515), Expect = 4e-63
Identities = 84/315 (26%), Positives = 140/315 (44%), Gaps = 34/315 (10%)
Query: 41 DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---YGTSGSS---------DRSP 88
++ + + Y + + +R ++++ K + + D +
Sbjct: 9 THWELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTS 68
Query: 89 QEILQ-RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQG 147
+E++Q TGL++ +G P S+D+R+ K V PV++QG
Sbjct: 69 EEVVQKMTGLKVPLSH-------SRSNDTLYIPEWEGRAPDSVDYRK-KGYV-TPVKNQG 119
Query: 148 RCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQ-YG 206
+CGSCWAF++ LE Q+ L LS LV+C N C GG + AF+YV++ G
Sbjct: 120 QCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRG 179
Query: 207 LESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSG-VDHMMH-LLQSGPIGVYLN- 262
++S+ YPY +E C Y + + G + + + GP+ V ++
Sbjct: 180 IDSEDAYPYVGQEE---SCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDA 236
Query: 263 -HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQ 321
+ Y D +CN L+HAV VGYG + G WI++NSWG+ + GY
Sbjct: 237 SLTSFQFYSKGVY--YDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYIL 294
Query: 322 IERG-ANACGIESYA 335
+ R NACGI + A
Sbjct: 295 MARNKNNACGIANLA 309
>1pci_A Procaricain; zymogen, hydrolase, thiol protease; 3.20A {Carica
papaya} SCOP: d.3.1.1
Length = 322
Score = 200 bits (510), Expect = 4e-62
Identities = 84/319 (26%), Positives = 139/319 (43%), Gaps = 44/319 (13%)
Query: 41 DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEYYGTSGS--------SDRSPQEIL 92
F ++++ N+ Y + +E RFE FK + DE + S +D S E
Sbjct: 20 QLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKKNNSYWLGLNEFADLSNDEFN 79
Query: 93 Q-RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGS 151
+ G + + + LP+++DWR+ K V PV QG CGS
Sbjct: 80 EKYVGSLIDATI-------EQSYDEEFINEDIVNLPENVDWRK-KGAV-TPVRHQGSCGS 130
Query: 152 CWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQYGLESQA 211
CWAF+ A +E + L LS+ +LV+C+ + C GG A EYV + G+ ++
Sbjct: 131 CWAFSAVATVEGINKIRTGKLVELSEQELVDCERRSHGCKGGYPPYALEYVAKNGIHLRS 190
Query: 212 DYPYRNKENITFRCTYEKEKAKVFVQDT---WVTSG-VDHMMHLLQSGPIGVYLN--HRL 265
YPY+ K+ C ++ + V+ + V ++++ + P+ V + R
Sbjct: 191 KYPYKAKQG---TCRAKQVGGPI-VKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRP 246
Query: 266 IESYDG----NPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQ 321
+ Y G P C K+D AV VGYG+ G +++NSWG + GY +
Sbjct: 247 FQLYKGGIFEGP-------CGT-KVDGAVTAVGYGKSGGKGYILIKNSWGTAWGEKGYIR 298
Query: 322 IERGANA----CGIESYAY 336
I+R CG+ +Y
Sbjct: 299 IKRAPGNSPGVCGLYKSSY 317
>3qj3_A Cathepsin L-like protein; hydrolase, proteinase, larVal midgut;
1.85A {Tenebrio molitor}
Length = 331
Score = 200 bits (510), Expect = 5e-62
Identities = 91/316 (28%), Positives = 148/316 (46%), Gaps = 30/316 (9%)
Query: 41 DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---YGTSGSS---------DRSP 88
+ ++ + + R+Y + E R + F++ + +E+ Y S D +P
Sbjct: 20 EKWENFKTTYARSYVNAKEETFRKQIFQKKLETFEEHNEKYRQGLVSYTLGVNLFTDMTP 79
Query: 89 QEILQ-RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQG 147
+E+ GL + ++ L P S DWR + V +PV++QG
Sbjct: 80 EEMKAYTHGLI--MPADLHKNGIPIKTREDLGLNASVRYPASFDWRD-QGMV-SPVKNQG 135
Query: 148 RCGSCWAFATTAILESQVALLKKTL--YPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQY 205
CGS WAF++T +ESQ+ + +S+ QLV+C L C+GG ++ AF YV Q
Sbjct: 136 SCGSSWAFSSTGAIESQMKIANGAGYDSSVSEQQLVDCVPNALGCSGGWMNDAFTYVAQN 195
Query: 206 -GLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSG-VDHMMHLLQS-GPIGVYL 261
G++S+ YPY + C Y+ + + +++ + + ++ + GP+ V
Sbjct: 196 GGIDSEGAYPYEMADG---NCHYDPNQVAARLSGYVYLSGPDENMLADMVATKGPVAVAF 252
Query: 262 N-HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYF 320
+ SY G + C +K HAV IVGYG +NG W+V+NSWGD GYF
Sbjct: 253 DADDPFGSYSGGVY--YNPTCETNKFTHAVLIVGYGNENGQDYWLVKNSWGDGWGLDGYF 310
Query: 321 QIERGA-NACGIESYA 335
+I R A N CGI A
Sbjct: 311 KIARNANNHCGIAGVA 326
>3i06_A Cruzipain; autocatalytic cleavage, glycoprotein, protease, thiol
protease, zymogen; HET: QL2; 1.10A {Trypanosoma cruzi}
PDB: 1ewm_A* 1ewo_A* 1ewl_A* 1f29_A* 1ewp_A* 1f2b_A*
1f2c_A* 1f2a_A* 1me4_A* 1u9q_X* 2aim_A* 2efm_A* 2oz2_A*
1me3_A* 3kku_A* 3lxs_A* 1aim_A* 3iut_A* 3hd3_A* 2p86_A*
...
Length = 215
Score = 194 bits (496), Expect = 2e-61
Identities = 65/217 (29%), Positives = 104/217 (47%), Gaps = 17/217 (7%)
Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG 186
P ++DWR + V V+ QG+CGSCWAF+ +E Q L L LS+ LV CD
Sbjct: 2 PAAVDWRA-RGAV-TAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCDKT 59
Query: 187 NLNCNGGNIDVAFEYVKQY---GLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVT 242
+ C+GG ++ AFE++ Q + ++ YPY + E I+ CT + +
Sbjct: 60 DSGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELP 119
Query: 243 SGVDHMMH-LLQSGPIGVYLNHRLIESYDG---NPIRRNDWACNPHKLDHAVAIVGYGEK 298
+ L +GP+ V ++ +Y G C +LDH V +VGY +
Sbjct: 120 QDEAQIAAWLAVNGPVAVAVDASSWMTYTGGVMTS-------CVSEQLDHGVLLVGYNDS 172
Query: 299 NGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYA 335
+ WI++NSW + GY +I +G+N C ++ A
Sbjct: 173 AAVPYWIIKNSWTTQWGEEGYIRIAKGSNQCLVKEEA 209
>2o6x_A Procathepsin L1, secreted cathepsin L 1; hydrolase, thiol protease,
cysteine protease, zymogen, hydro; 1.40A {Fasciola
hepatica}
Length = 310
Score = 197 bits (503), Expect = 3e-61
Identities = 88/314 (28%), Positives = 143/314 (45%), Gaps = 34/314 (10%)
Query: 41 DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---YGTSGSS---------DRSP 88
D + + +N+ Y ++ + R ++++ K E+ + + D +
Sbjct: 3 DLWHQWKRMYNKEYNGADD-QHRRNIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTF 61
Query: 89 QEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGR 148
+E + + A E +P +DWR+S + V+ QG
Sbjct: 62 EEFKAK----YLTE---MSRASDILSHGVPYEANNRAVPDKIDWRESGY--VTEVKDQGN 112
Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH--GNLNCNGGNIDVAFEYVKQYG 206
CGS WAF+TT +E Q ++T S+ QLV+C GN C GG ++ A++Y+KQ+G
Sbjct: 113 CGSGWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSRPWGNNGCGGGLMENAYQYLKQFG 172
Query: 207 LESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSG-VDHMMHLLQS-GPIGVYLN- 262
LE+++ YPY E +C Y K+ V V SG + +L+ + GP V ++
Sbjct: 173 LETESSYPYTAVEG---QCRYNKQLGVAKVTGFYTVHSGSEVELKNLVGAEGPAAVAVDV 229
Query: 263 HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQI 322
Y C+P +++HAV VGYG + G WIV+NSWG + GY ++
Sbjct: 230 ESDFMMYRSGIY--QSQTCSPLRVNHAVLAVGYGTQGGTDYWIVKNSWGLSWGERGYIRM 287
Query: 323 ERG-ANACGIESYA 335
R N CGI S A
Sbjct: 288 VRNRGNMCGIASLA 301
>1ppo_A Protease omega; hydrolase(thiol protease); 1.80A {Carica papaya}
SCOP: d.3.1.1 PDB: 1meg_A*
Length = 216
Score = 189 bits (484), Expect = 1e-59
Identities = 67/225 (29%), Positives = 109/225 (48%), Gaps = 28/225 (12%)
Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
LP+++DWR+ + PV QG CGSCWAF+ A +E + L LS+ +LV+C+
Sbjct: 1 LPENVDWRKKGA--VTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCER 58
Query: 186 GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDT---WVT 242
+ C GG A EYV + G+ ++ YPY+ K+ C ++ + V+ + V
Sbjct: 59 RSHGCKGGYPPYALEYVAKNGIHLRSKYPYKAKQG---TCRAKQVGGPI-VKTSGVGRVQ 114
Query: 243 SGV-DHMMHLLQSGPIGVYLN--HRLIESYDG----NPIRRNDWACNPHKLDHAVAIVGY 295
++++ + P+ V + R + Y G P C K+DHAV VGY
Sbjct: 115 PNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIFEGP-------CGT-KVDHAVTAVGY 166
Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIERGANA----CGIESYAY 336
G+ G +++NSWG + GY +I+R CG+ +Y
Sbjct: 167 GKSGGKGYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGLYKSSY 211
>2b1m_A SPE31; papain-like, sugar binding protein; HET: NAG FUC PG4; 2.00A
{Pachyrhizus erosus} PDB: 2b1n_A*
Length = 246
Score = 190 bits (485), Expect = 2e-59
Identities = 66/225 (29%), Positives = 113/225 (50%), Gaps = 22/225 (9%)
Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
P+S DW + K + V+ QG+CGS WAF+ T +E+ A+ L LS+ +L++C
Sbjct: 2 APESWDWSK-KGVI-TKVKFQGQCGSGWAFSATGAIEAAHAIATGNLVSLSEQELIDCVD 59
Query: 186 GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSG 244
+ C G +FE+V ++ G+ S+ADYPY+ ++ +C + + KV + + V
Sbjct: 60 ESEGCYNGWHYQSFEWVVKHGGIASEADYPYKARDG---KCKANEIQDKVTIDNYGVQIL 116
Query: 245 V---------DHMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWAC-NPHKLDHAVAIVG 294
+ + PI V ++ + Y G + C +P+ ++H V IVG
Sbjct: 117 SNESTESEAESSLQSFVLEQPISVSIDAKDFHFYSGGIY--DGGNCSSPYGINHFVLIVG 174
Query: 295 YGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA----CGIESYA 335
YG ++G+ WI +NSWG+ GY +I+R CG+ +A
Sbjct: 175 YGSEDGVDYWIAKNSWGEDWGIDGYIRIQRNTGNLLGVCGMNYFA 219
>3kwz_A Cathepsin K; enzyme inhibitor, covalent reversible inhibitor,
disease mutation, disulfide bond, glycoprotein,
hydrolase, lysosome, protease; HET: KWZ; 1.49A {Homo
sapiens} PDB: 1au0_A* 1au2_A* 1au3_A* 1au4_A* 1ayu_A*
1ayv_A* 1ayw_A* 1bgo_A* 1atk_A* 1nl6_A* 1nlj_A* 1q6k_A*
1mem_A* 1yk7_A* 1yk8_A* 1yt7_A* 2ato_A* 2aux_A* 2auz_A*
2bdl_A* ...
Length = 215
Score = 188 bits (479), Expect = 7e-59
Identities = 74/216 (34%), Positives = 110/216 (50%), Gaps = 14/216 (6%)
Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG 186
P S+D+R+ K V PV++QG+CGSCWAF++ LE Q+ L LS LV+C
Sbjct: 2 PDSVDYRK-KGYV-TPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE 59
Query: 187 NLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSG 244
N C GG + AF+YV++ G++S+ YPY +E C Y + + G
Sbjct: 60 NDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEE---SCMYNPTGKAAKCRGYREIPEG 116
Query: 245 -VDHMMHLL-QSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNG 300
+ + + GP+ V ++ + Y D +CN L+HAV VGYG + G
Sbjct: 117 NEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY--YDESCNSDNLNHAVLAVGYGIQKG 174
Query: 301 ILTWIVRNSWGDIGPDHGYFQIERGA-NACGIESYA 335
WI++NSWG+ + GY + R NACGI + A
Sbjct: 175 NKHWIIKNSWGENWGNKGYILMARNKNNACGIANLA 210
>8pch_A Cathepsin H; hydrolase, protease, cysteine proteinase,
aminopeptidase; HET: NAG BMA; 2.10A {Sus scrofa} SCOP:
d.3.1.1 PDB: 1nb3_A* 1nb5_A*
Length = 220
Score = 187 bits (477), Expect = 2e-58
Identities = 83/232 (35%), Positives = 122/232 (52%), Gaps = 43/232 (18%)
Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH- 185
P S+DWR+ V +PV++QG CGSCW F+TT LES VA+ + L++ QLV+C
Sbjct: 2 PPSMDWRKKGNFV-SPVKNQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQQLVDCAQN 60
Query: 186 -GNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTW-VT 242
N C GG AFEY++ G+ + YPY+ +++ C ++ +KA FV+D +T
Sbjct: 61 FNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYKGQDD---HCKFQPDKAIAFVKDVANIT 117
Query: 243 SG-VDHMMHLLQS-GPI---------------GVYLNHRLIESYDGNPIRRNDWACN--P 283
+ M+ + P+ G+Y + +C+ P
Sbjct: 118 MNDEEAMVEAVALYNPVSFAFEVTNDFLMYRKGIY----------------SSTSCHKTP 161
Query: 284 HKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANACGIESYA 335
K++HAV VGYGE+NGI WIV+NSWG +GYF IERG N CG+ + A
Sbjct: 162 DKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACA 213
>1yal_A Chymopapain; hydrolase, thiol protease; 1.70A {Carica papaya} SCOP:
d.3.1.1 PDB: 1gec_E*
Length = 218
Score = 186 bits (475), Expect = 3e-58
Identities = 70/230 (30%), Positives = 108/230 (46%), Gaps = 40/230 (17%)
Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG 186
P+S+DWR + PV++QG CGSCWAF+T A +E ++ L LS+ +LV+CD
Sbjct: 2 PQSIDWRAKGA--VTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH 59
Query: 187 NLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDT---WVTS 243
+ C GG + +YV G+ + YPY+ K+ +C + V+ T V S
Sbjct: 60 SYGCKGGYQTTSLQYVANNGVHTSKVYPYQAKQY---KCRATDKPGPK-VKITGYKRVPS 115
Query: 244 GV-DHMMHLLQSGPIGVYLNHRLIES------------YDGNPIRRNDWACNPHKLDHAV 290
+ L + P+ V +E+ +DG C KLDHAV
Sbjct: 116 NCETSFLGALANQPLSVL-----VEAGGKPFQLYKSGVFDGP--------CGT-KLDHAV 161
Query: 291 AIVGYGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA----CGIESYAY 336
VGYG +G I++NSWG + GY +++R + CG+ +Y
Sbjct: 162 TAVGYGTSDGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSY 211
>1cqd_A Protein (protease II); cysteine protease, glycoprotein, proline
specificity, carboh papain family, hydrolase; HET: NAG
FUL FUC; 2.10A {Zingiber officinale} SCOP: d.3.1.1
Length = 221
Score = 185 bits (472), Expect = 8e-58
Identities = 73/225 (32%), Positives = 110/225 (48%), Gaps = 28/225 (12%)
Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
LP S+DWR+ V PV++QG CGSCWAF+T A +E ++ L LS+ QLV+C
Sbjct: 3 LPDSIDWRE-NGAV-VPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTT 60
Query: 186 GNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTS 243
N C GG ++ AF+++ G+ S+ YPYR ++ C V + V S
Sbjct: 61 ANHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQDG---ICNSTVNAPVVSIDSYENVPS 117
Query: 244 GV-DHMMHLLQSGPIGVYLNH-----RLIES--YDGNPIRRNDWACNPHKLDHAVAIVGY 295
+ + + P+ V ++ +L S + G+ CN +HA+ +VGY
Sbjct: 118 HNEQSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGS--------CNI-SANHALTVVGY 168
Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIERGANA----CGIESYAY 336
G +N WIV+NSWG + GY + ER CGI +A
Sbjct: 169 GTENDKDFWIVKNSWGKNWGESGYIRAERNIENPDGKCGITRFAS 213
>1cs8_A Human procathepsin L; prosegment, propeptide, inhibition,
hydrolase; HET: OCS; 1.80A {Homo sapiens} SCOP: d.3.1.1
PDB: 1cjl_A 3hwn_A*
Length = 316
Score = 188 bits (480), Expect = 9e-58
Identities = 94/320 (29%), Positives = 139/320 (43%), Gaps = 43/320 (13%)
Query: 41 DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---YGTSGSS---------DRSP 88
+ + NR Y NE R ++++ K + + Y S D +
Sbjct: 10 AQWTKWKAMHNRLY-GMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTS 68
Query: 89 QEILQ-RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQG 147
+E Q G + + R K E P+S+DWR+ K V PV++QG
Sbjct: 69 EEFRQVMNGFQ----------NRKPRKGKVFQEPLFYEAPRSVDWRE-KGYV-TPVKNQG 116
Query: 148 RCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH--GNLNCNGGNIDVAFEYVKQY 205
+CGSCWAF+ T LE Q+ L LS+ LV+C GN CNGG +D AF+YV+
Sbjct: 117 QCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDN 176
Query: 206 -GLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSGVDHMMH-LLQSGPIGVYLN 262
GL+S+ YPY E C Y + + + +M + GPI V ++
Sbjct: 177 GGLDSEESYPYEATEE---SCKYNPKYSVANDAGFVDIPKQEKALMKAVATVGPISVAID 233
Query: 263 --HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG----EKNGILTWIVRNSWGDIGPD 316
H Y + C+ +DH V +VGYG E + W+V+NSWG+
Sbjct: 234 AGHESFLFYKEGIY--FEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGM 291
Query: 317 HGYFQIERGA-NACGIESYA 335
GY ++ + N CGI S A
Sbjct: 292 GGYVKMAKDRRNHCGIASAA 311
>2c0y_A Procathepsin S; proenzyme, proteinase, hydrolase, thiol protease,
prosegment binding loop, glycoprotein, lysosome,
protease, zymogen; 2.1A {Homo sapiens}
Length = 315
Score = 187 bits (476), Expect = 4e-57
Identities = 82/318 (25%), Positives = 140/318 (44%), Gaps = 40/318 (12%)
Query: 41 DAFKTYIVKWNRTYTDDNEIKTRFEYFKQDGKETDEY---YGTSGSS---------DRSP 88
+ + + + Y + NE R ++++ K + + S D +
Sbjct: 10 HHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTS 69
Query: 89 QEILQ-RTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQG 147
+E++ + LR+ + + + LP S+DWR+ + V+ QG
Sbjct: 70 EEVMSLMSSLRVPSQWQRNITYKSNPN---------RILPDSVDWREK--GCVTEVKYQG 118
Query: 148 RCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH---GNLNCNGGNIDVAFEYVKQ 204
CG+ WAF+ LE+Q+ L L LS LV+C GN CNGG + AF+Y+
Sbjct: 119 SCGAAWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIID 178
Query: 205 -YGLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVTSG-VDHMMHLLQS-GPIGVY 260
G++S A YPY+ + +C Y+ + T + G D + + + GP+ V
Sbjct: 179 NKGIDSDASYPYKAMDQ---KCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVG 235
Query: 261 LN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWGDIGPDHG 318
++ H Y + +C ++H V +VGYG+ NG W+V+NSWG + G
Sbjct: 236 VDARHPSFFLYRSGVY--YEPSCTQ-NVNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEG 292
Query: 319 YFQIERG-ANACGIESYA 335
Y ++ R N CGI S+
Sbjct: 293 YIRMARNKGNHCGIASFP 310
>3ioq_A CMS1MS2; caricaceae, cysteine protease, papain family, hydrolase;
HET: E64 SO4; 1.87A {Carica candamarcensis}
Length = 213
Score = 182 bits (465), Expect = 8e-57
Identities = 66/225 (29%), Positives = 105/225 (46%), Gaps = 32/225 (14%)
Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
+P S+DWRQ K V PV +QG CGSCW F++ A +E ++ L LS+ +L++C+
Sbjct: 1 IPTSIDWRQ-KGAV-TPVRNQGGCGSCWTFSSVAAVEGINKIVTGQLLSLSEQELLDCER 58
Query: 186 GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDT---WVT 242
+ C GG A +YV G+ + YPY + +C + K V+ V
Sbjct: 59 RSYGCRGGFPLYALQYVANSGIHLRQYYPYEGVQR---QCRASQAKGPK-VKTDGVGRVP 114
Query: 243 SGV-DHMMHLLQSGPIGVYLN--HRLIESYDG----NPIRRNDWACNPHKLDHAVAIVGY 295
++ + P+ + + R ++Y G P C +DHAVA VGY
Sbjct: 115 RNNEQALIQRIAIQPVSIVVEAKGRAFQNYRGGIFAGP-------CGT-SIDHAVAAVGY 166
Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIERGANA----CGIESYAY 336
G +++NSWG + GY +I+RG+ CG+ S +
Sbjct: 167 GNDY----ILIKNSWGTGWGEGGYIRIKRGSGNPQGACGVLSDSV 207
>2bdz_A Mexicain; cysteine protease, peptidase_C1, papain-like, HYDR; HET:
E64; 2.10A {Jacaratia mexicana}
Length = 214
Score = 182 bits (465), Expect = 9e-57
Identities = 65/220 (29%), Positives = 112/220 (50%), Gaps = 24/220 (10%)
Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG 186
P+S+DWR+ K V PV++Q CGSCWAF+T A +E ++ L LS+ +L++C+
Sbjct: 2 PESIDWRE-KGAV-TPVKNQNPCGSCWAFSTVATIEGINKIITGQLISLSEQELLDCERR 59
Query: 187 NLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDT---WVTS 243
+ C+GG + +YV G+ ++ +YPY K+ RC + +K V T +V +
Sbjct: 60 SHGCDGGYQTTSLQYVVDNGVHTEREYPYEKKQG---RCRAKDKKGPK-VYITGYKYVPA 115
Query: 244 GV-DHMMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNG 300
++ + + P+ V + R + Y G + C DHAV VGYG+
Sbjct: 116 NDEISLIQAIANQPVSVVTDSRGRGFQFYKGGIY---EGPCG-TNTDHAVTAVGYGKT-- 169
Query: 301 ILTWIVRNSWGDIGPDHGYFQIERGA----NACGIESYAY 336
+++NSWG + GY +I+R + CG+ + ++
Sbjct: 170 --YLLLKNSWGPNWGEKGYIRIKRASGRSKGTCGVYTSSF 207
>2cio_A Papain; hydrolase/inhibitor, complex hydrolase/inhibitor, ICP,
cysteine protease, allergen, protease, thiol protease;
1.5A {Carica papaya} PDB: 1khq_A 1khp_A 1ppn_A 3e1z_B
3ima_A 3lfy_A 9pap_A 1bqi_A* 1bp4_A* 1pad_A 1pe6_A*
1pip_A* 1pop_A* 1ppd_A 1ppp_A* 1stf_E* 2pad_A 4pad_A*
5pad_A* 6pad_A* ...
Length = 212
Score = 181 bits (463), Expect = 1e-56
Identities = 66/225 (29%), Positives = 107/225 (47%), Gaps = 32/225 (14%)
Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
+P+ +DWRQ K V PV++QG CGSCWAF+ +E + + L S+ +L++CD
Sbjct: 1 IPEYVDWRQ-KGAV-TPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNQYSEQELLDCDR 58
Query: 186 GNLNCNGGNIDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDT---WVT 242
+ CNGG A + V QYG+ + YPY + C ++ + V
Sbjct: 59 RSYGCNGGYPWSALQLVAQYGIHYRNTYPYEGVQR---YCRSREKGPYA-AKTDGVRQVQ 114
Query: 243 SG-VDHMMHLLQSGPIGVYLN--HRLIESYDG----NPIRRNDWACNPHKLDHAVAIVGY 295
+++ + + P+ V L + + Y G P C K+DHAVA VGY
Sbjct: 115 PYNEGALLYSIANQPVSVVLEAAGKDFQLYRGGIFVGP-------CGN-KVDHAVAAVGY 166
Query: 296 GEKNGILTWIVRNSWGDIGPDHGYFQIERGANA----CGIESYAY 336
G +++NSWG ++GY +I+RG CG+ + ++
Sbjct: 167 GPN----YILIKNSWGTGWGENGYIRIKRGTGNSYGVCGLYTSSF 207
>1iwd_A Ervatamin B; cysteine protease, alpha-beta protein, catalytic DYAD,
L-DOM domain., hydrolase; 1.63A {Tabernaemontana
divaricata} SCOP: d.3.1.1
Length = 215
Score = 181 bits (462), Expect = 3e-56
Identities = 71/225 (31%), Positives = 105/225 (46%), Gaps = 31/225 (13%)
Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
LP +DWR K V N +++Q +CGSCWAF+ A +ES + L LS+ +LV+CD
Sbjct: 1 LPSFVDWRS-KGAV-NSIKNQKQCGSCWAFSAVAAVESINKIRTGQLISLSEQELVDCDT 58
Query: 186 GNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDT---WV 241
+ CNGG ++ AF+Y+ G+++Q +YPY + C + + V V
Sbjct: 59 ASHGCNGGWMNNAFQYIITNGGIDTQQNYPYSAVQG---SCKPYRLRV---VSINGFQRV 112
Query: 242 TSGV-DHMMHLLQSGPIGVYLN--HRLIESYDG----NPIRRNDWACNPHKLDHAVAIVG 294
T + + S P+ V + + Y P C +H V IVG
Sbjct: 113 TRNNESALQSAVASQPVSVTVEAAGAPFQHYSSGIFTGP-------CGT-AQNHGVVIVG 164
Query: 295 YGEKNGILTWIVRNSWGDIGPDHGYFQIERGANA----CGIESYA 335
YG ++G WIVRNSWG + GY +ER + CGI
Sbjct: 165 YGTQSGKNYWIVRNSWGQNWGNQGYIWMERNVASSAGLCGIAQLP 209
>3bwk_A Cysteine protease falcipain-3; malaria, hydrolase; HET: C1P; 2.42A
{Plasmodium falciparum} PDB: 3bpm_A*
Length = 243
Score = 182 bits (463), Expect = 3e-56
Identities = 77/254 (30%), Positives = 114/254 (44%), Gaps = 32/254 (12%)
Query: 108 EADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVAL 167
EA+ E V K + DWR V PV+ Q CGSCWAF++ +ESQ A+
Sbjct: 2 EANYEDVIKKYKPADAKLDRIAYDWRL-HGGV-TPVKDQALCGSCWAFSSVGSVESQYAI 59
Query: 168 LKKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCT 226
KK L+ S+ +LV+C N C GG I AF+ + GL SQ DYPY + T C
Sbjct: 60 RKKALFLFSEQELVDCSVKNNGCYGGYITNAFDDMIDLGGLCSQDDYPYVSNLPET--CN 117
Query: 227 YEKEKAKVFVQDTWVTSGVDHMMHLLQS-GPIGVYLN-HRLIESYDG---NPIRRNDWAC 281
++ + + ++V+ D L+ GPI + + Y G + C
Sbjct: 118 LKRCNERYTI-KSYVSIPDDKFKEALRYLGPISISIAASDDFAFYRGGFYDG------EC 170
Query: 282 NPHKLDHAVAIVGYGEKNGILT----------WIVRNSWGDIGPDHGYFQIERGANA--- 328
+HAV +VGYG K+ +I++NSWG + GY +E N
Sbjct: 171 GA-APNHAVILVGYGMKDIYNEDTGRMEKFYYYIIKNSWGSDWGEGGYINLETDENGYKK 229
Query: 329 -CGIESYAYLASVK 341
C I + AY+ ++
Sbjct: 230 TCSIGTEAYVPLLE 243
>2xu3_A Cathepsin L1; hydrolase, drug design, thiol protease; HET: XU3 BTB;
0.90A {Homo sapiens} PDB: 2xu4_A* 2xu5_A* 2yj2_A*
2yj8_A* 2yj9_A* 2yjb_A* 2yjc_A* 3bc3_A* 3h89_A* 3h8b_A*
3h8c_A* 3of9_A* 3of8_A* 3hha_A* 2xu1_A* 3iv2_A* 3k24_A*
2nqd_B* 3kse_A* 2vhs_A ...
Length = 220
Score = 180 bits (460), Expect = 5e-56
Identities = 78/221 (35%), Positives = 112/221 (50%), Gaps = 19/221 (8%)
Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH- 185
P+S+DWR+ K V PV++QG+CGSCWAF+ T LE Q+ L LS+ LV+C
Sbjct: 2 PRSVDWRE-KGYV-TPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGP 59
Query: 186 -GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWVT 242
GN CNGG +D AF+YV+ GL+S+ YPY E C Y + + +
Sbjct: 60 QGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEE---SCKYNPKYSVANDTGFVDIP 116
Query: 243 SGVDHMMHLLQS-GPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG--- 296
+M + + GPI V ++ H Y + C+ +DH V +VGYG
Sbjct: 117 KQEKALMKAVATVGPISVAIDAGHESFLFYKEGIY--FEPDCSSEDMDHGVLVVGYGFES 174
Query: 297 -EKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYA 335
E + W+V+NSWG+ GY ++ + N CGI S A
Sbjct: 175 TESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAA 215
>2oul_A Falcipain 2; cysteine protease, inhibitor, macromolecular
interaction, HY hydrolase inhibitor complex; 2.20A
{Plasmodium falciparum} SCOP: d.3.1.1 PDB: 2ghu_A 1yvb_A
3bpf_A* 3pnr_A
Length = 241
Score = 180 bits (460), Expect = 1e-55
Identities = 70/250 (28%), Positives = 116/250 (46%), Gaps = 34/250 (13%)
Query: 112 ERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKT 171
E +KK+ E + + DWR V PV+ Q CGSCWAF++ +ESQ A+ K
Sbjct: 6 EVIKKYRGE--ENFDHAAYDWRL-HSGV-TPVKDQKNCGSCWAFSSIGSVESQYAIRKNK 61
Query: 172 LYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKE 230
L LS+ +LV+C N CNGG I+ AFE + + G+ DYPY + C ++
Sbjct: 62 LITLSEQELVDCSFKNYGCNGGLINNAFEDMIELGGICPDGDYPYVSDAPNL--CNIDRC 119
Query: 231 KAKVFVQDTWVTSGVDHMMHLLQS-GPIGVYLN-HRLIESYDG---NPIRRNDWACNPHK 285
K + +++ + + L+ GPI + + Y + C +
Sbjct: 120 TEKYGI-KNYLSVPDNKLKEALRFLGPISISVAVSDDFAFYKEGIFDG------ECGD-Q 171
Query: 286 LDHAVAIVGYGEKNGILT----------WIVRNSWGDIGPDHGYFQIERGANA----CGI 331
L+HAV +VG+G K + +I++NSWG + G+ IE + CG+
Sbjct: 172 LNHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNSWGQQWGERGFINIETDESGLMRKCGL 231
Query: 332 ESYAYLASVK 341
+ A++ ++
Sbjct: 232 GTDAFIPLIE 241
>3pdf_A Cathepsin C, dipeptidyl peptidase 1; two domains, cystein protease,
hydrolase-hydrolase inhibitor; HET: LXV NAG; 1.85A {Homo
sapiens} PDB: 1jqp_A* 2djf_B* 1k3b_B* 2djg_B* 2djf_A*
1k3b_A* 2djg_A* 2djf_C* 1k3b_C* 2djg_C*
Length = 441
Score = 185 bits (472), Expect = 2e-55
Identities = 74/315 (23%), Positives = 130/315 (41%), Gaps = 36/315 (11%)
Query: 48 VKWNRTYTDDNEIKTRFEYFKQDGKETDE-----YYGTSGS----SDRSPQEILQRTGLR 98
V N + +++ K +K D T+ + + ++++R
Sbjct: 125 VYVNTAHLKNSQEKYSNRLYKYDHNFVKAINAIQKSWTATTYMEYETLTLGDMIRR---- 180
Query: 99 LTGKEKERLEADRERVKKFLNERKKGPLPKSLDWR-QSKVKVLNPVESQGRCGSCWAFAT 157
G ++ + + + K LP S DWR + ++PV +Q CGSC++FA+
Sbjct: 181 SGGHSRKIPRPKPAPLTAEIQQ-KILFLPTSWDWRNVHGINFVSPVRNQASCGSCYSFAS 239
Query: 158 TAILESQVALL--KKTLYPLSKSQLVECDHGNLNCNGGNIDVAFEYVKQ-YGLESQADYP 214
+LE+++ +L LS ++V C C GG + Q +GL +A +P
Sbjct: 240 MGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFP 299
Query: 215 YRNKENITFRCTYEKEKAKVFVQD------TWVTSGVDHMMH-LLQSGPIGVYLN-HRLI 266
Y ++ C +++ + + + + M L+ GP+ V +
Sbjct: 300 YTGTDS---PCKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDF 356
Query: 267 ESYDG---NPIRRNDWACNPHKLDHAVAIVGYGEKNGILT--WIVRNSWG-DIGPDHGYF 320
Y + D +HAV +VGYG + WIV+NSWG G ++GYF
Sbjct: 357 LHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWG-ENGYF 415
Query: 321 QIERGANACGIESYA 335
+I RG + C IES A
Sbjct: 416 RIRRGTDECAIESIA 430
>1s4v_A Cysteine endopeptidase; KDEL ER retention signal, endosperm,
ricinosomes, SEED germi senescence, hydrolase-hydrolase
inhibitor complex; 2.00A {Ricinus communis} SCOP:
d.3.1.1
Length = 229
Score = 179 bits (457), Expect = 2e-55
Identities = 78/229 (34%), Positives = 114/229 (49%), Gaps = 33/229 (14%)
Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
+P S+DWR+ K V V+ QG+CGSCWAF+T +E + L LS+ +LV+CD
Sbjct: 2 VPASVDWRK-KGAV-TSVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDT 59
Query: 186 -GNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKEKAKVFVQDT---W 240
N CNGG +D AFE++KQ G+ ++A+YPY + C KE A V
Sbjct: 60 DQNQGCNGGLMDYAFEFIKQRGGITTEANYPYEAYDG---TCDVSKENAPA-VSIDGHEN 115
Query: 241 VTSGV-DHMMHLLQSGPIGVYLNH-----RLIES--YDGNPIRRNDWACNPHKLDHAVAI 292
V + ++ + + P+ V ++ + + G+ C +LDH VAI
Sbjct: 116 VPENDENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGS--------CGT-ELDHGVAI 166
Query: 293 VGYG-EKNGILTWIVRNSWGDIGPDHGYFQIERGA----NACGIESYAY 336
VGYG +G W V+NSWG + GY ++ERG CGI A
Sbjct: 167 VGYGTTIDGTKYWTVKNSWGPEWGEKGYIRMERGISDKEGLCGIAMEAS 215
>3p5u_A Actinidin; SAD, cysteine proteinases, hydrolase; 1.50A {Actinidia
arguta} PDB: 3p5v_A 3p5w_A 3p5x_A 1aec_A* 2act_A
Length = 220
Score = 178 bits (454), Expect = 4e-55
Identities = 69/227 (30%), Positives = 104/227 (45%), Gaps = 30/227 (13%)
Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
LP +DWR S + ++ QG+CGS WAF+T A +E + L LS+ +LV+C
Sbjct: 1 LPDYVDWRSSGA--VVDIKDQGQCGSAWAFSTIAAVEGINKIATGDLISLSEQELVDCGR 58
Query: 186 --GNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDT--- 239
C+GG + F+++ G+ ++A+YPY +E +C + ++ K V
Sbjct: 59 TQNTRGCDGGFMTDGFQFIINNGGINTEANYPYTAEEG---QCNLDLQQEKY-VSIDTYE 114
Query: 240 WVTSG-VDHMMHLLQSGPIGVYLN--HRLIESYDG----NPIRRNDWACNPHKLDHAVAI 292
V + + P+ V L + Y P C +DHAV I
Sbjct: 115 NVPYNNEWALQTAVAYQPVSVALEAAGYNFQHYSSGIFTGP-------CGT-AVDHAVTI 166
Query: 293 VGYGEKNGILTWIVRNSWGDIGPDHGYFQIERG---ANACGIESYAY 336
VGYG + GI WIV+NSWG + GY +I+R CGI A
Sbjct: 167 VGYGTEGGIDYWIVKNSWGTTWGEEGYMRIQRNVGGVGQCGIAKKAS 213
>3ovx_A Cathepsin S; hydrolase, covalent inhibitor, aldehyde warhead is
covalently bound to Cys25, lysosomeal protein; HET: O64;
1.49A {Homo sapiens} PDB: 2h7j_A* 2f1g_A* 2hh5_B*
2hhn_A* 2hxz_A* 2op3_A* 2frq_A* 2fra_A* 2fq9_A* 2ft2_A*
2fud_A* 2g7y_A* 1ms6_A* 2r9m_A* 2r9n_A* 2r9o_A* 3n3g_A*
3n4c_A* 3mpe_A* 1nqc_A* ...
Length = 218
Score = 178 bits (453), Expect = 6e-55
Identities = 73/220 (33%), Positives = 111/220 (50%), Gaps = 18/220 (8%)
Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
LP S+DWR+ + V+ QG CG+CWAF+ LE+Q+ L L LS LV+C
Sbjct: 2 LPDSVDWREKGC--VTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 59
Query: 186 ---GNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TW 240
GN CNGG + AF+Y+ G++S A YPY+ + +C Y+ + T
Sbjct: 60 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQ---KCQYDSKYRAATCSKYTE 116
Query: 241 VTSG-VDHMMHLLQS-GPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYG 296
+ G D + + + GP+ V ++ H Y + +C ++H V +VGYG
Sbjct: 117 LPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVY--YEPSCTQ-NVNHGVLVVGYG 173
Query: 297 EKNGILTWIVRNSWGDIGPDHGYFQIERG-ANACGIESYA 335
+ NG W+V+NSWG + GY ++ R N CGI S+
Sbjct: 174 DLNGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFP 213
>1o0e_A Ervatamin C; plant cysteine protease, two domain, stable at PH
2-12, HYDR; 1.90A {Tabernaemontana divaricata} SCOP:
d.3.1.1 PDB: 2pns_A* 2pre_A* 3bcn_A*
Length = 208
Score = 174 bits (443), Expect = 1e-53
Identities = 69/219 (31%), Positives = 103/219 (47%), Gaps = 24/219 (10%)
Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
LP+ +DWR+ K V PV++QG CGSCWAF+T + +ES + L LS+ +LV+CD
Sbjct: 1 LPEQIDWRK-KGAV-TPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDK 58
Query: 186 GNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAKV--FVQDTWVT 242
N C GG A++Y+ G+++QA+YPY+ + C + + + V
Sbjct: 59 KNHGCLGGAFVFAYQYIINNGGIDTQANYPYKAVQG---PCQAASKVVSIDGYNG---VP 112
Query: 243 SGVDH-MMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKN 299
+ + + P V ++ + Y KL+H V IVGY
Sbjct: 113 FCNEXALKQAVAVQPSTVAIDASSAQFQQYSSGIFS----GPCGTKLNHGVTIVGYQAN- 167
Query: 300 GILTWIVRNSWGDIGPDHGYFQIER--GANACGIESYAY 336
WIVRNSWG + GY ++ R G CGI Y
Sbjct: 168 ---YWIVRNSWGRYWGEKGYIRMLRVGGCGLCGIARLPY 203
>2fo5_A Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cysteine
endoprotease, endopeptidase, LEUP hydrolase; HET: AR7;
2.20A {Hordeum vulgare}
Length = 262
Score = 175 bits (447), Expect = 1e-53
Identities = 77/229 (33%), Positives = 112/229 (48%), Gaps = 30/229 (13%)
Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECD- 184
LP S+DWRQ K V V+ QG+CGSCWAF+T +E A+ +L LS+ +L++CD
Sbjct: 4 LPPSVDWRQ-KGAV-TGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDT 61
Query: 185 HGNLNCNGGNIDVAFEYVKQY-GLESQADYPYRNKENITFRCTYEKE--KAKVFVQDT-- 239
N C GG +D AFEY+K GL ++A YPYR C + + V V
Sbjct: 62 ADNDGCQGGLMDNAFEYIKNNGGLITEAAYPYRAARG---TCNVARAAQNSPVVVHIDGH 118
Query: 240 -WVTSGV-DHMMHLLQSGPIGVYLN--HRLIESYDG---NPIRRNDWACNPHKLDHAVAI 292
V + + + + + P+ V + + Y C +LDH VA+
Sbjct: 119 QDVPANSEEDLARAVANQPVSVAVEASGKAFMFYSEGVFTG------ECGT-ELDHGVAV 171
Query: 293 VGYG-EKNGILTWIVRNSWGDIGPDHGYFQIERGANA----CGIESYAY 336
VGYG ++G W V+NSWG + GY ++E+ + A CGI A
Sbjct: 172 VGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSGASGGLCGIAMEAS 220
>3f75_A Toxopain-2, cathepsin L protease; medical structural genomics of
pathogenic protozoa, MSGPP, C protease, parasite,
protozoa, hydrolase; 1.99A {Toxoplasma gondii}
Length = 224
Score = 174 bits (443), Expect = 2e-53
Identities = 66/227 (29%), Positives = 100/227 (44%), Gaps = 31/227 (13%)
Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
LP +DWR + V PV+ Q CGSCWAF+TT LE L LS+ +L++C
Sbjct: 7 LPAGVDWRS-RGCV-TPVKDQRDCGSCWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSR 64
Query: 186 --GNLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAKVFVQD-TWV 241
GN +C+GG ++ AF+YV G+ S+ YPY ++ C + + V + V
Sbjct: 65 AEGNQSCSGGEMNDAFQYVLDSGGICSEDAYPYLARDE---ECRAQSCEKVVKILGFKDV 121
Query: 242 TSG-VDHMMHLLQSGPIGVYLNH-----RLIES--YDGNPIRRNDWACNPHKLDHAVAIV 293
M L P+ + + + +D + C LDH V +V
Sbjct: 122 PRRSEAAMKAALAKSPVSIAIEADQMPFQFYHEGVFDAS--------CGT-DLDHGVLLV 172
Query: 294 GYG--EKNGILTWIVRNSWGDIGPDHGYFQIERG---ANACGIESYA 335
GYG +++ WI++NSWG GY + CG+ A
Sbjct: 173 GYGTDKESKKDFWIMKNSWGTGWGRDGYMYMAMHKGEEGQCGLLLDA 219
>3hhi_A Cathepsin B-like cysteine protease; occluding loop, hydrolase, THIO
protease; HET: 074; 1.60A {Trypanosoma brucei} PDB:
3mor_A*
Length = 325
Score = 170 bits (433), Expect = 9e-51
Identities = 69/297 (23%), Positives = 102/297 (34%), Gaps = 67/297 (22%)
Query: 84 SDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQ--SKVKVLN 141
+ + +E + G+ L ++F E + PLP S D + +
Sbjct: 35 QNITLREAKRLNGVIKKNNNASIL-----PKRRFTEEEARAPLPSSFDSAEAWPNCPTIP 89
Query: 142 PVESQGRCGSCWAFATTAILESQVALLK-KTLYPLSKSQLVECDHGNLN-CNGGNIDVAF 199
+ Q CGSCWA A + + + + +S L+ C + CNGG+ D A+
Sbjct: 90 QIADQSACGSCWAVAAASAMSDRFCTMGGVQDVHISAGDLLACCSDCGDGCNGGDPDRAW 149
Query: 200 EYVKQYGLESQADYPYR------------------NKENITFRCTYEKEK-----AKVFV 236
Y GL S PY T +C Y +
Sbjct: 150 AYFSSTGLVSDYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPKCDYTCDDPTIPVVNYRS 209
Query: 237 QDTWVTSGVDHMMH-LLQSGPI---------------GVYLNHRLIESYDGNPIRRNDWA 280
++ G D M L GP GVY +H
Sbjct: 210 WTSYALQGEDDYMRELFFRGPFEVAFDVYEDFIAYNSGVY-HHV---------------- 252
Query: 281 CNPHKLDHAVAIVGYGEKNGILTWIVRNSWG-DIGPDHGYFQIERGANACGIESYAY 336
+ HAV +VG+G NG+ W + NSW + G GYF I RG++ CGIE
Sbjct: 253 SGQYLGGHAVRLVGWGTSNGVPYWKIANSWNTEWG-MDGYFLIRRGSSECGIEDGGS 308
>3u8e_A Papain-like cysteine protease; papain-like cysteine peptidase,
peptidase_C1A, hydrolase, in form; 1.31A {Crocus
sativus}
Length = 222
Score = 161 bits (410), Expect = 1e-48
Identities = 70/219 (31%), Positives = 98/219 (44%), Gaps = 14/219 (6%)
Query: 127 PKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHG 186
P S+DWR+ K V V+ QG CG CWAF T +E A+ L +S+ Q+V+CD
Sbjct: 2 PASIDWRK-KGAV-TSVKDQGACGMCWAFGATGAIEGIDAITTGRLISVSEQQIVDCDTX 59
Query: 187 NLNCNGGNIDVAFEYVKQ-YGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGV 245
GG+ D AF +V G+ S A+YPY + C K A T V +
Sbjct: 60 XXXXXGGDADDAFRWVITNGGIASDANYPYTGVDG---TCDLNKPIAARIDGYTNVPNSS 116
Query: 246 DHMMHLLQSGPIGVYLN--HRLIESYDGNPIRRNDW-ACNPHKLDHAVAIVGYG-EKNGI 301
++ + P+ V + + Y G I + +P +DH V IVGYG
Sbjct: 117 SALLDAVAKQPVSVNIYTSSTSFQLYTGPGIFAGSSCSDDPATVDHTVLIVGYGSNGTNA 176
Query: 302 LTWIVRNSWGDIGPDHGYFQIERGANA----CGIESYAY 336
WIV+NSWG GY I R N C I+++
Sbjct: 177 DYWIVKNSWGTEWGIDGYILIRRNTNRPDGVCAIDAWGS 215
>1deu_A Procathepsin X; cysteine protease, proregion, prosegment, HY; 1.70A
{Homo sapiens} SCOP: d.3.1.1 PDB: 1ef7_A
Length = 277
Score = 162 bits (413), Expect = 2e-48
Identities = 62/255 (24%), Positives = 99/255 (38%), Gaps = 63/255 (24%)
Query: 126 LPKSLDWR-QSKVKVLNPVESQ---GRCGSCWAFATTAILESQVALLKKTLYP---LSKS 178
LPKS DWR V + +Q CGSCWA A+T+ + ++ + +K +P LS
Sbjct: 36 LPKSWDWRNVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRKGAWPSTLLSVQ 95
Query: 179 QLVECDHGNLNCNGGNIDVAFEYVKQYGLESQADYPY------------RNKENITFRCT 226
+++C + +C GGN ++Y Q+G+ + Y N C
Sbjct: 96 NVIDCGNAG-SCEGGNDLSVWDYAHQHGIPDETCNNYQAKDQECDKFNQCGTCNEFKECH 154
Query: 227 YEKEKAKVFVQDTWVTSGVDHMMH-LLQSGPI---------------GVYLNHRLIESYD 270
+ V D SG + MM + +GPI G+Y
Sbjct: 155 AIRNYTLWRVGDYGSLSGREKMMAEIYANGPISCGIMATERLANYTGGIY---------- 204
Query: 271 GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWG-DIGPDHGYFQIERGANAC 329
++ ++H V++ G+G +G WIVRNSWG G + G+ +I
Sbjct: 205 ------AEYQDTT-YINHVVSVAGWGISDGTEYWIVRNSWGEPWG-ERGWLRIVTSTYKD 256
Query: 330 G--------IESYAY 336
G IE +
Sbjct: 257 GKGARYNLAIEEHCT 271
>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme,
papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A
1pbh_A 1mir_A
Length = 317
Score = 162 bits (412), Expect = 9e-48
Identities = 68/307 (22%), Positives = 111/307 (36%), Gaps = 82/307 (26%)
Query: 84 SDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQ--SKVKVLN 141
+ + + G L G + + E +K LP S D R+ + +
Sbjct: 32 YNVDMSYLKRLCGTFLGGPKPPQRVMFTEDLK----------LPASFDAREQWPQCPTIK 81
Query: 142 PVESQGRCGSCWAFATTAILESQVALLKKTLY--PLSKSQLVEC--DHGNLNCNGGNIDV 197
+ QG CGSCWAF + ++ + +S L+ C CNGG
Sbjct: 82 EIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAE 141
Query: 198 AFEYVKQYGLESQADYPYR----------------------NKENITFRC--------TY 227
A+ + + GL S Y E T +C +
Sbjct: 142 AWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSP 201
Query: 228 EKEKAKVFVQDTW-VTSGVDHMMH-LLQSGPI---------------GVYLNHRLIESYD 270
++ K + +++ V++ +M + ++GP+ GVY H
Sbjct: 202 TYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY-QHV------ 254
Query: 271 GNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWG-DIGPDHGYFQIERGANAC 329
HA+ I+G+G +NG W+V NSW D G D+G+F+I RG + C
Sbjct: 255 ----------TGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWG-DNGFFKILRGQDHC 303
Query: 330 GIESYAY 336
GIES
Sbjct: 304 GIESEVV 310
>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase,
digestive tract, hydrolase-hydrolase INH complex; HET:
074; 1.30A {Schistosoma mansoni} PDB: 3s3q_A* 3s3r_A*
Length = 254
Score = 159 bits (405), Expect = 2e-47
Identities = 62/265 (23%), Positives = 99/265 (37%), Gaps = 72/265 (27%)
Query: 126 LPKSLDWRQ--SKVKVLNPVESQGRCGSCWAFATTAILESQVALLK--KTLYPLSKSQLV 181
+P S D R+ + K + + Q RCGSCWAF + + + K LS L+
Sbjct: 3 IPSSFDSRKKWPRCKSIATIRDQSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVDLL 62
Query: 182 ECDH-GNLNCNGGNIDVAFEYVKQYGLESQADYPYR-----------------------N 217
C L C GG + A++Y + G+ + + +
Sbjct: 63 SCCESCGLGCEGGILGPAWDYWVKEGIVTGSSKENHAGCEPYPFPKCEHHTKGKYPPCGS 122
Query: 218 KENITFRC--TYEKEKAKVFVQDT-------WVTSGVDHMMH-LLQSGPI---------- 257
K T RC T +K+ + QD V + + +++ GP+
Sbjct: 123 KIYKTPRCKQTCQKKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGFTVYEDF 182
Query: 258 -----GVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWG- 311
G+Y H HA+ I+G+G +N W++ NSW
Sbjct: 183 LNYKSGIY-KHI----------------TGETLGGHAIRIIGWGVENKAPYWLIANSWNE 225
Query: 312 DIGPDHGYFQIERGANACGIESYAY 336
D G ++GYF+I RG + C IES
Sbjct: 226 DWG-ENGYFRIVRGRDECSIESEVT 249
>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco
hydrolase, lysosome, protease, thiol protease, zymogen,
CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A*
3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A*
1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A*
1qdq_A* 1csb_B* 1huc_B 2ipp_B ...
Length = 266
Score = 158 bits (403), Expect = 6e-47
Identities = 64/265 (24%), Positives = 100/265 (37%), Gaps = 72/265 (27%)
Query: 126 LPKSLDWRQ--SKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLY--PLSKSQLV 181
LP S D R+ + + + QG CGS WAF + ++ + +S L+
Sbjct: 7 LPASFDAREQWPQCPTIKEIRDQGSCGSAWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 66
Query: 182 EC--DHGNLNCNGGNIDVAFEYVKQYGLESQADY-------PYR---------------N 217
C CNGG A+ + + GL S Y PY
Sbjct: 67 TCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEAHVNGARPPCT 126
Query: 218 KENITFRC--------TYEKEKAKVFVQDT-WVTSGVDHMMH-LLQSGPI---------- 257
E T +C + ++ K + ++ V++ +M + ++GP+
Sbjct: 127 GEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDF 186
Query: 258 -----GVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSWG- 311
GVY H HA+ I+G+G +NG W+V NSW
Sbjct: 187 LLYKSGVY-QHV----------------TGEMMGGHAIRILGWGVENGTPYWLVANSWNT 229
Query: 312 DIGPDHGYFQIERGANACGIESYAY 336
D G D+G+F+I RG + CGIES
Sbjct: 230 DWG-DNGFFKILRGQDHCGIESEVV 253
>2wbf_X Serine-repeat antigen protein; SERA, malaria, vacuole, protease,
cathepsin, hydrolase, glycoprotein, thiol protease; HET:
DMS; 1.60A {Plasmodium falciparum} PDB: 3ch3_X 3ch2_X
Length = 265
Score = 157 bits (398), Expect = 4e-46
Identities = 50/249 (20%), Positives = 85/249 (34%), Gaps = 44/249 (17%)
Query: 126 LPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDH 185
+ D + VE QG C + W FA+ LE+ + +S + C
Sbjct: 10 CNRLKDENN-CISN-LQVEDQGNCDTSWIFASKYHLETIRCMKGYEPTKISALYVANCYK 67
Query: 186 --GNLNCNGGNIDVAF-EYVKQYG-LESQADYPYRNKENITF---------------RCT 226
C+ G+ + F + ++ YG L ++++YPY + +
Sbjct: 68 GEHKDRCDEGSSPMEFLQIIEDYGFLPAESNYPYNYVKVGEQCPKVEDHWMNLWDNGKIL 127
Query: 227 YEKEKAKVFVQDTWVTSGVDHMMHLLQS------------GPIGVYLN--HRLIESYDGN 272
+ K + + + + + G + Y+ + + + G
Sbjct: 128 HNKNEPNSLDGKGYTAYESERFHDNMDAFVKIIKTEVMNKGSVIAYIKAENVMGYEFSGK 187
Query: 273 PIRRNDWACNPHKLDHAVAIVGYGEKNGILT-----WIVRNSWGDIGPDHGYFQIER-GA 326
+ C DHAV IVGYG WIVRNSWG D GYF+++ G
Sbjct: 188 KV---KNLCGDDTADHAVNIVGYGNYVNSEGEKKSYWIVRNSWGPYWGDEGYFKVDMYGP 244
Query: 327 NACGIESYA 335
C
Sbjct: 245 THCHFNFIH 253
>3ois_A Cysteine protease; alpha and beta, hydrolase; HET: UDP; 1.65A
{Xylella fastidiosa}
Length = 291
Score = 145 bits (366), Expect = 3e-41
Identities = 51/274 (18%), Positives = 100/274 (36%), Gaps = 37/274 (13%)
Query: 89 QEILQRTGLRLTGKEKERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVLNPVESQGR 148
Q +L+R G + + R+ + LP +D V QGR
Sbjct: 22 QTVLKRRKKSGYGYIPDIADI-RDFSYTP-EKSVIAALPPKVDLTPP-----FQVYDQGR 74
Query: 149 CGSCWAFATTAILESQVALLKKTLYPLSKSQLVEC-----DHGNLNCNGG-NIDVAFEYV 202
GSC A A A ++ + K++ S+L G++N + G I + +
Sbjct: 75 IGSCTANALAAAIQFERIHDKQSP-EFIPSRLFIYYNERKIEGHVNYDSGAMIRDGIKVL 133
Query: 203 KQYGLESQADYPYR--------NKENITFRCTYEK-----EKAKVFVQDTW--VTSGVDH 247
+ G+ + ++PY + + + + A+ + + V +DH
Sbjct: 134 HKLGVCPEKEWPYGDTPADPRTEEFPPGAPASKKPSDQCYKDAQNYKITEYSRVAQDIDH 193
Query: 248 MMHLL-QSGPIGVYLN-HRLIESYDGNPIRRNDW-ACNPHKLDHAVAIVGYGEKNGILTW 304
+ L P + + + P+R + + HAV VGY ++ +
Sbjct: 194 LKACLAVGSPFVFGFSVYNSWVGNNSLPVRIPLPTKNDTLEGGHAVLCVGYDDEIRH--F 251
Query: 305 IVRNSWG-DIGPDHGYFQIERGANA-CGIESYAY 336
+RNSWG ++G + GYF + + + +
Sbjct: 252 RIRNSWGNNVG-EDGYFWMPYEYISNTQLADDFW 284
>3pw3_A Aminopeptidase C; bleomycin, cysteine proteinase fold, structural
genomics, JO center for structural genomics, JCSG; HET:
MSE; 2.23A {Parabacteroides distasonis}
Length = 383
Score = 61.8 bits (149), Expect = 8e-11
Identities = 19/86 (22%), Positives = 38/86 (44%), Gaps = 12/86 (13%)
Query: 142 PVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGN------------LN 189
V++Q R G+CW +++ + LES++ + K Y LS+ V + +
Sbjct: 24 SVKNQNRAGTCWCYSSYSFLESELLRMGKGEYDLSEMFTVYNTYLDRADAAVRTHGDVSF 83
Query: 190 CNGGNIDVAFEYVKQYGLESQADYPY 215
GG+ A ++ +GL + +
Sbjct: 84 SQGGSFYDALYGMETFGLVPEEEMRP 109
Score = 42.1 bits (98), Expect = 2e-04
Identities = 16/86 (18%), Positives = 29/86 (33%), Gaps = 1/86 (1%)
Query: 236 VQDTWVTSGVDHMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGY 295
+ SG D L + + R+ + DH + I G
Sbjct: 266 DEKVQELSGSDMAHWLKLKPEEKKLNTKPQPQKWCTQAERQLAYDNYETTDDHGMQIYGI 325
Query: 296 G-EKNGILTWIVRNSWGDIGPDHGYF 320
++ G ++V+NSWG +G +
Sbjct: 326 AKDQEGNEYYMVKNSWGTNSKYNGIW 351
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis,
programmed cell death; HET: DTP; 6.90A {Drosophila
melanogaster} PDB: 3iz8_A*
Length = 1221
Score = 49.5 bits (117), Expect = 1e-06
Identities = 54/422 (12%), Positives = 114/422 (27%), Gaps = 153/422 (36%)
Query: 6 CDHQETNTEQVTYNVNTDSAIYVWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIK---- 61
QE ++ V + ++ + + + + YI + +R Y D+
Sbjct: 72 LSKQEEMVQKFVEEVLRINYKFLMSPIKTEQRQPSMMTRMYIEQRDRLYNDNQVFAKYNV 131
Query: 62 TRFEYFKQ---------------------DGKET-------DE----------YYGTSGS 83
+R + + + GK ++ +
Sbjct: 132 SRLQPYLKLRQALLELRPAKNVLIDGVLGSGKTWVALDVCLSYKVQCKMDFKIFWLNLKN 191
Query: 84 --SDRSPQEILQRTGLRLTGK----------EKERLEADRERVKKFLNERKKGPLPKSL- 130
S + E+LQ+ ++ K R+ + + +++ L + P L
Sbjct: 192 CNSPETVLEMLQKLLYQIDPNWTSRSDHSSNIKLRIHSIQAELRRLLKSK---PYENCLL 248
Query: 131 --D--WRQSKVKVLNPVESQGRCGSCWAFATT----------AILESQVAL--LKKTLYP 174
N SC TT A + ++L TL P
Sbjct: 249 VLLNVQNAKAWNAFN--------LSCKILLTTRFKQVTDFLSAATTTHISLDHHSMTLTP 300
Query: 175 -LSKSQL---VECDHGNL---NCNGGNIDVA------------FEYVKQYGLESQA---D 212
KS L ++C +L ++ ++ K + +
Sbjct: 301 DEVKSLLLKYLDCRPQDLPREVLTTNPRRLSIIAESIRDGLATWDNWKHVNCDKLTTIIE 360
Query: 213 YPYRNKENITFRCTYEKEKAKVFVQDTWVTSGV-------------DHMMH------LLQ 253
E +R +++ VF + + + +++ L++
Sbjct: 361 SSLNVLEPAEYRKMFDR--LSVFPPSAHIPTILLSLIWFDVIKSDVMVVVNKLHKYSLVE 418
Query: 254 SGP------I-GVYLN-----------HR-LIESYDGNPIRRND-WACNPHKLDHAVAIV 293
P I +YL HR +++ Y N + D P LD
Sbjct: 419 KQPKESTISIPSIYLELKVKLENEYALHRSIVDHY--NIPKTFDSDDLIPPYLD------ 470
Query: 294 GY 295
Y
Sbjct: 471 QY 472
>2pff_B Fatty acid synthase subunit beta; fatty acid synthase,
acyl-carrier-protein, beta-ketoacyl RED beta-ketoacyl
synthase, dehydratase; 4.00A {Saccharomyces cerevisiae}
Length = 2006
Score = 41.6 bits (97), Expect = 4e-04
Identities = 51/287 (17%), Positives = 92/287 (32%), Gaps = 100/287 (34%)
Query: 69 QDGKETDEYYGTSGSSDRSPQEILQRTGLRLTGKEKERLEADRERVKKFLNERKK----- 123
+D E +E G SP L ++ +E+++ + L K+
Sbjct: 325 EDSLENNE--GVP-----SPM-------LSISNLTQEQVQDYVNKTNSHLPAGKQVEISL 370
Query: 124 ----------GPLPKSL---DWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKK 170
GP P+SL + K K A + + +S++
Sbjct: 371 VNGAKNLVVSGP-PQSLYGLNLTLRKAK-----------------APSGLDQSRI----- 407
Query: 171 TLYPLSKSQLVECDHGNLNCNGGNIDVAF--EYVKQYGLESQADYPYRNKENITFRCTYE 228
P S+ +L + L + F + D K N++F
Sbjct: 408 ---PFSERKLK-FSNRFL-----PVASPFHSHLLVPASDLINKDLV---KNNVSFN---- 451
Query: 229 KEKAKVFVQDTWVTSGVDHMMHLLQSGPIGVYLNHRLIESYDGNPIRRNDW-ACNPHKLD 287
+ ++ V DT+ G D L+ + ++ R+++ P+ W K
Sbjct: 452 AKDIQIPVYDTF--DGSD-----LRV--LSGSISERIVDCIIRLPV---KWETTTQFKAT 499
Query: 288 HAVAIVGYGEKNGILTWIVRNSWG-----------DIGP--DHGYFQ 321
H + G G +G+ RN G DI P D+G+ Q
Sbjct: 500 HILDF-GPGGASGLGVLTHRNKDGTGVRVIVAGTLDINPDDDYGFKQ 545
Score = 32.3 bits (73), Expect = 0.30
Identities = 56/342 (16%), Positives = 97/342 (28%), Gaps = 115/342 (33%)
Query: 15 QVTYNVNTDSAIYVWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKT------RFEYFK 68
+ V T S ++ + F + + + D+E T +F +
Sbjct: 17 EHVLLVPTASFF------IASQLQ--EQFNKILPEPTEGFAADDEPTTPAELVGKFLGYV 68
Query: 69 QDGKETDEYYGTSGSSDRSPQEILQRTGLR------LTGKEKERLEADRERVKKFLNERK 122
E + G D ++L L L G + L A L +
Sbjct: 69 SSLVEPSK----VGQFD----QVL-NLCLTEFENCYLEGNDIHALAAK-------LLQEN 112
Query: 123 KGPLPKSLDWRQSKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVE 182
L K+ + ++ + + K+ S S L
Sbjct: 113 DTTLVKTKELIKNYITAR-------------------------IMAKRPFDKKSNSALFR 147
Query: 183 -CDHGNLNC----NG-GNIDVAF-EYVKQYGLESQADYPYRNKENITF-------RCTYE 228
GN G GN D F E Y Y + I F
Sbjct: 148 AVGEGNAQLVAIFGGQGNTDDYFEELRDLY-----QTYHVLVGDLIKFSAETLSELIRTT 202
Query: 229 KEKAKVFVQ--D--TWVTS-----GVDHMMHLLQSGP-IGVY-LNHRLIESYDGNPIRRN 277
+ KVF Q + W+ + D+++ + S P IGV L H ++
Sbjct: 203 LDAEKVFTQGLNILEWLENPSNTPDKDYLLSIPISCPLIGVIQLAHYVV----------- 251
Query: 278 DWAC-----NPHKL-DHAVAIVGYGEKNGILTWIV---RNSW 310
P +L + G+ + G++T + +SW
Sbjct: 252 --TAKLLGFTPGELRSYLKGATGHSQ--GLVTAVAIAETDSW 289
Score = 31.6 bits (71), Expect = 0.50
Identities = 56/294 (19%), Positives = 86/294 (29%), Gaps = 109/294 (37%)
Query: 18 YNVNTDSAIYVWRDLAYDSIKQVDAFK-TYIVKWNRT------------YTDDNEIKTRF 64
Y + +A VW + A + K F IV N +N F
Sbjct: 1636 YK-TSKAAQDVW-NRADNHFKDTYGFSILDIVINNPVNLTIHFGGEKGKRIRENYSAMIF 1693
Query: 65 EYFKQDG--------KETDEYYGTSGSSDRSPQEILQRT-----GLRLTGKEKERLEADR 111
E DG KE +E+ ++ + RS + +L T L L K A
Sbjct: 1694 ETIV-DGKLKTEKIFKEINEH--STSYTFRSEKGLLSATQFTQPALTLMEK------AAF 1744
Query: 112 ERVKKFLNERKKGPLPK-------SL----------------D------WRQSKVKVLNP 142
E +K KG +P SL +R ++V P
Sbjct: 1745 EDLK------SKGLIPADATFAGHSLGEYAALASLADVMSIESLVEVVFYRGMTMQVAVP 1798
Query: 143 VESQGRCGSCWAFATTAILESQVAL------LKKTLYPLSKS--QLVECDHGNLNCNGGN 194
+ GR + AI +VA L+ + + K LVE N N
Sbjct: 1799 RDELGRSN----YGMIAINPGRVAASFSQEALQYVVERVGKRTGWLVEI--VNYNVENQ- 1851
Query: 195 IDVAFEYVKQY-------GLESQADYPYRNKE-NITFR-----CTYEKEKAKVF 235
QY L++ + K I + E+ + +F
Sbjct: 1852 ---------QYVAAGDLRALDTVTNVLNFIKLQKIDIIELQKSLSLEEVEGHLF 1896
>3lvg_D LCB, clathrin light chain B; SELF assembly, coated PIT, cytoplasmic
vesicle, membrane, Ca structural protein; 7.94A {Bos
taurus}
Length = 190
Score = 37.1 bits (85), Expect = 0.004
Identities = 14/82 (17%), Positives = 24/82 (29%), Gaps = 24/82 (29%)
Query: 47 IVKWNRTYT------DDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQEILQRTGLRLT 100
I KW D E+ ++ K+ +E+ +S Q
Sbjct: 87 IRKWREEQRKRLQELDAASKVMEQEWREKAKKDLEEWN-----QRQSEQ----------- 130
Query: 101 GKEKERLEADRERVKKFLNERK 122
EK + +R K F +
Sbjct: 131 -VEKNK-INNRIADKAFYQQPD 150
>2pff_A Fatty acid synthase subunit alpha, 3-oxoacyl-[acyl-carrier-PR;
fatty acid synthase, acyl-carrier-protein, beta-ketoacyl
RED beta-ketoacyl synthase, dehydratase; 4.00A
{Saccharomyces cerevisiae}
Length = 1688
Score = 35.2 bits (81), Expect = 0.031
Identities = 16/90 (17%), Positives = 27/90 (30%), Gaps = 31/90 (34%)
Query: 37 IKQVD--AFKTYIVKWNRTYT----DDNEIKTRFEYFKQDGKETDEYYGTSGSSDRSPQE 90
I + W + T DD ++K ++E
Sbjct: 861 ISYHNGNLKGRPYTGWVDSKTKEPVDDKDVKAKYE-----------------------TS 897
Query: 91 ILQRTGLRLTGKEKERLEADRERVKKFLNE 120
IL+ +G+RL E E K+ + E
Sbjct: 898 ILEHSGIRLI--EPELFNGYNPEKKEMIQE 925
>2e01_A Cysteine proteinase 1; bleomycin hydrolase, thiol protease, C1
protease, hydrolase; 1.73A {Saccharomyces cerevisiae}
PDB: 2e02_A 2e03_A 2dzy_A 1a6r_A 2e00_A 2dzz_A 3gcb_A
1gcb_A
Length = 457
Score = 33.2 bits (75), Expect = 0.11
Identities = 10/37 (27%), Positives = 13/37 (35%), Gaps = 3/37 (8%)
Query: 287 DHAVAIVGYG---EKNGILTWIVRNSWGDIGPDHGYF 320
A+ I G L + V NSWG G +
Sbjct: 372 TAAMLITGCHVDETSKLPLRYRVENSWGKDSGKDGLY 408
Score = 32.8 bits (74), Expect = 0.14
Identities = 16/45 (35%), Positives = 20/45 (44%), Gaps = 1/45 (2%)
Query: 141 NPVESQGRCGSCWAFATTAILESQVA-LLKKTLYPLSKSQLVECD 184
PV +Q G CW FA T L V L + LS++ L D
Sbjct: 66 TPVTNQKSSGRCWLFAATNQLRLNVLSELNLKEFELSQAYLFFYD 110
>2cb5_A Protein (bleomycin hydrolase); aminopeptidase, cysteine protease,
SELF- compartmentalizing, cylinase; 1.85A {Homo sapiens}
SCOP: d.3.1.1 PDB: 1cb5_A
Length = 453
Score = 33.2 bits (75), Expect = 0.13
Identities = 10/38 (26%), Positives = 13/38 (34%), Gaps = 4/38 (10%)
Query: 287 DHAVAIVGYG----EKNGILTWIVRNSWGDIGPDHGYF 320
HA+ + W V NSWG+ GY
Sbjct: 370 THAMTFTAVSEKDDQDGAFTKWRVENSWGEDHGHKGYL 407
Score = 32.4 bits (73), Expect = 0.19
Identities = 10/46 (21%), Positives = 17/46 (36%), Gaps = 1/46 (2%)
Query: 141 NPVESQGRCGSCWAFATTAILESQVA-LLKKTLYPLSKSQLVECDH 185
P+ +Q G W F+ ++ L + S+S L D
Sbjct: 61 KPITNQKSSGRSWIFSCLNVMRLPFMKKLNIEEFEFSQSYLFFWDK 106
>3f75_P Toxopain-2, cathepsin L propeptide; medical structural genomics
of pathogenic protozoa, MSGPP, C protease, parasite,
protozoa, hydrolase; 1.99A {Toxoplasma gondii}
Length = 106
Score = 29.3 bits (66), Expect = 0.59
Identities = 10/53 (18%), Positives = 27/53 (50%), Gaps = 5/53 (9%)
Query: 17 TYNVNTDSAIYVWRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKTRFEYFKQ 69
+++ + +I+ W++ + DAF ++ + ++Y + E + R+ FK
Sbjct: 4 SHHHHHHGSIWEWKEAHFQ-----DAFSSFQAMYAKSYATEEEKQRRYAIFKN 51
>3b21_A ORF169B, OSPI; bacterial protein, effector, type 3 secretion SYST
unknown function; 2.01A {Shigella flexneri}
Length = 220
Score = 30.1 bits (67), Expect = 0.86
Identities = 21/78 (26%), Positives = 33/78 (42%), Gaps = 3/78 (3%)
Query: 87 SPQEILQRTGLRLTGKEKERLEADRERVKKFL---NERKKGPLPKSLDWRQSKVKVLNPV 143
SP+ ++ L+ T + E VKK L N + G + K + + K++NP
Sbjct: 5 SPEFMINGVSLQGTAGYEAHTEEGNVNVKKLLESLNSKSLGDMDKDSELAATLQKMINPS 64
Query: 144 ESQGRCGSCWAFATTAIL 161
G C C A A+L
Sbjct: 65 GGDGNCSGCALHACMAML 82
>1x9y_A Cysteine proteinase; half-barrel, barrel-sandwich-hybrid,
hydrolase; 2.50A {Staphylococcus aureus} SCOP: d.3.1.1
d.17.1.4
Length = 367
Score = 29.2 bits (64), Expect = 2.2
Identities = 30/176 (17%), Positives = 49/176 (27%), Gaps = 39/176 (22%)
Query: 135 SKVKVLNPVESQGRCGSCWAFATTAILESQVALLKKTLYPLSKSQLVECDHGNLNCNGGN 194
+ +K E Q C F+ A+L + + + ++ E +L
Sbjct: 192 NTLKNFKIREQQFDNSWCAGFSMAALLNATKNTDTYNAHDIMRTLYPEVSEQDLPNCATF 251
Query: 195 IDVAFEYVKQYGLESQADYPYRNKENITFRCTYEKEKAKVFVQDTWVTSGVDHMMHLLQS 254
+ EY K G + + + D V M+
Sbjct: 252 PNQMIEYGKSQGRDIHYQEGVPSYNQV----------------DQLTKDNVGIMILA--- 292
Query: 255 GPIGVYLNHRLIESYDGNPIRRNDWACNPHKLDHAVAIVGYGEKNGILTWIVRNSW 310
+S NP N L HA+A+VG + N I N W
Sbjct: 293 ------------QSVSQNP--------NDPHLGHALAVVGNAKINDQEKLIYWNPW 328
>1m65_A Hypothetical protein YCDX; structural genomics, beta-alpha-barrel,
metallo-enzyme, STRU function project, S2F, unknown
function; 1.57A {Escherichia coli} SCOP: c.6.3.1 PDB:
1m68_A 1pb0_A
Length = 245
Score = 27.8 bits (62), Expect = 4.7
Identities = 10/39 (25%), Positives = 13/39 (33%), Gaps = 6/39 (15%)
Query: 90 EILQRTGLRLTGKEKER-LEADRERVKKFLNERKKGPLP 127
+IL ER L R+ FL R P+
Sbjct: 207 KILDAVDF-----PPERILNVSPRRLLNFLESRGMAPIA 240
>1qht_A Protein (DNA polymerase); archaea, hyperthermostable, family B
polymer alpha family polymerase, transferase; 2.10A
{Thermococcus SP} SCOP: c.55.3.5 e.8.1.1 PDB: 1tgo_A
2xhb_A* 2vwj_A* 2vwk_A* 1wns_A* 1wn7_A 1qqc_A* 4ahc_A*
4ail_C* 3a2f_A* 2jgu_A* 1d5a_A
Length = 775
Score = 28.2 bits (63), Expect = 5.6
Identities = 10/36 (27%), Positives = 19/36 (52%)
Query: 105 ERLEADRERVKKFLNERKKGPLPKSLDWRQSKVKVL 140
L +R+++K+ + K LD+RQ +K+L
Sbjct: 454 GDLLEERQKIKRKMKATVDPLEKKLLDYRQRAIKIL 489
>2edd_A Netrin receptor DCC; tumor suppressor protein DCC, colorectal
cancer suppressor, structural genomics, NPPSFA; NMR
{Homo sapiens}
Length = 123
Score = 26.4 bits (58), Expect = 6.7
Identities = 10/49 (20%), Positives = 21/49 (42%), Gaps = 1/49 (2%)
Query: 15 QVTYNVNTDSAIYV-WRDLAYDSIKQVDAFKTYIVKWNRTYTDDNEIKT 62
V T A+ V W D + ++ + Y V+W +++ + K+
Sbjct: 24 GVQAVALTHDAVRVSWADNSVPKNQKTSEVRLYTVRWRTSFSASAKYKS 72
Database: pdb70
Posted date: Sep 4, 2012 3:40 AM
Number of letters in database: 6,701,793
Number of sequences in database: 27,921
Lambda K H
0.316 0.133 0.410
Gapped
Lambda K H
0.267 0.0856 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 27921
Number of Hits to DB: 5,199,069
Number of extensions: 301791
Number of successful extensions: 926
Number of sequences better than 10.0: 1
Number of HSP's gapped: 752
Number of HSP's successfully gapped: 63
Length of query: 341
Length of database: 6,701,793
Length adjustment: 94
Effective length of query: 247
Effective length of database: 4,077,219
Effective search space: 1007073093
Effective search space used: 1007073093
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 58 (25.9 bits)