RPS-BLAST 2.2.26 [Sep-21-2011]
Database: pdb70
27,921 sequences; 6,701,793 total letters
Searching..................................................done
Query= psy2558
(348 letters)
>3qj3_A Cathepsin L-like protein; hydrolase, proteinase, larVal midgut;
1.85A {Tenebrio molitor}
Length = 331
Score = 347 bits (893), Expect = e-119
Identities = 106/333 (31%), Positives = 156/333 (46%), Gaps = 35/333 (10%)
Query: 31 HHLHHVKHTAL--------FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLL-QDTEH 81
HH HH++ +AL + F + ++Y E R IF L + +
Sbjct: 3 HHHHHLEGSALPSTFVAEKWENFKTTYARSYVNAKEETFRKQIFQKKLETFEEHNEKYRQ 62
Query: 82 GSGVY--GLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMI-------PNITLPRAFDW 132
G Y G+N F+D++ E +A G + +P ++ P +FDW
Sbjct: 63 GLVSYTLGVNLFTDMTPEEMKAYTHGLIMPADLHKNGIPIKTREDLGLNASVRYPASFDW 122
Query: 133 REYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLV--SLSEQELIDCDQEDDGCEG 190
R+ V+ VK+Q CGSSWAFS+TG IE S+SEQ+L+DC GC G
Sbjct: 123 RDQGMVSPVKNQGSCGSSWAFSSTGAIESQMKIANGAGYDSSVSEQQLVDCVPNALGCSG 182
Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-SRDETDMAK 249
G +++AF + GG++ E YPY D C + +++GYV + DE +A
Sbjct: 183 GWMNDAFTYVAQN--GGIDSEGAYPYEMADGNCHYDPNQVAARLSGYVYLSGPDENMLAD 240
Query: 250 YLVENGPMAVAINA-YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKA 308
+ GP+AVA +A Y GV + C +H+VLIVGYG
Sbjct: 241 MVATKGPVAVAFDADDPFGSYSGGVYYNPT--C--ETNKFTHAVLIVGYG------NENG 290
Query: 309 VPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGIN 340
YW++KNSWG+GWG GYF++ R + CGI
Sbjct: 291 QDYWLVKNSWGDGWGLDGYFKIARNANNHCGIA 323
>1xkg_A DER P I, major mite fecal allergen DER P 1; major allergen,
cysteine protease, house DUST mite, dermatop
pteronyssinus; 1.61A {Dermatophagoides pteronyssinus}
SCOP: d.3.1.1
Length = 312
Score = 336 bits (863), Expect = e-115
Identities = 81/320 (25%), Positives = 129/320 (40%), Gaps = 34/320 (10%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
F + + NK+YAT + + F +++ +Q +G +N SDLS EF
Sbjct: 6 KTFEEYKKAFNKSYATFEDEEAARKNFLESVKYVQ-----SNGGA---INHLSDLSLDEF 57
Query: 100 QAKYLGFKLKPSYADRSVP------AMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
+ ++L + A N P D R+ VT ++ Q CGS+WAF
Sbjct: 58 KNRFLMSAEAFEHLKTQFDLNAETNACSINGNAPAEIDLRQMRTVTPIRMQGGCGSAWAF 117
Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
S E Y A + + L+EQEL+DC GC G +I + I G+ +E
Sbjct: 118 SGVAATESAYLAYRDQSLDLAEQELVDCAS-QHGCHGDTIPRGIEYIQH---NGVVQESY 173
Query: 214 YPYRGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVE-NGPMAVAINAY---ALQF 268
Y Y +++CR I+ Y + + + + L + + +AV I A +
Sbjct: 174 YRYVAREQSCRRPNAQR-FGISNYCQIYPPNANKIREALAQTHSAIAVIIGIKDLDAFRH 232
Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
Y G + H+V IVGY + V YWI++NSW WG+ GY
Sbjct: 233 YDGRTIIQRD--N--GYQPNYHAVNIVGYS------NAQGVDYWIVRNSWDTNWGDNGYG 282
Query: 329 RLYRGDGSCGINDYVRSALV 348
I +Y ++
Sbjct: 283 YFAANIDLMMIEEYPYVVIL 302
>1cs8_A Human procathepsin L; prosegment, propeptide, inhibition,
hydrolase; HET: OCS; 1.80A {Homo sapiens} SCOP: d.3.1.1
PDB: 1cjl_A 3hwn_A*
Length = 316
Score = 330 bits (849), Expect = e-113
Identities = 105/309 (33%), Positives = 163/309 (52%), Gaps = 18/309 (5%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLL-QDTEHGSGVY--GLNEFSDLST 96
A + + HN+ Y E R ++ N++ I+L Q+ G + +N F D+++
Sbjct: 10 AQWTKWKAMHNRLYGM-NEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTS 68
Query: 97 AEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
EF+ GF+ + + PR+ DWRE VT VK+Q CGS WAFS T
Sbjct: 69 EEFRQVMNGFQNRKPRKGKVFQEP-LFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSAT 127
Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
G +EG KT +L+SLSEQ L+DC ++GC GG + AF + GGL+ E++Y
Sbjct: 128 GALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDN--GGLDSEESY 185
Query: 215 PYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTG 272
PY +++C+ N K + G+V + + E + K + GP++VAI+A + FY G
Sbjct: 186 PYEATEESCKYNPKYSVANDAGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEG 245
Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
+ C +E++ H VL+VGYG + T+ + YW++KNSWGE WG GY ++ +
Sbjct: 246 IYFEPD--C--SSEDMDHGVLVVGYGFESTESDNN--KYWLVKNSWGEEWGMGGYVKMAK 299
Query: 333 G-DGSCGIN 340
CGI
Sbjct: 300 DRRNHCGIA 308
>1by8_A Protein (procathepsin K); hydrolase(sulfhydryl proteinase), papain;
2.60A {Homo sapiens} SCOP: d.3.1.1 PDB: 7pck_A
Length = 314
Score = 330 bits (848), Expect = e-113
Identities = 109/312 (34%), Positives = 163/312 (52%), Gaps = 25/312 (8%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLL-QDTEHGSGVY--GLNEFSDLST 96
+ + + H K Y V+ SR I+ NL+ I + + G Y +N D+++
Sbjct: 9 THWELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTS 68
Query: 97 AEFQAKYLGFKLKPSYADRSVPAMIP--NITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
E K G K+ S++ + IP P + D+R+ VT VK+Q CGS WAFS
Sbjct: 69 EEVVQKMTGLKVPLSHSRSNDTLYIPEWEGRAPDSVDYRKKGYVTPVKNQGQCGSCWAFS 128
Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
+ G +EG KT KL++LS Q L+DC E+DGC GG ++NAF + G++ E Y
Sbjct: 129 SVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKN--RGIDSEDAY 186
Query: 215 PYRGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVT 271
PY G +++C N K GY + +E + + + GP++VAI+A + QFY
Sbjct: 187 PYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSK 246
Query: 272 GVSHPIQFFCDG--GNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
GV + D ++NL+H+VL VGYG+ + +WIIKNSWGE WG KGY
Sbjct: 247 GV------YYDESCNSDNLNHAVLAVGYGIQKGN------KHWIIKNSWGENWGNKGYIL 294
Query: 330 LYRG-DGSCGIN 340
+ R + +CGI
Sbjct: 295 MARNKNNACGIA 306
>1m6d_A Cathepsin F, catsf; papain family cysteine protease, hydrolase;
HET: MYP; 1.70A {Homo sapiens} SCOP: d.3.1.1
Length = 214
Score = 322 bits (829), Expect = e-111
Identities = 105/222 (47%), Positives = 142/222 (63%), Gaps = 10/222 (4%)
Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
P +DWR AVT VKDQ MCGS WAFS TGN+EG + L+SLSEQEL+DCD+ D
Sbjct: 2 PPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDK 61
Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETD 246
C GG SNA+ I + GGLE E Y Y+G ++C+ + + +V I V +S++E
Sbjct: 62 ACMGGLPSNAYSAIKNL--GGLETEDDYSYQGHMQSCQFSAEKAKVYIQDSVELSQNEQK 119
Query: 247 MAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTH 306
+A +L + GP++VAINA+ +QFY G+S P++ C + H+VL+VGYG
Sbjct: 120 LAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLC--SPWLIDHAVLLVGYG------QR 171
Query: 307 KAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
VP+W IKNSWG WGEKGY+ L+RG G+CG+N SA+V
Sbjct: 172 SDVPFWAIKNSWGTDWGEKGYYYLHRGSGACGVNTMASSAVV 213
>2c0y_A Procathepsin S; proenzyme, proteinase, hydrolase, thiol protease,
prosegment binding loop, glycoprotein, lysosome,
protease, zymogen; 2.1A {Homo sapiens}
Length = 315
Score = 324 bits (834), Expect = e-111
Identities = 96/313 (30%), Positives = 153/313 (48%), Gaps = 27/313 (8%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLS 95
++ + + + K Y E R I+ NL+ + L EH G++ G+N D++
Sbjct: 10 HHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHN-LEHSMGMHSYDLGMNHLGDMT 68
Query: 96 TAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
+ E + ++ + PN LP + DWRE VT VK Q CG++WAFS
Sbjct: 69 SEEVMSLMSSLRVPSQWQRNITYKSNPNRILPDSVDWREKGCVTEVKYQGSCGAAWAFSA 128
Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQE---DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
G +E KT KLVSLS Q L+DC E + GC GG ++ AF I+ G++ +
Sbjct: 129 VGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDN--KGIDSDA 186
Query: 213 TYPYRGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFY 269
+YPY+ D+ C+ + K + Y + E + + + GP++V ++A + Y
Sbjct: 187 SYPYKAMDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLY 246
Query: 270 VTGVSHPIQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
+GV + + +N++H VL+VGYG YW++KNSWG +GE+GY
Sbjct: 247 RSGV------YYEPSCTQNVNHGVLVVGYG------DLNGKEYWLVKNSWGHNFGEEGYI 294
Query: 329 RLYRG-DGSCGIN 340
R+ R CGI
Sbjct: 295 RMARNKGNHCGIA 307
>2o6x_A Procathepsin L1, secreted cathepsin L 1; hydrolase, thiol protease,
cysteine protease, zymogen, hydro; 1.40A {Fasciola
hepatica}
Length = 310
Score = 324 bits (833), Expect = e-111
Identities = 109/310 (35%), Positives = 156/310 (50%), Gaps = 23/310 (7%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLL-QDTEHGSGVY--GLNEFSDLST 96
L++ + +NK Y + R +I+ N++ IQ + G Y GLN+F+D++
Sbjct: 3 DLWHQWKRMYNKEYNG-ADDQHRRNIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTF 61
Query: 97 AEFQAKYLGFKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
EF+AKYL + S N +P DWRE VT VKDQ CGS WAFST
Sbjct: 62 EEFKAKYLTEMSRASDILSHGVPYEANNRAVPDKIDWRESGYVTEVKDQGNCGSGWAFST 121
Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
TG +EG Y + +S SEQ+L+DC + ++GC GG + NA+ + GLE E +
Sbjct: 122 TGTMEGQYMKNERTSISFSEQQLVDCSRPWGNNGCGGGLMENAYQYLKQ---FGLETESS 178
Query: 214 YPYRGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA-YALQFYVT 271
YPY + CR NK+ K+ G+ +V S E ++ + GP AVA++ Y +
Sbjct: 179 YPYTAVEGQCRYNKQLGVAKVTGFYTVHSGSEVELKNLVGAEGPAAVAVDVESDFMMYRS 238
Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
G+ ++H+VL VGYG YWI+KNSWG WGE+GY R+
Sbjct: 239 GIYQS----QTCSPLRVNHAVLAVGYGTQGGT------DYWIVKNSWGLSWGERGYIRMV 288
Query: 332 RG-DGSCGIN 340
R CGI
Sbjct: 289 RNRGNMCGIA 298
>3qt4_A Cathepsin-L-like midgut cysteine proteinase; hydrolase, zymogen,
intramolecular DISS bonds, insect larVal midgut; HET:
PG4 PG6; 2.11A {Tenebrio molitor}
Length = 329
Score = 321 bits (826), Expect = e-109
Identities = 120/326 (36%), Positives = 166/326 (50%), Gaps = 28/326 (8%)
Query: 27 DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLL-QDTEHGSGV 85
D ++ L ++ F H K+Y++ +E R IF N+ KI E G
Sbjct: 12 DLEICSLPKSLFQEQWSQFKLTHKKSYSSPIEEIRRQLIFKDNVAKIAEHNAKFEKGEVT 71
Query: 86 Y--GLNEFSDLSTAEFQAKYLGFKLKPSYADRS--VPAMIPNITLPRAFDWREYDAVTGV 141
Y +N+F D+S EF A K + + +P + L + DWR +AV+ V
Sbjct: 72 YSKAMNQFGDMSKEEFLAYVNRGKAQKPKHPENLRMPYVSSKKPLAASVDWRS-NAVSEV 130
Query: 142 KDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDT 199
KDQ CGSSW+FSTTG +EG A + +L SLSEQ LIDC + GC+GG + +AF
Sbjct: 131 KDQGQCGSSWSFSTTGAVEGQLALQRGRLTSLSEQNLIDCSSSYGNAGCDGGWMDSAFSY 190
Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMA 258
I G+ E YPY CR + + ++GY + S DE +A + + GP+A
Sbjct: 191 IHD---YGIMSESAYPYEAQGDYCRFDSSQSVTTLSGYYDLPSGDENSLADAVGQAGPVA 247
Query: 259 VAINA-YALQFYVTGVSHPIQFFCDG--GNENLSHSVLIVGYGVDRTKFTHKAVPYWIIK 315
VAI+A LQFY G+ F D +L+H VL+VGYG D + YWI+K
Sbjct: 248 VAIDATDELQFYSGGL------FYDQTCNQSDLNHGVLVVGYGSDNGQ------DYWILK 295
Query: 316 NSWGEGWGEKGYFRLYRG-DGSCGIN 340
NSWG GWGE GY+R R +CGI
Sbjct: 296 NSWGSGWGESGYWRQVRNYGNNCGIA 321
>1pci_A Procaricain; zymogen, hydrolase, thiol protease; 3.20A {Carica
papaya} SCOP: d.3.1.1
Length = 322
Score = 320 bits (822), Expect = e-109
Identities = 109/329 (33%), Positives = 152/329 (46%), Gaps = 27/329 (8%)
Query: 22 FMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEH 81
F +VG + + LFN ++ HNK Y + E R IF NL I + ++
Sbjct: 2 FSIVGYSQDDLTSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDE-TNKKN 60
Query: 82 GSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIP--NITLPRAFDWREYDAVT 139
S GLNEF+DLS EF KY+G + + I + LP DWR+ AVT
Sbjct: 61 NSYWLGLNEFADLSNDEFNEKYVGSLIDATIEQSYDEEFINEDIVNLPENVDWRKKGAVT 120
Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
V+ Q CGS WAFS +EG+ +T KLV LSEQEL+DC++ GC+GG A +
Sbjct: 121 PVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERRSHGCKGGYPPYALEY 180
Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNK-KATQVKINGYVSV-SRDETDMAKYLVENGPM 257
+ G+ YPY+ CR + VK +G V +E ++ + P+
Sbjct: 181 VAK---NGIHLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLN-AIAKQPV 236
Query: 258 AVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIK 315
+V + + Q Y G+ F + +V VGYG K Y +IK
Sbjct: 237 SVVVESKGRPFQLYKGGI------FEGPCGTKVDGAVTAVGYGKSGGK------GYILIK 284
Query: 316 NSWGEGWGEKGYFRLYRG----DGSCGIN 340
NSWG WGEKGY R+ R G CG+
Sbjct: 285 NSWGTAWGEKGYIRIKRAPGNSPGVCGLY 313
>3i06_A Cruzipain; autocatalytic cleavage, glycoprotein, protease, thiol
protease, zymogen; HET: QL2; 1.10A {Trypanosoma cruzi}
PDB: 1ewm_A* 1ewo_A* 1ewl_A* 1f29_A* 1ewp_A* 1f2b_A*
1f2c_A* 1f2a_A* 1me4_A* 1u9q_X* 2aim_A* 2efm_A* 2oz2_A*
1me3_A* 3kku_A* 3lxs_A* 1aim_A* 3iut_A* 3hd3_A* 2p86_A*
...
Length = 215
Score = 315 bits (809), Expect = e-108
Identities = 93/225 (41%), Positives = 123/225 (54%), Gaps = 15/225 (6%)
Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
P A DWR AVT VKDQ CGS WAFS GN+E + L +LSEQ L+ CD+ D
Sbjct: 2 PAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDS 61
Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD---KACRLNKKATQVKINGYVSVSRD 243
GC GG ++NAF+ I+ + G + E +YPY + C + I G+V + +D
Sbjct: 62 GCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQD 121
Query: 244 ETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
E +A +L NGP+AVA++A + Y GV C E L H VL+VGY
Sbjct: 122 EAQIAAWLAVNGPVAVAVDASSWMTYTGGVMTS----CVS--EQLDHGVLLVGYN----- 170
Query: 304 FTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
AVPYWIIKNSW WGE+GY R+ +G C + + SA+V
Sbjct: 171 -DSAAVPYWIIKNSWTTQWGEEGYIRIAKGSNQCLVKEEASSAVV 214
>3pdf_A Cathepsin C, dipeptidyl peptidase 1; two domains, cystein protease,
hydrolase-hydrolase inhibitor; HET: LXV NAG; 1.85A {Homo
sapiens} PDB: 1jqp_A* 2djf_B* 1k3b_B* 2djg_B* 2djf_A*
1k3b_A* 2djg_A* 2djf_C* 1k3b_C* 2djg_C*
Length = 441
Score = 298 bits (765), Expect = 7e-99
Identities = 87/317 (27%), Positives = 133/317 (41%), Gaps = 21/317 (6%)
Query: 47 EQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGF 106
N + + ++ + ++ + + E+ L+ + + G
Sbjct: 125 VYVNTAHLKNSQEKYSNRLYKYDHNFVKAINAIQKSWTATTYMEYETLTLGDMIRRSGGH 184
Query: 107 KL---KPSYADRSVPAMIPNITLPRAFDWREYDA---VTGVKDQTMCGSSWAFSTTGNIE 160
+P A + + LP ++DWR V+ V++Q CGS ++F++ G +E
Sbjct: 185 SRKIPRPKPAPLTAEIQQKILFLPTSWDWRNVHGINFVSPVRNQASCGSCYSFASMGMLE 244
Query: 161 GVYAAKTKKLVS--LSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
T + LS QE++ C Q GCEGG GL EE +PY G
Sbjct: 245 ARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYLIAGKYAQD--FGLVEEACFPYTG 302
Query: 219 DDKACRLNKKATQVKINGYVSVSR-----DETDMAKYLVENGPMAVAINAYA-LQFYVTG 272
D C++ + + + Y V +E M LV +GPMAVA Y Y G
Sbjct: 303 TDSPCKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKG 362
Query: 273 V-SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
+ H E +H+VL+VGYG D + + YWI+KNSWG GWGE GYFR+
Sbjct: 363 IYHHTGLRDPFNPFELTNHAVLLVGYGTD----SASGMDYWIVKNSWGTGWGENGYFRIR 418
Query: 332 RGDGSCGINDYVRSALV 348
RG C I +A
Sbjct: 419 RGTDECAIESIAVAATP 435
>3bwk_A Cysteine protease falcipain-3; malaria, hydrolase; HET: C1P; 2.42A
{Plasmodium falciparum} PDB: 3bpm_A*
Length = 243
Score = 289 bits (741), Expect = 6e-98
Identities = 83/234 (35%), Positives = 118/234 (50%), Gaps = 20/234 (8%)
Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
A+DWR + VT VKDQ +CGS WAFS+ G++E YA + K L SEQEL+DC +
Sbjct: 19 LDRIAYDWRLHGGVTPVKDQALCGSCWAFSSVGSVESQYAIRKKALFLFSEQELVDCSVK 78
Query: 185 DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG-DDKACRLNKKATQVKINGYVSVSRD 243
++GC GG I+NAFD ++ GGL + YPY + C L + + I YVS+
Sbjct: 79 NNGCYGGYITNAFDDMIDL--GGLCSQDDYPYVSNLPETCNLKRCNERYTIKSYVSI--P 134
Query: 244 ETDMAKYLVENGPMAVAINA-YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV--- 299
+ + L GP++++I A FY G C +H+V++VGYG+
Sbjct: 135 DDKFKEALRYLGPISISIAASDDFAFYRGGFYDGE---C---GAAPNHAVILVGYGMKDI 188
Query: 300 -DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGINDYVRSALV 348
+ + Y+IIKNSWG WGE GY L +C I L+
Sbjct: 189 YNEDTGRMEKFYYYIIKNSWGSDWGEGGYINLETDENGYKKTCSIGTEAYVPLL 242
>2oul_A Falcipain 2; cysteine protease, inhibitor, macromolecular
interaction, HY hydrolase inhibitor complex; 2.20A
{Plasmodium falciparum} SCOP: d.3.1.1 PDB: 2ghu_A 1yvb_A
3bpf_A* 3pnr_A
Length = 241
Score = 287 bits (736), Expect = 4e-97
Identities = 78/248 (31%), Positives = 126/248 (50%), Gaps = 20/248 (8%)
Query: 111 SYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKL 170
+Y + A+DWR + VT VKDQ CGS WAFS+ G++E YA + KL
Sbjct: 3 NYEEVIKKYRGEENFDHAAYDWRLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKNKL 62
Query: 171 VSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG-DDKACRLNKKA 229
++LSEQEL+DC ++ GC GG I+NAF+ ++ GG+ + YPY C +++
Sbjct: 63 ITLSEQELVDCSFKNYGCNGGLINNAFEDMIEL--GGICPDGDYPYVSDAPNLCNIDRCT 120
Query: 230 TQVKINGYVSVSRDETDMAKYLVENGPMAVAINA-YALQFYVTGVSHPIQFFCDGGNENL 288
+ I Y+SV + + + L GP+++++ FY G+ C + L
Sbjct: 121 EKYGIKNYLSV--PDNKLKEALRFLGPISISVAVSDDFAFYKEGIFDGE---C---GDQL 172
Query: 289 SHSVLIVGYGVD----RTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
+H+V++VG+G+ + Y+IIKNSWG+ WGE+G+ + CG+
Sbjct: 173 NHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNSWGQQWGERGFINIETDESGLMRKCGLG 232
Query: 341 DYVRSALV 348
L+
Sbjct: 233 TDAFIPLI 240
>8pch_A Cathepsin H; hydrolase, protease, cysteine proteinase,
aminopeptidase; HET: NAG BMA; 2.10A {Sus scrofa} SCOP:
d.3.1.1 PDB: 1nb3_A* 1nb5_A*
Length = 220
Score = 284 bits (730), Expect = 1e-96
Identities = 79/219 (36%), Positives = 111/219 (50%), Gaps = 15/219 (6%)
Query: 127 PRAFDWREYDA-VTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE- 184
P + DWR+ V+ VK+Q CGS W FSTTG +E A T K++SL+EQ+L+DC Q
Sbjct: 2 PPSMDWRKKGNFVSPVKNQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQQLVDCAQNF 61
Query: 185 -DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-SR 242
+ GC+GG S AF+ I G+ E TYPY+G D C+ + ++
Sbjct: 62 NNHGCQGGLPSQAFEYIRYN--KGIMGEDTYPYKGQDDHCKFQPDKAIAFVKDVANITMN 119
Query: 243 DETDMAKYLVENGPMAVAINA-YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR 301
DE M + + P++ A Y G+ C + ++H+VL VGYG
Sbjct: 120 DEEAMVEAVALYNPVSFAFEVTNDFLMYRKGIYSS--TSCHKTPDKVNHAVLAVGYG--- 174
Query: 302 TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIN 340
+PYWI+KNSWG WG GYF + RG CG+
Sbjct: 175 ---EENGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLA 210
>2b1m_A SPE31; papain-like, sugar binding protein; HET: NAG FUC PG4; 2.00A
{Pachyrhizus erosus} PDB: 2b1n_A*
Length = 246
Score = 282 bits (723), Expect = 3e-95
Identities = 88/227 (38%), Positives = 116/227 (51%), Gaps = 24/227 (10%)
Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
P ++DW + +T VK Q CGS WAFS TG IE +A T LVSLSEQELIDC E
Sbjct: 2 APESWDWSKKGVITKVKFQGQCGSGWAFSATGAIEAAHAIATGNLVSLSEQELIDCVDES 61
Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV----- 240
+GC G +F+ ++ GG+ E YPY+ D C+ N+ +V I+ Y
Sbjct: 62 EGCYNGWHYQSFEWVVKH--GGIASEADYPYKARDGKCKANEIQDKVTIDNYGVQILSNE 119
Query: 241 ---SRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGY 297
S E+ + V P++V+I+A FY G+ C ++H VLIVGY
Sbjct: 120 STESEAESSLQS-FVLEQPISVSIDAKDFHFYSGGIYDGGN--C-SSPYGINHFVLIVGY 175
Query: 298 GVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
G + V YWI KNSWGE WG GY R+ R G CG+N
Sbjct: 176 G------SEDGVDYWIAKNSWGEDWGIDGYIRIQRNTGNLLGVCGMN 216
>2xu3_A Cathepsin L1; hydrolase, drug design, thiol protease; HET: XU3 BTB;
0.90A {Homo sapiens} PDB: 2xu4_A* 2xu5_A* 2yj2_A*
2yj8_A* 2yj9_A* 2yjb_A* 2yjc_A* 3bc3_A* 3h89_A* 3h8b_A*
3h8c_A* 3of9_A* 3of8_A* 3hha_A* 2xu1_A* 3iv2_A* 3k24_A*
2nqd_B* 3kse_A* 2vhs_A ...
Length = 220
Score = 276 bits (709), Expect = 2e-93
Identities = 87/219 (39%), Positives = 127/219 (57%), Gaps = 13/219 (5%)
Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-- 184
PR+ DWRE VT VK+Q CGS WAFS TG +EG KT +L+SLSEQ L+DC
Sbjct: 2 PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQG 61
Query: 185 DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDE 244
++GC GG + AF + GGL+ E++YPY +++C+ N K + G+V + + E
Sbjct: 62 NEGCNGGLMDYAFQYVQDN--GGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQE 119
Query: 245 TDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
+ K + GP++VAI+A + FY G+ C +E++ H VL+VGYG + T
Sbjct: 120 KALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPD--C--SSEDMDHGVLVVGYGFEST 175
Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGIN 340
+ + YW++KNSWGE WG GY ++ + CGI
Sbjct: 176 ESDNN--KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIA 212
>2cio_A Papain; hydrolase/inhibitor, complex hydrolase/inhibitor, ICP,
cysteine protease, allergen, protease, thiol protease;
1.5A {Carica papaya} PDB: 1khq_A 1khp_A 1ppn_A 3e1z_B
3ima_A 3lfy_A 9pap_A 1bqi_A* 1bp4_A* 1pad_A 1pe6_A*
1pip_A* 1pop_A* 1ppd_A 1ppp_A* 1stf_E* 2pad_A 4pad_A*
5pad_A* 6pad_A* ...
Length = 212
Score = 271 bits (696), Expect = 1e-91
Identities = 81/223 (36%), Positives = 107/223 (47%), Gaps = 28/223 (12%)
Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
+P DWR+ AVT VK+Q CGS WAFS IEG+ +T L SEQEL+DCD+
Sbjct: 1 IPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNQYSEQELLDCDRRS 60
Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSV-SRD 243
GC GG +A + G+ TYPY G + CR +K K +G V +
Sbjct: 61 YGCNGGYPWSALQLVAQ---YGIHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYN 117
Query: 244 ETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR 301
E + + N P++V + A Q Y G+ F + H+V VGYG +
Sbjct: 118 EGALLY-SIANQPVSVVLEAAGKDFQLYRGGI------FVGPCGNKVDHAVAAVGYGPN- 169
Query: 302 TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
Y +IKNSWG GWGE GY R+ RG G CG+
Sbjct: 170 ---------YILIKNSWGTGWGENGYIRIKRGTGNSYGVCGLY 203
>3f5v_A DER P 1 allergen; allergy, asthma, DUST mites, glycoprotein,
hydrola protease, secreted, thiol protease; HET: P6G;
1.36A {Dermatophagoides pteronyssinus} PDB: 2as8_A
3rvw_A* 3rvx_A 3rvv_A* 3d6s_A*
Length = 222
Score = 272 bits (697), Expect = 1e-91
Identities = 62/225 (27%), Positives = 94/225 (41%), Gaps = 20/225 (8%)
Query: 123 NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
N P D R+ VT ++ Q CGS+WAFS E Y A ++ + L+EQEL+DC
Sbjct: 7 NGNAPAEIDLRQMRTVTPIRMQGGCGSAWAFSGVAATESAYLAYRQQSLDLAEQELVDCA 66
Query: 183 QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-S 241
GC G +I + I G+ +E Y Y +++CR I+ Y +
Sbjct: 67 S-QHGCHGDTIPRGIEYIQH---NGVVQESYYRYVAREQSCRRPNAQR-FGISNYCQIYP 121
Query: 242 RDETDMAKYLVE-NGPMAVAINA---YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGY 297
+ + + L + + +AV I A + Y G + H+V IVGY
Sbjct: 122 PNANKIREALAQTHSAIAVIIGIKDLDAFRHYDGRTIIQRD--N--GYQPNYHAVNIVGY 177
Query: 298 GVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDY 342
+ YWI++NSW WG+ GY I +Y
Sbjct: 178 SNAQGV------DYWIVRNSWDTNWGDNGYGYFAANIDLMMIEEY 216
>3kwz_A Cathepsin K; enzyme inhibitor, covalent reversible inhibitor,
disease mutation, disulfide bond, glycoprotein,
hydrolase, lysosome, protease; HET: KWZ; 1.49A {Homo
sapiens} PDB: 1au0_A* 1au2_A* 1au3_A* 1au4_A* 1ayu_A*
1ayv_A* 1ayw_A* 1bgo_A* 1atk_A* 1nl6_A* 1nlj_A* 1q6k_A*
1mem_A* 1yk7_A* 1yk8_A* 1yt7_A* 2ato_A* 2aux_A* 2auz_A*
2bdl_A* ...
Length = 215
Score = 270 bits (693), Expect = 4e-91
Identities = 88/218 (40%), Positives = 126/218 (57%), Gaps = 16/218 (7%)
Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
P + D+R+ VT VK+Q CGS WAFS+ G +EG KT KL++LS Q L+DC E+D
Sbjct: 2 PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSEND 61
Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-SRDET 245
GC GG ++NAF + G++ E YPY G +++C N K GY + +E
Sbjct: 62 GCGGGYMTNAFQYVQKN--RGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEK 119
Query: 246 DMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
+ + + GP++VAI+A + QFY GV + C ++NL+H+VL VGYG+ +
Sbjct: 120 ALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDES--C--NSDNLNHAVLAVGYGIQKGN 175
Query: 304 FTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGIN 340
+WIIKNSWGE WG KGY + R + +CGI
Sbjct: 176 ------KHWIIKNSWGENWGNKGYILMARNKNNACGIA 207
>3ioq_A CMS1MS2; caricaceae, cysteine protease, papain family, hydrolase;
HET: E64 SO4; 1.87A {Carica candamarcensis}
Length = 213
Score = 270 bits (692), Expect = 6e-91
Identities = 79/223 (35%), Positives = 117/223 (52%), Gaps = 28/223 (12%)
Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
+P + DWR+ AVT V++Q CGS W FS+ +EG+ T +L+SLSEQEL+DC++
Sbjct: 1 IPTSIDWRQKGAVTPVRNQGGCGSCWTFSSVAAVEGINKIVTGQLLSLSEQELLDCERRS 60
Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNK-KATQVKINGYVSV-SRD 243
GC GG A + + G+ + YPY G + CR ++ K +VK +G V +
Sbjct: 61 YGCRGGFPLYALQYVAN---SGIHLRQYYPYEGVQRQCRASQAKGPKVKTDGVGRVPRNN 117
Query: 244 ETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR 301
E + + + P+++ + A A Q Y G+ F ++ H+V VGYG D
Sbjct: 118 EQALIQ-RIAIQPVSIVVEAKGRAFQNYRGGI------FAGPCGTSIDHAVAAVGYGND- 169
Query: 302 TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
Y +IKNSWG GWGE GY R+ RG G+CG+
Sbjct: 170 ---------YILIKNSWGTGWGEGGYIRIKRGSGNPQGACGVL 203
>1ppo_A Protease omega; hydrolase(thiol protease); 1.80A {Carica papaya}
SCOP: d.3.1.1 PDB: 1meg_A*
Length = 216
Score = 268 bits (687), Expect = 5e-90
Identities = 77/223 (34%), Positives = 107/223 (47%), Gaps = 24/223 (10%)
Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
LP DWR+ AVT V+ Q CGS WAFS +EG+ +T KLV LSEQEL+DC++
Sbjct: 1 LPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERRS 60
Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQV-KINGYVSV-SRD 243
GC+GG A + + G+ YPY+ CR + + K +G V +
Sbjct: 61 HGCKGGYPPYALEYVAK---NGIHLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNN 117
Query: 244 ETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR 301
E ++ + P++V + + Q Y G+ F + H+V VGYG
Sbjct: 118 EGNLLN-AIAKQPVSVVVESKGRPFQLYKGGI------FEGPCGTKVDHAVTAVGYGKSG 170
Query: 302 TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
K Y +IKNSWG WGEKGY R+ R G CG+
Sbjct: 171 GK------GYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGLY 207
>1cqd_A Protein (protease II); cysteine protease, glycoprotein, proline
specificity, carboh papain family, hydrolase; HET: NAG
FUL FUC; 2.10A {Zingiber officinale} SCOP: d.3.1.1
Length = 221
Score = 267 bits (684), Expect = 1e-89
Identities = 92/222 (41%), Positives = 125/222 (56%), Gaps = 22/222 (9%)
Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
LP + DWRE AV VK+Q CGS WAFST +EG+ T L+SLSEQ+L+DC +
Sbjct: 3 LPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTTAN 62
Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-SRDE 244
GC GG ++ AF I++ GG+ E+TYPYRG D C A V I+ Y +V S +E
Sbjct: 63 HGCRGGWMNPAFQFIVNN--GGINSEETYPYRGQDGICNSTVNAPVVSIDSYENVPSHNE 120
Query: 245 TDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
+ K V N P++V ++A Q Y +G+ F N + +H++ +VGYG +
Sbjct: 121 QSLQK-AVANQPVSVTMDAAGRDFQLYRSGI------FTGSCNISANHALTVVGYGTEND 173
Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
K +WI+KNSWG+ WGE GY R R DG CGI
Sbjct: 174 K------DFWIVKNSWGKNWGESGYIRAERNIENPDGKCGIT 209
>2bdz_A Mexicain; cysteine protease, peptidase_C1, papain-like, HYDR; HET:
E64; 2.10A {Jacaratia mexicana}
Length = 214
Score = 266 bits (683), Expect = 1e-89
Identities = 83/222 (37%), Positives = 115/222 (51%), Gaps = 28/222 (12%)
Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
P + DWRE AVT VK+Q CGS WAFST IEG+ T +L+SLSEQEL+DC++
Sbjct: 2 PESIDWREKGAVTPVKNQNPCGSCWAFSTVATIEGINKIITGQLISLSEQELLDCERRSH 61
Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNK-KATQVKINGYVSV-SRDE 244
GC+GG + + ++ G+ E+ YPY CR K +V I GY V + DE
Sbjct: 62 GCDGGYQTTSLQYVVD---NGVHTEREYPYEKKQGRCRAKDKKGPKVYITGYKYVPANDE 118
Query: 245 TDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
+ + + N P++V ++ QFY G+ + N H+V VGYG
Sbjct: 119 ISLIQ-AIANQPVSVVTDSRGRGFQFYKGGI------YEGPCGTNTDHAVTAVGYGKT-- 169
Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
Y ++KNSWG WGEKGY R+ R G+CG+
Sbjct: 170 --------YLLLKNSWGPNWGEKGYIRIKRASGRSKGTCGVY 203
>1o0e_A Ervatamin C; plant cysteine protease, two domain, stable at PH
2-12, HYDR; 1.90A {Tabernaemontana divaricata} SCOP:
d.3.1.1 PDB: 2pns_A* 2pre_A* 3bcn_A*
Length = 208
Score = 264 bits (676), Expect = 1e-88
Identities = 88/219 (40%), Positives = 116/219 (52%), Gaps = 24/219 (10%)
Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
LP DWR+ AVT VK+Q CGS WAFST +E + +T L+SLSEQEL+DCD+++
Sbjct: 1 LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDKKN 60
Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDET 245
GC GG+ A+ I++ GG++ + YPY+ C+ K V I+GY V
Sbjct: 61 HGCLGGAFVFAYQYIINN--GGIDTQANYPYKAVQGPCQAASKV--VSIDGYNGVPFCNE 116
Query: 246 DMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
K V P VAI+A Q Y +G+ F L+H V IVGY +
Sbjct: 117 XALKQAVAVQPSTVAIDASSAQFQQYSSGI------FSGPCGTKLNHGVTIVGYQAN--- 167
Query: 304 FTHKAVPYWIIKNSWGEGWGEKGYFRLYR--GDGSCGIN 340
YWI++NSWG WGEKGY R+ R G G CGI
Sbjct: 168 -------YWIVRNSWGRYWGEKGYIRMLRVGGCGLCGIA 199
>1s4v_A Cysteine endopeptidase; KDEL ER retention signal, endosperm,
ricinosomes, SEED germi senescence, hydrolase-hydrolase
inhibitor complex; 2.00A {Ricinus communis} SCOP:
d.3.1.1
Length = 229
Score = 263 bits (674), Expect = 6e-88
Identities = 100/225 (44%), Positives = 125/225 (55%), Gaps = 23/225 (10%)
Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
T+P + DWR+ AVT VKDQ CGS WAFST +EG+ KT KLVSLSEQEL+DCD +
Sbjct: 1 TVPASVDWRKKGAVTSVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTD 60
Query: 185 -DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQV-KINGYVSV-S 241
+ GC GG + AF+ I + GG+ E YPY D C ++K+ I+G+ +V
Sbjct: 61 QNQGCNGGLMDYAFEFIKQR--GGITTEANYPYEAYDGTCDVSKENAPAVSIDGHENVPE 118
Query: 242 RDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
DE + K V N P++VAI+A QFY GV F L H V IVGYG
Sbjct: 119 NDENALLK-AVANQPVSVAIDAGGSDFQFYSEGV------FTGSCGTELDHGVAIVGYGT 171
Query: 300 DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
YW +KNSWG WGEKGY R+ RG +G CGI
Sbjct: 172 TI-----DGTKYWTVKNSWGPEWGEKGYIRMERGISDKEGLCGIA 211
>3ovx_A Cathepsin S; hydrolase, covalent inhibitor, aldehyde warhead is
covalently bound to Cys25, lysosomeal protein; HET: O64;
1.49A {Homo sapiens} PDB: 2h7j_A* 2f1g_A* 2hh5_B*
2hhn_A* 2hxz_A* 2op3_A* 2frq_A* 2fra_A* 2fq9_A* 2ft2_A*
2fud_A* 2g7y_A* 1ms6_A* 2r9m_A* 2r9n_A* 2r9o_A* 3n3g_A*
3n4c_A* 3mpe_A* 1nqc_A* ...
Length = 218
Score = 262 bits (671), Expect = 1e-87
Identities = 79/222 (35%), Positives = 117/222 (52%), Gaps = 20/222 (9%)
Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE- 184
LP + DWRE VT VK Q CG+ WAFS G +E KT KLVSLS Q L+DC E
Sbjct: 2 LPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEK 61
Query: 185 --DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-S 241
+ GC GG ++ AF I+ G++ + +YPY+ D+ C+ + K + Y +
Sbjct: 62 YGNKGCNGGFMTTAFQYIIDN--KGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPY 119
Query: 242 RDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
E + + + GP++V ++A + Y +GV ++ +N++H VL+VGYG
Sbjct: 120 GREDVLKEAVANKGPVSVGVDARHPSFFLYRSGV-----YYEPSCTQNVNHGVLVVGYG- 173
Query: 300 DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGIN 340
YW++KNSWG +GE+GY R+ R CGI
Sbjct: 174 -----DLNGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIA 210
>3f75_A Toxopain-2, cathepsin L protease; medical structural genomics of
pathogenic protozoa, MSGPP, C protease, parasite,
protozoa, hydrolase; 1.99A {Toxoplasma gondii}
Length = 224
Score = 261 bits (670), Expect = 2e-87
Identities = 94/223 (42%), Positives = 119/223 (53%), Gaps = 21/223 (9%)
Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE- 184
LP DWR VT VKDQ CGS WAFSTTG +EG + AKT KLVSLSEQEL+DC +
Sbjct: 7 LPAGVDWRSRGCVTPVKDQRDCGSCWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSRAE 66
Query: 185 -DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-SR 242
+ C GG +++AF ++ GG+ E YPY D+ CR VKI G+ V R
Sbjct: 67 GNQSCSGGEMNDAFQYVLDS--GGICSEDAYPYLARDEECRAQSCEKVVKILGFKDVPRR 124
Query: 243 DETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVD 300
E M + P+++AI A QFY GV C +L H VL+VGYG D
Sbjct: 125 SEAAMKA-ALAKSPVSIAIEADQMPFQFYHEGVFDA---SC---GTDLDHGVLLVGYGTD 177
Query: 301 RTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG---DGSCGIN 340
+ +WI+KNSWG GWG GY + +G CG+
Sbjct: 178 KESKKD----FWIMKNSWGTGWGRDGYMYMAMHKGEEGQCGLL 216
>1iwd_A Ervatamin B; cysteine protease, alpha-beta protein, catalytic DYAD,
L-DOM domain., hydrolase; 1.63A {Tabernaemontana
divaricata} SCOP: d.3.1.1
Length = 215
Score = 260 bits (666), Expect = 5e-87
Identities = 81/222 (36%), Positives = 117/222 (52%), Gaps = 23/222 (10%)
Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
LP DWR AV +K+Q CGS WAFS +E + +T +L+SLSEQEL+DCD
Sbjct: 1 LPSFVDWRSKGAVNSIKNQKQCGSCWAFSAVAAVESINKIRTGQLISLSEQELVDCDTAS 60
Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-SRDE 244
GC GG ++NAF I++ GG++ ++ YPY +C+ + V ING+ V +E
Sbjct: 61 HGCNGGWMNNAFQYIITN--GGIDTQQNYPYSAVQGSCKPYRLRV-VSINGFQRVTRNNE 117
Query: 245 TDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
+ + V + P++V + A Q Y +G+ F +H V+IVGYG
Sbjct: 118 SALQS-AVASQPVSVTVEAAGAPFQHYSSGI------FTGPCGTAQNHGVVIVGYGTQSG 170
Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
K YWI++NSWG+ WG +GY + R G CGI
Sbjct: 171 K------NYWIVRNSWGQNWGNQGYIWMERNVASSAGLCGIA 206
>2fo5_A Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cysteine
endoprotease, endopeptidase, LEUP hydrolase; HET: AR7;
2.20A {Hordeum vulgare}
Length = 262
Score = 261 bits (670), Expect = 7e-87
Identities = 98/226 (43%), Positives = 123/226 (54%), Gaps = 24/226 (10%)
Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
LP + DWR+ AVTGVKDQ CGS WAFST ++EG+ A +T LVSLSEQELIDCD D
Sbjct: 4 LPPSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTAD 63
Query: 186 D-GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ----VKINGYVSV 240
+ GC+GG + NAF+ I + GGL E YPYR C + + A V I+G+ V
Sbjct: 64 NDGCQGGLMDNAFEYIKNN--GGLITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDV 121
Query: 241 SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
+ + V N P++VA+ A A FY GV F L H V +VGYG
Sbjct: 122 PANSEEDLARAVANQPVSVAVEASGKAFMFYSEGV------FTGECGTELDHGVAVVGYG 175
Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
V YW +KNSWG WGE+GY R+ + G CGI
Sbjct: 176 V-----AEDGKAYWTVKNSWGPSWGEQGYIRVEKDSGASGGLCGIA 216
>1yal_A Chymopapain; hydrolase, thiol protease; 1.70A {Carica papaya} SCOP:
d.3.1.1 PDB: 1gec_E*
Length = 218
Score = 258 bits (662), Expect = 3e-86
Identities = 89/222 (40%), Positives = 112/222 (50%), Gaps = 24/222 (10%)
Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
P++ DWR AVT VK+Q CGS WAFST +EG+ T L+ LSEQEL+DCD+
Sbjct: 2 PQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKHSY 61
Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSV-SRDE 244
GC+GG + + + + G+ K YPY+ CR K VKI GY V S E
Sbjct: 62 GCKGGYQTTSLQYVAN---NGVHTSKVYPYQAKQYKCRATDKPGPKVKITGYKRVPSNCE 118
Query: 245 TDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
T + N P++V + A Q Y +GV F L H+V VGYG
Sbjct: 119 TSFLG-ALANQPLSVLVEAGGKPFQLYKSGV------FDGPCGTKLDHAVTAVGYGTSDG 171
Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
K Y IIKNSWG WGEKGY RL R G+CG+
Sbjct: 172 K------NYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVY 207
>3p5u_A Actinidin; SAD, cysteine proteinases, hydrolase; 1.50A {Actinidia
arguta} PDB: 3p5v_A 3p5w_A 3p5x_A 1aec_A* 2act_A
Length = 220
Score = 256 bits (656), Expect = 2e-85
Identities = 85/224 (37%), Positives = 120/224 (53%), Gaps = 24/224 (10%)
Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE- 184
LP DWR AV +KDQ CGS+WAFST +EG+ T L+SLSEQEL+DC +
Sbjct: 1 LPDYVDWRSSGAVVDIKDQGQCGSAWAFSTIAAVEGINKIATGDLISLSEQELVDCGRTQ 60
Query: 185 -DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQV-KINGYVSV-S 241
GC+GG +++ F I++ GG+ E YPY ++ C L+ + + I+ Y +V
Sbjct: 61 NTRGCDGGFMTDGFQFIINN--GGINTEANYPYTAEEGQCNLDLQQEKYVSIDTYENVPY 118
Query: 242 RDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
+E + V P++VA+ A Y Q Y +G+ F + H+V IVGYG
Sbjct: 119 NNEWALQT-AVAYQPVSVALEAAGYNFQHYSSGI------FTGPCGTAVDHAVTIVGYGT 171
Query: 300 DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG---DGSCGIN 340
+ + YWI+KNSWG WGE+GY R+ R G CGI
Sbjct: 172 E------GGIDYWIVKNSWGTTWGEEGYMRIQRNVGGVGQCGIA 209
>2wbf_X Serine-repeat antigen protein; SERA, malaria, vacuole, protease,
cathepsin, hydrolase, glycoprotein, thiol protease; HET:
DMS; 1.60A {Plasmodium falciparum} PDB: 3ch3_X 3ch2_X
Length = 265
Score = 253 bits (649), Expect = 9e-84
Identities = 59/255 (23%), Positives = 94/255 (36%), Gaps = 38/255 (14%)
Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE- 184
R D + V+DQ C +SW F++ ++E + K + +S + +C +
Sbjct: 10 CNRLKDENNCISNLQVEDQGNCDTSWIFASKYHLETIRCMKGYEPTKISALYVANCYKGE 69
Query: 185 -DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG------------------DDKACRL 225
D C+ GS F I+ G L E YPY D+
Sbjct: 70 HKDRCDEGSSPMEFLQIIED-YGFLPAESNYPYNYVKVGEQCPKVEDHWMNLWDNGKILH 128
Query: 226 NKKATQ-VKINGYVSV---------SRDETDMAKYLVENGPMAVAINAY-ALQFYVTGVS 274
NK + GY + + ++ G + I A + + +G
Sbjct: 129 NKNEPNSLDGKGYTAYESERFHDNMDAFVKIIKTEVMNKGSVIAYIKAENVMGYEFSGKK 188
Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR-G 333
C G++ H+V IVGYG K YWI++NSWG WG++GYF++ G
Sbjct: 189 VK--NLC--GDDTADHAVNIVGYGNYVNSEGEK-KSYWIVRNSWGPYWGDEGYFKVDMYG 243
Query: 334 DGSCGINDYVRSALV 348
C N +
Sbjct: 244 PTHCHFNFIHSVVIF 258
>3u8e_A Papain-like cysteine protease; papain-like cysteine peptidase,
peptidase_C1A, hydrolase, in form; 1.31A {Crocus
sativus}
Length = 222
Score = 237 bits (606), Expect = 5e-78
Identities = 83/221 (37%), Positives = 113/221 (51%), Gaps = 18/221 (8%)
Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
P + DWR+ AVT VKDQ CG WAF TG IEG+ A T +L+S+SEQ+++DCD
Sbjct: 2 PASIDWRKKGAVTSVKDQGACGMCWAFGATGAIEGIDAITTGRLISVSEQQIVDCDTXXX 61
Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETD 246
GG +AF +++ GG+ + YPY G D C LNK +I+GY +V +
Sbjct: 62 XXXGGDADDAFRWVITN--GGIASDANYPYTGVDGTCDLNKPIA-ARIDGYTNVPNSSSA 118
Query: 247 MAKYLVENGPMAVAINA--YALQFYVT-GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
+ V P++V I + Q Y G+ C + H+VLIVGYG +
Sbjct: 119 LLD-AVAKQPVSVNIYTSSTSFQLYTGPGIFAGSS--CSDDPATVDHTVLIVGYGSNG-- 173
Query: 304 FTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
YWI+KNSWG WG GY + R DG C I+
Sbjct: 174 ---TNADYWIVKNSWGTEWGIDGYILIRRNTNRPDGVCAID 211
>3hhi_A Cathepsin B-like cysteine protease; occluding loop, hydrolase, THIO
protease; HET: 074; 1.60A {Trypanosoma brucei} PDB:
3mor_A*
Length = 325
Score = 230 bits (589), Expect = 7e-74
Identities = 71/292 (24%), Positives = 103/292 (35%), Gaps = 53/292 (18%)
Query: 89 NEFSDLSTAEFQAKYLGFKLKPSYA----DRSVPAMIPNITLPRAFD----WREYDAVTG 140
+++ E + + G K + A R LP +FD W +
Sbjct: 32 GVMQNITLREAK-RLNGVIKKNNNASILPKRRFTEEEARAPLPSSFDSAEAWPNCPTIPQ 90
Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKT-KKLVSLSEQELIDCDQE-DDGCEGGSISNAFD 198
+ DQ+ CGS WA + + + + V +S +L+ C + DGC GG A+
Sbjct: 91 IADQSACGSCWAVAAASAMSDRFCTMGGVQDVHISAGDLLACCSDCGDGCNGGDPDRAWA 150
Query: 199 TIMSKLGGGLEEEKTYPYRGDDKACRL------------------------NKKATQVKI 234
S GL + PY + + V
Sbjct: 151 YFSST---GLVSDYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPKCDYTCDDPTIPVVNY 207
Query: 235 NGYVSVS-RDETDMAKYLVENGPMAVAINAYA-LQFYVTGV-SHPIQFFCDGGNENLSHS 291
+ S + + E D + L GP VA + Y Y +GV H G H+
Sbjct: 208 RSWTSYALQGEDDYMRELFFRGPFEVAFDVYEDFIAYNSGVYHHV------SGQYLGGHA 261
Query: 292 VLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYV 343
V +VG+G VPYW I NSW WG GYF + RG CGI D
Sbjct: 262 VRLVGWGTSNG------VPYWKIANSWNTEWGMDGYFLIRRGSSECGIEDGG 307
>1deu_A Procathepsin X; cysteine protease, proregion, prosegment, HY; 1.70A
{Homo sapiens} SCOP: d.3.1.1 PDB: 1ef7_A
Length = 277
Score = 216 bits (552), Expect = 6e-69
Identities = 67/277 (24%), Positives = 109/277 (39%), Gaps = 51/277 (18%)
Query: 99 FQAKYLGFKLKPSYADRSVPA--------MIPNITLPRAFDWREYDAV---TGVKDQTM- 146
F+ ++ + + LP+++DWR D V + ++Q +
Sbjct: 1 FRRGQTCYRPLRGDGLAPLGRTTYPRPHEYLSPADLPKSWDWRNVDGVNYASITRNQHIP 60
Query: 147 --CGSSWAFSTTGNIEGVYAAKTK---KLVSLSEQELIDCDQEDDGCEGGSISNAFDTIM 201
CGS WA ++T + K K LS Q +IDC CEGG+ + +D
Sbjct: 61 QYCGSCWAHASTSAMADRINIKRKGAWPSTLLSVQNVIDCG-NAGSCEGGNDLSVWDYAH 119
Query: 202 SKLGGGLEEEKTYPYRGDD---------------KACRLNKKATQVKINGYVSVSRDETD 246
G+ +E Y+ D K C + T ++ Y S+S E
Sbjct: 120 QH---GIPDETCNNYQAKDQECDKFNQCGTCNEFKECHAIRNYTLWRVGDYGSLSGREKM 176
Query: 247 MAKYLVENGPMAVAINAY-ALQFYVTGV-SHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
MA+ + NGP++ I A L Y G+ + ++H V + G+G+
Sbjct: 177 MAE-IYANGPISCGIMATERLANYTGGIYAEY------QDTTYINHVVSVAGWGIS---- 225
Query: 305 THKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIND 341
YWI++NSWGE WGE+G+ R+ G
Sbjct: 226 --DGTEYWIVRNSWGEPWGERGWLRIVTSTYKDGKGA 260
>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme,
papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A
1pbh_A 1mir_A
Length = 317
Score = 214 bits (548), Expect = 7e-68
Identities = 75/299 (25%), Positives = 113/299 (37%), Gaps = 59/299 (19%)
Query: 87 GLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREY----DAVTGVK 142
G N F ++ + + + G L ++ LP +FD RE + ++
Sbjct: 28 GHN-FYNVDMSYLK-RLCGTFLGGP-KPPQRVMFTEDLKLPASFDAREQWPQCPTIKEIR 84
Query: 143 DQTMCGSSWAFSTTGNIEGVYAAKTKKLVS--LSEQELIDC--DQEDDGCEGGSISNAFD 198
DQ CGS WAF I T VS +S ++L+ C DGC GG + A++
Sbjct: 85 DQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWN 144
Query: 199 TIMSKLGGGLEEEKTY-------PYRGDDKACRLNKKATQVKING--------------- 236
K GL Y PY +N G
Sbjct: 145 FWTRK---GLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSP 201
Query: 237 -----------YVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHPIQFFCDGG 284
SVS E D+ + +NGP+ A + Y+ Y +GV + G
Sbjct: 202 TYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGV-----YQHVTG 256
Query: 285 NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYV 343
H++ I+G+GV+ PYW++ NSW WG+ G+F++ RG CGI V
Sbjct: 257 EMMGGHAIRILGWGVENG------TPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEV 309
>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase,
digestive tract, hydrolase-hydrolase INH complex; HET:
074; 1.30A {Schistosoma mansoni} PDB: 3s3q_A* 3s3r_A*
Length = 254
Score = 212 bits (541), Expect = 1e-67
Identities = 65/266 (24%), Positives = 110/266 (41%), Gaps = 54/266 (20%)
Query: 124 ITLPRAFDWRE----YDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKT--KKLVSLSEQE 177
+ +P +FD R+ ++ ++DQ+ CGS WAF + ++ K+ V LS +
Sbjct: 1 VEIPSSFDSRKKWPRCKSIATIRDQSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVD 60
Query: 178 LIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEE--------KTYPYRGDDKACRLN-- 226
L+ C + GCEGG + A+D + + G + + YP+ + +
Sbjct: 61 LLSCCESCGLGCEGGILGPAWDYWVKE--GIVTGSSKENHAGCEPYPFPKCEHHTKGKYP 118
Query: 227 ------------------KKATQVKINGY-----VSVSRDETDMAKYLVENGPMAVAINA 263
K T + + +V DE + K +++ GP+
Sbjct: 119 PCGSKIYKTPRCKQTCQKKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGFTV 178
Query: 264 YA-LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGW 322
Y Y +G+ + G H++ I+G+GV+ PYW+I NSW E W
Sbjct: 179 YEDFLNYKSGI-----YKHITGETLGGHAIRIIGWGVENK------APYWLIANSWNEDW 227
Query: 323 GEKGYFRLYRGDGSCGINDYVRSALV 348
GE GYFR+ RG C I V + +
Sbjct: 228 GENGYFRIVRGRDECSIESEVTAGRI 253
>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco
hydrolase, lysosome, protease, thiol protease, zymogen,
CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A*
3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A*
1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A*
1qdq_A* 1csb_B* 1huc_B 2ipp_B ...
Length = 266
Score = 208 bits (532), Expect = 4e-66
Identities = 70/263 (26%), Positives = 105/263 (39%), Gaps = 56/263 (21%)
Query: 123 NITLPRAFDWREY----DAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVS--LSEQ 176
++ LP +FD RE + ++DQ CGS+WAF I T VS +S +
Sbjct: 4 DLKLPASFDAREQWPQCPTIKEIRDQGSCGSAWAFGAVEAISDRICIHTNAHVSVEVSAE 63
Query: 177 ELIDC--DQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY-------PYRGDDKACRLNK 227
+L+ C DGC GG + A++ K GL Y PY +N
Sbjct: 64 DLLTCCGSMCGDGCNGGYPAEAWNFWTRK---GLVSGGLYESHVGCRPYSIPPCEAHVNG 120
Query: 228 --------------------------KATQVKINGYVSVSRDETDMAKYLVENGPMAVAI 261
K + SVS E D+ + +NGP+ A
Sbjct: 121 ARPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAF 180
Query: 262 NAYA-LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGE 320
+ Y+ Y +GV + G H++ I+G+GV+ PYW++ NSW
Sbjct: 181 SVYSDFLLYKSGVYQHVT-----GEMMGGHAIRILGWGVENG------TPYWLVANSWNT 229
Query: 321 GWGEKGYFRLYRGDGSCGINDYV 343
WG+ G+F++ RG CGI V
Sbjct: 230 DWGDNGFFKILRGQDHCGIESEV 252
>3ois_A Cysteine protease; alpha and beta, hydrolase; HET: UDP; 1.65A
{Xylella fastidiosa}
Length = 291
Score = 202 bits (514), Expect = 4e-63
Identities = 55/286 (19%), Positives = 85/286 (29%), Gaps = 43/286 (15%)
Query: 81 HGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYAD----RSVPAMIPNITLPRAFDWREYD 136
H SG+ + G+ P AD P LP D
Sbjct: 10 HSSGLVPRGSHMQTVLKRRKKSGYGYI--PDIADIRDFSYTPEKSVIAALPPKVDLTPP- 66
Query: 137 AVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE----DDGCEGGS 192
V DQ GS A + I+ + + + I ++ + G+
Sbjct: 67 --FQVYDQGRIGSCTANALAAAIQFERIHDKQSPEFIPSRLFIYYNERKIEGHVNYDSGA 124
Query: 193 ISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLN-----------------KKATQVKIN 235
+ ++ K G+ EK +PY R K A KI
Sbjct: 125 MIRDGIKVLHK--LGVCPEKEWPYGDTPADPRTEEFPPGAPASKKPSDQCYKDAQNYKIT 182
Query: 236 GYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHPIQFFCDGGNENLSHSVLI 294
Y V++D + L P + Y + I H+VL
Sbjct: 183 EYSRVAQDIDHLKACLAVGSPFVFGFSVYNSWVGNNSLPVR-IPLPTKNDTLEGGHAVLC 241
Query: 295 VGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL-YRGDGSCGI 339
VGY + ++ I+NSWG GE GYF + Y + +
Sbjct: 242 VGYD--------DEIRHFRIRNSWGNNVGEDGYFWMPYEYISNTQL 279
>3f75_P Toxopain-2, cathepsin L propeptide; medical structural genomics of
pathogenic protozoa, MSGPP, C protease, parasite,
protozoa, hydrolase; 1.99A {Toxoplasma gondii}
Length = 106
Score = 78.2 bits (193), Expect = 2e-18
Identities = 31/95 (32%), Positives = 39/95 (41%), Gaps = 9/95 (9%)
Query: 31 HHLHHVKHT--------ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG 82
HH HH F+ F + K+YAT E R IF NL I +
Sbjct: 6 HHHHHGSIWEWKEAHFQDAFSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGY- 64
Query: 83 SGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSV 117
S +N F DLS EF+ KYLGFK + +
Sbjct: 65 SYSLKMNHFGDLSRDEFRRKYLGFKKSRNLKSHHL 99
>2l95_A Crammer, LP06209P; cysteine proteinase inhibitor, intrinsic
disorder P like protein, hydrolase; NMR {Drosophila
melanogaster}
Length = 80
Score = 58.9 bits (143), Expect = 1e-11
Identities = 17/75 (22%), Positives = 34/75 (45%), Gaps = 5/75 (6%)
Query: 40 ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLL-QDTEHGSGVY--GLNEFSDLST 96
+ + + +K Y E R I++ + +I+ + E G + G+N +DL+
Sbjct: 8 EEWVEYKSKFDKNYEAE-EDLMRRRIYAESKARIEEHNRKFEKGEVTWKMGINHLADLTP 66
Query: 97 AEFQAKYLGFKLKPS 111
EF + G K+ P+
Sbjct: 67 EEFAQRS-GKKVPPN 80
>3pw3_A Aminopeptidase C; bleomycin, cysteine proteinase fold, structural
genomics, JO center for structural genomics, JCSG; HET:
MSE; 2.23A {Parabacteroides distasonis}
Length = 383
Score = 53.7 bits (128), Expect = 3e-08
Identities = 31/166 (18%), Positives = 49/166 (29%), Gaps = 16/166 (9%)
Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED- 185
F + + +T VK+Q G+ W +S+ +E K LSE + D
Sbjct: 11 GFVFTTVKENPITSVKNQNRAGTCWCYSSYSFLESELLRMGKGEYDLSEMFTVYNTYLDR 70
Query: 186 -----------DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKI 234
+GGS +A + + GL E+ N
Sbjct: 71 ADAAVRTHGDVSFSQGGSFYDALYGMETF---GLVPEEEMRPGMMYADTLSNHTELSALT 127
Query: 235 NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFF 280
+ V+ EN M A+ GV P +F
Sbjct: 128 DAMVAAIAKGKLRKLQSDENNAMLWKKAVAAVHQIYLGVP-PEKFT 172
Score = 46.8 bits (110), Expect = 5e-06
Identities = 18/86 (20%), Positives = 29/86 (33%), Gaps = 7/86 (8%)
Query: 245 TDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
+DMA +L Q + T + + D H + I G D+
Sbjct: 275 SDMAHWLKLKPEEKKLNTKPQPQKWCTQAERQLAY--DNYETTDDHGMQIYGIAKDQEG- 331
Query: 305 THKAVPYWIIKNSWGEGWGEKGYFRL 330
Y+++KNSWG G +
Sbjct: 332 ----NEYYMVKNSWGTNSKYNGIWYA 353
>2pff_B Fatty acid synthase subunit beta; fatty acid synthase,
acyl-carrier-protein, beta-ketoacyl RED beta-ketoacyl
synthase, dehydratase; 4.00A {Saccharomyces cerevisiae}
Length = 2006
Score = 47.4 bits (112), Expect = 6e-06
Identities = 48/274 (17%), Positives = 85/274 (31%), Gaps = 94/274 (34%)
Query: 60 YSRLHIFSGNLRKIQLLQDTEHGSGV---YGLNEFSDLSTAEFQAKYLGFK----LKPSY 112
+S L I N + + E G + Y F + + + + + FK SY
Sbjct: 1659 FSILDIVINNPVNLTIHFGGEKGKRIRENYSAMIFETIVDGKLKTEKI-FKEINEHSTSY 1717
Query: 113 ---ADRSV--------PAMIPNITLPRAF--DWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
+++ + PA+ + +A D + + D T G S G
Sbjct: 1718 TFRSEKGLLSATQFTQPALT---LMEKAAFEDLKSKGLI--PADATFAGHS-----LG-- 1765
Query: 160 EGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAF-DTIMSKLGGGLEEEKTYPYRG 218
E YAA L +S + ++ + YRG
Sbjct: 1766 E--YAA----LA----------------SLADVMS--IESLV--EV---VF------YRG 1790
Query: 219 DDKACRLNKKATQVKINGYVSVSRDETDMAKYL---VENGPMAVAINAYALQFYVTGVSH 275
QV +V RDE + Y + G +A + + ALQ+ V V
Sbjct: 1791 ---------MTMQV------AVPRDELGRSNYGMIAINPGRVAASFSQEALQYVVERVGK 1835
Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAV 309
+ + N N+ + + G +A+
Sbjct: 1836 RTGWLVEIVNYNVENQQYVAA-G------DLRAL 1862
Score = 45.8 bits (108), Expect = 2e-05
Identities = 50/301 (16%), Positives = 96/301 (31%), Gaps = 96/301 (31%)
Query: 11 ALLSLTVSVSSFMVVGDEKLHHLHHVKHTALF-------NYFLE-QH-NKTYATLVEYYS 61
AL +V G+ +L A+F +YF E + +TY LV
Sbjct: 144 ALFR---AVGE----GNAQLV--------AIFGGQGNTDDYFEELRDLYQTYHVLVGDL- 187
Query: 62 RLHIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAEFQ--AKYLGFKLKPSYADRSV 117
+ + L +L++ T V+ GLN L YL S+
Sbjct: 188 -IKFSAETLS--ELIRTTLDAEKVFTQGLNILEWLENPSNTPDKDYL----------LSI 234
Query: 118 PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSL--SE 175
P P I + + + + G + S+ TG+ +G+ A ++ S
Sbjct: 235 PISCPLIGVIQLAHYVVTAKLLGFTPGEL--RSYLKGATGHSQGLVTA---VAIAETDSW 289
Query: 176 QELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYP-------------YRGDDK- 221
+ ++ +I+ F I G+ + YP +
Sbjct: 290 ESFFVSVRK-------AITVLF-FI------GVRCYEAYPNTSLPPSILEDSLENNEGVP 335
Query: 222 ----ACR-LNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVA-INAYALQFYVTGVSH 275
+ L ++ Q + ++T+ +L + ++ +N A V+G
Sbjct: 336 SPMLSISNLTQEQVQDYV--------NKTN--SHLPAGKQVEISLVNG-AKNLVVSG--P 382
Query: 276 P 276
P
Sbjct: 383 P 383
Score = 33.9 bits (77), Expect = 0.089
Identities = 32/188 (17%), Positives = 50/188 (26%), Gaps = 57/188 (30%)
Query: 4 FYFFAGVA------LLSLTVSVS-----------SFM-VVGDEKLHHLHHVKHTALFNYF 45
FF GV SL S+ S M + + + + N
Sbjct: 302 VLFFIGVRCYEAYPNTSLPPSILEDSLENNEGVPSPMLSISNLTQEQVQ--DYVNKTNSH 359
Query: 46 LEQHNKTYATLVE---------YYSRLHIFSGNLRKIQ----LLQDTEHGSG--VYGLNE 90
L + +LV L+ + LRK + L Q S + N
Sbjct: 360 LPAGKQVEISLVNGAKNLVVSGPPQSLYGLNLTLRKAKAPSGLDQSRIPFSERKLKFSNR 419
Query: 91 FSDLS-TAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWRE-------YDAVTGVK 142
F L + F + L +I + + YD G
Sbjct: 420 F--LPVASPFHSHLL----------VPASDLINKDLVKNNVSFNAKDIQIPVYDTFDG-S 466
Query: 143 D-QTMCGS 149
D + + GS
Sbjct: 467 DLRVLSGS 474
Score = 33.5 bits (76), Expect = 0.13
Identities = 47/314 (14%), Positives = 86/314 (27%), Gaps = 107/314 (34%)
Query: 46 LEQHNKTYATLVEYYSRLHIFSGNLRKI---QLLQDTEHGSGVYGLNEFSDLSTAEFQAK 102
L + + LV + + L++ L + TE G +E + + AE K
Sbjct: 11 LSHGSLEHVLLVP--TASFFIASQLQEQFNKILPEPTE---GFAADDEPT--TPAELVGK 63
Query: 103 YLGF---KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGS---SWAFSTT 156
+LG+ ++PS + + N+ L F+ Y + G+ + A
Sbjct: 64 FLGYVSSLVEPSKVGQFDQVL--NLCL-TEFE-NCY----------LEGNDIHALAAKLL 109
Query: 157 GNIEGV-----------YAAKT---KKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMS 202
+ A+ + S L E + +++
Sbjct: 110 QENDTTLVKTKELIKNYITARIMAKRPFDKKSNSALFRAVGEGNA-----------QLVA 158
Query: 203 KLGG-G-----LEE--E--KTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLV 252
GG G EE + +TY D L K + + + R D K
Sbjct: 159 IFGGQGNTDDYFEELRDLYQTYHVLVGD----LIKFSAET----LSELIRTTLDAEKVFT 210
Query: 253 E--------NGP--------MAVA------INAYAL-QFYVT----GVSHPIQFFCDGGN 285
+ P + I L + VT G P +
Sbjct: 211 QGLNILEWLENPSNTPDKDYLLSIPISCPLIGVIQLAHYVVTAKLLGF-TPGELR----- 264
Query: 286 ENLSHSVLIVGYGV 299
+ G+
Sbjct: 265 -SYLKGATGHSQGL 277
>1qzv_F Plant photosystem I: subunit PSAF; photosynthesis,plant
photosynthetic reaction center, peripheral antenna; HET:
CL1 PQN; 4.44A {Pisum sativum} SCOP: i.5.1.1
Length = 154
Score = 43.4 bits (101), Expect = 2e-05
Identities = 14/30 (46%), Positives = 16/30 (53%), Gaps = 2/30 (6%)
Query: 98 EFQA-KYLGFKLKPSYADRSVPAMIPNITL 126
E QA K L LK YAD S PA+ T+
Sbjct: 18 EKQALKKLQASLKL-YADDSAPALAIKATM 46
Score = 38.4 bits (88), Expect = 8e-04
Identities = 6/25 (24%), Positives = 14/25 (56%), Gaps = 1/25 (4%)
Query: 239 SVSRDETDMAKYLVENGPMAVAINA 263
++ + + + Y ++ P A+AI A
Sbjct: 21 ALKKLQASLKLYADDSAP-ALAIKA 44
>2cb5_A Protein (bleomycin hydrolase); aminopeptidase, cysteine protease,
SELF- compartmentalizing, cylinase; 1.85A {Homo sapiens}
SCOP: d.3.1.1 PDB: 1cb5_A
Length = 453
Score = 43.6 bits (102), Expect = 7e-05
Identities = 19/84 (22%), Positives = 33/84 (39%), Gaps = 3/84 (3%)
Query: 246 DMAKYL-VENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
D+ K+ + G + + + L F V+ + G ++H++
Sbjct: 326 DVGKHFNSKLGLSDMNLYDHELVFGVSLKNMNKAERLTFGESLMTHAMTFTAVSEK--DD 383
Query: 305 THKAVPYWIIKNSWGEGWGEKGYF 328
A W ++NSWGE G KGY
Sbjct: 384 QDGAFTKWRVENSWGEDHGHKGYL 407
>2e01_A Cysteine proteinase 1; bleomycin hydrolase, thiol protease, C1
protease, hydrolase; 1.73A {Saccharomyces cerevisiae}
PDB: 2e02_A 2e03_A 2dzy_A 1a6r_A 2e00_A 2dzz_A 3gcb_A
1gcb_A
Length = 457
Score = 41.6 bits (97), Expect = 2e-04
Identities = 18/84 (21%), Positives = 34/84 (40%), Gaps = 6/84 (7%)
Query: 246 DMAKYL-VENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
K++ + G M + + Y Y + ++ ++LI G VD T
Sbjct: 330 HTPKFMDKKTGVMDIELWNYPAIGYNLPQQKASRI--RYHESLMTAAMLITGCHVDET-- 385
Query: 305 THKAVPYWIIKNSWGEGWGEKGYF 328
K + ++NSWG+ G+ G +
Sbjct: 386 -SKLPLRYRVENSWGKDSGKDGLY 408
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis,
programmed cell death; HET: DTP; 6.90A {Drosophila
melanogaster} PDB: 3iz8_A*
Length = 1221
Score = 34.1 bits (77), Expect = 0.088
Identities = 45/343 (13%), Positives = 88/343 (25%), Gaps = 102/343 (29%)
Query: 33 LHHVKHTALFNYFLEQHNKTYATLV--EYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNE 90
L +V++ +N F N + L+ + S L
Sbjct: 250 LLNVQNAKAWNAF----NLSCKILLTTRFKQVTDFLSAATTTHISLDHHSMT-------- 297
Query: 91 FSDLSTAEFQAKYLGFKLK--PSYADRSVP---AMIPNITLPRAFDWREYDAVTGVKDQT 145
+ KYL + + P + P ++I W + V K T
Sbjct: 298 LTPDEVKSLLLKYLDCRPQDLPREVLTTNPRRLSIIAESIRDGLATWDNWKHVNCDKLTT 357
Query: 146 MCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELI----------DCDQEDDGCEGGSISN 195
+ SS ++ +L I D + D
Sbjct: 358 IIESSLNVLEPAEYRKMF----DRLSVFPPSAHIPTILLSLIWFDVIKSDV--------- 404
Query: 196 AFDTIMSKL-GGGLEEE--KTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLV 252
+++KL L E+ K L K +E + + +V
Sbjct: 405 --MVVVNKLHKYSLVEKQPKESTISIPSIYLELKVKL------------ENEYALHRSIV 450
Query: 253 ENGPMAVAINAYALQ--FYVTGVSHPI--QFFCDGGNENLSHSVLIVGYGVDRTKFTHKA 308
+ Y + F + P Q+F SH +G+ + + +
Sbjct: 451 D---------HYNIPKTFDSDDLIPPYLDQYFY-------SH----IGHHLKNIEHPERM 490
Query: 309 VPY--------WI---IKNSWGEGWGEKGY-------FRLYRG 333
+ ++ I++ W G + Y+
Sbjct: 491 TLFRMVFLDFRFLEQKIRHD-STAWNASGSILNTLQQLKFYKP 532
Score = 29.1 bits (64), Expect = 2.9
Identities = 41/331 (12%), Positives = 95/331 (28%), Gaps = 112/331 (33%)
Query: 19 VSSFM--VVGDEKLHHLHHVKHTA-----LFNYFLEQHNKTYATLVEYYSRLHIFSGNLR 71
V ++ E++ H+ K LF L + + VE
Sbjct: 38 VQDMPKSILSKEEIDHIIMSKDAVSGTLRLFWTLLSKQEEMVQKFVE------------- 84
Query: 72 KIQLLQDTEHGSGVYG--LNEFSDLSTAEFQAKYLGFKLKPSYADR---SVPAMIP-NIT 125
++L+ Y ++ E + + ++ DR N++
Sbjct: 85 --EVLRIN------YKFLMSPIK----TEQRQPSMMTRMYIEQRDRLYNDNQVFAKYNVS 132
Query: 126 LPRAFDWREYDAVTGVKDQT------M--CGSSW-AFSTTGNIE-------GVY------ 163
+ + + A+ ++ + G +W A + + ++
Sbjct: 133 RLQPY-LKLRQALLELRPAKNVLIDGVLGSGKTWVALDVCLSYKVQCKMDFKIFWLNLKN 191
Query: 164 ----AAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG- 218
+ L L Q ID + +I +I ++L L + K Y
Sbjct: 192 CNSPETVLEMLQKLLYQ--IDPNWTSRSDHSSNIKLRIHSIQAEL-RRLLKSKPYE-NCL 247
Query: 219 ---DD-------KA----CRL--------------NKKATQVKINGYV-SVSRDETD--M 247
+ A C++ T + ++ + +++ DE +
Sbjct: 248 LVLLNVQNAKAWNAFNLSCKILLTTRFKQVTDFLSAATTTHISLDHHSMTLTPDEVKSLL 307
Query: 248 AKYL-----------VENGPMAVAINAYALQ 267
KYL + P ++I A +++
Sbjct: 308 LKYLDCRPQDLPREVLTTNPRRLSIIAESIR 338
Score = 27.1 bits (59), Expect = 10.0
Identities = 17/105 (16%), Positives = 33/105 (31%), Gaps = 16/105 (15%)
Query: 31 HHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNE 90
HH HH+ ++ +H Y ++ + F N + +QD + E
Sbjct: 2 HHHHHM------DFETGEHQYQYKDILSVF--EDAFVDNF-DCKDVQDMP--KSILSKEE 50
Query: 91 FSDL---STAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDW 132
+ A L F S + V + + L + +
Sbjct: 51 IDHIIMSKDAVSGTLRL-FWTLLSKQEEMVQKFVEEV-LRINYKF 93
>3t4l_A Histidine kinase 4; PAS domain, hormone receptor, endop reticulum;
HET: ZEA; 1.53A {Arabidopsis thaliana} PDB: 3t4k_A*
3t4j_A* 3t4o_A* 3t4q_A* 3t4s_A* 3t4t_A*
Length = 270
Score = 32.1 bits (72), Expect = 0.22
Identities = 14/90 (15%), Positives = 27/90 (30%), Gaps = 4/90 (4%)
Query: 72 KIQLLQDTEHG----SGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLP 127
+LL+ G VY + + + E A G+ + V ++ +
Sbjct: 155 PFRLLETHHLGVVLTFPVYKSSLPENPTVEERIAATAGYLGGAFDVESLVENLLGQLAGN 214
Query: 128 RAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
+A YD M G+ +
Sbjct: 215 QAIVVHVYDITNASDPLVMYGNQDEEADRS 244
>3n89_A Defective in GERM LINE development protein 3, ISO; KH domains, RNA
binding, cell cycle; 2.79A {Caenorhabditis elegans}
Length = 376
Score = 29.1 bits (64), Expect = 2.2
Identities = 10/62 (16%), Positives = 18/62 (29%)
Query: 144 QTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSK 203
G+ + GNI+ V A+ + L + D I+ +
Sbjct: 230 NETRGNIYEIKVVGNIDNVLKARRYIMDLLPISMCFNIKNTDMAEPSRVSDRNIHMIIDE 289
Query: 204 LG 205
G
Sbjct: 290 SG 291
>3eye_A PTS system N-acetylgalactosamine-specific IIB component 1;
structural genomics, phosphotransferase, PSI-2, protein
structure initiative; 1.45A {Escherichia coli O157}
Length = 168
Score = 27.4 bits (61), Expect = 4.6
Identities = 4/30 (13%), Positives = 13/30 (43%)
Query: 226 NKKATQVKINGYVSVSRDETDMAKYLVENG 255
+ + +I+ V V + +++ + G
Sbjct: 113 HFSEGKKQISSKVYVDDQDLTDLRFIKQRG 142
>1ble_A Fructose permease; phosphotransferase, sugar transport; 2.90A
{Bacillus subtilis} SCOP: c.38.1.1
Length = 163
Score = 27.0 bits (60), Expect = 5.7
Identities = 6/30 (20%), Positives = 13/30 (43%)
Query: 226 NKKATQVKINGYVSVSRDETDMAKYLVENG 255
+ + +I VSV+ + + L + G
Sbjct: 109 RFENHRRQITKSVSVTEQDIKAFETLSDKG 138
>1nrz_A PTS system, sorbose-specific IIB component; beta sheet core,
flanking helices, right handed beta-alpha-B crossover,
transferase; 1.75A {Klebsiella pneumoniae} SCOP:
c.38.1.1
Length = 164
Score = 27.0 bits (60), Expect = 5.7
Identities = 4/30 (13%), Positives = 12/30 (40%)
Query: 226 NKKATQVKINGYVSVSRDETDMAKYLVENG 255
+ + ++ VS+ + + L + G
Sbjct: 108 AWRPGKKQLTKAVSLDPQDIQAFRELDKLG 137
>3bde_A MLL5499 protein; stress responsive A/B barrel domain, structural
genomics, JO center for structural genomics, JCSG; 1.79A
{Mesorhizobium loti}
Length = 120
Score = 26.8 bits (59), Expect = 6.0
Identities = 24/125 (19%), Positives = 44/125 (35%), Gaps = 17/125 (13%)
Query: 25 VGDEKLHHLHH---------VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQL 75
+G +K+HH HH ++HT +F K + +E L L I+
Sbjct: 1 MGSDKIHHHHHHENLYFQGMIRHTVVFTL------KHASHSLEEKRFLVDAKKILSAIRG 54
Query: 76 LQDTEHGSGVYGLNEFSDLSTAEF--QAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWR 133
+ E + ++ + EF QA Y + P + +P + D+
Sbjct: 55 VTHFEQLRQISPKIDYHFGFSMEFADQAAYTRYNDHPDHVAFVRDRWVPEVEKFLEIDYV 114
Query: 134 EYDAV 138
+V
Sbjct: 115 PLGSV 119
>1vsq_C Mannose-specific phosphotransferase enzyme IIB component; sugar
transport, complex (transferase/phosphocarrier,
cytoplasm, membrane; HET: NEP; NMR {Escherichia coli}
PDB: 2jzn_C 2jzo_D 2jzh_A
Length = 165
Score = 27.0 bits (60), Expect = 6.5
Identities = 7/30 (23%), Positives = 13/30 (43%)
Query: 226 NKKATQVKINGYVSVSRDETDMAKYLVENG 255
+ + ++N VSV + + K L G
Sbjct: 111 AFRQGKTQVNNAVSVDEKDIEAFKKLNARG 140
>3ic6_A Putative methylase family protein; putative methylase family Pro
structural genomics, PSI-2, protein structure
initiative; 2.59A {Neisseria gonorrhoeae fa 1090}
Length = 223
Score = 26.9 bits (60), Expect = 8.4
Identities = 5/11 (45%), Positives = 8/11 (72%)
Query: 287 NLSHSVLIVGY 297
NL+ +V +V Y
Sbjct: 178 NLAQAVQVVCY 188
>3kty_A Probable methyltransferase; alpha-beta-alpha sandwich, structural
genomics, PSI-2, prote structure initiative; 2.30A
{Bordetella pertussis}
Length = 173
Score = 26.7 bits (60), Expect = 8.5
Identities = 1/11 (9%), Positives = 7/11 (63%)
Query: 287 NLSHSVLIVGY 297
N++ ++ + +
Sbjct: 156 NVAQALQLAAW 166
Database: pdb70
Posted date: Sep 4, 2012 3:40 AM
Number of letters in database: 6,701,793
Number of sequences in database: 27,921
Lambda K H
0.319 0.136 0.414
Gapped
Lambda K H
0.267 0.0856 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 27921
Number of Hits to DB: 5,411,979
Number of extensions: 323407
Number of successful extensions: 940
Number of sequences better than 10.0: 1
Number of HSP's gapped: 727
Number of HSP's successfully gapped: 66
Length of query: 348
Length of database: 6,701,793
Length adjustment: 94
Effective length of query: 254
Effective length of database: 4,077,219
Effective search space: 1035613626
Effective search space used: 1035613626
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 58 (25.9 bits)