RPS-BLAST 2.2.26 [Sep-21-2011]
Database: pdb70
27,921 sequences; 6,701,793 total letters
Searching..................................................done
Query= psy1705
(309 letters)
>1by8_A Protein (procathepsin K); hydrolase(sulfhydryl proteinase), papain;
2.60A {Homo sapiens} SCOP: d.3.1.1 PDB: 7pck_A
Length = 314
Score = 334 bits (858), Expect = e-115
Identities = 117/302 (38%), Positives = 170/302 (56%), Gaps = 8/302 (2%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K ++K Y K + ++L W+ N K I HN EA G+H Y L NHL D+ +++
Sbjct: 15 KKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTSEEVVQK 74
Query: 74 MTRLTHSRIRRTLVRSPESNESVL-IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
MT L + E PD +D+R+KG++TP NQ CG+C+AFS A++
Sbjct: 75 MTGLKVPLSHSRSNDTLYIPEWEGRAPDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALE 134
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQ+ K T ++ LS Q +VDC S N GC GG + N YVQ G+ E+ YPY G++
Sbjct: 135 GQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQE 192
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
C + + +P +E ALK +A VGP++V+I+AS +FQ Y+ G+Y DE
Sbjct: 193 ESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDE 252
Query: 253 ACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYA 307
+C SD +NHA+L VGY WI+KN W +WG+ GY+ + R NN CGIAN A +
Sbjct: 253 SCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASFP 312
Query: 308 LI 309
+
Sbjct: 313 KM 314
>1cs8_A Human procathepsin L; prosegment, propeptide, inhibition,
hydrolase; HET: OCS; 1.80A {Homo sapiens} SCOP: d.3.1.1
PDB: 1cjl_A 3hwn_A*
Length = 316
Score = 329 bits (847), Expect = e-113
Identities = 106/304 (34%), Positives = 161/304 (52%), Gaps = 13/304 (4%)
Query: 15 KKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM 74
+ + Y + ++ W+ N K I HNQE ++G H +T+ N D+ + + M
Sbjct: 17 AMHNRLYGMNE-EGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVM 75
Query: 75 TRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQ 134
+ + R+ V P +DWREKG++TP NQ CG+C+AFS A++GQ
Sbjct: 76 NGFQNRKPRKGKVFQEPLFYEA--PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQ 133
Query: 135 IFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSI 194
+F+ T + LS Q +VDCS GN GC GG + YVQ GGL EE YPY+ +
Sbjct: 134 MFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEES 193
Query: 195 CKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEAC 254
CK+ V + + + + P+ E AL +ATVGPI+V+I+A +F Y GIY + C
Sbjct: 194 CKYNPKYSVANDAGFVDI-PKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC 252
Query: 255 TSDYVNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAV 305
+S+ ++H +L+VGY W++KN W WG GY+ + + N CGIA+ A
Sbjct: 253 SSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAAS 312
Query: 306 YALI 309
Y +
Sbjct: 313 YPTV 316
>2c0y_A Procathepsin S; proenzyme, proteinase, hydrolase, thiol protease,
prosegment binding loop, glycoprotein, lysosome,
protease, zymogen; 2.1A {Homo sapiens}
Length = 315
Score = 326 bits (839), Expect = e-112
Identities = 104/301 (34%), Positives = 159/301 (52%), Gaps = 8/301 (2%)
Query: 15 KKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM 74
K Y K Y++K ++ ++L W+ N K + HN E G+H Y L NHL D+ + M
Sbjct: 17 KTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLM 76
Query: 75 TRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQ 134
+ L + + + +L PD +DWREKG +T Q CGA +AFS A++ Q
Sbjct: 77 SSLRVPSQWQRNITYKSNPNRIL-PDSVDWREKGCVTEVKYQGSCGAAWAFSAVGALEAQ 135
Query: 135 IFKSTSEIEELSIQQVVDCSIIS-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
+ T ++ LS Q +VDCS GN GC GG + Y+ G+ + YPYK
Sbjct: 136 LKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQ 195
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
C++ S ++ LP E LK +A GP++V ++A +F LY SG+Y + +
Sbjct: 196 KCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPS 255
Query: 254 CTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYAL 308
CT + VNH +L+VGY + W++KN W H++G+ GY+ + R N CGIA++ Y
Sbjct: 256 CTQN-VNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYPE 314
Query: 309 I 309
I
Sbjct: 315 I 315
>2o6x_A Procathepsin L1, secreted cathepsin L 1; hydrolase, thiol protease,
cysteine protease, zymogen, hydro; 1.40A {Fasciola
hepatica}
Length = 310
Score = 324 bits (833), Expect = e-111
Identities = 97/301 (32%), Positives = 147/301 (48%), Gaps = 8/301 (2%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
++ Y K+Y A D ++ W+ N K I HN GL YTL N +D+ + +
Sbjct: 9 KRMYNKEY-NGADDQHRRNIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEFKAK 67
Query: 74 MTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQG 133
P + +PD +DWRE G++T +Q +CG+ +AFS ++G
Sbjct: 68 YLTEMSRASDILSHGVPYEANNRAVPDKIDWRESGYVTEVKDQGNCGSGWAFSTTGTMEG 127
Query: 134 QIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQS 193
Q K+ S QQ+VDCS GN GC GG + N Y++ GL E YPY +
Sbjct: 128 QYMKNERTSISFSEQQLVDCSRPWGNNGCGGGLMENAYQYLK-QFGLETESSYPYTAVEG 186
Query: 194 ICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEA 253
C++ + V ++ + + E LK + GP AV+++ F +Y SGIY +
Sbjct: 187 QCRYNKQLGVAKVTGFYTVHSGSEVELKNLVGAEGPAAVAVDVE-SDFMMYRSGIYQSQT 245
Query: 254 CTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYAL 308
C+ VNHA+L VGY + WI+KN W WG+ GY+ + R N CGIA+ A +
Sbjct: 246 CSPLRVNHAVLAVGYGTQGGTDYWIVKNSWGLSWGERGYIRMVRNRGNMCGIASLASLPM 305
Query: 309 I 309
+
Sbjct: 306 V 306
>3qj3_A Cathepsin L-like protein; hydrolase, proteinase, larVal midgut;
1.85A {Tenebrio molitor}
Length = 331
Score = 314 bits (807), Expect = e-107
Identities = 93/309 (30%), Positives = 145/309 (46%), Gaps = 16/309 (5%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ Y + Y ++ +K +Q + HN++ +QGL YTL N +D+ P
Sbjct: 26 KTTYARSYVNAKEETFRKQIFQKKLETFEEHNEKYRQGLVSYTLGVNLFTDMTPEEMKAY 85
Query: 74 MTRLTHSRIRRTLVRSPESNESVL------IPDHLDWREKGFITPDWNQEDCGACYAFSI 127
L ++ E + P DWR++G ++P NQ CG+ +AFS
Sbjct: 86 THGLIMPADLHKNGIPIKTREDLGLNASVRYPASFDWRDQGMVSPVKNQGSCGSSWAFSS 145
Query: 128 ASAIQGQIFKSTSE--IEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEED 185
AI+ Q+ + +S QQ+VDC LGC+GG + + YV GG+ E
Sbjct: 146 TGAIESQMKIANGAGYDSSVSEQQLVDCV--PNALGCSGGWMNDAFTYVAQNGGIDSEGA 203
Query: 186 YPYKGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYA 245
YPY+ C + + +S + L DE+ L +AT GP+AV+ +A F Y+
Sbjct: 204 YPYEMADGNCHYDPNQVAARLSGYVYLSGPDENMLADMVATKGPVAVAFDAD-DPFGSYS 262
Query: 246 SGIYDDEACTSDYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKRG-NNRCGI 300
G+Y + C ++ HA+L+VGY N W++KN W WG +GY + R NN CGI
Sbjct: 263 GGVYYNPTCETNKFTHAVLIVGYGNENGQDYWLVKNSWGDGWGLDGYFKIARNANNHCGI 322
Query: 301 ANYAVYALI 309
A A +
Sbjct: 323 AGVASVPTL 331
>3qt4_A Cathepsin-L-like midgut cysteine proteinase; hydrolase, zymogen,
intramolecular DISS bonds, insect larVal midgut; HET:
PG4 PG6; 2.11A {Tenebrio molitor}
Length = 329
Score = 312 bits (801), Expect = e-106
Identities = 90/302 (29%), Positives = 153/302 (50%), Gaps = 9/302 (2%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ +KK Y + +++L ++ N KI HN + ++G Y+ N D+ ++
Sbjct: 31 KLTHKKSYSSPIEEIRRQLIFKDNVAKIAEHNAKFEKGEVTYSKAMNQFGDMSKEEFLAY 90
Query: 74 MTRLTHSRIRR-TLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
+ R + + +R P + + +DWR ++ +Q CG+ ++FS A++
Sbjct: 91 VNRGKAQKPKHPENLRMPYVSSKKPLAASVDWRSNA-VSEVKDQGQCGSSWSFSTTGAVE 149
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
GQ+ + LS Q ++DCS GN GC GG + + +Y+ G+M E YPY+ +
Sbjct: 150 GQLALQRGRLTSLSEQNLIDCSSSYGNAGCDGGWMDSAFSYIH-DYGIMSESAYPYEAQG 208
Query: 193 SICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDE 252
C+F V +S + LP DE++L + GP+AV+I+A+ Q Y+ G++ D+
Sbjct: 209 DYCRFDSSQSVTTLSGYYDLPSGDENSLADAVGQAGPVAVAIDAT-DELQFYSGGLFYDQ 267
Query: 253 ACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYA 307
C +NH +L+VGY ++ WILKN W WG++GY R N CGIA A Y
Sbjct: 268 TCNQSDLNHGVLVVGYGSDNGQDYWILKNSWGSGWGESGYWRQVRNYGNNCGIATAASYP 327
Query: 308 LI 309
+
Sbjct: 328 AL 329
>1pci_A Procaricain; zymogen, hydrolase, thiol protease; 3.20A {Carica
papaya} SCOP: d.3.1.1
Length = 322
Score = 277 bits (711), Expect = 6e-93
Identities = 85/303 (28%), Positives = 142/303 (46%), Gaps = 20/303 (6%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+ K Y + ++ N I N++ + Y L N +DL + ++
Sbjct: 26 MLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKKN----NSYWLGLNEFADLSNDEFNEK 81
Query: 74 MT-RLTHSRIRRTLVRSPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQ 132
L + I ++ + + V +P+++DWR+KG +TP +Q CG+C+AFS + ++
Sbjct: 82 YVGSLIDATIEQSYDEEFINEDIVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVE 141
Query: 133 GQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ 192
G T ++ ELS Q++VDC + GC GG L YV G+ YPYK KQ
Sbjct: 142 GINKIRTGKLVELSEQELVDCE--RRSHGCKGGYPPYALEYVA-KNGIHLRSKYPYKAKQ 198
Query: 193 SICKFKRPNI-VVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDD 251
C+ K+ +V S + P +E L +A P++V + + FQLY GI++
Sbjct: 199 GTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAK-QPVSVVVESKGRPFQLYKGGIFEG 257
Query: 252 EACTSDYVNHAMLLVGY----TRNSWILKNWWSHHWGDNGYMYLKRGNN----RCGIANY 303
C + V+ A+ VGY + ++KN W WG+ GY+ +KR CG+
Sbjct: 258 P-CGTK-VDGAVTAVGYGKSGGKGYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGLYKS 315
Query: 304 AVY 306
+ Y
Sbjct: 316 SYY 318
>1xkg_A DER P I, major mite fecal allergen DER P 1; major allergen,
cysteine protease, house DUST mite, dermatop
pteronyssinus; 1.61A {Dermatophagoides pteronyssinus}
SCOP: d.3.1.1
Length = 312
Score = 274 bits (703), Expect = 7e-92
Identities = 67/307 (21%), Positives = 120/307 (39%), Gaps = 27/307 (8%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
+K + K Y + + ++ + K + ++ NHLSDL +
Sbjct: 12 KKAFNKSYATFEDEEAARKNFLESVKYVQSNG-----------GAINHLSDLSLDEFKNR 60
Query: 74 MTRLTHSRIRRTLVRSPESNESVL-----IPDHLDWREKGFITPDWNQEDCGACYAFSIA 128
+ + + P +D R+ +TP Q CG+ +AFS
Sbjct: 61 FLMSAEAFEHLKTQFDLNAETNACSINGNAPAEIDLRQMRTVTPIRMQGGCGSAWAFSGV 120
Query: 129 SAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
+A + + +L+ Q++VDC+ GC G ++ + Y+Q G+++E Y Y
Sbjct: 121 AATESAYLAYRDQSLDLAEQELVDCA---SQHGCHGDTIPRGIEYIQ-HNGVVQESYYRY 176
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLA-TVGPIAVSINAS-PHTFQLYAS 246
++ C+ IS++ + P + + ++ LA T IAV I F+ Y
Sbjct: 177 VAREQSCRRPNAQ-RFGISNYCQIYPPNANKIREALAQTHSAIAVIIGIKDLDAFRHYDG 235
Query: 247 GIYDDEACTSDYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKRGNNRCGIAN 302
HA+ +VGY WI++N W +WGDNGY Y + I
Sbjct: 236 RTIIQRDNGYQPNYHAVNIVGYSNAQGVDYWIVRNSWDTNWGDNGYGYFAANIDLMMIEE 295
Query: 303 YAVYALI 309
Y ++
Sbjct: 296 YPYVVIL 302
>3kwz_A Cathepsin K; enzyme inhibitor, covalent reversible inhibitor,
disease mutation, disulfide bond, glycoprotein,
hydrolase, lysosome, protease; HET: KWZ; 1.49A {Homo
sapiens} PDB: 1au0_A* 1au2_A* 1au3_A* 1au4_A* 1ayu_A*
1ayv_A* 1ayw_A* 1bgo_A* 1atk_A* 1nl6_A* 1nlj_A* 1q6k_A*
1mem_A* 1yk7_A* 1yk8_A* 1yt7_A* 2ato_A* 2aux_A* 2auz_A*
2bdl_A* ...
Length = 215
Score = 270 bits (694), Expect = 8e-92
Identities = 92/216 (42%), Positives = 132/216 (61%), Gaps = 7/216 (3%)
Query: 99 PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
PD +D+R+KG++TP NQ CG+C+AFS A++GQ+ K T ++ LS Q +VDC S
Sbjct: 2 PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SE 59
Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEH 218
N GC GG + N YVQ G+ E+ YPY G++ C + + +P +E
Sbjct: 60 NDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEK 119
Query: 219 ALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSWI 274
ALK +A VGP++V+I+AS +FQ Y+ G+Y DE+C SD +NHA+L VGY WI
Sbjct: 120 ALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWI 179
Query: 275 LKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
+KN W +WG+ GY+ + R NN CGIAN A + +
Sbjct: 180 IKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 215
>8pch_A Cathepsin H; hydrolase, protease, cysteine proteinase,
aminopeptidase; HET: NAG BMA; 2.10A {Sus scrofa} SCOP:
d.3.1.1 PDB: 1nb3_A* 1nb5_A*
Length = 220
Score = 268 bits (689), Expect = 6e-91
Identities = 80/217 (36%), Positives = 121/217 (55%), Gaps = 8/217 (3%)
Query: 99 PDHLDWREKG-FITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIIS 157
P +DWR+KG F++P NQ CG+C+ FS A++ + +T ++ L+ QQ+VDC+
Sbjct: 2 PPSMDWRKKGNFVSPVKNQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQQLVDCAQNF 61
Query: 158 GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDE 217
N GC GG Y+++ G+M E+ YPYKG+ CKF+ + + + + DE
Sbjct: 62 NNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYKGQDDHCKFQPDKAIAFVKDVANITMNDE 121
Query: 218 HALKVTLATVGPIAVSINASPHTFQLYASGIYDDEAC--TSDYVNHAMLLVGY-TRNS-- 272
A+ +A P++ + + + F +Y GIY +C T D VNHA+L VGY N
Sbjct: 122 EAMVEAVALYNPVSFAFEVT-NDFLMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGIP 180
Query: 273 -WILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYAL 308
WI+KN W WG NGY ++RG N CG+A A Y +
Sbjct: 181 YWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYPI 217
>2xu3_A Cathepsin L1; hydrolase, drug design, thiol protease; HET: XU3 BTB;
0.90A {Homo sapiens} PDB: 2xu4_A* 2xu5_A* 2yj2_A*
2yj8_A* 2yj9_A* 2yjb_A* 2yjc_A* 3bc3_A* 3h89_A* 3h8b_A*
3h8c_A* 3of9_A* 3of8_A* 3hha_A* 2xu1_A* 3iv2_A* 3k24_A*
2nqd_B* 3kse_A* 2vhs_A ...
Length = 220
Score = 268 bits (687), Expect = 1e-90
Identities = 89/220 (40%), Positives = 128/220 (58%), Gaps = 10/220 (4%)
Query: 99 PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
P +DWREKG++TP NQ CG+C+AFS A++GQ+F+ T + LS Q +VDCS G
Sbjct: 2 PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQG 61
Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEH 218
N GC GG + YVQ GGL EE YPY+ + CK+ V + + + + P+ E
Sbjct: 62 NEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDI-PKQEK 120
Query: 219 ALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY--------TR 270
AL +ATVGPI+V+I+A +F Y GIY + C+S+ ++H +L+VGY
Sbjct: 121 ALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNN 180
Query: 271 NSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVYALI 309
W++KN W WG GY+ + + N CGIA+ A Y +
Sbjct: 181 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 220
>3ovx_A Cathepsin S; hydrolase, covalent inhibitor, aldehyde warhead is
covalently bound to Cys25, lysosomeal protein; HET: O64;
1.49A {Homo sapiens} PDB: 2h7j_A* 2f1g_A* 2hh5_B*
2hhn_A* 2hxz_A* 2op3_A* 2frq_A* 2fra_A* 2fq9_A* 2ft2_A*
2fud_A* 2g7y_A* 1ms6_A* 2r9m_A* 2r9n_A* 2r9o_A* 3n3g_A*
3n4c_A* 3mpe_A* 1nqc_A* ...
Length = 218
Score = 267 bits (686), Expect = 1e-90
Identities = 81/216 (37%), Positives = 122/216 (56%), Gaps = 7/216 (3%)
Query: 97 LIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSII 156
++PD +DWREKG +T Q CGAC+AFS A++ Q+ T ++ LS Q +VDCS
Sbjct: 1 ILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTE 60
Query: 157 S-GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQ 215
GN GC GG + Y+ G+ + YPYK C++ S ++ LP
Sbjct: 61 KYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPYG 120
Query: 216 DEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRN 271
E LK +A GP++V ++A +F LY SG+Y + +CT + VNH +L+VGY +
Sbjct: 121 REDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQN-VNHGVLVVGYGDLNGKE 179
Query: 272 SWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYAVY 306
W++KN W H++G+ GY+ + R N CGIA++ Y
Sbjct: 180 YWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSY 215
>1m6d_A Cathepsin F, catsf; papain family cysteine protease, hydrolase;
HET: MYP; 1.70A {Homo sapiens} SCOP: d.3.1.1
Length = 214
Score = 259 bits (664), Expect = 3e-87
Identities = 80/217 (36%), Positives = 113/217 (52%), Gaps = 11/217 (5%)
Query: 99 PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
P DWR KG +T +Q CG+C+AFS+ ++GQ F + + LS Q+++DC
Sbjct: 2 PPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCD--KM 59
Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEH 218
+ C GG N + ++ GGL E+DY Y+G C+F V I L Q+E
Sbjct: 60 DKACMGGLPSNAYSAIKNLGGLETEDDYSYQGHMQSCQFSAEKAKVYIQDSVEL-SQNEQ 118
Query: 219 ALKVTLATVGPIAVSINASPHTFQLYASGIYD--DEACTSDYVNHAMLLVGY-TRNS--- 272
L LA GPI+V+INA Q Y GI C+ ++HA+LLVGY R+
Sbjct: 119 KLAAWLAKRGPISVAINAF--GMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGQRSDVPF 176
Query: 273 WILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
W +KN W WG+ GY YL RG+ CG+ A A++
Sbjct: 177 WAIKNSWGTDWGEKGYYYLHRGSGACGVNTMASSAVV 213
>3f5v_A DER P 1 allergen; allergy, asthma, DUST mites, glycoprotein,
hydrola protease, secreted, thiol protease; HET: P6G;
1.36A {Dermatophagoides pteronyssinus} PDB: 2as8_A
3rvw_A* 3rvx_A 3rvv_A* 3d6s_A*
Length = 222
Score = 254 bits (652), Expect = 3e-85
Identities = 57/224 (25%), Positives = 98/224 (43%), Gaps = 11/224 (4%)
Query: 92 SNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVV 151
+ + P +D R+ +TP Q CG+ +AFS +A + + +L+ Q++V
Sbjct: 4 CSINGNAPAEIDLRQMRTVTPIRMQGGCGSAWAFSGVAATESAYLAYRQQSLDLAEQELV 63
Query: 152 DCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSV 211
DC + GC G ++ + Y+Q G+++E Y Y ++ C+ IS++
Sbjct: 64 DC---ASQHGCHGDTIPRGIEYIQ-HNGVVQESYYRYVAREQSCRRPNAQ-RFGISNYCQ 118
Query: 212 LPPQDEHALKVTLA-TVGPIAVSINAS-PHTFQLYASGIYDDEACTSDYVNHAMLLVGY- 268
+ P + + ++ LA T IAV I F+ Y HA+ +VGY
Sbjct: 119 IYPPNANKIREALAQTHSAIAVIIGIKDLDAFRHYDGRTIIQRDNGYQPNYHAVNIVGYS 178
Query: 269 TRNS---WILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
WI++N W +WGDNGY Y + I Y ++
Sbjct: 179 NAQGVDYWIVRNSWDTNWGDNGYGYFAANIDLMMIEEYPYVVIL 222
>3f75_A Toxopain-2, cathepsin L protease; medical structural genomics of
pathogenic protozoa, MSGPP, C protease, parasite,
protozoa, hydrolase; 1.99A {Toxoplasma gondii}
Length = 224
Score = 253 bits (649), Expect = 7e-85
Identities = 77/218 (35%), Positives = 122/218 (55%), Gaps = 12/218 (5%)
Query: 98 IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIIS 157
+P +DWR +G +TP +Q DCG+C+AFS A++G T ++ LS Q+++DCS
Sbjct: 7 LPAGVDWRSRGCVTPVKDQRDCGSCWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSRAE 66
Query: 158 GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDE 217
GN C+GG + + YV +GG+ E+ YPY + C+ + VV I + +P + E
Sbjct: 67 GNQSCSGGEMNDAFQYVLDSGGICSEDAYPYLARDEECRAQSCEKVVKILGFKDVPRRSE 126
Query: 218 HALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY------TRN 271
A+K LA P++++I A FQ Y G++D C +D ++H +LLVGY ++
Sbjct: 127 AAMKAALAK-SPVSIAIEADQMPFQFYHEGVFDAS-CGTD-LDHGVLLVGYGTDKESKKD 183
Query: 272 SWILKNWWSHHWGDNGYMYLKRG---NNRCGIANYAVY 306
WI+KN W WG +GYMY+ +CG+ A +
Sbjct: 184 FWIMKNSWGTGWGRDGYMYMAMHKGEEGQCGLLLDASF 221
>3p5u_A Actinidin; SAD, cysteine proteinases, hydrolase; 1.50A {Actinidia
arguta} PDB: 3p5v_A 3p5w_A 3p5x_A 1aec_A* 2act_A
Length = 220
Score = 250 bits (641), Expect = 1e-83
Identities = 72/217 (33%), Positives = 119/217 (54%), Gaps = 11/217 (5%)
Query: 98 IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIIS 157
+PD++DWR G + +Q CG+ +AFS +A++G +T ++ LS Q++VDC
Sbjct: 1 LPDYVDWRSSGAVVDIKDQGQCGSAWAFSTIAAVEGINKIATGDLISLSEQELVDCGRTQ 60
Query: 158 GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPN-IVVDISSWSVLPPQD 216
GC GG + + ++ GG+ E +YPY ++ C V I ++ +P +
Sbjct: 61 NTRGCDGGFMTDGFQFIINNGGINTEANYPYTAEEGQCNLDLQQEKYVSIDTYENVPYNN 120
Query: 217 EHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNS 272
E AL+ +A P++V++ A+ + FQ Y+SGI+ T+ V+HA+ +VGY +
Sbjct: 121 EWALQTAVA-YQPVSVALEAAGYNFQHYSSGIFTGPCGTA--VDHAVTIVGYGTEGGIDY 177
Query: 273 WILKNWWSHHWGDNGYMYLKRG---NNRCGIANYAVY 306
WI+KN W WG+ GYM ++R +CGIA A Y
Sbjct: 178 WIVKNSWGTTWGEEGYMRIQRNVGGVGQCGIAKKASY 214
>1cqd_A Protein (protease II); cysteine protease, glycoprotein, proline
specificity, carboh papain family, hydrolase; HET: NAG
FUL FUC; 2.10A {Zingiber officinale} SCOP: d.3.1.1
Length = 221
Score = 249 bits (638), Expect = 3e-83
Identities = 81/217 (37%), Positives = 125/217 (57%), Gaps = 13/217 (5%)
Query: 98 IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIIS 157
+PD +DWRE G + P NQ CG+C+AFS +A++G T ++ LS QQ+VDC+ +
Sbjct: 3 LPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCT--T 60
Query: 158 GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDE 217
N GC GG + ++ GG+ EE YPY+G+ IC VV I S+ +P +E
Sbjct: 61 ANHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQDGICNSTVNAPVVSIDSYENVPSHNE 120
Query: 218 HALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSW 273
+L+ +A P++V+++A+ FQLY SGI+ S NHA+ +VGY ++ W
Sbjct: 121 QSLQKAVAN-QPVSVTMDAAGRDFQLYRSGIFTGSCNIS--ANHALTVVGYGTENDKDFW 177
Query: 274 ILKNWWSHHWGDNGYMYLKRGNN----RCGIANYAVY 306
I+KN W +WG++GY+ +R +CGI +A Y
Sbjct: 178 IVKNSWGKNWGESGYIRAERNIENPDGKCGITRFASY 214
>1iwd_A Ervatamin B; cysteine protease, alpha-beta protein, catalytic DYAD,
L-DOM domain., hydrolase; 1.63A {Tabernaemontana
divaricata} SCOP: d.3.1.1
Length = 215
Score = 248 bits (635), Expect = 7e-83
Identities = 74/217 (34%), Positives = 122/217 (56%), Gaps = 14/217 (6%)
Query: 98 IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIIS 157
+P +DWR KG + NQ+ CG+C+AFS +A++ T ++ LS Q++VDC +
Sbjct: 1 LPSFVDWRSKGAVNSIKNQKQCGSCWAFSAVAAVESINKIRTGQLISLSEQELVDCD--T 58
Query: 158 GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDE 217
+ GC GG + N Y+ GG+ +++YPY Q CK R VV I+ + + +E
Sbjct: 59 ASHGCNGGWMNNAFQYIITNGGIDTQQNYPYSAVQGSCKPYRLR-VVSINGFQRVTRNNE 117
Query: 218 HALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSW 273
AL+ +A+ P++V++ A+ FQ Y+SGI+ T+ NH +++VGY +N W
Sbjct: 118 SALQSAVAS-QPVSVTVEAAGAPFQHYSSGIFTGPCGTA--QNHGVVIVGYGTQSGKNYW 174
Query: 274 ILKNWWSHHWGDNGYMYLKRGNN----RCGIANYAVY 306
I++N W +WG+ GY++++R CGIA Y
Sbjct: 175 IVRNSWGQNWGNQGYIWMERNVASSAGLCGIAQLPSY 211
>2bdz_A Mexicain; cysteine protease, peptidase_C1, papain-like, HYDR; HET:
E64; 2.10A {Jacaratia mexicana}
Length = 214
Score = 248 bits (635), Expect = 7e-83
Identities = 74/213 (34%), Positives = 114/213 (53%), Gaps = 11/213 (5%)
Query: 99 PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
P+ +DWREKG +TP NQ CG+C+AFS + I+G T ++ LS Q+++DC
Sbjct: 2 PESIDWREKGAVTPVKNQNPCGSCWAFSTVATIEGINKIITGQLISLSEQELLDCE--RR 59
Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPN-IVVDISSWSVLPPQDE 217
+ GC GG +L YV G+ E +YPY+ KQ C+ K V I+ + +P DE
Sbjct: 60 SHGCDGGYQTTSLQYVV-DNGVHTEREYPYEKKQGRCRAKDKKGPKVYITGYKYVPANDE 118
Query: 218 HALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNSWILKN 277
+L +A P++V ++ FQ Y GIY+ T+ +HA+ VGY + +LKN
Sbjct: 119 ISLIQAIAN-QPVSVVTDSRGRGFQFYKGGIYEGPCGTN--TDHAVTAVGYGKTYLLLKN 175
Query: 278 WWSHHWGDNGYMYLKRG----NNRCGIANYAVY 306
W +WG+ GY+ +KR CG+ + +
Sbjct: 176 SWGPNWGEKGYIRIKRASGRSKGTCGVYTSSFF 208
>1s4v_A Cysteine endopeptidase; KDEL ER retention signal, endosperm,
ricinosomes, SEED germi senescence, hydrolase-hydrolase
inhibitor complex; 2.00A {Ricinus communis} SCOP:
d.3.1.1
Length = 229
Score = 248 bits (635), Expect = 1e-82
Identities = 73/219 (33%), Positives = 116/219 (52%), Gaps = 14/219 (6%)
Query: 98 IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIIS 157
+P +DWR+KG +T +Q CG+C+AFS A++G T+++ LS Q++VDC
Sbjct: 2 VPASVDWRKKGAVTSVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDT-D 60
Query: 158 GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNI-VVDISSWSVLPPQD 216
N GC GG + +++ GG+ E +YPY+ C + N V I +P D
Sbjct: 61 QNQGCNGGLMDYAFEFIKQRGGITTEANYPYEAYDGTCDVSKENAPAVSIDGHENVPEND 120
Query: 217 EHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY-----TRN 271
E+AL +A P++V+I+A FQ Y+ G++ C ++ ++H + +VGY
Sbjct: 121 ENALLKAVAN-QPVSVAIDAGGSDFQFYSEGVFTGS-CGTE-LDHGVAIVGYGTTIDGTK 177
Query: 272 SWILKNWWSHHWGDNGYMYLKRG----NNRCGIANYAVY 306
W +KN W WG+ GY+ ++RG CGIA A Y
Sbjct: 178 YWTVKNSWGPEWGEKGYIRMERGISDKEGLCGIAMEASY 216
>2cio_A Papain; hydrolase/inhibitor, complex hydrolase/inhibitor, ICP,
cysteine protease, allergen, protease, thiol protease;
1.5A {Carica papaya} PDB: 1khq_A 1khp_A 1ppn_A 3e1z_B
3ima_A 3lfy_A 9pap_A 1bqi_A* 1bp4_A* 1pad_A 1pe6_A*
1pip_A* 1pop_A* 1ppd_A 1ppp_A* 1stf_E* 2pad_A 4pad_A*
5pad_A* 6pad_A* ...
Length = 212
Score = 246 bits (631), Expect = 3e-82
Identities = 73/214 (34%), Positives = 110/214 (51%), Gaps = 11/214 (5%)
Query: 98 IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIIS 157
IP+++DWR+KG +TP NQ CG+C+AFS I+G I T + + S Q+++DC
Sbjct: 1 IPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNQYSEQELLDCD--R 58
Query: 158 GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPN-IVVDISSWSVLPPQD 216
+ GC GG + L V G+ YPY+G Q C+ + + P +
Sbjct: 59 RSYGCNGGYPWSALQLVA-QYGIHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYN 117
Query: 217 EHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNSWILK 276
E AL ++A P++V + A+ FQLY GI+ V+HA+ VGY N ++K
Sbjct: 118 EGALLYSIAN-QPVSVVLEAAGKDFQLYRGGIFVGPCGNK--VDHAVAAVGYGPNYILIK 174
Query: 277 NWWSHHWGDNGYMYLKRGNN----RCGIANYAVY 306
N W WG+NGY+ +KRG CG+ + Y
Sbjct: 175 NSWGTGWGENGYIRIKRGTGNSYGVCGLYTSSFY 208
>3ioq_A CMS1MS2; caricaceae, cysteine protease, papain family, hydrolase;
HET: E64 SO4; 1.87A {Carica candamarcensis}
Length = 213
Score = 246 bits (630), Expect = 3e-82
Identities = 71/214 (33%), Positives = 112/214 (52%), Gaps = 11/214 (5%)
Query: 98 IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIIS 157
IP +DWR+KG +TP NQ CG+C+ FS +A++G T ++ LS Q+++DC
Sbjct: 1 IPTSIDWRQKGAVTPVRNQGGCGSCWTFSSVAAVEGINKIVTGQLLSLSEQELLDCE--R 58
Query: 158 GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPN-IVVDISSWSVLPPQD 216
+ GC GG L YV G+ + YPY+G Q C+ + V +P +
Sbjct: 59 RSYGCRGGFPLYALQYVA-NSGIHLRQYYPYEGVQRQCRASQAKGPKVKTDGVGRVPRNN 117
Query: 217 EHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNSWILK 276
E AL +A + P+++ + A FQ Y GI+ TS ++HA+ VGY + ++K
Sbjct: 118 EQALIQRIA-IQPVSIVVEAKGRAFQNYRGGIFAGPCGTS--IDHAVAAVGYGNDYILIK 174
Query: 277 NWWSHHWGDNGYMYLKRG----NNRCGIANYAVY 306
N W WG+ GY+ +KRG CG+ + +V+
Sbjct: 175 NSWGTGWGEGGYIRIKRGSGNPQGACGVLSDSVF 208
>3i06_A Cruzipain; autocatalytic cleavage, glycoprotein, protease, thiol
protease, zymogen; HET: QL2; 1.10A {Trypanosoma cruzi}
PDB: 1ewm_A* 1ewo_A* 1ewl_A* 1f29_A* 1ewp_A* 1f2b_A*
1f2c_A* 1f2a_A* 1me4_A* 1u9q_X* 2aim_A* 2efm_A* 2oz2_A*
1me3_A* 3kku_A* 3lxs_A* 1aim_A* 3iut_A* 3hd3_A* 2p86_A*
...
Length = 215
Score = 243 bits (623), Expect = 5e-81
Identities = 69/220 (31%), Positives = 114/220 (51%), Gaps = 16/220 (7%)
Query: 99 PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
P +DWR +G +T +Q CG+C+AFS ++ Q F + + LS Q +V C
Sbjct: 2 PAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCD--KT 59
Query: 159 NLGCAGGSLRNTLNYVQFA--GGLMKEEDYPYKGKQ---SICKFKRPNIVVDISSWSVLP 213
+ GC+GG + N ++ G + E+ YPY + C + I+ L
Sbjct: 60 DSGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVEL- 118
Query: 214 PQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY-TRNS 272
PQDE + LA GP+AV+++AS ++ Y G+ C S+ ++H +LLVGY +
Sbjct: 119 PQDEAQIAAWLAVNGPVAVAVDAS--SWMTYTGGVMTS--CVSEQLDHGVLLVGYNDSAA 174
Query: 273 ---WILKNWWSHHWGDNGYMYLKRGNNRCGIANYAVYALI 309
WI+KN W+ WG+ GY+ + +G+N+C + A A++
Sbjct: 175 VPYWIIKNSWTTQWGEEGYIRIAKGSNQCLVKEEASSAVV 214
>1ppo_A Protease omega; hydrolase(thiol protease); 1.80A {Carica papaya}
SCOP: d.3.1.1 PDB: 1meg_A*
Length = 216
Score = 242 bits (620), Expect = 1e-80
Identities = 73/218 (33%), Positives = 112/218 (51%), Gaps = 15/218 (6%)
Query: 98 IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIIS 157
+P+++DWR+KG +TP +Q CG+C+AFS + ++G T ++ ELS Q++VDC
Sbjct: 1 LPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCE--R 58
Query: 158 GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPN-IVVDISSWSVLPPQD 216
+ GC GG L YV G+ YPYK KQ C+ K+ +V S + P +
Sbjct: 59 RSHGCKGGYPPYALEYVA-KNGIHLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNN 117
Query: 217 EHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY-TRNS--- 272
E L +A P++V + + FQLY GI++ T V+HA+ VGY
Sbjct: 118 EGNLLNAIAK-QPVSVVVESKGRPFQLYKGGIFEGPCGTK--VDHAVTAVGYGKSGGKGY 174
Query: 273 WILKNWWSHHWGDNGYMYLKRGNN----RCGIANYAVY 306
++KN W WG+ GY+ +KR CG+ + Y
Sbjct: 175 ILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGLYKSSYY 212
>2b1m_A SPE31; papain-like, sugar binding protein; HET: NAG FUC PG4; 2.00A
{Pachyrhizus erosus} PDB: 2b1n_A*
Length = 246
Score = 242 bits (620), Expect = 4e-80
Identities = 74/228 (32%), Positives = 111/228 (48%), Gaps = 21/228 (9%)
Query: 98 IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIIS 157
P+ DW +KG IT Q CG+ +AFS AI+ +T + LS Q+++DC
Sbjct: 2 APESWDWSKKGVITKVKFQGQCGSGWAFSATGAIEAAHAIATGNLVSLSEQELIDCV--D 59
Query: 158 GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSV------ 211
+ GC G + +V GG+ E DYPYK + CK V I ++ V
Sbjct: 60 ESEGCYNGWHYQSFEWVVKHGGIASEADYPYKARDGKCKANEIQDKVTIDNYGVQILSNE 119
Query: 212 -LPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEAC-TSDYVNHAMLLVGY- 268
+ E +L+ + PI+VSI+A F Y+ GIYD C + +NH +L+VGY
Sbjct: 120 STESEAESSLQSFVLE-QPISVSIDAK--DFHFYSGGIYDGGNCSSPYGINHFVLIVGYG 176
Query: 269 TRNS---WILKNWWSHHWGDNGYMYLKRGNN----RCGIANYAVYALI 309
+ + WI KN W WG +GY+ ++R CG+ +A Y +I
Sbjct: 177 SEDGVDYWIAKNSWGEDWGIDGYIRIQRNTGNLLGVCGMNYFASYPII 224
>2fo5_A Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cysteine
endoprotease, endopeptidase, LEUP hydrolase; HET: AR7;
2.20A {Hordeum vulgare}
Length = 262
Score = 242 bits (619), Expect = 8e-80
Identities = 71/222 (31%), Positives = 116/222 (52%), Gaps = 17/222 (7%)
Query: 98 IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIIS 157
+P +DWR+KG +T +Q CG+C+AFS +++G T + LS Q+++DC +
Sbjct: 4 LPPSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDT-A 62
Query: 158 GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPN----IVVDISSWSVLP 213
N GC GG + N Y++ GGL+ E YPY+ + C R +VV I +P
Sbjct: 63 DNDGCQGGLMDNAFEYIKNNGGLITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVP 122
Query: 214 PQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----- 268
E L +A P++V++ AS F Y+ G++ E C ++ ++H + +VGY
Sbjct: 123 ANSEEDLARAVAN-QPVSVAVEASGKAFMFYSEGVFTGE-CGTE-LDHGVAVVGYGVAED 179
Query: 269 TRNSWILKNWWSHHWGDNGYMYLKRGNN----RCGIANYAVY 306
+ W +KN W WG+ GY+ +++ + CGIA A Y
Sbjct: 180 GKAYWTVKNSWGPSWGEQGYIRVEKDSGASGGLCGIAMEASY 221
>1yal_A Chymopapain; hydrolase, thiol protease; 1.70A {Carica papaya} SCOP:
d.3.1.1 PDB: 1gec_E*
Length = 218
Score = 239 bits (613), Expect = 2e-79
Identities = 78/217 (35%), Positives = 114/217 (52%), Gaps = 15/217 (6%)
Query: 99 PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
P +DWR KG +TP NQ CG+C+AFS + ++G T + ELS Q++VDC
Sbjct: 2 PQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCD--KH 59
Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICK-FKRPNIVVDISSWSVLPPQDE 217
+ GC GG +L YV G+ + YPY+ KQ C+ +P V I+ + +P E
Sbjct: 60 SYGCKGGYQTTSLQYVA-NNGVHTSKVYPYQAKQYKCRATDKPGPKVKITGYKRVPSNCE 118
Query: 218 HALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY----TRNSW 273
+ LA P++V + A FQLY SG++D C + ++HA+ VGY +N
Sbjct: 119 TSFLGALAN-QPLSVLVEAGGKPFQLYKSGVFDGP-CGTK-LDHAVTAVGYGTSDGKNYI 175
Query: 274 ILKNWWSHHWGDNGYMYLKRG----NNRCGIANYAVY 306
I+KN W +WG+ GYM LKR CG+ + Y
Sbjct: 176 IIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYY 212
>3pdf_A Cathepsin C, dipeptidyl peptidase 1; two domains, cystein protease,
hydrolase-hydrolase inhibitor; HET: LXV NAG; 1.85A {Homo
sapiens} PDB: 1jqp_A* 2djf_B* 1k3b_B* 2djg_B* 2djf_A*
1k3b_A* 2djg_A* 2djf_C* 1k3b_C* 2djg_C*
Length = 441
Score = 239 bits (612), Expect = 1e-76
Identities = 83/315 (26%), Positives = 127/315 (40%), Gaps = 28/315 (8%)
Query: 16 KYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEM- 74
+ K + + ++ +H + N L I+
Sbjct: 126 YVNTAHLKNSQEKYSNRLYKYDHNFVKAINA---IQKSWTATTYMEYETLTLGDMIRRSG 182
Query: 75 -TRLTHSRIRRTLVRSPESNESVLIPDHLDWREK---GFITPDWNQEDCGACYAFSIASA 130
R + + + + + +P DWR F++P NQ CG+CY+F+
Sbjct: 183 GHSRKIPRPKPAPLTAEIQQKILFLPTSWDWRNVHGINFVSPVRNQASCGSCYSFASMGM 242
Query: 131 IQGQIFKST--SEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPY 188
++ +I T S+ LS Q+VV CS GC GG GL++E +PY
Sbjct: 243 LEARIRILTNNSQTPILSPQEVVSCS--QYAQGCEGGFPYLIAGKYAQDFGLVEEACFPY 300
Query: 189 KGKQSICKFKRPNIVVDISSWSVLPP----QDEHALKVTLATVGPIAVSINASPHTFQLY 244
G S CK K S + + +E +K+ L GP+AV+ F Y
Sbjct: 301 TGTDSPCKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVY-DDFLHY 359
Query: 245 ASGIYD-----DEACTSDYVNHAMLLVGY-TRNS-----WILKNWWSHHWGDNGYMYLKR 293
GIY D + NHA+LLVGY T ++ WI+KN W WG+NGY ++R
Sbjct: 360 KKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRR 419
Query: 294 GNNRCGIANYAVYAL 308
G + C I + AV A
Sbjct: 420 GTDECAIESIAVAAT 434
>1o0e_A Ervatamin C; plant cysteine protease, two domain, stable at PH
2-12, HYDR; 1.90A {Tabernaemontana divaricata} SCOP:
d.3.1.1 PDB: 2pns_A* 2pre_A* 3bcn_A*
Length = 208
Score = 231 bits (591), Expect = 2e-76
Identities = 81/211 (38%), Positives = 117/211 (55%), Gaps = 9/211 (4%)
Query: 98 IPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIIS 157
+P+ +DWR+KG +TP NQ CG+C+AFS S ++ T + LS Q++VDC
Sbjct: 1 LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCD--K 58
Query: 158 GNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDE 217
N GC GG+ Y+ GG+ + +YPYK Q C+ VV I ++ +P +E
Sbjct: 59 KNHGCLGGAFVFAYQYIINNGGIDTQANYPYKAVQGPCQAASK--VVSIDGYNGVPFCNE 116
Query: 218 HALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNSWILKN 277
ALK +A V P V+I+AS FQ Y+SGI+ T +NH + +VGY N WI++N
Sbjct: 117 XALKQAVA-VQPSTVAIDASSAQFQQYSSGIFSGPCGTK--LNHGVTIVGYQANYWIVRN 173
Query: 278 WWSHHWGDNGYMYLKR--GNNRCGIANYAVY 306
W +WG+ GY+ + R G CGIA Y
Sbjct: 174 SWGRYWGEKGYIRMLRVGGCGLCGIARLPYY 204
>3bwk_A Cysteine protease falcipain-3; malaria, hydrolase; HET: C1P; 2.42A
{Plasmodium falciparum} PDB: 3bpm_A*
Length = 243
Score = 231 bits (591), Expect = 8e-76
Identities = 77/238 (32%), Positives = 112/238 (47%), Gaps = 27/238 (11%)
Query: 91 ESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQV 150
+ ++ L DWR G +TP +Q CG+C+AFS +++ Q + S Q++
Sbjct: 13 KPADAKLDRIAYDWRLHGGVTPVKDQALCGSCWAFSSVGSVESQYAIRKKALFLFSEQEL 72
Query: 151 VDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK-QSICKFKRPNIVVDISSW 209
VDCS N GC GG + N + + GGL ++DYPY C KR N I S+
Sbjct: 73 VDCS--VKNNGCYGGYITNAFDDMIDLGGLCSQDDYPYVSNLPETCNLKRCNERYTIKSY 130
Query: 210 SVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY- 268
+P + K L +GPI++SI AS F Y G YD E C + NHA++LVGY
Sbjct: 131 VSIP---DDKFKEALRYLGPISISIAAS-DDFAFYRGGFYDGE-CGAA-PNHAVILVGYG 184
Query: 269 TRNS-------------WILKNWWSHHWGDNGYMYLKRG----NNRCGIANYAVYALI 309
++ +I+KN W WG+ GY+ L+ C I A L+
Sbjct: 185 MKDIYNEDTGRMEKFYYYIIKNSWGSDWGEGGYINLETDENGYKKTCSIGTEAYVPLL 242
>2oul_A Falcipain 2; cysteine protease, inhibitor, macromolecular
interaction, HY hydrolase inhibitor complex; 2.20A
{Plasmodium falciparum} SCOP: d.3.1.1 PDB: 2ghu_A 1yvb_A
3bpf_A* 3pnr_A
Length = 241
Score = 230 bits (590), Expect = 1e-75
Identities = 70/235 (29%), Positives = 114/235 (48%), Gaps = 27/235 (11%)
Query: 94 ESVLIPDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDC 153
E DWR +TP +Q++CG+C+AFS +++ Q +++ LS Q++VDC
Sbjct: 14 EENFDHAAYDWRLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDC 73
Query: 154 SIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQ-SICKFKRPNIVVDISSWSVL 212
S N GC GG + N + GG+ + DYPY ++C R I ++ +
Sbjct: 74 S--FKNYGCNGGLINNAFEDMIELGGICPDGDYPYVSDAPNLCNIDRCTEKYGIKNYLSV 131
Query: 213 PPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY-TRN 271
P ++ LK L +GPI++S+ S F Y GI+D E C +NHA++LVG+ +
Sbjct: 132 P---DNKLKEALRFLGPISISVAVS-DDFAFYKEGIFDGE-CGDQ-LNHAVMLVGFGMKE 185
Query: 272 S-------------WILKNWWSHHWGDNGYMYLKRGNNR----CGIANYAVYALI 309
+I+KN W WG+ G++ ++ + CG+ A LI
Sbjct: 186 IVNPLTKKGEKHYYYIIKNSWGQQWGERGFINIETDESGLMRKCGLGTDAFIPLI 240
>2wbf_X Serine-repeat antigen protein; SERA, malaria, vacuole, protease,
cathepsin, hydrolase, glycoprotein, thiol protease; HET:
DMS; 1.60A {Plasmodium falciparum} PDB: 3ch3_X 3ch2_X
Length = 265
Score = 209 bits (534), Expect = 6e-67
Identities = 42/249 (16%), Positives = 71/249 (28%), Gaps = 39/249 (15%)
Query: 99 PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
D +Q +C + F+ ++ E ++S V +C
Sbjct: 11 NRLKDENNCISNLQVEDQGNCDTSWIFASKYHLETIRCMKGYEPTKISALYVANCYKGEH 70
Query: 159 NLGCAGGSLRNT-LNYVQFAGGLMKEEDYPYKGKQSI------------------CKFKR 199
C GS L ++ G L E +YPY + +
Sbjct: 71 KDRCDEGSSPMEFLQIIEDYGFLPAESNYPYNYVKVGEQCPKVEDHWMNLWDNGKILHNK 130
Query: 200 PN---------IVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD 250
+ + +K + G + I A + SG
Sbjct: 131 NEPNSLDGKGYTAYESERFHDNMDAFVKIIKTEVMNKGSVIAYIKAENVMGYEF-SGKKV 189
Query: 251 DEACTSDYVNHAMLLVGY-TRNS--------WILKNWWSHHWGDNGYMYLKR-GNNRCGI 300
C D +HA+ +VGY + WI++N W +WGD GY + G C
Sbjct: 190 KNLCGDDTADHAVNIVGYGNYVNSEGEKKSYWIVRNSWGPYWGDEGYFKVDMYGPTHCHF 249
Query: 301 ANYAVYALI 309
+
Sbjct: 250 NFIHSVVIF 258
>3hhi_A Cathepsin B-like cysteine protease; occluding loop, hydrolase, THIO
protease; HET: 074; 1.60A {Trypanosoma brucei} PDB:
3mor_A*
Length = 325
Score = 195 bits (498), Expect = 7e-61
Identities = 64/308 (20%), Positives = 110/308 (35%), Gaps = 43/308 (13%)
Query: 37 NHKKIHTHNQEAQQGLHGYTLREN-HLSDLHPRHYIKEM--TRLTHSRIRRTLVRSPESN 93
+ + N+ + + + + + ++ R + + ++ R E
Sbjct: 11 SKAFVDRVNRLNR---GIWKAKYDGVMQNITLREAKRLNGVIKKNNNASILPKRRFTEEE 67
Query: 94 ESVLIPDHLD----WREKGFITPDWNQEDCGACYAFSIASAIQGQIF-KSTSEIEELSIQ 148
+P D W I +Q CG+C+A + ASA+ + + +S
Sbjct: 68 ARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTMGGVQDVHISAG 127
Query: 149 QVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNI------ 202
++ C G+ GC GG Y + GL+ + PY K N
Sbjct: 128 DLLACCSDCGD-GCNGGDPDRAWAYFS-STGLVSDYCQPYPFPHCSHHSKSKNGYPPCSQ 185
Query: 203 ------------------VVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLY 244
VV+ SW+ Q E L GP V+ + F Y
Sbjct: 186 FNFDTPKCDYTCDDPTIPVVNYRSWTSYALQGEDDYMRELFFRGPFEVAFDVYE-DFIAY 244
Query: 245 ASGIYDDEACTSDYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKRGNNRCGI 300
SG+Y + HA+ LVG+ T N W + N W+ WG +GY ++RG++ CGI
Sbjct: 245 NSGVYHHVSGQYL-GGHAVRLVGWGTSNGVPYWKIANSWNTEWGMDGYFLIRRGSSECGI 303
Query: 301 ANYAVYAL 308
+ +
Sbjct: 304 EDGGSAGI 311
>3u8e_A Papain-like cysteine protease; papain-like cysteine peptidase,
peptidase_C1A, hydrolase, in form; 1.31A {Crocus
sativus}
Length = 222
Score = 189 bits (481), Expect = 1e-59
Identities = 73/220 (33%), Positives = 109/220 (49%), Gaps = 17/220 (7%)
Query: 99 PDHLDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISG 158
P +DWR+KG +T +Q CG C+AF AI+G +T + +S QQ+VDC +
Sbjct: 2 PASIDWRKKGAVTSVKDQGACGMCWAFGATGAIEGIDAITTGRLISVSEQQIVDCD--TX 59
Query: 159 NLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIVVDISSWSVLPPQDEH 218
GG + +V GG+ + +YPY G C I I ++ + P
Sbjct: 60 XXXXXGGDADDAFRWVITNGGIASDANYPYTGVDGTCDLN-KPIAARIDGYTNV-PNSSS 117
Query: 219 ALKVTLATVGPIAVSINASPHTFQLYAS-GIYDDEACTSDY--VNHAMLLVGYTRNS--- 272
AL +A P++V+I S +FQLY GI+ +C+ D V+H +L+VGY N
Sbjct: 118 ALLDAVAK-QPVSVNIYTSSTSFQLYTGPGIFAGSSCSDDPATVDHTVLIVGYGSNGTNA 176
Query: 273 --WILKNWWSHHWGDNGYMYLKRGNNR----CGIANYAVY 306
WI+KN W WG +GY+ ++R NR C I + Y
Sbjct: 177 DYWIVKNSWGTEWGIDGYILIRRNTNRPDGVCAIDAWGSY 216
>1deu_A Procathepsin X; cysteine protease, proregion, prosegment, HY; 1.70A
{Homo sapiens} SCOP: d.3.1.1 PDB: 1ef7_A
Length = 277
Score = 184 bits (470), Expect = 3e-57
Identities = 56/275 (20%), Positives = 94/275 (34%), Gaps = 44/275 (16%)
Query: 70 YIKEMTRLTHSRIRRTLVRSPESNESVLIPDHLDWREKG---FITPDWNQED---CGACY 123
Y R T R E +P DWR + + NQ CG+C+
Sbjct: 8 YRPLRGDGLAPLGRTTYPRPHEYLSPADLPKSWDWRNVDGVNYASITRNQHIPQYCGSCW 67
Query: 124 AFSIASAIQGQIF---KSTSEIEELSIQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGL 180
A + SA+ +I K LS+Q V+DC C GG+ + +Y G+
Sbjct: 68 AHASTSAMADRINIKRKGAWPSTLLSVQNVIDCG---NAGSCEGGNDLSVWDYAH-QHGI 123
Query: 181 MKEEDYPYKGKQSICKFKRPNI---------------VVDISSWSVLPPQDEHALKVTLA 225
E Y+ K C + + + + +
Sbjct: 124 PDETCNNYQAKDQECDKFNQCGTCNEFKECHAIRNYTLWRVGDYGS--LSGREKMMAEIY 181
Query: 226 TVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGY-TRNS---WILKNWWSH 281
GPI+ I A+ Y GIY + T+ +NH + + G+ + WI++N W
Sbjct: 182 ANGPISCGIMAT-ERLANYTGGIYAEYQDTTY-INHVVSVAGWGISDGTEYWIVRNSWGE 239
Query: 282 HWGDNGYMYLKRGNNRCG--------IANYAVYAL 308
WG+ G++ + + G I + +
Sbjct: 240 PWGERGWLRIVTSTYKDGKGARYNLAIEEHCTFGD 274
>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme,
papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A
1pbh_A 1mir_A
Length = 317
Score = 179 bits (457), Expect = 9e-55
Identities = 58/313 (18%), Positives = 103/313 (32%), Gaps = 51/313 (16%)
Query: 37 NHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESV 96
+ + ++ N+ + ++ ++ + + V E +
Sbjct: 11 SDELVNYVNKRN----TTWQA-GHNFYNVDMSYLKRLCGTFLGGPKPPQRVMFTEDLK-- 63
Query: 97 LIPDHLDWREK----GFITPDWNQEDCGACYAFSIASAIQGQIFKST--SEIEELSIQQV 150
+P D RE+ I +Q CG+C+AF AI +I T E+S + +
Sbjct: 64 -LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDL 122
Query: 151 VDCSIISGNLGCAGGSLRNTLNYVQFAGGL------MKEEDYPYKGK------------- 191
+ C GC GG N+ G + PY
Sbjct: 123 LTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPC 182
Query: 192 ---------QSICKFKRPNIVVDISSWSVLP---PQDEHALKVTLATVGPIAVSINASPH 239
IC+ + E + + GP+ + +
Sbjct: 183 TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYS- 241
Query: 240 TFQLYASGIYDDEACTSDYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKRGN 295
F LY SG+Y HA+ ++G+ N W++ N W+ WGDNG+ + RG
Sbjct: 242 DFLLYKSGVYQHVTGEMM-GGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQ 300
Query: 296 NRCGIANYAVYAL 308
+ CGI + V +
Sbjct: 301 DHCGIESEVVAGI 313
>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase,
digestive tract, hydrolase-hydrolase INH complex; HET:
074; 1.30A {Schistosoma mansoni} PDB: 3s3q_A* 3s3r_A*
Length = 254
Score = 175 bits (445), Expect = 1e-53
Identities = 61/254 (24%), Positives = 91/254 (35%), Gaps = 45/254 (17%)
Query: 98 IPDHLDWREK----GFITPDWNQEDCGACYAFSIASAIQGQIFKST--SEIEELSIQQVV 151
IP D R+K I +Q CG+C+AF A+ + + + ELS ++
Sbjct: 3 IPSSFDSRKKWPRCKSIATIRDQSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVDLL 62
Query: 152 DCSIISGNLGCAGGSLRNTLNYVQFAGGLMKE--------EDYPYKGKQSICKFKRPNIV 203
C G GC GG L +Y G + E YP+ + K K P
Sbjct: 63 SCCESCGL-GCEGGILGPAWDYWVKEGIVTGSSKENHAGCEPYPFPKCEHHTKGKYPPCG 121
Query: 204 VDISSW------------------------SVLPPQDEHALKVTLATVGPIAVSINASPH 239
I S DE A++ + GP+
Sbjct: 122 SKIYKTPRCKQTCQKKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGFTVYED 181
Query: 240 TFQLYASGIYDDEACTSDYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLKRGN 295
F Y SGIY + HA+ ++G+ N W++ N W+ WG+NGY + RG
Sbjct: 182 -FLNYKSGIYKHITGETL-GGHAIRIIGWGVENKAPYWLIANSWNEDWGENGYFRIVRGR 239
Query: 296 NRCGIANYAVYALI 309
+ C I + I
Sbjct: 240 DECSIESEVTAGRI 253
>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco
hydrolase, lysosome, protease, thiol protease, zymogen,
CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A*
3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A*
1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A*
1qdq_A* 1csb_B* 1huc_B 2ipp_B ...
Length = 266
Score = 175 bits (445), Expect = 1e-53
Identities = 55/256 (21%), Positives = 87/256 (33%), Gaps = 43/256 (16%)
Query: 94 ESVLIPDHLDWREK----GFITPDWNQEDCGACYAFSIASAIQGQIFKST--SEIEELSI 147
E + +P D RE+ I +Q CG+ +AF AI +I T E+S
Sbjct: 3 EDLKLPASFDAREQWPQCPTIKEIRDQGSCGSAWAFGAVEAISDRICIHTNAHVSVEVSA 62
Query: 148 QQVVDCSIISGNLGCAGGSLRNTLNYVQFAGG------LMKEEDYPYKGK---------- 191
+ ++ C GC GG N+ G PY
Sbjct: 63 EDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEAHVNGAR 122
Query: 192 ------------QSICKFKRPNIVVDISSWSVLP---PQDEHALKVTLATVGPIAVSINA 236
IC+ + E + + GP+ + +
Sbjct: 123 PPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSV 182
Query: 237 SPHTFQLYASGIYDDEACTSDYVNHAMLLVGY-TRNS---WILKNWWSHHWGDNGYMYLK 292
F LY SG+Y HA+ ++G+ N W++ N W+ WGDNG+ +
Sbjct: 183 YS-DFLLYKSGVYQHVTG-EMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKIL 240
Query: 293 RGNNRCGIANYAVYAL 308
RG + CGI + V +
Sbjct: 241 RGQDHCGIESEVVAGI 256
>3ois_A Cysteine protease; alpha and beta, hydrolase; HET: UDP; 1.65A
{Xylella fastidiosa}
Length = 291
Score = 175 bits (446), Expect = 2e-53
Identities = 50/288 (17%), Positives = 101/288 (35%), Gaps = 34/288 (11%)
Query: 42 HTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVLIPDH 101
H H+ + G R +H+ + R + R +PE + +P
Sbjct: 5 HHHHHHSS----GLVPRGSHMQTVLKRRKKSGYGYIPDIADIRDFSYTPEKSVIAALPPK 60
Query: 102 LDWREKGFITPDWNQEDCGACYAFSIASAIQGQIFKS--TSEIEELSIQQVVDCSIISGN 159
+D ++Q G+C A ++A+AIQ + + E + + I G+
Sbjct: 61 VDLTPP---FQVYDQGRIGSCTANALAAAIQFERIHDKQSPEFIPSRLFIYYNERKIEGH 117
Query: 160 LGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGKQSICKFKRPNIV---------------- 203
+ G++ V G+ E+++PY + + +
Sbjct: 118 VNYDSGAMIRDGIKVLHKLGVCPEKEWPYGDTPADPRTEEFPPGAPASKKPSDQCYKDAQ 177
Query: 204 -VDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYD---DEACTSDYV 259
I+ +S + QD LK LA P + ++ S +
Sbjct: 178 NYKITEYSRV-AQDIDHLKACLAVGSPFVFGFSVYN-SWVGNNSLPVRIPLPTKNDTLEG 235
Query: 260 NHAMLLVGY--TRNSWILKNWWSHHWGDNGYMYLKRG-NNRCGIANYA 304
HA+L VGY + ++N W ++ G++GY ++ + +A+
Sbjct: 236 GHAVLCVGYDDEIRHFRIRNSWGNNVGEDGYFWMPYEYISNTQLADDF 283
>2l95_A Crammer, LP06209P; cysteine proteinase inhibitor, intrinsic
disorder P like protein, hydrolase; NMR {Drosophila
melanogaster}
Length = 80
Score = 51.2 bits (123), Expect = 6e-09
Identities = 12/52 (23%), Positives = 30/52 (57%), Gaps = 1/52 (1%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDL 65
+ K+ K+Y + ++++ + + +I HN++ ++G + + NHL+DL
Sbjct: 14 KSKFDKNYEAEEDLMRRRI-YAESKARIEEHNRKFEKGEVTWKMGINHLADL 64
>3f75_P Toxopain-2, cathepsin L propeptide; medical structural genomics
of pathogenic protozoa, MSGPP, C protease, parasite,
protozoa, hydrolase; 1.99A {Toxoplasma gondii}
Length = 106
Score = 48.2 bits (115), Expect = 1e-07
Identities = 19/72 (26%), Positives = 34/72 (47%), Gaps = 4/72 (5%)
Query: 14 QKKYKKDYRKKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKE 73
Q Y K Y + ++ +++N IHTHNQ+ + Y+L+ NH DL + ++
Sbjct: 29 QAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQG----YSYSLKMNHFGDLSRDEFRRK 84
Query: 74 MTRLTHSRIRRT 85
SR ++
Sbjct: 85 YLGFKKSRNLKS 96
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis,
programmed cell death; HET: DTP; 6.90A {Drosophila
melanogaster} PDB: 3iz8_A*
Length = 1221
Score = 46.8 bits (110), Expect = 6e-06
Identities = 52/323 (16%), Positives = 89/323 (27%), Gaps = 94/323 (29%)
Query: 31 KLHWQSNHKKIHTHNQ--EAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVR 88
K+ W N K ++ E Q L Y + N S IK + +RR L++
Sbjct: 183 KIFW-LNLKNCNSPETVLEMLQKLL-YQIDPNWTSRSDHSSNIKLRIHSIQAELRR-LLK 239
Query: 89 SPESNESVLIPDHLDWREKGFITPDWNQEDCGACYAF-------------SIASAIQGQI 135
S +L+ ++ K WN AF + +
Sbjct: 240 SKPYENCLLVLLNV-QNAK-----AWN--------AFNLSCKILLTTRFKQVTDFLSAAT 285
Query: 136 FK-----------STSEIEELSIQQVVDCSI-------ISGN---LGCAGGSLR---NTL 171
+ E++ L + + +DC ++ N L S+R T
Sbjct: 286 TTHISLDHHSMTLTPDEVKSL-LLKYLDCRPQDLPREVLTTNPRRLSIIAESIRDGLATW 344
Query: 172 NYVQFAGG-------------LMKEEDYPYKGKQSICKFKRPNIVVDISS------WSVL 212
+ + L E + S+ F P+ I + W +
Sbjct: 345 DNWKHVNCDKLTTIIESSLNVLEPAEYRKMFDRLSV--F-PPS--AHIPTILLSLIWFDV 399
Query: 213 PPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSD--YVNHAMLLVGYTR 270
D + L V T + IY + + Y H ++ Y
Sbjct: 400 IKSDVMVVVNKLHKYS--LVEKQPKESTISIP--SIYLELKVKLENEYALHRSIVDHYN- 454
Query: 271 NSWILKNWWSHHWGDN---GYMY 290
I K + S Y Y
Sbjct: 455 ---IPKTFDSDDLIPPYLDQYFY 474
Score = 32.5 bits (73), Expect = 0.20
Identities = 34/238 (14%), Positives = 69/238 (28%), Gaps = 76/238 (31%)
Query: 30 KKLHWQSNHKKIHTHNQEAQQGLHGYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRS 89
KLH S +K +E+ + L + ++ L H I
Sbjct: 409 NKLHKYSLVEK---QPKESTISIPSIYLEL--------KVKLENEYAL-HRSIV------ 450
Query: 90 PESNESVLIPDHLDWREKGFITPDWNQEDCGACYAFS-----IASAIQGQIFKSTSEI-E 143
+ IP D + + P + + Y +S + + + +
Sbjct: 451 ----DHYNIPKTFDSDD---LIPPYLDQ-----YFYSHIGHHLKNIEHPERMTLFRMVFL 498
Query: 144 ELS-IQQVVDCSIISGNLGCAGGSLRNTLNYVQFAGGLMKEEDYPYKGK-QSICKFKRPN 201
+ ++Q + A GS+ NTL ++F + + D Y+ +I F
Sbjct: 499 DFRFLEQKI---RHDSTAWNASGSILNTLQQLKFYKPYICDNDPKYERLVNAILDF---- 551
Query: 202 IVVDISSWSVLPPQDEHALKVTLATVGPIAVSINASPHT------FQLYASGIYDDEA 253
LP +E+ + S +T I+++
Sbjct: 552 ----------LPKIEENLIC---------------SKYTDLLRIALMAEDEAIFEEAH 584
>1qzv_F Plant photosystem I: subunit PSAF; photosynthesis,plant
photosynthetic reaction center, peripheral antenna; HET:
CL1 PQN; 4.44A {Pisum sativum} SCOP: i.5.1.1
Length = 154
Score = 35.3 bits (80), Expect = 0.009
Identities = 9/47 (19%), Positives = 17/47 (36%), Gaps = 20/47 (42%)
Query: 191 KQSICKFKRPNIVVDISSWSVLPPQDEHALKVTLATVGPIAVSINAS 237
KQ++ K + +LK+ P A++I A+
Sbjct: 19 KQALKKL-------------------QASLKLYADDSAP-ALAIKAT 45
>3pw3_A Aminopeptidase C; bleomycin, cysteine proteinase fold, structural
genomics, JO center for structural genomics, JCSG; HET:
MSE; 2.23A {Parabacteroides distasonis}
Length = 383
Score = 34.8 bits (79), Expect = 0.030
Identities = 16/87 (18%), Positives = 26/87 (29%), Gaps = 8/87 (9%)
Query: 212 LPPQDEHALKVTLATVGPIAVSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRN 271
L D + Q YD+ T D H M + G ++
Sbjct: 272 LSGSDMAHWLKLKPEEKKLNTKPQPQKWCTQAERQLAYDNYETTDD---HGMQIYGIAKD 328
Query: 272 S-----WILKNWWSHHWGDNGYMYLKR 293
+++KN W + NG Y +
Sbjct: 329 QEGNEYYMVKNSWGTNSKYNGIWYASK 355
Score = 30.2 bits (67), Expect = 0.80
Identities = 15/72 (20%), Positives = 28/72 (38%), Gaps = 4/72 (5%)
Query: 110 ITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLGCAGGSLRN 169
IT NQ G C+ +S S ++ ++ + +LS V + + A ++R
Sbjct: 22 ITSVKNQNRAGTCWCYSSYSFLESELLRMGKGEYDLSEMFTVYNTYLD----RADAAVRT 77
Query: 170 TLNYVQFAGGLM 181
+ GG
Sbjct: 78 HGDVSFSQGGSF 89
>2e01_A Cysteine proteinase 1; bleomycin hydrolase, thiol protease, C1
protease, hydrolase; 1.73A {Saccharomyces cerevisiae}
PDB: 2e02_A 2e03_A 2dzy_A 1a6r_A 2e00_A 2dzz_A 3gcb_A
1gcb_A
Length = 457
Score = 34.3 bits (78), Expect = 0.047
Identities = 8/40 (20%), Positives = 16/40 (40%), Gaps = 7/40 (17%)
Query: 259 VNHAMLLVGYTRNS-------WILKNWWSHHWGDNGYMYL 291
+ AML+ G + + ++N W G +G +
Sbjct: 371 MTAAMLITGCHVDETSKLPLRYRVENSWGKDSGKDGLYVM 410
>2pff_B Fatty acid synthase subunit beta; fatty acid synthase,
acyl-carrier-protein, beta-ketoacyl RED beta-ketoacyl
synthase, dehydratase; 4.00A {Saccharomyces cerevisiae}
Length = 2006
Score = 34.3 bits (78), Expect = 0.055
Identities = 37/238 (15%), Positives = 59/238 (24%), Gaps = 104/238 (43%)
Query: 35 QSNHKKIHTHNQEAQQGLHGYTLRENHLS---------DLHPRHYIKEMTRLTHSRIRRT 85
+N + H G G +REN+ + L KE+ + S
Sbjct: 1666 INNPVNLTIHFG----GEKGKRIRENYSAMIFETIVDGKLKTEKIFKEINEHSTSYT--- 1718
Query: 86 LVRSPES--NE-----------SVLIPDHLDWREKGFITPDWNQEDC---G--------- 120
RS + + D + KG I D G
Sbjct: 1719 -FRSEKGLLSATQFTQPALTLMEKAA--FEDLKSKGLI-----PADATFAGHSLGEYAAL 1770
Query: 121 ACYA--FSIASAIQ-----GQIFKSTSEIEEL----------------------SIQQVV 151
A A SI S ++ G + +EL ++Q VV
Sbjct: 1771 ASLADVMSIESLVEVVFYRGMTMQVAVPRDELGRSNYGMIAINPGRVAASFSQEALQYVV 1830
Query: 152 D---------CSIISGNLGCAG------GSLRNTLNYVQFAGGLMKEEDYPYKGKQSI 194
+ I+ N G LR + ++ + Q I
Sbjct: 1831 ERVGKRTGWLVEIV--NYNVENQQYVAAGDLRA----LDTVTNVLN-----FIKLQKI 1877
>3cam_A Cold-shock domain family protein; cold shock protein, chain SWAP,
STRU genomics, oxford protein production facility, OPPF,
gene RE; 2.60A {Neisseria meningitidis MC58}
Length = 67
Score = 31.0 bits (71), Expect = 0.059
Identities = 19/54 (35%), Positives = 24/54 (44%), Gaps = 11/54 (20%)
Query: 108 GFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLG 161
GFITPD ED A + SAI + FK+ E Q V + +G G
Sbjct: 16 GFITPDEGGEDLFAHF-----SAINMEGFKTLKE------GQRVSFDVTTGPKG 58
>3i2z_B RNA chaperone, negative regulator of CSPA transcription; beta
barrel, DNA binding protein/transcription, cytoplasm,
gene regulation; 1.10A {Salmonella typhimurium} PDB:
2l15_A 1mjc_A 3mef_A
Length = 71
Score = 30.3 bits (69), Expect = 0.10
Identities = 18/54 (33%), Positives = 25/54 (46%), Gaps = 11/54 (20%)
Query: 108 GFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLG 161
GFITP+ +D + SAIQ FK+ +E Q V+ I +G G
Sbjct: 20 GFITPEDGSKDVFVHF-----SAIQTNGFKTLAE------GQRVEFEITNGAKG 62
>2cb5_A Protein (bleomycin hydrolase); aminopeptidase, cysteine protease,
SELF- compartmentalizing, cylinase; 1.85A {Homo sapiens}
SCOP: d.3.1.1 PDB: 1cb5_A
Length = 453
Score = 32.4 bits (73), Expect = 0.18
Identities = 9/41 (21%), Positives = 14/41 (34%), Gaps = 8/41 (19%)
Query: 259 VNHAMLLVGY--------TRNSWILKNWWSHHWGDNGYMYL 291
+ HAM W ++N W G GY+ +
Sbjct: 369 MTHAMTFTAVSEKDDQDGAFTKWRVENSWGEDHGHKGYLCM 409
>2lss_A Cold shock-like protein; CSD, CSP, oligonucleotide binding F fold,
RNA binding protein, DNA binding protein; NMR
{Rickettsia rickettsii}
Length = 70
Score = 28.4 bits (64), Expect = 0.52
Identities = 10/34 (29%), Positives = 12/34 (35%), Gaps = 5/34 (14%)
Query: 108 GFITPDWNQEDCGACYAFSIASAIQGQIFKSTSE 141
GFI D +D F SA+ S E
Sbjct: 19 GFIEQDNGGKDV-----FVHKSAVDAAGLHSLEE 47
>1g6p_A Cold shock protein TMCSP; greek-KEY, beta barrel, OB-fold,
structural genomics; NMR {Thermotoga maritima} SCOP:
b.40.4.5
Length = 66
Score = 27.2 bits (61), Expect = 1.6
Identities = 18/54 (33%), Positives = 23/54 (42%), Gaps = 12/54 (22%)
Query: 108 GFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLG 161
GFIT D D + SAI+ + FK+ E QVV+ I G G
Sbjct: 15 GFITKD-EGGDVFVHW-----SAIEMEGFKTLKE------GQVVEFEIQEGKKG 56
>3lvg_D LCB, clathrin light chain B; SELF assembly, coated PIT, cytoplasmic
vesicle, membrane, Ca structural protein; 7.94A {Bos
taurus}
Length = 190
Score = 29.0 bits (64), Expect = 1.7
Identities = 8/63 (12%), Positives = 19/63 (30%), Gaps = 23/63 (36%)
Query: 4 KEWIIIFIFPQKKYKKDYRKKATDSKKKLH------------WQSNHK----KIHTHNQE 47
++W +++ +K ++ SK W K +N+
Sbjct: 88 RKW-------REEQRKRLQELDAASKVMEQEWREKAKKDLEEWNQRQSEQVEKNKINNRI 140
Query: 48 AQQ 50
A +
Sbjct: 141 ADK 143
>1c9o_A CSPB, cold-shock protein; beta barrel, homodimer, transcription;
1.17A {Bacillus caldolyticus} SCOP: b.40.4.5 PDB: 2hax_A
1hz9_A 1hzb_A 1i5f_A 1hza_A 1hzc_A 3pf4_A 1csq_A 1nmf_A
1nmg_A 1csp_A 2f52_A 2es2_A 3pf5_A 2i5m_X 2i5l_X
Length = 66
Score = 26.8 bits (60), Expect = 1.7
Identities = 17/54 (31%), Positives = 23/54 (42%), Gaps = 12/54 (22%)
Query: 108 GFITPDWNQEDCGACYAFSIASAIQGQIFKSTSEIEELSIQQVVDCSIISGNLG 161
GFI + D + +AIQG+ FK+ E Q V I+ GN G
Sbjct: 16 GFIEVE-GGSDVFVHF-----TAIQGEGFKTLEE------GQEVSFEIVQGNRG 57
>3suk_A Cerato-platanin-like protein; double PSI beta barrel, unknown
function; 1.34A {Moniliophthora perniciosa}
Length = 125
Score = 27.8 bits (61), Expect = 2.2
Identities = 6/32 (18%), Positives = 9/32 (28%)
Query: 109 FITPDWNQEDCGACYAFSIASAIQGQIFKSTS 140
+N CG CY S + +
Sbjct: 51 SDIGGFNSPACGNCYTISFTFQGVTRSINLVA 82
>3m3g_A EPL1 protein; fungal, plant defense, fungus, polysaccharide-binding
protei; 1.39A {Hypocrea virens}
Length = 120
Score = 27.7 bits (61), Expect = 2.6
Identities = 5/22 (22%), Positives = 7/22 (31%)
Query: 109 FITPDWNQEDCGACYAFSIASA 130
WN CG C+ +
Sbjct: 50 AAVAGWNSASCGTCWKLQYSGH 71
>3szv_A Pyroglutatmate porin OPDO; beta-barrel, channel, bacterial outer
membrane, membrane Pro; HET: C8E; 1.45A {Pseudomonas
aeruginosa} PDB: 2y0k_A*
Length = 401
Score = 28.3 bits (62), Expect = 4.0
Identities = 7/56 (12%), Positives = 15/56 (26%)
Query: 240 TFQLYASGIYDDEACTSDYVNHAMLLVGYTRNSWILKNWWSHHWGDNGYMYLKRGN 295
L + +D L + + + GD+ Y Y+ +
Sbjct: 238 KSDLRFARASEDGGFRELDNRAFGALFSLRLGAHAVAAGYQRISGDDPYPYIAGSD 293
>3sul_A Cerato-platanin-like protein; double PSI beta barrel, unknown
function; 1.63A {Moniliophthora perniciosa}
Length = 122
Score = 27.0 bits (59), Expect = 4.0
Identities = 7/22 (31%), Positives = 9/22 (40%)
Query: 109 FITPDWNQEDCGACYAFSIASA 130
WN E CG CY + +
Sbjct: 50 DTITGWNSESCGTCYQITWSGT 71
>2kqa_A Cerato-platanin; elicitor, secreted, toxin; NMR {Ceratocystis
platani}
Length = 129
Score = 27.0 bits (59), Expect = 4.2
Identities = 5/22 (22%), Positives = 9/22 (40%)
Query: 109 FITPDWNQEDCGACYAFSIASA 130
W+ CG C+ +I +
Sbjct: 56 PDIAGWDSPSCGTCWKVTIPNG 77
>1qht_A Protein (DNA polymerase); archaea, hyperthermostable, family B
polymer alpha family polymerase, transferase; 2.10A
{Thermococcus SP} SCOP: c.55.3.5 e.8.1.1 PDB: 1tgo_A
2xhb_A* 2vwj_A* 2vwk_A* 1wns_A* 1wn7_A 1qqc_A* 4ahc_A*
4ail_C* 3a2f_A* 2jgu_A* 1d5a_A
Length = 775
Score = 28.2 bits (63), Expect = 4.7
Identities = 6/44 (13%), Positives = 15/44 (34%), Gaps = 4/44 (9%)
Query: 71 IKEMTRLTHSRIRRTLVRSP-ESNESVLIPDHLDWREKGFITPD 113
+++RL + S E L+ ++ + P+
Sbjct: 330 EAQLSRLIGQSLWDVSRSSTGNLVEWFLLRKA---YKRNELAPN 370
>2k9m_A RNA polymerase sigma factor RPON; core binding domain,
transcription; NMR {Aquifex aeolicus}
Length = 130
Score = 26.8 bits (60), Expect = 4.8
Identities = 7/31 (22%), Positives = 12/31 (38%)
Query: 55 YTLRENHLSDLHPRHYIKEMTRLTHSRIRRT 85
+ + L DL +K + SR+R
Sbjct: 92 EEILKKALRDLKRGKKLKPEIKGKLSRLRLF 122
>3u5c_P 40S ribosomal protein S15; translation, ribosome, ribosomal,
ribosomal R ribosomal protein, eukaryotic ribosome,
RNA-protein C; 3.00A {Saccharomyces cerevisiae} PDB:
3izb_R 3o30_I 3o2z_I 3u5g_P 1s1h_S 3jyv_S*
Length = 142
Score = 26.9 bits (60), Expect = 4.8
Identities = 9/45 (20%), Positives = 19/45 (42%), Gaps = 6/45 (13%)
Query: 54 GYTLRENHLSDLHPRHYIKEMTRLTHSRIRRTLVRSPESNESVLI 98
G L + L ++ ++ +L +R+RR R S + +
Sbjct: 19 GVDLEK--LLEMS----TEDFVKLAPARVRRRFARGMTSKPAGFM 57
>3erv_A Putative C39-like peptidase; structural genomics, unknown function,
PSI-2, protein structure initiative; 2.10A {Bacillus
anthracis}
Length = 236
Score = 27.4 bits (60), Expect = 5.2
Identities = 7/49 (14%), Positives = 21/49 (42%)
Query: 232 VSINASPHTFQLYASGIYDDEACTSDYVNHAMLLVGYTRNSWILKNWWS 280
+ +P + + ++ + Y H ++L+GY + S +++
Sbjct: 149 TNATFAPLDEDEFTTWETNNGDVSITYNEHCVVLIGYDQESVYIRDPLK 197
>2au5_A Conserved domain protein; structural genomics, PSI, protein STR
initiative, midwest center for structural genomics,
MCSG, U function; 2.10A {Enterococcus faecalis} SCOP:
a.244.1.1
Length = 139
Score = 27.0 bits (59), Expect = 5.7
Identities = 7/31 (22%), Positives = 15/31 (48%)
Query: 112 PDWNQEDCGACYAFSIASAIQGQIFKSTSEI 142
P+W +E G + S ++ + F S ++
Sbjct: 55 PNWLEEAAGGMQGVIVQSLLEDENFSSVEQL 85
>2ykt_A Brain-specific angiogenesis inhibitor 1-associate protein 2;
signaling protein, NPY motif, binding pocket; 2.11A
{Homo sapiens} PDB: 1y2o_A 1wdz_A
Length = 253
Score = 27.2 bits (59), Expect = 6.1
Identities = 12/79 (15%), Positives = 25/79 (31%), Gaps = 7/79 (8%)
Query: 4 KEWIIIFIFPQKKYKKDYR-------KKATDSKKKLHWQSNHKKIHTHNQEAQQGLHGYT 56
+ KKY+ + R K + KK K ++ + Q + +
Sbjct: 113 ELDSRYLSAALKKYQTEQRSKGDALDKCQAELKKLRKKSQGSKNPQKYSDKELQYIDAIS 172
Query: 57 LRENHLSDLHPRHYIKEMT 75
++ L + Y +T
Sbjct: 173 NKQGELENYVSDGYKTALT 191
>3ok8_A Brain-specific angiogenesis inhibitor 1-associate 2-like protein 2;
I-BAR, protein binding; 2.25A {Mus musculus}
Length = 222
Score = 27.1 bits (59), Expect = 7.3
Identities = 8/51 (15%), Positives = 23/51 (45%), Gaps = 1/51 (1%)
Query: 4 KEWIIIFIFPQKKYKKDYRKKATDSKKKL-HWQSNHKKIHTHNQEAQQGLH 53
K + + Y+ +YR +A + +K + +K + +E ++ ++
Sbjct: 111 KLDMQFIKDSCQHYEIEYRHRAANLEKCMSELWRMERKRDKNAREMKESVN 161
>2y2x_A OPDK, vanillate porin OPDK; membrane protein, outer membrane, OPRD,
transport; HET: C8E VNL; 1.65A {Pseudomonas aeruginosa
PA01} PDB: 2qtk_A* 3sys_A*
Length = 390
Score = 27.0 bits (59), Expect = 8.0
Identities = 5/56 (8%), Positives = 11/56 (19%)
Query: 241 FQLYASGIYDDEACTSDYVNHAMLLVGYTRNSWILKNWWSHHWGDNGYMYLKRGNN 296
L+ + L GD+G+ + +
Sbjct: 228 LGLFVDRDDGAARAGEIDSHTVYGLFSAGIGLHTFYLGLQKVGGDSGWQSVYGSSG 283
>2dqb_A Deoxyguanosinetriphosphate triphosphohydrolase, P; dntpase, DNTP,
single-stranded DNA, DNA dGTPase, HD superfamily,
structural genomics; 2.20A {Thermus thermophilus}
Length = 376
Score = 27.3 bits (61), Expect = 8.0
Identities = 11/39 (28%), Positives = 18/39 (46%), Gaps = 11/39 (28%)
Query: 60 NHLSDLHPRHYIKEMTRLTHS----RIRRTLVRSPESNE 94
D + R TRLTH+ ++ R++ R+ NE
Sbjct: 67 GWAGD-YYR------TRLTHTLEVAQVSRSIARALGLNE 98
>3szd_B Porin; beta-barrel, channel, bacterial outer membrane, membrane
Pro; HET: C8E 3PE; 2.31A {Pseudomonas aeruginosa} PDB:
3jty_A*
Length = 405
Score = 27.0 bits (59), Expect = 8.1
Identities = 6/56 (10%), Positives = 11/56 (19%)
Query: 241 FQLYASGIYDDEACTSDYVNHAMLLVGYTRNSWILKNWWSHHWGDNGYMYLKRGNN 296
+ L L GD+G+M + +
Sbjct: 231 LGGFRGRDAGSARAGKLDNRTVSALFSARYGLHTLYLGLQKVSGDDGWMRVNGTSG 286
>1cmx_A Protein (ubiquitin YUH1-UBAL); ubiquitin hydrolase,
deubiquitinating enzyme, cysteine protease, enzyme
specificity; 2.25A {Synthetic} SCOP: d.3.1.6
Length = 235
Score = 26.6 bits (58), Expect = 9.6
Identities = 6/31 (19%), Positives = 12/31 (38%)
Query: 8 IIFIFPQKKYKKDYRKKATDSKKKLHWQSNH 38
I+ +FP + +K + S + W
Sbjct: 55 IVLLFPINEDRKSSTSQQITSSYDVIWFKQS 85
Database: pdb70
Posted date: Sep 4, 2012 3:40 AM
Number of letters in database: 6,701,793
Number of sequences in database: 27,921
Lambda K H
0.319 0.134 0.425
Gapped
Lambda K H
0.267 0.0856 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 27921
Number of Hits to DB: 4,926,246
Number of extensions: 282931
Number of successful extensions: 868
Number of sequences better than 10.0: 1
Number of HSP's gapped: 625
Number of HSP's successfully gapped: 74
Length of query: 309
Length of database: 6,701,793
Length adjustment: 93
Effective length of query: 216
Effective length of database: 4,105,140
Effective search space: 886710240
Effective search space used: 886710240
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 57 (25.5 bits)