RPS-BLAST 2.2.26 [Sep-21-2011]
Database: pdb70
27,921 sequences; 6,701,793 total letters
Searching..................................................done
Query= psy11694
(655 letters)
>3qj3_A Cathepsin L-like protein; hydrolase, proteinase, larVal midgut;
1.85A {Tenebrio molitor}
Length = 331
Score = 260 bits (666), Expect = 1e-81
Identities = 89/317 (28%), Positives = 145/317 (45%), Gaps = 43/317 (13%)
Query: 185 HAIQGNNLTELSVQH--------HDKVYSSVEDLLRRHENFVTNVEKAEDYQSE-DSGTA 235
H ++G+ L V + + Y + ++ R + F +E E++ + G
Sbjct: 6 HHLEGSALPSTFVAEKWENFKTTYARSYVNAKEETFRKQIFQKKLETFEEHNEKYRQGLV 65
Query: 236 VF--GVNKFFDLSESDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHG 293
+ GVN F D++ +++ T L P ++ ++ + L
Sbjct: 66 SYTLGVNLFTDMTPEEMKAYTH------------GLIMPADLHKNGIPIKTREDLGLNAS 113
Query: 294 DDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTE--LSVQQLVDC 351
P +FDWR +G++S VK QG C WAFS+ G +E+ I + + +S QQLVDC
Sbjct: 114 VRYPASFDWRDQGMVSPVKNQGSCGSSWAFSSTGAIESQMKIANGAGYDSSVSEQQLVDC 173
Query: 352 DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRI 411
+ GC+GG M+DA Y+ NGG+ S+ AYPY+ ++ C + ++ Y +
Sbjct: 174 VPNALGCSGGWMNDAFTYVAQNGGIDSEGAYPYEMADGN--CHY-DPNQVAARLSGYVYL 230
Query: 412 PYGEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVID----LNQRL--------YGTS-- 456
+E + VAT+GP++V +A+ F YSGGV + YG
Sbjct: 231 SGPDENMLADMVATKGPVAVAFDADDPFGSYSGGVYYNPTCETNKFTHAVLIVGYGNENG 290
Query: 457 IPYWIVKNSWGSDWGEK 473
YW+VKNSWG WG
Sbjct: 291 QDYWLVKNSWGDGWGLD 307
Score = 177 bits (451), Expect = 2e-50
Identities = 56/148 (37%), Positives = 80/148 (54%), Gaps = 11/148 (7%)
Query: 506 LVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKE 565
LVDC + GC+GG M+DA Y+ NGG+ S+ AYPY+ ++ C + ++
Sbjct: 170 LVDCVPNALGCSGGWMNDAFTYVAQNGGIDSEGAYPYEMADGN--CHY-DPNQVAARLSG 226
Query: 566 YSRIPYGEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVIDLNQRLCNPKAQNHALIIVG 624
Y + +E + VAT+GP++V +A+ F YSGGV C HA++IVG
Sbjct: 227 YVYLSGPDENMLADMVATKGPVAVAFDADDPFGSYSGGV--YYNPTCETNKFTHAVLIVG 284
Query: 625 YGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
YG E +D YW+VKNSWG WG
Sbjct: 285 YGNENGQD-----YWLVKNSWGDGWGLD 307
Score = 114 bits (288), Expect = 4e-28
Identities = 37/161 (22%), Positives = 69/161 (42%), Gaps = 17/161 (10%)
Query: 43 TRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQRE-DSGTAVFE--VNKFFDLSD 99
++ NF + + Y + ++ R + F +E E++ + G + VN F D++
Sbjct: 20 EKWENFKTTYARSYVNAKEETFRKQIFQKKLETFEEHNEKYRQGLVSYTLGVNLFTDMTP 79
Query: 100 SDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGV 159
+++ T L P ++ ++ + L P +FDWR +G+
Sbjct: 80 EEMKAYTH------------GLIMPADLHKNGIPIKTREDLGLNASVRYPASFDWRDQGM 127
Query: 160 ISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTE--LSVQ 198
+S VK QG C WAFS+ G +E+ I + +S Q
Sbjct: 128 VSPVKNQGSCGSSWAFSSTGAIESQMKIANGAGYDSSVSEQ 168
>1xkg_A DER P I, major mite fecal allergen DER P 1; major allergen,
cysteine protease, house DUST mite, dermatop
pteronyssinus; 1.61A {Dermatophagoides pteronyssinus}
SCOP: d.3.1.1
Length = 312
Score = 244 bits (625), Expect = 8e-76
Identities = 68/295 (23%), Positives = 116/295 (39%), Gaps = 47/295 (15%)
Query: 198 QHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQL-TGL 256
+ +K Y++ ED +NF+ +V+ + +N DLS + +
Sbjct: 13 KAFNKSYATFEDEEAARKNFLESVKYVQSNGG--------AINHLSDLSLDEFKNRFLMS 64
Query: 257 NLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGK 316
Q L A + N+ + P D R ++ ++ QG
Sbjct: 65 AEAFEHLKTQFDLNA--------------ETNACSINGNAPAEIDLRQMRTVTPIRMQGG 110
Query: 317 CACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDNGGV 376
C WAFS V E+ + + +L+ Q+LVDC S GC+G + ++YI N GV
Sbjct: 111 CGSAWAFSGVAATESAYLAYRDQSLDLAEQELVDCA-SQHGCHGDTIPRGIEYIQHN-GV 168
Query: 377 VSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVA-TRGPLSVGMNA 435
V + Y Y A E C + + Y +I ++++ +A T ++V +
Sbjct: 169 VQESYYRYVAREQS--CRRPNAQR--FGISNYCQIYPPNANKIREALAQTHSAIAVIIGI 224
Query: 436 NGLF---YYSGGVIDL----NQRL--------YGTS--IPYWIVKNSWGSDWGEK 473
L +Y G I Q Y + + YWIV+NSW ++WG+
Sbjct: 225 KDLDAFRHYDGRTIIQRDNGYQPNYHAVNIVGYSNAQGVDYWIVRNSWDTNWGDN 279
Score = 158 bits (402), Expect = 8e-44
Identities = 48/166 (28%), Positives = 74/166 (44%), Gaps = 21/166 (12%)
Query: 491 TGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERG 550
LA ++LVDC S GC+G + ++YI NG VV + Y Y A E
Sbjct: 131 RDQSLD----LAEQELVDCA-SQHGCHGDTIPRGIEYIQHNG-VVQESYYRYVAREQS-- 182
Query: 551 CLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVA-TRGPLSVGMNANGLF---YYSGGVIDL 606
C + + Y +I ++++ +A T ++V + L +Y G I
Sbjct: 183 CRRPNAQR--FGISNYCQIYPPNANKIREALAQTHSAIAVIIGIKDLDAFRHYDGRTII- 239
Query: 607 NQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
QR + HA+ IVGY + D YWIV+NSW ++WG+
Sbjct: 240 -QRDNGYQPNYHAVNIVGYSNAQGVD-----YWIVRNSWDTNWGDN 279
Score = 118 bits (298), Expect = 1e-29
Identities = 31/160 (19%), Positives = 57/160 (35%), Gaps = 23/160 (14%)
Query: 40 SPVTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSD 99
S + F + + +K Y++ ED +NF+ +V+ + +N DLS
Sbjct: 3 SSIKTFEEYKKAFNKSYATFEDEEAARKNFLESVKYVQSNGGA--------INHLSDLSL 54
Query: 100 SDLQQL-TGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEG 158
+ + Q L A + N+ + P D R
Sbjct: 55 DEFKNRFLMSAEAFEHLKTQFDLNA--------------ETNACSINGNAPAEIDLRQMR 100
Query: 159 VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
++ ++ QG C WAFS V E+ + + +L+ Q
Sbjct: 101 TVTPIRMQGGCGSAWAFSGVAATESAYLAYRDQSLDLAEQ 140
>1by8_A Protein (procathepsin K); hydrolase(sulfhydryl proteinase), papain;
2.60A {Homo sapiens} SCOP: d.3.1.1 PDB: 7pck_A
Length = 314
Score = 242 bits (620), Expect = 4e-75
Identities = 91/295 (30%), Positives = 136/295 (46%), Gaps = 41/295 (13%)
Query: 199 HHDKVYSSVEDLLRRHENFVTNVEKAEDYQSE-DSGTAVF--GVNKFFDLSESDLQQ-LT 254
H K Y++ D + R + N++ + E G + +N D++ ++ Q +T
Sbjct: 17 THRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTSEEVVQKMT 76
Query: 255 GLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQ 314
GL + + +L P P++ D+R +G ++ VK Q
Sbjct: 77 GLKVPLSHSRSNDTLYIP------------------EWEGRAPDSVDYRKKGYVTPVKNQ 118
Query: 315 GKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDNG 374
G+C CWAFS+VG +E + L LS Q LVDC N GC GG M +A QY+ N
Sbjct: 119 GQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNR 178
Query: 375 GVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMN 434
G+ S+ AYPY E C+ G K + Y IP G E+ +K+ VA GP+SV ++
Sbjct: 179 GIDSEDAYPYVGQEES--CMY-NPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAID 235
Query: 435 ANG--LFYYSGGVID----LNQRL--------YGTS--IPYWIVKNSWGSDWGEK 473
A+ +YS GV + L YG +WI+KNSWG +WG K
Sbjct: 236 ASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNK 290
Score = 165 bits (420), Expect = 3e-46
Identities = 64/164 (39%), Positives = 88/164 (53%), Gaps = 16/164 (9%)
Query: 491 TGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERG 550
TG L + L+ + LVDC N GC GG M +A QY+ N G+ S+ AYPY E
Sbjct: 141 TGKLLN----LSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEES-- 194
Query: 551 CLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANG--LFYYSGGVIDLNQ 608
C+ G K + Y IP G E+ +K+ VA GP+SV ++A+ +YS GV
Sbjct: 195 CMY-NPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYY--D 251
Query: 609 RLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
CN NHA++ VGYG ++ +WI+KNSWG +WG K
Sbjct: 252 ESCNSDNLNHAVLAVGYGIQKGNK-----HWIIKNSWGENWGNK 290
Score = 111 bits (279), Expect = 4e-27
Identities = 38/160 (23%), Positives = 68/160 (42%), Gaps = 22/160 (13%)
Query: 43 TRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDY-QREDSGTAVFE--VNKFFDLSD 99
T + + + H K Y++ D + R + N++ + G +E +N D++
Sbjct: 9 THWELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTS 68
Query: 100 SDLQQ-LTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEG 158
++ Q +TGL + + +L P P++ D+R +G
Sbjct: 69 EEVVQKMTGLKVPLSHSRSNDTLYIP------------------EWEGRAPDSVDYRKKG 110
Query: 159 VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
++ VK QG+C CWAFS+VG +E + L LS Q
Sbjct: 111 YVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQ 150
>1pci_A Procaricain; zymogen, hydrolase, thiol protease; 3.20A {Carica
papaya} SCOP: d.3.1.1
Length = 322
Score = 240 bits (615), Expect = 3e-74
Identities = 91/290 (31%), Positives = 141/290 (48%), Gaps = 38/290 (13%)
Query: 199 HHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESD-LQQLTGLN 257
+H+K Y +V++ L R E F N+ ++ +++ + G+N+F DLS + ++ G
Sbjct: 28 NHNKFYENVDEKLYRFEIFKDNLNYIDETNKKNNSYWL-GLNEFADLSNDEFNEKYVGSL 86
Query: 258 LDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKC 317
+D+T+E +LPE DWR +G ++ V+ QG C
Sbjct: 87 IDATIEQSYDEEFIN------------------EDIVNLPENVDWRKKGAVTPVRHQGSC 128
Query: 318 ACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDNGGVV 377
CWAFSAV VE ++ I+ L ELS Q+LVDC+ + GC GG AL+Y+ N G+
Sbjct: 129 GSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERRSHGCKGGYPPYALEYVAKN-GIH 187
Query: 378 SDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANG 437
YPYKA + C + G VK R+ E + +A P+SV + + G
Sbjct: 188 LRSKYPYKAKQGT--CRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAK-QPVSVVVESKG 244
Query: 438 L-F-YYSGGVID------LNQRL----YGTS--IPYWIVKNSWGSDWGEK 473
F Y GG+ + ++ + YG S Y ++KNSWG+ WGEK
Sbjct: 245 RPFQLYKGGIFEGPCGTKVDGAVTAVGYGKSGGKGYILIKNSWGTAWGEK 294
Score = 153 bits (389), Expect = 7e-42
Identities = 54/158 (34%), Positives = 78/158 (49%), Gaps = 15/158 (9%)
Query: 497 KLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEE 556
KL L+ ++LVDC+ + GC GG AL+Y+ N G+ YPYKA + C +
Sbjct: 150 KLVELSEQELVDCERRSHGCKGGYPPYALEYVAKN-GIHLRSKYPYKAKQGT--CRAKQV 206
Query: 557 EGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDLNQRLCNPK 614
G VK R+ E + +A P+SV + + G F Y GG+ + C K
Sbjct: 207 GGPIVKTSGVGRVQPNNEGNLLNAIAK-QPVSVVVESKGRPFQLYKGGIFEGP---CGTK 262
Query: 615 AQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
+ A+ VGYG+ K Y ++KNSWG+ WGEK
Sbjct: 263 V-DGAVTAVGYGKSGGKG-----YILIKNSWGTAWGEK 294
Score = 113 bits (286), Expect = 5e-28
Identities = 46/157 (29%), Positives = 78/157 (49%), Gaps = 20/157 (12%)
Query: 43 TRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSD- 101
F ++M +H+K Y +V++ L R E F N+ ++ ++++ + +N+F DLS+ +
Sbjct: 20 QLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKKNNSYWL-GLNEFADLSNDEF 78
Query: 102 LQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVIS 161
++ G +D+T+E +LPE DWR +G ++
Sbjct: 79 NEKYVGSLIDATIEQSYDEEFIN------------------EDIVNLPENVDWRKKGAVT 120
Query: 162 KVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
V+ QG C CWAFSAV VE ++ I+ L ELS Q
Sbjct: 121 PVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQ 157
>1m6d_A Cathepsin F, catsf; papain family cysteine protease, hydrolase;
HET: MYP; 1.70A {Homo sapiens} SCOP: d.3.1.1
Length = 214
Score = 233 bits (596), Expect = 6e-73
Identities = 68/193 (35%), Positives = 104/193 (53%), Gaps = 20/193 (10%)
Query: 297 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG 356
P +DWR++G ++KVK+QG C CWAFS G VE + +L LS Q+L+DCD +
Sbjct: 2 PPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDK 61
Query: 357 GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEE 416
C GG +A I + GG+ ++ Y Y+ C E+ KV +++ + E
Sbjct: 62 ACMGGLPSNAYSAIKNLGGLETEDDYSYQGHMQS--CQFSAEKA-KVYIQDSVELS-QNE 117
Query: 417 EEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRL--------------YGTS--IPYW 460
+++ W+A RGP+SV +NA G+ +Y G+ + L YG +P+W
Sbjct: 118 QKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGQRSDVPFW 177
Query: 461 IVKNSWGSDWGEK 473
+KNSWG+DWGEK
Sbjct: 178 AIKNSWGTDWGEK 190
Score = 182 bits (464), Expect = 9e-54
Identities = 54/162 (33%), Positives = 89/162 (54%), Gaps = 13/162 (8%)
Query: 491 TGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERG 550
G L S L+ ++L+DCD + C GG +A I + GG+ ++ Y Y+
Sbjct: 42 QGTLLS----LSEQELLDCDKMDKACMGGLPSNAYSAIKNLGGLETEDDYSYQGHMQS-- 95
Query: 551 CLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRL 610
C E+ KV +++ + E+++ W+A RGP+SV +NA G+ +Y G+ + L
Sbjct: 96 CQFSAEKA-KVYIQDSVELS-QNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPL 153
Query: 611 CNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
C+P +HA+++VGYG+ +W +KNSWG+DWGEK
Sbjct: 154 CSPWLIDHAVLLVGYGQRSDVP-----FWAIKNSWGTDWGEK 190
Score = 82.5 bits (205), Expect = 4e-18
Identities = 23/50 (46%), Positives = 30/50 (60%)
Query: 149 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
P +DWR++G ++KVK+QG C CWAFS G VE + L LS Q
Sbjct: 2 PPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQ 51
>2c0y_A Procathepsin S; proenzyme, proteinase, hydrolase, thiol protease,
prosegment binding loop, glycoprotein, lysosome,
protease, zymogen; 2.1A {Homo sapiens}
Length = 315
Score = 235 bits (603), Expect = 1e-72
Identities = 97/297 (32%), Positives = 141/297 (47%), Gaps = 45/297 (15%)
Query: 199 HHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAV---FGVNKFFDLSESDLQQL-T 254
+ K Y + R + N++ + E S G+N D++ ++ L +
Sbjct: 18 TYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMS 77
Query: 255 GLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQ 314
L + S Q + SN LP++ DWR +G +++VK Q
Sbjct: 78 SLRVPS-----QWQRNITYKSN---------------PNRILPDSVDWREKGCVTEVKYQ 117
Query: 315 GKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS---NGGCNGGRMDDALQYII 371
G C WAFSAVG +EA ++ L LS Q LVDC N GCNGG M A QYII
Sbjct: 118 GSCGAAWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYII 177
Query: 372 DNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSV 431
DN G+ SD +YPYKA + + C + + +Y+ +PYG E+ +K+ VA +GP+SV
Sbjct: 178 DNKGIDSDASYPYKAMDQK--CQY-DSKYRAATCSKYTELPYGREDVLKEAVANKGPVSV 234
Query: 432 GMNANGL-F-YYSGGVIDL---NQRL--------YGTS--IPYWIVKNSWGSDWGEK 473
G++A F Y GV Q + YG YW+VKNSWG ++GE+
Sbjct: 235 GVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEE 291
Score = 155 bits (393), Expect = 2e-42
Identities = 68/167 (40%), Positives = 96/167 (57%), Gaps = 20/167 (11%)
Query: 491 TGVLPSKLSRLATEKLVDCDMS---NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASES 547
TG L S L+ + LVDC N GCNGG M A QYIIDN G+ SD +YPYKA +
Sbjct: 140 TGKLVS----LSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQ 195
Query: 548 ERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVID 605
+ C + + +Y+ +PYG E+ +K+ VA +GP+SVG++A F Y GV
Sbjct: 196 K--CQY-DSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGV-- 250
Query: 606 LNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
+ C NH +++VGYG+ K+ YW+VKNSWG ++GE+
Sbjct: 251 YYEPSCTQN-VNHGVLVVGYGDLNGKE-----YWLVKNSWGHNFGEE 291
Score = 110 bits (276), Expect = 1e-26
Identities = 38/160 (23%), Positives = 66/160 (41%), Gaps = 24/160 (15%)
Query: 43 TRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQRE-DSGTAVFE--VNKFFDLSD 99
+ + + + K Y + R + N++ + E G ++ +N D++
Sbjct: 10 HHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTS 69
Query: 100 SDLQQL-TGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEG 158
++ L + L + S Q + SN LP++ DWR +G
Sbjct: 70 EEVMSLMSSLRVPS-----QWQRNITYKSN---------------PNRILPDSVDWREKG 109
Query: 159 VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
+++VK QG C WAFSAVG +EA ++ L LS Q
Sbjct: 110 CVTEVKYQGSCGAAWAFSAVGALEAQLKLKTGKLVSLSAQ 149
>2b1m_A SPE31; papain-like, sugar binding protein; HET: NAG FUC PG4; 2.00A
{Pachyrhizus erosus} PDB: 2b1n_A*
Length = 246
Score = 232 bits (594), Expect = 3e-72
Identities = 68/198 (34%), Positives = 103/198 (52%), Gaps = 20/198 (10%)
Query: 295 DLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS 354
D PE++DW +GVI+KVK QG+C WAFSA G +EA HAI +L LS Q+L+DC
Sbjct: 1 DAPESWDWSKKGVITKVKFQGQCGSGWAFSATGAIEAAHAIATGNLVSLSEQELIDCVDE 60
Query: 355 NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASE----SERGCLVGEEEGFKVKVKEYSR 410
+ GC G + ++++ +GG+ S+ YPYKA + + + + V++
Sbjct: 61 SEGCYNGWHYQSFEWVVKHGGIASEADYPYKARDGKCKANEIQDKVTIDNYGVQILSNES 120
Query: 411 IPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVID-----LNQRL--------YGTS- 456
E ++ +V P+SV ++A +YSGG+ D + YG+
Sbjct: 121 TESEAESSLQSFVLE-QPISVSIDAKDFHFYSGGIYDGGNCSSPYGINHFVLIVGYGSED 179
Query: 457 -IPYWIVKNSWGSDWGEK 473
+ YWI KNSWG DWG
Sbjct: 180 GVDYWIAKNSWGEDWGID 197
Score = 173 bits (440), Expect = 7e-50
Identities = 52/166 (31%), Positives = 83/166 (50%), Gaps = 15/166 (9%)
Query: 491 TGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASE---- 546
TG L S L+ ++L+DC + GC G + ++++ +GG+ S+ YPYKA +
Sbjct: 43 TGNLVS----LSEQELIDCVDESEGCYNGWHYQSFEWVVKHGGIASEADYPYKARDGKCK 98
Query: 547 SERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDL 606
+ + + V++ E ++ +V P+SV ++A +YSGG+ D
Sbjct: 99 ANEIQDKVTIDNYGVQILSNESTESEAESSLQSFVLE-QPISVSIDAKDFHFYSGGIYD- 156
Query: 607 NQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
+P NH ++IVGYG E+ D YWI KNSWG DWG
Sbjct: 157 GGNCSSPYGINHFVLIVGYGSEDGVD-----YWIAKNSWGEDWGID 197
Score = 85.0 bits (211), Expect = 1e-18
Identities = 30/52 (57%), Positives = 36/52 (69%)
Query: 147 DLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
D PE++DW +GVI+KVK QG+C WAFSA G +EA HAI NL LS Q
Sbjct: 1 DAPESWDWSKKGVITKVKFQGQCGSGWAFSATGAIEAAHAIATGNLVSLSEQ 52
>2o6x_A Procathepsin L1, secreted cathepsin L 1; hydrolase, thiol protease,
cysteine protease, zymogen, hydro; 1.40A {Fasciola
hepatica}
Length = 310
Score = 234 bits (600), Expect = 3e-72
Identities = 81/295 (27%), Positives = 130/295 (44%), Gaps = 43/295 (14%)
Query: 199 HHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAV---FGVNKFFDLSESDLQQLTG 255
++K Y+ +D RR + NV+ +++ V G+N+F D++ + +
Sbjct: 11 MYNKEYNGADDQHRR-NIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEFKAKYL 69
Query: 256 LNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQG 315
+ +++ + + +P+ DWR G +++VK+QG
Sbjct: 70 ------------------TEMSRASDILSHGVPYEANNRAVPDKIDWRESGYVTEVKDQG 111
Query: 316 KCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS--NGGCNGGRMDDALQYIIDN 373
C WAFS G +E + + S QQLVDC N GC GG M++A QY +
Sbjct: 112 NCGSGWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSRPWGNNGCGGGLMENAYQY-LKQ 170
Query: 374 GGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGM 433
G+ ++ +YPY A E + C ++ G KV + + G E E+K V GP +V +
Sbjct: 171 FGLETESSYPYTAVEGQ--CRYNKQLG-VAKVTGFYTVHSGSEVELKNLVGAEGPAAVAV 227
Query: 434 NANGLF-YYSGGVID----LNQRL--------YGTS--IPYWIVKNSWGSDWGEK 473
+ F Y G+ R+ YGT YWIVKNSWG WGE+
Sbjct: 228 DVESDFMMYRSGIYQSQTCSPLRVNHAVLAVGYGTQGGTDYWIVKNSWGLSWGER 282
Score = 158 bits (401), Expect = 1e-43
Identities = 59/166 (35%), Positives = 83/166 (50%), Gaps = 20/166 (12%)
Query: 491 TGVLPSKLSRLATEK-LVDCDMS--NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASES 547
S S E+ LVDC N GC GG M++A QY+ G+ ++ +YPY A E
Sbjct: 133 ERTSIS-FS----EQQLVDCSRPWGNNGCGGGLMENAYQYL-KQFGLETESSYPYTAVEG 186
Query: 548 ERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVIDL 606
+ C ++ G KV + + G E E+K V GP +V ++ F Y G+
Sbjct: 187 Q--CRYNKQLG-VAKVTGFYTVHSGSEVELKNLVGAEGPAAVAVDVESDFMMYRSGIYQ- 242
Query: 607 NQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
+ C+P NHA++ VGYG + D YWIVKNSWG WGE+
Sbjct: 243 -SQTCSPLRVNHAVLAVGYGTQGGTD-----YWIVKNSWGLSWGER 282
Score = 108 bits (272), Expect = 3e-26
Identities = 31/159 (19%), Positives = 63/159 (39%), Gaps = 22/159 (13%)
Query: 43 TRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQRE-DSGTAVFE--VNKFFDLSD 99
+ + R ++K Y+ +D RR + NV+ +++ D G + +N+F D++
Sbjct: 3 DLWHQWKRMYNKEYNGADDQHRR-NIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTF 61
Query: 100 SDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGV 159
+ + + +++ + + +P+ DWR G
Sbjct: 62 EEFKAKYL------------------TEMSRASDILSHGVPYEANNRAVPDKIDWRESGY 103
Query: 160 ISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
+++VK+QG C WAFS G +E + S Q
Sbjct: 104 VTEVKDQGNCGSGWAFSTTGTMEGQYMKNERTSISFSEQ 142
>3bwk_A Cysteine protease falcipain-3; malaria, hydrolase; HET: C1P; 2.42A
{Plasmodium falciparum} PDB: 3bpm_A*
Length = 243
Score = 230 bits (589), Expect = 2e-71
Identities = 71/206 (34%), Positives = 103/206 (50%), Gaps = 28/206 (13%)
Query: 291 RHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVD 350
A+DWR G ++ VK+Q C CWAFS+VG VE+ +AI+ +L S Q+LVD
Sbjct: 15 ADAKLDRIAYDWRLHGGVTPVKDQALCGSCWAFSSVGSVESQYAIRKKALFLFSEQELVD 74
Query: 351 CDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSR 410
C + N GC GG + +A +ID GG+ S YPY ++ E C + + +K Y
Sbjct: 75 CSVKNNGCYGGYITNAFDDMIDLGGLCSQDDYPYVSNLPET-CNLKRCNE-RYTIKSYVS 132
Query: 411 IPYGEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVIDLN--QRL--------YGTS--- 456
IP +++ K+ + GP+S+ + A+ F +Y GG D YG
Sbjct: 133 IP---DDKFKEALRYLGPISISIAASDDFAFYRGGFYDGECGAAPNHAVILVGYGMKDIY 189
Query: 457 ---------IPYWIVKNSWGSDWGEK 473
Y+I+KNSWGSDWGE
Sbjct: 190 NEDTGRMEKFYYYIIKNSWGSDWGEG 215
Score = 169 bits (431), Expect = 1e-48
Identities = 56/168 (33%), Positives = 87/168 (51%), Gaps = 19/168 (11%)
Query: 491 TGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERG 550
L + ++LVDC + N GC GG + +A +ID GG+ S YPY ++ E
Sbjct: 61 KKALFL----FSEQELVDCSVKNNGCYGGYITNAFDDMIDLGGLCSQDDYPYVSNLPET- 115
Query: 551 CLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVIDLNQR 609
C + + +K Y IP +++ K+ + GP+S+ + A+ F +Y GG D
Sbjct: 116 CNLKRCNE-RYTIKSYVSIP---DDKFKEALRYLGPISISIAASDDFAFYRGGFYDGE-- 169
Query: 610 LCNPKAQNHALIIVGYGEEEKKDGTS-----IPYWIVKNSWGSDWGEK 652
C NHA+I+VGYG ++ + + Y+I+KNSWGSDWGE
Sbjct: 170 -CGAA-PNHAVILVGYGMKDIYNEDTGRMEKFYYYIIKNSWGSDWGEG 215
Score = 86.9 bits (216), Expect = 2e-19
Identities = 23/56 (41%), Positives = 31/56 (55%)
Query: 143 RHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
A+DWR G ++ VK+Q C CWAFS+VG VE+ +AI+ L S Q
Sbjct: 15 ADAKLDRIAYDWRLHGGVTPVKDQALCGSCWAFSSVGSVESQYAIRKKALFLFSEQ 70
>3qt4_A Cathepsin-L-like midgut cysteine proteinase; hydrolase, zymogen,
intramolecular DISS bonds, insect larVal midgut; HET:
PG4 PG6; 2.11A {Tenebrio molitor}
Length = 329
Score = 233 bits (596), Expect = 2e-71
Identities = 91/295 (30%), Positives = 133/295 (45%), Gaps = 42/295 (14%)
Query: 199 HHDKVYSSVEDLLRRHENFVTNVEKAEDYQSE-DSGTAVF--GVNKFFDLSESDLQQLTG 255
H K YSS + +RR F NV K ++ ++ + G + +N+F D+S+ +
Sbjct: 33 THKKSYSSPIEEIRRQLIFKDNVAKIAEHNAKFEKGEVTYSKAMNQFGDMSKEEFLAYVN 92
Query: 256 LNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQG 315
Q + + L + DWR+ V S+VK+QG
Sbjct: 93 -----------------RGKAQKPKHPENLRMPYVSSKKPLAASVDWRSNAV-SEVKDQG 134
Query: 316 KCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS--NGGCNGGRMDDALQYIIDN 373
+C W+FS G VE A+Q LT LS Q L+DC S N GC+GG MD A YI +
Sbjct: 135 QCGSSWSFSTTGAVEGQLALQRGRLTSLSEQNLIDCSSSYGNAGCDGGWMDSAFSYIH-D 193
Query: 374 GGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGM 433
G++S+ AYPY+A C + + Y +P G+E + V GP++V +
Sbjct: 194 YGIMSESAYPYEAQGDY--CRF-DSSQSVTTLSGYYDLPSGDENSLADAVGQAGPVAVAI 250
Query: 434 NANGLF-YYSGGVIDL----NQRL--------YGTS--IPYWIVKNSWGSDWGEK 473
+A +YSGG+ L YG+ YWI+KNSWGS WGE
Sbjct: 251 DATDELQFYSGGLFYDQTCNQSDLNHGVLVVGYGSDNGQDYWILKNSWGSGWGES 305
Score = 157 bits (400), Expect = 2e-43
Identities = 62/166 (37%), Positives = 88/166 (53%), Gaps = 20/166 (12%)
Query: 491 TGVLPSKLSRLATEK-LVDCDMS--NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASES 547
G L S LS E+ L+DC S N GC+GG MD A YI + G++S+ AYPY+A
Sbjct: 156 RGRLTS-LS----EQNLIDCSSSYGNAGCDGGWMDSAFSYIH-DYGIMSESAYPYEAQGD 209
Query: 548 ERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVIDL 606
C + + Y +P G+E + V GP++V ++A +YSGG+
Sbjct: 210 Y--CRF-DSSQSVTTLSGYYDLPSGDENSLADAVGQAGPVAVAIDATDELQFYSGGL--F 264
Query: 607 NQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
+ CN NH +++VGYG + +D YWI+KNSWGS WGE
Sbjct: 265 YDQTCNQSDLNHGVLVVGYGSDNGQD-----YWILKNSWGSGWGES 305
Score = 109 bits (274), Expect = 2e-26
Identities = 42/159 (26%), Positives = 64/159 (40%), Gaps = 21/159 (13%)
Query: 43 TRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDY-QREDSGTAVFE--VNKFFDLSD 99
++ F H K YSS + +RR F NV K ++ + + G + +N+F D+S
Sbjct: 25 EQWSQFKLTHKKSYSSPIEEIRRQLIFKDNVAKIAEHNAKFEKGEVTYSKAMNQFGDMSK 84
Query: 100 SDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGV 159
+ Q + + L + DWR+ V
Sbjct: 85 EEFLAYVN-----------------RGKAQKPKHPENLRMPYVSSKKPLAASVDWRSNAV 127
Query: 160 ISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
S+VK+QG+C W+FS G VE A+Q LT LS Q
Sbjct: 128 -SEVKDQGQCGSSWSFSTTGAVEGQLALQRGRLTSLSEQ 165
>2oul_A Falcipain 2; cysteine protease, inhibitor, macromolecular
interaction, HY hydrolase inhibitor complex; 2.20A
{Plasmodium falciparum} SCOP: d.3.1.1 PDB: 2ghu_A 1yvb_A
3bpf_A* 3pnr_A
Length = 241
Score = 228 bits (583), Expect = 1e-70
Identities = 65/204 (31%), Positives = 103/204 (50%), Gaps = 28/204 (13%)
Query: 293 GDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCD 352
+ A+DWR ++ VK+Q C CWAFS++G VE+ +AI+ N L LS Q+LVDC
Sbjct: 15 ENFDHAAYDWRLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCS 74
Query: 353 MSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIP 412
N GCNGG +++A + +I+ GG+ D YPY + + K +K Y +P
Sbjct: 75 FKNYGCNGGLINNAFEDMIELGGICPDGDYPYVS--DAPNLCNIDRCTEKYGIKNYLSVP 132
Query: 413 YGEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVIDLN--QRL--------YGTS----- 456
+ ++K+ + GP+S+ + + F +Y G+ D +L +G
Sbjct: 133 ---DNKLKEALRFLGPISISVAVSDDFAFYKEGIFDGECGDQLNHAVMLVGFGMKEIVNP 189
Query: 457 -------IPYWIVKNSWGSDWGEK 473
Y+I+KNSWG WGE+
Sbjct: 190 LTKKGEKHYYYIIKNSWGQQWGER 213
Score = 167 bits (426), Expect = 5e-48
Identities = 50/153 (32%), Positives = 81/153 (52%), Gaps = 15/153 (9%)
Query: 506 LVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKE 565
LVDC N GCNGG +++A + +I+ GG+ D YPY + + K +K
Sbjct: 70 LVDCSFKNYGCNGGLINNAFEDMIELGGICPDGDYPYVS--DAPNLCNIDRCTEKYGIKN 127
Query: 566 YSRIPYGEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVIDLNQRLCNPKAQNHALIIVG 624
Y +P + ++K+ + GP+S+ + + F +Y G+ D C + NHA+++VG
Sbjct: 128 YLSVP---DNKLKEALRFLGPISISVAVSDDFAFYKEGIFDGE---CGDQL-NHAVMLVG 180
Query: 625 YGEEE-----KKDGTSIPYWIVKNSWGSDWGEK 652
+G +E K G Y+I+KNSWG WGE+
Sbjct: 181 FGMKEIVNPLTKKGEKHYYYIIKNSWGQQWGER 213
Score = 85.7 bits (213), Expect = 5e-19
Identities = 23/54 (42%), Positives = 33/54 (61%)
Query: 145 GDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
+ A+DWR ++ VK+Q C CWAFS++G VE+ +AI+ N L LS Q
Sbjct: 15 ENFDHAAYDWRLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQ 68
>3i06_A Cruzipain; autocatalytic cleavage, glycoprotein, protease, thiol
protease, zymogen; HET: QL2; 1.10A {Trypanosoma cruzi}
PDB: 1ewm_A* 1ewo_A* 1ewl_A* 1f29_A* 1ewp_A* 1f2b_A*
1f2c_A* 1f2a_A* 1me4_A* 1u9q_X* 2aim_A* 2efm_A* 2oz2_A*
1me3_A* 3kku_A* 3lxs_A* 1aim_A* 3iut_A* 3hd3_A* 2p86_A*
...
Length = 215
Score = 226 bits (578), Expect = 3e-70
Identities = 71/191 (37%), Positives = 112/191 (58%), Gaps = 15/191 (7%)
Query: 297 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG 356
P A DWRA G ++ VK+QG+C CWAFSA+G VE + G+ LT LS Q LV CD ++
Sbjct: 2 PAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDS 61
Query: 357 GCNGGRMDDALQYII--DNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYG 414
GC+GG M++A ++I+ +NG V ++ +YPY + E + + +P
Sbjct: 62 GCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQD 121
Query: 415 EEEEMKKWVATRGPLSVGMNANGLFYYSGGVID--LNQRL--------YGTS--IPYWIV 462
E ++ W+A GP++V ++A+ Y+GGV+ ++++L Y S +PYWI+
Sbjct: 122 -EAQIAAWLAVNGPVAVAVDASSWMTYTGGVMTSCVSEQLDHGVLLVGYNDSAAVPYWII 180
Query: 463 KNSWGSDWGEK 473
KNSW + WGE+
Sbjct: 181 KNSWTTQWGEE 191
Score = 169 bits (431), Expect = 6e-49
Identities = 47/164 (28%), Positives = 85/164 (51%), Gaps = 16/164 (9%)
Query: 491 TGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYII--DNGGVVSDQAYPYKASESE 548
L + L+ + LV CD ++ GC+GG M++A ++I+ +NG V ++ +YPY + E
Sbjct: 42 GHPLTN----LSEQMLVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGI 97
Query: 549 RGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQ 608
+ + +P E ++ W+A GP++V ++A+ Y+GGV+
Sbjct: 98 SPPCTTSGHTVGATITGHVELPQD-EAQIAAWLAVNGPVAVAVDASSWMTYTGGVMT--- 153
Query: 609 RLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
C + +H +++VGY + YWI+KNSW + WGE+
Sbjct: 154 -SCVSEQLDHGVLLVGYNDSAAVP-----YWIIKNSWTTQWGEE 191
Score = 80.6 bits (200), Expect = 2e-17
Identities = 27/50 (54%), Positives = 34/50 (68%)
Query: 149 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
P A DWRA G ++ VK+QG+C CWAFSA+G VE + G+ LT LS Q
Sbjct: 2 PAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQ 51
>1cs8_A Human procathepsin L; prosegment, propeptide, inhibition,
hydrolase; HET: OCS; 1.80A {Homo sapiens} SCOP: d.3.1.1
PDB: 1cjl_A 3hwn_A*
Length = 316
Score = 229 bits (587), Expect = 3e-70
Identities = 91/301 (30%), Positives = 138/301 (45%), Gaps = 52/301 (17%)
Query: 199 HHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAV---FGVNKFFDLSESDLQQL-T 254
H+++Y E+ RR + N++ E + E +N F D++ + +Q+
Sbjct: 18 MHNRLYGMNEEGWRR-AVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMN 76
Query: 255 GLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQ 314
G N+ + + FQ + P + DWR +G ++ VK Q
Sbjct: 77 GFQ------------------NRKPRKGKVFQE---PLFYEAPRSVDWREKGYVTPVKNQ 115
Query: 315 GKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG--GCNGGRMDDALQYIID 372
G+C CWAFSA G +E + L LS Q LVDC G GCNGG MD A QY+ D
Sbjct: 116 GQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQD 175
Query: 373 NGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVG 432
NGG+ S+++YPY+A+E C + + IP +E+ + K VAT GP+SV
Sbjct: 176 NGGLDSEESYPYEATEES--CKYNPKYS-VANDAGFVDIP-KQEKALMKAVATVGPISVA 231
Query: 433 MNANGL-F-YYSGGVID----LNQRL--------YGTSI------PYWIVKNSWGSDWGE 472
++A F +Y G+ ++ + YG YW+VKNSWG +WG
Sbjct: 232 IDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGM 291
Query: 473 K 473
Sbjct: 292 G 292
Score = 156 bits (396), Expect = 6e-43
Identities = 65/167 (38%), Positives = 94/167 (56%), Gaps = 17/167 (10%)
Query: 491 TGVLPSKLSRLATEK-LVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASES 547
TG L S LS E+ LVDC G GCNGG MD A QY+ DNGG+ S+++YPY+A+E
Sbjct: 138 TGRLIS-LS----EQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEE 192
Query: 548 ERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVID 605
C + + IP +E+ + K VAT GP+SV ++A F +Y G+
Sbjct: 193 S--CKYNPKYS-VANDAGFVDIP-KQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYF 248
Query: 606 LNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
+ C+ + +H +++VGYG E + + YW+VKNSWG +WG
Sbjct: 249 --EPDCSSEDMDHGVLVVGYGFESTESDNN-KYWLVKNSWGEEWGMG 292
Score = 107 bits (270), Expect = 6e-26
Identities = 40/160 (25%), Positives = 69/160 (43%), Gaps = 26/160 (16%)
Query: 43 TRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQRE-DSGTAVFE--VNKFFDLSD 99
++ + H+++Y E+ RR + N++ E + +E G F +N F D++
Sbjct: 10 AQWTKWKAMHNRLYGMNEEGWRR-AVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTS 68
Query: 100 SDLQQL-TGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEG 158
+ +Q+ G N+ + + FQ + P + DWR +G
Sbjct: 69 EEFRQVMNGFQ------------------NRKPRKGKVFQE---PLFYEAPRSVDWREKG 107
Query: 159 VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
++ VK QG+C CWAFSA G +E + L LS Q
Sbjct: 108 YVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQ 147
>3ioq_A CMS1MS2; caricaceae, cysteine protease, papain family, hydrolase;
HET: E64 SO4; 1.87A {Carica candamarcensis}
Length = 213
Score = 225 bits (576), Expect = 5e-70
Identities = 66/190 (34%), Positives = 99/190 (52%), Gaps = 18/190 (9%)
Query: 296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSN 355
+P + DWR +G ++ V+ QG C CW FS+V VE ++ I L LS Q+L+DC+ +
Sbjct: 1 IPTSIDWRQKGAVTPVRNQGGCGSCWTFSSVAAVEGINKIVTGQLLSLSEQELLDCERRS 60
Query: 356 GGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGE 415
GC GG ALQY+ N G+ Q YPY+ + + C + +G KVK R+P
Sbjct: 61 YGCRGGFPLYALQYVA-NSGIHLRQYYPYEGVQRQ--CRASQAKGPKVKTDGVGRVPRNN 117
Query: 416 EEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDL--NQRL--------YGTSIPYWIVK 463
E+ + + +A P+S+ + A G F Y GG+ + YG Y ++K
Sbjct: 118 EQALIQRIA-IQPVSIVVEAKGRAFQNYRGGIFAGPCGTSIDHAVAAVGYGN--DYILIK 174
Query: 464 NSWGSDWGEK 473
NSWG+ WGE
Sbjct: 175 NSWGTGWGEG 184
Score = 156 bits (397), Expect = 3e-44
Identities = 50/149 (33%), Positives = 75/149 (50%), Gaps = 19/149 (12%)
Query: 506 LVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKE 565
L+DC+ + GC GG ALQY+ N G+ Q YPY+ + + C + +G KVK
Sbjct: 53 LLDCERRSYGCRGGFPLYALQYVA-NSGIHLRQYYPYEGVQRQ--CRASQAKGPKVKTDG 109
Query: 566 YSRIPYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDLNQRLCNPKAQNHALIIV 623
R+P E+ + + +A P+S+ + A G F Y GG+ C +HA+ V
Sbjct: 110 VGRVPRNNEQALIQRIA-IQPVSIVVEAKGRAFQNYRGGIFAGP---CGTSI-DHAVAAV 164
Query: 624 GYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
GYG + Y ++KNSWG+ WGE
Sbjct: 165 GYGND---------YILIKNSWGTGWGEG 184
Score = 84.0 bits (209), Expect = 1e-18
Identities = 21/51 (41%), Positives = 30/51 (58%)
Query: 148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
+P + DWR +G ++ V+ QG C CW FS+V VE ++ I L LS Q
Sbjct: 1 IPTSIDWRQKGAVTPVRNQGGCGSCWTFSSVAAVEGINKIVTGQLLSLSEQ 51
>1ppo_A Protease omega; hydrolase(thiol protease); 1.80A {Carica papaya}
SCOP: d.3.1.1 PDB: 1meg_A*
Length = 216
Score = 224 bits (574), Expect = 1e-69
Identities = 72/192 (37%), Positives = 100/192 (52%), Gaps = 18/192 (9%)
Query: 296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSN 355
LPE DWR +G ++ V+ QG C CWAFSAV VE ++ I+ L ELS Q+LVDC+ +
Sbjct: 1 LPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERRS 60
Query: 356 GGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGE 415
GC GG AL+Y+ N G+ YPYKA + C + G VK R+
Sbjct: 61 HGCKGGYPPYALEYVAKN-GIHLRSKYPYKAKQGT--CRAKQVGGPIVKTSGVGRVQPNN 117
Query: 416 EEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDL--NQRL--------YGTS--IPYWI 461
E + +A P+SV + + G F Y GG+ + ++ YG S Y +
Sbjct: 118 EGNLLNAIAK-QPVSVVVESKGRPFQLYKGGIFEGPCGTKVDHAVTAVGYGKSGGKGYIL 176
Query: 462 VKNSWGSDWGEK 473
+KNSWG+ WGEK
Sbjct: 177 IKNSWGTAWGEK 188
Score = 161 bits (411), Expect = 3e-46
Identities = 56/164 (34%), Positives = 80/164 (48%), Gaps = 19/164 (11%)
Query: 491 TGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERG 550
TG L L+ ++LVDC+ + GC GG AL+Y+ N G+ YPYKA +
Sbjct: 42 TGKLVE----LSEQELVDCERRSHGCKGGYPPYALEYVAKN-GIHLRSKYPYKAKQGT-- 94
Query: 551 CLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDLNQ 608
C + G VK R+ E + +A P+SV + + G F Y GG+ +
Sbjct: 95 CRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAK-QPVSVVVESKGRPFQLYKGGIFEGP- 152
Query: 609 RLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
C K +HA+ VGYG+ K Y ++KNSWG+ WGEK
Sbjct: 153 --CGTKV-DHAVTAVGYGKSGGKG-----YILIKNSWGTAWGEK 188
Score = 83.3 bits (207), Expect = 2e-18
Identities = 26/51 (50%), Positives = 33/51 (64%)
Query: 148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
LPE DWR +G ++ V+ QG C CWAFSAV VE ++ I+ L ELS Q
Sbjct: 1 LPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQ 51
>2cio_A Papain; hydrolase/inhibitor, complex hydrolase/inhibitor, ICP,
cysteine protease, allergen, protease, thiol protease;
1.5A {Carica papaya} PDB: 1khq_A 1khp_A 1ppn_A 3e1z_B
3ima_A 3lfy_A 9pap_A 1bqi_A* 1bp4_A* 1pad_A 1pe6_A*
1pip_A* 1pop_A* 1ppd_A 1ppp_A* 1stf_E* 2pad_A 4pad_A*
5pad_A* 6pad_A* ...
Length = 212
Score = 224 bits (573), Expect = 2e-69
Identities = 64/190 (33%), Positives = 94/190 (49%), Gaps = 18/190 (9%)
Query: 296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSN 355
+PE DWR +G ++ VK QG C CWAFSAV +E + I+ +L + S Q+L+DCD +
Sbjct: 1 IPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNQYSEQELLDCDRRS 60
Query: 356 GGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGE 415
GCNGG ALQ + G+ YPY+ + C E+ + K ++
Sbjct: 61 YGCNGGYPWSALQLVA-QYGIHYRNTYPYEGVQRY--CRSREKGPYAAKTDGVRQVQPYN 117
Query: 416 EEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDL--NQRL--------YGTSIPYWIVK 463
E + +A P+SV + A G F Y GG+ ++ YG Y ++K
Sbjct: 118 EGALLYSIAN-QPVSVVLEAAGKDFQLYRGGIFVGPCGNKVDHAVAAVGYGP--NYILIK 174
Query: 464 NSWGSDWGEK 473
NSWG+ WGE
Sbjct: 175 NSWGTGWGEN 184
Score = 154 bits (392), Expect = 1e-43
Identities = 47/149 (31%), Positives = 67/149 (44%), Gaps = 19/149 (12%)
Query: 506 LVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKE 565
L+DCD + GCNGG ALQ + G+ YPY+ + C E+ + K
Sbjct: 53 LLDCDRRSYGCNGGYPWSALQLVA-QYGIHYRNTYPYEGVQRY--CRSREKGPYAAKTDG 109
Query: 566 YSRIPYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDLNQRLCNPKAQNHALIIV 623
++ E + +A P+SV + A G F Y GG+ C K +HA+ V
Sbjct: 110 VRQVQPYNEGALLYSIAN-QPVSVVLEAAGKDFQLYRGGIFVGP---CGNKV-DHAVAAV 164
Query: 624 GYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
GYG Y ++KNSWG+ WGE
Sbjct: 165 GYGPN---------YILIKNSWGTGWGEN 184
Score = 84.0 bits (209), Expect = 1e-18
Identities = 24/51 (47%), Positives = 32/51 (62%)
Query: 148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
+PE DWR +G ++ VK QG C CWAFSAV +E + I+ NL + S Q
Sbjct: 1 IPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNQYSEQ 51
>1cqd_A Protein (protease II); cysteine protease, glycoprotein, proline
specificity, carboh papain family, hydrolase; HET: NAG
FUL FUC; 2.10A {Zingiber officinale} SCOP: d.3.1.1
Length = 221
Score = 224 bits (573), Expect = 2e-69
Identities = 78/194 (40%), Positives = 104/194 (53%), Gaps = 18/194 (9%)
Query: 294 DDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDM 353
DDLP++ DWR G + VK QG C CWAFS V VE ++ I L LS QQLVDC
Sbjct: 1 DDLPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTT 60
Query: 354 SNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPY 413
+N GC GG M+ A Q+I++NGG+ S++ YPY+ + C V + Y +P
Sbjct: 61 ANHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQDGI--CNSTVNAP-VVSIDSYENVPS 117
Query: 414 GEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDLN--QRL--------YGTS--IPY 459
E+ ++K VA P+SV M+A G F Y G+ + YGT +
Sbjct: 118 HNEQSLQKAVAN-QPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGTENDKDF 176
Query: 460 WIVKNSWGSDWGEK 473
WIVKNSWG +WGE
Sbjct: 177 WIVKNSWGKNWGES 190
Score = 157 bits (400), Expect = 1e-44
Identities = 62/149 (41%), Positives = 84/149 (56%), Gaps = 15/149 (10%)
Query: 506 LVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKE 565
LVDC +N GC GG M+ A Q+I++NGG+ S++ YPY+ + C V +
Sbjct: 55 LVDCTTANHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQDGI--CNSTVNAP-VVSIDS 111
Query: 566 YSRIPYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDLNQRLCNPKAQNHALIIV 623
Y +P E+ ++K VA P+SV M+A G F Y G+ + CN A NHAL +V
Sbjct: 112 YENVPSHNEQSLQKAVAN-QPVSVTMDAAGRDFQLYRSGIFTGS---CNISA-NHALTVV 166
Query: 624 GYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
GYG E KD +WIVKNSWG +WGE
Sbjct: 167 GYGTENDKD-----FWIVKNSWGKNWGES 190
Score = 87.6 bits (218), Expect = 7e-20
Identities = 26/53 (49%), Positives = 32/53 (60%)
Query: 146 DDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
DDLP++ DWR G + VK QG C CWAFS V VE ++ I +L LS Q
Sbjct: 1 DDLPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQ 53
>2bdz_A Mexicain; cysteine protease, peptidase_C1, papain-like, HYDR; HET:
E64; 2.10A {Jacaratia mexicana}
Length = 214
Score = 223 bits (570), Expect = 4e-69
Identities = 67/189 (35%), Positives = 105/189 (55%), Gaps = 18/189 (9%)
Query: 297 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG 356
PE+ DWR +G ++ VK Q C CWAFS V +E ++ I L LS Q+L+DC+ +
Sbjct: 2 PESIDWREKGAVTPVKNQNPCGSCWAFSTVATIEGINKIITGQLISLSEQELLDCERRSH 61
Query: 357 GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEE 416
GC+GG +LQY++DN GV +++ YPY+ + C +++G KV + Y +P +E
Sbjct: 62 GCDGGYQTTSLQYVVDN-GVHTEREYPYEKKQGR--CRAKDKKGPKVYITGYKYVPANDE 118
Query: 417 EEMKKWVATRGPLSVGMNANGL-F-YYSGGVID--LNQRL--------YGTSIPYWIVKN 464
+ + +A P+SV ++ G F +Y GG+ + YG Y ++KN
Sbjct: 119 ISLIQAIAN-QPVSVVTDSRGRGFQFYKGGIYEGPCGTNTDHAVTAVGYGK--TYLLLKN 175
Query: 465 SWGSDWGEK 473
SWG +WGEK
Sbjct: 176 SWGPNWGEK 184
Score = 156 bits (398), Expect = 3e-44
Identities = 50/149 (33%), Positives = 83/149 (55%), Gaps = 19/149 (12%)
Query: 506 LVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKE 565
L+DC+ + GC+GG +LQY++DN GV +++ YPY+ + C +++G KV +
Sbjct: 53 LLDCERRSHGCDGGYQTTSLQYVVDN-GVHTEREYPYEKKQGR--CRAKDKKGPKVYITG 109
Query: 566 YSRIPYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDLNQRLCNPKAQNHALIIV 623
Y +P +E + + +A P+SV ++ G F +Y GG+ + C +HA+ V
Sbjct: 110 YKYVPANDEISLIQAIAN-QPVSVVTDSRGRGFQFYKGGIYEGP---CGTN-TDHAVTAV 164
Query: 624 GYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
GYG+ Y ++KNSWG +WGEK
Sbjct: 165 GYGKT---------YLLLKNSWGPNWGEK 184
Score = 82.1 bits (204), Expect = 6e-18
Identities = 22/50 (44%), Positives = 29/50 (58%)
Query: 149 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
PE+ DWR +G ++ VK Q C CWAFS V +E ++ I L LS Q
Sbjct: 2 PESIDWREKGAVTPVKNQNPCGSCWAFSTVATIEGINKIITGQLISLSEQ 51
>8pch_A Cathepsin H; hydrolase, protease, cysteine proteinase,
aminopeptidase; HET: NAG BMA; 2.10A {Sus scrofa} SCOP:
d.3.1.1 PDB: 1nb3_A* 1nb5_A*
Length = 220
Score = 220 bits (563), Expect = 5e-68
Identities = 69/197 (35%), Positives = 92/197 (46%), Gaps = 23/197 (11%)
Query: 297 PEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS- 354
P + DWR +G +S VK QG C CW FS G +E+ AI + L+ QQLVDC +
Sbjct: 2 PPSMDWRKKGNFVSPVKNQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQQLVDCAQNF 61
Query: 355 -NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPY 413
N GC GG A +YI N G++ + YPYK + C ++ VK+ + I
Sbjct: 62 NNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYKGQDDH--CKFQPDKA-IAFVKDVANITM 118
Query: 414 GEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVID------LNQRL--------YGTS-- 456
+EE M + VA P+S F Y G+ ++ YG
Sbjct: 119 NDEEAMVEAVALYNPVSFAFEVTNDFLMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENG 178
Query: 457 IPYWIVKNSWGSDWGEK 473
IPYWIVKNSWG WG
Sbjct: 179 IPYWIVKNSWGPQWGMN 195
Score = 170 bits (434), Expect = 2e-49
Identities = 58/166 (34%), Positives = 77/166 (46%), Gaps = 17/166 (10%)
Query: 491 TGVLPSKLSRLATEK-LVDCDMS--NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASES 547
TG + S L+ E+ LVDC + N GC GG A +YI N G++ + YPYK +
Sbjct: 43 TGKMLS-LA----EQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYKGQDD 97
Query: 548 ERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVIDL 606
C ++ VK+ + I +EE M + VA P+S F Y G+
Sbjct: 98 H--CKFQPDKA-IAFVKDVANITMNDEEAMVEAVALYNPVSFAFEVTNDFLMYRKGIYSS 154
Query: 607 NQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
P NHA++ VGYGEE YWIVKNSWG WG
Sbjct: 155 TSCHKTPDKVNHAVLAVGYGEENGIP-----YWIVKNSWGPQWGMN 195
Score = 75.2 bits (186), Expect = 1e-15
Identities = 21/51 (41%), Positives = 28/51 (54%), Gaps = 1/51 (1%)
Query: 149 PEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
P + DWR +G +S VK QG C CW FS G +E+ AI + L+ Q
Sbjct: 2 PPSMDWRKKGNFVSPVKNQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQ 52
>3kwz_A Cathepsin K; enzyme inhibitor, covalent reversible inhibitor,
disease mutation, disulfide bond, glycoprotein,
hydrolase, lysosome, protease; HET: KWZ; 1.49A {Homo
sapiens} PDB: 1au0_A* 1au2_A* 1au3_A* 1au4_A* 1ayu_A*
1ayv_A* 1ayw_A* 1bgo_A* 1atk_A* 1nl6_A* 1nlj_A* 1q6k_A*
1mem_A* 1yk7_A* 1yk8_A* 1yt7_A* 2ato_A* 2aux_A* 2auz_A*
2bdl_A* ...
Length = 215
Score = 219 bits (561), Expect = 1e-67
Identities = 74/193 (38%), Positives = 102/193 (52%), Gaps = 19/193 (9%)
Query: 297 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG 356
P++ D+R +G ++ VK QG+C CWAFS+VG +E + L LS Q LVDC N
Sbjct: 2 PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSEND 61
Query: 357 GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEE 416
GC GG M +A QY+ N G+ S+ AYPY E C+ K + Y IP G E
Sbjct: 62 GCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEES--CMYNPTGK-AAKCRGYREIPEGNE 118
Query: 417 EEMKKWVATRGPLSVGMNANG--LFYYSGGVID----LNQRL--------YGTS--IPYW 460
+ +K+ VA GP+SV ++A+ +YS GV + L YG +W
Sbjct: 119 KALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHW 178
Query: 461 IVKNSWGSDWGEK 473
I+KNSWG +WG K
Sbjct: 179 IIKNSWGENWGNK 191
Score = 165 bits (420), Expect = 2e-47
Identities = 64/165 (38%), Positives = 87/165 (52%), Gaps = 18/165 (10%)
Query: 491 TGVLPSKLSRLATEK-LVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESER 549
TG L + LS + LVDC N GC GG M +A QY+ N G+ S+ AYPY E
Sbjct: 42 TGKLLN-LS----PQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEES- 95
Query: 550 GCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANG--LFYYSGGVIDLN 607
C+ K + Y IP G E+ +K+ VA GP+SV ++A+ +YS GV
Sbjct: 96 -CMYNPTGK-AAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGV--YY 151
Query: 608 QRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
CN NHA++ VGYG ++ +WI+KNSWG +WG K
Sbjct: 152 DESCNSDNLNHAVLAVGYGIQKGNK-----HWIIKNSWGENWGNK 191
Score = 79.8 bits (198), Expect = 3e-17
Identities = 21/50 (42%), Positives = 31/50 (62%)
Query: 149 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
P++ D+R +G ++ VK QG+C CWAFS+VG +E + L LS Q
Sbjct: 2 PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQ 51
>1s4v_A Cysteine endopeptidase; KDEL ER retention signal, endosperm,
ricinosomes, SEED germi senescence, hydrolase-hydrolase
inhibitor complex; 2.00A {Ricinus communis} SCOP:
d.3.1.1
Length = 229
Score = 219 bits (561), Expect = 1e-67
Identities = 79/194 (40%), Positives = 111/194 (57%), Gaps = 19/194 (9%)
Query: 296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS- 354
+P + DWR +G ++ VK+QG+C CWAFS + VE ++ I+ N L LS Q+LVDCD
Sbjct: 2 VPASVDWRKKGAVTSVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTDQ 61
Query: 355 NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYG 414
N GCNGG MD A ++I GG+ ++ YPY+A + C V +E V + + +P
Sbjct: 62 NQGCNGGLMDYAFEFIKQRGGITTEANYPYEAYDGT--CDVSKENAPAVSIDGHENVPEN 119
Query: 415 EEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDLN--QRL--------YGTS---IPY 459
+E + K VA P+SV ++A G F +YS GV + L YGT+ Y
Sbjct: 120 DENALLKAVAN-QPVSVAIDAGGSDFQFYSEGVFTGSCGTELDHGVAIVGYGTTIDGTKY 178
Query: 460 WIVKNSWGSDWGEK 473
W VKNSWG +WGEK
Sbjct: 179 WTVKNSWGPEWGEK 192
Score = 157 bits (399), Expect = 2e-44
Identities = 62/150 (41%), Positives = 84/150 (56%), Gaps = 14/150 (9%)
Query: 506 LVDCDMS-NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVK 564
LVDCD N GCNGG MD A ++I GG+ ++ YPY+A + C V +E V +
Sbjct: 54 LVDCDTDQNQGCNGGLMDYAFEFIKQRGGITTEANYPYEAYDGT--CDVSKENAPAVSID 111
Query: 565 EYSRIPYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDLNQRLCNPKAQNHALII 622
+ +P +E + K VA P+SV ++A G F +YS GV + C + +H + I
Sbjct: 112 GHENVPENDENALLKAVAN-QPVSVAIDAGGSDFQFYSEGVFTGS---CGTE-LDHGVAI 166
Query: 623 VGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
VGYG DGT YW VKNSWG +WGEK
Sbjct: 167 VGYGTTI--DGT--KYWTVKNSWGPEWGEK 192
Score = 84.1 bits (209), Expect = 1e-18
Identities = 23/51 (45%), Positives = 34/51 (66%)
Query: 148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
+P + DWR +G ++ VK+QG+C CWAFS + VE ++ I+ N L LS Q
Sbjct: 2 VPASVDWRKKGAVTSVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQ 52
>1iwd_A Ervatamin B; cysteine protease, alpha-beta protein, catalytic DYAD,
L-DOM domain., hydrolase; 1.63A {Tabernaemontana
divaricata} SCOP: d.3.1.1
Length = 215
Score = 219 bits (560), Expect = 1e-67
Identities = 76/192 (39%), Positives = 108/192 (56%), Gaps = 19/192 (9%)
Query: 296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSN 355
LP DWR++G ++ +K Q +C CWAFSAV VE+++ I+ L LS Q+LVDCD ++
Sbjct: 1 LPSFVDWRSKGAVNSIKNQKQCGSCWAFSAVAAVESINKIRTGQLISLSEQELVDCDTAS 60
Query: 356 GGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGE 415
GCNGG M++A QYII NGG+ + Q YPY A + C V + + R+
Sbjct: 61 HGCNGGWMNNAFQYIITNGGIDTQQNYPYSAVQGS--CKPYRLRV--VSINGFQRVTRNN 116
Query: 416 EEEMKKWVATRGPLSVGMNANGL-F-YYSGGVID------LNQRL----YGTS--IPYWI 461
E ++ VA+ P+SV + A G F +YS G+ N + YGT YWI
Sbjct: 117 ESALQSAVAS-QPVSVTVEAAGAPFQHYSSGIFTGPCGTAQNHGVVIVGYGTQSGKNYWI 175
Query: 462 VKNSWGSDWGEK 473
V+NSWG +WG +
Sbjct: 176 VRNSWGQNWGNQ 187
Score = 155 bits (394), Expect = 8e-44
Identities = 60/149 (40%), Positives = 84/149 (56%), Gaps = 16/149 (10%)
Query: 506 LVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKE 565
LVDCD ++ GCNGG M++A QYII NGG+ + Q YPY A + C V +
Sbjct: 53 LVDCDTASHGCNGGWMNNAFQYIITNGGIDTQQNYPYSAVQGS--CKPYRLRV--VSING 108
Query: 566 YSRIPYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDLNQRLCNPKAQNHALIIV 623
+ R+ E ++ VA+ P+SV + A G F +YS G+ C AQNH ++IV
Sbjct: 109 FQRVTRNNESALQSAVAS-QPVSVTVEAAGAPFQHYSSGIFTGP---CG-TAQNHGVVIV 163
Query: 624 GYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
GYG + K+ YWIV+NSWG +WG +
Sbjct: 164 GYGTQSGKN-----YWIVRNSWGQNWGNQ 187
Score = 83.7 bits (208), Expect = 2e-18
Identities = 23/51 (45%), Positives = 33/51 (64%)
Query: 148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
LP DWR++G ++ +K Q +C CWAFSAV VE+++ I+ L LS Q
Sbjct: 1 LPSFVDWRSKGAVNSIKNQKQCGSCWAFSAVAAVESINKIRTGQLISLSEQ 51
>3f5v_A DER P 1 allergen; allergy, asthma, DUST mites, glycoprotein,
hydrola protease, secreted, thiol protease; HET: P6G;
1.36A {Dermatophagoides pteronyssinus} PDB: 2as8_A
3rvw_A* 3rvx_A 3rvv_A* 3d6s_A*
Length = 222
Score = 218 bits (557), Expect = 5e-67
Identities = 54/204 (26%), Positives = 88/204 (43%), Gaps = 24/204 (11%)
Query: 288 NSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQ 347
N+ + P D R ++ ++ QG C WAFS V E+ + +L+ Q+
Sbjct: 2 NACSINGNAPAEIDLRQMRTVTPIRMQGGCGSAWAFSGVAATESAYLAYRQQSLDLAEQE 61
Query: 348 LVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKE 407
LVDC S GC+G + ++YI N GVV + Y Y A E C + +
Sbjct: 62 LVDCA-SQHGCHGDTIPRGIEYIQHN-GVVQESYYRYVAREQS--CRRPNAQR--FGISN 115
Query: 408 YSRIPYGEEEEMKKWVA-TRGPLSVGMNANGLF---YYSGGVIDL----NQRL------- 452
Y +I ++++ +A T ++V + L +Y G I Q
Sbjct: 116 YCQIYPPNANKIREALAQTHSAIAVIIGIKDLDAFRHYDGRTIIQRDNGYQPNYHAVNIV 175
Query: 453 -YGTS--IPYWIVKNSWGSDWGEK 473
Y + + YWIV+NSW ++WG+
Sbjct: 176 GYSNAQGVDYWIVRNSWDTNWGDN 199
Score = 157 bits (400), Expect = 2e-44
Identities = 48/166 (28%), Positives = 74/166 (44%), Gaps = 21/166 (12%)
Query: 491 TGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERG 550
LA ++LVDC S GC+G + ++YI NG VV + Y Y A E
Sbjct: 51 RQQSLD----LAEQELVDCA-SQHGCHGDTIPRGIEYIQHNG-VVQESYYRYVAREQS-- 102
Query: 551 CLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVA-TRGPLSVGMNANGLF---YYSGGVIDL 606
C + + Y +I ++++ +A T ++V + L +Y G I
Sbjct: 103 CRRPNAQR--FGISNYCQIYPPNANKIREALAQTHSAIAVIIGIKDLDAFRHYDGRTII- 159
Query: 607 NQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
QR + HA+ IVGY + D YWIV+NSW ++WG+
Sbjct: 160 -QRDNGYQPNYHAVNIVGYSNAQGVD-----YWIVRNSWDTNWGDN 199
Score = 86.4 bits (215), Expect = 2e-19
Identities = 15/59 (25%), Positives = 25/59 (42%)
Query: 140 NSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
N+ + P D R ++ ++ QG C WAFS V E+ + +L+ Q
Sbjct: 2 NACSINGNAPAEIDLRQMRTVTPIRMQGGCGSAWAFSGVAATESAYLAYRQQSLDLAEQ 60
>2fo5_A Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cysteine
endoprotease, endopeptidase, LEUP hydrolase; HET: AR7;
2.20A {Hordeum vulgare}
Length = 262
Score = 219 bits (559), Expect = 7e-67
Identities = 80/196 (40%), Positives = 113/196 (57%), Gaps = 18/196 (9%)
Query: 295 DLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS 354
DLP + DWR +G ++ VK+QGKC CWAFS V VE ++AI+ SL LS Q+L+DCD +
Sbjct: 3 DLPPSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTA 62
Query: 355 -NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE-RGCLVGEEEGFKVKVKEYSRIP 412
N GC GG MD+A +YI +NGG++++ AYPY+A+ + V + + +P
Sbjct: 63 DNDGCQGGLMDNAFEYIKNNGGLITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVP 122
Query: 413 YGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDL--NQRL--------YGTS---I 457
EE++ + VA P+SV + A+G F +YS GV L YG +
Sbjct: 123 ANSEEDLARAVAN-QPVSVAVEASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGK 181
Query: 458 PYWIVKNSWGSDWGEK 473
YW VKNSWG WGE+
Sbjct: 182 AYWTVKNSWGPSWGEQ 197
Score = 154 bits (392), Expect = 5e-43
Identities = 58/151 (38%), Positives = 85/151 (56%), Gaps = 13/151 (8%)
Query: 506 LVDCDMS-NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE-RGCLVGEEEGFKVKV 563
L+DCD + N GC GG MD+A +YI +NGG++++ AYPY+A+ + V +
Sbjct: 56 LIDCDTADNDGCQGGLMDNAFEYIKNNGGLITEAAYPYRAARGTCNVARAAQNSPVVVHI 115
Query: 564 KEYSRIPYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDLNQRLCNPKAQNHALI 621
+ +P EE++ + VA P+SV + A+G F +YS GV C + +H +
Sbjct: 116 DGHQDVPANSEEDLARAVAN-QPVSVAVEASGKAFMFYSEGVFTGE---CGTE-LDHGVA 170
Query: 622 IVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
+VGYG E DG YW VKNSWG WGE+
Sbjct: 171 VVGYGVAE--DGK--AYWTVKNSWGPSWGEQ 197
Score = 86.2 bits (214), Expect = 6e-19
Identities = 27/52 (51%), Positives = 36/52 (69%)
Query: 147 DLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
DLP + DWR +G ++ VK+QGKC CWAFS V VE ++AI+ +L LS Q
Sbjct: 3 DLPPSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQ 54
>3p5u_A Actinidin; SAD, cysteine proteinases, hydrolase; 1.50A {Actinidia
arguta} PDB: 3p5v_A 3p5w_A 3p5x_A 1aec_A* 2act_A
Length = 220
Score = 216 bits (554), Expect = 1e-66
Identities = 75/194 (38%), Positives = 107/194 (55%), Gaps = 19/194 (9%)
Query: 296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS- 354
LP+ DWR+ G + +K+QG+C WAFS + VE ++ I L LS Q+LVDC +
Sbjct: 1 LPDYVDWRSSGAVVDIKDQGQCGSAWAFSTIAAVEGINKIATGDLISLSEQELVDCGRTQ 60
Query: 355 -NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPY 413
GC+GG M D Q+II+NGG+ ++ YPY A E + C + ++ V + Y +PY
Sbjct: 61 NTRGCDGGFMTDGFQFIINNGGINTEANYPYTAEEGQ--CNLDLQQEKYVSIDTYENVPY 118
Query: 414 GEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDL--NQRL--------YGTS--IPY 459
E ++ VA P+SV + A G F +YS G+ + YGT I Y
Sbjct: 119 NNEWALQTAVA-YQPVSVALEAAGYNFQHYSSGIFTGPCGTAVDHAVTIVGYGTEGGIDY 177
Query: 460 WIVKNSWGSDWGEK 473
WIVKNSWG+ WGE+
Sbjct: 178 WIVKNSWGTTWGEE 191
Score = 153 bits (390), Expect = 4e-43
Identities = 60/151 (39%), Positives = 83/151 (54%), Gaps = 16/151 (10%)
Query: 506 LVDCDMS--NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKV 563
LVDC + GC+GG M D Q+II+NGG+ ++ YPY A E + C + ++ V +
Sbjct: 53 LVDCGRTQNTRGCDGGFMTDGFQFIINNGGINTEANYPYTAEEGQ--CNLDLQQEKYVSI 110
Query: 564 KEYSRIPYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDLNQRLCNPKAQNHALI 621
Y +PY E ++ VA P+SV + A G F +YS G+ C +HA+
Sbjct: 111 DTYENVPYNNEWALQTAVA-YQPVSVALEAAGYNFQHYSSGIFTGP---CGTAV-DHAVT 165
Query: 622 IVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
IVGYG E D YWIVKNSWG+ WGE+
Sbjct: 166 IVGYGTEGGID-----YWIVKNSWGTTWGEE 191
Score = 83.3 bits (207), Expect = 3e-18
Identities = 21/51 (41%), Positives = 31/51 (60%)
Query: 148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
LP+ DWR+ G + +K+QG+C WAFS + VE ++ I +L LS Q
Sbjct: 1 LPDYVDWRSSGAVVDIKDQGQCGSAWAFSTIAAVEGINKIATGDLISLSEQ 51
>3ovx_A Cathepsin S; hydrolase, covalent inhibitor, aldehyde warhead is
covalently bound to Cys25, lysosomeal protein; HET: O64;
1.49A {Homo sapiens} PDB: 2h7j_A* 2f1g_A* 2hh5_B*
2hhn_A* 2hxz_A* 2op3_A* 2frq_A* 2fra_A* 2fq9_A* 2ft2_A*
2fud_A* 2g7y_A* 1ms6_A* 2r9m_A* 2r9n_A* 2r9o_A* 3n3g_A*
3n4c_A* 3mpe_A* 1nqc_A* ...
Length = 218
Score = 216 bits (553), Expect = 2e-66
Identities = 83/196 (42%), Positives = 112/196 (57%), Gaps = 21/196 (10%)
Query: 296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS- 354
LP++ DWR +G +++VK QG C CWAFSAVG +EA ++ L LS Q LVDC
Sbjct: 2 LPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEK 61
Query: 355 --NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIP 412
N GCNGG M A QYIIDN G+ SD +YPYKA + + C + +Y+ +P
Sbjct: 62 YGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQK--CQYDSKYR-AATCSKYTELP 118
Query: 413 YGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDL---NQRL--------YGTS--I 457
YG E+ +K+ VA +GP+SVG++A F Y GV Q + YG
Sbjct: 119 YGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDLNGK 178
Query: 458 PYWIVKNSWGSDWGEK 473
YW+VKNSWG ++GE+
Sbjct: 179 EYWLVKNSWGHNFGEE 194
Score = 157 bits (399), Expect = 2e-44
Identities = 69/168 (41%), Positives = 95/168 (56%), Gaps = 22/168 (13%)
Query: 491 TGVLPSKLSRLATEK-LVDCDMS---NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASE 546
TG L S LS + LVDC N GCNGG M A QYIIDN G+ SD +YPYKA +
Sbjct: 43 TGKLVS-LS----AQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMD 97
Query: 547 SERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVI 604
+ C + +Y+ +PYG E+ +K+ VA +GP+SVG++A F Y GV
Sbjct: 98 QK--CQYDSKYR-AATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGV- 153
Query: 605 DLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
+ C NH +++VGYG+ K+ YW+VKNSWG ++GE+
Sbjct: 154 -YYEPSCTQNV-NHGVLVVGYGDLNGKE-----YWLVKNSWGHNFGEE 194
Score = 81.4 bits (202), Expect = 1e-17
Identities = 25/51 (49%), Positives = 34/51 (66%)
Query: 148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
LP++ DWR +G +++VK QG C CWAFSAVG +EA ++ L LS Q
Sbjct: 2 LPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQ 52
>1yal_A Chymopapain; hydrolase, thiol protease; 1.70A {Carica papaya} SCOP:
d.3.1.1 PDB: 1gec_E*
Length = 218
Score = 216 bits (552), Expect = 2e-66
Identities = 80/191 (41%), Positives = 107/191 (56%), Gaps = 18/191 (9%)
Query: 297 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG 356
P++ DWRA+G ++ VK QG C CWAFS + VE ++ I +L ELS Q+LVDCD +
Sbjct: 2 PQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKHSY 61
Query: 357 GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEE 416
GC GG +LQY+ N GV + + YPY+A + + C ++ G KVK+ Y R+P E
Sbjct: 62 GCKGGYQTTSLQYVA-NNGVHTSKVYPYQAKQYK--CRATDKPGPKVKITGYKRVPSNCE 118
Query: 417 EEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDL--NQRL--------YGTS--IPYWIV 462
+A PLSV + A G F Y GV D +L YGTS Y I+
Sbjct: 119 TSFLGALAN-QPLSVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTSDGKNYIII 177
Query: 463 KNSWGSDWGEK 473
KNSWG +WGEK
Sbjct: 178 KNSWGPNWGEK 188
Score = 155 bits (394), Expect = 1e-43
Identities = 60/149 (40%), Positives = 80/149 (53%), Gaps = 15/149 (10%)
Query: 506 LVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKE 565
LVDCD + GC GG +LQY+ N GV + + YPY+A + + C ++ G KVK+
Sbjct: 53 LVDCDKHSYGCKGGYQTTSLQYVA-NNGVHTSKVYPYQAKQYK--CRATDKPGPKVKITG 109
Query: 566 YSRIPYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDLNQRLCNPKAQNHALIIV 623
Y R+P E +A PLSV + A G F Y GV D C K +HA+ V
Sbjct: 110 YKRVPSNCETSFLGALAN-QPLSVLVEAGGKPFQLYKSGVFDGP---CGTKL-DHAVTAV 164
Query: 624 GYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
GYG + K+ Y I+KNSWG +WGEK
Sbjct: 165 GYGTSDGKN-----YIIIKNSWGPNWGEK 188
Score = 81.8 bits (203), Expect = 8e-18
Identities = 25/50 (50%), Positives = 33/50 (66%)
Query: 149 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
P++ DWRA+G ++ VK QG C CWAFS + VE ++ I NL ELS Q
Sbjct: 2 PQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQ 51
>3f75_A Toxopain-2, cathepsin L protease; medical structural genomics of
pathogenic protozoa, MSGPP, C protease, parasite,
protozoa, hydrolase; 1.99A {Toxoplasma gondii}
Length = 224
Score = 214 bits (548), Expect = 9e-66
Identities = 74/198 (37%), Positives = 106/198 (53%), Gaps = 22/198 (11%)
Query: 294 DDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDM 353
+LP DWR+ G ++ VK+Q C CWAFS G +E H + L LS Q+L+DC
Sbjct: 5 SELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSR 64
Query: 354 SNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRI 411
+ G C+GG M+DA QY++D+GG+ S+ AYPY A + E C E VK+ + +
Sbjct: 65 AEGNQSCSGGEMNDAFQYVLDSGGICSEDAYPYLARDEE--CRAQSCEK-VVKILGFKDV 121
Query: 412 PYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDLN--QRL--------YGTS--- 456
P E MK +A P+S+ + A+ + F +Y GV D + L YGT
Sbjct: 122 PRRSEAAMKAALAK-SPVSIAIEADQMPFQFYHEGVFDASCGTDLDHGVLLVGYGTDKES 180
Query: 457 -IPYWIVKNSWGSDWGEK 473
+WI+KNSWG+ WG
Sbjct: 181 KKDFWIMKNSWGTGWGRD 198
Score = 150 bits (380), Expect = 8e-42
Identities = 59/166 (35%), Positives = 93/166 (56%), Gaps = 19/166 (11%)
Query: 491 TGVLPSKLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE 548
TG L S L+ ++L+DC + G C+GG M+DA QY++D+GG+ S+ AYPY A + E
Sbjct: 48 TGKLVS----LSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDSGGICSEDAYPYLARDEE 103
Query: 549 RGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDL 606
C E VK+ + +P E MK +A P+S+ + A+ + F +Y GV D
Sbjct: 104 --CRAQSCEK-VVKILGFKDVPRRSEAAMKAALAK-SPVSIAIEADQMPFQFYHEGVFDA 159
Query: 607 NQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
+ C +H +++VGYG +++ +WI+KNSWG+ WG
Sbjct: 160 S---CGTDL-DHGVLLVGYGTDKES---KKDFWIMKNSWGTGWGRD 198
Score = 84.1 bits (209), Expect = 1e-18
Identities = 22/53 (41%), Positives = 29/53 (54%)
Query: 146 DDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
+LP DWR+ G ++ VK+Q C CWAFS G +E H + L LS Q
Sbjct: 5 SELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTTGALEGAHCAKTGKLVSLSEQ 57
>1o0e_A Ervatamin C; plant cysteine protease, two domain, stable at PH
2-12, HYDR; 1.90A {Tabernaemontana divaricata} SCOP:
d.3.1.1 PDB: 2pns_A* 2pre_A* 3bcn_A*
Length = 208
Score = 213 bits (545), Expect = 1e-65
Identities = 75/188 (39%), Positives = 105/188 (55%), Gaps = 16/188 (8%)
Query: 296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSN 355
LPE DWR +G ++ VK QG C CWAFS V VE+++ I+ +L LS Q+LVDCD N
Sbjct: 1 LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDKKN 60
Query: 356 GGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGE 415
GC GG A QYII+NGG+ + YPYKA + C + V + Y+ +P+
Sbjct: 61 HGCLGGAFVFAYQYIINNGGIDTQANYPYKAVQGP--CQAASK---VVSIDGYNGVPFCN 115
Query: 416 EEEMKKWVATRGPLSVGMNANGLFY--YSGGV--------IDLNQRLYGTSIPYWIVKNS 465
E +K+ VA P +V ++A+ + YS G+ ++ + G YWIV+NS
Sbjct: 116 EXALKQAVA-VQPSTVAIDASSAQFQQYSSGIFSGPCGTKLNHGVTIVGYQANYWIVRNS 174
Query: 466 WGSDWGEK 473
WG WGEK
Sbjct: 175 WGRYWGEK 182
Score = 142 bits (361), Expect = 2e-39
Identities = 60/164 (36%), Positives = 82/164 (50%), Gaps = 25/164 (15%)
Query: 491 TGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERG 550
TG L S L+ ++LVDCD N GC GG A QYII+NGG+ + YPYKA +
Sbjct: 42 TGNLIS----LSEQELVDCDKKNHGCLGGAFVFAYQYIINNGGIDTQANYPYKAVQGP-- 95
Query: 551 CLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY--YSGGVIDLNQ 608
C + V + Y+ +P+ E +K+ VA P +V ++A+ + YS G+
Sbjct: 96 CQAASK---VVSIDGYNGVPFCNEXALKQAVA-VQPSTVAIDASSAQFQQYSSGIFS--- 148
Query: 609 RLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
NH + IVGY YWIV+NSWG WGEK
Sbjct: 149 -GPCGTKLNHGVTIVGYQAN---------YWIVRNSWGRYWGEK 182
Score = 83.5 bits (207), Expect = 2e-18
Identities = 26/51 (50%), Positives = 33/51 (64%)
Query: 148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
LPE DWR +G ++ VK QG C CWAFS V VE+++ I+ NL LS Q
Sbjct: 1 LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQ 51
>2xu3_A Cathepsin L1; hydrolase, drug design, thiol protease; HET: XU3 BTB;
0.90A {Homo sapiens} PDB: 2xu4_A* 2xu5_A* 2yj2_A*
2yj8_A* 2yj9_A* 2yjb_A* 2yjc_A* 3bc3_A* 3h89_A* 3h8b_A*
3h8c_A* 3of9_A* 3of8_A* 3hha_A* 2xu1_A* 3iv2_A* 3k24_A*
2nqd_B* 3kse_A* 2vhs_A ...
Length = 220
Score = 209 bits (534), Expect = 8e-64
Identities = 74/199 (37%), Positives = 103/199 (51%), Gaps = 26/199 (13%)
Query: 297 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS-- 354
P + DWR +G ++ VK QG+C CWAFSA G +E + L LS Q LVDC
Sbjct: 2 PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQG 61
Query: 355 NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYG 414
N GCNGG MD A QY+ DNGG+ S+++YPY+A+E C + + IP
Sbjct: 62 NEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEES--CKYNPKYS-VANDTGFVDIP-K 117
Query: 415 EEEEMKKWVATRGPLSVGMNANGLF--YYSGGVID----LNQRL--------YGTS---- 456
+E+ + K VAT GP+SV ++A +Y G+ ++ + YG
Sbjct: 118 QEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTES 177
Query: 457 --IPYWIVKNSWGSDWGEK 473
YW+VKNSWG +WG
Sbjct: 178 DNNKYWLVKNSWGEEWGMG 196
Score = 156 bits (397), Expect = 3e-44
Identities = 64/167 (38%), Positives = 93/167 (55%), Gaps = 17/167 (10%)
Query: 491 TGVLPSKLSRLATEK-LVDCDMS--NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASES 547
TG L S LS E+ LVDC N GCNGG MD A QY+ DNGG+ S+++YPY+A+E
Sbjct: 42 TGRLIS-LS----EQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEE 96
Query: 548 ERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGLF--YYSGGVID 605
C + + IP +E+ + K VAT GP+SV ++A +Y G+
Sbjct: 97 S--CKYNPKYS-VANDTGFVDIP-KQEKALMKAVATVGPISVAIDAGHESFLFYKEGI-- 150
Query: 606 LNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
+ C+ + +H +++VGYG E + + YW+VKNSWG +WG
Sbjct: 151 YFEPDCSSEDMDHGVLVVGYGFESTESDNN-KYWLVKNSWGEEWGMG 196
Score = 79.5 bits (197), Expect = 5e-17
Identities = 22/50 (44%), Positives = 29/50 (58%)
Query: 149 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
P + DWR +G ++ VK QG+C CWAFSA G +E + L LS Q
Sbjct: 2 PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQ 51
>3pdf_A Cathepsin C, dipeptidyl peptidase 1; two domains, cystein protease,
hydrolase-hydrolase inhibitor; HET: LXV NAG; 1.85A {Homo
sapiens} PDB: 1jqp_A* 2djf_B* 1k3b_B* 2djg_B* 2djf_A*
1k3b_A* 2djg_A* 2djf_C* 1k3b_C* 2djg_C*
Length = 441
Score = 215 bits (549), Expect = 3e-63
Identities = 62/306 (20%), Positives = 113/306 (36%), Gaps = 50/306 (16%)
Query: 199 HHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQLTGLNL 258
+ + + + + + + + ++ L+ D+ + +G
Sbjct: 126 YVNTAHLKNSQEKYSNRLYKYDHNFVKAINAIQKSWTATTYMEYETLTLGDMIRRSG--- 182
Query: 259 DSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWR---AEGVISKVKEQG 315
S + + LP ++DWR +S V+ Q
Sbjct: 183 -------------GHSRKIPRPKPAPLTAEIQQKILFLPTSWDWRNVHGINFVSPVRNQA 229
Query: 316 KCACCWAFSAVGVVEAMHAIQGNSLT--ELSVQQLVDCDMSNGGCNGGRMDDALQYIIDN 373
C C++F+++G++EA I N+ LS Q++V C GC GG +
Sbjct: 230 SCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYLIAGKYAQD 289
Query: 374 GGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIP----YGEEEEMKKWVATRGPL 429
G+V + +PY ++S C + +E+ F+ EY + E MK + GP+
Sbjct: 290 FGLVEEACFPYTGTDSP--CKM-KEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPM 346
Query: 430 SVGMNA-NGLFYYSGGV---------IDLNQRL--------YGTS----IPYWIVKNSWG 467
+V + +Y G+ + + YGT + YWIVKNSWG
Sbjct: 347 AVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWG 406
Query: 468 SDWGEK 473
+ WGE
Sbjct: 407 TGWGEN 412
Score = 163 bits (414), Expect = 3e-44
Identities = 46/164 (28%), Positives = 74/164 (45%), Gaps = 14/164 (8%)
Query: 497 KLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEE 556
+ L+ +++V C GC GG + G+V + +PY ++S C + +E
Sbjct: 255 QTPILSPQEVVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYTGTDSP--CKM-KE 311
Query: 557 EGFKVKVKEYSRIP----YGEEEEMKKWVATRGPLSVGMNA-NGLFYYSGGV---IDLNQ 608
+ F+ EY + E MK + GP++V + +Y G+ L
Sbjct: 312 DCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRD 371
Query: 609 RLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
+ NHA+++VGYG + G YWIVKNSWG+ WGE
Sbjct: 372 PFNPFELTNHAVLLVGYGTDS-ASGM--DYWIVKNSWGTGWGEN 412
Score = 82.1 bits (203), Expect = 8e-17
Identities = 24/153 (15%), Positives = 49/153 (32%), Gaps = 21/153 (13%)
Query: 52 HDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQLTGLNLD 111
+ + + + + + ++ L+ D+ + +G
Sbjct: 127 VNTAHLKNSQEKYSNRLYKYDHNFVKAINAIQKSWTATTYMEYETLTLGDMIRRSG---- 182
Query: 112 STLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWR---AEGVISKVKEQGK 168
S + + LP ++DWR +S V+ Q
Sbjct: 183 ------------GHSRKIPRPKPAPLTAEIQQKILFLPTSWDWRNVHGINFVSPVRNQAS 230
Query: 169 CACCWAFSAVGVVEAMHAIQGNNLT--ELSVQH 199
C C++F+++G++EA I NN LS Q
Sbjct: 231 CGSCYSFASMGMLEARIRILTNNSQTPILSPQE 263
>2wbf_X Serine-repeat antigen protein; SERA, malaria, vacuole, protease,
cathepsin, hydrolase, glycoprotein, thiol protease; HET:
DMS; 1.60A {Plasmodium falciparum} PDB: 3ch3_X 3ch2_X
Length = 265
Score = 201 bits (512), Expect = 5e-60
Identities = 42/227 (18%), Positives = 78/227 (34%), Gaps = 47/227 (20%)
Query: 294 DDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDM 353
+ D +V++QG C W F++ +E + ++G T++S + +C
Sbjct: 8 EYCNRLKDENNCISNLQVEDQGNCDTSWIFASKYHLETIRCMKGYEPTKISALYVANCYK 67
Query: 354 S--NGGCNGGRMDDA-LQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGF--------- 401
C+ G LQ I D G + ++ YPY + C E+
Sbjct: 68 GEHKDRCDEGSSPMEFLQIIEDYGFLPAESNYPYNYVKVGEQCPKVEDHWMNLWDNGKIL 127
Query: 402 ---------------KVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGL--FYYSGG 444
+ + + + +K V +G + + A + + +SG
Sbjct: 128 HNKNEPNSLDGKGYTAYESERFHDNMDAFVKIIKTEVMNKGSVIAYIKAENVMGYEFSGK 187
Query: 445 VIDLN---QRL--------YGTSI-------PYWIVKNSWGSDWGEK 473
+ YG + YWIV+NSWG WG++
Sbjct: 188 KVKNLCGDDTADHAVNIVGYGNYVNSEGEKKSYWIVRNSWGPYWGDE 234
Score = 155 bits (395), Expect = 2e-43
Identities = 40/175 (22%), Positives = 63/175 (36%), Gaps = 30/175 (17%)
Query: 506 LVDCDMS--NGGCNGGRMDDA-LQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGF--- 559
+ +C C+ G LQ I D G + ++ YPY + C E+
Sbjct: 62 VANCYKGEHKDRCDEGSSPMEFLQIIEDYGFLPAESNYPYNYVKVGEQCPKVEDHWMNLW 121
Query: 560 ---------------------KVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANG-LF 597
+ + + + +K V +G + + A +
Sbjct: 122 DNGKILHNKNEPNSLDGKGYTAYESERFHDNMDAFVKIIKTEVMNKGSVIAYIKAENVMG 181
Query: 598 YYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
Y G N LC +HA+ IVGYG +G YWIV+NSWG WG++
Sbjct: 182 YEFSGKKVKN--LCGDDTADHAVNIVGYGNYVNSEGEKKSYWIVRNSWGPYWGDE 234
Score = 77.4 bits (191), Expect = 5e-16
Identities = 11/53 (20%), Positives = 23/53 (43%)
Query: 146 DDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
+ D +V++QG C W F++ +E + ++G T++S
Sbjct: 8 EYCNRLKDENNCISNLQVEDQGNCDTSWIFASKYHLETIRCMKGYEPTKISAL 60
>3hhi_A Cathepsin B-like cysteine protease; occluding loop, hydrolase, THIO
protease; HET: 074; 1.60A {Trypanosoma brucei} PDB:
3mor_A*
Length = 325
Score = 192 bits (489), Expect = 5e-56
Identities = 60/278 (21%), Positives = 97/278 (34%), Gaps = 59/278 (21%)
Query: 238 GVNKFF-DLSESDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDL 296
+ +++ + ++L G+ N + + +F L
Sbjct: 29 KYDGVMQNITLREAKRLNGV----------------IKKNNNASILPKRRFTEEEARAPL 72
Query: 297 PEAFD----WRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQ-GNSLTELSVQQLVDC 351
P +FD W I ++ +Q C CWA +A + G +S L+ C
Sbjct: 73 PSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTMGGVQDVHISAGDLLAC 132
Query: 352 DMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYPYKA------SESERGCLVGEEEGFK-- 402
G GCNGG D A Y G+VSD PY S+S+ G + F
Sbjct: 133 CSDCGDGCNGGDPDRAWAYFSST-GLVSDYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTP 191
Query: 403 -------------VKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNA-NGLFYYSGGV--- 445
V + ++ E++ + + RGP V + Y+ GV
Sbjct: 192 KCDYTCDDPTIPVVNYRSWTSYALQGEDDYMRELFFRGPFEVAFDVYEDFIAYNSGVYHH 251
Query: 446 ----IDLNQ--RL--YGT--SIPYWIVKNSWGSDWGEK 473
RL +GT +PYW + NSW ++WG
Sbjct: 252 VSGQYLGGHAVRLVGWGTSNGVPYWKIANSWNTEWGMD 289
Score = 139 bits (353), Expect = 7e-37
Identities = 44/179 (24%), Positives = 69/179 (38%), Gaps = 32/179 (17%)
Query: 497 KLSRLATEKLVDCDMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYPYKA------SESER 549
+ ++ L+ C G GCNGG D A Y G +VSD PY S+S+
Sbjct: 120 QDVHISAGDLLACCSDCGDGCNGGDPDRAWAYFSSTG-LVSDYCQPYPFPHCSHHSKSKN 178
Query: 550 GCLVGEEEGFK---------------VKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNA- 593
G + F V + ++ E++ + + RGP V +
Sbjct: 179 GYPPCSQFNFDTPKCDYTCDDPTIPVVNYRSWTSYALQGEDDYMRELFFRGPFEVAFDVY 238
Query: 594 NGLFYYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
Y+ GV + HA+ +VG+G +G PYW + NSW ++WG
Sbjct: 239 EDFIAYNSGVY---HHVSGQYLGGHAVRLVGWGTS---NGV--PYWKIANSWNTEWGMD 289
Score = 77.8 bits (192), Expect = 8e-16
Identities = 18/113 (15%), Positives = 35/113 (30%), Gaps = 21/113 (18%)
Query: 92 NKFFDLSDSDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEA 151
+++ + ++L G+ N + + +F LP +
Sbjct: 32 GVMQNITLREAKRLNGV----------------IKKNNNASILPKRRFTEEEARAPLPSS 75
Query: 152 FD----WRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNN-LTELSVQH 199
FD W I ++ +Q C CWA +A + G +S
Sbjct: 76 FDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTMGGVQDVHISAGD 128
>1deu_A Procathepsin X; cysteine protease, proregion, prosegment, HY; 1.70A
{Homo sapiens} SCOP: d.3.1.1 PDB: 1ef7_A
Length = 277
Score = 181 bits (461), Expect = 2e-52
Identities = 55/214 (25%), Positives = 83/214 (38%), Gaps = 35/214 (16%)
Query: 293 GDDLPEAFDWRAEG---VISKVKEQ---GKCACCWAFSAVGVVEAMHAIQGNSLTE---L 343
DLP+++DWR S + Q C CWA ++ + I+ L
Sbjct: 33 PADLPKSWDWRNVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRKGAWPSTLL 92
Query: 344 SVQQLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE-------RGCLVG 396
SVQ ++DC + G C GG Y +G + + Y+A + E C
Sbjct: 93 SVQNVIDCG-NAGSCEGGNDLSVWDYAHQHG-IPDETCNNYQAKDQECDKFNQCGTCNEF 150
Query: 397 EEEGFKVKVKEYSRIPYGE---EEEMKKWVATRGPLSVGMNANGLFY-YSGGV---IDLN 449
+E + YG E+M + GP+S G+ A Y+GG+
Sbjct: 151 KECHAIRNYTLWRVGDYGSLSGREKMMAEIYANGPISCGIMATERLANYTGGIYAEYQDT 210
Query: 450 QRL--------YGTS--IPYWIVKNSWGSDWGEK 473
+ +G S YWIV+NSWG WGE+
Sbjct: 211 TYINHVVSVAGWGISDGTEYWIVRNSWGEPWGER 244
Score = 142 bits (360), Expect = 2e-38
Identities = 44/167 (26%), Positives = 69/167 (41%), Gaps = 21/167 (12%)
Query: 497 KLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE-------R 549
+ L+ + ++DC + G C GG Y +G + + Y+A + E
Sbjct: 88 PSTLLSVQNVIDCG-NAGSCEGGNDLSVWDYAHQHG-IPDETCNNYQAKDQECDKFNQCG 145
Query: 550 GCLVGEEEGFKVKVKEYSRIPYGE---EEEMKKWVATRGPLSVGMNANGLFY-YSGGVID 605
C +E + YG E+M + GP+S G+ A Y+GG+
Sbjct: 146 TCNEFKECHAIRNYTLWRVGDYGSLSGREKMMAEIYANGPISCGIMATERLANYTGGI-- 203
Query: 606 LNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
+ NH + + G+G DGT YWIV+NSWG WGE+
Sbjct: 204 YAEYQDTTYI-NHVVSVAGWGIS---DGT--EYWIVRNSWGEPWGER 244
Score = 64.6 bits (158), Expect = 1e-11
Identities = 15/73 (20%), Positives = 27/73 (36%), Gaps = 13/73 (17%)
Query: 145 GDDLPEAFDWRAEG---VISKVKEQ---GKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
DLP+++DWR S + Q C CWA ++ + I+
Sbjct: 33 PADLPKSWDWRNVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRKGAWPS--- 89
Query: 199 HHDKVYSSVEDLL 211
SV++++
Sbjct: 90 ----TLLSVQNVI 98
>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase,
digestive tract, hydrolase-hydrolase INH complex; HET:
074; 1.30A {Schistosoma mansoni} PDB: 3s3q_A* 3s3r_A*
Length = 254
Score = 178 bits (455), Expect = 5e-52
Identities = 56/230 (24%), Positives = 88/230 (38%), Gaps = 52/230 (22%)
Query: 295 DLPEAFDWRAE----GVISKVKEQGKCACCWAFSAVGVVEAMHAIQ--GNSLTELSVQQL 348
++P +FD R + I+ +++Q +C CWAF AV + IQ G ELS L
Sbjct: 2 EIPSSFDSRKKWPRCKSIATIRDQSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVDL 61
Query: 349 VDCDMSNG-GCNGGRMDDALQYIIDNGGV--------VSDQAYPYKASES---------- 389
+ C S G GC GG + A Y + G V + YP+ E
Sbjct: 62 LSCCESCGLGCEGGILGPAWDYWVKEGIVTGSSKENHAGCEPYPFPKCEHHTKGKYPPCG 121
Query: 390 ---------ERGCLVGEEEGFKVKVKEYSRIPY---GEEEEMKKWVATRGPLSVGMNA-N 436
++ C + + + K + Y +E+ ++K + GP+ G
Sbjct: 122 SKIYKTPRCKQTCQKKYKTPYT-QDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGFTVYE 180
Query: 437 GLFYYSGGV-------IDLNQ--RL--YGTS--IPYWIVKNSWGSDWGEK 473
Y G+ R+ +G PYW++ NSW DWGE
Sbjct: 181 DFLNYKSGIYKHITGETLGGHAIRIIGWGVENKAPYWLIANSWNEDWGEN 230
Score = 135 bits (343), Expect = 3e-36
Identities = 42/189 (22%), Positives = 70/189 (37%), Gaps = 41/189 (21%)
Query: 496 SKLSRLATEKLVDCDMSNG-GCNGGRMDDALQYIIDNGGV--------VSDQAYPYKASE 546
+ L+ L+ C S G GC GG + A Y + G V + YP+ E
Sbjct: 51 KQNVELSAVDLLSCCESCGLGCEGGILGPAWDYWVKEGIVTGSSKENHAGCEPYPFPKCE 110
Query: 547 S-------------------ERGCLVGEEEGFKVKVKEYSRIPY---GEEEEMKKWVATR 584
++ C + + + K + Y +E+ ++K +
Sbjct: 111 HHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYT-QDKHRGKSSYNVKNDEKAIQKEIMKY 169
Query: 585 GPLSVGMNA-NGLFYYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKN 643
GP+ G Y G+ + + HA+ I+G+G E + PYW++ N
Sbjct: 170 GPVEAGFTVYEDFLNYKSGI---YKHITGETLGGHAIRIIGWGVE---NKA--PYWLIAN 221
Query: 644 SWGSDWGEK 652
SW DWGE
Sbjct: 222 SWNEDWGEN 230
Score = 73.0 bits (180), Expect = 1e-14
Identities = 18/59 (30%), Positives = 28/59 (47%), Gaps = 6/59 (10%)
Query: 147 DLPEAFDWRAE----GVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNN--LTELSVQH 199
++P +FD R + I+ +++Q +C CWAF AV + IQ ELS
Sbjct: 2 EIPSSFDSRKKWPRCKSIATIRDQSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVD 60
>3u8e_A Papain-like cysteine protease; papain-like cysteine peptidase,
peptidase_C1A, hydrolase, in form; 1.31A {Crocus
sativus}
Length = 222
Score = 174 bits (443), Expect = 8e-51
Identities = 64/197 (32%), Positives = 96/197 (48%), Gaps = 26/197 (13%)
Query: 297 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG 356
P + DWR +G ++ VK+QG C CWAF A G +E + AI L +S QQ+VDCD
Sbjct: 2 PASIDWRKKGAVTSVKDQGACGMCWAFGATGAIEGIDAITTGRLISVSEQQIVDCDTXXX 61
Query: 357 GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEE 416
GG DDA +++I NGG+ SD YPY + C + + ++ Y+ +P
Sbjct: 62 XXXGGDADDAFRWVITNGGIASDANYPYTGVDGT--CDLNKPIA--ARIDGYTNVP-NSS 116
Query: 417 EEMKKWVATRGPLSVGMNANGL---FYYSGGVI----------DLNQRL----YGTS--- 456
+ VA P+SV + + Y G+ ++ + YG++
Sbjct: 117 SALLDAVAK-QPVSVNIYTSSTSFQLYTGPGIFAGSSCSDDPATVDHTVLIVGYGSNGTN 175
Query: 457 IPYWIVKNSWGSDWGEK 473
YWIVKNSWG++WG
Sbjct: 176 ADYWIVKNSWGTEWGID 192
Score = 117 bits (295), Expect = 3e-30
Identities = 54/166 (32%), Positives = 81/166 (48%), Gaps = 19/166 (11%)
Query: 491 TGVLPSKLSRLATEK-LVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESER 549
TG L S +S E+ +VDCD GG DDA +++I NGG+ SD YPY +
Sbjct: 42 TGRLIS-VS----EQQIVDCDTXXXXXXGGDADDAFRWVITNGGIASDANYPYTGVDGT- 95
Query: 550 GCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGL---FYYSGGVIDL 606
C + + ++ Y+ +P + VA P+SV + + Y G+
Sbjct: 96 -CDLNKPIA--ARIDGYTNVP-NSSSALLDAVAK-QPVSVNIYTSSTSFQLYTGPGIFAG 150
Query: 607 NQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
+ +P +H ++IVGYG GT+ YWIVKNSWG++WG
Sbjct: 151 SSCSDDPATVDHTVLIVGYGSN----GTNADYWIVKNSWGTEWGID 192
Score = 81.4 bits (201), Expect = 1e-17
Identities = 22/50 (44%), Positives = 30/50 (60%)
Query: 149 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
P + DWR +G ++ VK+QG C CWAF A G +E + AI L +S Q
Sbjct: 2 PASIDWRKKGAVTSVKDQGACGMCWAFGATGAIEGIDAITTGRLISVSEQ 51
>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco
hydrolase, lysosome, protease, thiol protease, zymogen,
CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A*
3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A*
1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A*
1qdq_A* 1csb_B* 1huc_B 2ipp_B ...
Length = 266
Score = 173 bits (441), Expect = 7e-50
Identities = 56/232 (24%), Positives = 84/232 (36%), Gaps = 52/232 (22%)
Query: 293 GDDLPEAFDWRAE----GVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLT--ELSVQ 346
LP +FD R + I ++++QG C WAF AV + I N+ E+S +
Sbjct: 4 DLKLPASFDAREQWPQCPTIKEIRDQGSCGSAWAFGAVEAISDRICIHTNAHVSVEVSAE 63
Query: 347 QLVDCDMS--NGGCNGGRMDDALQYIIDNGGV------VSDQAYPYKASESE-------- 390
L+ C S GCNGG +A + G V PY E
Sbjct: 64 DLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEAHVNGARP 123
Query: 391 ------------RGCLVGEEEGFKVKVKEYSRIPY---GEEEEMKKWVATRGPLSVGMNA 435
+ C G +K + K Y Y E+++ + GP+ +
Sbjct: 124 PCTGEGDTPKCSKICEPGYSPTYK-QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSV 182
Query: 436 NG-LFYYSGGV-------IDLNQ--RL--YGT--SIPYWIVKNSWGSDWGEK 473
Y GV + R+ +G PYW+V NSW +DWG+
Sbjct: 183 YSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDN 234
Score = 131 bits (332), Expect = 1e-34
Identities = 46/185 (24%), Positives = 68/185 (36%), Gaps = 41/185 (22%)
Query: 500 RLATEKLVDCDMS--NGGCNGGRMDDALQYIIDNGGV------VSDQAYPYKASESE--- 548
++ E L+ C S GCNGG +A + G V PY E
Sbjct: 59 EVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEAHV 118
Query: 549 -----------------RGCLVGEEEGFKVKVKEYSRIPY---GEEEEMKKWVATRGPLS 588
+ C G +K + K Y Y E+++ + GP+
Sbjct: 119 NGARPPCTGEGDTPKCSKICEPGYSPTYK-QDKHYGYNSYSVSNSEKDIMAEIYKNGPVE 177
Query: 589 VGMNANG-LFYYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGS 647
+ Y GV Q + HA+ I+G+G E +GT PYW+V NSW +
Sbjct: 178 GAFSVYSDFLLYKSGV---YQHVTGEMMGGHAIRILGWGVE---NGT--PYWLVANSWNT 229
Query: 648 DWGEK 652
DWG+
Sbjct: 230 DWGDN 234
Score = 70.4 bits (173), Expect = 1e-13
Identities = 18/61 (29%), Positives = 27/61 (44%), Gaps = 6/61 (9%)
Query: 145 GDDLPEAFDWRAE----GVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLT--ELSVQ 198
LP +FD R + I ++++QG C WAF AV + I N E+S +
Sbjct: 4 DLKLPASFDAREQWPQCPTIKEIRDQGSCGSAWAFGAVEAISDRICIHTNAHVSVEVSAE 63
Query: 199 H 199
Sbjct: 64 D 64
>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme,
papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A
1pbh_A 1mir_A
Length = 317
Score = 173 bits (440), Expect = 4e-49
Identities = 68/288 (23%), Positives = 104/288 (36%), Gaps = 76/288 (26%)
Query: 238 GVNKFFDLSESDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLP 297
G N F+++ S L++L G L + Q LP
Sbjct: 28 GHN-FYNVDMSYLKRLCGTFLGGP---------------------KPPQRVMFTEDLKLP 65
Query: 298 EAFDWRAE----GVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSL--TELSVQQLVDC 351
+FD R + I ++++QG C CWAF AV + I N+ E+S + L+ C
Sbjct: 66 ASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTC 125
Query: 352 DMS--NGGCNGGRMDDALQYIIDNGGVVSDQAY-------PYKASESE------------ 390
S GCNGG +A + G+VS Y PY E
Sbjct: 126 CGSMCGDGCNGGYPAEAWNFWTRK-GLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTG 184
Query: 391 --------RGCLVGEEEGFKVKVKEYSRIPY---GEEEEMKKWVATRGPLSVGMNA-NGL 438
+ C G +K + K Y Y E+++ + GP+ + +
Sbjct: 185 EGDTPKCSKICEPGYSPTYK-QDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDF 243
Query: 439 FYYSGGV-------IDLNQ--RL--YGT--SIPYWIVKNSWGSDWGEK 473
Y GV + R+ +G PYW+V NSW +DWG+
Sbjct: 244 LLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDN 291
Score = 129 bits (327), Expect = 2e-33
Identities = 48/186 (25%), Positives = 72/186 (38%), Gaps = 43/186 (23%)
Query: 500 RLATEKLVDCDMS--NGGCNGGRMDDALQYIIDNGGVVSDQAY-------PYKASESE-- 548
++ E L+ C S GCNGG +A + G +VS Y PY E
Sbjct: 116 EVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKG-LVSGGLYESHVGCRPYSIPPCEHH 174
Query: 549 ------------------RGCLVGEEEGFKVKVKEYSRIPY---GEEEEMKKWVATRGPL 587
+ C G +K + K Y Y E+++ + GP+
Sbjct: 175 VNGSRPPCTGEGDTPKCSKICEPGYSPTYK-QDKHYGYNSYSVSNSEKDIMAEIYKNGPV 233
Query: 588 SVGMNA-NGLFYYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWG 646
+ + Y GV Q + HA+ I+G+G E +GT PYW+V NSW
Sbjct: 234 EGAFSVYSDFLLYKSGVY---QHVTGEMMGGHAIRILGWGVE---NGT--PYWLVANSWN 285
Query: 647 SDWGEK 652
+DWG+
Sbjct: 286 TDWGDN 291
Score = 70.5 bits (173), Expect = 2e-13
Identities = 26/112 (23%), Positives = 41/112 (36%), Gaps = 27/112 (24%)
Query: 94 FFDLSDSDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFD 153
F+++ S L++L G L + Q LP +FD
Sbjct: 31 FYNVDMSYLKRLCGTFLGGP---------------------KPPQRVMFTEDLKLPASFD 69
Query: 154 WRAE----GVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNL--TELSVQH 199
R + I ++++QG C CWAF AV + I N E+S +
Sbjct: 70 AREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAED 121
>3ois_A Cysteine protease; alpha and beta, hydrolase; HET: UDP; 1.65A
{Xylella fastidiosa}
Length = 291
Score = 170 bits (432), Expect = 2e-48
Identities = 34/214 (15%), Positives = 69/214 (32%), Gaps = 38/214 (17%)
Query: 294 DDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGN--SLTELSVQQLVDC 351
LP D +V +QG+ C A + ++ + +
Sbjct: 55 AALPPKVDLTPP---FQVYDQGRIGSCTANALAAAIQFERIHDKQSPEFIPSRLFIYYNE 111
Query: 352 --DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASE--------------SERGCLV 395
+ + G M ++ GV ++ +PY + S++
Sbjct: 112 RKIEGHVNYDSGAMIRDGIKVLHKLGVCPEKEWPYGDTPADPRTEEFPPGAPASKKPSDQ 171
Query: 396 GEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNA--NGLFYYSGGVIDLNQRL- 452
++ K+ EYSR+ + + +K +A P G + + + S V
Sbjct: 172 CYKDAQNYKITEYSRVA-QDIDHLKACLAVGSPFVFGFSVYNSWVGNNSLPVRIPLPTKN 230
Query: 453 -------------YGTSIPYWIVKNSWGSDWGEK 473
Y I ++ ++NSWG++ GE
Sbjct: 231 DTLEGGHAVLCVGYDDEIRHFRIRNSWGNNVGED 264
Score = 137 bits (347), Expect = 2e-36
Identities = 30/165 (18%), Positives = 63/165 (38%), Gaps = 26/165 (15%)
Query: 506 LVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASE--------------SER 549
+ + + G M ++ GV ++ +PY + S++
Sbjct: 108 YYNERKIEGHVNYDSGAMIRDGIKVLHKLGVCPEKEWPYGDTPADPRTEEFPPGAPASKK 167
Query: 550 GCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNA--NGLFYYSGGVIDLN 607
++ K+ EYSR+ + + +K +A P G + + + S V
Sbjct: 168 PSDQCYKDAQNYKITEYSRVA-QDIDHLKACLAVGSPFVFGFSVYNSWVGNNSLPVRIPL 226
Query: 608 QRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
+ HA++ VGY +E + ++ ++NSWG++ GE
Sbjct: 227 PTKNDTLEGGHAVLCVGYDDEIR-------HFRIRNSWGNNVGED 264
Score = 61.4 bits (149), Expect = 2e-10
Identities = 8/53 (15%), Positives = 17/53 (32%), Gaps = 3/53 (5%)
Query: 146 DDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
LP D +V +QG+ C A + ++ + + +
Sbjct: 55 AALPPKVDLTPP---FQVYDQGRIGSCTANALAAAIQFERIHDKQSPEFIPSR 104
>2pff_B Fatty acid synthase subunit beta; fatty acid synthase,
acyl-carrier-protein, beta-ketoacyl RED beta-ketoacyl
synthase, dehydratase; 4.00A {Saccharomyces cerevisiae}
Length = 2006
Score = 82.0 bits (202), Expect = 3e-16
Identities = 108/611 (17%), Positives = 168/611 (27%), Gaps = 212/611 (34%)
Query: 63 LRRHENFVTNVEKA-EDYQREDSGTAVFE-VNKFFDLSDSDLQQLTGLNLDSTLEDIQPS 120
L+ E F + + E + +D T E V KF S ++ D L
Sbjct: 33 LQ--EQFNKILPEPTEGFAADDEPTTPAELVGKFLGYVSSLVEPSKVGQFDQVLNLC--- 87
Query: 121 LQAPFSSNQTDTEMRAFQFNSLRHGD--DLPEAFDWRAEGVISKVKEQGKC---ACCWA- 174
+ F+ L D L + + K KE K A A
Sbjct: 88 -------------LTEFENCYLEGNDIHALAAKLLQENDTTLVKTKELIKNYITARIMAK 134
Query: 175 -----------FSAVGVVEA-MHAI---QGNN---LTELSVQHHDKVYSS-VEDLLRRH- 214
F AVG A + AI QGN EL + + Y V DL++
Sbjct: 135 RPFDKKSNSALFRAVGEGNAQLVAIFGGQGNTDDYFEELRDLY--QTYHVLVGDLIKFSA 192
Query: 215 ---ENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQLTGLNLDSTLE--DIQPS- 268
+ AE ++ GLN+ LE P
Sbjct: 193 ETLSELIRTTLDAEKVFTQ------------------------GLNILEWLENPSNTPDK 228
Query: 269 ---LQAPFS------------------SNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGV 307
L P S T E+R++ + H L A
Sbjct: 229 DYLLSIPISCPLIGVIQLAHYVVTAKLLGFTPGELRSYLKGATGHSQGLVTAV------A 282
Query: 308 ISKVK------EQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG----- 356
I++ + A F +GV A SL ++ ++ N
Sbjct: 283 IAETDSWESFFVSVRKAITVLFF-IGV-RCYEAYPNTSLPPSILEDSLE----NNEGVPS 336
Query: 357 ---GCNGGRMDDALQYIID-----------------NGG---VVSDQAYPYKASESERGC 393
+ + +Q ++ NG VVS P +S
Sbjct: 337 PMLSISNLT-QEQVQDYVNKTNSHLPAGKQVEISLVNGAKNLVVS--GPP----QS---- 385
Query: 394 LVGEEEGF-KVKV---KEYSRIPYGEEEEMKKWVATR-GPLSVGMNANGLF---YYSGGV 445
L G K K + SRIP+ E K + R P++ F
Sbjct: 386 LYGLNLTLRKAKAPSGLDQSRIPFSER---KLKFSNRFLPVASP------FHSHLLVPAS 436
Query: 446 IDLNQRLYGTSIPYWIVKNSWGSDWGEKVEDKVGSSGNRTRDL-ELTGVLPSKLSRLATE 504
+N+ L ++ + D V D G DL L+G + ++
Sbjct: 437 DLINKDLVKNNVSF------NAKDIQIPVYD--TFDG---SDLRVLSGSISERIVDCIIR 485
Query: 505 KLVDCDMSNGGCNGGRMDDALQ----YIIDNG-GVVSDQAYPYKASESERGC---LVG-- 554
V + Q +I+D G G S ++ G + G
Sbjct: 486 LPVK------------WETTTQFKATHILDFGPGGASGLGVLTHRNKDGTGVRVIVAGTL 533
Query: 555 -----EEEGFK 560
++ GFK
Sbjct: 534 DINPDDDYGFK 544
Score = 56.2 bits (135), Expect = 3e-08
Identities = 90/533 (16%), Positives = 168/533 (31%), Gaps = 173/533 (32%)
Query: 113 TLE--DIQPSLQAPFSSNQTDTEMRAFQFNSLRH-------GDDLPEAFDWRAE------ 157
TL ++ L P +S ++++ QFN + DD P AE
Sbjct: 10 TLSHGSLEHVLLVPTASFFIASQLQE-QFNKILPEPTEGFAADDEPTT---PAELVGKFL 65
Query: 158 GVISKVKEQGKCACCWAFSAV---GVVEAMHAI-QGNN---LTELSVQHHDKVYSSVEDL 210
G +S + E K F V + E + +GN+ L +Q +D ++L
Sbjct: 66 GYVSSLVEPSKVG---QFDQVLNLCLTEFENCYLEGNDIHALAAKLLQENDTTLVKTKEL 122
Query: 211 LRR--HENFVTN--VEKAED---YQSEDSGT----AVFG----VNKFFDLSESDLQQL-- 253
++ + +K + +++ G A+FG + +F+ +L+ L
Sbjct: 123 IKNYITARIMAKRPFDKKSNSALFRAVGEGNAQLVAIFGGQGNTDDYFE----ELRDLYQ 178
Query: 254 --TGLNLDSTLEDIQPSLQAPFSSNQTDTEM---RAFQFNS-LRHGDDLPEAFDWRAEGV 307
L + ++ +L D E + L + + P+ D+
Sbjct: 179 TYHVL-VGDLIKFSAETLS-ELIRTTLDAEKVFTQGLNILEWLENPSNTPDK-DYLLSIP 235
Query: 308 ISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDAL 367
IS +G + +L +V + G G + L
Sbjct: 236 IS-------------CPLIG------------VIQL-AHYVVTAKLL--GFTPGELRSYL 267
Query: 368 QYIIDNG-GVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATR 426
+ + G+V+ A ++S E F V V++ + + ++ R
Sbjct: 268 KGATGHSQGLVT--AVAIAETDSW--------ESFFVSVRKAITVLF--------FIGVR 309
Query: 427 GPLSVGMNANGLFYYSGGVIDLNQRLYG-TSIPYWIVKNSWGSDWGEKVEDKVGSSGNRT 485
Y TS+P I+++S + E G +
Sbjct: 310 C----------------------YEAYPNTSLPPSILEDSL--ENNE---------GVPS 336
Query: 486 RDLELTGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYI-IDNGG---VVSDQAYP 541
L ++ L++ + V+ N I + NG VVS P
Sbjct: 337 PMLSISN-----LTQEQVQDYVN------KTNSHLPAGKQVEISLVNGAKNLVVS--GPP 383
Query: 542 YKASESERGCLVGEEEGF-KVKV---KEYSRIPYGEEEEMKKWVATR-GPLSV 589
+S L G K K + SRIP+ E K + R P++
Sbjct: 384 ----QS----LYGLNLTLRKAKAPSGLDQSRIPFSER---KLKFSNRFLPVAS 425
Score = 45.8 bits (108), Expect = 4e-05
Identities = 89/549 (16%), Positives = 172/549 (31%), Gaps = 189/549 (34%)
Query: 134 MRAFQFNSLRHGD-----DLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQ 188
R +L HG +P A + A S+++EQ F+ + + E
Sbjct: 6 TRPL---TLSHGSLEHVLLVPTASFFIA----SQLQEQ--------FNKI-LPEP----- 44
Query: 189 GNNLTELSVQHHDKVYSSVEDLLRRHENFVTN-VEKAEDYQSEDSGTAVFGVNKFFDLSE 247
TE + ++ +L+ + +V++ VE ++ Q + +F E
Sbjct: 45 ----TEGFAADDEP--TTPAELVGKFLGYVSSLVEPSKVGQFDQVLNLCL--TEF----E 92
Query: 248 SDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGV 307
+ L G DI +L A T ++ + ++ + A +
Sbjct: 93 NCY--LEG-------NDIH-ALAAKLLQENDTTLVKTKEL--IK-------NY-ITARIM 132
Query: 308 ISK-VKEQGKCACCWAFSAVGVVEA-MHAI---QGNS---LTELS---------VQQLVD 350
+ ++ A F AVG A + AI QGN+ EL V L+
Sbjct: 133 AKRPFDKKSNSAL---FRAVGEGNAQLVAIFGGQGNTDDYFEELRDLYQTYHVLVGDLIK 189
Query: 351 CDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVK--VKEY 408
+ L +I + +++ + +G + ++
Sbjct: 190 -----------FSAETLSELIRT-TLDAEKVFT---------------QGLNILEWLENP 222
Query: 409 SRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLYGTSIPYWIVKNSWGS 468
S P ++ ++ + P+S + GVI L Y + G
Sbjct: 223 SNTP-DKD-----YLLS-IPISCPLI---------GVIQLAH--------YVVTAKLLGF 258
Query: 469 DWGEKVEDKVGSSGNRTRDLELTGVLPSKLSRLAT--EKLVDCDMSNGGCNGGRMDDALQ 526
GE G++G ++ L +T V + E + + L
Sbjct: 259 TPGELRSYLKGATG-HSQGL-VTAVAIA----ETDSWESFFV--------SVRKAITVLF 304
Query: 527 YIIDNGGVVSDQAYPYKASESE--RGCLVGEEEGFK---VKVKEYSRIPYGEEEEMKKWV 581
+I GV +AYP + + EG + + ++ E+++ +V
Sbjct: 305 FI----GVRCYEAYPNTSLPPSILEDS-LENNEGVPSPMLSISNLTQ------EQVQDYV 353
Query: 582 A---TRGP----LSVGMNANGL--FYYSGGVIDL---NQRLCNPKAQNHALIIVGYGEEE 629
+ P + + + NG SG L N L KA +
Sbjct: 354 NKTNSHLPAGKQVEISL-VNGAKNLVVSGPPQSLYGLNLTLRKAKAPS------------ 400
Query: 630 KKDGTSIPY 638
D + IP+
Sbjct: 401 GLDQSRIPF 409
>3pw3_A Aminopeptidase C; bleomycin, cysteine proteinase fold, structural
genomics, JO center for structural genomics, JCSG; HET:
MSE; 2.23A {Parabacteroides distasonis}
Length = 383
Score = 57.9 bits (139), Expect = 4e-09
Identities = 22/116 (18%), Positives = 41/116 (35%), Gaps = 13/116 (11%)
Query: 288 NSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQ 347
++ + + F E I+ VK Q + CW +S+ +E+ G +LS
Sbjct: 2 DTEKKVSEEGFVFTTVKENPITSVKNQNRAGTCWCYSSYSFLESELLRMGKGEYDLSEMF 61
Query: 348 LVDCDMSNGG------------CNGGRMDDALQYIIDNGGVVSDQAYPYKASESER 391
V + GG DAL Y ++ G+V ++ ++
Sbjct: 62 TVYNTYLDRADAAVRTHGDVSFSQGGSFYDAL-YGMETFGLVPEEEMRPGMMYADT 116
Score = 44.8 bits (105), Expect = 6e-05
Identities = 46/366 (12%), Positives = 100/366 (27%), Gaps = 51/366 (13%)
Query: 140 NSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
++ + + F E I+ VK Q + CW +S+ +E+ G +LS
Sbjct: 2 DTEKKVSEEGFVFTTVKENPITSVKNQNRAGTCWCYSSYSFLESELLRMGKGEYDLSEMF 61
Query: 200 ------HDKVYSSVEDLLRRHEN-------FVTNVEK-----AEDYQSEDSGTAVFGVNK 241
D+ ++V + + +E E+ + +
Sbjct: 62 TVYNTYLDRADAAVRTHGDVSFSQGGSFYDALYGMETFGLVPEEEMRPGMMYADTLSNHT 121
Query: 242 FFDLSESDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFD 301
+ L+ N +A + PE F
Sbjct: 122 ELSALTDAMVAAIAKGKLRKLQS---------DENNAMLWKKAVAAVHQIYLGVPPEKFT 172
Query: 302 WRAEGVISKVKEQGKCACCWAFSAVG-----------VVEAMHAIQGNSLTELSVQQL-- 348
++ + K + + ++ +E + L + +
Sbjct: 173 YKGKEYTPKSFFESTGLKASDYVSLTSYTHHPFYTQFPLEIQDNWRHGMSYNLPLDEFME 232
Query: 349 -VDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKE 407
D ++ G D + +G V + G + K + K+
Sbjct: 233 VFDNAINTGYTIAWGSDVSESGFTRDGVAVMPDDEKV---QELSGSDMAHWLKLKPEEKK 289
Query: 408 YSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLYGTSIPYWIVKNSWG 467
+ P ++ + + +G+ Y G D Y++VKNSWG
Sbjct: 290 LNTKPQPQKWCTQAERQLAYDNYETTDDHGMQIY-GIAKDQEGN------EYYMVKNSWG 342
Query: 468 SDWGEK 473
++
Sbjct: 343 TNSKYN 348
Score = 39.8 bits (92), Expect = 0.002
Identities = 10/36 (27%), Positives = 20/36 (55%), Gaps = 4/36 (11%)
Query: 617 NHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
+H + I G ++++ Y++VKNSWG++
Sbjct: 317 DHGMQIYGIAKDQEG----NEYYMVKNSWGTNSKYN 348
>3f75_P Toxopain-2, cathepsin L propeptide; medical structural genomics of
pathogenic protozoa, MSGPP, C protease, parasite,
protozoa, hydrolase; 1.99A {Toxoplasma gondii}
Length = 106
Score = 52.4 bits (126), Expect = 9e-09
Identities = 16/73 (21%), Positives = 32/73 (43%), Gaps = 2/73 (2%)
Query: 43 TRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDL 102
F +F + K Y++ E+ RR+ F N+ + ++ + ++N F DLS +
Sbjct: 23 DAFSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGY-SYSLKMNHFGDLSRDEF 81
Query: 103 QQL-TGLNLDSTL 114
++ G L
Sbjct: 82 RRKYLGFKKSRNL 94
Score = 47.8 bits (114), Expect = 5e-07
Identities = 14/66 (21%), Positives = 27/66 (40%), Gaps = 2/66 (3%)
Query: 198 QHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQL-TGL 256
+ K Y++ E+ RR+ F N+ + + + +N F DLS + ++ G
Sbjct: 30 AMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGY-SYSLKMNHFGDLSRDEFRRKYLGF 88
Query: 257 NLDSTL 262
L
Sbjct: 89 KKSRNL 94
>2l95_A Crammer, LP06209P; cysteine proteinase inhibitor, intrinsic
disorder P like protein, hydrolase; NMR {Drosophila
melanogaster}
Length = 80
Score = 42.0 bits (99), Expect = 3e-05
Identities = 15/71 (21%), Positives = 35/71 (49%), Gaps = 4/71 (5%)
Query: 43 TRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDY-QREDSGTAVFE--VNKFFDLSD 99
++ + DK Y + EDL+RR + + + E++ ++ + G ++ +N DL+
Sbjct: 8 EEWVEYKSKFDKNYEAEEDLMRR-RIYAESKARIEEHNRKFEKGEVTWKMGINHLADLTP 66
Query: 100 SDLQQLTGLNL 110
+ Q +G +
Sbjct: 67 EEFAQRSGKKV 77
Score = 37.7 bits (88), Expect = 8e-04
Identities = 16/62 (25%), Positives = 31/62 (50%), Gaps = 4/62 (6%)
Query: 200 HDKVYSSVEDLLRRHENFVTNVEKAEDYQSE-DSGTAVF--GVNKFFDLSESDLQQLTGL 256
DK Y + EDL+RR + + + E++ + + G + G+N DL+ + Q +G
Sbjct: 17 FDKNYEAEEDLMRR-RIYAESKARIEEHNRKFEKGEVTWKMGINHLADLTPEEFAQRSGK 75
Query: 257 NL 258
+
Sbjct: 76 KV 77
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis,
programmed cell death; HET: DTP; 6.90A {Drosophila
melanogaster} PDB: 3iz8_A*
Length = 1221
Score = 44.5 bits (104), Expect = 1e-04
Identities = 81/531 (15%), Positives = 139/531 (26%), Gaps = 150/531 (28%)
Query: 193 TELSVQHHDKVYSSVEDLLRRH-ENFVTN--VEKAEDY-----QSED------SGTAVFG 238
E + +D+L + FV N + +D E+ S AV G
Sbjct: 9 FETGEHQY-----QYKDILSVFEDAFVDNFDCKDVQDMPKSILSKEEIDHIIMSKDAVSG 63
Query: 239 VNKFFDLSESDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLR---HGDD 295
+ F S +++ ++ L L +P + Q M + R + D
Sbjct: 64 TLRLFWTLLSKQEEMVQKFVEEVLRINYKFLMSPIKTEQRQPSMMTRMYIEQRDRLYND- 122
Query: 296 LPEAFD----WRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQG---NSLTELSVQQL 348
+ F R + K+++ A A V+ + + G +
Sbjct: 123 -NQVFAKYNVSRLQ-PYLKLRQ----ALLELRPAKNVL--IDGVLGSGKTWVALDVCLSY 174
Query: 349 -VDCDMSN-------GGCNGGR----MDDALQYIID-NGGVVSDQAYPYK---------- 385
V C M CN M L Y ID N SD + K
Sbjct: 175 KVQCKMDFKIFWLNLKNCNSPETVLEMLQKLLYQIDPNWTSRSDHSSNIKLRIHSIQAEL 234
Query: 386 ----ASESERGCL-----VGEE---EGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGM 433
S+ CL V F + +I + TR
Sbjct: 235 RRLLKSKPYENCLLVLLNVQNAKAWNAFNLS----CKI----------LLTTR------- 273
Query: 434 NANGLFYYSGGVIDLNQRLYGTSIPYWIVKNSWGSDWGEKVE--DKVGSSGNRTRDL--E 489
V D T I + +S E K R +DL E
Sbjct: 274 --------FKQVTDFLSAATTTHIS--LDHHSMTLTPDEVKSLLLKY--LDCRPQDLPRE 321
Query: 490 LTGVLPSKLSRLATE-----------KLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQ 538
+ P +LS +A K V+CD ++ ++ ++ +
Sbjct: 322 VLTTNPRRLSIIAESIRDGLATWDNWKHVNCD---------KLTTIIESSLNVLEPAEYR 372
Query: 539 AYPYKASESERGCLVGEEEGFKVKVKEYSRI----PYGEEEEMKKWVATRGPLSVGMNAN 594
+ L + S I + + + L
Sbjct: 373 KM-FDR-------LSVFPPSAHIPTILLSLIWFDVIKSDVMVVVNKLHKYS-LVEKQPKE 423
Query: 595 GLFYYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIP------YW 639
++L +L N + H I+ Y + D + Y+
Sbjct: 424 STISIPSIYLELKVKLEN-EYALHRSIVDHYNIPKTFDSDDLIPPYLDQYF 473
Score = 36.8 bits (84), Expect = 0.025
Identities = 60/449 (13%), Positives = 113/449 (25%), Gaps = 144/449 (32%)
Query: 9 LGEKGLGY---LHTFM---IK------VALLESNIFQTRGY---LNSP-----VTRFLNF 48
GE Y L F + V + +I + S R
Sbjct: 11 TGEHQYQYKDILSVFEDAFVDNFDCKDVQDMPKSILSKEEIDHIIMSKDAVSGTLRLFWT 70
Query: 49 MRDHDK--VYSSVEDLLRRHENFVTNVEKAEDYQRE-DSGTAVFEVNKFFDLSDSD---- 101
+ + V VE++LR + F+ + K E Q + + + ++ + +D+
Sbjct: 71 LLSKQEEMVQKFVEEVLRINYKFLMSPIKTEQRQPSMMTRMYIEQRDRLY--NDNQVFAK 128
Query: 102 ---------------LQQL---TGLNLD------------STLEDIQPSLQAPFS----- 126
L +L + +D + + F
Sbjct: 129 YNVSRLQPYLKLRQALLELRPAKNVLIDGVLGSGKTWVALDVCLSYKVQCKMDFKIFWLN 188
Query: 127 -SNQTDTEMRAFQFNSL------------RHGDDLPEAFDWRAEGVISKVKEQGKCAC-- 171
N E L H ++ + +K + C
Sbjct: 189 LKNCNSPETVLEMLQKLLYQIDPNWTSRSDHSSNIKLRIHSIQAELRRLLKSKPYENCLL 248
Query: 172 ---------CW-AFS----------AVGVVEAMHAIQGNNLTELSVQHHDKVYSSVE--D 209
W AF+ V + + T +S+ HH + E
Sbjct: 249 VLLNVQNAKAWNAFNLSCKILLTTRFKQVTD---FLSAATTTHISLDHHSMTLTPDEVKS 305
Query: 210 LLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDL---QQLTGLNLDSTLEDIQ 266
LL + + + +D E T ++ + L +N D I+
Sbjct: 306 LLLK----YLDC-RPQDLPREVLTTNPRRLSIIAESIRDGLATWDNWKHVNCDKLTTIIE 360
Query: 267 PSLQAPFSSNQTDTEMRAFQFNSL---RHGDDLPEAFDWRAEGVISKVKEQGKCACCWAF 323
SL E R F+ L +P ++S + W
Sbjct: 361 SSLN-----VLEPAEYRK-MFDRLSVFPPSAHIPTI-------LLSLI---------WFD 398
Query: 324 SAVGVVEAMHAIQGNSLTELSVQQLVDCD 352
V + + +L LV+
Sbjct: 399 VIKSDVMVV-------VNKLHKYSLVEKQ 420
>2cb5_A Protein (bleomycin hydrolase); aminopeptidase, cysteine protease,
SELF- compartmentalizing, cylinase; 1.85A {Homo sapiens}
SCOP: d.3.1.1 PDB: 1cb5_A
Length = 453
Score = 37.8 bits (87), Expect = 0.011
Identities = 14/36 (38%), Positives = 19/36 (52%), Gaps = 1/36 (2%)
Query: 617 NHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
HA+ E++ +DG W V+NSWG D G K
Sbjct: 370 THAMTFTAVSEKDDQDGAFT-KWRVENSWGEDHGHK 404
>2e01_A Cysteine proteinase 1; bleomycin hydrolase, thiol protease, C1
protease, hydrolase; 1.73A {Saccharomyces cerevisiae}
PDB: 2e02_A 2e03_A 2dzy_A 1a6r_A 2e00_A 2dzz_A 3gcb_A
1gcb_A
Length = 457
Score = 37.0 bits (85), Expect = 0.017
Identities = 16/68 (23%), Positives = 28/68 (41%), Gaps = 7/68 (10%)
Query: 590 GMNANGLFYYSGGVIDLN----QRLCNPKAQ-NHALIIVGYGEEEKKDGTSIPYWIVKNS 644
G+ L+ Y +L R+ ++ A++I G +E + V+NS
Sbjct: 340 GVMDIELWNYPAIGYNLPQQKASRIRYHESLMTAAMLITGCHVDE--TSKLPLRYRVENS 397
Query: 645 WGSDWGEK 652
WG D G+
Sbjct: 398 WGKDSGKD 405
Score = 31.2 bits (70), Expect = 1.1
Identities = 18/106 (16%), Positives = 30/106 (28%), Gaps = 4/106 (3%)
Query: 139 FNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
N R F+ + V Q CW F+A + + + NL E +
Sbjct: 44 LNKTRLQKQDNRVFNTVVSTDSTPVTNQKSSGRCWLFAATNQLRL-NVLSELNLKEFELS 102
Query: 199 HHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFD 244
Y D L + F+ + + D + D
Sbjct: 103 Q---AYLFFYDKLEKANYFLDQIVSSADQDIDSRLVQYLLAAPTED 145
Score = 28.9 bits (64), Expect = 6.0
Identities = 15/68 (22%), Positives = 20/68 (29%), Gaps = 1/68 (1%)
Query: 287 FNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLT-ELSV 345
N R F+ + V Q CW F+A + + N ELS
Sbjct: 44 LNKTRLQKQDNRVFNTVVSTDSTPVTNQKSSGRCWLFAATNQLRLNVLSELNLKEFELSQ 103
Query: 346 QQLVDCDM 353
L D
Sbjct: 104 AYLFFYDK 111
>2spc_A Spectrin; cytoskeleton; 1.80A {Drosophila melanogaster} SCOP:
a.7.1.1
Length = 107
Score = 29.4 bits (66), Expect = 1.0
Identities = 15/62 (24%), Positives = 25/62 (40%), Gaps = 9/62 (14%)
Query: 17 LHTFMIKVALLESNIFQTRGYLNSPVTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKA 76
L +M L ES + +LN+ D +VE L+++HE+F +
Sbjct: 5 LQLYMRDCELAESWMSAREAFLNA---------DDDANAGGNVEALIKKHEDFDKAINGH 55
Query: 77 ED 78
E
Sbjct: 56 EQ 57
>3tqc_A Pantothenate kinase; biosynthesis of cofactors, prosthetic groups,
carriers, TRAN; HET: ADP; 2.30A {Coxiella burnetii}
Length = 321
Score = 29.2 bits (65), Expect = 3.8
Identities = 8/23 (34%), Positives = 14/23 (60%)
Query: 244 DLSESDLQQLTGLNLDSTLEDIQ 266
L+ESDL +L G +L+++
Sbjct: 34 TLTESDLDKLQGQIEIVSLKEVT 56
>3vcz_A Endoribonuclease L-PSP; virulence, pathogenesis, infectious
diseases, center for STR genomics of infectious
diseases, csgid, translation; HET: GOL; 1.80A {Vibrio
vulnificus}
Length = 153
Score = 28.0 bits (63), Expect = 6.2
Identities = 9/42 (21%), Positives = 15/42 (35%), Gaps = 6/42 (14%)
Query: 408 YSRIPYGEEEEMKKWVATR------GPLSVGMNANGLFYYSG 443
+ Y + M K + T GP G++ + SG
Sbjct: 14 GTENLYFQSNAMTKVLHTDSAPAAIGPYIQGVDLGNMVLTSG 55
Score = 28.0 bits (63), Expect = 6.2
Identities = 9/42 (21%), Positives = 15/42 (35%), Gaps = 6/42 (14%)
Query: 566 YSRIPYGEEEEMKKWVATR------GPLSVGMNANGLFYYSG 601
+ Y + M K + T GP G++ + SG
Sbjct: 14 GTENLYFQSNAMTKVLHTDSAPAAIGPYIQGVDLGNMVLTSG 55
>3fb2_A Spectrin alpha chain, brain spectrin; non-erythroid alpha chain
alpha-II spectrin, fordrin alpha chain, sptan1,
SPTA2_human, NESG, HR5563A; 2.30A {Homo sapiens}
Length = 218
Score = 28.3 bits (63), Expect = 6.5
Identities = 14/62 (22%), Positives = 24/62 (38%), Gaps = 9/62 (14%)
Query: 17 LHTFMIKVALLESNIFQTRGYLNSPVTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKA 76
L F E+ + +LN+ D SVE L+++HE+F +
Sbjct: 121 LQLFHRDCEQAENWMAAREAFLNTE---------DKGDSLDSVEALIKKHEDFDKAINVQ 171
Query: 77 ED 78
E+
Sbjct: 172 EE 173
>3aez_A Pantothenate kinase; transferase, homodimer, COA biosynthesis,
nucleotide binding binding, cytoplasm,
nucleotide-binding; HET: GDP PAZ; 2.20A {Mycobacterium
tuberculosis} PDB: 2ges_A* 2geu_A* 2gev_A* 2zs7_A*
2zs8_A* 2zs9_A* 2zsa_A* 2zsb_A* 2zsd_A* 2zse_A* 2zsf_A*
2get_A* 3af0_A* 3af1_A* 3af2_A* 3af3_A* 3af4_A* 3avp_A*
3avo_A* 3avq_A*
Length = 312
Score = 28.3 bits (63), Expect = 7.6
Identities = 7/23 (30%), Positives = 12/23 (52%)
Query: 244 DLSESDLQQLTGLNLDSTLEDIQ 266
L+E +L L GL L +++
Sbjct: 28 ALTEEELVGLRGLGEQIDLLEVE 50
>4ezg_A Putative uncharacterized protein; internalin-A, leucine-rich repeat
protein, structural genomi center for structural
genomics, JCSG; HET: MSE; 1.50A {Listeria monocytogenes}
Length = 197
Score = 28.0 bits (63), Expect = 8.6
Identities = 6/31 (19%), Positives = 16/31 (51%), Gaps = 1/31 (3%)
Query: 238 GVNKFFDLSESDLQQLTGLNLDST-LEDIQP 267
G + +++E+ + LT + L + + D+
Sbjct: 31 GQSSTANITEAQMNSLTYITLANINVTDLTG 61
>1sq5_A Pantothenate kinase; P-loop, transferase; HET: PAU ADP; 2.20A
{Escherichia coli} SCOP: c.37.1.6 PDB: 1esm_A* 1esn_A*
Length = 308
Score = 28.0 bits (62), Expect = 9.5
Identities = 9/22 (40%), Positives = 16/22 (72%)
Query: 245 LSESDLQQLTGLNLDSTLEDIQ 266
LSE ++ +L G+N D +LE++
Sbjct: 23 LSEDEIARLKGINEDLSLEEVA 44
Database: pdb70
Posted date: Sep 4, 2012 3:40 AM
Number of letters in database: 6,701,793
Number of sequences in database: 27,921
Lambda K H
0.315 0.133 0.400
Gapped
Lambda K H
0.267 0.0856 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 27921
Number of Hits to DB: 10,031,962
Number of extensions: 605563
Number of successful extensions: 1604
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1276
Number of HSP's successfully gapped: 155
Length of query: 655
Length of database: 6,701,793
Length adjustment: 100
Effective length of query: 555
Effective length of database: 3,909,693
Effective search space: 2169879615
Effective search space used: 2169879615
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 60 (26.7 bits)